Natural Language Processing for Social Media. Diana Inkpen

Natural Language Processing for Social Media - Diana  Inkpen


Скачать книгу

       2 http://aigicrv.org/2016/

       3 https://aclweb.org/anthology/W/W12/#2100

       4 https://aclweb.org/anthology/W/W13/#1100

       5 https://aclweb.org/anthology/W/W14/#1300

       6 http://www.aclweb.org/

       7 http://microposts2016.seas.upenn.edu/

       8 http://eacl2017.org/

       9 http://cci.drexel.edu/bigdata/bigdata2017/

       10 https://sites.google.com/site/socialnlp2020/

       Acknowledgments

      This book would not have been possible without the hard work of many people. We would like to thank our colleagues and students at the University of Southern California and our colleagues at the NLP research group at the University of Ottawa. We would like to thank in particular Prof. Stan Szpakowicz from the University of Ottawa for his comments on the early draft of the book, and two anonymous reviewers for their useful suggestions for revisions and additions. We thank Prof. Graeme Hirst of the University of Toronto and Michael Morgan from Morgan & Claypool Publishers for their continuous encouragement.

      Anna Atefeh Farzindar and Diana Inkpen

      March 2020

      CHAPTER 1

       Introduction to Social Media Analysis

      Social media is a phenomenon that has recently expanded throughout the world and quickly attracted billions of users. This form of electronic communication through social networking platforms allows users to generate its content and share it in various forms of information, personal words, pictures, audio, and videos. Therefore, social computing is formed as an emerging area of research and development that includes a wide range of topics such as Web semantics, artificial intelligence, natural language processing, network analysis, and Big Data analytics.

      Over the past few years, online social networking sites (Facebook, Twitter, YouTube, Flickr, MySpace, LinkedIn, Metacafe, Vimeo, etc.) have revolutionized the way we communicate with individuals, groups, and communities, and have altered everyday practices [Boyd and Ellison, 2007].

      The broad categories of social media platforms are: content-sharing sites, forums, blogs, and microblogs. On content sharing sites (such as Facebook, Instagram, Foursquare, Flickr, YouTube) people exchange information, messages, photos, videos, or other types of content. On Web user forums (such as StackOverflow, CNET forums, Apple Support) people post specialized information, questions, or answers. Blogs (such as Gizmodo, Mashable, Boing Boing, and many more) allow people to post messages and other content and to share information and opinions. Micro-blogs (such as Twitter, Sina Weibo, Tumblr) are limited to short texts for sharing information and opinions. The modalities of sharing content in order: posts; comments to posts; explicit or implicit connections to build social networks (friend connections, followers, etc.); cross-posts and user linking; social tagging; likes/favorites/starring/voting/rating/etc.; author information; and linking to user profile features.1 In Table 1.1, we list more details about social media platforms and their characteristics and types of content shared [Barbier et al., 2013].

      Social media statistics for January 2014 have shown that Facebook has grown to more than 1 billion active users, adding more than 200 million users in a single year. Statista,2 the world’s largest statistics portal, announced the ranking for social networks based on the number of active users. As presented in Figure 1.1, the ranking shows that Qzone took second place with more than 600 million users. Google+, LinkedIn, and Twitter completed the top 5 with 300 million, 259 million, and 232 million active users, respectively.

Type Characteristics Examples
Social Networks A social networking website allows the user to build a web page and connect with a friend or other acquaintance in order to share user-generated content. MySpace, Facebook, LinkedIn, Meetup, Google Plus+
Blogs and Blog Comments A blog is an online journal where the blogger can create the content and display it in reverse chronological order. Blogs are generally maintained by a person or a community. Blog comments are posts by users attached to blogs or online newspaper posts. Huffington Post, Business Insider, Engadget, and online journals
Microblogs A microblog is similar to a blog but has a limited content. Twitter, Tumblr, Sina Weibo, Plurk
Forums An online forum is a place for members to discuss a topic by posting messages. Online Discussion Communities, phpBB Developer Forum, Raising Children Forum
Social Bookmarks Services that allow users to save, organize, and search links to various websites, and to share their bookmarks of Web pages. Delicious, Pinterest, Google Bookmarks
Wikis These websites allow people to collaborate and add content or edit the information on a community-based database. Wikipedia, Wikitravel, Wikihow
Social News Social news encourage their community to submit news stories, or to vote on the content and share it. Digg, Slashdot, Reddit
Media Sharing A website that enables users to capture videos and pictures or upload and share with others. YouTube, Flickr, Snapchat, Instagram, Vine

      Statista also provided the growth trend for both Facebook and LinkedIn, illustrated in Figure 1.2 and Figure 1.3, respectively. Figure 1.2 shows that Facebook, by reaching 845 million users at the end of 2011, totaled 1,228 million users by the end of 2013. As depicted in Figure 1.3, LinkedIn also reached 277 million users by the end of 2013, whereas it only had 145 million users at the end of 2011. Statista also calculated the annual income for both Facebook and LinkedIn, which in 2013 totalled US$7,872 and US$1,528 million, respectively.

image
Скачать книгу