Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Social Network Integration in Document Summarization

Atefeh Farzindar

Source Title: Innovative Document Summarization Techniques: Revolutionizing Knowledge Understanding

DOI: 10.4018/978-1-4666-5019-0.ch006

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In this chapter, the author presents the new role of summarization in the dynamic network of social media and its importance in semantic analysis of social media and large data. The author introduces how summarization tasks can improve social media retrieval and event detection. The author discusses the challenges in social media data versus traditional documents. The author presents the approaches to social media summarization and methods for update summarization, network activities summarization, event-based summarization, and opinion summarization. The author reviews the existing evaluation metrics for summarization and the efforts on evaluation shared tasks on social data related tracks by ACL, TREC, TAC, and SemEval. In conclusion, the author discusses the importance of this dynamic discipline and great potential of automatic summarization in the coming decade, in the context of changes in mobile technology, cloud computing, and social networking.

Chapter Preview

Top

1. Introduction

Automatic summarization of traditional media such as written press and articles has been a popular research domain over the past 25 years. Document summarization is typically performed to save reading time by reducing the amount of information presented to users. Several online news agencies use clustering techniques to categorize news articles and provide pseudo-summaries. In addition, summarizing specific types of documents, such as legal decisions, drew a lot of attention in the research field and the marketing of automatic systems (Farzindar and Lapalme 2004). The purpose of these approaches is to exploit the thematic structure of documents in order to improve coherence and readability of the summary. In recent years, we have been facing new challenges in processing social media data and its integration in document summarization. Texts in social media are extremely noisy, ungrammatical; they do not adhere to conventional rules and they are subject to continuously changing conventions.

Over the past few years, online social networking sites (Facebook, Twitter, Youtube, Flickr, MySpace, LinkedIn, Metacafe, Vimeo, etc.) have revolutionized the way we communicate with individuals, groups and communities, and altered everyday practices (Boyd and Ellison, 2007). Nearly one in four people worldwide will use social networks in 2013, according to an eMarketer report (New Media Trend Watch, 2013), “Worldwide Social Network Users: 2013 Forecast and Comparative Estimates”. Social media has become a primary source of intelligence because it has become the first response to key events issued by highly dynamic contents generated by 1.73 billion users in 2013. Social media statistics for 2012 has shown that Facebook has grown to more than 800 million active users, adding more than 200 million in a single year. Twitter now has 100 million active users and LinkedIn has over 64 million users in North America alone (Digital Buzz, 2012). Recently, workshops such as Semantic Analysis in Social Media (Farzindar and Inkpen, 2012) and NAACL/HLT workshop on Language Analysis in Social Media (Farzindar et Al. 2013) have been increasingly focusing on the impact of social media on our daily lives, both on a personal and a professional level.

Social media data is the collection of open source information which can be obtained publicly via Blogs and micro-blogs, Internet forums, user-generated FAQs, chat, podcasts, online games, tags, ratings and comments. Social media data has several properties: the nature of conversation is social which are posted in real-time. Geo-locating a group of topically-related conversations is important as it includes emotions, neologisms, credibility/rumors and incentives. The texts are non-structured and are presented in many formats and written by different people in many languages and styles. Also the typography mistakes and chat slang have become increasingly prevalent on social networking sites like Facebook and Twitter. The authors are not professional writers and the pockets of sources in thousands of places on the www.

Monitoring and analyzing this rich and continuous flow of user-generated content can yield unprecedentedly valuable information, which would not have been available from traditional media outlets. Summarization can play a key role in semantic analysis of social media and Social Media Analytics. This has given rise to the emerging discipline of Social Media Analytics, which draws from Social Network Analysis, Machine Learning, Data Mining, Information Retrieval (IR), automatic summarization, and Natural Language Processing (NLP) (Melville et al. 2009).

In the context of analyzing social networks and document summarization, finding powerful methods and algorithms to extract the relevant data in large volumes, various and free formats from multiple sources and languages, is a scientific challenge. Automatic processing and summarization of such data needs to evaluate the appropriate research methods for information extraction, automatic categorization, clustering, indexing data and statistical machine translation.

The sheer volume of social media data and the incredible rate at which new content is created makes manual summarization, or any other meaningful manual analysis, largely infeasible. In many applications the amount of data is too large for effective real-time human evaluation and analysis of the data for a decision maker.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Social Network Integration in Document Summarization

Abstract

1. Introduction

Complete Chapter List