Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Visualization of High-Level Associations from Twitter Data

Luca Cagliero, Naeem A. Mahoto

Source Title: Packaging Digital Information for Enhanced Learning and Analysis: Data Visualization, Spatialization, and Multidimensionality

DOI: 10.4018/978-1-4666-4462-5.ch008

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The Data Mining and Knowledge Discovery (KDD) process focuses on extracting useful information from large datasets. To support analysts in making decisions, a relevant research effort has been devoted to visualizing the extracted data mining models effectively. A particular attention has been paid to the discovery of strong association rules from textual data coming from social networks, which represent potentially relevant correlations among document terms. However, state-of-the-art rule visualization tools do not allow experts to visualize data correlations at different abstraction levels. Hence, the effectiveness of the proposed approaches is limited, especially when dealing with fairly sparse data. This chapter presents Twitter Generalized Rule Visualizer (TGRV), a novel text mining and visualization tool. It aims at supporting analysts in looking into the results of the generalized association rule mining process from textual data coming from Twitter supplied with WordNet taxonomies. Taxonomies are used for aggregating document terms into higher-level concepts. Generalized rules represent high-level associations among document terms. By exploiting taxonomy-based models, experts may look into the discovered data correlations from different perspectives and figure out interesting knowledge. Changing the perspective from which data correlations are visualized is shown to improve the readability and the usability of the generated rule-based model. The experimental results show the applicability and the usefulness of the proposed visualization tool on real textual data coming from Twitter. The visualized data correlations are shown to be valuable for advanced analysis, such as topic trend and user behavior analysis.

Chapter Preview

Top

Introduction

Data Mining and Knowledge Discovery (KDD) focuses on extracting useful information from large datasets (Tan & al., 2005). Descriptive data mining techniques (e.g., clustering, association rule mining) entail discovering interesting and hidden patterns from the analyzed data. In the last several years a significant research effort has been devoted to applying data mining techniques to textual data published on social networks. In particular, the analysis of the textual User-Generated Content (UGC) published on Twitter (http://twitter.com) has achieved promising results in the context of user behavior profiling (Li et al., 2008; Mathioudakis & Koudas., 2010) and topic trend discovery (Cheong & Lee., 2009; Cagliero & Fiori, In press).

Association rule mining (Agrawal & al., 1993) is a widely exploratory data mining technique that allows discovering valuable correlations among data. An association rule is an implication A → B, where A and B are sets of items occurring in the source data. In the context of textual data analysis, a rule represents an implication between a couple of term sets occurring in the analyzed document. To make the rule mining process computationally tractable, a minimum support threshold is commonly enforced to select only the associations among terms that occur frequently in the analyzed data. As a drawback, traditional rule mining algorithms (e.g., Apriori (Agrawal & Srikant, 1994), FP-Growth (Han et al., 2000)) are sometimes ineffective in mining valuable knowledge, because of the excessive level of detail of the mined information. For instance, when coping with real-world textual data, most of the associations among terms occur rarely in the analyzed data and, thus, may be discarded by enforcing a minimum support threshold. To overcome this issue, Agrawal & Srikant (1995) proposed to discover generalized association rules. Generalized rules are rules that may also contain high level (generalized) terms. By exploiting a taxonomy (i.e., a set of is-a hierarchies) built over the analyzed textual documents terms are aggregated into higher level concepts, which are more likely to be frequent in the analyzed data. Hence, generalized rules represent underlying term correlations at different abstraction levels. Generalized rule mining from textual data has already been addressed in different application contexts, among which social data analysis (Cagliero & Fiori, In Press) and biomedical literature analysis (Berardi et al., 2005).

To support analysts in the knowledge discovery process a parallel relevant research effort has been devoted to proposing visual tools adapted to several well-known KDD tasks. In the context of association rule mining, the proposed systems are commonly focused on either visualizing the mining results effectively to ease the expert validation task (Leung et al. 2008; Wong et al., 1999; Meng, 2010) or allowing experts to drive the data mining process (Fayyad et al., 2001; Li et al. 2011). However, to the best of our knowledge, the problem of visualizing generalized rules mined from textual data coming from social networks has never been investigated so far.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Visualization of High-Level Associations from Twitter Data

Abstract

Introduction

Complete Chapter List