Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

From Frequent Features to Frequent Social Links

Erick Stattner, Martine Collard

Source Title: International Journal of Information System Modeling and Design (IJISMD) 4(3)

DOI: 10.4018/jismd.2013070104

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Standard data mining techniques have been applied and adapted for eliciting knowledge from social networks, by achieving classical tasks such as classification, search for frequent patterns or link prediction. Most works have exploited only the network topological structure, and therefore cannot be used to answer questions involving nodes features. For instance, the frequent pattern discovery task generally refers to the search for sub-networks frequently found in a single network or in a set of networks. In the same area, this paper focuses on the concept of frequent link that stands as a regularity found in a network on links between node groups that share common characteristics. The extraction of such links from a social network is a particularly challenging and computationally intensive problem, since it is much dependent on the number of links and attributes. In this study, the authors propose a solution for reducing the search space of frequent links, by filtering the nodes features on a criterion of frequency. The authors make the assumption that frequent links occur between sets of features that are themselves frequent. This property is used to reduce the search space and speed up the extraction process. The authors empirically show that it is well founded, and they discuss the efficiency of the solution in terms of computation time and number of frequent patterns found depending on several frequency thresholds.

Article Preview

Top

1. Introduction

Standard data mining algorithms have been relying on the implicit assumption that data are IID (independent and identically distributed). They consider datasets as collections of independent instances of single relations. While this restriction appears to be consistent with the classical statistical inference problem, it ignores inherent dependencies and correlations involved in numerous real-world phenomena that frequently emerge from complex interactions between entities. For instance in traditional data mining processes, it is common to search for purchase models of individuals by focusing on their own attributes (age, city, school, center of interest, etc). Nevertheless, it is obvious that social relationships between individuals, such as friendship or professional relationships, maintain their environment and determine their behavior and decisions.

The last decade social links have become the subject of a novel active research area, the “Science of Network” (Barabasi, 2002, Watts, 2004, Borner et al., 2007), a new way of studying relationships maintained between entities with new techniques regarding older traditional approaches followed in sociology and discrete mathematics. While previous works provided most popular results among which Milgram's experiment (Milgram, 1967) on small world phenomenon or Bott's work (Bott, 1957) on urban families, a great emphasis has been put on network structures and gave important findings. Network structures have been analysed according to recently defined measures such as the degree, density, diameter, clustering coefficient or shortest-path, etc.

The so-called and very novel social network mining area, also called link mining, (Getoor & Diehl, 2005) has attempted to apply the concepts of data mining on networks. Standard data mining techniques (classification, prediction, clustering, frequent pattern discovery), are used to explore the topological structure of the network without considering the nodes attributes. For instance, this is the case of methods for extracting frequent patterns from social networks, which are mostly approaches that only exploit the topological structure to extract sub-networks found frequently in a set of networks or a single much larger network.

However, both sources of information (structure and nodes features) seem to be relevant to take full advantage of the knowledge hidden into the network. While the links describe the nature of the relationships between individuals into the network, the attributes may provide relevant information on their role, position or influence. In the case of the spread of diseases for example, although the social link between two individuals is the main vector of the transmission process, properties such as the age or the origins, can influence quite significantly the probability that an individual contracts the disease, and therefore can influence the whole diffusion process.

Given the increasing number of available data sets, in which, in addition to social links, some information on nodes is also available, recent methods have integrated the complementary dimension. In this field, new algorithms or adaptations of existing algorithms have been proposed in visualization (Snasel et al., 2009), classification (Kuznetsov & Ignatov, 2009) or identification of communities (Gaume et al., 2010).

In this paper, we address the problem of the search for frequent patterns in social networks and we focus particularly on a new approach that combines both the network structure and the attributes of nodes for discovering regular patterns among the links that connect nodes with common characteristics. Such patterns are called “frequent links” and provide knowledge on the groups of node the most connected in the network.

For instance let us assume a social network built on mobile phone contacts, i.e in which two individuals (nodes) are connected if we observe at least one call from one of them to the other one. Each individual is characterized by its home country, the age and the professional status. A frequent link in this kind of network would provide for instance the following knowledge: “20% of calls are between European business men and Chinese people”.

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024)

Volume 14: 1 Issue (2023)

Volume 13: 8 Issues (2022): 7 Released, 1 Forthcoming

Volume 12: 4 Issues (2021)

Volume 11: 4 Issues (2020)

Volume 10: 4 Issues (2019)

Volume 9: 4 Issues (2018)

Volume 8: 4 Issues (2017)

Volume 7: 4 Issues (2016)

Volume 6: 4 Issues (2015)

Volume 5: 4 Issues (2014)

Volume 4: 4 Issues (2013)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

From Frequent Features to Frequent Social Links

Abstract

1. Introduction

Complete Article List