Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Ontology-Based Clustering in a Peer Data Management System

Carlos Eduardo Santos Pires, Rocir Marcos Leite Santiago, Ana Carolina Salgado, Zoubida Kedad, Mokrane Bouzeghoub

Source Title: International Journal of Distributed Systems and Technologies (IJDST) 3(2)

DOI: 10.4018/jdst.2012040101

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Peer Data Management Systems (PDMSs) are advanced P2P applications in which each peer represents an autonomous data source making available an exported schema to be shared with other peers. Query answering in PDMSs can be improved if peers are efficiently disposed in the overlay network according to the similarity of their content. The set of peers can be partitioned into clusters, so as the semantic similarity among the peers participating into the same cluster is maximal. The creation and maintenance of clusters is a challenging problem in the current stage of development of PDMSs. This work proposes an incremental peer clustering process. The authors present a PDMS architecture designed to facilitate the connection of new peers according to their exported schema described by an ontology. The authors propose a clustering process and the underlying algorithm. The authors present and discuss some experimental results on peer clustering using the approach.

Article Preview

Top

Introduction

Peer Data Management Systems (PDMSs) (Kantere et al., 2009; Tatarinov et al., 2003) are advanced P2P applications in which each peer represents an autonomous data source that makes available an exported schema. Such schema represents the data to be shared with other peers. Peers communicate through an overlay network, i.e., a virtual (logical) network which runs as an overlay on top of a physical network (Doval & O’Mahony, 2003). According to the overlay topology employed, P2P systems are classified into three categories (Androutsellis-Theotokis & Spinellis, 2004): unstructured, structured, and hybrid. Some works also consider a fourth one called super-peer (Yang & Garcia-Molina, 2003).

One of the most studied data management issue on PDMSs is query answering (Hose et al., 2008; Montanelli & Castano, 2008), which consists in propagating a query, submitted at any of the peers, on paths of limited depth in the corresponding overlay network (Lodi et al., 2008). At each routing step, the query is reformulated to the exported schema of its new host based on the respective schema mappings (Souza et al., 2009).

Query answering issues in PDMSs can be improved if peers are efficiently disposed in the overlay network according to their similarity with respect to the content they are willing to share (Castano et al., 2004). The set of peers can then be partitioned into clusters of peers in order to maximize the semantic similarity between the peers participating into the same cluster. Peer clustering has several benefits, the most important one being the fact that the queries are answered only by a few (but relevant) peers (Raftopoulou et al., 2009).

Due to the excessive number of peers, their autonomous nature, and the heterogeneity of their schemas, the creation and maintenance of clusters is considered a challenging aspect in the current stage of development of PDMSs (Kantere et al., 2008). This work proposes an incremental process for clustering peers in a PMDS. To achieve this objective, we present a PDMS architecture which is mainly designed to facilitate the connection of new peers according to their corresponding exported schema. In this architecture, peers are organized in semantic communities (Castano & Montanelli, 2005) and within a community peers are grouped into semantically related clusters. As schemas are represented as an OWL ontology (OWL, 2011), the clustering process makes intensive use of ontology management services, such as matching (Castano et al., 2006; Euzenat & Shvaiko, 2007), merging, and summarization (Pires et al., 2010).

Some approaches have been proposed to the problem of peer clustering. One of the first solutions was proposed for P2P file sharing systems (Yang & Garcia-Molina, 2003). This approach can be applied when the peers have the same structure and vocabulary, which is not the case in our setting. Some of the existing solutions (Castano & Montanelli, 2005; Doulkeridis et al., 2006; Kantere et al., 2008) assume that the P2P network is already populated with a predetermined number of peers and the clustering process is done in an ad-hoc manner. Few solutions (Li & Vuong, 2005; Lodi et al., 2008) consider the problem of forming clusters from scratch. In Li and Vuong (2005), a simple and asymmetric global measure is used to compute the semantic similarity between two peers’ schemas; the authors assume that peers in a PDMS share exactly the same ontology. The PDMS proposed in Lodi et al. (2008) concentrates all efforts related to peer clustering in a centralized structure called Access Point Structure (APS) which is updated whenever a peer joins or leaves the PDMS. The frequency of updates in the APS can be intense and consequently bring scalability problems to the system.

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024)

Volume 14: 2 Issues (2023)

Volume 13: 8 Issues (2022)

Volume 12: 4 Issues (2021)

Volume 11: 4 Issues (2020)

Volume 10: 4 Issues (2019)

Volume 9: 4 Issues (2018)

Volume 8: 4 Issues (2017)

Volume 7: 4 Issues (2016)

Volume 6: 4 Issues (2015)

Volume 5: 4 Issues (2014)

Volume 4: 4 Issues (2013)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Ontology-Based Clustering in a Peer Data Management System

Abstract

Introduction

Complete Article List