Hershey, Pennsylvania

New York, New YorkBeijing, China

Special Offers
- Up to 50% off Thousands of Research Books
  From July 1st through October 31st, 2025, we are offering discounts of up to 50% across thousands of titles in Business & Management; Science, Technology, & Medicine; and Education & Social Sciences. Through this campaign, we’re committed to ensuring that our mutual library customers worldwide can continue to access high-quality, peer-reviewed content during these challenging times. If this campaign is successful, we will extend through the end of the year and beyond if there’s a benefit to all parties involved. When hosted on the InfoSci^® Platform, e-books feature no DRM, no additional cost for unlimited-user licensing, full-text PDF & HTML formats, and more. Discount is automatically added at checkout.
  Browse Titles
- IGI Global Scientific Publishing Launches International Brand Ambassador Program
  IGI Global Scientific Publishing has launched a new Ambassador Program, designed to empower research professionals to help spread scholarly resources and foster global research engagement. As a local, mid-sized publisher, this initiative offers IGI Global Scientific Publishing an exciting opportunity to expand its global presence in the academic community and foster meaningful connections among scholars around the world. With currently over 130 ambassadors worldwide, these scholarly experts are dedicated to supporting the publisher’s initiative of disseminating cutting-edge research.
  Learn More
- Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 20 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no hosting or maintenance fees, no additional cost for unlimited-user licensing, full-text PDF & HTML format, and more.
  Learn More
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through the IGI Global Scientific Publishing Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global Scientific Publishing to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open access endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global Scientific Publishing to publish your work under open access? Review the IGI Global Scientific Publishing open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Large Scale Matching Issues and Advances

Sana Sellami (LIRIS, France), Aicha-Nabila Benharkat (LIRIS, France), and Youssef Amghar (LIRIS, France)

Source Title: Ontology Theory, Management and Design: Advanced Tools and Models

DOI: 10.4018/978-1-61520-859-3.ch009

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Nowadays, the Information technology domains (semantic web, E-business, digital libraries, life science, etc) abound with a large variety of data (e.g. DB schemas, XML schemas, ontologies) and bring up a hard problem: the semantic heterogeneity. Matching techniques are called to overcome this challenge and attempts to align these data. In this chapter, the authors are interested in studying large scale matching approaches. They survey the techniques of large scale matching, when a large number of schemas/ontologies and attributes are involved. They attempt to cover a variety of techniques for schema matching called Pair-wise and Holistic, as well as a set of useful optimization techniques. They compare the different existing schema/ontology matching tools. One can acknowledge that this domain is on top of effervescence and large scale matching needs many more advances. Then the authors provide conclusions concerning important open issues and potential synergies of the technologies presented.

Chapter Preview

Top

Introduction

Recently, we are witnessing an explosive growth of data in the business and scientific area. In fact, there are many databases and information sources available through the web covering different domains: semantic Web, deep Web, e-business, biology, digital libraries, etc. In such domains, the data generated are heterogeneous and voluminous e.g schemas with several thousand elements are common in e-business applications. Currently, the greatest challenge to take up is to perform the integration of such heterogeneous collections of data. Matching techniques are solutions to automatically find correspondences between these data in order to allow their integration in information systems. Matching has found considerable interest in both research and practice. In fact, matching is an operation that takes data as input (e.g XML schemas, ontologies, relational database schemas) and returns the semantic similarity values of their elements. However, matching these data at large scale represents a laborious process. The standard approach trying to match the complete input schemas will often lead to shading off performance. Various schema matching systems have been developed to solve the problem semi-automatically. Since schema matching is a semi-automatic task, efficient implementations are required to support interactive user feedback. In this context, scalable matching becomes a problem to be solved.

This chapter describes new research works of large scale schema and ontology matching. In the past years there has been quite an amount of research in the area of matching both for database schemas and more recently for ontologies. Several surveys (Rahm& Bernstein, 2002, Shvaiko & Euzenat, 2005) have been proposed covering many of the existing approaches. The survey proposed by (Rahm& Bernstein, 2002) is devoted to a classification of schema matching approaches and a comparative review of matching systems. The survey exposed by (Shvaiko & Euzenat, 2005) presents, as well, a new classification taking into account some novel schema/ontology matching approaches. A number of approaches and principles have been developed for matching small or medium data (schemas or ontologies). A major challenge that is still largely to be tackled is to scale up semantic matching in two ways: to a large number of data to be aligned or matched and to very large data. While the former is primarily addressed in the database area, the latter has been addressed by researchers in schema and ontology matching. Based on this observation, we provide a survey of work in the large scale area that differs from those proposed by (Rahm& Bernstein, 2002, Shvaiko & Euzenat, 2005). We provide in our study the main features of a large scale matching. We survey, then, the existing matching approaches at large scale called holistic and Pair-wise and we show how these approaches deal with scalability problem. We discuss the several related strategies and topics of optimization techniques, machine learning algorithms, statistical algorithms, etc. We describe the existing schema/ontology matching tools in the literature and compare them. This analysis of state of the art techniques allows us to make some conclusions and observations about the existing matching approaches and systems.

This chapter is organized as follows. Section 2 presents the motivation of large scale matching problem. Section 3 discusses the large scale matching approaches and presents a classification. In section 4, we describe the large scale matching tools. Section 5 reports some future directions and section 6 concludes this chapter.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Large Scale Matching Issues and Advances

Abstract

Introduction

Complete Chapter List