Reference Hub1
A Scalable Algorithm for One-to-One, Onto, and Partial Schema Matching with Uninterpreted Column Names and Column Values

A Scalable Algorithm for One-to-One, Onto, and Partial Schema Matching with Uninterpreted Column Names and Column Values

Boris Rabinovich, Mark Last
Copyright: © 2014 |Volume: 25 |Issue: 4 |Pages: 16
ISSN: 1063-8016|EISSN: 1533-8010|EISBN13: 9781466657595|DOI: 10.4018/JDM.2014100101
Cite Article Cite Article

MLA

Rabinovich, Boris, and Mark Last. "A Scalable Algorithm for One-to-One, Onto, and Partial Schema Matching with Uninterpreted Column Names and Column Values." JDM vol.25, no.4 2014: pp.1-16. http://doi.org/10.4018/JDM.2014100101

APA

Rabinovich, B. & Last, M. (2014). A Scalable Algorithm for One-to-One, Onto, and Partial Schema Matching with Uninterpreted Column Names and Column Values. Journal of Database Management (JDM), 25(4), 1-16. http://doi.org/10.4018/JDM.2014100101

Chicago

Rabinovich, Boris, and Mark Last. "A Scalable Algorithm for One-to-One, Onto, and Partial Schema Matching with Uninterpreted Column Names and Column Values," Journal of Database Management (JDM) 25, no.4: 1-16. http://doi.org/10.4018/JDM.2014100101

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

In this paper, the authors propose a five-step approach to the problem of identifying semantic correspondences between attributes of two database schemas. It is one of the key challenges in many database applications such as data integration and data warehousing. The authors' research is focused on uninterpreted schema matching, where the column names and column values are uninterpreted or unreliable. The approach implements Bayesian networks, Pearson's correlation and mutual information to identify inter-attribute dependencies. Additionally, the authors propose an extension to their algorithm that allows the user to manually enter the known mappings to improve the automated matching results. The five-step approach also allows data privacy preservation. The authors' evaluation experiments show that the proposed approach enhances the current set of schema matching techniques.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.