General Strategy for Querying Web Sources in a Data Federation Environment

General Strategy for Querying Web Sources in a Data Federation Environment

Aykut Firat (Northeastern University, USA), Lynn Wu (Massachusetts Institute of Technology, USA) and Stuart Madnick (Massachusetts Institute of Technology, USA)
Copyright: © 2010 |Pages: 18
DOI: 10.4018/978-1-60566-982-3.ch134

Abstract

Modern database management systems are supporting the inclusion and querying of nonrelational sources within a data federation environment via wrappers. Wrapper development for Web sources, however, is a convolution of code with extraction and query planning knowledge and becomes a daunting task. We use IBM DB2 federation engine to demonstrate the challenges of incorporating Web sources into a data federation. We, then, present a practical and general strategy for the inclusion and querying of Web sources without requiring any changes in the underlying data federation technology. This strategy separates the code and knowledge in wrapper development by introducing a general-purpose capabilities-aware mini query-planner and a data extraction engine. As a result, Web sources can be included in a data federation system faster, and maintained easier.

Complete Chapter List

Search this Book:
Reset