Information Retrieval from Deep Web Based on Visual Query Interpretation

Information Retrieval from Deep Web Based on Visual Query Interpretation

Radhouane Boughammoura (Department of Sciences of Data Processing, Faculty of Sciences of Monastir, Research Unit MARS, Monastir, Tunisia), Mohamed Nazih Omri (Department of Sciences of Data Processing, Faculty of Sciences of Monastir, Research Unit MARS, Monastir, Tunisia) and Lobna Hlaoua (Department of Electronic and Computer Science, High School of Sciences and Technology of H. Sousse, Research Unit MARS, H. Sousse, Tunisia)
Copyright: © 2012 |Pages: 15
DOI: 10.4018/ijirr.2012100104
OnDemand PDF Download:
No Current Special Offers


Deep Web is growing rapidly. More than 90% of relevant information in web comes from deep Web. Users are usually interested by products which satisfy their needs at the best prices and quality of service .Hence, user’s needs concerns not only one service but many competitive services at the same time. However, for commercial reasons, there is no way to compare all web services products. Each web service is a black box which accepts queries through its own query interface and returns results. As consequence, users ask separately different web services and spend a lot of time comparing products in order to find the best one. This is a burden for novice users. In this paper, the authors propose a new approach which integrates query interfaces of many web services into one universal web service. The new interface describes visually the universal query and is used to ask many web services at the same time. The authors have evaluated their approach on standard datasets and have proved good performances.
Article Preview


Deep Web is a treasury of products and services. This treasury is hidden behind Web forms which give access to data stored in distant databases. Deep Web is very rich in terms of quantity (more than 90% of Web) and quality of service.

Users are usually searching for new and complementary services at the best prices and quality of service. However they are often disappointed because there is no way to compare products of two competitive deep web services. This drawback is due to two factors: 1) deep web services are offering competitive commercial products, 2) these services are offering query interfaces with different query capabilities. Hence queries submitted to servers have different meaning. In order to find a good product, the burden of information retrieval is the responsibility of users. This is time consuming and need a considerable cognitive effort. For novice users, this charge is the deadline between them and deep Web.

We try to facilitate information retrieval from deep web to novice users. Our goal is to build a universal web form which allows novice users query many web services at the same time. This web form make possible to formulate only one query and obtaining results from all merged web services at the same time. We aim to create this new web form which must be user friendly and easy to understand for novice users (Figure 1).

Figure 1.

Architecture of our deep Web Information retrieval system


2. Architecture Of Proposed System

We have subdivided the main problem (Information Retrieval from deep Web) into three sub-problems (Schema Extraction, Schema Integration, and Schema Visualization) and for each sub-problem we give some response keys.

Deep web services are clustered into several domains of interests, for example Airfare, Automobiles, Books, etc. This is a preprocessing step done by web crawler which classifies each web service into some category based on its query form. Since query forms are public interfaces they are easily reached through search engines (such as Google and Yahoo) from the surface Web. As we focus on deep web this step is beyond our interest.

Complete Article List

Search this Journal:
Open Access Articles: Forthcoming
Volume 12: 4 Issues (2022): Forthcoming, Available for Pre-Order
Volume 11: 4 Issues (2021): 2 Released, 2 Forthcoming
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 4 Issues (2016)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2013)
Volume 2: 4 Issues (2012)
Volume 1: 4 Issues (2011)
View Complete Journal Contents Listing