Reference Hub2
Optimal Query Generation for Hidden Web Extraction through Response Analysis

Optimal Query Generation for Hidden Web Extraction through Response Analysis

Sonali Gupta, Komal Kumar Bhatia
Copyright: © 2014 |Volume: 4 |Issue: 2 |Pages: 18
ISSN: 2155-6377|EISSN: 2155-6385|EISBN13: 9781466654884|DOI: 10.4018/ijirr.2014040101
Cite Article Cite Article

MLA

Gupta, Sonali, and Komal Kumar Bhatia. "Optimal Query Generation for Hidden Web Extraction through Response Analysis." IJIRR vol.4, no.2 2014: pp.1-18. http://doi.org/10.4018/ijirr.2014040101

APA

Gupta, S. & Bhatia, K. K. (2014). Optimal Query Generation for Hidden Web Extraction through Response Analysis. International Journal of Information Retrieval Research (IJIRR), 4(2), 1-18. http://doi.org/10.4018/ijirr.2014040101

Chicago

Gupta, Sonali, and Komal Kumar Bhatia. "Optimal Query Generation for Hidden Web Extraction through Response Analysis," International Journal of Information Retrieval Research (IJIRR) 4, no.2: 1-18. http://doi.org/10.4018/ijirr.2014040101

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

A huge number of Hidden Web databases exists over the WWW forming a massive source of high quality information. Retrieval of this information for enriching the repository of the search engine is the prime target of a Hidden web crawler. Besides this, the crawler should perform this task at an affordable cost and resource utilization. This paper proposes a Random ranking mechanism whereby the queries to be raised by the hidden web crawler have been ranked. By ranking the queries according to the proposed mechanism, the Hidden Web crawler is able to make an optimal choice among the candidate queries and efficiently retrieve the Hidden web databases. The Hidden Web crawler proposed here also possesses an extensible and scalable framework to improve the efficiency of crawling. The proposed approach has also been compared with other methods of Hidden Web crawling existing in the literature.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.