Reference Hub5
An Approach for Focused Crawler to Harvest Digital Academic Documents in Online Digital Libraries

An Approach for Focused Crawler to Harvest Digital Academic Documents in Online Digital Libraries

Sumita Gupta, Neelam Duhan, Poonam Bansal
Copyright: © 2019 |Volume: 9 |Issue: 3 |Pages: 25
ISSN: 2155-6377|EISSN: 2155-6385|EISBN13: 9781522567165|DOI: 10.4018/IJIRR.2019070103
Cite Article Cite Article

MLA

Gupta, Sumita, et al. "An Approach for Focused Crawler to Harvest Digital Academic Documents in Online Digital Libraries." IJIRR vol.9, no.3 2019: pp.23-47. http://doi.org/10.4018/IJIRR.2019070103

APA

Gupta, S., Duhan, N., & Bansal, P. (2019). An Approach for Focused Crawler to Harvest Digital Academic Documents in Online Digital Libraries. International Journal of Information Retrieval Research (IJIRR), 9(3), 23-47. http://doi.org/10.4018/IJIRR.2019070103

Chicago

Gupta, Sumita, Neelam Duhan, and Poonam Bansal. "An Approach for Focused Crawler to Harvest Digital Academic Documents in Online Digital Libraries," International Journal of Information Retrieval Research (IJIRR) 9, no.3: 23-47. http://doi.org/10.4018/IJIRR.2019070103

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

With the rapid growth of digital information and user need, it becomes imperative to retrieve relevant and desired domain or topic specific documents as per the user query quickly. A focused crawler plays a vital role in digital libraries to crawl the web so that researchers can easily explore the domain specific search results list and find the desired content against the query. In this article, a focused crawler is being proposed for online digital library search engines, which considers meta-data of the query in order to retrieve the corresponding document or other relevant but missing information (e.g. paid publication from ACM, IEEE, etc.) against the user query. The different query strategies are made by using the meta-data and submitted to different search engines which aim to find more relevant information which is missing. The result comes out from these search engines are filtered and then used further for crawling the Web.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.