Reference Hub5
On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms

On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms

Claudio Gutiérrez-Soto, Gilles Hubert
Copyright: © 2015 |Volume: 6 |Issue: 2 |Pages: 21
ISSN: 1947-8186|EISSN: 1947-8194|EISBN13: 9781466678583|DOI: 10.4018/IJISMD.2015040103
Cite Article Cite Article

MLA

Gutiérrez-Soto, Claudio, and Gilles Hubert. "On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms." IJISMD vol.6, no.2 2015: pp.72-92. http://doi.org/10.4018/IJISMD.2015040103

APA

Gutiérrez-Soto, C. & Hubert, G. (2015). On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms. International Journal of Information System Modeling and Design (IJISMD), 6(2), 72-92. http://doi.org/10.4018/IJISMD.2015040103

Chicago

Gutiérrez-Soto, Claudio, and Gilles Hubert. "On The Reuse of Past Searches in Information Retrieval: Study of Two Probabilistic Algorithms," International Journal of Information System Modeling and Design (IJISMD) 6, no.2: 72-92. http://doi.org/10.4018/IJISMD.2015040103

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

When using information retrieval systems, information related to searches is typically stored in files, which are well known as log files. By contrast, past search results of previously submitted queries are ignored most of the time. Nevertheless, past search results can be profitable for new searches. Some approaches in Information Retrieval exploit the previous searches in a customizable way for a single user. On the contrary, approaches that deal with past searches collectively are less common. This paper deals with such an approach, by using past results of similar past queries submitted by other users, to build the answers for new submitted queries. It proposes two Monte Carlo algorithms to build the result for a new query by selecting relevant documents associated to the most similar past query. Experiments were carried out to evaluate the effectiveness of the proposed algorithms using several dataset variants. These algorithms were also compared with the baseline approach based on the cosine measure, from which they reuse past results. Simulated datasets were designed for the experiments, following the Cranfield paradigm, well established in the Information Retrieval domain. The empirical results show the interest of our approach.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.