A Study on Effective Measurement of Search Results from Search Engines

A Study on Effective Measurement of Search Results from Search Engines

Jin Zhang, Xin Cai, Taowen Le, Wei Fei, Feicheng Ma
Copyright: © 2019 |Pages: 26
DOI: 10.4018/JGIM.2019010110
Article PDF Download
Open access articles are freely available for download

Abstract

This article describes how as internet technology continues to change and improve lives and societies worldwide, effective global information management becomes increasingly critical, and effective Internet information retrieval systems become more and more significant in providing Internet users worldwide with accurate and complete information. Search engine evaluation is an important research field as search engines directly determine the quality of information users' Internet searches. Relevance-decrease pattern/model plays an important role in search engine result evaluation. This research studies effective measurement of search results through investigating relevance-decrease patterns of search results from two popular search engines: Google and Bing. The findings can be applied to relevance-evaluation of search results from other information retrieval systems such as OPAC, can help make search engine evaluations more accurate and sound, and can provide global information management personnel with valuable insights.
Article Preview
Top

1. Introduction

As more and more people worldwide depend on the Internet to fulfill their information needs (Khatwani & Srivastava, 2017), and as the impact of Internet on people and societies have become increasingly profound (Teo, 2007; Lane et al., 2017), researchers throughout the world have studied factors maximizing successes of information technology implementations or global information management (Roztocki & Weistroffer, 2011, Lee et al., 2014; Caprio et al., 2015; Hung et al., 2016; Silic & Back, 2016; Soja, 2016; Chatterjee et al., 2017). One such technological implementation is the employment of search engines. Because of the critical role search engines play in bridging Internet information resources and information users, it is particularly important to evaluate effectiveness of search engines through effective measurements of their search results, as different search engines utilize different retrieval and ranking algorithms and therefore respond to search queries with different search results.

Average Internet searchers tend to take the search results presented by the search engines as a list of decreasing relevance, and they tend to browse only the first 20-30 items on a results list from a search engine. Moreover, business intelligence systems also seem to base many of their decisions on search results as returned by Internet search engines. If the most relevant results are not properly positioned on the result list, important information would be missed, and the decisions could be impaired. Therefore, precise relevance ranking of search result items as returned by search engines is extremely important.

However, because what resides on the Web is an ever-changing and extremely heterogeneous data collection (Jansen & Pooch, 2001), Web page ranking algorithms have become very complicated and dynamic (Dean 2016; Barysevich 2017). It is important to know that ranking algorithms of different search engines handle variables differently. Consequently, the degree of search result relevance varies from search engine to search engine. Ideally, if all returned items are ranked in terms of relevance to the search query, and the ranked data are captured in a two-dimensional chart where the X-axis represents the ranked items and the Y-axis represents the relevance score, then a decline curve appears. Understanding the downward curve is critical to evaluating the quality of search results because the downward curve serves as a yardstick in measuring relevance of search results of a search engine.

The primary purpose of this study is to explore effective measurement of search results from search engines through investigating relevance-decrease patterns of search results from two major search engines: Google and Bing. To accomplish the purpose, 4 domain categories were defined, and 24 search queries with 6 from each category were formulated and submitted to both Google and Bing. Retrieved results were then collected, and their relevance was judged by 32 subjects independently. A group of possible regression models were developed for regression analysis, and the performances of the regression models were tested. The best-fit regression model was identified through ANOVA analyses. The findings of this study help people better understand the relevance-decrease patterns of search results produced by search engines. The best-fit regression model identified in this study provides a way for people to evaluate search result relevance of search engines.

Complete Article List

Search this Journal:
Reset
Volume 32: 1 Issue (2024)
Volume 31: 9 Issues (2023)
Volume 30: 12 Issues (2022)
Volume 29: 6 Issues (2021)
Volume 28: 4 Issues (2020)
Volume 27: 4 Issues (2019)
Volume 26: 4 Issues (2018)
Volume 25: 4 Issues (2017)
Volume 24: 4 Issues (2016)
Volume 23: 4 Issues (2015)
Volume 22: 4 Issues (2014)
Volume 21: 4 Issues (2013)
Volume 20: 4 Issues (2012)
Volume 19: 4 Issues (2011)
Volume 18: 4 Issues (2010)
Volume 17: 4 Issues (2009)
Volume 16: 4 Issues (2008)
Volume 15: 4 Issues (2007)
Volume 14: 4 Issues (2006)
Volume 13: 4 Issues (2005)
Volume 12: 4 Issues (2004)
Volume 11: 4 Issues (2003)
Volume 10: 4 Issues (2002)
Volume 9: 4 Issues (2001)
Volume 8: 4 Issues (2000)
Volume 7: 4 Issues (1999)
Volume 6: 4 Issues (1998)
Volume 5: 4 Issues (1997)
Volume 4: 4 Issues (1996)
Volume 3: 4 Issues (1995)
Volume 2: 4 Issues (1994)
Volume 1: 4 Issues (1993)
View Complete Journal Contents Listing