Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Fuzzy Rough Set Based Technique for User Specific Information Retrieval: A Case Study on Wikipedia Data

Nidhika Yadav, Niladri Chatterjee

Source Title: International Journal of Rough Sets and Data Analysis (IJRSDA) 5(4)

DOI: 10.4018/IJRSDA.2018100102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Information retrieval is widely used due to extremely large volume of text and image data available on the web and consequently, efficient retrieval is required. Text information retrieval is a branch of information retrieval which deals with text documents. Another key factor is the concern for a retrieval engine, often referred to as user-specific information retrieval, which works according to a specific user. This article performs a preliminary investigation of the proposed fuzzy rough sets-based model for user-specific text information retrieval. The model improves on the computational time required to compute the approximations compared to classical fuzzy rough set model by using Wikipedia as the information source. The technique also improves on the accuracy of clustering obtained for user specified classes.

Article Preview

Top

Introduction

Information retrieval (IR) (Luhn, 1955) concerns with extraction and ranking of required data from a corpus which is most relevant to a particular user specified query. In a broad sense the corpus can be a collection of text documents under consideration, textually tagged image files, the intranet/internet documents or images. When the data under consideration is in the form of text the process of extraction for text files which are important to a query is called text information retrieval. Text information retrieval is crucial in these days of information abundance. This field has gained immense popularity keeping in view the availability of text in the form of documents, blogs, tweets, product reviews, news articles, official documents in an organization and websites to mention few. Another equally important and growing field is IR of images based on textual queries. This field is growing in both research and industry in various applications and is popularly referred to as content-based information retrieval (Datta et al., 2017). Though a relatively new area as compared to its text counterpart, it is of prime concern keeping in view the cheap availability of cameras in phones, laptops and other digital devices. This lead to rise of image data with textual meta data present in social media applications such as, Instagram, Twitter, Facebook etc.

A large number of developments has been made in the field if IR in last fifty years. Luhn (1955) proposed one of the first IR systems wherein exact match were computed between the query and document. Since then various refined and advanced IR techniques have been proposed and in use in modern day IR systems. Salton and Bukley (1988) in their seminal work described various term weighting techniques that could be used for an efficient IR system. The query and document were represented as vectors and similarity between the query and document was computed as dot product between them. IR has matured now to a more sophisticated system using advanced mathematical and Artificial Intelligence techniques, such as Google PageRank algorithm, Deep Learning concepts.

In an IR system it is the user who plays the key role. Given the same query by two different users an unbiased IR engine shall produce the same results. It is worth noting that two different users imply different interests and choices. As a consequence, it is paramount to take into consideration the personal choices, the interests and habits of a particular user. Hence, there is a necessity to develop a client-side filtering system. This system filters the information retrieved by an IR engine, e.g. Google, Bing, Yahoo to name a few. Ones the data has been retrieved by IR engine filtering is performed at client side. The present work is concerned with the process of filtering the text data Information Retrieval which is specific to a particular user. Text IR henceforth will be referred to as IR in this paper. User-specific IR can improve on the search results significantly since not all information retrieved by a search engine is relevant to all users. This can be illustrated with the help of an example. Consider two different users: one who is into fishing business and other who is a musician. Both these users enter the keyword “bass”. We note that both refer to different contexts even though the keyword is same. The first one refers to the “bass fish” while the second one refers to the instrument “bass.” Once a user specific IR system is built at client side only the information that is relevant to this user shall be retrieved. Hence it is of prime importance to learn users' opinion for an efficient IR system.

Complete Article List

Search this Journal:

Reset

Volume 9: 1 Issue (2025): Forthcoming, Available for Pre-Order

Volume 8: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 7: 4 Issues (2021): 1 Released, 3 Forthcoming

Volume 6: 3 Issues (2019)

Volume 5: 4 Issues (2018)

Volume 4: 4 Issues (2017)

Volume 3: 4 Issues (2016)

Volume 2: 2 Issues (2015)

Volume 1: 2 Issues (2014)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Fuzzy Rough Set Based Technique for User Specific Information Retrieval: A Case Study on Wikipedia Data

Abstract

Introduction

Complete Article List