Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Information Retrieval (IR) and Extracting Associative Rules

Asmae Dami, Mohamed Fakir, Belaid Bouikhalene

Source Title: Journal of Information Technology Research (JITR) 7(4)

DOI: 10.4018/jitr.2014100104

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

This paper is located in the intersection of two research themes, namely: Information Retrieval and Knowledge Discovery from texts (Text mining). The purpose of this paper is two-fold: first, it focuses on Information Retrieval (IR) whose purpose is to implement a set of models and systems for selecting a set of documents satisfying user needs in terms of information expressed as a query. An information retrieval system is composed mainly of two processes the representation and retrieval process. The process of representation is called indexing, which allows representation of documents and queries by descriptors, or indexes. These descriptors reflect the contents of documents. The retrieval process consists on the comparison between documents representations and query representation. The second aim of this paper is to discover the relationships between terms (keywords) descriptors of documents in a document database. The correlations (relationships) between terms are extracted by using a technique of the Text mining, mainly association rules.

Article Preview

Top

1. Introduction

Information plays a vital role in today's information society and we are witnessing an unprecedented explosion of its volume and its potential users. This rapid increase in the volume of information has created the problem of how to find information that interests us in this great mass of information. To address this problem a whole discipline was born. This discipline is called Information Retrieval (IR). Indeed, the main objective in the field of IR is to provide models, techniques and systems for storing and organizing masses of information and select those that respond to a user query. In general, a process of Information retrieval based on two basic steps, namely:

1. Indexing is a very important step in the process of IR. It is to identify and extract representative terms from the content of a document or query, which cover the most of their semantic content.
2. The step of selecting the relevant information consists of matching the descriptors extracted by the step of indexing with the descriptors of user query, in order to identify the information that respond to the needs of user query.

The information obtained by structuring a textual corpus (extraction of representative terms of document content) is only one facet of the implicit knowledge contained in a corpus. For this reason, one of the goals of text mining is to propose techniques for the extraction of implicit information in the document database.

One of the branches of text mining is concerned with the implications that describe the different correlations between the terms in the documents. These implications are called association rules.

The main problem of this paper is to extract from a textual corpus a set of useful knowledge for information retrieval system.

We defined two major objectives for extracting knowledge from textual corpus.

• The first objective is to study the information retrieval system (IRS), its functioning, its models and techniques used for evaluating information retrieval system.
• The second objective is to extract relationships between the representative terms of informational content of the corpus (index terms) using association rules.

This paper is organized as follows. The first section provides an introduction to the field of information retrieval. First, we introduce the research process, indexing process, models of IR and evaluation of information retrieval system. The second section is devoted to the presentation of the association rules. In the third section, we will introduce the concept of extracting knowledge from texts. Then, we describe a method for extracting associations between terms from a textual indexed collection using the technique of association rules. The last section will be devoted to the experimental part which we will describe the main features of our application illustrated with screenshots.

Top

2. Information Retrieval, Basic Concepts And Models

The Information Retrieval (IR) (Ricardo & Berthier, 2011; Baziz, 2005) is traditionally defined as a set of techniques to select from a collection of documents, those who are likely to respond to the needs of the user. Manage texts involves storing, retrieving and exploring relevant documents.

The operation of IR (Mooers, 1948) is performed by software tools called information retrieval systems (IRS), whose goal is to find documents that satisfy user needs.

In an Information Retrieval System (IRS), the user expresses his information need as a query. The IRS tries to find all relevant documents and reject the documents that are not relevant. In practice, the set of documents returned by a query for a SRI is composed of a subset of relevant documents and a subset of irrelevant documents. These subsets determine the performance of an SRI (Karbasi, 2007; Harrathi, 2009).

Complete Article List

Search this Journal:

Reset

Volume 16: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 15: 6 Issues (2022): 1 Released, 5 Forthcoming

Volume 14: 4 Issues (2021)

Volume 13: 4 Issues (2020)

Volume 12: 4 Issues (2019)

Volume 11: 4 Issues (2018)

Volume 10: 4 Issues (2017)

Volume 9: 4 Issues (2016)

Volume 8: 4 Issues (2015)

Volume 7: 4 Issues (2014)

Volume 6: 4 Issues (2013)

Volume 5: 4 Issues (2012)

Volume 4: 4 Issues (2011)

Volume 3: 4 Issues (2010)

Volume 2: 4 Issues (2009)

Volume 1: 4 Issues (2008)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Information Retrieval (IR) and Extracting Associative Rules

Abstract

1. Introduction

2. Information Retrieval, Basic Concepts And Models

Complete Article List