Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Building CLIA for Resource-Scarce African Languages: A Case Study on Oromo—English CLIR

Kula Kekeba Tune, Vasudeva Varma

Source Title: Information Retrieval and Management: Concepts, Methodologies, Tools, and Applications

DOI: 10.4018/978-1-5225-5191-1.ch048

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Since most of the existing major search engines and commercial Information Retrieval (IR) systems are primarily designed for well-resourced European and Asian languages, they have paid little attention to the development of Cross-Language Information Access (CLIA) technologies for resource-scarce African languages. This paper presents the authors' experience in building CLIA for indigenous African languages, with a special focus on the development and evaluation of Oromo-English-CLIR. The authors have adopted a knowledge-based query translation approach to design and implement their initial Oromo-English CLIR (OMEN-CLIR). Apart from designing and building the first OMEN-CLIR from scratch, another major contribution of this study is assessing the performance of the proposed retrieval system at one of the well-recognized international Cross-Language Evaluation Forums like the CLEF campaign. The overall performance of OMEN-CLIR was found to be very promising and encouraging, given the limited amount of linguistic resources available for severely under-resourced African languages like Afaan Oromo.

Chapter Preview

Top

Introduction

As we move towards an increasingly globalized and knowledge-based economy, the ability to instantly access and share relevant information (Baeza-Yates & Ribeiro-Neto, 1999; Gey, Kando, & Peters, 2005; Nie, 2010) beyond language and cultural boundaries has become more and more crucial. The World Wide Web (WWW) contains massive volumes of multilingual and multimedia information resources that can be explored and exploited to address critical social and economic problems. Unfortunately, in developing and culturally diverse regions like Africa and Asia, the accessibility and usability of online resources are severely constrained by formidable obstacles and challenges such as language barriers, linguistic digital divide and lack of robust CLIA systems (Adegbola, 2009; Gasser, 2006; Varma, Tune, & Pingali, 2007). As pointed out by (Georg & Hans, ‎2013; Oard & Diekema, 1998; Peters, Braschler, & Clough, 2012), language barriers and linguistic digital divide have continued to threaten and undermine the potential of the Internet to deliver universal and equitable access to online information resources and services. This is especially true in highly multicultural developing nations like Ethiopia and India.

Broadly speaking, language barriers can be defined as linguistic and cultural factors that impede the free flow of information across language boundaries. In this article, the term language barriers is more specifically used to describe linguistic and cultural obstacles that discourage or prevent users from seeking and sharing important information across different languages and cultures. Even though the term linguistic digital divide is closely associated with language barriers, it is often used to describe the disparity in technological development between different languages (Gasser, 2006; Scannell, 2007). While the term digital divide is generally used to describe the gap in accessing and using computing devices among various social groups, the term linguistic digital divide is more specifically used to describe the relative advantages of certain languages (or language communities) over the others with respect to modern language resources and information access technologies.

Since most of the existing commercial search engines and Information Retrieval (IR) systems have primarily focused on well-resourced European and Asian languages, they have not paid adequate attention to supporting under-resourced African languages (Adegbola, 2009; Gey, Kando, & Peters, 2005; Osborn, 2010; Pingali, Tune, & Varma, 2008). The need for exploring and developing multilingual information access technologies that permit African communities to search and discover information beyond linguistic and cultural barriers has, therefore, become more urgent today than ever before. In this regard, much attention has been paid to the development of Cross-Language Information Retrieval (CLIR), which is mainly concerned with searching and discovering information beyond language and cultural boundaries (Hedlund, et al., 2004; Nie, 2010). The main purpose of CLIR is to identify documents written in one or more language(s) in response to a query expressed in a different language (Nie, 2010; Peters, Braschler, & Clough, 2012). On the other hand, CLIA deals with much more general and broader issues. CLIA encompasses not only the academic domain of cross-language search or CLIR, but also many aspects of natural language processing and understanding, including text encoding, digitization, content analysis and visualization (Peters, Braschler, & Clough, 2012). In this paper, we use the term CLIA in its narrower sense to refer to the processes of querying, accessing and retrieving information across different languages.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Building CLIA for Resource-Scarce African Languages: A Case Study on Oromo—English CLIR

Abstract

Introduction

Complete Chapter List