Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Word Sense Disambiguation

Pushpak Bhattacharyya, Mitesh Khapra

Source Title: Emerging Applications of Natural Language Processing: Concepts and New Research

DOI: 10.4018/978-1-4666-2169-5.ch002

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

This chapter discusses the basic concepts of Word Sense Disambiguation (WSD) and the approaches to solving this problem. Both general purpose WSD and domain specific WSD are presented. The first part of the discussion focuses on existing approaches for WSD, including knowledge-based, supervised, semi-supervised, unsupervised, hybrid, and bilingual approaches. The accuracy value for general purpose WSD as the current state of affairs seems to be pegged at around 65%. This has motivated investigations into domain specific WSD, which is the current trend in the field. In the latter part of the chapter, we present a greedy neural network inspired algorithm for domain specific WSD and compare its performance with other state-of-the-art algorithms for WSD. Our experiments suggest that for domain-specific WSD, simply selecting the most frequent sense of a word does as well as any state-of-the-art algorithm.

Chapter Preview

Top

1. Introduction

Word Sense Disambiguation (WSD) is the problem of finding the correct sense (i.e., meaning) of a word by looking at the context in which it appears. It is one of the central challenges in NLP and is ubiquitous across all languages. Almost every language that we know has polysemy (poly means “many” and semy means “signs” or “meanings”) to a certain degree. For example, consider the two different meanings of the word bank in English:

I am going to the bank to withdraw money.

I am going to take a walk along the river bank.

In the first sentence, the word ‘bank’ refers to a “financial institution” whereas in the second sentence it refers to a “sloping land beside a water body (river, in this case).” When a human being reads the first sentence, he sees the words “withdraw” and “money” in the context and uses his world knowledge to decide that the word bank here refers to a “financial institution.” Similarly, he sees the word “river” in the second sentence and easily infers that the word bank here refers to a “sloping land near the river.” Identifying the correct meaning of a word can serve as a building block for many Natural Language Processing (NLP) tasks, such as Information Retrieval (IR), Machine Translation (MT), Information Extraction (IE), and more recently for Subjectivity and Sentiment Analysis. In IR, WSD can help in identifying the correct sense of a word in the query and thereby improve the precision of the results fetched (Harman, 2005). In MT, identifying the correct sense of a word in the source language can help in selecting its appropriate translation in the target language (Carpuat & Wu, 2007). Similarly, in IE, knowing the correct sense of every word in a document may help in doing an accurate analysis of the text. More recently, Balamurali et al. (2011) have shown that WSD can help in improving the performance of document level sentiment classifiers.

The above-mentioned applications of WSD suggest that distinguishing between different senses of a word is indeed important, but, how do we train a machine to acquire the necessary world knowledge required to perform such distinction or how do we even make a machine aware that a word has such multiple senses or meanings? The first question brings out the hardness of the problem and it is a commonly accepted notion that WSD is an AI-complete problem, i.e. it is as hard as any other AI problem (Navigli, 2009). In fact, several studies (Snyder & Palmer, 2004) have shown that WSD is a hard problem even for human beings. Specifically, these studies have shown that given the task of assigning senses to a large set of words by looking at their context, the agreement in the senses assigned by two humans is only 78%. Considering the difficulty of the task, its importance, and its ubiquitous nature, much work has been done in this area. In this chapter, we describe some of the popular algorithms, which have been proposed to perform WSD and highlight that in some specific conditions, such as when the corpus is restricted to a specific domain, it is possible to achieve near human performance on WSD.

The second question, i.e., “how do we make a machine aware of the different senses of a word” brings us to the concept of a sense repository or a knowledge base. A sense repository is a lexical resource which lists down the different senses of a word. The most popular sense repository used for WSD is WordNet (Fellbaum, 1998) which is a hierarchical lexical database where the basic unit of storage is a synset (short for synonymy set). As the name suggests, each synset contains a set of words, which together define a concept. From now on, we use the words synset and sense interchangeably. In addition to storing the gloss, examples and members for each synset, a wordnet also stores semantic relations between the synsets, e.g., hypernymy/hyponymy (IS-A), holonymy/meronymy (PART-OF), troponymy (TYPE-OF), etc. Below, we give examples of two synsets from the English wordnet along with their relations.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Word Sense Disambiguation

Abstract

1. Introduction

Complete Chapter List