Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Building Language Resources for Emotion Analysis in Bengali

Dipankar Das, Sivaji Bandyopadhyay

Source Title: Technical Challenges and Design Issues in Bangla Language Processing

DOI: 10.4018/978-1-4666-3970-6.ch016

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Rapidly growing Web users from multilingual communities focus the attention to improve the multilingual search engines on the basis of sentiment or emotion and provide the opportunities to build resources for languages other than English. At present, there is no such corpus or lexicon available for emotion analysis in Indian languages, especially for Bengali, the sixth most popular language in the world, second in India, and the national language of Bangladesh. Thus, in the chapter, the authors describe the preparation of an emotion corpus and lexicon in Bengali. The emotion lexicon, termed Bengali WordNet Affect has been developed from its equivalent version in English by traversing the steps of expansion, translation, and sense disambiguation. In addition to emotion lexicon, a Bengali blog corpus for emotion analysis has also been developed by manual annotators with detailed linguistic expressions such as emotional phrases, intensities, emotion holder, emotion topic and target span, and sentential emotion tags.

Chapter Preview

Top

Introduction

In recent times, research activities in the areas of Opinion, Sentiment, and/or Emotion in natural language texts and other media are gaining ground under the umbrella of subjectivity analysis and affective computing.

The Subjectivity Analysis is defined as classifying a given text (usually a sentence) into one of two classes: objective or subjective whereas Affective computing is an area of artificial intelligence that focuses on how emotion is expressed, perceived, recognized, processed, and interpreted in text, speech, dialogue, image, video etc.. Text based emotion analysis relies heavily on Natural Language Processing (NLP), which is mostly focused on understanding the semantics of text. By analyzing the texts and obtaining semantic as well as emotional information, the computer can deal with more interpersonal matters such as understanding the relationships between people. Both affective computing and NLP are needed to reach this goal. NLP algorithms are necessary to understand the semantics or explicit message of text, while affective computing is needed to understand the implicit message in text manifested through emotion (Minato et al., 2008).

The identification of emotional state from texts is not an easy task as emotion is not open to any objective observation or verification (Quirk et al., 1985). Genuine opinion, emotion and sentiment are hard to collect, ambiguous to annotate, and tricky to distribute due to privacy reasons. Different forms of modeling exist, and ground truth is never solid due to the often highly different perception of the mostly very few annotators. Thus, the few available corpora suffer from a number of issues due to the peculiarity of these young and emerging fields.

In order to obtain knowledge and information from emotional text it is necessary to have reliable linguistic resources, such as tagged emotion corpora and emotion dictionaries. As the study of emotion recognition combined with natural language processing is rather new, it is still difficult to obtain such linguistic resources.

Among the social media like e-mails, Weblogs, chat rooms, online forums and even twitter, blog is one of the communicative and informative repository of text based emotional contents in the Web 2.0 (Lin et al., 2007). Thus, we have prepared the emotion annotated corpus from Bengali blog documents.

The proposed corpus annotation task was carried out at sentence and document levels. Three annotators have manually annotated the blog sentences, which were retrieved from an open source Bengali Web blog archive (www.amarblog.com). Ekman’s (1993) six basic emotion classes (anger, disgust, fear, happy, sad and surprise) were considered to accomplish our tasks. The emotional sentences are annotated with three types of intensities such as high, medium and low as well as the sentences of non-emotional (neutral) and multiple (mixed) categories were also identified. The emotional words and phrases were marked by fixing the lexical scope of the emotional expressions. Each of the emoticons is also considered as individual emotional expressions. The emotion holder and relevant topics associated with the emotional expressions were annotated by considering the punctuation marks, conjuncts, rhetorical structures and other discourse information whereas the knowledge of the rhetorical structure helps in removing the subjective discrepancies from the writer’s point of view. The annotation scheme is used to annotate 123 blog posts containing 4,740 emotional sentences having single emotion tag and 322 emotional sentences for mixed emotion tags along with 7087 neutral sentences in Bengali. Three types of standard agreement measures such as Cohen’s kappa (κ) (Cohen, 1960), Measure of Agreement on Set-valued Items (MASI) (Passonneau, 2004) and agr (Wiebe et al., 2005) metrics were employed for the annotated emotion related components. It is observed that the relaxed agreement schemes like MASI and agr are specially considered for fixing the lexical boundaries of emotional expressions and topics in the emotional sentences. The inter annotator agreement of some emotional components such as sentential emotions, holders and topics show satisfactory performance whereas the sentences of mixed emotion and intensities of medium and low show the disagreement. We observed that a preliminary experiment for the word level emotion classification on a small set of the whole corpus yielded satisfactory results.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Building Language Resources for Emotion Analysis in Bengali

Abstract

Introduction

Complete Chapter List