Exploiting Chi Square Method for Sentiment Analysis of Product Reviews

Exploiting Chi Square Method for Sentiment Analysis of Product Reviews

Nilesh M. Shelke, Shrinivas P. Deshpande
DOI: 10.4018/978-1-7998-0951-7.ch022
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Sentiment analysis is an extension of data mining which employs natural language processing and information extraction task to recognize people's opinion towards entities such as products, services, issues, organizations, individuals, events, topics, and their attributes. It gives the summarized opinion of a writer or speaker. It has received lot of attention due to increasing number of posts/tweets on social sites. The proposed system is meant to classify a given text of review into positive, negative, or the neutral category. Primary objective of this article is to provide a method of exploiting permutation and combination and chi values for sentiment analysis of product reviews. Publicly available freely dictionary SentiWordNet 3.0 has been used for review classification. The proposed system is domain independent and context aware. Another objective of the proposed system is to identify the feature specific intensity with which reviewer has expressed his opinion. Effectiveness of the proposed system has been verified through performance matrix and compared with other research work.
Chapter Preview
Top

Introduction

As the publication of posts/opinion/thoughts/tweets on social sites/blogs/forums became easy, internet sites are flooding with product reviews. Reviews are useful to customers for taking decisions for purchasing, institutes/companies for branding, politicians for making image in the public, etc. This public generated review holds sentiments for various products/persons/movies etc. But their browsing is tedious as they are huge in number. Sentiment Analysis from text is multi-facetted research issue. Even though there are tremendous challenges and many difficulties in sentiment analysis from text; its contribution and applications to industry is not ignorable. Because sentiment analysis bases its results on factors that are so inherently humane, it will become one the major drivers of many business decisions in future. Practical applications of sentiment analysis are huge. Sentiment analysis can be successfully applied to learn user preferences and interests from users’ personal writings. These methods are often studied for consumer feedback analysis. Similarly, e-learning systems can benefit from affective tutoring approaches.

Sentiment Analysis is an extension of data mining which employs Natural Language Processing and Information Extraction task to recognize people’s opinion towards entities such as products, services, issues, organizations, individuals, events, topics and their attributes. Sentiment Analysis analyze each text and detect which part contain opinionated word, which is being opinionated and who has written the opinion. It gives the summarized opinion of a writer or speaker. Sentient analysis can be done at word level, sentence level and document level. In a broader sense, sentiment analysis is focused to detect the attitude of a speaker or a writer with respect to some topic or about overall coverage of a document (Zhao et al., 2015). Sentiment can be defined as

Sentiment:= <oj, fjk, soijkl, hi>, oj is the target: what/whom the sentiment is expressed to. fjk is a feature of the object oj,soijk is the sentiment value of the opinion of the opinion holder hi on feature fjk of object oj; soijkl is +ve,-ve, or neutral, or a more granular rating.hi is holder: who expresses the sentiment (Pawar, 2016).

Identification of sentiments or moods embedded in the (written) natural language is critical and a challenging issue. Researchers are attempting to apply different approaches and techniques and extract useful information from this data.

As such demand of sentiment analysis applications is increasing rapidly. Primary tasks in sentiment analysis is finding the “aspects” or “features”, finding their corresponding sentiments, identifying the intensity of the sentiments and finally calculating the overall opinion of the reviewer. Consider some actual reviews from the dataset.

  • 1.

    The bread is top notch as well.

  • 2.

    In fact, this was not a Nicosia salad and was barely eatable.

  • 3.

    I have to say they have one of the fastestdelivery times in the city.

  • 4.

    The food is always fresh and hot - ready to eat!

  • 5.

    Did I mention that the coffee is outstanding?

  • 6.

    Certainly not the best sushi in New York, however, it is always freshfood, and the place is very clean, sterile.

In the above example reviews, underlined terms are the product features and italic terms indicates the sentiments about those features termed as sentiment features.

Complete Chapter List

Search this Book:
Reset