Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Feature Selection for GUMI Kernel-Based SVM in Speech Emotion Recognition

Imen Trabelsi, Med Salim Bouhlel

Source Title: Artificial Intelligence: Concepts, Methodologies, Tools, and Applications

DOI: 10.4018/978-1-5225-1759-7.ch038

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Speech emotion recognition is the indispensable requirement for efficient human machine interaction. Most modern automatic speech emotion recognition systems use Gaussian mixture models (GMM) and Support Vector Machines (SVM). GMM are known for their performance and scalability in the spectral modeling while SVM are known for their discriminatory power. A GMM-supervector characterizes an emotional style by the GMM parameters (mean vectors, covariance matrices, and mixture weights). GMM-supervector SVM benefits from both GMM and SVM frameworks. In this paper, the GMM-UBM mean interval (GUMI) kernel based on the Bhattacharyya distance is successfully used. CFSSubsetEval combined with Best first algorithm and Greedy stepwise were also utilized on the supervectors space in order to select the most important features. This framework is illustrated using Mel-frequency cepstral (MFCC) coefficients and Perceptual Linear Prediction (PLP) features on two different emotional databases namely the Surrey Audio-Expressed Emotion and the Berlin Emotional speech Database.

Chapter Preview

Top

Introduction

Speech is the natural communication form between humans, provides a great deal of information about speaker, language and emotions. This fact has motivated researchers to find a fast and efficient method of natural interaction between man and machine. Presence of emotions makes speech more natural. This has introduced a relatively new research area, namely speech emotion recognition (SER), which is defined as extracting the emotional state of a speaker from his or her speech. This challenging task has several applications in day-to-day life like agent-customer interactions, call-center applications (Herm, 2008), web movies, on- board car driving systems (Hu et al., 2013), medical diagnostic tool and E-tutoring systems (Trabelsi & Bouhlel, 2016a). As in any pattern recognition problem, the performance of emotion recognition from speech depends on label, organization, representation, and evaluation of training data. A significant challenge for emotional research depends on a sense of what emotion is and is in finding appropriate emotional labels. Three labeling methods can be distinguished: (1) categorical approach, (2) dimensional approach, and (3) appraisal-based approach (Cowie & McKeown & Douglas-Cowie, 2012; Hudlicka, 2011). In the first one, emotion is described as a discrete class that differs explicitly and mutually exclusive from one emotion to another. In the second one, emotion is described as a continuous process that will changes dynamically over time, using the multi-dimensional emotion model. However, the appraisal approach, introduces the role of time into the comprehension of emotions (Mortillaro & Meuleman & Scherer, 2012; De Vries, 2015). A critical research challenge in speech emotion recognition systems is to how to encode the spoken emotion by some suitable features (Maji et al., 2015; Saba et al., 2016). This step, called feature extraction, is of a great importance in SER. However, having a large number of potential features increases the complexity of the system and normally results in longer system training times. Therefore, a popular approach is to start with a larger set of features and then removes irrelevant data to reduce dimensionality of the training data and generate a more compact and robust feature set. Another important issue in the evaluation of an emotional speech system is the choice of emotional corpus. The existing emotional databases could be divided into three classes namely: simulated (actor), elicited (induced) and spontaneous (natural) speech databases. For more detailed description, the reader may refer to (Koolagudi& Rao, 2012).

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Feature Selection for GUMI Kernel-Based SVM in Speech Emotion Recognition

Abstract

Introduction

Complete Chapter List