Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Facial Muscle Activity Patterns for Recognition of Utterances in Native and Foreign Language: Testing for its Reliability and Flexibility

Sridhar Arjunan, Dinesh Kant Kumar, Hans Weghorn, Ganesh Naik

Source Title: Cross-Disciplinary Applications of Artificial Intelligence and Pattern Recognition: Advancing Technologies

DOI: 10.4018/978-1-61350-429-1.ch012

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The need for developing reliable and flexible human computer interface is increased and applications of HCI have been in each and every field. Human factors play an important role in these kinds of interfaces. Research and development of new human computer interaction (HCI) techniques that enhance the flexibility and reliability for the user are important. Research on new methods of computer control has focused on three types of body functions: speech, bioelectrical activity, and use of mechanical sensors. Speech operated systems have the advantage that these provide the user with flexibility. Such systems have the potential for making computer control effortless and natural. This chapter summarizes research conducted to investigate the use of facial muscle activity for a reliable interface to identify voiceless speech based commands without any audio signals. System performance and reliability have been tested to study inter-subject and inter-day variations and impact of the native language of the speaker. The experimental results indicate that such a system has high degree of inter-subject and inter-day variations. The results also indicate that the variations of the style of speaking in the native language are low but are high when the speaker speaks in a foreign language. The results also indicate that such a system is suitable for a very small vocabulary. The authors suggest that facial sEMG based speech recognition systems may only find limited applications.

Chapter Preview

Top

1. Introduction

One bottleneck in our technological advancements is the interface between the computer and the user. While till recently, Human computer interface (HCI) was largely restricted to the keyboard and the mouse, in the recent past the advancements have lead to systems that are voice, biosignals and gesture operated. Speech operated systems have the advantage that these provide the user with flexibility and time tested natural ability. Such systems provide a potential for natural and seamless interface that have the potential for making computer control almost effortless. Such HCI systems can provide richness comparable to human to human interaction. The success of such systems is based on the robustness of the speech recognition system which is a complex multidisciplinary research area including speech and language processing.

In recent years, significant progress has been achieved in advancing speech recognition technology, making speech an effective modality in both telephony and multimodal human-machine interaction. The technology has become increasingly usable and useful. However, currently speech recognition is largely audio based and suffers from three major shortcomings; (i) it is not suitable in noisy environments such as a vehicle or a factory, (ii) it is not suitable for people with speech impairment disability, such as people after a stroke attack, and (iii) it is not suitable for giving discrete commands or when there may be other people talking loudly in the vicinity.

Work conducted by Chen (Chen, 2001) has demonstrated that speech based human to human communication is multimodal where along with audio signal the listener also observes the facial and body gestures. When we speak in noisy environments, or with people with hearing loss, the lip and facial movements often compensate the lack of quality audio (Simpson et al. 1990; Stone et al 1992). The identification of the speech with lip movement can be achieved using visual sensing, or sensing of the movement and shape using mechanical sensors (Manabe et.al., 2003) or by relating the movement and shape to the muscle activity (Chan et al. 2002; Kumar et al. 2004). To improve the speech classification systems, numbers of researchers have proposed the use of facial movements and gestures (Dimberg et al. 1997; Edward et al. 2006; Francis et.al., 2002). Proposed systems are based on vision, biosignals and mechanical sensor. The proposed systems are generally used along with audio speech recognition systems.

Each of these techniques has strengths and limitations. The video based technique is computationally expensive, requires a camera monitoring the lips that is fixed to the user’s head, and is sensitive to lighting conditions. The sensor based technique has the obvious disadvantage that it requires the user to have sensors fixed to the face, making the system not user friendly. The muscle monitoring systems have limitations of low reliability. There are two possible reasons; (i) people use different muscles even when they make the same sound and (ii) cross talk due to different muscles makes the signal quality difficult to classify. These reasons were extensively studied by Harris (Harris, 1970) and reported that the suitable problems for EMG research will be divided into three classes: first, ‘which muscle’ problems; second, ‘which mechanism’ problems; and third, a more vaguely defined class of problems having to do with the general organization of the speech mechanism. The other difficulty of each of these systems is that these systems are user dependent and not suitable for different users. In this chapter we report the use of recording muscle activity of the facial muscles to determine the unspoken command from the user. Even the Myoelectric Signals (MES) based systems are heavily influenced by user dependencies, such as style of speaking, rate of speaking, and variation in pronunciation.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Facial Muscle Activity Patterns for Recognition of Utterances in Native and Foreign Language: Testing for its Reliability and Flexibility

Abstract

1. Introduction

Complete Chapter List