Visual Speech and Gesture Coding Using the MPEG-4 Face and Body Animation Standard

Visual Speech and Gesture Coding Using the MPEG-4 Face and Body Animation Standard

Eric Petajan
Copyright: © 2009 |Pages: 21
ISBN13: 9781605661865|ISBN10: 1605661864|ISBN13 Softcover: 9781616925338|EISBN13: 9781605661872
DOI: 10.4018/978-1-60566-186-5.ch004
Cite Chapter Cite Chapter

MLA

Petajan, Eric. "Visual Speech and Gesture Coding Using the MPEG-4 Face and Body Animation Standard." Visual Speech Recognition: Lip Segmentation and Mapping, edited by Alan Wee-Chung Liew and Shilin Wang, IGI Global, 2009, pp. 128-148. https://doi.org/10.4018/978-1-60566-186-5.ch004

APA

Petajan, E. (2009). Visual Speech and Gesture Coding Using the MPEG-4 Face and Body Animation Standard. In A. Liew & S. Wang (Eds.), Visual Speech Recognition: Lip Segmentation and Mapping (pp. 128-148). IGI Global. https://doi.org/10.4018/978-1-60566-186-5.ch004

Chicago

Petajan, Eric. "Visual Speech and Gesture Coding Using the MPEG-4 Face and Body Animation Standard." In Visual Speech Recognition: Lip Segmentation and Mapping, edited by Alan Wee-Chung Liew and Shilin Wang, 128-148. Hershey, PA: IGI Global, 2009. https://doi.org/10.4018/978-1-60566-186-5.ch004

Export Reference

Mendeley
Favorite

Abstract

Automatic Speech Recognition (ASR) is the most natural input modality from humans to machines. When the hands are busy or a full keyboard is not available, speech input is especially in demand. Since the most compelling application scenarios for ASR include noisy environments (mobile phones, public kiosks, cars), visual speech processing must be incorporated to provide robust performance. This chapter motivates and describes the MPEG-4 Face and Body Animation (FBA) standard for representing visual speech data as part of a whole virtual human specification. The super low bit-rate FBA codec included with the standard enables thin clients to access processing and communication services over any network including enhanced visual communication, animated entertainment, man-machine dialog, and audio/visual speech recognition.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.