Reference Hub1
Mouth Shape Detection Based on Template Matching and Optical Flow for Machine Lip Reading

Mouth Shape Detection Based on Template Matching and Optical Flow for Machine Lip Reading

Tsuyoshi Miyazaki, Toyoshiro Nakashima, Naohiro Ishii
Copyright: © 2013 |Volume: 1 |Issue: 1 |Pages: 12
ISSN: 2166-7160|EISSN: 2166-7179|EISBN13: 9781466631878|DOI: 10.4018/ijsi.2013010102
Cite Article Cite Article

MLA

Miyazaki, Tsuyoshi, et al. "Mouth Shape Detection Based on Template Matching and Optical Flow for Machine Lip Reading." IJSI vol.1, no.1 2013: pp.14-25. http://doi.org/10.4018/ijsi.2013010102

APA

Miyazaki, T., Nakashima, T., & Ishii, N. (2013). Mouth Shape Detection Based on Template Matching and Optical Flow for Machine Lip Reading. International Journal of Software Innovation (IJSI), 1(1), 14-25. http://doi.org/10.4018/ijsi.2013010102

Chicago

Miyazaki, Tsuyoshi, Toyoshiro Nakashima, and Naohiro Ishii. "Mouth Shape Detection Based on Template Matching and Optical Flow for Machine Lip Reading," International Journal of Software Innovation (IJSI) 1, no.1: 14-25. http://doi.org/10.4018/ijsi.2013010102

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The authors describe an improved method for detecting distinctive mouth shapes in Japanese utterance image sequences. Their previous method uses template matching. Two types of mouth shapes are formed when a Japanese phone is pronounced: one at the beginning of the utterance (the beginning mouth shape, BeMS) and the other at the end (the ending mouth shape, EMS). The authors’ previous method could detect mouth shapes, but it misdetected some shapes because the time period in which the BeMS was formed was short. Therefore, they predicted that a high-speed camera would be able to capture the BeMS with higher accuracy. Experiments showed that the BeMS could be captured; however, the authors faced another problem. Deformed mouth shapes that appeared in the transition from one shape to another were detected as the BeMS. This study describes the use of optical flow to prevent the detection of such mouth shapes. The time period in which the mouth shape is deformed is detected using optical flow, and the mouth shape during this time is ignored. The authors propose an improved method of detecting the BeMS and EMS in Japanese utterance image sequences by using template matching and optical flow.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.