Acoustic Modeling of Speech Signal using Artificial Neural Network: A Review of Techniques and Current Trends

Acoustic Modeling of Speech Signal using Artificial Neural Network: A Review of Techniques and Current Trends

Mousmita Sarma, Kandarpa Kumar Sarma
ISBN13: 9781522501596|ISBN10: 1522501592|EISBN13: 9781522501602
DOI: 10.4018/978-1-5225-0159-6.ch008
Cite Chapter Cite Chapter

MLA

Sarma, Mousmita, and Kandarpa Kumar Sarma. "Acoustic Modeling of Speech Signal using Artificial Neural Network: A Review of Techniques and Current Trends." Psychology and Mental Health: Concepts, Methodologies, Tools, and Applications, edited by Information Resources Management Association, IGI Global, 2016, pp. 196-212. https://doi.org/10.4018/978-1-5225-0159-6.ch008

APA

Sarma, M. & Sarma, K. K. (2016). Acoustic Modeling of Speech Signal using Artificial Neural Network: A Review of Techniques and Current Trends. In I. Management Association (Ed.), Psychology and Mental Health: Concepts, Methodologies, Tools, and Applications (pp. 196-212). IGI Global. https://doi.org/10.4018/978-1-5225-0159-6.ch008

Chicago

Sarma, Mousmita, and Kandarpa Kumar Sarma. "Acoustic Modeling of Speech Signal using Artificial Neural Network: A Review of Techniques and Current Trends." In Psychology and Mental Health: Concepts, Methodologies, Tools, and Applications, edited by Information Resources Management Association, 196-212. Hershey, PA: IGI Global, 2016. https://doi.org/10.4018/978-1-5225-0159-6.ch008

Export Reference

Mendeley
Favorite

Abstract

Acoustic modeling of the sound unit is a crucial component of Automatic Speech Recognition (ASR) system. This is the process of establishing statistical representations for the feature vector sequences for a particular sound unit so that a classifier for the entire sound unit used in the ASR system can be designed. Current ASR systems use Hidden Markov Model (HMM) to deal with temporal variability and Gaussian Mixture Model (GMM) for acoustic modeling. Recently machine learning paradigms have been explored for application in speech recognition domain. In this regard, Multi Layer Perception (MLP), Recurrent Neural Network (RNN) etc. are extensively used. Artificial Neural Network (ANN)s are trained by back propagating the error derivatives and therefore have the potential to learn much better models of nonlinear data. Recently, Deep Neural Network (DNN)s with many hidden layer have been up voted by the researchers and have been accepted to be suitable for speech signal modeling. In this chapter various techniques and works on the ANN based acoustic modeling are described.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.