Speech Disorders Recognition using Speech Analysis

Speech Disorders Recognition using Speech Analysis

Khaled Necibi (University of Annaba, Algeria), Halima Bahi (University of Annaba, Algeria) and Toufik Sari (University of Annaba, Algeria)
DOI: 10.4018/978-1-4666-4422-9.ch024
OnDemand PDF Download:
$30.00
List Price: $37.50

Abstract

Speech disorders are human disabilities widely present in young population but also adults may suffer from such disorders after some physical problems. In this context, the detection and further the correction of such disabilities may be handled by Automatic Speech Recognition (ASR) technology. The first works on the speech disorders detection began early in the 70s and seem to follow the same evolution as those on the ASR. Indeed, these early works were more based on the signal processing techniques. Progressively, systems dealing with speech disorders incorporate more ideas from ASR technology. Particularly, Hidden Markov Models, the state-of-the-art approaches in ASR systems, are used. This chapter reviews systems that use ASR techniques to evaluate pronunciation of people who suffer from speech or voice impairments. The authors investigate the existing systems and present the main innovation and some of the available resources.
Chapter Preview
Top

Asr In Speech Impairment Detection

Speech disorders are human disabilities widely present in young population but also adults may suffer from such disorders after some physical problems. So, the detection and further the correction of such disabilities may be handled by ASR technology. At the beginning, works were essentially based on signal processing techniques. In particular, these works were mainly based on the fundamental computation and the harmonics of the signal, and then the principal of the detection consists on looking for dissimilarities between the normal speech and the abnormal one.

Recently, the need of the computer-aided speech therapy systems has increased. Such systems are getting more attention for researchers since the number of persons suffering from speech impairment is great. The main purpose of these systems is to provide methods for improving the communication skills of person who suffer from disorder in speech or voice.

Among the available systems devoted to speech disorder, we would like first to present Vocaliza, a system which is developed in the context of the National Project TIN in Spain.

Vocaliza (Vaquero et al., 2008a), is a Speech-Technology-based application for computer-aided speech therapy in Spanish language. This software provides a user interface especially designed to be attractive even to the youngest users. It works on three level of language: phonological, semantic and syntactic. Each level was trained by a different method which was shown as a game, in order to attract young users. In fact, all games were based on ASR techniques. The goal here was to decide if the user has completed the game successfully. The system includes speech synthesis to show how a word must be pronounced, speaker adaptation to estimate the acoustic models adapted to the user and utterance verification to evaluate user pronunciation.

Most of Vocaliza functionalities are provided by different Human Languages Technologies (HLTs) like ASR which is the core module of Vocaliza application. Each game needs an ASR decoder to decode the user utterances, and to decide which word sequence has been pronounced, so that the application will be able to let the user know if the game has been completed successfully.

The ASR integrated in Vocaliza uses an utterance verification procedure in order to decide if the user has pronounced the requested word or if there is a phoneme sequence with more probability.

Complete Chapter List

Search this Book:
Reset