The use of a voice interface, along with textual, graphical, video, tactile, and audio interfaces, can improve the experience of the user of a mobile device. Many applications can benefit from voice input and output on a mobile device, including applications that provide travel directions, weather information, restaurant and hotel reservations, appointments and reminders, voice mail, and e-mail. We have developed a prototype system for a mobile device that supports client-side, voice-enabled applications. In fact, the prototype supports multimodal interactions but, here, we focus on voice interaction. The prototype includes six voice-enabled applications and a program manager that manages the applications. In this chapter we describe the prototype, including design issues that we faced, and evaluation methods that we employed in developing a voice-enabled user interface for a mobile device.
Key Terms in this Chapter
Multimodal Interface: The integration of textual, graphical, video, tactile, speech, and other audio interfaces through the use of mouse, stylus, fingers, keyboard, display, camera, microphone, and/or GPS.
Global Positioning System (GPS): A system that is used to obtain geographical coordinates, which includes a GPS satellite and a GPS receiver.
Speech Synthesis: The artificial production of human speech. Speech synthesis technology is also called text-to-speech technology in reference to its ability to convert text into speech.
Hidden Markov Model (HMM): A technique, based on a finite state machine that associates probabilities with phonemes, and pairs of phonemes, that is used in speech recognition systems, to determine the likelihood of an expression spoken by a user of that system.
Web Service: A software application identified by a Uniform Resource Indicator (URI) that is defined, described, and discovered using the eXtensible Markup Language (XML) and that supports direct interactions with other software applications using XML-based messages via an Internet protocol.
Location Aware: An application that is based on a particular physical location, as given by geographical coordinates, physical address, zip code, and so forth, that determines the output of the application.
Mobile Device: For the purposes of this chapter, a handheld device, such as a cell phone or personal digital assistant (PDA), that has an embedded computer and that the user can carry around.
Speech Recognition: The process of interpreting human speech for transcription or as a method of interacting with a computer or a mobile device, using a source of speech input, such as a microphone.
Complete Chapter List
Anxo Cereijo Roibás, Stephen Johnson
Hanna Stelmaszewska, Bob Fields, Ann Blandford
Hyowon Lee, Cathal Gurrin, Gareth J.F. Jones, Alan F. Smeaton
Amy K. Karlson, Benjamin B. Bederson, Jose L. Contreras-Vidal
Martina Ziefle, Susanne Bay
Susanne Bay, Martina Ziefle
Chris Barber, James Knight
Anind K. Dey, Jonna Häkkilä
Bent Schmidt-Nielsen, Bret Harsham, Bhiksha Raj, Clifton Forlines
Nikolaos Tselios, Ioanna Papadimitriou, Dimitrios Raptis, Nikoletta Yiannoutsou, Vassilis Komis, Nikolaos Avouris
Siu Cheung Kong
Hyungsung Park, Young Kyun Baek, David Gibson
Nikola Mitrovic, Eduardo Mena, Jose Alberto Royo
Michael J. O’Grady, Gregory M.P. O’Hare
Yang Li, Scott Klemmer, James A. Landay
Emmanuel Dubois, Wafaa Abou Moussa, Cédric Bach, Nelly de Bonnefoy
Ioannis D. Zaharakis, Achilles D. Kameas
Rafael Ballagas, Michael Rohs, Jennifer G. Sheridan, Jan Borchers
Mark David Dunlop, Michelle Montgomery Masters
Min Lin, Andrew Sears, Steven Herbst, Yanfang Liu
Louise E. Moser, P.M. Melliar-Smith
Dong Yu, Li Deng
Parisa Eslambolchilar, Andrew Crossan, Roderick Murray-Smith, Sara Dalzel-Job, Frank Pollick
Panu Korpipää, Jukka Linjama, Juha Kela, Tapani Rantakokko
Enrico Costanza, Samuel A. Inverso, Rebecca Allen, Pattie Maes
Tolga Capin, Antonio Haro
Andrea Sanna, Fabrizio Lamberti
Rock Leung, Joanna Lumsden
Mark Matthews, Gavin Doherty, David Coyle, John Sharry
Francesco Bellotti, Riccardo Berta, Alessandro De Gloria, Massimiliano Margarone
Shigueo Nomura, Takayuki Shiose, Hiroshi Kawakami, Osamu Katai, Keiji Yamanaka
Florence Gaunet, Xavier Briffault
Julio Abascal, Borja Bonail, Daniel Cagigas, Nestor Garay, Luis Gardeazabal
Regina Bernhaupt, Kristijan Mihalic, Marianna Obrist
Jan Willem Streefkerk, Myra P. van Esch-Bussemakers, Mark A. Neerincx, Rosemarijn Looije
Enrico Bertini, Tiziana Catarci, Alan Dix, Silvia Gabrielli, Stephen Kimani, Giuseppe Santucci
Thomas Alexander, Christopher Schlick, Alexander Sievert, Dieter Leyk
Maria de Fátima Queiroz Vieira Turnell, José Eustáquio Rangel de Queiroz, Danilo de Sousa Ferreira
Jaakko T. Lehikoinen
Dong-Han Ham, Jeongyun Heo, Peter Fossick, William Wong, Sanghyun Park, Chiwon Song, Mike Bradley
Kaikkonen, Kaikkonen, Anne, Anne, Aki Kekäläinen, Mikael Cankar, Titti Kallio
Murray Crease, Robert Longworth
Andrew Crossan, Roderick Murray-Smith, Stephen Brewster, Bojan Musizza
Murray Crease, Joanna Lumsden
Rune T. Høegh, Jesper Kjeldskov, Mikael B. Skov, Jan Stage
Adrian Stoica, Georgios Fiotakis, Dimitrios Raptis, Ioanna Papadimitriou, Vassilis Komis, Nikolaos Avouris
Kater Oakley, Gitte Lindgaard, Peter Kroeger, John Miller, Earl Bryenton, Paul Hébert
Shwetak N. Patel, Khai N. Truong, Gillian R. Hayes, Giovanni Iachello, Julie A. Kientz, Gregory D. Abowd
Saturnino Luz, Masood Masoodian
Jason T. Black, Lois Wright Hawkes
Tiong T. Goh, Kinshuk, Nian-Shing Chen