Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Reading Numbers System for Portuguese Language

João Paulo Teixeira, Carolina Mota, Cátia Sampaio

Source Title: International Journal of Reliable and Quality E-Healthcare (IJRQEH) 4(1)

DOI: 10.4018/IJRQEH.2015010102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The paper presents an algorithm for read common numbers until one million in Portuguese language. The record and cutting process of the digit speech sounds deserved a special attention to improve the speech sound output. A special attention is required for the correct inclusion of the particle ‘e' (and) to provide better naturalness of the read numbers. The system has the ability of simulate the human biologic speech sound production in the task of reading numbers. The system is based in the concatenation of carefully recorded edited and selected speech segments corresponding to the digits. The naturalness of the system was improved with the use of speech files of read digits in different positions (beginning, middle and end) and using digits concatenated with the particle ‘e' before, after and before and after the digit.

Article Preview

Top

Introduction

The process of automatically read numbers is useful for several types of applications. Watches for visually impaired persons, automatic answers systems, speech interface systems with pre-recorded sentences using numbers and even general purpose TTS (Text-To-Speech) systems (Saraswathi, 2010) and (Sproat, 1997) need at least the algorithm for automatic reading numbers. The organization of the sequence of chunks and the correct insertions of linkers such as ‘e’ (and) require algorithms dependent on the language. Additionally, the position of each digit in the whole number carries its own prosody contour concerning their F0 curve and duration length.

Numbers such as amounts, telephone numbers, date, hours, codes and personal identification card requires different structures of reading. For amounts the sequence of numbers should be converted in hundreds of millions, hundreds of thousands and hundreds of units. However, a telephone number is read in a different way, for instance as the groups of units (example 931720855: nine three one - seven two zero - eight five five). The number of digits in each group depends on the length of the telephone number but always groups of three or two digits. The date numbers have a proper form to be written and read. Among several formats of the date one very common appear such as ‘dd-mm-yyyy’ that requires a system to interpret a date in the sequence of numbers and the corresponding structure of reading. Also the reading can be done in different ways. For instance the month can be read as a number or as the name of the month. For the hours the most common format is ‘hh:mm’ but several variation can be found. The way the hour can be read is very variable. For instance ‘18:50 h’ can be read as ‘eighteen fifty’; ‘six hours and fifty minute PM’ or ‘ten to six PM’, among others. For codes and personal identification different forms can be adopted depending on the number of digits. In these cases a similar strategy as the one mentioned for telephone numbers can be adopted.

An automatic reading system for general application should identify the correct class in order to read them in the correct structure. Then the system has to compose the sequence of words to complete the final sentence. Finally the system has to convert the sentence to sound by a synthesis process or concatenation of the sequence of phonemes or words. Depending on the application different strategies can be used to read the number. A general purpose Text-to-Speech could synthesize the sequence of phonemes that fulfill the complete number, but a reading number system can simple concatenate the sequence of recorded sounds of digits and particles. This last process, although less flexible than a TTS synthesizer, can reach better quality because no segmental processing is required. Anyway, the system can be improved considering several requirements during the record and cut of the speech signal, and also using some post prosodic processing. The systems based in the concatenation of recorded sounds of digits the position of the digit within the whole number must be considered. Two different approaches can be used to convey the prosody adequate to the digit position. The first approach consists in the utilization of the recorded digit in the same position. This approach imposes that several records of the same digit must be saved in the database of speech sounds. The second approach consists in making prosody modifications in the original speech sound files to impose the adequate F0 and duration for the corresponding position. TD-PSOLA algorithms (Charpentier & Moulines, 1990) allow the F0 and durations modifications within some limitations. Namely, it is not recommended to change the F0 and/or durations for 2 times higher or lower the original F0 and/or duration, due the severe lost in speech quality (Teixeira, 2012)(Barros, 2002).

Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech (Klatt, 1987) (Sharman, 1998) (Teixeira, 2013).

Synthesized speech can be created by concatenating segments of recorded/synthesized speech stored in a database of sound or a database of parameters depending on the acoustic processing module (Sproat, 1997).

Complete Article List

Search this Journal:

Reset

Volume 13: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 12: 2 Issues (2023)

Volume 11: 4 Issues (2022)

Volume 10: 4 Issues (2021)

Volume 9: 4 Issues (2020)

Volume 8: 4 Issues (2019)

Volume 7: 4 Issues (2018)

Volume 6: 4 Issues (2017)

Volume 5: 4 Issues (2016)

Volume 4: 4 Issues (2015)

Volume 3: 4 Issues (2014)

Volume 2: 4 Issues (2013)

Volume 1: 4 Issues (2012)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Reading Numbers System for Portuguese Language

Abstract

Introduction

Complete Article List