Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Computational Linguistic Distances and Big Data: Optimising the Speech Recognition Systems

Krishnaveer Abhishek Challa, Jawahar Annabattula

Source Title: Exploring the Convergence of Big Data and the Internet of Things

DOI: 10.4018/978-1-5225-2947-7.ch005

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Linguistic distance has always been an inter-language issue, but English being a interwoven cluster of rhyming words, homophones, tenses etc., has turned linguistic distance into an intra-language unit to measure the similarity of sounds. In many theoretical and applied areas of computational linguistics i.e., Big Data, researchers operate with a notion of linguistic distance or, conversely, linguistic similarity has become the means to optimise speech recognition systems. The present research paper focuses on the mentioned lines as an attempt to turn the existing systems from delivering good performance to perfect performance, especially in the area of Big Data.

Chapter Preview

Top

Introduction

Linguistic distance is how different one language or dialect is from another. Although there is no uniform approach to quantifying linguistic distance between languages, the concept is used in a variety of linguistic situations, such as learning additional languages, historical linguistics, language-based conflicts and the effects of language differences on trade. The proposed measures used for linguistic distance reflect varying understandings of the term itself. One approach is based on mutual intelligibility, i.e. the ability of speakers of one language to understand the other language. With this, the higher the linguistic distance, the lower is the level of mutual intelligibility (Chiswick & Miller, 2005).

The quest among linguists for a scalar measure of linguistic distance has been in vain. There is no yardstick for measuring distances between or among languages, as there is for the geographic distance between countries (e.g., miles). This arises because of the complexity of languages, which differ by vocabulary, grammar, syntax, written form, etc. The distance between two languages may also depend on whether it is in the written or spoken form. For example, the written form of Chinese does not vary among the regions of China, but the spoken languages differ sharply. Alternatively, two languages that may be close in the spoken form may differ more sharply in the written form (for example, if they use different alphabets, as in the case of German and Yiddish).

Perhaps the way to address the distance between languages is not through language trees which trace the evolution of languages, but by asking a simpler question: How difficult is it for individuals who know language A to learn languages B1 through Bi, where there are ‘i’ other languages. If it is more difficult to learn language B1, than it is to learn language B2, it can be said that language B1 is more “distant” from A than language B2. Language B3 may be as difficult to learn as is language B1 for a language A speaker, but that does not mean that language B3 is close to language B1. Indeed, it may be further from B1 than it is from A.

Linguistic distance is a concept that seeks to measure the degree of difference between two languages. Since the linguistic distances between languages are as different and variable as the languages themselves, such a concept cannot be accurately applied in a scientifically precise manner. This concept is important due to the increase in globalization, which has led to international trade between business concerns from different countries with different languages and dialects. It is also relevant as a tool to measure the ability of immigrants learning a new language that is different from their mother tongue. This is because the more removed one language is from another; the more difficult it will be for the immigrant to adapt to the new language. This also has big applications in the field of Computational Linguistics (CL).

Linguistic distance can be measured by measuring the mutual intelligibility of the language to the speakers. Mutual intelligibility determines how easy or difficult it will be for the speakers to grasp the fundamentals of the new language. This may be facilitated by the sharing of some common words or the similarity in the arrangement of grammatical and lexical forms. For instance, different territories or countries may speak the same basic language with only some minor or major differences in intonation, meaning of words, and the application of the language in general.

American and British English, for example, are mostly related with only a few easily surmountable variations. The linguistic distance between the methods of speaking the language is very small. On the other hand, Irish brogue and Cockney accent might prove to be a greater challenge for an American listener even though they are still variations of the same language. For these, the linguistic distance is more than that of British English. Even at that, learning to understand and speak these versions of the English language would not be as challenging as learning to speak Russian, since both versions are more related to American English and have a higher measure of mutual intelligibility.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Computational Linguistic Distances and Big Data: Optimising the Speech Recognition Systems

Abstract

Introduction

Complete Chapter List