Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Improving Audio Spatialization Using Customizable Pinna Based Anthropometric Model of Head-Related Transfer Functions

Navarun Gupta, Armando Barreto

Source Title: Speech, Image, and Language Processing for Human Computer Interaction: Multi-Modal Advancements

DOI: 10.4018/978-1-4666-0954-9.ch005

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The role of binaural and immersive sound is becoming crucial in virtual reality and HCI related systems. This chapter proposes a structural model for the pinna, to be used as a block within structural models for the synthesis of Head-Related Transfer Functions, needed for digital audio spatialization. An anthropometrically plausible pinna model is presented, justified and verified by comparison with measured Head-Related Impulse Responses (HRIRs). Similarity levels better than 90% are found in this comparison. Further, the relationships between key anthropometric features of the listener and the parameters of the model are established, as sets of predictive equations. Modeled HRIRs are obtained substituting anthropometric features measured from 10 volunteers into the predictive equations to find the model parameters. These modeled HRIRs are used in listening tests by the subjects to assess the elevation of spatialized sound sources. The modeled HRIRs yielded a smaller average elevation error (29.9o) than “generic” HRIRs (31.4o), but higher than the individually measured HRIRs for the subjects (23.7o).

Chapter Preview

Top

Introduction

Immersive sound systems are an essential part of any modern virtual reality system. Computer games, navigation systems, and teleconferencing – all require a life like reproduction of sound that surrounds the user and makes human computer interaction effortless and authentic.

Audio spatialization, the ability to impart a virtual originating location to digital sounds, is gaining increasing relevance due to its continuously expanding applications in the entertainment industry, computer gaming, and also in the broader fields of Virtual Reality (VR) and Human-Computer Interaction (HCI). Current advances in this field are ultimately predicated on an inextricable interplay between physics, signal processing, audiology, cognitive neuroscience and anthropometry.

It is known that we are able to discern the location of the source of a sound, around us, by exploiting localization clues that are part of the sounds that reach our eardrums. If a well-defined sound, such as the buzzer of an alarm clock is originated at a given location relative to the position of a static listener who faces North (e.g., two meters away from the listener, at ear level, from the NE direction), the deflections of the listener’s right eardrum will not be the same as those of the buzzer’s diaphragm. The acoustic signal originated in the buzzer will be transformed as it is propagated to the listener’s right ear, due to multi-path reflections involving the torso, head and (right) outer ear of the listener. Furthermore, the acoustic signal reaching the listener’s left ear will be affected by the same effects and, in addition, it may be affected by diffraction, as the listener’s head is interposed between the buzzer and the his/her left eardrum. From a dynamic system’s point of view, the transformation of sound from source (buzzer) to the destination (e.g., right eardrum) can be modeled as a transfer function, which describes how each frequency component of the buzzer sound (and any other sound originated at the same source location, for that matter), is modified in magnitude and phase, as it travels from source to destination. Evidently, the transfer function that mediates between the buzzer and the left eardrum will be different from that associated with the right eardrum. We can expect a more dramatic attenuation of high frequencies in the sound reaching the left ear, for the situation described. Because these pairs of transfer functions have been known to depend on the shape and size of the listener’s anthropometric features, particularly head measurements, they are termed “Head-Related Transfer Functions”, HRTFs. Evidently, the HRTF pair changes as we consider different sound source locations around the listener. Consider, for example, that the buzzer is now placed two meters away from the static listener, but in a position west from his/her location. Clearly, now the right HRTF will indicate a more severe attenuation of high frequencies than the left HRTF. Perceptually, our brains are capable of discerning the location of a sound source by comparing the transformation suffered by the sound from its origin to each one of our eardrums (“binaural localization clues”), or by learned recognition of characteristic features present in even the sounds reaching a single one of our eardrums (“monoaural localization clues”).

Clearly, if the HRTF pair corresponding to a given source location around a listener can be specified and, later implemented by engineering means, we could process a single-channel sound signal (e.g., a monoaural recording of the buzzer sound collected right next to the buzzer), and process it by both HRTFs in the pair, resulting in a binaural sound that, when delivered through headphones to a listener, should produce a similar sound localization perception to the one experienced when the buzzer was physically placed there. If HRTF pairs are defined for many locations around the listener, one could change the HRTF pair used for the sound spatialization process described and, therefore re-assign the buzzer sound to a different “virtual source location”.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Improving Audio Spatialization Using Customizable Pinna Based Anthropometric Model of Head-Related Transfer Functions

Abstract

Introduction

Complete Chapter List