Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Perceiving the World With Sound: An Overview to Robot Audition

Usama Saqib, Robin Kerstens

Source Title: Design and Control Advances in Robotics

DOI: 10.4018/978-1-6684-5381-0.ch003

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Robot perception is the ability of a robotic platform to perceive its environment by the means of sensor inputs, e.g., laser, IMU, motor encoders, and so on. Much like humans, robots are not limited to perceiving their environment through vision-based sensors, e.g., cameras. Robot perception, through the scope of this chapter, encompasses acoustic signal processing techniques to locate the presence of a sound source, e.g., human speaker, within an environment for human-robot interaction (HRI), that has gained great interest within scientific community. This chapter will serve as an introduction to acoustic signal processing within robotics, starting with passive acoustic localization and building up to contemporary active sensing methods, such as the usage of neural networks and spatial map generation. The origins of active acoustic localization, which finds its roots in biomimetics, are also discussed.

Chapter Preview

Top

Introduction

The detection and localization of acoustic reflectors such as walls, objects or people within an environment is a popular topic within the area of robotics. Traditionally, camera and laser-based technologies are used to detect the presence of these landmarks to generate a spatial map of a 3D space and aid these robotic platforms in navigating within their environment. However, these light-based sensing modalities often face challenges such as a lack of light, overexposure (glare), the inability of detecting transparent surfaces such as windows, false reflections, or their sensitivity to occlusion. These issues can be addressed when incorporating sound-based sensing modalities. Research in animal auditory system has inspired researchers to develop technologies to locate the presence of sound sources within an environment.

This chapter will serve as an introduction to robot audition, starting with the subdomains of robot audition and building up to more contemporary active sensing methods, such as the usage of Artificial Intelligence (AI) on recorded data to detect and track acoustic sources and for spatial map generation. The origins of active acoustic sensing, which finds its roots in biomimetics, are also discussed. This chapter is written as a reference for people working on robot perception using sound and wants to contribute to future works by bringing new challenges to the field of robot perception. The chapter will begin with an introduction to biomimicry in robotics, which aims to mimic an animal’s auditory system to localize the position of sound sources in the nearby environment. Biomimicry facilitates intelligent designs in robots to achieve high performance and robustness when navigating between and localizing acoustic sources in a dynamic environment. Designers of such robots make use of new materials, sensors and actuators to provide high capabilities that allow robots to mimic biological processes such as hearing.

Furthermore, this chapter will review techniques in scientific literature associated with passive acoustic localization and active acoustic localization, which are the two important sub-domains of robot audition for SSL. Passive acoustic localization involves detecting sound generated by objects present in an environment while active acoustic localization techniques probe an environment with a known sound to detect the position of objects within an environment. Both sub-domains have their fair share of advantages and disadvantages. For example, active acoustic localization is useful in a quiet environment. This is normally the case when a robot explores an underground environment, such as caves, tunnels, and sewers. Bats, rats and even some aquatic mammals are known to use these techniques to navigate and hunt in complete darkness. These animals probe the environment with a unique sound, or call, and use acoustic echoes to distinguish flora and fauna, different types of animals/prey, and everything needed for their survival. Therefore, a discussion on the different types of probe signals that can be used in robotics to acquire spatial information from the environment is also an important highlight of this chapter. More specifically, analysis of additive white Gaussian noise (AWGN), coded emissions, and chirp signals will be discussed in detail. The application of spatial mapping using echolocation is also an important highlight of this chapter, which incorporate spatial filtering techniques, such as, beamforming techniques.

Finally, the chapter will review data-driven approaches to using contemporary methods such as neural networks for perceiving the environment in an artificially intelligent way. This is relatively a newer approach that combines physics-based model of sound with machine learning to teach robotic platforms to learn to classify and predict their surroundings.

Key Terms in this Chapter

ML: – Machine learning

DOA: – Direction of arrival

MISO: – Multiple input and single output

ROV: – Remotely operated vehicle

MVDR: – Minimum variance distortionless response

SONAR: – Sound navigation and ranging

RIR: – Room impulse response

IPD: – Interaural phase difference

ITD: – interaural time difference

CASA: – Computational auditory scene analysis

MIMO: – Multiple input and multiple output

SIMO: – Single input and multiple output

AIR: – Acoustic impulse response

CFAR: – Constant False Alarm Rate

DL: – Deep learning

DSB: – Delay and sum beamformer

RNN: – Recurrent neural network

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Perceiving the World With Sound: An Overview to Robot Audition

Abstract

Introduction

Key Terms in this Chapter

Complete Chapter List