Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A High Quality Audio Coder Using Proposed Psychoacoustic Model

Source Title: Signal Processing, Perceptual Coding and Watermarking of Digital Audio: Advanced Technologies and Models

DOI: 10.4018/978-1-61520-925-5.ch009

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Chapter Preview

Top

9.1. Structure Of Proposed Perceptual Audio Coder

The structure of the proposed high quality perceptual audio encoder is shown in Figure 1 (He et al., 2008b). Input PCM audio samples are fed into the encoder. The time to frequency mapping creates a sub-sampled representation of the audio samples using the DWPT. The psychoacoustic model calculates the masking thresholds, which are later employed to control the quantizer and coding. Bit allocation strategy is utilized to allocate bits to each sub-band sample according to its perceptual importance. Typically, more bits are reserved for low frequency samples, which are perceptually more important. Quantization is performed in a way to keep the quantization noise below the audible threshold for transparent audio coding. The bit allocation information is transmitted together with the encoded audio as ancillary data or side information, which are used in the audio decoder to reconstruct the PCM audio samples. Lossless coding, which is usually Huffman coding, is employed to further remove the redundancy of the quantized value. The frame packing block packs the output of quantizer and coding block as well as the side information and yields the encoded audio stream.

Figure 1.

Structure of perceptual audio encoder

Figure 2 shows the decoder of the proposed audio coding scheme. The encoded audio stream is fed into the frame unpacking block, which unpacks the compressed audio stream into the quantized samples as well as the side information. In the de-quantization and decoding block, Huffman decoding is performed first followed by de-quantization, using the side information extracted from the frame-unpacking block. The output is the audio samples in the wavelet domain, which are later transformed in time domain by the inverse time/frequency mapping block to form the decoded PCM audio samples.

Figure 2.

Structure of perceptual audio decoder

Time/frequency mapping block and psychoacoustic model block are illustrated in chapter 7 as the proposed psychoacoustic model, so only quantizer and coding block is explained in the following section.

Top

9.2 Quantization And Huffman Coding

The quantization and Huffman coding employed in the proposed audio codec is similar to that of the MPEG 1 layer III standard. The input to the quantization and Huffman coding block includes the spectral values (wavelet coefficients) of the frame, the maximum number of bits available for Huffman coding, the critical band partition table and the allowed distortion in each critical band (also called scalefactor band in audio coding).

The maximum number of bits available for Huffman coding for one frame (called granule in audio coding) is defined as

(9.1) where bit_rate is the actual bit rate, granul_size is the number of spectral values in one granule (1024 for our case) and the sampling frequency is 44.1 kHz for CD quality audio.

The allowed distortion in each scalefactor band is calculated as

(9.2) where sb is the scalefactor band index, thrn(sb) is the masking threshold estimated by proposed psychoacoustic model, and bw(sb) is the bandwidth of each scalefactor band and can be read from Table 1.

Table 1.

Scalefactor band partition (sampling frequency 44.1 kHz)

Scalefactor band(sb)	Bandwidth(bw)	Index of start	Index of end
1	4	1	4
2	4	5	8
3	8	9	16
4	4	17	20
5	4	21	24
6	8	25	32
7	4	33	36
8	4	37	40
9	8	41	48
10	16	49	64
11	8	65	72
12	8	73	80
13	16	81	96
14	16	97	112
15	16	113	128
16	16	129	144
17	16	145	160
18	32	161	192
19	64	193	256
20	32	257	288
21	32	289	320
22	64	321	384
23	128	385	512
24	256	513	768
25	256	769	1024

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

A High Quality Audio Coder Using Proposed Psychoacoustic Model

Chapter Preview

9.1. Structure Of Proposed Perceptual Audio Coder

9.2 Quantization And Huffman Coding

Complete Chapter List