0% found this document useful (0 votes)

109 views15 pages

MPEG Audio: Multimedia Communications: Coding, Systems, and Networking

This document provides an overview of MPEG audio coding standards including MPEG-1 audio and MPEG-2 audio. It describes the basics of psychoacoustics and subband coding techniques used in MPEG audio. It then summarizes the layer structures, coding tools, and frame structures of MPEG-1 layers I, II, and III. It also discusses MPEG-2 audio extensions such as multichannel coding, backward compatible coding, and non-backward compatible coding using MPEG-2 AAC.

Uploaded by

luigi-porritiello-uni-6951

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views15 pages

MPEG Audio: Multimedia Communications: Coding, Systems, and Networking

Uploaded by

luigi-porritiello-uni-6951

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

18-796

Multimedia Communications: Coding, Systems, and Networking

Prof. Tsuhan Chen tsuhan@ece.cmu.edu

MPEG Audio

Outline
Basics
Psychoacoustics Subband coding

MPEG-1 audio
Layer I and II Layer III Frame structure and packetization

MPEG-2 audio
Multichannel audio Backward compatible coding Non backward compatible coding
18-796/Spring 1999/Chen

Digital Audio
Telephone Speech Wideband Speech Mediumband Audio Wideband Audio

Frequency Band (Hz) 300~3400 50~7000 10~11000 10~22000

Sampling Rate (kHz) 8 16 24 48

Bits per Sample 8 8 16 16

Raw Bitrate (kbits/s) 64 128 384 768

CD: 44.1 kHz 16 bits 2 channels = 1.411 Mbits/s

18-796/Spring 1999/Chen

Psychoacoustics
Threshold in quiet

26 critical bands 0~24 kHz

Frequency masking in the same critical band

18-796/Spring 1999/Chen

Frequency Masking
SMR (Signal-to-Mask Ratio)

18-796/Spring 1999/Chen

Temporal Masking
Post-Masking: 50~200ms

Pre-Masking: 1/10 of post-masking

18-796/Spring 1999/Chen

Subband Coding
H1 (z) H2 (z) M M

Q Q Q

M M

F1 (z) F2 (z) FM(z)

Synthesis Filterbank

HM(z)
Analysis Filterbank

Maximal downsampling Q should be based on signal-to-masking ratio (SMR) Ear critical bands are not uniform, but logarithmic s
The filter bank should match the critical bands Tree-structure filter bank (to be derived on board)
18-796/Spring 1999/Chen

Subband Coding vs. DCT

M z-1 M z-1 E(z) R(z) M z M z

M Polyphase Representation

When E(z) = DCT matrix, this becomes DCT

No overlap; blocking artifact

Modified DCT (MDCT)

50% overlap; less blocking artifact
18-796/Spring 1999/Chen

MPEG-1 Audio
ISO/IEC 11172-3 (1988~1991)
First high quality audio compression standard Sampling rates: 32, 44.1, 48 kHz CD quality two-channel audio at ~256 kbits/s
CD: 44.1 kHz 16 bits 2 = 1.411 Mbits/s

Quality demonstration (MPEG-1 Layer II)

Stereo 44.1 kHz at 64 kbits/s Stereo 44.1 kHz at 128 kbits/s Stereo 44.1 kHz at 192 kbits/s Stereo 44.1 kHz at 256 kbits/s
18-796/Spring 1999/Chen

Encoder Block Diagram

PCM audio samples 32, 44.1, 48 kHz analysis filterbank encoded bitstream frame packing

quantizer and coding

psychoacoustic model

11172-3 Encoder

ancillary data
18-796/Spring 1999/Chen

Decoder Block Diagram

encoded bits tream

fra m e unpacking

reconstruction

synthesis filte rbank

PCM audio samples 32, 44.1, 48 kHz

11172-3 Decoder
ancillary data

18-796/Spring 1999/Chen

Layers
Increasing complexity, delay, and quality
Layer I: ~384 kbits/s for perceptually lossless quality (4:1) Layer II: ~192 kbits/s for perceptually lossless quality (8:1) Layer III: ~128 kbits/s for perceptually lossless quality (12:1) (for two channels)

100% perceptual lossless

18-796/Spring 1999/Chen

Layer I and II Encoder

32 Analysis Filterbank
512-tap Masking Threshold Generator Dynamic Bit Allocator Coder

Scaler & Quantizer Mux

FFT
512-pt for Layer I 1024-pt for Layer II/III

18-796/Spring 1999/Chen

Block-Based Coding
12 Analysis Filterbank 12 12

...
Block: Layer I Superblock: Layer II/III

12 samples for Layer I, 36 samples for Layer II/III Block companding: Each block normalized by scalefactor For Layer II, up to 3 scalefactors, with 2-bit scalefactor select Each block/superblock receives one bit allocation

Layer III Encoder

6 or 18 with overlap

Analysis Filterbank

MDCT

Scaler & Quantizer

Huffman Coding

Mux
Masking Threshold Generator Coding

FFT

18-796/Spring 1999/Chen

Features in Layer III

Hybrid filterbank
MDCT with filterbank

Long/short window switching

Short for better temporal resolution (to prevent pre-echoes) Long for better frequency resolution

Nonuniform quantization Entropy coding

Run-length and Huffman coding

Bit reservoir (buffer)

18-796/Spring 1999/Chen

Frame Structure
Header Info Side Info Subband Sanples Aux Data

Header info: Sync bits, system info, CRC (cyclic redundancy code) Side info: bit allocation, scalefactor, (and scalefactor select for Layer II and III) Subband samples: 32 12 for Layer I, 32 36 for Layer II and III Packetization: 4-byte header, 184-byte payload

18-796/Spring 1999/Chen

Stereo Redundancy Coding

Four modes: mono, stereo, dual with two separate channel, joint stereo Joint stereo mode
Human stereo perception > 2kHz is based on envelope Intensity stereo coding > 2kHz
Encode (L + R) Assign independent left- and right- scalefactors

Layer III supports (L+R) and (LR) coding

18-796/Spring 1999/Chen

MPEG-2 Audio
ISO/IEC 13818-3
Allows lower sampling rates
16, 22.05, and 24 kHz: about half of MPEG-1

From wideband speech to mediumband audio Higher frequency resolution Layer I, II, and III

Multichannel coding
2~5 channels; surround sound, multilingual, for visual/hearing-impaired

Backward compatible and non-backward compatible coding (13818-7: MPEG-2 AAC)

18-796/Spring 1999/Chen

Multichannel Audio

2/0-stereo

3/0

3/1
Surround

LFE: Low-frequency enhancement (woofer) 15~120 Hz Can be anywhere

3/2

3/2 with woofer (5.1 system)

18-796/Spring 1999/Chen

Compatibility
Forward compatibility
A new decoder can decode an old bitstream Usually simple to achieve

Backward compatibility
An old decoder can decode a new bitstream, at least partially Usually limits the coding efficiency

18-796/Spring 1999/Chen

MPEG-2 Backward Compatible Audio Coding

MPEG-1 Header MPEG-1 Data MPEG-1 Ancillary Data

MPEG-1/2 Frame

MPEG-2 Header

MPEG-2 Data

L C R LS RS Matrix

L0 R0 T3 T4 T5

MPEG-1 Encoder MPEG-2 Extension Encoder Mux

L0 = ( L + C + LS ) 1 1 or = 1; = = 0 = 1+ 2 ; = = 2 R0 = ( R + C + RS )

Backward Compatible Audio Coding (cont.)

L C R LS RS

L0 R0 T3 Matrix T4 T5

MPEG-1 Encoder MPEG-2 Extension Encoder Mux Demux

L0 L R0 C T3 Inverse R MPEG-2 T4 Matrix LS Extension RS Decoder T5 MPEG-1 Decoder

Matrixing

Dematrixing

18-796/Spring 1999/Chen

Non Backward Compatible (NBC) Coding

MPEG-2 Advanced Audio Coding (AAC)
ISO/IEC 13818-7 (April 1997) 320~384 kbits/s for 5 channels, 64kbits/channel NBC at 320 kbits/s as good as BC coding at 640 kbits/s 1~48 audio channels, 0~16 LFEs, 0~16 data streams

Same framework (perceptual subband coding) as MPEG-1, with some enhancements

18-796/Spring 1999/Chen

MPEG-2 AAC
Noiseless Decoding

Enhancements
Preprocessing High resolution filterbanks
1024-line MDCT / 128
Legend Data Control Inverse Quantizer

Scale Factors

Temporal noise shaping (TNS): time-dependent quantization Coupling channel

Intensity multichannel coding

M/S 13818-7 Coded Audio Stream Bitstream Demultiplex

Prediction

Backward adaptive prediction in subbands M/S stereo coding Noiseless coding (entropy coding): Huffman coding

Intensity/ Coupling

TNS

Filter Bank Output Time Signal

Gain Control

Input time signal

Encoder
Perceptual Model Gain Control Legend Filter Bank Data Control

TNS

Intensity/ Coupling Quantized Spectrum Prediction of Previous Frame M/S Iteration Loops Scale Factors

Bitstream Multiplex

13818-7 Coded Audio Stream

Rate/Distortion Control Process

Quantizer

Noiseless Coding

18-796/Spring 1999/Chen

MPEG-2 AAC Profiles

Main Low Complexity Scaleable Sampling Rate 20 kHz 18 kHz 12 kHz 6 kHz

Main profile
Best quality, highest complexity 1024 or 128 MDCT

Low-complexity profile
No temporal noise shaping, no prediction

Scalable sampling-rate profile

Scalable output sampling rates and complexity Uses hybrid filterbanks (like MPEG-1 Layer III) No prediction, no coupling channel
18-796/Spring 1999/Chen

Simcast
To achieve backward compatibility at the cost of higher bitrate
L0 R0 L C R LS RS MPEG-2 AAC Encoder Mux Demux MPEG-2 AAC Decoder MPEG-1 Encoder MPEG-1 Decoder L0 R0 L C R LS RS

18-796/Spring 1999/Chen

References
Peter Noll, MPEG digital audio coding, IEEE Signal Processing Magazine, Sept. 1997, pp. 59-81 D. Pan, A tutorial on MPEG/audio compression, IEEE Multimedia, v. 2, no. 2, 1995, pp. 60-74 http://www.mpeg.org/MPEG/audio.html http://www.cselt.it/mpeg/faq/faq-audio.htm http://www.tnt.uni-hannover.de/project/mpeg/audio/

18-796/Spring 1999/Chen

MPEG-1 Audio Compression Guide
No ratings yet
MPEG-1 Audio Compression Guide
10 pages
ضغط الصوت
No ratings yet
ضغط الصوت
31 pages
MPEG-4 Advanced Audio Coding
No ratings yet
MPEG-4 Advanced Audio Coding
13 pages
Audio Compression Techniques Guide
No ratings yet
Audio Compression Techniques Guide
31 pages
Advanced Audio Coding (Aac)
100% (1)
Advanced Audio Coding (Aac)
33 pages
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
No ratings yet
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
23 pages
Audio Compression Insights
No ratings yet
Audio Compression Insights
25 pages
Digital TV Compression Guide
No ratings yet
Digital TV Compression Guide
43 pages
Brandenburg Mp3 Aac
No ratings yet
Brandenburg Mp3 Aac
12 pages
AES 17 Conference Mp3 and AAC Explained AES17
No ratings yet
AES 17 Conference Mp3 and AAC Explained AES17
12 pages
Simple Audio Compression Methods: A Udio Com Pression
No ratings yet
Simple Audio Compression Methods: A Udio Com Pression
6 pages
MPEG
No ratings yet
MPEG
12 pages
Audio Compression Standards: James Rodney P. Santiago
No ratings yet
Audio Compression Standards: James Rodney P. Santiago
51 pages
Audio Compression: Usha Sree
No ratings yet
Audio Compression: Usha Sree
23 pages
Audio Compression
No ratings yet
Audio Compression
23 pages
Audio Compression1
No ratings yet
Audio Compression1
22 pages
Low Bit Rate Coding
No ratings yet
Low Bit Rate Coding
4 pages
Huff Man 1
No ratings yet
Huff Man 1
4 pages
MPEG, The MP3 Standard, and Audio Compression
No ratings yet
MPEG, The MP3 Standard, and Audio Compression
12 pages
17 Multimedia Data and Its Encoding E
No ratings yet
17 Multimedia Data and Its Encoding E
19 pages
New Implementation Techniques of An Effi
No ratings yet
New Implementation Techniques of An Effi
11 pages
MPEG Standards For Audio
No ratings yet
MPEG Standards For Audio
46 pages
4 Chapter Audio and Video Compression
No ratings yet
4 Chapter Audio and Video Compression
122 pages
HE-AAC v2 PDF
No ratings yet
HE-AAC v2 PDF
12 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
MPEG Audio - Compression - 2
No ratings yet
MPEG Audio - Compression - 2
5 pages
Multimedia System Design Part - 4
No ratings yet
Multimedia System Design Part - 4
37 pages
Mpeg Audio
No ratings yet
Mpeg Audio
59 pages
Wireless Communication 03 Coding
No ratings yet
Wireless Communication 03 Coding
50 pages
STA013 mp3解壓縮晶片
No ratings yet
STA013 mp3解壓縮晶片
17 pages
Audio Compression
No ratings yet
Audio Compression
53 pages
Digital Representation of Audio Information
No ratings yet
Digital Representation of Audio Information
22 pages
Audio & Video Compression Guide
100% (3)
Audio & Video Compression Guide
59 pages
M I Itai Au Ioc Ing: Dealing With Bit Rates
No ratings yet
M I Itai Au Ioc Ing: Dealing With Bit Rates
23 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
MPEG Layer-3: An Introduction To
No ratings yet
MPEG Layer-3: An Introduction To
15 pages
Multimedia Compression Techniques
No ratings yet
Multimedia Compression Techniques
35 pages
MPEG-7 for Multimedia Professionals
100% (1)
MPEG-7 for Multimedia Professionals
58 pages
MP3 Audio & Introduction To MPEG-4 (Part 6) : Klara Nahrstedt Spring 2011
100% (1)
MP3 Audio & Introduction To MPEG-4 (Part 6) : Klara Nahrstedt Spring 2011
37 pages
Sistem Digital Nirkabel (TM3)
No ratings yet
Sistem Digital Nirkabel (TM3)
64 pages
Most Powerful Audio Available Today: Ig in Al Aa CPL Us MP 3Pr O Aa C Re Al 8 7K H ZL PF WM A 8 MP 3 Re Alg 2 ZL PF
No ratings yet
Most Powerful Audio Available Today: Ig in Al Aa CPL Us MP 3Pr O Aa C Re Al 8 7K H ZL PF WM A 8 MP 3 Re Alg 2 ZL PF
2 pages
Multimedia Exam Solutions
No ratings yet
Multimedia Exam Solutions
13 pages
2015 Chapter 11 MMS IT
No ratings yet
2015 Chapter 11 MMS IT
11 pages
Dts Overview
No ratings yet
Dts Overview
35 pages
Digital Audio
No ratings yet
Digital Audio
29 pages
JPEG, Basic Ideas, Standards H.261, MPEG-1, MPEG-2 AVC, HEVC, Container Formats
No ratings yet
JPEG, Basic Ideas, Standards H.261, MPEG-1, MPEG-2 AVC, HEVC, Container Formats
20 pages
M5 MPEGAudio
No ratings yet
M5 MPEGAudio
60 pages
MP3 Format: Theory of The Standard
No ratings yet
MP3 Format: Theory of The Standard
15 pages
Audio Coding for Engineers
No ratings yet
Audio Coding for Engineers
15 pages
Audio Processing for Engineers
No ratings yet
Audio Processing for Engineers
6 pages
Mpeg 4 1109
No ratings yet
Mpeg 4 1109
38 pages
DAS9T02 - Data Reduction
No ratings yet
DAS9T02 - Data Reduction
36 pages
Dolby AC3 Audio Codec and MPEG-2 Advanced Audio Coding: Recommended by
No ratings yet
Dolby AC3 Audio Codec and MPEG-2 Advanced Audio Coding: Recommended by
4 pages
INT 338 Network-Based Multimedia Lecture
No ratings yet
INT 338 Network-Based Multimedia Lecture
44 pages
Lecture 11 Sound Notes
No ratings yet
Lecture 11 Sound Notes
14 pages
Wireless Communications: Principles and Practice 2 Edition T.S. Rappaport
No ratings yet
Wireless Communications: Principles and Practice 2 Edition T.S. Rappaport
19 pages
Agsc QP
No ratings yet
Agsc QP
15 pages
Wang 等 - 2019 - A Memory-Efficient Sketch Method for Estimating Hi
No ratings yet
Wang 等 - 2019 - A Memory-Efficient Sketch Method for Estimating Hi
10 pages
Emerging Land Policy Issues in India
No ratings yet
Emerging Land Policy Issues in India
20 pages
Grose 2014
No ratings yet
Grose 2014
9 pages
Art & Design Student Assessment
No ratings yet
Art & Design Student Assessment
2 pages
Diversity Models and Dimensions Guide
No ratings yet
Diversity Models and Dimensions Guide
4 pages
Numerical Investigations of Gas-Liquid Two-Phase Flow in A Pump Inducer
No ratings yet
Numerical Investigations of Gas-Liquid Two-Phase Flow in A Pump Inducer
46 pages
WO Albeng Alprod Depo 30
No ratings yet
WO Albeng Alprod Depo 30
3 pages
Packing Machine Operation Instruction
No ratings yet
Packing Machine Operation Instruction
18 pages
Data Types, Variables, and Constants
No ratings yet
Data Types, Variables, and Constants
20 pages
Construction Blueprint Details
100% (1)
Construction Blueprint Details
2 pages
ND II 3rdterm Sum
No ratings yet
ND II 3rdterm Sum
7 pages
Biology Levels for Students
No ratings yet
Biology Levels for Students
3 pages
Cat Red
No ratings yet
Cat Red
5 pages
Standard of Competence
No ratings yet
Standard of Competence
11 pages
GU Student Manual 2 Schemas
No ratings yet
GU Student Manual 2 Schemas
11 pages
Heep 111
0% (1)
Heep 111
7 pages
Data Integration
No ratings yet
Data Integration
4 pages
T-Root Blades in A Steam Turbine Rotor A
No ratings yet
T-Root Blades in A Steam Turbine Rotor A
8 pages
5.1 Chemical Formulae, Equations, Calculations (1C) QP Part 2
No ratings yet
5.1 Chemical Formulae, Equations, Calculations (1C) QP Part 2
12 pages
Error Identification - PT3
No ratings yet
Error Identification - PT3
1 page
Che 560 HW5
No ratings yet
Che 560 HW5
1 page
Endemism: Definition, Types, and Examples
No ratings yet
Endemism: Definition, Types, and Examples
39 pages
Exercise Solutions For Simulation With Arena PDF
0% (1)
Exercise Solutions For Simulation With Arena PDF
2 pages
15.IO Streams Introduction
No ratings yet
15.IO Streams Introduction
27 pages
Ohms Law 14to16 Lesson-Plan
No ratings yet
Ohms Law 14to16 Lesson-Plan
3 pages
The Shiphandlers Guide
No ratings yet
The Shiphandlers Guide
143 pages
Leg Foot Massager 1026 Manual
No ratings yet
Leg Foot Massager 1026 Manual
5 pages

MPEG Audio: Multimedia Communications: Coding, Systems, and Networking

Uploaded by

MPEG Audio: Multimedia Communications: Coding, Systems, and Networking

Uploaded by

18-796

Multimedia Communications: Coding, Systems, and Networking

Prof. Tsuhan Chen tsuhan@ece.cmu.edu

Frequency Band (Hz) 300~3400 50~7000 10~11000 10~22000

Sampling Rate (kHz) 8 16 24 48

Bits per Sample 8 8 16 16

Raw Bitrate (kbits/s) 64 128 384 768

CD: 44.1 kHz 16 bits 2 channels = 1.411 Mbits/s

26 critical bands 0~24 kHz

Frequency masking in the same critical band

Pre-Masking: 1/10 of post-masking

F1 (z) F2 (z) FM(z)

Subband Coding vs. DCT

When E(z) = DCT matrix, this becomes DCT

Modified DCT (MDCT)

Quality demonstration (MPEG-1 Layer II)

Encoder Block Diagram

quantizer and coding

Decoder Block Diagram

encoded bits tream

synthesis filte rbank

PCM audio samples 32, 44.1, 48 kHz

100% perceptual lossless

Layer I and II Encoder

Scaler & Quantizer Mux

Layer III Encoder

Scaler & Quantizer

Features in Layer III

Long/short window switching

Nonuniform quantization Entropy coding

Bit reservoir (buffer)

Stereo Redundancy Coding

Layer III supports (L+R) and (LR) coding

Backward compatible and non-backward compatible coding (13818-7: MPEG-2 AAC)

LFE: Low-frequency enhancement (woofer) 15~120 Hz Can be anywhere

3/2 with woofer (5.1 system)

MPEG-2 Backward Compatible Audio Coding

MPEG-1 Encoder MPEG-2 Extension Encoder Mux

Backward Compatible Audio Coding (cont.)

MPEG-1 Encoder MPEG-2 Extension Encoder Mux Demux

L0 L R0 C T3 Inverse R MPEG-2 T4 Matrix LS Extension RS Decoder T5 MPEG-1 Decoder

Non Backward Compatible (NBC) Coding

Same framework (perceptual subband coding) as MPEG-1, with some enhancements

Temporal noise shaping (TNS): time-dependent quantization Coupling channel

M/S 13818-7 Coded Audio Stream Bitstream Demultiplex

Filter Bank Output Time Signal

Input time signal

13818-7 Coded Audio Stream

Rate/Distortion Control Process

MPEG-2 AAC Profiles

Scalable sampling-rate profile

You might also like