Sie sind auf Seite 1von 31

Audio Compression

P N Bhakta, DDG(E)
Lossless and Lossy Coding

LOSSLESS CODING
REDUNDANCY
SIGNAL INFORMATION REMOVED
CONTENT INFORMATION
LEVEL
CONTENT

REDUNDANCY

FREQUENCY
LOSSY REDUNDANCY
CODING INFORMATION +
CONTENT IRRELEVANCY
REMOVED
Factors affecting Coder Design
• Fidelity

• Data rate

• Complexity

• Delay
The Coding Chain

ANALOG DIGITAL ANALOG

HUMAN EAR
SOURCE TRANSMIT

ENCODER 11010101 OR 11010101 DECODER

SINGER RECORD / PLAY


LISTENER
Human Hearing System

Outer ear Middle Inner ear


ear Oval
window Fluid Basilar
Pinna membrane

Ear canal Cochlea


Round window
Ear Drum Helicotrrema
Eustachian tube
Critical Bands

0 500 Hz 20000 Hz Frequency


Threshold of Hearing
The Masking Phenomenon
• Frequency Masking
• Temporal Masking
Frequency Masking
Frequency Masking
Frequency Masking
Masking Parameters
MASKING SIGNAL

MASKING SNR SMR


THRESHOLD

MNR
SPL
(in db)

NOISE

fm f
FREQUENCY
Temporal Masking
Psychoacoustic Coder

Psychoacoustic
Model

Dynamic Bit Bit Stream


Band Splitting
Allocator Framing
PCM
Encoded
Audio
Bit stream
MPEG Standards
• MPEG-1 standard - coding of synchronized
video and audio at a total data rate of about
1.5 mbps (1992)
• MPEG-2 standard– Total data rate of about 10
Mbps (1994)
• MPEG–3 standard – Total data rate of about
40 Mbps. However, this was dropped in the
year 1993
• MPEG-4 standard- It was finalized in 1998.
MPEG-1 Audio Encoder
Encoded
Time
Allocation Bit stream Bit stream
PCM to
Frequency and
Audio
Mapping Coding Framing

Ancillary
Psychoacoustic
data
Model
MPEG-1 Layer I & II Encoder (Mono)

0
Filter 1
Bank Uniform
PCM
Audio (32 Sub-band) 31
Quantiser

Bit stream
Encoded
Framing Bit stream

Psychoacoustic Coding of side


DFT
Model information
MPEG-1 Layer III Encoder (Mono)

0
0
PCM
Filter 1 Huffman
Audio Non-Uniform
Bank MDCT
(32 Sub-band) Coding
31 575 Quantiser

Bit stream
Encoded
Framing Bit stream

Psychoacoustic Coding of side


DFT
Model information
MPEG-2 Multichannel BC Encoder

Lo
L Lo’
Ro MPEG-1
R Stereo
Ro’
Decoder
Multi-
Matrix Channel
T3
C
T4 Encoder L’
Ls R’
T5
Rs MPEG-2 C’
Multi- Ls’
Channel Rs’
LFE Dematrix
Decoder
LFE’
Lo = L + aC + b Ls
Ro = R + aC + b Rs
a = b = 1/2
c = 1/ (1+2)
MPEG-2 AAC Encoder

Rate / Distortion
Perceptual Control Process
Model

Pre - Filter Intensity/ Pre- Scale Noiseless


Input Processing Bank TNS Coupling Diction M/S Factors Quantiser Coding

Bit-stream Formatter
Control
Quantised spectrum
Data Coded Audio Data of previous frame
MPEG-4 GA Encoder

Quantisation and
Coding Choices

Perceptual
Model

Pre - Filter Intensity/ Pre- Twin


Input Processing Bank TNS LTP Coupling Diction PNS M/S BSAC AAC VQ

Bit-stream Formatter
Control

Data Coded Audio Data


MPEG – 1 Audio
MAIN FEATURES
• Sampling rates – 32, 44.1 and 48 KHz
• Data rates – 32 to 224 kbps per channel
• Channels – Mono, Dual Mono, Stereo, joint
stereo
• Compression ratios – 2.7 to 24:1 (as per
sampling rates)
• Layers – Layer I, II, III
Layer I
• Data rates – 32-224 kbps (preferred above
128 kbps)
• Complexity – Low
• Applications – Digital compact cassette
etc
Layer II
• Data rates 32-192 kbps per channel (224 kbps
or more for stereo modes only)
• Complexity – Medium
• Applications – Digital Audio Broadcasting,
Digital Video Broadcasting, etc.
Layer III
• Data rates – 32-160 kbps per channel
(preferred below 128 kbps)
• Complexity – High
• Applications – ISDN, Internet etc.
MPEG – 2 Audio
• Developed to achieve the quality of
MPEG-1 Audio or better than that with
lower data rates and allow for
multichannel applications.
Different Systems of MPEG-2
Audio
• MPEG– 2 LSF

• MPEG–2 .5

• MPEG–2 MULTICHANNEL BC
MPEG– 2 LSF
• Sampling rates – 24, 22.05 and 16 KHz
• Data rates – 32-128 kbps (Layer I)
8 - 80 kbps (Layer II & III)
• Channels – Mono, Dual, Stereo, joint stereo
• Layer III is useful for low bandwidth Internet
application.
MPEG–2 .5
• Sampling rates – 12, 11.025 and 8 KHz
MPEG–2 Multichannel BC

• Sampling rates – Same as in MPEG– 1 for five main


channels.
For LFE- 1/96 th. of main channels.

• Supported Configurations – 5.1(or 3/2/1), 3/1, 3/0, 2/2,


2/1, 2/0, 1/0.
It also supports seven multilingual audios.

• Data rates (Maximum) – Layer I – 1.13 Mbps


(At sampling rate – 48 kHz) Layer II – 1.066 Mbps
Layer III – 1.002 Mbps

Das könnte Ihnen auch gefallen