Beruflich Dokumente
Kultur Dokumente
4/5/2004
4/5/2004
4/5/2004
Related specifications
MUSICAM
ASPEC
NICAM 728
Dolby AC-3
4/5/2004
4/5/2004
4/5/2004
4/5/2004
4/5/2004
Masking
4/5/2004
Sub-band
Filter
Bit
Allocation
Bit-stream
Generation
Output
Compute
Masking
4/5/2004
10
4/5/2004
11
1 2
10
11
12
13
14
15
16
Level (db)
0 8
12
10
10
60
35
20
15
4/5/2004
12
4/5/2004
13
4/5/2004
14
4/5/2004
15
4/5/2004
16
4/5/2004
17
1:6...1:8
1:10...1:12
4/5/2004
18
4/5/2004
19
32 subbands
0 to 31
Scaler
Quantizer
SMRn
Psychoacoustic
model
4/5/2004
Scale
factor
encoder
Bit-rate
allocation
Rn
Quantized
sample
encoder
Multiplexer
PCM
input
Analysis
filter bank
SFn
Output
Bit-rate
allocation
encoder
20
The input audio stream passes through a filter bank that divides
the input into multiple subbands of frequency.
The input audio stream simultaneously passes through a
psychoacoustic model that determines the ratio of the signal
energy to the masking threshold for each subband.
The bit- or noise allocation block uses the Signal-to-Mask
Ratios to decide how to apportion the total number of code
bits available for the quantization of the subband signals to
minimize the audibility of the quantization noise.
Finally, the multiplexer takes the representation of the quantized
subband samples and formats this data and side information into
a coded bitstream.
Ancillary data not necessarily related to the audio stream can
be inserted within the coded bitstream.
4/5/2004
21
Subband
filter 2
.
.
Subband
filter 31
12
samples
12
samples
12
samples
12
samples
12
samples
12
samples
12
samples
12
samples
12
samples
Layer I
frame
4/5/2004
22
PCM
input
Compute
quiet
threshold
Fast
Fourier
Transform
(FFT)
Tonal/
tonal
nontonal
separator
non
tonal
Compute
signal
power
Sn
Compute
tonal
masking
threshold
function
Compute
nontonal
masking
threshold
function
Masking
threshold
function
Calculate
Minimum
Mn
SMRn
The separator identifies and separates the tonal and noiselike components (non-tonal) of the audio signal because the
masking abilities of the two types of signal differ.
4/5/2004
23
Quantizer
SMRn
Psychoacoustic
model
Buffer
fullness
Multiplexer
MDCT
Quantized
sample
Huffman
encoder
Buffer
PCM
input
Analysis
filter bank
Scale
factor
encoder
Output
Side
information
encoder
Side
information
4/5/2004
24
Layer II
Layer III
Header
CRC
Bit Allocation
Scale factor
Samples
Ancillary
(32)
(0,16)
(128,256)
(0-384)
Header
CRC
Bit Allocation
SCFSI
Scale factor
(32)
(0,16)
(128,256)
(0-60)
(0-384)
Header
CRC
Side information
Main Data
Ancillary
(32)
(0,16)
(136, 256)
data
data
Samples
Ancillary
data
4/5/2004
25
MP3 frame
The main data section contains the coded scale factor values
and the Huffman coded frequency lines
Its length depends on the bitrate and the length of the ancillary
data.
The length of the scale factor part depends on whether scale
factors are reused, and also on the window length (short or long).
The scale factors are used in the requantization of the
samples
The demand for Huffman code bits varies with time during the
coding process.
The variable bitrate format can be used to handle this, but a fixed
bitrate is often required for an application such as broadcasting
Therefore there is also a bit reservoir technique that allows
unused main data storage in one frame to be used by up to
two consecutive frames
4/5/2004
26
The design of the Layer III bitstream better fits the encoder's time
varying demand on code bits.
As with Layer II, Layer III processes the audio data in frames of
1,152 samples.
Unlike Layer II, the coded data representing these samples do not
necessarily fit into a fixed length frame in the code bitstream.
The encoder can donate bits to a reservoir when it needs fewer
than the average number of bits to code a frame.
4/5/2004
27
4/5/2004
28
Purpose
4/5/2004
29
MP3 Decoder
4/5/2004
30
MP3 Performance
Sound quality
Bandwidth
Mode
Bitrate
Reduction ratio
Telephone sound
2.5 kHz
mono
8 kbps *
96:1
Short wave
4.5 kHz
mono
16 kbps
48:1
AM radio
7.5 kHz
mono
32 kbps
24:1
FM radio
11 kHz
stereo
56...64 kbps
26...24:1
Near-CD
15 kHz
stereo
96 kbps
16:1
CD
>15 kHz
stereo
112..128kbps
14..12:1
4/5/2004
31
MPEG-2 Audio
4/5/2004
32
4/5/2004
33
CRC
Bit Allocation
SCFSI
Scale factor
Samples
MC
MC
MC
MC
MC
Header
CRC
Bit Allocation
SCFSI
Predictor
Ancillary data 1
MC Samples
Ancillary data 2
Multi-lingual
Commentary
4/5/2004
34
Layer
LayerIIII
Low
Low
Frequency
Frequency
Layer
LayerIIIIII
MultiMultiChannel
Channel
4/5/2004
Layer
LayerIIII
5 channels
32, 44.1, 48 Khz
Layer
LayerIIIIII
Layer
LayerI I
Layer
LayerIIII
Layer
LayerIIIIII
35
4/5/2004
36
Key Points
MPEG-2 Audio BC
MPEG-2 AAC (NBC)
4/5/2004
37