Beruflich Dokumente
Kultur Dokumente
Chapter 11
MPEG Video Coding I — MPEG-1 and 2
11.1 Overview
11.2 MPEG-1
11.3 MPEG-2
11.4 Further Exploration
1 Li & Drew
c Prentice Hall 2003
11.1 Overview
2 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
11.2 MPEG-1
3 Li & Drew
c Prentice Hall 2003
4 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
The MB containing part of a ball in the Target frame cannot find a good
matching MB in the previous frame because half of the ball was occluded
by another object. A match however can readily be obtained from the next
frame.
5 Li & Drew
c Prentice Hall 2003
6 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
Difference macroblock
Y
Cb
− %
Cr
DCT
Quantization
Motion vectors Entropy coding
0011101…
Fig 11.2: B-frame Coding Based on Bidirectional Motion Compensation.
7 Li & Drew
c Prentice Hall 2003
Time
Display order I B B P B B P B B I
Coding and I P B B P B B I B B
transmission order
8 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
– H.261 only supports CIF (352 × 288) and QCIF (176 × 144) source
formats, MPEG-1 supports SIF (352 × 240 for NTSC, 352 × 288 for
PAL).
– MPEG-1 also allows specification of other formats as long as the
Constrained Parameter Set (CPS) as shown in Table 11.1 is satisfied:
– May also start and end anywhere as long as they fill the
whole picture.
10 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
11 Li & Drew
c Prentice Hall 2003
• Quantization:
8 × DCT [i, j] 8 × DCT [i, j]
QDCT [i, j] = round = round (11.1)
step size[i, j] Q1[i, j] ∗ scale
8 × DCT [i, j] 8 × DCT [i, j]
QDCT [i, j] = = (11.2)
step size[i, j] Q2[i, j] ∗ scale
12 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
13 Li & Drew
c Prentice Hall 2003
14 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
15 Li & Drew
c Prentice Hall 2003
Video sequence
Macroblock Macroblock
header Block 0 Block 1 Block 2 Block 3 Block 4 Block 5 layer
Differential Block
DC coefficient VLC run VLC run ... end_of_block layer
(if intra macroblock)
16 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
11.3 MPEG-2
17 Li & Drew
c Prentice Hall 2003
SNR Spatially
Level Simple Main Scalable Scalable High 4:2:2 Multiview
Profile Profile Profile Profile Profile Profile Profile
High * *
High 1440 * * *
Main * * * * * *
Low * *
18 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
19 Li & Drew
c Prentice Hall 2003
000000000
111111111
111111111
000000000
000000000
111111111 Top−field
000000000
111111111
000000000
111111111
111111111
000000000
000000000
111111111
000000000
111111111
000000000
111111111 111111111
000000000
000000000
111111111 000000000
111111111
000000000
111111111 111111111
000000000
111111111
000000000 111111111
000000000 Bottom−field
000000000
111111111 111111111
000000000
111111111
000000000 111111111
000000000
000000000
111111111 111111111
000000000
111111111
000000000 111111111
000000000
111111111
000000000
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
... 1111
0000
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111
0000
1111 0000
1111
0000
1111 0000
1111
0000
1111
0000
1111
0000
1111
0000
1111 0000
1111
0000
1111
0000
1111 0000
1111
0000
1111
0000
1111
0000
1111
0000
1111 0000
1111
0000
1111 0000
1111
0000
1111
0000
1111
0000
1111
0000
1111 0000
1111
0000
1111
0000
1111 0000
1111
0000
1111
0000
1111
0000
1111 0000
1111 0000
1111
0000
1111 0000
1111 0000
1111
I or P B P
20 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
21 Li & Drew
c Prentice Hall 2003
22 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
23 Li & Drew
c Prentice Hall 2003
– Due to the nature of interlaced video the consecutive rows in the 8×8
blocks are from different fields, there exists less correlation between
them than between the alternate rows.
– Alternate scan recognizes the fact that in interlaced video the verti-
cally higher spatial frequency components may have larger magnitudes
and thus allows them to be scanned earlier in the sequence.
24 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
(a) (b)
25 Li & Drew
c Prentice Hall 2003
MPEG-2 Scalabilities
26 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
27 Li & Drew
c Prentice Hall 2003
SNR Scalability
28 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
Bits_enhance
+ Q VLC
−
Q−1
+
Base Encoder
Current Bits_base
+ DCT Q VLC
Frame
−
Q−1
IDCT
MC−based Frame
Prediction Prediction Memory
Motion vectors
Motion
Estimation
(a) Encoder
29 Li & Drew
c Prentice Hall 2003
Bits_enhance
VLD Q−1
+
Base Decoder
Bits_base Output_base,
VLD Q−1 IDCT +
Output_high
Frame MC−based
Memory Prediction
Motion vectors
(b) Decoder
30 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
Spatial Scalability
31 Li & Drew
c Prentice Hall 2003
Spatial Bits_enhance
Current
frame enhancement layer Predicted MB
encoder from Enh. layer 16 × 16
1−w
+ 16 × 16
Spatial Spatial
decimator interpolator Interpolated MB w
from Base layer 16 × 16
(a) (b)
32 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
Temporal Scalability
33 Li & Drew
c Prentice Hall 2003
Temporal Bits_enhance
enhancement layer
encoder
Current Temporal
frame demultiplexer
Temporal Bits_base
base layer
encoder
34 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
Temporal
enhancement layer
B B B B
Base layer
I B B P ...
Temporal
Enhancement Layer
P B B B
Base Layer
I B B P ...
Hybrid Scalability
36 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
Data Partitioning
37 Li & Drew
c Prentice Hall 2003
38 Li & Drew
c Prentice Hall 2003
Fundamentals of Multimedia, Chapter 11
39 Li & Drew
c Prentice Hall 2003
• Text books:
40 Li & Drew
c Prentice Hall 2003