1 Introduction |
|
1 | (36) |
|
1.1 Popular Video and Audio Standards |
|
|
1 | (2) |
|
1.2 Digital Representation of Video |
|
|
3 | (15) |
|
1.3 Basic Structure of Video Codec |
|
|
18 | (1) |
|
1.4 Performance Comparison Metrics for Video Codec |
|
|
18 | (10) |
|
1.5 Digital Representation of Audio |
|
|
28 | (1) |
|
1.6 Basic Structure of Perceptual Audio Coding |
|
|
29 | (2) |
|
1.7 Performance Comparison Metrics for Audio Codec |
|
|
31 | (5) |
|
|
36 | (1) |
2 Video Coding Standards and Video Formats |
|
37 | (14) |
|
|
37 | (3) |
|
|
40 | (1) |
|
2.3 Video Coding Standards |
|
|
40 | (1) |
|
|
41 | (4) |
|
|
41 | (1) |
|
|
41 | (1) |
|
|
42 | (1) |
|
|
42 | (1) |
|
2.4.5 H.263, H.263+ and H.263++ |
|
|
43 | (1) |
|
|
43 | (1) |
|
2.4.7 H.264/MPEG-4 Part 10/AVC |
|
|
44 | (1) |
|
|
45 | (1) |
|
2.5 Video Formats and Quality |
|
|
45 | (5) |
|
|
45 | (1) |
|
|
46 | (3) |
|
|
49 | (1) |
|
|
49 | (1) |
|
|
50 | (1) |
3 AVS China |
|
51 | (48) |
|
|
51 | (1) |
|
3.2 AVS China Profiles and Levels |
|
|
52 | (3) |
|
3.2.1 AVS-Video Jizhun (Base) Profile |
|
|
52 | (1) |
|
3.2.2 AVS-Video Jiben (Basic) Profile |
|
|
53 | (1) |
|
3.2.3 AVS-Video Shenzhan (Extended) Profile |
|
|
53 | (2) |
|
3.2.4 AVS-Video Jiaqiang (Enhanced) Profile |
|
|
55 | (1) |
|
3.3 Data Formats Used in AVS |
|
|
55 | (4) |
|
3.3.1 AVS Video Layered Structure |
|
|
56 | (3) |
|
|
59 | (10) |
|
3.4.1 Encoder Process Outline |
|
|
60 | (1) |
|
3.4.2 Coding Tools Used in AVS Video Coder |
|
|
61 | (8) |
|
|
69 | (1) |
|
|
69 | (3) |
|
|
70 | (1) |
|
|
70 | (1) |
|
3.6.3 Picture_coding_type |
|
|
71 | (1) |
|
3.7 NAL Unit for AVS Video Stream |
|
|
72 | (2) |
|
3.7.1 NAL Unit Mapping with AVS Video Stream |
|
|
72 | (1) |
|
3.7.2 NAL Unit Header Description |
|
|
72 | (2) |
|
3.8 Introduction to AVS-M (AVS Part 7) |
|
|
74 | (7) |
|
3.8.1 Data Structure of AVS-M |
|
|
75 | (3) |
|
3.8.2 Embodiment of AVS-M |
|
|
78 | (2) |
|
3.8.3 Various Levels in Jiben Profile |
|
|
80 | (1) |
|
3.9 Block Mode Prediction Modes |
|
|
81 | (4) |
|
|
81 | (2) |
|
|
83 | (1) |
|
3.9.3 Skip Mode Prediction |
|
|
84 | (1) |
|
|
85 | (1) |
|
3.10 Transform, Quantization and Entropy Coding |
|
|
85 | (4) |
|
|
85 | (1) |
|
|
86 | (1) |
|
|
86 | (1) |
|
3.10.4 Simplified Deblocking Filter |
|
|
87 | (2) |
|
|
89 | (7) |
|
|
91 | (1) |
|
|
92 | (4) |
|
|
96 | (2) |
|
|
97 | (1) |
|
3.12.2 Profiles and Levels |
|
|
97 | (1) |
|
3.12.3 Overview of the Design Characteristics |
|
|
97 | (1) |
|
|
98 | (1) |
|
|
98 | (1) |
4 H.264/MPEG-4 Advanced Video Coding |
|
99 | (26) |
|
|
99 | (1) |
|
4.2 Profiles and Levels of H.264 |
|
|
100 | (5) |
|
|
100 | (5) |
|
|
105 | (1) |
|
|
105 | (2) |
|
|
107 | (1) |
|
|
108 | (1) |
|
4.6 Inter Prediction of Macroblocks in P-Slices |
|
|
108 | (1) |
|
4.7 Sub-Pixel Motion Vectors |
|
|
109 | (3) |
|
4.8 Transform and Quantization |
|
|
112 | (1) |
|
4.9 In-Loop Deblocking Filter |
|
|
112 | (4) |
|
|
114 | (2) |
|
4.10 B-Slices and Adaptive Weighted Prediction |
|
|
116 | (1) |
|
|
117 | (2) |
|
|
119 | (1) |
|
4.13 Some Applications of H.264 |
|
|
120 | (1) |
|
|
121 | (1) |
|
|
121 | (4) |
5 High Efficiency Video Coding (HEVC) |
|
125 | (34) |
|
|
125 | (1) |
|
5.2 Joint Collaborative Team on Video Coding |
|
|
125 | (7) |
|
5.3 Analysis of Coding Tools in HEVC Test Model, HM 1.0: Intra Prediction |
|
|
132 | (1) |
|
|
132 | (6) |
|
|
135 | (1) |
|
5.4.2 Transform Coefficient Scanning |
|
|
136 | (1) |
|
5.4.3 Luma and Chroma Fractional Pixel Interpolation |
|
|
137 | (1) |
|
5.4.4 Comparison of Coding Tools of HM1 and HEVC Draft 9 |
|
|
137 | (1) |
|
|
138 | (2) |
|
|
140 | (3) |
|
5.7 Performance and Computational Complexity of HEVC Encoders |
|
|
143 | (1) |
|
5.8 System Layer Integration of HEVC |
|
|
144 | (1) |
|
5.9 HEVC Lossless Coding and Improvements |
|
|
144 | (2) |
|
|
146 | (2) |
|
|
148 | (11) |
6 VP6 Video Coding Standard |
|
159 | (40) |
|
|
159 | (1) |
|
6.2 Comparison with Previous Flash Codec MX |
|
|
160 | (5) |
|
6.3 VP6 Algorithm Fundamentals |
|
|
165 | (1) |
|
6.4 Coding Profiles in VP6 |
|
|
166 | (1) |
|
|
167 | (1) |
|
|
168 | (1) |
|
|
168 | (2) |
|
6.6.1 MB Modes in I-Frames (Intra-Mode) |
|
|
168 | (1) |
|
6.6.2 MB Modes in P-Frames (Inter-Modes and Intra-Mode) |
|
|
169 | (1) |
|
6.7 Nearest and Near Blocks |
|
|
170 | (1) |
|
|
171 | (1) |
|
|
172 | (1) |
|
6.8.2 Prediction Loop Filtering |
|
|
172 | (1) |
|
6.9 Filtering for Fractional Pixel Motion Compensation |
|
|
172 | (2) |
|
|
173 | (1) |
|
|
173 | (1) |
|
6.10 Support for Unrestricted Motion Vectors |
|
|
174 | (1) |
|
6.11 Prediction Loop Filtering |
|
|
174 | (1) |
|
6.12 DCT, Scan Orders and Coefficient Token Set |
|
|
174 | (12) |
|
|
180 | (1) |
|
6.12.2 DCT Coding and Coefficient Token Set |
|
|
181 | (5) |
|
|
186 | (1) |
|
|
187 | (2) |
|
6.14.1 Use of Context Information |
|
|
188 | (1) |
|
|
188 | (1) |
|
|
189 | (1) |
|
6.15 An Overview on VP6 Coding |
|
|
189 | (1) |
|
6.16 Performance of VP6 Coding |
|
|
190 | (1) |
|
|
191 | (1) |
|
6.18 Background/Foreground Segmentation |
|
|
191 | (1) |
|
6.19 Context Predictive Entropy Encoding |
|
|
192 | (1) |
|
6.20 Bitstream Partitions |
|
|
192 | (2) |
|
6.21 Dual Mode Arithmetic and VLC Encoding |
|
|
194 | (1) |
|
6.22 Adaptive Sub-Pixel Motion Estimation |
|
|
194 | (1) |
|
6.23 VP6-E and VP6-S Encoder Profiles |
|
|
194 | (1) |
|
6.24 Device Ports and Hardware Implementations |
|
|
195 | (2) |
|
|
197 | (1) |
|
|
197 | (2) |
7 Performance Analysis and Comparison of the Dirac Video Codec with H.264/MPEG-4, Part 10 |
|
199 | (22) |
|
|
199 | (1) |
|
|
200 | (2) |
|
|
200 | (1) |
|
|
201 | (1) |
|
7.3 Stages of Encoding and Decoding in Dirac |
|
|
202 | (7) |
|
|
202 | (2) |
|
7.3.2 Scaling and Quantization |
|
|
204 | (1) |
|
|
205 | (1) |
|
|
206 | (1) |
|
7.3.5 Motion Compensation |
|
|
207 | (1) |
|
|
208 | (1) |
|
|
209 | (2) |
|
7.4.1 Code Structure Overview |
|
|
209 | (1) |
|
7.4.2 Simplicity and Relative Speed of Encoding |
|
|
209 | (2) |
|
|
211 | (6) |
|
7.5.1 Compression Ratio Test |
|
|
211 | (2) |
|
|
213 | (1) |
|
|
214 | (3) |
|
|
217 | (1) |
|
|
218 | (1) |
|
|
218 | (1) |
|
|
218 | (3) |
8 The VC-1 Video Coding |
|
221 | (50) |
|
|
221 | (1) |
|
8.2 Integer Transform Coding |
|
|
222 | (4) |
|
|
222 | (2) |
|
|
224 | (2) |
|
8.3 Motion Estimation/Compensation |
|
|
226 | (6) |
|
|
227 | (1) |
|
|
228 | (1) |
|
8.3.3 Profiles and Levels |
|
|
229 | (3) |
|
|
232 | (12) |
|
8.4.1 Bitstream Structure |
|
|
232 | (1) |
|
8.4.2 Baseline Intra-Frame Compression |
|
|
233 | (1) |
|
8.4.3 Variable-Size Transform Specifications |
|
|
234 | (2) |
|
8.4.4 Overlapped Transform |
|
|
236 | (2) |
|
|
238 | (2) |
|
8.4.6 Quarter-pel MC for Y |
|
|
240 | (4) |
|
|
244 | (10) |
|
8.5.1 Quarter-pel MC for CbCr |
|
|
244 | (1) |
|
|
244 | (2) |
|
|
246 | (1) |
|
|
246 | (1) |
|
8.5.5 Dynamic Resolution Change |
|
|
247 | (2) |
|
|
249 | (1) |
|
8.5.7 Adaptive MB Quantization |
|
|
250 | (3) |
|
8.5.8 Intensity Compensation |
|
|
253 | (1) |
|
|
254 | (1) |
|
|
254 | (5) |
|
8.6.1 Bitstream Structure |
|
|
254 | (1) |
|
|
255 | (2) |
|
8.6.3 Sequence Level User Data |
|
|
257 | (1) |
|
|
258 | (1) |
|
|
258 | (1) |
|
8.7 The H.264 to VC-1 Transcoding |
|
|
259 | (4) |
|
8.7.1 Infra MB Mode Mapping |
|
|
260 | (1) |
|
8.7.2 Inter MB Mode Mapping |
|
|
261 | (1) |
|
8.7.3 Motion Vector Mapping |
|
|
262 | (1) |
|
|
263 | (1) |
|
|
263 | (1) |
|
|
263 | (4) |
|
8.8.1 Encapsulation of VC-1 in TS |
|
|
265 | (1) |
|
8.8.2 Encapsulation of VC-1 in PS |
|
|
265 | (2) |
|
8.9 VC-2 Video Compression |
|
|
267 | (2) |
|
|
267 | (1) |
|
|
268 | (1) |
|
|
269 | (1) |
|
|
269 | (2) |
Appendix A: Investigation of Image Quality of Dirac, H.264 and H.265 |
|
271 | (24) |
|
|
271 | (1) |
|
|
271 | (1) |
|
A.3 Image Quality Assessment Using SSIM and FSIM |
|
|
272 | (5) |
|
|
277 | (1) |
|
A.4.1 Results using Foreman QCIF Sequence |
|
|
277 | (1) |
|
A.4.2 Results using Foreman CIF Sequence |
|
|
277 | (1) |
|
A.4.3 Results using container QCIF Sequence |
|
|
277 | (1) |
|
A.4.4 Results using container CIF Sequence |
|
|
277 | (1) |
|
|
277 | (17) |
|
|
294 | (1) |
Appendix B: PSNR Average for AVSNR Software |
|
295 | (2) |
Appendix C: A Universal Image Quality Index and SSIM Comparison |
|
297 | (28) |
|
|
297 | (5) |
|
C.2 Universal Image Quality Index [ Q8] |
|
|
302 | (2) |
|
C.3 Structural Similarity Index [ Q13] |
|
|
304 | (7) |
|
C.4 Images with Disortions [ G11] |
|
|
311 | (4) |
|
|
315 | (3) |
|
|
318 | (1) |
|
|
319 | (1) |
|
C.8 JVT Document on Video Quality Metrics in the H.264 Reference Software |
|
|
320 | (5) |
Appendix D: Implementation of Mode Dependent DCT/DST in H.264 |
|
325 | (22) |
|
|
325 | (1) |
|
D.2 Transform Implementation in the Reference Software |
|
|
326 | (1) |
|
|
327 | (3) |
|
D.3.1 Mapping from Intra Prediction Modes to DCT/DST |
|
|
327 | (1) |
|
D.3.2 Obtaining DST Matrices for H.264 |
|
|
327 | (2) |
|
D.3.3 Implementation of DCT/DST in the Reference Software for H.264/AVC |
|
|
329 | (1) |
|
D.4 Calculation of BD-PSNR and BD-Bit rate |
|
|
330 | (1) |
|
|
331 | (10) |
|
D.5.1 Results for WQVGA (416x240) Sequences |
|
|
331 | (1) |
|
D.5.2 Results for WVGA (832x480) Sequences |
|
|
332 | (1) |
|
D.5.3 Results for HD (1920 x 1080) Sequences |
|
|
332 | (3) |
|
D.5.4 Results for HD (1080x720) Sequences |
|
|
335 | (1) |
|
D.5.5 Results for different combinations of DCT/DST applied to RaceHorses Sequences |
|
|
335 | (6) |
|
D.6 Conclusions and Future Work |
|
|
341 | (6) |
Appendix E: Performance Analysis and Comparison of JM, Intel IPP and X264 for H.264 Softwares |
|
347 | (22) |
|
|
347 | (2) |
|
|
349 | (1) |
|
|
349 | (1) |
|
|
350 | (1) |
|
E.5 JM (17.2) Performance Analysis |
|
|
351 | (4) |
|
E.6 X264 Performance Analysis |
|
|
355 | (1) |
|
E.7 Intel IPP Performance Analysis |
|
|
356 | (4) |
|
E.8 Comparison of SSIM for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles |
|
|
360 | (2) |
|
E.9 Comparison of PSNR for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles |
|
|
362 | (2) |
|
E.10 Comparison of Encoding Time for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles |
|
|
364 | (2) |
|
E.11 Comparison of Compression Ratio for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles |
|
|
366 | (2) |
|
|
368 | (1) |
|
|
368 | (1) |
Appendix F: Implementation of AIC Based on I-Frame Only Coding in H.264 and Comparison with Other Still Frame Image Coding Standards Such as JPEG, JPEG 2000, JPEG-LS and JPEG-XR |
|
369 | (52) |
|
|
369 | (1) |
|
F.2 Advanced Image Coding |
|
|
370 | (5) |
|
|
375 | (2) |
|
|
377 | (3) |
|
|
380 | (1) |
|
|
381 | (2) |
|
|
383 | (1) |
|
|
384 | (1) |
|
|
385 | (2) |
|
F.10 Main Differences [ AC1, HI1, J22, JX3, JL2, JL4] |
|
|
387 | (1) |
|
F.11 Evaluation Methodology |
|
|
388 | (4) |
|
F.12 Conclusions and Future Work |
|
|
392 | (29) |
Appendix G: Higher Order 2-D ICTs for HD Video Coding |
|
421 | (22) |
|
G.1 Discrete Cosine Transform and Video Compression |
|
|
421 | (2) |
|
G.2 Integer Cosine Transforms |
|
|
423 | (2) |
|
G.3 Simple 2-D Order 16 ICT |
|
|
425 | (4) |
|
G.4 Modified 2-D Order 16 ICT |
|
|
429 | (4) |
|
G.5 2-D Order 16 binDCT Based on Loeffler's Factorization |
|
|
433 | (2) |
|
G.6 Transform Coding Gain |
|
|
435 | (2) |
|
G.7 Implementation in H.264/AVC and Performance Analysis |
|
|
437 | (2) |
|
G.8 Implementation in AVS Video and Performance Analysis |
|
|
439 | (3) |
|
G.9 Conclusions and Future Work |
|
|
442 | (1) |
Appendix H: Comparison of H.264 Codecs |
|
443 | (6) |
Bibliography |
|
449 | (30) |
Index |
|
479 | |