Atjaunināt sīkdatņu piekrišanu

Video coding standards: AVS China, H.264/MPEG-4 PART 10, HEVC, VP6, DIRAC and VC-1 2014 ed. [Hardback]

  • Formāts: Hardback, 499 pages, height x width: 235x155 mm, weight: 9044 g, 335 Illustrations, black and white; XXIII, 499 p. 335 illus., 1 Hardback
  • Sērija : Signals and Communication Technology
  • Izdošanas datums: 21-Oct-2013
  • Izdevniecība: Springer
  • ISBN-10: 9400767412
  • ISBN-13: 9789400767416
  • Hardback
  • Cena: 136,16 €*
  • * ši ir gala cena, t.i., netiek piemērotas nekādas papildus atlaides
  • Standarta cena: 160,19 €
  • Ietaupiet 15%
  • Grāmatu piegādes laiks ir 3-4 nedēļas, ja grāmata ir uz vietas izdevniecības noliktavā. Ja izdevējam nepieciešams publicēt jaunu tirāžu, grāmatas piegāde var aizkavēties.
  • Daudzums:
  • Ielikt grozā
  • Piegādes laiks - 4-6 nedēļas
  • Pievienot vēlmju sarakstam
  • Formāts: Hardback, 499 pages, height x width: 235x155 mm, weight: 9044 g, 335 Illustrations, black and white; XXIII, 499 p. 335 illus., 1 Hardback
  • Sērija : Signals and Communication Technology
  • Izdošanas datums: 21-Oct-2013
  • Izdevniecība: Springer
  • ISBN-10: 9400767412
  • ISBN-13: 9789400767416

The requirements for multimedia (especially video and audio) communications increase rapidly in the last two decades in broad areas such as television, entertainment, interactive services, telecommunications, conference, medicine, security, business, traffic, defense and banking. Video and audio coding standards play most important roles in multimedia communications. In order to meet these requirements, series of video and audio coding standards have been developed such as MPEG-2, MPEG-4, MPEG-21 for audio and video by ISO/IEC, H.26x for video and G.72x for audio by ITU-T, Video Coder 1 (VC-1) for video by the Society of Motion Picture and Television Engineers (SMPTE) and RealVideo (RV) 9 for video by Real Networks.

AVS China is the abbreviation for Audio Video Coding Standard of China. This new standard includes four main technical areas, which are systems, video, audio and digital copyright management (DRM), and some supporting documents such as consistency verification. The second part of the standard known as AVS1-P2 (Video - Jizhun) was approved as the national standard of China in 2006, and several final drafts of the standard have been completed, including AVS1-P1 (System - Broadcast), AVS1-P2 (Video - Zengqiang), AVS1-P3 (Audio - Double track), AVS1-P3 (Audio - 5.1), AVS1-P7 (Mobile Video), AVS-S-P2 (Video) and AVS-S-P3 (Audio). AVS China provides a technical solution for many applications such as digital broadcasting (SDTV and HDTV), high-density storage media, Internet streaming media, and will be used in the domestic IPTV, satellite and possibly the cable TV market. Comparing with other coding standards such as H.264 AVC, the advantages of AVS video standard include similar performance, lower complexity, lower implementation cost and licensing fees. This standard has attracted great deal of attention from industries related to television, multimedia communications and even chip manufacturing from around the world. Also many well known companies have joined the AVS Group to be Full Members or Observing Members. The 163 members of AVS Group include Texas Instruments (TI) Co., Agilent Technologies Co. Ltd., Envivio Inc., NDS, Philips Research East Asia, Aisino Corporation, LG, Alcatel Shanghai Bell Co. Ltd., Nokia (China) Investment (NCIC) Co. Ltd., Sony (China) Ltd., and Toshiba (China) Co. Ltd. as well as some high level universities in China. Thus there is a pressing need from the instructors, students, and engineers for a book dealing with the topic of AVS China and its performance comparisons with similar standards such as H.264, VC-1 and RV-9.



This book consolidates all the major video coding standards and examines developments in video/audio codecs and their applications. It helps the reader understand the concepts, key techniques and performances of AVS China and other video coding standards.
1 Introduction 1(36)
1.1 Popular Video and Audio Standards
1(2)
1.2 Digital Representation of Video
3(15)
1.3 Basic Structure of Video Codec
18(1)
1.4 Performance Comparison Metrics for Video Codec
18(10)
1.5 Digital Representation of Audio
28(1)
1.6 Basic Structure of Perceptual Audio Coding
29(2)
1.7 Performance Comparison Metrics for Audio Codec
31(5)
1.8 Summary
36(1)
2 Video Coding Standards and Video Formats 37(14)
2.1 Introduction
37(3)
2.2 Complexity Reduction
40(1)
2.3 Video Coding Standards
40(1)
2.4 MPEG and H.26x
41(4)
2.4.1 H.120
41(1)
2.4.2 H.261
41(1)
2.4.3 MPEG-l
42(1)
2.4.4 H.262/MPEG-2
42(1)
2.4.5 H.263, H.263+ and H.263++
43(1)
2.4.6 MPEG-4
43(1)
2.4.7 H.264/MPEG-4 Part 10/AVC
44(1)
2.4.8 H.265/HEVC
45(1)
2.5 Video Formats and Quality
45(5)
2.5.1 Frames and Fields
45(1)
2.5.2 Color Spaces
46(3)
2.5.3 Video Formats
49(1)
2.5.4 Quality
49(1)
2.6 Summary
50(1)
3 AVS China 51(48)
3.1 AVS China
51(1)
3.2 AVS China Profiles and Levels
52(3)
3.2.1 AVS-Video Jizhun (Base) Profile
52(1)
3.2.2 AVS-Video Jiben (Basic) Profile
53(1)
3.2.3 AVS-Video Shenzhan (Extended) Profile
53(2)
3.2.4 AVS-Video Jiaqiang (Enhanced) Profile
55(1)
3.3 Data Formats Used in AVS
55(4)
3.3.1 AVS Video Layered Structure
56(3)
3.4 AVS Video Encoder
59(10)
3.4.1 Encoder Process Outline
60(1)
3.4.2 Coding Tools Used in AVS Video Coder
61(8)
3.5 AVS Video Decoder
69(1)
3.6 AVS Video Bit Stream
69(3)
3.6.1 Start Code
70(1)
3.6.2 Start Code Value
70(1)
3.6.3 Picture_coding_type
71(1)
3.7 NAL Unit for AVS Video Stream
72(2)
3.7.1 NAL Unit Mapping with AVS Video Stream
72(1)
3.7.2 NAL Unit Header Description
72(2)
3.8 Introduction to AVS-M (AVS Part 7)
74(7)
3.8.1 Data Structure of AVS-M
75(3)
3.8.2 Embodiment of AVS-M
78(2)
3.8.3 Various Levels in Jiben Profile
80(1)
3.9 Block Mode Prediction Modes
81(4)
3.9.1 Infra Prediction
81(2)
3.9.2 Inter Prediction
83(1)
3.9.3 Skip Mode Prediction
84(1)
3.9.4 RD Optimization
85(1)
3.10 Transform, Quantization and Entropy Coding
85(4)
3.10.1 Transform
85(1)
3.10.2 Quantization
86(1)
3.10.3 Entropy Coding
86(1)
3.10.4 Simplified Deblocking Filter
87(2)
3.11 AVS Part-1: System
89(7)
3.11.1 Program Stream
91(1)
3.11.2 Transport Stream
92(4)
3.12 IEEE AVS
96(2)
3.12.1 Applications
97(1)
3.12.2 Profiles and Levels
97(1)
3.12.3 Overview of the Design Characteristics
97(1)
3.13 Summary
98(1)
3.14 Projects
98(1)
4 H.264/MPEG-4 Advanced Video Coding 99(26)
4.1 Introduction
99(1)
4.2 Profiles and Levels of H.264
100(5)
4.2.1 Profiles in H.264
100(5)
4.2.2 Levels in H.264
105(1)
4.3 H.264 Encoder
105(2)
4.4 Intra-Prediction
107(1)
4.5 Inter-Prediction
108(1)
4.6 Inter Prediction of Macroblocks in P-Slices
108(1)
4.7 Sub-Pixel Motion Vectors
109(3)
4.8 Transform and Quantization
112(1)
4.9 In-Loop Deblocking Filter
112(4)
4.9.1 Filter Strength
114(2)
4.10 B-Slices and Adaptive Weighted Prediction
116(1)
4.11 Entropy Coding
117(2)
4.12 H.264 Decoder
119(1)
4.13 Some Applications of H.264
120(1)
4.14 Summary
121(1)
4.15 Projects
121(4)
5 High Efficiency Video Coding (HEVC) 125(34)
5.1 Introduction
125(1)
5.2 Joint Collaborative Team on Video Coding
125(7)
5.3 Analysis of Coding Tools in HEVC Test Model, HM 1.0: Intra Prediction
132(1)
5.4 HEVC Encoder
132(6)
5.4.1 Infra Prediction
135(1)
5.4.2 Transform Coefficient Scanning
136(1)
5.4.3 Luma and Chroma Fractional Pixel Interpolation
137(1)
5.4.4 Comparison of Coding Tools of HM1 and HEVC Draft 9
137(1)
5.5 Extensions to HEVC
138(2)
5.6 Profiles and Levels
140(3)
5.7 Performance and Computational Complexity of HEVC Encoders
143(1)
5.8 System Layer Integration of HEVC
144(1)
5.9 HEVC Lossless Coding and Improvements
144(2)
5.10 Summary
146(2)
5.11 Projects
148(11)
6 VP6 Video Coding Standard 159(40)
6.1 Introduction
159(1)
6.2 Comparison with Previous Flash Codec MX
160(5)
6.3 VP6 Algorithm Fundamentals
165(1)
6.4 Coding Profiles in VP6
166(1)
6.5 Types of Frames
167(1)
6.5.1 Golden Frames
168(1)
6.6 MB Modes
168(2)
6.6.1 MB Modes in I-Frames (Intra-Mode)
168(1)
6.6.2 MB Modes in P-Frames (Inter-Modes and Intra-Mode)
169(1)
6.7 Nearest and Near Blocks
170(1)
6.8 Motion Vectors
171(1)
6.8.1 Encoding
172(1)
6.8.2 Prediction Loop Filtering
172(1)
6.9 Filtering for Fractional Pixel Motion Compensation
172(2)
6.9.1 Bilinear Filtering
173(1)
6.9.2 Bicubic Filtering
173(1)
6.10 Support for Unrestricted Motion Vectors
174(1)
6.11 Prediction Loop Filtering
174(1)
6.12 DCT, Scan Orders and Coefficient Token Set
174(12)
6.12.1 Scan Orders
180(1)
6.12.2 DCT Coding and Coefficient Token Set
181(5)
6.13 Quantization
186(1)
6.14 Entropy Coding
187(2)
6.14.1 Use of Context Information
188(1)
6.14.2 Huffman Coder
188(1)
6.14.3 BoolCoder
189(1)
6.15 An Overview on VP6 Coding
189(1)
6.16 Performance of VP6 Coding
190(1)
6.17 VP6 Golden Frames
191(1)
6.18 Background/Foreground Segmentation
191(1)
6.19 Context Predictive Entropy Encoding
192(1)
6.20 Bitstream Partitions
192(2)
6.21 Dual Mode Arithmetic and VLC Encoding
194(1)
6.22 Adaptive Sub-Pixel Motion Estimation
194(1)
6.23 VP6-E and VP6-S Encoder Profiles
194(1)
6.24 Device Ports and Hardware Implementations
195(2)
6.25 Summary
197(1)
6.26 Projects
197(2)
7 Performance Analysis and Comparison of the Dirac Video Codec with H.264/MPEG-4, Part 10 199(22)
7.1 Introduction
199(1)
7.2 Dirac Architecture
200(2)
7.2.1 Dirac Encoder
200(1)
7.2.2 Dirac Decoder
201(1)
7.3 Stages of Encoding and Decoding in Dirac
202(7)
7.3.1 Wavelet Transform
202(2)
7.3.2 Scaling and Quantization
204(1)
7.3.3 Entropy Coding
205(1)
7.3.4 Motion Estimation
206(1)
7.3.5 Motion Compensation
207(1)
7.3.6 Decoder
208(1)
7.4 Implementation
209(2)
7.4.1 Code Structure Overview
209(1)
7.4.2 Simplicity and Relative Speed of Encoding
209(2)
7.5 Results
211(6)
7.5.1 Compression Ratio Test
211(2)
7.5.2 SSIM Test
213(1)
7.5.3 PSNR Test
214(3)
7.6 Conclusions
217(1)
7.7 Future Research
218(1)
7.8 Summary
218(1)
7.9 Projects
218(3)
8 The VC-1 Video Coding 221(50)
8.1 The VC-1 Structure
221(1)
8.2 Integer Transform Coding
222(4)
8.2.1 Inverse Transform
222(2)
8.2.2 Forward Transform
224(2)
8.3 Motion Estimation/Compensation
226(6)
8.3.1 Loop Filter
227(1)
8.3.2 Complexity
228(1)
8.3.3 Profiles and Levels
229(3)
8.4 The Simple Profile
232(12)
8.4.1 Bitstream Structure
232(1)
8.4.2 Baseline Intra-Frame Compression
233(1)
8.4.3 Variable-Size Transform Specifications
234(2)
8.4.4 Overlapped Transform
236(2)
8.4.5 4MV per MB
238(2)
8.4.6 Quarter-pel MC for Y
240(4)
8.5 The Main Profile
244(10)
8.5.1 Quarter-pel MC for CbCr
244(1)
8.5.2 Start Codes
244(2)
8.5.3 Extended MV
246(1)
8.5.4 Loop Filter
246(1)
8.5.5 Dynamic Resolution Change
247(2)
8.5.6 B Frames
249(1)
8.5.7 Adaptive MB Quantization
250(3)
8.5.8 Intensity Compensation
253(1)
8.5.9 Range Adjustment
254(1)
8.6 The Advanced Profile
254(5)
8.6.1 Bitstream Structure
254(1)
8.6.2 Interlace
255(2)
8.6.3 Sequence Level User Data
257(1)
8.6.4 Entry Point Layer
258(1)
8.6.5 Display Metadata
258(1)
8.7 The H.264 to VC-1 Transcoding
259(4)
8.7.1 Infra MB Mode Mapping
260(1)
8.7.2 Inter MB Mode Mapping
261(1)
8.7.3 Motion Vector Mapping
262(1)
8.7.4 Reference Pictures
263(1)
8.7.5 Skipped MB
263(1)
8.8 Transport of VC-1
263(4)
8.8.1 Encapsulation of VC-1 in TS
265(1)
8.8.2 Encapsulation of VC-1 in PS
265(2)
8.9 VC-2 Video Compression
267(2)
8.9.1 Introduction
267(1)
8.9.2 Scope
268(1)
8.10 Summary
269(1)
8.11 Projects
269(2)
Appendix A: Investigation of Image Quality of Dirac, H.264 and H.265 271(24)
A.1 Introduction
271(1)
A.2 H.265
271(1)
A.3 Image Quality Assessment Using SSIM and FSIM
272(5)
A.4 Results
277(1)
A.4.1 Results using Foreman QCIF Sequence
277(1)
A.4.2 Results using Foreman CIF Sequence
277(1)
A.4.3 Results using container QCIF Sequence
277(1)
A.4.4 Results using container CIF Sequence
277(1)
A.5 Conclusions
277(17)
A.6 Projects
294(1)
Appendix B: PSNR Average for AVSNR Software 295(2)
Appendix C: A Universal Image Quality Index and SSIM Comparison 297(28)
C.1 Introduction
297(5)
C.2 Universal Image Quality Index [ Q8]
302(2)
C.3 Structural Similarity Index [ Q13]
304(7)
C.4 Images with Disortions [ G11]
311(4)
C.5 Results
315(3)
C.6 Conclusions
318(1)
C.7 Project
319(1)
C.8 JVT Document on Video Quality Metrics in the H.264 Reference Software
320(5)
Appendix D: Implementation of Mode Dependent DCT/DST in H.264 325(22)
D.1 Introduction
325(1)
D.2 Transform Implementation in the Reference Software
326(1)
D.3 Proposed Scheme
327(3)
D.3.1 Mapping from Intra Prediction Modes to DCT/DST
327(1)
D.3.2 Obtaining DST Matrices for H.264
327(2)
D.3.3 Implementation of DCT/DST in the Reference Software for H.264/AVC
329(1)
D.4 Calculation of BD-PSNR and BD-Bit rate
330(1)
D.5 Performance Analysis
331(10)
D.5.1 Results for WQVGA (416x240) Sequences
331(1)
D.5.2 Results for WVGA (832x480) Sequences
332(1)
D.5.3 Results for HD (1920 x 1080) Sequences
332(3)
D.5.4 Results for HD (1080x720) Sequences
335(1)
D.5.5 Results for different combinations of DCT/DST applied to RaceHorses Sequences
335(6)
D.6 Conclusions and Future Work
341(6)
Appendix E: Performance Analysis and Comparison of JM, Intel IPP and X264 for H.264 Softwares 347(22)
E.1 H.264
347(2)
E.2 JM Software [ H30]
349(1)
E.3 X264 [ XI]
349(1)
E.4 Intel IPP [ X3]
350(1)
E.5 JM (17.2) Performance Analysis
351(4)
E.6 X264 Performance Analysis
355(1)
E.7 Intel IPP Performance Analysis
356(4)
E.8 Comparison of SSIM for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles
360(2)
E.9 Comparison of PSNR for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles
362(2)
E.10 Comparison of Encoding Time for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles
364(2)
E.11 Comparison of Compression Ratio for JM, X264 and Intel IPP Softwares in Baseline, Main and High Profiles
366(2)
E.12 Conclusions
368(1)
E.13 Future Work
368(1)
Appendix F: Implementation of AIC Based on I-Frame Only Coding in H.264 and Comparison with Other Still Frame Image Coding Standards Such as JPEG, JPEG 2000, JPEG-LS and JPEG-XR 369(52)
F.1 Introduction
369(1)
F.2 Advanced Image Coding
370(5)
F.3 Modified AIC
375(2)
F.4 H.264 Standard
377(3)
F.5 JPEG
380(1)
F.6 JPEG2000
381(2)
F.7 JPEG XR
383(1)
F.8 JPEG-LS
384(1)
F.9 JPEG-LS Algorithm
385(2)
F.10 Main Differences [ AC1, HI1, J22, JX3, JL2, JL4]
387(1)
F.11 Evaluation Methodology
388(4)
F.12 Conclusions and Future Work
392(29)
Appendix G: Higher Order 2-D ICTs for HD Video Coding 421(22)
G.1 Discrete Cosine Transform and Video Compression
421(2)
G.2 Integer Cosine Transforms
423(2)
G.3 Simple 2-D Order 16 ICT
425(4)
G.4 Modified 2-D Order 16 ICT
429(4)
G.5 2-D Order 16 binDCT Based on Loeffler's Factorization
433(2)
G.6 Transform Coding Gain
435(2)
G.7 Implementation in H.264/AVC and Performance Analysis
437(2)
G.8 Implementation in AVS Video and Performance Analysis
439(3)
G.9 Conclusions and Future Work
442(1)
Appendix H: Comparison of H.264 Codecs 443(6)
Bibliography 449(30)
Index 479
Prof. K. R. Rao received the Ph. D. degree in electrical engineering from The University of New Mexico, Albuquerque in 1966. Since 1966, he has been with the University of Texas at Arlington where he is currently a professor of electrical engineering. He, along with two other researchers, introduced the Discrete Cosine Transform in 1975 which has since become very popular in digital signal processing. He is the co-author of the books "Orthogonal Transforms for Digital Signal Processing" (Springer-Verlag, 1975), "Fast Transforms: Analyses and Applications" (Academic Press, 1982), "Discrete Cosine Transform-Algorithms, Advantages, Applications" (Academic Press, 1990). He has edited a benchmark volume, "Discrete Transforms and Their Applications" (Van Nostrand Reinhold, 1985). He has coedited a benchmark volume, "Teleconferencing" (Van Nostrand Reinhold, 1985). He is co-author of the books, "Techniques and standards for Image/Video/Audio Coding" (Prentice Hall, 1996), "Packet video communications over ATM networks"(Prentice Hall, 2000) and "Multimedia communication Systems" (Prentice Hall, 2002). He is coeditor of the handbooks, "The transform and data compression handbook" (CRC Press, 2001), and "Digital video image quality and perceptual coding" (Marcel Dekker, 2004). He is co-author of the book, "Fast Fourier Transform and Its Applications" (Springer, 2009). Some of his books have been translated into Japanese, Chinese, Korean and Russian. He has conducted workshops/tutorials on video/audio coding/standards worldwide. He has published extensively in refereed journals and has been a consultant to industry, research institutes and academia. He is a Fellow of the IEEE.

Dr. Do Nyeon Kim received the B.E. degree from Kyungpook National University, Daegu, Korea, in 1985 and the M.S. and Ph.D. degrees, both in electrical engineering, both from Yonsei University, Seoul, Korea, in 1988 and 2004, respectively. From 1989 to 1994, he was a senior researcher at ETRI, Daejeon, Korea. He was a lecturer at Yonsei University in 2004. From 2005, he is a visiting professor in the Department of Electrical Engineering, the University of Texas at Arlington. He is a recipient of IT scholarship for 2005 and 2006 for post-doctoral program from the Institute for Information Technology Advancement and the Ministry of Information and Communication, Republic of Korea. He is co-author of the book, "Fast Fourier Transform and Its Applications" (Springer, 2010).  

Prof. J.J. Hwang received the B.S., M.S., and Ph.D. degrees in electronic engineering from the Chonbuk National University in 1983, 1986, and 1992, respectively. He is currently full-professor at the Kunsan National University, Korea, and adjunct professor of RMIT University, Sydney,  Australia. His research interests are digital image/video coding & processing, information theory, object segmentation and tracking, and 2D/3D visual quality assessment and evaluation. He is the co-author of Techniques and standards for image, video and audio coding (Prentice Hall, 1996) and Fast Fourier transform Algorithms and applications (Springer, 2010).