Part 2: Video Coding Techniques

Similar documents
Introduction to Video Compression H.261

CSE 408 Multimedia Information System Yezhou Yang

Image Data Compression

6. H.261 Video Coding Standard

encoding without prediction) (Server) Quantization: Initial Data 0, 1, 2, Quantized Data 0, 1, 2, 3, 4, 8, 16, 32, 64, 128, 256

Real-Time Audio and Video

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5

RATE-DISTORTION ANALYSIS AND TRAFFIC MODELING OF SCALABLE VIDEO CODERS. A Dissertation MIN DAI

Wavelet Scalable Video Codec Part 1: image compression by JPEG2000

Image Compression. Fundamentals: Coding redundancy. The gray level histogram of an image can reveal a great deal of information about the image

Image Compression - JPEG

H.264/MPEG4 Part INTRODUCTION Terminology

Statistical Analysis and Distortion Modeling of MPEG-4 FGS

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Multimedia Networking ECE 599

Analysis of Rate-distortion Functions and Congestion Control in Scalable Internet Video Streaming

Multimedia. Multimedia Data Compression (Lossless Compression Algorithms)

Modelling of produced bit rate through the percentage of null quantized transform coefficients ( zeros )

CSE 126 Multimedia Systems Midterm Exam (Form A)

h 8x8 chroma a b c d Boundary filtering: 16x16 luma H.264 / MPEG-4 Part 10 : Intra Prediction H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter

Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding

BASICS OF COMPRESSION THEORY

Product Obsolete/Under Obsolescence. Quantization. Author: Latha Pillai

New Trends in High Definition Video Compression - Application to Multiple Description Coding

Predictive Coding. Prediction Prediction in Images

Predictive Coding. Prediction

Compression. Encryption. Decryption. Decompression. Presentation of Information to client site

Multimedia Communications Fall 07 Midterm Exam (Close Book)

Image compression. Institute of Engineering & Technology, Ahmedabad University. October 20, 2015

Detailed Review of H.264/AVC

JPEG and JPEG2000 Image Coding Standards

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD

L. Yaroslavsky. Fundamentals of Digital Image Processing. Course

Fast Intra Coding Method of H.264 for Video Surveillance System

The MPEG4/AVC standard: description and basic tasks splitting

Basic Principles of Video Coding

Progressive Wavelet Coding of Images

Fast Progressive Wavelet Coding

Lecture 7 Predictive Coding & Quantization

A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv Techniques

Lec 05 Arithmetic Coding

A L A BA M A L A W R E V IE W

OH BOY! Story. N a r r a t iv e a n d o bj e c t s th ea t e r Fo r a l l a g e s, fr o m th e a ge of 9

Objective: Reduction of data redundancy. Coding redundancy Interpixel redundancy Psychovisual redundancy Fall LIST 2

On Compression Encrypted Data part 2. Prof. Ja-Ling Wu The Graduate Institute of Networking and Multimedia National Taiwan University

A DISTRIBUTED VIDEO CODER BASED ON THE H.264/AVC STANDARD

LORD: LOw-complexity, Rate-controlled, Distributed video coding system

Software Process Models there are many process model s in th e li t e ra t u re, s om e a r e prescriptions and some are descriptions you need to mode

C o r p o r a t e l i f e i n A n c i e n t I n d i a e x p r e s s e d i t s e l f

Multimedia Information Systems

Multimedia & Computer Visualization. Exercise #5. JPEG compression

Intra Frame Coding for Advanced Video Coding Standard to reduce Bitrate and obtain consistent PSNR Using Gaussian Pulse

MODERN video coding standards, such as H.263, H.264,

Human Visual System Based Adaptive Inter Quantization

IMAGE COMPRESSION IMAGE COMPRESSION-II. Coding Redundancy (contd.) Data Redundancy. Predictive coding. General Model

Executive Committee and Officers ( )

Motion Vector Prediction With Reference Frame Consideration

IMPROVED INTRA ANGULAR PREDICTION BY DCT-BASED INTERPOLATION FILTER. Shohei Matsuo, Seishi Takamura, and Hirohisa Jozawa

IMAGE COMPRESSION-II. Week IX. 03/6/2003 Image Compression-II 1

Lesson Ten. What role does energy play in chemical reactions? Grade 8. Science. 90 minutes ENGLISH LANGUAGE ARTS

Coding for loss tolerant systems

Enhanced SATD-based cost function for mode selection of H.264/AVC intra coding

Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression

H.264 / MPEG-4 Part 10 : Intra Prediction

DELIVERING video of good quality over the Internet or

Rate-distortion Analysis and Control in DCT-based Scalable Video Coding. Xie Jun

Lec 04 Variable Length Coding (VLC) in JPEG

Image Compression. Qiaoyong Zhong. November 19, CAS-MPG Partner Institute for Computational Biology (PICB)

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders

Digital Image Processing Lectures 25 & 26

Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms

176 5 t h Fl oo r. 337 P o ly me r Ma te ri al s

Fast Frame-Based Scene Change Detection in the Compressed Domain for MPEG-4 Video

P a g e 5 1 of R e p o r t P B 4 / 0 9

Converting DCT Coefficients to H.264/AVC

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding

Direction-Adaptive Transforms for Coding Prediction Residuals

I M P O R T A N T S A F E T Y I N S T R U C T I O N S W h e n u s i n g t h i s e l e c t r o n i c d e v i c e, b a s i c p r e c a u t i o n s s h o

Optimization of Selective Enhancement for MPEG-4 Fine Granularity Scalability

Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences

An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding

Scalable color image coding with Matching Pursuit

SYDE 575: Introduction to Image Processing. Image Compression Part 2: Variable-rate compression

A Complete Video Coding Chain Based on Multi-Dimensional Discrete Cosine Transform

A Hyper-Trellis based Turbo Decoder for Wyner-Ziv Video Coding

Context-adaptive coded block pattern coding for H.264/AVC

Rate-Distortion Based Temporal Filtering for. Video Compression. Beckman Institute, 405 N. Mathews Ave., Urbana, IL 61801

Compression and Coding

3684 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 23, NO. 8, AUGUST 2014

Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p.

arxiv: v1 [cs.mm] 10 Mar 2016

Phase-Correlation Motion Estimation Yi Liang

A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction

Multi-Hypothesis based Distributed Video Coding using LDPC Codes

SCALABLE 3-D WAVELET VIDEO CODING

HARMONIC VECTOR QUANTIZATION

Digital communication system. Shannon s separation principle

2. the basis functions have different symmetries. 1 k = 0. x( t) 1 t 0 x(t) 0 t 1

THE newest video coding standard is known as H.264/AVC

Transcription:

art 2: Video Coding Techniques Outline Vincent Roca and Christoph Neumann {firstname.name}@inrialpes.fr lanète project; INRIA Rhône-Alpes MIS 03, Napoli, November 2003 Copyright 2003, INRIA; all rights reserved R a w I m a g e f o r m a ts Every ixel of an image is represented in RG: Red Green lue or YCbCr: Luminance Chrominance blue Chrominance red, often called YUV Conversion RG to YCbCr: Y Cb Cr = 0. 57 0. 331 0. 419 0. 114 0. 500 0. 01 0. 299 0. 169 R G 0. 500 4:2:2 CIF: resolution 352x2 Horizontal: 352 Y and 176 Cb and 176 Cr Vertical: 2 Y and 144 Cb and 144 Cr QCIF: resolution 176x144 4:2:0: half the number of pixels for Cr and Cb M a c r o b lo c k s Frames are divided into 16X16 pixel areas (4 x blocks). Original Image Akiyo (QCIF) M a c r o b lo c k s, 4 : 2 : 0 f o r m a t Each 16x16 pixel area is a macroblock which contains data: 4 x blocks for luminance Y 1 x block for chrominance Cr 1 x block for chrominance Cb Outline X: luminance O : ch r o minance

source I ntr a c o d ed f r a m e DCT transformation of each Macroblock Quantification VLC / entropy coding with «zigzag» scan Compression com p ressed D C T Transforms image representation in frequencies and amplitudes DCT does not reduce the amount of data, but prepares it for following processes Low frequencies situated on left upper corner, high frequencies on right bottom corner com p ressed Decompression recon st ruct ed DCT Energy is concentrated on some coefficients Q ua ntif ic a tio n Filter on frequencies (adapt frequencies to human eye) Ex.: High frequencies are less significant for human eye Leads to more 0 in representation Reduces amount of data Energy is concentrated on some coefficients Q u a n t i f i c a t i o n Variable len g t h coding ( V L C ) / E nt r op y C oding Many codecs use Huffman coding «zigzag» scan (low frequencies situated on left upper corner, high frequencies on right bottom corner)!" #$" %&&&&& '%&&&&&& &" %(&&&&&& Frame t y p es Three frame types: I,, I: Intracoded frames : redicted frames : idirectional predicted frames M o t i o n c o mp en s at i o n For predicted frames (- or -frames) )+*, - *. 5 3, - 0 4, - *. Reference image Image to be coded I )+*, - *. / * 0 1 2. 3 4, - *. - Display order: I I it-stream order: I I Estimated image Error

e C o mp res s o r arc h i t ec t u re + - O u t l i n e + + E r r or p r op a ga t ion Error-resilient video coding Error concealment E r r o r I E r r o r p r o p a g a t i o n V ide o coding s ol u t ions t o l im it r r s Error-resilient video coding Resync markers within video bitstream RVLCs: reversible variable length codes. No need to jump to next resync marker if error occur. Decoder is able to continue to decode the data. ut is less efficient than VLC Data partitioning: data close to a resync marker are more likely to be accurately decoded than those further away Error concealment Action only taken by the decoder Estimate missing region temporal interpolation spatial interpolation motion-compensated temporal interpolation O u t l i n e S c al ab i l i t y s c h emes AKA hierarchical encoding One base video layer + one or more cumulative enhancement layers Quality of video increases with number of decoded video layers Higher layers need lower layers in order to be decoded Often proposed as a solution for rate control

S c al ab i t y s c h emes Types of scalability: Temporal Frame rate Spatial Resolution SNR SNR Special case FGS: artial reception of enhancement layer possible Quality of video increases proportional to the number of received bits in the enhancement layer FGS SNR Temporal (FGST) T emp o ral s c al ab i l i t y Enhancement layer(s) increases decoded video frame rate (frames per second) Example (other scheduling of frames are possible): T em p o r al en h. v i d eo l ay er I S p at i al s c al ab i l i t y & S N R s c al ab i l i t y Enhancement layer(s) increases decoded video resolution Example: ase layer only: QCIF ase layer + one enh. layer: CIF FG S At the sender E n h v i d eo l ay er I en h. v i d eo l ay er 1 I At the receiver I E n h v i d eo l ay er Same dependencies for SNR scalability Enhancement layer(s) increases decoded video SNR I FG S FGS as layered video FG S T Add temporal refinement Variant, single enh. layer: L ay er n E n h v i d eo l ay er. F G S en h. v i d eo l ay er.. L ay er 2 I Variant 2, FGST separate layer: L ay er 1 E n h v i d eo l ay er 2 L ay er 0 E n h v i d eo l ay er 1 I

I s s u es o f l ay ered s c al ab i l i t y Availability: Most video codecs only support one (!!!) enhancement layer e.g. ISO MEG-4 reference codec Compression: More overhead than single layered stream (less compression) M D C Several independent video layers Each layer can be watched independently from others Increased quality if several layers available More overhead than cumulative layered compression r e c on s t r u c t e d v i d e o v i d e o s t r e a m 1 Low q u a l i t y v i d e o MDC e n c o d e r MDC d e c o d e r H i g h q u a l i t y v i d e o s t r e a m 2 Low q u a l i t y Outline V id eo S ta nd a r d s H.261 Video telephony and teleconferencing over ISDN p*64kb/s MEG-1 Video CD 1.5 Mb/s MEG-2, H.262 Digital Television, DVD 2-20 Mb/s H.263 Video telephony over STN 33.6 kb/s and up MEG-4 Interactive multimedia applications: Visiophonie, Internet Streaming, TV roadcast,... Object-based coding, synthetic content, interactivity itrate variable MEG-4 part 10 (AVC), H.264 Like mpeg-4 with improved compression 10 s to 100 s of kb/s better MEG-7, MEG-21 Like mpeg-4 with content description/indexation S ta nd a r d s Ov er v iew M E G - 4 I T U - T I S O - I E C Scene decomposition Decomposition in Visual Objects (VO), that are coded separately M E G - 1 H.261 H.263 H.262 = M E G - 2 M E G - 4 H.264 = M E G - 4 p a r t 10 = A V C

M E G - 4 Outline VideoSession (VS) VS1 VisualObject (VO) VO1 VO2 VideoObjectLayer (VOL) (ase layer + enh. layer(s)) VOL1 VOL2 GroupOfVOs (GOV) ( GO) GOV1 GOV2 VideoObjectlane (VO) ( frame in H.261) VO1 VOk VO1 VOk ( I n) a d eq ua c y to netw o r k tr a ns m is s io ns roblems due to network transmissions: andwidth Losses Delay Jitter Video coding only solutions: VR, CR Error-resilient video coding Error concealment Not sufficient!!! Solutions based on transmission mechanism needed