Wavelet Scalable Video Codec Part 1: image compression by JPEG2000

Similar documents
EBCOT coding passes explained on a detailed example

Basic Principles of Video Coding

Image Compression. Fundamentals: Coding redundancy. The gray level histogram of an image can reveal a great deal of information about the image

Compression and Coding

Digital communication system. Shannon s separation principle

Objective: Reduction of data redundancy. Coding redundancy Interpixel redundancy Psychovisual redundancy Fall LIST 2

JPEG and JPEG2000 Image Coding Standards

<Outline> JPEG 2000 Standard - Overview. Modes of current JPEG. JPEG Part I. JPEG 2000 Standard

+ (50% contribution by each member)

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments

encoding without prediction) (Server) Quantization: Initial Data 0, 1, 2, Quantized Data 0, 1, 2, 3, 4, 8, 16, 32, 64, 128, 256

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5

- An Image Coding Algorithm

Image Data Compression

Module 5 EMBEDDED WAVELET CODING. Version 2 ECE IIT, Kharagpur

Compression and Coding. Theory and Applications Part 1: Fundamentals

EE67I Multimedia Communication Systems

Compression and Coding. Theory and Applications Part 1: Fundamentals

Embedded Zerotree Wavelet (EZW)

ECE533 Digital Image Processing. Embedded Zerotree Wavelet Image Codec

Information and Entropy

Fast Progressive Wavelet Coding

at Some sort of quantization is necessary to represent continuous signals in digital form

L. Yaroslavsky. Fundamentals of Digital Image Processing. Course

Multimedia Networking ECE 599

SIGNAL COMPRESSION. 8. Lossy image compression: Principle of embedding

BASICS OF COMPRESSION THEORY

Can the sample being transmitted be used to refine its own PDF estimate?

Run-length & Entropy Coding. Redundancy Removal. Sampling. Quantization. Perform inverse operations at the receiver EEE

IMAGE COMPRESSION-II. Week IX. 03/6/2003 Image Compression-II 1

Review of Quantization. Quantization. Bring in Probability Distribution. L-level Quantization. Uniform partition

Image Compression Basis Sebastiano Battiato, Ph.D.

Progressive Wavelet Coding of Images

Waveform-Based Coding: Outline

Overview. Analog capturing device (camera, microphone) PCM encoded or raw signal ( wav, bmp, ) A/D CONVERTER. Compressed bit stream (mp3, jpg, )

State of the art Image Compression Techniques

Statistical Analysis and Distortion Modeling of MPEG-4 FGS

Multimedia Information Systems

Image Compression. Qiaoyong Zhong. November 19, CAS-MPG Partner Institute for Computational Biology (PICB)

RESOLUTION SCALABLE AND RANDOM ACCESS DECODABLE IMAGE CODING WITH LOW TIME COMPLEXITY

Fault Tolerance Technique in Huffman Coding applies to Baseline JPEG

Image Compression - JPEG

On Compression Encrypted Data part 2. Prof. Ja-Ling Wu The Graduate Institute of Networking and Multimedia National Taiwan University

Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding

JPEG2000 High-Speed SNR Progressive Decoding Scheme

CHAPTER 3. Transformed Vector Quantization with Orthogonal Polynomials Introduction Vector quantization

Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p.

Analysis of Rate-distortion Functions and Congestion Control in Scalable Internet Video Streaming

Summary of Last Lectures

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Chapter 9 Fundamental Limits in Information Theory

Information Theory. M1 Informatique (parcours recherche et innovation) Aline Roumy. January INRIA Rennes 1/ 73

Multimedia Communications Fall 07 Midterm Exam (Close Book)

SYDE 575: Introduction to Image Processing. Image Compression Part 2: Variable-rate compression

Module 4. Multi-Resolution Analysis. Version 2 ECE IIT, Kharagpur

EMBEDDED ZEROTREE WAVELET COMPRESSION

Multimedia & Computer Visualization. Exercise #5. JPEG compression

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding

QUANTIZATION [1] is a signal processing technique

Scalable color image coding with Matching Pursuit

Multimedia. Multimedia Data Compression (Lossless Compression Algorithms)

Image and Multidimensional Signal Processing

IMAGE COMPRESSION IMAGE COMPRESSION-II. Coding Redundancy (contd.) Data Redundancy. Predictive coding. General Model

Information Theory. Coding and Information Theory. Information Theory Textbooks. Entropy

6. H.261 Video Coding Standard

4. Quantization and Data Compression. ECE 302 Spring 2012 Purdue University, School of ECE Prof. Ilya Pollak

The information loss in quantization

Digital Image Processing Lectures 25 & 26

CSE 408 Multimedia Information System Yezhou Yang

Vector Quantization and Subband Coding

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Introduction to Video Compression H.261

Transform Coding. Transform Coding Principle

Implementation of CCSDS Recommended Standard for Image DC Compression

Real-Time Audio and Video

Lecture 7 Predictive Coding & Quantization

Compression. Encryption. Decryption. Decompression. Presentation of Information to client site

ECE472/572 - Lecture 11. Roadmap. Roadmap. Image Compression Fundamentals and Lossless Compression Techniques 11/03/11.

2018/5/3. YU Xiangyu

EE368B Image and Video Compression

RLE = [ ; ], with compression ratio (CR) = 4/8. RLE actually increases the size of the compressed image.

Grayscale and Colour Image Codec based on Matching Pursuit in the Spatio-Frequency Domain.

Basics of DCT, Quantization and Entropy Coding

Chapter 2: Source coding

An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding

Motivation for Arithmetic Coding

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD

RATE-DISTORTION ANALYSIS AND TRAFFIC MODELING OF SCALABLE VIDEO CODERS. A Dissertation MIN DAI

Reduce the amount of data required to represent a given quantity of information Data vs information R = 1 1 C

Compression methods: the 1 st generation

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

A New Pipelined Architecture for JPEG2000 MQ-Coder

A study of image compression techniques, with specific focus on weighted finite automata

Predictive Coding. Prediction Prediction in Images

Predictive Coding. Prediction

QUALITY progressivity is a feature provided by many

Multimedia Communications. Mathematical Preliminaries for Lossless Compression

Basics of DCT, Quantization and Entropy Coding. Nimrod Peleg Update: Dec. 2005

Digital Communications III (ECE 154C) Introduction to Coding and Information Theory

Shannon-Fano-Elias coding

Transcription:

1 Wavelet Scalable Video Codec Part 1: image compression by JPEG2000 Aline Roumy aline.roumy@inria.fr May 2011

2 Motivation for Video Compression Digital video studio standard ITU-R Rec. 601 Y luminance Cb Chrominance Cr Chrominance Sampling Rate 13.5 MHz 6.75 MHz 6.75 MHz Quantization 8 bit 8 bit 8 bit Raw bit rate 216 Mbps Human eyes are less sensitive to the chrominance than to the luminance

3 Plan 1. Still-image compression 1. Typical coding scheme: transform coding 2. Information theoretical justification for the transform coding scheme 2. From Transform coding to JPEG-2000 Wavelet based still-image compression 3. From motion compensation to WSVC (Ronan)

Typical Image Coder and Decoder Transform y= T(x) Quantizer q= Q(y) image x coefficients y indices q Entropy coder c= C(q) 4 Encoder bistream c reconstructed image Inverse transform T ¹ Dequantizer Q ¹ Entropy decoder C ¹ Decoder Transform T(x) usually invertible Quantization Q(y) not invertible, introduces distortion Lossless encoder C(q)

5 Lossless source coding Random process (source) Example: model for the pixels Minimum average length (in bits) to represent a source sample: Discrete-valued i.i.d. source [Shannon] bits/source sample finite-valued stationary ergodic source [Shannon-McMillan-Breiman] bits/source sample

6 Entropy coding for i.i.d. source The way to achieve H(X) is entropy coding, aka Variable length code VLC Example: Huffman For any distribution of X, a VLC code can be found, whose rate R satisfies H(X ) R < H (X )+1 very inefficient for H(X)<<1 bit/symbol Remedy: combine m successive symbols to a new block-symbol H(X ) R < H (X )+1/m - exponentially growing alphabet size Example: block-huffman, arithmetic

7 Sufficiency of binary encoding Consider the entropy coding of the r.v. Example: 2⁸=256 gray levels Assume that X is equivalent to a r.vector No loss by conditionally encoding bits (in any order) Definition: a context is the set of r.v. taken to condition in the entropy encoder. Example: B0,, BK-2 is the context for BK-1

What if the process is not i.i.d.? 8

9 Joint entropy and statistical dependence Theorem (Independence bound on joint entropy) equality iff X1,, Xn are statistically independent Exploiting statistical dependence lowers bit-rate idea 1 Statistically independent components can be coded separately without loss of efficiency idea 2

idea 1 toward context adaptive coding Exploiting statistical dependence lowers bit-rate 10 Break image into blocks combine m neighbor pixels to a new block-pixel ( seen before) Block-by-block entropy coding Statistical dependencies across block boundaries neglected Sliding window across image Interpret image as source with memory Perform conditional entropy coding (spatial context) Context adaptive coding [JPEG-2000]

idea 2 toward transform coding Independent symbols can be coded separately 11 Transform x into an equivalent array of coefficients y invertible transform same sampling rate Min 1 st order entropy min variance transform coding [JPEG-2000] Goal = transform allows to use scalar entropy coding

12 Summary on lossless compression ergodic stationary process: transform and decorrelate the samples: make the entropy coding task easier (scalar is optimal if iid process) transform coding exploit residual statistical context = spatial neighborhood limited context size due to transform context adaptive coding image x Transform y= T(x) coefficients y Entropy coder c= C(y)

13 Lossy compression Lower the bit-rate R by introducing some (acceptable) distortion D of X. Rate Distortion theory gives the optimal curve for a given distribution in theory

Scalar quantization 14

Vector quantization 15

Uniform vector quantization 16

17 High resolution analysis Uniform vector Quantization + entropy coding = optimal (dimension infinity) Uniform scalar Quantization + scalar entropy coding = small loss (1.53 db Mean square error distortion) coefficients y Quantizer q= Q(y) indices q Entropy coder c= C(q) scalar

18 High resolution regime Valid for moderate to high rate Valid for all rates by adding a dither (uniform noise in Quantization cell) [Zamir92]

Typical Image Coder and Decoder Transform y= T(x) Quantizer q= Q(y) image x coefficients y indices q Entropy coder c= C(q) 19 Encoder bistream c reconstructed image Inverse transform T ¹ Dequantizer Q ¹ Entropy decoder C ¹ Decoder Transform T(x) to decorrelate the pixels (to allow scalar C) Quantization Q(y): scalar uniform Lossless encoder C(q): context adaptive: bit+spatial (to deal with remaining redundancy)

2. JPEG 2000 20

21 Features of JPEG2000 ISO/IEC JTC1/SC29/WG1 JPEG2000 international standard since Dec. 2000. What's new? Improved compression efficiency (vs JPEG); Multiple resolution representation; Progressive decoding; Scalability; Error resilience.

22 Transform y= T(x) JPEG scheme? Quantizer q= Q(y) image x coefficients y indices q Encoder bistream c Entropy coder c= C(q) reconstructed image Inverse transform T ¹ Dequantizer Q ¹ Entropy decoder C ¹ Decoder Transform?

23 2-d Discrete Wavelet Transform Optimal orthonormal transform=klt. Not used: source statistics dependent DCT (JPEG) blocking artefacts Subband transform spatial scalability DWT LL LH HL HH

24 Are the DWT-coefficients independent? Not really need for context adaptive entropy coding to remove residual dependencies.

Partitioning of image subbands into code-blocks 25

26 Transform y= T(x) JPEG scheme? Quantizer q= Q(y) image x coefficients y indices q Encoder bistream c Entropy coder c= C(q) reconstructed image Inverse transform T ¹ Dequantizer Q ¹ Entropy decoder C ¹ Decoder Quantizer? Scalar uniform?

Embedded deadzone scalar uniform quantizer 27

28 Transform y= T(x) JPEG scheme? Quantizer q= Q(y) image x coefficients y indices q Encoder bistream c Entropy coder c= C(q) reconstructed image Inverse transform T ¹ Dequantizer Q ¹ Entropy decoder C ¹ Decoder Entropy coder? Bit-plane coding + R-D code-block truncation = EBCOT Embedded Block Coding with Optimized Truncation

29 Bit-plane coding bitplane= a set of bits having the same position. 1 bit/coefficient

30 Achievable rates through bitplane encoding Bitplane encoding leads too few Rate levels Bitplane encoding Generate additional feasible truncation points for each block

31 Fractional bit-planes coefficient =sign / 000 insignificant 1 010011 significant Significance Propagation Pass insignificant coefficient with 1 significant neighbor Magnitude Refinement Pass significant coefficient Cleanup Pass remaining coefficients

32 Entropy coding Independent entropy coding for each pass Context-dimension reduction from 2⁸=256 to 18 contexts by using primitives = almost sufficient statistics of the neighborhood.

33 Optimal bit-stream truncation PCRD, Post Compression Rate-Distortion Minimize Distortion (of the image) under a global rate constraint

JPEG2000 compression results 34 Original Carol Image (512 x 512 Pixels, 24-Bit RGB, Size 786K)

JPEG2000 compression results 35 75:1, 10.6 Kbyte

JPEG2000 compression results 36 150:1, 5.3 Kbyte

JPEG2000 compression results 37 300:1, 5.3 Kbyte

38 Comparison JPEG vs. JPEG2000 Lenna, 256x256 RGB Baseline JPEG: 4572 bytes Lenna, 256x256 RGB JPEG-2000: 4572 bytes

39

40 Sufficiency of binary encoding Consider a r.v. equivalent to a r.v. No loss by conditionally encoding bits (in any order) Supply entropy encoder with symbol/probability pairs Total number of (conditional) probabilities to be estimated per symbol x same as with K-ary encoding of x Definition: a context is the set of r.v. taken to condition in the entropy encoder.

41 Fractional bit-planes coefficient = 000 insignificant 1 010011 significant Partition the bits in the bitplane in 3 classes and encode them separately 1. Significance Propagation Pass insignificant bits with significant neighbors 2. Magnitude Refinement Pass coefficient that were already significant in the previous bit-plane 3. Cleanup Pass: all remaining (insignificant) coefficients Context-dimension reduction by using primitives = interpretation of the coefficient