Analysis of Rate-distortion Functions and Congestion Control in Scalable Internet Video Streaming

Similar documents
Statistical Analysis and Distortion Modeling of MPEG-4 FGS

Analysis of Rate-Distortion Functions and Congestion Control in Scalable Internet Video Streaming

RATE-DISTORTION ANALYSIS AND TRAFFIC MODELING OF SCALABLE VIDEO CODERS. A Dissertation MIN DAI

Min Congestion Control for High- Speed Heterogeneous Networks. JetMax: Scalable Max-Min

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Rate-distortion Analysis and Control in DCT-based Scalable Video Coding. Xie Jun

Optimization of Selective Enhancement for MPEG-4 Fine Granularity Scalability

LORD: LOw-complexity, Rate-controlled, Distributed video coding system

Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding

Fast Progressive Wavelet Coding

PIQI-RCP: Design and Analysis of Rate-Based Explicit Congestion Control

Wavelet Scalable Video Codec Part 1: image compression by JPEG2000

Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences

SIGNAL COMPRESSION. 8. Lossy image compression: Principle of embedding

Compression and Coding

A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv Techniques

On Compression Encrypted Data part 2. Prof. Ja-Ling Wu The Graduate Institute of Networking and Multimedia National Taiwan University

Progressive Wavelet Coding of Images

L. Yaroslavsky. Fundamentals of Digital Image Processing. Course

Converting DCT Coefficients to H.264/AVC

SCALABLE 3-D WAVELET VIDEO CODING

Multiple Description Transform Coding of Images

Image Data Compression

Overview. Analog capturing device (camera, microphone) PCM encoded or raw signal ( wav, bmp, ) A/D CONVERTER. Compressed bit stream (mp3, jpg, )

2. the basis functions have different symmetries. 1 k = 0. x( t) 1 t 0 x(t) 0 t 1

SCALABLE AUDIO CODING USING WATERMARKING

Wyner-Ziv Coding of Video with Unsupervised Motion Vector Learning

Utility, Fairness and Rate Allocation

Redundancy Allocation Based on Weighted Mismatch-Rate Slope for Multiple Description Video Coding

Image Compression - JPEG

Scalar and Vector Quantization. National Chiao Tung University Chun-Jen Tsai 11/06/2014

Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources

CODING SAMPLE DIFFERENCES ATTEMPT 1: NAIVE DIFFERENTIAL CODING

Audio Coding. Fundamentals Quantization Waveform Coding Subband Coding P NCTU/CSIE DSPLAB C.M..LIU

A Hyper-Trellis based Turbo Decoder for Wyner-Ziv Video Coding

at Some sort of quantization is necessary to represent continuous signals in digital form

Rate Bounds on SSIM Index of Quantized Image DCT Coefficients

Research Article Compressive-Sensing-Based Video Codec by Autoregressive Prediction and Adaptive Residual Recovery

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

ON SCALABLE CODING OF HIDDEN MARKOV SOURCES. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression

MODERN video coding standards, such as H.263, H.264,

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding

Basic Principles of Video Coding

Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms

A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments

BASICS OF COMPRESSION THEORY

Modelling of produced bit rate through the percentage of null quantized transform coefficients ( zeros )

Multiple Description Coding for quincunx images.

Resource Allocation and Pricing. R. Srikant University of Illinois

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5

Rate-Distortion Based Temporal Filtering for. Video Compression. Beckman Institute, 405 N. Mathews Ave., Urbana, IL 61801

Application of a Bi-Geometric Transparent Composite Model to HEVC: Residual Data Modelling and Rate Control

Digital Image Processing Lectures 25 & 26

Human Visual System Based Adaptive Inter Quantization

Product Obsolete/Under Obsolescence. Quantization. Author: Latha Pillai

Multimedia Networking ECE 599

Local and Global Stability of Symmetric Heterogeneously-Delayed Control Systems

Can the sample being transmitted be used to refine its own PDF estimate?

Vector Quantization and Subband Coding

Sparse Regression Codes for Multi-terminal Source and Channel Coding

A NEW BASIS SELECTION PARADIGM FOR WAVELET PACKET IMAGE CODING

encoding without prediction) (Server) Quantization: Initial Data 0, 1, 2, Quantized Data 0, 1, 2, 3, 4, 8, 16, 32, 64, 128, 256

A Measurement-Analytic Approach for QoS Estimation in a Network Based on the Dominant Time Scale

Waveform-Based Coding: Outline

Direction-Adaptive Transforms for Coding Prediction Residuals

Multiple Description Coding: Proposed Methods And Video Application

Compression methods: the 1 st generation

Embedded Zerotree Wavelet (EZW)

Image Dependent Log-likelihood Ratio Allocation for Repeat Accumulate Code based Decoding in Data Hiding Channels

Multimedia Communications. Differential Coding

Adaptive Quantization Matrices for HD and UHD Display Resolutions in Scalable HEVC

EE5585 Data Compression April 18, Lecture 23

Multimedia Communications Fall 07 Midterm Exam (Close Book)

ECE 521. Lecture 11 (not on midterm material) 13 February K-means clustering, Dimensionality reduction

Hyper-Trellis Decoding of Pixel-Domain Wyner-Ziv Video Coding

Wavelet and Time-Domain Modeling of Multi-Layer VBR Video Traffic

A MODEL OF JERKINESS FOR TEMPORAL IMPAIRMENTS IN VIDEO TRANSMISSION. S. Borer. SwissQual AG 4528 Zuchwil, Switzerland

Multimedia Communications. Scalar Quantization

ECE Information theory Final

C.M. Liu Perceptual Signal Processing Lab College of Computer Science National Chiao-Tung University

Multi-Hypothesis based Distributed Video Coding using LDPC Codes

384Y Project June 5, Stability of Congestion Control Algorithms Using Control Theory with an application to XCP

h 8x8 chroma a b c d Boundary filtering: 16x16 luma H.264 / MPEG-4 Part 10 : Intra Prediction H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter

Transform Coding. Transform Coding Principle

Introduction to Video Compression H.261

Video Coding with Motion Compensation for Groups of Pictures

18.2 Continuous Alphabet (discrete-time, memoryless) Channel

Sparse Solutions of Linear Systems of Equations and Sparse Modeling of Signals and Images: Final Presentation

16.36 Communication Systems Engineering

Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi

Video Coding With Linear Compensation (VCLC)

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders

Vector Quantization Encoder Decoder Original Form image Minimize distortion Table Channel Image Vectors Look-up (X, X i ) X may be a block of l

Deterministic sampling masks and compressed sensing: Compensating for partial image loss at the pixel level

On Common Information and the Encoding of Sources that are Not Successively Refinable

Utility Maximization in Peer-to-Peer Systems

6. H.261 Video Coding Standard

Efficiency and Braess Paradox under Pricing

Transcription:

Analysis of Rate-distortion Functions and Congestion Control in Scalable Internet Video Streaming Min Dai Electrical Engineering, Texas A&M University Dmitri Loguinov Computer Science, Texas A&M University 1

Motivation Scalable coding is widely applied in Internet streaming. Fine Granular Scalability (FGS) has been chosen in MPEG-4 standard Study the statistical properties of FGS encoder and propose a more accurate statistical model for it The Rate-distortion (R-D) theory is a powerful tool in Internet streaming. Choose appropriate compression schemes Optimally allocate bits in joint source-channel coding Rate adaptation in the Internet

Motivation (cont.) R-D theory (cont.): No current closed-form R-D model has been developed for scalable coding Derive an R-D model for scalable video coding Constant quality control: Another application of R-D model in Internet streaming Human eyes are sensitive to quality fluctuation Many video sequences have severe quality fluctuations 3

Motivation (cont.) Most existing CQ methods in scalable streaming are limited to the constant bitrate (CBR) case. Channel bandwidth is varying in a real network Congestion control is necessary to allow fair and efficient usage of network bandwidth Most existing congestion control methods (e.g., AIMD) are not proven to be asymptotically stable. Kelly s continuous-feedback congestion control Combine our R-D model with Kelly s control in Internet streaming 4

Overview of this Talk Background on rate-distortion theory and FGS scalable coding A big picture of this work R-D modeling of an FGS encoder Kelly s control Constant quality control Experimental results Conclusion 5

Background Rate-distortion (R-D) theory The theoretical discipline that treats data compression from the viewpoint of information theory A typical R-D curve Theoretical R-D model: R Theoretical R(D) A lower bound for any encoder given a statistical distribution of the source Often unachievable in the real world D Operational R(D) Operational R-D model An achievable bound for a practical encoder 6

Background (cont.) Scalable coding: Provides the capability of recovering image or video information by partially decoding the compressed bitstream Fine granular scalability (FGS): One low bitrate base layer (BL) to provide a low but guaranteed level of quality One high bitrate enhancement layer (EL) to provide finer quality improvement EL can be truncated at any codeword 7

Background (cont.) FGS at the Encoder FGS at the Server FGS at the Decoder EL BL Portion of the FGS EL transmitted in the Internet 8

A Big Picture Statistical Model of FGS EL input Distortion Model of FGS EL R-D Model of FGS EL Congestion Control Constant Quality Control 9

Related work on Statistical Models Input to FGS EL: DCT residue between the original image and the reconstructed image from BL The two most popular models for DCT residue: Zero-mean Gaussian distribution: f x 1 σ ( x) = e πσ Laplacian distribution (double exponential): λ f ( x) = e λ x 10

Related work on Statistical Models Probability 0.4 0.3 0. 0.1 real PMF Gaussian Laplacian Probability 1.E+00 1.E-0 1.E-04 real PMF Gaussian Laplacian 0.0-15 -10-5 0 5 10 15 DCT residue 1.E-06 0 10 0 30 40 DCT residue The PMF of DCT residue with Gaussian and Laplacian estimations (left). Logarithmic scale of PMFs for the positive residue (right). All testing sequences shown in this paper are coded at 10fps and 18 kb/s in the base layer 11

Proposed Statistical Model Mixture Laplacian model: f ( x) = p λ e + (1 λ p) 0 λ0 x 1 λ1 x where λ 0 denotes the small variance Laplacian distribution and λ 1 denotes the large variance Laplacian distribution e Use Expectation-Maximization (EM) algorithm to give Maximum-likelihood (ML) estimation for parameters { p, λ 0, λ 1 } 1

Proposed Statistical Model (cont.) Real PMF and mixture Laplacian (left) and Logarithmic scale of the positive part (right) Probability 0.4 0.3 0. 0.1 real PMF mixture Probability 1.E+00 1.E-0 1.E-04 real PMF mixture 0.0-0 -10 0 10 0 DCT residue 1.E-06 0 10 0 30 40 DCT residue 13

More Results The weighted absolute error of estimations in Foreman CIF (left) and Coastguard CIF (right) 0.1 0.1 Gaussian Laplacian mixture 0.06 0.05 Gaussian Laplacian mixture 0.08 0.04 error 0.06 error 0.03 0.04 0.0 0.0 0.01 0 0 90 180 70 frame number 0 0 90 180 70 frame number 14

A Big Picture Statistical Model of FGS EL input Distortion Model of FGS EL R-D Model of FGS EL Congestion Control Constant Quality Control 15

Current Distortion Models Classical model: D = ε σ X where ε is a signal-dependent constant, σ X denotes the signal variance and R is the bitrate A variation of the classical model ( proposed by Chiang et al. in 1997): R R = ad 1 + bd where parameters a, b are obtained empirically 16

Current Distortion Models (cont.) Distortion model for Uniform Quantizer (UQ): D( ) = where is quantization parameter (QP) and β equals 1 β 17

Current Distortion Models (cont.) Performances of current models in frame 0 (left) and frame 5 of Foreman CIF (right) PSNR (db) 70 60 50 Chiang et al. real PSNR UQ classical PSNR (db) 70 60 50 40 Chiang et al. real PSNR UQ classical 40 30 30 0.E+00.E+05 4.E+05 6.E+05 FGS EL bits 0 0.E+00.E+05 4.E+05 6.E+05 8.E+05 FGS EL bits 18

19 A more Accurate Distortion Model For each Laplacian component in the mixture Laplacian model, the distortion is: λ λ + λ + = λ λ 1) ( 1 1 1 ) (1 1 ) ( n n n n n n e e D Final version: ) ( ) (1 ) ( ) ( 1 0 + = D p D p D where is the quantization step of each bitplane in the FGS EL and p is the probability of related Laplacian component

Results of Distortion Model The average absolute errors in Foreman CIF (left) and Coastguard CIF (right) avg abs error (db) 8 6 4 classical UQ model our model aveg abs error(db) 8 6 4 classical UQ model our model 0 0 0 90 180 70 0 90 180 70 frame number frame number 0

A Big Picture Statistical Model of FGS EL input Distortion Model of FGS EL R-D Model of FGS EL Congestion Control Constant Quality Control 1

Operational R-D model Peak Signal-to-Noise Ratio (PSNR) is the most popular quality measurement in video coding PSNR = 10log10 (55 / D) Based on our distortion model, we found that PSNR could be described with a quadratic function of bitplane number z ( z= log ( ) ) PSNR( z) d z + d z + d 1 3

Operational R-D model (cont.) In traditional R-D models, bitrate R is a linear function of the bitplane number z bits per pixel 4 3 1 actual quadratic linear R ( z) = az + Experimental results show that a quadratic function of z is a much better model of R bits per pixel 5 4 3 1 b actual quadratic linear 0 0 1 3 4 5 6 7 bitplane z 0 0 1 3 4 5 6 7 8 bitplane z 3

Operational R-D model (cont.) Distortion model D( ) PSNR ~ log(d) Quality PSNR (z) = = d 1 z + d z + d 3 Rate R(z) = a 1 z + a z + a 3 PSNR = AR + B R + C D( R) = ar+ b R + c 4

Operational R-D model (cont.) The proposed R-D function is: D( R) = ar+ b where a, b, c are constants R + c Notice that the classical model is a special case of our model with a = - and b=0. D R+ log ( ε σ x ( R) = ) 5

Results of R-D models (1) The average absolute errors in Foreman CIF (left) and Coastguard CIF (right) avg abs error(db) 8 7 6 5 4 3 1 classical UQ our model avg abs error (db) 7 6 5 4 3 1 classical UQ our model 0 1 31 61 91 Frame number 0 1 31 61 91 Frame number 6

Results of R-D models () The maximum absolute errors in Foreman CIF (left) and Coastguard CIF (right) max abs error (db) 14 1 10 8 6 4 classical UQ our model max abs error (db) 14 1 10 8 6 4 classical UQ our model 0 0 90 180 70 Frame number 0 0 90 180 70 Frame number 7

A Big Picture Statistical Model of FGS EL input Distortion Model of FGS EL R-D Model of FGS EL Congestion Control Constant Quality Control 8

Constant Quality in CBR 34 33 3 base layer 4 40 Ours Classical JPEG000 PSNR(dB) 31 30 9 8 PSNR (db) 38 36 34 7 6 3 5 0 1 3 4 5 6 7 8 9 10 30 0 1 3 4 5 6 7 8 9 time (s) time (s) Notice that most CQ papers stop here, while the available bandwidth is varying in the real case 9

A Big Picture Statistical Model of FGS EL input Distortion Model of FGS EL R-D Model of FGS EL Congestion Control Constant Quality Control 30

Congestion Control Current status: AIMD, TFRC and binomial algorithms oscillate around the average rate Continuous-feedback controller is proposed by Kelly et al. in 1998 : dr( t) dt = r( αu ( r) β l P pl ), where U ( r) = log( r) where r is the current sending rate and α, β are constants. U is the utility function of the end user and p l is the price that the flow pays for using router l along the end-to-end path P 31

3 Congestion Control (cont.) An application-friendly version: ) ( ) ( ) ( t r t p dt t dr β α = Bottleneck packet loss p is used as the feedback instead of prices = i i i l i l t r C t r t p ) ( ) ( ) ( where r i is the sending rate of the i-th flow passing through the bottleneck router l, C l is the speed of router l

A Big Picture Statistical Model of FGS EL input Distortion Model of FGS EL R-D Model of FGS EL Congestion Control Constant Quality Control 33

Experimental Results (1) Comparison between a single AIMD flow and a single Kelly s flow. Bottleneck bandwidth C is 1mb/s and RTT=100ms 40 38 PSNR(dB) 36 34 3 30 Kelly AIMD 0 1 3 4 5 6 7 8 9 time (s) 34

Experimental Results () Two Kelly flows are sharing the same bottleneck link C under identical delay PSNR (db) 40 35 30 flow1 flow base 5 0 1 3 4 5 6 7 8 9 time (s) 35

Experimental Results (3) Examine the effect of different round-trip delays (fixed and random) on fairness PSNR (db) 40 D=100 ms 40 D=400 ms 35 30 D base 35 PSNR (db) 30 flow1 flow base 5 0 1 3 4 5 6 7 8 9 time (s) 5 0 1 3 4 5 6 7 8 9 time (s) 36

Experimental Results (4) Examine the situation n flows sharing the bottleneck bandwidth and each flow has a random delay 40 flow 500 base 400 PSNR (db) 35 30 delay D (ms) 300 00 100 5 0 1 3 4 5 6 7 8 9 time (s) 0 0 4 6 8 10 time (s) A single-flow PSNR when n = 10 flows share a 10 mb/s bottleneck link (left), Random delay for the flow (right) 37

Conclusion to this Work This paper derives a simple but accurate operational R-D model based on the properties of FGS encoders Based on this R-D model, we show a simple algorithm that can achieve better constant quality in CBR for scalable streaming than many other CQ methods Another contribution of this work is the successful combination of our R-D model with Kelly s congestion control in Internet streaming 38

Thank You! 39