Motion Vector Prediction With Reference Frame Consideration

Size: px
Start display at page:

Download "Motion Vector Prediction With Reference Frame Consideration"

Transcription

1 Motion Vector Prediction With Reference Frame Consideration Alexis M. Tourapis *a, Feng Wu b, Shipeng Li b a Thomson Corporate Research, 2 Independence Way, Princeton, NJ, USA 855 b Microsoft Research Asia, 3F Sigma Center, 49 Zhichun Road, Beijing, 18, China ABSTRACT In this paper, we introduce a new motion vector prediction method that could be used within multiple picture reference codecs, such as the H.264 (MPEG-4 AVC) video coding standard. Our method considers for each candidate motion vector the temporal distance of its corresponding reference picture compared to the current one for the generation of the predictor motion vector. This allows for more accurate motion vector prediction, and better exploitation of the temporal correlation that may exist within a video sequence. Furthermore, we also introduce a modification to the SKIP motion vector macroblock mode, according to which not only the motion vectors but also the reference indices are adaptively generated. Simulation results suggest that our proposed methods, combined with an improved Rate Distortion optimization strategy, if implemented within the existing H.264 codec, can allow for a considerable performance improvement of up to 8.6% bitrate reduction compared to the current H.264 standard. Keywords: H.264, Motion Vector Prediction, reference pictures/frames, skip mode, spatial prediction, temporal prediction 1. INTRODUCTION The new upcoming H.264 standard (or MPEG-4 AVC, JVT,and H.26L) has managed to achieve considerably higher coding efficiency compared to older standards like MPEG-2 [2] and MPEG-4 [3], partly due to the adoption of more refined and complicated motion models and modes. These include increased sub-pixel accuracy down to a quarter (¼) pixel level, multiple-referencing, the introduction of the tree structured macroblock [2] based on a quad-tree (Figure 1) concept according to which different sub areas of a Macroblock (MB) can be assigned to different motion information, and Multiple-frame/picture indexing of the motion vectors (MVs). A macroblock can essentially have up to 16 MVs since the tree macroblock structure enables the macroblock to be coded in 4 different modes and partitions of shape sizes equal to 16 16, 16 8, 8 16, and 8 8, while when in the 8 8 partition mode, each 8 8 partition can be further split into 8 8, 8 4, 4 8, and 4 4 blocks (Figure 1). Considering the high cost of transmitting motion parameters, motion vectors are coded differentially versus a motion vector predictor (MVP or MV pred ). This predictor is calculated by taking in consideration that adjacent blocks/macroblocks and their motion tend to have very high spatial correlation and by using the median value of the MVs of the three adjacent, on the left, top, and top-right (or top-left if top-right is not available) blocks to the current block. This MV pred is basically calculated as: where replaced by ( MV A, MV B MV C ) MV pred Median, = (1) MV A, MV B, and MV C are the three adjacent predictors as shown in Figure 2. In this computation, MV C is MV D if it is not available (edge of a slice or picture). * alexismt@ieee.org; phone +1-(69) ; fax +1-(69)

2 1 macroblock partition of 16*16 luma samples and 2 macroblock partitions of 16*8 luma samples and 2 macroblock partitions of 8*16 luma samples and 4 sub-macroblocks of 8*8 luma samples and Macroblock partitions sub-macroblock partition of 8*8 luma samples and 2 sub-macroblock partitions of 8*4 luma samples and 2 sub-macroblock partitions of 4*8 luma samples and 4 sub-macroblock partitions of 4*4 luma samples and Sub-macroblock partitions Figure 1: Macroblock and sub-macroblock partitions as defined in H.264. Nevertheless, considering that H.264 allows the usage of multiple reference frames (or pictures), which are usually pictures at different time instants and are very likely spatially unrelated, to improve performance of this prediction an additional consideration is made based on the reference pictures of each spatial predictor and that of the current block. Instead of always using the median prediction, if only a single predictor uses the same reference index as the current block, then only this predictor is used for prediction, while all others are immediately discarded. This method, which we will call the single equal reference condition, can strengthen somewhat the correlation between adjacent motion vectors and their references, but nevertheless does not account for the case where two or even none of the reference indices are the same as the current block's. In these cases the median method is still used, which could reduce prediction efficiency. D B C A Current Macroblock Figure 2: Spatial predictors used for Motion Vector Prediction and the generation of the SKIP mode MV parameters Furthermore, the H.264 standard benefits considerably from the adoption of the MotionCopy SKIP macroblock mode which is used within Predictive (P) pictures. This mode is nevertheless strongly related to the efficacy of the motion vector prediction. More precisely, this mode essentially signals that the motion vectors and reference index of an entire MB can be completely derived by the location of the MB within a slice or picture, and from the motion information of its spatial neighbors. To be more exact, if the MB is not on the edge columns or rows of a picture or slice, and both it's top and left spatial predictors are not zero or do not use the zero reference index, then the zero reference and the MVP are used as the actual motion information of this MB. Otherwise the motion vectors for this MB are set equal to zero. This process is called zero partitioning of the SKIP mode. Apparently if the generation of the MVP is not accurate enough, the efficacy also of SKIP mode will also be affected. The zero reference is also always used for SKIP mode. In this paper we present an alternative method for generating the MVP through the consideration of the temporal distances of the reference indices and by accordingly scaling the motion vectors. An additional, very simple process of selecting the reference index, instead of always using zero, within SKIP mode is also introduced. Our methods can lead

3 to further improvement in the motion vector prediction process within the H.264 standard, and thus better coding efficiency. In Section 2 we will first introduce the details of our proposed modifications, while experimental results will then be given in Section 3, followed by our conclusions in Section REFERENCE CONSIDERATION WITHIN THE MOTION VECTOR PREDICTION AND SKIP As we have previously discussed, motion vector prediction within the H.264 standard basically considers the spatial correlation that may exist between adjacent blocks or macroblocks, in an effort to reduce the cost of the motion vectors even further. On the other hand, temporal correlation could also be of some use and could allow us for further benefits. In particular, the temporal direct mode, currently used within Bi-predictive (B) pictures, assumes that there exists a temporal relationship between co-located blocks and allows the prediction of the motion vectors for this mode using simple scaling operations. More specifically, following the assumption that an object is moving with constant speed the motion vectors of a co-located block are scaled according to the temporal distances (Figure 3) of the reference pictures involved to generate two new motion vectors MV L and MV L1 that will be used for the prediction. These motion vectors are calculated as follows compared to the co-located block's motion vector MV : DistScaleF actor = ( TD B 256) / TD D (2) MV L = ( DistScaleFactor MV + 128) >> 8 (3) which are approximations of: MV L1 = MV L MV, (4) TRB MV L = MV (5) TR D ( TRB TRD ) MV L 1 = MV, (6) TR D but can essentially reduce the number of divisions since the variable DistScaleFactor can be precomputed at the Slice/Picture level. In the above TD B and TD D are the temporal distances of the reference pictures used for the prediction compared to the current picture.

4 List Reference Current B List 1 Reference... current block MV... co-located block MV L MVL1 TD D TD B Time Figure 3: Temporal Direct Prediction in B picture coding We observe that a similar scaling approach could be beneficial within the motion vector prediction process as well. As we have previously discussed, the reference indices of the adjacent neighbors are considered only under certain conditions, and are indirectly used as a decision mechanism on whether the median prediction will be used or not. It is, nevertheless, possible to use these reference indices, and more specifically the temporal distances of these references compared to the current picture, with a more direct impact on the motion vector prediction. Similar to temporal direct, we propose scaling the MVs from each predictor according to these temporal distances. More specifically, the MVP is now calculated as MV A MV B MV C MV pred = TD ref Median,,, (7) TDA TDB TDC where MV A, MV B, and MV C are the three predictor MVs, TD A, TD B, and TD C are their corresponding temporal distances, and TD ref is the temporal distance of the current reference. Division can also be very easily replaced with binary shifts without any loss in efficiency, and more specifically with the following equations: Z = ( TD 256) / TD (8) A ref A Z = ( TD 256) / TD (9) B ref B Z = ( TD 256) / TD (1) C MV ref C ( Z A MV A + 128) >> 8, ( Z B MV B + 128) >> 8, ( ZC MV + 128) >> 8) pred = Median C. (11) Z A, Z B, and Z C, can be pre-calculated at the picture/slice level, and thus the increase in complexity compared to the original method without the divisions/scaling is very minor. Obviously the basis of this concept is very similar to that of Temporal Direct, since we are again assuming that adjacent pictures may follow the constant speed rule (Figure 4).

5 Ref 1 Ref Current Predictor block B MV = TD P, MV B TD1 MV = MV TD1 P, 1 B TD1 current block MV B MV P, MV P,1 TD 1 TD 2 Time Figure 4: Predictors are generated according to their corresponding reference picture distances compared to the current reference picture Although it could be argued that by introducing the above scaling method could allow us to remove the single equal reference condition and simplify the prediction process, we have found from experimental results that this rule is still advantageous, especially considering that that the reference pictures involved may not always be temporally correlated. In this sense this condition can be advantageous, since only the most related picture (which in this case is the same as the current reference) is considered, while all other pictures would instead introduce motion noise and could hurt the prediction. Reference indices could also be used within SKIP mode as well. It is well known that the SKIP macroblock mode is probably the most efficient mode within Predictive (P) pictures in H.264. As previously discussed, this mode does not require the transmission of any residual data, and tries to further exploit spatial correlation between motion of adjacent MBs by signaling, under certain conditions, that the current MB has a MV equal to the MVP of the Macroblock type or zero. A major drawback though of this mode is that SKIP always considers the zero reference, and does not consider the case that it is possible that by using other references might be more beneficial. As an example, this mode does not consider the case where all neighbor predictors are not equal to zero reference. In this case the median prediction would still be used without making any other considerations. Although the scaling process discussed previously can indirectly improve the performance of the SKIP mode (in the previous example, the predictors would all be scaled towards the zero reference), a different method can also be used that can improve the prediction. In particular, similar to the spatial direct mode also used in B pictures, we observe that we may also perform a reference index prediction according to the reference indices of the adjacent macroblocks that are already used within the MVP process. Instead of always using the zero reference, the smallest non-negative reference (which usually implies the closest in terms of time) from the three adjacent predictors is selected and used for the prediction process. If no such reference is available, then zero is used by default (e.g. beginning of a slice or all adjacent blocks are intra coded). This method obviously enhances the relationship between adjacent pictures and the current one, while also could be rather useful especially when considering that the H.264 standard allows reordering of the references which may imply that the zero reference may not itself have the highest correlation with the current reference.

6 To summarize, the pseudocode for this scheme is as follows: SKIP_MV_Calculation() // Note that UpRight will be replaced by UpLeft at picture boundaries // if a reference is not available then its value is equal to -1. Skip_Reference=min(reference_fw_Left&255,reference_fw_Up&255, referenceb_fw_upright&255); if (Skip_reference!=255) if ((Skip_reference == ) && (reference_colocated == ) && ((abs(mvpx)>>1) == ) && ((abs(mvpy)>>1) == ) ) Skip_MV = ; Reference_Skip = ; else Skip_MV =SpatialPredictor(16x16,FW, Skip_reference); Reference_Skip=Skip_reference; else Skip_MV = ; Reference_Skip = ; Apart from these two semantic modifications to the H.264 codec, we introduce an additional modification within the mode decision of H.264 to further enhance performance. H.264 is based on a Rate Distortion Optimization (RDO) model using lagrangian (λ) parameters considering that these methods lead to considerably higher performance than other simpler, rate or distortion only methods. Mode decision is instead performed in H.264 by minimizing the equation: J mode ( mode mode s, c, MODE λ ) = SSD( s, c, MODE ) + λ R( s, c, MODE ) (12) where SSD denotes the Sum of Square Difference between the original and reconstructed signals, MODE indicates a mode out of a set of potential macroblock modes and more specifically SKIP, 16 16, 16 8, 8 16, Tree8 8, Intra4 4, Intra16 16, λ mode is the Lagrangian multiplier and is quantizer dependent and R(s,c,MODE) is the number of bits associated with choosing MODE, including the bits for the macroblock header, the motion and all DCT coefficients. As we have previously pointed out, SKIP mode itself can be considered as a special case of the mode but for which no motion and DCT coefficients need to be transmitted, which is basically a coefficient thresholding concept. Obviously, thresholding may be used for all other macroblock modes as well. In our case, we consider, in addition to the original modes, all INTER modes without coefficients (forced Coded Block Pattern equal to ) within the mode decision as well. This now means that we will have to examine 4 additional modes instead of 7. More specifically we will not have to examine modes SKIP, 16 16, 16 16nocoeff, 16 8, 16 8nocoeff, 8 16, 8 16nocoeff, Tree8 8, Tree8 8nocoeff, Intra4 4, Intra16 16.

7 3. SIMULATION RESULTS All of the above concepts were introduced within version 4.3a of the H.264 reference software [7]. For our simulations we have selected 5 sequences, namely QCIF resolution sequences Container and News coded at 1fps, and CIF sequences Mobile, Bus, and Stefan at 3fps. The CAVLC entropy coder was used for all our tests, with quantizer values of 28, 32, 36, and 4, a search range of ±32, and 5 references. Rate Distortion Optimization was enabled in our simulations. To simplify our comparisons we have used average PSNR gain (dpsnr) and bitrate reduction (dbitrate) results, based on the above quantizers, as is also recommended by [8]. This method was also the required comparison method to all proponents to the H.264 standard, since it allows for a quantitative RD performance estimate of a proposed algorithm. We observe that our proposed methods lead to a bitrate reduction of -1.43%, -1.41%, -8.65%, -3.5%, and -4.11% for sequences container, news, mobile, bus, and stefan respectively (Table 1). This equivalently corresponds for each respective sequence to a gain of.75db,.85db,.413db,.168db, and.214db. The Rate Distortion curves for sequences container and mobile are also shown in Figures 5 and 6 respectively. We particularly observe the considerable improvement on the 3 CIF sequences and more specifically in sequence mobile. The result on this sequence is somewhat expected considering that this sequence is well known to receive a considerable improvement benefit from the use of multiple references, and is characterized by relatively smooth and constant motion. Considering also the RD curves, we further observe that the gains are more prominent at higher bitrates, which is to be expected considering that the RDO mode decision tends to be more biased towards lower bitrate when the quantization parameters increase, resulting also to fewer non-zero reference indices. It is of course obvious that our modifications in the motion vector prediction process and the reference index prediction used within SKIP have no impact if a single reference is used. 4. CONCLUSION In this paper, two semantic changes were proposed for usage within the H.264 standard, or other multiple reference codecs, and could improve performance if multiple references are used. More specifically we have introduced an alternative motion vector prediction method that considers the reference indices and the associated temporal distances of the spatial neighbors within the motion vector prediction process, while a reference picture selection process is proposed to be used for the generation of the SKIP macroblock mode parameters. These methods allow for more accurate motion vector prediction, and better exploitation of temporal correlation within a multiple reference motion compensated framework. Combined with a minor modification in the Rate Distortion Optimized Mode Decision of the H.264 codec, our simulation results show that we can achieve considerable improvement compared to the existing H.264 standard. REFERENCES 1. Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, "Joint Video Specification (ITU-T Rec. H.264 ISO/IEC AVC) - Joint Committee Draft", document JVT-E22d3.doc, Sep'2. 2. ISO/IEC Standard :2. Information technology generic coding of moving pictures and associated audio information: Video. 3. ISO/IEC Standard :21. Information technology Coding of audio-visual objects Part 2: Visual 4. Heiko Schwarz and Thomas Wiegand, Tree-structured macroblock partition, document VCEG -O17, 15th VCEG meeting, Pattaya, Dec Hideaki Kimata, GMVC and GMC Switched by MV, document JVT-B46, 2nd JVT meeting, Geneva, Jan Shijun Sun and Shawmin Lei, " Global Motion Vector Coding (GMVC)," document JVT-B19, 2nd JVT Meeting, Geneva, Jan Jani Lainema and Marta Karczewicz, Skip mode motion compensation, document JVT-C27, 3rd JVT Meeting, Fairfax, May A. M. Tourapis, H. Y. Cheong, M. L. Liou, and O. C. Au, "Temporal Interpolation of Video Sequences Using Zonal Based Algorithms," in proceedings of the 21 IEEE International Conference on Image Processing (ICIP'1), WP8-5252, Thessaloniki, Greece, October 21.

8 9. JVT Reference Software unofficial version 4.3a, 1. G. Bjontegaard, Calculation of average PSNR differences between RD-Curves, document VCEG-M33, 13th VCEG meeting, Austin TX, Mar 1 Table 1: Performance Evaluation of the Proposed Scheme Sequences Container News Mobile Bus Stefan δbitrate % δpsnr Container QCIF 1kbps PSNR => db Original Proposed Bitrate => kbps Figure 5: RD performance for sequence Container at 1fps 35 Mobile CIF 3kbps PSNR => db Original Proposed Bitrate => kbps Figure 6: RD performance for sequence Mobile at 3fps

h 8x8 chroma a b c d Boundary filtering: 16x16 luma H.264 / MPEG-4 Part 10 : Intra Prediction H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter

h 8x8 chroma a b c d Boundary filtering: 16x16 luma H.264 / MPEG-4 Part 10 : Intra Prediction H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter 1. Introduction The Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG are finalising a new standard for the coding (compression) of natural

More information

Enhanced SATD-based cost function for mode selection of H.264/AVC intra coding

Enhanced SATD-based cost function for mode selection of H.264/AVC intra coding SIViP (013) 7:777 786 DOI 10.1007/s11760-011-067-z ORIGINAL PAPER Enhanced SATD-based cost function for mode selection of H.6/AVC intra coding Mohammed Golam Sarwer Q. M. Jonathan Wu Xiao-Ping Zhang Received:

More information

THE newest video coding standard is known as H.264/AVC

THE newest video coding standard is known as H.264/AVC IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 17, NO. 6, JUNE 2007 765 Transform-Domain Fast Sum of the Squared Difference Computation for H.264/AVC Rate-Distortion Optimization

More information

Context-adaptive coded block pattern coding for H.264/AVC

Context-adaptive coded block pattern coding for H.264/AVC Context-adaptive coded block pattern coding for H.264/AVC Yangsoo Kim a), Sungjei Kim, Jinwoo Jeong, and Yoonsik Choe b) Department of Electrical and Electronic Engineering, Yonsei University 134, Sinchon-dong,

More information

The MPEG4/AVC standard: description and basic tasks splitting

The MPEG4/AVC standard: description and basic tasks splitting The MPEG/AVC standard: description and basic tasks splitting Isabelle Hurbain 1 Centre de recherche en informatique École des Mines de Paris hurbain@cri.ensmp.fr January 7, 00 1 35, rue Saint-Honoré, 77305

More information

Bit Rate Estimation for Cost Function of H.264/AVC

Bit Rate Estimation for Cost Function of H.264/AVC Bit Rate Estimation for Cost Function of H.264/AVC 257 14 X Bit Rate Estimation for Cost Function of H.264/AVC Mohammed Golam Sarwer 1,2, Lai Man Po 1 and Q. M. Jonathan Wu 2 1 City University of Hong

More information

H.264/MPEG4 Part INTRODUCTION Terminology

H.264/MPEG4 Part INTRODUCTION Terminology 6 H.264/MPEG4 Part 10 6.1 INTRODUCTION The Moving Picture Experts Group and the Video Coding Experts Group (MPEG and VCEG) have developed a new standard that promises to outperform the earlier MPEG-4 and

More information

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD 4th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD Simone

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC9/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC9/WG11 MPEG 98/M3833 July 1998 Source:

More information

MODERN video coding standards, such as H.263, H.264,

MODERN video coding standards, such as H.263, H.264, 146 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 1, JANUARY 2006 Analysis of Multihypothesis Motion Compensated Prediction (MHMCP) for Robust Visual Communication Wei-Ying

More information

Direction-Adaptive Transforms for Coding Prediction Residuals

Direction-Adaptive Transforms for Coding Prediction Residuals MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Direction-Adaptive Transforms for Coding Prediction Residuals Robert Cohen, Sven Klomp, Anthony Vetro, Huifang Sun TR2010-090 November 2010

More information

Fast Intra Coding Method of H.264 for Video Surveillance System

Fast Intra Coding Method of H.264 for Video Surveillance System 76 IJCSNS International Journal of Computer Science and Network Security, OL.7 No.10, October 2007 Fast Intra Coding Method of.264 for ideo Surveillance System Jang-Seon Ryu and Eung-Tea Kim, Korea Polytechnic

More information

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding Aditya Mavlankar, Chuo-Ling Chang, and Bernd Girod Information Systems Laboratory, Department of Electrical

More information

AN ENHANCED EARLY DETECTION METHOD FOR ALL ZERO BLOCK IN H.264

AN ENHANCED EARLY DETECTION METHOD FOR ALL ZERO BLOCK IN H.264 st January 0. Vol. 7 No. 005-0 JATIT & LLS. All rights reserved. ISSN: 99-865 www.jatit.org E-ISSN: 87-95 AN ENHANCED EARLY DETECTION METHOD FOR ALL ZERO BLOCK IN H.6 CONG-DAO HAN School of Electrical

More information

Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences

Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences Jingning Han, Vinay Melkote, and Kenneth Rose Department of Electrical and Computer Engineering University of California, Santa

More information

A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction

A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction SPIE Conference on Visual Communications and Image Processing, Perth, Australia, June 2000 1 A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction Markus Flierl, Thomas

More information

Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression

Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL 12, NO 11, NOVEMBER 2002 957 Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression Markus Flierl, Student

More information

MPEG-4 AVC/H.264 Motion Vector Prediction Rev. 4

MPEG-4 AVC/H.264 Motion Vector Prediction Rev. 4 MPEG-4 AVC/H.264 Motion Vector Prediction Rev. 4 Author: Shevach Riabtsev (Israel, Haifa), riabtsev@yahoo.com Reviewer: Mohsen Abdoli, MS.c Student of Computer Engineering, Sharif University of Technology,

More information

6. H.261 Video Coding Standard

6. H.261 Video Coding Standard 6. H.261 Video Coding Standard ITU-T (formerly CCITT) H-Series of Recommendations 1. H.221 - Frame structure for a 64 to 1920 kbits/s channel in audiovisual teleservices 2. H.230 - Frame synchronous control

More information

4x4 Transform and Quantization in H.264/AVC

4x4 Transform and Quantization in H.264/AVC Video compression design, analysis, consulting and research White Paper: 4x4 Transform and Quantization in H.264/AVC Iain Richardson / VCodex Limited Version 1.2 Revised November 2010 H.264 Transform and

More information

Detailed Review of H.264/AVC

Detailed Review of H.264/AVC Detailed Review of H.264/AVC, Ph.D.. abuhajar@digitavid.net (408) 506-2776 P.O. BOX:720998 San Jose, CA 95172 1 Outline Common Terminologies Color Space Macroblock and Slice Type Slice Block Diagram Intra-Prediction

More information

A DISTRIBUTED VIDEO CODER BASED ON THE H.264/AVC STANDARD

A DISTRIBUTED VIDEO CODER BASED ON THE H.264/AVC STANDARD 5th European Signal Processing Conference (EUSIPCO 27), Poznan, Poland, September 3-7, 27, copyright by EURASIP A DISTRIBUTED VIDEO CODER BASED ON THE /AVC STANDARD Simone Milani and Giancarlo Calvagno

More information

HM9: High Efficiency Video Coding (HEVC) Test Model 9 Encoder Description Il-Koo Kim, Ken McCann, Kazuo Sugimoto, Benjamin Bross, Woo-Jin Han

HM9: High Efficiency Video Coding (HEVC) Test Model 9 Encoder Description Il-Koo Kim, Ken McCann, Kazuo Sugimoto, Benjamin Bross, Woo-Jin Han INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 29/WG 11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 MPEG 2012/N13154 October

More information

Intra Frame Coding for Advanced Video Coding Standard to reduce Bitrate and obtain consistent PSNR Using Gaussian Pulse

Intra Frame Coding for Advanced Video Coding Standard to reduce Bitrate and obtain consistent PSNR Using Gaussian Pulse Intra Frame Coding for Advanced Video Coding Standard to reduce Bitrate and obtain consistent PSNR Using Gaussian Pulse Manjanaik.N1, Dr.Manjunath.R2 Electronics and Communication Engineering, Jain University,

More information

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Dr. Jian Zhang Conjoint Associate Professor NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2006 jzhang@cse.unsw.edu.au

More information

Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding

Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding Wen-Hsiao Peng, Tihao Chiang, Hsueh-Ming Hang, and Chen-Yi Lee National Chiao-Tung University 1001 Ta-Hsueh Rd., HsinChu 30010,

More information

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5 Lecture : Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Dr. Jian Zhang Conjoint Associate Professor NICTA & CSE UNSW COMP959 Multimedia Systems S 006 jzhang@cse.unsw.edu.au Acknowledgement

More information

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders Tomasz Grajek and Marek Domański Poznan University of Technology Chair of Multimedia Telecommunications and Microelectronics ul.

More information

On Compression Encrypted Data part 2. Prof. Ja-Ling Wu The Graduate Institute of Networking and Multimedia National Taiwan University

On Compression Encrypted Data part 2. Prof. Ja-Ling Wu The Graduate Institute of Networking and Multimedia National Taiwan University On Compression Encrypted Data part 2 Prof. Ja-Ling Wu The Graduate Institute of Networking and Multimedia National Taiwan University 1 Brief Summary of Information-theoretic Prescription At a functional

More information

SSIM-Inspired Perceptual Video Coding for HEVC

SSIM-Inspired Perceptual Video Coding for HEVC 2012 IEEE International Conference on Multimedia and Expo SSIM-Inspired Perceptual Video Coding for HEVC Abdul Rehman and Zhou Wang Dept. of Electrical and Computer Engineering, University of Waterloo,

More information

arxiv: v1 [cs.mm] 10 Mar 2016

arxiv: v1 [cs.mm] 10 Mar 2016 Predicting Chroma from Luma with Frequency Domain Intra Prediction Nathan E. Egge and Jean-Marc Valin Mozilla, Mountain View, USA Xiph.Org Foundation arxiv:1603.03482v1 [cs.mm] 10 Mar 2016 ABSTRACT This

More information

Introduction to Video Compression H.261

Introduction to Video Compression H.261 Introduction to Video Compression H.6 Dirk Farin, Contact address: Dirk Farin University of Mannheim Dept. Computer Science IV L 5,6, 683 Mannheim, Germany farin@uni-mannheim.de D.F. YUV-Colorspace Computer

More information

Scalable resource allocation for H.264 video encoder: Frame-level controller

Scalable resource allocation for H.264 video encoder: Frame-level controller Scalable resource allocation for H.264 video encoder: Frame-level controller Michael M. Bronstein Technion Israel Institute of Technology September 7, 2009 Abstract Tradeoff between different resources

More information

Achieving H.264-like compression efficiency with distributed video coding

Achieving H.264-like compression efficiency with distributed video coding Achieving H.264-like compression efficiency with distributed video coding Simone Milani a, Jiajun Wang b and Kannan Ramchandran b a Dept. of Information Engineering, University of Padova, Italy. b Dept.

More information

The Choice of MPEG-4 AAC encoding parameters as a direct function of the perceptual entropy of the audio signal

The Choice of MPEG-4 AAC encoding parameters as a direct function of the perceptual entropy of the audio signal The Choice of MPEG-4 AAC encoding parameters as a direct function of the perceptual entropy of the audio signal Claus Bauer, Mark Vinton Abstract This paper proposes a new procedure of lowcomplexity to

More information

on a per-coecient basis in large images is computationally expensive. Further, the algorithm in [CR95] needs to be rerun, every time a new rate of com

on a per-coecient basis in large images is computationally expensive. Further, the algorithm in [CR95] needs to be rerun, every time a new rate of com Extending RD-OPT with Global Thresholding for JPEG Optimization Viresh Ratnakar University of Wisconsin-Madison Computer Sciences Department Madison, WI 53706 Phone: (608) 262-6627 Email: ratnakar@cs.wisc.edu

More information

Fast Bi-Directional Prediction Selection in H.264/MPEG-4 AVC Temporal Scalable Video Coding

Fast Bi-Directional Prediction Selection in H.264/MPEG-4 AVC Temporal Scalable Video Coding 3508 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 12, DECEMBER 2011 Fast Bi-Directional Prediction Selection in H.264/MPEG-4 AVC Temporal Scalable Video Coding Hung-Chih Lin, Hsueh-Ming Hang, Fellow,

More information

Modelling of produced bit rate through the percentage of null quantized transform coefficients ( zeros )

Modelling of produced bit rate through the percentage of null quantized transform coefficients ( zeros ) Rate control strategies in H264 Simone Milani (simone.milani@dei.unipd.it) with the collaboration of Università degli Studi di adova ST Microelectronics Summary General scheme of the H.264 encoder Rate

More information

Predictive Coding. Prediction Prediction in Images

Predictive Coding. Prediction Prediction in Images Prediction Prediction in Images Predictive Coding Principle of Differential Pulse Code Modulation (DPCM) DPCM and entropy-constrained scalar quantization DPCM and transmission errors Adaptive intra-interframe

More information

IMPROVED INTRA ANGULAR PREDICTION BY DCT-BASED INTERPOLATION FILTER. Shohei Matsuo, Seishi Takamura, and Hirohisa Jozawa

IMPROVED INTRA ANGULAR PREDICTION BY DCT-BASED INTERPOLATION FILTER. Shohei Matsuo, Seishi Takamura, and Hirohisa Jozawa 2th European Signal Processing Conference (EUSIPCO 212 Bucharest, Romania, August 27-31, 212 IMPROVED INTRA ANGULAR PREDICTION BY DCT-BASED INTERPOLATION FILTER Shohei Matsuo, Seishi Takamura, and Hirohisa

More information

Predictive Coding. Prediction

Predictive Coding. Prediction Predictive Coding Prediction Prediction in Images Principle of Differential Pulse Code Modulation (DPCM) DPCM and entropy-constrained scalar quantization DPCM and transmission errors Adaptive intra-interframe

More information

LOSSLESS INTRA CODING IN HEVC WITH INTEGER-TO-INTEGER DST. Fatih Kamisli. Middle East Technical University Ankara, Turkey

LOSSLESS INTRA CODING IN HEVC WITH INTEGER-TO-INTEGER DST. Fatih Kamisli. Middle East Technical University Ankara, Turkey LOSSLESS INTRA CODING IN HEVC WITH INTEGER-TO-INTEGER DST Fatih Kamisli Middle East Technical University Ankara, Turkey ABSTRACT It is desirable to support efficient lossless coding within video coding

More information

Converting DCT Coefficients to H.264/AVC

Converting DCT Coefficients to H.264/AVC MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Converting DCT Coefficients to H.264/AVC Jun Xin, Anthony Vetro, Huifang Sun TR2004-058 June 2004 Abstract Many video coding schemes, including

More information

Video Coding With Linear Compensation (VCLC)

Video Coding With Linear Compensation (VCLC) Coding With Linear Compensation () Arif Mahmood Zartash Afzal Uzmi Sohaib Khan School of Science and Engineering Lahore University of Management Sciences, Lahore, Pakistan {arifm, zartash, sohaib}@lums.edu.pk

More information

Intra Prediction by a linear combination of Template Matching predictors

Intra Prediction by a linear combination of Template Matching predictors Intra Prediction by a linear combination of Template Matching predictors Laurent Guillo, Tangi Poirier, Christine Guillemot To cite this version: Laurent Guillo, Tangi Poirier, Christine Guillemot. Intra

More information

High Throughput Entropy Coding in the HEVC Standard

High Throughput Entropy Coding in the HEVC Standard DOI 10.1007/s11265-014-0900-5 High Throughput Entropy Coding in the HEVC Standard Jung-Ah Choi & Yo-Sung Ho Received: 13 June 2013 /Accepted: 21 April 2014 # Springer Science+Business Media New York 2014

More information

CSE 408 Multimedia Information System Yezhou Yang

CSE 408 Multimedia Information System Yezhou Yang Image and Video Compression CSE 408 Multimedia Information System Yezhou Yang Lots of slides from Hassan Mansour Class plan Today: Project 2 roundup Today: Image and Video compression Nov 10: final project

More information

Multi-Hypothesis based Distributed Video Coding using LDPC Codes

Multi-Hypothesis based Distributed Video Coding using LDPC Codes Multi-Hypothesis based Distributed Video Coding using LDPC Codes Kiran Misra, Shirish Karande, Hayder Radha Department of Electrical and Computer Engineering 2120, Engineering Building Michigan State University

More information

Phase-Correlation Motion Estimation Yi Liang

Phase-Correlation Motion Estimation Yi Liang EE 392J Final Project Abstract Phase-Correlation Motion Estimation Yi Liang yiliang@stanford.edu Phase-correlation motion estimation is studied and implemented in this work, with its performance, efficiency

More information

A VC-1 TO H.264/AVC INTRA TRANSCODING USING ENCODING INFORMATION TO REDUCE RE-QUANTIZATION NOISE

A VC-1 TO H.264/AVC INTRA TRANSCODING USING ENCODING INFORMATION TO REDUCE RE-QUANTIZATION NOISE A VC- TO H.6/AVC INTRA TRANSCODING USING ENCODING INFORMATION TO REDUCE RE-QUANTIZATION NOISE Takeshi Yoshitome, Yoshiyuki Nakajima, and Kazuto Kamikura NTT Cyer Space Laoratories, NTT Corporation, Yokosuka-shi,

More information

Wavelet Scalable Video Codec Part 1: image compression by JPEG2000

Wavelet Scalable Video Codec Part 1: image compression by JPEG2000 1 Wavelet Scalable Video Codec Part 1: image compression by JPEG2000 Aline Roumy aline.roumy@inria.fr May 2011 2 Motivation for Video Compression Digital video studio standard ITU-R Rec. 601 Y luminance

More information

Neural network based intra prediction for video coding

Neural network based intra prediction for video coding Neural network based intra prediction for video coding J. Pfaff, P. Helle, D. Maniry, S. Kaltenstadler, W. Samek, H. Schwarz, D. Marpe, T. Wiegand Video Coding and Analytics Department, Fraunhofer Institute

More information

Redundancy Allocation Based on Weighted Mismatch-Rate Slope for Multiple Description Video Coding

Redundancy Allocation Based on Weighted Mismatch-Rate Slope for Multiple Description Video Coding 1 Redundancy Allocation Based on Weighted Mismatch-Rate Slope for Multiple Description Video Coding Mohammad Kazemi, Razib Iqbal, Shervin Shirmohammadi Abstract Multiple Description Coding (MDC) is a robust

More information

SIGNAL COMPRESSION. 8. Lossy image compression: Principle of embedding

SIGNAL COMPRESSION. 8. Lossy image compression: Principle of embedding SIGNAL COMPRESSION 8. Lossy image compression: Principle of embedding 8.1 Lossy compression 8.2 Embedded Zerotree Coder 161 8.1 Lossy compression - many degrees of freedom and many viewpoints The fundamental

More information

1 Overview. Coding flow

1 Overview. Coding flow Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6) 3 rd Meeting: Fairfax, Virginia, USA, 6-10 May, 2002 Document: JVT-C028 Filename: JVT-C028.doc Title: Status:

More information

Human Visual System Based Adaptive Inter Quantization

Human Visual System Based Adaptive Inter Quantization Human Visual System Based Adaptive Inter Quantization Jin Li 1, Jari Koivusaari 1,Jarma akala 1,Moncef Gabbouj 1 and Hexin Chen 2 Department of Information echnology, ampere University of echnology ampere,

More information

MULTIVIEW video can provide real depth perception,

MULTIVIEW video can provide real depth perception, 390 IEEE TRANSACTIONS ON BROADCASTING, VOL. 59, NO. 2, JUNE 2013 DIRECT Mode Early Decision Optimization Based on Rate Distortion Cost Property and Inter-view Correlation Yun Zhang, Sam Kwong, Senior Member,

More information

H.264 / MPEG-4 Part 10 : Intra Prediction

H.264 / MPEG-4 Part 10 : Intra Prediction 264 / PEG-4 Part 10 : ntra Prediction 264 / PEG-4 Part 10 White Paper Revised April 03 Prediction of ntra acroblocks 1 ntroduction The oint ideo Team (T) of SO/EC PEG and TU-T CEG are finalising a new

More information

CHAPTER 3. Implementation of Transformation, Quantization, Inverse Transformation, Inverse Quantization and CAVLC for H.

CHAPTER 3. Implementation of Transformation, Quantization, Inverse Transformation, Inverse Quantization and CAVLC for H. CHAPTER 3 Implementation of Transformation, Quantization, Inverse Transformation, Inverse Quantization and CAVLC for H.264 Video Encoder 3.1 Introduction The basics of video processing in H.264 Encoder

More information

Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms

Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms Flierl and Girod: Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms, IEEE DCC, Mar. 007. Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms Markus Flierl and Bernd Girod Max

More information

Multimedia Networking ECE 599

Multimedia Networking ECE 599 Multimedia Networking ECE 599 Prof. Thinh Nguyen School of Electrical Engineering and Computer Science Based on lectures from B. Lee, B. Girod, and A. Mukherjee 1 Outline Digital Signal Representation

More information

Selective Use Of Multiple Entropy Models In Audio Coding

Selective Use Of Multiple Entropy Models In Audio Coding Selective Use Of Multiple Entropy Models In Audio Coding Sanjeev Mehrotra, Wei-ge Chen Microsoft Corporation One Microsoft Way, Redmond, WA 98052 {sanjeevm,wchen}@microsoft.com Abstract The use of multiple

More information

Analysis of Rate-distortion Functions and Congestion Control in Scalable Internet Video Streaming

Analysis of Rate-distortion Functions and Congestion Control in Scalable Internet Video Streaming Analysis of Rate-distortion Functions and Congestion Control in Scalable Internet Video Streaming Min Dai Electrical Engineering, Texas A&M University Dmitri Loguinov Computer Science, Texas A&M University

More information

A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv Techniques

A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv Techniques Hindawi Publishing Corporation EURASIP Journal on Applied Signal Processing Volume 26, Article ID 6971, Pages 1 18 DOI 1.1155/ASP/26/6971 A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv

More information

Application of a Bi-Geometric Transparent Composite Model to HEVC: Residual Data Modelling and Rate Control

Application of a Bi-Geometric Transparent Composite Model to HEVC: Residual Data Modelling and Rate Control Application of a Bi-Geometric Transparent Composite Model to HEVC: Residual Data Modelling and Rate Control by Yueming Gao A thesis presented to the University of Waterloo in fulfilment of the thesis requirement

More information

Image Data Compression

Image Data Compression Image Data Compression Image data compression is important for - image archiving e.g. satellite data - image transmission e.g. web data - multimedia applications e.g. desk-top editing Image data compression

More information

HARMONIC VECTOR QUANTIZATION

HARMONIC VECTOR QUANTIZATION HARMONIC VECTOR QUANTIZATION Volodya Grancharov, Sigurdur Sverrisson, Erik Norvell, Tomas Toftgård, Jonas Svedberg, and Harald Pobloth SMN, Ericsson Research, Ericsson AB 64 8, Stockholm, Sweden ABSTRACT

More information

Fast Progressive Wavelet Coding

Fast Progressive Wavelet Coding PRESENTED AT THE IEEE DCC 99 CONFERENCE SNOWBIRD, UTAH, MARCH/APRIL 1999 Fast Progressive Wavelet Coding Henrique S. Malvar Microsoft Research One Microsoft Way, Redmond, WA 98052 E-mail: malvar@microsoft.com

More information

A NEW BASIS SELECTION PARADIGM FOR WAVELET PACKET IMAGE CODING

A NEW BASIS SELECTION PARADIGM FOR WAVELET PACKET IMAGE CODING A NEW BASIS SELECTION PARADIGM FOR WAVELET PACKET IMAGE CODING Nasir M. Rajpoot, Roland G. Wilson, François G. Meyer, Ronald R. Coifman Corresponding Author: nasir@dcs.warwick.ac.uk ABSTRACT In this paper,

More information

Thor update. High Efficiency, Moderate Complexity Video Codec using only RF IPR

Thor update. High Efficiency, Moderate Complexity Video Codec using only RF IPR Thor update High Efficiency, Moderate Complexity Video Codec using only RF IPR draft-fuldseth-netvc-thor-01 Steinar Midtskogen (Cisco) IETF 94 Yokohama, JP November 2015 1 IPR note https://datatracker.ietf.org/ipr/2636/

More information

Vector Quantization Encoder Decoder Original Form image Minimize distortion Table Channel Image Vectors Look-up (X, X i ) X may be a block of l

Vector Quantization Encoder Decoder Original Form image Minimize distortion Table Channel Image Vectors Look-up (X, X i ) X may be a block of l Vector Quantization Encoder Decoder Original Image Form image Vectors X Minimize distortion k k Table X^ k Channel d(x, X^ Look-up i ) X may be a block of l m image or X=( r, g, b ), or a block of DCT

More information

Product Obsolete/Under Obsolescence. Quantization. Author: Latha Pillai

Product Obsolete/Under Obsolescence. Quantization. Author: Latha Pillai Application Note: Virtex and Virtex-II Series XAPP615 (v1.1) June 25, 2003 R Quantization Author: Latha Pillai Summary This application note describes a reference design to do a quantization and inverse

More information

Progressive Wavelet Coding of Images

Progressive Wavelet Coding of Images Progressive Wavelet Coding of Images Henrique Malvar May 1999 Technical Report MSR-TR-99-26 Microsoft Research Microsoft Corporation One Microsoft Way Redmond, WA 98052 1999 IEEE. Published in the IEEE

More information

Original citation: Prangnell, Lee, Sanchez Silva, Victor and Vanam, Rahul (05) Adaptive quantization by soft thresholding in HEVC. In: IEEE Picture Coding Symposium, Queensland, Australia, 3 May 03 Jun

More information

Statistical Analysis and Distortion Modeling of MPEG-4 FGS

Statistical Analysis and Distortion Modeling of MPEG-4 FGS Statistical Analysis and Distortion Modeling of MPEG-4 FGS Min Dai Electrical Engineering Texas A&M University, TX 77843 Dmitri Loguinov Computer Science Texas A&M University, TX 77843 Hayder Radha Hayder

More information

Lecture 7 Predictive Coding & Quantization

Lecture 7 Predictive Coding & Quantization Shujun LI (李树钧): INF-10845-20091 Multimedia Coding Lecture 7 Predictive Coding & Quantization June 3, 2009 Outline Predictive Coding Motion Estimation and Compensation Context-Based Coding Quantization

More information

Transform-Domain Rate-Distortion Optimization Accelerator for H.264/AVC Video Encoding

Transform-Domain Rate-Distortion Optimization Accelerator for H.264/AVC Video Encoding Transorm-Domain Rate-Distortion Optimization Accelerator or H.64/AVC Video Encoding Mohammed Golam Sarwer, Lai Man Po, Kai Guo and Q.M. Jonathan Wu Abstract In H.64/AVC video encoding, rate-distortion

More information

An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding

An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding Beibei Wang, Yao Wang, Ivan Selesnick and Anthony Vetro TR2004-132 December

More information

CSE 126 Multimedia Systems Midterm Exam (Form A)

CSE 126 Multimedia Systems Midterm Exam (Form A) University of California, San Diego Inst: Prof P. V. Rangan CSE 126 Multimedia Systems Midterm Exam (Form A) Spring 2003 Solution Assume the following input (before encoding) frame sequence (note that

More information

EE368B Image and Video Compression

EE368B Image and Video Compression EE368B Image and Video Compression Homework Set #2 due Friday, October 20, 2000, 9 a.m. Introduction The Lloyd-Max quantizer is a scalar quantizer which can be seen as a special case of a vector quantizer

More information

MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING

MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING Yannick Morvan, Dirk Farin University of Technology Eindhoven 5600 MB Eindhoven, The Netherlands email: {y.morvan;d.s.farin}@tue.nl Peter

More information

Prediction-Guided Quantization for Video Tone Mapping

Prediction-Guided Quantization for Video Tone Mapping Prediction-Guided Quantization for Video Tone Mapping Agnès Le Dauphin a, Ronan Boitard a,b, Dominique Thoreau a, Yannick Olivier a, Edouard Francois a and Fabrice LeLéannec a a Technicolor, 975 Av. des

More information

Rate-distortion Analysis and Control in DCT-based Scalable Video Coding. Xie Jun

Rate-distortion Analysis and Control in DCT-based Scalable Video Coding. Xie Jun Rate-distortion Analysis and Control in DCT-based Scalable Video Coding Xie Jun School of Computer Engineering A thesis submitted to the Nanyang Technological University in fulfillment of the requirement

More information

Relationship Between λ and Q in RDO

Relationship Between λ and Q in RDO Jmspeex Journal of Dubious Theoretical Results July 6, 015 Abstract This is a log of theoretical calculations and approximations that are used in some of the Daala code. Some approximations are likely

More information

OPTIMAL INTRA CODING OF HEVC BY STRUCTURED SET PREDICTION MODE WITH DISCRIMINATIVE LEARNING. Wenrui Dai, Hongkai Xiong

OPTIMAL INTRA CODING OF HEVC BY STRUCTURED SET PREDICTION MODE WITH DISCRIMINATIVE LEARNING. Wenrui Dai, Hongkai Xiong OPTIMAL INTRA CODING OF HEVC BY STRUCTURED SET PREDICTION MODE WITH DISCRIMINATIVE LEARNING Wenrui Dai, Hongkai Xiong Department of Electronic Engineering, Shanghai Jiao Tong Universit, Shanghai 200240,

More information

3drs e3drs fs e3drs fs Rate (kbps) Mother and Daughter (b) Miss America (a) 140.

3drs e3drs fs e3drs fs Rate (kbps) Mother and Daughter (b) Miss America (a) 140. oise{robust Recursive otion Estimation for H.263{based videoconferencing systems Stefano Olivieriy, Gerard de Haan z, and Luigi Albaniy yphilips S.p.A, Philips Research onza Via Philips, 12, 252 onza (I,

More information

VIDEO CODING USING A SELF-ADAPTIVE REDUNDANT DICTIONARY CONSISTING OF SPATIAL AND TEMPORAL PREDICTION CANDIDATES. Author 1 and Author 2

VIDEO CODING USING A SELF-ADAPTIVE REDUNDANT DICTIONARY CONSISTING OF SPATIAL AND TEMPORAL PREDICTION CANDIDATES. Author 1 and Author 2 VIDEO CODING USING A SELF-ADAPTIVE REDUNDANT DICTIONARY CONSISTING OF SPATIAL AND TEMPORAL PREDICTION CANDIDATES Author 1 and Author 2 Address - Line 1 Address - Line 2 Address - Line 3 ABSTRACT All standard

More information

(12) Patent Application Publication (10) Pub. No.: US 2009/ A1

(12) Patent Application Publication (10) Pub. No.: US 2009/ A1 US 2009.0245.349A1 (19) United States (12) Patent Application Publication (10) Pub. No.: US 2009/0245349 A1 Zhao et al. (43) Pub. Date: Oct. 1, 2009 (54) METHODS AND SYSTEMS FOR PARALLEL (21) Appl. No.:

More information

Waveform-Based Coding: Outline

Waveform-Based Coding: Outline Waveform-Based Coding: Transform and Predictive Coding Yao Wang Polytechnic University, Brooklyn, NY11201 http://eeweb.poly.edu/~yao Based on: Y. Wang, J. Ostermann, and Y.-Q. Zhang, Video Processing and

More information

Rate-Distortion Based Temporal Filtering for. Video Compression. Beckman Institute, 405 N. Mathews Ave., Urbana, IL 61801

Rate-Distortion Based Temporal Filtering for. Video Compression. Beckman Institute, 405 N. Mathews Ave., Urbana, IL 61801 Rate-Distortion Based Temporal Filtering for Video Compression Onur G. Guleryuz?, Michael T. Orchard y? University of Illinois at Urbana-Champaign Beckman Institute, 45 N. Mathews Ave., Urbana, IL 68 y

More information

Proc. of NCC 2010, Chennai, India

Proc. of NCC 2010, Chennai, India Proc. of NCC 2010, Chennai, India Trajectory and surface modeling of LSF for low rate speech coding M. Deepak and Preeti Rao Department of Electrical Engineering Indian Institute of Technology, Bombay

More information

The training of Karhunen Loève transform matrix and its application for H.264 intra coding

The training of Karhunen Loève transform matrix and its application for H.264 intra coding DOI 10.1007/s11042-008-0221-4 The training of Karhunen Loève transform matrix and its application for H.264 intra coding Yi Gao & Jiazhong Chen & Shengsheng Yu & Jingli Zhou & Lai-Man Po # Springer Science

More information

DELIVERING video of good quality over the Internet or

DELIVERING video of good quality over the Internet or 1448 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 10, OCTOBER 2009 Error Resilient Video Coding Using B Pictures in H.264 Mengyao Ma, Student Member, IEEE, Oscar C. Au,

More information

State of the art Image Compression Techniques

State of the art Image Compression Techniques Chapter 4 State of the art Image Compression Techniques In this thesis we focus mainly on the adaption of state of the art wavelet based image compression techniques to programmable hardware. Thus, an

More information

Hyper-Trellis Decoding of Pixel-Domain Wyner-Ziv Video Coding

Hyper-Trellis Decoding of Pixel-Domain Wyner-Ziv Video Coding 1 Hyper-Trellis Decoding of Pixel-Domain Wyner-Ziv Video Coding Arun Avudainayagam, John M. Shea, and Dapeng Wu Wireless Information Networking Group (WING) Department of Electrical and Computer Engineering

More information

Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources

Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources Russell M. Mersereau Center for Signal and Image Processing Georgia Institute of Technology Outline Cache vector quantization Lossless

More information

A Novel Multi-Symbol Curve Fit based CABAC Framework for Hybrid Video Codec s with Improved Coding Efficiency and Throughput

A Novel Multi-Symbol Curve Fit based CABAC Framework for Hybrid Video Codec s with Improved Coding Efficiency and Throughput A Novel Multi-Symbol Curve Fit based CABAC Framework for Hybrid Video Codec s with Improved Coding Efficiency and Throughput by Krishnakanth Rapaka A thesis presented to the University of Waterloo in fulfilment

More information

Information and Entropy

Information and Entropy Information and Entropy Shannon s Separation Principle Source Coding Principles Entropy Variable Length Codes Huffman Codes Joint Sources Arithmetic Codes Adaptive Codes Thomas Wiegand: Digital Image Communication

More information

Optimization of Selective Enhancement for MPEG-4 Fine Granularity Scalability

Optimization of Selective Enhancement for MPEG-4 Fine Granularity Scalability Optimization of Selective Enhancement for MPEG-4 Fine Granularity Scalability Wen-Shiaw Peng, H.C. Huang and Tihao Chiang Dept. of Electronics Engineering, National Chiao Tung University, 1001, University

More information

New Trends in High Definition Video Compression - Application to Multiple Description Coding

New Trends in High Definition Video Compression - Application to Multiple Description Coding New Trends in High Definition Video Compression - Application to Multiple Description Coding Marie Andrée Agostini To cite this version: Marie Andrée Agostini. New Trends in High Definition Video Compression

More information