Transform-Domain Rate-Distortion Optimization Accelerator for H.264/AVC Video Encoding

Size: px
Start display at page:

Download "Transform-Domain Rate-Distortion Optimization Accelerator for H.264/AVC Video Encoding"

Transcription

1 Transorm-Domain Rate-Distortion Optimization Accelerator or H.64/AVC Video Encoding Mohammed Golam Sarwer, Lai Man Po, Kai Guo and Q.M. Jonathan Wu Abstract In H.64/AVC video encoding, rate-distortion optimization or mode selection plays a signiicant role to achieve outstanding perormance in compression eiciency and video quality. However, this mode selection process also makes the encoding process extremely complex, especially in the computation o the ratedistortion cost unction, which includes the computations o the sum o squared dierence (SSD) between the original and reconstructed image blocks and context-based entropy coding o the block. In this paper, a transorm-domain rate-distortion optimization accelerator based on ast SSD (FSSD) and VLC-based rate estimation algorithm is proposed. This algorithm could signiicantly simpliy the hardware architecture or the rate-distortion cost computation with only ignorable perormance degradation. An eicient hardware structure or implementing the proposed transorm-domain rate-distortion optimization accelerator is also proposed. Simulation results demonstrated that the proposed algorithm reduces about 47% o total encoding time with negligible degradation o coding perormance. The proposed method can be easily applied to many mobile video application areas such as a digital camera and a DMB (Digital Multimedia Broadcasting) phone. Keywords Context-adaptive variable length coding (CAVLC), H.64/AVC, rate-distortion optimization (RDO), sum o squared dierence (SSD). H I. INTRODUCTION.64/AVC is the newest hybrid video coding standard developed by ITU-T Video Coding Expert Group (VCEG) and ISO/IEC MPEG Video Group named Joint Video Group (JVT) [1, ]. H.64/AVC has proved its superiority in coding eiciency over its precedents, e.g., it shows a more than 40% rate reduction over H.63 [3]. The rate-distortion optimization (RDO) is one o the essential parts o the H.64/AVC encoder to achieve the much better coding perormance. However, the computational complexity o the RDO technique is extremely high. Hence, the cost unction Mohammed Golam Sarwer was with City University o Hong Kong, Hong Kong. He is now with the University o Windsor, Windsor, ON, Canada ( sarwer@uwindsor.ca). Lai Man Po and Kai guo are with the Department o Electronic Engineering, City University o Hong Kong, Kowloon, Hong Kong, ( eelmpo@cityu.edu.hk). Q. M. Jonathan Wu is with the Department o Electrical and Computer Engineering, University o Windsor, Windsor, ON, Canada( jwu@uwindsor.ca). computation makes H.64/AVC impossible to be realized in real time applications without high computing hardware. In order to accelerate H.64/AVC video coding, a number o researches have been made to explore ast algorithms in motion estimation [4-5] and mode decision [6-7]. To reduce complexity, a new cost unction or intra 4x4 mode decisions was proposed in [9]. In this cost unction, sum o absolute integer transorm dierence (SAITD) is used in distortion part and a rate prediction algorithm is used in rate part. A new transorm domain rate estimation and distortion measure was proposed to reduce the rate-distortion cost unction computation with ignorable coding perormance degradation [10]. But the computation reduction is low. In order to avoid the entropy coding method during mode decision process, several methods [11, 1, 13] are introduced to predict the bit rate o 4x4 block. To reduce the complexity o distortion part o RDO technique, a new ast sum o squared dierence (FSSD) computation algorithm with use o an iterative tablelookup quantization process was proposed in [14]. This FSSD algorithm was based on the theoretical equivalent o the SSDs in spatial and transorm domain. This approach avoided the inverse quantization/transorm and pixel reconstruction processes with nearby no rate-distortion perormance degradation. In this paper, a transorm-domain rate-distortion optimization accelerator is developed based on ast sum o squared dierence (FSSD) and rate estimation algorithm. A method to estimate the rate o context-adaptive variable length coding (CAVLC) is proposed. An eicient hardware structure or implementing the proposed transorm-domain RDO accelerator is also introduced. The remainder o this paper is organized as ollows. Section provides the review o rate distortion optimized mode decision technique. In section 3, transorm domain ast sum o squared dierence (FSSD) calculation is described. Section 4 introduces the ast rate estimation method. In section 5, algorithm o proposed rate distortion optimization accelerator is presented. Hardware architecture o proposed RDO accelerator is proposed in section 6. The simulation results o the proposed method are presented in section 7. Finally, section 8 concludes the paper. 38

2 II. RATE DISTORTION OPTIMIZED MODE DECISION OF H.64/AVC To take the ull advantages o all modes, the H.64/AVC encoder can determine the mode that meets the best RD tradeo using RDO mode decision scheme. The best mode is the one with minimum rate-distortion (RD) cost and this cost is deined as J RD = SSD( S, C) λ R (1) where λ is the Lagrange multiplier, R is the number o coded bits or each macroblock which can be written as ollows: R = Rheader Rmotion Rres () where R header, R motion and Rres means the number o bits need to encode the header inormation, motion vectors and quantized residual block, respectively. The SSD(S,C) in (1) is the sum o the squared dierence (SSD) between the original blocks S and the reconstructed block C, and it can be expressed as N 1 N 1 SSD(, ) = S C ( S C ) (3) i= 0 j= 0 where S and C are the (i, j)th elements o the current original block S and the reconstructed block C, respectively. In order to compute RD cost or each mode, same operation o orward and inverse transorm/quantization and entropy coding is repetitively perormed. All o these processing explains the high complexity o RD cost computation. International Journal o Signal Processing 5:3 009 From (4), it is shown that to calculate the SSD we need 16 multipliers and 16 square operators. Actually the q is the constant value. The value o q at dierent pixel position is given in Table 1. In the irst region o Table 1 multiplication by q can easily by implemented by shiting right by two bits. To avoid the multiplication operation in second region q is 1 1 approximated as. Hence multiplication in this region 10 3 is replaced by shiting right by three bits. Similarly, or region III, q is approximated as ( ) From simulations, it is shown that these approximations do not signiicantly aect the SSD calculation as well as rate distortion perormance. It should be noted that no computation is necessary or shiting operator. In order to avoid the square operation, we have used a square table. It should be noted again that is the coeicients beore quantization and ˆ is the coeicients ater inverse quantization. So the sign o and ˆ is always same. Additionally, the dierences between size and. Mathematically, ˆ must be smaller than quantization step ˆ ˆ < < q III. FAST SUM OF ABSOLUTE DIFFERENCE CALCULATION The cause o dierence between original block and reconstructed block is due to the quantization error. Hence the spatial domain SSD and transorm domain SSD is theoretically equivalent [14]. Mathematically, where, block, N = 1 N 1 SSD ( S, C) ( q [ ˆ ]) (4) i= 0 j= 0 is the (i,j)th element o integer cosine transormed ˆ is the (i,j)th element o inverse quantized transorm block and q is the (i,j)th element o scaled transorm matrix. The relationship between deined as [14] ˆ 1 Q ( Q( )) = round( and = ˆ is = / ) Z (5) where = / q represents the new quantization step size deine in [14] and is the quantization step size o original H.64 encoder. In order to avoid the multiplication and division operation in (5), we proposed an iterative tablelookup quantization process in [14]. ˆ q ( ) < (6) TABLE I q AT DIFFERENT POSITION Region Position (i, j) I (0,0),(0,),(,0),(,) II (1,1),(1,3),(3,1),(3,3) III Others For example, the value o is 10, 16, 6, 40 and 64 or QP is 4, 8, 3, 36, and 40, respectively. I we choose QP=4, the let hand side o (6) is always less that 10. Additionally, the values o let hand sides are always integer. I we store the squares o all integer values which are less than 10 (or QP=4), then 16 square operation can be saved. In this way, we need additional memory requirement. But the size o memory is not large. We have to store only 10 elements in case o QP=4. Thus, the computational intensive multiplication and square operations can be avoided. q 1 a = 4 b = ab 1 = 4 5

3 IV. FAST RATE ESTIMATION Context adaptive variable length coding (CAVLC) uses ive syntax elements to encode the 4x4 block. To estimate the bits or quantized transorm coeicients, we estimate the number o bits or each o ive dierent types o symbols o CAVLC separately. A. Coeicient Token Both the total number o nonzero coeicients (Ncoe) and the number o trailing /- 1s ( T L ) are coded as combined event. For all elements, except chroma DC, a choice between three tables and one ixed length codeword is made [15]. N T is a parameter used or table selection. Assume Nu and N L is the number o non-zero coeicients o upper and let block, respectively. I both upper and let blocks are available then N T = ( Nu N L )/. I only upper block available N T =Nu; i only let block available N T =N L ; i neither is available N T =0. Since to encode coeicient token, three dierent VLC tables are used, it needs more memory to store all the possible symbols. This is not eicient or hardware implementation in terms o memory requirement. It is well known that neighboring blocks are highly spatially correlated. Hence there is high correlation between parameter N T and number o nonzero coeicients o current block (Ncoe). Thereore, we can replace N T by Ncoe or selection o VLC tables. The algorithm or table selection o proposed method given below: i ( 0<= N coe <): Num-VLC0 i ( <= N coe <4): Num-VLC1 i ( 4<= N coe <=7): Num-VLC i (N coe >7): 6 bit ixed length codeword TABLE II PROBABILITY OF WRONG TABLE SELECTION Sequence QP Akiyo Claire Foreman Container Stean Miss_America Silent Salesman In order to justiy the above algorithm, we have done several experiments and calculated the probability o wrong table selection o dierent video sequences with dierent QP values. Table shows the probability o wrong table selection. From Table, we have seen that this probability is very low or low motion sequences such as Akiyo, Claire, and Miss America. Although this probability or medium and high motion sequences is large, it does not greatly inluence the rate-distortion perormance. This is because number o bits or a particular coeicient token is almost similar in all VLC tables. Let us assume a symbol with number o non-zero coeicient is 4 and number o trailing ones is 3. The number o bits using VLC tables 0 is 6, while the number o bits is 4 or both VLC tables 1 and [15]. Similarly when the number o non-zero coeicients is 3 and number o trailing one is 0, International Journal o Signal Processing 5: number o bits to encode this symbol are 9, 7and 6 or VLC tables 0, 1 and, respectively. By combing the mentioned algorithm with VLC tables, we have developed a rate table to encode the coeicient token. Rate table is generated by collecting the length o codeword rom the dierent tables. For example, when Ncoe is either 0 or 1, data is collected rom VLC table0 and while the value o Ncoe is either or 3, data is collected rom VLC table 1. From observation o VLC tables, we have seen that the rate is exactly equal to 6 while number o non-zero coeicient o current block (Ncoe ) is greater than 7. Otherwise, rate o 4x4 block ( Rcoe) is estimated based on a new rate table deined in Table 3. Thereore, algorithm or proposed rate estimation or coeicient token o 4x4 block is given as ollows: i (Number o nonzero coeicients> 7) R coe =6; else search rate rom Table 3; TABLE III RATE TABLE FOR COEFFICIENT TOKEN OF 4X4 BLOCK Trailing ones ( T L) Total Coeicients ( N Coe) B. Sign o Trailing ones One bit is used to signal sign inormation o trailing /- 1s. 0 is used or positive and 1 is used or negative. So bit consumption to encode the sign o trailing ones ( R trail1 ) is exactly equal to the number o trailing ones (N L ). C. Encode the Level CAVLC makes the choice o the VLC look-up table or the level adaptive in a way where the choice depends on the recently coded levels [16]. One out o 7 level VLC tables is used to encode the level inormation [15]. Level VLC0 has its own structure while the other tables, level VLCN, N=1 to 6, share common structure. From the observation o VLC tables, we have seen that bit consumption to encode a coeicient is increasing with increasing the absolute value o that coeicient. Additionally, the bit consumption also depends on the absolute value o previously coded level coeicient. From the observation, rate or level is estimated as ollows R = L level( i) i or highest requency component (7.a) α β R or other component (7.b) level( i) = 1 Li 1 Li 1 α1 β1 where L i-1 is the recently encoded level o i th level. In order to set the weighting actors, we have done several experiments or dierent video sequences (Akiyo, Foreman, Stean, and Mobile) with QCIF ormat at dierent QP values. We have observed RD perormance o these video sequences at

4 dierent combinations o weighting actor. Better RD perormance was ound at α 1 = 3 and β 1 = 1. D. Encode Total Zeros The codeword Total_zeros is the number o zeros between the last non-zero coeicient o the zig-zag scan and its start. In case o mode decision process, only number o bits to encode the symbol is necessary. I only the length o codeword (instead o codeword) is retrieved rom the total_zero table, then we can avoid the extra calculation or codeword generation o CAVLC during mode decision process. E. Encode Runs Since ater transormation and quantization, blocks typically contain mostly zeros, CAVLC algorithm uses run-level coding to represent strings o zeros compactly. Run means the number o preceding zeros beore each non-zero coeicient and Zeros_let called the number o zeros let beore each non-zero coeicient. Based on Run and Zeros_let, a run VLC tables is used to encode this element [15]. From observation o run VLC table, it is shown that no o bits to encode this coeicient is increasing with the value o run. The estimated rate or run is proposed as R run( i) = α Runi β (8) Here α and β are the weighting actors. From observation o run table, we have seen that value o α and β are depends on the Run. Table 4 shows the value o α and β with dierent value o run. TABLE IV VALUE OF α AND β R run α β 0,1,and 1 1 3,4,5,and >6 1-3 International Journal o Signal Processing 5:3 009 Mathematically, P Rres( est) = Rcoe Rtrail1 Rlevel( i) Rzero Rrun( i) i= 1 i= 1 (9) where P and Q are the number o level and run, respectively. Fig. 1 shows the comparison o our proposed method with actual rate or the Foreman sequence. Data is collected rom the irst 100 4x4 block. QP actor is set as 8. It is shown that the proposed method is very closely matched with actual rate. V. RATE-DISTORTION OPTIMIZATION ACCELERATOR The overall rate-distortion cost computation process using the proposed method combined with FSSD [14] can be summarized as: Step 1: Compute the predicted block using inter or intra rame prediction: P Step : Compute the residual (dierence) block: D = S P Step 3: ICT transorm the residual block: F = ICT (D) Step 4: or i=0 to N-1 and j=0 to N-1 determine the quantized coeicients Z by the ollowing iterative tablelookup quantization process: Step I: Set k=0 and i < 0 then set sign=-1, otherwise sign=1. Step II: I k, then k=k1 and goto Step II; Step III: Set Z = sign k and ˆ = sign k. ; Step 5: Calculate SSD; Step 6: Estimate the number o bits R e = Rheader Rmotion Rres(est) Step 7: Calculate the R-D cost : J RD = SSD R e Q Actual Rate Estimated Rate Fig. 1 Estimated and the actual rates o Foreman (X-axis: Block number, Y-axis: number o bits) Thereore, the number o bits to encode the 4x4 block is calculated by summing the estimated bits o all symbols. Fig. Block diagram o proposed RDO The block diagram o proposed RDO accelerator is shown in Fig.. The major dierence is that the conventional quantization process is replaced by the iterative table-lookup quantization and the SSD calculation is preormed in the ICT transorm domain. Additionally, entropy coding technique is replaced by ast rate estimation algorithm. We can obtain the rate-distortion cost without computations o two scale transorms, quantization, inverse quantization, inverse ICT transorm and entropy coding. 41

5 ˆ 00 ˆ 01 ˆ Fig. 3. Hardware architecture or iterative table-lookup quantization process VI. HARDWARE ARCHITECTURE OF PROPOSED RDO ACCELERATOR Fig. 3 describes a possible hardware structure or the iterative table-lookup quantization implementation [14]. As each coeicient does not depend on the others, the quantization process can be perormed in parallel. In the other words, we can quantize all the coeicients o simultaneously. At irst, the quantization points and boundary points are generated by quantization table generator at the beginning o encoding process and then stored in the RAM, in which there is a pointer that points to the current boundary point. Then the current boundary point and quantization point are loaded in a register, which will compare with ater the sign o judges that is abstracted by a sign selector. I the comparator is larger, then Loop signal takes eect which tells counter to add one and the pointer to shit orward and point to the next boundary point. When is smaller than the current boundary point, Out signal takes eect which resets the pointer to the initial position, asks the counter to output the accumulated value z and asks the register to output the current quantization value, ˆ. From the structure in Fig. 3, we can ind that the proposed table-lookup quantization process is very suitable or hardware implementation, since it does not require complicated operation modules and can execute in the parallel mode. Fig. 4 shows the architecture o SSD calculator. It is shown that 16 multiplication operations are replaced by 4 shiting operators. It should be noted that no computation is necessary or shiting operator. In order to avoid the square operator, we have used a square table. In the square table, square o all integers within quantization step size are stored. In this way, we need additional memory requirement. But the size o memory (ROM) is not large because o the limited value o. For example encoder has to store 64 values i the quantization parameter QP is 40. Architecture o shiter or region III is given in Fig. 5. Here multiplier is replaced by two shit right operators. 4

6 00 ˆ 00 0 ˆ 0 0 ˆ 0 ˆ 11 ˆ ˆ 13 ˆ ˆ 01 ˆ ˆ ˆ 10 1 ˆ 1 1 ˆ 1 3 ˆ 3 30 ˆ 30 3 ˆ >> >> >> >> >>3 >>3 >>3 >>3 ROM (Square Table) SSD Fig. 4. Architecture o SSD calculator >> 3 >> 5 Fig. 5 Architecture o shiter or region III The hardware architecture or proposed ast rate estimation algorithm is shown in Fig. 6. The proposed architecture contains a number o counters and register to store the inormation or the block. Non-zero coeicients counter is used to store the number o non-zero coeicients ( N coe ). Trailing one counter is used to store the number o trailing /- 1s. Total zero counter is used to store the total number o zeros beore the last non-zero coeicient. Run beore register is used to store the number o zeros preceding each non-zero coeicient. The level detector checks i the coeicient is either level or not. The proposed architecture begins by reading the coeicients rom the input buer in reverse zig-zag order. In each cycle, it reads one coeicient rom the input buer analyzes the coeicient and updates the inormation stored in the related counter and register. Reverse zig-zag scanning enables us to determine the necessary inormation or encoding 4x4 block. It takes 16 cycles to read the coeicients and stored the corresponding inormation in the counter and register ile or each 4x4 block. Based on the number o non-zero coeicients, comparator 1 select either the rate table or coeicient token (ROM1) should be used or not. Comparator 1 compares the output o non-zero coeicient counter with content o register 4. Register 4 always store a ixed value 7. Comparator1 sent a signal 1 when the number o non-zero coeicients is greater than the ixed value 7. Then the register 1 is enabled. I comparator 1 sent the signal 0, rate table or coeicient token is activated. In this case, rate is ound by searching the rate table based on the number o non-zero coeicients and number o trailing ones. Similarly number o bits to encode the total zero is calculated by rate table or total zeros (ROM). The level detector checks i the coeicient is zero or not. I the level is non-zero, it will be stored into the level-irst-in-irst-out buer (FIFO), and the corresponding run inormation will be also stored to the run-fifo. Based on the content o FIFO, level and run rate estimators predict the rate o level and run, respectively. 43

7 Register 4 Load: 7 Register 1 Load: 6 Zig-zag scan address generator Non-zero coeicient counter Trailing one counter Comparator 1 Rate table or coeicient token (ROM1) M U X Input register Total-zero counter Run beore register FIFO Rate-table or total zero (ROM) Run rate estimator A D D E R Estimated rate Level Detector FIFO Level rate estimator Fig. 6. Hardware structure o proposed rate estimator VII. SIMULATION RESULTS To evaluate the perormance o the proposed method, JM 9.6 [8] reerence sotware is used in simulation. Dierent types o video sequences are used as test materials. A group o experiments were carried out on the test sequences with dierent quantization parameters. All the simulations are conducted under Windows Vista operating system, with Pentium 4. G CPU and 1 G RAM. The conditions o the simulation are listed in Table 5. The comparison results are produced and tabulated based on the average dierence o total encoding time ( T %), the average PSNR dierences ( PSNR ), and the average bit rate dierences ( Rate %). PSNR and bit rate dierences are calculated according to the numerical averages between RD curves derived rom the original and proposed algorithm, respectively. The detail procedure to calculate these dierences can be ound in [17]. In order to evaluate the complexity reduction, T (%) is deined as ollows Toriginal T proposed T % = 100% (10) Toriginal where, T original denotes the total encoding time o the JM 9.6 encoder and T proposed is total encoding time o the encoder with proposed method. TABLE V SIMULATION CONDITIONS GOP Structure IPPP Quantization Parameters 4/8/3/36 Number o Reerence Frame 1 Hadamard Transorm OFF Search range 16 Symbol mode CAVLC/CABAC RDO optimization ON Frame rate 30 ps Number o rames 50 Fast Motion estimation Enabled A. Results with CAVLC entropy coding method In this experiment, ater selecting the best mode by proposed technique, CAVLC is used or inal entropy coding o best mode. The rate-distortion perormance comparison o the conventional RDO and proposed RDO in terms o PSNR, bitrate, and encoding time are shown in Tables 6. When only rate estimation is used, the degradation o RD perormance is very negligible. We have seen that while only rate estimation is used, the average PSNR degradation is db and the average bit rate increment is 0.615%. When both two algorithms are used, the rate-distortion perormance dierences are always less than. % and 0.07 db in terms o bit rate and PSNR, respectively. Fig. 7 shows the comparison o rate-distortion curves o proposed accelerator with original RD optimization method. The perormance o proposed 44

8 TABLE VI EXPERIMENTAL OF PROPOSED METHOD WHILE CAVLC IS USED AS ENTROPY CODING Sequence only FSSD Rate Estimation PSNR Rate% T % PSNR Rate% T % Foreman ( CIF) Flower ( CIF) Bus (CIF) Akiyo (QCIF) Mobile (QCIF) Salesman (QCIF) Stean ( QCIF) Average TABLE VII EXPERIMENTAL OF PROPOSED METHOD WHILE CABAC IS USED AS ENTROPY CODING Sequence only FSSD Rate Estimation PSNR Rate% T % PSNR Rate% T % Foreman ( CIF) Flower ( CIF) Bus (CIF) Akiyo (QCIF) Mobile (QCIF) Salesman (QCIF) Stean ( QCIF) Average Sequence QP PSNR in db TABLE VIII COMPARISON WITH OTHER RATE ESTIMATION METHODS Method in [13] Method in [11] Method in [9] Bit rate (%) Complexity Reducti on (%) PSNR in db Bit rate (%) Complexity Reduction (%) PSNR in db Bit rate (%) Complexity Reduction (%) Flower (CIF) Bus (CIF) Foreman (CIF) Average method is similar with actual RD curves. As shown in Table 6, the proposed accelerator can reduce the encoding time by around 47% (on average), in which about 14% reduction is achieved by the FSSD and about % reduction is achieved by rate estimation. B. Results with CABAC entropy coding method Although the proposed rate-distortion accelerator is developed based on CAVLC, it is also suitable during mode decision when Content Adaptive Binary Arithmetic Coding (CABAC) is used as entropy coding method. This is because the proposed algorithm is used only or RD optimized mode selection process. Ater selection o best mode, true entropy coding is used. As shown in Table 7, PSNR reduction and bit rate increment are about 0.08 db and.5% on average, respectively. Fig. 8 shows the comparison o RD optimized curves. As shown in Fig. 8, the rate distortion perormance o proposed method is much closed with actual RD optimized curve. C. Comparison with other methods In this experiment, the proposed rate estimation method is compared with other rate estimation methods. Only rate estimation method is enabled and CAVLC is used as inal entropy coding method. Complexity reduction is calculated by (11). Tre T proposed Complexity reduction (%) = 100% (11) Tre 45

9 where, T re is the total encoding time o the other conventional method and T proposed is the total encoding time with proposed rate estimation method only. International Journal o Signal Processing 5: FSSDRate Estimation (a) Akiyo ( QCIF) 34 FSSDRate Estimation (b) Foreman (CIF) 9 FSSDRate Estimation FSSDRate Estimation (c) Mobile (QCIF) (d) Flower (CIF) FSSDRate Estimation FSSDRate Estimation (e) Stean (QCIF) () Bus (CIF) Fig.7. RD perormance o proposed accelerator with CAVLC ( X-axis: Bit rate in kbps, Y-axis: PSNR in db) 46

10 FSSDRate Estimation (a) Akiyo ( QCIF) 34 FSSDRate Estimation (b) Foreman (CIF) 9 FSSDRate Estimation (c) Mobile (QCIF) FSSDRate Estimation (d) Flower (CIF) FSSDRate Estimation FSSDRate Estimation (e) Stean (QCIF) () Bus (CIF) Fig. 8. RD perormance o proposed accelerator with CABAC ( X-axis: Bit rate in kbps, Y-axis: PSNR in db) 47

11 Comparison results are presented in Table 8. Positive values or the PSNR and bit rate dierences indicate increments, and negative values indicate decrements. Negative values o complexity reductions indicate the increment o complexity o proposed method. We have seen that the RD perormance o proposed method is always better than the other methods mentioned in Table 8. As compared to [11], the proposed method increases the video quality by 0.09 db and reduces the bit rate o 1.05% on average with more similar complexity reduction. With compared to [13], the proposed algorithm achieves a better complexity saving which is about 5% and improves the RD perormance slightly. The proposed method achieves more PSNR increment (about 0.10dB) and bit rate reduction (about 3.10%) o [9] with the expanse o a little computational load. VIII. CONCLUSION In this paper, ast rate-distortion optimization method or mode decision o H.64/AVC is proposed. FSSD algorithm avoids conventional quantization, inverse quantization and inverse cosine transorm. Additionally, entropy coding is replaced by ast rate estimation method. The proposed rate distortion accelerator can be used or both CAVLC and CABAC in H.64 encoding. O course, the accuracy or CAVLC should be better but our simulation results show that the perormance or CABAC with use o proposed accelerator in mode selection is similar to ull RD cost encoding using CABAC. The proposed technique reduces total encoding time by about 47 % on average with negligible degradation o coding perormance. The proposed method is suitable or many video application areas such as a digital camera and a mobile phone. ACKNOWLEDGMENT This work was substantially supported by a CERG grant rom Hong Kong Government, Hong Kong SAR, China. [Project No ]. REFERENCES [1] ISO/IEC , Inormation Technology-Coding o audio-visual objects-part:10: Advanced Video Coding. ISO/IEC JTC1/SC9/WG11 (004) [] Weigand T., Sullivan, G. Bjontegaard, and G.,Luthra, A., Overview o H.64/AVC Video Coding Standard, IEEE Trans. Circuits and Syst. Video Technol., vol. 13, pp , July 003. [3] T.Wiegand, H.Schwarz, A.Joch, F. Kossentini, and G.J. Sullivan, Rateconstrained coder control and comparison o video coding standards, IEEE Trans. Circuits and Syst. Video Technol., vol 13, no. 7, pp , July 003 [4] Zhibo Chen, Peng Zhou, Yun He, Fast Integer Pel and Fractional Pel Motion Estimation or JVT, Joint Video Team (JVT) Docs, JVT-F017, Dec. 00. [5] Yang L., Yu K., Li J., Li S. An eective variable block size early termination algorithm or H.64 video coding, IEEE Trans. Circuits and Syst. Video Technol., vol. 15, no. 6, pp , June 005. [6] Feng Pan, Xiao Lin, Susanto Rahardja, Keng Pang Lim, Z.G. Li, Dajun Wu, and Si WU, Fast Mode Decision Algorithm or Intra-prediction in H.64/AVC Video Coding, IEEE Trans. Circuits and Syst. Video Technol., vol. 15, no. 7, pp-813-8, July 005. [7] K.P.Lim, S. Wu, D. J. Wu, S. Rahardja, X.Lin, F.Pan, and Z.G.Li, Fast inter mode selection, Joint Video Team(JVT), Doc. JVT-I00, Sep International Journal o Signal Processing 5: [8] Joint Video Team (JVT) reerence sotware version 9.6, [9] C.H. Tseng, H.M. Wang, and J. F. Yang, Enhanced Intra 4x4 Mode Decision or H.64/AVC Coders, IEEE Trans. Circuits and Syst. Video Technol., vol. 16, no. 8, pp , August 006. [10] Yu-Kuang Tu, Jar-Ferr Yang and Ming-Ting Sun, Eicient Rate- Distortion Estimation or H.64/AVC Coders, IEEE Trans. Circuits and Syst. Video Technol., vol, 16, pp , May 006. [11] Mohammed Golam Sarwer, and Lai Man Po, Fast bit rate estimation or mode decision o H.64/AVC, IEEE Trans. Circuits and Syst. Video Technol., vol, 17, no. 10, October 007, pp [1] Q. Chen and Y. He, A ast bit estimation method or rate distortion optimization in H.64/AVC, in Proceeding o Picture Coding Symp., 004. [13] Zhenyu Wei and King Ngi Ngan, A ast rate-distortion optimization algorithm or H.64/AVC, IEEE ICASSP 007, pp. I-1157-I [14] Lai-Man Po and Kai Guo, Transorm-Domain ast sum o the squared dierence computation or H.64/AVC Rate-Distortion Optimization, IEEE Trans. Circuits and Syst. Video Technol., vol, 17, no 66, pp , June 007. [15] ITU-T Rec. H.64/ISO/IEC , Advanced Video Coding, Final Committee Drat, Documents JVT-E0, September 00. [16] Iain E. G. Richardson, H.64 and MPEG-4 Video Compression video coding or next generation multimedia, John Wily & Sons, pp , 003. [17] G. Bjontegaard, Calculation o average PSNR dierences between RDcurves, presented at the 13 th VCEG-M Meeting, Austin, TX, April 001. Mohammed Golam Sarwer was born in Comilla, Bangladesh in He received B.Sc. and M.Phil degree both in electronic engineering rom Khulna University o Engineering and Technology and City University o Hong Kong in 001 and 007, respectively. Mr. Sarwer was working as a ull time aculty member in Khulna University o Engineering and Technology rom 001 to 005. Currently, he is PhD candidate in the department o electrical and computer engineering, University o Windsor, Canada. He is the member o International Association o Engineers (IAENG). In 003, he won the prime minister gold medal due to outstanding academic perormance o undergraduate level Lai-Man Po (M 9) received the B.Sc. and Ph.D. degrees, both in electronic Engineering, rom the City University o Hong Kong in 1988 and 1991, respectively. Since November 1991, he has been with the Department o Electronic Engineering, City University o Hong Kong, and is currently Associate Proessor. His research interests are in vector quantization, motion estimation and image/video compression. Dr. Po was awarded First Prize (1988) in the Paper Contest or Students and Non-corporate Members organized by the Institute o Electronics and Radio Engineers o Hong Kong, and a Postgraduate Fellowship o the Sir Edward Youde Memorial Council (1988 to 1991). Q.M. Jonathan Wu (M 9) received his Ph.D. degree in electrical engineering rom the University o Wales, Swansea, U.K., in From 1995, he worked at the National Research Council o Canada (NRC) or 10 years where he became a senior research oicer and group leader. He is currently a Proessor in the Department o Electrical and Computer Engineering at the University o Windsor, Canada. Dr. Wu holds the Tier 1 Canada Research Chair (CRC) in Automotive Sensors and Sensing Systems. He has published more than 100 peer-reviewed papers in areas o computer vision, image processing, intelligent systems, robotics, micro-sensors and actuators, and integrated micro-systems. His current research interests include 3D image analysis, active video object tracking and extraction, vision-guided robotics, sensor analysis and usion, wireless sensor network, and integrated micro-systems. Dr. Wu is an Associate Editor or IEEE Transaction on Systems, Man, and Cybernetics (part A). Dr. Wu has served on the Technical Program Committees and International Advisory Committees or many prestigious conerences.

THE newest video coding standard is known as H.264/AVC

THE newest video coding standard is known as H.264/AVC IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 17, NO. 6, JUNE 2007 765 Transform-Domain Fast Sum of the Squared Difference Computation for H.264/AVC Rate-Distortion Optimization

More information

Bit Rate Estimation for Cost Function of H.264/AVC

Bit Rate Estimation for Cost Function of H.264/AVC Bit Rate Estimation for Cost Function of H.264/AVC 257 14 X Bit Rate Estimation for Cost Function of H.264/AVC Mohammed Golam Sarwer 1,2, Lai Man Po 1 and Q. M. Jonathan Wu 2 1 City University of Hong

More information

Enhanced SATD-based cost function for mode selection of H.264/AVC intra coding

Enhanced SATD-based cost function for mode selection of H.264/AVC intra coding SIViP (013) 7:777 786 DOI 10.1007/s11760-011-067-z ORIGINAL PAPER Enhanced SATD-based cost function for mode selection of H.6/AVC intra coding Mohammed Golam Sarwer Q. M. Jonathan Wu Xiao-Ping Zhang Received:

More information

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD 4th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD Simone

More information

Context-adaptive coded block pattern coding for H.264/AVC

Context-adaptive coded block pattern coding for H.264/AVC Context-adaptive coded block pattern coding for H.264/AVC Yangsoo Kim a), Sungjei Kim, Jinwoo Jeong, and Yoonsik Choe b) Department of Electrical and Electronic Engineering, Yonsei University 134, Sinchon-dong,

More information

Motion Vector Prediction With Reference Frame Consideration

Motion Vector Prediction With Reference Frame Consideration Motion Vector Prediction With Reference Frame Consideration Alexis M. Tourapis *a, Feng Wu b, Shipeng Li b a Thomson Corporate Research, 2 Independence Way, Princeton, NJ, USA 855 b Microsoft Research

More information

Analysis of Integer Transformation and Quantization Blocks using H.264 Standard and the Conventional DCT Techniques

Analysis of Integer Transformation and Quantization Blocks using H.264 Standard and the Conventional DCT Techniques Priyanka P James et al, International Journal o Computer Science and Mobile Computing, Vol.3 Issue.3, March- 2014, pg. 873-878 Available Online at www.ijcsmc.com International Journal o Computer Science

More information

h 8x8 chroma a b c d Boundary filtering: 16x16 luma H.264 / MPEG-4 Part 10 : Intra Prediction H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter

h 8x8 chroma a b c d Boundary filtering: 16x16 luma H.264 / MPEG-4 Part 10 : Intra Prediction H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter 1. Introduction The Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG are finalising a new standard for the coding (compression) of natural

More information

AN ENHANCED EARLY DETECTION METHOD FOR ALL ZERO BLOCK IN H.264

AN ENHANCED EARLY DETECTION METHOD FOR ALL ZERO BLOCK IN H.264 st January 0. Vol. 7 No. 005-0 JATIT & LLS. All rights reserved. ISSN: 99-865 www.jatit.org E-ISSN: 87-95 AN ENHANCED EARLY DETECTION METHOD FOR ALL ZERO BLOCK IN H.6 CONG-DAO HAN School of Electrical

More information

4x4 Transform and Quantization in H.264/AVC

4x4 Transform and Quantization in H.264/AVC Video compression design, analysis, consulting and research White Paper: 4x4 Transform and Quantization in H.264/AVC Iain Richardson / VCodex Limited Version 1.2 Revised November 2010 H.264 Transform and

More information

1 Overview. Coding flow

1 Overview. Coding flow Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6) 3 rd Meeting: Fairfax, Virginia, USA, 6-10 May, 2002 Document: JVT-C028 Filename: JVT-C028.doc Title: Status:

More information

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders Tomasz Grajek and Marek Domański Poznan University of Technology Chair of Multimedia Telecommunications and Microelectronics ul.

More information

CHAPTER 3. Implementation of Transformation, Quantization, Inverse Transformation, Inverse Quantization and CAVLC for H.

CHAPTER 3. Implementation of Transformation, Quantization, Inverse Transformation, Inverse Quantization and CAVLC for H. CHAPTER 3 Implementation of Transformation, Quantization, Inverse Transformation, Inverse Quantization and CAVLC for H.264 Video Encoder 3.1 Introduction The basics of video processing in H.264 Encoder

More information

Direction-Adaptive Transforms for Coding Prediction Residuals

Direction-Adaptive Transforms for Coding Prediction Residuals MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Direction-Adaptive Transforms for Coding Prediction Residuals Robert Cohen, Sven Klomp, Anthony Vetro, Huifang Sun TR2010-090 November 2010

More information

Intra Frame Coding for Advanced Video Coding Standard to reduce Bitrate and obtain consistent PSNR Using Gaussian Pulse

Intra Frame Coding for Advanced Video Coding Standard to reduce Bitrate and obtain consistent PSNR Using Gaussian Pulse Intra Frame Coding for Advanced Video Coding Standard to reduce Bitrate and obtain consistent PSNR Using Gaussian Pulse Manjanaik.N1, Dr.Manjunath.R2 Electronics and Communication Engineering, Jain University,

More information

SSIM-Inspired Perceptual Video Coding for HEVC

SSIM-Inspired Perceptual Video Coding for HEVC 2012 IEEE International Conference on Multimedia and Expo SSIM-Inspired Perceptual Video Coding for HEVC Abdul Rehman and Zhou Wang Dept. of Electrical and Computer Engineering, University of Waterloo,

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC9/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC9/WG11 MPEG 98/M3833 July 1998 Source:

More information

The training of Karhunen Loève transform matrix and its application for H.264 intra coding

The training of Karhunen Loève transform matrix and its application for H.264 intra coding DOI 10.1007/s11042-008-0221-4 The training of Karhunen Loève transform matrix and its application for H.264 intra coding Yi Gao & Jiazhong Chen & Shengsheng Yu & Jingli Zhou & Lai-Man Po # Springer Science

More information

Converting DCT Coefficients to H.264/AVC

Converting DCT Coefficients to H.264/AVC MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Converting DCT Coefficients to H.264/AVC Jun Xin, Anthony Vetro, Huifang Sun TR2004-058 June 2004 Abstract Many video coding schemes, including

More information

Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression

Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL 12, NO 11, NOVEMBER 2002 957 Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression Markus Flierl, Student

More information

Fast Intra Coding Method of H.264 for Video Surveillance System

Fast Intra Coding Method of H.264 for Video Surveillance System 76 IJCSNS International Journal of Computer Science and Network Security, OL.7 No.10, October 2007 Fast Intra Coding Method of.264 for ideo Surveillance System Jang-Seon Ryu and Eung-Tea Kim, Korea Polytechnic

More information

Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding

Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding Enhanced Stochastic Bit Reshuffling for Fine Granular Scalable Video Coding Wen-Hsiao Peng, Tihao Chiang, Hsueh-Ming Hang, and Chen-Yi Lee National Chiao-Tung University 1001 Ta-Hsueh Rd., HsinChu 30010,

More information

Modelling of produced bit rate through the percentage of null quantized transform coefficients ( zeros )

Modelling of produced bit rate through the percentage of null quantized transform coefficients ( zeros ) Rate control strategies in H264 Simone Milani (simone.milani@dei.unipd.it) with the collaboration of Università degli Studi di adova ST Microelectronics Summary General scheme of the H.264 encoder Rate

More information

Introduction to Video Compression H.261

Introduction to Video Compression H.261 Introduction to Video Compression H.6 Dirk Farin, Contact address: Dirk Farin University of Mannheim Dept. Computer Science IV L 5,6, 683 Mannheim, Germany farin@uni-mannheim.de D.F. YUV-Colorspace Computer

More information

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding Aditya Mavlankar, Chuo-Ling Chang, and Bernd Girod Information Systems Laboratory, Department of Electrical

More information

Numerical Methods - Lecture 2. Numerical Methods. Lecture 2. Analysis of errors in numerical methods

Numerical Methods - Lecture 2. Numerical Methods. Lecture 2. Analysis of errors in numerical methods Numerical Methods - Lecture 1 Numerical Methods Lecture. Analysis o errors in numerical methods Numerical Methods - Lecture Why represent numbers in loating point ormat? Eample 1. How a number 56.78 can

More information

LOSSLESS INTRA CODING IN HEVC WITH INTEGER-TO-INTEGER DST. Fatih Kamisli. Middle East Technical University Ankara, Turkey

LOSSLESS INTRA CODING IN HEVC WITH INTEGER-TO-INTEGER DST. Fatih Kamisli. Middle East Technical University Ankara, Turkey LOSSLESS INTRA CODING IN HEVC WITH INTEGER-TO-INTEGER DST Fatih Kamisli Middle East Technical University Ankara, Turkey ABSTRACT It is desirable to support efficient lossless coding within video coding

More information

RATIONAL FUNCTIONS. Finding Asymptotes..347 The Domain Finding Intercepts Graphing Rational Functions

RATIONAL FUNCTIONS. Finding Asymptotes..347 The Domain Finding Intercepts Graphing Rational Functions RATIONAL FUNCTIONS Finding Asymptotes..347 The Domain....350 Finding Intercepts.....35 Graphing Rational Functions... 35 345 Objectives The ollowing is a list o objectives or this section o the workbook.

More information

Detailed Review of H.264/AVC

Detailed Review of H.264/AVC Detailed Review of H.264/AVC, Ph.D.. abuhajar@digitavid.net (408) 506-2776 P.O. BOX:720998 San Jose, CA 95172 1 Outline Common Terminologies Color Space Macroblock and Slice Type Slice Block Diagram Intra-Prediction

More information

Improvement of Sparse Computation Application in Power System Short Circuit Study

Improvement of Sparse Computation Application in Power System Short Circuit Study Volume 44, Number 1, 2003 3 Improvement o Sparse Computation Application in Power System Short Circuit Study A. MEGA *, M. BELKACEMI * and J.M. KAUFFMANN ** * Research Laboratory LEB, L2ES Department o

More information

MODERN video coding standards, such as H.263, H.264,

MODERN video coding standards, such as H.263, H.264, 146 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 1, JANUARY 2006 Analysis of Multihypothesis Motion Compensated Prediction (MHMCP) for Robust Visual Communication Wei-Ying

More information

Fast Bi-Directional Prediction Selection in H.264/MPEG-4 AVC Temporal Scalable Video Coding

Fast Bi-Directional Prediction Selection in H.264/MPEG-4 AVC Temporal Scalable Video Coding 3508 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 12, DECEMBER 2011 Fast Bi-Directional Prediction Selection in H.264/MPEG-4 AVC Temporal Scalable Video Coding Hung-Chih Lin, Hsueh-Ming Hang, Fellow,

More information

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Dr. Jian Zhang Conjoint Associate Professor NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2006 jzhang@cse.unsw.edu.au

More information

H.264/MPEG4 Part INTRODUCTION Terminology

H.264/MPEG4 Part INTRODUCTION Terminology 6 H.264/MPEG4 Part 10 6.1 INTRODUCTION The Moving Picture Experts Group and the Video Coding Experts Group (MPEG and VCEG) have developed a new standard that promises to outperform the earlier MPEG-4 and

More information

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5 Lecture : Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Dr. Jian Zhang Conjoint Associate Professor NICTA & CSE UNSW COMP959 Multimedia Systems S 006 jzhang@cse.unsw.edu.au Acknowledgement

More information

Achieving H.264-like compression efficiency with distributed video coding

Achieving H.264-like compression efficiency with distributed video coding Achieving H.264-like compression efficiency with distributed video coding Simone Milani a, Jiajun Wang b and Kannan Ramchandran b a Dept. of Information Engineering, University of Padova, Italy. b Dept.

More information

SCALABLE AUDIO CODING USING WATERMARKING

SCALABLE AUDIO CODING USING WATERMARKING SCALABLE AUDIO CODING USING WATERMARKING Mahmood Movassagh Peter Kabal Department of Electrical and Computer Engineering McGill University, Montreal, Canada Email: {mahmood.movassagh@mail.mcgill.ca, peter.kabal@mcgill.ca}

More information

6. H.261 Video Coding Standard

6. H.261 Video Coding Standard 6. H.261 Video Coding Standard ITU-T (formerly CCITT) H-Series of Recommendations 1. H.221 - Frame structure for a 64 to 1920 kbits/s channel in audiovisual teleservices 2. H.230 - Frame synchronous control

More information

3drs e3drs fs e3drs fs Rate (kbps) Mother and Daughter (b) Miss America (a) 140.

3drs e3drs fs e3drs fs Rate (kbps) Mother and Daughter (b) Miss America (a) 140. oise{robust Recursive otion Estimation for H.263{based videoconferencing systems Stefano Olivieriy, Gerard de Haan z, and Luigi Albaniy yphilips S.p.A, Philips Research onza Via Philips, 12, 252 onza (I,

More information

FREQUENCY DOMAIN SINGULAR VALUE DECOMPOSITION FOR EFFICIENT SPATIAL AUDIO CODING. Sina Zamani, Tejaswi Nanjundaswamy, Kenneth Rose

FREQUENCY DOMAIN SINGULAR VALUE DECOMPOSITION FOR EFFICIENT SPATIAL AUDIO CODING. Sina Zamani, Tejaswi Nanjundaswamy, Kenneth Rose FREQUENCY DOMAIN SINGULAR VALUE DECOMPOSITION FOR EFFICIENT SPATIAL AUDIO CODING Sina Zamani, Tejaswi Nanjundaswamy, Kenneth Rose Department o Electrical and Computer Engineering, University o Caliornia

More information

Probabilistic Model of Error in Fixed-Point Arithmetic Gaussian Pyramid

Probabilistic Model of Error in Fixed-Point Arithmetic Gaussian Pyramid Probabilistic Model o Error in Fixed-Point Arithmetic Gaussian Pyramid Antoine Méler John A. Ruiz-Hernandez James L. Crowley INRIA Grenoble - Rhône-Alpes 655 avenue de l Europe 38 334 Saint Ismier Cedex

More information

The MPEG4/AVC standard: description and basic tasks splitting

The MPEG4/AVC standard: description and basic tasks splitting The MPEG/AVC standard: description and basic tasks splitting Isabelle Hurbain 1 Centre de recherche en informatique École des Mines de Paris hurbain@cri.ensmp.fr January 7, 00 1 35, rue Saint-Honoré, 77305

More information

arxiv: v2 [cs.sd] 15 May 2017

arxiv: v2 [cs.sd] 15 May 2017 FREQUENCY DOMAIN SINGULAR VALUE DECOMPOSITION FOR EFFICIENT SPATIAL AUDIO CODING Sina Zamani, Tejaswi Nanjundaswamy, Kenneth Rose Department o Electrical and Computer Engineering, University o Caliornia

More information

A Particle Swarm Optimization Algorithm for Neighbor Selection in Peer-to-Peer Networks

A Particle Swarm Optimization Algorithm for Neighbor Selection in Peer-to-Peer Networks A Particle Swarm Optimization Algorithm or Neighbor Selection in Peer-to-Peer Networks Shichang Sun 1,3, Ajith Abraham 2,4, Guiyong Zhang 3, Hongbo Liu 3,4 1 School o Computer Science and Engineering,

More information

OPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION

OPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION OPTIMAL PLACEMENT AND UTILIZATION OF PHASOR MEASUREMENTS FOR STATE ESTIMATION Xu Bei, Yeo Jun Yoon and Ali Abur Teas A&M University College Station, Teas, U.S.A. abur@ee.tamu.edu Abstract This paper presents

More information

High Throughput Entropy Coding in the HEVC Standard

High Throughput Entropy Coding in the HEVC Standard DOI 10.1007/s11265-014-0900-5 High Throughput Entropy Coding in the HEVC Standard Jung-Ah Choi & Yo-Sung Ho Received: 13 June 2013 /Accepted: 21 April 2014 # Springer Science+Business Media New York 2014

More information

Finger Search in the Implicit Model

Finger Search in the Implicit Model Finger Search in the Implicit Model Gerth Stølting Brodal, Jesper Sindahl Nielsen, Jakob Truelsen MADALGO, Department o Computer Science, Aarhus University, Denmark. {gerth,jasn,jakobt}@madalgo.au.dk Abstract.

More information

A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction

A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction SPIE Conference on Visual Communications and Image Processing, Perth, Australia, June 2000 1 A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction Markus Flierl, Thomas

More information

Equidistant Polarizing Transforms

Equidistant Polarizing Transforms DRAFT 1 Equidistant Polarizing Transorms Sinan Kahraman Abstract arxiv:1708.0133v1 [cs.it] 3 Aug 017 This paper presents a non-binary polar coding scheme that can reach the equidistant distant spectrum

More information

Objectives. By the time the student is finished with this section of the workbook, he/she should be able

Objectives. By the time the student is finished with this section of the workbook, he/she should be able FUNCTIONS Quadratic Functions......8 Absolute Value Functions.....48 Translations o Functions..57 Radical Functions...61 Eponential Functions...7 Logarithmic Functions......8 Cubic Functions......91 Piece-Wise

More information

Redundancy Allocation Based on Weighted Mismatch-Rate Slope for Multiple Description Video Coding

Redundancy Allocation Based on Weighted Mismatch-Rate Slope for Multiple Description Video Coding 1 Redundancy Allocation Based on Weighted Mismatch-Rate Slope for Multiple Description Video Coding Mohammad Kazemi, Razib Iqbal, Shervin Shirmohammadi Abstract Multiple Description Coding (MDC) is a robust

More information

on a per-coecient basis in large images is computationally expensive. Further, the algorithm in [CR95] needs to be rerun, every time a new rate of com

on a per-coecient basis in large images is computationally expensive. Further, the algorithm in [CR95] needs to be rerun, every time a new rate of com Extending RD-OPT with Global Thresholding for JPEG Optimization Viresh Ratnakar University of Wisconsin-Madison Computer Sciences Department Madison, WI 53706 Phone: (608) 262-6627 Email: ratnakar@cs.wisc.edu

More information

A VC-1 TO H.264/AVC INTRA TRANSCODING USING ENCODING INFORMATION TO REDUCE RE-QUANTIZATION NOISE

A VC-1 TO H.264/AVC INTRA TRANSCODING USING ENCODING INFORMATION TO REDUCE RE-QUANTIZATION NOISE A VC- TO H.6/AVC INTRA TRANSCODING USING ENCODING INFORMATION TO REDUCE RE-QUANTIZATION NOISE Takeshi Yoshitome, Yoshiyuki Nakajima, and Kazuto Kamikura NTT Cyer Space Laoratories, NTT Corporation, Yokosuka-shi,

More information

Diversity-Fidelity Tradeoff in Transmission of Analog Sources over MIMO Fading Channels

Diversity-Fidelity Tradeoff in Transmission of Analog Sources over MIMO Fading Channels Diversity-Fidelity Tradeo in Transmission o Analog Sources over MIMO Fading Channels Mahmoud Taherzadeh, Kamyar Moshksar and Amir K. Khandani Coding & Signal Transmission Laboratory www.cst.uwaterloo.ca

More information

Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources

Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources Russell M. Mersereau Center for Signal and Image Processing Georgia Institute of Technology Outline Cache vector quantization Lossless

More information

MULTIVIEW video can provide real depth perception,

MULTIVIEW video can provide real depth perception, 390 IEEE TRANSACTIONS ON BROADCASTING, VOL. 59, NO. 2, JUNE 2013 DIRECT Mode Early Decision Optimization Based on Rate Distortion Cost Property and Inter-view Correlation Yun Zhang, Sam Kwong, Senior Member,

More information

Least-Squares Spectral Analysis Theory Summary

Least-Squares Spectral Analysis Theory Summary Least-Squares Spectral Analysis Theory Summary Reerence: Mtamakaya, J. D. (2012). Assessment o Atmospheric Pressure Loading on the International GNSS REPRO1 Solutions Periodic Signatures. Ph.D. dissertation,

More information

A DISTRIBUTED VIDEO CODER BASED ON THE H.264/AVC STANDARD

A DISTRIBUTED VIDEO CODER BASED ON THE H.264/AVC STANDARD 5th European Signal Processing Conference (EUSIPCO 27), Poznan, Poland, September 3-7, 27, copyright by EURASIP A DISTRIBUTED VIDEO CODER BASED ON THE /AVC STANDARD Simone Milani and Giancarlo Calvagno

More information

CHAPTER 8 ANALYSIS OF AVERAGE SQUARED DIFFERENCE SURFACES

CHAPTER 8 ANALYSIS OF AVERAGE SQUARED DIFFERENCE SURFACES CAPTER 8 ANALYSS O AVERAGE SQUARED DERENCE SURACES n Chapters 5, 6, and 7, the Spectral it algorithm was used to estimate both scatterer size and total attenuation rom the backscattered waveorms by minimizing

More information

CSE 408 Multimedia Information System Yezhou Yang

CSE 408 Multimedia Information System Yezhou Yang Image and Video Compression CSE 408 Multimedia Information System Yezhou Yang Lots of slides from Hassan Mansour Class plan Today: Project 2 roundup Today: Image and Video compression Nov 10: final project

More information

TLT-5200/5206 COMMUNICATION THEORY, Exercise 3, Fall TLT-5200/5206 COMMUNICATION THEORY, Exercise 3, Fall Problem 1.

TLT-5200/5206 COMMUNICATION THEORY, Exercise 3, Fall TLT-5200/5206 COMMUNICATION THEORY, Exercise 3, Fall Problem 1. TLT-5/56 COMMUNICATION THEORY, Exercise 3, Fall Problem. The "random walk" was modelled as a random sequence [ n] where W[i] are binary i.i.d. random variables with P[W[i] = s] = p (orward step with probability

More information

Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences

Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences Estimation-Theoretic Delayed Decoding of Predictively Encoded Video Sequences Jingning Han, Vinay Melkote, and Kenneth Rose Department of Electrical and Computer Engineering University of California, Santa

More information

The achievable limits of operational modal analysis. * Siu-Kui Au 1)

The achievable limits of operational modal analysis. * Siu-Kui Au 1) The achievable limits o operational modal analysis * Siu-Kui Au 1) 1) Center or Engineering Dynamics and Institute or Risk and Uncertainty, University o Liverpool, Liverpool L69 3GH, United Kingdom 1)

More information

Physics 5153 Classical Mechanics. Solution by Quadrature-1

Physics 5153 Classical Mechanics. Solution by Quadrature-1 October 14, 003 11:47:49 1 Introduction Physics 5153 Classical Mechanics Solution by Quadrature In the previous lectures, we have reduced the number o eective degrees o reedom that are needed to solve

More information

Predictive Coding. Prediction

Predictive Coding. Prediction Predictive Coding Prediction Prediction in Images Principle of Differential Pulse Code Modulation (DPCM) DPCM and entropy-constrained scalar quantization DPCM and transmission errors Adaptive intra-interframe

More information

Detecting Doubly Compressed JPEG Images by Factor Histogram

Detecting Doubly Compressed JPEG Images by Factor Histogram APSIPA ASC 2 Xi an Detecting Doubly Compressed JPEG Images by Factor Histogram Jianquan Yang a, Guopu Zhu a, and Jiwu Huang b a Shenzhen Institutes o Advanced Technology, Chinese Academy o Sciences, Shenzhen,

More information

Computer Engineering Mekelweg 4, 2628 CD Delft The Netherlands MSc THESIS

Computer Engineering Mekelweg 4, 2628 CD Delft The Netherlands  MSc THESIS Computer Engineering Mekelweg 4, 2628 CD Delft The Netherlands http://ce.et.tudelft.nl/ 2010 MSc THESIS Analysis and Implementation of the H.264 CABAC entropy decoding engine Martinus Johannes Pieter Berkhoff

More information

1462 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 10, OCTOBER 2009

1462 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 10, OCTOBER 2009 1462 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 10, OCTOBER 2009 2-D Order-16 Integer Transforms for HD Video Coding Jie Dong, Student Member, IEEE, King Ngi Ngan, Fellow,

More information

VIDEO CODING USING A SELF-ADAPTIVE REDUNDANT DICTIONARY CONSISTING OF SPATIAL AND TEMPORAL PREDICTION CANDIDATES. Author 1 and Author 2

VIDEO CODING USING A SELF-ADAPTIVE REDUNDANT DICTIONARY CONSISTING OF SPATIAL AND TEMPORAL PREDICTION CANDIDATES. Author 1 and Author 2 VIDEO CODING USING A SELF-ADAPTIVE REDUNDANT DICTIONARY CONSISTING OF SPATIAL AND TEMPORAL PREDICTION CANDIDATES Author 1 and Author 2 Address - Line 1 Address - Line 2 Address - Line 3 ABSTRACT All standard

More information

RESOLUTION MSC.362(92) (Adopted on 14 June 2013) REVISED RECOMMENDATION ON A STANDARD METHOD FOR EVALUATING CROSS-FLOODING ARRANGEMENTS

RESOLUTION MSC.362(92) (Adopted on 14 June 2013) REVISED RECOMMENDATION ON A STANDARD METHOD FOR EVALUATING CROSS-FLOODING ARRANGEMENTS (Adopted on 4 June 203) (Adopted on 4 June 203) ANNEX 8 (Adopted on 4 June 203) MSC 92/26/Add. Annex 8, page THE MARITIME SAFETY COMMITTEE, RECALLING Article 28(b) o the Convention on the International

More information

Two-step self-tuning phase-shifting interferometry

Two-step self-tuning phase-shifting interferometry Two-step sel-tuning phase-shiting intererometry J. Vargas, 1,* J. Antonio Quiroga, T. Belenguer, 1 M. Servín, 3 J. C. Estrada 3 1 Laboratorio de Instrumentación Espacial, Instituto Nacional de Técnica

More information

Compression and Coding

Compression and Coding Compression and Coding Theory and Applications Part 1: Fundamentals Gloria Menegaz 1 Transmitter (Encoder) What is the problem? Receiver (Decoder) Transformation information unit Channel Ordering (significance)

More information

Application of a Bi-Geometric Transparent Composite Model to HEVC: Residual Data Modelling and Rate Control

Application of a Bi-Geometric Transparent Composite Model to HEVC: Residual Data Modelling and Rate Control Application of a Bi-Geometric Transparent Composite Model to HEVC: Residual Data Modelling and Rate Control by Yueming Gao A thesis presented to the University of Waterloo in fulfilment of the thesis requirement

More information

Phase-Correlation Motion Estimation Yi Liang

Phase-Correlation Motion Estimation Yi Liang EE 392J Final Project Abstract Phase-Correlation Motion Estimation Yi Liang yiliang@stanford.edu Phase-correlation motion estimation is studied and implemented in this work, with its performance, efficiency

More information

A Comparative Study of Non-separable Wavelet and Tensor-product. Wavelet; Image Compression

A Comparative Study of Non-separable Wavelet and Tensor-product. Wavelet; Image Compression Copyright c 007 Tech Science Press CMES, vol., no., pp.91-96, 007 A Comparative Study o Non-separable Wavelet and Tensor-product Wavelet in Image Compression Jun Zhang 1 Abstract: The most commonly used

More information

Introduction. Outline. CSC/ECE 574 Computer and Network Security. Secret Keys or Secret Algorithms? Secrets? (Cont d) Secret Key Cryptography

Introduction. Outline. CSC/ECE 574 Computer and Network Security. Secret Keys or Secret Algorithms? Secrets? (Cont d) Secret Key Cryptography Outline CSC/ECE 574 Computer and Network Security Introductory Remarks Feistel Cipher DES AES Topic 3.1 Secret Key Cryptography Algorithms CSC/ECE 574 Dr. Peng Ning 1 CSC/ECE 574 Dr. Peng Ning 2 Secret

More information

Definition: Let f(x) be a function of one variable with continuous derivatives of all orders at a the point x 0, then the series.

Definition: Let f(x) be a function of one variable with continuous derivatives of all orders at a the point x 0, then the series. 2.4 Local properties o unctions o several variables In this section we will learn how to address three kinds o problems which are o great importance in the ield o applied mathematics: how to obtain the

More information

Image Coding Algorithm Based on All Phase Walsh Biorthogonal Transform

Image Coding Algorithm Based on All Phase Walsh Biorthogonal Transform Image Coding Algorithm Based on All Phase Walsh Biorthogonal ransform Chengyou Wang, Zhengxin Hou, Aiping Yang (chool of Electronic Information Engineering, ianin University, ianin 72 China) wangchengyou@tu.edu.cn,

More information

New Results on Boomerang and Rectangle Attacks

New Results on Boomerang and Rectangle Attacks New Results on Boomerang and Rectangle Attacks Eli Biham, 1 Orr Dunkelman, 1 Nathan Keller 2 1 Computer Science Department, Technion. Haia 32000, Israel {biham,orrd}@cs.technion.ac.il 2 Mathematics Department,

More information

Driving Point Impedance Computation Applying Parallel Processing Techniques

Driving Point Impedance Computation Applying Parallel Processing Techniques 7th WSEAS Int. Con. on MATHEMATICAL METHODS and COMPUTATIONAL TECHNIQUES IN ELECTRICAL ENGINEERING, Soia, 27-29/10/05 (pp229-234) Driving Point Impedance Computation Applying Parallel Processing Techniques

More information

A VLSI Algorithm for Modular Multiplication/Division

A VLSI Algorithm for Modular Multiplication/Division A VLSI Algorithm for Modular Multiplication/Division Marcelo E. Kaihara and Naofumi Takagi Department of Information Engineering Nagoya University Nagoya, 464-8603, Japan mkaihara@takagi.nuie.nagoya-u.ac.jp

More information

A TWO-STAGE VIDEO CODING FRAMEWORK WITH BOTH SELF-ADAPTIVE REDUNDANT DICTIONARY AND ADAPTIVELY ORTHONORMALIZED DCT BASIS

A TWO-STAGE VIDEO CODING FRAMEWORK WITH BOTH SELF-ADAPTIVE REDUNDANT DICTIONARY AND ADAPTIVELY ORTHONORMALIZED DCT BASIS A TWO-STAGE VIDEO CODING FRAMEWORK WITH BOTH SELF-ADAPTIVE REDUNDANT DICTIONARY AND ADAPTIVELY ORTHONORMALIZED DCT BASIS Yuanyi Xue, Yi Zhou, and Yao Wang Department of Electrical and Computer Engineering

More information

DELIVERING video of good quality over the Internet or

DELIVERING video of good quality over the Internet or 1448 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 10, OCTOBER 2009 Error Resilient Video Coding Using B Pictures in H.264 Mengyao Ma, Student Member, IEEE, Oscar C. Au,

More information

Bayesian Technique for Reducing Uncertainty in Fatigue Failure Model

Bayesian Technique for Reducing Uncertainty in Fatigue Failure Model 9IDM- Bayesian Technique or Reducing Uncertainty in Fatigue Failure Model Sriram Pattabhiraman and Nam H. Kim University o Florida, Gainesville, FL, 36 Copyright 8 SAE International ABSTRACT In this paper,

More information

Analysis on the Strengthening Buffer Operator Based on the Strictly Monotone Function

Analysis on the Strengthening Buffer Operator Based on the Strictly Monotone Function International Journal o Applied Physics and Mathematics Vol. 3 No. March 03 Analysis on the Strengthening Buer Operator Based on the Strictly Monotone Function Hu Xiao-Li Wu Zheng-Peng and Han Ran Abstract

More information

Lec 04 Variable Length Coding (VLC) in JPEG

Lec 04 Variable Length Coding (VLC) in JPEG ECE 5578 Multimedia Communication Lec 04 Variable Length Coding (VLC) in JPEG Zhu Li Dept of CSEE, UMKC Z. Li Multimedia Communciation, 2018 p.1 Outline Lecture 03 ReCap VLC JPEG Image Coding Framework

More information

A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv Techniques

A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv Techniques Hindawi Publishing Corporation EURASIP Journal on Applied Signal Processing Volume 26, Article ID 6971, Pages 1 18 DOI 1.1155/ASP/26/6971 A Framework for Adaptive Scalable Video Coding Using Wyner-Ziv

More information

Mitsuru Matsui , Ofuna, Kamakura, Kanagawa, 247, Japan. which are block ciphers with a 128-bit key, a 64-bit block and a variable

Mitsuru Matsui , Ofuna, Kamakura, Kanagawa, 247, Japan. which are block ciphers with a 128-bit key, a 64-bit block and a variable New Block Encryption Algorithm MISTY Mitsuru Matsui Inormation Technology R&D Center Mitsubishi Electric Corporation 5-1-1, Ouna, Kamakura, Kanagawa, 247, Japan matsui@iss.isl.melco.co.jp Abstract. We

More information

Curve Sketching. The process of curve sketching can be performed in the following steps:

Curve Sketching. The process of curve sketching can be performed in the following steps: Curve Sketching So ar you have learned how to ind st and nd derivatives o unctions and use these derivatives to determine where a unction is:. Increasing/decreasing. Relative extrema 3. Concavity 4. Points

More information

Predictive Coding. Prediction Prediction in Images

Predictive Coding. Prediction Prediction in Images Prediction Prediction in Images Predictive Coding Principle of Differential Pulse Code Modulation (DPCM) DPCM and entropy-constrained scalar quantization DPCM and transmission errors Adaptive intra-interframe

More information

A New Pipelined Architecture for JPEG2000 MQ-Coder

A New Pipelined Architecture for JPEG2000 MQ-Coder , October 24-26, 2012, San Francisco, USA A New Pipelined Architecture for JPEG2000 MQ-Coder M. Ahmadvand, and A. Ezhdehakosh Abstract JPEG2000 has become one of the most rewarding image coding standards.

More information

IMPROVED INTRA ANGULAR PREDICTION BY DCT-BASED INTERPOLATION FILTER. Shohei Matsuo, Seishi Takamura, and Hirohisa Jozawa

IMPROVED INTRA ANGULAR PREDICTION BY DCT-BASED INTERPOLATION FILTER. Shohei Matsuo, Seishi Takamura, and Hirohisa Jozawa 2th European Signal Processing Conference (EUSIPCO 212 Bucharest, Romania, August 27-31, 212 IMPROVED INTRA ANGULAR PREDICTION BY DCT-BASED INTERPOLATION FILTER Shohei Matsuo, Seishi Takamura, and Hirohisa

More information

Intra Prediction by a linear combination of Template Matching predictors

Intra Prediction by a linear combination of Template Matching predictors Intra Prediction by a linear combination of Template Matching predictors Laurent Guillo, Tangi Poirier, Christine Guillemot To cite this version: Laurent Guillo, Tangi Poirier, Christine Guillemot. Intra

More information

LATTICE VECTOR QUANTIZATION FOR IMAGE CODING USING EXPANSION OF CODEBOOK

LATTICE VECTOR QUANTIZATION FOR IMAGE CODING USING EXPANSION OF CODEBOOK LATTICE VECTOR QUANTIZATION FOR IMAGE CODING USING EXPANSION OF CODEBOOK R. R. Khandelwal 1, P. K. Purohit 2 and S. K. Shriwastava 3 1 Shri Ramdeobaba College Of Engineering and Management, Nagpur richareema@rediffmail.com

More information

Image Compression - JPEG

Image Compression - JPEG Overview of JPEG CpSc 86: Multimedia Systems and Applications Image Compression - JPEG What is JPEG? "Joint Photographic Expert Group". Voted as international standard in 99. Works with colour and greyscale

More information

ROBUST STABILITY AND PERFORMANCE ANALYSIS OF UNSTABLE PROCESS WITH DEAD TIME USING Mu SYNTHESIS

ROBUST STABILITY AND PERFORMANCE ANALYSIS OF UNSTABLE PROCESS WITH DEAD TIME USING Mu SYNTHESIS ROBUST STABILITY AND PERFORMANCE ANALYSIS OF UNSTABLE PROCESS WITH DEAD TIME USING Mu SYNTHESIS I. Thirunavukkarasu 1, V. I. George 1, G. Saravana Kumar 1 and A. Ramakalyan 2 1 Department o Instrumentation

More information

L. Yaroslavsky. Fundamentals of Digital Image Processing. Course

L. Yaroslavsky. Fundamentals of Digital Image Processing. Course L. Yaroslavsky. Fundamentals of Digital Image Processing. Course 0555.330 Lec. 6. Principles of image coding The term image coding or image compression refers to processing image digital data aimed at

More information

Supplementary Information Reconstructing propagation networks with temporal similarity

Supplementary Information Reconstructing propagation networks with temporal similarity Supplementary Inormation Reconstructing propagation networks with temporal similarity Hao Liao and An Zeng I. SI NOTES A. Special range. The special range is actually due to two reasons: the similarity

More information

Bond strength model for interfaces between nearsurface mounted (NSM) CFRP strips and concrete

Bond strength model for interfaces between nearsurface mounted (NSM) CFRP strips and concrete University o Wollongong Research Online Faculty o Engineering and Inormation Sciences - Papers: Part A Faculty o Engineering and Inormation Sciences 2014 Bond strength model or interaces between nearsurace

More information

The Impact of Carries on the Complexity of Collision Attacks on SHA-1

The Impact of Carries on the Complexity of Collision Attacks on SHA-1 The Impact o Carries on the Complexity o Collision Attacks on SHA-1 Florian Mendel, Norbert Pramstaller, Christian Rechberger and Vincent Rijmen Norbert.Pramstaller@iaik.tugraz.at Institute or Applied

More information