Ragav Venkatesan, 2 Christine Zwart, 2,3 David Frakes, 1 Baoxin Li

Similar documents
Image information measurement for video retrieval

Single-Image-Based Rain and Snow Removal Using Multi-guided Filter

Truncation Strategy of Tensor Compressive Sensing for Noisy Video Sequences

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Using Entropy and 2-D Correlation Coefficient as Measuring Indices for Impulsive Noise Reduction Techniques

Intraframe Prediction with Intraframe Update Step for Motion-Compensated Lifted Wavelet Video Coding

Lecture 7 Predictive Coding & Quantization

MODERN video coding standards, such as H.263, H.264,

CS4495/6495 Introduction to Computer Vision. 6B-L1 Dense flow: Brightness constraint

arxiv: v1 [cs.cv] 10 Feb 2016

h 8x8 chroma a b c d Boundary filtering: 16x16 luma H.264 / MPEG-4 Part 10 : Intra Prediction H.264 / MPEG-4 Part 10 White Paper Reconstruction Filter

Robust Motion Segmentation by Spectral Clustering

A Video Codec Incorporating Block-Based Multi-Hypothesis Motion-Compensated Prediction

Reading. 3. Image processing. Pixel movement. Image processing Y R I G Q

The New Graphic Description of the Haar Wavelet Transform

Conjugate gradient acceleration of non-linear smoothing filters Iterated edge-preserving smoothing

Stability of Recursive Gaussian Filtering for Piecewise Linear Bilateral Filtering

Multi-scale Geometric Summaries for Similarity-based Upstream S

Luminance, disparity and range statistics in 3D natural scenes

METRIC. Ming-Te WU and Shiunn-Jang CHERN

Analysis of Redundant-Wavelet Multihypothesis for Motion Compensation

Covariance Tracking Algorithm on Bilateral Filtering under Lie Group Structure Yinghong Xie 1,2,a Chengdong Wu 1,b

LORD: LOw-complexity, Rate-controlled, Distributed video coding system

Low-Complexity FPGA Implementation of Compressive Sensing Reconstruction

GRAPH SIGNAL PROCESSING: A STATISTICAL VIEWPOINT

MRI Reconstruction via Fourier Frames on Interleaving Spirals

Space-Time Kernels. Dr. Jiaqiu Wang, Dr. Tao Cheng James Haworth University College London

1. INTRODUCTION. Yen-Chung Chiu. Wen-Hsiang Tsai.

Prediction-Guided Quantization for Video Tone Mapping

Fast Nonnegative Matrix Factorization with Rank-one ADMM

Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms

Vector Quantizers for Reduced Bit-Rate Coding of Correlated Sources

Enhanced SATD-based cost function for mode selection of H.264/AVC intra coding

Lecture 04 Image Filtering

Rate-Constrained Multihypothesis Prediction for Motion-Compensated Video Compression

Shape Outlier Detection Using Pose Preserving Dynamic Shape Models

A TWO-STAGE VIDEO CODING FRAMEWORK WITH BOTH SELF-ADAPTIVE REDUNDANT DICTIONARY AND ADAPTIVELY ORTHONORMALIZED DCT BASIS

SPEECH ANALYSIS AND SYNTHESIS

A DELAY-DEPENDENT APPROACH TO DESIGN STATE ESTIMATOR FOR DISCRETE STOCHASTIC RECURRENT NEURAL NETWORK WITH INTERVAL TIME-VARYING DELAYS

A Contrario Detection of False Matches in Iris Recognition

Review: Learning Bimodal Structures in Audio-Visual Data

Detailed Review of H.264/AVC

Image Recognition Using Modified Zernike Moments

Part III Super-Resolution with Sparsity

Tutorial on Methods for Interpreting and Understanding Deep Neural Networks. Part 3: Applications & Discussion

Light Field Image Compression with Sub-apertures Reordering and Adaptive Reconstruction

Visual Selection and Attention Shifting Based on FitzHugh-Nagumo Equations

SVAN 2016 Mini Course: Stochastic Convex Optimization Methods in Machine Learning

Deep Reinforcement Learning SISL. Jeremy Morton (jmorton2) November 7, Stanford Intelligent Systems Laboratory

Density Propagation for Continuous Temporal Chains Generative and Discriminative Models

Asaf Bar Zvi Adi Hayat. Semantic Segmentation

Pose tracking of magnetic objects

Video Coding with Motion Compensation for Groups of Pictures

Nonlinear reverse-correlation with synthesized naturalistic noise

RESTORATION OF VIDEO BY REMOVING RAIN

NOISE ENHANCED ANISOTROPIC DIFFUSION FOR SCALAR IMAGE RESTORATION. 6, avenue du Ponceau Cergy-Pontoise, France.

A 600bps Vocoder Algorithm Based on MELP. Lan ZHU and Qiang LI*

Singlets. Multi-resolution Motion Singularities for Soccer Video Abstraction. Katy Blanc, Diane Lingrand, Frederic Precioso

Compression methods: the 1 st generation

LOSSLESS INTRA CODING IN HEVC WITH INTEGER-TO-INTEGER DST. Fatih Kamisli. Middle East Technical University Ankara, Turkey

A Study of Source Controlled Channel Decoding for GSM AMR Vocoder

6.869 Advances in Computer Vision. Bill Freeman, Antonio Torralba and Phillip Isola MIT Oct. 3, 2018

Deep Learning: Approximation of Functions by Composition

Gaussian Process Based Image Segmentation and Object Detection in Pathology Slides

A Background Layer Model for Object Tracking through Occlusion

Benchmarking Functional Link Expansions for Audio Classification Tasks

Wavelet Packet Based Digital Image Watermarking

Statistical Filters for Crowd Image Analysis

Recent Advances in Structured Sparse Models

Suppression of impulse noise in Track-Before-Detect Algorithms

LATTICE VECTOR QUANTIZATION FOR IMAGE CODING USING EXPANSION OF CODEBOOK

Human Pose Tracking I: Basics. David Fleet University of Toronto

ANALYSIS AND PREDICTION OF JND-BASED VIDEO QUALITY MODEL

Various signal sampling and reconstruction methods

Continuous State MRF s

Histogram of multi-directional Gabor filter bank for motion trajectory feature extraction

Scale Space Smoothing, Image Feature Extraction and Bessel Filters

Automated Segmentation of Low Light Level Imagery using Poisson MAP- MRF Labelling

SYDE 575: Introduction to Image Processing. Image Compression Part 2: Variable-rate compression

Frog Sound Identification System for Frog Species Recognition

A COMPARATIVE STUDY OF -DISTORTION LIMITED IMAGE COMPRESSION ALGORITHMS. Hannes Hartenstein, Ralf Herz and Dietmar Saupe

TUTORIAL PART 1 Unsupervised Learning

Two-stage Pedestrian Detection Based on Multiple Features and Machine Learning

arxiv: v1 [cs.mm] 10 Mar 2016

FPGA Implementation of a HOG-based Pedestrian Recognition System


Hou, Ch. et al. IEEE Transactions on Neural Networks March 2011

Case Studies of Logical Computation on Stochastic Bit Streams

FINGERPRINT INFORMATION MAXIMIZATION FOR CONTENT IDENTIFICATION 1. Rohit Naini, Pierre Moulin

Bootstrapping for Approximate Homomorphic Encryption

Logarithmic quantisation of wavelet coefficients for improved texture classification performance

Neuromorphic Sensing and Control of Autonomous Micro-Aerial Vehicles

Apprentissage, réseaux de neurones et modèles graphiques (RCP209) Neural Networks and Deep Learning

A Modified Moment-Based Image Watermarking Method Robust to Cropping Attack

A Method of HVAC Process Object Identification Based on PSO

Surveying, Mapping and Remote Sensing (LIESMARS), Wuhan University, China

Motion Vector Prediction With Reference Frame Consideration

CSE 591: Introduction to Deep Learning in Visual Computing. - Parag S. Chandakkar - Instructors: Dr. Baoxin Li and Ragav Venkatesan

Predictive Coding. Prediction Prediction in Images

CSC 576: Variants of Sparse Learning

Transcription:

1,3 Ragav Venkatesan, 2 Christine Zwart, 2,3 David Frakes, 1 Baoxin Li 1 School of Computing Informatics and Decision Systems Engineering, Arizona State University, Tempe, AZ, USA 2 School of Biological and Health Systems Engineering, Arizona State University, Tempe, AZ, USA 3 School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, AZ, USA

3/2/2014

OUTLINE Introduction Literature review Proposed deinterlacing system Results Conclusions

OUTLINE Introduction Literature review Proposed deinterlacing system Results Conclusions

INTRODUCTION Interlaced Videos:- Given frame of N rows, only N/2 alternate rows are present in one field. Remaining rows are present in the next field

INTRODUCTION Deinterlacing :- Convert Interlaced Videos to Deinterlaced Videos Convert Fields to Frames.

INTRODUCTION Deinterlacing :- F 0 x, n = F x, n, y mod2 = n mod2 F i x, n, otherwise

INTRODUCTION Static region of video Temporal deinterlacing High motion in the video Spatial deinterlacing Sports is broadcast over 1080i rather than the usual 720p or 1080p. This is followed by ESPN/Star, Sky, BBC sport.

OUTLINE Introduction Literature review Proposed deinterlacing system Results Conclusions

Literature Review Linear Deinterlacers Spatial Deinterlacers Temporal Deinterlacers Spatio-Temporal Deinterlacers Non-Linear Deinterlacers Inter-Frame Deinterlacers Intra-Frame Deinterlacers Adaptive Deinterlacers

Literature Review Linear Deinterlacers:- F n i, j = m F n i, j, j mod2 = n mod2 F n+m i, j + k h m k, otherwise k

Literature Review Line Averaging (LA):-

Literature Review Spatial Linear Deinterlacers:- All pass on the temporal. Even after low pass, aliasing components are preserved. Temporal Linear Deinterlacers:- All pass on the vertical. This is the best solution for a static image as all the vertical frequencies are preserved

Literature Review Temporal Linear Deinterlacers:-

Literature Review Vertical Temporal Filter (VTF) [Wes98]:- F n i, j, j mod2 = n mod2 F n i, j = F n+m i, j + k h m k, otherwise m k h m k = 1 2, 1 2 1 16, 1 8, 1 16, k = 1,1 m = 0, k = 2,0,2 m = 1,1

Literature Review Non- Linear Deinterlacers:- Inter-Frame Deinterlacers Intra-Frame Deinterlacers Adaptive Deinterlacers

Literature Review Non- Linear Deinterlacers:- Inter-Frame Deinterlacers Edge based Line Averaging Intra-Frame Deinterlacers Spatio-Temporal edge based median filtering Adaptive Deinterlacers Content adaptive vertical temporal filtering.

Literature Review Method switching algorithms (MSAs) Method switching ELA [Hon11] Choses between LA, ELA and STELA. Content adaptive VTF CAVTF [Lee13] Adaptively learn the filter weights of VTF based on video content Uses Adaptive dynamic range encoding (ADRC) Each code gets a different weight Latest paper there is on deinterlacing Considered to be benchmark for our tests.

OUTLINE Introduction Literature review Proposed deinterlacing system Results Conclusions

Deinterlacing System Region of video that has no motion or a static region. This requires a purely temporal deinterlacer. Region of video that has moderate motion. This requires a spatio-temporal deinterlacer Region of video that has very high motion. This requires a purely spatial deinterlacer.

Deinterlacing System F n i, j = F n i, j, j mod2 = n mod2 a, A b, B

Deinterlacing System Solving for Region A :- Difference image will give static regions d n = F n + 1 F(n 1) Threshold to 1 bit-depth. Condition a: d n i, j < t 1 where t 1 is 1 bit.

Deinterlacing System F n i, j = F n i, j, j mod2 = n mod2 F n 1 i, j, d n i, j < t 1 b, B c, C

Deinterlacing System Solving for Region B :- Spectral residue. q n = Ch 1 n + Ch 2 n μ 1 + Ch 3 n μ 2 + Ch 4 n μ 3 Q i n u, v = q i n x, y = W 1 1 WH y=0 H 1 yv e μ 2π 1 W +xuh x=0 W 1 H 1 yv e μ 2π 1 W +xuh 1 WH v=0 u=0 q 1 n x, y Q 1 n [u, v]

Deinterlacing System Solving for Region B :- Q p = Q Q Phase spectrum of the transform.

Deinterlacing System Solving for Region B :- Q p = Q Q Gaussian smooth the Phase spectrum of the transform. S n i, j = g q p

Deinterlacing System 2D Control Grid Interpolation : 2DI[i, j, k] = I(i + d 1 [i, j, k], j + d 2 [i, j, k], k + k) [Frakes08]

Deinterlacing System 1D control grid interpolation: I i, j = I i + α, j + 1 Two uni-directional estimates:- I i, j = I(i + 2α, j + 2) I i, j + 2 = I i + 2α, j 1DI = I i + α, j + 1 = 1 2 I i, j + I i + 2α, j + 2

Deinterlacing System F n i, j = F n i, j, j mod2 = n mod2 F n 1 i, j, d n i, j < t 1 2DI[i, j, k](1 S n i, j ) + 1D[i, j]s n i, j, d n i, j t 1

OUTLINE Introduction Literature review Proposed deinterlacing system Results Conclusions

Results 60 PSNR 50 40 30 20 10 0 STELA VTF SRVTF Proposed

Results 60 VSNR 50 40 30 20 10 0 VTF SRVTF SDD

Original ELA STELA VTF SRVTF Proposed

CAVTF [Lee11] Proposed

Thank You.

References [Gates97]Microsoft, "Broadcast-enabled computer hardware requirements," in WinHec'97, 1997. [Haa98] G. Haan and E. Bellers, "Deinterlacing-an overview," Proceedings of the IEEE, vol. 86, pp. 1839-1857, September 1998. [Wes98] M. Weston, Interpolating lines of video signals, US-patent 4,789,893, December 1988. [Sal93] J. Salonen and S. Kalli, "Edge adaptive interpolation for scanning rate conversion," Signal processing for HDTV IV, pp. 757-764, 1993.

References [Kuo96] C. J. Kuo, C. Liao and C. Lin, "Adaptive interpolation technique for scanning rate conversion," IEEE Transactions on circuit systems and video technology, vol. 6, pp. 317-321, 1996. [Oh00] H. S. Oh, Y. Kim, Y. Y. Jung, A. W. Morales and S. J. Ko, "Spatio-temporal edge-based median filtering for deinterlacing," in International conference on consumer electronics, 2000 [Hon11] S.-M. Hong, S.-J. Park, J. Jang and J. Jeong, "Method switching algorithm for intra-field deinterlacing," in IEEE 15th International symposium on consumer electronics, 2011.

References [Lee11] K. Lee and C. Lee, "High quality deinterlacing using content adaptive vertical temporal filtering," IEEE Transactions on consumer electronics, pp. 2469-2474, 2011. [Lu10] T. Lu, Z. Yuan, Y. Huang, D. Wu and H. Yu, "Video retargetting with nonlinear spatial-temporal saliency fusion," in International conference on image processing (ICIP), 2010. [Itt98] L. Itti, C. Koch and E. Niebur, "A model of saliency-based visual attention for rapid scene analysis," IEEE Transactions on pattern analysis and machine intelligence, vol. 20, pp. 1254-1259, 1998. [Frakes08] Frakes, David H., et al. "A new method for registration-based medical image interpolation." Medical Imaging, IEEE Transactions on 27.3 (2008): 370-377.

References [Hou07] X. Hou and L. Zhang, "Saliency Detection: A Spectral residue appraoch," in Computer vision and pattern analysis (CVPR), 2007. [San01] S. Sangwine and T. Ell, "Hypercomplex Fourier transforms of color images," IEEE Transactions on image processing, pp. 22-35, 2001. [Zwa12] C. Zwart and D. H. Frakes, "Soft adaptive gradient angle interpolation of grayscale images," in IEEE International conference on Acoustics, Speech and Signal Processing (ICASSP), 2012. [See07] P. Seeling, F. Fitzek and M. Reisslein, Video traces for network performance evaluation, Springer, 2007.