Content-based Decomposition of Gesture Videos

Size: px
Start display at page:

Download "Content-based Decomposition of Gesture Videos"

Transcription

1 Content-based Decomposition of Gesture Videos Nikolaos D. Doulamis National Technical University of Athens, Dept. of Electrical and Computer Engineering Anastasios D. Doulamis National Technical University of Athens, Dept. of Electrical and Computer Engineering Dimitrios I. Kosmopoulos National Centre for Scientific Research Demokritos, Institute of Informatics & Telecommunications Abstract - In this paper e present a novel method for gesture video decomposition based on the depicted content. From the initial content the key-frames are extracted and the neighboring frames are assigned to key frames of similar content. The resulting frame groups are decomposed to binary trees, based on the energy of the depicted gestures. In case of reduced bandidth e keep the original timeline but e send only a dynamically adapted video summary. The respective frames are obtained by moving appropriately across the hierarchy layers of the constructed tree. The hierarchically structured video can be used for purposes such as efficient video brosing and transmission of dynamically generated summaries over lo bandidth netorks, for communication or human-computer interface applications. I INTRODUCTION Many interesting approaches have been proposed in the past to improve the quality of life of disabled people and aid their inclusion in the society. This applies also to people ith hearing and speech impairement, ho use sign language to communicate. The video depicting communicative signs requires special analysis (compared ith conventional video content) to derive the semantic meaning of the content. On the other hand, there is a tremendous demand for multimedia data transmitted over netorks and this demand is expected to rapidly increase in the forthcoming years due to the development of lo-cost devices for capturing and generating multimedia information. Despite the recent process in coding and compression algorithms, such as the MPEG-4 standards [], transmission of multimedia information, and especially of digital video in a cost effective and quality guaranteeing manner over lo bandidth netorks, such as the Internet, still remains one of the most challenging problems. This is due to the fact that digital video, even in compressed domain, imposes very large bandidth requirements. There is difficulty in navigating and accessing huge video databases. Despite the recent advances in video coding and the proposal of hierarchical coding schemes, such as the Fine Granularity Scalability (FGS) [, 3] scheme. This concept can be extended to video sequences by initially transmitting a coarse (lo) resolution and then, decomposing particular video segments into a higher (finer) resolution. In this ay and taking into account bandidth constrains, the respective video content is transmitted. Conventional video adaptation algorithms can be considered the orks of [4-8]. The goal of these algorithms is to extract a small "video abstract" by discarding visual information, similarly to text summaries used in document files. On the contrary, in a video delivery system the user is alloed to see visual content at various resolutions. Toards this direction, algorithms, dealing ith progressive retrieval of images or video sequences, such as the ones presented in [9, 0] can be considered. These techniques initially transmit a coarse (lo) "resolution vie" of an image, folloed by additional residual information. Hoever, these approaches, decompose linearly (donsample) visual information on spatial and temporal direction at fixed units of pixels or frames. Hierarchical video summarization has been also adopted in the frameork of the MPEG-7 standard [], through the HierarchicalSummary Description scheme. The technique uses only key-frame organization and clusters video segments according to the visual content and temporal coherency. In [], a content-based video decomposition scheme has been presented to organize visual information at different levels of content hierarchy. The method represents video information in a tree structure, the level of hich corresponds to a content resolution, hile the tree nodes correspond to the video segments that are partitioned at this level. Shots and frames are considered as the basic elements for the hierarchical video decomposition. The main draback, hoever, of all the above mentioned methods, is that a) generic lo level features are used for modeling video content and b) content decomposition is not constrained ith respect to the current bandidth requirements. The second issue means that the amount of the transmitted video content is not restricted by the netork channel characteristics, hich dynamically

2 changes through time. For this reason, a video content decomposition scheme is required to update the transmitted video content to netork characteristics (adaptive video transmission). In this paper, e propose a novel video decomposition scheme for applications such as efficient brosing and adaptive delivery and transmitting of gesture videos such as sign language streams. Gesture videos are also useful for human-computer interaction, or even robot guidance over netorks. In these types of videos, the semantic content is mainly defined as the gesture variation, i.e., the movement of hands and head. Adaptation of gesture videos allos for efficient streaming of such types of videos over lo bandidth netorks, for navigation of sign language streams in huge media databases, for delivery of the salient content in robotic guidance scenarios, and for searching and indexing of particular content of interest of gesture videos. The proposed architecture consists of the gesture segmentation-representation module and the adaptive hierarchical decomposition module. Gesture segmentation is performed in our case taking into account skin color information. Then, the segmented regions are represented using the Zernike moments [3]. The adoption of Zernike moments, instead of other types of moments, such as Cartesian or Hu moments, is due to the fact that they are orthogonal and thus they have stronger representation capabilities. Due to orthogonality, the sum of the squared coefficients of the Zernike moments expresses the energy of the gesture shape [3]. The next step of the proposed architecture is the adaptive hierarchical content decomposition algorithm. More specifically, the shape energy is used for estimating the key-frames based on the curvature of the shape energy trajectory. In the folloing, the remaining video frames are temporally classified ith respect to the estimated key-frames. Then, an adaptive decomposition algorithm is proposed, hich dynamically calculates the number of frames required to be transmitted ith respect to content variations and the bandidth constraints. The algorithm cluster video frames using a binary tree classification scheme. II GESTURE SEGMENTATION AND SUMMARIZATION The goal of gesture segmentation and summarization procedure is to segment the head and hand regions in the image and create a content based representation for them; then based on this representation to select only those frames that are able to describe adequately the performed gestures. As described in the folloing, the segmentation is performed using skin color, the head and hand areas are described by Zernike moments and the key frames are extracted by identifying the maximum temporal variations of the gesture energy. We assume that the depicted person (target) faces the camera ith his/her upper body part captured in the image. The hands may disappear from the image, they may be occluded by each other and they may occlude the head, but e assume that the target is dressed. To perform color modeling, e firstly locate the face in the image using a face detector from a readily available vision library. We train a color classifier based on the face color and then based on the modeled color e find the hand regions in the image. For solving this target-specific and thus reduced skin color modeling problem, e assume a single multivariate Gaussian model. The color probability density function in the Hue, Saturation and Intensity color space (selected due to similarity to human perception) is given by: p( c skin) = e 3 (π ) Σ Τ ( c µ ) Σ ( c µ ) here c refers to the color vector and µ, Σ are the mean vector and the covariance matrix respectively. The skin regions are obtained after calculating the skin probability for all pixels. Depending on the application context the Intensity (and sometimes also the Saturation) channel may be excluded from the color probability calculation. Using the color model, e are able to segment the image into skin and non-skin regions resulting in a binary mask (filtered to remove noise and after extraction of the up to three biggest connected components). The mask is applied to the Intensity channel of the image to obtain a masked gray-level image including only the head and the to hands. The gray-level image provides a richer representation of the current gesture than the binary masks; the employment of the latter can lead to loss of information especially in the case of different gestures ith similar silhouettes, e.g., front and back image of the hand. We use the Zernike moments to represent the activity state as it is expressed by the relative position and shape of the head and the to hands due to their noise resiliency, their reduced information redundancy and their reconstruction capability. The complex Zernike moments of order p are defined as [3]: () p + π jqθ Α pq = Rpq ( r) e f ( r, θ ) r drdθ (a) π 0 π = x y, θ=tan - (y/x), -<x,y< (b) r + p q s m s R pq ( r) ( ) r (c) = s= 0 p-q = even and 0 q m. ( m s)! m + n m n s!( s)!( s)!

3 For the purposes of this ork e chose to normalize the moments around the center of gravity because it is actually the relative pose of the head to the hands that signifies change of activity. On the contrary, normalization to scale is undesired due to the fact that scale variations ithin the same shot signify interesting gestural events. For representing the gesture state e use as signature the sum of the squared coefficients of the Zernike moments hich correspond to the gesture energy [3]: J = Q p= 0 q p p, q even Apq here Q is the selected order of the moments and the L norm. The energy is plotted for each frame of the shot forming a trajectory, hich expresses the temporal variation of the energy shape through time. The second derivative of J for all frames ithin a shot ith respect to time is used as a curvature measure. Local maxima correspond to time instances of peak shape variation, hile local minima indicate lo shape variation. Let us denote as J( the energy of shape coefficients to the kth frame of the examined shot. Initially, the first derivative of signal J(, say J (, is evaluated, approximated as the difference of feature vectors beteen to successive frames J(=J(k+)-J(. To minimize noise influence, a eighted average of the first derivative, say J, over a indo, is used, given by: (4) J ( = β( α( k ) l k ( J ( l + ) J ( l) ), k=0,,m- here α ( = max(0, k N ), and β ( k ) = min( M, k + N ) and *N + is the length of the indo, centered at frame k. Variable M indicates the number of frames of the shot. The eights l are defined for l { N, N } ; in the simple case, all eights l are considered equal to each other, meaning that the derivatives of all frame feature vectors ithin the indo interval present the same importance, (3) l =, -N,,N (5) (N + ) Similarly, the second eighted derivative, J (, for the k-th frame is defined as: J ( = β( k ) α( k ) l k J ( (6) here J ( = J ( k + ) J (, k=0,,m-3 (7) and α ( = min(0, k N ), β ( k ) = min( M 3, k + N ) As explained previously, the local maxima and minima of J are considered as appropriate curve points, i.e., as time instances for the selected key-frames. Note that J is a discrete time sequence. Hence, the local maxima and minima can be estimated as the union of to sets X = X M X m ; the X M contains the time instances of frames corresponding to the local maxima of J, hile the X m the time instances of local minima of J. The sets X M and X m are estimated as follos X M = { k : J ( k ) < J ( & J ( > J ( k + )} X m = { k : J ( k ) > J ( & J ( > D( k + )} (8) III VIDEO DECOMPOSITION AND ADAPTIVE TRANSMISSION The key-frames extracted as described in section, preserve the semantics of the gesture. Thus, video content decomposition is performed in our case by exploiting the key-frame information. In particular, the remaining frames are temporally grouped around the extracted key-frames. These groups are composed in a spatiotemporal fashion considering both gesture energy (spatial) and time proximity (temporal). It is actually a problem of dividing the frames that belong to the timeline beteen to adjacent key-frames into to groups. One group is represented by the first key-frame, hile the other by the second one. This is equivalent to finding the optimal value frame index, say mˆ, that minimizes the folloing objective function. m N m ˆ = arg min{ Jk g + Jk g } (9) m k = k= m+ here g, g the shape energy of the to key-frames and J k the shape energy of the frames belonging in the timeline beteen the to key-frames. Then the frames ith time index k m are assigned to the group of the first key-frame hile the rest ones are assigned to the second group. Especially, for the first (last) key-frame of the video sequence, the frames that are located in time beteen the beginning (end) of the timeline and the key-frame, are directly assigned to the group of that key-frame. After defining the groups, say G i i=,, K, the frames in each group G i are processed to compose a binary tree based on the difference of their shape energies. Each node of the tree corresponds to a video frame. In addition each frame is decomposed into to frames, say f r, f s hose respective shape energy, J r, J s, is selected so that:

4 J r J s > J u J v for all fu f v Gi, (0) All the remaining nodes of G i are then grouped around the to frames f u, f v by minimizing (9). The frames f u, f v, summarize (represent) at a more detailed level all the frames belonging into their groups. The video decomposition procedure is repeated for all tree branches until the tree can not be further decomposed, i.e. the tree nodes represent single frames. In our representation, each node n in the tree is characterized by a vector of three integers (a,b,c) here: a is the minimum frame index in the group summarized by node n. b is the maximum frame index in the group summarized by node n. c is the index of the frame represented by node n In the sequel, video decomposition is used for transmitting visual information over lo bandidth netorks. Let us denote as C(n) the set of all children of node n. Let us also denote as K(t) the root node of the tree that corresponds to the key-frame at time instance t. If N is the total number of frames, t the current timeline counted in frame-period, and B the current bandidth, then an adaptive algorithm for video transmission is proposed defined as follos: for t = :N { GetBandidth(B); n = K (t); hile (B - <n(b)-n(a) ) /*a single node*/ n = { n C(n) : t [a,b]}; if n (c) = = t transmit frame(n) else transmit control signal } In the previous algorithm e recall that a, b are the minimum/maximum frame indices of the group represented by node n. In addition, B - refers to the inverse of the bandidth and n(a), n(b), n(c) the a, b, c fields of the node vector. In each time slot the bandidth is read to find the ne netork/client media device state. In case that the bandidth is unable to transmit even the key-frames then some higher level clustering has to be performed but in that case significant visual info ill be lost. IV EXPERIMENTAL RESULTS The applicability of the method has been verified through content-based video decomposition of sign language videos. The presented scheme for gesture-based video decomposition is evaluated using video files of sign language. Figure illustrates a shot of the assume ord. In Figure a, one every 5 frames is presented for clarity. Figure b shos the second derivative of the shape energy and the respective key-frames. As is observed, the salient parts of the shots are detected as key-frames, i.e., the 8, 3, 39, 60, 89,, 7 frame indices. The remaining frames are grouped together ith respect to these key-frames. For example, the first group includes the frames from -0, the second -3, etc. For each group, the to most dissimilar frames are selected for further decomposition, e.g., the second group represented by the key-frame 3 is further decomposed to groups represented by frames 6 and 3, hich in the sequel are decomposed to groups represented by frames ith indices of and 6 and 7 and 3 respectively. This process is repeated until all the tree nodes represent single frames. This is illustrated in Figure. The frames that are transmitted for time-variant bandidth are displayed in the same figure. It becomes clear that the frame transmission is adaptable to netork channel capacity and variations in its characteristics. It is also evident that time the frame sequence, hich is important for content such as gestures, is preserved. V CONCLUSIONS In this paper e present an adaptive algorithm for gesture video decomposition over lo-bandidth netorks. Initially, key-frames are extracted using the time variation of the gesture shape energy. This energy is calculated by the sum of he squared coefficients o the Zernike moments. In case of reduced bandidth e keep the original timeline but e send only a dynamically adapted video summary. The respective frames are obtained by moving appropriately across the hierarchy layers of the constructed tree. Experimental results on sign language videos indicate that the proposed scheme a) decomposes video based on content characteristics and b) provides adaptive content delivery in time varied netorks. VI REFERENCES [] ISO/IEC JTC/SC9/WG N356, "MPEG-4 Overvie," Doc. N356, Maui, Haaii, December 999. [] W. Li, Overvie of Fine Granularity Scalability in MPEG-4 Video Standard, IEEE Trans. CSVT, vol., no. 3, pp , Mar. 00 [3] Coding of Audio-Visual Objects, Part- Visual, Amendment 4: Streaming Video Profile, ISO/IEC /FPDAM4, July 000. [4] N. Doulamis, A. Doulamis, Y. Avrithis, K. Ntalianis and S. Kollias, " Efficient summarization of stereoscopic video sequences," IEEE Trans. on CSVT, Vol. 0, No. 4, pp , June 000. [5] A. Hanjalic and H. Zhang, "An integrated scheme for automated abstraction based on unsupervised cluster-

5 validity analysis," IEEE Trans. on CSV T, Vol. 9, No. 8, pp , December 999. [6] A. Ekin, A.M. Tekalp, and R. Mehrotra, Automatic soccer video analysis and summarization, IEEE Trans. on Image Processing, Vol., pp , July 003. [7] Tianming Liu, Hong-Jiang Zhang and Feihu Qi, A novel video key-frame-extraction algorithm based on perceived motion energy model, IEEE Trans. on CSVT, Vol., 3,pp , Oct [8] Jae-Ho Lee, Gang-Gook Lee, and Whoi-Yul Kim, Automatic video summarizing tool using MPEG-7 descriptors for personal video recorder, IEEE Trans. on Cons. Elect., Vol. 49,pp , Aug [9] J. R. Smith, "VideoZoom: Spatio-temporal video broser," IEEE Trans. on Multimedia, vol., No., pp. 57-7, June 999. [0] J. R. Smith, V. Castelli and C.-S. Li, "Adaptive storage and retrieval of large compressed images," in Storage & Retrieval for Image and Video Databases, VII, M.M Yeung, B.L. Yeo and C. A. Bouman Eds. Proc. SPIE, vol. 3656, pp , Jan [] ISO/IEC JTC /SC 9/WG /N3964,N3966, Multimedia Description Schemes (MDS) Group, March 00, Singapore. [] A. Doulamis and N. Doulamis, Optimal Contentbased Video Decomposition for Interactive Video Navigation over IP-based Netorks, IEEE Trans. on Circuits and Systems for Video Technology, Vοl. 4, No. 6, pp , June 004. [3] R.Mukundan,K.R.Ramakrishnan, Moment Functions in Image Analysis: Theory and Applications, World Scientific, Singapore, x (a) Magnitude of the Second Derivative (,0,8) (,3, 3) (3,49,39) (50, 68, 60) Frame Index (69,0,89) (03,,) (3,37,7) Second Derivative/ local Min-Max Key-frames of the assume sign (b) Figure. Some summarization results of sign language images. (a) One every 5 frames for the assume sign. (b) The second derivative and the local maxima/minima for the assume sign and the key-frames

6 Figure. The binary tree decomposition for the group of frames -3 and the frames on the media device for time-variable bandidth.

A Novel Activity Detection Method

A Novel Activity Detection Method A Novel Activity Detection Method Gismy George P.G. Student, Department of ECE, Ilahia College of,muvattupuzha, Kerala, India ABSTRACT: This paper presents an approach for activity state recognition of

More information

Dominant Feature Vectors Based Audio Similarity Measure

Dominant Feature Vectors Based Audio Similarity Measure Dominant Feature Vectors Based Audio Similarity Measure Jing Gu 1, Lie Lu 2, Rui Cai 3, Hong-Jiang Zhang 2, and Jian Yang 1 1 Dept. of Electronic Engineering, Tsinghua Univ., Beijing, 100084, China 2 Microsoft

More information

ECE533 Digital Image Processing. Embedded Zerotree Wavelet Image Codec

ECE533 Digital Image Processing. Embedded Zerotree Wavelet Image Codec University of Wisconsin Madison Electrical Computer Engineering ECE533 Digital Image Processing Embedded Zerotree Wavelet Image Codec Team members Hongyu Sun Yi Zhang December 12, 2003 Table of Contents

More information

A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES

A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES V.Sridevi, P.Malarvizhi, P.Mathivannan Abstract Rain removal from a video is a challenging problem due to random spatial distribution and

More information

Image Data Compression

Image Data Compression Image Data Compression Image data compression is important for - image archiving e.g. satellite data - image transmission e.g. web data - multimedia applications e.g. desk-top editing Image data compression

More information

Enhancing Generalization Capability of SVM Classifiers with Feature Weight Adjustment

Enhancing Generalization Capability of SVM Classifiers with Feature Weight Adjustment Enhancing Generalization Capability of SVM Classifiers ith Feature Weight Adjustment Xizhao Wang and Qiang He College of Mathematics and Computer Science, Hebei University, Baoding 07002, Hebei, China

More information

ON SCALABLE CODING OF HIDDEN MARKOV SOURCES. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

ON SCALABLE CODING OF HIDDEN MARKOV SOURCES. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose ON SCALABLE CODING OF HIDDEN MARKOV SOURCES Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California, Santa Barbara, CA, 93106

More information

Sound Recognition in Mixtures

Sound Recognition in Mixtures Sound Recognition in Mixtures Juhan Nam, Gautham J. Mysore 2, and Paris Smaragdis 2,3 Center for Computer Research in Music and Acoustics, Stanford University, 2 Advanced Technology Labs, Adobe Systems

More information

Multimedia Networking ECE 599

Multimedia Networking ECE 599 Multimedia Networking ECE 599 Prof. Thinh Nguyen School of Electrical Engineering and Computer Science Based on lectures from B. Lee, B. Girod, and A. Mukherjee 1 Outline Digital Signal Representation

More information

Neural Networks. Associative memory 12/30/2015. Associative memories. Associative memories

Neural Networks. Associative memory 12/30/2015. Associative memories. Associative memories //5 Neural Netors Associative memory Lecture Associative memories Associative memories The massively parallel models of associative or content associative memory have been developed. Some of these models

More information

Statistical Filters for Crowd Image Analysis

Statistical Filters for Crowd Image Analysis Statistical Filters for Crowd Image Analysis Ákos Utasi, Ákos Kiss and Tamás Szirányi Distributed Events Analysis Research Group, Computer and Automation Research Institute H-1111 Budapest, Kende utca

More information

Lecture 3a: The Origin of Variational Bayes

Lecture 3a: The Origin of Variational Bayes CSC535: 013 Advanced Machine Learning Lecture 3a: The Origin of Variational Bayes Geoffrey Hinton The origin of variational Bayes In variational Bayes, e approximate the true posterior across parameters

More information

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD

AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD 4th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP AN IMPROVED CONTEXT ADAPTIVE BINARY ARITHMETIC CODER FOR THE H.264/AVC STANDARD Simone

More information

Information Retrieval and Web Search

Information Retrieval and Web Search Information Retrieval and Web Search IR models: Vector Space Model IR Models Set Theoretic Classic Models Fuzzy Extended Boolean U s e r T a s k Retrieval: Adhoc Filtering Brosing boolean vector probabilistic

More information

Selective Use Of Multiple Entropy Models In Audio Coding

Selective Use Of Multiple Entropy Models In Audio Coding Selective Use Of Multiple Entropy Models In Audio Coding Sanjeev Mehrotra, Wei-ge Chen Microsoft Corporation One Microsoft Way, Redmond, WA 98052 {sanjeevm,wchen}@microsoft.com Abstract The use of multiple

More information

Wavelet Packet Based Digital Image Watermarking

Wavelet Packet Based Digital Image Watermarking Wavelet Packet Based Digital Image ing A.Adhipathi Reddy, B.N.Chatterji Department of Electronics and Electrical Communication Engg. Indian Institute of Technology, Kharagpur 72 32 {aar, bnc}@ece.iitkgp.ernet.in

More information

Brief Introduction of Machine Learning Techniques for Content Analysis

Brief Introduction of Machine Learning Techniques for Content Analysis 1 Brief Introduction of Machine Learning Techniques for Content Analysis Wei-Ta Chu 2008/11/20 Outline 2 Overview Gaussian Mixture Model (GMM) Hidden Markov Model (HMM) Support Vector Machine (SVM) Overview

More information

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. This document is donloaded from DR-NTU, Nanyang Technological University Library, Singapore. Title Amplify-and-forard based to-ay relay ARQ system ith relay combination Author(s) Luo, Sheng; Teh, Kah Chan

More information

Artificial Neural Networks. Part 2

Artificial Neural Networks. Part 2 Artificial Neural Netorks Part Artificial Neuron Model Folloing simplified model of real neurons is also knon as a Threshold Logic Unit x McCullouch-Pitts neuron (943) x x n n Body of neuron f out Biological

More information

Optimum Relay Position for Differential Amplify-and-Forward Cooperative Communications

Optimum Relay Position for Differential Amplify-and-Forward Cooperative Communications Optimum Relay Position for Differential Amplify-and-Forard Cooperative Communications Kazunori Hayashi #1, Kengo Shirai #, Thanongsak Himsoon 1, W Pam Siriongpairat, Ahmed K Sadek 3,KJRayLiu 4, and Hideaki

More information

Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms

Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms Flierl and Girod: Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms, IEEE DCC, Mar. 007. Half-Pel Accurate Motion-Compensated Orthogonal Video Transforms Markus Flierl and Bernd Girod Max

More information

A ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT

A ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT Progress In Electromagnetics Research Letters, Vol. 16, 53 60, 2010 A ROBUST BEAMFORMER BASED ON WEIGHTED SPARSE CONSTRAINT Y. P. Liu and Q. Wan School of Electronic Engineering University of Electronic

More information

Feature Extraction Using Zernike Moments

Feature Extraction Using Zernike Moments Feature Extraction Using Zernike Moments P. Bhaskara Rao Department of Electronics and Communication Engineering ST.Peter's Engineeing college,hyderabad,andhra Pradesh,India D.Vara Prasad Department of

More information

University of Genova - DITEN. Smart Patrolling. video and SIgnal Processing for Telecommunications ISIP40

University of Genova - DITEN. Smart Patrolling. video and SIgnal Processing for Telecommunications ISIP40 University of Genova - DITEN Smart Patrolling 1 Smart Patrolling Detection of the intruder Tracking of the intruder A cognitive node will active an operator, describing on his mobile terminal the characteristic

More information

An Improved Driving Scheme in an Electrophoretic Display

An Improved Driving Scheme in an Electrophoretic Display International Journal of Engineering and Technology Volume 3 No. 4, April, 2013 An Improved Driving Scheme in an Electrophoretic Display Pengfei Bai 1, Zichuan Yi 1, Guofu Zhou 1,2 1 Electronic Paper Displays

More information

Early & Quick COSMIC-FFP Analysis using Analytic Hierarchy Process

Early & Quick COSMIC-FFP Analysis using Analytic Hierarchy Process Early & Quick COSMIC-FFP Analysis using Analytic Hierarchy Process uca Santillo (luca.santillo@gmail.com) Abstract COSMIC-FFP is a rigorous measurement method that makes possible to measure the functional

More information

State of the art Image Compression Techniques

State of the art Image Compression Techniques Chapter 4 State of the art Image Compression Techniques In this thesis we focus mainly on the adaption of state of the art wavelet based image compression techniques to programmable hardware. Thus, an

More information

Adaptive Noise Cancellation

Adaptive Noise Cancellation Adaptive Noise Cancellation P. Comon and V. Zarzoso January 5, 2010 1 Introduction In numerous application areas, including biomedical engineering, radar, sonar and digital communications, the goal is

More information

Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p.

Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p. Preface p. xvii Introduction p. 1 Compression Techniques p. 3 Lossless Compression p. 4 Lossy Compression p. 5 Measures of Performance p. 5 Modeling and Coding p. 6 Summary p. 10 Projects and Problems

More information

Multimedia communications

Multimedia communications Multimedia communications Comunicazione multimediale G. Menegaz gloria.menegaz@univr.it Prologue Context Context Scale Scale Scale Course overview Goal The course is about wavelets and multiresolution

More information

Study of Wavelet Functions of Discrete Wavelet Transformation in Image Watermarking

Study of Wavelet Functions of Discrete Wavelet Transformation in Image Watermarking Study of Wavelet Functions of Discrete Wavelet Transformation in Image Watermarking Navdeep Goel 1,a, Gurwinder Singh 2,b 1ECE Section, Yadavindra College of Engineering, Talwandi Sabo 2Research Scholar,

More information

Single Channel Signal Separation Using MAP-based Subspace Decomposition

Single Channel Signal Separation Using MAP-based Subspace Decomposition Single Channel Signal Separation Using MAP-based Subspace Decomposition Gil-Jin Jang, Te-Won Lee, and Yung-Hwan Oh 1 Spoken Language Laboratory, Department of Computer Science, KAIST 373-1 Gusong-dong,

More information

Original Article Design Approach for Content-Based Image Retrieval Using Gabor-Zernike Features

Original Article Design Approach for Content-Based Image Retrieval Using Gabor-Zernike Features International Archive of Applied Sciences and Technology Volume 3 [2] June 2012: 42-46 ISSN: 0976-4828 Society of Education, India Website: www.soeagra.com/iaast/iaast.htm Original Article Design Approach

More information

Empirical Analysis of Invariance of Transform Coefficients under Rotation

Empirical Analysis of Invariance of Transform Coefficients under Rotation International Journal of Engineering Research and Development e-issn: 2278-67X, p-issn: 2278-8X, www.ijerd.com Volume, Issue 5 (May 25), PP.43-5 Empirical Analysis of Invariance of Transform Coefficients

More information

Lecture VIII Dim. Reduction (I)

Lecture VIII Dim. Reduction (I) Lecture VIII Dim. Reduction (I) Contents: Subset Selection & Shrinkage Ridge regression, Lasso PCA, PCR, PLS Lecture VIII: MLSC - Dr. Sethu Viayakumar Data From Human Movement Measure arm movement and

More information

Science Insights: An International Journal

Science Insights: An International Journal Available online at http://www.urpjournals.com Science Insights: An International Journal Universal Research Publications. All rights reserved ISSN 2277 3835 Original Article Object Recognition using Zernike

More information

Preliminaries. Definition: The Euclidean dot product between two vectors is the expression. i=1

Preliminaries. Definition: The Euclidean dot product between two vectors is the expression. i=1 90 8 80 7 70 6 60 0 8/7/ Preliminaries Preliminaries Linear models and the perceptron algorithm Chapters, T x + b < 0 T x + b > 0 Definition: The Euclidean dot product beteen to vectors is the expression

More information

On the Energy-Delay Trade-off of a Two-Way Relay Network

On the Energy-Delay Trade-off of a Two-Way Relay Network On the Energy-Delay Trade-off of a To-Way Relay Netork Xiang He Aylin Yener Wireless Communications and Netorking Laboratory Electrical Engineering Department The Pennsylvania State University University

More information

Color Reproduction Stabilization with Time-Sequential Sampling

Color Reproduction Stabilization with Time-Sequential Sampling Color Reproduction Stabilization ith Time-Sequential Sampling Teck-Ping Sim, and Perry Y. Li Department of Mechanical Engineering, University of Minnesota Church Street S.E., MN 55455, USA Abstract Color

More information

The peculiarities of the model: - it is allowed to use by attacker only noisy version of SG C w (n), - CO can be known exactly for attacker.

The peculiarities of the model: - it is allowed to use by attacker only noisy version of SG C w (n), - CO can be known exactly for attacker. Lecture 6. SG based on noisy channels [4]. CO Detection of SG The peculiarities of the model: - it is alloed to use by attacker only noisy version of SG C (n), - CO can be knon exactly for attacker. Practical

More information

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders

Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders Single Frame Rate-Quantization Model for MPEG-4 AVC/H.264 Video Encoders Tomasz Grajek and Marek Domański Poznan University of Technology Chair of Multimedia Telecommunications and Microelectronics ul.

More information

The Analytic Hierarchy Process for the Reservoir Evaluation in Chaoyanggou Oilfield

The Analytic Hierarchy Process for the Reservoir Evaluation in Chaoyanggou Oilfield Advances in Petroleum Exploration and Development Vol. 6, No. 2, 213, pp. 46-5 DOI:1.3968/j.aped.1925543821362.1812 ISSN 1925-542X [Print] ISSN 1925-5438 [Online].cscanada.net.cscanada.org The Analytic

More information

Max-Margin Ratio Machine

Max-Margin Ratio Machine JMLR: Workshop and Conference Proceedings 25:1 13, 2012 Asian Conference on Machine Learning Max-Margin Ratio Machine Suicheng Gu and Yuhong Guo Department of Computer and Information Sciences Temple University,

More information

On A Comparison between Two Measures of Spatial Association

On A Comparison between Two Measures of Spatial Association Journal of Modern Applied Statistical Methods Volume 9 Issue Article 3 5--00 On A Comparison beteen To Measures of Spatial Association Faisal G. Khamis Al-Zaytoonah University of Jordan, faisal_alshamari@yahoo.com

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC9/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC9/WG11 MPEG 98/M3833 July 1998 Source:

More information

On Common Information and the Encoding of Sources that are Not Successively Refinable

On Common Information and the Encoding of Sources that are Not Successively Refinable On Common Information and the Encoding of Sources that are Not Successively Refinable Kumar Viswanatha, Emrah Akyol, Tejaswi Nanjundaswamy and Kenneth Rose ECE Department, University of California - Santa

More information

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Dr. Jian Zhang Conjoint Associate Professor NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2006 jzhang@cse.unsw.edu.au

More information

encoding without prediction) (Server) Quantization: Initial Data 0, 1, 2, Quantized Data 0, 1, 2, 3, 4, 8, 16, 32, 64, 128, 256

encoding without prediction) (Server) Quantization: Initial Data 0, 1, 2, Quantized Data 0, 1, 2, 3, 4, 8, 16, 32, 64, 128, 256 General Models for Compression / Decompression -they apply to symbols data, text, and to image but not video 1. Simplest model (Lossless ( encoding without prediction) (server) Signal Encode Transmit (client)

More information

Multimedia Databases. Previous Lecture. 4.1 Multiresolution Analysis. 4 Shape-based Features. 4.1 Multiresolution Analysis

Multimedia Databases. Previous Lecture. 4.1 Multiresolution Analysis. 4 Shape-based Features. 4.1 Multiresolution Analysis Previous Lecture Multimedia Databases Texture-Based Image Retrieval Low Level Features Tamura Measure, Random Field Model High-Level Features Fourier-Transform, Wavelets Wolf-Tilo Balke Silviu Homoceanu

More information

TIME DOMAIN ACOUSTIC CONTRAST CONTROL IMPLEMENTATION OF SOUND ZONES FOR LOW-FREQUENCY INPUT SIGNALS

TIME DOMAIN ACOUSTIC CONTRAST CONTROL IMPLEMENTATION OF SOUND ZONES FOR LOW-FREQUENCY INPUT SIGNALS TIME DOMAIN ACOUSTIC CONTRAST CONTROL IMPLEMENTATION OF SOUND ZONES FOR LOW-FREQUENCY INPUT SIGNALS Daan H. M. Schellekens 12, Martin B. Møller 13, and Martin Olsen 4 1 Bang & Olufsen A/S, Struer, Denmark

More information

Linear models and the perceptron algorithm

Linear models and the perceptron algorithm 8/5/6 Preliminaries Linear models and the perceptron algorithm Chapters, 3 Definition: The Euclidean dot product beteen to vectors is the expression dx T x = i x i The dot product is also referred to as

More information

A Generalization of a result of Catlin: 2-factors in line graphs

A Generalization of a result of Catlin: 2-factors in line graphs AUSTRALASIAN JOURNAL OF COMBINATORICS Volume 72(2) (2018), Pages 164 184 A Generalization of a result of Catlin: 2-factors in line graphs Ronald J. Gould Emory University Atlanta, Georgia U.S.A. rg@mathcs.emory.edu

More information

Single-Image-Based Rain and Snow Removal Using Multi-guided Filter

Single-Image-Based Rain and Snow Removal Using Multi-guided Filter Single-Image-Based Rain and Snow Removal Using Multi-guided Filter Xianhui Zheng 1, Yinghao Liao 1,,WeiGuo 2, Xueyang Fu 2, and Xinghao Ding 2 1 Department of Electronic Engineering, Xiamen University,

More information

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig Multimedia Databases Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 4 Previous Lecture Texture-Based Image Retrieval Low

More information

Basic Principles of Video Coding

Basic Principles of Video Coding Basic Principles of Video Coding Introduction Categories of Video Coding Schemes Information Theory Overview of Video Coding Techniques Predictive coding Transform coding Quantization Entropy coding Motion

More information

BASICS OF COMPRESSION THEORY

BASICS OF COMPRESSION THEORY BASICS OF COMPRESSION THEORY Why Compression? Task: storage and transport of multimedia information. E.g.: non-interlaced HDTV: 0x0x0x = Mb/s!! Solutions: Develop technologies for higher bandwidth Find

More information

Some Classes of Invertible Matrices in GF(2)

Some Classes of Invertible Matrices in GF(2) Some Classes of Invertible Matrices in GF() James S. Plank Adam L. Buchsbaum Technical Report UT-CS-07-599 Department of Electrical Engineering and Computer Science University of Tennessee August 16, 007

More information

SCALABLE AUDIO CODING USING WATERMARKING

SCALABLE AUDIO CODING USING WATERMARKING SCALABLE AUDIO CODING USING WATERMARKING Mahmood Movassagh Peter Kabal Department of Electrical and Computer Engineering McGill University, Montreal, Canada Email: {mahmood.movassagh@mail.mcgill.ca, peter.kabal@mcgill.ca}

More information

Lecture 8: Interest Point Detection. Saad J Bedros

Lecture 8: Interest Point Detection. Saad J Bedros #1 Lecture 8: Interest Point Detection Saad J Bedros sbedros@umn.edu Last Lecture : Edge Detection Preprocessing of image is desired to eliminate or at least minimize noise effects There is always tradeoff

More information

Multimedia Databases. 4 Shape-based Features. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis

Multimedia Databases. 4 Shape-based Features. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis 4 Shape-based Features Multimedia Databases Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 4 Multiresolution Analysis

More information

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials Philipp Krähenbühl and Vladlen Koltun Stanford University Presenter: Yuan-Ting Hu 1 Conditional Random Field (CRF) E x I = φ u

More information

TUTORIAL PART 1 Unsupervised Learning

TUTORIAL PART 1 Unsupervised Learning TUTORIAL PART 1 Unsupervised Learning Marc'Aurelio Ranzato Department of Computer Science Univ. of Toronto ranzato@cs.toronto.edu Co-organizers: Honglak Lee, Yoshua Bengio, Geoff Hinton, Yann LeCun, Andrew

More information

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5

Lecture 2: Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments. Tutorial 1. Acknowledgement and References for lectures 1 to 5 Lecture : Introduction to Audio, Video & Image Coding Techniques (I) -- Fundaments Dr. Jian Zhang Conjoint Associate Professor NICTA & CSE UNSW COMP959 Multimedia Systems S 006 jzhang@cse.unsw.edu.au Acknowledgement

More information

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu Dimension Reduction Techniques Presented by Jie (Jerry) Yu Outline Problem Modeling Review of PCA and MDS Isomap Local Linear Embedding (LLE) Charting Background Advances in data collection and storage

More information

+ (50% contribution by each member)

+ (50% contribution by each member) Image Coding using EZW and QM coder ECE 533 Project Report Ahuja, Alok + Singh, Aarti + + (50% contribution by each member) Abstract This project involves Matlab implementation of the Embedded Zerotree

More information

Hu and Zernike Moments for Sign Language Recognition

Hu and Zernike Moments for Sign Language Recognition Hu and Zernike Moments for Sign Language Recognition K. C. Otiniano-Rodríguez 1, G. Cámara-Chávez 1, and D. Menotti 1 1 Computing Department, Federal University of Ouro Preto, Ouro Preto, MG, Brazil Abstract

More information

Unsupervised Anomaly Detection for High Dimensional Data

Unsupervised Anomaly Detection for High Dimensional Data Unsupervised Anomaly Detection for High Dimensional Data Department of Mathematics, Rowan University. July 19th, 2013 International Workshop in Sequential Methodologies (IWSM-2013) Outline of Talk Motivation

More information

Tennis player segmentation for semantic behavior analysis

Tennis player segmentation for semantic behavior analysis Proposta di Tennis player segmentation for semantic behavior analysis Architettura Software per Robot Mobili Vito Renò, Nicola Mosca, Massimiliano Nitti, Tiziana D Orazio, Donato Campagnoli, Andrea Prati,

More information

RESTORATION OF VIDEO BY REMOVING RAIN

RESTORATION OF VIDEO BY REMOVING RAIN RESTORATION OF VIDEO BY REMOVING RAIN Sajitha Krishnan 1 and D.Venkataraman 1 1 Computer Vision and Image Processing, Department of Computer Science, Amrita Vishwa Vidyapeetham University, Coimbatore,

More information

CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum

CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum 1997 19 CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION 3.0. Introduction

More information

International Research Journal of Engineering and Technology (IRJET) e-issn: Volume: 03 Issue: 11 Nov p-issn:

International Research Journal of Engineering and Technology (IRJET) e-issn: Volume: 03 Issue: 11 Nov p-issn: Analysis of Document using Approach Sahinur Rahman Laskar 1, Bhagaban Swain 2 1,2Department of Computer Science & Engineering Assam University, Silchar, India ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

Review: Learning Bimodal Structures in Audio-Visual Data

Review: Learning Bimodal Structures in Audio-Visual Data Review: Learning Bimodal Structures in Audio-Visual Data CSE 704 : Readings in Joint Visual, Lingual and Physical Models and Inference Algorithms Suren Kumar Vision and Perceptual Machines Lab 106 Davis

More information

CITS 4402 Computer Vision

CITS 4402 Computer Vision CITS 4402 Computer Vision A/Prof Ajmal Mian Adj/A/Prof Mehdi Ravanbakhsh Lecture 06 Object Recognition Objectives To understand the concept of image based object recognition To learn how to match images

More information

Role of Assembling Invariant Moments and SVM in Fingerprint Recognition

Role of Assembling Invariant Moments and SVM in Fingerprint Recognition 56 Role of Assembling Invariant Moments SVM in Fingerprint Recognition 1 Supriya Wable, 2 Chaitali Laulkar 1, 2 Department of Computer Engineering, University of Pune Sinhgad College of Engineering, Pune-411

More information

Data Informatics. Seon Ho Kim, Ph.D.

Data Informatics. Seon Ho Kim, Ph.D. Data Informatics Seon Ho Kim, Ph.D. seonkim@usc.edu What is Machine Learning? Overview slides by ETHEM ALPAYDIN Why Learn? Learn: programming computers to optimize a performance criterion using example

More information

An Efficient Algorithm for Fast Computation of Pseudo-Zernike Moments

An Efficient Algorithm for Fast Computation of Pseudo-Zernike Moments An Efficient Algorithm for Fast Computation of Pseudo-Zernike Moments Chong, Chee-Way Fac. of Engg & Tech., Multimedia University, Jalan Air Keroh Lama, 75450 Melaka, Malaysia. (Email:cwayc@pc.jaring.my)

More information

- An Image Coding Algorithm

- An Image Coding Algorithm - An Image Coding Algorithm Shufang Wu http://www.sfu.ca/~vswu vswu@cs.sfu.ca Friday, June 14, 2002 22-1 Agenda Overview Discrete Wavelet Transform Zerotree Coding of Wavelet Coefficients Successive-Approximation

More information

Frequency offset and I/Q imbalance compensation for Direct-Conversion receivers

Frequency offset and I/Q imbalance compensation for Direct-Conversion receivers Frequency offset and I/Q imbalance compensation for Direct-Conversion receivers Hung Tao Hsieh Department of Communication Engineering National Chiao Tung University May 12th, 2009 1 Reference M.Valkama,

More information

Digital Image Processing

Digital Image Processing Digital Image Processing, 2nd ed. Digital Image Processing Chapter 7 Wavelets and Multiresolution Processing Dr. Kai Shuang Department of Electronic Engineering China University of Petroleum shuangkai@cup.edu.cn

More information

Efficient Spatial Data Structure for Multiversion Management of Engineering Drawings

Efficient Spatial Data Structure for Multiversion Management of Engineering Drawings Efficient Structure for Multiversion Management of Engineering Drawings Yasuaki Nakamura Department of Computer and Media Technologies, Hiroshima City University Hiroshima, 731-3194, Japan and Hiroyuki

More information

Lecture 8: Interest Point Detection. Saad J Bedros

Lecture 8: Interest Point Detection. Saad J Bedros #1 Lecture 8: Interest Point Detection Saad J Bedros sbedros@umn.edu Review of Edge Detectors #2 Today s Lecture Interest Points Detection What do we mean with Interest Point Detection in an Image Goal:

More information

SYDE 575: Introduction to Image Processing. Image Compression Part 2: Variable-rate compression

SYDE 575: Introduction to Image Processing. Image Compression Part 2: Variable-rate compression SYDE 575: Introduction to Image Processing Image Compression Part 2: Variable-rate compression Variable-rate Compression: Transform-based compression As mentioned earlier, we wish to transform image data

More information

MODERN video coding standards, such as H.263, H.264,

MODERN video coding standards, such as H.263, H.264, 146 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 1, JANUARY 2006 Analysis of Multihypothesis Motion Compensated Prediction (MHMCP) for Robust Visual Communication Wei-Ying

More information

Simultaneous Multi-frame MAP Super-Resolution Video Enhancement using Spatio-temporal Priors

Simultaneous Multi-frame MAP Super-Resolution Video Enhancement using Spatio-temporal Priors Simultaneous Multi-frame MAP Super-Resolution Video Enhancement using Spatio-temporal Priors Sean Borman and Robert L. Stevenson Department of Electrical Engineering, University of Notre Dame Notre Dame,

More information

A Human Behavior Recognition Method Based on Latent Semantic Analysis

A Human Behavior Recognition Method Based on Latent Semantic Analysis Journal of Information Hiding and Multimedia Signal Processing c 2016 ISSN 2073-4212 Ubiquitous International Volume 7, Number 3, May 2016 A Human Behavior Recognition Method Based on Latent Semantic Analysis

More information

4. Quantization and Data Compression. ECE 302 Spring 2012 Purdue University, School of ECE Prof. Ilya Pollak

4. Quantization and Data Compression. ECE 302 Spring 2012 Purdue University, School of ECE Prof. Ilya Pollak 4. Quantization and Data Compression ECE 32 Spring 22 Purdue University, School of ECE Prof. What is data compression? Reducing the file size without compromising the quality of the data stored in the

More information

Global Scene Representations. Tilke Judd

Global Scene Representations. Tilke Judd Global Scene Representations Tilke Judd Papers Oliva and Torralba [2001] Fei Fei and Perona [2005] Labzebnik, Schmid and Ponce [2006] Commonalities Goal: Recognize natural scene categories Extract features

More information

Linear models: the perceptron and closest centroid algorithms. D = {(x i,y i )} n i=1. x i 2 R d 9/3/13. Preliminaries. Chapter 1, 7.

Linear models: the perceptron and closest centroid algorithms. D = {(x i,y i )} n i=1. x i 2 R d 9/3/13. Preliminaries. Chapter 1, 7. Preliminaries Linear models: the perceptron and closest centroid algorithms Chapter 1, 7 Definition: The Euclidean dot product beteen to vectors is the expression d T x = i x i The dot product is also

More information

CALCULATION OF STEAM AND WATER RELATIVE PERMEABILITIES USING FIELD PRODUCTION DATA, WITH LABORATORY VERIFICATION

CALCULATION OF STEAM AND WATER RELATIVE PERMEABILITIES USING FIELD PRODUCTION DATA, WITH LABORATORY VERIFICATION CALCULATION OF STEAM AND WATER RELATIVE PERMEABILITIES USING FIELD PRODUCTION DATA, WITH LABORATORY VERIFICATION Jericho L. P. Reyes, Chih-Ying Chen, Keen Li and Roland N. Horne Stanford Geothermal Program,

More information

Image Compression. 1. Introduction. Greg Ames Dec 07, 2002

Image Compression. 1. Introduction. Greg Ames Dec 07, 2002 Image Compression Greg Ames Dec 07, 2002 Abstract Digital images require large amounts of memory to store and, when retrieved from the internet, can take a considerable amount of time to download. The

More information

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18,   ISSN International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, www.ijcea.com ISSN 2321-3469 CONTENT-BASED IMAGE RETRIEVAL USING ZERNIKE MOMENTS AND SURF Priyanka

More information

VIDEO SYNCHRONIZATION VIA SPACE-TIME INTEREST POINT DISTRIBUTION. Jingyu Yan and Marc Pollefeys

VIDEO SYNCHRONIZATION VIA SPACE-TIME INTEREST POINT DISTRIBUTION. Jingyu Yan and Marc Pollefeys VIDEO SYNCHRONIZATION VIA SPACE-TIME INTEREST POINT DISTRIBUTION Jingyu Yan and Marc Pollefeys {yan,marc}@cs.unc.edu The University of North Carolina at Chapel Hill Department of Computer Science Chapel

More information

URL: < >

URL:   < > Citation: Wu, Yongle, Qu, Meijun, Liu, Yuanan and Ghassemlooy, Zabih (217) A Broadband Graphene-Based THz Coupler ith Wide-Range Tunable Poer-Dividing Ratios. Plasmonics, (5). pp. 17-192. ISSN 1557-1955

More information

A Modified Moment-Based Image Watermarking Method Robust to Cropping Attack

A Modified Moment-Based Image Watermarking Method Robust to Cropping Attack A Modified Moment-Based Watermarking Method Robust to Cropping Attack Tianrui Zong, Yong Xiang, and Suzan Elbadry School of Information Technology Deakin University, Burwood Campus Melbourne, Australia

More information

ROBUST LINEAR DISCRIMINANT ANALYSIS WITH A LAPLACIAN ASSUMPTION ON PROJECTION DISTRIBUTION

ROBUST LINEAR DISCRIMINANT ANALYSIS WITH A LAPLACIAN ASSUMPTION ON PROJECTION DISTRIBUTION ROBUST LINEAR DISCRIMINANT ANALYSIS WITH A LAPLACIAN ASSUMPTION ON PROJECTION DISTRIBUTION Shujian Yu, Zheng Cao, Xiubao Jiang Department of Electrical and Computer Engineering, University of Florida 0

More information

MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING

MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING MATCHING-PURSUIT DICTIONARY PRUNING FOR MPEG-4 VIDEO OBJECT CODING Yannick Morvan, Dirk Farin University of Technology Eindhoven 5600 MB Eindhoven, The Netherlands email: {y.morvan;d.s.farin}@tue.nl Peter

More information

Wavelets and Multiresolution Processing

Wavelets and Multiresolution Processing Wavelets and Multiresolution Processing Wavelets Fourier transform has it basis functions in sinusoids Wavelets based on small waves of varying frequency and limited duration In addition to frequency,

More information

Robust Network Codes for Unicast Connections: A Case Study

Robust Network Codes for Unicast Connections: A Case Study Robust Network Codes for Unicast Connections: A Case Study Salim Y. El Rouayheb, Alex Sprintson, and Costas Georghiades Department of Electrical and Computer Engineering Texas A&M University College Station,

More information

Multimedia analysis and retrieval

Multimedia analysis and retrieval Multimedia analysis and retrieval from the masses, and for the masses Lexing Xie IBM T J Watson Research Center xlx@us.ibm.com ICIP-MIR workshop, Oct 12, 2008 2008 IBM Research multimedia information analysis

More information

Boosting: Algorithms and Applications

Boosting: Algorithms and Applications Boosting: Algorithms and Applications Lecture 11, ENGN 4522/6520, Statistical Pattern Recognition and Its Applications in Computer Vision ANU 2 nd Semester, 2008 Chunhua Shen, NICTA/RSISE Boosting Definition

More information