Singlets. Multi-resolution Motion Singularities for Soccer Video Abstraction. Katy Blanc, Diane Lingrand, Frederic Precioso

Size: px
Start display at page:

Download "Singlets. Multi-resolution Motion Singularities for Soccer Video Abstraction. Katy Blanc, Diane Lingrand, Frederic Precioso"

Transcription

1 Singlets Multi-resolution Motion Singularities for Soccer Video Abstraction Katy Blanc, Diane Lingrand, Frederic Precioso I3S Laboratory, Sophia Antipolis, France Katy Blanc Diane Lingrand Frederic Precioso 1

2 Overview VIDEO & MOTION SINGULARITIES & SINGLETS SOCCER SALIENT MOMENTS 2

3 Video Analysis Burst of video content production new sources of videos big databases : Youtube 8M [1] much analyzed types : meeting/conferences, movies, news and sports Diverse applications : browsing in database, automatic video surveillance, driverless car, Exponential amount of information: a match of soccer of HDTV images of pixels each Collaboration with Wildmoka, themselves in collaboration with l INA and BeIN. 3

4 Related works: Video description Modelling human motion: [2] Handcrafted features: Stip [3], idt [4] Deep learning representations [5] 4

5 Related works : sport abstraction Clue Detection to detect highlights: ground color, jersey color, shot segmentation and view classification Line marks positions and shot detection [6] Ye et al. Goals, attacks and other events using logo and score appearances and goal mouth position [7] Zawbaa et al. Face and skin detection, whistle detector and user specifications [8] Raventos et al. 5

6 Overview VIDEO & MOTION SINGULARITIES & SINGLETS SOCCER SALIENT MOMENTS 6

7 Inspiration: fluid movement Inspired from the work of Druon et al. [9] and the further work of Kihl et al. [10] 7

8 Optical Flow Approximation Optical Flow = discrete bivariable vector field F Ω R 2 x 1, x 2 U x 1, x 2, V x 1, x 2 with Ω = 1, 1 2 Polynomial subspace and the Legendre Basis P K,L x 1, x 2 = K L k=0 l=0 γ k,l. x 1 k x 2 l Projection on the Legendre Basis with K + L < D U = u 0,0 P 0,0 + u 0,1 P 0,1 + u 1,0 P 1,0 V = v 0,0 P 0,0 + v 0,1 P 0,1 + v 1,0 P 1,0 8

9 Polynomial projection = (, ) Original Flow F Flow U Flow V Projection U V = E 5 U V = x 1 x

10 Approximation and coefficient analysis From a simple analysis example to production, scenarization, event sementization U V = Coefficient value counterattack u 0,0 horizontal global displacement v 0,0 vertical global displacement u 0,0 horizontal global position v 0,0 vertical global position Frame number 10

11 Singularity First projection on the Legendre basis U = u 0,0 P 0,0 + u 0,1 P 0,1 + u 1,0 P 1,0 V = v 0,0 P 0,0 + v 0,1 P 0,1 + v 1,0 P 1,0 Then on the canonical basis U V = A. x 1 x 2 + b with A M 2,2 and b R 2 6 types of singularities A = tr A 2 4. det A, λ 1 and λ 2 the Eigenvalues of A 11

12 Singularities extraction From a multi-resolution analysis of the optical flow U V = A. x 1 x 2 + b 12

13 Singlets: match singularities during time From singularity on optical flow frame to tracks of singularities: Singlets 13

14 Overview VIDEO & MOTION SINGULARITIES & SINGLETS SOCCER SALIENT MOMENTS 14

15 Application on soccer abstraction Our database Zoom Slow Motion Global Excitement Soccer Saliant Moment 15

16 Soccer Summarization: Our database Lack of standard benchmark for comparison sake Germany vs Portugal Nigeria vs Argentina France vs Honduras Switzerland vs France

17 Soccer Summarization: Our database 17

18 Zoom detection Zoom motion is a pure star node singularity. Zoom combined with a translation is an improper node Conditions: there is a singularity it is a star or improper node : A = 0 time consistent : last for one second Positives eigenvalues -> zoom out Negatives eigenvalues -> zoom in Determine the zoom direction center 18

19 Zoom detection A = 0 A 0.2 Threshold on an average A on a second set to 0,2 Comparison - Global motion estimation [6,11] - Duan et al. method [12] : Two histograms, one on magnitude one on angles to detect diagonal pattern (DL) 19

20 Slow Motion Detection How to differentiate a fast motion that has been artificially slowed down from a slow motion? 20

21 Slow Motion Detection Fast Motion Slow Motion 21

22 Global excitement Spatial histogram of 3x3 per frame Sum spatial histograms on 10 frames Threshold set on 1500 to select the most agitated moments 22

23 Soccer Saliant Moment Within 30 seconds: - at least two zoom direction changes - an activity peaks higher than 1500 in the farthest view - a slow motion replay in a close up view 23

24 Example of Summarization Zooms Agitation Slow Motion 24

25 Extension to other sports Set and Trained on Soccer and Tested on Handball All hyperparameters, parameters and SVM were trained and set on soccer videos and we test our framework on 5 minutes of Qatar Handball World Championship final without any adjustement or retraining. 25

26 Conclusion Optical Flow Approximation Possibility to use : - different basis - different degree - other space than the polynomial one Singularity extraction Vanishing point 6 types for the affine approximation Motion description Global distribution Singlets Track singularities Length histograms Compute a description like for IDT Sport Video Abstraction Zoom detection Slow Motion Detection Global excitement 26

27 Because there is not only soccer in life Abstraction of concert Facial Emotion Action recognition 27

28 Vision in Sports: CVSports 28

29 References [1] S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, and S. ijayanarasimhan. Youtube-8m: A large-scale video classification benchmark. CoRR, abs/ , [2] Gorelick, Lena, et al. "Actions as space-time shapes." IEEE transactions on pattern analysis and machine intelligence (2007): [3] I. Laptev. On space-time interest points. Int. J. Comput. Vision, 64(2-3): , Sept [4] H. Wang, A. Kläser, C. Schmid, and C.-L. Liu. Action Recognition by Dense Trajectories. In IEEE Conference on Computer Vision & Pattern Recognition, pages , Colorado Springs, United States, June [5] D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. Learning spatiotemporal features with 3d convolutional net-works. In roceedings of the IEEE International Conference on Computer Vision, pages , [6] Q. Ye, Q. Huang, W. Gao, and S. Jiang. Exciting event detection in broadcast soccer video with mid-level description and incremental learning. In Proceedings of the 13th annual ACM international conference on Multimedia, [7] H. M. Zawbaa, N. El-Bendary, A. E. Hassanien, and T. Kim. Event detection based approach for soccer video summarization using machine learning. International Journal of Multimedia and Ubiquitous Engineering, 2012 [8] A. Raventos, R. Quijada, L. Torres, and F. Tarres. Automatic summarization of soccer highlights using audio-visual descriptors. arxiv preprint arxiv: , [9] Druon, Martin. Modélisation du mouvement par polynômes orthogonaux: application à l'étude d'écoulements fluides. Diss. Université de Poitiers, [10] O. Kihl, B. Tremblais, and B. Augereau. Multivariate orthogonal polynomials to extract singular points. In IEEE International Conference on Image Processing ICIP 2008 San Diego, CA, United States, Oct [11] X. Qian. Global Motion Estimation and Its Applications. INTECH Open Access Publisher, [12] L.-Y. Duan, M. Xu, Q. Tian, C.-S. Xu, and J. S. Jin. A unified framework for semantic shot classification in sports video. IEEE Transactions on Multimedia,

30 Optical Flow Extraction Gunnar Farneback Method Pixel Neighborhood Approximation Approximation translation Relations Speed vector estimation 30

31 Polynomial Space Projection Scalar product Legendre basis with w x 1, x 2 = 1 and Ω = 1,1 2 Polynomial degree D nd = (D+1)(D+2) polynomials in the basis and so nd 2 coefficients Flot optique blurring (decreasing with the degree) 31

32 Multi-Resolution Singularity One sliding windows -> possibly one singularity The same singularity can be at different size and aprroximately the same position: We keep the one with less angular deviation from the original flow. dev = ω Ω 1 2 sin θ ω θ ω With θ ω the angle of the motion vector at the pixel position ω for the originalf flow and θ ω for the polynomial approximation 32

Classification of Hand-Written Digits Using Scattering Convolutional Network

Classification of Hand-Written Digits Using Scattering Convolutional Network Mid-year Progress Report Classification of Hand-Written Digits Using Scattering Convolutional Network Dongmian Zou Advisor: Professor Radu Balan Co-Advisor: Dr. Maneesh Singh (SRI) Background Overview

More information

Modeling Complex Temporal Composition of Actionlets for Activity Prediction

Modeling Complex Temporal Composition of Actionlets for Activity Prediction Modeling Complex Temporal Composition of Actionlets for Activity Prediction ECCV 2012 Activity Recognition Reading Group Framework of activity prediction What is an Actionlet To segment a long sequence

More information

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning Sangdoo Yun 1 Jongwon Choi 1 Youngjoon Yoo 2 Kimin Yun 3 and Jin Young Choi 1 1 ASRI, Dept. of Electrical and Computer Eng.,

More information

Two-Layered Face Detection System using Evolutionary Algorithm

Two-Layered Face Detection System using Evolutionary Algorithm Two-Layered Face Detection System using Evolutionary Algorithm Jun-Su Jang Jong-Hwan Kim Dept. of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology (KAIST),

More information

Lecture 8: Interest Point Detection. Saad J Bedros

Lecture 8: Interest Point Detection. Saad J Bedros #1 Lecture 8: Interest Point Detection Saad J Bedros sbedros@umn.edu Last Lecture : Edge Detection Preprocessing of image is desired to eliminate or at least minimize noise effects There is always tradeoff

More information

Event Detection and Semantic Identification Using Bayesian Belief Network

Event Detection and Semantic Identification Using Bayesian Belief Network Event Detection and Semantic Identification Using Bayesian Belief Network Maheshkumar H. Kolekar Dept. of Computer Science, University of Missouri, Columbia, MO, USA mkolekar@gmail.com S. Sengupta Dept.

More information

Histogram of multi-directional Gabor filter bank for motion trajectory feature extraction

Histogram of multi-directional Gabor filter bank for motion trajectory feature extraction Histogram of multi-directional Gabor filter bank for motion trajectory feature extraction NGOC NAM BUI, TAN DAT TRINH, MIN KYUNG PARK, JIN YOUNG KIM Dept. of Electronics and Computer Engineering Chonnam

More information

Object Recognition Using Local Characterisation and Zernike Moments

Object Recognition Using Local Characterisation and Zernike Moments Object Recognition Using Local Characterisation and Zernike Moments A. Choksuriwong, H. Laurent, C. Rosenberger, and C. Maaoui Laboratoire Vision et Robotique - UPRES EA 2078, ENSI de Bourges - Université

More information

Empirical Analysis of Invariance of Transform Coefficients under Rotation

Empirical Analysis of Invariance of Transform Coefficients under Rotation International Journal of Engineering Research and Development e-issn: 2278-67X, p-issn: 2278-8X, www.ijerd.com Volume, Issue 5 (May 25), PP.43-5 Empirical Analysis of Invariance of Transform Coefficients

More information

Image Recognition Using Modified Zernike Moments

Image Recognition Using Modified Zernike Moments Sensors & Transducers 204 by IFSA Publishing S. L. http://www.sensorsportal.com Image Recognition Using Modified ernike Moments Min HUANG Yaqiong MA and Qiuping GONG School of Computer and Communication

More information

Learning Spa+otemporal Graphs of Human Ac+vi+es

Learning Spa+otemporal Graphs of Human Ac+vi+es Learning Spa+otemporal Graphs of Human Ac+vi+es William Brendel Sinisa Todorovic Our Goal Long Jump Triple Jump Recognize all occurrences of activities Identify the start and end frames Parse the video

More information

Recap: edge detection. Source: D. Lowe, L. Fei-Fei

Recap: edge detection. Source: D. Lowe, L. Fei-Fei Recap: edge detection Source: D. Lowe, L. Fei-Fei Canny edge detector 1. Filter image with x, y derivatives of Gaussian 2. Find magnitude and orientation of gradient 3. Non-maximum suppression: Thin multi-pixel

More information

Lecture 8: Interest Point Detection. Saad J Bedros

Lecture 8: Interest Point Detection. Saad J Bedros #1 Lecture 8: Interest Point Detection Saad J Bedros sbedros@umn.edu Review of Edge Detectors #2 Today s Lecture Interest Points Detection What do we mean with Interest Point Detection in an Image Goal:

More information

Brief Introduction of Machine Learning Techniques for Content Analysis

Brief Introduction of Machine Learning Techniques for Content Analysis 1 Brief Introduction of Machine Learning Techniques for Content Analysis Wei-Ta Chu 2008/11/20 Outline 2 Overview Gaussian Mixture Model (GMM) Hidden Markov Model (HMM) Support Vector Machine (SVM) Overview

More information

Shape of Gaussians as Feature Descriptors

Shape of Gaussians as Feature Descriptors Shape of Gaussians as Feature Descriptors Liyu Gong, Tianjiang Wang and Fang Liu Intelligent and Distributed Computing Lab, School of Computer Science and Technology Huazhong University of Science and

More information

Multiple Similarities Based Kernel Subspace Learning for Image Classification

Multiple Similarities Based Kernel Subspace Learning for Image Classification Multiple Similarities Based Kernel Subspace Learning for Image Classification Wang Yan, Qingshan Liu, Hanqing Lu, and Songde Ma National Laboratory of Pattern Recognition, Institute of Automation, Chinese

More information

Kai Yu NEC Laboratories America, Cupertino, California, USA

Kai Yu NEC Laboratories America, Cupertino, California, USA Kai Yu NEC Laboratories America, Cupertino, California, USA Joint work with Jinjun Wang, Fengjun Lv, Wei Xu, Yihong Gong Xi Zhou, Jianchao Yang, Thomas Huang, Tong Zhang Chen Wu NEC Laboratories America

More information

Sound Recognition in Mixtures

Sound Recognition in Mixtures Sound Recognition in Mixtures Juhan Nam, Gautham J. Mysore 2, and Paris Smaragdis 2,3 Center for Computer Research in Music and Acoustics, Stanford University, 2 Advanced Technology Labs, Adobe Systems

More information

A Human Behavior Recognition Method Based on Latent Semantic Analysis

A Human Behavior Recognition Method Based on Latent Semantic Analysis Journal of Information Hiding and Multimedia Signal Processing c 2016 ISSN 2073-4212 Ubiquitous International Volume 7, Number 3, May 2016 A Human Behavior Recognition Method Based on Latent Semantic Analysis

More information

Iterative face image feature extraction with Generalized Hebbian Algorithm and a Sanger-like BCM rule

Iterative face image feature extraction with Generalized Hebbian Algorithm and a Sanger-like BCM rule Iterative face image feature extraction with Generalized Hebbian Algorithm and a Sanger-like BCM rule Clayton Aldern (Clayton_Aldern@brown.edu) Tyler Benster (Tyler_Benster@brown.edu) Carl Olsson (Carl_Olsson@brown.edu)

More information

INTEREST POINTS AT DIFFERENT SCALES

INTEREST POINTS AT DIFFERENT SCALES INTEREST POINTS AT DIFFERENT SCALES Thank you for the slides. They come mostly from the following sources. Dan Huttenlocher Cornell U David Lowe U. of British Columbia Martial Hebert CMU Intuitively, junctions

More information

A Novel Activity Detection Method

A Novel Activity Detection Method A Novel Activity Detection Method Gismy George P.G. Student, Department of ECE, Ilahia College of,muvattupuzha, Kerala, India ABSTRACT: This paper presents an approach for activity state recognition of

More information

Edges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise

Edges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise Edges and Scale Image Features From Sandlot Science Slides revised from S. Seitz, R. Szeliski, S. Lazebnik, etc. Origin of Edges surface normal discontinuity depth discontinuity surface color discontinuity

More information

Anticipating Visual Representations from Unlabeled Data. Carl Vondrick, Hamed Pirsiavash, Antonio Torralba

Anticipating Visual Representations from Unlabeled Data. Carl Vondrick, Hamed Pirsiavash, Antonio Torralba Anticipating Visual Representations from Unlabeled Data Carl Vondrick, Hamed Pirsiavash, Antonio Torralba Overview Problem Key Insight Methods Experiments Problem: Predict future actions and objects Image

More information

Shankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms

Shankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models. Say Wei Foo, Yong Lian, Liang Dong. IEEE Transactions on Circuits and Systems for Video Technology, May 2004. Shankar

More information

Fast Supervised LDA for Discovering Micro-Events in Large-Scale Video Datasets

Fast Supervised LDA for Discovering Micro-Events in Large-Scale Video Datasets Fast Supervised LDA for Discovering Micro-Events in Large-Scale Video Datasets Angelos Katharopoulos, Despoina Paschalidou, Christos Diou, Anastasios Delopoulos Multimedia Understanding Group ECE Department,

More information

Rapid Object Recognition from Discriminative Regions of Interest

Rapid Object Recognition from Discriminative Regions of Interest Rapid Object Recognition from Discriminative Regions of Interest Gerald Fritz, Christin Seifert, Lucas Paletta JOANNEUM RESEARCH Institute of Digital Image Processing Wastiangasse 6, A-81 Graz, Austria

More information

Detectors part II Descriptors

Detectors part II Descriptors EECS 442 Computer vision Detectors part II Descriptors Blob detectors Invariance Descriptors Some slides of this lectures are courtesy of prof F. Li, prof S. Lazebnik, and various other lecturers Goal:

More information

Machine Learning with Quantum-Inspired Tensor Networks

Machine Learning with Quantum-Inspired Tensor Networks Machine Learning with Quantum-Inspired Tensor Networks E.M. Stoudenmire and David J. Schwab Advances in Neural Information Processing 29 arxiv:1605.05775 RIKEN AICS - Mar 2017 Collaboration with David

More information

CS 231A Section 1: Linear Algebra & Probability Review

CS 231A Section 1: Linear Algebra & Probability Review CS 231A Section 1: Linear Algebra & Probability Review 1 Topics Support Vector Machines Boosting Viola-Jones face detector Linear Algebra Review Notation Operations & Properties Matrix Calculus Probability

More information

CS 231A Section 1: Linear Algebra & Probability Review. Kevin Tang

CS 231A Section 1: Linear Algebra & Probability Review. Kevin Tang CS 231A Section 1: Linear Algebra & Probability Review Kevin Tang Kevin Tang Section 1-1 9/30/2011 Topics Support Vector Machines Boosting Viola Jones face detector Linear Algebra Review Notation Operations

More information

Convolutional Neural Networks

Convolutional Neural Networks Convolutional Neural Networks Books» http://www.deeplearningbook.org/ Books http://neuralnetworksanddeeplearning.com/.org/ reviews» http://www.deeplearningbook.org/contents/linear_algebra.html» http://www.deeplearningbook.org/contents/prob.html»

More information

Deep Learning of Invariant Spatiotemporal Features from Video. Bo Chen, Jo-Anne Ting, Ben Marlin, Nando de Freitas University of British Columbia

Deep Learning of Invariant Spatiotemporal Features from Video. Bo Chen, Jo-Anne Ting, Ben Marlin, Nando de Freitas University of British Columbia Deep Learning of Invariant Spatiotemporal Features from Video Bo Chen, Jo-Anne Ting, Ben Marlin, Nando de Freitas University of British Columbia Introduction Focus: Unsupervised feature extraction from

More information

ENGG5781 Matrix Analysis and Computations Lecture 10: Non-Negative Matrix Factorization and Tensor Decomposition

ENGG5781 Matrix Analysis and Computations Lecture 10: Non-Negative Matrix Factorization and Tensor Decomposition ENGG5781 Matrix Analysis and Computations Lecture 10: Non-Negative Matrix Factorization and Tensor Decomposition Wing-Kin (Ken) Ma 2017 2018 Term 2 Department of Electronic Engineering The Chinese University

More information

Mitosis Detection in Breast Cancer Histology Images with Multi Column Deep Neural Networks

Mitosis Detection in Breast Cancer Histology Images with Multi Column Deep Neural Networks Mitosis Detection in Breast Cancer Histology Images with Multi Column Deep Neural Networks IDSIA, Lugano, Switzerland dan.ciresan@gmail.com Dan C. Cireşan and Alessandro Giusti DNN for Visual Pattern Recognition

More information

Preliminary results of a learning algorithm to detect stellar ocultations Instituto de Astronomia UNAM. Dr. Benjamín Hernández & Dr.

Preliminary results of a learning algorithm to detect stellar ocultations Instituto de Astronomia UNAM. Dr. Benjamín Hernández & Dr. The TAOS II project Preliminary results of a learning algorithm to detect stellar ocultations Instituto de Astronomia UNAM Dr. Benjamín Hernández & Dr. Mauricio Reyes {benja,maurey}@astrosen.unam.mx {benja,maurey}@astros.unam.mx

More information

CSCI 250: Intro to Robotics. Spring Term 2017 Prof. Levy. Computer Vision: A Brief Survey

CSCI 250: Intro to Robotics. Spring Term 2017 Prof. Levy. Computer Vision: A Brief Survey CSCI 25: Intro to Robotics Spring Term 27 Prof. Levy Computer Vision: A Brief Survey What Is Computer Vision? Higher-order tasks Face recognition Object recognition (Deep Learning) What Is Computer Vision?

More information

A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES

A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES A RAIN PIXEL RESTORATION ALGORITHM FOR VIDEOS WITH DYNAMIC SCENES V.Sridevi, P.Malarvizhi, P.Mathivannan Abstract Rain removal from a video is a challenging problem due to random spatial distribution and

More information

Scale and Rotation Invariant Detection of Singular Patterns in Vector Flow Fields

Scale and Rotation Invariant Detection of Singular Patterns in Vector Flow Fields Scale and Rotation Invariant Detection of Singular Patterns in Vector Flow Fields Wei Liu and Eraldo Ribeiro Computer Vision and Bio-Inspired Computing Laboratory Department of Computer Sciences Florida

More information

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros Corners, Blobs & Descriptors With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros Motivation: Build a Panorama M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 How do we build panorama?

More information

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY [Gaurav, 2(1): Jan., 2013] ISSN: 2277-9655 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY Face Identification & Detection Using Eigenfaces Sachin.S.Gurav *1, K.R.Desai 2 *1

More information

Lie Algebrized Gaussians for Image Representation

Lie Algebrized Gaussians for Image Representation Lie Algebrized Gaussians for Image Representation Liyu Gong, Meng Chen and Chunlong Hu School of CS, Huazhong University of Science and Technology {gongliyu,chenmenghust,huchunlong.hust}@gmail.com Abstract

More information

arxiv: v1 [cs.cv] 7 Aug 2018

arxiv: v1 [cs.cv] 7 Aug 2018 Dynamic Temporal Pyramid Network: A Closer Look at Multi-Scale Modeling for Activity Detection Da Zhang 1, Xiyang Dai 2, and Yuan-Fang Wang 1 arxiv:1808.02536v1 [cs.cv] 7 Aug 2018 1 University of California,

More information

Instance-level l recognition. Cordelia Schmid INRIA

Instance-level l recognition. Cordelia Schmid INRIA nstance-level l recognition Cordelia Schmid NRA nstance-level recognition Particular objects and scenes large databases Application Search photos on the web for particular places Find these landmars...in

More information

Event Detection by Eigenvector Decomposition Using Object and Frame Features

Event Detection by Eigenvector Decomposition Using Object and Frame Features Event Detection by Eigenvector Decomposition Using Object and Frame Features Fatih Porikli Tetsuji Haga Abstract We develop an event detection framework that has two significant advantages over past work

More information

Slide a window along the input arc sequence S. Least-squares estimate. σ 2. σ Estimate 1. Statistically test the difference between θ 1 and θ 2

Slide a window along the input arc sequence S. Least-squares estimate. σ 2. σ Estimate 1. Statistically test the difference between θ 1 and θ 2 Corner Detection 2D Image Features Corners are important two dimensional features. Two dimensional image features are interesting local structures. They include junctions of dierent types Slide 3 They

More information

PCA FACE RECOGNITION

PCA FACE RECOGNITION PCA FACE RECOGNITION The slides are from several sources through James Hays (Brown); Srinivasa Narasimhan (CMU); Silvio Savarese (U. of Michigan); Shree Nayar (Columbia) including their own slides. Goal

More information

Kronecker Decomposition for Image Classification

Kronecker Decomposition for Image Classification university of innsbruck institute of computer science intelligent and interactive systems Kronecker Decomposition for Image Classification Sabrina Fontanella 1,2, Antonio Rodríguez-Sánchez 1, Justus Piater

More information

Corner. Corners are the intersections of two edges of sufficiently different orientations.

Corner. Corners are the intersections of two edges of sufficiently different orientations. 2D Image Features Two dimensional image features are interesting local structures. They include junctions of different types like Y, T, X, and L. Much of the work on 2D features focuses on junction L,

More information

Role of Assembling Invariant Moments and SVM in Fingerprint Recognition

Role of Assembling Invariant Moments and SVM in Fingerprint Recognition 56 Role of Assembling Invariant Moments SVM in Fingerprint Recognition 1 Supriya Wable, 2 Chaitali Laulkar 1, 2 Department of Computer Engineering, University of Pune Sinhgad College of Engineering, Pune-411

More information

Blur Insensitive Texture Classification Using Local Phase Quantization

Blur Insensitive Texture Classification Using Local Phase Quantization Blur Insensitive Texture Classification Using Local Phase Quantization Ville Ojansivu and Janne Heikkilä Machine Vision Group, Department of Electrical and Information Engineering, University of Oulu,

More information

self-driving car technology introduction

self-driving car technology introduction self-driving car technology introduction slide 1 Contents of this presentation 1. Motivation 2. Methods 2.1 road lane detection 2.2 collision avoidance 3. Summary 4. Future work slide 2 Motivation slide

More information

Latent Semantic Analysis. Hongning Wang

Latent Semantic Analysis. Hongning Wang Latent Semantic Analysis Hongning Wang CS@UVa VS model in practice Document and query are represented by term vectors Terms are not necessarily orthogonal to each other Synonymy: car v.s. automobile Polysemy:

More information

LivePhoto Physics Activity 3. Velocity Change. Motion Detector. Sample

LivePhoto Physics Activity 3. Velocity Change. Motion Detector. Sample LivePhoto Physics Activity 3 Name: Date: Analyzing Position vs. Time Graphs: The most fundamental measurements of motion involve the determination of an object s location at a series of times. A very effective

More information

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al.

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al. 6.869 Advances in Computer Vision Prof. Bill Freeman March 3, 2005 Image and shape descriptors Affine invariant features Comparison of feature descriptors Shape context Readings: Mikolajczyk and Schmid;

More information

Human activity recognition in the semantic simplex of elementary actions

Human activity recognition in the semantic simplex of elementary actions BEAUDRY et al.: ACTIVITY RECOGNITION IN THE SEMANTIC SIMPLEX 1 Human activity recognition in the semantic simplex of elementary actions Beaudry Cyrille cyrille.beaudry@univ-lr.fr Péteri Renaud renaud.peteri@univ-lr.fr

More information

A METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE SIMILARITY

A METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE SIMILARITY IJAMML 3:1 (015) 69-78 September 015 ISSN: 394-58 Available at http://scientificadvances.co.in DOI: http://dx.doi.org/10.1864/ijamml_710011547 A METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE

More information

The Deep Ritz method: A deep learning-based numerical algorithm for solving variational problems

The Deep Ritz method: A deep learning-based numerical algorithm for solving variational problems The Deep Ritz method: A deep learning-based numerical algorithm for solving variational problems Weinan E 1 and Bing Yu 2 arxiv:1710.00211v1 [cs.lg] 30 Sep 2017 1 The Beijing Institute of Big Data Research,

More information

Feature extraction: Corners and blobs

Feature extraction: Corners and blobs Feature extraction: Corners and blobs Review: Linear filtering and edge detection Name two different kinds of image noise Name a non-linear smoothing filter What advantages does median filtering have over

More information

DYNAMIC TEXTURE RECOGNITION USING ENHANCED LBP FEATURES

DYNAMIC TEXTURE RECOGNITION USING ENHANCED LBP FEATURES DYNAMIC TEXTURE RECOGNITION USING ENHANCED FEATURES Jianfeng Ren BeingThere Centre Institute of Media Innovation Nanyang Technological University 50 Nanyang Drive, Singapore 637553. Xudong Jiang, Junsong

More information

Aruna Bhat Research Scholar, Department of Electrical Engineering, IIT Delhi, India

Aruna Bhat Research Scholar, Department of Electrical Engineering, IIT Delhi, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2017 IJSRCSEIT Volume 2 Issue 6 ISSN : 2456-3307 Robust Face Recognition System using Non Additive

More information

Edge Image Description Using Angular Radial Partitioning

Edge Image Description Using Angular Radial Partitioning Edge Image Description Using Angular Radial Partitioning A.Chalechale, A. Mertins and G. Naghdy IEE Proc.-Vis. Image Signal Processing, 2004 Slides by David Anthony Torres Computer Science and Engineering

More information

Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification

Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification Luca Costantini 2, Lorenzo Seidenari 1, Giuseppe Serra 1, Licia Capodiferro 2, and Alberto Del Bimbo 1 1 Media Integration

More information

Vlad Estivill-Castro (2016) Robots for People --- A project for intelligent integrated systems

Vlad Estivill-Castro (2016) Robots for People --- A project for intelligent integrated systems 1 Vlad Estivill-Castro (2016) Robots for People --- A project for intelligent integrated systems V. Estivill-Castro 2 Perception Concepts Vision Chapter 4 (textbook) Sections 4.3 to 4.5 What is the course

More information

Lecture: Face Recognition

Lecture: Face Recognition Lecture: Face Recognition Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab Lecture 12-1 What we will learn today Introduction to face recognition The Eigenfaces Algorithm Linear

More information

Linear Algebra & Geometry why is linear algebra useful in computer vision?

Linear Algebra & Geometry why is linear algebra useful in computer vision? Linear Algebra & Geometry why is linear algebra useful in computer vision? References: -Any book on linear algebra! -[HZ] chapters 2, 4 Some of the slides in this lecture are courtesy to Prof. Octavia

More information

CS4670: Computer Vision Kavita Bala. Lecture 7: Harris Corner Detec=on

CS4670: Computer Vision Kavita Bala. Lecture 7: Harris Corner Detec=on CS4670: Computer Vision Kavita Bala Lecture 7: Harris Corner Detec=on Announcements HW 1 will be out soon Sign up for demo slots for PA 1 Remember that both partners have to be there We will ask you to

More information

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18,   ISSN International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, www.ijcea.com ISSN 2321-3469 CONTENT-BASED IMAGE RETRIEVAL USING ZERNIKE MOMENTS AND SURF Priyanka

More information

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble nstance-level recognition: ocal invariant features Cordelia Schmid NRA Grenoble Overview ntroduction to local features Harris interest points + SSD ZNCC SFT Scale & affine invariant interest point detectors

More information

A Generative Model Based Kernel for SVM Classification in Multimedia Applications

A Generative Model Based Kernel for SVM Classification in Multimedia Applications Appears in Neural Information Processing Systems, Vancouver, Canada, 2003. A Generative Model Based Kernel for SVM Classification in Multimedia Applications Pedro J. Moreno Purdy P. Ho Hewlett-Packard

More information

Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features

Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features G. Y. Chen and B. Kégl Department of Computer Science and Operations Research, University of Montreal, CP 6128 succ.

More information

Scale-Invariance of Support Vector Machines based on the Triangular Kernel. Abstract

Scale-Invariance of Support Vector Machines based on the Triangular Kernel. Abstract Scale-Invariance of Support Vector Machines based on the Triangular Kernel François Fleuret Hichem Sahbi IMEDIA Research Group INRIA Domaine de Voluceau 78150 Le Chesnay, France Abstract This paper focuses

More information

Affine Structure From Motion

Affine Structure From Motion EECS43-Advanced Computer Vision Notes Series 9 Affine Structure From Motion Ying Wu Electrical Engineering & Computer Science Northwestern University Evanston, IL 68 yingwu@ece.northwestern.edu Contents

More information

Machine Learning for Computer Vision 8. Neural Networks and Deep Learning. Vladimir Golkov Technical University of Munich Computer Vision Group

Machine Learning for Computer Vision 8. Neural Networks and Deep Learning. Vladimir Golkov Technical University of Munich Computer Vision Group Machine Learning for Computer Vision 8. Neural Networks and Deep Learning Vladimir Golkov Technical University of Munich Computer Vision Group INTRODUCTION Nonlinear Coordinate Transformation http://cs.stanford.edu/people/karpathy/convnetjs/

More information

Linear Algebra & Geometry why is linear algebra useful in computer vision?

Linear Algebra & Geometry why is linear algebra useful in computer vision? Linear Algebra & Geometry why is linear algebra useful in computer vision? References: -Any book on linear algebra! -[HZ] chapters 2, 4 Some of the slides in this lecture are courtesy to Prof. Octavia

More information

Machine vision, spring 2018 Summary 4

Machine vision, spring 2018 Summary 4 Machine vision Summary # 4 The mask for Laplacian is given L = 4 (6) Another Laplacian mask that gives more importance to the center element is given by L = 8 (7) Note that the sum of the elements in the

More information

Ragav Venkatesan, 2 Christine Zwart, 2,3 David Frakes, 1 Baoxin Li

Ragav Venkatesan, 2 Christine Zwart, 2,3 David Frakes, 1 Baoxin Li 1,3 Ragav Venkatesan, 2 Christine Zwart, 2,3 David Frakes, 1 Baoxin Li 1 School of Computing Informatics and Decision Systems Engineering, Arizona State University, Tempe, AZ, USA 2 School of Biological

More information

Wavelet-based Salient Points with Scale Information for Classification

Wavelet-based Salient Points with Scale Information for Classification Wavelet-based Salient Points with Scale Information for Classification Alexandra Teynor and Hans Burkhardt Department of Computer Science, Albert-Ludwigs-Universität Freiburg, Germany {teynor, Hans.Burkhardt}@informatik.uni-freiburg.de

More information

FPGA Implementation of a HOG-based Pedestrian Recognition System

FPGA Implementation of a HOG-based Pedestrian Recognition System MPC Workshop Karlsruhe 10/7/2009 FPGA Implementation of a HOG-based Pedestrian Recognition System Sebastian Bauer sebastian.bauer@fh-aschaffenburg.de Laboratory for Pattern Recognition and Computational

More information

On The Role Of Head Motion In Affective Expression

On The Role Of Head Motion In Affective Expression On The Role Of Head Motion In Affective Expression Atanu Samanta, Tanaya Guha March 9, 2017 Department of Electrical Engineering Indian Institute of Technology, Kanpur, India Introduction Applications

More information

Face detection and recognition. Detection Recognition Sally

Face detection and recognition. Detection Recognition Sally Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification

More information

Conjugate gradient acceleration of non-linear smoothing filters Iterated edge-preserving smoothing

Conjugate gradient acceleration of non-linear smoothing filters Iterated edge-preserving smoothing Cambridge, Massachusetts Conjugate gradient acceleration of non-linear smoothing filters Iterated edge-preserving smoothing Andrew Knyazev (knyazev@merl.com) (speaker) Alexander Malyshev (malyshev@merl.com)

More information

38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, Bag-of-phrases.. Image Representation Using Bag-of-phrases

38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, Bag-of-phrases.. Image Representation Using Bag-of-phrases 38 1 Vol. 38, No. 1 2012 1 ACTA AUTOMATICA SINICA January, 2012 Bag-of-phrases 1, 2 1 1 1, Bag-of-words,,, Bag-of-words, Bag-of-phrases, Bag-of-words DOI,, Bag-of-words, Bag-of-phrases, SIFT 10.3724/SP.J.1004.2012.00046

More information

CITS 4402 Computer Vision

CITS 4402 Computer Vision CITS 4402 Computer Vision A/Prof Ajmal Mian Adj/A/Prof Mehdi Ravanbakhsh Lecture 06 Object Recognition Objectives To understand the concept of image based object recognition To learn how to match images

More information

Science Insights: An International Journal

Science Insights: An International Journal Available online at http://www.urpjournals.com Science Insights: An International Journal Universal Research Publications. All rights reserved ISSN 2277 3835 Original Article Object Recognition using Zernike

More information

Digital Trimulus Color Image Enhancing and Quantitative Information Measuring

Digital Trimulus Color Image Enhancing and Quantitative Information Measuring th WSEAS Int. Conference on Computational Intelligence, Man-Machine Systems and Cybernetics, Tenerife, Spain, December -, 007 33 Digital Trimulus Color Enhancing and Quantitative Information Measuring

More information

Face Recognition Using Multi-viewpoint Patterns for Robot Vision

Face Recognition Using Multi-viewpoint Patterns for Robot Vision 11th International Symposium of Robotics Research (ISRR2003), pp.192-201, 2003 Face Recognition Using Multi-viewpoint Patterns for Robot Vision Kazuhiro Fukui and Osamu Yamaguchi Corporate Research and

More information

Anomaly Localization in Topic-based Analysis of Surveillance Videos

Anomaly Localization in Topic-based Analysis of Surveillance Videos Anomaly Localization in Topic-based Analysis of Surveillance Videos Deepak Pathak IIT Kanpur Dept. of Computer Science deepakp@iitk.ac.in Abstract Abhijit Sharang IIT Kanpur Dept. of Computer Science abhisg@iitk.ac.in

More information

A Factorization Method for 3D Multi-body Motion Estimation and Segmentation

A Factorization Method for 3D Multi-body Motion Estimation and Segmentation 1 A Factorization Method for 3D Multi-body Motion Estimation and Segmentation René Vidal Department of EECS University of California Berkeley CA 94710 rvidal@eecs.berkeley.edu Stefano Soatto Dept. of Computer

More information

Representing Visual Appearance by Video Brownian Covariance Descriptor for Human Action Recognition

Representing Visual Appearance by Video Brownian Covariance Descriptor for Human Action Recognition Representing Visual Appearance by Video Brownian Covariance Descriptor for Human Action Recognition Piotr Bilinski, Michal Koperski, Slawomir Bak, Francois Bremond INRIA 2004 Route des Lucioles, BP 93,

More information

Object Detection Grammars

Object Detection Grammars Object Detection Grammars Pedro F. Felzenszwalb and David McAllester February 11, 2010 1 Introduction We formulate a general grammar model motivated by the problem of object detection in computer vision.

More information

Robust Motion Segmentation by Spectral Clustering

Robust Motion Segmentation by Spectral Clustering Robust Motion Segmentation by Spectral Clustering Hongbin Wang and Phil F. Culverhouse Centre for Robotics Intelligent Systems University of Plymouth Plymouth, PL4 8AA, UK {hongbin.wang, P.Culverhouse}@plymouth.ac.uk

More information

Online Appearance Model Learning for Video-Based Face Recognition

Online Appearance Model Learning for Video-Based Face Recognition Online Appearance Model Learning for Video-Based Face Recognition Liang Liu 1, Yunhong Wang 2,TieniuTan 1 1 National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences,

More information

CS5670: Computer Vision

CS5670: Computer Vision CS5670: Computer Vision Noah Snavely Lecture 5: Feature descriptors and matching Szeliski: 4.1 Reading Announcements Project 1 Artifacts due tomorrow, Friday 2/17, at 11:59pm Project 2 will be released

More information

Estimating the Rotation of a Ball Using High Speed Camera

Estimating the Rotation of a Ball Using High Speed Camera 1 2 1 1 1 2 Hough ICP 3DCG 3 6% 3% 6% Estimating the Rotation of a Ball Using High Speed Camera Hiroki Kamada, 1 Yoshinori Takeuchi, 2 Tetsuya Matsumoto, 1 Hiroaki Kudo 1 and Noburu Ohnishi 1 In table

More information

Multi-Layer Boosting for Pattern Recognition

Multi-Layer Boosting for Pattern Recognition Multi-Layer Boosting for Pattern Recognition François Fleuret IDIAP Research Institute, Centre du Parc, P.O. Box 592 1920 Martigny, Switzerland fleuret@idiap.ch Abstract We extend the standard boosting

More information

Artificial Neural Networks D B M G. Data Base and Data Mining Group of Politecnico di Torino. Elena Baralis. Politecnico di Torino

Artificial Neural Networks D B M G. Data Base and Data Mining Group of Politecnico di Torino. Elena Baralis. Politecnico di Torino Artificial Neural Networks Data Base and Data Mining Group of Politecnico di Torino Elena Baralis Politecnico di Torino Artificial Neural Networks Inspired to the structure of the human brain Neurons as

More information

Temporal Factorization Vs. Spatial Factorization

Temporal Factorization Vs. Spatial Factorization Temporal Factorization Vs. Spatial Factorization Lihi Zelnik-Manor 1 and Michal Irani 2 1 California Institute of Technology, Pasadena CA, USA, lihi@caltech.edu, WWW home page: http://www.vision.caltech.edu/lihi

More information

Optical flow. Subhransu Maji. CMPSCI 670: Computer Vision. October 20, 2016

Optical flow. Subhransu Maji. CMPSCI 670: Computer Vision. October 20, 2016 Optical flow Subhransu Maji CMPSC 670: Computer Vision October 20, 2016 Visual motion Man slides adapted from S. Seitz, R. Szeliski, M. Pollefes CMPSC 670 2 Motion and perceptual organization Sometimes,

More information

arxiv: v2 [cs.sd] 7 Feb 2018

arxiv: v2 [cs.sd] 7 Feb 2018 AUDIO SET CLASSIFICATION WITH ATTENTION MODEL: A PROBABILISTIC PERSPECTIVE Qiuqiang ong*, Yong Xu*, Wenwu Wang, Mark D. Plumbley Center for Vision, Speech and Signal Processing, University of Surrey, U

More information