38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, Bag-of-phrases.. Image Representation Using Bag-of-phrases

Size: px
Start display at page:

Download "38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, Bag-of-phrases.. Image Representation Using Bag-of-phrases"

Transcription

1 38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, 2012 Bag-of-phrases 1, , Bag-of-words,,, Bag-of-words, Bag-of-phrases, Bag-of-words DOI,, Bag-of-words, Bag-of-phrases, SIFT /SP.J Image Representation Using Bag-of-phrases ZHANG Lin-Bo 1, 2 WANG Chun-Heng 1 XIAO Bai-Hua 1 SHAO Yun-Xue 1 Abstract Bag-of-words representation, with which an image is represented as a histogram of the numbers of occurrences of particular visual words, has demonstrated impressive levels of performance in the past few years. However, the relative position information between the visual words are almost entirely ignored. In this paper, the potential strength of this relative position information is investigated and a new kind of representation named Bag-of-phrases is proposed. The effectiveness of this strategy is validated on two benchmark databases. The classification results demonstrate that our Bag-of-phrases strategy can achieve better results compared to Bag-of-words method. Key words descriptor Image representation, spatial layout, Bag-of-words, Bag-of-phrases, scale-invariant feature transform (SIFT),,,, :,, (, ),, Manuscript received July 28, 2011; accepted October 9, 2011 ( , ) Supported by National Natural Science Foundation of China ( , ) Recommended by Associate Editor LIU Yi-Jun 1. ( ) Key Laboratory of Complex Systems and Intelligence Science, Institute of Automation, Chinese Academy of Sciences, Beijing China Academy of Transportation Sciences, Beijing , :, : LBP (Local binary pattern) [1] HOG (Histogram of oriented gradients) [2] [3],,,,,,, Neisser [4] : (Preattentive stage) (Attentive stage)., (pop-out)

2 1 : Bag-of-phrases 47 ;,,, : [5]. [6 7]., ( ),, : [8 9] (Part-structure) [10 11] [12] [13] [14], Bag-of-words [8] Bagof-words, Bag-of-words Zhu [15] Csurka [8] (Visual words) ; Zhu [15] (Keyblock) Bag-of-words :, ;,, (K-, Mean-shift [16] ), ; ( (Soft assignment) [17] ) ;,, Bag-of-words,, 1 Fig. 1 1 Bag-of-words Illustration of Bag-of-words model Bag-of-words, ; Bag-of-words,, Lazebnik [18] (Spatial pyramid) Bag-of-words : 1 Bag-of-words ; 2 Bag-of-words Bag-of-words, Bag-of-words ; 3 Bagof-phrases ; 4 ; 1 Bag-of-words,, Grauman (Spatial pyramid matching, SPM) [19]. (Histogram pyramids),, Lazebnik [18] Bag-of-words Bag-of-words, Bag-of-words,, 2 Bag-of-words,,,, Bag-of-words,, 4 4, 2, L = 1, 2 2

3 48 38 Fig. 2 2 Bag-of-words Illustration of spatial pyramid Bag-of-words models,,,,,, 3 (a) 3 (b), 3 (a) 3 (b), 3 (c) Bag-of-words, 3 (c) 3 (b) 2 2 Bag-of-words Bag-of-words Bag-of-words,,,, Bag-of-words [13], (High-level), Bag-of-words ; Bag-of-words, Bag-of-words Bag-of-words? [4], Bag-of-words? [13],,,,, Bag-ofwords, Bag-of-words Bag-of-words Bag-of-words, Bag-of-words, 2 3 Fig. 3 Bag-of-words Spatial pyramid Bag-of-words vectors of different image contents Bag-of-words, Bag-of-words : 1) ; 2), Bag-of-words, (Document) (Word),,,,,,,,.,,, 4, ( );, Bag-

4 1 : Bag-of-phrases 49 of-words ( ), Bag-of-words Fig. 4 4 The relationship between local features in image and words in document, ( ) (, ).,,, (, ), (, ),,, ( ), 3 Bag-of-phrases (Visual words), (Visual phrase)., Bag-ofwords Bag-of-phrases 3.1 Bag-of-phrases : 1), I i N i, P i = {p i1, p i2,, p ini }; 2) p ij, p ij, f ij. : (Visual word) f ij p ij 3) ( f ij ), s ij ; 4) p ij f ij s ij,, v ij ; 5), Bag-of-words (Code phrase), ; 6) Bag-of-phrases Visual phrase Code phrases, Code phrase, Bag-ofphrases, Bag-of-phrases,, Bag-of-words 3.2, Shape-context [20] I i ( 5 (a) ), 1) I i p ij, R, O ( 5 (b) )., O,, ;,, ;, 2),

5 50 38, s ij. 3), s ij : p ij, 4),, s ij : t th 1/δ t., s ij p ij (a) (a) Local features on original image (b) (4, 12 ) (b) Spatial layout of neighbor visual words around the center of concentric circles in 4 radius bins and Fig orientation bins Modeling spatial layout of visual words,,,,,,, p ij, f ij s ij v ij.,,,, Bagof-words Bag-of-words (Code word), Bag-of-words (, K- Mean-shift ) Bag-of-phrases, Bag-of-phrases, Bag-of-words, :, [17] 4 Bag-of-phrases,, Caltech-101 PASCAL Visual object challenge (VOC) 2006 : 1), [21]., N=1 000; 2) p ij f ij SIFT [22] ; 3) 3.2 p ij, M O 5 12, 5 : 4, 8, 14, 22, 32; 4) s ij δ t, [0, 255], SIFT (Scaleinvariant feature transform) δ t δ 1 < δ 2 < δ 3 < δ 4 < δ 5. δ t = t,, ; 5), K-, Csurka [8]

6 1 : Bag-of-phrases 51 Bag-of-words ; 6) Bag-of-phrases, Lib- SVM SVM k(x 1, x 2 ) = exp{ d χ 2(x 1, x 2 )/σ},, σ 100. d χ 2(x 1, x 2 ) : d χ 2(x 1, x 2 ) = 1 2 Q q=1 [x 1 (q) x 2 (q)] 2 x 1 (q) + x 2 (q), x 1 (q), x 2 (q) Q- x 1 x 2 q 4.1 Caltech-101 Caltech , [23]., 30,, = 3 030,, Bag-of-phrases 1 (Support vector machine, SVM), 101, : ζ 1, ζ 2,, ζ 101. Iτ, 101, p 1 τ, p 2 τ,, p 101 τ., I τ l τ, : l τ = arg t 101 max t=1 pt τ l τ I τ l τ, I τ ;, 1 Caltech-101, Bag-of-words [8] Bag-of-words [18]. 1, Bag-of-phrases Bag-of-words, Bag-of-words Caltech-101, Bag-of-words PASCAL VOC 2006, 1 Caltech-101 (%) Table 1 Comparison of precisions on Caltech-101 dataset (%) (%) Bag-of-words, K = ± Bag-of-words, K = ± Bag-of-phrases, K = ± Bag-of-words, K = ± Bag-of-words, K = ± Bag-of-phrases, K = ± PASCAL VOC 2006 PASCAL VOC (Visual object challenge 2006) PASCAL VOC 2010 : 1) PASCAL VOC ) PAS- CAL VOC 2006, Bag-of-words Bagof-words, PASCAL VOC 2010,, Bag-of-words ;, 3) PASCAL VOC , PASCAL VOC , PASCAL VOC 2006, PASCAL VOC 2010,,,, PAS- CAL VOC 2006 PASCAL VOC , 10 : (Bicycle) (Bus) (Car) (Cat) (Cow) (Dog) (Horse) (Motorbike) (Person) (Sheep). PASCAL VOC 2006, 2 618, 2 686, INRIA Larlus [24] Bag-of-words, ;, INRIA Larlus

7 K = , Bagof-words PASCAL VOC 2006 Bag-of-words, PASCAL VOC 2006, ROC (Receiver operating characteristic), ROC, 6, Bagof-words 5 Spatial bag-of-words. 6 Fig. 6 The relationship between words in document and local features in image 6, Bag-of-phrases 1 000, INRIA Larlus [24],, Bag-of-words Bag-of-words, ;, Bag-of-words Bag-of-words,, K = K = 1 000, 4.3, Bag-ofphrases,,, Bag-ofphrases, Bag-of-words,, ( p ij ), p ij ( ), p ij, p ij,,,,,, Bag-of-phrases : ( ),, 4,,, :,, ;,,,, :,,, Bag-ofwords ;,, Bag-of-phrases ; (Visual sentences), Bag-ofsentences,,,,,, [25 26] Visual phrase, [25] (Visual words) Visual phrase, Visual phrase, [25], [26] Visual word, Visual phrase,

8 1 : Bag-of-phrases 53 5 Bag-of-words Bag-of-words,, ;, Bag-of-words Bag-of-phrases, Bag-of-phrases, : ( ), ;,, References 1 Ojala T, Pietikainen M, Maenpaa T. Multiresolution grayscale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7): Dalal N, Triggs B, Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, USA: IEEE, Swain M J, Ballard D H. Color indexing. International Journal of Computer Vision, 1991, 7(1): Neisser U. Visual search. Scientic American, 1964, 210(6): Tuytelaars T, Mikolajczyk K. Local invariant feature detectors: a survey. Foundations and Trends in Computer Graphics and Vision, 2008, 3(3): Mikolajczyk K, Schmid C. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(10): Li J, Allinson N. A comprehensive review of current local features for computer vision. Neurocomputing, 2008, 71(10 12): Csurka G, Dance C R, Fan L, Willamowski J, Bray C. Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision. Prague Czech Republic: ECCV, Yang J C, Yu K, Gong Y H, Huang T. Linear spatial pyramid matching using sparse coding for image classication. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Mimi, USA: IEEE, Fergus R, Perona D, Zisserman A. Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Madison, USA: IEEE, Felzenszwalb P F, Huttenlocher D P. Pictorial structures for object recognition. International Journal of Computer Vision, 2005, 61(1): Shotton J, Blake A, Cipolla R. Multiscale categorical object recognition using contour fragments. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 30(7): Serre T, Wolf L, Bileschi S, Riesenhuber M, Poggio T. Robust object recognition with cortex-like mechanisms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(3): Torralba A, Murphy K P, Freeman W T. Contextual models for object detection using boosted random fields. In: Proceedings of the Neural Information Processing Systems. Vancouver, Canada: NIPS, Zhu L, Rao A B, Zhang A D. Theory of keyblock-based image retrieval. ACM Transactions on Information Systems, 2002, 20(2): Comaniciu D, Meer P. Mean shift: a robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(5): Gemert J C, Geusebroek J M, Veenman C J, Smeulders A W M. Kernel codebooks for scene categorization. In: Proceedings of the European Conference on Computer Vision. Berlin, Germany: Springer, Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE Computer Vision and Pattern Recognition. New York, USA: IEEE, Grauman K, Darrell T. The pyramid match kernel: discriminative classication with sets of image features. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. Beijing, China: IEEE, Belongie S, Malik J, Puzicha J. Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(4): Loupias E, Sebe N, Bres S, Jolion J M. Wavelet-based salient points for image retrieval. In: Proceedings of the International Conference on Image Processing. Vancouver, Canada: IEEE, Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2):

9 Li F F, Fergus F, Perona P. Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. In: Proceedings of the Conference on Computer Vision and Pattern Recognition Workshop. Washington D. C., USA: IEEE, 2004, Everingham M, Zisserman A, Williams C, Gool L V. The PASCAL visual object classes challenge 2006 (VOC2006) results [online], available: July 20, Yuan J, Wu Y, Yang M. Discovery of collocation patterns: from visual words to visual phrases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Minneapolis, USA: IEEE, Sadeghi M A, Farhadi A. Recognition using visual phrases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Providence, USA: IEEE, , linbo.zhang@ia.ac.cn (ZHANG Lin-Bo Ph. D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, machine learning, and content based image classification. Corresponding author of this paper.),,, chunheng.wang@ia.ac.cn (WANG Chun-Heng Professor at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, intelligent systems, image processing, and character recognition, and artificial intelligence.), baihua.xiao@ia.ac.cn (XIAO Bai-Hua Professor at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, intelligent systems, and multimedia information processing and retrieval.), yunxue.shao@ia.ac.cn (SHAO Yun-Xue Ph. D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, machine learning, and handwritten Chinese character recognition.)

Wavelet-based Salient Points with Scale Information for Classification

Wavelet-based Salient Points with Scale Information for Classification Wavelet-based Salient Points with Scale Information for Classification Alexandra Teynor and Hans Burkhardt Department of Computer Science, Albert-Ludwigs-Universität Freiburg, Germany {teynor, Hans.Burkhardt}@informatik.uni-freiburg.de

More information

Maximally Stable Local Description for Scale Selection

Maximally Stable Local Description for Scale Selection Maximally Stable Local Description for Scale Selection Gyuri Dorkó and Cordelia Schmid INRIA Rhône-Alpes, 655 Avenue de l Europe, 38334 Montbonnot, France {gyuri.dorko,cordelia.schmid}@inrialpes.fr Abstract.

More information

Kai Yu NEC Laboratories America, Cupertino, California, USA

Kai Yu NEC Laboratories America, Cupertino, California, USA Kai Yu NEC Laboratories America, Cupertino, California, USA Joint work with Jinjun Wang, Fengjun Lv, Wei Xu, Yihong Gong Xi Zhou, Jianchao Yang, Thomas Huang, Tong Zhang Chen Wu NEC Laboratories America

More information

Lecture 13 Visual recognition

Lecture 13 Visual recognition Lecture 13 Visual recognition Announcements Silvio Savarese Lecture 13-20-Feb-14 Lecture 13 Visual recognition Object classification bag of words models Discriminative methods Generative methods Object

More information

Use Bin-Ratio Information for Category and Scene Classification

Use Bin-Ratio Information for Category and Scene Classification Use Bin-Ratio Information for Category and Scene Classification Nianhua Xie 1,2, Haibin Ling 2, Weiming Hu 1, Xiaoqin Zhang 1 1 National Laboratory of Pattern Recognition, Institute of Automation, CAS,

More information

Two-stage Pedestrian Detection Based on Multiple Features and Machine Learning

Two-stage Pedestrian Detection Based on Multiple Features and Machine Learning 38 3 Vol. 38, No. 3 2012 3 ACTA AUTOMATICA SINICA March, 2012 1 1 1, (Adaboost) (Support vector machine, SVM). (Four direction features, FDF) GAB (Gentle Adaboost) (Entropy-histograms of oriented gradients,

More information

Kernel Density Topic Models: Visual Topics Without Visual Words

Kernel Density Topic Models: Visual Topics Without Visual Words Kernel Density Topic Models: Visual Topics Without Visual Words Konstantinos Rematas K.U. Leuven ESAT-iMinds krematas@esat.kuleuven.be Mario Fritz Max Planck Institute for Informatics mfrtiz@mpi-inf.mpg.de

More information

The state of the art and beyond

The state of the art and beyond Feature Detectors and Descriptors The state of the art and beyond Local covariant detectors and descriptors have been successful in many applications Registration Stereo vision Motion estimation Matching

More information

Shape of Gaussians as Feature Descriptors

Shape of Gaussians as Feature Descriptors Shape of Gaussians as Feature Descriptors Liyu Gong, Tianjiang Wang and Fang Liu Intelligent and Distributed Computing Lab, School of Computer Science and Technology Huazhong University of Science and

More information

Distinguish between different types of scenes. Matching human perception Understanding the environment

Distinguish between different types of scenes. Matching human perception Understanding the environment Scene Recognition Adriana Kovashka UTCS, PhD student Problem Statement Distinguish between different types of scenes Applications Matching human perception Understanding the environment Indexing of images

More information

Pedestrian Density Estimation by a Weighted Bag of Visual Words Model

Pedestrian Density Estimation by a Weighted Bag of Visual Words Model Pedestrian Density Estimation by a Weighted Bag of Visual Words Model Shilin Zhang and Xunyuan Zhang image representation termed bag of visual words integrating weighting scheme and spatial pyramid co-occurrence,

More information

A Discriminatively Trained, Multiscale, Deformable Part Model

A Discriminatively Trained, Multiscale, Deformable Part Model A Discriminatively Trained, Multiscale, Deformable Part Model P. Felzenszwalb, D. McAllester, and D. Ramanan Edward Hsiao 16-721 Learning Based Methods in Vision February 16, 2009 Images taken from P.

More information

Fisher Vector image representation

Fisher Vector image representation Fisher Vector image representation Machine Learning and Category Representation 2014-2015 Jakob Verbeek, January 9, 2015 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.14.15 A brief recap on kernel

More information

SUBJECTIVE EVALUATION OF IMAGE UNDERSTANDING RESULTS

SUBJECTIVE EVALUATION OF IMAGE UNDERSTANDING RESULTS 18th European Signal Processing Conference (EUSIPCO-2010) Aalborg, Denmark, August 23-27, 2010 SUBJECTIVE EVALUATION OF IMAGE UNDERSTANDING RESULTS Baptiste Hemery 1, Hélène Laurent 2, and Christophe Rosenberger

More information

Global Scene Representations. Tilke Judd

Global Scene Representations. Tilke Judd Global Scene Representations Tilke Judd Papers Oliva and Torralba [2001] Fei Fei and Perona [2005] Labzebnik, Schmid and Ponce [2006] Commonalities Goal: Recognize natural scene categories Extract features

More information

Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context

Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Silvio Savarese Lecture 10-16-Feb-15 From the 3D to 2D & vice versa

More information

Compressed Fisher vectors for LSVR

Compressed Fisher vectors for LSVR XRCE@ILSVRC2011 Compressed Fisher vectors for LSVR Florent Perronnin and Jorge Sánchez* Xerox Research Centre Europe (XRCE) *Now with CIII, Cordoba University, Argentina Our system in a nutshell High-dimensional

More information

Beyond Spatial Pyramids

Beyond Spatial Pyramids Beyond Spatial Pyramids Receptive Field Learning for Pooled Image Features Yangqing Jia 1 Chang Huang 2 Trevor Darrell 1 1 UC Berkeley EECS 2 NEC Labs America Goal coding pooling Bear Analysis of the pooling

More information

Kernel Methods in Computer Vision

Kernel Methods in Computer Vision Kernel Methods in Computer Vision Christoph Lampert Max Planck Institute for Biological Cybernetics, Tübingen Matthew Blaschko MPI Tübingen and University of Oxford June 2, 29 Overview... 14: 15: Introduction

More information

Lie Algebrized Gaussians for Image Representation

Lie Algebrized Gaussians for Image Representation Lie Algebrized Gaussians for Image Representation Liyu Gong, Meng Chen and Chunlong Hu School of CS, Huazhong University of Science and Technology {gongliyu,chenmenghust,huchunlong.hust}@gmail.com Abstract

More information

Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification

Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification Luca Costantini 2, Lorenzo Seidenari 1, Giuseppe Serra 1, Licia Capodiferro 2, and Alberto Del Bimbo 1 1 Media Integration

More information

Improving Image Similarity With Vectors of Locally Aggregated Tensors (VLAT)

Improving Image Similarity With Vectors of Locally Aggregated Tensors (VLAT) Improving Image Similarity With Vectors of Locally Aggregated Tensors (VLAT) David Picard and Philippe-Henri Gosselin picard@ensea.fr, gosselin@ensea.fr ETIS / ENSEA - University of Cergy-Pontoise - CNRS

More information

DYNAMIC TEXTURE RECOGNITION USING ENHANCED LBP FEATURES

DYNAMIC TEXTURE RECOGNITION USING ENHANCED LBP FEATURES DYNAMIC TEXTURE RECOGNITION USING ENHANCED FEATURES Jianfeng Ren BeingThere Centre Institute of Media Innovation Nanyang Technological University 50 Nanyang Drive, Singapore 637553. Xudong Jiang, Junsong

More information

Efficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition

Efficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition Efficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition Kihyuk Sohn Dae Yon Jung Honglak Lee Alfred O. Hero III Dept. of Electrical Engineering and Computer

More information

Introduction to Discriminative Machine Learning

Introduction to Discriminative Machine Learning Introduction to Discriminative Machine Learning Yang Wang Vision & Media Lab Simon Fraser University CRV Tutorial, Kelowna May 24, 2009 Hand-written Digit Recognition [Belongie et al. PAMI 2002] 2 Hand-written

More information

Overview. Harris interest points. Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points

Overview. Harris interest points. Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points Overview Harris interest points Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points Evaluation and comparison of different detectors Region descriptors and their performance

More information

Large-scale classification of traffic signs under real-world conditions

Large-scale classification of traffic signs under real-world conditions Large-scale classification of traffic signs under real-world conditions Lykele Hazelhoff a,b, Ivo Creusen a,b, Dennis van de Wouw a,b and Peter H.N. de With a,b a CycloMedia Technology B.V., Achterweg

More information

Multiple Similarities Based Kernel Subspace Learning for Image Classification

Multiple Similarities Based Kernel Subspace Learning for Image Classification Multiple Similarities Based Kernel Subspace Learning for Image Classification Wang Yan, Qingshan Liu, Hanqing Lu, and Songde Ma National Laboratory of Pattern Recognition, Institute of Automation, Chinese

More information

Detectors part II Descriptors

Detectors part II Descriptors EECS 442 Computer vision Detectors part II Descriptors Blob detectors Invariance Descriptors Some slides of this lectures are courtesy of prof F. Li, prof S. Lazebnik, and various other lecturers Goal:

More information

EE 6882 Visual Search Engine

EE 6882 Visual Search Engine EE 6882 Visual Search Engine Prof. Shih Fu Chang, Feb. 13 th 2012 Lecture #4 Local Feature Matching Bag of Word image representation: coding and pooling (Many slides from A. Efors, W. Freeman, C. Kambhamettu,

More information

Urban land use information retrieval based on scene classification of Google Street View images

Urban land use information retrieval based on scene classification of Google Street View images Urban land use information retrieval based on scene classification of Google Street View images Xiaojiang Li 1, Chuanrong Zhang 1 1 Department of Geography, University of Connecticut, Storrs Email: {xiaojiang.li;chuanrong.zhang}@uconn.edu

More information

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison of different detectors Region descriptors and

More information

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison of different detectors Region descriptors and

More information

Invariant local features. Invariant Local Features. Classes of transformations. (Good) invariant local features. Case study: panorama stitching

Invariant local features. Invariant Local Features. Classes of transformations. (Good) invariant local features. Case study: panorama stitching Invariant local eatures Invariant Local Features Tuesday, February 6 Subset o local eature types designed to be invariant to Scale Translation Rotation Aine transormations Illumination 1) Detect distinctive

More information

CS4495/6495 Introduction to Computer Vision. 8C-L3 Support Vector Machines

CS4495/6495 Introduction to Computer Vision. 8C-L3 Support Vector Machines CS4495/6495 Introduction to Computer Vision 8C-L3 Support Vector Machines Discriminative classifiers Discriminative classifiers find a division (surface) in feature space that separates the classes Several

More information

Discriminative part-based models. Many slides based on P. Felzenszwalb

Discriminative part-based models. Many slides based on P. Felzenszwalb More sliding window detection: ti Discriminative part-based models Many slides based on P. Felzenszwalb Challenge: Generic object detection Pedestrian detection Features: Histograms of oriented gradients

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Dan Oneaţă 1 Introduction Probabilistic Latent Semantic Analysis (plsa) is a technique from the category of topic models. Its main goal is to model cooccurrence information

More information

Orientation Map Based Palmprint Recognition

Orientation Map Based Palmprint Recognition Orientation Map Based Palmprint Recognition (BM) 45 Orientation Map Based Palmprint Recognition B. H. Shekar, N. Harivinod bhshekar@gmail.com, harivinodn@gmail.com India, Mangalore University, Department

More information

Representing Sets of Instances for Visual Recognition

Representing Sets of Instances for Visual Recognition Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) Representing Sets of Instances for Visual Recognition Jianxin Wu, 1 Bin-Bin Gao, 1 Guoqing Liu 2 1 National Key Laboratory

More information

Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions

Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao Li Fei-Fei Computer Science Department, Stanford University, USA {bangpeng,feifeili}@cs.stanford.edu

More information

SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES

SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES Tetsu Matsukawa and Takio Kurita Department of Computer Science, University of Tsukuba, Tennodai 1-1-1 Tsukuba, Japan

More information

LoG Blob Finding and Scale. Scale Selection. Blobs (and scale selection) Achieving scale covariance. Blob detection in 2D. Blob detection in 2D

LoG Blob Finding and Scale. Scale Selection. Blobs (and scale selection) Achieving scale covariance. Blob detection in 2D. Blob detection in 2D Achieving scale covariance Blobs (and scale selection) Goal: independently detect corresponding regions in scaled versions of the same image Need scale selection mechanism for finding characteristic region

More information

Achieving scale covariance

Achieving scale covariance Achieving scale covariance Goal: independently detect corresponding regions in scaled versions of the same image Need scale selection mechanism for finding characteristic region size that is covariant

More information

A biologically plausible network for the computation of orientation dominance

A biologically plausible network for the computation of orientation dominance A biologically plausible network for the computation of orientation dominance Kritika Muralidharan Statistical Visual Computing Laboratory University of California San Diego La Jolla, CA 9239 krmurali@ucsd.edu

More information

CS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015

CS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015 CS 3710: Visual Recognition Describing Images with Features Adriana Kovashka Department of Computer Science January 8, 2015 Plan for Today Presentation assignments + schedule changes Image filtering Feature

More information

Analysis on a local approach to 3D object recognition

Analysis on a local approach to 3D object recognition Analysis on a local approach to 3D object recognition Elisabetta Delponte, Elise Arnaud, Francesca Odone, and Alessandro Verri DISI - Università degli Studi di Genova - Italy Abstract. We present a method

More information

LOCALITY PRESERVING HASHING. Electrical Engineering and Computer Science University of California, Merced Merced, CA 95344, USA

LOCALITY PRESERVING HASHING. Electrical Engineering and Computer Science University of California, Merced Merced, CA 95344, USA LOCALITY PRESERVING HASHING Yi-Hsuan Tsai Ming-Hsuan Yang Electrical Engineering and Computer Science University of California, Merced Merced, CA 95344, USA ABSTRACT The spectral hashing algorithm relaxes

More information

Learning Sparse Covariance Patterns for Natural Scenes

Learning Sparse Covariance Patterns for Natural Scenes Learning Sparse Covariance Patterns for Natural Scenes Liwei Wang Yin Li Jiaya Jia Jian Sun David Wipf James M. Rehg The Chinese University of Hong Kong Georgia Institute of Technology Microsoft Research

More information

Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II

Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II Noah Snavely Cornell James Hays CMU MIT (Fall 2009) Brown (Spring 2010 ) Complexity of the Visual World

More information

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning Sangdoo Yun 1 Jongwon Choi 1 Youngjoon Yoo 2 Kimin Yun 3 and Jin Young Choi 1 1 ASRI, Dept. of Electrical and Computer Eng.,

More information

SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES

SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES Tetsu Matsukawa and Takio Kurita :Department of Computer Science, University of Tsukuba, Tennodai 1-1-1 Tsukuba, Japan

More information

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18,   ISSN International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, www.ijcea.com ISSN 2321-3469 CONTENT-BASED IMAGE RETRIEVAL USING ZERNIKE MOMENTS AND SURF Priyanka

More information

Higher-order Statistical Modeling based Deep CNNs (Part-I)

Higher-order Statistical Modeling based Deep CNNs (Part-I) Higher-order Statistical Modeling based Deep CNNs (Part-I) Classical Methods & From Shallow to Deep Qilong Wang 2018-11-23 Context 1 2 3 4 Higher-order Statistics in Bag-of-Visual-Words (BoVW) Higher-order

More information

Hilbert-Huang Transform-based Local Regions Descriptors

Hilbert-Huang Transform-based Local Regions Descriptors Hilbert-Huang Transform-based Local Regions Descriptors Dongfeng Han, Wenhui Li, Wu Guo Computer Science and Technology, Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of

More information

DEIM Forum 04 E-3 43 80 3 5 / DC 43 80 3 5 90065 6-6 43 80 3 5 E-mail: gs3007@s.inf.shizuoka.ac.jp, dgs538@s.inf.shizuoka.ac.jp, ishikawa-hiroshi@sd.tmu.ac.jp, yokoyama@inf.shizuoka.ac.jp Flickr Exif OpenLayers

More information

Feature detectors and descriptors. Fei-Fei Li

Feature detectors and descriptors. Fei-Fei Li Feature detectors and descriptors Fei-Fei Li Feature Detection e.g. DoG detected points (~300) coordinates, neighbourhoods Feature Description e.g. SIFT local descriptors (invariant) vectors database of

More information

Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification

Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification Mandar Dixit, Nikhil Rasiwasia, Nuno Vasconcelos Department of Electrical and Computer Engineering University of California,

More information

Dirichlet-based Histogram Feature Transform for Image Classification

Dirichlet-based Histogram Feature Transform for Image Classification Dirichlet-based Histogram Feature Transform for Image Classification Takumi Kobayashi National Institute of Advanced Industrial Science and Technology Umezono --, Tsukuba, Japan takumi.kobayashi@aist.go.jp

More information

Workshop on Web- scale Vision and Social Media ECCV 2012, Firenze, Italy October, 2012 Linearized Smooth Additive Classifiers

Workshop on Web- scale Vision and Social Media ECCV 2012, Firenze, Italy October, 2012 Linearized Smooth Additive Classifiers Worshop on Web- scale Vision and Social Media ECCV 2012, Firenze, Italy October, 2012 Linearized Smooth Additive Classifiers Subhransu Maji Research Assistant Professor Toyota Technological Institute at

More information

Object Detection Grammars

Object Detection Grammars Object Detection Grammars Pedro F. Felzenszwalb and David McAllester February 11, 2010 1 Introduction We formulate a general grammar model motivated by the problem of object detection in computer vision.

More information

Feature detectors and descriptors. Fei-Fei Li

Feature detectors and descriptors. Fei-Fei Li Feature detectors and descriptors Fei-Fei Li Feature Detection e.g. DoG detected points (~300) coordinates, neighbourhoods Feature Description e.g. SIFT local descriptors (invariant) vectors database of

More information

CS5670: Computer Vision

CS5670: Computer Vision CS5670: Computer Vision Noah Snavely Lecture 5: Feature descriptors and matching Szeliski: 4.1 Reading Announcements Project 1 Artifacts due tomorrow, Friday 2/17, at 11:59pm Project 2 will be released

More information

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros Corners, Blobs & Descriptors With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros Motivation: Build a Panorama M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 How do we build panorama?

More information

Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales

Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales Tsun-Yi Yang,2 Yen-Yu Lin Yung-Yu Chuang 2 Academia Sinica, Taiwan 2 National Taiwan University, Taiwan {shamangary,yylin}@citi.sinica.edu.tw

More information

Human Action Recognition under Log-Euclidean Riemannian Metric

Human Action Recognition under Log-Euclidean Riemannian Metric Human Action Recognition under Log-Euclidean Riemannian Metric Chunfeng Yuan, Weiming Hu, Xi Li, Stephen Maybank 2, Guan Luo, National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing,

More information

MIL-UT at ILSVRC2014

MIL-UT at ILSVRC2014 MIL-UT at ILSVRC2014 IIT Guwahati (undergrad) -> Virginia Tech (intern) Senthil Purushwalkam, Yuichiro Tsuchiya, Atsushi Kanehira, Asako Kanezaki and *Tatsuya Harada The University of Tokyo Pipeline of

More information

TRAFFIC SCENE RECOGNITION BASED ON DEEP CNN AND VLAD SPATIAL PYRAMIDS

TRAFFIC SCENE RECOGNITION BASED ON DEEP CNN AND VLAD SPATIAL PYRAMIDS TRAFFIC SCENE RECOGNITION BASED ON DEEP CNN AND VLAD SPATIAL PYRAMIDS FANG-YU WU 1, SHI-YANG YAN 1, JEREMY S. SMITH 2, BAI-LING ZHANG 1 1 Department of Computer Science and Software Engineering, Xi an

More information

Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification

Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification 3 IEEE International Conference on Computer Vision Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification Mandar Dixit, Nikhil Rasiwasia, Nuno Vasconcelos Department of Electrical

More information

Semi-supervised Dictionary Learning Based on Hilbert-Schmidt Independence Criterion

Semi-supervised Dictionary Learning Based on Hilbert-Schmidt Independence Criterion Semi-supervised ictionary Learning Based on Hilbert-Schmidt Independence Criterion Mehrdad J. Gangeh 1, Safaa M.A. Bedawi 2, Ali Ghodsi 3, and Fakhri Karray 2 1 epartments of Medical Biophysics, and Radiation

More information

Multi-Layer Boosting for Pattern Recognition

Multi-Layer Boosting for Pattern Recognition Multi-Layer Boosting for Pattern Recognition François Fleuret IDIAP Research Institute, Centre du Parc, P.O. Box 592 1920 Martigny, Switzerland fleuret@idiap.ch Abstract We extend the standard boosting

More information

Fine-grained Classification

Fine-grained Classification Fine-grained Classification Marcel Simon Department of Mathematics and Computer Science, Germany marcel.simon@uni-jena.de http://www.inf-cv.uni-jena.de/ Seminar Talk 23.06.2015 Outline 1 Motivation 2 3

More information

Improved Local Coordinate Coding using Local Tangents

Improved Local Coordinate Coding using Local Tangents Improved Local Coordinate Coding using Local Tangents Kai Yu NEC Laboratories America, 10081 N. Wolfe Road, Cupertino, CA 95129 Tong Zhang Rutgers University, 110 Frelinghuysen Road, Piscataway, NJ 08854

More information

Graphical Object Models for Detection and Tracking

Graphical Object Models for Detection and Tracking Graphical Object Models for Detection and Tracking (ls@cs.brown.edu) Department of Computer Science Brown University Joined work with: -Ying Zhu, Siemens Corporate Research, Princeton, NJ -DorinComaniciu,

More information

Towards Good Practices for Action Video Encoding

Towards Good Practices for Action Video Encoding Towards Good Practices for Action Video Encoding Jianxin Wu National Key Laboratory for Novel Software Technology Nanjing University, China wujx21@nju.edu.cn Yu Zhang Nanyang Technological University Singapore

More information

Asaf Bar Zvi Adi Hayat. Semantic Segmentation

Asaf Bar Zvi Adi Hayat. Semantic Segmentation Asaf Bar Zvi Adi Hayat Semantic Segmentation Today s Topics Fully Convolutional Networks (FCN) (CVPR 2015) Conditional Random Fields as Recurrent Neural Networks (ICCV 2015) Gaussian Conditional random

More information

Comparative study of global invariant. descriptors for object recognition

Comparative study of global invariant. descriptors for object recognition Author manuscript, published in "Journal of Electronic Imaging (2008) 1-35" Comparative study of global invariant descriptors for object recognition A. Choksuriwong, B. Emile, C. Rosenberger, H. Laurent

More information

Loss Functions and Optimization. Lecture 3-1

Loss Functions and Optimization. Lecture 3-1 Lecture 3: Loss Functions and Optimization Lecture 3-1 Administrative Assignment 1 is released: http://cs231n.github.io/assignments2017/assignment1/ Due Thursday April 20, 11:59pm on Canvas (Extending

More information

INTEREST POINTS AT DIFFERENT SCALES

INTEREST POINTS AT DIFFERENT SCALES INTEREST POINTS AT DIFFERENT SCALES Thank you for the slides. They come mostly from the following sources. Dan Huttenlocher Cornell U David Lowe U. of British Columbia Martial Hebert CMU Intuitively, junctions

More information

Shared Segmentation of Natural Scenes. Dependent Pitman-Yor Processes

Shared Segmentation of Natural Scenes. Dependent Pitman-Yor Processes Shared Segmentation of Natural Scenes using Dependent Pitman-Yor Processes Erik Sudderth & Michael Jordan University of California, Berkeley Parsing Visual Scenes sky skyscraper sky dome buildings trees

More information

Gaussian Process Style Transfer Mapping for Historical Chinese Character Recognition

Gaussian Process Style Transfer Mapping for Historical Chinese Character Recognition Best Student Paper Award Gaussian Process Style Transfer Mapping for Historical Chinese Character Recognition Jixiong Feng a, Liangrui Peng a and Franck Lebourgeois b a Tsinghua National Laboratory for

More information

Edges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise

Edges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise Edges and Scale Image Features From Sandlot Science Slides revised from S. Seitz, R. Szeliski, S. Lazebnik, etc. Origin of Edges surface normal discontinuity depth discontinuity surface color discontinuity

More information

Object Recognition Using Local Characterisation and Zernike Moments

Object Recognition Using Local Characterisation and Zernike Moments Object Recognition Using Local Characterisation and Zernike Moments A. Choksuriwong, H. Laurent, C. Rosenberger, and C. Maaoui Laboratoire Vision et Robotique - UPRES EA 2078, ENSI de Bourges - Université

More information

Region Moments: Fast invariant descriptors for detecting small image structures

Region Moments: Fast invariant descriptors for detecting small image structures Region Moments: Fast invariant descriptors for detecting small image structures Gianfranco Doretto Yi Yao Visualization and Computer Vision Lab, GE Global Research, Niskayuna, NY 39 doretto@research.ge.com

More information

Lecture 8: Interest Point Detection. Saad J Bedros

Lecture 8: Interest Point Detection. Saad J Bedros #1 Lecture 8: Interest Point Detection Saad J Bedros sbedros@umn.edu Review of Edge Detectors #2 Today s Lecture Interest Points Detection What do we mean with Interest Point Detection in an Image Goal:

More information

Face Recognition Using Global Gabor Filter in Small Sample Case *

Face Recognition Using Global Gabor Filter in Small Sample Case * ISSN 1673-9418 CODEN JKYTA8 E-mail: fcst@public2.bta.net.cn Journal of Frontiers of Computer Science and Technology http://www.ceaj.org 1673-9418/2010/04(05)-0420-06 Tel: +86-10-51616056 DOI: 10.3778/j.issn.1673-9418.2010.05.004

More information

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al.

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al. 6.869 Advances in Computer Vision Prof. Bill Freeman March 3, 2005 Image and shape descriptors Affine invariant features Comparison of feature descriptors Shape context Readings: Mikolajczyk and Schmid;

More information

Robust Detection, Classification and Positioning of Traffic Signs from Street-Level Panoramic Images for Inventory Purposes

Robust Detection, Classification and Positioning of Traffic Signs from Street-Level Panoramic Images for Inventory Purposes Robust Detection, Classification and Positioning of Traffic Signs from Street-Level Panoramic Images for Inventory Purposes Lykele Hazelhoff and Ivo Creusen CycloMedia Technology B.V. Achterweg 38, 4181

More information

A METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE SIMILARITY

A METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE SIMILARITY IJAMML 3:1 (015) 69-78 September 015 ISSN: 394-58 Available at http://scientificadvances.co.in DOI: http://dx.doi.org/10.1864/ijamml_710011547 A METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE

More information

39 Mutual Component Analysis for Heterogeneous Face Recognition

39 Mutual Component Analysis for Heterogeneous Face Recognition 39 Mutual Component Analysis for Heterogeneous Face Recognition ZHIFENG LI, Chinese Academy of Sciences DIHONG GONG, Chinese Academy of Sciences QIANG LI, University of Technology Sydney DACHENG TAO, University

More information

Self-Adaptable Templates for Feature Coding

Self-Adaptable Templates for Feature Coding Self-Adaptable Templates for Feature Coding Xavier Boix 1,2 Gemma Roig 1,2 Salomon Diether 1 Luc Van Gool 1 1 Computer Vision Laboratory, ETH Zurich, Switzerland 2 LCSL, Massachusetts Institute of Technology

More information

Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time

Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time Experiment presentation for CS3710:Visual Recognition Presenter: Zitao Liu University of Pittsburgh ztliu@cs.pitt.edu

More information

TUTORIAL PART 1 Unsupervised Learning

TUTORIAL PART 1 Unsupervised Learning TUTORIAL PART 1 Unsupervised Learning Marc'Aurelio Ranzato Department of Computer Science Univ. of Toronto ranzato@cs.toronto.edu Co-organizers: Honglak Lee, Yoshua Bengio, Geoff Hinton, Yann LeCun, Andrew

More information

Enhanced Local Binary Covariance Matrices (ELBCM) for texture analysis and object tracking

Enhanced Local Binary Covariance Matrices (ELBCM) for texture analysis and object tracking Enhanced Local Binary Covariance Matrices () for texture analysis and object tracking Andrés Romero Laboratoire de Recherche en Informatique Bât 650, Université Paris Sud 91405 Orsay Cedex France andres.romero@upsud.fr

More information

CSE 473/573 Computer Vision and Image Processing (CVIP)

CSE 473/573 Computer Vision and Image Processing (CVIP) CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu inwogu@buffalo.edu Lecture 11 Local Features 1 Schedule Last class We started local features Today More on local features Readings for

More information

Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net

Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net Supplementary Material Xingang Pan 1, Ping Luo 1, Jianping Shi 2, and Xiaoou Tang 1 1 CUHK-SenseTime Joint Lab, The Chinese University

More information

Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features

Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features G. Y. Chen and B. Kégl Department of Computer Science and Operations Research, University of Montreal, CP 6128 succ.

More information

Scale-space image processing

Scale-space image processing Scale-space image processing Corresponding image features can appear at different scales Like shift-invariance, scale-invariance of image processing algorithms is often desirable. Scale-space representation

More information

Detecting Humans via Their Pose

Detecting Humans via Their Pose Detecting Humans via Their Pose Alessandro Bissacco Computer Science Department University of California, Los Angeles Los Angeles, CA 90095 bissacco@cs.ucla.edu Ming-Hsuan Yang Honda Research Institute

More information

Invariant Scattering Convolution Networks

Invariant Scattering Convolution Networks Invariant Scattering Convolution Networks Joan Bruna and Stephane Mallat Submitted to PAMI, Feb. 2012 Presented by Bo Chen Other important related papers: [1] S. Mallat, A Theory for Multiresolution Signal

More information

HeadNet: Pedestrian Head Detection Utilizing Body in Context

HeadNet: Pedestrian Head Detection Utilizing Body in Context HeadNet: Pedestrian Head Detection Utilizing Body in Context Gang Chen 1,2, Xufen Cai 1, Hu Han,1, Shiguang Shan 1,2,3 and Xilin Chen 1,2 1 Key Laboratory of Intelligent Information Processing of Chinese

More information