38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, Bag-of-phrases.. Image Representation Using Bag-of-phrases
|
|
- Primrose Martin
- 5 years ago
- Views:
Transcription
1 38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, 2012 Bag-of-phrases 1, , Bag-of-words,,, Bag-of-words, Bag-of-phrases, Bag-of-words DOI,, Bag-of-words, Bag-of-phrases, SIFT /SP.J Image Representation Using Bag-of-phrases ZHANG Lin-Bo 1, 2 WANG Chun-Heng 1 XIAO Bai-Hua 1 SHAO Yun-Xue 1 Abstract Bag-of-words representation, with which an image is represented as a histogram of the numbers of occurrences of particular visual words, has demonstrated impressive levels of performance in the past few years. However, the relative position information between the visual words are almost entirely ignored. In this paper, the potential strength of this relative position information is investigated and a new kind of representation named Bag-of-phrases is proposed. The effectiveness of this strategy is validated on two benchmark databases. The classification results demonstrate that our Bag-of-phrases strategy can achieve better results compared to Bag-of-words method. Key words descriptor Image representation, spatial layout, Bag-of-words, Bag-of-phrases, scale-invariant feature transform (SIFT),,,, :,, (, ),, Manuscript received July 28, 2011; accepted October 9, 2011 ( , ) Supported by National Natural Science Foundation of China ( , ) Recommended by Associate Editor LIU Yi-Jun 1. ( ) Key Laboratory of Complex Systems and Intelligence Science, Institute of Automation, Chinese Academy of Sciences, Beijing China Academy of Transportation Sciences, Beijing , :, : LBP (Local binary pattern) [1] HOG (Histogram of oriented gradients) [2] [3],,,,,,, Neisser [4] : (Preattentive stage) (Attentive stage)., (pop-out)
2 1 : Bag-of-phrases 47 ;,,, : [5]. [6 7]., ( ),, : [8 9] (Part-structure) [10 11] [12] [13] [14], Bag-of-words [8] Bagof-words, Bag-of-words Zhu [15] Csurka [8] (Visual words) ; Zhu [15] (Keyblock) Bag-of-words :, ;,, (K-, Mean-shift [16] ), ; ( (Soft assignment) [17] ) ;,, Bag-of-words,, 1 Fig. 1 1 Bag-of-words Illustration of Bag-of-words model Bag-of-words, ; Bag-of-words,, Lazebnik [18] (Spatial pyramid) Bag-of-words : 1 Bag-of-words ; 2 Bag-of-words Bag-of-words, Bag-of-words ; 3 Bagof-phrases ; 4 ; 1 Bag-of-words,, Grauman (Spatial pyramid matching, SPM) [19]. (Histogram pyramids),, Lazebnik [18] Bag-of-words Bag-of-words, Bag-of-words,, 2 Bag-of-words,,,, Bag-of-words,, 4 4, 2, L = 1, 2 2
3 48 38 Fig. 2 2 Bag-of-words Illustration of spatial pyramid Bag-of-words models,,,,,, 3 (a) 3 (b), 3 (a) 3 (b), 3 (c) Bag-of-words, 3 (c) 3 (b) 2 2 Bag-of-words Bag-of-words Bag-of-words,,,, Bag-of-words [13], (High-level), Bag-of-words ; Bag-of-words, Bag-of-words Bag-of-words? [4], Bag-of-words? [13],,,,, Bag-ofwords, Bag-of-words Bag-of-words Bag-of-words, Bag-of-words, 2 3 Fig. 3 Bag-of-words Spatial pyramid Bag-of-words vectors of different image contents Bag-of-words, Bag-of-words : 1) ; 2), Bag-of-words, (Document) (Word),,,,,,,,.,,, 4, ( );, Bag-
4 1 : Bag-of-phrases 49 of-words ( ), Bag-of-words Fig. 4 4 The relationship between local features in image and words in document, ( ) (, ).,,, (, ), (, ),,, ( ), 3 Bag-of-phrases (Visual words), (Visual phrase)., Bag-ofwords Bag-of-phrases 3.1 Bag-of-phrases : 1), I i N i, P i = {p i1, p i2,, p ini }; 2) p ij, p ij, f ij. : (Visual word) f ij p ij 3) ( f ij ), s ij ; 4) p ij f ij s ij,, v ij ; 5), Bag-of-words (Code phrase), ; 6) Bag-of-phrases Visual phrase Code phrases, Code phrase, Bag-ofphrases, Bag-of-phrases,, Bag-of-words 3.2, Shape-context [20] I i ( 5 (a) ), 1) I i p ij, R, O ( 5 (b) )., O,, ;,, ;, 2),
5 50 38, s ij. 3), s ij : p ij, 4),, s ij : t th 1/δ t., s ij p ij (a) (a) Local features on original image (b) (4, 12 ) (b) Spatial layout of neighbor visual words around the center of concentric circles in 4 radius bins and Fig orientation bins Modeling spatial layout of visual words,,,,,,, p ij, f ij s ij v ij.,,,, Bagof-words Bag-of-words (Code word), Bag-of-words (, K- Mean-shift ) Bag-of-phrases, Bag-of-phrases, Bag-of-words, :, [17] 4 Bag-of-phrases,, Caltech-101 PASCAL Visual object challenge (VOC) 2006 : 1), [21]., N=1 000; 2) p ij f ij SIFT [22] ; 3) 3.2 p ij, M O 5 12, 5 : 4, 8, 14, 22, 32; 4) s ij δ t, [0, 255], SIFT (Scaleinvariant feature transform) δ t δ 1 < δ 2 < δ 3 < δ 4 < δ 5. δ t = t,, ; 5), K-, Csurka [8]
6 1 : Bag-of-phrases 51 Bag-of-words ; 6) Bag-of-phrases, Lib- SVM SVM k(x 1, x 2 ) = exp{ d χ 2(x 1, x 2 )/σ},, σ 100. d χ 2(x 1, x 2 ) : d χ 2(x 1, x 2 ) = 1 2 Q q=1 [x 1 (q) x 2 (q)] 2 x 1 (q) + x 2 (q), x 1 (q), x 2 (q) Q- x 1 x 2 q 4.1 Caltech-101 Caltech , [23]., 30,, = 3 030,, Bag-of-phrases 1 (Support vector machine, SVM), 101, : ζ 1, ζ 2,, ζ 101. Iτ, 101, p 1 τ, p 2 τ,, p 101 τ., I τ l τ, : l τ = arg t 101 max t=1 pt τ l τ I τ l τ, I τ ;, 1 Caltech-101, Bag-of-words [8] Bag-of-words [18]. 1, Bag-of-phrases Bag-of-words, Bag-of-words Caltech-101, Bag-of-words PASCAL VOC 2006, 1 Caltech-101 (%) Table 1 Comparison of precisions on Caltech-101 dataset (%) (%) Bag-of-words, K = ± Bag-of-words, K = ± Bag-of-phrases, K = ± Bag-of-words, K = ± Bag-of-words, K = ± Bag-of-phrases, K = ± PASCAL VOC 2006 PASCAL VOC (Visual object challenge 2006) PASCAL VOC 2010 : 1) PASCAL VOC ) PAS- CAL VOC 2006, Bag-of-words Bagof-words, PASCAL VOC 2010,, Bag-of-words ;, 3) PASCAL VOC , PASCAL VOC , PASCAL VOC 2006, PASCAL VOC 2010,,,, PAS- CAL VOC 2006 PASCAL VOC , 10 : (Bicycle) (Bus) (Car) (Cat) (Cow) (Dog) (Horse) (Motorbike) (Person) (Sheep). PASCAL VOC 2006, 2 618, 2 686, INRIA Larlus [24] Bag-of-words, ;, INRIA Larlus
7 K = , Bagof-words PASCAL VOC 2006 Bag-of-words, PASCAL VOC 2006, ROC (Receiver operating characteristic), ROC, 6, Bagof-words 5 Spatial bag-of-words. 6 Fig. 6 The relationship between words in document and local features in image 6, Bag-of-phrases 1 000, INRIA Larlus [24],, Bag-of-words Bag-of-words, ;, Bag-of-words Bag-of-words,, K = K = 1 000, 4.3, Bag-ofphrases,,, Bag-ofphrases, Bag-of-words,, ( p ij ), p ij ( ), p ij, p ij,,,,,, Bag-of-phrases : ( ),, 4,,, :,, ;,,,, :,,, Bag-ofwords ;,, Bag-of-phrases ; (Visual sentences), Bag-ofsentences,,,,,, [25 26] Visual phrase, [25] (Visual words) Visual phrase, Visual phrase, [25], [26] Visual word, Visual phrase,
8 1 : Bag-of-phrases 53 5 Bag-of-words Bag-of-words,, ;, Bag-of-words Bag-of-phrases, Bag-of-phrases, : ( ), ;,, References 1 Ojala T, Pietikainen M, Maenpaa T. Multiresolution grayscale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7): Dalal N, Triggs B, Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, USA: IEEE, Swain M J, Ballard D H. Color indexing. International Journal of Computer Vision, 1991, 7(1): Neisser U. Visual search. Scientic American, 1964, 210(6): Tuytelaars T, Mikolajczyk K. Local invariant feature detectors: a survey. Foundations and Trends in Computer Graphics and Vision, 2008, 3(3): Mikolajczyk K, Schmid C. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(10): Li J, Allinson N. A comprehensive review of current local features for computer vision. Neurocomputing, 2008, 71(10 12): Csurka G, Dance C R, Fan L, Willamowski J, Bray C. Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision. Prague Czech Republic: ECCV, Yang J C, Yu K, Gong Y H, Huang T. Linear spatial pyramid matching using sparse coding for image classication. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Mimi, USA: IEEE, Fergus R, Perona D, Zisserman A. Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Madison, USA: IEEE, Felzenszwalb P F, Huttenlocher D P. Pictorial structures for object recognition. International Journal of Computer Vision, 2005, 61(1): Shotton J, Blake A, Cipolla R. Multiscale categorical object recognition using contour fragments. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 30(7): Serre T, Wolf L, Bileschi S, Riesenhuber M, Poggio T. Robust object recognition with cortex-like mechanisms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(3): Torralba A, Murphy K P, Freeman W T. Contextual models for object detection using boosted random fields. In: Proceedings of the Neural Information Processing Systems. Vancouver, Canada: NIPS, Zhu L, Rao A B, Zhang A D. Theory of keyblock-based image retrieval. ACM Transactions on Information Systems, 2002, 20(2): Comaniciu D, Meer P. Mean shift: a robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(5): Gemert J C, Geusebroek J M, Veenman C J, Smeulders A W M. Kernel codebooks for scene categorization. In: Proceedings of the European Conference on Computer Vision. Berlin, Germany: Springer, Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE Computer Vision and Pattern Recognition. New York, USA: IEEE, Grauman K, Darrell T. The pyramid match kernel: discriminative classication with sets of image features. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. Beijing, China: IEEE, Belongie S, Malik J, Puzicha J. Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(4): Loupias E, Sebe N, Bres S, Jolion J M. Wavelet-based salient points for image retrieval. In: Proceedings of the International Conference on Image Processing. Vancouver, Canada: IEEE, Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2):
9 Li F F, Fergus F, Perona P. Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. In: Proceedings of the Conference on Computer Vision and Pattern Recognition Workshop. Washington D. C., USA: IEEE, 2004, Everingham M, Zisserman A, Williams C, Gool L V. The PASCAL visual object classes challenge 2006 (VOC2006) results [online], available: July 20, Yuan J, Wu Y, Yang M. Discovery of collocation patterns: from visual words to visual phrases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Minneapolis, USA: IEEE, Sadeghi M A, Farhadi A. Recognition using visual phrases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Providence, USA: IEEE, , linbo.zhang@ia.ac.cn (ZHANG Lin-Bo Ph. D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, machine learning, and content based image classification. Corresponding author of this paper.),,, chunheng.wang@ia.ac.cn (WANG Chun-Heng Professor at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, intelligent systems, image processing, and character recognition, and artificial intelligence.), baihua.xiao@ia.ac.cn (XIAO Bai-Hua Professor at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, intelligent systems, and multimedia information processing and retrieval.), yunxue.shao@ia.ac.cn (SHAO Yun-Xue Ph. D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interest covers pattern recognition, machine learning, and handwritten Chinese character recognition.)
Wavelet-based Salient Points with Scale Information for Classification
Wavelet-based Salient Points with Scale Information for Classification Alexandra Teynor and Hans Burkhardt Department of Computer Science, Albert-Ludwigs-Universität Freiburg, Germany {teynor, Hans.Burkhardt}@informatik.uni-freiburg.de
More informationMaximally Stable Local Description for Scale Selection
Maximally Stable Local Description for Scale Selection Gyuri Dorkó and Cordelia Schmid INRIA Rhône-Alpes, 655 Avenue de l Europe, 38334 Montbonnot, France {gyuri.dorko,cordelia.schmid}@inrialpes.fr Abstract.
More informationKai Yu NEC Laboratories America, Cupertino, California, USA
Kai Yu NEC Laboratories America, Cupertino, California, USA Joint work with Jinjun Wang, Fengjun Lv, Wei Xu, Yihong Gong Xi Zhou, Jianchao Yang, Thomas Huang, Tong Zhang Chen Wu NEC Laboratories America
More informationLecture 13 Visual recognition
Lecture 13 Visual recognition Announcements Silvio Savarese Lecture 13-20-Feb-14 Lecture 13 Visual recognition Object classification bag of words models Discriminative methods Generative methods Object
More informationUse Bin-Ratio Information for Category and Scene Classification
Use Bin-Ratio Information for Category and Scene Classification Nianhua Xie 1,2, Haibin Ling 2, Weiming Hu 1, Xiaoqin Zhang 1 1 National Laboratory of Pattern Recognition, Institute of Automation, CAS,
More informationTwo-stage Pedestrian Detection Based on Multiple Features and Machine Learning
38 3 Vol. 38, No. 3 2012 3 ACTA AUTOMATICA SINICA March, 2012 1 1 1, (Adaboost) (Support vector machine, SVM). (Four direction features, FDF) GAB (Gentle Adaboost) (Entropy-histograms of oriented gradients,
More informationKernel Density Topic Models: Visual Topics Without Visual Words
Kernel Density Topic Models: Visual Topics Without Visual Words Konstantinos Rematas K.U. Leuven ESAT-iMinds krematas@esat.kuleuven.be Mario Fritz Max Planck Institute for Informatics mfrtiz@mpi-inf.mpg.de
More informationThe state of the art and beyond
Feature Detectors and Descriptors The state of the art and beyond Local covariant detectors and descriptors have been successful in many applications Registration Stereo vision Motion estimation Matching
More informationShape of Gaussians as Feature Descriptors
Shape of Gaussians as Feature Descriptors Liyu Gong, Tianjiang Wang and Fang Liu Intelligent and Distributed Computing Lab, School of Computer Science and Technology Huazhong University of Science and
More informationDistinguish between different types of scenes. Matching human perception Understanding the environment
Scene Recognition Adriana Kovashka UTCS, PhD student Problem Statement Distinguish between different types of scenes Applications Matching human perception Understanding the environment Indexing of images
More informationPedestrian Density Estimation by a Weighted Bag of Visual Words Model
Pedestrian Density Estimation by a Weighted Bag of Visual Words Model Shilin Zhang and Xunyuan Zhang image representation termed bag of visual words integrating weighting scheme and spatial pyramid co-occurrence,
More informationA Discriminatively Trained, Multiscale, Deformable Part Model
A Discriminatively Trained, Multiscale, Deformable Part Model P. Felzenszwalb, D. McAllester, and D. Ramanan Edward Hsiao 16-721 Learning Based Methods in Vision February 16, 2009 Images taken from P.
More informationFisher Vector image representation
Fisher Vector image representation Machine Learning and Category Representation 2014-2015 Jakob Verbeek, January 9, 2015 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.14.15 A brief recap on kernel
More informationSUBJECTIVE EVALUATION OF IMAGE UNDERSTANDING RESULTS
18th European Signal Processing Conference (EUSIPCO-2010) Aalborg, Denmark, August 23-27, 2010 SUBJECTIVE EVALUATION OF IMAGE UNDERSTANDING RESULTS Baptiste Hemery 1, Hélène Laurent 2, and Christophe Rosenberger
More informationGlobal Scene Representations. Tilke Judd
Global Scene Representations Tilke Judd Papers Oliva and Torralba [2001] Fei Fei and Perona [2005] Labzebnik, Schmid and Ponce [2006] Commonalities Goal: Recognize natural scene categories Extract features
More informationProperties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context
Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Silvio Savarese Lecture 10-16-Feb-15 From the 3D to 2D & vice versa
More informationCompressed Fisher vectors for LSVR
XRCE@ILSVRC2011 Compressed Fisher vectors for LSVR Florent Perronnin and Jorge Sánchez* Xerox Research Centre Europe (XRCE) *Now with CIII, Cordoba University, Argentina Our system in a nutshell High-dimensional
More informationBeyond Spatial Pyramids
Beyond Spatial Pyramids Receptive Field Learning for Pooled Image Features Yangqing Jia 1 Chang Huang 2 Trevor Darrell 1 1 UC Berkeley EECS 2 NEC Labs America Goal coding pooling Bear Analysis of the pooling
More informationKernel Methods in Computer Vision
Kernel Methods in Computer Vision Christoph Lampert Max Planck Institute for Biological Cybernetics, Tübingen Matthew Blaschko MPI Tübingen and University of Oxford June 2, 29 Overview... 14: 15: Introduction
More informationLie Algebrized Gaussians for Image Representation
Lie Algebrized Gaussians for Image Representation Liyu Gong, Meng Chen and Chunlong Hu School of CS, Huazhong University of Science and Technology {gongliyu,chenmenghust,huchunlong.hust}@gmail.com Abstract
More informationSpace-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification
Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification Luca Costantini 2, Lorenzo Seidenari 1, Giuseppe Serra 1, Licia Capodiferro 2, and Alberto Del Bimbo 1 1 Media Integration
More informationImproving Image Similarity With Vectors of Locally Aggregated Tensors (VLAT)
Improving Image Similarity With Vectors of Locally Aggregated Tensors (VLAT) David Picard and Philippe-Henri Gosselin picard@ensea.fr, gosselin@ensea.fr ETIS / ENSEA - University of Cergy-Pontoise - CNRS
More informationDYNAMIC TEXTURE RECOGNITION USING ENHANCED LBP FEATURES
DYNAMIC TEXTURE RECOGNITION USING ENHANCED FEATURES Jianfeng Ren BeingThere Centre Institute of Media Innovation Nanyang Technological University 50 Nanyang Drive, Singapore 637553. Xudong Jiang, Junsong
More informationEfficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition
Efficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition Kihyuk Sohn Dae Yon Jung Honglak Lee Alfred O. Hero III Dept. of Electrical Engineering and Computer
More informationIntroduction to Discriminative Machine Learning
Introduction to Discriminative Machine Learning Yang Wang Vision & Media Lab Simon Fraser University CRV Tutorial, Kelowna May 24, 2009 Hand-written Digit Recognition [Belongie et al. PAMI 2002] 2 Hand-written
More informationOverview. Harris interest points. Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points
Overview Harris interest points Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points Evaluation and comparison of different detectors Region descriptors and their performance
More informationLarge-scale classification of traffic signs under real-world conditions
Large-scale classification of traffic signs under real-world conditions Lykele Hazelhoff a,b, Ivo Creusen a,b, Dennis van de Wouw a,b and Peter H.N. de With a,b a CycloMedia Technology B.V., Achterweg
More informationMultiple Similarities Based Kernel Subspace Learning for Image Classification
Multiple Similarities Based Kernel Subspace Learning for Image Classification Wang Yan, Qingshan Liu, Hanqing Lu, and Songde Ma National Laboratory of Pattern Recognition, Institute of Automation, Chinese
More informationDetectors part II Descriptors
EECS 442 Computer vision Detectors part II Descriptors Blob detectors Invariance Descriptors Some slides of this lectures are courtesy of prof F. Li, prof S. Lazebnik, and various other lecturers Goal:
More informationEE 6882 Visual Search Engine
EE 6882 Visual Search Engine Prof. Shih Fu Chang, Feb. 13 th 2012 Lecture #4 Local Feature Matching Bag of Word image representation: coding and pooling (Many slides from A. Efors, W. Freeman, C. Kambhamettu,
More informationUrban land use information retrieval based on scene classification of Google Street View images
Urban land use information retrieval based on scene classification of Google Street View images Xiaojiang Li 1, Chuanrong Zhang 1 1 Department of Geography, University of Connecticut, Storrs Email: {xiaojiang.li;chuanrong.zhang}@uconn.edu
More informationOverview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors
Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison of different detectors Region descriptors and
More informationOverview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors
Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison of different detectors Region descriptors and
More informationInvariant local features. Invariant Local Features. Classes of transformations. (Good) invariant local features. Case study: panorama stitching
Invariant local eatures Invariant Local Features Tuesday, February 6 Subset o local eature types designed to be invariant to Scale Translation Rotation Aine transormations Illumination 1) Detect distinctive
More informationCS4495/6495 Introduction to Computer Vision. 8C-L3 Support Vector Machines
CS4495/6495 Introduction to Computer Vision 8C-L3 Support Vector Machines Discriminative classifiers Discriminative classifiers find a division (surface) in feature space that separates the classes Several
More informationDiscriminative part-based models. Many slides based on P. Felzenszwalb
More sliding window detection: ti Discriminative part-based models Many slides based on P. Felzenszwalb Challenge: Generic object detection Pedestrian detection Features: Histograms of oriented gradients
More informationProbabilistic Latent Semantic Analysis
Probabilistic Latent Semantic Analysis Dan Oneaţă 1 Introduction Probabilistic Latent Semantic Analysis (plsa) is a technique from the category of topic models. Its main goal is to model cooccurrence information
More informationOrientation Map Based Palmprint Recognition
Orientation Map Based Palmprint Recognition (BM) 45 Orientation Map Based Palmprint Recognition B. H. Shekar, N. Harivinod bhshekar@gmail.com, harivinodn@gmail.com India, Mangalore University, Department
More informationRepresenting Sets of Instances for Visual Recognition
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) Representing Sets of Instances for Visual Recognition Jianxin Wu, 1 Bin-Bin Gao, 1 Guoqing Liu 2 1 National Key Laboratory
More informationGrouplet: A Structured Image Representation for Recognizing Human and Object Interactions
Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao Li Fei-Fei Computer Science Department, Stanford University, USA {bangpeng,feifeili}@cs.stanford.edu
More informationSCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES
SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES Tetsu Matsukawa and Takio Kurita Department of Computer Science, University of Tsukuba, Tennodai 1-1-1 Tsukuba, Japan
More informationLoG Blob Finding and Scale. Scale Selection. Blobs (and scale selection) Achieving scale covariance. Blob detection in 2D. Blob detection in 2D
Achieving scale covariance Blobs (and scale selection) Goal: independently detect corresponding regions in scaled versions of the same image Need scale selection mechanism for finding characteristic region
More informationAchieving scale covariance
Achieving scale covariance Goal: independently detect corresponding regions in scaled versions of the same image Need scale selection mechanism for finding characteristic region size that is covariant
More informationA biologically plausible network for the computation of orientation dominance
A biologically plausible network for the computation of orientation dominance Kritika Muralidharan Statistical Visual Computing Laboratory University of California San Diego La Jolla, CA 9239 krmurali@ucsd.edu
More informationCS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015
CS 3710: Visual Recognition Describing Images with Features Adriana Kovashka Department of Computer Science January 8, 2015 Plan for Today Presentation assignments + schedule changes Image filtering Feature
More informationAnalysis on a local approach to 3D object recognition
Analysis on a local approach to 3D object recognition Elisabetta Delponte, Elise Arnaud, Francesca Odone, and Alessandro Verri DISI - Università degli Studi di Genova - Italy Abstract. We present a method
More informationLOCALITY PRESERVING HASHING. Electrical Engineering and Computer Science University of California, Merced Merced, CA 95344, USA
LOCALITY PRESERVING HASHING Yi-Hsuan Tsai Ming-Hsuan Yang Electrical Engineering and Computer Science University of California, Merced Merced, CA 95344, USA ABSTRACT The spectral hashing algorithm relaxes
More informationLearning Sparse Covariance Patterns for Natural Scenes
Learning Sparse Covariance Patterns for Natural Scenes Liwei Wang Yin Li Jiaya Jia Jian Sun David Wipf James M. Rehg The Chinese University of Hong Kong Georgia Institute of Technology Microsoft Research
More informationPhoto Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II
Photo Tourism and im2gps: 3D Reconstruction and Geolocation of Internet Photo Collections Part II Noah Snavely Cornell James Hays CMU MIT (Fall 2009) Brown (Spring 2010 ) Complexity of the Visual World
More informationAction-Decision Networks for Visual Tracking with Deep Reinforcement Learning
Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning Sangdoo Yun 1 Jongwon Choi 1 Youngjoon Yoo 2 Kimin Yun 3 and Jin Young Choi 1 1 ASRI, Dept. of Electrical and Computer Eng.,
More informationSCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES
SCENE CLASSIFICATION USING SPATIAL RELATIONSHIP BETWEEN LOCAL POSTERIOR PROBABILITIES Tetsu Matsukawa and Takio Kurita :Department of Computer Science, University of Tsukuba, Tennodai 1-1-1 Tsukuba, Japan
More informationInternational Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, ISSN
International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, www.ijcea.com ISSN 2321-3469 CONTENT-BASED IMAGE RETRIEVAL USING ZERNIKE MOMENTS AND SURF Priyanka
More informationHigher-order Statistical Modeling based Deep CNNs (Part-I)
Higher-order Statistical Modeling based Deep CNNs (Part-I) Classical Methods & From Shallow to Deep Qilong Wang 2018-11-23 Context 1 2 3 4 Higher-order Statistics in Bag-of-Visual-Words (BoVW) Higher-order
More informationHilbert-Huang Transform-based Local Regions Descriptors
Hilbert-Huang Transform-based Local Regions Descriptors Dongfeng Han, Wenhui Li, Wu Guo Computer Science and Technology, Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of
More informationDEIM Forum 04 E-3 43 80 3 5 / DC 43 80 3 5 90065 6-6 43 80 3 5 E-mail: gs3007@s.inf.shizuoka.ac.jp, dgs538@s.inf.shizuoka.ac.jp, ishikawa-hiroshi@sd.tmu.ac.jp, yokoyama@inf.shizuoka.ac.jp Flickr Exif OpenLayers
More informationFeature detectors and descriptors. Fei-Fei Li
Feature detectors and descriptors Fei-Fei Li Feature Detection e.g. DoG detected points (~300) coordinates, neighbourhoods Feature Description e.g. SIFT local descriptors (invariant) vectors database of
More informationClass-Specific Simplex-Latent Dirichlet Allocation for Image Classification
Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification Mandar Dixit, Nikhil Rasiwasia, Nuno Vasconcelos Department of Electrical and Computer Engineering University of California,
More informationDirichlet-based Histogram Feature Transform for Image Classification
Dirichlet-based Histogram Feature Transform for Image Classification Takumi Kobayashi National Institute of Advanced Industrial Science and Technology Umezono --, Tsukuba, Japan takumi.kobayashi@aist.go.jp
More informationWorkshop on Web- scale Vision and Social Media ECCV 2012, Firenze, Italy October, 2012 Linearized Smooth Additive Classifiers
Worshop on Web- scale Vision and Social Media ECCV 2012, Firenze, Italy October, 2012 Linearized Smooth Additive Classifiers Subhransu Maji Research Assistant Professor Toyota Technological Institute at
More informationObject Detection Grammars
Object Detection Grammars Pedro F. Felzenszwalb and David McAllester February 11, 2010 1 Introduction We formulate a general grammar model motivated by the problem of object detection in computer vision.
More informationFeature detectors and descriptors. Fei-Fei Li
Feature detectors and descriptors Fei-Fei Li Feature Detection e.g. DoG detected points (~300) coordinates, neighbourhoods Feature Description e.g. SIFT local descriptors (invariant) vectors database of
More informationCS5670: Computer Vision
CS5670: Computer Vision Noah Snavely Lecture 5: Feature descriptors and matching Szeliski: 4.1 Reading Announcements Project 1 Artifacts due tomorrow, Friday 2/17, at 11:59pm Project 2 will be released
More informationCorners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros
Corners, Blobs & Descriptors With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros Motivation: Build a Panorama M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 How do we build panorama?
More informationAccumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales
Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales Tsun-Yi Yang,2 Yen-Yu Lin Yung-Yu Chuang 2 Academia Sinica, Taiwan 2 National Taiwan University, Taiwan {shamangary,yylin}@citi.sinica.edu.tw
More informationHuman Action Recognition under Log-Euclidean Riemannian Metric
Human Action Recognition under Log-Euclidean Riemannian Metric Chunfeng Yuan, Weiming Hu, Xi Li, Stephen Maybank 2, Guan Luo, National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing,
More informationMIL-UT at ILSVRC2014
MIL-UT at ILSVRC2014 IIT Guwahati (undergrad) -> Virginia Tech (intern) Senthil Purushwalkam, Yuichiro Tsuchiya, Atsushi Kanehira, Asako Kanezaki and *Tatsuya Harada The University of Tokyo Pipeline of
More informationTRAFFIC SCENE RECOGNITION BASED ON DEEP CNN AND VLAD SPATIAL PYRAMIDS
TRAFFIC SCENE RECOGNITION BASED ON DEEP CNN AND VLAD SPATIAL PYRAMIDS FANG-YU WU 1, SHI-YANG YAN 1, JEREMY S. SMITH 2, BAI-LING ZHANG 1 1 Department of Computer Science and Software Engineering, Xi an
More informationClass-Specific Simplex-Latent Dirichlet Allocation for Image Classification
3 IEEE International Conference on Computer Vision Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification Mandar Dixit, Nikhil Rasiwasia, Nuno Vasconcelos Department of Electrical
More informationSemi-supervised Dictionary Learning Based on Hilbert-Schmidt Independence Criterion
Semi-supervised ictionary Learning Based on Hilbert-Schmidt Independence Criterion Mehrdad J. Gangeh 1, Safaa M.A. Bedawi 2, Ali Ghodsi 3, and Fakhri Karray 2 1 epartments of Medical Biophysics, and Radiation
More informationMulti-Layer Boosting for Pattern Recognition
Multi-Layer Boosting for Pattern Recognition François Fleuret IDIAP Research Institute, Centre du Parc, P.O. Box 592 1920 Martigny, Switzerland fleuret@idiap.ch Abstract We extend the standard boosting
More informationFine-grained Classification
Fine-grained Classification Marcel Simon Department of Mathematics and Computer Science, Germany marcel.simon@uni-jena.de http://www.inf-cv.uni-jena.de/ Seminar Talk 23.06.2015 Outline 1 Motivation 2 3
More informationImproved Local Coordinate Coding using Local Tangents
Improved Local Coordinate Coding using Local Tangents Kai Yu NEC Laboratories America, 10081 N. Wolfe Road, Cupertino, CA 95129 Tong Zhang Rutgers University, 110 Frelinghuysen Road, Piscataway, NJ 08854
More informationGraphical Object Models for Detection and Tracking
Graphical Object Models for Detection and Tracking (ls@cs.brown.edu) Department of Computer Science Brown University Joined work with: -Ying Zhu, Siemens Corporate Research, Princeton, NJ -DorinComaniciu,
More informationTowards Good Practices for Action Video Encoding
Towards Good Practices for Action Video Encoding Jianxin Wu National Key Laboratory for Novel Software Technology Nanjing University, China wujx21@nju.edu.cn Yu Zhang Nanyang Technological University Singapore
More informationAsaf Bar Zvi Adi Hayat. Semantic Segmentation
Asaf Bar Zvi Adi Hayat Semantic Segmentation Today s Topics Fully Convolutional Networks (FCN) (CVPR 2015) Conditional Random Fields as Recurrent Neural Networks (ICCV 2015) Gaussian Conditional random
More informationComparative study of global invariant. descriptors for object recognition
Author manuscript, published in "Journal of Electronic Imaging (2008) 1-35" Comparative study of global invariant descriptors for object recognition A. Choksuriwong, B. Emile, C. Rosenberger, H. Laurent
More informationLoss Functions and Optimization. Lecture 3-1
Lecture 3: Loss Functions and Optimization Lecture 3-1 Administrative Assignment 1 is released: http://cs231n.github.io/assignments2017/assignment1/ Due Thursday April 20, 11:59pm on Canvas (Extending
More informationINTEREST POINTS AT DIFFERENT SCALES
INTEREST POINTS AT DIFFERENT SCALES Thank you for the slides. They come mostly from the following sources. Dan Huttenlocher Cornell U David Lowe U. of British Columbia Martial Hebert CMU Intuitively, junctions
More informationShared Segmentation of Natural Scenes. Dependent Pitman-Yor Processes
Shared Segmentation of Natural Scenes using Dependent Pitman-Yor Processes Erik Sudderth & Michael Jordan University of California, Berkeley Parsing Visual Scenes sky skyscraper sky dome buildings trees
More informationGaussian Process Style Transfer Mapping for Historical Chinese Character Recognition
Best Student Paper Award Gaussian Process Style Transfer Mapping for Historical Chinese Character Recognition Jixiong Feng a, Liangrui Peng a and Franck Lebourgeois b a Tsinghua National Laboratory for
More informationEdges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise
Edges and Scale Image Features From Sandlot Science Slides revised from S. Seitz, R. Szeliski, S. Lazebnik, etc. Origin of Edges surface normal discontinuity depth discontinuity surface color discontinuity
More informationObject Recognition Using Local Characterisation and Zernike Moments
Object Recognition Using Local Characterisation and Zernike Moments A. Choksuriwong, H. Laurent, C. Rosenberger, and C. Maaoui Laboratoire Vision et Robotique - UPRES EA 2078, ENSI de Bourges - Université
More informationRegion Moments: Fast invariant descriptors for detecting small image structures
Region Moments: Fast invariant descriptors for detecting small image structures Gianfranco Doretto Yi Yao Visualization and Computer Vision Lab, GE Global Research, Niskayuna, NY 39 doretto@research.ge.com
More informationLecture 8: Interest Point Detection. Saad J Bedros
#1 Lecture 8: Interest Point Detection Saad J Bedros sbedros@umn.edu Review of Edge Detectors #2 Today s Lecture Interest Points Detection What do we mean with Interest Point Detection in an Image Goal:
More informationFace Recognition Using Global Gabor Filter in Small Sample Case *
ISSN 1673-9418 CODEN JKYTA8 E-mail: fcst@public2.bta.net.cn Journal of Frontiers of Computer Science and Technology http://www.ceaj.org 1673-9418/2010/04(05)-0420-06 Tel: +86-10-51616056 DOI: 10.3778/j.issn.1673-9418.2010.05.004
More informationAdvances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al.
6.869 Advances in Computer Vision Prof. Bill Freeman March 3, 2005 Image and shape descriptors Affine invariant features Comparison of feature descriptors Shape context Readings: Mikolajczyk and Schmid;
More informationRobust Detection, Classification and Positioning of Traffic Signs from Street-Level Panoramic Images for Inventory Purposes
Robust Detection, Classification and Positioning of Traffic Signs from Street-Level Panoramic Images for Inventory Purposes Lykele Hazelhoff and Ivo Creusen CycloMedia Technology B.V. Achterweg 38, 4181
More informationA METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE SIMILARITY
IJAMML 3:1 (015) 69-78 September 015 ISSN: 394-58 Available at http://scientificadvances.co.in DOI: http://dx.doi.org/10.1864/ijamml_710011547 A METHOD OF FINDING IMAGE SIMILAR PATCHES BASED ON GRADIENT-COVARIANCE
More information39 Mutual Component Analysis for Heterogeneous Face Recognition
39 Mutual Component Analysis for Heterogeneous Face Recognition ZHIFENG LI, Chinese Academy of Sciences DIHONG GONG, Chinese Academy of Sciences QIANG LI, University of Technology Sydney DACHENG TAO, University
More informationSelf-Adaptable Templates for Feature Coding
Self-Adaptable Templates for Feature Coding Xavier Boix 1,2 Gemma Roig 1,2 Salomon Diether 1 Luc Van Gool 1 1 Computer Vision Laboratory, ETH Zurich, Switzerland 2 LCSL, Massachusetts Institute of Technology
More informationStyle-aware Mid-level Representation for Discovering Visual Connections in Space and Time
Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time Experiment presentation for CS3710:Visual Recognition Presenter: Zitao Liu University of Pittsburgh ztliu@cs.pitt.edu
More informationTUTORIAL PART 1 Unsupervised Learning
TUTORIAL PART 1 Unsupervised Learning Marc'Aurelio Ranzato Department of Computer Science Univ. of Toronto ranzato@cs.toronto.edu Co-organizers: Honglak Lee, Yoshua Bengio, Geoff Hinton, Yann LeCun, Andrew
More informationEnhanced Local Binary Covariance Matrices (ELBCM) for texture analysis and object tracking
Enhanced Local Binary Covariance Matrices () for texture analysis and object tracking Andrés Romero Laboratoire de Recherche en Informatique Bât 650, Université Paris Sud 91405 Orsay Cedex France andres.romero@upsud.fr
More informationCSE 473/573 Computer Vision and Image Processing (CVIP)
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu inwogu@buffalo.edu Lecture 11 Local Features 1 Schedule Last class We started local features Today More on local features Readings for
More informationTwo at Once: Enhancing Learning and Generalization Capacities via IBN-Net
Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net Supplementary Material Xingang Pan 1, Ping Luo 1, Jianping Shi 2, and Xiaoou Tang 1 1 CUHK-SenseTime Joint Lab, The Chinese University
More informationInvariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features
Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features G. Y. Chen and B. Kégl Department of Computer Science and Operations Research, University of Montreal, CP 6128 succ.
More informationScale-space image processing
Scale-space image processing Corresponding image features can appear at different scales Like shift-invariance, scale-invariance of image processing algorithms is often desirable. Scale-space representation
More informationDetecting Humans via Their Pose
Detecting Humans via Their Pose Alessandro Bissacco Computer Science Department University of California, Los Angeles Los Angeles, CA 90095 bissacco@cs.ucla.edu Ming-Hsuan Yang Honda Research Institute
More informationInvariant Scattering Convolution Networks
Invariant Scattering Convolution Networks Joan Bruna and Stephane Mallat Submitted to PAMI, Feb. 2012 Presented by Bo Chen Other important related papers: [1] S. Mallat, A Theory for Multiresolution Signal
More informationHeadNet: Pedestrian Head Detection Utilizing Body in Context
HeadNet: Pedestrian Head Detection Utilizing Body in Context Gang Chen 1,2, Xufen Cai 1, Hu Han,1, Shiguang Shan 1,2,3 and Xilin Chen 1,2 1 Key Laboratory of Intelligent Information Processing of Chinese
More information