Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

Similar documents
Overview. Harris interest points. Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

Detectors part II Descriptors

SIFT keypoint detection. D. Lowe, Distinctive image features from scale-invariant keypoints, IJCV 60 (2), pp , 2004.

Blobs & Scale Invariance

Instance-level l recognition. Cordelia Schmid & Josef Sivic INRIA

Blob Detection CSC 767

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros

Feature extraction: Corners and blobs

Lecture 8: Interest Point Detection. Saad J Bedros

Feature detectors and descriptors. Fei-Fei Li

Scale & Affine Invariant Interest Point Detectors

Feature detectors and descriptors. Fei-Fei Li

Edges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise

CSE 473/573 Computer Vision and Image Processing (CVIP)

Extract useful building blocks: blobs. the same image like for the corners

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al.

INTEREST POINTS AT DIFFERENT SCALES

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

SURF Features. Jacky Baltes Dept. of Computer Science University of Manitoba WWW:

Image Analysis. Feature extraction: corners and blobs

Maximally Stable Local Description for Scale Selection

Recap: edge detection. Source: D. Lowe, L. Fei-Fei

LoG Blob Finding and Scale. Scale Selection. Blobs (and scale selection) Achieving scale covariance. Blob detection in 2D. Blob detection in 2D

Achieving scale covariance

Scale-space image processing

CS5670: Computer Vision

Feature Vector Similarity Based on Local Structure

SURVEY OF APPEARANCE-BASED METHODS FOR OBJECT RECOGNITION

Wavelet-based Salient Points with Scale Information for Classification

The state of the art and beyond

Invariant local features. Invariant Local Features. Classes of transformations. (Good) invariant local features. Case study: panorama stitching

Given a feature in I 1, how to find the best match in I 2?

Hilbert-Huang Transform-based Local Regions Descriptors

Affine Adaptation of Local Image Features Using the Hessian Matrix

Vlad Estivill-Castro (2016) Robots for People --- A project for intelligent integrated systems

SIFT: SCALE INVARIANT FEATURE TRANSFORM BY DAVID LOWE

Lecture 12. Local Feature Detection. Matching with Invariant Features. Why extract features? Why extract features? Why extract features?

Lecture 8: Interest Point Detection. Saad J Bedros

Scale & Affine Invariant Interest Point Detectors

Orientation Map Based Palmprint Recognition

Lecture 6: Finding Features (part 1/2)

Lecture 7: Finding Features (part 2/2)

arxiv: v1 [cs.cv] 10 Feb 2016

Local Features (contd.)

Lecture 7: Finding Features (part 2/2)

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Instance-level l recognition. Cordelia Schmid INRIA

A New Shape Adaptation Scheme to Affine Invariant Detector

SIFT, GLOH, SURF descriptors. Dipartimento di Sistemi e Informatica

Image matching. by Diva Sian. by swashford

Corner detection: the basic idea

Feature detection.

Advanced Features. Advanced Features: Topics. Jana Kosecka. Slides from: S. Thurn, D. Lowe, Forsyth and Ponce. Advanced features and feature matching

KAZE Features. 1 Introduction. Pablo Fernández Alcantarilla 1, Adrien Bartoli 1, and Andrew J. Davison 2

Is the Scale Invariant Feature Transform (SIFT) really Scale Invariant?

Rotational Invariants for Wide-baseline Stereo

2. The wave equation. u tt (x i,y j,t n ) ( u n+1. 2u n i,j + u n 1 ) (2) 2 u(x i,y j,t n ) ( u n i 1,j 1 +2u n i 1,j + u n i 1,j+1+ (3)

Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales

On the Completeness of Coding with Image Features

CS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015

Lecture 05 Point Feature Detection and Matching

6.869 Advances in Computer Vision. Prof. Bill Freeman March 1, 2005

Harris Corner Detector

On the consistency of the SIFT Method

Edge Detection. CS 650: Computer Vision

SIFT: Scale Invariant Feature Transform

Learning Discriminative and Transformation Covariant Local Feature Detectors

Lecture 6: Edge Detection. CAP 5415: Computer Vision Fall 2008

Image Processing 1 (IP1) Bildverarbeitung 1

ECE 468: Digital Image Processing. Lecture 8

Visual Object Recognition

CS4670: Computer Vision Kavita Bala. Lecture 7: Harris Corner Detec=on

An Improved Harris-Affine Invariant Interest Point Detector

EE 6882 Visual Search Engine

Lecture 7: Edge Detection

Is SIFT Scale Invariant?

Affine Differential Invariants for Invariant Feature Point Detection

Interpreting the Ratio Criterion for Matching SIFT Descriptors

Lesson 04. KAZE, Non-linear diffusion filtering, ORB, MSER. Ing. Marek Hrúz, Ph.D.

Feature extraction: Corners and blobs

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, August 18, ISSN

Scale-space image processing

Interest Operators. All lectures are from posted research papers. Harris Corner Detector: the first and most basic interest operator

Scale Space Smoothing, Image Feature Extraction and Bessel Filters

Fast Explicit Diffusion for Accelerated Features in Nonlinear Scale Spaces

SCALE-SPACE - Theory and Applications

Filtering and Edge Detection

Linear Diffusion. E9 242 STIP- R. Venkatesh Babu IISc

Roadmap. Introduction to image analysis (computer vision) Theory of edge detection. Applications

Region Moments: Fast invariant descriptors for detecting small image structures

Interest Point Detection with Wavelet Maxima Lines

Coding Images with Local Features

Equi-Affine Differential Invariants for Invariant Feature Point Detection

Extracting Scale and Illuminant Invariant Regions through Color

Laplacian Filters. Sobel Filters. Laplacian Filters. Laplacian Filters. Laplacian Filters. Laplacian Filters

Perception III: Filtering, Edges, and Point-features

Transcription:

Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison of different detectors Region descriptors and their performance

Scale invariance - motivation Description regions have to be adapted to scale changes Interest points have to be repeatable for scale changes

Harris detector + scale changes Repeatability rate R( ε ) = {( a i, b i ) dist( H ( ai ), b max( a, b ) i i i ) < ε}

Scale adaptation Scale change between two images I 1 x y 1 1 = I 2 x y 2 2 = I 2 sx sy 1 1 Scale adapted derivative calculation

Scale adaptation = = 1 1 2 2 2 2 1 1 1 sy sx I y x I y x I Scale change between two images ) ( ) ( 1 1 2 2 2 1 1 1 σ σ s G y x I s G y x I n n i i n i i K K = Scale adapted derivative calculation sσ n s

Scale adaptation (σ ) L i where are the derivatives with Gaussian convolution ) ( ) ( ) ( ) ( ~ ) ( 2 2 σ σ σ σ σ y y x y x x L L L L L L G

Scale adaptation (σ ) L i where are the derivatives with Gaussian convolution ) ( ) ( ) ( ) ( ~ ) ( 2 2 σ σ σ σ σ y y x y x x L L L L L L G ) ( ) ( ) ( ) ( ~ ) ( 2 2 2 σ σ σ σ σ s L s L L s L L s L s G s y y x y x x Scale adapted auto-correlation matrix

Harris detector adaptation to scale R( ε ) {( a, b ) dist( H ( a ), b ) < ε} = i i i i

Multi-scale matching algorithm s = 1 s = 3 s = 5

Multi-scale matching algorithm s = 1 8 matches

Multi-scale matching algorithm Robust estimation of a global affine transformation s = 1 3 matches

Multi-scale matching algorithm s = 1 3 matches s = 3 4 matches

Multi-scale matching algorithm s = 1 3 matches s = 3 4 matches highest number of matches correct scale s = 5 16 matches

Matching results Scale change of 5.7

Matching results 100% correct matches (13 matches)

Scale selection We want to find the characteristic scale of the blob by convolving it with Laplacians at several scales and looking for the maximum response However, Laplacian response decays as scale increases: original signal (radius=8) increasing σ Why does this happen?

Scale normalization The response of a derivative of Gaussian filter to a perfect step edge decreases as σ increases 1 σ 2π

Scale normalization The response of a derivative of Gaussian filter to a perfect step edge decreases as σ increases To keep response the same (scale-invariant), must multiply Gaussian derivative by σ Laplacian is the second Gaussian derivative, so it must be Laplacian is the second Gaussian derivative, so it must be multiplied by σ 2

Effect of scale normalization Original signal Unnormalized Laplacian response Scale-normalized Laplacian response maximum

Blob detection in 2D Laplacian of Gaussian: Circularly symmetric operator for blob detection in 2D 2 g = 2 x g 2 + 2 y g 2

Blob detection in 2D Laplacian of Gaussian: Circularly symmetric operator for blob detection in 2D Scale-normalized: g g 2 g = σ 2 + norm x 2 y 2 2 2

Scale selection The 2D Laplacian is given by ( x 2 2 2 2 2 2 ( )/ 2 2σ x + y σ + y ) e (up to scale) For a binary circle of radius r, the Laplacian achieves a maximum at σ = r / 2 r Laplacian response image r / 2 scale (σ)

Characteristic scale We define the characteristic scale as the scale that produces peak of Laplacian response characteristic scale T. Lindeberg (1998). Feature detection with automatic scale selection. International Journal of Computer Vision 30 (2): pp 77--116.

Scale selection For a point compute a value (gradient, Laplacian etc.) at several scales Normalization of the values with the scale factor 2 e.g. Laplacian s ( L xx + L yy ) Select scale s at the maximum characteristic scale 2 s ( L xx + L yy ) scale Exp. results show that the Laplacian gives best results

Scale selection Scale invariance of the characteristic scale s norm. Lap. scale

Scale selection Scale invariance of the characteristic scale s norm. Lap. norm. Lap. scale scale Relation between characteristic scales s s 1 = s2

Scale-invariant detectors Harris-Laplace (Mikolajczyk & Schmid 01) Laplacian detector (Lindeberg 98) Difference of Gaussian (Lowe 99) Harris-Laplace Laplacian

Harris-Laplace multi-scale Harris points selection of points at maximum of Laplacian invariant points + associated regions [Mikolajczyk & Schmid 01]

Matching results 213 / 190 detected interest points

Matching results 58 points are initially matched

Matching results 32 points are matched after verification all correct

LOG detector Convolve image with scalenormalized Laplacian at several scales 2 LOG = s ( G xx ( σ ) + G yy ( σ )) Detection of maxima and minima of Laplacian in scale space

Hessian detector Hessian matrix H ( x) = L L xx xy L L xy yy Determinant of Hessian matrix DET = L xx L yy L 2 xy Penalizes/eliminates long structures with small derivative in a single direction

Efficient implementation Difference of Gaussian (DOG) approximates the Laplacian DOG = G ) ( ) ( kσ G σ Error due to the approximation

DOG detector Fast computation, scale space processed one octave at a time David G. Lowe. "Distinctive image features from scale-invariant keypoints. IJCV 60 (2).

Local features - overview Scale invariant interest points Affine invariant interest points Evaluation of interest points Descriptors and their evaluation

Affine invariant regions - Motivation Scale invariance is not sufficient for large baseline changes detected scale invariant region A projected regions, viewpoint changes can locally be approximated by an affine transformation A

Affine invariant regions - Motivation

Affine invariant regions - Example

Harris/Hessian/Laplacian-Affine Initialize with scale-invariant Harris/Hessian/Laplacian points Estimation of the affine neighbourhood with the second moment matrix [Lindeberg 94] Apply affine neighbourhood estimation to the scaleinvariant interest points [Mikolajczyk & Schmid 02, Schaffalitzky & Zisserman 02] Excellent results in a comparison [Mikolajczyk et al. 05]

Affine invariant regions Based on the second moment matrix (Lindeberg 94) = = ), ( ), ( ), ( ), ( ) ( ),, ( 2 2 2 D y D y x D y x D x I D D I L L L L L L G M σ σ σ σ σ σ σ σ µ x x x x x x 2 x 1 = M Normalization with eigenvalues/eigenvectors

Affine invariant regions x = A R x L x L = M 1 2 L x L x R = M 1 2 R x R x R = Rx L Isotropic neighborhoods related by image rotation

Affine invariant regions - Estimation Iterative estimation initial points

Affine invariant regions - Estimation Iterative estimation iteration #1

Affine invariant regions - Estimation Iterative estimation iteration #2

Affine invariant regions - Estimation Iterative estimation iteration #3, #4

Harris-Affine versus Harris-Laplace Harris-Affine Harris-Laplace

Harris/Hessian-Affine Harris-Affine Hessian-Affine

Harris-Affine

Hessian-Affine

Matches 22 correct matches

Matches 33 correct matches

Maximally stable extremal regions (MSER) [Matas 02] Extremal regions: connected components in a thresholded image (all pixels above/below a threshold) Maximally stable: minimal change of the component (area) for a change of the threshold, i.e. region remains stable for a change of threshold Excellent results in a recent comparison

Maximally stable extremal regions (MSER) Examples of thresholded images high threshold low threshold

MSER

Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison of different detectors Region descriptors and their performance

Evaluation of interest points Quantitative evaluation of interest point/region detectors points / regions at the same relative location and area Repeatability rate : percentage of corresponding points Two points/regions are corresponding if location error small area intersection large [K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir & L. Van Gool 05]

Evaluation criterion H repeatability = # corresponding regions # detected regions 100%

Evaluation criterion H # corresponding regions repeatability = # detected regions intersection overlap error = ( 1 ) 100% union 100% 2% 10% 20% 30% 40% 50% 60%

Dataset Different types of transformation Viewpoint change Scale change Image blur JPEG compression Light change Two scene types Structured Textured Transformations within the sequence (homographies) Independent estimation

Viewpoint change (0-60 degrees ) structured scene textured scene

Zoom + rotation (zoom of 1-4) structured scene textured scene

Blur, compression, illumination blur - structured scene blur - textured scene light change - structured scene jpeg compression - structured scene

Comparison of affine invariant detectors repeatabilit ty % 100 90 80 70 60 50 40 30 20 10 Viewpoint change - structured scene repeatability % Harris Affine Hessian Affine MSER IBR EBR Salient number of corresp pondences 1400 1200 # correspondences IBR 1000 EBR Salient 800 600 400 200 Harris Affine Hessian Affine MSER 0 15 20 25 30 35 40 45 50 55 60 65 viewpoint angle 0 15 20 25 30 35 40 45 50 55 60 65 viewpoint angle reference image 20 40 60

Comparison of affine invariant detectors Scale change repeatability % repeatability % reference image 2.8 reference image 4

Conclusion - detectors Good performance for large viewpoint and scale changes Results depend on transformation and scene type, no one best detector Detectors are complementary MSER adapted to structured scenes Harris and Hessian adapted to textured scenes Performance of the different scale invariant detectors is very similar (Harris-Laplace, Hessian, LoG and DOG) Scale-invariant detector sufficient up to 40 degrees of viewpoint change

Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison of different detectors Region descriptors and their performance

Region descriptors Normalized regions are invariant to geometric transformations except rotation not invariant to photometric transformations

Descriptors Regions invariant to geometric transformations except rotation rotation invariant descriptors normalization with dominant gradient direction Regions not invariant to photometric transformations invariance to affine photometric transformations normalization with mean and standard deviation of the image patch

Descriptors Extract affine regions Normalize regions Eliminate rotational + illumination Compute appearance descriptors SIFT (Lowe 04)

Descriptors Gaussian derivative-based descriptors Differential invariants (Koenderink and van Doorn 87) Steerable filters (Freeman and Adelson 91) SIFT (Lowe 99) Moment invariants [Van Gool et al. 96] Shape context [Belongie et al. 02] SIFT with PCA dimensionality reduction Gradient PCA [Ke and Sukthankar 04] SURF descriptor [Bay et al. 08] DAISY descriptor [Tola et al. 08, Windler et al 09]

Descriptors should be Distinctive Comparison criterion Robust to changes on viewing conditions as well as to errors of the detector Detection rate (recall) #correct matches / #correspondences False positive rate #false matches / #all matches Variation of the distance threshold distance (d1, d2) < threshold [K. Mikolajczyk & C. Schmid, PAMI 05] 1 1

Viewpoint change (60 degrees) * * sift esift gradient pca shape context cross correlation har aff esift steerable filters gradient moments complex filters 1 0.9 0.8 0.7 #correct / 210 01 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 precision

Scale change (factor 2.8) * * sift esift gradient pca shape context cross correlation har aff esift steerable filters gradient moments complex filters 1 0.9 0.8 0.7 #correct / 208 86 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 precision

Conclusion - descriptors SIFT based descriptors perform best Significant difference between SIFT and low dimension descriptors as well as cross-correlation Robust region descriptors better than point-wise descriptors Performance of the descriptor is relatively independent of the detector

Available on the internet http://lear.inrialpes.fr/software Binaries for detectors and descriptors Building blocks for recognition systems Carefully designed test setup Dataset with transformations Evaluation code in matlab Benchmark for new detectors and descriptors