INTEREST POINTS AT DIFFERENT SCALES

Similar documents
Corner detection: the basic idea

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al.

Feature detectors and descriptors. Fei-Fei Li

Feature detectors and descriptors. Fei-Fei Li

Lecture 8: Interest Point Detection. Saad J Bedros

Lecture 12. Local Feature Detection. Matching with Invariant Features. Why extract features? Why extract features? Why extract features?

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros

Blobs & Scale Invariance

Feature extraction: Corners and blobs

Edges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise

Invariant local features. Invariant Local Features. Classes of transformations. (Good) invariant local features. Case study: panorama stitching

Extract useful building blocks: blobs. the same image like for the corners

LoG Blob Finding and Scale. Scale Selection. Blobs (and scale selection) Achieving scale covariance. Blob detection in 2D. Blob detection in 2D

Local Features (contd.)

SIFT keypoint detection. D. Lowe, Distinctive image features from scale-invariant keypoints, IJCV 60 (2), pp , 2004.

Lecture 8: Interest Point Detection. Saad J Bedros

Achieving scale covariance

Detectors part II Descriptors

CSE 473/573 Computer Vision and Image Processing (CVIP)

Blob Detection CSC 767

Overview. Harris interest points. Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points

Lecture 7: Finding Features (part 2/2)

Lecture 7: Finding Features (part 2/2)

CS5670: Computer Vision

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

SIFT: Scale Invariant Feature Transform

Image Analysis. Feature extraction: corners and blobs

Scale & Affine Invariant Interest Point Detectors

CS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015

Recap: edge detection. Source: D. Lowe, L. Fei-Fei

Vlad Estivill-Castro (2016) Robots for People --- A project for intelligent integrated systems

SIFT: SCALE INVARIANT FEATURE TRANSFORM BY DAVID LOWE

Scale-space image processing

Advanced Features. Advanced Features: Topics. Jana Kosecka. Slides from: S. Thurn, D. Lowe, Forsyth and Ponce. Advanced features and feature matching

SIFT, GLOH, SURF descriptors. Dipartimento di Sistemi e Informatica

6.869 Advances in Computer Vision. Prof. Bill Freeman March 1, 2005

Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

Harris Corner Detector

EE 6882 Visual Search Engine

Scale & Affine Invariant Interest Point Detectors

CS4670: Computer Vision Kavita Bala. Lecture 7: Harris Corner Detec=on

Image matching. by Diva Sian. by swashford

EECS150 - Digital Design Lecture 15 SIFT2 + FSM. Recap and Outline

SURF Features. Jacky Baltes Dept. of Computer Science University of Manitoba WWW:

Lecture 6: Finding Features (part 1/2)

Edge Detection. CS 650: Computer Vision

Interest Operators. All lectures are from posted research papers. Harris Corner Detector: the first and most basic interest operator

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Instance-level l recognition. Cordelia Schmid INRIA

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Keypoint extraction: Corners Harris Corners Pkwy, Charlotte, NC

Image Alignment and Mosaicing

SURVEY OF APPEARANCE-BASED METHODS FOR OBJECT RECOGNITION

Maximally Stable Local Description for Scale Selection

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Instance-level l recognition. Cordelia Schmid & Josef Sivic INRIA

Given a feature in I 1, how to find the best match in I 2?

Feature detection.

Image Processing 1 (IP1) Bildverarbeitung 1

Machine vision. Summary # 4. The mask for Laplacian is given

Optical flow. Subhransu Maji. CMPSCI 670: Computer Vision. October 20, 2016

The state of the art and beyond

Wavelet-based Salient Points with Scale Information for Classification

Face detection and recognition. Detection Recognition Sally

Machine vision, spring 2018 Summary 4

Motion Estimation (I) Ce Liu Microsoft Research New England

Edge Detection PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2005

Perception III: Filtering, Edges, and Point-features

Robert Collins CSE598G Mean-Shift Blob Tracking through Scale Space

Lesson 04. KAZE, Non-linear diffusion filtering, ORB, MSER. Ing. Marek Hrúz, Ph.D.

Affine Adaptation of Local Image Features Using the Hessian Matrix

Optical Flow, Motion Segmentation, Feature Tracking

Orientation Map Based Palmprint Recognition

Rotational Invariants for Wide-baseline Stereo

Slide a window along the input arc sequence S. Least-squares estimate. σ 2. σ Estimate 1. Statistically test the difference between θ 1 and θ 2

Motion Estimation (I)

A New Shape Adaptation Scheme to Affine Invariant Detector

Introduction to motion correspondence

Hilbert-Huang Transform-based Local Regions Descriptors

arxiv: v1 [cs.cv] 10 Feb 2016

2D Image Processing Face Detection and Recognition

Visual Object Recognition

Feature extraction: Corners and blobs

Image Alignment and Mosaicing Feature Tracking and the Kalman Filter

Edge Detection. Computer Vision P. Schrater Spring 2003

Feature Extraction and Image Processing

Corner. Corners are the intersections of two edges of sufficiently different orientations.

Object Recognition Using Local Characterisation and Zernike Moments

ECE Digital Image Processing and Introduction to Computer Vision

Templates, Image Pyramids, and Filter Banks

Feature Vector Similarity Based on Local Structure

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Region Moments: Fast invariant descriptors for detecting small image structures

Galilean-diagonalized spatio-temporal interest operators

Is the Scale Invariant Feature Transform (SIFT) really Scale Invariant?

Feature Tracking. 2/27/12 ECEn 631

Image Stitching II. Linda Shapiro CSE 455

Image Stitching II. Linda Shapiro EE/CSE 576

Transcription:

INTEREST POINTS AT DIFFERENT SCALES

Thank you for the slides. They come mostly from the following sources. Dan Huttenlocher Cornell U David Lowe U. of British Columbia Martial Hebert CMU

Intuitively, junctions of contours. Generally more stable features over changes of viewpoint Intuitively, large variations in the neighborhood of the point in all directions 2

The distribution of the x and y derivatives is very different for all three types of patches The distribution of x and y derivatives can be characterized by the shape and size of the principal component ellipse 3

5

Corner detection: the basic idea At a corner, shifting a window in any direction should give a large change in intensity flat region: no change in all directions edge : no change along the edge direction corner : significant change in all directions 6

A simple corner detector Define the sum squared difference (SSD) between an image patch and a patch shifted by offset (x,y): where w(u,v) = or 1 in window, 0 outside Gaussian If s(x,y) is high for shifts in all 8 directions, declare a corner. Problem: not isotropic 7

Harris corner detector derivation Second-order Taylor series approximation: where A is defined in terms of partial derivatives I x = I/ x and I y = I/ y summed over (u,v): For constant t, S(x,y) < t is an ellipse 8

Eigenvector analysis The eigenvectors v 1, v 2 of A give an orthogonal basis for the ellipse I.e. directions of fastest and slowest change for λ 2 > λ 1, v 1 is the direction of fastest change (minor axis of ellipse) and v 2 is the direction of slowest change (major axis) direction of the fastest change v 2 direction of the slowest change (λ 2 ) -1/2 (λ 1 ) -1/2 v 1 9

Classify points based on eigenvalues Classification of image points using eigenvalues of M: λ 2 Edge λ 2 >> λ 1 Corner λ 1 and λ 2 are large, λ 1 ~ λ 2 ; E increases in all directions λ 1 and λ 2 are small; E is almost constant in all directions Flat region Edge λ 1 >> λ 2 λ 1 10

Harris corner detection But square roots are expensive Approximate corner response function that avoids square roots: R ( ) 2 = λ λ k λ + λ 1 2 1 2 with k is set empirically After thresholding, keep only local maxima of R as corners prevents multiple detections of the same corner 13

Harris detector, step-by-step 14

Harris detector, step-by-step Compute corner response R 15

Harris detector, step-by-step Threshold on corner response R 16

Harris detector, step-by-step Take only local maxima of R 17

Harris detector properties Invariant to intensity shift: I = I + b only derivatives are used, not original intensity values Insensitive to intensity scaling: I = a I R R threshold x (image coordinate) x (image coordinate) So Harris is insensitive to affine intensity changes I.e. linear scaling plus a constant offset, I = a I + b 21

Rotation invariance Harris detector properties Ellipse (eigenvectors) rotate but shape (eigenvalues) remain the same Corner response R is invariant to image rotation 20

Rotation Invariance Cordelia Schmid & Roger Mohr (97) Apply Harris corner detector Use rotational invariants at corner points However, not scale invariant. Sensitive to viewpoint and illumination change.

Experimental evaluation C.Schmid et.al. Evaluation of Interest Point Detectors. IJCV 2000 24

Harris detector properties But Harris is not invariant to image scale All points will be classified as edges Corner! 22

Experimental evaluation Quality of Harris detector for different scale changes Repeatability rate: # correspondences # possible correspondences C.Schmid et.al. Evaluation of Interest Point Detectors. IJCV 2000 23

Scale invariant interest point detection Consider regions (e.g. circles) of different sizes around a point Regions of corresponding sizes will look the same in both images 25

Scale invariant detection The problem: how do we choose corresponding circles independently in each image? 26

A solution Design a function which is scale invariant I.e. value is the same for two corresponding regions, even if they are at different scales Example: average intensity is the same for corresponding regions, even of different sizes For a given point in an image, consider the value of f as a function of region size (circle radius) f scale = 1/2 f region size region 27 size

A solution Take a local maximum of this function The region size at which maximum is achieved should be invariant to image scale This scale invariant region size is determined independently in each image f scale = 1/2 f s 1 region size s 2 region 28 size

Choosing a function A good function for scale detection has one sharp peak f f f Good! bad bad region size region size region size A function that responds to image contrast is a good choice e.g. convolve with a kernel like the Laplacian or the Difference of Gaussians 29

ML = µ ( xl, ΣL) MR = µ ( xr, ΣR) x Ax 1 x M 2 x L 1 x M 2 x R whitening 1 2 R 2 ( M x ) = R( M x R 1 L L ) Isotropic neighborhoods related by rotation Localization: Compute location of interest point using local maximum of the Harris measure Iterate Scale: Compute the scale at that point using Laplacian Affine neighborhood: Normalize neighborhood with second moment matrix Yes, it converges 5

Why invariance? Reminder: Finding scale invariant features Laplacian Characteristic scale : - maximum in scale space - scale invariant 4

Affine invariant detection Harris-Affine [Mikolajczyk & Schmid, IJCV04]: Use Harris moment matrix to select dominant directions and anisotropy 23

Affine invariant detection Matching Widely Separated Views Based on Affine Invariant Regions, T. TUYTELAARS and L. VAN GOOL, IJCV 2004 24

Affine invariant detection The regions found may not exactly correspond, so we approximate them with ellipses Geometric Moments: mpq p q = x y f( x, y) dxdy Fact: moments m pq uniquely 2 determine the function f Taking f to be the characteristic function of a region (1 inside, 0 outside), moments of orders up to 2 allow to approximate the region by an ellipse This ellipse will have the same moments of orders up to 2 as the original region 26

Affine invariant detection Algorithm summary (detection of affine invariant region): Start from a local intensity extremum point Go in every direction until the point of extremum of some function f Curve connecting the points is the region boundary Compute geometric moments of orders up to 2 for this region Replace the region with ellipse 27

Scale Invariant Detection: Summary Given: two images of the same scene with a scale difference between them Goal: find the same interest points independently in each image Solution: search for maxima of suitable functions in scale and in space (over the image) Methods: 1. Harris-Laplacian [Mikolajczyk, Schmid]: maximize Laplacian over scale, Harris measure of corner response over the image 2. SIFT [Lowe]: maximize Difference of Gaussians over scale and space 14

Scale-Invariant Local Features Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging parameters SIFT Features

Advantages of invariant local features Locality: features are local, so robust to occlusion and clutter (no prior segmentation) Distinctiveness: individual features can be matched to a large database of objects Quantity: many features can be generated for even small objects Efficiency: close to real-time performance Extensibility: can easily be extended to wide range of differing feature types, with each adding robustness

Resam Bl ple ur Sub tract Build Scale-Space Pyramid All scales must be examined to identify scale-invariant features An efficient function is to compute the Difference of Gaussian (DOG) pyramid (Burt & Adelson, 1983) Resample Blur Subtract

Scale space processed one octave at a time

Sampling frequency for scale More points are found as sampling frequency increases, but accuracy of matching decreases after 3 scales/octave

Resam ple ur Sub tract Key point localization Detect maxima and minima of difference-of-gaussian in scale space Bl

Example of keypoint detection Threshold on value at DOG peak and on ratio of principle curvatures (Harris approach) (a) 233x189 image (b) 832 DOG extrema (c) 729 left after peak value threshold (d) 536 left after testing ratio of principle curvatures

Select canonical orientation Create histogram of local gradient directions computed at selected scale Assign canonical orientation at peak of smoothed histogram Each key specifies stable 2D coordinates (x, y, scale, orientation) 0 2π

SIFT Orientation Assignment Orientation determined by largest peak Assign second orientation if second peak at least.8 height About 15% of keypoints have two orientations Parabolic fit to three values around peak to interpolate orientation 0 2π 17

SIFT Descriptor 8 orientations and 4x4 array computed by histogramming over 16x16 image region 128 dimensional feature vector Illustration shows 2x2 array and 8x8 region 19

SIFT Matching Example Example objects in cluttered environment Rectangle around detected objects based on model images at left 20

Location recognition

Scale Invariant Detectors Experimental evaluation of detectors w.r.t. scale change Repeatability rate: # correspondences # possible correspondences K.Mikolajczyk, C.Schmid. Indexing Based on Scale Invariant Interest Points. ICCV 2001 13

Feature Matching & Geometric Relations Features from training image(s) Features from input image Feature Matching & Geometric Relations Features from training image(s) Features from input image Aspect is different between training and test images invariant features Local feature similarity is not sufficient use global geometric consistency Large number of features define distance in feature space + efficient indexing 3

Lucas-Kanade (1981) is an image alignment algorithm in a optical motion routines. Align a (2D) template T(x) to a (2D) image I(x) where the image have a parameterized warps, e.g., affine with six parameters, or homography with eight. See the reading list: Lucas-Kanade 20 years on: A unifying framework.