Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context

Similar documents
Detectors part II Descriptors

Feature extraction: Corners and blobs

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros

Edges and Scale. Image Features. Detecting edges. Origin of Edges. Solution: smooth first. Effects of noise

CSE 473/573 Computer Vision and Image Processing (CVIP)

Extract useful building blocks: blobs. the same image like for the corners

SIFT keypoint detection. D. Lowe, Distinctive image features from scale-invariant keypoints, IJCV 60 (2), pp , 2004.

CS5670: Computer Vision

Blobs & Scale Invariance

Overview. Harris interest points. Comparing interest points (SSD, ZNCC, SIFT) Scale & affine invariant interest points

Blob Detection CSC 767

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

CS 3710: Visual Recognition Describing Images with Features. Adriana Kovashka Department of Computer Science January 8, 2015

LoG Blob Finding and Scale. Scale Selection. Blobs (and scale selection) Achieving scale covariance. Blob detection in 2D. Blob detection in 2D

Recap: edge detection. Source: D. Lowe, L. Fei-Fei

Advances in Computer Vision. Prof. Bill Freeman. Image and shape descriptors. Readings: Mikolajczyk and Schmid; Belongie et al.

Achieving scale covariance

Feature detectors and descriptors. Fei-Fei Li

Feature detectors and descriptors. Fei-Fei Li

Scale & Affine Invariant Interest Point Detectors

Lecture 8: Interest Point Detection. Saad J Bedros

Image Analysis. Feature extraction: corners and blobs

SURF Features. Jacky Baltes Dept. of Computer Science University of Manitoba WWW:

INTEREST POINTS AT DIFFERENT SCALES

Lecture 6: Finding Features (part 1/2)

Overview. Introduction to local features. Harris interest points + SSD, ZNCC, SIFT. Evaluation and comparison of different detectors

Wavelet-based Salient Points with Scale Information for Classification

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Scale-space image processing

Invariant local features. Invariant Local Features. Classes of transformations. (Good) invariant local features. Case study: panorama stitching

Maximally Stable Local Description for Scale Selection

Lecture 12. Local Feature Detection. Matching with Invariant Features. Why extract features? Why extract features? Why extract features?

Image matching. by Diva Sian. by swashford

Lecture 7: Finding Features (part 2/2)

Vlad Estivill-Castro (2016) Robots for People --- A project for intelligent integrated systems

arxiv: v1 [cs.cv] 10 Feb 2016

Lecture 7: Finding Features (part 2/2)

Lecture 8: Interest Point Detection. Saad J Bedros

Affine Adaptation of Local Image Features Using the Hessian Matrix

The state of the art and beyond

Orientation Map Based Palmprint Recognition

Given a feature in I 1, how to find the best match in I 2?

SIFT: SCALE INVARIANT FEATURE TRANSFORM BY DAVID LOWE

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Edge Detection. CS 650: Computer Vision

Corner detection: the basic idea

Filtering and Edge Detection

Local Features (contd.)

Lecture 6: Edge Detection. CAP 5415: Computer Vision Fall 2008

Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales

Machine vision. Summary # 4. The mask for Laplacian is given

SIFT: Scale Invariant Feature Transform

Lecture 7: Edge Detection

Machine vision, spring 2018 Summary 4

EE 6882 Visual Search Engine

Lesson 04. KAZE, Non-linear diffusion filtering, ORB, MSER. Ing. Marek Hrúz, Ph.D.

Roadmap. Introduction to image analysis (computer vision) Theory of edge detection. Applications

Instance-level l recognition. Cordelia Schmid INRIA

Fast Explicit Diffusion for Accelerated Features in Nonlinear Scale Spaces

SURVEY OF APPEARANCE-BASED METHODS FOR OBJECT RECOGNITION

Robert Collins CSE598G Mean-Shift Blob Tracking through Scale Space

KAZE Features. 1 Introduction. Pablo Fernández Alcantarilla 1, Adrien Bartoli 1, and Andrew J. Davison 2

Keypoint extraction: Corners Harris Corners Pkwy, Charlotte, NC

CS4670: Computer Vision Kavita Bala. Lecture 7: Harris Corner Detec=on

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Edge Detection. Introduction to Computer Vision. Useful Mathematics Funcs. The bad news

Instance-level l recognition. Cordelia Schmid & Josef Sivic INRIA

Scale Space Smoothing, Image Feature Extraction and Bessel Filters

Feature extraction: Corners and blobs

Feature Vector Similarity Based on Local Structure

Face detection and recognition. Detection Recognition Sally

Hilbert-Huang Transform-based Local Regions Descriptors

Edge Detection PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2005

ECE Digital Image Processing and Introduction to Computer Vision. Outline

Visual Object Recognition

SIFT, GLOH, SURF descriptors. Dipartimento di Sistemi e Informatica

38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, Bag-of-phrases.. Image Representation Using Bag-of-phrases

Instance-level recognition: Local invariant features. Cordelia Schmid INRIA, Grenoble

Scale & Affine Invariant Interest Point Detectors

Global Scene Representations. Tilke Judd

Advanced Features. Advanced Features: Topics. Jana Kosecka. Slides from: S. Thurn, D. Lowe, Forsyth and Ponce. Advanced features and feature matching

Feature detection.

On the Completeness of Coding with Image Features

Templates, Image Pyramids, and Filter Banks

ECE 468: Digital Image Processing. Lecture 8

Large-scale classification of traffic signs under real-world conditions

Region Moments: Fast invariant descriptors for detecting small image structures

Harris Corner Detector

Image Processing 1 (IP1) Bildverarbeitung 1

Edge Detection. Computer Vision P. Schrater Spring 2003

Shape of Gaussians as Feature Descriptors

Learning Discriminative and Transformation Covariant Local Feature Detectors

Fisher Vector image representation

Urban land use information retrieval based on scene classification of Google Street View images

Space-time Zernike Moments and Pyramid Kernel Descriptors for Action Classification

Introduction to Computer Vision

Edge Detection in Computer Vision Systems

Distinguish between different types of scenes. Matching human perception Understanding the environment

Discriminative part-based models. Many slides based on P. Felzenszwalb

Transcription:

Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Silvio Savarese Lecture 10-16-Feb-15

From the 3D to 2D & vice versa P = [x,y,z] p = [x,y] 3D world Image Let s now focus on 2D

How to represent images?

The big picture Feature Detection e.g. DoG Feature Description e.g. SIFT Estimation Matching Indexing Detection

Estimation Courtesy of TKK Automation Technology Laboratory

Estimation H

Estimation

Matching Image 1 Image 2

Object modeling and detection A

Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Silvio Savarese Lecture 10-16-Feb-15

Edge detection

Depth discontinuity Surface orientation discontinuity What causes an edge? Identifies sudden changes in an image Reflectance discontinuity (i.e., change in surface material properties) Illumination discontinuity (e.g., highlights; shadows)

Good detection accuracy: Edge Detection Criteria for optimal edge detection (Canny 86): minimize the probability of false positives (detecting spurious edges caused by noise), false negatives (missing real edges) Good localization: edges must be detected as close as possible to the true edges. Single response constraint: minimize the number of local maxima around the true edge (i.e. detector must return single point for each true edge point)

Examples: Edge Detection True edge Poor robustness to noise Poor localization Too many responses

Designing an edge detector Two ingredients: Use derivatives (in x and y direction) to define a location with high gradient. Need smoothing to reduce noise prior to taking derivative

Designing an edge detector f g f * g d dx ( f g) [Eq. 1] [Eq. 2] = dg dx f = derivative of Gaussian filter Source: S. Seitz

Edge detector in 2D Smoothing I' 1 2π σ 2 e 2σ 2 [Eq. 3] [Eq. 4] = g(x, y) I g(x, y) = x 2 +y 2 Derivative ( g I) = ( g) = S = I ' g x $ & g x I # = % I g " = $! & y # % g y I " g & g # $! & $ g x = g x! = $ $! % g y $ % y!" =! " S x S y # $ = gradient vector [Eq. 5] #! " [Eq. 6]

Canny Edge Detection (Canny 86): See CS131A for details original Canny with Canny with The choice of σ depends on desired behavior large σ detects large scale edges small σ detects fine features

Other edge detectors: - Sobel - Canny-Deriche - Differential

Corner/blob detectors

Corner/blob detectors Repeatability The same feature can be found in several images despite geometric and photometric transformations Saliency Each feature is found at an interesting region of the image Locality A feature occupies a relatively small area of the image;

Repeatability Illumination invariance Scale invariance Pose invariance Rotation Affine

Saliency Locality

Corners detectors

Harris corner detector C.Harris and M.Stephens. "A Combined Corner and Edge Detector. Proceedings of the 4th Alvey Vision Conference: pages 147--151. See CS131A for details

Harris Detector: Basic Idea Explore intensity changes within a window as the window changes location flat region: no change in all directions edge : no change along the edge direction corner : significant change in all directions

Results

Blob detectors

Edge detection f g f * g d dx ( f g) Source: S. Seitz

Edge detection f g f * g [Eq. 7] d dx 2 2 ( f g) [Eq. 8] d dx 2 f 2 g = second derivative of Gaussian filter = Laplacian of the gaussian

Edge detection as zero crossing f Edge d dx 2 2 g Laplacian d dx 2 f 2 [Eq. 8] g Edge = zero crossing of second derivative

Edge detection as zero crossing * = edge edge

From edges to blobs Can we use the laplacian to find a blob (RECT function)? * * * = = = maximum Magnitude of the Laplacian response achieves a maximum at the center of the blob, provided the scale of the Laplacian is matched to the scale of the blob

From edges to blobs Can we use the laplacian to find a blob (RECT function)? * * * = = = What if the blob is slightly thicker or slimmer? maximum

Scale selection Convolve signal with Laplacians at several sizes and looking for the maximum response increasing σ

Scale normalization To keep the energy of the response the same, must multiply Gaussian derivative by σ Laplacian is the second Gaussian derivative, so it must be multiplied by σ 2 Normalized Laplacian g(x) = 1 2π σ e x 2 2σ 2 σ 2 d 2 dx 2 g

Characteristic scale We define the characteristic scale as the scale that produces peak of Laplacian response Original signal Scale-normalized Laplacian response σ = 1 σ = 2 σ = 4 σ = 8 σ = 16 Maximum T. Lindeberg (1998). "Feature detection with automatic scale selection." International Journal of Computer Vision 30 (2): pp 77--116.

Characteristic scale Here is what happens if we don t normalize the Laplacian: Original signal σ = 1 σ = 2 σ = 4 σ = 8 σ = 16 This should give the max response

Blob detection in 2D Laplacian of Gaussian: Circularly symmetric operator for blob detection in 2D Scale-normalized: [Eq. 9] & g g 2 g = σ 2 $ + norm % x 2 y 2 2 2 #! "

Scale selection For a binary circle of radius r, the Laplacian achieves a maximum at σ = r / 2 r Laplacian response image r / 2 scale (σ)

Scale-space blob detector 1. Convolve image with scale-normalized Laplacian at several scales 2. Find maxima of squared Laplacian response in scale-space The maxima indicate that a blob has been detected and what s its intrinsic scale

Scale-space blob detector: Example

Scale-space blob detector: Example

Scale-space blob detector: Example

Difference of Gaussians (DoG) David G. Lowe. "Distinctive image features from scale-invariant keypoints. IJCV 60 (2), 04 Approximating the Laplacian with a difference of Gaussians: ( (,, ) (,, )) 2 L = Gxx x y + Gyy x y σ σ σ (Laplacian) DoG = G(x, y,2σ ) G(x, y,σ ) Difference of gaussian with scales 2 σ and σ [Eq. 10] [Eq. 11] In general: DoG = G(x, y,kσ ) G(x, y,σ ) (k 1)σ 2 L [Eq. 12]

Affine invariant detectors K. Mikolajczyk and C. Schmid, Scale and Affine invariant interest point detectors, IJCV 60(1):63-86, 2004. Similarly to characteristic scale, we can define the characteristic shape of a blob

Detector Illumination Rotation Scale View point Lowe 99 (DoG) Properties of detectors Yes Yes Yes No Scale-normalized: 2 2 2 2& g g = $ normg σ + 2 2 % x y #! "

Detector Illumination Rotation Scale View point Lowe 99 (DoG) Properties of detectors Yes Yes Yes No Harris corner Yes Yes No No Mikolajczyk & Schmid 01, 02 Yes Yes Yes Yes Tuytelaars, 00 Yes Yes No (Yes 04 ) Yes Kadir & Brady, 01 Yes Yes Yes no Matas, 02 Yes Yes Yes no

Lecture 10 Detectors and descriptors Properties of detectors Edge detectors Harris DoG Properties of descriptors SIFT HOG Shape context Silvio Savarese Lecture 10-16-Feb-15

The big picture Feature Detection e.g. DoG Feature Description e.g. SIFT Estimation Matching Indexing Detection

Invariant w.r.t: Properties Depending on the application a descriptor must incorporate information that is: Illumination Pose Scale Intraclass variability A a Highly distinctive (allows a single feature to find its correct match with good probability in a large database of features)

The simplest descriptor M N 1 x NM vector of pixel intensities w= [ ]

Normalized vector of intensities M 1 x NM vector of pixel intensities w= [ ] w n = (w (w w) w) Makes the descriptor invariant with respect to affine transformation of the illumination condition [Eq. 13] N

Illumination normalization Affine intensity change: w w + b a w + b [Eq. 14] w n = (w (w w) w) w Make each patch zero mean: remove b Make unit variance: remove a Index of w

Why can t we just use this? Sensitive to small variation of: location Pose Scale intra-class variability Poorly distinctive

Sensitive to pose variations Normalized Correlation: NCC w n w! n = (w (w w)(w! w)(w! w!) w!) u

Properties of descriptors Descriptor Illumination Pose Intra-class variab. PATCH Good Poor Poor

Bank of filters * = image filter bank filter responses More robust but still quite sensitive to pose variations descriptor http://people.csail.mit.edu/billf/papers/steerpaper91freemanadelson.pdf A. Oliva and A. Torralba. Modeling the shape of the scene: a holistic representation of the spatial envelope. IJCV, 2001.

Properties of descriptors Descriptor Illumination Pose Intra-class variab. PATCH Good Poor Poor FILTERS Good Medium Medium

SIFT descriptor David G. Lowe. "Distinctive image features from scale-invariant keypoints. IJCV 60 (2), 04 Alternative representation for image regions Location and characteristic scale s given by DoG detector s Image window

SIFT descriptor Alternative representation for image regions Location and characteristic scale s given by DoG detector s Compute gradient at each pixel N x N spatial bins Compute an histogram h i of M orientations for each bin i θ 1 θ 2 θ M-1 θ M

SIFT descriptor Alternative representation for image regions Location and characteristic scale s given by DoG detector s Compute gradient at each pixel N x N spatial bins Compute an histogram h i of M orientations for each bin i Concatenate h i for i=1 to N 2 to form a 1xMN 2 vector H

SIFT descriptor Alternative representation for image regions Location and characteristic scale s given by DoG detector s Compute gradient at each pixel N x N spatial bins Compute an histogram h i of M orientations for each bin i Concatenate h i for i=1 to N 2 to form a 1xMN 2 vector H Normalize to unit norm Gaussian center-weighting Typically M = 8; N= 4 H = 1 x 128 descriptor

SIFT descriptor Robust w.r.t. small variation in: Illumination (thanks to gradient & normalization) Pose (small affine variation thanks to orientation histogram ) Scale (scale is fixed by DOG) Intra-class variability (small variations thanks to histograms)

Rotational invariance Find dominant orientation by building a orientation histogram Rotate all orientations by the dominant orientation 0 2 π This makes the SIFT descriptor rotational invariant

Properties of descriptors Descriptor Illumination Pose Intra-class variab. PATCH Good Poor Poor FILTERS Good Medium Medium SIFT Good Good Medium

HoG = Histogram of Oriented Gradients Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05 Like SIFT, but Sampled on a dense, regular grid around the object Gradients are contrast normalized in overlapping blocks

Shape context descriptor Belongie et al. 2002 Histogram (occurrences within each bin) A 3 1 0 0 1 2 3 4 5 // 10 11 12 13 14. Bin # 13 th

Shape context descriptor Courtesy of S. Belongie and J. Malik descriptor 1 descriptor 2 descriptor 3

Other detectors/descriptors HOG: Histogram of oriented gradients Dalal & Triggs, 2005 SURF: Speeded Up Robust Features Herbert Bay, Andreas Ess, Tinne Tuytelaars, Luc Van Gool, "SURF: Speeded Up Robust Features", Computer Vision and Image Understanding (CVIU), Vol. 110, No. 3, pp. 346- - 359, 2008 FAST (corner detector) Rosten. Machine Learning for High- speed Corner Detection, 2006. ORB: an efficient alternative to SIFT or SURF Ethan Rublee, Vincent Rabaud, Kurt Konolige, Gary R. Bradski: ORB: An efficient alternative to SIFT or SURF. ICCV 2011 Fast Retina Key- point (FREAK) A. Alahi, R. Ortiz, and P. Vandergheynst. FREAK: Fast Retina Keypoint. In IEEE Conference on Computer Vision and Pattern Recognition, 2012. CVPR 2012 Open Source Award Winner.

Next lecture: Image Classification by Deep Networks by Ronan Collobert (Facebook)