Vectors [and more on masks] Vector space theory applies directly to several image processing/ representation problems 1
Image as a sum of basic images What if every person s portrait photo could be expressed as a sum of 20 special images? è We would only need 20 numbers to model any photo è sparse rep on our Smart card. 2
Efaces 100 x 100 images of faces are approximated by a subspace of only 4 100 x 100 images, the mean image plus a linear combination of the 3 most important eigenimages 3
The image as an expansion 4
Different bases, different properties revealed 5
Fundamental expansion 6
Basis gives structural parts 7
Vector space review, part 1 8
Vector space review, Part 2 2 9
A space of images in a vector space n M x N image of real intensity values has dimension D = M x N n Can concatenate all M rows to interpret an image as a D dimensional 1D vector n The vector space properties apply n The 2D structure of the image is NOT lost 10
Orthonormal basis vectors help 11
Represent S = [10, 15, 20] 12
Projection of vector U onto V 13
Normalized dot product Can now think about the angle between two signals, two faces, two text documents, 14
Every 2x2 neighborhood has some constant, some edge, and some line component Confirm that basis vectors are orthonormal 15
Roberts basis cont. If a neighborhood N has large dot product with a basis vector (image), then N is similar to that basis image. 16
Standard 3x3 image basis Structureless and relatively useless! 17
Frie-Chen basis Confirm that bases vectors are orthonormal 18
Structure from Frie-Chen expansion Expand N using Frie- Chen basis 19
Sinusoids provide a good basis 20
Sinusoids also model well in images 21
Operations using the Fourier basis 22
A few properties of 1D sinusoids They are orthogonal Are they orthonormal? 23
F(x,y) as a sum of sinusoids 24
Continuous 2D Fourier Transform To compute F(u,v) we do a dot product of our image f(x,y) with a specific sinusoid with frequencies u and v 26
Power spectrum from FT 27
Examples from images Done with HIPS in 1997 28
Descriptions of former spectra 29
Discrete Fourier Transform Do N x N dot products and determine where the energy is. High energy in parameters u and v means original image has similarity to those sinusoids. 30
Bandpass filtering 31
Convolution of two functions in the spatial domain is equivalent to pointwise multiplication in the frequency domain 32
LOG or DOG filter Laplacian of Gaussian Approx Difference of Gaussians 33
LOG filter properties 34
Mathematical model 35
1D model; rotate to create 2D model 36
1D Gaussian and 1 st derivative 37
2 nd derivative; then all 3 curves 38
Laplacian of Gaussian as 3x3 39
G(x,y): Mexican hat filter 40
Convolving LOG with region boundary creates a zero-crossing Mask h(x,y) Input f(x,y) Output f(x,y) * h(x,y) 41
42
LOG relates to animal vision 43
1D EX. Artificial Neural Network (ANN) for computing g(x) = f(x) * h(x) level 1 cells feed 3 level 2 cells level 2 cells integrate 3 level 1 input cells using weights [-1,2,-1] 44
Experience the Mach band effect 45
Simple model of a neuron 46
Output conditioning: threshold versus smoother output signal 47
Canny edge detector 51
Summary of LOG filter n Convenient filter shape n Boundaries detected as 0-crossings n Psychophysical evidence that animal visual systems might work this way (your testimony) n Physiological evidence that real NNs work as the ANNs 53