which the theoretical results were obtained. Image Signals The geometry of visual scenes under perspective projection generally yields complex image s

Similar documents
ing a minimal set of layers, each describing a coherent motion from the scene, which yield a complete description of motion when superposed. Jepson an

Elec4621 Advanced Digital Signal Processing Chapter 11: Time-Frequency Analysis

Hypergeometric Filters For Optical Flow and Ane Matching. Carnegie Mellon University. Pittsburgh, PA matching problems.

CS-9645 Introduction to Computer Vision Techniques Winter 2018

(Refer Slide Time: 01:30)

measurements. These computations are straightforward and local in space-time, yielding ecient implementations on both serial and parallel machines. Th

Lie Groups for 2D and 3D Transformations

ROYAL INSTITUTE OF TECHNOLOGY KUNGL TEKNISKA HÖGSKOLAN. Department of Signals, Sensors & Systems

Ensemble averaged dynamic modeling. By D. Carati 1,A.Wray 2 AND W. Cabot 3

Performance of Optical Flow Techniques. Abstract. While dierent optical ow techniques continue to appear, there has been a

Two motion perception mechanisms revealed through distancedriven reversal of apparent motion

CHAPTER 4 FOURIER SERIES S A B A R I N A I S M A I L

Sequence Quantization y. C. B. Atkins, T. J. Flohr, D. P. Hilgenberg, C. A. Bouman, and J. P. Allebach

Model of human visual-motion sensing

A Novel Approach to the 2D Analytic Signal? Thomas Bulow and Gerald Sommer. Christian{Albrechts{Universitat zu Kiel

Functional form of motion priors in human motion perception

How big is a Gabor patch, and why should we care?

Absolutely indecomposable symmetric matrices

DESIGN OF MULTI-DIMENSIONAL DERIVATIVE FILTERS. Eero P. Simoncelli

Domain and range of exponential and logarithmic function *

Eero P. Simoncelli. Vision and Modeling Group Technical Report #209. January 1993 y. MIT Media Laboratory, E Ames Street

Surround effects on the shape of the temporal contrast-sensitivity function

Representation and Learning of. Klas Nordberg Gosta Granlund Hans Knutsson

SYSTEM RECONSTRUCTION FROM SELECTED HOS REGIONS. Haralambos Pozidis and Athina P. Petropulu. Drexel University, Philadelphia, PA 19104

Review of Linear Time-Invariant Network Analysis

Sum of Coherent Systems Decomposition by SVD. University of California at Berkeley. Berkeley, CA September 21, 1995.

W. Kozek and H.G. Feichtinger. NUHAG, Dept. of Mathematics, University of Vienna. J. Scharinger. Altenberger Str. 69, A-4040 Linz, Austria ABSTRACT

Towards a Mathematical Theory of Super-resolution

ADDENDUM B: CONSTRUCTION OF R AND THE COMPLETION OF A METRIC SPACE

Multidimensional digital signal processing

Boxlets: a Fast Convolution Algorithm for. Signal Processing and Neural Networks. Patrice Y. Simard, Leon Bottou, Patrick Haner and Yann LeCun

A general theory of discrete ltering. for LES in complex geometry. By Oleg V. Vasilyev AND Thomas S. Lund

Novel determination of dierential-equation solutions: universal approximation method

Examples. 2-input, 1-output discrete-time systems: 1-input, 1-output discrete-time systems:

An Attempt To Understand Tilling Approach In proving The Littlewood Conjecture

Discrete Wavelet Analysis for Fast Optic Flow Computation

Chapter 2 Fourier Series Phase Object Spectra

2800 Lyngby, Denmark, Abstract

arxiv: v3 [q-bio.nc] 1 Sep 2016

Jean Morlet and the Continuous Wavelet Transform

Using Skew Gabor Filter in Source Signal Separation and Local Spectral Multi-Orientation Analysis

The Discrete Fourier transform

f(x) cos dx L L f(x) sin L + b n sin a n cos

Homework 1 Solutions ECEn 670, Fall 2013

Networks and Systems Prof V.G K. Murti Department of Electrical Engineering Indian Institute of Technology, Madras Lecture - 10 Fourier Series (10)

Gabor wavelet analysis and the fractional Hilbert transform

October 7, :8 WSPC/WS-IJWMIP paper. Polynomial functions are renable

Relations and Functions

Observed Brain Dynamics

MATTHIAS P ATZOLD, MEMBER, IEEE, ULRICH KILLAT, MEMBER, IEEE, FRANK LAUE, YINGCHUN LI. Technical University of Hamburg-Harburg.

Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space

Anisotropic grid-based formulas. for subgrid-scale models. By G.-H. Cottet 1 AND A. A. Wray

Vectors [and more on masks] Vector space theory applies directly to several image processing/ representation problems

THE LARGEST INTERSECTION LATTICE OF A CHRISTOS A. ATHANASIADIS. Abstract. We prove a conjecture of Bayer and Brandt [J. Alg. Combin.

Nonlinearity optimization in nonlinear joint transform correlators

Chirp Transform for FFT

Therefore the new Fourier coefficients are. Module 2 : Signals in Frequency Domain Problem Set 2. Problem 1

CONCEPTUAL DEVELOPMENT OF AN ASSISTANT FOR CHANGE DETECTION AND ANALYSIS BASED ON REMOTELY SENSED SCENES

!) + log(t) # n i. The last two terms on the right hand side (RHS) are clearly independent of θ and can be

Probabilistic Models of the Brain: Perception and Neural Function

arxiv: v1 [cs.cv] 10 Feb 2016

Numerical Schemes from the Perspective of Consensus

First Eigenvector Second Eigenvector Third Eigenvector

Vision Modules and Cue Combination

FOURIER SERIES. Chapter Introduction

Speed encoding in correlation motion detectors as a consequence of spatial structure

A Biologically-Inspired Model for Recognition of Overlapped Patterns

Algebra 2 and Mathematics 3 Critical Areas of Focus

k-degenerate Graphs Allan Bickle Date Western Michigan University

The recovery of motion information from visual input is an important task for both natural

First Order Linear Ordinary Differential Equations

Homework 7 Solutions to Selected Problems

Lecture Notes 5: Multiresolution Analysis

Instituto de Sistemas e Robótica. Pólo de Lisboa

When does a planar bipartite framework admit a continuous deformation?

Motion Estimation (I) Ce Liu Microsoft Research New England

Optical Systems Program of Studies Version 1.0 April 2012

Adaptation to a 'spatial-frequency doubled' stimulus

Optical Flow, Motion Segmentation, Feature Tracking

1 Introduction Spectral analysis of (wide-sense) stationary random processes by means of the power spectral density (PSD) is a useful concept. owever,

Filtering in the Frequency Domain

Introduction Wavelet shrinage methods have been very successful in nonparametric regression. But so far most of the wavelet regression methods have be

An intuitive introduction to the concept of spatial coherence

Tyler Anderson 1, Murat Tanyel 2

A Multivariate Time-Frequency Based Phase Synchrony Measure for Quantifying Functional Connectivity in the Brain

Review of Linear System Theory

2.1 Basic Concepts Basic operations on signals Classication of signals

The Uniformity Principle: A New Tool for. Probabilistic Robustness Analysis. B. R. Barmish and C. M. Lagoa. further discussion.

Minimum and maximum values *

Physical Optics 2018 Dr. Muwafaq Fadhil Al-Mishlab Third lecture [ Huygens Principle, Interference of light]

Introduction to Signal Processing

Chapter One Hilbert s 7th Problem: It s statement and origins

Ole Christensen 3. October 20, Abstract. We point out some connections between the existing theories for

A NOTE ON MATRIX REFINEMENT EQUATIONS. Abstract. Renement equations involving matrix masks are receiving a lot of attention these days.

QM1 - Tutorial 1 The Bohr Atom and Mathematical Introduction

Absolute Equilibrium Entropy

Atomic emission spectra experiment

Jean Morlet and the Continuous Wavelet Transform (CWT)

The Wave Nature of Matter Causes Quantization *

Algorithmic Lie Symmetry Analysis and Group Classication for Ordinary Dierential Equations

Transcription:

NonFourier Motion in The Fourier Spectrum Steven S. Beauchemin GRASP Laboratory Department of Computer and Information Science University of Pennsylvania Philadelphia PA 191048 Abstract Typically, image motion analysis in the frequency domain is performed according to the Motion From Fourier Coecients, or MFFC principle. However, this principle excludes a class of motion that does not generate power spectra containing the origin of the frequency domain, which is usually referred to as Non Fourier motion. This type of motion includes phenomena relevant to motion analysis such as translucency, sinusoidal beats, occlusion, Theta motion, and lacks a denitive theoretical framework. We address this problem by deriving exact mathematical expressions for NonFourier motion, thus providing a basis for computational models for velocity extraction in the frequency domain. Numerical experiments demonstrating the validity of the approach are also presented. Keywords: Image motion, Optical ow, Fourier transform, NonFourier motion 1 Introduction Traditionally, motion perception has been equated with orientation of power in the frequency domain. The many optical ow methods use what Chubb and Sperling term the MotionFromFourierComponents (MFFC) principle [1] in which the orientation of the plane or line through the origin of the frequency space that contains most of the spectral power gives the rate of image translation. The MFFC principle states that for a moving stimulus, its Fourier transform has substantial power over some regions of the frequency domain whose points spatiotemporally correspond to sinusoidal gratings with drift direction consonant with the perceived motion [1]. In addition, current models of human perception involve some frequency analysis of the imagery, such as bandpass ltering and similar processes. However, some classes of moving stimuli which elicit a strong percept in subjects fail to show a coherent spatiotemporal frequency distribution of their power and cannot be understood in terms of the MFFC principle. Examples include driftbalanced visual stimuli [1], Fourier and NonFourier plaid superpositions [4], amplitude envelopes, sinusoidal beats and various multiplicative phenomena []. By driftbalanced it is meant that a visual stimulus with two (leftward and rightward, for example) or more dierent motions shows identical contents of Fourier power for each motion and therefore, according to the MFFC principle, should not elicit a coherent motion percept. However, some classes of driftbalanced stimuli dened by Chubb and Sperling do elicit strong coherent motion percepts, contrary to the predictions of the usual MFFC model. Sources of NonFourier motion also include the motion of texture boundaries and the motion of motion boundaries. For instance, transparency as considered by Fleet and Langley [] is an example of NonFourier motion, as transparency causes the relative scattering of Fourier components away from the spectrum of the moving stimuli. In addition, occlusion is another example of NonFourier motion which is closely related to the Theta motion stimuli of Zanker [5], where the occlusion window moves independently from both the foreground and the background, thus involving three independent velocities. It has been observed by Fleet and Langley that many NonFourier motion stimuli have simple characterizations in the frequency domain, namely power distributions located along lines or planes which do not contain the origin of the frequency space, as required by the MFFC idealization []. We develop exact frequency representations for several nonfourier motions and state their properties with respect to image motion. 1.1 Methodology To analyze the frequency structure of image signals while preserving representations that are as general as possible, an eort is made to only pose those hypotheses that preserve the generality of the analysis to follow. We describe the assumptions and the techniques with

which the theoretical results were obtained. Image Signals The geometry of visual scenes under perspective projection generally yields complex image signals. Conceptually, assumptions concerning scene structure should not be made, as they constrain the geometry of observable scenes. In addition, any measured physical signal, such as image intensities, satises Dirichlet conditions. Such signals admit a nite number of nite discontinuities, are absolutely integrable and may be expanded into complex exponential series. Dirichlet conditions constitute the sum of assumptions made on image signals. Mathematical Technique The results established in this analysis emanate from a general approach to modeling visual scenes exhibiting nonfourier motion. In the case of occlusion, an equation which describes the spatiotemporal pattern of the superposition of a background and an occluding signal is established, in which a characteristic function describing the position of an occluding signal within the imaging space of the visual sensor is dened: (x) = 1 if x within occluding signal 0 otherwise. (1) Two image signals I 1 (x) and I (x), corresponding to the occluding and occluded signals respectively, are dened to form the complete signal pattern: I(x; t) = (v1(x; t))i1(v1(x; t)) + [1? (v1(x; t))] I(v(x; t)); () where v i (x; t) is velocity. Alternatively, scenes of translucency are modeled as I(x; t) = f(1)(v1(x; t))i(v(x; t)); (3) where f( 1 ) is a function of the density of the translucent material. Hypotheses made on the components of () and (3) are inserted and the structure of the corresponding nonfourier motions in frequency space are developed. That is to say, signal structures are expanded into complex exponential series, such as: Ii(x) = c ine ixt N k i ; (4) where I i (x) is the i th intensity pattern, c in are complex coecients, k i are fundamental frequencies, n T = (n 1 ; n ; : : : n n ) are integers and N = n T I. Relevance of Fourier Analysis It has been conjectured that NonFourier spectra have mathematically simple characterizations in Fourier space []. Consequently, the use of Fourier analysis as a local tool is justied as long as one realizes that it constitutes a global idealization of local phenomena. In that sense, Fourier analysis is used as a local tool whenever Gabor lters, wavelets or local Discrete Fourier Transforms are employed for signal analysis. Experimental Technique Given the theoretical nature of this research, the purpose of the numerical experiments is to verify the validity of the theoretical results. In order to accomplish this, the frequency content of the image signals used in the experiments must be entirely known to the experimenter, thus forbidding the use of natural image sequences. In addition, image signals with single frequency components are used in order to facilitate the interpretation of experiments involving 3D Fast Fourier transforms. The use of more complex signals impedes a careful examination of the numerical results and do not extend the understanding of the phenomena under study in any particular way. Sinusoidal Beats and Translucency Sinusoidal beats are sums of sinusoidal patterns, each moving with a possibly dierent velocity, and constitute one of the simplest forms of NonFourier motion. A simple beat involving two 1D sinusoids can be expressed as I(x; t) = cos(k1x +!1t) + cos(kx +!t); (5) which is recognized as a case of additive translucency involving two patterns. The Fourier transform of sinusoidal beats is straightforward yet does not comply with the MFFC principle as the frequencies of each pattern do not align to represent an unambiguous motion pattern. These types of stimuli have been used to study velocity detection thresholds and lead to the hypothesis that the human visual system may use a nonfourier channel of motion detection [3]. Figure 1 shows a superposition of two 1D sinusoidals and the corresponding Fourier transform. In the context of sinusoidal beats, k and! are beat frequencies and dene group velocity as?!. Average frequencies k and! are carrier frequencies and dene carrier k velocity as?!. k In the case of additive translucency, we express the translucent event as (3) and assume a spatially constant

t Figure 1: The composition of an additive transparency scene. a): First sinusoidal signal with frequency k 1 = 8 and velocity a 1 = 1:0. b): Second sinusoidal signal with frequency k = and velocity 1 a =?1:0. c): Transparency created with the superposition of rst and second sinusoidal signal. d): Frequency spectrum of transparency. f( 1 ) with translucency factor ', leading to a weighted superposition of intensity patterns, written as I(x; t) = 'I1(v1(x; t)) + (1? ')I(v(x; t)); () where I 1 (v 1 (x; t)) is the intensity prole of the translucent material and I (v (x; t)) is the intensity prole of the background. With I 1 (v 1 (x; t)) and I (v (x; t)) satisfying Dirichlet conditions, the frequency spectrum of () is written as ^I(k;!) = ' (1? ') c1n(k? N k1;! + a T 1 N k1) + cn(k? N k;! + a T N k): In this case, the velocities of interest are those of the individual signals. Each of them is predicted by the MFFC principle but the sum of their frequencies does not allow a direct measurement. 3 Occlusion At an occlusion, the occluding and occluded velocities can always be identied as such and, a degenerate occluding signal exhibiting a linear spectrum is supplemented by the linear orientation of its occluding boundary, allowing to determine the full velocity of the occluding signal. Let I 1 (x) and I (x) be D functions satisfying Dirichlet conditions such that they may be expressed as complex exponential series expansions (3). Also let I 1 (x; t) = I 1 (v 1 (x; t)), I (x; t) = I (v (x; t)) and the (7) The fre occluding boundary be represented by (4). quency spectrum of the occlusion is ^I(k;!) =?i c1n(k? N k1;! + a T 1 N k1) +(1? ) ~1; X c1n((k? N k1) T n? 1 ; k T a1 +!) cn(k? N k;! + a T N k) (k? N k1) T n1 + cn((k? N k)t n? 1 ; k T a1 +!? a T N k) (k? N k) T n1 where a = a 1?a. Figure shows the composition of a simple occlusion scene with D sinusoidal signals. The Fourier transform of its various components show that the linear spectrum of the occlusion boudary is oriented in a manner that is consonant with the velocity of the occluding signal. Detecting this orientation amounts to identifying the occluding velocity. In addition, since both the occluding signal and its boundary translate with identical full velocity, one can disambiguate the normal velocity of the occluding signal. 4 Generalized Boundaries Typically, occlusion boundaries are unconstrained in shape, yielding a variety of occluding situations. Under the hypothesis that the motion of the occluding boundary is rigid on the image plane, we can derive the frequency structure of such events. For instance, consider a generalized occlusion boundary represented by the characteristic function (x) in the coordinates (8)

y! ) P Pq k x k y e) f) g) h) Figure : (top): The composition of a simple D occlusion scene. a) The occluding sinusoidal signal with frequency ( 1 ; ) and velocity 1 (?1:0;?1:0). b) The occluded sinusoidal signal with frequency ( 8 ; ) and velocity 8 (1:0; 1:0). c) p p The step function used to create the occlusion scene with normal vector ( ; ). d) The occlusion as a combination of a), b) and c). (bottom) e) through h): Image plots of corresponding amplitude spectra. of the image plane and the Fourier transforms of the complex exponential series expansions of both the occluding and occluded signals I 1 and I. Substituting these terms into the Fourier transform of () yields the following Fourier spectrum ^I(k;!) = c1n ^(k? N k1)(a T 1 k +!)? cn ^(k? N k)(a T k +!? at N k) + cn(k? N k;! + a T N k); from which it is observed that the spectrum of the occluding boundary is repeated at every nonzero frequency of both signals. The spectrum occupies planes descriptive of full velocity which can be used to perform such measurements. 5 Theta Motion Sources of NonFourier motion include such phenomena as translucency, occlusion and, in particular, Zanker's (9) Theta motion stimuli which are examples of Non Fourier motion involving occlusion with various forms of occlusion windows[5]. This category of motion is described by a window that translates with a velocity uncorrelated with those of the occluding and occluded signals. For 1D image signals, such an occlusion scene can be expressed as I(x; t) = (x? v3t)i1(x? v1t)? (x? v3t)i(x? vt) + I(x? vt): (10) As Zanker and Fleet [5, ], we model the occlusion window with a rectangle function in the spatial coordinate as 0 if x?x0 b > 1 x? 8>< x0 = 1 b if x?x0 b = 1 (11) >: < 1 : 1 if x?x0 b Such a function is nonzero in the interval ]x 0? b ; x 0 + b [ and zero otherwise. We can then write the Fourier transform of the occlusion scene (10) as I(k;!) = K 1X n=?1 sinc(k? nk1)c1n(kv3? nk1v3)?

y Figure 3: Generalized occluding boundaries. a), b) and c): Images from a sequence in which the occluding pattern moves with velocity a T 1 = (?1:0;?1:0). Spatial frequency of the sinusoidal texture within the circular boundary is k T 1 = ( 1 ; ). d): The frequency spectrum of the sequence, where the plane contains the spectrum of the 1 boundary convolved with the frequency of the texture. t Figure 4: Examples of Theta motion. a): Velocities of occlusion window, occluding and occluded signals are v 3 = 0:5, v 1 = 1:0 and v =?:0 respectively. b): Frequency spectrum of a). c): Velocities of occlusion window, occluding and occluded signals are v 3 =?0:5, v 1 =?:0 and v = 1:0 respectively. d): Frequency spectrum of c). K 1X 1X n=?1 n=?1 sinc(k? nk)cn(kv3? nkv) + cn(k? nk;! + nkv); (1) where sinc(k) = sin k k, v 3 = v 3? v1, v = v? v 1, and K = b?1 e?ikx0b?1. The spectra (kv 3 +!?nk 1 v 3 ) and (kv 3 +!?nk v ) are consonant with the motion of the occluding window and represent a case of Non Fourier motion, as they do not contain the origin. We performed two experiments with Theta motions as pictured in Figure 4. It is observed that the spectrum of the sinc function is convolved with each frequency of both signals and that its orientation is descriptive of the velocity of the window. As expected, the visible peaks represent the motions of both signals in the MFFC sense. Conclusion We presented a framework for the Fourier analysis of nonfourier motions which yields exact mathematical expressions and forms a basis for computational models for velocity extraction in the frequency domain beyond the MFFC principle. References [1] C. Chubb and G. Sperling. Driftbalanced random stimuli: A general basis for studying nonfourier motion perception. J. Opt. Soc. Am. A, 5(11):198{ 007, 1988. [] D. J. Fleet and K. Langley. Computational analysis of nonfourier motion. Vision Research, 34():3057{3079, 1995.

[3] K. Turano and A. Pantle. On the mechanism that encodes the movement of contrast variations. Vision Research, 9:07{1, 1989. [4] J. D. Victor and M. M. Conte. Coherence and transparency of moving plaids composed of fourier and nonfourier gratings. Perception & Psychophysics, 5(4):403{414, 1988. [5] M. J. Zanker. Theta motion: A paradoxical stimulus to explore higherorder motion extraction. Vision Research, 33:553{59, 1993.