An Introduction to Wavelets with Applications in Environmental Science

Similar documents
Wavelet Methods for Time Series Analysis. Part IV: Wavelet-Based Decorrelation of Time Series

Wavelet Methods for Time Series Analysis. Motivating Question

Wavelet Methods for Time Series Analysis. Part IX: Wavelet-Based Bootstrapping

Testing for Homogeneity of Variance in Time Series: Long Memory, Wavelets and the Nile River

Testing for homogeneity of variance in time series: Long memory, wavelets, and the Nile River

A Tutorial on Stochastic Models and Statistical Analysis for Frequency Stability Measurements

Wavestrapping Time Series: Adaptive Wavelet-Based Bootstrapping

Wavelet-Based Bootstrapping for Non-Gaussian Time Series. Don Percival

Wavelet Methods for Time Series Analysis

Using wavelet tools to estimate and assess trends in atmospheric data

Wavelet Analysis of Discrete Time Series

A Tutorial on Stochastic Models and Statistical Analysis for Frequency Stability Measurements

Testing for Homogeneity of Variance in Time Series: Long Memory, Wavelets and the Nile River NRCSE. T e c h n i c a l R e p o r t S e r i e s

Wavelet Methods for Time Series Analysis. Part II: Wavelet-Based Statistical Analysis of Time Series

Defining the Discrete Wavelet Transform (DWT)

Wavelet-based estimation for Gaussian time series and spatio-temporal processes

Wavelet Methods for Time Series Analysis. Quick Comparison of the MODWT to the DWT

Wavelet Methods for Time Series Analysis. Quick Comparison of the MODWT to the DWT

Asymptotic Decorrelation of Between-Scale Wavelet Coefficients

Nontechnical introduction to wavelets Continuous wavelet transforms Fourier versus wavelets - examples

Examples of DWT & MODWT Analysis: Overview

Wavelet domain test for long range dependence in the presence of a trend

Multiscale Detection and Location of Multiple Variance Changes in the Presence of Long Memory Brandon Whitcher EURANDOM Peter

Multiresolution Models of Time Series

Differencing Revisited: I ARIMA(p,d,q) processes predicated on notion of dth order differencing of a time series {X t }: for d = 1 and 2, have X t

Serial Correlation Tests with Wavelets

Inference. Undercoverage of Wavelet-Based Resampling Confidence Intervals

FGN 1.0 PPL 1.0 FD

Decline of Arctic Sea-Ice Thickness as Evidenced by Submarine Measurements. Don Percival

Classic Time Series Analysis

Quantitative Finance II Lecture 10

If we want to analyze experimental or simulated data we might encounter the following tasks:

Wavelet-Generalized Least Squares: A New BLU Estimator of Linear Regression Models with 1/f Errors

[y i α βx i ] 2 (2) Q = i=1

Detection of Anomalous Observations Using Wavelet Analysis in Frequency Domain

Econ 424 Time Series Concepts

EXAMINATION OF RELATIONSHIPS BETWEEN TIME SERIES BY WAVELETS: ILLUSTRATION WITH CZECH STOCK AND INDUSTRIAL PRODUCTION DATA

Statistical signal processing

Cointegrated VARIMA models: specification and. simulation

Calculating wavelet variance associated with discrete wavelet transform (DWT) 1

Trend Assessment of Correlated Data

Simple and Multiple Linear Regression

Econometrics 2, Class 1

Wavelet Analysis of Variance for Time Series with Missing Values. Technical Report no. 535 Department of Statistics, University of Washington

Wavelet Method for Locally Stationary Seasonal Long Memory Processes

Wavelet methods and null models for spatial pattern analysis

STAT 520: Forecasting and Time Series. David B. Hitchcock University of South Carolina Department of Statistics

Wavelet-Based Test for Time Series Non-Stationarity 1

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

Wavelet methods for the detection of anomalies and their application to network traffic analysis

FE570 Financial Markets and Trading. Stevens Institute of Technology

Nonstationary Time Series:

STAT 536: Genetic Statistics

Week 5 Quantitative Analysis of Financial Markets Characterizing Cycles

Non-Stationary Time Series and Unit Root Testing

AN INTRODUCTION TO SPECTRAL ANALYSIS AND WAVELETS

Computationally efficient banding of large covariance matrices for ordered data and connections to banding the inverse Cholesky factor

Review of Concepts from Fourier & Filtering Theory. Fourier theory for finite sequences. convolution/filtering of infinite sequences filter cascades

TMA4285 December 2015 Time series models, solution.

MA Advanced Econometrics: Applying Least Squares to Time Series

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

Robust Backtesting Tests for Value-at-Risk Models

MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2

ECON/FIN 250: Forecasting in Finance and Economics: Section 7: Unit Roots & Dickey-Fuller Tests

Issues in estimating past climate change at local scale.

Econometrics of Panel Data

CHANGE DETECTION IN TIME SERIES

Wavelet-based confidence intervals for the self-similarity parameter

Linear models and their mathematical foundations: Simple linear regression

MULTI-SCALE TESTS FOR SERIAL CORRELATION. October 2013, First Version: April 2010

Booth School of Business, University of Chicago Business 41914, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Midterm

Practical Statistics

Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem

MEI Exam Review. June 7, 2002

7. MULTIVARATE STATIONARY PROCESSES

Nonstationary time series models

Non-Stationary Time Series and Unit Root Testing

It is easily seen that in general a linear combination of y t and x t is I(1). However, in particular cases, it can be I(0), i.e. stationary.

Time Series 4. Robert Almgren. Oct. 5, 2009

Single Equation Linear GMM with Serially Correlated Moment Conditions

Final Exam November 24, Problem-1: Consider random walk with drift plus a linear time trend: ( t

Utility of Correlation Functions

Asymptotically sufficient statistics in nonparametric regression experiments with correlated noise

Stratified Market Equilibria: An Application of Maximal Overlap Discrete Wavelet Analysis

Econometric Methods for Panel Data

Introduction to Wavelets and Wavelet Transforms

Wavelet-Based Multiresolution Analysis of Wivenhoe Dam Water Temperatures

Christopher Dougherty London School of Economics and Political Science

1 Linear Difference Equations

Applied Time. Series Analysis. Wayne A. Woodward. Henry L. Gray. Alan C. Elliott. Dallas, Texas, USA

3 Time Series Regression

Wavelets and Multiresolution Processing. Thinh Nguyen

ECE 541 Stochastic Signals and Systems Problem Set 11 Solution

Hydrological statistics for engineering design in a varying climate

Heteroskedasticity; Step Changes; VARMA models; Likelihood ratio test statistic; Cusum statistic.

Trending Models in the Data

Locally Self-Similar Processes and Their Wavelet Analysis

Ch 2: Simple Linear Regression

Testing for Panel Unit Roots in the Presence of an Unknown Structural Break and Cross-Sectional Dependency

EC408 Topics in Applied Econometrics. B Fingleton, Dept of Economics, Strathclyde University

Transcription:

An Introduction to Wavelets with Applications in Environmental Science Don Percival Applied Physics Lab, University of Washington Data Analysis Products Division, MathSoft overheads for talk available at http://weber.u.washington.edu/~dbp/talks.html joint work with: B. Whitcher, S. Byers & P. Guttorp (UW) S. Sardy (UW, MathSoft, EPFL) P. Craigmile (UW)

Overview of Talk overview of discrete wavelet transform (DWT) wavelet coefficients and their interpretation DWT as a time series decorrelator three uses for wavelets 1. testing for variance changes 2. bootstrapping auto/cross-correlation estimates 3. estimating d for stationary/nonstationary fractional difference processes with trend warning: results for 2& 3 are preliminary

Overview of DWT let X =[X 0,X 1,...,X N 1 ] T be observed time series (for convenience, assume N integer multiple of 2 J 0) let W be N N orthonormal DWT matrix (more precisely: partial DWT of level J 0 ) W = WX is vector of DWT coefficients can partition W as follows: W = W 1. W J0 V J0 W j contains N j = N/2 j wavelet coefficients related to changes of averages at scale τ j =2 j 1 (τ j is jth dyadic scale) related to times spaced 2 j units apart V J0 contains N J0 = N/2 J 0 scaling coefficients related to averages at scale λ J0 =2 J 0 related to times spaced 2 J 0 units apart

Example: DWT of FDP X t called fractional difference process (FDP) if it has a spectral density function (SDF) given by S X (f) = where σ 2 > 0 and 1 2 d<1 2 σ 2 2 sin(πf) 2d, note: for small f, have S X (f) C/ f 2d ; i.e., 1/f type process if d = 0, FDP is white noise if 0 <d< 1 2, FDP stationary with long memory can extend definition to d 1 2 nonstationary 1/f type process also called FARIMA(0,d,0) process example: DWT of FDP, d =0.4

Two Consequences of Orthonormality multiresolution analysis (MRA) X = W T W = J 0 j=1 W T j W j + V T J 0 V J0 J 0 j=1 (W j partitions W commensurate with W j ) scale-based additive decomposition D j s & S J0 called details & smooth example: Nile River minimum flood levels analysis of variance: ˆσ 2 X 1 N N 1 t=0 (X t X) 2 = 1 N J 0 j=1 D j + S J0 W j 2 + V J0 2 X 2 scale-based decomposition (cf. frequency-based) can define wavelet variance νx(τ 2 j ) for FDP, can deduce d from log/log plots since ν 2 X(τ j ) Cτ 2d 1 j example: Nile River minimum flood levels

DWT in Terms of Filters filter X 0,X 1,...,X N 1 to obtain 2 j/2 W j,t L j 1 l=0 h j,l X t l mod N, t =0, 1,...,N 1 where h j,l is jth level wavelet filter note: circular filtering subsample to obtain wavelet coefficients: W j,t =2 j/2 W j,2 j (t+1) 1, t =0, 1,...,N j 1, where W j,t is tth element of W j examples: Haar, D(4), C(6) & LA(8) wavelet filters 1 jth wavelet filter is band-pass with pass-band [, 1 ] 2 j+1 2 j note: jth scale related to interval of frequencies similarly, scaling filters yield V J0 examples: Haar, D(4), C(6) & LA(8) scaling filters J 0 th scaling filter is low-pass with pass-band [0, 1 2 J 0 +1 ]

Wavelets as Whitening Filters recall DWT of FDP for d =0.4 since FDP is stationary process, W j is also (ignoring terms influenced by circularity) can compute SDFs for each W j see figure DWT acts as whitening filter requires SDF of X to be flat over pass-band 1 [, 1 ] 2 j+1 2 j if not true, can use wavelet packet transform (DWPT) used by Flandrin, Tewfik & Kim, Wornell, McCoy & Walden three examples built on whitening property 1. testing for variance changes 2. bootstrapping auto/cross-correlation estimates 3. estimating d for stationary/nonstationary fractional difference processes with trend whitening property should help with other problems

Homogeneity of Variance: I claim: DWT approximately decorrelates FDPs implication: W j should resemble white noise (ignoring coefficients influenced by circularity) cov {W j,t,w j,t } 0 when t t var {W j,t } should not vary with t (homogeneity of variance) can test for homogeneity of variance using W j suppose Y 0,...,Y N 1 independent normal RVs with E{Y t } = 0 and var {Y t } = σ 2 t want to test null hypothesis H 0 : σ 2 0 = σ 2 1 = = σ 2 N 1 can test H 0 versus a variety of alternatives, e.g., H 1 : σ 2 0 = = σ 2 k σ 2 k+1 = = σ 2 N 1 using normalized cumulative sum of squares

Homogeneity of Variance: II D + to define test statistic D, start with P k and then compute max k +1 0 k N 2 k j=0 Yj 2 N 1 j=0 Yj 2, k =0,...,N 2 N 1 P k & D max 0 k N 2 from which we form D max (D +,D ) can reject H 0 if observed D is too large can quantify too large by considering distribution of D under H 0 need to find critical value x α such that P[D x α ]=α for, e.g., α =0.01, 0.05 or 0.1 P k once determined, can perform α level test of H 0 : compute D statistic from data Y 0,...,Y N 1 reject H 0 at level α if D x α k N 1

Homogeneity of Variance: III can determine critical values x α in two ways Monte Carlo simulations large sample approximation to distribution of D: P[(N/2) 1/2 D x] 1+2 ( 1) l e 2l2 x 2 l=1 (reasonable approximation for N 128) idea: given time series X, compute D using W j,t =2 j/2 W j,2 j (t+1) 1, (L 2) 1 1 2 j t N 2 1, j where L is length of j = 1 level wavelet filter and 2 j/2 W j,t L j 1 l=0 h j,l X t l mod N results in level by level tests above formulation allows for general N (i.e., N need not be multiple of 2 J 0) Q: is DWT decorrelation of FDPs good enough? Fig. 9 says yes!

Example: Nile River Minima recall MRA & wavelet variance plots application of homogeneity of variance test: scale D x 0.1 x 0.05 x 0.01 1 year 0.1559 0.0945 0.1051 0.1262 2 years 0.1754 0.1320 0.1469 0.1765 4 years 0.1000 0.1855 0.2068 0.2474 8 years 0.2313 0.2572 0.2864 0.3436 if H 0 rejected, use nondecimated DWT to detect change point: compute rotated cumulative variance curve look for time of largest excursion from 0 Fig. 10: change point detection 720 AD for level j =1 722 AD for level j =2 agrees well with mosque construction in 715 AD interpretation differs from Beran & Terrin (1996)

Wavelet-Based Bootstrapping: I Davison & Hinkley, 1998, Chapter 8, discusses bootstrapping in context of time series analysis whitening allows wavelet-based bootstrapping for certain statistics (but not all) first example: lag 1 autocorrelation estimate ˆr 1 N 2 t=0 X t X t+1 N 1 t=0 X 2 t idea: to get standard error of ˆr 1, compute DWT of X sample with replacement from W j to form W (b) j (do same with V J0 ) synthesize X (b) using W (b) j s & V (b) J 0 compute ˆr (b) 1 for X (b) repeat until computer gets tired use standard error of ˆr (b) 1 s for X (b) s to assess standard error of ˆr 1 for X

Wavelet-Based Bootstrapping: II to test scheme, did Monte Carlo study involving AR(1) process: X t =0.9X t 1 + ɛ t MA(1) process: X t = ɛ t + ɛ t 1 FDP with d =0.45 average ˆr (b) 1 s have negligible bias comparison of standard errors, N = 128: LA(8) DWT LA(8) DWPT true AR(1) 0.043 ±0.003 0.057 ±0.005 0.039 MA(1) 0.073 ±0.003 0.068 ±0.002 0.063 FDP 0.107 ±0.005 0.093 ±0.005 0.108 comparison of standard errors, N = 1024: LA(8) DWT LA(8) DWPT true AR(1) 0.019 ±0.001 0.018 ±0.001 0.014 MA(1) 0.032 ±0.002 0.024 ±0.001 0.022 FDP 0.054 ±0.004 0.046 ±0.003 0.051 handles both short & long memory models

Wavelet-Based Bootstrapping: III second example: cross-correlation estimate ˆr (XY ) 0 N 1 ( N 1 t=0 X 2 t t=0 X t Y t N 1 t=0 Yt 2 ) 1/2 to assess null hypothesis r (XY ) 0 =0, separately generate X (b) & Y (b) bootstrapped ˆr (XY ) 0 should reflect variability in ˆr (XY ) 0 under null example: cross-correlation ˆr (XY ) 0 = 0.26 between maximum annual snow-pack level at Mt. Rainier Pacific decadel oscillation index can reject null at critical level p =0.01 lots of questions to be addressed! what is class of applicable statistics?

Estimation for FDPs: I extension of work by Wornell, McCoy & Walden problem: estimate d from time series U t such that where U t = T t + X t T t r j=0 a j t j is polynomial trend X t is FDP process, but can have d 1 2 DWT wavelet filter of width L has embedded differencing operation of order L/2 if L 2 r + 1, reduces polynomial trend to 0 can partition DWT coefficients as W = W s + W b + W w where W s has scaling coefficients and 0s elsewhere W s has boundary-dependent wavelet coefficients W w has boundary-independent wavelet coefficients

Estimation for FDPs: II since U = W T W, can write U = W T (W s + W b )+W T W w T + X example: microbolometric camera data can use values in W w to form likelihood: L(d, σ 2 ɛ ) J 0 N j j=1 t=1 where σ 2 ɛ is innovations variance; 1 ( 2πσ 2 j ) 1/2 e W 2 j,t+l j 1/(2σ2 j ) σj 2 1/2 H 1/2 j(f) df ; 2 sin(πf) 2d and H j ( ) is squared gain for {h j,l } σ 2 ɛ works well in Monte Carlo simulations for Hansen Lebedeff series, get ˆd. =0.40 ± 0.08 lots of questions to be addressed! what are properties of T? how to assess significance of T? (extension of Brillinger, 1994)