Conditional-Sum-of-Squares Estimation of Models for Stationary Time Series with Long Memory

Similar documents
Conditional-sum-of-squares estimation of models for stationary time series with long memory

Regression with an Evaporating Logarithmic Trend

LECTURE 8: ASYMPTOTICS I

LECTURE 11 LINEAR PROCESSES III: ASYMPTOTIC RESULTS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences

1 Convergence in Probability and the Weak Law of Large Numbers

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

Asymptotic distribution of products of sums of independent random variables

Statistical Inference Based on Extremum Estimators

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Lecture 19: Convergence

Notes 27 : Brownian motion: path properties

Math Solutions to homework 6

Bull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung

Self-normalized deviation inequalities with application to t-statistic

7.1 Convergence of sequences of random variables

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Sequences and Series of Functions

On the convergence rates of Gladyshev s Hurst index estimator

CSE 527, Additional notes on MLE & EM

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Study the bias (due to the nite dimensional approximation) and variance of the estimators

A Note on the Kolmogorov-Feller Weak Law of Large Numbers

A Weak Law of Large Numbers Under Weak Mixing

The log-behavior of n p(n) and n p(n)/n

A note on self-normalized Dickey-Fuller test for unit root in autoregressive time series with GARCH errors

Gaussian Pseudo-Maximum Likelihood Estimation of Fractional Time Series Models

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

6.3 Testing Series With Positive Terms

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

ECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization

Distribution of Random Samples & Limit theorems

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

Limit distributions for products of sums

1 Covariance Estimation

SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker

Advanced Stochastic Processes.

2 Banach spaces and Hilbert spaces

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Introduction to Probability. Ariel Yadin

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.

Statistical Analysis on Uncertainty for Autocorrelated Measurements and its Applications to Key Comparisons

Notes 19 : Martingale CLT

Liudas Giraitis, Masanobu Taniguchi, Murad S. Taqqu Asymptotic normality of quadratic forms of martingale differences

Stochastic Simulation

Berry-Esseen bounds for self-normalized martingales

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

2.2. Central limit theorem.

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data

A Proof of Birkhoff s Ergodic Theorem

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

Lecture 2: Monte Carlo Simulation

The natural exponential function

Bayesian Methods: Introduction to Multi-parameter Models

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

Solutions: Homework 3

Entropy Rates and Asymptotic Equipartition

for all x ; ;x R. A ifiite sequece fx ; g is said to be ND if every fiite subset X ; ;X is ND. The coditios (.) ad (.3) are equivalet for =, but these

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT

This section is optional.

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Random Variables, Sampling and Estimation

Maximum Likelihood Estimation and Complexity Regularization

A Central Limit Theorem for Belief Functions

STA Object Data Analysis - A List of Projects. January 18, 2018

The Central Limit Theorem

STAT331. Example of Martingale CLT with Cox s Model

Rates of Convergence by Moduli of Continuity

Topics in Probability Theory and Stochastic Processes Steven R. Dunbar. Stirling s Formula Derived from the Gamma Function

An Introduction to Randomized Algorithms

ECE 330:541, Stochastic Signals and Systems Lecture Notes on Limit Theorems from Probability Fall 2002

Chapter 6 Infinite Series

ki, X(n) lj (n) = (ϱ (n) ij ) 1 i,j d.

Chapter 2 The Monte Carlo Method

Math 341 Lecture #31 6.5: Power Series

Sieve Estimators: Consistency and Rates of Convergence

Estimation of the Mean and the ACVF

Asymptotic Results for the Linear Regression Model

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Detailed proofs of Propositions 3.1 and 3.2

Lecture Stat Maximum Likelihood Estimation

THE STRONG LAW OF LARGE NUMBERS FOR STATIONARY SEQUENCES

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology

Exponential Families and Bayesian Inference

Ma 4121: Introduction to Lebesgue Integration Solutions to Homework Assignment 5

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

CHAPTER 10 INFINITE SEQUENCES AND SERIES

The random version of Dvoretzky s theorem in l n

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Transcription:

Coditioal-Sum-of-Squares Estimatio of Models for Statioary Time Series with Log Memory.M. Robiso Lodo School of Ecoomics The Sutory Cetre Sutory ad Toyota Iteratioal Cetres for Ecoomics ad Related Disciplies Lodo School of Ecoomics ad olitical Sciece Discussio paper Houghto Street No. EM/2006/505 Lodo WC2A 2AE September 2006 Tel: 020 7955 6679 Research supported by ESRC Grat R000239936.

Abstract Employig recet results of Robiso (2005) we cosider the asymptotic properties of coditioal-sum-of-squares (CSS) estimates of parametric models for statioary time series with log memory. CSS estimatio has bee cosidered as a rival to Gaussia maximum likelihood ad Whittle estimatio of time series models. The latter kids of estimate have bee rigorously show to be asymptotically ormally distributed i case of log memory. However, CSS estimates, which should have the same asymptotic distributioal properties uder similar coditios, have ot received comparable treatmet: the trucatio of the ifiite autoregressive represetatio iheret i CSS estimatio has bee essetially igored i proofs of asymptotic ormality. Ulike i short memory models it is ot straightforward to show the trucatio has egligible effect. AMS 2000 subject classificatio: 62M0. Keywords ad phrases: Log memory, coditioal-sum-of-squares estimatio, cetral limit theorem, almost sure covergece. The author. All rights reserved. Short sectios of text, ot to exceed two paragraphs, may be quoted without explicit permissio provided that full credit, icludig otice, is give to the source.

Itroductio Cosider a real-valued, strictly ad covariace statioary time series x t, t 2 Z. It is believed that x t has a parametric autoregressive (AR) represetatio j ( 0 )x t j = " t ; t 2 Z: (.) j=0 Here " t is a sequece of zero-mea, ucorrelated ad homoscedastic radom variables, with variace 2 0, the j () are give fuctios with p vector argumet, 0 is a ukow p vector, ad 0 () for all. The rage of structures f j ()g covered by (.) is very broad, but of iterest to us are oes which allow x t to have log memory. Usually, these are "fractioal", where it is assumed that the fuctio (s; ) = j ()s j ; (.2) j=0 with complex-valued argumet s o the uit circle, is of form where () is a scalar fuctio of such that (s; ) = ( s) () (s; ); (.3) 0 < ( 0 ) < 2 (.4) ad It follows that x t has spectral desity of form f() = 0 < j (s; 0 )j < ; jsj = : (.5) 2 0 j (e i ; 0 )j 2 = e i 2(0) 2 0 j (e i ; 0 )j 2 : (.6) The leadig choice of is a ratioal fuctio of s, i which case x t is said to be a fractioal autoregressive itegrated movig average (FARIMA) model; ( 0 ) is caled the memory parameter. Leadig methods of estimatio of 0, give observatios x ; :::; x, are Gaussia maximum likelihood (ML), ad approximatios thereto. They are "approximatios" i the sese that uder similar coditios they have the same asymptotic ormal distributio as ML, ad are thus asymptotically e ciet uder Gaussiaity. At the same time, uder may departures from Gaussiaity, though the e ciecy is lost the limit ormal distributio of all these estimates is ua ected. Assumig Gaussiaity, asymptotic ormality of oe form of approximatio, a Whittle estimate ivolvig itegratio over frequecy, was rst established by Fox ad Taqqu (986), ad the by Dahlhaus (989) i case of ML estimatio. Giraitis ad Surgailis (990) established asymptotic ormality for the estimate 2

cosidered by Fox ad Taqqu (986) whe " t eed ot be Gaussia but is idepedet ad idetically distributed with ite fourth momet. Due to the pole i the spectral desity at = 0 (see (.6)), the asymptotic ormality proofs are cosiderably more challegig tha those of Haa (973) for short memory time series models, icisive though these were for such models. A alterative estimate that has bee cosidered i the literature is coditioalsum-of-squares (CSS) estimatio, which was previously employed by Box ad Jekis (970) for short memory time series models. De e ad estimate 0 by e t () = t j=0 s () = j ()x t j ; (.7) e 2 t (); (.8) ^ = arg mi 2 s (); (.9) where R p is a compact set. Oe ca motivate ^ by the hope that s ( 0 ) is a good approximatio to "2 t, which is itself proportioal to the expoet i the desity fuctio of idepedet idetically distributed zero-mea ormal variates. Thus oe hopes that (after ceterig at 0 ad 2 ormig) ^ has the same limit distributioal properties as the Gaussia ML ad Whittle estimates metioed previously. Give a iitial cosistecy proof of ^, a stadard approach to provig asymptotic ormality etails applyig the mea value theorem to r (^ ) about 0, where r () = @s () = 2 h t ()e t (); (.0) @ for h t () = @e t() : (.) @ The mai part of the proof the ivolves establishig that 2 r ( 0 ) coverges i distributio to a zero-mea ormal vector. If the " t are assumed to be coditioally homoscedastic martigale di ereces, ad coditios esurig that h t () has ite variace are imposed, such covergece is easily see to hold (see e.g. Brow (97)) for r ( 0 ) = 2 h t " t ; (.2) where h t = h t ( 0 ). However this is oly useful if also r( 0 ) r ( 0 ) = o p 2 ; (.3) 3

i other words, if the e ect of replacig e t = e t ( 0 ) by " t is su cietly small. Ulike the h t " t, the h t e t ad h t (e t " t ) are ot zero-mea, orthogoal radom variables. We ca employ the Schwarz iequality: E jr ( 0 ) r ( 0 )j 2 h E(e t " t ) 2 E kh t ( t ( 0 )k 2i 2 : (.4) The if, say, it were true that E(e t " t ) 2 = O p (t ) for some > 0, the right had side of (.4) would be O p 2 2, ad (.3) established. For short memory models E(e t " t ) 2 typically decays fast eough, ideed eve expoetially. But uder quite geeral coditios permittig log memory (see Robiso (2005)), E(e t " t ) 2 Kt (.5) oly, where K is a arbitrarily large geeric costat, which is isu ciet to establish (.3) usig (.4). A more delicate proof of (.3) is required, ad this was give by Robiso (2005). As discussed there, this delicacy relates to that see i the proofs of Fox ad Taqqu (986) ad others for alterative estimates of 0. Ideed, give that these estimates ad CSS should have the same limit distributioal properties, it would be extraordiary if the proof for CSS were very much easier tha for the other estimates. A cetral limit theorem for ^ is give i Sectio 3. rior to that, i the followig sectio, we provide the almost covergece of ^ (uder somewhat more geeral coditios). Haa (973) proved this for various estimates, assumig strict statioarity ad ergodicity, which is cosistet with log memory. However, he did ot cover CSS estimatio. 2 Almost Sure Covergece I the preset sectio we do ot require that x t ecessarily has spectral desity of form (.6), with (.5) holdig, but simply that it is a zero-mea strictly statioary, ergodic process with AR represetatio (.), with the setece after (.) holdig, ad also 0 2 ; for all 2 f 0 g (s; ) 6= (s; 0 ) (2.) o a subset of jsj = of positive measure, j(s; )j is cotiuous i for all s : jsj = ; ad sup j j ()j < : (2.2) j=0 2 Coditio (2.) is a stadard ideti ability coditio, ad (2.2) is reasoable i that log memory models (e.g. (.6), such as FARIMAs) typically have AR represetatios with summable coe ciets. Note that this setup allows the 4

spectral desity to have poles at o-zero frequecies (as i certai cyclic ad seasoal models), whereas (.6) does ot, i view of (.5). Theorem Uder the above coditios lim ^ = 0 ; a:s: (2.3)! roof. Theorem of Haa (973) ad Theorem of Fox ad Taqqu (986) cover the estimate ~ = arg mi sy (); (2.4) where s y () is the objective fuctio for the itegral form of Whittle estimate, i.e. 2 N () of Haa (973) or 2 N () of Fox ad Taqqu (986). We ca write where s y () = c (0) 0 () + 2 c (j) j (); (2.5) c (j) = j x t x t+j ; 0 j ; (2.6) j () = k () k+j (): (2.7) k=0 From Theorem of Haa (973), ad its proof, it is clear that it su ces to show that lim s y () s () = 0; a:s: (2.8) Now sup! s y () s () = + 2 x 2 t k= t+ j x t x t+j 2 k() k= t j+ k () k+j () = 4 a i (); (2.9) i= where ( a () = (0) j 2 j() + a 2 () = x 2 t (0) a 3 () = 2 a 4 () = 2 (j) j ( j= k= t+ k= t j+ 2 j() j (x t x t+j (j)) ) ; (2.0) 2 k(); (2.) k () k+j (); (2.2) k= t j+ k () k+j () ) ;(2.3) 5

where It remais to prove (j) = cov(x 0 ; x j ): (2.4) lim sup ja i ()j = 0; i = ; 2; 3; 4: (2.5)! 2 As the proofs for i = ; 2 are similar to but simpler tha those for i = 3; 4, we give oly the latter. We have ( 2 sup ja 3 ()j 2 j(j)j sup j j ()j) : (2.6) j=0 2 The quatity i braces is ite ad sice, by the Riema-Lebesgue theorem, existece of a spectral desity implies lim j! (j) = 0, it follows from the Toeplitz lemma that (2.6)! 0 as!. Next, by summatio-by-parts a 4 () = 2 +2 j t fc t(j) (j)g t j+ () t+ () j fx t x t+j (j)g k () k+j (): (2.7) The modulus of the rst term o the right has supremum, over, bouded by k= K sup j jc t (j) (j)j sup j t+ ()j (2.8) usig (2.2). Usig (2.2) agai, ad Theorem of Haa (974) ad the Toeplitz lemma, it follows that (2.8) is o() a.s. The secod term i (2.7) ca be similarly hadled. 3 Asymptotic Normality We assume ow i additio that x t has spectral desity (.6), with (.4), (.5) satis ed, that 0 is a iterior poit of, that the " t i (.) are idepedet with zero mea, variace 2 0 ad uiformly bouded fourth momet, that (s; ) is twice cotiuously di eretiable i, ad that the matrix = R 2 is positive de ite. log e i 2 2 @ @ log e i ; 0 log e i 2 2 @ @ log e i ; 0 0 d (3.) 6

Theorem 2 Uder the above coditios, as! 2 (^ 0 ) coverges i distributio to a p-variate ormal vector with zero mea ad covariace matrix. roof. As discussed i Sectio, we have 0 = r (^ ) = r ( 0 ) + ~ T (^ 0 ); (3.2) where T ~ is the matrix formed by evaluatig, for i = ; :::; p, the i-th row of the matrix T () = @ 2 =@@ 0 s () at = ~ i, where ~ i 0 ^ 0 ; k:k deotig Euclidea orm. De e j = @ @ j(; ); (3.3) so that ad de e also Write r ( 0 ) h t = t j x t j ; (3.4) t = j x t j ; (3.5) r = t " t : (3.6) r = r + r 2 + r 3, where r = 2 (h t t )" t ; (3.7) r 2 = 2 t (e t " t ); (3.8) r 3 = 2 (h t t )(e t " t ): (3.9) We show that r i = o p ( 2 ), i = ; 2; 3. To deal with r, we may write h t t = j x t j = jt " j ; (3.0) j=t where Sice jt = E kh t t k 2 = 2 0 j k=0 k+j j k : (3.) jt 2 (log t) 2 K t (3.2) 7

as oted o p.824 of Robiso (2005), ad " t is idepedet of h t t, it follows that E kr k 2 K 2 t K log 2 : (3.3) Next, we ca write where e t " t = jt " j ; (3.4) jt = j k=0 Thus, from Lemma 6 of Robiso (2005), Fially, E kr 3 k k+j t k : (3.5) E kr 2 k 2 (log )3 K 2 : (3.6) K K E kh t t k 2 E (e t " t ) 2 2 log t t (log )2 ; (3.7) usig (3.2) ad also Lemma 4 of Robiso (2005). This completes the proof that r i = o p ( 2 ), i = ; 2; 3. The remaider of the proof is easier, ad more stadard, ad is omitted. ACKNOWLEDGEMENT I thak a referee for a careful readig of the paper. REFERENCES BOX, G.E.. ad JENKINS, G.M. (970). Time Series Aalysis, Forecastig ad Cotrol. Holde-Day, Sa Fracisco. BROWN, B.M. (97). Martigale cetral limit theorems. A. Math. Statist. 42, 59-66. DAHLHAUS, R. (989). E ciet parameter estimatio for self-similar processes. A. Statist. 7 749-766. (Correctio A. Statist., forthcomig.) FOX, R. ad TAQQU, M.S. (986). Large-sample properties of parameter estimates for strogly depedet statioary Gaussia time series. A. Statist. 4 57-532. GIRAITIS, L. ad SURGAILIS, D. (990). A cetral limit theorem for quadratic forms i strogly depedet liear variables ad its applicatio to asymptotic ormality of Whittle s estimate. robab. Theory Related Fields 86 87-04. 8

HANNAN, E.J. (973). The asymptotic theory of liear time series models. J. Appl. robability 0 30-45. (Correctio: J. Appl. robability 0, 93.) HANNAN, E.J. (974). The uiform covergece of autocovariaces. A. Statist. 2 803-806. ROBINSON,.M. (2005). E ciecy improvemets i iferece o statioary ad ostatioary fractioal time series. A. Statist. 33, 800-842. 9