Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535

Similar documents
FOURIER TRANSFORMS OF STATIONARY PROCESSES 1. k=1

CENTRAL LIMIT THEOREMS FOR FUNCTIONALS OF LINEAR PROCESSES AND THEIR APPLICATIONS

Nonparametric Density Estimation fo Title Processes with Infinite Variance.

UNIT ROOT TESTING FOR FUNCTIONALS OF LINEAR PROCESSES

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes

QED. Queen s Economics Department Working Paper No. 1244

Pointwise convergence rates and central limit theorems for kernel density estimators in linear processes

LARGE DEVIATION PROBABILITIES FOR SUMS OF HEAVY-TAILED DEPENDENT RANDOM VECTORS*

Submitted to the Brazilian Journal of Probability and Statistics

MODERATE DEVIATIONS FOR STATIONARY PROCESSES

MOMENT CONVERGENCE RATES OF LIL FOR NEGATIVELY ASSOCIATED SEQUENCES

On variable bandwidth kernel density estimation

Mathematical Institute, University of Utrecht. The problem of estimating the mean of an observed Gaussian innite-dimensional vector

Soo Hak Sung and Andrei I. Volodin

Density estimators for the convolution of discrete and continuous random variables

PROBABILITY AND MOMENT INEQUALITIES UNDER DEPENDENCE

A note on the growth rate in the Fazekas Klesov general law of large numbers and on the weak law of large numbers for tail series

Some Contra-Arguments for the Use of Stable Distributions in Financial Modeling

Stable Process. 2. Multivariate Stable Distributions. July, 2006

Estimation of a quadratic regression functional using the sinc kernel

Does k-th Moment Exist?

On the asymptotic normality of frequency polygons for strongly mixing spatial processes. Mohamed EL MACHKOURI

Consistency of the maximum likelihood estimator for general hidden Markov models

The Convergence Rate for the Normal Approximation of Extreme Sums

BRANCHING PROCESSES 1. GALTON-WATSON PROCESSES

Central Limit Theorem for Non-stationary Markov Chains

Research Article Almost Periodic Solutions of Prey-Predator Discrete Models with Delay

Complete moment convergence of weighted sums for processes under asymptotically almost negatively associated assumptions

Regularity of the density for the stochastic heat equation

arxiv:math.pr/ v1 17 May 2004

On uniqueness of moving average representations of heavy-tailed stationary processes

Weighted Sums of Orthogonal Polynomials Related to Birth-Death Processes with Killing

5. Some theorems on continuous functions

On Reparametrization and the Gibbs Sampler

Optimal global rates of convergence for interpolation problems with random design

ON THE COMPLETE CONVERGENCE FOR WEIGHTED SUMS OF DEPENDENT RANDOM VARIABLES UNDER CONDITION OF WEIGHTED INTEGRABILITY

A Single-Pass Algorithm for Spectrum Estimation With Fast Convergence Han Xiao and Wei Biao Wu

Wald for non-stopping times: The rewards of impatient prophets

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

NOTES AND PROBLEMS IMPULSE RESPONSES OF FRACTIONALLY INTEGRATED PROCESSES WITH LONG MEMORY

L p Spaces and Convexity

P i [B k ] = lim. n=1 p(n) ii <. n=1. V i :=

Complete q-moment Convergence of Moving Average Processes under ϕ-mixing Assumption

March 1, Florida State University. Concentration Inequalities: Martingale. Approach and Entropy Method. Lizhe Sun and Boning Yang.

arxiv: v1 [math.pr] 1 May 2014

Nonparametric regression with martingale increment errors

Minimax Rate of Convergence for an Estimator of the Functional Component in a Semiparametric Multivariate Partially Linear Model.

Kernel estimation for time series: An asymptotic theory

Regular Variation and Extreme Events for Stochastic Processes

ON THE STRONG LIMIT THEOREMS FOR DOUBLE ARRAYS OF BLOCKWISE M-DEPENDENT RANDOM VARIABLES

Bahadur representations for bootstrap quantiles 1

3 Integration and Expectation

A NOTE ON A DISTRIBUTION OF WEIGHTED SUMS OF I.I.D. RAYLEIGH RANDOM VARIABLES

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Some functional (Hölderian) limit theorems and their applications (II)

On the singular values of random matrices

Alternate Characterizations of Markov Processes

The Stein and Chen-Stein methods for functionals of non-symmetric Bernoulli processes

Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables

Random Bernstein-Markov factors

Introduction and Preliminaries

Estimation of the functional Weibull-tail coefficient

Conditional independence, conditional mixing and conditional association

Logarithmic scaling of planar random walk s local times

Empirical Processes of Dependent Random Variables

Probability and Measure

MATH 131A: REAL ANALYSIS (BIG IDEAS)

Almost sure limit theorems for U-statistics

Research Reports on Mathematical and Computing Sciences

Refining the Central Limit Theorem Approximation via Extreme Value Theory

Spring 2012 Math 541B Exam 1

Asymptotic behavior for sums of non-identically distributed random variables

ON A DISTANCE DEFINED BY THE LENGTH SPECTRUM ON TEICHMÜLLER SPACE

STOCHASTIC GEOMETRY BIOIMAGING

Relative growth of the partial sums of certain random Fibonacci-like sequences

A regeneration proof of the central limit theorem for uniformly ergodic Markov chains

Convergence of generalized entropy minimizers in sequences of convex problems

Asymptotic Analysis of Exceedance Probability with Stationary Stable Steps Generated by Dissipative Flows

EXACT CONVERGENCE RATE AND LEADING TERM IN THE CENTRAL LIMIT THEOREM FOR U-STATISTICS

Independence of some multiple Poisson stochastic integrals with variable-sign kernels

Size properties of wavelet packets generated using finite filters

THE DVORETZKY KIEFER WOLFOWITZ INEQUALITY WITH SHARP CONSTANT: MASSART S 1990 PROOF SEMINAR, SEPT. 28, R. M. Dudley

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε

Gaussian Processes. 1. Basic Notions

The main results about probability measures are the following two facts:

The Codimension of the Zeros of a Stable Process in Random Scenery

Mi-Hwa Ko. t=1 Z t is true. j=0

Asymptotic distributions of some scale estimators in nonlinear models with long memory errors having infinite variance 1

On large deviations for combinatorial sums

On Least Squares Linear Regression Without Second Moment

Weak invariance principles for sums of dependent random functions

Conditional moment representations for dependent random variables

Real Variables # 10 : Hilbert Spaces II

A CLT FOR MULTI-DIMENSIONAL MARTINGALE DIFFERENCES IN A LEXICOGRAPHIC ORDER GUY COHEN. Dedicated to the memory of Mikhail Gordin

Tail bound inequalities and empirical likelihood for the mean

CHAPTER 3: LARGE SAMPLE THEORY

Exercises in Extreme value theory

Lectures 2 3 : Wigner s semicircle law

STATIONARY RANDOM WALKS AND ISOTONIC REGRESSION

f(x) f(z) c x z > 0 1

Transcription:

Additive functionals of infinite-variance moving averages Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535 Departments of Statistics The University of Chicago Chicago, Illinois 60637 June 12, 2003

Additive functionals of infinite-variance moving averages June 12, 2003 By Wei Biao Wu University of Chicago Abstract We consider the asymptotic behavior of additive functionals of linear processes with infinite variance innovations. Applying the central limit theory for Markov chains, we establish asymptotic normality for short-range dependent processes. A non-central limit theorem is obtained when the processes are long-range dependent and the innovations are in the domain of attraction of stable laws. Key words and phrases. Central limit theorem, empirical process, level crossings, linear process, long- and short-range dependence, Markov chain, martingale, stable distribution. 1 Introduction Let {ε i } i Z be iid c 1 random vectors such that E( ε 1 α ) < for some 0 < α 2 and E(ε 1 ) = 0 if 1 α 2; let {a i } i 0 be r c matrices such that k=1 a k α <, where r, c 1 are fixed integers and a = ( k,l a2 kl )1/2 for matrix a = (a kl ). Then the multiple linear process X n = i=0 a iε n i is well-defined (cf Corollary 5.1.3 in Chow and Teicher (1978)). In case r = c = 1 and ε i is stable, then X n is also stable. The important fractional ARIMA models with infinite variance innovations fall within this framework. Let K be a measurable function for which E[K(X 1 )] = 0. Our goal is to investigate the limiting behavior of S n (K) = n i=1 K(X i). This problem has received much attention recently; see Hsing (1999), Koul and Surgailis (2001), Surgailis (2002) and references therein for some historical developments. In Hsing (1999), the central limit problem is considered. 1

When K belongs to certain classes of bounded functions, Koul and Surgailis (2001) and Surgailis (2002) obtained non-central limit theorems for univariate processes. As pointed in Koul and Surgailis (2001), the treatment of the asymptotic normality problem in Hsing (1999) is not rigorous. The central limit problem for multiple linear processes with heavy tails remains unsolved. Here we shall establish a central limit theorem under mild conditions. The method adopted in this paper is based on the limit theory for additive functionals of Markov chains. Gordin and Lifsic (1978) introduced the method and an important generalization was given in Woodroofe (1992). The main idea is to write K(X n ) = g(x n ) and use the fact that the shift process X n = (..., ε n 1, ε n ) is a Markov chain. Then the general limit theory developed for the additive functional S n (g) = n i=1 g(ξ i) for Markov chains ξ n is applicable. This theory can be applied to linear processes and enables one to launch a systematic study of their asymptotic behavior. See Wu and Mielniczuk (2001) and Wu (2002, 2003), where certain open problems in linear processes are circumvented. In this article, we apply this technique to processes with infinite variance innovations to establish a central limit theorem for short-range dependent processes and a non-central limit theorem for long-range dependent sequences when the innovations are in the domain of attraction of certain stable laws. Derivations of limit theorems based on such methods seem to be quite simple at a technical level. By comparison, the treatment in Hsing (1999) appears formidable. This paper is organized as follows. Main results are stated in Section 2 and proved in Section 3. In Section 4, an open problem is proposed. 2 Main results Unless otherwise specified, we assume that E( ε 1 α ) < holds for a fixed 0 < α 2 and E(ε 1 ) = 0 if 1 α 2. For a random vector ξ, let ξ p = [E( ξ p )] 1/p and ξ = 2

ξ 2. Recall the shift process X n = (..., ε n 1, ε n ) and consider the projection operator P k ξ = E[ξ X k ] E[ξ X k 1 ], k Z. Define the truncated processes X n,k = k i= a n iε i and X n,k = X n X n,k 1. As in Ho and Hsing (1997), let K n (w) = E[K(w + X n,1 )] and K (w) = E[K(w + X n )]. (1) Theorem 1 and Corollary 1 assert central and non-central limit theorems for S n (K), respectively. In Corollary 1 we assume that ε i D(d), the domain of attraction of stable law with index d (0, 2). Theorem 2 provides an approximation of S n (K) by the linear functional n i=1 X i. An interesting application is given in Section 2.3, where a limit theorem for level crossings is derived. 2.1 A Central Limit Theorem For a function f define the local Lipschitz constant by L f (x) = sup y x: y x 1 f(y) f(x). y x Theorem 1. Assume that E[K 2 (X 0 )] < and that either (a) there exist q > 1 and κ N for which a n α 2q <, (2) n=1 F q (κ) <, where F q (n) = E[ L Kn (X n,0 ) 2p + K n (X n,0 ) 2p ], p = q/(q 1), (3) or (b) there exists κ N for which F 1 (κ) <, where F 1 (n) = sup[ L Kn (x) + K n (x) ] <, (4) x a n α 2 <. (5) n=1 Then S n (K)/ n N(0, σ 2 ) for some σ <. 3

Remark 1. Case (b) can be viewed as a limit of (a) when q 1. Note that F q defined in (3) is a non-increasing function in n. To see this, by the smoothing property, K n+1 (x) = EK n (x + a n ε 1 ), L Kn+1 (x) = sup { y x: y x 1 E sup y x: y x 1 E[K n (y + a n ε 1 ) K n (x + a n ε 1 )] y x K n (y + a n ε 1 ) K n (x + a n ε 1 ) y x } = EL Kn (x + a n ε 1 ) which, in conjunction with K n+1 (x) E K n (x + a n ε 1 ) and the fact that X n+1,0 + a n ε 1 and X n,0 are identically distributed, implies F q (n + 1) F q (n) by Jensen s inequality. Similarly, F 1 ( ) defined in (4) is non-increasing. Remark 2. In Theorem 1, we need not assume a n = n γ l(n) in the univariate case r = c = 1, l a slowly varying function, and the innovations ε i need not be in D(d). If r = c = 1, a n = n γ l(n) and ε i D(d), then (5) holds for 2/γ < α < d provided dγ > 2, an assumption used in Hsing (1999) to obtain a central limit theorem for S n (K). 2.2 A Non-Central Limit Theorem Suppose K (0) := ( K (w)/ w 1,..., K (w)/ w r ) w=0 exists. Let R n = S n (K) T n, where the linear functional T n = K (0) n i=1 X i. Set A n (α) = i=n a i α. Condition 1. For sufficiently large n, K n ( ) is differentiable and there exist q 1, 1 ν < α/q such that K n 1 (X n,1 ) K n (X n,0 ) K (0)a n 1 ε 1 ν = O[ a n 1 α qν + an 1 A 1 qν n (α)]. (6) Intuitively, (6) asserts a first order expansion of Taylor s type. Theorem 2. Assume Condition 1 and a n = n γ l(n), where 1 > γ > 1/α. Then for all β < β 0 = min {1, γα/(qν), γ + (γα 1)/(qν)}, R n ν = O[n 1 β +1/ν ] (7) 4

To state Corollary 1, we assume that ε i D(d). Namely, there exists a slowly varying function l ε such that n i=1 ε i/[n 1 d l ε (n)] Z, where Z is a stable law with index d. Corollary 1. Let r = c = 1. Assume Condition 1, a n = n γ l(n) and β 0 +1/ν < γ+1/d. Then S n (K)/D n K (0)Z, where D n = n1 γ+ 1 d l(n)l ε (n) 1 γ { [t 1 γ (max(t 1, 0)) 1 γ ] d dt 0 } 1 d. Proof. By Theorem 2, R n = o P (D n ). Hence the Corollary follows from the classical result that n i=1 X i/d n Z (Avram and Taqqu (1986)). Proposition 1. Assume that there exist ν [1, α), q [1, α/ν) and n N such that F q (n) := E[ L K n (X n,0 ) νp + K n(x n,0 ) νp + K n (X n,0 ) νp + K n (X n,1 ) νp ] <, (8) where p = q/(q 1). Here F 1 (n) < is interpreted as sup x L K n (x) + K n(x) + K n (x) ] <. Then Condition 1 holds. By Remark 1, F q ( ) is non-increasing. As illustrated in Example 2, Corollary 1 goes beyond earlier ones by allowing unbounded functionals K; see Koul and Surgailis (2001) and Surgailis (2002) where boundedness are required. Example 1. Let α = d y 2, ν = d y and q = 1 + cy, where c < (γd 1)/(γd 2 ) is a nonnegative number. Then as y 0, ( γ + 1/d) ( β 0 + 1/ν) = ( γc + γ/d 1/d 2 )y + O(y 2 ) > 0 since γc + γ/d 1/d 2 > 0. Corollary 1 is applicable if y > 0 is sufficiently small. Example 2. Let a 0 = 1, r = c = 1 and let ε i be iid symmetric-α-stable random variables with index d (1, 2) and characteristic function φ(t) = E[exp(t 1ε 1 )] = exp( t d ) for t R. Assume that there exists C > 0 such that for all u R, K(u) C(1 + u ) α/(νp). (9) 5

Then (8) holds with n = 1. Notice that (9) allows unbounded and discontinuous functions if p <. To see that (9) implies (8), let f be the density of ε i. By Theorem 2.4.2 in Ibragimov and Linnik (1971), f(t) c d t 1 d as t, where c d = π 1 Γ(1+d) sin(πd/2) and Γ is the Gamma function. It is easily seen that a similar argument also implies f (t) c d (1 + d) t 2 d and f (t) c d (1 + d)(2 + d) t 3 d (10) as t. Observe that K 1 (x) = R K(w + x)f(x)dx = R K(u)f(u x)du and K 1(x) = R K(u)f (u x)du. Let g(x; 1) = sup y 1 g(x +y) be the maximal function of g. Using 1 + x + v (1 + x )(1 + v ), we have f (u y) f (u x) L K 1 (x) K(u) sup du R y x 1 x y R K(x + v) f (v; 1) dv C(1 + x ) α/(νp) R (1 + v )α/(νp) f (v; 1) dv C 1 (1 + x ) α/(νp) in view of (10), where C 1 is a constant. Similarly, K 1(x) C 2 (1+ x ) α/(νp) and K 1 (x) C 3 (1 + x ) α/(νp) hold for some constants C 2 and C 3. Thus (8) holds since α < d. 2.3 Level Crossing Analysis Consider the one dimensional process X n = i=0 a iε n i, where ε i D(d) and a n = n γ l(n) with γ > 1/α and 1 < d < 2. Corollary 1 gives the asymptotic distribution for the instantaneous filter S n (K) = n i=1 K(X i). As a simple non-instantaneous filter, we discuss the level crossing statistics N n = n t=1 1 (X t 1 l)(x t l) 0 and M n (τ) = n t=1 1 ( τ X t )( τ X t 1 ) 0, where X t = X t X t 1 is the difference operator and τ X t =... X t for τ N. Note that N n counts how many times the process {X t } n t=1 crosses level l, and M (τ) n the τth order crossing count. In particular, M (1) n is the number of local extremes of the process {X t } n t=1. In certain engineering problems, it is popular to conduct analysis based on crossing counts; see Kedem (1994) for more details. Limit theorems for N n and M (τ) n are useful for related statistical inference. A central limit theorem for N n is derived in Wu (2002), where l = 0 and {X n } is a short-range dependent linear process with finite variance. 6 is

For r 1, (X n, X n 1,..., X n r+1 ) can be viewed as a multiple linear process Y n = i=0 a iε n i, where a i = (a i, a i 1,..., a i r+1 ) and a i = 0 for i < 0. Let F n (u, v) be the joint distribution function of (X n,0, X n 1,0 ) and f n (u, v) its density; let F (u) and F (u, v) be the distribution functions of X n and (X n, X n 1 ) respectively. Write f (10) (u, v) = f(u, v)/ u, f (01) (u, v) = f(u, v)/ v and f(u) = df (u)/du. Recall Corollary 1 for the definition of D n. Corollary 2. Let γ > 1/α and 1 < d < 2. (i) If there exists β > 1 such that a n a n+1 = O(n β ), then for all τ 1, [M (τ) n EM (τ) n ]/ n N(0, σ 2 τ) for some σ 2 τ <. (ii) If γ > 2/α, then (N n EN n )/ n N(0, σ 2 ). (iii) If γ < 1, then N n EN n D n 2[f (01) (l, l) + f (10) (l, l) f(l)]z. (11) Proof. Consider (iii) first. Let f n (10) (u, v) = f n (u, v)/ u and f n (01) (u, v) = f n (u, v)/ v. By properties of the characteristic function of ε i formula that there exist n 0 N and C < for which D(d), it follows from the inversion sup sup u,v R [f n(u, v) + f n (01) (u, v) + f n (10) (u, v) ] < C. (12) n n 0 By Proposition 1, Theorem 2 can be applied to the functional K(x 1, x 2 ) = 1 (x1 l)(x 2 l) 0 E1 (X0 l)(x 1 l) 0. Observe that K (x 1, x 2 ) = F (l x 1 ) + F (l x 2 ) 2F (l x 1, l x 2 ) 2[F (l) F (l, l)]. Then K (0) = ( f(l) + 2f (10) (l, l), f(l) + 2f (01) (l, l)). Using the same argument as in Corollary 1, (11) follows from K (0) n i=1 Y i/d n 2[f (01) (l, l) + f (10) (l, l) f(l)]z. The cases (i) and (ii) similarly follow from Theorem 1 by establishing inequalities like (12). We omit the details since they only involve elementary computations based on the inversion formula between density functions and characteristic functions. 3 Proofs By the smoothing property, E[K(X n ) X k ] = K n k (X n,k ) holds for k n. Let {ε i, i Z} be an independent copy of {ε i, i Z} and X n,1 = a n 1 ε 1+X n,0. Then E[K n 1 (X n,1) X 1 ] = 7

K n (X n,0 ) almost surely. In the proofs below, we will use these claims. We omit the proof of the following obvious lemma. Lemma 1. Assume E( Z α ) <. Then for q 1 α q 2 0, as M, E[ Z/M q 1 1 Z M ] + E[ Z/M q 2 1 Z >M ] = O(1/M α ). Proof of Theorem 1. First we show that, under (a), P 1 K(X n ) <. (13) By Cauchy s inequality, n=1 P 1 K(X n ) = E[K n 1 (X n,1) K n 1 (X n,1 ) X 1 ] K n 1 (X n,1) K n 1 (X n,1 ) [K n 1 (X n,1) K n 1 (X n,1 )] 1 X n,1 X n,1 1 + [K n 1 (X n,1) K n 1 (X n,1 )] 1 X n,1 X n,1 >1. Let = X n,1 X n,1 = a n 1 (ε 1 ε 1 ). Then by Hölder s inequality and Lemma 1, [K n 1 (X n,1) K n 1 (X n,1 )] 1 1 2 E{L 2 K n 1 (X n,1 ) 2 1 1 } L 2 K n 1 (X n,1 ) p 2 1 1 q = O[E( α 1 1 )] 1 q = O( an 1 α q ). Similarly, K n 1 (X n,1 ) 1 >1 2 K 2 n 1(X n,1 ) p 1 >1 q = O( a n 1 α q ). Thus (13) follows from (2), and n=1 P 1K(X n ) converges to ξ (say) in L 2. By Theorem 1 in Woodroofe (1992), {S n (K) E[S n (K) X 0 ]}/ n N(0, ξ 2 ). To conclude the proof, it remains to establish E[S n (K) X 0 ] 2 = o(n). To this end notice that P j S n (K), j = 0, 1,..., are orthogonal and E[S n (K) X 0 ] 2 = P j S n (K) 2 j=0 = O [ j=0 [ n P j K(X i ) j=0 i=1 ] n P 1 K(X i+j+1 ) = o(n) i=1 ] 2 by (13). Since arguments for (b) are similar, we omit the details. 8

Proof of Theorem 2. Let V i = K(X i ) K (0)X i. Clearly P j R n, j =..., 1, 0,..., n, are martingale differences. Then by Bahr-Esseen s (cf. Avram and Taqqu, 1986) and Minkowski s inequalities, R n ν ν 2 n j= P j R n ν ν 2 n j= [ n ] ν P j V i ν = 2 i=1 Write θ n = a n 1 α qν + an 1 A 1 qν n (α) and Θ n n j= [ n ] ν P 1 V i j+1 ν. i=1 = n k=0 θ n. By Karamata s theorem, Θ n = O(n 1 β ) and j=n θν j = O(n 1 νβ ). Observe that P k V i = 0 for k > i. Then (6) implies [ n n ] ν [ n n ] ν [ n ] ν P 1 V i j+1 ν = P 1 V i j+1 ν n P 1 V i ν = O(nΘ ν n). j=1 i=1 j=1 i=j On the other hand, since l(n) is slowly varying, [ 0 n ] ν [ 0 P 1 V i j+1 ν = j= i=1 j= n+1 + n j= ] i=1 = O(nΘ ν 2n) + n ν O(θj ν ), j=n (Θ n j+1 Θ 1 j ) ν which clearly implies (7). Proof of Proposition 1. Let U = K n (X n,1 ) K n (X n,0 ) K n(x n,0 )a n 1 ε 1. Then 1 2 ν E( U ν ) E( U1 an 1 ε 1 1 ν ) + E( U1 an 1 ε 1 >1 ν ) E( L K n (X n,0 ) a n 1 ε 1 2 1 an 1 ε 1 1 ν ) + 3 ν E[ K n (X n,1 )1 an 1 ε 1 >1 ν ] + 3 ν E[ K n (X n,0 )1 an 1 ε 1 >1 ν ] + 3 ν E[ K n(x n,0 ) a n 1 ε 1 1 an 1 ε 1 >1 ν ]. By Lemma 1 and (8), the first, third and fourth terms in the proceeding display are of order O( a n 1 α ). The second one, by Hölder s inequality, has order K n (X n,1 ) ν p 1 an 1 ε 1 >1 q = O( a n 1 α q ). So, U ν = O( a n 1 α νq ). Let V = Kn 1 (X n,1 + a n 1 ε 1) K n 1 (X n,1 ) K n 1(X n,1 )a n 1 ε 1. Similarly, V ν = O( a n 1 α νq ), which implies Kn (X n,1 ) K n 1 (X n,1 ) ν = O( a n 1 α νq ) in view of the fact that Kn (x) K n 1 (x) = E[K n 1 (x + a n 1 ε 1) K n 1 (x) a n 1 ε 1K n 1(x)]. Hence K n 1 (X n,1 ) K n (X n,0 ) K n(x n,0 )a n 1 ε 1 ν = 9

O( a n 1 α νq ). To conclude the proof, it suffices to verify that K n (X n,0 ) K (0) ν = O[A 1 νq n (α)]. Since K (ι) = EK n (ι + X n,0 ), K (ι) K (0) ι K EK n(x n,0 ) E n (ι + X n,0 ) K n (X n,0 ) K ι n(x n,0 ) ι E L Kn (X n,0 ), implying that EK n(x n,0 ) = K (0) for sufficiently large n by letting ι 0. Let X n,0 = i=n a iε n i and = X n,0 X n,0. Then E( α ) = O[A n (α)]. By Jensen s inequality, K n(x n,0 ) K (0) ν = E[K n(x n,0 ) K n(x n,0) X 0 ] ν K n(x n,0 ) K n(x n,0) ν [K n(x n,0 ) K n(x n,0)] 1 1 ν + [K n(x n,0 ) K n(x n,0)] 1 >1 ν which, by (8) and Lemma 1, is of order O[A 1 νq n (α)] by using the same arguments as in the proof of Theorem 1. 4 An Open Problem Theorem 2 can be viewed as a first order expansion of S n (K); namely it is approximated by a linear functional n i=1 X i. Do higher order expansions exist? The issue is well understood when the innovations have finite fourth moment, see Ho and Hsing (1997). Such expansions would enable one to obtain finer results than Corollary 1 in the degenerate case K (0) = 0. Acknowledgment. The author is grateful to Tailen Hsing and the referees for helpful suggestions. References Avram, F. and Taqqu, M.S. (1986), Weak convergence of moving averages with infinite variance. In Dependence in Probability and Statistics: A Survey of Recent Results (Eberlein and Taqqu, eds.), 399-416. Birkhauser, Boston. Chow, Y. S. and Teicher, H. (1988). Probability Theory. Springer Verlag, New York. 10

Gordin, M. I. and Lifsic, B. (1978). The central limit theorem for stationary Markov processes. Doklady 19, 392 394. Ho, H.-C. and Hsing, T. (1997). Limit theorems for functionals of moving averages. Ann. Probab. 25, 1636-1669. Hsing, T. (1999). On the asymptotic distributions of partial sums of functionals of infinitevariance moving averages. Ann. Probab. 27, 1579-1599. Ibragimov, I. A. and Yu. V. Linnik (1971). Independent and stationary sequences of random variables. Wolters-Noordhoff, Groningen. Kedem, B. (1994). Time series analysis by higher order crossings. IEEE Press, Piscataway, NJ. Koul, H. and Surgailis, D. (2001). Asymptotics of empirical processes of long memory moving averages with infinite variance. Stochastic Process. Appl. 91, 309 336. Surgailis, D., (2002). Stable limits of empirical processes of moving averages with infinite variance. Stochastic Process. Appl. 100, 255 274. Woodroofe, M. (1992). A central limit theorem for functions of a Markov chain with applications to shifts. Stochastic Process. Appl. 41, 33 44. Wu, W. B. and J. Mielniczuk (2002). Kernel density estimation for linear processes. Ann. Statist. 30, 1441-1559. Wu, W. B. (2002). Central limit theorems for functionals of linear processes and their applications. Statist. Sinica 12, 635 649. Wu, W. B. (2003). Empirical Processes of Long-memory Sequences. To appear, Bernoulli. Department of Statistics, The University of Chicago, 5734 S. University Avenue, Chicago, IL 60637, U.S.A. E-mail: wbwu@galton.uchicago.edu 11