A generalization of Strassen s functional LIL

Similar documents
arxiv:math/ v2 [math.pr] 16 Mar 2007

The Codimension of the Zeros of a Stable Process in Random Scenery

LARGE DEVIATION PROBABILITIES FOR SUMS OF HEAVY-TAILED DEPENDENT RANDOM VECTORS*

Self-normalized Cramér-Type Large Deviations for Independent Random Variables

STA205 Probability: Week 8 R. Wolpert

On the Kolmogorov exponential inequality for negatively dependent random variables

Weighted uniform consistency of kernel density estimators with general bandwidth sequences

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS

On the Set of Limit Points of Normed Sums of Geometrically Weighted I.I.D. Bounded Random Variables

PCA with random noise. Van Ha Vu. Department of Mathematics Yale University

UNIFORM IN BANDWIDTH CONSISTENCY OF KERNEL REGRESSION ESTIMATORS AT A FIXED POINT

A Law of the Iterated Logarithm. for Grenander s Estimator

Large deviations of empirical processes

On the law of the iterated logarithm for Gaussian processes 1

Krzysztof Burdzy University of Washington. = X(Y (t)), t 0}

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

A Note on the Central Limit Theorem for a Class of Linear Systems 1

Almost sure limit theorems for U-statistics

Estimates for probabilities of independent events and infinite series

Limit Thoerems and Finitiely Additive Probability

Brownian motion. Samy Tindel. Purdue University. Probability Theory 2 - MA 539

Ergodic Theorems. Samy Tindel. Purdue University. Probability Theory 2 - MA 539. Taken from Probability: Theory and examples by R.

Probability and Measure

4 Sums of Independent Random Variables

Self-normalized laws of the iterated logarithm

Empirical Processes: General Weak Convergence Theory

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

A PECULIAR COIN-TOSSING MODEL

Weighted uniform consistency of kernel density estimators with general bandwidth sequences

Lecture 9. d N(0, 1). Now we fix n and think of a SRW on [0,1]. We take the k th step at time k n. and our increments are ± 1

On almost sure rates of convergence for sample average approximations

Random Bernstein-Markov factors

Characterizations of Weibull Geometric Distribution

IEOR 6711: Stochastic Models I Fall 2013, Professor Whitt Lecture Notes, Thursday, September 5 Modes of Convergence

Limit Theorems for Quantile and Depth Regions for Stochastic Processes

Introduction to Bases in Banach Spaces

Problem set 1, Real Analysis I, Spring, 2015.

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

Introduction to Self-normalized Limit Theory

NOTES ON FRAMES. Damir Bakić University of Zagreb. June 6, 2017

A SKOROHOD REPRESENTATION AND AN INVARIANCE PRINCIPLE FOR SUMS OF WEIGHTED i.i.d. RANDOM VARIABLES

Notes 9 : Infinitely divisible and stable laws

Chapter 5. Weak convergence

STAT 7032 Probability Spring Wlodek Bryc

Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535

Introduction and Preliminaries

The Convergence Rate for the Normal Approximation of Extreme Sums

The strictly 1/2-stable example

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

New Versions of Some Classical Stochastic Inequalities

The Lindeberg central limit theorem

The Central Limit Theorem: More of the Story

On pathwise stochastic integration

A Note on the Strong Law of Large Numbers

Research Article Exponential Inequalities for Positively Associated Random Variables and Applications

Part II Probability and Measure

Prime numbers and Gaussian random walks

HOPF S DECOMPOSITION AND RECURRENT SEMIGROUPS. Josef Teichmann

Chapter 4. Measure Theory. 1. Measure Spaces

Lecture Notes 3 Convergence (Chapter 5)

Laws of the iterated logarithm for α-time Brownian motion

COMPOSITION SEMIGROUPS AND RANDOM STABILITY. By John Bunge Cornell University

On the local time of random walk on the 2-dimensional comb

ON THE ZERO-ONE LAW AND THE LAW OF LARGE NUMBERS FOR RANDOM WALK IN MIXING RAN- DOM ENVIRONMENT

Lecture 12. F o s, (1.1) F t := s>t

n E(X t T n = lim X s Tn = X s

Weighted Sums of Orthogonal Polynomials Related to Birth-Death Processes with Killing

Risk Bounds for Lévy Processes in the PAC-Learning Framework

A NOTE ON A DISTRIBUTION OF WEIGHTED SUMS OF I.I.D. RAYLEIGH RANDOM VARIABLES

Course 212: Academic Year Section 1: Metric Spaces

Asymptotic Inference for High Dimensional Data

Rare Events in Random Walks and Queueing Networks in the Presence of Heavy-Tailed Distributions

1 Sequences of events and their limits

RATIO ERGODIC THEOREMS: FROM HOPF TO BIRKHOFF AND KINGMAN

Some Background Math Notes on Limsups, Sets, and Convexity

Locally convex spaces, the hyperplane separation theorem, and the Krein-Milman theorem

Probability and Measure

Entropy dimensions and a class of constructive examples

A Concise Course on Stochastic Partial Differential Equations

Lyapunov optimizing measures for C 1 expanding maps of the circle

Lecture I: Asymptotics for large GUE random matrices

{σ x >t}p x. (σ x >t)=e at.

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications.

Brownian Motion and Conditional Probability

Integral Jensen inequality

P (A G) dp G P (A G)

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

Useful Probability Theorems

On the expected diameter of planar Brownian motion

The Favorite Point of a Poisson Process

THE ASYMPTOTICS OF L-STATISTICS FOR NON I.I.D. VARIABLES WITH HEAVY TAILS

Probability and Measure

ONE DIMENSIONAL MARGINALS OF OPERATOR STABLE LAWS AND THEIR DOMAINS OF ATTRACTION

Exponential martingales: uniform integrability results and applications to point processes

Large Sample Theory. Consider a sequence of random variables Z 1, Z 2,..., Z n. Convergence in probability: Z n

(A n + B n + 1) A n + B n

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

MATH 6605: SUMMARY LECTURE NOTES

Sharpness of second moment criteria for branching and tree-indexed processes

Transcription:

A generalization of Strassen s functional LIL Uwe Einmahl Departement Wiskunde Vrije Universiteit Brussel Pleinlaan 2 B-1050 Brussel, Belgium E-mail: ueinmahl@vub.ac.be Abstract Let X 1, X 2,... be a sequence of i.i.d. mean zero random variables and let S n denote the sum of the first n random variables. We show that whenever we have with probability one, lim sup n S n /c n = α 0 < for a regular normalizing sequence {c n }, the corresponding normalized partial sum process sequence is relatively compact in C[0, 1] with canonical cluster set. Combining this result with some LIL type results in the infinite variance case, we obtain Strassen type results in this setting. Short title: General Strassen-type results AMS 2000 Subject Classifications: 60F15, 60G50. Keywords: Hartman-Wintner LIL, Strassen- type results, infinite variance, very slowly varying functions, sums of i.i.d. random variables, strong invariance principle. Research partially supported by an FWO-Vlaanderen Grant 1

1 Introduction Let {X, X n ; n 1} be a sequence of real-valued independent and identically distributed (i.i.d.) random variables, and let S n = n i=1 X i, n 1. Define Lx = log e max{e, x} and LLx = L(Lx) for x IR. The classical Hartman-Wintner law of the iterated logarithm (LIL) states that (1.1) lim sup ±S n /(2nLLn) 1/2 = σ < a.s. n if and only if (1.2) EX = 0 and σ 2 = EX 2 <. Starting with the work of Feller (1968) there has been quite some interest in finding extensions of the Hartman-Wintner LIL to the infinite variance case, especially for variables X in the domain of attraction to the normal distribution. Recall that this means that one can find a centering sequence {β n } IR and a normalizing sequence α n so that S n β n α n d Y normal(0, 1), where d stands for convergence in distribution. Moreover, it is known that in this case, one has E X p < for all 0 < p < 2 and a possible choice for the centering sequence is given by β n = nex. The normalizing sequence α n can be chosen of the form ng(n) where g : [0, [ [0, [ is non-decreasing and slowly varying at infinity, that is we have lim t g(et)/g(t) = 1. Under an additional symmetry assumption (which turned out to be unnecessary later) Feller (1968) proved in this case that (1.3) lim sup ±S n /α [nlln] = 2 a.s. n if and only if (1.4) EX = 0 and P{ X α [nlln] } <. It is easy to see that this result implies the Hartman-Wintner LIL since random variables satisfying condition (1.2) are trivially in the domain of attraction to the normal distribution with α n = σ n and β n = nex = 0. Though this result is more general than the Hartman-Wintner LIL it was not the final word in the infinite variance case. The main problem is condition (1.4) which is not satisfied for many distributions of interest, most notably it cannot be applied for the distribution determined by the density f(x) = 1/ x 3, x 1. Another shortcoming is that it is essentially restricted to 2

the domain of attraction case. This latter problem has been addressed by Klass (1976, 1977), Pruitt (1981), Li and Tomkins (2003) among other authors, but finding a normalizing sequence for the aforementioned example remained an open problem. Einmahl and Li (2005) looked at the following modification of the LIL problem: Given a sequence, a n = nh(n), where h is a slowly varying non-decreasing function, when does one have with probability one, 0 < lim sup S n /a n <? n Somewhat surprisingly, it turned out that the Hartman-Wintner LIL could be generalized to a law of the very slowly varying function, which is general enough to also give a normalizing sequence for the above example, that is for random variables with the distribution determined by the density f(x) = 1/ x 3, x 1. A natural question is now whether one can also find functional Strassen type versions of these results. All the results of this type known to us are restricted to the domain of attraction case (see, for instance, Kuelbs (1985), Einmahl (1989)). We shall show that whenever one has with probability one lim sup n S n /c n = α 0 < and the normalizing sequence {c n } is sufficiently regular, a functional LIL type result with canonical cluster set holds. Our result implies among other things that the recent results of Einmahl and Li (2005) and also the two-sided version of the Klass (1976) LIL can be improved to functional LIL type results. 2 Statement of results To formulate our main result we need some extra notation. Let S (n) : Ω C[0, 1] the partial sum process of order n, that is, S (n) (t) = S [nt] + (nt [nt])x [nt]+1, 0 t 1. Strassen (1964) proved under condition (1.2) that with probability one, {S (n) / 2nLLn; n 1} is relatively compact in C[0, 1] and C({S (n) / 2nLLn; n 1}) = σk, where C({x n : n 1}) denotes the set of limit points of the sequence {x n } C[0, 1] which we shall call the cluster set of {x n } and K = {f C[0, 1] : f(t) = t 0 g(u)du, 0 t 1, where 1 0 g 2 (u)du 1}. 3

Recall that the Strassen LIL implies for any continuous functional ψ : C[0, 1] IR: With probability one, {ψ(s (n) / 2nLLn); n 1} is bounded and the corresponding cluster set is equal to ψ(σk). Choosing ψ(f) = f(1), f C[0, 1] one re-obtains the Hartman-Wintner LIL, but this is of course just one of many possible applications of the Strassen LIL. Let c n be a sequence of real numbers satisfying the following two conditions, (2.1) c n / n and (2.2) ɛ > 0 m ɛ 1 : c n /c m (1 + ɛ)(n/m), m ɛ m < n. Note that condition (2.2) is satisfied if c n /n is non-increasing or if c n = c(n), where c : [0, ) [0, ) is regularly varying at infinity with exponent γ < 1. Let H be the truncated second moment function of the random variable X, that is Set α 0 = sup { H(t) = EX 2 I{ X t}, t 0. α 0 : ( ) } n 1 exp α2 c 2 n =. 2nH(c n ) Then the main result of the present paper is as follows, Theorem 1 Let X, X 1, X 2,... be i.i.d. mean zero random variables. Assume that (2.3) P{ X c n } <, where c n is a sequence of positive real numbers satisfying conditions (2.1) and (2.2). If α 0 < we have with probability one, (2.4) {S (n) /c n ; n 1} is relatively compact in C[0, 1] and (2.5) C({S (n) /c n ; n 1}) = α 0 K. Remark 1 From relations (2.4) and (2.5) it follows that one has under assumption (2.3) with probability one, (2.6) lim sup S n /c n = α 0 <. n Moreover we have C({S n /c n ; n 1}) = [ α 0, α 0 ]. Therefore, the above result implies Theorem 3 of Einmahl and Li (2005) if α 0 <. It is easy to see that condition (2.3) is necessary for (2.6) and thus also for (2.4) and (2.5) to hold. 4

One of the main difficulties in applying (2.6) is the calculation of the parameter α 0. A key result is Theorem 4, Einmahl and Li (2005) which makes it possible to calculate α 0 (or finding at least bounds) in many cases of interest. Since we now know that the very same parameter determines the cluster set, we immediately have the functional versions of all LIL type results given in Einmahl and Li (2005) once Theorem 1 has been proved. To formulate some of the possible corollaries, we need some extra notation. Let H be the set of all continuous, non-decreasing functions h : [0, [ ]0, [, which are slowly varying at infinity. Set f τ (t) := exp((lt) τ ), 0 τ 1. Let H 0 H be the class of all functions so that lim h(tf τ(t))/h(t) = 1, 0 < τ < 1. t As this condition is more restrictive than slow variation, we call these functions very slowly varying. Examples for functions in H 0 are h(t) = (Lt) r, r 0 and h(t) = (LLt) p, p 0. Combining Theorem 2, Einmahl and Li (2005) and Theorem 1 we obtain the following corollary which could be called the functional law of the very slowly varying function. Recall that H is the truncated second moment function of the random variable X. Corollary 1 Let X, X 1, X 2,... be i.i.d. random variables, and let S n = n i=1 X i, n 1. Given a function h H 0 set Ψ(x) = xh(x) and a n = Ψ(n), n 1. If there exists a constant 0 < λ < such that (2.7) EX = 0, EΨ 1 ( X ) < and lim sup x then we have with probability one, Ψ 1 (xllx) H(x) = λ2 x 2 LLx 2, (2.8) {S (n) /a n ; n 1} is relatively compact in C[0, 1] and (2.9) C({S (n) /a n ; n 1}) = λk Conversely, relations (2.8) and (2.9) imply condition (2.7). From the above result we can infer, Corollary 2 Let p 1. For any constant 0 < λ < we have with probability one, {S (n) / 2n(LLn) p ; n 1} is relatively compact in C[0, 1] and if and only if C({S (n) / 2n(LLn) p ; n 1}) = λk (2.10) EX = 0 and lim sup(llx) 1 p H(x) = λ 2. x 5

Remark 2 If p = 1 then condition (2.10) is equivalent to EX = 0 and EX 2 = λ 2. We see that the Strassen LIL is a special case of Corollary 2. Note that the condition EΨ 1 ( X ) < has disappeared. This is possible since the condition on H implies it in this case. In fact, we can show the following: Let h H 0 be function satisfying for some 0 < α < 1: h(t) = O(exp((LLt) α ) as t. Then one has: Ψ 1 (xllx) lim sup H(x) < EΨ 1 ( X ) <. x x 2 LLx To see that we note that under the additional assumption on h we have Ψ 1 (y) y 2 /h(y 2 ) as y and therefore we need only to show that E[X 2 /h(x 2 )] <. The assumption on H implies then that H(t) = H(t) 1 = O(h(t 2 )/LLt) as t. Thus, it is sufficient to show that E[X 2 /( H( X )LL X )] <. But given that log( H(t)) = O(log(h(t 2 ))) = O((LLt) α ), we have for some positive constant C that E[X 2 /( H( X )LL X )] CE[X 2 /( H( X ){L H( X )} 1/α )], which is finite since 1/α > 1. ( A related result for random variables in the domain of attraction of the normal distribution is Lemma 7, Einmahl (1995).) Corollary 3 Let r > 0. For any constant 0 < λ < we have with probability one, and if and only if {S (n) / 2n(Ln) r ; n 1} is relatively compact in C[0, 1] C({S (n) / 2n(Ln) r ; n 1}) = λk (2.11) EX = 0, EX 2 /(L X ) r < and lim sup x LLx (Lx) r H(x) = 2r λ 2. We finally show how the two-sided version of the Klass LIL can be improved to a functional LIL. We need a certain function K which is defined for any random variable X : Ω IR with 0 < E X <. Set M(t) := E X I{ X > t}, t 0. Then it is easy to see that the function G(t) := t 2 /(H(t) + tm(t)), t > 0 is continuous and increasing and the function K is defined as its inverse function. Moreover, one has for this function K that as x (2.12) K(x)/ x ( EX 2) 1/2 ]0, ] and, (2.13) K(x)/x 0. Set γ n = 2K(n/LLn)LLn, n 1. Then γ n obviously satisfies conditions (2.1) and (2.2) and we can infer from our theorem, 6

Corollary 4 (Klass LIL) Let X, X 1, X 2,... be iid mean zero random variables. Then we have for the partial sum process sequence S (n) based on these random variables, (2.14) {S (n) /γ n ; n 1} is relatively compact in C[0, 1] and (2.15) C({S (n) /γ n ; n 1}) = K a.s. if and only if (2.16) P{ X γ n } < Our proof of Theorem 1 is based on a new strong invariance principle for sums of i.i.d. random variables with possibly infinite variance which will be proved in Section 3. We shall show that up to an almost sure error term of order o(c n ), the partial sum process sequence S (n) can be approximated by σ n W (n) where σn 2 = H(c n ) EX 2 [0, ] and W (n) (t) = W (nt), 0 t 1 with {W (s), s 0} being a standard Brownian motion. This way we can reduce the investigation of the asymptotic behavior of S (n) /c n to that one of σ n W (n) /c n where some general results on Brownian motion due to Talagrand (1992) and Csáki (1980), respectively, will come in handy. 3 Proof of Theorem 1 Throughout the whole section we shall assume that {c n } is a sequence of positive real numbers satisfying conditions (2.1) and (2.2). Moreover, X, X 1, X 2,... will always be a sequence of i.i.d. mean zero random variables satisfying (3.1) P{ X c n } <. The following lemma collects a number of useful facts which we will need later on. Lemma 1 Assuming (3.1) we have, (3.2) (3.3) (3.4) (3.5) E X 3 I{ X c n }/c 3 n < H(c n ) = EX 2 I{ X c n } = o(c 2 n/n) as n M(c n ) = E X I{ X > c n } = o(c n /n) as n n EXI{ X c k } = o(c n ) as n k=1 7

Proof The first three facts are already stated and proven in Lemma 1 of Einmahl and Li (2005). The last fact follows by the same argument as on page 2037, Einmahl (1993). Just replace γ n by c n and use conditions (2.1) and (2.2). The crucial tool for our proof is the following infinite variance version of a strong invariance principle given in Einmahl and Mason (1993). Theorem 2 Let X be a mean zero random variable satisfying condition (3.1). If the underlying p-space (Ω, F, P) is rich enough, one can define a sequence {X n } of independent copies of X and a standard Brownian motion {W (t); t 0} such that we have as n (3.6) S (n) σ n W (n) = o(c n ) a.s. where and W (n) (t) = W (nt), 0 t 1. σ 2 n = H(c n ), n 1 Proof (STEP 1) We first note that using Theorem 3 of Einmahl (1988) in conjunction with fact (3.5), we can construct a sequence {X n } of independent copies of X and independent standard normal random variables {Y n } such that as n (3.7) n (X j τ j Y j ) = o(c n ) a.s. j=1 where τ 2 n = Var(XI{ X c n }), n 1. (STEP 2) We next claim that (3.8) n σ n τ n Y n converges a.s. c n This implies via the Kronecker lemma that as n (3.9) n (σ j τ j )Y j = o(c n ) a.s. j=1 To prove (3.8) we note that on account of EX = 0, (σ n τ n ) 2 c 2 n = σn 2 τn 2 c 2 n (EXI{ X > c n }) 2 c 2 n ɛ 2 n n 2 < 8

where ɛ n = ne X I{ X > c n }/c n 0 (see relation (3.4)). (STEP 3) Set We show that as k k n := max (σ n σ j )Y j, n 1 1 k n j=1 (3.10) 2 k/c 2 k 0 a.s. To that end we note that by Kolmogorov s maximal inequality, 2l P{ 2 l ɛc 2 l} ɛ 2 (σ 2 l σ k ) 2 k=1 c 2 2 l 2 l ɛ 2 k=1 σ 2 2 l σ 2 k c 2 2 l Set p j = P{c j 1 < X c j }, j 1 (c 0 = 0) Then we have: 2 l l=1 k=1 σ 2 2 l σ 2 k c 2 2 l = = 2 l 2 l l=1 k=1 j=k+1 p j c 2 j/c 2 2 l 2 l j 1 ( 1)p j c 2 j/c 2 2 l l=1 j=2 jp j c 2 j j=2 k=1 l:2 l j j=2 2 l jp j c 2 j/c 2 2 l l=1 j=2 c 2 j 2 p 2 l j 2 l 2 jp j < l:2 l j j=2 Here we have used the fact that c 2 j/c 2 2 l j/2 l if j 2 l which follows from condition (2.1). We now obtain relation (3.10) via the Borel-Cantelli lemma. (STEP 4) Note that max n 2 l+1 + (σ 2 l+1 σ 2 l) max T m, 2 l <n 2 l+1 1 m 2 l+1 where T m = m j=1 Y j, m 1. Moreover, we have by Lévy s inequality, { } P (σ 2 l+1 σ 2 l) max T m ɛc 2 l 1 m 2 l+1 2P {(σ 2 l+1 σ 2 l) T 2 l+1 ɛc 2 l} 2ɛ 2 (σ 2 l+1 σ 2 l) 2 2 l+1 c 2 2 l 2ɛ 2 (σ 2 2 σ 2 l+1 2 )2 l+1 c 2 l 2 l 9

Next observe that due to the monotonicity of σ n and in view of STEP 3 we have l=1 2 l σ2 2 l+1 σ 2 2 l c 2 2 l 2 l l=1 k=1 σ 2 2 l+1 σ 2 k c 2 2 l+1 c2 2 l+1 c 2 2 l <. Here we have used the fact that c 2 l+1/c 2 l = O(1) due to assumption (2.2). We can infer via Borel-Cantelli that as l, (3.11) (σ 2 l+1 σ 2 l) max 1 m 2 l+1 T m = o(c 2 l) a.s. In view of relation (3.10) this means that as n (3.12) n /c n 0 a.s. (STEP 5) Combining relations (3.7), (3.9) and (3.12) we conclude that as n k max (X j σ n Y j ) 1 k n = o(c n) a.s. j=1 Let T (n) be the partial sum process (of order n) determined by the sums T k, 1 k n. Then we can infer from the above relation that as n (3.13) S (n) σ n T (n) = o(c n ) a.s. W.l.o.g. we can assume that there exists a standard Brownian motion {W (t) : t 0} satisfying W (n) = T n, n 1 which of course implies that T (n) (k/n) = W (n) (k/n) = T k, 1 k n. By a standard argument (see, for instance, p. 485 of Einmahl and Mason (1993)), it follows that (3.14) T (n) W (n) = O( log n) a.s. Recalling fact (3.3), we find that (3.15) σ n T (n) σ n W (n) = O(σ n log n) = o(cn log n/n) = o(cn ) a.s. Combining relations (3.13) and (3.15) we obtain the assertion. Remark 3 After some modification one can prove the above theorem more generally for sequences {c n } satisfying condition (2.2) and instead of condition (2.1): α 1/3 : c n /n α is noncreasing. Then this result also includes the strong invariance principle given in Einmahl and Mason (1993) so that the above theorem is in fact an extension of the earlier result. For our purposes the version above, however, is more than sufficient and it should be clear to the interested reader how to get the more general version. 10

Due to invariance, it is now sufficient to prove the functional LIL type results for σ n W (n) /c n. Here we proceed similarly as in the classical proof by Strassen (1964). We first show for ɛ > 0, (3.16) P{σ n W (n) /c n (α 0 K) ɛ eventually} = 1. This relation combined with the compactness of K already implies that with probability 1, {σ n W (n) /c n ; n 1} is relatively compact in C[0, 1]. Moreover we can infer that with probability one, (3.17) C({σ n W (n) /c n ; n 1}) α 0 K. Thus it remains to prove that with probability one, (3.18) C({σ n W (n) /c n ; n 1}) α 0 K. Given that C({σ n W (n) /c n ; n 1}) is a closed subset of C[0, 1] as a cluster set, it suffices to show that we have for any function f in the interior of α 0 K and δ > 0, (3.19) P{ f σ n W (n) /c n < δ infinitely often} = 1 To prove (3.16) and (3.19) we need two further lemmas. We first show that the parameter α 0 remains unchanged if we replace H(c n ) by H(δc n ), where 0 < δ < 1. Though this also follows from Theorem 3, Einmahl and Li (2005), we prefer to give the direct (simple) proof since this makes our proof of Theorem 1 completely independent of the proofs given by Einmahl and Li (2005) who used among other tools a non-trivial result of Kesten (1970) on general cluster sets. Lemma 2 Let α 0 be defined as in Theorem 1. Set { ( ) } α 0 = sup α 0 : n 1 exp α2 c 2 n =, 2nH(δc n ) where 0 < δ < 1. Then we have: α 0 = α 0. Proof Note that by monotonicity of H we obviously have: α 0 α 0. Set n = H(c n ) H(δc n ). Then we trivially have n /c 2 n δ 1 E X 3 I{ X c n }/c 3 n and fact (3.2) implies that (3.20) c 2 n n < Next observe that for any 0 < ɛ < 1, ( ) 1 n exp α2 c 2 n = 1 ) ( 2nH(c n ) n exp α 2 c 2 n 2n(H(δc n ) + n 1 ) ( n exp α 2 c 2 n + 1n ( ) 2n(1 + ɛ)h(δc n ) exp α 2 c 2 n 2n(1 + ɛ 1 ) n 1 ) ( n exp α 2 c 2 n + 2(1 + ɛ 1 ) n. 2n(1 + ɛ)h(δc n ) α 2 c 2 n 11

In view of relation (3.20) this means that α 0 1 + ɛ α 0. Since we can choose ɛ arbitrarily small, we find that α 0 α 0. Lemma 3 Let n j a subsequence satisfying for large enough j, 1 < a 1 < n j+1 /n j a 2 < Then we have: (3.21) ( exp j=1 α2 c 2 n j 2n 2 j σ2 n j ) { = if α < α 0 < if α > α 0, where σ 2 n = H(c n ). Proof. Let σ 2 n = H(δc n ), where 0 < δ < 1 will be specified later. Let α < α 0 = α 0. (See Lemma 2.) Since c n / n is non-decreasing, we have = n j+1 j=j 0 n=n j +1 (a 2 1) (a 2 1) ( 1 n exp α2 c 2 n 2n σ n 2 ( j=j 0 exp j=j 0 exp ) α2 c 2 n j 2n j σ n 2 j+1 ( ) α2 c 2 n j 2n j σ 2 n j provided we have chosen δ small enough so that σ n 2 j+1 = H(δc nj+1 ) H(c nj ) = σn 2 j. This is possible since c nj+1 /c nj = O(n j+1 /n j ) = O(1) (recall condition (2.2)). The other part of the lemma follows similarly. Proof of (3.16) Our proof is based on the following inequality due to Talagrand (1992): Let U be the unit ball in (C[0, 1], ). There exists an absolute constant C > 0 so that for all t > 0 and η > 0, ( (3.22) P{W tk + ηu} exp Cη 2 ηt ) 2 t2 2 Recall that σn 2 = H(c n ). Using the fact that W d (n) = nw, we get: { P{σ n W (n) /c n α 0 K + ɛu} = P 12 ) W α 0c n K + nσ 2 n ɛc n nσ 2 n U ( exp C nσ2 n α ) 0(α 0 + ɛ)c 2 n ɛ 2 c 2 n 2nσn 2 }

From fact (3.3) we can infer that if n is large enough, these probabilities are bounded above by ( ) 2 exp α2 1c 2 n, 2nσn 2 where α 1 = α 0 (α 0 + ɛ) > α 0. Let ɛ > 0 be fixed and set n j = [(1 + ɛ) j ], j 1. Using (3.21) in conjunction with the above bound we find that P{σ nj W (nj )/c nj α 0 K + ɛu} < j=1 In view of the Borel-Cantelli lemma this means that and consequently P{σ nj W (nj )/c nj α 0 K + ɛu eventually} = 1 (3.23) lim sup d(σ nj W (nj )/c nj, α 0 K) ɛ a.s. j where as usual d(f, α 0 K) = inf g α0 K f g, f C[0, 1]. Consider now n j n n j+1. From the definition of K it is clear that As we trivially have f α 0 K f(nt/n j+1 ) α 0 K. W (n) (t) = W (nj+1 )(nt/n j+1 ), 0 t 1 we obtain for n j n n j+1 and large enough j ( σn W (n) (3.24) d, c ) n j+1 α 0 K 2ɛ a.s. c n c n Recall that σ n is non-decreasing and use the fact that tk K, t 1. From assumption (2.2) and the definition of the subsequence {n j } we get for n j n n j+1 and large j, c nj+1 /c n c nj+1 /c nj 1 + 2ɛ. Therefore, we have with probability one for large j, ( ) σn W (n) (3.25) d, (1 + 2ɛ)α 0 K 2ɛ, n j n n j+1. c n Since (1 + a)α 0 K α 0 K + aα 0 U, a > 0, we can conclude that for large enough n ( ) σn W (n) (3.26) d, α 0 K 2(1 + α 0 )ɛ a.s. c n 13

and we have proven relation (3.16). Proof of (3.19) To simplify notation, we set for f C[0, 1]: ( 1/2 1 I(f) = (f (u)) du) 2 if f exists 0 otherwise We use the following inequality given in Lemma 2 of Csáki (1980). (3.27) P{ W f z} exp( I 2 (f)/2)p{ W z}, f C[0, 1], z > 0. W.l.o.g. we shall assume that 0 < α 0 <. Let δ > 0 be fixed. Set n j = m j, j 1, where m 2(1 + α0/δ 2 2 ) is a natural number. First note that if j 2 { } ( ) P sup σ nj W (n j t) δc nj 2 exp δ2 c 2 n j 0 t 1/m 2n j 1 σn 2 j ( ) ( ) = 2 exp mδ2 c 2 n j 2n j σ 2 n j 2 exp since m 2α0/δ 2 2. Recalling relation (3.21) we find that { } (3.28) P sup σ nj W (n j t) δc nj < 0 t 1/m j=2 Therefore by the Borel-Cantelli lemma, 2α2 0c 2 n j 2n j σ 2 n j, (3.29) lim sup j sup 0 t 1/m σ nj W (n j t) c nj δ a.s. Moreover, it is easy to see that (3.30) sup f(t) α 0 / m δ, f α 0 K. 0 t 1/m Combining relations (3.29) and (3.30), we can conclude for f α 0 K that (3.31) lim inf j f σ n j W (nj )/c nj 2δ + lim inf j which in turn (again by relation (3.29)) is 3δ + lim inf j Z j, 14 sup f(t) σ nj W (n j t)/c nj 1/m t 1

where Z j := sup f(t) σ nj {W (n j t) W (n j 1 )}/c nj, j 2. 1/m t 1 These variables are independent so that in view of the second Borel-Cantelli lemma we have lim inf j Z j 2δ a.s. and consequently (3.32) lim inf j if we can show, f σ n j W (nj )/c nj 5δ a.s. (3.33) P{Z j 2δ} =. j=2 It remains to prove (3.33) for functions f in the interior of α 0 K that is under the assumption I(f) < α 0. To that end we first note that { } P{Z j 2δ} P sup f(t) σ nj W (n j t)/c nj δ, σ nj W (n j 1 ) δc nj 1/m t 1 P{ f σ nj W (nj )/c nj < δ} P{σ nj W (n j 1 ) δc nj } Since j P{σ n j W (n j 1 ) δc nj } < (use (3.28)), we see that (3.33) is proven once we have shown that (3.34) I(f) < α 0 P{ f σ nj W (nj )/c nj < δ} = j=1 Applying inequality (3.27), we find that P{ f σ nj W (nj )/c nj < δ} = P{ W c nj f/( n j σ nj ) δc nj /( n j σ nj )} ( ) exp I2 (f)c 2 n j 2n j σ 2 n j P{ W δc nj /( n j σ nj )} where we have used the trivial fact that I(af) = ai(f), a > 0. Recalling that c n / nσ 2 n (see (3.3)), we can conclude that for large j P{ f σ nj W (nj )/c nj < δ} 1 2 exp ( I2 (f)c 2 n j 2n j σ 2 n j In view of (3.21) it is now evident that (3.34) holds which in turn implies (3.32). We now see that relation (3.19) holds. ). 15

REFERENCES Csáki, E. (1980) A relation between Chung s and Strassen s laws of the iterated logarithm. Z.Wahrsch. Verw. Gebiete 54 287 301. Einmahl, U. (1988) Strong approximations for partial sums of i.i.d. B-valued r.v. s in the domain of attraction of a Gaussian law. Probab. Theory Related Fields 77 65 85. Einmahl, U. (1989) Stability results and strong invariance principles for partial sums of Banach space valued random variables. Ann.Probab. 17 332 352. Einmahl, U. (1993). Toward a general law of the iterated logarithm in Banach space. Ann. Probab. 21 2012-2045. Einmahl, U. and Li, D. (2005) Some results on two-sided LIL behavior Ann. 33 1601 1624. Probab. Einmahl, U. and Mason, D. (1993) Rates of clustering in Strassen s LIL for partial sum processes Probab. Th. Rel. Fields 97 479 487. Feller, W. (1968). An extension of the law of the iterated logarithm to variables without variance. J. Math. Mech. 18 343-355. Kesten, H. (1970). The limit points of a normalized random walk. Ann. Math. Statist. 41 1173-1205. Klass, M. (1976). Toward a universal law of the iterated logarithm I. Z. Wahrsch. Verw. Gebiete 36 165-178. Klass, M. (1977). Toward a universal law of the iterated logarithm II. Z. Wahrsch. Verw. Gebiete 39 151-165. Kuelbs, J. (1985). The LIL when X is in the domain of attraction of a Gaussian law. Ann. Probab. 13 825 859. Li, D. L. and Tomkins, R. J. (2003). The law of the logarithm for weighted sums of independent random variables. J. Theoretical Probab. 16 519 542. Pruitt, W. (1981). General one-sided laws of the iterated logarithm. Ann. Probab. 9 1-48. Strassen, V. (1964). An invariance principle for the law of the iterated logarithm. Z. Wahrsch. Verw. Geb. 3 211-226. Talagrand, M. (1992). On the rate of clustering in Strassen s LIL for Brownian motion. In: Probability in Banach spaces 8 (R. Dudley, M. Hahn, J. Kuelbs, eds.) 339 347. Birkhäuser, Boston. 16