The Mann-Whitney U-statistic for α-dependent sequences

Size: px
Start display at page:

Download "The Mann-Whitney U-statistic for α-dependent sequences"

Transcription

1 The Ma-Whitey U-statistic for α-depedet sequeces Jérôme Dedecker a ad Guillaume Saulière b arxiv: v [math.st] 2 Nov 206 November 22, 206 a Uiversité Paris Descartes, Sorboe Paris Cité, Laboratoire MAP5 (UMR 845). jerome.dedecker@parisdescartes.fr b Uiversité Paris Sud, Istitut de Recherche biomédicale et d Epidémiologie du Sport (IRMES), Istitut Natioal du Sport de l Expertise et de la Performace (INSEP). Guillaume.SAULIERE@isep.fr Abstract We give the asymptotic behavior of the Ma-Whitey U-statistic for two idepedet statioary sequeces. The result applies to a large class of short-rage depedet sequeces, icludig may o-mixig processes i the sese of Roseblatt [5]. We also give some partial results i the lograge depedet case, ad we ivestigate other related questios. Based o the theoretical results, we propose some simple correctios of the usual tests for stochastic domiatio; ext we simulate differet (o-mixig) statioary processes to see that the corrected tests perform well. Itroductio ad mai result Let (X i ) i Z ad (Y i ) i Z be two idepedet statioary sequeces of real-valued radom variables. Let m = m be a sequece of positive itegers such that m as. The Ma- Whitey U-statistic may be defied as U = U,m = m m Xi <Y j. We shall also use the otatios π = P(X < Y ), H Y (t) = P(Y > t) ad G X (t) = P(X < t). It is well kow that, if (X i ) i Z ad (Y i ) i Z are two idepedet sequeces of idepedet ad idetically (iid) radom variables, the U may be used to test π = /2 agaist π /2 (or π > /2, π < /2). Whe π > /2 we shall say that Y stochastically domiates X i a j=

2 weak sese. This property of weak domiatio is true for istace if Y stochastically domiates X, that is if H Y (t) H X (t) for ay real t, with a strict iequality for some t 0. But it also holds i may other situatios, for istace if X ad Y are two Gaussia radom variables with E(X) < E(Y ), whatever the variaces of X ad Y. I this paper, we study the asymptotic behavior of U uder some coditios o the α- depedece coefficiets of the sequeces (X i ) i Z ad (Y i ) i Z. Let us recall the defiitio of these coefficiets. Defiitio.. Let (X i ) i Z be a strictly statioary sequece, ad let F 0 = σ(x k, k 0). For ay o-egative iteger, let where F is the distributio fuctio of X 0. α,x () = sup E ( X x F 0 ) F (x), x R We shall also always require that the sequeces are 2-ergodic i the followig sese. Defiitio.2. A strictly statioary sequece (X i ) i Z is 2-ergodic if, for ay bouded mesurable fuctio f from R 2 to R, ad ay o-egative iteger k lim f(x i, X i+k ) = E(f(X 0, X k )) almost surely. (.) Note that the almost sure limit i (.) always exists, by the ergodic theorem. The property of 2-ergodicity meas oly that this limit is costat. This property is weaker tha usual ergodicity, which would implies the same property for ay bouded fuctio f from R l to R, l N {0}. Our mai result is the followig theorem. Theorem.. Let (X i ) i Z ad (Y i ) i Z be two idepedet statioary 2-ergodic sequeces of real-valued radom variables. Assume that α,x (k) < ad Let (m ) be a sequece of positive itegers such that lim α,y (k) <. (.2) m = c, for some c [0, ). (.3) The the radom variables (U π) coverge i distributio to N (0, V ), with V = Var(H Y (X 0 )) + 2 Cov(H Y (X 0 ), H Y (X k )) + c ( Var(G X (Y 0 )) ) Cov(G X (Y 0 ), G X (Y k )). (.4)

3 Note that the result of Theorem. has bee obtaied by Serflig [6], provided that the more restrictive coditio k 0 ( ) α(k) < ad α(k) = O k log k is satisfied by the two sequeces (X i ) i Z ad (Y i ) i Z, where the α(k) s are the usual α-mixig coefficiets of Roseblatt [5]. (.5) The otio of α-depedece that we cosider i the preset paper is much weaker tha α- mixig i the sese of Roseblatt [5]. Some properties of these α-depedece coefficiets are stated i the book by Rio [4], ad may examples are give i the two papers [7], [8]. Let us recall oe of these examples. Let X i = k 0 a k ε i k (.6) where (a k ) k 0 l 2, ad the sequece (ε i ) i Z is a sequece of idepedet ad idetically distributed real-valued radom variables, with mea zero ad fiite variace. If the cumulative distributio fuctio F of X 0 is γ-hölder for some γ (0, ], the α,x () C ( k a 2 k ) γ γ+2 for some positive costat C (this follows from the computatios i Sectio 6. of [8]). particular, if a k is geometrically decreasig, the so is α,x () (whatever the idex γ). Note that, without extra assumptios o the distributio of ε 0, such liear processes have o reasos to be mixig i the sese of Roseblatt. For istace, it is well kow that the liear process X i = k 0 is ot α-mixig (see for istace []). geometrically decreasig. ε i k 2 k+, where P(ε = /2) = P(ε = /2) = /2, By cotrast, for this particular example, α,x () is To coclude, let us also metio the paper [5] where it is proved that the coefficiets α,x () ca be computed for a large class of dyamical systems o the uit iterval (see Subsectio 6.2 of the preset paper for more details). Remark.. Other types of depedece are cosidered i the papers by Dewa ad Prakasa Rao [] ad Dehlig ad Fried [9]. The paper [] deals with the U-statistics U for two idepedet samples of associated radom variables. The secod paper [9] deals with geeral U-statistics of fuctios of β-mixig sequeces, for either two idepedet samples, or for two I 3

4 adjacet samples from the same time series (see Subsectio 5.2 for a extesio of our results to this more difficult cotext). The class of uctios of β-mixig sequeces cotais also may examples, with a large itersectio with the class of α-depedet sequeces. The advatage of our approach is that it leads to coditios that are optimal i a precise sese (see Sectio 3). O aother had, it seems more difficult to deal with geeral U-statistics i our cotext (some kid of regularity is eeded, for istace we could certaily deal with U-statistics based o fuctios which have bouded variatio i each coordiates). 2 Testig the stochastic domiatio from two idepedet time series The statistic (U /2) caot be used to test H 0 : π = /2 versus oe of the possible alteratives, because uder H 0 the asymptotic distributio of (U /2) depeds of the ukow quatity V defied i (.4). I the ext Propositio, we propose a estimator of V uder some slightly stroger coditios o the depedece coefficiets. Defiitio 2.. Let (X i ) i Z be a strictly statioary sequece of real-valued radom variables, ad let F 0 = σ(x k, k 0). Let f x (z) = z x F (x), where F is the cumulative distributio fuctio of X 0. For ay o-egative iteger, let { } α 2,X () = max α,x (), sup E (f x (X i )f y (X j ) F 0 ) E (f x (X i )f y (X j )). x,y R,j i Propositio 2.. Let (X i ) i Z ad (Y i ) i Z be two idepedet statioary sequeces, ad let (m ) be a sequece of positive ietegers satisfyig (.3). Assume that α 2,X (k) < ad α 2,Y (k) <. (2.) Let ad H m (t) = m m Yj >t ad G (t) = j= Xi <t, ˆγ X (k) = k (H m (X i ) H m )(H m (X i+k ) H m ), ˆγ Y (l) = m l (G (Y j ) Ḡ)(G (Y j+l ) Ḡ), m where H m = H m(x i ) ad mḡ = m j= G (Y j ). Let also (a ) ad (b ) be two sequeces of positive itegers tedig to ifiity as teds to ifiity, such that a = o( ) ad b = 4 j=

5 o( m ). The V = ˆγ X (0) + 2 a coverges i L 2 to the quatity V defied i (.4). ˆγ X (k) + m (ˆγ Y (0) + 2 b l= ˆγ Y (l) Combiig Theorem. ad Propositio 2., we obtai that, uder H 0 : π = /2, if V > 0, the radom variables coverge i distributio to N (0, ). T = (U /2) max{v, 0} Remark 2.. The coditio (2.) is ot restrictive: i all the atural examples we kow, the coefficiets α 2,X (k) behave as α,x (k) (this is the case for istace for the liear process (.6)). Note that, if we assume (2.) istead of (.2), the ergodicity is o loger required i Theorem.. Fially, followig the proof of Theorem 4. page 89 i [2], oe ca also build a cosistet estimator of V uder the coditio α,x (k) < ad k ) (2.2) α,y (k) <, (2.3) k which is ot directly comparable to (2.) (for istace, the first part of (2.) is satisfied if α 2,X (k) = O(k (log k) a ) for some a >, while the first part of (2.3) requires α,x (k) = O(k (log k) a ) for some a > 2). Remark 2.2. The choice of the two sequeces (a ) ad (b ) is a delicate matter. If the coefficiets α 2,X (k) decrease very quickly, the a should icrease very slowly (it suffices to take a 0 i the iid settig). O the cotrary, if α 2,X (k) = O(k (log k) a ) for some a >, the the terms i the covariace series have o reaso to be small, ad oe should take a close to to estimate may of these covariace terms. A data-drive criterio for choosig a ad b is a iterestig (but probably difficult) questio, which is far beyod the scope of the preset paper. However, from a practical poit of view, there is a easy way to proceed: oe ca plot the estimated covariaces ˆγ X (k) s ad choose a (ot too large) i such a way that ˆγ X (0) + 2 a ˆγ X (k) should represet a importat part of the covariace series Var(H Y (X 0 )) + 2 Cov(H Y (X 0 ), H Y (X k )). 5

6 Similarly, oe may choose b from the graph of the ˆγ Y (l) s. As we shall see i the simulatios (Sectio 6), if the decays of the true covariaces γ X (k) ad γ Y (l) are ot too slow, this provides a easy ad reasoable choice for a ad b. 3 Log rage depedece I this sectio, we shall see that the coditio (.2) caot be weakeed for the validity of Theorem.. More precisely, we shall give some examples where (.2) fails, ad the asymptotic distributio of a (U π) (if ay) ca be o Gaussia. Let us begi with the boudary case, where α 2,X (k) = O((k + ) ) ad α 2,Y (k) = O((k + ) ). (3.) I that case, oe ca prove the followig result. Propositio 3.. Let (X i ) i Z ad (Y i ) i Z be two idepedet statioary sequeces. Assume that (3.) is satisfied, ad let (m ) be a sequece a positive itegers satisfyig (.3). The:. For ay r (2, 4), there exists a positive costat C r such that, for ay x > 0, ( ) lim sup P / log (U π) > x C r x. r 2. There are some examples of statioary Markov chais (X i ) i Z ad (Y i ) i Z for which the sequece / log (U π) coverges i distributio to a o-degeerate ormal distributio. We cosider ow the case where α,x (k) < ad α,y (k) = O((k + ) (p ) ), for some p (, 2). (3.2) This meas that (X i ) i Z is short-rage depedet, while (Y i ) i Z is possibly log-rage depedet. I that case, we shall prove the followig result. Propositio 3.2. Let (X i ) i Z ad (Y i ) i Z be two idepedet statioary sequeces. Assume that (3.2) is satisfied for some p (, 2), ad that (X i ) i Z is 2-ergodic. Let (m ) be a sequece a positive itegers such that The: lim m 2(p )/p = c, for some c [0, ). (3.3) 6

7 . There exists a positive costat C p such that, for ay x > 0, lim sup P ( (U π) > x ) C p x. p 2. There are some examples of statioary Markov chais (X i ) i Z ad (Y i ) i Z for which the sequece (U π) coverges i distributio to Z + cs, where Z is idepedet of S, Z is a Gaussia radom variable, ad S is a stable radom variable of order p. Remark 3.. Note that we do ot deal with the case where the two samples are log-rage depedet; i that case the degeerate U-statistic from the Hoeffdig decompositio (see (4.2)) is o loger egligible, which gives rise to other possible limits. See the paper [0] for a treatmet of this difficult questio i the case of fuctios of Gaussia processes. 4 Proofs 4. Proof of Theorem. The mai igrediet of the proof is the followig propositio: Propositio 4.. Let (X i ) i Z ad (Y i ) i Z be two idepedet statioary sequeces. Let also f(x, y) = x<y H Y (x) G X (y) + π. (4.) The ( E m ) 2 m f(x i, Y j ) 6 m j= i=0 m j=0 mi{α,x (i), α,y (j)}. Let us admit this propositio for a while, ad see how it ca be used to prove the theorem. Startig from (4.), we easily see that (U π) = m m j= f(x i, Y j ) + (H Y (X i ) π) + m m (G X (Y j ) π). (4.2) j= This is the well kow Hoeffdig decompositio of U-statistics; the first term o the right-had side of (4.2) is called a degeerate U-statistic, ad is hadled via Propositio 4.. Ideed, it follows easily from this propositio that ( ) 2 E m f(x i, Y j ) 6 m m j= i=0 (i + )α,x (i) + 6 m 7 m (j + )α,y (j). (4.3) j=0

8 Sice (.2) holds, the α,x () = o( ) ad α,y () = o(m ) (because these coefficiets decrease) ad by Cesaro s lemma lim i=0 (i + )α,x (i) = 0 ad lim m m m (j + )α,y (j) = 0. From (4.3) ad Coditio (.3), we ifer that m f(x i, Y j ) coverges to 0 i L 2 as. (4.4) m j= It remais to cotrol the two last terms o the right-had side of (4.2). We first ote that these two terms are idepedet, so it is eough to prove the covergece i distributio for both terms. The fuctios H Y ad G X beig mootoic, the cetral limit theorem for statioary ad 2-ergodic α-depedet sequeces (see for istace [5]) implies that ad (H Y (X i ) π) coverges i distributio to N (0, v ), where j=0 v = Var(H Y (X 0 )) + 2 m (G X (Y j ) π) coverges i distributio to N (0, v 2 ), where m j= v 2 = Var(G X (Y 0 )) + 2 Combiig (4.2), (4.4), (4.5) ad (4.6), the proof of Theorem. is complete. Cov(H Y (X 0 ), H Y (X k )), (4.5) Cov(G X (Y 0 ), G X (Y k )). (4.6) It remais to prove Propositio 4.. We start from the elemetary iequality ( ) 2 E m f(x i, Y j ) 4 m m E(f(X m 2 m 2 i, Y j )f(x k, Y l )). (4.7) j= To cotrol the term E(f(X i, Y j )f(x k, Y l )), we first work coditioally o Y = (Y i ) i Z. Note first that, sice Y is idepedet of X, E(f(X i, Y j ) Y) = 0. Sice f(x, y) 2, j= k=i E(f(X i, Y j )f(x k, Y l ) Y = y) 2 E(f(X k, y l ) F i ), (4.8) where F i = σ(x k, k i). Now, by defiitio of f, E(f(X k, y l ) F i ) E( Xk <y l F i ) G X (y l ) + E(H Y (X k ) F i ) π 2α,X (k i). (4.9) 8 l=j

9 From (4.8) ad (4.9), it follows that E(f(X i, Y j )f(x k, Y l )) E(f(X i, Y j )f(x k, Y l ) Y) 4α,X (k i). (4.0) I the same way E(f(X i, Y j )f(x k, Y l )) 4α,Y (l j). (4.) Propositio 4. follows from (4.7), (4.0) ad (4.). 4.2 Proof of Propositio 2. We proceed i two steps. Step. Let γx(k) = k (H Y (X i ) π)(h Y (X i+k ) π), γy (l) = m l (G X (Y j ) π)(g X (Y i+k ) π), m ad ote that E(γ X(k)) = k Cov(H Y (X 0 ), H Y (X k )) ad E(γ Y (l)) = k Cov(G X(Y 0 ), G X (Y l )). (4.2) I this first step, we shall prove that V = γ X(0) + 2 a j= γ X(k) + m (γ Y (0) + 2 b l= γ Y (l) coverges i L 2 to V. Clearly, by (4.2), it suffices to show that Var(V ) coverges to 0 as. We oly deal with the secod term i the expressio of V, the other oes may be hadled i the same way. Lettig Z i = H Y (X i ) π, ad γ k = Cov(H Y (X 0 ), H Y (X k )) we have ( a ) Var γx(k) = a a k l E((Z 2 i Z i+k γ k )(Z j Z j+l γ l )). (4.3) We must examie several cases. O Γ = {j i + k}, l= j= E((Z i Z i+k γ k )(Z j Z j+l γ l )) 2α 2,X (j i k). ), Cosequetly 2 a E((Z i Z i+k γ k )(Z j Z j+l γ l )) Γ 2 2 a k l l= j= a a l= ( ) α 2,X (j) j 0 Ca2 (4.4) 9

10 for some positive costat C. O Γ 2 = {i j < i + k} {i + k j + l}, E((Z i Z i+k γ k )(Z j Z j+l γ l )) = E((Z i Z i+k γ k )Z j Z j+l ) 2α,X (j + l i k). Cosequetly 2 a a k l l= j= E((Z i Z i+k γ k )(Z j Z j+l γ l )) Γ2 2 2 O Γ 3 = {i j} {i + k > j + l}, a a l= ( ) α,x (j) j 0 Ca2. (4.5) E((Z i Z i+k γ k )(Z j Z j+l γ l )) = E((Z j Z j+l γ l )Z i Z i+k ) 2α,X (i + k j l). Cosequetly 2 a E((Z i Z i+k γ k )(Z j Z j+l γ l )) Γ3 2 2 a k l l= j= a a l= j= ( ) α,x (i) From (4.4), (4.5) ad (4.6), settig Γ = {i j} = Γ Γ 2 Γ 3, oe gets 2 a a k l l= j= i 0 E((Z i Z i+k γ k )(Z j Z j+l γ l )) Γ 3Ca2 Of course, iterchagig i ad j, the same is true o Γ c = {i > j}, ad fially Ca2. (4.6) 2 a a k l l= j= E((Z i Z i+k γ k )(Z j Z j+l γ l )) 6Ca2 (4.7) From (4.3), (4.7) ad the fact that a = o( ), we ifer that a γ X (k) coverges i L2 to k>0 Cov(H Y (X 0 ), H Y (X k )). Sice the other terms i the defiitio of V ca be hadled i the same way, it follows that V coverges i L 2 to V. Step 2. I this secod step, i the expressio of γ X (k), we replace H Y (X i ) by H m (X i ), H Y (X i+k ) by H m (X i+k ), ad π by H m. Similarly, i the expressio of γ Y (l), we replace G X(Y j ) by G (Y j ), G X (Y j+l ) by G (Y j+l ), ad π by Ḡ. If we ca prove that these replacemets i the expressio of V Let are egligible i L 2, the coclusio of Propositio 2. will follow. γ,x (k) = k (H m (X i ) π)(h Y (X i+k ) π). 0

11 The ( γx(k) γ,x (k) 2 H m (X i ) H Y (X i ) 2 = H m (x) H Y (x) 2 2P X (dx)) /2, (4.8) the last equality beig true because (X i ) i Z ad (Y i ) i Z are idepedet. Now, ( ) H m (x) H Y (x) 2 2 Var( Y0 >x) + 2 Cov( Y0 >x, Yk >x) 2 k=0 α,y(k). (4.9) m m From (4.8) ad (4.9), we ifer that there exist a positive costat C such that a γ X(0) γ,x (0) + 2 (γx(k) γ,x (k)) C a. m By coditio (.3) ad the fact that a = o( ), this last quatity teds to 0 as. The Let ow 2 γ 2,X (k) = k (H m (X i ) H m )(H Y (X i+k ) π). γ,x (k) γ 2,X (k) 2 H m H Y (X i ) + 2 H Y (X i ) π. (4.20) 2 Proceedig as i (4.9) for the first term o the right-had side of (4.20), ad otig that H Y (X i ) π k=0 α,x(k), we get that there exist a positive costat C 2 such that γ a,x(0) γ 2,X (0) + 2 (γ,x (k) γ 2,X (k)) Agai, this last quatity teds to 0 as. 2 C 2a + C 2a. m Fially, all the other replacemets may be doe i the same way, ad lead to egligible cotributios i L 2. Hece ad the result follows from Step. lim V V 2 = 0

12 4.3 Proof of Propositio 3. We start from (4.2) agai, but with the ormalizatio / log istead of. Sice (3.) holds, lim sup i=0 (i + )α,x (i) < ad lim sup m From Iequality (4.3) ad Coditio (.3), we ifer that m log m m (j + )α,y (j) <. j=0 m f(x i, Y j ) coverges to 0 i L 2 as. (4.2) j= Let us prove Item. From (4.2) ad (4.2), ( ( ) lim sup P / log (U π) > x lim sup P (H Y (X i ) π) > x ) log /2 ( m + lim sup P (G X (Y i ) π) > x ) log /2. (4.22) m Sice (3.) holds, it follows from Propositio 5. i [6] that: for ay r (2, 4), there exists a positive costat c,r such that, for ay x > 0 ( lim sup P (H Y (X i ) π) > x ) log /2 j= c,r x r. Sice (.3) holds, the same is true for the secod term o the right-had side of (4.22) (for a positive costat c 2,r ), ad Item follows. Let us prove Item 2. We use a result by Gouëzel [2] about itermittet maps. Let θ /2 be the map described i Subsectio 6.2 of the preset paper. As explaied i Subsectio 6.2, there exists a uique absolutely cotiuous θ /2 -ivariat measure ν /2 o [0, ], ad (see the commets o the case α = /2 i Sectio.3 of Gouëzel s paper), o the probability space ([0, ], ν /2 ), ( HY (θ/2) k ν /2 (H Y ) ) coverges i distributio to a o-degeerate ormal distributio provided H Y (0) ν(h Y ) (which is true for istace if Y has distributio ν /2, sice i that case H Y (0) = ad ν(h Y ) = /2). As fully explaied i Subsectio 6.2, there exists a statioary Markov chai (X i ) i Z such 2

13 that the distributio of (X,..., X ) is the same as (θ /2,..., θ /2) o the probability space ([0, ], ν /2 ). Take (Y i ) i Z a idepedet copy of the chai (X i ) i Z. For such chais, both (H Y (X i ) π) ad m (G X (Y j ) π) m j= coverge to the same o-degeerate ormal distributio (ote that π = /2 i that case). Moreover, as recalled i Subsectio 6.2, there exist two positive costats A, B such that This completes the proof of Item 2. A k + α 2,X(k) = α 2,Y (k) B k Proof of Propositio 3.2 We start from (4.2) agai. At the ed of the proof, we shall prove that if (3.2) ad (3.3) are satisfied, the (4.4) holds. Let us ow prove Item. From (4.2) ad (4.4), lim sup P ( (U π) > x ) lim sup P ( (H Y (X i ) π) > x ) /2 + lim sup P ( m m (G X (Y i ) π) > x ) /2. (4.23) Sice the first part of (3.2) is satisfied, it follows from the cetral limit theorem for statioary ad 2-ergodic α-depedet sequeces that ( lim P (H Y (X i ) π) > x ) /2 = P(Z > x/2), (4.24) where Z is a Gaussia radom variable. From Coditio (3.2) ad Remark 3.3 i [6], we ifer that if (3.3) holds, the there exists a positive costat κ p such that, for ay x > 0, ( m lim sup P (G X (Y i ) π) > x ) /2 κ p m x. (4.25) p j= Item follows from (4.23), (4.24) ad (4.25). Let us prove Item 2. j= For the sequece (X i ) i Z, take a sequece of iid radom variables uiformly distributed over [0, ]. To fid the sequece (Y i ) i Z we use agai a result by Gouëzel [2] about itermittet maps. Let θ /p be the map described i Subsectio 6.2 of the preset 3

14 paper. As explaied is Subsectio 6.2, there exists a uique absolutely cotiuous θ /p -ivariat measure ν /p o [0, ], ad (see Theorem.3 i [2]), o the probability space ([0, ], ν /p ), m /p m ( GX (θ/p) k ν /p (G X ) ) coverges i distributio to a o-degeerate stable distributio of order p provided G X (0) ν(g X ) (this coditio is satisfied here, because G X (0) = 0 ad ν /p (G X ) > 0). As fully explaied i Subsectio 6.2, there exists a statioary Markov chai (Y i ) i Z such that the distributio of (Y,..., Y ) is the same as (θ /p,..., θ /p) o the probability space ([0, ], ν /p ). For such a chai, m /p m (G X (Y j ) π) j= coverges i distributio to a o-degeerate stable distributio of order p. Moreover, as recalled i Subsectio 6.2, there exist two positive costats A, B such that This completes the proof of Item 2. A (k + ) p α,y(k) B (k + ) p. It remais to prove (4.4). From Propositio 4. we get that ( ) 2 ( ) E m m f(x i, Y j ) 6 mi{α,x (i), α,y (j)}. (4.26) m m Note that j= i=0 j=0 m 6 mi{α,x (i), α,y (j)} 6α,X (i), (4.27) m j=0 ad that, for ay positive iteger i, sice α,y (j) 0 as j, by Cesaro s lemma. m 6 lim mi{α,x (i), α,y (j)} = 0, (4.28) m m j=0 domiated covergece theorem that ( lim E m provig (4.4). Sice i>0 α,x(i) <, it follows from (4.26), (4.27), (4.28), ad the ) 2 m f(x i, Y j ) = 0, j= 4

15 5 Other related results 5. Testig the stochastic domiatio by a kow distributio We briefly describes here a procedure for testig the (weak) domiatio by a kow distributio µ. Let H(t) = µ((t, )), ad let (X i ) i Z be a statioary 2-ergodic sequece of real-valued radom variables. Let π = E(H(X 0 )), ad defie If H = H(X i ). α,x (k) <, (5.) the the radom variables ( H π) coverge i distributio to N (0, V ), where V = Var(H(X 0 )) + 2 Cov(H(X 0 ), H(X k )). (5.2) Now, we wat to test H 0 : π = /2 (o relatio of weak domiatio betwee µ ad P X ) agaist oe of the possible alteratives: H : π /2 (oe of the two distributios weakly domiates the other oe), H : π > /2 (µ weakly domiates P X ), H : π < /2 (P X weakly domiates µ). Agai, oe caot use directly the statistics ( H /2) to test H 0 : π = /2 versus oe of the possibles alteratives, because uder H 0 the asymptotic distributio of ( H /2) depeds of the ukow quatity V defied i (5.2). To estimate V, we take for some sequece a = o( ), where V, = ˆγ(0) + 2 a ˆγ(k) ˆγ(k) = k (H(X i ) H )(H(X i+k ) H ). Now, as i Propositio 2., if (5.) holds for α 2,X (k) istead of α,x (k), the v coverges i L 2 to v. It follows that, uder H 0 : π = /2, if v > 0, the radom variables ( T H /2) = max{v,, 0} coverges i distributio to N (0, ). 5

16 5.2 The case of two adjacet samples from the same time series Let (X i ) i Z be a statioary sequece of real-valued radom variables. Let m = m be a sequece of positive itegers such that m as. We cosider here the U-statistic U = U,m = m +m j=+ Xi <Y j, where Y j = f(x j ), f beig a mootoic fuctio. This statistic ca be used to test if there is a chage i the time series after the time, ad more precisely if Y + weakly domiates X. Let π = P(X < f(x )), where X is a idepedet copy of X. Let also H Y (t) = P(f(X ) > t) ad G X (t) = P(X < t). Fidig the asymptotic behavior of U is a slightly more difficult problem tha for two idepedet samples, because we caot use the idepedece to cotrol the degeerate U- statistic as i Propositio 4.. However, it ca be doe by usig a more restrictive coefficiet tha α 2,X. Defiitio 5.. Let (X i ) i Z be a strictly statioary sequece of real-valued radom variables, ad let F 0 = σ(x k, k 0). Let P be the law of X 0 ad P (Xi,X j ) be the law of (X i, X j ). Let P Xk F 0 be the coditioal distributio of X k give F 0, ad let P (Xi,X j ) F 0 be the coditioal distributio of (X i, X j ) give F 0. Defie the radom variables b(k) = sup PXk F 0 (f x ), b(i, j) = x R sup P (Xi,X j ) F 0 (f x f y ) P (Xi,X j ) (f x f y ), (x,y) R 2 where the fuctio f x has bee defied i Defiitio 2.. Defie ow the coefficiets { } β,x (k) = E(b(k)) ad β 2,X (k) = max β,x (k), sup E((b(i, j))) i>j k. Of course, by defiitio of the coefficiets, we always have that α 2,X (k) β 2,X (k). coefficiets β 2,X are weaker tha the usual β-mixig coefficiets of (X i ) i Z. May examples of o-mixig process for which β 2,X ca be computed are give i [8]. For all the examples that we metio i the preset paper, the coefficiet β 2,X (k) is of the same order tha the coefficiet α 2,X (k) (see the paper [4] for the example of the itermittet maps described i Subsectio 6.2). Theorem 5.. Let (X i ) i Z be a statioary sequece of real-valued radom variables. Assume that The β 2,X (k) <. (5.3) 6

17 Let (m ) be a sequece of positive itegers satisfyig (.3). distributio to N (0, V ), with The (U π) coverge i V = Var(H Y (X 0 )) + 2 Cov(H Y (X 0 ), H Y (X k )) + c ( Var(G X (f(x 0 ))) + 2 ) Cov(G X (f(x 0 )), G X (f(x k ))). (5.4) For the estimatio of V, oe ca prove the followig aalogue of Propositio 2.. Propositio 5.2. Let (X i ) i Z be a idepedet statioary sequece, ad let Y j = f(x j ) where f is a mootoic fuctio. Let (m ) be a sequece of positive itegers satisfyig (.3). Assume that (5.3) is satisfied. Let ad H m (t) = m +m j=+ Yj >t ad G (t) = ˆγ X (k) = k (H m (X i ) H m )(H m (X i+k ) H m ), ˆγ Y (l) = m Xi <t, +m l j=+ (G (Y j ) Ḡ)(G (Y j+l ) Ḡ), where H m = H m(x i ) ad mḡ = +m j=+ G (Y j ). Let also (a ) ad (b ) be two sequeces of positive itegers tedig to ifiity as teds to ifiity, such that a = o( / log()) ad b = o( m / log(m )). The V = ˆγ X (0) + 2 a coverges i L 2 to the quatity V defied i (5.4). ˆγ X (k) + m (ˆγ Y (0) + 2 b l= ˆγ Y (l) Combiig Theorem 5. ad Propositio 5.2, we obtai that, uder H 0 : π = /2, if V > 0, the radom variables coverge i distributio to N (0, ). T = (U /2) max{v, 0} ) (5.5) The proof of Propositio 5.2 follows the same lies as that of Propositio 2., so we do ot give the details here. The oly mai chage cocers Step 2, Iequalities (4.8) ad (4.9), where we used the idepedece betwee the samples. Here istead, oe ca use the fact that 2 sup m H (x) H(x) C (log(m)) 2 k=0 β,x(k). x R m 2 7

18 This ca be easily proved by followig the proof of Propositio 7. i Rio [4] up to (7.8) ad by usig the computatios i the proof of Theorem 2. i [3]. The proof of Theorem 5. follows also the lie of that of Theorem.; the oly o-trivial chage cocers Propositio 4.. Here, we have to deal with L = +m f(x i, Y j ) m j=+ where f is defied i (4.). We wat to prove that E(L 2 ) teds to 0 as teds to ifiity. As i the proof of Propositio 4., we start from the elemetary iequality E(L 2 ) 4 +m +m E(f(X 2 m 2 i, Y j )f(x k, Y l )). (5.6) j=+ k=i l=j Let (X i ) i Z be a idepedet copy of (X i ) i Z ad Y i = f(x i ), ad write E(f(X i, Y j )f(x k, Y l )) = E(f(X i, Y j )f(x k, Y l )) E(f(X i, Y j )f(x k, Y l )) + E(f(X i, Y j )f(x k, Y l )). (5.7) Now, usig the idepedece, the secod term o the right-had side of (5.7) ca be hadled exactly as i Propositio 4.. It remais to deal with the first term i the right-had side of (5.7). Note that E(f(X i, Y j )f(x k, Y l )) E(f(X i, Yj )f(x k, Yl )) E(f(x i, Y j )f(x k, Y l ) X i = x i, X k = x k ) E(f(x i, Y j )f(x k, Y l )) P (Xi,X k )(dx i, dx k ) (5.8) For ay fixed x, the fuctio y h x (y) = f(x, y) is a bouded variatio fuctio whose variatio is bouded by 2, ad E(h x (X i )) = 0. Proceedig as i Lemma of [7], oe ca the prove that E(f(x i, Y j )f(x k, Y l ) X i = x i, X k = x k ) E(f(x i, Y j )f(x k, Y l )) P (Xi,X k )(dx i, dx k ) 4β 2,X (j k). (5.9) Cosiderig separately the two cases j k > /4 or j k /4, we get the upper boud 2 m 2 +m +m j=+ k=i l=j E(f(X i, Y j )f(x k, Y l ) E(f(X i, Y j )f(x k, Y l )) 4 m + 4 m k> /4 β 2,X (k) (5.0) from which we easily derive that E(L 2 ) teds to 0 as teds to ifiity (sice (.3) ad (5.3) are satisfied). 8

19 6 Simulatios 6. Example : fuctios of a auto-regressive process I this sectio, we first simulate (Z,..., Z ), accordig to the simple AR() equatio Z k+ = 2 (Z k + ε k+ ), where Z is uiformly distributed over [0, ], ad (ε i ) i 2 is a sequece of iid radom variables with distributio B(/2), idepedet of Z. Oe ca check that the trasitio Kerel of the chai (Z i ) i is K(f)(x) = ( ( x ) ( )) x + f + f, ad that the uiform distributio o [0, ] is the uique ivariat distributio by K. Hece, the chai (Z i ) i is strictly statioary. It is well kow that the chai (Z i ) i is ot α-mixig i the sese of Roseblatt [5] (see for istace []). However, oe ca prove that the coefficiets α 2,X of the chai (Z i ) i are such that α 2,Z (k) 2 k (ad the same upper boud is true for β 2,Z (k), see for istace Sectio 6. i [8]). the Let ow Q µ,σ 2 be the iverse of the cumulative distributio fuctio of the law N (µ, σ 2 ). Let X i = Q µ,σ 2 (Z i). The sequece (X i ) i is also a statioary Markov chai (as a ivertible fuctio of a statioary Markov chai), ad oe ca easily check that α 2,X (k) = α 2,Z (k). By costructio, X i is N (µ, σ 2 )- distributed, but the sequece (X i ) i is ot a Gaussia process (otherwise it would be mixig i the sese of Roseblatt). Next, we simulate iid radom variables (Y,..., Y m ) with commo distributio N (µ 2, σ 2 2). Note that, if µ = µ 2, the hypothesis H 0 : π = /2 is satisfied ad the radom variables T defied i (2.2) coverge i distributio to N (0, ). We shall study the behavior of T for σ 2 = 4, σ 2 2 = ad differet choices of µ, µ 2, a ad b. As explaied i Sectio 2, the quatities a ad b may be chose by aalyzig the graph of the estimated auto-covariaces ˆγ X (k) ad ˆγ Y (l). For this examples, these graphs suggest a choice of a = 3 or 4 ad b = 0 (see Figure ). 9

20 We shall compute T for differet choices of (, m) (from (50, 00) to (750, 500) with /m =.5). We estimate the three quatities Var(T ), P(T >.645) ad P( T >.96) via a classical Mote-Carlo procedure, by averagig over N = 2000 idepedet trials. If µ = µ 2, we say that P(T >.645) is level (the level of the oe-sided test) ad P( T >.96) is level 2 (the level of the two-sided test). If a, b are well chose, the estimated variace should be close to ad the estimated levels ad 2 should be close to If µ µ 2, we say that P(T >.645) is power (the power of the oe-sided test) ad P( T >.96) is power 2 (the power of the two-sided test). Case µ = µ 2 = 0 ad a = 0, b = 0 (o correctio): ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level Here, sice a = 0, we do ot estimate ay of the covariace terms γ X (l); hece, we compute T as if both series (X i ) i ad (Y j ) j were ucorrelated. The result is that the estimated variace is too large (aroud 2. whatever (, m)) ad so are the estimated levels ad 2 (aroud 0.3 for level ad 0.8 for level 2, whatever (, m)). This meas that the-oe sided or two-sided tests build with T will reject the ull hypothesis too ofte. Case µ = µ 2 = 0 ad a = 3, b = 0: ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level As suggested by the graphs of the estimated auto-covariaces (see Figure ), the choice a = 3, b = 0 should give a reasoable estimatio of V. This is ideed the case: the estimated variace is ow aroud., the estimated level is aroud 0.06 ad the estimated level 2 aroud Case µ = µ 2 = 0 ad a = 4, b = 0: 20

21 Figure : Estimatio of the γ X (k) s (top) ad γ Y (l) s (bottom) for Example, with µ = µ 2 = 0, = 300 ad m = 200. ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level Here, we see that the choice a = 4, b = 0 works well also, ad seems eve slightly better tha the choice a = 3, b = 0. Case µ = 0, µ 2 = 0.2 ad a = 4, b = 0 : I this example, H 0 is ot satisfied, ad π > /2. However, µ 2 is close to µ, ad sice σ = 2 ad σ 2 =, Y does ot stochastically domiate X (there is oly stochastic domiatio 2

22 i a weak sese). We shall see how the two tests are able to see this weak domiatio, by givig some estimatio of the powers. ; m 50 ; ; ; ; ; 500 Estimated variace Estimated power Estimated power As oe ca see, the powers of the oe sided ad two-sided tests are always greater tha 0.05, as expected. Still as expected, these powers icrease with the size of the samples. For = 750, m = 500, the power of the oe-sided test is aroud 0.47 ad that of the two-sided test is aroud Example 2: Itermittet maps For γ i ]0, [, we cosider the itermittet map θ γ from [0, ] to [0, ], itroduced by Liverai, Saussol ad Vaieti [3]: x( + 2 γ x γ ) if x [0, /2[ θ γ (x) = 2x if x [/2, ]. It follows from [3] that there exists a uique absolutely cotiuous θ γ -ivariat probability measure ν γ, with desity h γ. Let us briefly describe the Markov chai associated with θ γ, ad its properties. Let first K γ be the Perro-Frobeius operator of θ γ with respect to ν γ, defied as follows: for ay fuctios u, v i L 2 ([0, ], ν γ ) (6.) ν γ (u v θ γ ) = ν γ (K γ (u) v). (6.2) The relatio (6.2) states that K γ is the adjoit operator of the isometry U : u u θ γ actig o L 2 ([0, ], ν γ ). It is easy to see that the operator K γ is a trasitio kerel, ad that ν γ is ivariat by K γ. Let ow (ξ i ) i be a statioary Markov chai with ivariat measure ν γ ad trasitio kerel K γ. It is well kow that o the probability space ([0, ], ν γ ), the radom vector (θ γ, θ 2 γ,..., θ γ ) is distributed as (ξ, ξ,..., ξ ). Now it is proved i [5] that there exist two positive costats A, B such that A ( + ) ( γ)/γ α 2,ξ() B ( + ) ( γ)/γ (this is also true for the coefficiet β 2,ξ (), see [4]). Moreover, the chai (ξ i ) i is ot α-mixig i the sese of Roseblatt [5]. 22

23 Let X i (x) = θ i γ (x) ad Y j (y) = θ j γ 2 (y). O the probability space ([0, ] [0, ], ν γ ν γ2 ) the two sequeces (X i ) i ad (Y j ) j are idepedet. Note that, if γ = γ 2, the hypothesis H 0 : π = /2 is satisfied ad the radom variables T defied i (2.2) coverge i distributio to N (0, ). We shall study the behavior of T for differet choices of γ, γ 2, a ad b. As explaied i Sectio 2, the quatities a ad b may be chose by aalyzig the graph of the estimated auto-covariaces ˆγ X (k) ad ˆγ Y (l). As i Subsectio 6., we shall compute T for differet choices of (, m) (from (50, 00) to (750, 500) with /m =.5). We estimate the three quatities Var(T ), P(T >.645) ad P( T >.96) via a classical Mote-Carlo procedure, by averagig over N = 2000 idepedet trials. If γ = γ 2, we say that P(T >.645) is level (the level of the oe-sided test) ad P( T >.96) is level 2 (the level of the two-sided test). If a, b are well chose, the estimated variace should be close to ad the estimated levels ad 2 should be close to If γ γ 2, we say that P(T >.645) is power (the power of the oe-sided test) ad P( T >.96) is power 2 (the power of the two-sided test). Note that we caot simulate exactly (X i ) i ad (Y j ) j, sice the distributio ν /4 is ukow. We start by pickig X ad Y idepedetly over [0, 0.05] (i order to start ear the eutral fixed poit 0) ad the geerate X i = θγ i (X ) ad Y j = θγ j 2 (Y ) for i >, j >. We shall see that the tests are robust to this (slight) lack of statioarity. Case γ = γ 2 = /4 ad a = 0, b = 0 (o correctio): ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level Here agai, sice a = b = 0, we compute T as if both series (X i ) i ad (Y j ) j were ucorrelated. It leads to a disastrous result, with a estimated variace aroud 4.8, a estimated level aroud 0.2, ad a estimated level 2 aroud This meas that the-oe sided or twosided tests build with T will reject the ull hypothesis much too ofte. Case γ = γ 2 = /4 ad a = 4, b = 4: 23

24 Figure 2: Estimatio of the γ X (k) s (top) ad γ Y (l) s (bottom) for Example 2, with γ = γ 2 = /4, = 300 ad m = 200. ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level As suggested by the graphs of the estimated auto-covariaces (see Figure 2), the choice a = 4, b = 4 should give a reasoable estimatio of V. We see that the procedure is much better tha if o correctio is made: the estimated variace is ow aroud.3, the estimated level is aroud ad the estimated level 2 is aroud Case γ = γ 2 = /4 ad a = 5, b = 5: 24

25 ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level For a = 5, b = 5, the results seem slightly better tha for a = 4, b = 4. Eve for large samples, the estimated level 2 is still larger tha the expected level: aroud 0.07 istead of 0.05 (but 0.07 has to be compared to 0.36 which is the estimated level 2 without correctio). Recall that the coefficiets α 2,X () ad α 2,Y () are exactly of order /3, which is quite slow. The procedure ca certaily be improved for larger, m by estimatig more covariace terms (i accordace with Propositio 2.). Case γ = 0.25, γ 2 = 0. ad a = 5, b = 4. We first estimate the quatity π = P(X < Y ) via a realizatio of m m Xi <Y j. j= For = m = 30000, this gives a estimatio aroud for π, meaig that Y weakly domiates X. Let us see how the tests based o T ca see this departure from H 0 : π = /2. Based o the estimated covariaces, a reasoable choice is a = 5, b = 4. This choice leads to the followig estimated powers (based o N = 2000 idepedet trials). ; m 50 ; ; ; ; ; 500 Estimated variace Estimated power Estimated power As oe ca see, the powers of the oe sided ad two-sided tests are always greater tha 0.05, as expected. Still as expected, these powers icrease with the size of the samples. For = 750, m = 500, the power of the oe-sided test is aroud 0.24 ad that of the two-sided test is aroud 0.7. This has to be compared with the estimated levels whe T is cetered o istead of 0.5 (recall that is the previous estimatio of π o very large samples): i that case, for = 750, m = 500 ad a = 5, b = 4, oe gets a estimated level = 0.059, ad a estimated level 2 = 0.068, which are both close to 0.05 as expected. 25

26 6.3 Itermittets maps: the case of two adjacet samples Here, we shall illustrate the result of Subsectio 5.2. As i Subsectio 6.2, we first simulate X accordig to the uiform distributio over [0, 0.05], ad the geerate X i = θγ i (Z ), but ow for the idexes i {2,..., + m}. For j { +,..., + m}, we take Y j = f(x j ) for a give mootoic fuctio f. I our first experimet, we simply take f =Id, so that Y j = X j. Hece the assumptio H 0 : π = /2 is satisfied. For the secod experimet, we take f(x) = x 4/5, so that f(x ) > X almost surely. Hece π > /2. Let T be defied i (5.5). We follow the same scheme as i Subsectio 6.2, ad we estimate the three quatities Var(T ), P(T >.645) ad P( T >.96) via a classical Mote-Carlo procedure, by averagig over N = 2000 idepedet trials. Case γ = /4, f =Id ad a = 0, b = 0 (o correctio): ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level Here, as i the correspodig two-sample case (see the table for γ = γ 2 = /4), the ucorrected tests lead to a disastrous result, with a estimated level aroud 0.25 ad a estimated level 2 aroud Case γ = /4, f =Id ad a = 5, b = 5: ; m 50 ; ; ; ; ; 500 Estimated variace Estimated level Estimated level The corrected tests with the choice a = b = 5 give a much better result tha if o correctio is made. Oe ca observe, however, that the estimated levels are still too high ( 8%) for moderately large samples (up to = 450, m = 300). The results seem less good tha i the correspodig two-sample case (see the table for γ = γ 2 = /4), which is of course ot surprisig. Case γ = /4, f(x) = x 4/5 ad a = 5, b = 5: 26

27 ; m 50 ; ; ; ; ; 500 Estimated variace Estimated power Estimated power As oe ca see, the powers of the oe sided ad two-sided tests are always greater tha 0.05, as expected. Still as expected, these powers icrease with the size of the samples. For = 750, m = 500, the power of the oe-sided test is aroud 0.55 ad that of the two-sided test is aroud Ackowledgmets. We thak Herold Dehlig for helpful commets o a prelimiary versio of the paper. Refereces [] R. C. Bradley (986), Basic properties of strog mixig coditios, Depedece i probability ad statistics. A survey of recet results. Oberwolfach, 985. E. Eberlei ad M. S. Taquu editors, Birkäuser, [2] S. Dédé, Théorèmes limites foctioels et estimatio de la desité spectrale pour des suites statioaires. Phd thesis, Uiversité Pierre et Marie Curie, [3] J. Dedecker, A empirical cetral limit theorem for itermittet maps, Probab. Theory Related Fields 48 (200) [4] J. Dedecker, H. Dehlig, ad M. S. Taqqu, Weak covergece of the empirical process of itermittet maps i L 2 uder log-rage depedece, Stochastics ad Dyamics 5 (205) 29 pages. [5] J. Dedecker, S. Gouëzel ad F. Merlevède, Some almost sure results for ubouded fuctios of itermittet maps ad their associated Markov chais, A. Ist. Heri Poicaré Probab. Stat. 46 (200) [6] J. Dedecker ad F. Merlevède, A deviatio boud for α-depedet sequeces with applicatios to itermittet maps, to appear i Stochastics ad Dyamics (206). [7] J. Dedecker ad C. Prieur, New depedece coefficiets. Examples ad applicatios to statistics, Probab. Theory Related Fields 32 (2005) [8] J. Dedecker ad C. Prieur, A empirical cetral limit theorem for depedet sequeces, Stochastic Process. Appl. 7 (2007)

28 [9] H. Dehlig ad R. Fried, Asymptotic distributio of two-sample empirical U-quatiles with applicatios to robust tests for shifts i locatio. J. Multivariate Aal. 05 (202), [0] H. Dehlig, A. Rooch ad M. S. Taqqu, No-parametric chage-poit tests for log-rage depedet data. Scad. J. Stat. 40 (203) [] I. Dewa ad B. L. S. Prakasa Rao, Ma-Whitey test for associated sequeces, A. Ist. Statist. Math. 55 (2003) -9. [2] S. Gouëzel, Cetral limit theorems ad stable laws for itermittet maps, Probab. Theory Related Fields 28 (2004) [3] C. Liverai, B. Saussol ad S. Vaieti (999), A probabilistic approach to itermittecy, Ergodic Theory Dyam. Systems [4] E. Rio, Théorie asymptotique des processus aléatoires faiblemet dépedats, Mathématiques et Applicatios 3 (Spriger-Verlag, 2000). [5] M. Roseblatt, A cetral limit theorem ad a strog mixig coditio, Proc. Nat. Acad. Sci. U. S. A. 42 (956) [6] R. J. Serflig, The Wilcoxo two-sample statistic o strogly mixig processes, A. Math. Statist. 39 (968)

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Erratum to: An empirical central limit theorem for intermittent maps

Erratum to: An empirical central limit theorem for intermittent maps Probab. Theory Relat. Fields (2013) 155:487 491 DOI 10.1007/s00440-011-0393-0 ERRATUM Erratum to: A empirical cetral limit theorem for itermittet maps J. Dedecker Published olie: 25 October 2011 Spriger-Verlag

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Sequences and Series of Functions

Sequences and Series of Functions Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Introduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT

Introduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

On the optimality of McLeish s conditions for the central limit theorem

On the optimality of McLeish s conditions for the central limit theorem O the optimality of McLeish s coditios for the cetral limit theorem Jérôme Dedecker a a Laboratoire MAP5, CNRS UMR 845, Uiversité Paris-Descartes, Sorboe Paris Cité, 45 rue des Saits Pères, 7570 Paris

More information

Math Solutions to homework 6

Math Solutions to homework 6 Math 175 - Solutios to homework 6 Cédric De Groote November 16, 2017 Problem 1 (8.11 i the book): Let K be a compact Hermitia operator o a Hilbert space H ad let the kerel of K be {0}. Show that there

More information

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15 17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Lecture 10 October Minimaxity and least favorable prior sequences

Lecture 10 October Minimaxity and least favorable prior sequences STATS 300A: Theory of Statistics Fall 205 Lecture 0 October 22 Lecturer: Lester Mackey Scribe: Brya He, Rahul Makhijai Warig: These otes may cotai factual ad/or typographic errors. 0. Miimaxity ad least

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Law of the sum of Bernoulli random variables

Law of the sum of Bernoulli random variables Law of the sum of Beroulli radom variables Nicolas Chevallier Uiversité de Haute Alsace, 4, rue des frères Lumière 68093 Mulhouse icolas.chevallier@uha.fr December 006 Abstract Let be the set of all possible

More information

Notes 19 : Martingale CLT

Notes 19 : Martingale CLT Notes 9 : Martigale CLT Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: [Bil95, Chapter 35], [Roc, Chapter 3]. Sice we have ot ecoutered weak covergece i some time, we first recall

More information

Distribution of Random Samples & Limit theorems

Distribution of Random Samples & Limit theorems STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to

More information

Notes 27 : Brownian motion: path properties

Notes 27 : Brownian motion: path properties Notes 27 : Browia motio: path properties Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces:[Dur10, Sectio 8.1], [MP10, Sectio 1.1, 1.2, 1.3]. Recall: DEF 27.1 (Covariace) Let X = (X

More information

Lecture 3 The Lebesgue Integral

Lecture 3 The Lebesgue Integral Lecture 3: The Lebesgue Itegral 1 of 14 Course: Theory of Probability I Term: Fall 2013 Istructor: Gorda Zitkovic Lecture 3 The Lebesgue Itegral The costructio of the itegral Uless expressly specified

More information

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3 MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special

More information

1 Convergence in Probability and the Weak Law of Large Numbers

1 Convergence in Probability and the Weak Law of Large Numbers 36-752 Advaced Probability Overview Sprig 2018 8. Covergece Cocepts: i Probability, i L p ad Almost Surely Istructor: Alessadro Rialdo Associated readig: Sec 2.4, 2.5, ad 4.11 of Ash ad Doléas-Dade; Sec

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013 Large Deviatios for i.i.d. Radom Variables Cotet. Cheroff boud usig expoetial momet geeratig fuctios. Properties of a momet

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Asymptotic distribution of products of sums of independent random variables

Asymptotic distribution of products of sums of independent random variables Proc. Idia Acad. Sci. Math. Sci. Vol. 3, No., May 03, pp. 83 9. c Idia Academy of Scieces Asymptotic distributio of products of sums of idepedet radom variables YANLING WANG, SUXIA YAO ad HONGXIA DU ollege

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

The Mann Whitney U-Statistic for α-dependent Sequences

The Mann Whitney U-Statistic for α-dependent Sequences ISSN 066-5307, Matheatical Methods of Statistics, 207, Vol. 26, No. 2, pp. 33. c Allerto Press, Ic., 207. The Ma Whitey U-Statistic for α-depedet Sequeces J. Dedecker * ad G. Saulière 2** Uiv. Paris Descartes,

More information

Entropy Rates and Asymptotic Equipartition

Entropy Rates and Asymptotic Equipartition Chapter 29 Etropy Rates ad Asymptotic Equipartitio Sectio 29. itroduces the etropy rate the asymptotic etropy per time-step of a stochastic process ad shows that it is well-defied; ad similarly for iformatio,

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013 MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013 Fuctioal Law of Large Numbers. Costructio of the Wieer Measure Cotet. 1. Additioal techical results o weak covergece

More information

Asymptotic Results for the Linear Regression Model

Asymptotic Results for the Linear Regression Model Asymptotic Results for the Liear Regressio Model C. Fli November 29, 2000 1. Asymptotic Results uder Classical Assumptios The followig results apply to the liear regressio model y = Xβ + ε, where X is

More information

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences Commuicatios of the Korea Statistical Society 29, Vol. 16, No. 5, 841 849 Precise Rates i Complete Momet Covergece for Negatively Associated Sequeces Dae-Hee Ryu 1,a a Departmet of Computer Sciece, ChugWoo

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog

More information

LECTURE 8: ASYMPTOTICS I

LECTURE 8: ASYMPTOTICS I LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1 Defiitio: (Weak covergece) A sequece

More information

Lecture Chapter 6: Convergence of Random Sequences

Lecture Chapter 6: Convergence of Random Sequences ECE5: Aalysis of Radom Sigals Fall 6 Lecture Chapter 6: Covergece of Radom Sequeces Dr Salim El Rouayheb Scribe: Abhay Ashutosh Doel, Qibo Zhag, Peiwe Tia, Pegzhe Wag, Lu Liu Radom sequece Defiitio A ifiite

More information

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices Radom Matrices with Blocks of Itermediate Scale Strogly Correlated Bad Matrices Jiayi Tog Advisor: Dr. Todd Kemp May 30, 07 Departmet of Mathematics Uiversity of Califoria, Sa Diego Cotets Itroductio Notatio

More information

5 Birkhoff s Ergodic Theorem

5 Birkhoff s Ergodic Theorem 5 Birkhoff s Ergodic Theorem Amog the most useful of the various geeralizatios of KolmogorovâĂŹs strog law of large umbers are the ergodic theorems of Birkhoff ad Kigma, which exted the validity of the

More information

Dimension-free PAC-Bayesian bounds for the estimation of the mean of a random vector

Dimension-free PAC-Bayesian bounds for the estimation of the mean of a random vector Dimesio-free PAC-Bayesia bouds for the estimatio of the mea of a radom vector Olivier Catoi CREST CNRS UMR 9194 Uiversité Paris Saclay olivier.catoi@esae.fr Ilaria Giulii Laboratoire de Probabilités et

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

The log-behavior of n p(n) and n p(n)/n

The log-behavior of n p(n) and n p(n)/n Ramauja J. 44 017, 81-99 The log-behavior of p ad p/ William Y.C. Che 1 ad Ke Y. Zheg 1 Ceter for Applied Mathematics Tiaji Uiversity Tiaji 0007, P. R. Chia Ceter for Combiatorics, LPMC Nakai Uivercity

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Chapter 6 Infinite Series

Chapter 6 Infinite Series Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat

More information

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014. Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Notes 5 : More on the a.s. convergence of sums

Notes 5 : More on the a.s. convergence of sums Notes 5 : More o the a.s. covergece of sums Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: Dur0, Sectios.5; Wil9, Sectio 4.7, Shi96, Sectio IV.4, Dur0, Sectio.. Radom series. Three-series

More information

Lecture 20: Multivariate convergence and the Central Limit Theorem

Lecture 20: Multivariate convergence and the Central Limit Theorem Lecture 20: Multivariate covergece ad the Cetral Limit Theorem Covergece i distributio for radom vectors Let Z,Z 1,Z 2,... be radom vectors o R k. If the cdf of Z is cotiuous, the we ca defie covergece

More information

MAT1026 Calculus II Basic Convergence Tests for Series

MAT1026 Calculus II Basic Convergence Tests for Series MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real

More information

A Weak Law of Large Numbers Under Weak Mixing

A Weak Law of Large Numbers Under Weak Mixing A Weak Law of Large Numbers Uder Weak Mixig Bruce E. Hase Uiversity of Wiscosi Jauary 209 Abstract This paper presets a ew weak law of large umbers (WLLN) for heterogeous depedet processes ad arrays. The

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

7 Sequences of real numbers

7 Sequences of real numbers 40 7 Sequeces of real umbers 7. Defiitios ad examples Defiitio 7... A sequece of real umbers is a real fuctio whose domai is the set N of atural umbers. Let s : N R be a sequece. The the values of s are

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A. Radom Walks o Discrete ad Cotiuous Circles by Jeffrey S. Rosethal School of Mathematics, Uiversity of Miesota, Mieapolis, MN, U.S.A. 55455 (Appeared i Joural of Applied Probability 30 (1993), 780 789.)

More information

Lecture 11 October 27

Lecture 11 October 27 STATS 300A: Theory of Statistics Fall 205 Lecture October 27 Lecturer: Lester Mackey Scribe: Viswajith Veugopal, Vivek Bagaria, Steve Yadlowsky Warig: These otes may cotai factual ad/or typographic errors..

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Kernel density estimator

Kernel density estimator Jauary, 07 NONPARAMETRIC ERNEL DENSITY ESTIMATION I this lecture, we discuss kerel estimatio of probability desity fuctios PDF Noparametric desity estimatio is oe of the cetral problems i statistics I

More information

STAT Homework 1 - Solutions

STAT Homework 1 - Solutions STAT-36700 Homework 1 - Solutios Fall 018 September 11, 018 This cotais solutios for Homework 1. Please ote that we have icluded several additioal commets ad approaches to the problems to give you better

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Sequences. Notation. Convergence of a Sequence

Sequences. Notation. Convergence of a Sequence Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

Math 113, Calculus II Winter 2007 Final Exam Solutions

Math 113, Calculus II Winter 2007 Final Exam Solutions Math, Calculus II Witer 7 Fial Exam Solutios (5 poits) Use the limit defiitio of the defiite itegral ad the sum formulas to compute x x + dx The check your aswer usig the Evaluatio Theorem Solutio: I this

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Detailed proofs of Propositions 3.1 and 3.2

Detailed proofs of Propositions 3.1 and 3.2 Detailed proofs of Propositios 3. ad 3. Proof of Propositio 3. NB: itegratio sets are geerally omitted for itegrals defied over a uit hypercube [0, s with ay s d. We first give four lemmas. The proof of

More information

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data Proceedigs 59th ISI World Statistics Cogress, 5-30 August 013, Hog Kog (Sessio STS046) p.09 Kolmogorov-Smirov type Tests for Local Gaussiaity i High-Frequecy Data George Tauche, Duke Uiversity Viktor Todorov,

More information

BIRKHOFF ERGODIC THEOREM

BIRKHOFF ERGODIC THEOREM BIRKHOFF ERGODIC THEOREM Abstract. We will give a proof of the poitwise ergodic theorem, which was first proved by Birkhoff. May improvemets have bee made sice Birkhoff s orgial proof. The versio we give

More information

Berry-Esseen bounds for self-normalized martingales

Berry-Esseen bounds for self-normalized martingales Berry-Essee bouds for self-ormalized martigales Xiequa Fa a, Qi-Ma Shao b a Ceter for Applied Mathematics, Tiaji Uiversity, Tiaji 30007, Chia b Departmet of Statistics, The Chiese Uiversity of Hog Kog,

More information

Lecture 27. Capacity of additive Gaussian noise channel and the sphere packing bound

Lecture 27. Capacity of additive Gaussian noise channel and the sphere packing bound Lecture 7 Ageda for the lecture Gaussia chael with average power costraits Capacity of additive Gaussia oise chael ad the sphere packig boud 7. Additive Gaussia oise chael Up to this poit, we have bee

More information

On the convergence rates of Gladyshev s Hurst index estimator

On the convergence rates of Gladyshev s Hurst index estimator Noliear Aalysis: Modellig ad Cotrol, 2010, Vol 15, No 4, 445 450 O the covergece rates of Gladyshev s Hurst idex estimator K Kubilius 1, D Melichov 2 1 Istitute of Mathematics ad Iformatics, Vilius Uiversity

More information

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >

More information

Elements of Statistical Methods Lots of Data or Large Samples (Ch 8)

Elements of Statistical Methods Lots of Data or Large Samples (Ch 8) Elemets of Statistical Methods Lots of Data or Large Samples (Ch 8) Fritz Scholz Sprig Quarter 2010 February 26, 2010 x ad X We itroduced the sample mea x as the average of the observed sample values x

More information

On forward improvement iteration for stopping problems

On forward improvement iteration for stopping problems O forward improvemet iteratio for stoppig problems Mathematical Istitute, Uiversity of Kiel, Ludewig-Mey-Str. 4, D-24098 Kiel, Germay irle@math.ui-iel.de Albrecht Irle Abstract. We cosider the optimal

More information

Sequences I. Chapter Introduction

Sequences I. Chapter Introduction Chapter 2 Sequeces I 2. Itroductio A sequece is a list of umbers i a defiite order so that we kow which umber is i the first place, which umber is i the secod place ad, for ay atural umber, we kow which

More information

Surveying the Variance Reduction Methods

Surveying the Variance Reduction Methods Iteratioal Research Joural of Applied ad Basic Scieces 2013 Available olie at www.irjabs.com ISSN 2251-838X / Vol, 7 (7): 427-432 Sciece Explorer Publicatios Surveyig the Variace Reductio Methods Arash

More information

Integrable Functions. { f n } is called a determining sequence for f. If f is integrable with respect to, then f d does exist as a finite real number

Integrable Functions. { f n } is called a determining sequence for f. If f is integrable with respect to, then f d does exist as a finite real number MATH 532 Itegrable Fuctios Dr. Neal, WKU We ow shall defie what it meas for a measurable fuctio to be itegrable, show that all itegral properties of simple fuctios still hold, ad the give some coditios

More information

ACO Comprehensive Exam 9 October 2007 Student code A. 1. Graph Theory

ACO Comprehensive Exam 9 October 2007 Student code A. 1. Graph Theory 1. Graph Theory Prove that there exist o simple plaar triagulatio T ad two distict adjacet vertices x, y V (T ) such that x ad y are the oly vertices of T of odd degree. Do ot use the Four-Color Theorem.

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

1 Approximating Integrals using Taylor Polynomials

1 Approximating Integrals using Taylor Polynomials Seughee Ye Ma 8: Week 7 Nov Week 7 Summary This week, we will lear how we ca approximate itegrals usig Taylor series ad umerical methods. Topics Page Approximatig Itegrals usig Taylor Polyomials. Defiitios................................................

More information

Lecture 2. The Lovász Local Lemma

Lecture 2. The Lovász Local Lemma Staford Uiversity Sprig 208 Math 233A: No-costructive methods i combiatorics Istructor: Ja Vodrák Lecture date: Jauary 0, 208 Origial scribe: Apoorva Khare Lecture 2. The Lovász Local Lemma 2. Itroductio

More information

1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1

1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1 8. The cetral limit theorems 8.1. The cetral limit theorem for i.i.d. sequeces. ecall that C ( is N -separatig. Theorem 8.1. Let X 1, X,... be i.i.d. radom variables with EX 1 = ad EX 1 = σ (,. Suppose

More information

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS Jauary 25, 207 INTRODUCTION TO MATHEMATICAL STATISTICS Abstract. A basic itroductio to statistics assumig kowledge of probability theory.. Probability I a typical udergraduate problem i probability, we

More information

A Hilbert Space Central Limit Theorem for Geometrically Ergodic Markov Chains

A Hilbert Space Central Limit Theorem for Geometrically Ergodic Markov Chains A Hilbert Space Cetral Limit Theorem for Geometrically Ergodic Marov Chais Joh Stachursi Research School of Ecoomics, Australia Natioal Uiversity Abstract This ote proves a simple but useful cetral limit

More information

This section is optional.

This section is optional. 4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore

More information

Self-normalized deviation inequalities with application to t-statistic

Self-normalized deviation inequalities with application to t-statistic Self-ormalized deviatio iequalities with applicatio to t-statistic Xiequa Fa Ceter for Applied Mathematics, Tiaji Uiversity, 30007 Tiaji, Chia Abstract Let ξ i i 1 be a sequece of idepedet ad symmetric

More information

SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker

SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker CHAPTER 9. POINT ESTIMATION 9. Covergece i Probability. The bases of poit estimatio have already bee laid out i previous chapters. I chapter 5

More information

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT TR/46 OCTOBER 974 THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION by A. TALBOT .. Itroductio. A problem i approximatio theory o which I have recetly worked [] required for its solutio a proof that the

More information

Introduction to Probability. Ariel Yadin

Introduction to Probability. Ariel Yadin Itroductio to robability Ariel Yadi Lecture 2 *** Ja. 7 ***. Covergece of Radom Variables As i the case of sequeces of umbers, we would like to talk about covergece of radom variables. There are may ways

More information