arxiv: v2 [math.st] 22 Nov 2013

Size: px
Start display at page:

Download "arxiv: v2 [math.st] 22 Nov 2013"

Transcription

1 The Annals of Statistics 203, Vol 4, No 5, DOI: 024/3-AOS54 c Institute of Mathematical Statistics, 203 arxiv:35000v2 [mathst] 22 Nov 203 CONVERGENCE RATES OF EIGENVECTOR EMPIRICAL SPECTRAL DISTRIBUTION OF LARGE DIMENSIONAL SAMPLE COVARIANCE MATRIX By Ningning Xia, Yingli Qin 2 and Zhidong Bai 3 Northeast Normal University and National University of Singapore, University of Waterloo, and Northeast Normal University and National University of Singapore The eigenvector Empirical Spectral Distribution(VESD) is adopted to investigate the limiting behavior of eigenvectors and eigenvalues of covariance matrices In this paper, we shall show that the Kolmogorov distance between the expected VESD of sample covariance matrix and the Marčenko Pastur distribution function is of order O(N /2 ) Given that data dimension n to sample size N ratio is bounded between 0 and, this convergence rate is established under finite 0th moment condition of the underlying distribution It is also shown that, for any fixed η >0, the convergence rates of VESD are O(N /4 ) in probability and O(N /4+η ) almost surely, requiring finite 8th moment of the underlying distribution Introductionandmainresults LetX i =(X i,x 2i,,X ni ) T andx= (X,,X N ) be an n N matrix of iid (independent and identically distributed) complex random variables with mean 0 and variance We consider, a class of sample covariance matrices S n = N N X k X k = N XX, k= Received February 203; revised June 203 Supported by the Fundamental Research Funds for the Central Universities SSXT3 2 Supported by University of Waterloo Start-up Grant 3 Supported by NSF China Grant 7057, PCSIRT Fundamental Research Funds for the Central Universities, and NUS Grant R AMS 2000 subect classifications Primary 5A52, 60F5, 62E20; secondary 60F7, 62H99 Key words and phrases Eigenvector empirical spectral distribution, empirical spectral distribution, Marčenko Pastur distribution, sample covariance matrix, Stieltes transform This is an electronic reprint of the original article published by the Institute of Mathematical Statistics in The Annals of Statistics, 203, Vol 4, No 5, This reprint differs from the original in pagination and typographic detail

2 2 N XIA, Y QIN AND Z D BAI where X denotes the conugate transpose of the data matrix X The Empirical Spectral Distribution (ESD) F Sn (x) of S n is then defined as F Sn (x)= n () I(λ i x), n i= where λ λ n are the eigenvalues of S n in ascending order and I( ) is the conventional indicator function Marčenko and Pastur in [6] proved that with probability, F Sn (x) converges weakly to the standard Marčenko Pastur distribution F y (x) with density function (2) p y (x)= df y(x) dx = (x a)(b x)i(a x b), 2πxy where a=( y) 2 and b=(+ y) 2 Here the positive constant y is the limit of dimension to sample size ratio when both n and N tend to infinity In applications of asymptotic theorems of spectral analysis of large dimensional random matrices, one of the important problems is the convergence rate of the ESD The Kolmogorov distance between the expected ESD of S n and the Marčenko Pastur distribution F y (x) is defined as = EF Sn F y =sup EF Sn (x) F y (x) x as well as the distance between two distributions F Sn (x) and F y (x), p = F Sn F y =sup F Sn (x) F y (x) x Notice that, for any constant C >0, { } P( p C)=P sup F Sn (x) F y (x) C C E p x Thus, p measures the rate of convergence in probability Bai in[2, 3] firstly tackled the problem of convergence rate and established three Berry Esseen type inequalities for the difference of two distributions in terms of their Stieltes transforms Götze and Tikhomirov in [] further improved the Berry Esseen type inequlatiy and showed the convergence rate of F Sn (x) is O(N /2 ) in probability under finite 8th moment condition More recently, a sharper bound is obtained by Pillai and Yin in [8], under a stronger condition, that is, the sub-exponential decay assumption It is shown that the difference between eigenvalues of S n and the Marčenko Pastur distribution is of order O(N (logn) O(loglogN) ) in probability In the literature, research on limiting properties of eigenvectors of large dimensional sample covariance matrices is much less developed than that of eigenvalues, due to the cumbersome formulation of the eigenvectors Some great achievements have been made in proving the properties of eigenvectors

3 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 3 for large dimensional sample covariance matrices, such as[4, 9 22], and that for Wigner matrices, such as [0, 3, 25] However, the eigenvectors of large sample covariance matrices play an important role in high-dimensional statistical analysis In particular, due to the increasing availability of high-dimensional data, principal component analysis (PCA) has been favorably recognized as a powerful technique to reduce dimensionality The eigenvectors corresponding to the leading eigenvalues are the directions of the principal components Johnstone [2] proposed the spiked eigenvalue model to test the existence of principal component Paul [7] discussed the length of the eigenvector corresponding to the spiked eigenvalue In PCA, the eigenvectors (ν 0,,ν0 n) of population covariance matrix Σ determine the directions in which we proect the observed data and the corresponding eigenvalues (λ 0,,λ0 n ) determine the proportion of total variability loaded on each direction of proections In practice, the (sample) eigenvalues (λ,,λ n ) and eigenvectors (ν,,ν n ) of the sample covariance matrix S n are used in PCA In [], Anderson has shown the following asymptotic distribution for the sample eigenvectors ν,,ν n when the observations are from a multivariate normal distribution of covariance matrix Σ with distinct eigenvalues: N(νi ν 0 i ) N d n (0,D i ), where D i =λ 0 i n k=,k i λ 0 k (λ 0 k λ0 i )2ν0 k ν0 T k However, this is a large sample result when the dimension n is fixed and low Inparticular, if Σ=σ 2 I n, then theeigenmatrix (matrix of eigenvectors) should be asymptotically isotropic when the sample size is large That is, the eigenmatrix should be asymptotically Haar, under some minor moment conditions However, when the dimension is large (increasing), the Haar property is not easy to formulate Motivated by the orthogonal iteration method, [5] proposed an iterative thresholding method to estimate sparse principal subspaces (spanned by the leading eigenvectors of Σ) in high dimensional and spiked covariance matrix setting The convergence rates of the proposed estimators are provided By reducing the sparse PCA problem to a high-dimensional regression problem, [9] established the optimal rates of convergence for estimating the principal subspace with respect to a large collection of spiked covariance matrices See the reference therein for more literature on sparse PCA and spiked covariance matrices To perform the test of existence of spiked eigenvalues, one has to investigate the null properties of the eigenmatrices, that is, when Σ=σ 2 I n (ie,

4 4 N XIA, Y QIN AND Z D BAI nonspiked) Then the eigenmatrix should be asymptotically isotropic, when the sample size is large That is, the eigenmatrix should be asymptotically Haar However, when the dimension is large, the Haar property is not easy to formulate The recent development in random matrix theory can help us investigate the large dimension and large sample properties of eigenvectors We will adopt the VESD, defined later in the paper, to characterize the asymptotical Haar property so that if the eigenmatrix is Haar, then the process defined the VESD tends to a Brownian bridge Conversely, if the process defined by the VESD tends to a Brownian bridge, then it indicates a similarity between the Haar distribution and that of the eigenmatrix Therefore, studying the large sample and large dimensional results of the VESD can assist us in better examining spiked covariance matrix as assumed by [5] and [9] among many others Let U n Λ n U n denote the spectral decomposition of S n, where Λ n = diag(λ,λ 2,,λ n ) and U n = (u i ) n n is a unitary matrix consisting of the corresponding orthonormal eigenvectors of S n For each n, let x n C n, x n = be nonrandom and let d n =U n x n =(d,,d n ), where x n denotes Euclidean norm of x n Define a stochastic process X n (t) by X n (t)= [nt] ( n/2 d 2 ), [a] denotes the greatest integer a n = If U n is Haar distributed over the orthogonal matrices, then d n would be uniformly distributed over the unit sphere in R n, and the limiting distribution of X n (t) is a unique Brownian bridge B(t) when n tends to infinity In this paper, we use the behavior of X n (t) for all x n to reflect the uniformity of U n The process X n (t) is considerably important for us to understand the behavior of the eigenvectors of S n Motivated by Silverstein s ideas in [9 22], we want to examine the limiting properties of U n through stochastic process X n (t) We claim that U n is asymptotically Haar distributed, which means X n (t) converges to a Brownian bridge B(t) In [2], it showed that the weak convergence of X n (t) converging to a Brownian bridge B(t) is equivalent to X n (F Sn (x)) converging to B(F y (x)) We therefore consider transforming X n (t) to X n (F Sn (x)) where F Sn (x) is the ESD of S n Wedefinetheeigenvector EmpiricalSpectralDistribution(VESD)H Sn (x) of S n as follows: n (3) H Sn (x)= d i 2 I(λ i x) i= Between H Sn (x) in (3) and F Sn (x) in (), we notice that there is no difference except the coefficient associated with each indicator function such

5 that (4) ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 5 X n (F Sn (x))= n/2(h Sn (x) F Sn (x)) Henceforth, the investigation of X n (t) is converted to that of the difference between two empirical distributions H Sn (x) and F Sn (x) The authors in [4] proved that H Sn (x) and F Sn (x) have the same limiting distribution, the Marčenko Pastur distribution F y (x), where y n = n/n and y = lim n,n y n (0,) Before we present the main theorems, let us introduce the following notation: and H = EH Sn F yn =sup EH Sn (x) F yn (x) x H p = H Sn F yn =sup H Sn (x) F yn (x) x We denote ξ n = O p (a n ) and η n = O as (b n ) if, for any ǫ > 0, there exist a large positive constant c and a positive random variable c 2, such that P(ξ n /a n c ) ǫ and P(η n /b n c 2 )=, respectively In this paper, we follow the work in [4] and establish three types of convergence rates of H Sn (x) to F yn (x) in the following theorems Theorem Suppose that X i, i =,,n, =,,N are iid complex random variables with EX =0, E X 2 = and E X 0 < For any fixed unit vector x n C n ={x Cn : x =}, and y n =n/n, it then follows that { H O(N = EH Sn F yn = /2 a 3/4 ), if N /2 a<, O(N /8 ), if a<n /2, where a=( y n ) 2 as it is defined in (2) and F yn denotes the Marčenko Pastur distribution function with an index y n Remark 2 From the proof of Theorem, it is clear that the condition E X 0 < is required only in thetruncation step in the next section We therefore believe that the condition E X 0 < can be replaced by E X 8 < in Theorems 6 and 8 Remark 3 Because the convergence rate of EH Sn F y depends on the convergence rate of y n y, we only consider the convergence rate of EH Sn F yn Remark 4 As a = ( y n ) 2, we can characterize the closeness between y n and through a In particular, when y n is away from (or

6 6 N XIA, Y QIN AND Z D BAI a N /2 ), the convergence rate of EH Sn F yn is O(N /2 ), which we believe is the optimal convergence rate This is because we observe in [4] that for an analytic function f, Y n (f)= (5) n f(x)d(h Sn (x) F yn (x)) converges to a Gaussian distribution While in[6], Bai and Silverstein proved that the limiting distribution of n f(x)d(f Sn (x) F yn (x)) is also a Gaussian distribution We therefore conecture that the optimal rate of H Sn (x) should be O(N /2 ) and O(N ) for F Sn (x) Although F Sn (x) and H Sn (x) converge to the same limiting distribution, there exists a substantial difference between F Sn (x) and H Sn (x) Remark 5 Notice that two matrices XX and X X share the same set of nonzero eigenvalues However, these two matrices do not always share the same set of eigenvectors Especially when y n, the eigenvectors of S n corresponding to 0 eigenvalues can be arbitrary As a result, the limit of H Sn may not exist or heavily depends on the choice of unit vector x n Therefore, we only consider the case of y n in this paper and leave the case of y n as a future research problem The rates of convergence in probability and almost sure convergence of the VESD are provided in the next two theorems Theorem 6 Under the assumptions in Theorem except that we now only require E X 8 <, we have { H O p = H Sn p (N /4 a /2 ), if N /4 a<, F yn = O p (N /8 ), if a<n /4 Remark 7 As an application of Theorem 6, in [8] we extended the CLT of the linear spectral statistics Y n (f) established in [4] to the case where the kernel function f is continuously twice differentiable provided that the sample covariance matrix S n satisfies the assumptions of Theorem 6 This result is useful in testing Johnstone s hypothesis when normality is not assumed Theorem 8 Under the assumptions in Theorem 6, for any η >0, we have { H Oas (N p = H Sn F yn = /4+η a /2 ), if N /4 a<, O as (N /8+η ), if a<n /4

7 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 7 Remark 9 In this paper, we will use the following notation: X denote the conugate transpose of a matrix (or vector) X; X T denote the (ordinary) transpose of a matrix (or vector) X; x denote the Euclidean norm for any vector x; A = λ max (AA ), the spectral norm; F =sup x F(x) for any function F; z denote the conugate of a complex number z The rest of the paper is organized as follows In Section 2, we introduce the main tools used to prove Theorems, 6 and 8, including Stieltes transform and a Berry Esseen type inequality The proofs of these three theorems are presented in Sections 3 6 Several important results which are repeatedly employed throughout Sections 3 6 are proved in Appendix A Appendix B contains some existing results in the literature Finally, preliminaries on truncation, centralization and rescaling are postponed to the last section 2 Main tools 2 Stieltes transform The Stieltes transform is an essential tool in random matrix theory and our paper Let us now briefly review the Stieltes transform and some important and relevant results For a cumulative distribution function G(x), its Stieltes transform m G (z) is defined as m G (z)= λ z dg(λ), z C+ ={z C,I(z)>0}, where I( ) denotes the imaginary part of a complex number The Stieltes transforms of the ESD F Sn (x) and the VESD H Sn (x) are and m F Sn(z)= n tr(s n zi n ) m H Sn(z)=x n(s n zi n ) x n, respectively Here I n denotes the n n identity matrix For simplicity of notation, we use m n (z) and m H n (z) to denote m F Sn(z) and m H Sn(z), respectively Remark 2 Notice that although the eigenmatrix U n may not be unique, the Stieltes transform m H n (z) of HSn depends on S n for any x n rather than U n

8 8 N XIA, Y QIN AND Z D BAI Let S n = X X/N denote the companion matrix of S n As S n and S n share the same set of nonzero eigenvalues, it can be shown that Stieltes transforms of F Sn (x) and F S n (x) satisfy the following equality: (2) m n (z)= y n z +y n m n (z), where m n (z) denotes the Stieltes transform of F S n (x) Moreover, [5] and [24] claimed that F S n converges, almost surely, to a nonrandom distribution function F y (x) with Stieltes transform m(z) such that (22) m(z)= y z +ym y (z), where m y (z) denotes the Stieltes transform of the Marčenko Pastur distribution with index y Using (64) in [7], we also obtain the relationship between two limits m y (z) and m(z) as follows: (23) m y (z)= 22 A Berry Esseen type inequality z(+m(z)) Lemma 22 Let H Sn (x) and F yn (x) be the VESDof S n and the Marčenko Pastur distribution with index y n, respectively Denote their corresponding Stieltes transforms by m H n (z) and m yn (z), respectively Then there exist large positive constants A,B,K,K 2 and K 3, such that for A>B >5, H = EH Sn (x) F yn (x) K Em H n (z) m y n (z) du+k 2 v +K 3 v sup x t <v F yn (x+t) F yn (x) dt, x >B EH Sn (x) F yn (x) dx where z = u + iv is a complex number with positive imaginary part (ie, v >0) Remark 23 Lemma 22 can be proved using Lemma B To prove Theorem, we apply Lemma 22 In addition, we prove Theorems 6 and 8 by replacing EH Sn (x), Em H n (z) with H Sn (x) and m H n (z), respectively 3 Proof of Theorem Under the condition of E X 0 <, we can choose a sequence of η N with η N 0 and η N N /4 as N, such that (3) lim N η 0 N E( X 0 I( X >η N N /4 ))=0

9 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 9 Furthermore, without loss of generality, we can assume that every X i is bounded by η N N /4 and has mean 0 and variance See Appendix C for details on truncation, centralization and rescaling We introduce some notation before start proving Theorem Throughout the paper, we use C and C i for i=0,,2, to denote positive constant numbers which are independent of N and may take different values at different appearances Let X denote the th column of the data matrix X Let r =X / N so that S n = N = r r and let (32) v y = a+ v= y n + v, B =S n r r, A(z)=S n zi n, A (z)=b zi n, α (z)=r A (z)x n x n r N x n A (z)x n, ξ (z)=r A (z)r N EtrA (z), ˆξ (z)=r A (z)r N tra (z), b(z)= It is easy to show that +(/N)EtrA (z), b (z)= +(/N)EtrA (z), β (z)= +r A (z)r β (z) b (z)= b (z)β (z)ξ (z) For any =,2,,N, we can also show that (33) due to the fact that r A (z)=β (z)r A (z) (B zi n ) (B +r r zi n) =(B zi n ) r r (B +r r zi n) From (22) in [23], we can write m n (z) in terms of β (z) as follows: (34) m n (z)= zn N β (z) =

10 0 N XIA, Y QIN AND Z D BAI We proceed with the proof of Theorem : δ=:em H n (z) m y (z) = x n [EA (z) ( zm(z)i n zi n ) ]x n = (zm(z)+z) x ne[(zm(z)+z)a (z)+i n ]x n = (zm(z)+z) x ne[(zi n +A(z))A (z)+zm(z)a (z)]x n [ N = (zm(z)+z) x ne r r A (z)+z(em n (z))a (z) = (zm(z)+z) x ne = z(em n (z))a (z)+zm(z)a (z) [ N = = (zm(z)+z) x n E [ N = ] x n β (z)r r A (z) ( zem n (z))a (z) (zem n (z) zm(z))a (z) β (z)r r A (z) ( N ] x n ) ] N Eβ (z) A (z) (zm(z)+z) x n(zem n (z) zm(z))(ea (z))x n [ N ( = (zm(z)+z) x n Eβ (z) r r A (z) (z))] N EA x n = +m(z)(zem n (z) zm(z))em H n (z) =:δ +δ 2, where δ =(zm(z)+z) x n [ N = δ 2 =m y (z)(zem n (z) zm(z))em H n (z) = ( Eβ (z) r r A (z) (z))] N EA x n, x n Lemma 3 If δ C ( ) Nvv y v y v

11 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES holds for some constants C 0 and C, when v 2 v y C 0 N, under the conditions of Theorem, there exists a constant C such that H Cv/v y Proof According to Lemma 22, H K Em H n (z) m y (z) du +K 2 v +K 3 v sup x x >B EH Sn (x) F yn (x) dx t <v F yn (x+t) F yn (x) dt From Lemmas B2 and A8, we know that there exists a positive constant C, such that K 2 v EH Sn (x) F yn (x) dx (35) Notice that x >B +K 3 v sup x Cv/v y Em H n (z) m y(z) du + δ du+ δ du+ δ du+ δ 2 du t <v F yn (x+t) F yn (x) dt zm y (z) Em H n (z) Em n (z) m(z) du zm y (z) Em H n (z) m y(z) Em n (z) m(z) du zm y (z) m y (z) Em n (z) m(z) du Lemma A, (2) and (A3) imply that and C Em n (z) m(z) = y n Em n (z) m y (z) Nv 3/2 vy 2 Em n (z) m(z) du= y n Em n (z) m y (z) du Cv

12 2 N XIA, Y QIN AND Z D BAI From (23) in [3], we have zm y (z)= y z+ (+y z) 2 4y 2y = ( z y 2 y m semi ), y where m semi ( ) denotes the Stieltes transform of the semicircle law, see (32) in [2] Therefore zm y (z) is bounded by a constant, for m semi ( ), see (33) in [2] Combined with Lemma B7, there exist constants C 2, C 3, such that Em H n (z) m y(z) du δ du+ C 2 Nv 3/2 vy 2 Em H n (z) m y (z) du+ C 3v v y Given v 2 v y C 0 N, for v y v, we have v 3/2 vy 2 v2 v y C 0 N For a large enough C 0 such that C 2 /C 0 /2, we have Em H n (z) m y (z) du δ du+ Cv v y As δ C Nvv y ( v y v ) and v2 v y C 0 N, if C 0 4AC K, we then have (36) Em H n (z) m y(z) du 2AC Nv 2 v y v v y + 2AC Nv 2 v y H + Cv v y H 2K + Cv v y Thus, from Lemma 22, equations (36) and (35), we conclude that there exists a constant C, such that The proof is complete H Cv v y C0 N To finish the proof of Theorem, we choose v = a+n such that /4 v 2 v y =C 0 N v y a+n /4 C 0 N According to Lemma 3, we know that H Cv v y CN /2 ( a+n /4 ) 3/2

13 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 3 If a<n /4, H CN /2 (N /4 ) 3/2 =O(N /8 ) If a N /4, H CN /2 ( a) 3/2 =O(N /2 a 3/4 ) Thus, the proof of Theorem is complete 4 The bound for δ In this section, we are going to show that when v 2 v y C 0 N, δ is indeed bounded by C Nvv y ( v y v ), as required by Lemma 3 From δ, we can further write δ =δ +δ 2 +δ 3, where [ ( δ =N(zm(z)+z) E β (z) r A (z)x nx nr )] N x na (z)x n =N(zm(z)+z) E(β (z)α (z)), δ 2 =(zm(z)+z) E[β (z)x n (A (z) (z))x n ], δ 3 =(zm(z)+z) E[β (z)x n (A (z) EA (z))x n ] According to (23) and Lemma B7, (4) (zm(z)+z) = m y (z) C v y for some constant C Using identity (32) three times, we have β (z)=b (z) b 2 (z)ξ (z)+b 3 (z)ξ2 (z) b3 (z)β (z)ξ 3 (z) Notice that Eα (z) =0 and b (z) is bounded by a constant (due to Lemma A3), we then have (42) δ CN v y Eβ (z)α (z) CN v y ( Eξ (z)α (z) + Eξ 2 (z)α (z) + Eβ (z)ξ 3 (z)α (z) ) Let us start with the first term in the above upper bound of δ as in (42) Note that r and A (z) are independent Therefore, for any integer p>0 we have and E(trA (z) EtrA (z))α (z)=0 E(trA (z))p α (z)=0 Denote A (z)=(a i) n n, A (z)x nx n =(b i) n n, and e i bethe ith canonical basis vector, that is, the n-vector whose coordinates are all 0 except

14 4 N XIA, Y QIN AND Z D BAI that the ith coordinate is Then Lemmas B3, A5, A6 and the inequality (z) /v imply that A Eξ (z)α (z) = Eˆξ (z)α (z) C N 2E ( tr(a (z)a ( z)x nx n)+ i= ) n a ii b ii i= { ( C N N 2 v v y v +v E x n A ( z)e i 2 C ( N 2 v v y v ) ) /2 } N E a ii 2 x n e i 2 In the above, we use the following two results, which can be proved by applying Lemmas A5 and A6: ( ) E a =E e A (z)e C, v y v i= E a 2 E e A (z)a ( z)e C ( v v y v Hence, we have shown that Eξ (z)α (z) C ( ) (43) N 2 v v y v Let us denote X the conugate transpose of X, that is, X =( X, X 2,, X n ) Then we can rewrite the second term in the upper bound of δ as Eξ(z)α 2 (z)= [( N 3E a i Xi X + i i ( i, ) ) 2 (a ii Xi Ea 2 ii ) )] b i ( X i X δ i ) = [{( ) 2 ( ) N 3E a i Xi X +2 (a ii Xi 2 Ea ii) i i ( ( ) 2 } a i Xi X )+ a ii Xi 2 Ea ii i i

15 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 5 { b i Xi X + }] b ii ( Xi ) 2 i i C N 3E ( i a 2 iib ii + i Ea ii 2 b ii + i a 2 ib ii + a 2 ib i + a i a ii b + i i i + C N 3 Ea ι i aτ k b ik, ι,τ i k ) a ii a i b i where a ι i and aτ i denote a i or ā i By following the similar proofs in establishing (43), we are able to show that E i a 2 ii b ii E a ii b ii v i E i v (E a2 Ex na (z)a ( z)x n) /2 C v 2 ( v y v ), Ea ii 2 b ii Ea 2 (Ex na (z)a ( z)x n) /2 [ ( )] C 3/2 v v y v The Cauchy Schwarz inequality implies that E ( ( )) a 2 i b ii E b ii a 2 i i i ( ) /2 ( E x na (z)e i 2 i E i i =(Ex n (A C v 3/2 [ a 2 i b i i ( x ne i 2 E a 2 i ) 2 ) /2 (z)a ( z))x n) /2 (E (A (z)a ( z) ) 2 ) /2 ( ) 3/2, v y v E a i x n A (z)e i 2 i E a i x n e 2 ] /2

16 6 N XIA, Y QIN AND Z D BAI [ E (A (z)a ( z)) ii (x na (z)e i) 2 i ] /2 E (A ( z)a (z)) (x n e ) 2 E i E i [ a ii a i b C ( ) 3/2 v 2, v y v i [ i N C v 2 [ a ii a i b i C ι,τ i N v 2 i k E a ii x n A (z)e 2 i E a 2 ii x n A (z)a ( z)x n E a i x n e 2 ] /2 ] /2 E(A (z)a ( z)) x n e 2 ( ), v y v E a ii x na e 2 i ( ) v y v Finally, we establish Ea ι i aτ k b N ik C E a i x ne i 2 ] /2 v 2 ( ) v y v By inclusive exclusive principle and what we have ust proved, it remains to show that ( ) Ea ι i aτ k b N (44) ik C v 2 v y v ι,τ i,,k Notice that γ ik = a ia k is the (i,k)-element of A 2 (z) We obtain Ea i a k b ik = γ ik b ik i,,k i,k ( E γ ik 2 i,k i,k E b ik 2 ) /2

17 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 7 =(Etr(A 2 (z)a 2 ( z))ex na (z)a ( z)x n) /2 ( ) N C v 2 v y v Similarly, one can prove the other terms of (44) share this common bound In summary, when v 2 v y C 0 N it holds that Eξ(z)α 2 (z) C ( ) (45) N 2 v v y v For the last term in the upper bound of δ, we apply Lemma A4 and the Cauchy Schwarz inequality again In particular, for any fixed t > 0, we have that Eβ (z)ξ 3 (z)α (z) CE ξ 3 (z)α (z) +o(n t ) (46) C(E ξ (z) 6 ) /2 (E α (z) 2 ) /2 +o(n t ) C N 5/2 v 2 v 3/2 y ( v y v ) /2 The last inequality in (46) is due to Lemmas A2 and A7 Therefore, for any v 2 v y C 0 N, (43), (45) and (46) lead us to δ C ( ) (47) Nvv y v y v To establish the upper bound for δ 2, we will make use of the following equality: (48) A (z) (z)=β (z)a (z)r r A (z) Note that (4) implies that δ 2 C v y Eβ (z)x n(a (z) (z))x n C v y Eβ 2 (z)x na (z)r r A (z)x n (49) C Nv y E X A (z)x nx n A (z)x +o(n t ) (see Lemma A4) C Etr(A Nv (z)x nx na (z)) (see Lemma B5) y = C Nv y Ex na 2 (z)x n C Nvv y ( ) v y v

18 8 N XIA, Y QIN AND Z D BAI At last, we establish the upper bound for δ 3 By (4), Lemma A3, and the fact (40) we obtain β (z)=b (z) b (z)β (z)ξ (z), δ 3 C v y E{β (z)x n (A (z) EA (z))x n } = C v y E{b(z)β (z)ξ (z)x n(a (z) EA (z))x n } C v y E{ξ (z)x n(a (z) EA (z))x n } + C v y E{β (z)ξ 2 (z)x n (A (z) EA (z))x n } From C r inequality (see Loève (author?) [4]), we have δ 3 C v y ( II + II 2 + II 3 ), where II =E{ξ (z)x n (A (z) EA (z))x n}, II 2 =E{ξ (z)x n (A (z) (z))x n}, II 3 =E{β (z)ξ (z)x n E(A (z) (z))x n} It should be noted that E(ξ (z) A (z)) = 0, and r and A (z) are independent Then we have II =E[E{ξ (z)x n(a (z) EA (z))x n A (z)}]=0 By the results in (48) and (40), we have II 2 = Eξ (z)β (z)x na (z)r r A (z)x n b (z) Eξ (z)r A (z)x nx na (z)r + b (z) Eβ (z)ξ 2 (z)r A (z)x nx n A (z)r =:III +III 2, where ( III = b (z) Eξ (z) r A (z)x nx n A (z)r N x n A 2 n) (z)x C N 2 Etr(x n A 3 (z)x n)+e i = C ( ) N 2 v 2 v y v (A (z)x nx n A (z)) ii a ii

19 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 9 The above inequality follows from Lemmas B3 and A3 By Lemma A4 and the Cauchy Schwarz inequality, it holds that III 2 CE ξ 2 (z)r A (z)x nx n A (z)r C(E ξ (z) 4 ) /2 (E r A (z)x nx n A (z)r 2 ) /2 ( ) /2 C N 2 v 2 vy 2 N 2E x na 2 (z)x n 2 (see Lemmas A2 and B5) ( ) C N 2 v 2 v y v y v Hence, we have shown that C II 2 N 2 v 2 v y ( ) v y v Moreover, Lemmas A2 and A5, and the Cauchy Schwarz inequality lead us to the following: II 3 C(E ξ (z) 2 E x n(a (z) (z))x n 2 ) /2 ( ) C C N /2 v /2 vy /2 Nv v y v = C N 3/2 v 3/2 v /2 y Therefore, it follows that (4) ( v y v ) δ 3 C C ( II + II 2 + II 3 ) v y N 3/2 v 3/2 v y ( ) v y v As it has been shown in (47), (49) and (4), we conclude that δ C ( ) (42) Nvv y v y v 5 Proof of Theorem 6 From Lemma 22, and by replacing EH Sn (x) and Em H n (z) by H Sn (x) and m H n (z), respectively, we have E H p =:E H Sn (x) F yn (x) K E m H n (z) EmH n (z) du+k +K 2 v x >B EH Sn (x) F yn (x) dx Em H n (z) m y(z) du

20 20 N XIA, Y QIN AND Z D BAI +K 3 v sup F yn (x+t) F yn (x) dt x t <v K E m H n (z) EmH n (z) du+ H As the convergence rate of H has already been established in Theorem, we only focus on the convergence rate of E m H n (z) Em H n (z) By Lemma A6 and the Cauchy Schwarz inequality, it follows that E m H n (z) EmH n (z) (E mh n (z) EmH n (z) 2 ) /2 C ( Nv v y v Together with Lemma 3 that H Cv/v y, when v 2 v y O(N ), we have ( ) C E H Sn (x) F yn (x) +v Nv v y By choosing v=o(n /4 ), we obtain { O(N E H Sn (x) F yn (x) /4 a /2 ), when a N /4, O(N /8 ), otherwise The proof of Theorem 6 is complete 6 Proof of Theorem 8 Notice that the proof of Theorem 8 is almost the same as that of Theorem 6 By Lemma 22, choosing v=o(n /4 ), H Sn F yn By Lemma A6, we have ) m H n (z) Em H n (z) du+cv/v y E m H n (z) Em H n (z) 2l CN l v 2l ( v y v When a<n /4, with v=o(n /4 ), N 2l(/8 η) E m H n (z) EmH n (z) 2l CN 2lη, which implies that if we choose an l such that 2lη>, m H n (z) EmH n (z) du=o as(n /8+η ) ) 2l When a N /4, in this case, by choosing v=o(n /4 ), we have a l N 2l(/4 η) E m H n (z) Em H n (z) 2l CN 2lη Theorem 8 then follows by setting l > 2η This completes the proof of Theorem 8

21 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 2 APPENDIX A In this section, we establish some lemmas which are used in the proofs of the main theorems Lemma A Under the conditions of Theorem 6, for all z <A and v 2 v y C 0 N, we have C Em n (z) m y (z) Nv 3/2 vy 2, where C 0 is a constant and v y = y n + v Proof Since Em n (z)= n Etr(S n zi n ) (A) where s kk = N = n = n n E s kk z N 2 α k (S nk zi n ) α k k= n E ǫ+ y n z y n zem n (z) k= = z+y n +y n zem n (z) +δ n, N X k 2, = S nk = N X (k)x (k), α k =X (k) X k, ǫ k =(s kk )+y n +y n zem n (z) N 2α k (S nk zi n ) α k, δ n = n b n Eβ k ǫ k, n k= b n =b n (z)= β k =β k (z)= z+y n +y n zem n (z), z+y n +y n zem n (z) ǫ k,

22 22 N XIA, Y QIN AND Z D BAI where X (k) is the (n ) N matrix obtained from X with its kth row removed and X k is the kth row of X It has proved that one of the roots of equation (A) is (see (37) in [3]) Em n (z)= 2y n z (z+y n y n zδ n (z+y n +y n zδ n ) 2 4y n z) The Stieltes transform of the Marčenko Pastur distribution with index y is given by (see (23) in [3]) Thus, Em n (z) m y (z) δ [ n + 2 Let us define by convention R( z)= m y (z)= y n+z (+y n z) 2 4y n 2y n z 2(z+y n ) y n zδ n (z+y n ) 2 4y n z+ (z+y n +y n zδ n ) 2 4y n z I(z) 2( z R(z)), I( z)= I(z) 2( z +R(z)) If u y n 5(A+), then the real parts of (z+y n ) 2 4y n z and (z+yn +y n zδ n ) 2 4y n z have the same sign Since they both have positive imaginary parts, it follows that (z+y n ) 2 4y n z+ (z+y n +y n zδ n ) 2 4y n z I((z+y n ) 2 4y n z) = ( ) 2v /2 2v(u y n ) 5(A+) Thus, (A2) Em n (z) m y (z) δ n (+ C ) v C δ n 2 v If u y n < 5(A+), we have Em n(z) m y (z) C δ n In [7] [see the inequality above (836)], we have δ n C Nv 3 Combined with (A2), we get The proof of the lemma is complete ( + v ) 2 C v y Nvvy 2 C Em n (z) m y (z) Nv 3/2 vy 2 ]

23 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 23 In addition, the following relevant result which is involved in the proof of Lemma 3 is presented here: (A3) Em n (z) m y (z) du = Em n (z) m y (z) du u y n /(5(A+)), u A + Em n (z) m y (z) du Cv u y n </(5(A+)), u A Lemma A2 Under the conditions of Theorem 6, for v 2 v y C 0 N and l 3, there exists a constant C, such that E ξ (z) 2l C N l v l vy l Proof By the C r -inequality (see Loève (author?) [4]), it follows that ( E ξ (z) 2l C E N tra (z) 2l ) N EtrA (z) +E ˆξ (z) 2l =:I +I 2 From Lemmas B6, B8, B0 and the C r -inequality with v 2 v y C 0 N, we have I E N tra (z) 2l N tra (z) +E N tra (z) 2l N EtrA (z) + N EtrA (z) 2l N EtrA (z) {( ) 2l C + ( Nv N 2l v 4l + v ) l ( ) 2l } + v y Nv C N 2l v 3l vy l Under finite 8th moment assumption, for l 2 we have E X 4l =E{ X 8 X 4l 8 I( X η N N /4 )} CN l 2 It can be shown that B zi n =A, (B zi n ) /v, and tr((b zi n ) (B zi n ) )=v I(tr(B zi n ) )

24 24 N XIA, Y QIN AND Z D BAI Hence, by Lemma B5, I 2 = N 2lE X A (z)x tra (z) 2l C N 2lE{CNl 2 tr(a (z)a ( z))l +(Ctr(A (z)a ( z)))l } = C N 2lE{CNl 2 v 2l+ I(tr(B zi n ) ) +C l v l (I(tr(B zi n ) )) l } = C N 2l{CNl v 2l+ E(I(m F B (z))) +C l N l v l E(I(m F B (z))) l } C N l+ v 2l + C v y N l v l vy l The last inequality is due to Lemmas B6, B7, B8 and E(I(m F B (z))) l E m F B (z) m n (z) l +E m n (z) Em n (z) l + Em n (z) m y (z) l + m y (z) l Cv l y Here let = EF Sn F yn, by integration by parts and Lemma B0, we have (A4) Em n (z) m y (z) C v C v y Therefore, for l 3 and v 2 v y C 0 N, it follows that This completes the proof E ξ (z) l I +I 2 C N l v l vy l Lemma A3 Under the conditions of Theorem 6, for all z <A and v 2 v y C 0 N, we have b (z) C Proof From (23) in [3], we have m y (z)= y n z+ ( y n z) 2 4y n z, 2y n z

25 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 25 wherethesquarerootofacomplexnumberisdefinedastheonewithpositive imaginary part It then can be verified that b 0 (z)=: +y n m y (z) =+ 2 (z y n (z y n ) 2 4y n ) = + y n m semi ( z yn yn ), where m semi denotes the Stieltes transform of the semicircular law As m semi, we conclude that (A5) b 0 (z) + y n By the relationship between b 0 and b, we have b (z)= b 0 (z) +y n b 0 (z)((/n)etra (z) m y(z)) When C 0 is chosen large enough, by Lemma A, for all large N we have n EtrA (z) m y(z) n Etr(A (z) (z)) + Em n (z) m y (z) and consequently we obtain Thus, the proof is complete nv + 3(+ y n ) 2 3(+ y n ) b (z) 3(+ y n ) C Lemma A4 If b (z) C, then for any fixed t>0, P( β (z) >2C)=o(N t ) Proof Note that if b (z)ξ (z) /2, by Lemma A3, we get β (z) = b (z) +b (z)ξ (z) b (z) b (z)ξ (z) 2 b (z) 2C As a result, ( P( β (z) >2C) P b (z)ξ (z) > ) 2 ( P ξ (z) > ) 2C (2C) p E ξ (z) p (see Lemma A3)

26 26 N XIA, Y QIN AND Z D BAI By the C r -inequality, Lemmas B8 and B9, for some η = η N N /4 and p logn, we have E ξ (z) p =E ˆξ (z) p +E N tra (z) p N EtrA (z) C(Nη 4 NN ) (v η 2 NN /2 ) p + Cη 2p 4 N Cη p N C N p v 3p/2 v p/2 y For any fixed t>0, when N is large enough so that logη N >t+, it can be shown that We finish the proof E ξ (z) p Ce plogη N Ce p(t+) Ce (t+)logn =CN t =o(n t ) Lemma A5 If v 2 v y C 0 N, C 0 is a large constant For l, it holds that Proof Recall that E m H B (z) 2l CE m H n (z) 2l A (z) (z)=β (z)a (z)r r A (z) By Lemmas A4 and B5, it holds that E m H B (z) m H n (z) 2l =E x n(a (z) (z))x n 2l =E x n β (z)a (z)r r A (z)x n 2l =E x nβ (z)a (z)r r A (z)x n 2l I( β (z) C) +E x n β (z)a (z)r r A (z)x n 2l I( β (z) >C) C N 2lE X A (z)x nx na (z)x 2l +o(n t ) C N 2lE X A (z)x nx na (z)x x na 2 (z)x n 2l + C N 2lE x n A 2 (z)x n 2l

27 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 27 C N 2l[E(trA (z)x nx n A (z)a ( z)x nx n A ( z))l +N l 2 Etr(A (z)x nx n A (z)a ( z)x nx n A ( z))l ] + C N 2lE x na 2 (z)x n 2l C N l+2 v 2lE m H B (z) 2l The last step follows the fact that x na (z)a ( z)x n =v I(x na (z)x n)=v I(m H B (z)) For v 2 v y C 0 N, which implies that v C 0 N /2 Choose C 0 and N C large enough, such that N l+2 v 2l 2 Further, by C r-inequality, we obtain E m H B (z) 2l CE m H B (z) m H n (z) 2l +CE m H n (z) 2l 2 E m H B (z) 2l +CE m H n (z) 2l That is, E m H B (z) 2l CE m H n (z) 2l, for some constant C This finishes the proof Lemma A6 Under the conditions of Theorem 6, for v 2 v y C 0 N, we have E m H n (z) EmH n (z) 2l C ( ) 2l N l v 2l v y v Proof Write E ( ) as the conditional expectation given {r,,r } It can then be shown that m H n (z) EmH n (z)= N = γ, where γ =:E (x na (z)x n ) E (x na (z)x n ) = (E E ){x n(a (z) (z))x n } = (E E ){β (z)x na (z)r r A (z)x n } Therefore, by Lemmas B4(b), we have ( N ) l N E m H n (z) Em H n (z) 2l CE E γ 2 +C E γ 2l Using Lemmas A4 and B5, we have = = E =E (E E )β (z)x n A (z)r r A (z)x n 2 C N 2E X A (z)x n x n A (z)x x n A 2 (z)x n 2

28 28 N XIA, Y QIN AND Z D BAI + C N 2E x n A 2 (z)x n 2 C N 2E tr(a (z)x n x n A (z)a ( z)x n x n A ( z)) + C N 2E x na 2 (z)x n 2 By the fact that x na (z)a ( z)x n =v I(x na (z)x n ) and A (z) v, we have On the other side, E γ 2 C N 2 v 2E m H B (z) 2 E γ 2l = N 2lE X A (z)x nx na (z)x 2l Thus, we obtain C N 2lE X A (z)x nx na (z)x x na 2 (z)x n 2l + C N 2lE x na 2 (z)x n 2l C Nl 2 N2l v 2l E(I(x na (z)x n)) 2l + C N 2lE x na (z)x n 2l C N l+2 v 2lE m H B (z) 2l E m H n (z) Em H n (z) 2l C N l v 2lE m H B (z) 2l C + N l+ v 2lE m H B (z) 2l Further C N l v 2lE mh n (z) 2l (by Lemma A5) E m H n (z) 2l E m H n (z) Em H n (z) 2l + Em H n (z) m y (z) 2l + m y (z) 2l For v C 0 N /2 C, choose C 0 large enough, such that integration by parts, it is easy to find that (A6) Em H n (z) m y (z) C H, v where H = EH Sn F yn Besides, from Lemma B7, we know that m y (z) C v y N l v 2l 2 And using

29 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 29 Therefore, we obtain E m H n (z) Em H n (z) 2l C ( ) 2l N l v 2l v y v The proof is then complete Lemma A7 E α (z) 2 C ( ) N 2 v v y v Proof Lemma B5 implies that E α (z) 2 = N 2E X A (z)x nx n X x n A (z)x n 2 C N 2E(x na ( z)a (z)x n) C N 2 v Em H B (z) Using Lemmas A5 and A6 and integration by parts, we have E m H B (z) 2 E m H n (z) 2 This finishes the proof E m H n (z) EmH n (z) 2 + Em H n (z) m y(z) 2 + m y (z) 2 ( ) 2 C v y v Lemma A8 Under the conditions in Theorem, for any fixed t>0, B EH Sn (x) F yn (x) dx=o(n t ) Proof For any fixed t>0, by Lemma B2, it follows that P(λ max (S n ) B+x) CN t (B+x ε) 2 and EH Sn (x) F yn (x) dx B = B B ( EH Sn (x))dx ( ) N y i 2 P(λ i x) dx i=

30 30 N XIA, Y QIN AND Z D BAI The proof is complete B B ( N ) N y i 2 y i 2 P(λ max x) dx i= i= N t (B+x ε) 2 dx=o(n t ) APPENDIX B In what follows, we will present some existing results which are of substantial importance in proving the main theorems Lemma B (Theorem 22 in [2]) Let F be a distribution function and let G be a function of bounded variation satisfying F(x) G(x) dx< Denote their Stieltes transforms by f(z) and g(z), respectively Then F G =sup F(x) G(x) x π( κ)(2γ ) ( f(z) g(z) du+ 2π v + v sup x y 2vτ x >B F(x) G(x) dx ) G(x+y) G(x) dy, where z =u+iv is a complex variable, γ, κ, τ, A and B are positive constants such that A>B, and κ= γ = π 4B π(a B)(2γ ) < u <τ u 2 + du> 2 Lemma B2 (Lemma 85 in [7]) For any v>0, we have sup F yn (x+u) F yn (x) du< 2(+y n ) v 2 /v y, x 3πy n u <v where F yn is the cdf of the Marčenko Pastur distribution with index y n, and v y = y n + v

31 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 3 Lemma B3 ((5) in [6]) Let A=(a i ) n n and B=(b i ) n n be two nonrandom matrices Let X = (X,,X n ) be a random vector of independent complex entries Assume that EX i = 0 and E X i 2 = Then we have E(X AX tra)(x BX trb) n = (E X i 4 EXi 2 2 2)a ii b ii + EXi 2 2 trab T +trab i= Lemma B4 (Burkholder inequalities (Lemmas 2 and 22 in [5])) Let {X k } be a complex martingale difference sequence with respect to the increasing σ-field {F k }, and let E k denote the conditional expectation with respect to F k Then we have: (a) for p>, ( n n ) p/2, E X k p K p E X k 2 k= k= (b) for p 2, n p ( n ) p/2+ E X k K p (E E k X k 2 E k= k= where K p is a constant which depends on p only n X k ), p Lemma B5 (Lemma 27 in [5]) Let A=(a i ) be an n n nonrandom matrix and X=(X,,X n ) be random vector of independent complex entries Assume that EX i =0, E X i 2 = and E X i l V l Then for any p 2, k= E X AX tra p K p ((V 4 tr(aa )) p/2 +V 2p tr(aa ) p/2 ), where K p is a constant depending on p only Lemma B6 (Lemma 26 in [24]) Let z C + with v =I(z), A and B n n with B Hermitian, τ R, and q C n Then tr((b zi n ) (B+τqq zi n ) )A A v, where A denotes spectral norm on matrices Lemma B7 ((849) in[7]) For the Stieltes transform of the Marčenko Pastur distribution, we have 2 m y (z), yvy where v y = a+ v= y n + v

32 32 N XIA, Y QIN AND Z D BAI Lemma B8 (Lemma 820 in [7]) If z <A, v 2 v y C 0 N and l, then ( E m n (z) Em n (z) 2l C N 2l v 4l yn 2l + v ) l, v y where A is a positive constant, v y = y n + v and := EF Sn F yn Lemma B9 (Lemma 9 in [7]) Suppose that X i, i=,,n, are independent, with EX i =0, E X i 2 =, supe X i 4 =ν < and X i η n with η>0 Assume that A is a complex matrix Then for any given p such that 2 p blog(nν η 4 ) and b>, we have where α=(x,,x n ) T E α Aα tr(a) p νn p (nη 4 ) (40b 2 A η 2 ) p, Lemma B0 (Theorem 80 in [7]) Let S n = XX /N, where X = (X i (n)) n N Assume that the following conditions hold: () For each n, X i (n) are independent, (2) EX i (n)=0, E X i (n) 2 =, for all i,, (3) sup n sup i, E X i (n) 6 < Then we have { O(N =: EF Sn F yn = /2 a ), if a>n /3, O(N /6 ), otherwise, where y n =n/n and a is defined in the Marčenko Pastur distribution Lemma B (Theorem 5 in [7]) Assume that the entries of {X i } is a double array of iid complex random variables with mean zero, variance σ 2 and finite 4th moment Let X=(X i ) n N be the n N matrix of the upperleft corner of the double array If n/n y (0,), then, with probability one, we have and lim λ min(s n )=σ 2 ( y) 2 n lim λ max(s n )=σ 2 (+ y) 2 n Lemma B2 (Theorem 59 in[7]) Suppose that the entries of the matrix X=(X i ) n N are independent (not necessarily identically distributed) and satisfy: () EX i =0, (2) X i Nδ N,

33 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 33 (3) max i E X i 2 σ 2 0 as N and (4) E X i l b( Nδ N ) l 3 for all l 3, where δ N 0 and b > 0 Let S n =XX /N Then, for any x>ǫ>0, n/n y, and fixed integer l 2, we have P(λ max (S n ) σ 2 (+ y) 2 +x) CN l (σ 2 (+ y) 2 +x ǫ) l for some constant C >0 APPENDIX C Notethat thedatamatrix X=(X i ) n N consists ofiid complex random variables withmean0andvariance Inwhatfollows, wewillfurtherassume that every X i is bounded by η N N /4 for some carefully selected η N The proofs presented in the following three steps ointly ustify such a convenient assumption C Truncation for Theorem Choose η N 0 and η N N /4 as N such that lim N η 0 N E X 0 I( X >η N N /4 )=0 Let X n denote the truncated data matrix whose entry on the ith row and th column is X i I( X i η N N /4 ), i=,,n, =,,N Define Ŝn = X n X n/n Then P(S n Ŝn) nnp( X i >η N N /4 ) nn 3/2 η 0 N E X 0 I( X >η N N /4 ) =o(n /2 ) C2 Truncation for Theorems 6 and 8 Choose η N 0 and η N N /4 as N such that (C) lim N η 8 N E X 8 I( X >η N N /4 )=0 Let X n denote the truncated data matrix whose entry on the ith row and th column is X i I( X i η N N /4 ), i=,,n, =,,N Define Ŝn = X n X n /N Then ( ) P(S n Ŝn, io)= lim P n N X i >η N N /4 k N=ki==

34 34 N XIA, Y QIN AND Z D BAI ( ) = lim P n N X i >η N N /4 k t=kn [2 t,2 t+ ) i== ( (yn+)2 ) t+ 2 t+ lim P X i >η 2 t2 t/4 k t=k i= = C lim (2 t+ ) 2 P( X >η 2 t2 t/4 ) k t=k C lim 4 t P(η 2 l2 l/4 < X η 2 l+2 (l+)/4 ) k t=k l=t l =C lim 4 t P(η 2 l2 l/4 < X η 2 l+2 (l+)/4 ) k l=k t=k lim Cη 8 E X k 2 l 8 I(η 2 l2 l/4 < X η 2 l+2 (l+)/4 ) l=k =0 The last equality is due to (C) C3 Centralization The centralization procedures for three theorems are identical, only 8th moment is required and thus we treat them uniformly Let X n denote the centralized version of X n More explicitly, on the ith row and th column of X n, the entry is X i I( X i η N N /4 ) E(X i I( X i η N N /4 )) Notice that according to Theorem 3 of [26], (S n zi n ) is bounded by /v, where denotes the spectral norm for a matrix Define S n = X n X n /N Suppose that v C 0 N /2, we obtain m H Ŝn (z) m H Sn (z) = x n(ŝn zi n ) x n x n( S n zi n ) x n (Ŝn zi n ) Ŝn S n ( S n zi n ) v 2 Ŝn S n Nv 2( X n X n X n + X n X n X n ) by Lemma B2 C Nv 2 X n X n as

35 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 35 = C Nv 2 E{X I( X η N N /4 )} n N C Nv 2 η 7 N N 7/4 E( X 8 I( X >η N N /4 )) =o(n /4 ) To establish both the weak and the strong convergence rates of the VESD to the Marčenko Pastur distribution, this o(n /4 ) suffices Moreover, for the convergence rate presented in Theorem, we shall prove the following Let m y (z) denotes the Stieltes transform of the Marčenko Pastur distribution, thus m y (z) is bounded by 2 yn( y n+ v) C v in Lemma B7 Then Em H n (z) Em H n (z) m y (z) + m y (z) C m y (z) C v Besides x n (Ŝn zi n ) x n can be considered as a Stieltes transform of some VESD function So, we have Thus, E x n (Ŝn zi n ) 2 =v Ex n (Ŝn zi n ) x n C v E m H Ŝn (z) m H Sn (z) E Ŝn S n x n (Ŝn zi n ) ( S n zi n ) x n C Nη 7 N N 7/4 E( X 8 I( X η N N /4 )) (E x n(ŝn zi) 2 ) /2 (E ( S n zi) x n 2 ) /2 C Nv 3/2 η 7 N N 7/4 E( X 8 I( X η N N /4 )) o(n /2 ) C4 Rescaling The rescaling procedures for the three theorems are exactly the same, and only 8th moment is required Thus, we treat them uniformly Write Y n = X n /σ, where σ 2 =E X I( X η N N /4 ) E(X I( X η N N /4 )) 2 Notice that σ tends to as N goes to Define G n =Y n Y n/n, which is the sample covariance matrix of Y n We shall show that G n and S n are asymptotically equivalent, that is, the VESD of G n and S n have the same limit if either one limit exists For v C 0 N /2, m H Gn(z) m H Sn (z) = x n( S n zi n ) ( S n G n )(G n zi n ) x n

36 36 N XIA, Y QIN AND Z D BAI v 2 ( σ ) S n (see Lemma B2) C v 2( σ2 ) as Cv 2 η 6 N N 3/2 E( X 8 I( X >η N N /4 )) o(n /2 ) Hence, we shall without loss of generality assume that every X i is bounded by η N N /4, and every X i has mean 0 and variance as REFERENCES [] Anderson, T W (963) Asymptotic theory for principal component analysis Ann Inst Statist Math MR [2] Bai, Z D (993) Convergence rate of expected spectral distributions of large random matrices I Wigner matrices Ann Probab MR27559 [3] Bai, Z D (993) Convergence rate of expected spectral distributions of large random matrices II Sample covariance matrices Ann Probab MR27560 [4] Bai, Z D, Miao, B Q and Pan, G M (2007) On asymptotics of eigenvectors of large sample covariance matrix Ann Probab MR [5] Bai, Z D and Silverstein, J W (998) No eigenvalues outside the support of the limiting spectral distribution of large-dimensional sample covariance matrices Ann Probab MR6705 [6] Bai, Z D and Silverstein, J W (2004) CLT for linear spectral statistics of largedimensional sample covariance matrices Ann Probab MR [7] Bai, Z D and Silverstein, J W (200) Spectral Analysis of Large Dimensional Random Matrices, 2nd ed Springer, New York MR [8] Bai, Z D and Xia, N (203) Functional CLT of eigenvectors for large sample covariance matrices Statist Papers To appear [9] Cai, T, Ma, Z M and Wu, Y H (203) Sparse PCA: Optimal rates and adaptive estimation Available at arxiv:2309 [0] Erdős, L, Schlein, B and Yau, H-T (2009) Semicircle law on short scales and delocalization of eigenvectors for Wigner random matrices Ann Probab MR [] Götze, F and Tikhomirov, A (2004) Rate of convergence in probability to the Marchenko Pastur law Bernoulli MR [2] Johnstone, I M (200) On the distribution of the largest eigenvalue in principal components analysis Ann Statist MR86396 [3] Knowles, A and Yin, J (203) Eigenvector distribution of Wigner matrices Probab Theory Related Fields MR [4] Loève, M (977) Probability Theory I, 4th ed Springer, New York Graduate Texts in Mathematics 45 MR06507 [5] Ma, Z M (203) Sparse principal component analysis and iterative thresholding Ann Statist [6] Marčenko, V A and Pastur, L A (967) Distribution of eigenvalues for some sets of random matrices Math USSR-Sb

37 ASYMPTOTICS OF EIGENVECTORS AND EIGENVALUES 37 [7] Paul, D (2007) Asymptotics of sample eigenstructure for a large dimensional spiked covariance model Statist Sinica MR [8] Pillai, N S and Yin, J (203) Universality of covariance matrices Available at arxiv:0250v6 [9] Silverstein, J W (98) Describing the behavior of eigenvectors of random matrices using sequences of measures on orthogonal groups SIAM J Math Anal MR [20] Silverstein, J W (984) Some limit theorems on the eigenvectors of largedimensional sample covariance matrices J Multivariate Anal MR [2] Silverstein, J W (989) On the eigenvectors of large-dimensional sample covariance matrices J Multivariate Anal 30 6 MR [22] Silverstein, J W (990) Weak convergence of random functions defined by the eigenvectors of sample covariance matrices Ann Probab MR [23] Silverstein, J W (995) Strong convergence of the empirical distribution of eigenvalues of large-dimensional random matrices J Multivariate Anal MR [24] Silverstein, J W and Bai, Z D (995) On the empirical distribution of eigenvalues of a class of large-dimensional random matrices J Multivariate Anal MR [25] Tao, T and Vu, V (20) Universal properties of eigenvectors Available at arxiv:03280v2 [26] Yin, Y Q, Bai, Z D and Krishnaiah, P R (988) On the limit of the largest eigenvalue of the large-dimensional sample covariance matrix Probab Theory Related Fields MR N Xia Z D Bai KLAS and School of Mathematics and Statistics Northeast Normal University Changchun China and Department of Statistics and Applied Probability National University of Singapore Singapore 7546 Singapore xiann664@gmailcom baizd@nenueducn Y Qin Department of Statistics and Actuarial Science University of Waterloo Waterloo, ON N2L 3G Canada yingliqin@uwaterlooca

arxiv: v1 [math.pr] 13 Aug 2007

arxiv: v1 [math.pr] 13 Aug 2007 The Annals of Probability 2007, Vol. 35, No. 4, 532 572 DOI: 0.24/009790600000079 c Institute of Mathematical Statistics, 2007 arxiv:0708.720v [math.pr] 3 Aug 2007 ON ASYMPTOTICS OF EIGENVECTORS OF LARGE

More information

Large sample covariance matrices and the T 2 statistic

Large sample covariance matrices and the T 2 statistic Large sample covariance matrices and the T 2 statistic EURANDOM, the Netherlands Joint work with W. Zhou Outline 1 2 Basic setting Let {X ij }, i, j =, be i.i.d. r.v. Write n s j = (X 1j,, X pj ) T and

More information

A new method to bound rate of convergence

A new method to bound rate of convergence A new method to bound rate of convergence Arup Bose Indian Statistical Institute, Kolkata, abose@isical.ac.in Sourav Chatterjee University of California, Berkeley Empirical Spectral Distribution Distribution

More information

Comparison Method in Random Matrix Theory

Comparison Method in Random Matrix Theory Comparison Method in Random Matrix Theory Jun Yin UW-Madison Valparaíso, Chile, July - 2015 Joint work with A. Knowles. 1 Some random matrices Wigner Matrix: H is N N square matrix, H : H ij = H ji, EH

More information

Isotropic local laws for random matrices

Isotropic local laws for random matrices Isotropic local laws for random matrices Antti Knowles University of Geneva With Y. He and R. Rosenthal Random matrices Let H C N N be a large Hermitian random matrix, normalized so that H. Some motivations:

More information

Non white sample covariance matrices.

Non white sample covariance matrices. Non white sample covariance matrices. S. Péché, Université Grenoble 1, joint work with O. Ledoit, Uni. Zurich 17-21/05/2010, Université Marne la Vallée Workshop Probability and Geometry in High Dimensions

More information

Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices

Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices by Jack W. Silverstein* Department of Mathematics Box 8205 North Carolina State University Raleigh,

More information

arxiv: v3 [math-ph] 21 Jun 2012

arxiv: v3 [math-ph] 21 Jun 2012 LOCAL MARCHKO-PASTUR LAW AT TH HARD DG OF SAMPL COVARIAC MATRICS CLAUDIO CACCIAPUOTI, AA MALTSV, AD BJAMI SCHLI arxiv:206.730v3 [math-ph] 2 Jun 202 Abstract. Let X be a matrix whose entries are i.i.d.

More information

Applications of random matrix theory to principal component analysis(pca)

Applications of random matrix theory to principal component analysis(pca) Applications of random matrix theory to principal component analysis(pca) Jun Yin IAS, UW-Madison IAS, April-2014 Joint work with A. Knowles and H. T Yau. 1 Basic picture: Let H be a Wigner (symmetric)

More information

arxiv: v5 [math.na] 16 Nov 2017

arxiv: v5 [math.na] 16 Nov 2017 RANDOM PERTURBATION OF LOW RANK MATRICES: IMPROVING CLASSICAL BOUNDS arxiv:3.657v5 [math.na] 6 Nov 07 SEAN O ROURKE, VAN VU, AND KE WANG Abstract. Matrix perturbation inequalities, such as Weyl s theorem

More information

On corrections of classical multivariate tests for high-dimensional data

On corrections of classical multivariate tests for high-dimensional data On corrections of classical multivariate tests for high-dimensional data Jian-feng Yao with Zhidong Bai, Dandan Jiang, Shurong Zheng Overview Introduction High-dimensional data and new challenge in statistics

More information

On singular values distribution of a matrix large auto-covariance in the ultra-dimensional regime. Title

On singular values distribution of a matrix large auto-covariance in the ultra-dimensional regime. Title itle On singular values distribution of a matrix large auto-covariance in the ultra-dimensional regime Authors Wang, Q; Yao, JJ Citation Random Matrices: heory and Applications, 205, v. 4, p. article no.

More information

Convergence of empirical spectral distributions of large dimensional quaternion sample covariance matrices

Convergence of empirical spectral distributions of large dimensional quaternion sample covariance matrices Ann Inst Stat Math 206 68:765 785 DOI 0007/s0463-05-054-0 Convergence of empirical spectral distributions of large dimensional quaternion sample covariance matrices Huiqin Li Zhi Dong Bai Jiang Hu Received:

More information

Invertibility of random matrices

Invertibility of random matrices University of Michigan February 2011, Princeton University Origins of Random Matrix Theory Statistics (Wishart matrices) PCA of a multivariate Gaussian distribution. [Gaël Varoquaux s blog gael-varoquaux.info]

More information

Small Ball Probability, Arithmetic Structure and Random Matrices

Small Ball Probability, Arithmetic Structure and Random Matrices Small Ball Probability, Arithmetic Structure and Random Matrices Roman Vershynin University of California, Davis April 23, 2008 Distance Problems How far is a random vector X from a given subspace H in

More information

Local semicircle law, Wegner estimate and level repulsion for Wigner random matrices

Local semicircle law, Wegner estimate and level repulsion for Wigner random matrices Local semicircle law, Wegner estimate and level repulsion for Wigner random matrices László Erdős University of Munich Oberwolfach, 2008 Dec Joint work with H.T. Yau (Harvard), B. Schlein (Cambrigde) Goal:

More information

Eigenvalue variance bounds for Wigner and covariance random matrices

Eigenvalue variance bounds for Wigner and covariance random matrices Eigenvalue variance bounds for Wigner and covariance random matrices S. Dallaporta University of Toulouse, France Abstract. This work is concerned with finite range bounds on the variance of individual

More information

Convergence rates of spectral distributions of large sample covariance matrices

Convergence rates of spectral distributions of large sample covariance matrices Title Convergence rates of spectral distributions of large sample covariance matrices Authors) Bai, ZD; Miao, B; Yao, JF Citation SIAM Journal On Matrix Analysis And Applications, 2004, v. 25 n., p. 05-27

More information

Random matrices: Distribution of the least singular value (via Property Testing)

Random matrices: Distribution of the least singular value (via Property Testing) Random matrices: Distribution of the least singular value (via Property Testing) Van H. Vu Department of Mathematics Rutgers vanvu@math.rutgers.edu (joint work with T. Tao, UCLA) 1 Let ξ be a real or complex-valued

More information

arxiv: v1 [math.pr] 13 Dec 2013

arxiv: v1 [math.pr] 13 Dec 2013 arxiv:131.3747v1 [math.pr] 13 Dec 013 CONVERGENCE RATES OF THE SPECTRAL DISTRIBUTIONS OF LARGE RANDOM QUATERNION SELF-DUAL HERMITIAN MATRICES YANQING YIN, ZHIDONG BAI Abstract. In this paper, convergence

More information

Semicircle law on short scales and delocalization for Wigner random matrices

Semicircle law on short scales and delocalization for Wigner random matrices Semicircle law on short scales and delocalization for Wigner random matrices László Erdős University of Munich Weizmann Institute, December 2007 Joint work with H.T. Yau (Harvard), B. Schlein (Munich)

More information

On corrections of classical multivariate tests for high-dimensional data. Jian-feng. Yao Université de Rennes 1, IRMAR

On corrections of classical multivariate tests for high-dimensional data. Jian-feng. Yao Université de Rennes 1, IRMAR Introduction a two sample problem Marčenko-Pastur distributions and one-sample problems Random Fisher matrices and two-sample problems Testing cova On corrections of classical multivariate tests for high-dimensional

More information

Eigenvalues and Singular Values of Random Matrices: A Tutorial Introduction

Eigenvalues and Singular Values of Random Matrices: A Tutorial Introduction Random Matrix Theory and its applications to Statistics and Wireless Communications Eigenvalues and Singular Values of Random Matrices: A Tutorial Introduction Sergio Verdú Princeton University National

More information

The circular law. Lewis Memorial Lecture / DIMACS minicourse March 19, Terence Tao (UCLA)

The circular law. Lewis Memorial Lecture / DIMACS minicourse March 19, Terence Tao (UCLA) The circular law Lewis Memorial Lecture / DIMACS minicourse March 19, 2008 Terence Tao (UCLA) 1 Eigenvalue distributions Let M = (a ij ) 1 i n;1 j n be a square matrix. Then one has n (generalised) eigenvalues

More information

Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws. Symeon Chatzinotas February 11, 2013 Luxembourg

Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws. Symeon Chatzinotas February 11, 2013 Luxembourg Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws Symeon Chatzinotas February 11, 2013 Luxembourg Outline 1. Random Matrix Theory 1. Definition 2. Applications 3. Asymptotics 2. Ensembles

More information

Spectral analysis of the Moore-Penrose inverse of a large dimensional sample covariance matrix

Spectral analysis of the Moore-Penrose inverse of a large dimensional sample covariance matrix Spectral analysis of the Moore-Penrose inverse of a large dimensional sample covariance matrix Taras Bodnar a, Holger Dette b and Nestor Parolya c a Department of Mathematics, Stockholm University, SE-069

More information

arxiv: v2 [math.pr] 16 Aug 2014

arxiv: v2 [math.pr] 16 Aug 2014 RANDOM WEIGHTED PROJECTIONS, RANDOM QUADRATIC FORMS AND RANDOM EIGENVECTORS VAN VU DEPARTMENT OF MATHEMATICS, YALE UNIVERSITY arxiv:306.3099v2 [math.pr] 6 Aug 204 KE WANG INSTITUTE FOR MATHEMATICS AND

More information

Local law of addition of random matrices

Local law of addition of random matrices Local law of addition of random matrices Kevin Schnelli IST Austria Joint work with Zhigang Bao and László Erdős Supported by ERC Advanced Grant RANMAT No. 338804 Spectrum of sum of random matrices Question:

More information

Random Matrices: Invertibility, Structure, and Applications

Random Matrices: Invertibility, Structure, and Applications Random Matrices: Invertibility, Structure, and Applications Roman Vershynin University of Michigan Colloquium, October 11, 2011 Roman Vershynin (University of Michigan) Random Matrices Colloquium 1 / 37

More information

Operator-Valued Free Probability Theory and Block Random Matrices. Roland Speicher Queen s University Kingston

Operator-Valued Free Probability Theory and Block Random Matrices. Roland Speicher Queen s University Kingston Operator-Valued Free Probability Theory and Block Random Matrices Roland Speicher Queen s University Kingston I. Operator-valued semicircular elements and block random matrices II. General operator-valued

More information

Random regular digraphs: singularity and spectrum

Random regular digraphs: singularity and spectrum Random regular digraphs: singularity and spectrum Nick Cook, UCLA Probability Seminar, Stanford University November 2, 2015 Universality Circular law Singularity probability Talk outline 1 Universality

More information

A Note on the Central Limit Theorem for the Eigenvalue Counting Function of Wigner and Covariance Matrices

A Note on the Central Limit Theorem for the Eigenvalue Counting Function of Wigner and Covariance Matrices A Note on the Central Limit Theorem for the Eigenvalue Counting Function of Wigner and Covariance Matrices S. Dallaporta University of Toulouse, France Abstract. This note presents some central limit theorems

More information

Stein s Method and Characteristic Functions

Stein s Method and Characteristic Functions Stein s Method and Characteristic Functions Alexander Tikhomirov Komi Science Center of Ural Division of RAS, Syktyvkar, Russia; Singapore, NUS, 18-29 May 2015 Workshop New Directions in Stein s method

More information

Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535

Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535 Additive functionals of infinite-variance moving averages Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535 Departments of Statistics The University of Chicago Chicago, Illinois 60637 June

More information

Selected Exercises on Expectations and Some Probability Inequalities

Selected Exercises on Expectations and Some Probability Inequalities Selected Exercises on Expectations and Some Probability Inequalities # If E(X 2 ) = and E X a > 0, then P( X λa) ( λ) 2 a 2 for 0 < λ

More information

Wiener Measure and Brownian Motion

Wiener Measure and Brownian Motion Chapter 16 Wiener Measure and Brownian Motion Diffusion of particles is a product of their apparently random motion. The density u(t, x) of diffusing particles satisfies the diffusion equation (16.1) u

More information

Inhomogeneous circular laws for random matrices with non-identically distributed entries

Inhomogeneous circular laws for random matrices with non-identically distributed entries Inhomogeneous circular laws for random matrices with non-identically distributed entries Nick Cook with Walid Hachem (Telecom ParisTech), Jamal Najim (Paris-Est) and David Renfrew (SUNY Binghamton/IST

More information

High-dimensional two-sample tests under strongly spiked eigenvalue models

High-dimensional two-sample tests under strongly spiked eigenvalue models 1 High-dimensional two-sample tests under strongly spiked eigenvalue models Makoto Aoshima and Kazuyoshi Yata University of Tsukuba Abstract: We consider a new two-sample test for high-dimensional data

More information

1 Intro to RMT (Gene)

1 Intro to RMT (Gene) M705 Spring 2013 Summary for Week 2 1 Intro to RMT (Gene) (Also see the Anderson - Guionnet - Zeitouni book, pp.6-11(?) ) We start with two independent families of R.V.s, {Z i,j } 1 i

More information

Assessing the dependence of high-dimensional time series via sample autocovariances and correlations

Assessing the dependence of high-dimensional time series via sample autocovariances and correlations Assessing the dependence of high-dimensional time series via sample autocovariances and correlations Johannes Heiny University of Aarhus Joint work with Thomas Mikosch (Copenhagen), Richard Davis (Columbia),

More information

Asymptotic Statistics-III. Changliang Zou

Asymptotic Statistics-III. Changliang Zou Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (

More information

Exponential tail inequalities for eigenvalues of random matrices

Exponential tail inequalities for eigenvalues of random matrices Exponential tail inequalities for eigenvalues of random matrices M. Ledoux Institut de Mathématiques de Toulouse, France exponential tail inequalities classical theme in probability and statistics quantify

More information

On the singular values of random matrices

On the singular values of random matrices On the singular values of random matrices Shahar Mendelson Grigoris Paouris Abstract We present an approach that allows one to bound the largest and smallest singular values of an N n random matrix with

More information

Asymptotic Statistics-VI. Changliang Zou

Asymptotic Statistics-VI. Changliang Zou Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous

More information

Preface to the Second Edition...vii Preface to the First Edition... ix

Preface to the Second Edition...vii Preface to the First Edition... ix Contents Preface to the Second Edition...vii Preface to the First Edition........... ix 1 Introduction.............................................. 1 1.1 Large Dimensional Data Analysis.........................

More information

On the principal components of sample covariance matrices

On the principal components of sample covariance matrices On the principal components of sample covariance matrices Alex Bloemendal Antti Knowles Horng-Tzer Yau Jun Yin February 4, 205 We introduce a class of M M sample covariance matrices Q which subsumes and

More information

The Matrix Dyson Equation in random matrix theory

The Matrix Dyson Equation in random matrix theory The Matrix Dyson Equation in random matrix theory László Erdős IST, Austria Mathematical Physics seminar University of Bristol, Feb 3, 207 Joint work with O. Ajanki, T. Krüger Partially supported by ERC

More information

Kernel Method: Data Analysis with Positive Definite Kernels

Kernel Method: Data Analysis with Positive Definite Kernels Kernel Method: Data Analysis with Positive Definite Kernels 2. Positive Definite Kernel and Reproducing Kernel Hilbert Space Kenji Fukumizu The Institute of Statistical Mathematics. Graduate University

More information

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes Alea 4, 117 129 (2008) Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes Anton Schick and Wolfgang Wefelmeyer Anton Schick, Department of Mathematical Sciences,

More information

Random Matrix: From Wigner to Quantum Chaos

Random Matrix: From Wigner to Quantum Chaos Random Matrix: From Wigner to Quantum Chaos Horng-Tzer Yau Harvard University Joint work with P. Bourgade, L. Erdős, B. Schlein and J. Yin 1 Perhaps I am now too courageous when I try to guess the distribution

More information

Random Bernstein-Markov factors

Random Bernstein-Markov factors Random Bernstein-Markov factors Igor Pritsker and Koushik Ramachandran October 20, 208 Abstract For a polynomial P n of degree n, Bernstein s inequality states that P n n P n for all L p norms on the unit

More information

Invertibility of symmetric random matrices

Invertibility of symmetric random matrices Invertibility of symmetric random matrices Roman Vershynin University of Michigan romanv@umich.edu February 1, 2011; last revised March 16, 2012 Abstract We study n n symmetric random matrices H, possibly

More information

arxiv: v4 [math.pr] 10 Sep 2018

arxiv: v4 [math.pr] 10 Sep 2018 Lectures on the local semicircle law for Wigner matrices arxiv:60.04055v4 [math.pr] 0 Sep 208 Florent Benaych-Georges Antti Knowles September, 208 These notes provide an introduction to the local semicircle

More information

On large deviations for combinatorial sums

On large deviations for combinatorial sums arxiv:1901.0444v1 [math.pr] 14 Jan 019 On large deviations for combinatorial sums Andrei N. Frolov Dept. of Mathematics and Mechanics St. Petersburg State University St. Petersburg, Russia E-mail address:

More information

Asymptotic Distribution of the Largest Eigenvalue via Geometric Representations of High-Dimension, Low-Sample-Size Data

Asymptotic Distribution of the Largest Eigenvalue via Geometric Representations of High-Dimension, Low-Sample-Size Data Sri Lankan Journal of Applied Statistics (Special Issue) Modern Statistical Methodologies in the Cutting Edge of Science Asymptotic Distribution of the Largest Eigenvalue via Geometric Representations

More information

Eigenvectors of some large sample covariance matrix ensembles

Eigenvectors of some large sample covariance matrix ensembles Probab. Theory Relat. Fields DOI 0.007/s00440-00-0298-3 Eigenvectors of some large sample covariance matrix ensembles Olivier Ledoit Sandrine Péché Received: 24 March 2009 / Revised: 0 March 200 Springer-Verlag

More information

Anti-concentration Inequalities

Anti-concentration Inequalities Anti-concentration Inequalities Roman Vershynin Mark Rudelson University of California, Davis University of Missouri-Columbia Phenomena in High Dimensions Third Annual Conference Samos, Greece June 2007

More information

Local Kesten McKay law for random regular graphs

Local Kesten McKay law for random regular graphs Local Kesten McKay law for random regular graphs Roland Bauerschmidt (with Jiaoyang Huang and Horng-Tzer Yau) University of Cambridge Weizmann Institute, January 2017 Random regular graph G N,d is the

More information

Extreme inference in stationary time series

Extreme inference in stationary time series Extreme inference in stationary time series Moritz Jirak FOR 1735 February 8, 2013 1 / 30 Outline 1 Outline 2 Motivation The multivariate CLT Measuring discrepancies 3 Some theory and problems The problem

More information

Estimation of the Global Minimum Variance Portfolio in High Dimensions

Estimation of the Global Minimum Variance Portfolio in High Dimensions Estimation of the Global Minimum Variance Portfolio in High Dimensions Taras Bodnar, Nestor Parolya and Wolfgang Schmid 07.FEBRUARY 2014 1 / 25 Outline Introduction Random Matrix Theory: Preliminary Results

More information

ESTIMATION IN A SEMI-PARAMETRIC TWO-STAGE RENEWAL REGRESSION MODEL

ESTIMATION IN A SEMI-PARAMETRIC TWO-STAGE RENEWAL REGRESSION MODEL Statistica Sinica 19(29): Supplement S1-S1 ESTIMATION IN A SEMI-PARAMETRIC TWO-STAGE RENEWAL REGRESSION MODEL Dorota M. Dabrowska University of California, Los Angeles Supplementary Material This supplement

More information

Homework # , Spring Due 14 May Convergence of the empirical CDF, uniform samples

Homework # , Spring Due 14 May Convergence of the empirical CDF, uniform samples Homework #3 36-754, Spring 27 Due 14 May 27 1 Convergence of the empirical CDF, uniform samples In this problem and the next, X i are IID samples on the real line, with cumulative distribution function

More information

A NOTE ON THE COMPLETE MOMENT CONVERGENCE FOR ARRAYS OF B-VALUED RANDOM VARIABLES

A NOTE ON THE COMPLETE MOMENT CONVERGENCE FOR ARRAYS OF B-VALUED RANDOM VARIABLES Bull. Korean Math. Soc. 52 (205), No. 3, pp. 825 836 http://dx.doi.org/0.434/bkms.205.52.3.825 A NOTE ON THE COMPLETE MOMENT CONVERGENCE FOR ARRAYS OF B-VALUED RANDOM VARIABLES Yongfeng Wu and Mingzhu

More information

RANDOM MATRICES: LAW OF THE DETERMINANT

RANDOM MATRICES: LAW OF THE DETERMINANT RANDOM MATRICES: LAW OF THE DETERMINANT HOI H. NGUYEN AND VAN H. VU Abstract. Let A n be an n by n random matrix whose entries are independent real random variables with mean zero and variance one. We

More information

DISCUSSION OF INFLUENTIAL FEATURE PCA FOR HIGH DIMENSIONAL CLUSTERING. By T. Tony Cai and Linjun Zhang University of Pennsylvania

DISCUSSION OF INFLUENTIAL FEATURE PCA FOR HIGH DIMENSIONAL CLUSTERING. By T. Tony Cai and Linjun Zhang University of Pennsylvania Submitted to the Annals of Statistics DISCUSSION OF INFLUENTIAL FEATURE PCA FOR HIGH DIMENSIONAL CLUSTERING By T. Tony Cai and Linjun Zhang University of Pennsylvania We would like to congratulate the

More information

Common-Knowledge / Cheat Sheet

Common-Knowledge / Cheat Sheet CSE 521: Design and Analysis of Algorithms I Fall 2018 Common-Knowledge / Cheat Sheet 1 Randomized Algorithm Expectation: For a random variable X with domain, the discrete set S, E [X] = s S P [X = s]

More information

w T 1 w T 2. w T n 0 if i j 1 if i = j

w T 1 w T 2. w T n 0 if i j 1 if i = j Lyapunov Operator Let A F n n be given, and define a linear operator L A : C n n C n n as L A (X) := A X + XA Suppose A is diagonalizable (what follows can be generalized even if this is not possible -

More information

j=1 x j p, if 1 p <, x i ξ : x i < ξ} 0 as p.

j=1 x j p, if 1 p <, x i ξ : x i < ξ} 0 as p. LINEAR ALGEBRA Fall 203 The final exam Almost all of the problems solved Exercise Let (V, ) be a normed vector space. Prove x y x y for all x, y V. Everybody knows how to do this! Exercise 2 If V is a

More information

Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012

Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012 Instructions Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012 The exam consists of four problems, each having multiple parts. You should attempt to solve all four problems. 1.

More information

Lectures 2 3 : Wigner s semicircle law

Lectures 2 3 : Wigner s semicircle law Fall 009 MATH 833 Random Matrices B. Való Lectures 3 : Wigner s semicircle law Notes prepared by: M. Koyama As we set up last wee, let M n = [X ij ] n i,j=1 be a symmetric n n matrix with Random entries

More information

Fluctuations of Random Matrices and Second Order Freeness

Fluctuations of Random Matrices and Second Order Freeness Fluctuations of Random Matrices and Second Order Freeness james mingo with b. collins p. śniady r. speicher SEA 06 Workshop Massachusetts Institute of Technology July 9-14, 2006 1 0.4 0.2 0-2 -1 0 1 2-2

More information

A note on a Marčenko-Pastur type theorem for time series. Jianfeng. Yao

A note on a Marčenko-Pastur type theorem for time series. Jianfeng. Yao A note on a Marčenko-Pastur type theorem for time series Jianfeng Yao Workshop on High-dimensional statistics The University of Hong Kong, October 2011 Overview 1 High-dimensional data and the sample covariance

More information

1. General Vector Spaces

1. General Vector Spaces 1.1. Vector space axioms. 1. General Vector Spaces Definition 1.1. Let V be a nonempty set of objects on which the operations of addition and scalar multiplication are defined. By addition we mean a rule

More information

Quantile Processes for Semi and Nonparametric Regression

Quantile Processes for Semi and Nonparametric Regression Quantile Processes for Semi and Nonparametric Regression Shih-Kang Chao Department of Statistics Purdue University IMS-APRM 2016 A joint work with Stanislav Volgushev and Guang Cheng Quantile Response

More information

Zeros of lacunary random polynomials

Zeros of lacunary random polynomials Zeros of lacunary random polynomials Igor E. Pritsker Dedicated to Norm Levenberg on his 60th birthday Abstract We study the asymptotic distribution of zeros for the lacunary random polynomials. It is

More information

Universality of local spectral statistics of random matrices

Universality of local spectral statistics of random matrices Universality of local spectral statistics of random matrices László Erdős Ludwig-Maximilians-Universität, Munich, Germany CRM, Montreal, Mar 19, 2012 Joint with P. Bourgade, B. Schlein, H.T. Yau, and J.

More information

PCA with random noise. Van Ha Vu. Department of Mathematics Yale University

PCA with random noise. Van Ha Vu. Department of Mathematics Yale University PCA with random noise Van Ha Vu Department of Mathematics Yale University An important problem that appears in various areas of applied mathematics (in particular statistics, computer science and numerical

More information

Stat 206: Linear algebra

Stat 206: Linear algebra Stat 206: Linear algebra James Johndrow (adapted from Iain Johnstone s notes) 2016-11-02 Vectors We have already been working with vectors, but let s review a few more concepts. The inner product of two

More information

5.1 Consistency of least squares estimates. We begin with a few consistency results that stand on their own and do not depend on normality.

5.1 Consistency of least squares estimates. We begin with a few consistency results that stand on their own and do not depend on normality. 88 Chapter 5 Distribution Theory In this chapter, we summarize the distributions related to the normal distribution that occur in linear models. Before turning to this general problem that assumes normal

More information

2. Matrix Algebra and Random Vectors

2. Matrix Algebra and Random Vectors 2. Matrix Algebra and Random Vectors 2.1 Introduction Multivariate data can be conveniently display as array of numbers. In general, a rectangular array of numbers with, for instance, n rows and p columns

More information

Fluctuations from the Semicircle Law Lecture 4

Fluctuations from the Semicircle Law Lecture 4 Fluctuations from the Semicircle Law Lecture 4 Ioana Dumitriu University of Washington Women and Math, IAS 2014 May 23, 2014 Ioana Dumitriu (UW) Fluctuations from the Semicircle Law Lecture 4 May 23, 2014

More information

Chapter 3. Matrices. 3.1 Matrices

Chapter 3. Matrices. 3.1 Matrices 40 Chapter 3 Matrices 3.1 Matrices Definition 3.1 Matrix) A matrix A is a rectangular array of m n real numbers {a ij } written as a 11 a 12 a 1n a 21 a 22 a 2n A =.... a m1 a m2 a mn The array has m rows

More information

Brownian Motion and Conditional Probability

Brownian Motion and Conditional Probability Math 561: Theory of Probability (Spring 2018) Week 10 Brownian Motion and Conditional Probability 10.1 Standard Brownian Motion (SBM) Brownian motion is a stochastic process with both practical and theoretical

More information

An Introduction to Large Sample covariance matrices and Their Applications

An Introduction to Large Sample covariance matrices and Their Applications An Introduction to Large Sample covariance matrices and Their Applications (June 208) Jianfeng Yao Department of Statistics and Actuarial Science The University of Hong Kong. These notes are designed for

More information

On the distinguishability of random quantum states

On the distinguishability of random quantum states 1 1 Department of Computer Science University of Bristol Bristol, UK quant-ph/0607011 Distinguishing quantum states Distinguishing quantum states This talk Question Consider a known ensemble E of n quantum

More information

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces. Math 350 Fall 2011 Notes about inner product spaces In this notes we state and prove some important properties of inner product spaces. First, recall the dot product on R n : if x, y R n, say x = (x 1,...,

More information

EIGENVECTORS OF RANDOM MATRICES OF SYMMETRIC ENTRY DISTRIBUTIONS. 1. introduction

EIGENVECTORS OF RANDOM MATRICES OF SYMMETRIC ENTRY DISTRIBUTIONS. 1. introduction EIGENVECTORS OF RANDOM MATRICES OF SYMMETRIC ENTRY DISTRIBUTIONS SEAN MEEHAN AND HOI NGUYEN Abstract. In this short note we study a non-degeneration property of eigenvectors of symmetric random matrices

More information

Han-Ying Liang, Dong-Xia Zhang, and Jong-Il Baek

Han-Ying Liang, Dong-Xia Zhang, and Jong-Il Baek J. Korean Math. Soc. 41 (2004), No. 5, pp. 883 894 CONVERGENCE OF WEIGHTED SUMS FOR DEPENDENT RANDOM VARIABLES Han-Ying Liang, Dong-Xia Zhang, and Jong-Il Baek Abstract. We discuss in this paper the strong

More information

Mathematics Department Stanford University Math 61CM/DM Inner products

Mathematics Department Stanford University Math 61CM/DM Inner products Mathematics Department Stanford University Math 61CM/DM Inner products Recall the definition of an inner product space; see Appendix A.8 of the textbook. Definition 1 An inner product space V is a vector

More information

SPECTRAL ANALYSIS OF A SYMMETRIZED AUTO-CROSS COVARIANCE MATRIX

SPECTRAL ANALYSIS OF A SYMMETRIZED AUTO-CROSS COVARIANCE MATRIX SPECTRAL ANALYSIS OF A SYMMETRIZED AUTO-CROSS COVARIANCE MATRIX WANG CHEN NATIONAL UNIVERSITY OF SINGAPORE 203 SPECTRAL ANALYSIS OF A SYMMETRIZED AUTO-CROSS COVARIANCE MATRIX WANG CHEN (B.Sc.(Hons) National

More information

Karhunen-Loève decomposition of Gaussian measures on Banach spaces

Karhunen-Loève decomposition of Gaussian measures on Banach spaces Karhunen-Loève decomposition of Gaussian measures on Banach spaces Jean-Charles Croix jean-charles.croix@emse.fr Génie Mathématique et Industriel (GMI) First workshop on Gaussian processes at Saint-Etienne

More information

Review of Linear Algebra

Review of Linear Algebra Review of Linear Algebra Definitions An m n (read "m by n") matrix, is a rectangular array of entries, where m is the number of rows and n the number of columns. 2 Definitions (Con t) A is square if m=

More information

Self-normalized Cramér-Type Large Deviations for Independent Random Variables

Self-normalized Cramér-Type Large Deviations for Independent Random Variables Self-normalized Cramér-Type Large Deviations for Independent Random Variables Qi-Man Shao National University of Singapore and University of Oregon qmshao@darkwing.uoregon.edu 1. Introduction Let X, X

More information

Universality for random matrices and log-gases

Universality for random matrices and log-gases Universality for random matrices and log-gases László Erdős IST, Austria Ludwig-Maximilians-Universität, Munich, Germany Encounters Between Discrete and Continuous Mathematics Eötvös Loránd University,

More information

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that Lecture 28 28.1 Kolmogorov-Smirnov test. Suppose that we have an i.i.d. sample X 1,..., X n with some unknown distribution and we would like to test the hypothesis that is equal to a particular distribution

More information

Almost Sure Convergence of the General Jamison Weighted Sum of B-Valued Random Variables

Almost Sure Convergence of the General Jamison Weighted Sum of B-Valued Random Variables Acta Mathematica Sinica, English Series Feb., 24, Vol.2, No., pp. 8 92 Almost Sure Convergence of the General Jamison Weighted Sum of B-Valued Random Variables Chun SU Tie Jun TONG Department of Statistics

More information

Part IA. Vectors and Matrices. Year

Part IA. Vectors and Matrices. Year Part IA Vectors and Matrices Year 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2018 Paper 1, Section I 1C Vectors and Matrices For z, w C define the principal value of z w. State de Moivre s

More information

Math Linear Algebra II. 1. Inner Products and Norms

Math Linear Algebra II. 1. Inner Products and Norms Math 342 - Linear Algebra II Notes 1. Inner Products and Norms One knows from a basic introduction to vectors in R n Math 254 at OSU) that the length of a vector x = x 1 x 2... x n ) T R n, denoted x,

More information

Absolute value equations

Absolute value equations Linear Algebra and its Applications 419 (2006) 359 367 www.elsevier.com/locate/laa Absolute value equations O.L. Mangasarian, R.R. Meyer Computer Sciences Department, University of Wisconsin, 1210 West

More information

Convergence of Eigenspaces in Kernel Principal Component Analysis

Convergence of Eigenspaces in Kernel Principal Component Analysis Convergence of Eigenspaces in Kernel Principal Component Analysis Shixin Wang Advanced machine learning April 19, 2016 Shixin Wang Convergence of Eigenspaces April 19, 2016 1 / 18 Outline 1 Motivation

More information