Independence Test For High Dimensional Random Vectors

Size: px
Start display at page:

Download "Independence Test For High Dimensional Random Vectors"

Transcription

1 Independence Test For High Dimensional Random Vectors Guangming Pan, Jiti Gao, Yanrong Yang, Meihui Guo June 6, 2 Abstract This paper proposes a new test statistic for the hypothesis testing of cross sectional independence. This test is based on the empirical spectral distribution of the sample covariance matrix. We study the limiting distributions of this test under both null and alternative hypotheses, as dimensionality and the sample size of the data go to infinity together, with their ratio converging to a finite nonzero limit. Finite sample properties are investigated through simulations on some classical time series. As an empirical study, the proposed test is applied to test the random walk hypothesis for closed stock prices of 62 companies in New York Stock Exchange(NYSE) market from the date to the date , to detect whether the observed closed stock prices of each company follow a random walk. The results illustrate that most stocks in NYSE do not follow random walks, in agreement with the conclusions of some modern literature research works. Keywords: Independence test, empirical spectral distribution, asymptotic distribution, characteristic function, Marcenko-Pastur Law. Introduction Suppose that X ij are real valued random variables. For j p, let x j = (X j,, X jn ) T denote the j th time series and X = ( x T,, ) xt p be a panel of p time series, where n usually denotes the sample size in each of the time series data. In both theory and practice, it is not uncommon to assume that each of the time series (X j, X j2,, X jp ) is statistically independent, but it may be unrealistic to assume that x, x 2,, x p are independent and even uncorrelated. This is because there is no natural ordering for cross sectional indices. There are such cases in various disciplines. In economics and finance, for example, it is not unreasonable to expect that there is significant evidence of cross sectional dependence in output innovations across p countries and regions in the World. In the field of climatology, there is also some evidence to show that climatic variables in different stations may be cross sectionally dependent and the level of cross sectional dependence may be determined by some kind of physical distance. Moreover, one would expect that climatic variables, such as temperature and rainfall variables, in a station in Australia have higher level dependence with the same type of climatic variables in a station in New Zealand than those in the United States. In such situations, it may be necessary to test whether x, x 2,, x p are independent before a statistical model is used to model such data. In the econometrics and statistics literature, several papers have considered testing for cross sectional independence for the residuals involved in some specific regression models. Such studies include (38) for the parametric linear model case, (28) for the parametric nonlinear case, and (23) for the nonparametric nonlinear case.

2 As the main motivation of this paper, we will propose using an empirical spectral distribution function based test statistic for cross sectional independence of x, x 2,, x p. Our aim is to test H : x,, x p are independent; against H : x,, x p are not independent. () In this paper a new test for hypothesis () is proposed allowing p proportional to n. Our approach relies on large random matrix theory. The reminder of the paper is organized as follows. Section 2 introduces the test statistic and some related random matrix results. Section 3 shows the central limit theorem for the proposed test statistic. Section 4 reports simulation results, including the sizes and powers of the proposed test for normal and gamma distributed data respectively, and the powers for testing some classical time series, such as ARCH() process, MA() process and nonlinear MA process. The goal is to check the efficiency of our test in application for time series possessing different kinds of dependence. A specific random matrix called Vandermonde matrix is also investigated, which has important applications in engineering. Section 5 provides an empirical study, the proposed test is applied to the random walk hypothesis for the closed stock prices of 62 companies from New York Stock Exchange. Section 6 contains the conclusions. Proofs and some lemmas are deferred to the Appendix. 2 Methodology We first make the following assumption: Assumption. For each j, X j,, X jn are i.i.d random variables with mean zero and variance one. Denote the sample covariance matrix by A n = n XH X where X = (x,..., x p ) = (y,..., y n ) T and the superscript H stands for the complex conjugate transpose. The empirical spectral distribution (ESD) of the sample covariance matrix A n is defined as F An (x) = p I(λ i x), (2) p i= where λ λ 2... λ p are eigenvalues of A n. It is well-known that if x,, x p are independent and p/n c (, ) then F A p (x) converges with probability one to the Marcenko- Pastur Law (M-P Law) F c (x) ( see Marcenko-Pastur (967) and (6)) whose density has an explicit expression { p c (x) = 2πxc (b x)(x a), a x b; (3), otherwise; and a point mass /c at the origin if c >, where a = ( c) 2 and b = ( + c) 2. When there is some correlation among x,, x p, denote by T p the covariance matrix of the first row, y T, of X. Then, under some structure of each x j, when F T p D (x) H(x), F An converges with probability one to a non random distribution function F c,h whose Stieltjes transform satisfies (see (42)) m = dh(x). (4) x( c czm) z 2

3 Here the Stieltjes transformation m G for any c.d.f G is defined as m G (z) = dg(λ), Imz > (5) λ z and G can be recovered by the inversion formula G{[x, x 2 ]} = π lim ε + x2 x Im(m G (x + iε))dx where x and x 2 are continuity points of G. Moreover, the equation (4) takes a simpler form when F c,h is replaced by F c,h = ( c)i [, ] + cf c,h, (6) which is the limiting ESD of A p = n XXH. Its Stieltjes transform has an inverse m(z) = c z z = z(m) = m + c + cm(z) (7) x dh(x). (8) + xm Our test statistics relies on the following observation: the limit of the ESD of the sample covariance matrix A n is the M-P law by (3) when x,, x p are independent, while, the limit of the ESD is determined from (4) when T p is different from the identity matrix; Moreover, preliminary investigations indicate that when x,, x p are only uncorrelated (without any further assumptions) the limit of the ESD of A n is not the M-P law either (see (4)). So, these motivate us to use the ESD of A n, F An (x), as a test statistic. However, there is no central limit theorem for (F A n (x) F c,h (x)), as argued by (5). Therefore, instead, we consider the characteristic function of F A n (x). The characteristic function of F A n (x) is s n (t) Our test statistic is then proposed as follows M n = e itx df A n (x) = p e itλ i. i= s n (t) s(t) 2 du(t), (9) where s(t) is the characteristic function of F c n (x), obtained from the M-P law F c (x) with c replaced by c n = p/n, and U(t) is a distribution function with its support on a compact interval. To develop the asymptotic distribution of M n the following assumptions are needed. Assumption 2. let T p be a p p random Hermitian nonnegative definite matrix and (T /2 ) 2 = T. let y T j = ut j T/2 with u j being i.i.d random vectors. The empirical spectral distribution F Tp of T p converges weakly to a distribution H on [, ) as n ; the jth entry of u i is denoted as y ij, for each p, y ij, i n, j p are i.i.d., i.d. for all p, i, j, Ey =, E y 2 = and E y 4 < ; Assumption 3. p = p(n) with p/n c (, ). 3

4 3 Main results Write G n (x) = p[f An (x) F cn (x)]. () Then p(s n (t) s(t)) can be decomposed as the sum of random part and non-random part as follows: p(s n (t) s(t)) = e itx dg n (x) = e itx dp(f A n (x) F c n,h n (x)) + e itx dp(f c n,h n (x) F c n (x)), () where F c n,h n is obtained from F c,h with c and H replaced by c n = p/n and H n = F T n. (5) established the remarkable central limit theorem for functional of eigenvalues of A n under the additional assumption that E X 4 = 3 while (37) provided a supplement to this theorem by eliminating the condition to some extent. From Theorem 4 of (37) and () we can directly obtain the following results. Theorem. In addition to Assumption?? and 2, suppose that n e i T /2 n (m(z )T n + I) T /2 n e i e i T /2 n (m(z 2 )T n + I) T /2 n e i h (z, z 2 ) (2) n and i= n n i= e i T /2 n (m(z)t n + I) T /2 n e i e i T /2 n (m(z)t n + I) 2 T /2 n e i h 2 (z), (3) where e i is the n-dimensional row vector with the i-th element being and others. Then, for any positive integer k, ( ) cos(t x)dg n (x),..., cos(t k x)dg n (x), sin(t x)dg n (x),..., sin(t k x)dg n (x) converges in distribution to Gaussian vectors (V + δ (t ),..., V k + δ (t k ), Z + δ 2 (t ),..., Z k + δ 2 (t k )), where δ (t) = lim cos(tx)dp(f c n,h n (x) F c n (x)), δ 2 (t) = lim sin(tx)dp(f c n,h n (x) F c n (x)). n n The means covariances of V j and Z j are specified as follows: If X is real, then for any j =,..., k, EV j = c m 3 (z)τ 2 ( + τm(z)) 3 dh(τ) cos(t j z) 2πi ( c m 2 (z)τ 2 ( + τm(z)) 2 dh(τ)) 2 dz EX4 3 cm 3 (z)h 2 (z) cos(t j z) 2πi c m 2 (z)τ 2 ( + τm(z)) 2 dh(τ) dz. Replacing cos(t j z) in EV j by sin(t j z) yields the expression of EZ j. For any j, h =,..., k, Cov(V j, Z h ) = cos(tj z )sin(t h z 2 ) d 2π 2 (m(z ) m(z 2 )) 2 m(z 2 ) d m(z )dz dz 2 dz 2 dz c(ex4 3) 4π 2 d 2 (4) cos(t j z )sin(t h z 2 ) [m(z )m(z 2 )h (z, z 2 )]dz dz 2. dz dz 2 4

5 The covariances cov(v j, V h ) and cov(z j, Z h ) are similar except replacing cos(t j z) and (or) sin(t j z) by sin(t j z) and (or) sin(t j z) correspondingly. If X is complex with EX 2 =, then EV j = E X 4 2 cm 3 (z)h 2 (z) cos(t j z) 2πi c m 2 (z)τ 2 dh(τ)/( + τm(z)) 2 dz, and the covariance Cov(V j, Z h ) = cos(tj z )sin(t h z 2 ) d 4π 2 (m(z ) m(z 2 )) 2 m(z 2 ) d m(z )dz dz 2 dz 2 dz c(e X 4 2) d 2 4π 2 cos(t j z )sin(t h z 2 ) [m(z )m(z 2 )h (z, z 2 )]dz dz 2. dz dz 2 Remark. When T n = I, (5) calculated the asymptotic expectations and covariances for power functions f(x) = x r, r Z + for real X when EX 4 = 3 or complex X when E X 4 = 2. (37) eliminated this fourth moment condition and provided the expectation and covariance for f(x) = x r. These are helpful in simulations because the asymptotic expectations and covariances for f (x) = sintx and f 2 (x) = costx can be derived by Taylor series of sintx and costx. Remark 2. Assumption?? assumes that all the entries of the time series y are identical distributed. This applies to stationary time series. For a non-stationary time series, we can normalize it so that it has zero mean and unit variance. But the fourth moments of all the entries of the time series are not necessarily identical after normalization. In this case we only need to impose the additional assumptions that sup EYij 4 < and for any η >, i,j η 4 np E( Y jk 4 I Yjk η n). (5) jk A careful checking on the arguments of Theorem. of (5) and Theorem.4 of (37) indicates that Theorem still holds. For the expressions of the expectation and covariance of asymptotic distribution, we substitute the fourth moment E Y 4 with the average of all the fourth moments p j= E Y ji 4 of all the entries, i.e. p for any i. Therefore the proposed test can be applied to detect the independence for non-stationary time series with finite fourth moment. Based on Theorem, we can derive the central limit theorem for the proposed test statistic S n. Theorem 2. With the same assumptions as in Theorem, the proposed test statistic M n converges in distribution to a random variable R, where R = t2 t ( V (t) + δ (t) 2 + Z(t) + δ 2 (t) 2 )du(t), (6) (V (t), Z(t)) is the Gaussian limit of ( costxdg n (x), sintxdg n (x)) as n, p and δ (t), δ 2 (t) are defined in the above theorem. Particularly, under the null hypothesis, the statistic M n converges in distribution to + ( V (t) 2 + Z(t) 2 )du(t). The proof is delegated to Appendix. 5

6 Remark 3. There exists a problem how to choose the distribution U(t) appearing in the proposed test statistic S n. A practical consideration is that the choice of U(t) guarantees that the expression of the proposed statistic S n is easy to calculate in practice. In our simulation, we choose U(t) as the uniform distribution. Another consideration is how the values of t influence the values of the asymptotic random variables appearing in the central limit theorem of S n. From (5), the expectation and covariance of the asymptotic random variable for ( f (x)dp(f A n (x) F c n,h n (x)), f 2 (x)dp(f A n (x) F c n,h n (x))) are and EX f = 2π Cov(X f, X f2 ) = 2π 2 f (x)arg( c t 2 m 2 (x) dh(t))dx, (7) ( + tm(x)) 2 f (x)f 2(y)ln( m + 4 i (x)m i (y) )dxdy. (8) m(x) m i (y) 2 If f (x) = costx and f 2 (x) = sintx, then the limiting expectations and variances include t and t 2 respectively. From this point, large t will cause large covariance while small t that approximates to zero will incur small covariances. In practice, this will incur a larger empirical size than the significant level. But for the proposed test statistic, t will not have large influences on the the value of the test statistic S n, since t only appears in the function sintx or costx. So we should not choose large t or small t. Therefore, in our simulation, we choose the interval t [.2,.7]. 4 Simulations This section reports the simulation results about the finite sample properties of the proposed test. Simulations are used to find the size and power of the proposed independence test and hence evaluate the performance of this test. Comparison with LRT test conducted by (2) is provided for normal distributed time series. Some classical time series such as MA() process, uncorrelated nonlinear MA process and ARCH() process are tested by the proposed test. A specific matrix called Vandermonde matrix is also detected. 4. Empirical size and empirical power First we introduce the calculation of empirical size and empirical power. Let z α be the ( α)% quantile of the asymptotic null distribution of the test statistic M n. With K replications of the data set simulated under the null hypothesis, we calculate the empirical size as ˆα = { of M n H z α }, (9) K where Mn H represents the values of the test statistic M n based on the data simulated under the null hypothesis. In our simulation, we choose K = as the number of repeated simulations. The significance level is α =.5. Since the asymptotic null distribution of the test statistic is not a classical distribution, we need to estimate the quantile z α. Naturally, we do as follows: generate K replications of the asymptotic distributed random variable and then select the (Kα)-th largest value as the estimated ( α) quantile of the asymptotic distributed random variable. To compute the empirical power, we use the estimated critical point ẑ α under the null hypothesis. ẑ α is the Kα-th largest value of the test statistic by K repeated simulations under 6

7 the null hypothesis. Then the empirical power is ˆβ = { of M n A ẑ α }, (2) K where Mn A represents the values of the test statistic M n based on the data simulated under the alternative hypothesis. For the proposed independence test, we generate the number n of p-dimensional independent and identical distributed random vectors {x k } n k=, each with means p and covariance matrix Σ = I p. Under Assumption?? and the null hypothesis, {x k } n k= are generated in two scenarios:. Each y k is p-dimensional normal random vector with mean p and covariance matrix Σ = I p ; T p = I p ; 2. Each y k consists of i.i.d. random variables with standardized Gamma(4,2) distribution, so they have zero mean and unit variance; T p = I p. Under the alternative hypothesis, we consider the case: T /2 p = (.95I p,.5 p ), where p is p-dimensional vector with as entries. In this case, the population covariance matrix of x is Σ =.95I p +.5 p p, which is called compound symmetric covariance matrix. In our simulation, we choose p = 2 : 2 : 2 and n = 2 : 2 : 2. The significant level is.5. In each case, we run simulations. For normal distributed data, the fourth moment E X 4 = 3; for standardized Gamma distributional data, E X 4 = 4.5. This is calculated by a formula as follows: the k-th moment of X which is Gamma(α, λ) is E X k = (α + k ) (α + )α λ k. (2) Table and Table 2 show the empirical sizes and empirical powers of our proposed test and LRT((2)) test for normal distributed random vectors respectively. From Table and Table 2, we can see that LRT test does not work when p is large while the proposed test possesses good performance when p and n are both large. The LRT test is only applicable to the case that p is fixed and n goes to infinity. When the difference between p and n are large, the sizes and the powers of the proposed test become worse. This is the reason that our test is proposed under the case that p and n are in the same order when they approximate to infinity. The proposed test also works well for gamma random vectors while LRT test is not applicable to gamma case, since, in theory, LRT test is given under normal random vectors. Table 3 provides the empirical sizes and empirical powers of the proposed test for the gamma case. The empirical powers are all under the alternative hypothesis that Σ =.5I p +.95 p p. Reports for empirical powers show that the proposed test can check independence for both normal and gamma vectors well. Moreover, the empirical powers converge to as n, p. Since independence test is important in time series, we apply the proposed test to several time series, such as MA() model, uncorrelated nonlinear MA model and ARCH() model. Furthermore, a special kind matrix called Vandermonde matrix is also tested. 7

8 4.2 Some time series and Vandermonde matrix 4.2. MA() model and nonlinear MA model Moving average of order (MA()) model with length p is a p-dimensional random vector which is defined as v t = z t +.5z t, t =,..., p; (22) where z t W N(, ), i.e. white noise process with mean and variance σ 2. The correlation matrix is ρ() ρ() ρ() ρ() , (23) ρ() ρ() ρ() ρ() where ρ() = cov(vt,v t ). var(vt ) Since the entries of v = (v,..., v p ) are correlated and satisfy Assumption??. Hence, from Theorem 2, our test can capture the dependence of MA() process as n and p go to infinity in the same order. The simulation results in Table 4 show that the proposed test can indeed test the dependence of MA() process. Moreover, the proposed test can also distinguish the dependence for non-martingale difference sequences with zero autocorrelations. Consider, for example, the following uncorrelated nonlinear MA process: r t = z t z t 2 (z t 2 + z t + ), t =,..., p; (24) where z t W N(, ), i.e. white noise process with mean and variance σ 2. Its correlation matrix is a diagonal matrix. This model is provided by (32) which tests the martingale difference hypothesis. Our proposed independence test can be applied to this nonlinear MA model, and the powers in Table 5 show that this test performs well for this model. This result also states that the empirical spectral distribution of nonlinear MA model is not M-P law since the proposed test statistic is established on the characteristic function of M-P law ARCH() model This section considers autoregressive conditional heteroscedastic(arch()) model: w t = z t α + α wt 2, t =,..., p; (25) where z t W N(, ), i.e. white noise process with mean and variance. ARCH() model is a martingale difference sequence and it is a strictly stationary stochastic process. There exists no theoritical results stating that the LSD of sample covariance matrix for ARCH() model is M-P Law, but from Figure, we can see that the LSD of the sample covariance matrix for ARCH() model is M-P Law. For ARCH() model, the proposed test can not capture its dependence directly, but we can use our test to test the dependence of {w 2,..., w2 p}. Since this test can tell us that {w 2,..., w2 p} is not an independent sequence, 8

9 naturally it can be concluded that ARCH() is not an independent sequence. This idea is similar to (27) who use the second derivative of empirical spectral density function to capture the pairwise dependence for ARCH() model. Here we take α =.9 and α =.. Table 6 shows the powers of our test for testing dependence of ARCH() model. Figure : Empirical Spectral Distribution of ARCH() and I.I.D Normal Vector n=,p=2 n=45,p= Empirical Spectral Distribution ARCH() i.i.d Normal Empirical Spectral Distribution ARCH() i.i.d Normal x x n=2,p= n=5,p= Empirical Spectral Distribution ARCH() i.i.d Normal Empirical Spectral Distribution ARCH() i.i.d Normal x x Vandermonde matrix Consider n p vandermonde matrix V of the form V = e iω e iω p n..... e i(n )ω e i(n )ω p (26) where ω i, i =,..., p are called phased distributions and are assumed i.i.d on [, 2π). Then the entries of V lie on the unit circle. Obviously, all the elements of each column of V are not independent while the elements from different columns are independent. Denote the sample covariance matrix of V as D = V H V. Vandermonde matrices play an important role in signal processing and wireless applications such as direction of arrival estimation, precoding or sparse sampling theory, etc.. (4) established that as n, p go to, the limiting spectral distribution of D = V H V is not the Marcenko-Pastur Law. We use the proposed test statistic S to test the dependence of every column of the matrix V. Since for any k =,..., n and j =,..., p, E(e ikω j ) 2 = and E e ikω j 4 =, Theorem 9

10 2 can be applied directly without normalize process. The empirical powers in table 7 show that the proposed test works well for Vandermonde matrix. 5 Empirical study: random walk hypothesis The random walk hypothesis is defined by x t = µ + x t + ε t, t =,..., p; (27) where x t is the stock price at time t, µ is a drift parameter and ε t is a random disturbance term. All ε t, t =,..., p are assumed to be independent. To investigate whether the stock prices {x t } p t= follow a random walk is equal to testing independence of the first difference {ε t = x t x t } p t=. We apply our method to test random walk hypothesis for financial data in stock markets: daily closed stock prices of 62 companies from New York Stock Exchange(NYSE) from the date to the date For each company, there are 28 closed stock prices x t, t =,..., 28. Then we can make difference and get the random disturbance series {ε t } with length 28, i.e. t =,..., 28. In order to capture the independence of the time series {ε t, t =,..., 28}, we reshape the time series as a 4 7 matrix W and apply the proposed test to the matrix Q = n W W T. Fig.4 provides the histograms of the p-values of the proposed test for 62 stocks. The significance level is.5. The figure shows that there are 445 stocks whose p-values are below.5 and others are above.5. These results show that 76 stocks are accepted to follow random walk while 445 stocks are rejected. To be more clear, we plot autocorrelation functions(acf) of the random disturbance time series for 8 companies, of which 4 time series are accepted and 4 time series are rejected by our test. The ACF graphs of Figure 2 and Figure 3 illustrate that the proposed test can relatively accurately distinguish dependence from independence, since we accept the time series with relatively smaller sample ACFs and reject those with larger sample ACFs. Our test states that 445 stocks of the total 62 stocks are rejected as random walk hypothesis, which constitute a high proportion of the total investigated stocks. This result is in conformity with many research works on random walk hypothesis for various stock markets, e.g.(5), (2), (36). Some stock markets follow random walk hypothesis well while some not. From our proposed test, we can conclude that closed stock prices in New York Stock Exchange do not follow random walk hypothesis. Moreover, more and more professional traders and researchers find that random walk hypothesis is bogus. Many complex and more accurate models are proposed to describe stock markets or other financial markets. 6 Conclusions This paper provides an independence test based on the characteristic function of the empirical spectral distribution of the sample covariance matrix of a time series. The proposed test can test independence of all the entries of time series with length p, where p goes to infinity. Finite sample simulations show that our test also performs well when applying to time series with relatively small length. We do not assume any specific distribution on the time series and moreover, the proposed test can be applied to non-stationary time series with finite fourth moment condition, while most literatures impose stationary condition on the tested time series, such as (2). The proposed test is consistent for a class of random vectors with the type X = TY, where Y is a

11 Sample Autocorrelation Figure 2: ACF Graphs HONEYWELL INTERNATIONAL INC ARCHER DANIELS MIDLAND CO..8.6 Sample Autocorrelation Lag Lag. COCA COLA CO. EATON CORP Sample Autocorrelation Sample Autocorrelation Lag Lag *Autocorrelation function graphs of random disturbance series of stock prices from four companies in NYSE, whose stock prices are accepted as random walks by the proposed test. The time series are all length of 28, from the date 99.. to the date All the closed stock prices are from WRDS database. random vector with independent entries and T is a nonnegative definite matrix. Many literatures investigate independence by autocorrelation functions, e.g.(), (26),etc.. This method can not capture the dependence of some time series with uncorrelated entries. (27) offered a generalized spectral density approach and his test can apply to many uncorrelated time series. But Hong s method is based on testing mutually independence of any two entries of a time series and can not test for a time series with length of more than 2. Moreover, his test is also restricted on stationary time series. For uncorrelated time series, such as nonlinear MA model investigated in the paper, which is not a martingale difference sequence, simulation results tell that our test can capture the dependence between its entries. For ARCH() model, which is a martingale difference sequence, we propose a modification to the proposed test and the modified version performs well on ARCH() model. This paper is contributed to study independence test for large-dimensional random vectors, which case is missed by classical test methods, such as LRT test come up by (2). LRT test includes the sample covariance matrix as the estimator of the population covariance matrix. When the dimensionality p is larger than the sample size n, sample covariance matrix is singular. Hence larger dimensionality causes lesser efficiency in classical statistics. Nowadays, the theory of random matrix is applied to deal with large dimensional data((4)). This paper contributes to this work.

12 Figure 3: ACF Graphs. EXXON MOBIL CORP. IDACORP INC Sample Autocorrelation Sample Autocorrelation Lag Lag. SCHAWK INC. AMERON INTERNATIONAL CORP DEL Sample Autocorrelation Sample Autocorrelation Lag Lag *Autocorrelation function graphs of random disturbance series of stock prices from four companies in NYSE, whose stock prices are rejected to follow random walks by the proposed test. The time series are all length of 28, from the date 99.. to the date All the closed stock prices are from WRDS database. 7 Appendix Lemma (Theorem 8. of (9)). Let P n, P be probability measures on (C, ϕ). If the finite dimensional distributions of P n converge weakly to those of P, and if {P n } is tight, then P n P. Lemma 2 (Theorem 2.3 of (9)). The sequence {X n } is tight if it satisfies these two conditions (I) The sequence {X n ()} is tight. (II) There exist constants γ, α >, and a nondecreasing, continuous function F on [, ] such that holds for all t, t 2, and n. E{ X n (t 2 ) X n (t ) γ } F (t 2 ) F (t ) α (28) Lemma 3 (Continuous Theorem). Let X n and X be random elements defined on a metric space S. Suppose g : S S has a set of discontinuous points D g such that P (X D g ) =. Then X n d X g(x n ) d g(x). (29) Lemma 4 (Lemma 2.4 of (24)). Let Ω be an open convex set in C. If f : Ω C is an analytic function and a, b are distinct points in Ω, then there exist points u, v on L(a, b) such that Re( f(a) f(b) ) = Re(f f(a) f(b) (u)), Im( ) = Im(f (v)), (3) a b a b where Re(z) and Im(z) are the real and imaginary parts of z respectively; and L(a, b) {a + t(b a) : t (, )}. 2

13 Figure 4: P-value of the proposed test P values for the proposed test number of stocks p value *These are p-values for 62 stocks from NYSE from the date 99.. to the date All the closed stock prices are from WRDS database. Proof of Theorem 2. Let φ n (t) = costxdg n (x) and ψ n (t) = sintxdg n (x). To finish Theorem 2, by Lemma 3 it is enough to prove that (φ n, ψ n ) D (V + δ, Z + δ 2 ), where φ n = {φ n (t)}, ψ n = {ψ n (t)}, V = {V (t)}, Z = {Z(t)} and δ = {δ (t)}, δ 2 = {δ 2 (t)}. To this end, in view of Lemma and Theorem, we need to prove the tightness of (φ n (t)}, ψ n (t)) for t [t, t 2 ]. Repeating the same truncation and centralization steps as those in (5), we may assume that X ij < δ n n, EXij =, E X ij 2 =, E X ij 4 <. (3) Set M n (z) = n[m F An (z) m F cn,hn (z)]. By the Cauchy theorem f(x)dg(x) = f(z)m G (z)dz, (32) 2πi where G is a c.d.f and f is an analytic function on an open set containing the support of G. Therefore, with probability one, for all n large, φ n (t) can be written as φ n (t) = cos(tz)m n (z)dz. (33) 2πi The contour C involved in the above integral is specified as follows. Let C u = {x + iv : x [x l, x r ]}, (34) where v >, x r is any number greater than lim sup T n ( + c) 2, x l is any negative number if n c and otherwise choose x l (, lim sup T n ( c) 2 ). Then the contour C is defined by n C = {x l + iv : v [, v ]} C u {x r + iv : v [, v ]}. (35) We consider ˆM n (z), a truncated version of M n (z), as in (5). Let C l = { {xl + iv : v [n ε n, v ]}, x l >, {x l + iv : v [, v ]}, x l <, (36) 3

14 and C r = {x r + iv : v [n ε n, v ]} (37) where ε n is a decreasing sequence to zero satisfying for some α (, ), ε n n α. (38) Set C n = C l C u C r. The process ˆM n (z) is defined as M n (z), z C n, ˆM n (z) = M n (x r + in ε n ), x = x r, v [, n ε] and if x l >, M n (x l + in ε n ), x = x l, v [, n ε]. (39) Then Write φ () n (t) = 2πi M n (z) = M n (z) ˆM n (z) + ˆM n (z). (4) cos(tz)[m n (z) ˆM n (z)]dz, φ (2) n (t) = 2πi (costz) ˆM n (z)dz. Then φ n (t) can be decomposed as φ n (t) = φ () n (t) + φ (2) n (t). (4) There is a similar decomposition for ψ n (t). For concreteness, we only prove the tightness for the part {φ n (t)}. For ( ˆM n (z) M n (z)), the argument in page 563 of (5) ensures that cos(tz)(m n (z) ˆM n (z))dz a.s., uniformly in t [t, t 2 ]. Obviously, this implies for any ε >, lim lim sup P [ sup (cost z cost 2 z)[m n (z) ˆM n (z)]dz ε] =. (42) δ n t t 2 <δ By Theorem 7.3 of (9), {φ () n (t)} is tight. For ˆM n (z), write ˆM () n (z) = ˆM n (z) + (2) ˆM n (z), where and Based on these we may further write ˆM () n (z) = n[ ˆM F An (z) E ˆM F An (z)] (43) ˆM (2) n (z) = n[ ˆM EF An (z) ˆM F cn,hn (z)]. (44) φ (2) n (t) = φ 2, n (t) + φ 2,2 n (t), (45) where φn 2, (t) = 2πi cos(tz) () ˆM n (z)dz, φ 2,2 n (t) = 2πi cos(tz) (2) ˆM n (z)dz. 4

15 In order to prove the tightness of {φ 2, n (t)}, we use a tightness criteria, i.e. Theorem 2.3 of (8). It follows from Lemma 4 that sup E φ2, n (t ) φ 2, n (t 2 ) 2 t,t 2,n t t 2 ( = sup t,t 2,n 2π E C Re[zsin(t 3 z)] + i Im[zsin(t 4 z)]) ˆM n (z)dz 2, where t 3 and t 4 belong to L[t, t 2 ]. Since the integral contour C is bounded, by Cauchy s inequality, E (Re[zsin(t 3 z)] + i Im[zsin(t 4 z)]) ˆM [ n(z)dz 2 K E C C ] () ˆM n (z) 2 dz, where K is a constant number which may be different from line to line. From the argument in Page 569, (2.5) and Section 3 in (5) we know that on the contour C E ˆM () n (z) 2 K. This implies that {φ 2, n (t)} is tight in t. For the non-random part {φn 2,2 (t)}, we only need to prove that it is bounded and equicontinuous with respect to t. Since (5) has proved that ˆM n(z) 2 is uniformly bounded with respect to z and the integral contour is also bounded, we obtain φ 2,2 n (t) K. (46) By Lemma 4 and the fact that ˆM n(z) 2 is uniformly bounded, {φ 2,2 n (t)} is equicontinuous. Hence the tightness of {φn 2,2 (t)} is also completed. Therefore {φ n (t)} is tight. References [] W.L. Andrew, A.C. Mackinlay, stock market prices do not follow random walks: evidence from a simple specification test, Rev. Financ. Stud., ()(988), [2] T.W. Anderson, An introduction to multivariate statistical analysis, New York: Wiley, 2th edtion(984). [4] Z.D. Bai, H. Saranadasa, Effect of high dimension comparison of significance tests for a high dimensional two sample problem, Statist. Sinica, 6(996), [5] Z.D. Bai, J.W. Silverstein, CLT for linear spectral statistics of large-dimensional sample covariance matrices, Ann. Probab., 32(A)(24), [6] Z.D. Bai, J.W. Silverstein, Spectral analysis of large dimensional random matrices, Sringer, second edition(29). [7] Z.D. Bai, D.D. Jiang, J.F. Yao, S.R. Zheng, Corrections to LRT on large-dimensional covariance matrix by RMT, Ann. Statist., 37(6B)(29), [8] P. Billingsley, Convergence of Probability Measures, Wiley, New York first edition (968). [9] P. Billingsley, Convergence of probability measures, Wiley-Interscience Publication, second edition(999). 5

16 [] G. Box and D. Pierce, Distribution of residual autocorrelations in autoregressive integrated moving average time series models, J. Am. Statist. Ass., 65(97), [] N.H. Chan, L.T. Tran, Nonparametric tests for serial dependence, J. Time. Ser. Anal., 3(992), 2-3. [2] J.Y. Campbell, W.L. Andrew and A.C. MacKinlay, The econometrics of financial markets, Princeton University Press,(997). [3] D.L. Donoho, High-dimensional data analysis: The curses and blessings of dimensionality, American Math. Society Lecture, Math. Challenges of the 2st Century, (2). [4] T.W. EPPS, A test for normality based on the empirical characteristic function, Biometrika, 7(3)(983), [5] E.F. Fama, The Behavior of Stock Market Prices, J. Bus., 38(965), [6] E.F. Fama, Efficient capital markets: a review of theory and empirical work, J. Financ., 25(97), [7] E.F. Fama, Efficient capital markets: II, J. Financ., 46(99), [2] M.D. Godfrey, C.W.J. Granger, O. Morgenstern, The random-walk hypothesis of stock market behavior, Kyklos, 7()(964), -3. [22] C. Granger, T. Terasvirta, Modelling nonlinear economic relationships, New York: Oxford University Press, (993). [23] Chen, J., Gao, J. and Li, D. (29). A new diagnostic test for cross sectional uncorrelatedness in nonparametric panel data models. Working paper available at [24] C.H. Guo, N.J. Higham, A Schur-Newton method for the matrix p-th root and its inverse, SIAM Journal on Matrix Analysis and Applications, 28(3)(26). [25] R.E. Hall, Stochastic implications of the life cycle-permanent income hypothesis: theory and evidence, J. Polit. Econ., 86(978), [26] Y.M. Hong, Testing for pairwise serial independence via the empirical distribution function, J. R. Statist. Soc. B, 6(2)(998), [27] Y.M. Hong, Hypothesis testing in time series via the empirical characteristic function: a generalized spectral density approach, J. Am. Statist. Ass., 94(448)(999), [28] Hsiao, C., Pesaran, M. H. and Pick, A. (27). Diagnostic tests of cross section independence for nonlinear panel data models. IZA discussion paper No. 27 [29] S. John, Some optimal multivariate tests, Biometrika, 58(97), [3] I. Johnstone, on the distribution of the largest principal component, Ann. Statist., 29(2), [3] I. Johnstone, D.M. Titterington, Statistical challenges of high-dimensional data, Phil. Trans. R. Soc. A, 367(29), [32] C.M. Kuan and W.M. Lee, A new test for the martingale difference hypothesis, Studies in Nonlinear Dynamics and Econometrics, 8(4)(24),

17 [33] G.M. Ljung, G. Box, on a measure of lack of fit in time series models, Biometrika, 65(978), [34] V.A. Marcenko, L.A. Pastur, Distribution for some sets of random matrices, Math. USSR-Sb., (967), [35] H. Nagao, On some test criteria for covariance matrix, Ann. Statist., (973), [36] T. Nakamura, M. Small, Tests for the random walk hypothesis for financial data, Phys. A, 377(27), [37] G.M. Pan, W. Zhou, Central limit theorem for signal-to-interference ratio of reduced rank linear receiver, Ann. Appl. Probab., 8(3)(28), [38] Pesaran, M. H. (24). General diagnostic tests for cross section dependence in panels. Cambridge Working Paper in Economics No [39] V. Plerou, P. Gopikrishnan and B. Rosenow, A random matrix approach to cross-correlations in financial data, Phys. Rev. E: Stat., 65(6)(22), 6626(-8). [4] P.M. Robinson, Consistent nonparametric entropy-based testing, Rev. Econ. Stud., 58(99), [4]. Ryan, M. Debbah, Asymptotic behaviour of random vandermonde matrices with entries on the unit circle, IEEE Trans. on Information Theory, 55(7)(29), [42] J.W. Silverstein, Strong convergence of the empirical distribution of eigenvalues of large dimensional random matrices, J. Multivariate Anal. 55 (995), [43] H.J. Skaug, D. Tjøstheim, Nonparametric test of serial independence based on the empirical distribution function, Biometrika, 8(993), [44] H. Tong, Nonlinear time series: A dynamic system approach, Oxford, U.K.: Clarendon Press, (99). Table : Empirical sizes of the proposed test and the LRT test at significant level.5 for standardized normal distributed random vectors. p n The proposed test The LRT test

18 Table 2: Empirical powers of the proposed test and the LRT test at significant level.5 for standardized normal distributed random vectors. p n The proposed test The LRT test *The powers are under the alternative hypothesis that the population covariance matrix is Σ =.5I p +.95 p p. Table 3: Empirical sizes and powers of the proposed test at.5 significance level for standardized gamma distributed random vectors. p n Empirical sizes Empirical powers *The powers are under the alternative hypothesis that the population covariance matrix is Σ =.5I p +.95 p p. 8

19 Table 4: Empirical powers of the proposed test at.5 significance level for MA() model. p n Table 5: Empirical powers of the proposed test at.5 significance level for nonlinear MA model. p n Table 6: Empirical powers of the proposed test at.5 significance level for ARCH() model. p n Table 7: Empirical powers of the proposed test at.5 significance level for Vandermonde Matrix. p n

A note on a Marčenko-Pastur type theorem for time series. Jianfeng. Yao

A note on a Marčenko-Pastur type theorem for time series. Jianfeng. Yao A note on a Marčenko-Pastur type theorem for time series Jianfeng Yao Workshop on High-dimensional statistics The University of Hong Kong, October 2011 Overview 1 High-dimensional data and the sample covariance

More information

On corrections of classical multivariate tests for high-dimensional data. Jian-feng. Yao Université de Rennes 1, IRMAR

On corrections of classical multivariate tests for high-dimensional data. Jian-feng. Yao Université de Rennes 1, IRMAR Introduction a two sample problem Marčenko-Pastur distributions and one-sample problems Random Fisher matrices and two-sample problems Testing cova On corrections of classical multivariate tests for high-dimensional

More information

Large sample covariance matrices and the T 2 statistic

Large sample covariance matrices and the T 2 statistic Large sample covariance matrices and the T 2 statistic EURANDOM, the Netherlands Joint work with W. Zhou Outline 1 2 Basic setting Let {X ij }, i, j =, be i.i.d. r.v. Write n s j = (X 1j,, X pj ) T and

More information

On corrections of classical multivariate tests for high-dimensional data

On corrections of classical multivariate tests for high-dimensional data On corrections of classical multivariate tests for high-dimensional data Jian-feng Yao with Zhidong Bai, Dandan Jiang, Shurong Zheng Overview Introduction High-dimensional data and new challenge in statistics

More information

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach By Shiqing Ling Department of Mathematics Hong Kong University of Science and Technology Let {y t : t = 0, ±1, ±2,

More information

Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices

Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices by Jack W. Silverstein* Department of Mathematics Box 8205 North Carolina State University Raleigh,

More information

Diagnostic Test for GARCH Models Based on Absolute Residual Autocorrelations

Diagnostic Test for GARCH Models Based on Absolute Residual Autocorrelations Diagnostic Test for GARCH Models Based on Absolute Residual Autocorrelations Farhat Iqbal Department of Statistics, University of Balochistan Quetta-Pakistan farhatiqb@gmail.com Abstract In this paper

More information

Cross-sectional Independence Test for a Class of Parametric Panel Data Models

Cross-sectional Independence Test for a Class of Parametric Panel Data Models ISS 440-77X Australia Department of Econometrics and Business Statistics http://www.buseco.monash.edu.au/depts/ebs/pubs/wpapers/ ross-sectional Indepence est for a lass of Parametric Panel Data Models

More information

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley

More information

Lecture 2: Univariate Time Series

Lecture 2: Univariate Time Series Lecture 2: Univariate Time Series Analysis: Conditional and Unconditional Densities, Stationarity, ARMA Processes Prof. Massimo Guidolin 20192 Financial Econometrics Spring/Winter 2017 Overview Motivation:

More information

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications Yongmiao Hong Department of Economics & Department of Statistical Sciences Cornell University Spring 2019 Time and uncertainty

More information

Estimation of the Global Minimum Variance Portfolio in High Dimensions

Estimation of the Global Minimum Variance Portfolio in High Dimensions Estimation of the Global Minimum Variance Portfolio in High Dimensions Taras Bodnar, Nestor Parolya and Wolfgang Schmid 07.FEBRUARY 2014 1 / 25 Outline Introduction Random Matrix Theory: Preliminary Results

More information

A new method to bound rate of convergence

A new method to bound rate of convergence A new method to bound rate of convergence Arup Bose Indian Statistical Institute, Kolkata, abose@isical.ac.in Sourav Chatterjee University of California, Berkeley Empirical Spectral Distribution Distribution

More information

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis Introduction to Time Series Analysis 1 Contents: I. Basics of Time Series Analysis... 4 I.1 Stationarity... 5 I.2 Autocorrelation Function... 9 I.3 Partial Autocorrelation Function (PACF)... 14 I.4 Transformation

More information

Issues on quantile autoregression

Issues on quantile autoregression Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides

More information

Thomas J. Fisher. Research Statement. Preliminary Results

Thomas J. Fisher. Research Statement. Preliminary Results Thomas J. Fisher Research Statement Preliminary Results Many applications of modern statistics involve a large number of measurements and can be considered in a linear algebra framework. In many of these

More information

11. Further Issues in Using OLS with TS Data

11. Further Issues in Using OLS with TS Data 11. Further Issues in Using OLS with TS Data With TS, including lags of the dependent variable often allow us to fit much better the variation in y Exact distribution theory is rarely available in TS applications,

More information

A New Test in Parametric Linear Models with Nonparametric Autoregressive Errors

A New Test in Parametric Linear Models with Nonparametric Autoregressive Errors A New Test in Parametric Linear Models with Nonparametric Autoregressive Errors By Jiti Gao 1 and Maxwell King The University of Western Australia and Monash University Abstract: This paper considers a

More information

Lesson 4: Stationary stochastic processes

Lesson 4: Stationary stochastic processes Dipartimento di Ingegneria e Scienze dell Informazione e Matematica Università dell Aquila, umberto.triacca@univaq.it Stationary stochastic processes Stationarity is a rather intuitive concept, it means

More information

Non white sample covariance matrices.

Non white sample covariance matrices. Non white sample covariance matrices. S. Péché, Université Grenoble 1, joint work with O. Ledoit, Uni. Zurich 17-21/05/2010, Université Marne la Vallée Workshop Probability and Geometry in High Dimensions

More information

Understanding Regressions with Observations Collected at High Frequency over Long Span

Understanding Regressions with Observations Collected at High Frequency over Long Span Understanding Regressions with Observations Collected at High Frequency over Long Span Yoosoon Chang Department of Economics, Indiana University Joon Y. Park Department of Economics, Indiana University

More information

HYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH DIMENSIONAL COVARIANCE MATRIX

HYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH DIMENSIONAL COVARIANCE MATRIX Submitted to the Annals of Statistics HYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH DIMENSIONAL COVARIANCE MATRIX By Shurong Zheng, Zhao Chen, Hengjian Cui and Runze Li Northeast Normal University, Fudan

More information

On singular values distribution of a matrix large auto-covariance in the ultra-dimensional regime. Title

On singular values distribution of a matrix large auto-covariance in the ultra-dimensional regime. Title itle On singular values distribution of a matrix large auto-covariance in the ultra-dimensional regime Authors Wang, Q; Yao, JJ Citation Random Matrices: heory and Applications, 205, v. 4, p. article no.

More information

ECON 4160, Spring term Lecture 12

ECON 4160, Spring term Lecture 12 ECON 4160, Spring term 2013. Lecture 12 Non-stationarity and co-integration 2/2 Ragnar Nymoen Department of Economics 13 Nov 2013 1 / 53 Introduction I So far we have considered: Stationary VAR, with deterministic

More information

Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws. Symeon Chatzinotas February 11, 2013 Luxembourg

Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws. Symeon Chatzinotas February 11, 2013 Luxembourg Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws Symeon Chatzinotas February 11, 2013 Luxembourg Outline 1. Random Matrix Theory 1. Definition 2. Applications 3. Asymptotics 2. Ensembles

More information

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017 Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION By Degui Li, Peter C. B. Phillips, and Jiti Gao September 017 COWLES FOUNDATION DISCUSSION PAPER NO.

More information

Preface to the Second Edition...vii Preface to the First Edition... ix

Preface to the Second Edition...vii Preface to the First Edition... ix Contents Preface to the Second Edition...vii Preface to the First Edition........... ix 1 Introduction.............................................. 1 1.1 Large Dimensional Data Analysis.........................

More information

EXTENDED GLRT DETECTORS OF CORRELATION AND SPHERICITY: THE UNDERSAMPLED REGIME. Xavier Mestre 1, Pascal Vallet 2

EXTENDED GLRT DETECTORS OF CORRELATION AND SPHERICITY: THE UNDERSAMPLED REGIME. Xavier Mestre 1, Pascal Vallet 2 EXTENDED GLRT DETECTORS OF CORRELATION AND SPHERICITY: THE UNDERSAMPLED REGIME Xavier Mestre, Pascal Vallet 2 Centre Tecnològic de Telecomunicacions de Catalunya, Castelldefels, Barcelona (Spain) 2 Institut

More information

Financial Econometrics

Financial Econometrics Financial Econometrics Nonlinear time series analysis Gerald P. Dwyer Trinity College, Dublin January 2016 Outline 1 Nonlinearity Does nonlinearity matter? Nonlinear models Tests for nonlinearity Forecasting

More information

Volatility. Gerald P. Dwyer. February Clemson University

Volatility. Gerald P. Dwyer. February Clemson University Volatility Gerald P. Dwyer Clemson University February 2016 Outline 1 Volatility Characteristics of Time Series Heteroskedasticity Simpler Estimation Strategies Exponentially Weighted Moving Average Use

More information

Large Sample Properties of Estimators in the Classical Linear Regression Model

Large Sample Properties of Estimators in the Classical Linear Regression Model Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in

More information

ECON 4160, Lecture 11 and 12

ECON 4160, Lecture 11 and 12 ECON 4160, 2016. Lecture 11 and 12 Co-integration Ragnar Nymoen Department of Economics 9 November 2017 1 / 43 Introduction I So far we have considered: Stationary VAR ( no unit roots ) Standard inference

More information

INVERSE EIGENVALUE STATISTICS FOR RAYLEIGH AND RICIAN MIMO CHANNELS

INVERSE EIGENVALUE STATISTICS FOR RAYLEIGH AND RICIAN MIMO CHANNELS INVERSE EIGENVALUE STATISTICS FOR RAYLEIGH AND RICIAN MIMO CHANNELS E. Jorswieck, G. Wunder, V. Jungnickel, T. Haustein Abstract Recently reclaimed importance of the empirical distribution function of

More information

Spectral analysis of the Moore-Penrose inverse of a large dimensional sample covariance matrix

Spectral analysis of the Moore-Penrose inverse of a large dimensional sample covariance matrix Spectral analysis of the Moore-Penrose inverse of a large dimensional sample covariance matrix Taras Bodnar a, Holger Dette b and Nestor Parolya c a Department of Mathematics, Stockholm University, SE-069

More information

Central Limit Theorems for Classical Likelihood Ratio Tests for High-Dimensional Normal Distributions

Central Limit Theorems for Classical Likelihood Ratio Tests for High-Dimensional Normal Distributions Central Limit Theorems for Classical Likelihood Ratio Tests for High-Dimensional Normal Distributions Tiefeng Jiang 1 and Fan Yang 1, University of Minnesota Abstract For random samples of size n obtained

More information

The autocorrelation and autocovariance functions - helpful tools in the modelling problem

The autocorrelation and autocovariance functions - helpful tools in the modelling problem The autocorrelation and autocovariance functions - helpful tools in the modelling problem J. Nowicka-Zagrajek A. Wy lomańska Institute of Mathematics and Computer Science Wroc law University of Technology,

More information

A test for improved forecasting performance at higher lead times

A test for improved forecasting performance at higher lead times A test for improved forecasting performance at higher lead times John Haywood and Granville Tunnicliffe Wilson September 3 Abstract Tiao and Xu (1993) proposed a test of whether a time series model, estimated

More information

Signal and noise in financial correlation matrices

Signal and noise in financial correlation matrices Physica A 344 (2004) 67 72 www.elsevier.com/locate/physa Signal and noise in financial correlation matrices Zdzis"aw Burda, Jerzy Jurkiewicz 1 M. Smoluchowski Institute of Physics, Jagiellonian University,

More information

Multivariate Time Series

Multivariate Time Series Multivariate Time Series Notation: I do not use boldface (or anything else) to distinguish vectors from scalars. Tsay (and many other writers) do. I denote a multivariate stochastic process in the form

More information

Asymptotic Distribution of the Largest Eigenvalue via Geometric Representations of High-Dimension, Low-Sample-Size Data

Asymptotic Distribution of the Largest Eigenvalue via Geometric Representations of High-Dimension, Low-Sample-Size Data Sri Lankan Journal of Applied Statistics (Special Issue) Modern Statistical Methodologies in the Cutting Edge of Science Asymptotic Distribution of the Largest Eigenvalue via Geometric Representations

More information

Notes on Time Series Modeling

Notes on Time Series Modeling Notes on Time Series Modeling Garey Ramey University of California, San Diego January 17 1 Stationary processes De nition A stochastic process is any set of random variables y t indexed by t T : fy t g

More information

Testing High-Dimensional Covariance Matrices under the Elliptical Distribution and Beyond

Testing High-Dimensional Covariance Matrices under the Elliptical Distribution and Beyond Testing High-Dimensional Covariance Matrices under the Elliptical Distribution and Beyond arxiv:1707.04010v2 [math.st] 17 Apr 2018 Xinxin Yang Department of ISOM, Hong Kong University of Science and Technology

More information

Eigenvalues and Singular Values of Random Matrices: A Tutorial Introduction

Eigenvalues and Singular Values of Random Matrices: A Tutorial Introduction Random Matrix Theory and its applications to Statistics and Wireless Communications Eigenvalues and Singular Values of Random Matrices: A Tutorial Introduction Sergio Verdú Princeton University National

More information

Introduction to Stochastic processes

Introduction to Stochastic processes Università di Pavia Introduction to Stochastic processes Eduardo Rossi Stochastic Process Stochastic Process: A stochastic process is an ordered sequence of random variables defined on a probability space

More information

Empirical Market Microstructure Analysis (EMMA)

Empirical Market Microstructure Analysis (EMMA) Empirical Market Microstructure Analysis (EMMA) Lecture 3: Statistical Building Blocks and Econometric Basics Prof. Dr. Michael Stein michael.stein@vwl.uni-freiburg.de Albert-Ludwigs-University of Freiburg

More information

On a Nonparametric Notion of Residual and its Applications

On a Nonparametric Notion of Residual and its Applications On a Nonparametric Notion of Residual and its Applications Bodhisattva Sen and Gábor Székely arxiv:1409.3886v1 [stat.me] 12 Sep 2014 Columbia University and National Science Foundation September 16, 2014

More information

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 3 Feb 2004

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 3 Feb 2004 arxiv:cond-mat/0312496v2 [cond-mat.stat-mech] 3 Feb 2004 Signal and Noise in Financial Correlation Matrices Zdzis law Burda, Jerzy Jurkiewicz 1 M. Smoluchowski Institute of Physics, Jagiellonian University,

More information

SPRING 2006 PRELIMINARY EXAMINATION SOLUTIONS

SPRING 2006 PRELIMINARY EXAMINATION SOLUTIONS SPRING 006 PRELIMINARY EXAMINATION SOLUTIONS 1A. Let G be the subgroup of the free abelian group Z 4 consisting of all integer vectors (x, y, z, w) such that x + 3y + 5z + 7w = 0. (a) Determine a linearly

More information

Testing for Unit Roots with Cointegrated Data

Testing for Unit Roots with Cointegrated Data Discussion Paper No. 2015-57 August 19, 2015 http://www.economics-ejournal.org/economics/discussionpapers/2015-57 Testing for Unit Roots with Cointegrated Data W. Robert Reed Abstract This paper demonstrates

More information

Applications of random matrix theory to principal component analysis(pca)

Applications of random matrix theory to principal component analysis(pca) Applications of random matrix theory to principal component analysis(pca) Jun Yin IAS, UW-Madison IAS, April-2014 Joint work with A. Knowles and H. T Yau. 1 Basic picture: Let H be a Wigner (symmetric)

More information

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes Alea 4, 117 129 (2008) Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes Anton Schick and Wolfgang Wefelmeyer Anton Schick, Department of Mathematical Sciences,

More information

GARCH Models. Eduardo Rossi University of Pavia. December Rossi GARCH Financial Econometrics / 50

GARCH Models. Eduardo Rossi University of Pavia. December Rossi GARCH Financial Econometrics / 50 GARCH Models Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 50 Outline 1 Stylized Facts ARCH model: definition 3 GARCH model 4 EGARCH 5 Asymmetric Models 6

More information

Estimation of large dimensional sparse covariance matrices

Estimation of large dimensional sparse covariance matrices Estimation of large dimensional sparse covariance matrices Department of Statistics UC, Berkeley May 5, 2009 Sample covariance matrix and its eigenvalues Data: n p matrix X n (independent identically distributed)

More information

Extreme inference in stationary time series

Extreme inference in stationary time series Extreme inference in stationary time series Moritz Jirak FOR 1735 February 8, 2013 1 / 30 Outline 1 Outline 2 Motivation The multivariate CLT Measuring discrepancies 3 Some theory and problems The problem

More information

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Econ 423 Lecture Notes: Additional Topics in Time Series 1 Econ 423 Lecture Notes: Additional Topics in Time Series 1 John C. Chao April 25, 2017 1 These notes are based in large part on Chapter 16 of Stock and Watson (2011). They are for instructional purposes

More information

Stochastic Processes

Stochastic Processes Stochastic Processes Stochastic Process Non Formal Definition: Non formal: A stochastic process (random process) is the opposite of a deterministic process such as one defined by a differential equation.

More information

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED by W. Robert Reed Department of Economics and Finance University of Canterbury, New Zealand Email: bob.reed@canterbury.ac.nz

More information

On the smallest eigenvalues of covariance matrices of multivariate spatial processes

On the smallest eigenvalues of covariance matrices of multivariate spatial processes On the smallest eigenvalues of covariance matrices of multivariate spatial processes François Bachoc, Reinhard Furrer Toulouse Mathematics Institute, University Paul Sabatier, France Institute of Mathematics

More information

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University Topic 4 Unit Roots Gerald P. Dwyer Clemson University February 2016 Outline 1 Unit Roots Introduction Trend and Difference Stationary Autocorrelations of Series That Have Deterministic or Stochastic Trends

More information

Weiming Li and Jianfeng Yao

Weiming Li and Jianfeng Yao LOCAL MOMENT ESTIMATION OF POPULATION SPECTRUM 1 A LOCAL MOMENT ESTIMATOR OF THE SPECTRUM OF A LARGE DIMENSIONAL COVARIANCE MATRIX Weiming Li and Jianfeng Yao arxiv:1302.0356v1 [stat.me] 2 Feb 2013 Abstract:

More information

Assessing the dependence of high-dimensional time series via sample autocovariances and correlations

Assessing the dependence of high-dimensional time series via sample autocovariances and correlations Assessing the dependence of high-dimensional time series via sample autocovariances and correlations Johannes Heiny University of Aarhus Joint work with Thomas Mikosch (Copenhagen), Richard Davis (Columbia),

More information

Mi-Hwa Ko. t=1 Z t is true. j=0

Mi-Hwa Ko. t=1 Z t is true. j=0 Commun. Korean Math. Soc. 21 (2006), No. 4, pp. 779 786 FUNCTIONAL CENTRAL LIMIT THEOREMS FOR MULTIVARIATE LINEAR PROCESSES GENERATED BY DEPENDENT RANDOM VECTORS Mi-Hwa Ko Abstract. Let X t be an m-dimensional

More information

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis Sílvia Gonçalves and Benoit Perron Département de sciences économiques,

More information

TESTING SERIAL CORRELATION IN SEMIPARAMETRIC VARYING COEFFICIENT PARTIALLY LINEAR ERRORS-IN-VARIABLES MODEL

TESTING SERIAL CORRELATION IN SEMIPARAMETRIC VARYING COEFFICIENT PARTIALLY LINEAR ERRORS-IN-VARIABLES MODEL Jrl Syst Sci & Complexity (2009) 22: 483 494 TESTIG SERIAL CORRELATIO I SEMIPARAMETRIC VARYIG COEFFICIET PARTIALLY LIEAR ERRORS-I-VARIABLES MODEL Xuemei HU Feng LIU Zhizhong WAG Received: 19 September

More information

Eigenvector stability: Random Matrix Theory and Financial Applications

Eigenvector stability: Random Matrix Theory and Financial Applications Eigenvector stability: Random Matrix Theory and Financial Applications J.P Bouchaud with: R. Allez, M. Potters http://www.cfm.fr Portfolio theory: Basics Portfolio weights w i, Asset returns X t i If expected/predicted

More information

6. The econometrics of Financial Markets: Empirical Analysis of Financial Time Series. MA6622, Ernesto Mordecki, CityU, HK, 2006.

6. The econometrics of Financial Markets: Empirical Analysis of Financial Time Series. MA6622, Ernesto Mordecki, CityU, HK, 2006. 6. The econometrics of Financial Markets: Empirical Analysis of Financial Time Series MA6622, Ernesto Mordecki, CityU, HK, 2006. References for Lecture 5: Quantitative Risk Management. A. McNeil, R. Frey,

More information

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 14

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 14 Introduction to Econometrics (3 rd Updated Edition) by James H. Stock and Mark W. Watson Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 14 (This version July 0, 014) 015 Pearson Education,

More information

9) Time series econometrics

9) Time series econometrics 30C00200 Econometrics 9) Time series econometrics Timo Kuosmanen Professor Management Science http://nomepre.net/index.php/timokuosmanen 1 Macroeconomic data: GDP Inflation rate Examples of time series

More information

Random regular digraphs: singularity and spectrum

Random regular digraphs: singularity and spectrum Random regular digraphs: singularity and spectrum Nick Cook, UCLA Probability Seminar, Stanford University November 2, 2015 Universality Circular law Singularity probability Talk outline 1 Universality

More information

Bootstrapping high dimensional vector: interplay between dependence and dimensionality

Bootstrapping high dimensional vector: interplay between dependence and dimensionality Bootstrapping high dimensional vector: interplay between dependence and dimensionality Xianyang Zhang Joint work with Guang Cheng University of Missouri-Columbia LDHD: Transition Workshop, 2014 Xianyang

More information

Econ 623 Econometrics II Topic 2: Stationary Time Series

Econ 623 Econometrics II Topic 2: Stationary Time Series 1 Introduction Econ 623 Econometrics II Topic 2: Stationary Time Series In the regression model we can model the error term as an autoregression AR(1) process. That is, we can use the past value of the

More information

Research Article Least Squares Estimators for Unit Root Processes with Locally Stationary Disturbance

Research Article Least Squares Estimators for Unit Root Processes with Locally Stationary Disturbance Advances in Decision Sciences Volume, Article ID 893497, 6 pages doi:.55//893497 Research Article Least Squares Estimators for Unit Root Processes with Locally Stationary Disturbance Junichi Hirukawa and

More information

TESTS FOR LARGE DIMENSIONAL COVARIANCE STRUCTURE BASED ON RAO S SCORE TEST

TESTS FOR LARGE DIMENSIONAL COVARIANCE STRUCTURE BASED ON RAO S SCORE TEST Submitted to the Annals of Statistics arxiv:. TESTS FOR LARGE DIMENSIONAL COVARIANCE STRUCTURE BASED ON RAO S SCORE TEST By Dandan Jiang Jilin University arxiv:151.398v1 [stat.me] 11 Oct 15 This paper

More information

arxiv:physics/ v2 [physics.soc-ph] 8 Dec 2006

arxiv:physics/ v2 [physics.soc-ph] 8 Dec 2006 Non-Stationary Correlation Matrices And Noise André C. R. Martins GRIFE Escola de Artes, Ciências e Humanidades USP arxiv:physics/61165v2 [physics.soc-ph] 8 Dec 26 The exact meaning of the noise spectrum

More information

Numerical Solutions to the General Marcenko Pastur Equation

Numerical Solutions to the General Marcenko Pastur Equation Numerical Solutions to the General Marcenko Pastur Equation Miriam Huntley May 2, 23 Motivation: Data Analysis A frequent goal in real world data analysis is estimation of the covariance matrix of some

More information

Questions and Answers on Unit Roots, Cointegration, VARs and VECMs

Questions and Answers on Unit Roots, Cointegration, VARs and VECMs Questions and Answers on Unit Roots, Cointegration, VARs and VECMs L. Magee Winter, 2012 1. Let ɛ t, t = 1,..., T be a series of independent draws from a N[0,1] distribution. Let w t, t = 1,..., T, be

More information

Regression and Statistical Inference

Regression and Statistical Inference Regression and Statistical Inference Walid Mnif wmnif@uwo.ca Department of Applied Mathematics The University of Western Ontario, London, Canada 1 Elements of Probability 2 Elements of Probability CDF&PDF

More information

The Generalized Cochrane-Orcutt Transformation Estimation For Spurious and Fractional Spurious Regressions

The Generalized Cochrane-Orcutt Transformation Estimation For Spurious and Fractional Spurious Regressions The Generalized Cochrane-Orcutt Transformation Estimation For Spurious and Fractional Spurious Regressions Shin-Huei Wang and Cheng Hsiao Jan 31, 2010 Abstract This paper proposes a highly consistent estimation,

More information

Supermodular ordering of Poisson arrays

Supermodular ordering of Poisson arrays Supermodular ordering of Poisson arrays Bünyamin Kızıldemir Nicolas Privault Division of Mathematical Sciences School of Physical and Mathematical Sciences Nanyang Technological University 637371 Singapore

More information

Financial Econometrics Return Predictability

Financial Econometrics Return Predictability Financial Econometrics Return Predictability Eric Zivot March 30, 2011 Lecture Outline Market Efficiency The Forms of the Random Walk Hypothesis Testing the Random Walk Hypothesis Reading FMUND, chapter

More information

Applications and fundamental results on random Vandermon

Applications and fundamental results on random Vandermon Applications and fundamental results on random Vandermonde matrices May 2008 Some important concepts from classical probability Random variables are functions (i.e. they commute w.r.t. multiplication)

More information

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and

More information

Eigenvectors of some large sample covariance matrix ensembles

Eigenvectors of some large sample covariance matrix ensembles Probab. Theory Relat. Fields DOI 0.007/s00440-00-0298-3 Eigenvectors of some large sample covariance matrix ensembles Olivier Ledoit Sandrine Péché Received: 24 March 2009 / Revised: 0 March 200 Springer-Verlag

More information

arxiv: v1 [math.pr] 13 Aug 2007

arxiv: v1 [math.pr] 13 Aug 2007 The Annals of Probability 2007, Vol. 35, No. 4, 532 572 DOI: 0.24/009790600000079 c Institute of Mathematical Statistics, 2007 arxiv:0708.720v [math.pr] 3 Aug 2007 ON ASYMPTOTICS OF EIGENVECTORS OF LARGE

More information

1 Class Organization. 2 Introduction

1 Class Organization. 2 Introduction Time Series Analysis, Lecture 1, 2018 1 1 Class Organization Course Description Prerequisite Homework and Grading Readings and Lecture Notes Course Website: http://www.nanlifinance.org/teaching.html wechat

More information

LARGE DIMENSIONAL RANDOM MATRIX THEORY FOR SIGNAL DETECTION AND ESTIMATION IN ARRAY PROCESSING

LARGE DIMENSIONAL RANDOM MATRIX THEORY FOR SIGNAL DETECTION AND ESTIMATION IN ARRAY PROCESSING LARGE DIMENSIONAL RANDOM MATRIX THEORY FOR SIGNAL DETECTION AND ESTIMATION IN ARRAY PROCESSING J. W. Silverstein and P. L. Combettes Department of Mathematics, North Carolina State University, Raleigh,

More information

Fractal functional regression for classification of gene expression data by wavelets

Fractal functional regression for classification of gene expression data by wavelets Fractal functional regression for classification of gene expression data by wavelets Margarita María Rincón 1 and María Dolores Ruiz-Medina 2 1 University of Granada Campus Fuente Nueva 18071 Granada,

More information

WLLN for arrays of nonnegative random variables

WLLN for arrays of nonnegative random variables WLLN for arrays of nonnegative random variables Stefan Ankirchner Thomas Kruse Mikhail Urusov November 8, 26 We provide a weak law of large numbers for arrays of nonnegative and pairwise negatively associated

More information

A NOVEL OPTIMAL PROBABILITY DENSITY FUNCTION TRACKING FILTER DESIGN 1

A NOVEL OPTIMAL PROBABILITY DENSITY FUNCTION TRACKING FILTER DESIGN 1 A NOVEL OPTIMAL PROBABILITY DENSITY FUNCTION TRACKING FILTER DESIGN 1 Jinglin Zhou Hong Wang, Donghua Zhou Department of Automation, Tsinghua University, Beijing 100084, P. R. China Control Systems Centre,

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

DEPARTMENT OF ECONOMICS

DEPARTMENT OF ECONOMICS ISSN 0819-64 ISBN 0 7340 616 1 THE UNIVERSITY OF MELBOURNE DEPARTMENT OF ECONOMICS RESEARCH PAPER NUMBER 959 FEBRUARY 006 TESTING FOR RATE-DEPENDENCE AND ASYMMETRY IN INFLATION UNCERTAINTY: EVIDENCE FROM

More information

Multivariate Time Series: VAR(p) Processes and Models

Multivariate Time Series: VAR(p) Processes and Models Multivariate Time Series: VAR(p) Processes and Models A VAR(p) model, for p > 0 is X t = φ 0 + Φ 1 X t 1 + + Φ p X t p + A t, where X t, φ 0, and X t i are k-vectors, Φ 1,..., Φ p are k k matrices, with

More information

Asymptotic inference for a nonstationary double ar(1) model

Asymptotic inference for a nonstationary double ar(1) model Asymptotic inference for a nonstationary double ar() model By SHIQING LING and DONG LI Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong maling@ust.hk malidong@ust.hk

More information

Asymptotic Statistics-III. Changliang Zou

Asymptotic Statistics-III. Changliang Zou Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (

More information

Stochastic process for macro

Stochastic process for macro Stochastic process for macro Tianxiao Zheng SAIF 1. Stochastic process The state of a system {X t } evolves probabilistically in time. The joint probability distribution is given by Pr(X t1, t 1 ; X t2,

More information

Cointegration Lecture I: Introduction

Cointegration Lecture I: Introduction 1 Cointegration Lecture I: Introduction Julia Giese Nuffield College julia.giese@economics.ox.ac.uk Hilary Term 2008 2 Outline Introduction Estimation of unrestricted VAR Non-stationarity Deterministic

More information

Lecture 6: Univariate Volatility Modelling: ARCH and GARCH Models

Lecture 6: Univariate Volatility Modelling: ARCH and GARCH Models Lecture 6: Univariate Volatility Modelling: ARCH and GARCH Models Prof. Massimo Guidolin 019 Financial Econometrics Winter/Spring 018 Overview ARCH models and their limitations Generalized ARCH models

More information

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND Testing For Unit Roots With Cointegrated Data NOTE: This paper is a revision of

More information

Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem

Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem Prof. Massimo Guidolin 20192 Financial Econometrics Winter/Spring 2018 Overview Stochastic vs. deterministic

More information

Robustní monitorování stability v modelu CAPM

Robustní monitorování stability v modelu CAPM Robustní monitorování stability v modelu CAPM Ondřej Chochola, Marie Hušková, Zuzana Prášková (MFF UK) Josef Steinebach (University of Cologne) ROBUST 2012, Němčičky, 10.-14.9. 2012 Contents Introduction

More information