Empirical Likelihood Tests for High-dimensional Data
|
|
- Wilfred Jack Walton
- 5 years ago
- Views:
Transcription
1 Empirical Likelihood Tests for High-dimensional Data Department of Statistics and Actuarial Science University of Waterloo, Canada ICSA - Canada Chapter 2013 Symposium Toronto, August 2-3, 2013 Based on joint work with Liang Peng and Rongmao Zhang
2 Contents 1 Introduction
3 Introduction In this talk we discuss empirical likelihood ratio tests for high-dimensional data. Let X i = (X i1,..., X ip ), i = 1, 2,, n be iid random vectors with mean µ = (µ 1,, µ p ) and covariance Σ = (σ ij ) 1 i,j p. Here p may depend on n. If p is fixed, then it is a traditional statistical setting. If p, then it is high-dimensional setting.
4 Typical testing questions: Testing µ = µ 0 (one sample mean test). Two sample means testing. Testing Σ = Σ 0 (covariance matrix test) Two sample covariance matrices testing. We focus on testing covariance matrices.
5 Problem setup. Let X i = (X i1,..., X ip ), i = 1, 2,, n be iid random vectors with mean µ = (µ 1,, µ p ) and covariance Σ = (σ ij ) 1 i,j p.
6 Problem setup. Let X i = (X i1,..., X ip ), i = 1, 2,, n be iid random vectors with mean µ = (µ 1,, µ p ) and covariance Σ = (σ ij ) 1 i,j p. Testing covariance matrix H 0 : Σ = Σ 0 against H 1 : Σ Σ 0. (1)
7 Problem setup. Let X i = (X i1,..., X ip ), i = 1, 2,, n be iid random vectors with mean µ = (µ 1,, µ p ) and covariance Σ = (σ ij ) 1 i,j p. Testing covariance matrix H 0 : Σ = Σ 0 against H 1 : Σ Σ 0. (1) Testing bandedness H 0 : σ ij = 0 for all i j τ. (2)
8 Problem setup. Let X i = (X i1,..., X ip ), i = 1, 2,, n be iid random vectors with mean µ = (µ 1,, µ p ) and covariance Σ = (σ ij ) 1 i,j p. Testing covariance matrix H 0 : Σ = Σ 0 against H 1 : Σ Σ 0. (1) Testing bandedness H 0 : σ ij = 0 for all i j τ. (2) Non-parametric. No information about sparsity.
9 Literature. Testing (1) for fixed p: traditional likelihood ratio test; scaled distance measure test (John (1971, 1972) and Nagao (1973)).
10 Literature. Testing (1) for fixed p: traditional likelihood ratio test; scaled distance measure test (John (1971, 1972) and Nagao (1973)). Testing (1) for divergent p: Ledoit and Wolf (2002) for normal X i and Chen, Zhang and Zhong (2010) for a factor model. Specific models are imposed. Restrictions are put on p. p has to go to infinity as n approaches infinity.
11 Testing (2) for divergent p: Cai and Jiang (2011). The test statistic: the coherence converges slowly. Normality are assumed. Restrictions are put on p and τ.
12 Testing (2) for divergent p: Cai and Jiang (2011). The test statistic: the coherence converges slowly. Normality are assumed. Restrictions are put on p and τ. Testing (2) for divergent p: Qiu and Chen (2012). Similar to Chen, Zhang and Zhong (2010), specific models; restrictions.
13 Our goal: build up a test statistic that works for both (1) and (2); loose the condition on p; get rid of specific models.
14 Our goal: build up a test statistic that works for both (1) and (2); loose the condition on p; get rid of specific models. First we assume µ is known. The case when µ is unknown is very similar.
15 Basic observations. Σ = Σ 0 is equivalent to D 2 := Σ Σ 0 2 F = tr((σ Σ 0) 2 ) = 0.
16 Basic observations. Σ = Σ 0 is equivalent to D 2 := Σ Σ 0 2 F = tr((σ Σ 0) 2 ) = 0. We can construct our test based on an estimator of D 2.
17 A natural estimator. For i = 1,..., n, define the p p matrix Y i = (X i µ)(x i µ) T, and estimator e(γ) = tr((y 1 Γ)(Y 2 Γ)). E[Y 1 ] = Σ and E[e(Σ 0 )] = D 2. E[e(Σ 0 )] = 0 is equivalent to Σ = Σ 0.
18 We need independent copies of (Y 1, Y 2 ).
19 We need independent copies of (Y 1, Y 2 ). Splitting the sample. Let N = [n/2]. For i = 1, 2,..., N, we define e i (Σ) = tr((y i Σ)(Y N+i Σ)).
20 We need independent copies of (Y 1, Y 2 ). Splitting the sample. Let N = [n/2]. For i = 1, 2,..., N, we define e i (Σ) = tr((y i Σ)(Y N+i Σ)). Very difficult to estimate the variance of e i.
21 We need independent copies of (Y 1, Y 2 ). Splitting the sample. Let N = [n/2]. For i = 1, 2,..., N, we define e i (Σ) = tr((y i Σ)(Y N+i Σ)). Very difficult to estimate the variance of e i. Empirical likelihood method automatically catches the asymptotic variance.
22 Define the empirical likelihood ratio function with constraint (estimating equation) E[e 1 (Σ)] = 0: N L 0 (Σ) = sup{ (Np i ) : i=1 N N p i = 1, p i e i (Σ) = 0, p i 0}. i=1 i=1
23 Define the empirical likelihood ratio function with constraint (estimating equation) E[e 1 (Σ)] = 0: N L 0 (Σ) = sup{ (Np i ) : i=1 N N p i = 1, p i e i (Σ) = 0, p i 0}. i=1 i=1 Under some regularity conditions and H 0, 2 log L 0 (Σ 0 ) converges weakly to χ 2 1. This seems good but...
24 Define the empirical likelihood ratio function with constraint (estimating equation) E[e 1 (Σ)] = 0: N L 0 (Σ) = sup{ (Np i ) : i=1 N N p i = 1, p i e i (Σ) = 0, p i 0}. i=1 i=1 Under some regularity conditions and H 0, 2 log L 0 (Σ 0 ) converges weakly to χ 2 1. This seems good but... Shortfall of the test based on L 0. When Σ Σ 0 2 F is small, E[e 1(Σ 0 )] will be very close to 0 (in a rate of Σ Σ 0 2 F ). In this case the test based on L 0 has a poor power. Later we will see this in power analysis.
25 Secondary constraint. We add one more constraint which is easier to break under H 1.
26 Secondary constraint. We add one more constraint which is easier to break under H 1. The choice of the second linear constraint can be arbitrary.
27 Secondary constraint. We add one more constraint which is easier to break under H 1. The choice of the second linear constraint can be arbitrary. With no prior information, the following statistics v i (Σ): v i (Σ) = 1 T p (Y i Y N+i 2Σ)1 p can be used in a constraint E[v 1 (Σ 0 )] = 0.
28 We define the empirical likelihood function with two constraints as N L 1 (Σ 0 ) = sup{ (Np i ) : i=1 N N p i = 1, i=1 i=1 ( ) ei (Σ 0 ) p i = v i (Σ 0 ) ( ) 0, p i 0}. 0
29 We define the empirical likelihood function with two constraints as N L 1 (Σ 0 ) = sup{ (Np i ) : i=1 N N p i = 1, i=1 i=1 ( ) ei (Σ 0 ) p i = v i (Σ 0 ) ( ) 0, p i 0}. 0 Theorem 1 Suppose e 1 (Σ 0 ) and v 1 (Σ 0 ) satisfy a regularity condition (P). Then under H 0, 2 log L 1 (Σ 0 ) converges in distribution to χ 2 2 as n.
30 CLT condition (similar to the Lyapunov condition). (P) We say a statistic T with size n satisfies condition (P) if ET 2 > 0 and for some δ > 0, E T 2+δ δ+min(δ,2) (ET 2 = o(n 4 ). ) 1+δ/2 For example, if E(T 4 )/(E(T 2 )) 2 = o(n), then T satisfies (P) with δ = 2.
31 Remark 1 In order to prove Theorem 1, it is sufficient to prove condition (P) e guarantees that the sample t i = ( i (Σ 0 ), v i (Σ 0 ) Var(ei (Σ 0 )) Var(vi (Σ 0 )) )T satisfies CLT and ti 2 satisfies LLN, with a controlled maximum.
32 Remark 1 In order to prove Theorem 1, it is sufficient to prove condition (P) e guarantees that the sample t i = ( i (Σ 0 ), v i (Σ 0 ) Var(ei (Σ 0 )) Var(vi (Σ 0 )) )T satisfies CLT and t 2 i satisfies LLN, with a controlled maximum. Remark 2 When µ is unknown, just replace µ in Y i by the sample means and the theorem still holds with one extra moment condition.
33 Remark 3 In the factor model considered by Chen, Zhang and Zhong (2010), e 1 (Σ 0 ) and v 1 (Σ 0 ) satisfy (P). With this model, our test allows p to diverge arbitrarily fast or stay finite.
34 The problem is testing Here we consider µ is known. H 0 : σ ij = 0 for all i j τ. (3)
35 We are interested the information of the black squares in Σ and we will ignore the stars. Basic observation. H 0 is equivalent to the black squares of Σ being 0.
36 Define the τ-off-diagonal upper triangular matrix M (τ) of a matrix M: { (M (τ) Mij j i + τ; ) ij = 0 j < i + τ. H 0 is equivalent to tr((σ (τ) ) T Σ (τ) ) = 0.
37 For i = 1,..., N, Let e i = tr((y (τ) i ) T Y (τ) N+i ), v i = 1 T p (Y (τ) i + Y (τ) N+i )1 p. We define the empirical likelihood function as N L 2 = sup{ (Np i ) : i=1 N N p i = 1, i=1 Here we omit the Σ 0 in e i and v i. i=1 ( ) e p i i v i = ( ) 0, p i 0}. 0
38 Theorem 2 Suppose that e 1 and v 1 satisfy (P). Then under H 0 in (3), 2 log L 2 converges in distribution to χ 2 2 as n.
39 Theorem 2 Suppose that e 1 and v 1 satisfy (P). Then under H 0 in (3), 2 log L 2 converges in distribution to χ 2 2 as n. The method can be used to test some other structures. One interesting application is to test the assumption or estimation of the sparsity.
40 Remark 4 (1) In the Gaussian model used by Cai ( and Jiang (2011), e 1 and v 1 satisfy (P) provided that τ = o 1 i,j p σ ij. ( 1 i,j p σ ij ) 1/2 )
41 Remark 4 (1) In the Gaussian model used by Cai ( and Jiang (2011), e 1 and v 1 satisfy (P) provided that τ = o 1 i,j p σ ij. ( 1 i,j p σ ij ) 1/2 ) (2) With moment or boundedness conditions, the Gaussian assumption can be removed.
42 Denote π 11 = E(e 1 (Σ) 2 ), π 22 = E(v 1 (Σ) 2 ), ζ 1 = tr((σ Σ 0 ) 2 )/ π 11 and ζ 2 = 21 T p (Σ Σ 0 )1 p / π 22. For most models we discuss, ( tr((σ Σ0 ) 2 ) ) ζ 1 = O tr(σ 2 ) and ( ) 1 T p (Σ Σ 0 )1 p ζ 2 = O 1 T p Σ 2. 1 p
43 Theorem 3 In addition to the conditions of Theorem 1, if H 1 : Σ Σ 0 holds, then P{ 2 log L 1 (Σ 0 ) > ξ 1 α } = P{χ 2 2,ν > ξ 1 α } + o(1) for any level α as n, where χ 2 2,ν is a noncentral chi-square distribution with two degrees of freedom and noncentrality parameter ν = N(ζ1 2 + ζ2 2 ),
44 Remark 5 From the above power analysis, the new test rejects the null hypothesis with probability tending to one when max( nζ 1, n ζ 2 ).
45 Remark 5 From the above power analysis, the new test rejects the null hypothesis with probability tending to one when max( nζ 1, n ζ 2 ). Note that the test given in Chen, Zhang and Zhong (2010) requires nζ 1.
46 Remark 5 From the above power analysis, the new test rejects the null hypothesis with probability tending to one when max( nζ 1, n ζ 2 ). Note that the test given in Chen, Zhang and Zhong (2010) requires nζ 1. Our test may have a better power or a worse power in different settings.
47 Remark 5 From the above power analysis, the new test rejects the null hypothesis with probability tending to one when max( nζ 1, n ζ 2 ). Note that the test given in Chen, Zhang and Zhong (2010) requires nζ 1. Our test may have a better power or a worse power in different settings. Same results for the test in Theorem 2.
48 It is clear that our tests is powerful when Σ Σ 0 is dense, and not powerful when Σ Σ 0 is sparse.
49 It is clear that our tests is powerful when Σ Σ 0 is dense, and not powerful when Σ Σ 0 is sparse. With only the first constraint E(e 1 (Σ 0 )) = 0, the test power (requires nζ 1 ) is worse than the test in Chen, Zhang and Zhong (2010).
50 It is clear that our tests is powerful when Σ Σ 0 is dense, and not powerful when Σ Σ 0 is sparse. With only the first constraint E(e 1 (Σ 0 )) = 0, the test power (requires nζ 1 ) is worse than the test in Chen, Zhang and Zhong (2010). The test in Cai and Jiang (2011) is good when Σ Σ 0 is sparse but is powerless when Σ Σ 0 is dense, since their test power depends on Σ Σ 0 max.
51 Testing covariance matrices We assume a dense model and a local alternative. We compare with Chen, Zhang and Zhong (2010) for testing covariance matrices and Cai and Jiang (2011) for testing bandedness. The ELT has biased size for small n, so we also give a bootstrap calibrated version of ELT.
52 Table : Testing covariance matrices (n, p) EL(0.05) BCEL(0.05) CZZ(0.05) EL(0.05) BCEL(0.05) CZZ(0.05) δ = 0 δ = 0 δ = 0 δ = 1 δ = 1 δ = 1 (50, 25) (50, 50) (50, 100) (50, 200) (50, 400) (50, 800) (200, 25) (200, 50) (200, 100) (200, 200) (200, 400) (200, 800)
53 Table : Testing bandedness (n, p) EL(0.05) BCEL(0.05) CJ(0.05) EL(0.05) BCEL(0.05) CJ(0.05) δ = 0 δ = 0 δ = 0 δ = 1 δ = 1 δ = 1 (50, 25) (50, 50) (50, 100) (50, 200) (50, 400) (50, 800) (200, 25) (200, 50) (200, 100) (200, 200) (200, 400) (200, 800)
54 Conclusion. The new technique works for non-parametric models; allows arbitrary p; requires only moment conditions; avoids to estimate asymptotic variance; the limiting distribution is always χ 2 2 ; can be applied to testing sample mean, two-sample means, and two-sample covariance matrices under the HD framework.
55 Shortfalls: the number of observations is reduced by half; the power is good in the dense setting but not in the sparse setting. The optimal choice of the second constraint is unknown.
56 References I Cai, T. and Jiang, T. (2011). Limiting laws of coherence of random matrices with applications to testing covariance structure and construction of compressed sensing matrices. Ann. Statist. 39, Chen, S., Zhang, L. and Zhong, P. (2010). Tests for High-Dimensional Covariance Matrices. J. Amer. Statist. Assoc. 105, Ledoit, O., and Wolf, M. (2002). Some Hypothesis Tests for the Covariance Matrix When the Dimension Is Large Compare to the Sample Size. Ann. Statist., 30,
57 References II Nagao, H. (1973). On some test criteria for covariance matrix. Ann. Statist., 1, Qin, J. and Lawless, J.F. (1994). Empirical likelihood and general estimating equations. Ann. Statist. 22, Qiu, Y. and Chen, S. (2012). Test for bandedness of high-dimensional covariance matrices and bandwidth estimation. Ann. Statist. 40,
58 Thank you!
Jackknife Empirical Likelihood Test for Equality of Two High Dimensional Means
Jackknife Empirical Likelihood est for Equality of wo High Dimensional Means Ruodu Wang, Liang Peng and Yongcheng Qi 2 Abstract It has been a long history to test the equality of two multivariate means.
More informationCentral Limit Theorems for Classical Likelihood Ratio Tests for High-Dimensional Normal Distributions
Central Limit Theorems for Classical Likelihood Ratio Tests for High-Dimensional Normal Distributions Tiefeng Jiang 1 and Fan Yang 1, University of Minnesota Abstract For random samples of size n obtained
More informationBootstrapping high dimensional vector: interplay between dependence and dimensionality
Bootstrapping high dimensional vector: interplay between dependence and dimensionality Xianyang Zhang Joint work with Guang Cheng University of Missouri-Columbia LDHD: Transition Workshop, 2014 Xianyang
More informationHYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH DIMENSIONAL COVARIANCE MATRIX
Submitted to the Annals of Statistics HYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH DIMENSIONAL COVARIANCE MATRIX By Shurong Zheng, Zhao Chen, Hengjian Cui and Runze Li Northeast Normal University, Fudan
More informationSummary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)
Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ
More informationStatistical Inference On the High-dimensional Gaussian Covarianc
Statistical Inference On the High-dimensional Gaussian Covariance Matrix Department of Mathematical Sciences, Clemson University June 6, 2011 Outline Introduction Problem Setup Statistical Inference High-Dimensional
More informationAsymptotic Statistics-VI. Changliang Zou
Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous
More informationLecture 3. Inference about multivariate normal distribution
Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates
More informationNonconcave Penalized Likelihood with A Diverging Number of Parameters
Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized
More informationOn corrections of classical multivariate tests for high-dimensional data
On corrections of classical multivariate tests for high-dimensional data Jian-feng Yao with Zhidong Bai, Dandan Jiang, Shurong Zheng Overview Introduction High-dimensional data and new challenge in statistics
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationHypothesis Testing. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA
Hypothesis Testing Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA An Example Mardia et al. (979, p. ) reprint data from Frets (9) giving the length and breadth (in
More informationSHOTA KATAYAMA AND YUTAKA KANO. Graduate School of Engineering Science, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka , Japan
A New Test on High-Dimensional Mean Vector Without Any Assumption on Population Covariance Matrix SHOTA KATAYAMA AND YUTAKA KANO Graduate School of Engineering Science, Osaka University, 1-3 Machikaneyama,
More informationA General Framework for High-Dimensional Inference and Multiple Testing
A General Framework for High-Dimensional Inference and Multiple Testing Yang Ning Department of Statistical Science Joint work with Han Liu 1 Overview Goal: Control false scientific discoveries in high-dimensional
More informationCENTRAL LIMIT THEOREMS FOR CLASSICAL LIKELIHOOD RATIO TESTS FOR HIGH-DIMENSIONAL NORMAL DISTRIBUTIONS
The Annals of Statistics 013, Vol. 41, No. 4, 09 074 DOI: 10.114/13-AOS1134 Institute of Mathematical Statistics, 013 CENTRAL LIMIT THEOREMS FOR CLASSICAL LIKELIHOOD RATIO TESTS FOR HIGH-DIMENSIONAL NORMAL
More informationICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood
ICSA Applied Statistics Symposium 1 Balanced adjusted empirical likelihood Art B. Owen Stanford University Sarah Emerson Oregon State University ICSA Applied Statistics Symposium 2 Empirical likelihood
More informationHigh Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data
High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department
More informationExtreme inference in stationary time series
Extreme inference in stationary time series Moritz Jirak FOR 1735 February 8, 2013 1 / 30 Outline 1 Outline 2 Motivation The multivariate CLT Measuring discrepancies 3 Some theory and problems The problem
More informationKRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS
Bull. Korean Math. Soc. 5 (24), No. 3, pp. 7 76 http://dx.doi.org/34/bkms.24.5.3.7 KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS Yicheng Hong and Sungchul Lee Abstract. The limiting
More informationHigh-dimensional covariance estimation based on Gaussian graphical models
High-dimensional covariance estimation based on Gaussian graphical models Shuheng Zhou Department of Statistics, The University of Michigan, Ann Arbor IMA workshop on High Dimensional Phenomena Sept. 26,
More informationSTAT 461/561- Assignments, Year 2015
STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and
More informationhigh-dimensional inference robust to the lack of model sparsity
high-dimensional inference robust to the lack of model sparsity Jelena Bradic (joint with a PhD student Yinchu Zhu) www.jelenabradic.net Assistant Professor Department of Mathematics University of California,
More informationTesting Some Covariance Structures under a Growth Curve Model in High Dimension
Department of Mathematics Testing Some Covariance Structures under a Growth Curve Model in High Dimension Muni S. Srivastava and Martin Singull LiTH-MAT-R--2015/03--SE Department of Mathematics Linköping
More informationInferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data
Journal of Multivariate Analysis 78, 6282 (2001) doi:10.1006jmva.2000.1939, available online at http:www.idealibrary.com on Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone
More informationSTT 843 Key to Homework 1 Spring 2018
STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ
More informationHigh-dimensional asymptotic expansions for the distributions of canonical correlations
Journal of Multivariate Analysis 100 2009) 231 242 Contents lists available at ScienceDirect Journal of Multivariate Analysis journal homepage: www.elsevier.com/locate/jmva High-dimensional asymptotic
More informationBivariate Paired Numerical Data
Bivariate Paired Numerical Data Pearson s correlation, Spearman s ρ and Kendall s τ, tests of independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationSTA442/2101: Assignment 5
STA442/2101: Assignment 5 Craig Burkett Quiz on: Oct 23 rd, 2015 The questions are practice for the quiz next week, and are not to be handed in. I would like you to bring in all of the code you used to
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan
More informationMonte Carlo Integration I [RC] Chapter 3
Aula 3. Monte Carlo Integration I 0 Monte Carlo Integration I [RC] Chapter 3 Anatoli Iambartsev IME-USP Aula 3. Monte Carlo Integration I 1 There is no exact definition of the Monte Carlo methods. In the
More informationModel Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao
Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley
More informationTESTS FOR LARGE DIMENSIONAL COVARIANCE STRUCTURE BASED ON RAO S SCORE TEST
Submitted to the Annals of Statistics arxiv:. TESTS FOR LARGE DIMENSIONAL COVARIANCE STRUCTURE BASED ON RAO S SCORE TEST By Dandan Jiang Jilin University arxiv:151.398v1 [stat.me] 11 Oct 15 This paper
More informationNonparametric Inference via Bootstrapping the Debiased Estimator
Nonparametric Inference via Bootstrapping the Debiased Estimator Yen-Chi Chen Department of Statistics, University of Washington ICSA-Canada Chapter Symposium 2017 1 / 21 Problem Setup Let X 1,, X n be
More informationEcon 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE
Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE Eric Zivot Winter 013 1 Wald, LR and LM statistics based on generalized method of moments estimation Let 1 be an iid sample
More informationδ -method and M-estimation
Econ 2110, fall 2016, Part IVb Asymptotic Theory: δ -method and M-estimation Maximilian Kasy Department of Economics, Harvard University 1 / 40 Example Suppose we estimate the average effect of class size
More informationNon-Stationary Time Series and Unit Root Testing
Econometrics II Non-Stationary Time Series and Unit Root Testing Morten Nyboe Tabor Course Outline: Non-Stationary Time Series and Unit Root Testing 1 Stationarity and Deviation from Stationarity Trend-Stationarity
More informationEstimation for two-phase designs: semiparametric models and Z theorems
Estimation for two-phase designs:semiparametric models and Z theorems p. 1/27 Estimation for two-phase designs: semiparametric models and Z theorems Jon A. Wellner University of Washington Estimation for
More informationDepartment of Economics, Vanderbilt University While it is known that pseudo-out-of-sample methods are not optimal for
Comment Atsushi Inoue Department of Economics, Vanderbilt University (atsushi.inoue@vanderbilt.edu) While it is known that pseudo-out-of-sample methods are not optimal for comparing models, they are nevertheless
More informationExploring Granger Causality for Time series via Wald Test on Estimated Models with Guaranteed Stability
Exploring Granger Causality for Time series via Wald Test on Estimated Models with Guaranteed Stability Nuntanut Raksasri Jitkomut Songsiri Department of Electrical Engineering, Faculty of Engineering,
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the
More information1 Glivenko-Cantelli type theorems
STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then
More informationMultivariate-sign-based high-dimensional tests for sphericity
Biometrika (2013, xx, x, pp. 1 8 C 2012 Biometrika Trust Printed in Great Britain Multivariate-sign-based high-dimensional tests for sphericity BY CHANGLIANG ZOU, LIUHUA PENG, LONG FENG AND ZHAOJUN WANG
More informationAsymptotic Normality under Two-Phase Sampling Designs
Asymptotic Normality under Two-Phase Sampling Designs Jiahua Chen and J. N. K. Rao University of Waterloo and University of Carleton Abstract Large sample properties of statistical inferences in the context
More informationSTA205 Probability: Week 8 R. Wolpert
INFINITE COIN-TOSS AND THE LAWS OF LARGE NUMBERS The traditional interpretation of the probability of an event E is its asymptotic frequency: the limit as n of the fraction of n repeated, similar, and
More informationTesting Homogeneity Of A Large Data Set By Bootstrapping
Testing Homogeneity Of A Large Data Set By Bootstrapping 1 Morimune, K and 2 Hoshino, Y 1 Graduate School of Economics, Kyoto University Yoshida Honcho Sakyo Kyoto 606-8501, Japan. E-Mail: morimune@econ.kyoto-u.ac.jp
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationChapter 2: Fundamentals of Statistics Lecture 15: Models and statistics
Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics Data from one or a series of random experiments are collected. Planning experiments and collecting data (not discussed here). Analysis:
More informationHypothesis Testing for Var-Cov Components
Hypothesis Testing for Var-Cov Components When the specification of coefficients as fixed, random or non-randomly varying is considered, a null hypothesis of the form is considered, where Additional output
More informationarxiv: v1 [stat.me] 28 May 2018
High-Dimensional Statistical Inferences with Over-identification: Confidence Set Estimation and Specification Test Jinyuan Chang, Cheng Yong Tang, Tong Tong Wu arxiv:1805.10742v1 [stat.me] 28 May 2018
More informationEstimation of the Bivariate and Marginal Distributions with Censored Data
Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new
More informationOn testing the equality of mean vectors in high dimension
ACTA ET COMMENTATIONES UNIVERSITATIS TARTUENSIS DE MATHEMATICA Volume 17, Number 1, June 2013 Available online at www.math.ut.ee/acta/ On testing the equality of mean vectors in high dimension Muni S.
More information11. Further Issues in Using OLS with TS Data
11. Further Issues in Using OLS with TS Data With TS, including lags of the dependent variable often allow us to fit much better the variation in y Exact distribution theory is rarely available in TS applications,
More informationMore powerful tests for sparse high-dimensional covariances matrices
Accepted Manuscript More powerful tests for sparse high-dimensional covariances matrices Liu-hua Peng, Song Xi Chen, Wen Zhou PII: S0047-259X(16)30004-5 DOI: http://dx.doi.org/10.1016/j.jmva.2016.03.008
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More informationEmpirical Likelihood Inference for Two-Sample Problems
Empirical Likelihood Inference for Two-Sample Problems by Ying Yan A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Master of Mathematics in Statistics
More informationStat 710: Mathematical Statistics Lecture 31
Stat 710: Mathematical Statistics Lecture 31 Jun Shao Department of Statistics University of Wisconsin Madison, WI 53706, USA Jun Shao (UW-Madison) Stat 710, Lecture 31 April 13, 2009 1 / 13 Lecture 31:
More informationON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES
ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES Hisashi Tanizaki Graduate School of Economics, Kobe University, Kobe 657-8501, Japan e-mail: tanizaki@kobe-u.ac.jp Abstract:
More information1 Mixed effect models and longitudinal data analysis
1 Mixed effect models and longitudinal data analysis Mixed effects models provide a flexible approach to any situation where data have a grouping structure which introduces some kind of correlation between
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More informationGlobal Testing and Large-Scale Multiple Testing for High-Dimensional Covariance Structures
Global Testing and Large-Scale Multiple Testing for High-Dimensional Covariance Structures T. Tony Cai 1 1 Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, USA, 19104;
More informationEmpirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm
Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier
More informationAppendix of the paper: Are interest rate options important for the assessment of interest rate risk?
Appendix of the paper: Are interest rate options important for the assessment of interest rate risk? Caio Almeida,a, José Vicente b a Graduate School of Economics, Getulio Vargas Foundation b Research
More informationTesting Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information
Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information Budi Pratikno 1 and Shahjahan Khan 2 1 Department of Mathematics and Natural Science Jenderal Soedirman
More informationCOMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS
Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department
More informationQuantitative trait evolution with mutations of large effect
Quantitative trait evolution with mutations of large effect May 1, 2014 Quantitative traits Traits that vary continuously in populations - Mass - Height - Bristle number (approx) Adaption - Low oxygen
More informationStatistica Sinica Preprint No: SS
Statistica Sinica Preprint No: SS-2017-0275 Title Testing homogeneity of high-dimensional covariance matrices Manuscript ID SS-2017-0275 URL http://www.stat.sinica.edu.tw/statistica/ DOI 10.5705/ss.202017.0275
More informationTesting for a unit root in an ar(1) model using three and four moment approximations: symmetric distributions
Hong Kong Baptist University HKBU Institutional Repository Department of Economics Journal Articles Department of Economics 1998 Testing for a unit root in an ar(1) model using three and four moment approximations:
More informationA Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data
A Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data Yujun Wu, Marc G. Genton, 1 and Leonard A. Stefanski 2 Department of Biostatistics, School of Public Health, University of Medicine
More informationHigh-dimensional two-sample tests under strongly spiked eigenvalue models
1 High-dimensional two-sample tests under strongly spiked eigenvalue models Makoto Aoshima and Kazuyoshi Yata University of Tsukuba Abstract: We consider a new two-sample test for high-dimensional data
More informationand Comparison with NPMLE
NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/
More informationA Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints
Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note
More informationTESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST
Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department
More informationUnit roots in vector time series. Scalar autoregression True model: y t 1 y t1 2 y t2 p y tp t Estimated model: y t c y t1 1 y t1 2 y t2
Unit roots in vector time series A. Vector autoregressions with unit roots Scalar autoregression True model: y t y t y t p y tp t Estimated model: y t c y t y t y t p y tp t Results: T j j is asymptotically
More informationHypothesis Testing One Sample Tests
STATISTICS Lecture no. 13 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 12. 1. 2010 Tests on Mean of a Normal distribution Tests on Variance of a Normal
More informationSTRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013
STRUCTURAL EQUATION MODELING Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 Introduction: Path analysis Path Analysis is used to estimate a system of equations in which all of the
More informationSTATS 200: Introduction to Statistical Inference. Lecture 29: Course review
STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout
More informationThomas J. Fisher. Research Statement. Preliminary Results
Thomas J. Fisher Research Statement Preliminary Results Many applications of modern statistics involve a large number of measurements and can be considered in a linear algebra framework. In many of these
More informationA COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky
A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),
More informationThe purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.
Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That
More informationPattern correlation matrices and their properties
Linear Algebra and its Applications 327 (2001) 105 114 www.elsevier.com/locate/laa Pattern correlation matrices and their properties Andrew L. Rukhin Department of Mathematics and Statistics, University
More informationMath 423/533: The Main Theoretical Topics
Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)
More informationComposite Hypotheses and Generalized Likelihood Ratio Tests
Composite Hypotheses and Generalized Likelihood Ratio Tests Rebecca Willett, 06 In many real world problems, it is difficult to precisely specify probability distributions. Our models for data may involve
More informationStatistical Inference of Covariate-Adjusted Randomized Experiments
1 Statistical Inference of Covariate-Adjusted Randomized Experiments Feifang Hu Department of Statistics George Washington University Joint research with Wei Ma, Yichen Qin and Yang Li Email: feifang@gwu.edu
More informationQuasi-likelihood Scan Statistics for Detection of
for Quasi-likelihood for Division of Biostatistics and Bioinformatics, National Health Research Institutes & Department of Mathematics, National Chung Cheng University 17 December 2011 1 / 25 Outline for
More informationUnderstanding Regressions with Observations Collected at High Frequency over Long Span
Understanding Regressions with Observations Collected at High Frequency over Long Span Yoosoon Chang Department of Economics, Indiana University Joon Y. Park Department of Economics, Indiana University
More informationPenalized Empirical Likelihood and Growing Dimensional General Estimating Equations
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 Biometrika (0000), 00, 0, pp. 1 15 C 0000 Biometrika Trust Printed
More informationLikelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science
1 Likelihood Ratio Tests that Certain Variance Components Are Zero Ciprian M. Crainiceanu Department of Statistical Science www.people.cornell.edu/pages/cmc59 Work done jointly with David Ruppert, School
More informationAdjusted Empirical Likelihood for Long-memory Time Series Models
Adjusted Empirical Likelihood for Long-memory Time Series Models arxiv:1604.06170v1 [stat.me] 21 Apr 2016 Ramadha D. Piyadi Gamage, Wei Ning and Arjun K. Gupta Department of Mathematics and Statistics
More informationHypothesis testing:power, test statistic CMS:
Hypothesis testing:power, test statistic The more sensitive the test, the better it can discriminate between the null and the alternative hypothesis, quantitatively, maximal power In order to achieve this
More informationPsychology 454: Latent Variable Modeling How do you know if a model works?
Psychology 454: Latent Variable Modeling How do you know if a model works? William Revelle Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 18 Outline 1 Goodness
More informationTopics in statistical inference for massive data and high-dimensional data
Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2017 Topics in statistical inference for massive data and high-dimensional data Liuhua Peng Iowa State University
More informationMODIFIED LIKELIHOOD RATIO TEST FOR HOMOGENEITY IN A TWO-SAMPLE PROBLEM
Statistica Sinica 19 (009), 1603-1619 MODIFIED LIKELIHOOD RATIO TEST FOR HOMOGENEITY IN A TWO-SAMPLE PROBLEM Yuejiao Fu, Jiahua Chen and John D. Kalbfleisch York University, University of British Columbia
More informationAnalysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems
Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA
More informationSUPPLEMENTARY SIMULATIONS & FIGURES
Supplementary Material: Supplementary Material for Mixed Effects Models for Resampled Network Statistics Improve Statistical Power to Find Differences in Multi-Subject Functional Connectivity Manjari Narayan,
More informationSecond Order Cone Programming, Missing or Uncertain Data, and Sparse SVMs
Second Order Cone Programming, Missing or Uncertain Data, and Sparse SVMs Ammon Washburn University of Arizona September 25, 2015 1 / 28 Introduction We will begin with basic Support Vector Machines (SVMs)
More informationEXTENDED GLRT DETECTORS OF CORRELATION AND SPHERICITY: THE UNDERSAMPLED REGIME. Xavier Mestre 1, Pascal Vallet 2
EXTENDED GLRT DETECTORS OF CORRELATION AND SPHERICITY: THE UNDERSAMPLED REGIME Xavier Mestre, Pascal Vallet 2 Centre Tecnològic de Telecomunicacions de Catalunya, Castelldefels, Barcelona (Spain) 2 Institut
More informationBTRY 4090: Spring 2009 Theory of Statistics
BTRY 4090: Spring 2009 Theory of Statistics Guozhang Wang September 25, 2010 1 Review of Probability We begin with a real example of using probability to solve computationally intensive (or infeasible)
More informationNotes on Asymptotic Theory: Convergence in Probability and Distribution Introduction to Econometric Theory Econ. 770
Notes on Asymptotic Theory: Convergence in Probability and Distribution Introduction to Econometric Theory Econ. 770 Jonathan B. Hill Dept. of Economics University of North Carolina - Chapel Hill November
More informationThe Multinomial Model
The Multinomial Model STA 312: Fall 2012 Contents 1 Multinomial Coefficients 1 2 Multinomial Distribution 2 3 Estimation 4 4 Hypothesis tests 8 5 Power 17 1 Multinomial Coefficients Multinomial coefficient
More informationStat 502 Design and Analysis of Experiments General Linear Model
1 Stat 502 Design and Analysis of Experiments General Linear Model Fritz Scholz Department of Statistics, University of Washington December 6, 2013 2 General Linear Hypothesis We assume the data vector
More information