Testing Hypotheses Of Covariance Structure In Multivariate Data

Similar documents
Optimal estimation for doubly multivariate data in blocked compound symmetric covariance structure

Hypothesis testing in multilevel models with block circular covariance structures

Testing Some Covariance Structures under a Growth Curve Model in High Dimension

On Projection of a Positive Definite Matrix on a Cone of Nonnegative Definite Toeplitz Matrices

Extremal graphs for the sum of the two largest signless Laplacian eigenvalues

Estimation of Variance Components in Linear Mixed Models with Commutative Orthogonal Block Structure

Linear Models for Multivariate Repeated Measures Data

Block Circular Symmetry in Multilevel Models

Monday, September 25, 2017

Short proofs of theorems of Mirsky and Horn on diagonals and eigenvalues of matrices

Canonical Correlation Analysis of Longitudinal Data

The symmetric minimal rank solution of the matrix equation AX=B and the optimal approximation

Statistical Inference On the High-dimensional Gaussian Covarianc

Pairs of matrices, one of which commutes with their commutator

Department of Statistics

Orthogonal decompositions in growth curve models

INTEGER POWERS OF ANTI-BIDIAGONAL HANKEL MATRICES

A Dimension Reduction Technique for Estimation in Linear Mixed Models

Generalized left and right Weyl spectra of upper triangular operator matrices

A note on simultaneous block diagonally stable matrices

Lower bounds for the Estrada index

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

A method for computing quadratic Brunovsky forms

An angle metric through the notion of Grassmann representative

A note on a conjecture for the distance Laplacian matrix

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

On EP elements, normal elements and partial isometries in rings with involution

z = β βσβ Statistical Analysis of MV Data Example : µ=0 (Σ known) consider Y = β X~ N 1 (β µ, β Σβ) test statistic for H 0β is

Note on deleting a vertex and weak interlacing of the Laplacian spectrum

Recognition of hidden positive row diagonally dominant matrices

A Box-Type Approximation for General Two-Sample Repeated Measures - Technical Report -

Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is

Psychometrika, 24,

On nonnegative realization of partitioned spectra

STT 843 Key to Homework 1 Spring 2018

On the maximum positive semi-definite nullity and the cycle matroid of graphs

Randomized Complete Block Designs

Applied Multivariate Analysis

Analysis of variance, multivariate (MANOVA)

On (T,f )-connections of matrices and generalized inverses of linear operators

More calculations on determinant evaluations

On the M-matrix inverse problem for singular and symmetric Jacobi matrices

Matrix functions that preserve the strong Perron- Frobenius property

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation

Structured eigenvalue/eigenvector backward errors of matrix pencils arising in optimal control

On the reduction of matrix polynomials to Hessenberg form

Ratio of Linear Function of Parameters and Testing Hypothesis of the Combination Two Split Plot Designs

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

Exercises Chapter 4 Statistical Hypothesis Testing

An Introduction to Multivariate Statistical Analysis

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007

One-Way Repeated Measures Contrasts

Generalization of Gracia's Results

Potentially nilpotent tridiagonal sign patterns of order 4

A note on estimates for the spectral radius of a nonnegative matrix

A Simple Proof of Fiedler's Conjecture Concerning Orthogonal Matrices

Proof of Atiyah's conjecture for two special types of configurations

An improved characterisation of the interior of the completely positive cone

Dihedral groups of automorphisms of compact Riemann surfaces of genus two

Vector Cross Product Differential and Difference Equations in R^3 and in R^7

Non-trivial solutions to certain matrix equations

A factorization of the inverse of the shifted companion matrix

Partial isometries and EP elements in rings with involution

Multivariate statistical methods and data mining in particle physics

Statistics and econometrics

Test Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics

Singular Value and Norm Inequalities Associated with 2 x 2 Positive Semidefinite Block Matrices

STAT 536: Genetic Statistics

On Selecting Tests for Equality of Two Normal Mean Vectors

A CLASS OF THE BEST LINEAR UNBIASED ESTIMATORS IN THE GENERAL LINEAR MODEL

The spectrum of the edge corona of two graphs

[y i α βx i ] 2 (2) Q = i=1

Chapter 9: Hypothesis Testing Sections

Problem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30

Next is material on matrix rank. Please see the handout

A new upper bound for the eigenvalues of the continuous algebraic Riccati equation

Summary of Chapters 7-9

Products of commuting nilpotent operators

SOME ASPECTS OF MULTIVARIATE BEHRENS-FISHER PROBLEM

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Interior points of the completely positive cone

Large Sample Properties of Estimators in the Classical Linear Regression Model

A norm inequality for pairs of commuting positive semidefinite matrices

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is

A Likelihood Ratio Test

On cardinality of Pareto spectra

Fundamental Probability and Statistics

Jackknife Euclidean Likelihood-Based Inference for Spearman s Rho

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Operator norms of words formed from positivedefinite

Group inverse for the block matrix with two identical subblocks over skew fields

Some Results on the Multivariate Truncated Normal Distribution

Estimation, Detection, and Identification CMU 18752

T 2 Type Test Statistic and Simultaneous Confidence Intervals for Sub-mean Vectors in k-sample Problem

Multivariate Distributions

Testing Block-Diagonal Covariance Structure for High-Dimensional Data

A Statistical Analysis of Fukunaga Koontz Transform

Infinite products and paracontracting matrices

Transcription:

Electronic Journal of Linear Algebra Volume 33 Volume 33: Special Issue for the International Conference on Matrix Analysis and its Applications, MAT TRIAD 2017 Article 6 2018 Testing Hypotheses Of Covariance Structure In Multivariate Data Miguel Fonseca NOVA University of Lisbon, fmig@fct.unl.pt Arkadiusz Koziol University of Zielona Gora, a.koziol@wmie.uz.zgora.pl Roman Zmyslony University of Zielona Gora, r.zmyslony@wmie.uz.zgora.pl Follow this and additional works at: Part of the Multivariate Analysis Commons, Statistical Methodology Commons, and the Statistical Models Commons Recommended Citation Fonseca, Miguel; Koziol, Arkadiusz; and Zmyslony, Roman. (2018), "Testing Hypotheses Of Covariance Structure In Multivariate Data", Electronic Journal of Linear Algebra, Volume 33, pp. 53-62. DOI: https://doi.org/10.13001/1081-3810, 1537-9582.3743 This Article is brought to you for free and open access by Wyoming Scholars Repository. It has been accepted for inclusion in Electronic Journal of Linear Algebra by an authorized editor of Wyoming Scholars Repository. For more information, please contact scholcom@uwyo.edu.

TESTING HYPOTHESES OF COVARIANCE STRUCTURE IN MULTIVARIATE DATA MIGUEL FONSECA, ARKADIUSZ KOZIO l, AND ROMAN ZMYŚLONY Abstract. In this paper there is given a new approach for testing hypotheses on the structure of covariance matrices in double multivariate data. It is proved that ratio of positive and negative parts of best unbiased estimators (BUE) provide an F-test for independence of blocks variables in double multivariate models. Key words. Quadratic subspace, Testing hypotheses, Structure of covariance matrices, Positive and negative part of estimator, Block compound symmetric covariance structure, Double multivariate data. AMS subject classifications. 62J10, 62F03 62F05, 62H15. 1. Introduction. Blocked compound symmetric (BCS) covariance structure for doubly multivariate observations (m dimensional observation vector repeatedly measured over u locations or time points), which is a multivariate generalization of compound symmetry covariance structure for multivariate observations, was introduced by [8, 9] while classifying genetically different groups, and then [7] studied BCS covariance structure while developing general linear model with exchangeable and jointly normally distributed error vectors. We consider the following multivariate data: ( (1.1) y num 1 = vec Y um n where µ and Γ are unknown, µ is um 1 vector and ) N ( ) (1 n I um )µ, I n Γ um, Γ = I u Γ 0 + (J u I u ) Γ 1. Here I u is an identity matrix of size u and J u = 1 u 1 u is the matrix with all elements equal to one, of size u. Γ 0 and Γ 1 are m m unknown parameters. Equivalently the structure of covariance can be written as the sum of strong orthogonal matrices i.e. product of matrices is equal zero: (1.2) Γ = ( I u 1 ) u J u (Γ 0 Γ 1 ) + 1 u J u (Γ 0 + (u 1)Γ 1 ). The paper is divided into five sections: the present introduction, sections dedicated to best unbiased and maximum likelihood estimation for the considered model, and to the proposed hypothesis test, followed by the simulation study and by final remarks. Received by the editors on February 24, 2018. Accepted for publication on July 30, 2018. Handling Editor: Natalia Bebiano. Corresponding Author: Miguel Fonseca. Center of Mathematics and Applications, Faculty of Sciences and Technology, NOVA University of Lisbon, Monte da Caparica, 2829-516 Caparica, Portugal (fmig@fct.unl.pt). Faculty of Mathematics, Computer Science and Econometrics, University of Zielona Góra, Szafrana 4a, 65-516 Zielona Góra, Poland (a.koziol@wmie.uz.zgora.pl, r.zmyslony@wmie.uz.zgora.pl).

Miguel Fonseca, Arkadiusz Kozio l, Roman Zmyślony 54 2. Best unbiased estimators and Maximum Likelihood Estimators of µ, Γ 0 and Γ 1. In [2] the best unbiased estimators (BUE) for Γ 0 and Γ 1 are given. It follows that BUE for 0 = Γ 0 Γ 1 and 1 = Γ 0 + (u 1)Γ 1 are given by: 0 = Γ 0 Γ 1, 1 = Γ 0 + (u 1) Γ 1. On the other hand the maximum likelihood (ML) estimators for 0 and 1 are (see [3]) given by (n 1)(u 1) 0 = n(u 1) 1 = n 1 n 1. Moreover, from [4] 0 and 1 are independent and 0, n(u 1) 0 = (n 1)(u 1) 0 W m ( 0, (n 1)(u 1) ), n 1 = (n 1) 1 W m ( 1, n 1 ), where W m ( Σ, n ) stands for Wishart distribution with m m scale matrix Σ and degrees of freedom parameter n. 3. Testing hypotheses about structure of covariance. The maximum likelihood for the distribution of (1.1) is given by l 1 = 0 n(u 1) 2 1 n 2 e num 2. Considering H 0 : Γ 1 = 0 vs H 1 : Γ 1 0 H 0 : 1 = 0 vs H 1 : 1 0 : From definitions of 0 and 1, it is clear that under H 0 we must necessarily have 0 = 1. Moreover, the maximum likelihood under H 0 is given by Finally, the likelihood ratio test is given by l 0 = (nu) 1( n(u 1) 0 + n ) nu 2 1 e num 2. n(u 1) n 0 2 1 2 (3.3) Λ = (nu) 1( n(u 1) 0 + n ) nu. 2 1 Now we prove the following: Lemma 3.1. If W 1 W m (Σ, n 1 ) and W 2 W m (Σ, n 2 ) (independent), then, for every fixed vector x 0 R m : T = n 2x W 1 x n 1 x W 2 x F n 1,n 2.

55 Testing Hypotheses Of Covariance Structure In Multivariate Data Proof. According to the theorem in [1], if W W m (Σ, n) then for every x 0 R m : Now if we calculate ratio of x W 1x n 1 x W 1 x n 1 x W 2 x n 2 and x W 2x n 2 = x W x x Σx χ2 n. we get: x W 1 x n 1 x Σx x W 2 x n 2 x Σx χ 2 n 1 n 1 χ 2 n 2 n 2 F n1,n 2. Under the framework of [6] and [10], the positive part of Γ 1 is given by Γ 1+ = 1 u and negative part is given by Γ 1 = 0 u. To build a test for the nullity of Γ 1 we can use Lemma 3.1. To do this, first we find the test statistic. Theorem 3.2. Noting that the estimator of Γ 1 is given by The test statistic Γ 1 = Γ 1+ Γ 1 = 1 0. u (3.4) T = v Γ1+ v v Γ 1 v, with v 0, is distributed as an F random variable with (n 1) and (n 1)(u 1) degrees of freedom under H 0 : Γ 1+ = Γ 1. Proof. Unbiased estimator of Γ 1 can be expressed as Under null hypothesis Γ 1 = 0: Γ 1 = (n 1)(u 1) 1 (n 1)(u 1) 0 (n 1)u(u 1) = 1 0. u Now from Lemma 3.1 it follows that: (n 1)u Γ 1+ = (n 1) 1 W m (Γ 0, n 1), (n 1)u(u 1) Γ 1 = (n 1)(u 1) 0 W m (Γ 0, (n 1)(u 1)). T = (n 1)uv Γ1+ v (n 1) (n 1)u(u 1)v Γ1 v (u 1)(n 1) = v Γ1+ v v Γ 1 v. Remark 3.3. Under the null hypothesis and using v = 1, the expectation of the numerator and the denominator of the statistic in (3.4) are equal, while under the alternative hypothesis and assuming that all elements of Γ 1 are non-negative, the expectation of the numerator is greater than the expectation of the denominator. If the elements in Γ 1 are non-positive then we reject the null hypothesis when the value of the test statistic is small enough. The next section will show simulation results using v = 1, which is to take the sum of the elements of Γ 1 as null.

Miguel Fonseca, Arkadiusz Kozio l, Roman Zmyślony 56 4. Simulation Study. To test the hypothesis described earlier, a simulation study using the R statistics software was performed and we compare the power function of F test,one and two-sided, with the likelihood ratio test (LRT). The chosen parameters were m = 3, u = 2 and 0.01221 0.02172 0.00901 0.01038 0.01931 0.00824 Γ 0 = 0.02172 0.07492 0.01682, Γ 1 = 0.01931 0.06678 0.01529, 0.00901 0.01682 0.01108 0.00824 0.01529 0.00807 where λ is a multiplier such that Γ = I u Γ 0 +(J u I u ) λγ 1 is positive definite. In this case, 0.280274 < λ < 1.12096. For this instance, we take v = 1, which means that the sum of the elements of Γ 1 is equal to 0 under the alternative hypothesis. Generating 10000 observation vectors using sample sizes of n = 5, 10, 15, 20 and taking the significance level as 5%, the power of the LRT and one and two sided versions of the F test were compared. The results are shown in figures 1, 2, 3 and 4. Consider another case, a very special one, presented in [5], where Γ 0 and Γ 1 are scalars, with m = 1. Let Γ 0 = 2 and Γ 1 = 1. Additionally, it is assumed that u = 2, and parameter n will be one of the values from the set {3, 5, 10, 25}. Matrix Γ has the following form: Γ = [ 2 1 1 2 From conditions of positive definiteness of matrix Γ it is easy to show that values of multiplier λ should be from interval [ 2, 2]. The results are shown in figures 5, 6, which are included after the bibliography. 5. Final Remarks. The test presented in this paper provides a valid alternative to the likelihood ratio test for hypothesis tests on covariance components in multivariate models with BCS covariance structure, given that some previous knowledge on the covariance components is provided in the form of a linear restriction. Given this condition, the presented test is more powerful than the LRT, as it is shown by the simulation study. It also offers computational advantages over the computationally expensive LRT distribution. ]. Acknowledgment. Miguel Fonseca acknowledges that his work was partially supported by the Fundação para a Ciência e a Tecnologia (Portuguese Foundation for Science and Technology) through the project UID/MAT/00297/2013 (Centro de Matemática e Aplicações). REFERENCES [1] Tõnu Kollo and Dietrich von Rosen. Advanced Multivariate Statistics with Matrices. Springer, 2010. [2] Anuradha Roy, Roman Zmyślony, Miguel Fonseca and Ricardo Leiva. Optimal estimation for doubly multivariate data in blocked compound symmetric covariance structure. Journal of Multivariate Analysis, 144:81 90, 2016. [3] Anuradha Roy and Miguel Fonseca. Linear Models with Doubly Exchangeable Distributed Errors. Communications in Statistics - Theory and Methods, 41:2545 2569, 2012. [4] Anuradha Roy, Ricardo Leiva, Ivan Žežula and Daniel Klein. Testing the equality of mean vectors for paired doubly multivariate observations in blocked compound symmetric covariance matrix setup. Journal of Multivariate Analysis, 137:50 60, 2015. [5] Edward Gasiorek, Andrzej Michalski and Roman Zmyślony. Tests of independence of normal random variables with known and unknown variance ratio. Discussiones Mathematicae Probability and Statistics, 20:233 247, 2000.

57 Testing Hypotheses Of Covariance Structure In Multivariate Data [6] Andrzej Michalski and Roman Zmyślony. Testing Hypotheses for Variance Components in Mixed Linear Models. Statistics: A Journal of Theoretical and Applied Statistics, 27:297 310, 1996. [7] Steven F. Arnold. Linear-Models with Exchangeably Distributed Errors. Journal of the American Statistical Association, 74:194 199, 1979. [8] Calyampudi Radhakrishna Rao. Familial correlations or the multivariate generalizations of the intraclass correlation. Current Science, 14:66 67, 1945 [9] Calyampudi Radhakrishna Rao. Discriminant Functions for Genetic Differentiation and Selection (Part IV of Statistical Inference Applied to Classificatory Problems). Sankhyā: the Indian Journal of Statistics, 12:229 246, 1953. [10] Dário Ferreira, Sandra S. Ferreira and Célia Nunes. Linear-Models with Exchangeably Distributed Errors. 10th International Conference on Numerical Analysis and Applied Mathematics. AIP Conf. Proc., 1479:1682 1685, 2012.

Miguel Fonseca, Arkadiusz Kozio l, Roman Zmyślony 58 Figure 1. n = 5

59 Testing Hypotheses Of Covariance Structure In Multivariate Data Figure 2. n = 10

Miguel Fonseca, Arkadiusz Kozio l, Roman Zmyślony 60 Figure 3. n = 15

61 Testing Hypotheses Of Covariance Structure In Multivariate Data Figure 4. n = 20

Miguel Fonseca, Arkadiusz Kozio l, Roman Zmyślony 62 Figure 5. n = 3 and n = 5 Figure 6. n = 10 and n = 25