EXPLICIT EXPRESSIONS OF PROJECTORS ON CANONICAL VARIABLES AND DISTANCES BETWEEN CENTROIDS OF GROUPS. Haruo Yanai*

Similar documents
Statistics for Social and Behavioral Sciences

On V-orthogonal projectors associated with a semi-norm

An Introduction to Multivariate Statistical Analysis

Consistency of test based method for selection of variables in high dimensional two group discriminant analysis

Consistent Bivariate Distribution

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis

ABOUT PRINCIPAL COMPONENTS UNDER SINGULARITY

Multivariate Statistical Analysis

A User's Guide To Principal Components

A note on the equality of the BLUPs for new observations under two linear models

MULTIVARIATE ANALYSIS OF VARIANCE UNDER MULTIPLICITY José A. Díaz-García. Comunicación Técnica No I-07-13/ (PE/CIMAT)

REGRESSION, DISCRIMINANT ANALYSIS, AND CANONICAL CORRELATION ANALYSIS WITH HOMALS 1. MORALS

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING

Mean squared error matrix comparison of least aquares and Stein-rule estimators for regression coefficients under non-normal disturbances

High-dimensional asymptotic expansions for the distributions of canonical correlations

Applied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition

Analysis of Microtubules using. for Growth Curve modeling.

Consistency of Distance-based Criterion for Selection of Variables in High-dimensional Two-Group Discriminant Analysis

ON VARIANCE COVARIANCE COMPONENTS ESTIMATION IN LINEAR MODELS WITH AR(1) DISTURBANCES. 1. Introduction

Mohsen Pourahmadi. 1. A sampling theorem for multivariate stationary processes. J. of Multivariate Analysis, Vol. 13, No. 1 (1983),

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

The Third International Workshop in Sequential Methodologies

Principal component analysis

Discriminant Analysis with High Dimensional. von Mises-Fisher distribution and

On Extreme Bernoulli and Dependent Families of Bivariate Distributions

Researchers often record several characters in their research experiments where each character has a special significance to the experimenter.

Two remarks on normality preserving Borel automorphisms of R n

New insights into best linear unbiased estimation and the optimality of least-squares

Regularized Discriminant Analysis and Reduced-Rank LDA

On the necessary and sufficient condition for the extended Wedderburn-Guttman theorem

Sociedad de Estadística e Investigación Operativa

General structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models

Statistical Inference On the High-dimensional Gaussian Covarianc

314 IEEE TRANSACTIONS ON RELIABILITY, VOL. 55, NO. 2, JUNE 2006

Chapter 2 Canonical Correlation Analysis

Huang method for solving fully fuzzy linear system of equations

CURRICULUM VITAE. Yoshihiko KONNO. August 31, Personal. Japan Women s University, Mejirodai, Bunkyo-ku, Tokyo , Japan

Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA

~ g-inverses are indeed an integral part of linear algebra and should be treated as such even at an elementary level.

460 HOLGER DETTE AND WILLIAM J STUDDEN order to examine how a given design behaves in the model g` with respect to the D-optimality criterion one uses

ORTHOGONALLY INVARIANT ESTIMATION OF THE SKEW-SYMMETRIC NORMAL MEAN MATRIX SATOSHI KURIKI

Testing Some Covariance Structures under a Growth Curve Model in High Dimension

arxiv: v4 [math.oc] 26 May 2009

Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC

Asymptotic Distribution of the Largest Eigenvalue via Geometric Representations of High-Dimension, Low-Sample-Size Data

Useful Numerical Statistics of Some Response Surface Methodology Designs

Measures and Jacobians of Singular Random Matrices. José A. Díaz-Garcia. Comunicación de CIMAT No. I-07-12/ (PE/CIMAT)

Journal of Multivariate Analysis. Sphericity test in a GMANOVA MANOVA model with normal error

1 A factor can be considered to be an underlying latent variable: (a) on which people differ. (b) that is explained by unknown variables

MULTIVARIATE ANALYSIS OF VARIANCE

MOORE-PENROSE INVERSE IN AN INDEFINITE INNER PRODUCT SPACE

Diagonalizing Matrices

Illustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives

More on generalized inverses of partitioned matrices with Banachiewicz-Schur forms

Estimation of S-wave scattering coefficient in the mantle from envelope characteristics before and after the ScS arrival

Statistics for Applications. Chapter 9: Principal Component Analysis (PCA) 1/16

The DMP Inverse for Rectangular Matrices

Analysis of variance, multivariate (MANOVA)

An Alternative Proof of the Greville Formula

STATISTICS 407 METHODS OF MULTIVARIATE ANALYSIS TOPICS

Chapter 3 ANALYSIS OF RESPONSE PROFILES

GRAPHICAL REPRESENTATION OF CORRELATION ANALYSIS OF ORDERED DATA BY LINKED VECTOR PATTERN

A ROBUST METHOD OF ESTIMATING COVARIANCE MATRIX IN MULTIVARIATE DATA ANALYSIS G.M. OYEYEMI *, R.A. IPINYOMI **

Number of cases (objects) Number of variables Number of dimensions. n-vector with categorical observations

Department of Econometrics and Business Statistics

Small sample size in high dimensional space - minimum distance based classification.

A Generalized Uncertainty Principle and Sparse Representation in Pairs of Bases

Inverse of a Square Matrix. For an N N square matrix A, the inverse of A, 1

On Hadamard and Kronecker Products Over Matrix of Matrices

X -1 -balance of some partially balanced experimental designs with particular emphasis on block and row-column designs

THE GRADIENT PROJECTION ALGORITHM FOR ORTHOGONAL ROTATION. 2 The gradient projection algorithm

CHAPTER 2 -idempotent matrices

Canonical Correlation & Principle Components Analysis

Matrix Differential Calculus with Applications in Statistics and Econometrics

Robustness of the Quadratic Discriminant Function to correlated and uncorrelated normal training samples

Wolfgang Karl Härdle Leopold Simar. Applied Multivariate. Statistical Analysis. Fourth Edition. ö Springer

Moore-Penrose s inverse and solutions of linear systems

Principal Component Analysis (PCA) Theory, Practice, and Examples

On the construction of asymmetric orthogonal arrays

INFORMATION THEORY AND STATISTICS

Response Surface Methodology

HISTORICAL PERSPECTIVE OF SURVEY SAMPLING

ECON 3150/4150, Spring term Lecture 7

Principal Component Analysis

EIGENVALUES AND SINGULAR VALUE DECOMPOSITION

FACTOR ANALYSIS AS MATRIX DECOMPOSITION 1. INTRODUCTION

Multivariate Gaussian Analysis

on climate and its links with Arctic sea ice cover

Dragan S. Djordjević. 1. Introduction and preliminaries

Diagonal and Monomial Solutions of the Matrix Equation AXB = C

Causal Inference Using Nonnormality Yutaka Kano and Shohei Shimizu 1

The Precise Effect of Multicollinearity on Classification Prediction

Regularized Common Factor Analysis

Financial Econometrics

Response Surface Methodology:

Large Sample Properties of Estimators in the Classical Linear Regression Model

Supervised Learning: Linear Methods (1/2) Applied Multivariate Statistics Spring 2012

Finite Population Sampling and Inference

Properties of Matrices and Operations on Matrices

Transcription:

J. Japan Statist. Soc. Vol. 11 No. 1 1981 43-53 EXPLICIT EXPRESSIONS OF PROJECTORS ON CANONICAL VARIABLES AND DISTANCES BETWEEN CENTROIDS OF GROUPS Haruo Yanai* Generalized expressions of canonical correlation analysis and partial canonical correlation analysis are introduced, in which the sum of the squared canonical and partial canonical correlation coefficients for each are given as the traces of the product of two orthogonal projectors and that of two oblique projectors respectively. Following the result, some explicit expressions of projectors are obtained in connection with the product of two projectors defined in terms of canonical variables arising in canonical correlation analysis, partial canonical correlation analysis and part canonical correlation analysis, and the results are applied for showing that the Euclidian distance based on canonical variables turns out to be Mahalanobis' generalized distance with a slight modification. 1. Introduction Canonical correlation analysis established by Hotelling (1935) as a method of analysing the relationship between two sets of variables is a generalization of regression analysis. It also subsumes multiple regression analysis, discriminant analysis, canonical analysis based on discrete variables and canonical factor analysis. Recently, Khatri (1976) showed that the theory can be developed, by using generalized inverse defined by Rao (1962), without assuming the non-singularity of the covariance matrix associated with the joint distribution of the explanatory and criterion variables. The purpose of the present article is to further extend the theory to the case where the two sets of variables are not necessarily linearly independent and a third set of variables is available in addition to the two sets. For this purpose, we give some explicit expressions of projectors defined in terms of canonical variables obtained from canonical correlation analysis, partial canonical correlation analysis and multiple discriminant analysis, using the general theory of the projector developed by Rao and Mitra (1971), Takeuchi and Yanai (1972), Rao (1974) and Rao and Yanai (1979). Furthermore, we shall show that the obtained results can be used effectively for clarifying the form of Euclidian distance based on the canonical variables arising from canonical correlation analysis and multiple discriminant analysis. In the next section, we give some preliminary results, and in the subsequent sections we shall show that they can be used to clarify the relations among the projectors defined in terms of canonical variables arising in canonical correlation analysis and partial canonical correlation analysis. Received Nov. 1, 1979, Revised Dec. 4, 1980. * Chiba University.

46 J. JAPAN STATIST. SOC. Vol. 11 No.1 1981

EXPLICIT EXPRESSIONS OF PROJECTORS ON CANONICAL VARIABLES 51 which is an estimate of Mahalanobis' generalized distance between the group 1 and 2 (see Yanai and Takane (1977, p. 81)). From the above result, it follows that the Euclidian distance based on canonical variables turns out to be Mahalanobis' generalized distance with a slight modification. We prove a stronger COROLLARY 6. result.

52 J. JAPAN STATIST. SOC. Vol. 11 No.1 1981 A similar reasoning as in the proof of Corollary 6 leads to the following Corollary. COROLLARY 7. Finally, we consider the Euclidian distances based on the canonical variables of canonical correlation analysis. In this case, the following theorem is established. (34) and The proofs follows immediately, using the result (10) and Corollary 2. In order to get explicit expressions of d2yb(gi, gj), we may replace X by Y and Y by X in both equations of (34). From Corollary 5, we have the following theorem, which is a generalization of Theorem 7. and The theorem follows from the result of Corollary 3 and an explicit expression of PQ2x which is QzX(X'QzX)-X'Qz. Acknowledgement Part of the result of this paper was obtained while the author was visiting the Indian Statistical Institute, New Delhi, from December, 1977 to January, 1978. I should like to express my sincere gratitude to Dr. C. R. Rao for his useful comments on this work. Thanks are also due to Dr. M. Sibuya, IBM Japan and Dr. S. Iwatsubo, National Center for Entrance Examination of Universities, for their suggestions, which led me to Corollaries 1 and 6, respectively. Furthermore, the author deeply expresses his thanks to referees for giving useful suggestions for revising the manuscript and for leading me to incorporate Corollary 4 in this paper.

EXPLICIT EXPRESSIONS OF PROJECTORS ON CANONICAL VARIABLES 53 REFERENCES [1] Cooley, W. W. and Lohnes, P. R. (1962). Multivariate Procedure for the Behavioral Science, John Wiley & Sons, New York. [2] Gnanadesikan, R. (1977). Method for Statistical Data Analysis of Multivariate Observations, John Wiley & Sons, New York. [3] Hotelling, H. (1936). Relations between two sets of variates, Biometrika, 28, 321-377. [4] Khatri, C. G. (1976). A note on multiple and canonical correlation for a singular matrix, Psychometrika, Vol. 41, No.4, 465-470. [5] Rao, C. R. (1962). A note on a generalized inverse of a matrix with applications to problems in mathematical statistics, J. of Royal Statistical Society, B 24, 152-158. [6] Rao, C. R. and Mitra, S. K. (1971). Generalized Inverse of Matrices and its Applications, John Wiley & Sons, New York. [7] Rao, C. R. (1974). Projectors, generalized inverse and BLUES, J. of Royal Statistical Society, B 36, 442-448. [8] Rao, C. R. and Yanai, H. (1979). General definition of a projector, its decomposition, and application to statistical problems, J. of Statistical Planning and Inference, Vol. 3, No.1, 1-17. [9] Timm, N. H. and Carlson, J. E. (1976). Part and bipartial canonical correlation analysis, Psychometrika, Vol. 41, 159-176. [10] Takeuchi, K. and Yanai, H. (1972). Tahenryokaiseki no Kiso (Foundations of Multivariate Analysis), Toyo Keizai Press. [11] Yanai, H. and Takane, Y. (1977). Tahenryokaiseki (Multivariate Analysis), Asakura Publishing Company, Tokyo.