Elliptically Contoured Distributions

Similar documents
x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ).

Multivariate Distributions

A Probability Review

MAS223 Statistical Inference and Modelling Exercises

The Multivariate Gaussian Distribution

Physics 403. Segev BenZvi. Parameter Estimation, Correlations, and Error Bars. Department of Physics and Astronomy University of Rochester

18 Bivariate normal distribution I

Next is material on matrix rank. Please see the handout

Statistics 351 Probability I Fall 2006 (200630) Final Exam Solutions. θ α β Γ(α)Γ(β) (uv)α 1 (v uv) β 1 exp v }

Chapter 5. The multivariate normal distribution. Probability Theory. Linear transformations. The mean vector and the covariance matrix

Massachusetts Institute of Technology Instrumentation Laboratory Cambridge, Massachusetts

Multivariate Statistics

Basic Concepts in Matrix Algebra

component risk analysis

Multivariate Distributions (Hogg Chapter Two)

MLES & Multivariate Normal Theory

Lecture 3. Inference about multivariate normal distribution

01 Probability Theory and Statistics Review

Matrices and Deformation

MATH5745 Multivariate Methods Lecture 07

22.3. Repeated Eigenvalues and Symmetric Matrices. Introduction. Prerequisites. Learning Outcomes

8 - Continuous random vectors

2. Matrix Algebra and Random Vectors

Random Variables and Their Distributions

2 Functions of random variables

Week Quadratic forms. Principal axes theorem. Text reference: this material corresponds to parts of sections 5.5, 8.2,

Fundamentals of Matrices

Whitening and Coloring Transformations for Multivariate Gaussian Data. A Slecture for ECE 662 by Maliha Hossain

3. Probability and Statistics

ANOVA: Analysis of Variance - Part I

Statistics for scientists and engineers

STAT 501 Assignment 1 Name Spring 2005

The Multivariate Normal Distribution. In this case according to our theorem

Chapter 4 Multiple Random Variables

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Uncorrelatedness and Independence

1 Data Arrays and Decompositions

Principal Components Theory Notes

The Multivariate Normal Distribution 1

Contents 1. Contents

Multiple Random Variables

The Multivariate Normal Distribution

Lecture 5: Hypothesis tests for more than one sample

Exam 2. Jeremy Morris. March 23, 2006

Multivariate Gaussians. Sargur Srihari

Stat 5101 Lecture Slides: Deck 8 Dirichlet Distribution. Charles J. Geyer School of Statistics University of Minnesota

(Multivariate) Gaussian (Normal) Probability Densities

E X A M. Probability Theory and Stochastic Processes Date: December 13, 2016 Duration: 4 hours. Number of pages incl.

Chapter 4. The First Fundamental Form (Induced Metric)

Multivariate Distribution Models

Probability Background

Notes on Random Vectors and Multivariate Normal

9.1 Mean and Gaussian Curvatures of Surfaces

CS 195-5: Machine Learning Problem Set 1

Bayesian Decision Theory

Chapter 17: Undirected Graphical Models

Gaussian Models (9/9/13)

j=1 u 1jv 1j. 1/ 2 Lemma 1. An orthogonal set of vectors must be linearly independent.

Continuous Random Variables

A Few Special Distributions and Their Properties

STA 4322 Exam I Name: Introduction to Statistics Theory

From Lay, 5.4. If we always treat a matrix as defining a linear transformation, what role does diagonalisation play?

Lecture 2. Spring Quarter Statistical Optics. Lecture 2. Characteristic Functions. Transformation of RVs. Sums of RVs

A Note on Hilbertian Elliptically Contoured Distributions

Joint Distributions. (a) Scalar multiplication: k = c d. (b) Product of two matrices: c d. (c) The transpose of a matrix:

Lecture 3. Probability - Part 2. Luigi Freda. ALCOR Lab DIAG University of Rome La Sapienza. October 19, 2016

Multivariate Random Variable

Multiple Random Variables

OHSx XM521 Multivariable Differential Calculus: Homework Solutions 13.1

Lecture Semidefinite Programming and Graph Partitioning

Concentration Ellipsoids

Lie Groups for 2D and 3D Transformations

Linear Algebra: Matrix Eigenvalue Problems

ELLIPSES. Problem: Find the points on the locus. Q(x, y) = 865x 2 294xy + 585y 2 = closest to, and farthest from, the origin. Answer.

IEOR 4701: Stochastic Models in Financial Engineering. Summer 2007, Professor Whitt. SOLUTIONS to Homework Assignment 9: Brownian motion

Vectors and Matrices Statistics with Vectors and Matrices

STA 2201/442 Assignment 2

2.3. The Gaussian Distribution

Gaussian random variables inr n

8 Eigenvectors and the Anisotropic Multivariate Gaussian Distribution

3 2 6 Solve the initial value problem u ( t) 3. a- If A has eigenvalues λ =, λ = 1 and corresponding eigenvectors 1

On prediction and density estimation Peter McCullagh University of Chicago December 2004

Repeated Eigenvalues and Symmetric Matrices

Numerical Optimization

1. Density and properties Brief outline 2. Sampling from multivariate normal and MLE 3. Sampling distribution and large sample behavior of X and S 4.

We introduce methods that are useful in:

Change Of Variable Theorem: Multiple Dimensions

Economics 240A, Section 3: Short and Long Regression (Ch. 17) and the Multivariate Normal Distribution (Ch. 18)

More Spectral Clustering and an Introduction to Conjugacy

Physics 403 Probability Distributions II: More Properties of PDFs and PMFs

Chapter 5 continued. Chapter 5 sections

Multivariate Analysis and Likelihood Inference

5. Random Vectors. probabilities. characteristic function. cross correlation, cross covariance. Gaussian random vectors. functions of random vectors

4. Distributions of Functions of Random Variables

Random Vectors and Multivariate Normal Distributions

Random Matrix Eigenvalue Problems in Probabilistic Structural Mechanics

5.1 Consistency of least squares estimates. We begin with a few consistency results that stand on their own and do not depend on normality.

Multivariate Distributions

16.584: Random Vectors

MAS223 Statistical Inference and Modelling Exercises and Solutions

Transcription:

Elliptically Contoured Distributions Recall: if X N p µ, Σ), then { 1 f X x) = exp 1 } det πσ x µ) Σ 1 x µ) So f X x) depends on x only through x µ) Σ 1 x µ), and is therefore constant on the ellipsoidal surfaces x µ) Σ 1 x µ) = k. Each ellipsoid is centered at µ and its principal axes are in the directions of the eigenvectors of Σ, with lengths proportional to eigenvalues of Σ. NC STATE UNIVERSITY 1 / 8

The general elliptically contoured distribution has pdf 1 g [ x ν) Λ 1 x ν) ] det Λ where: Λ is positive definite; g ) 0, with... g y y) dy 1 dy... dy p = 1. If X has this distribution and Y = L 1 X ν), where LL = Λ, then Y has pdf g y y), spherically contoured. NC STATE UNIVERSITY / 8

Polar Coordinates Every y can be written in polar coordinates as y 1 = r sin θ 1, y = r cos θ 1 sin θ, y 3 = r cos θ 1 cos θ sin θ 3,. y p 1 = r cos θ 1 cos θ... cos θ p sin θ p 1, y p = r cos θ 1 cos θ... cos θ p cos θ p 1 where: r 0; 1 π < θ i 1 π, 1 i p 1, and π < θ p 1 π. NC STATE UNIVERSITY 3 / 8

The Jacobian Jr, θ) = r p 1 [cos θ 1 )] p [cos θ )] p 3... [cos θ p 3 )] cos θ p ) If the pdf of Y is g y y), then the pdf of R, Θ is Jr, θ)g r ) = r p 1 g r ) [cos θ 1 )] p [cos θ )] p 3... [cos θ p 3 )] cos θ p ) R and Θ are independent, and the marginal pdf of R is π 1 p Γ 1 p) r p 1 g r ) = Cp) r p 1 g r ). NC STATE UNIVERSITY 4 / 8

Here Cp) is the surface area of the p-dimensional unit sphere. The vector U = Y/R is uniformly distributed on that surface, and, because it is a function of Θ, is independent of R. So Y can be written as RU, where R and U are independent, R has density Cp) r p 1 g r ), and U is uniformly distributed on the surface of the p-dimensional unit sphere. NC STATE UNIVERSITY 5 / 8

Moments If O is an orthogonal matrix O O = I p ), and V = OU then V V = U U = 1, so V also takes values on the surface of the unit sphere. Also the Jacobian is det O = 1, so V is uniformly distributed on the surface of the p-dimensional unit sphere, the same as U. So EU) = EOU) = OEU) for every orthogonal O EU) = 0 take O = I p, for instance). NC STATE UNIVERSITY 6 / 8

Similarly, if E UU ) = Σ, then Σ = E VV ) = OΣO for every orthogonal O Σ = ki p for some k not so trivial... )... and U U = 1 kp = traceσ) = E [ trace UU )] = E U U ) = 1 Σ = E UU ) = p 1 I p. NC STATE UNIVERSITY 7 / 8

Finally: if ER) <, then EY) = ERU) = ER)EU) = 0; if E R ) <, then E YY ) = E R UU ) = E R ) E UU ) = 1 p E R ) I p and since X = LY + ν, under the same conditions EX) = ν and CX) = LCY)L = 1 p E R ) LL = 1 p E R ) Λ NC STATE UNIVERSITY 8 / 8

Marginal Distributions If Y is spherically contoured with density g y y), and Y is partitioned as ) Y 1) Y = Y ) where Y 1) contains Y 1, Y,..., Y q, and Y ) contains Y q+1, Y q+,..., Y p, the marginal density of Y ) is g q+1):p y q+1, y q+,..., y p ) =... g u u + y y ) du = Cq) 0 = g y y ), say. g r 1 + y y ) r q 1 1 dr 1 NC STATE UNIVERSITY 9 / 8

So the marginal distribution of Y ) is spherically contoured. Now suppose that X has the elliptically contoured density 1 g [ x ν) Λ 1 x ν) ] det Λ and is similarly partitioned, and as for the multivariate normal distribution we let Z 1) = X 1) Σ 1 Σ 1 X) = X 1) Λ 1 Λ 1 X) Z ) = X ) NC STATE UNIVERSITY 10 / 8

Then the density of Z is [ 1 g z 1) τ 1)) Λ 1 det 11 z 1) τ 1)) Λ11 det Λ + z ) ν )) Λ 1 z ) ν ))] where τ 1) = ν 1) Λ 1 Λ 1 ν) Note that Z 1) and Z ) are uncorrelated, but independent only for the multivariate normal distribution. NC STATE UNIVERSITY 11 / 8

The marginal density of Z ) = X ) is [ 1 g x ) ν )) Λ 1 x ) ν ))] det Λ = Cq) 0 g [ r1 + x ) ν )) Λ 1 x ) ν ))] r q 1 1 dr 1 So the marginal density of X ) is also elliptically contoured. NC STATE UNIVERSITY 1 / 8

Uniform Distribution on a Sphere Consider the special case Y = pu, uniformly distributed on the sphere of radius p. Then Y 1 = pu 1 = p sin Θ 1 ), and the pdf of Θ 1 is proportional to [cos θ 1 )] p 1. So the pdf of Y 1 and hence of every Y i ) is proportional to 1 y p ) p 3 NC STATE UNIVERSITY 13 / 8

This is the Beta density with parameters α = β = p 1)/, scaled to the interval [ p, p]. Special cases: p = 1: the density isn t integrable, so the argument fails. Clearly Y 1 takes the values ±1 with probability 1. p = : the density is U-shaped, unbounded at ±. p = 3: the density is uniform on ± 3. p : the density converges to N0, 1). NC STATE UNIVERSITY 14 / 8

More generally, for any fixed q, the marginal density of Y 1) consisting of Y 1, Y,..., Y q ) converges to N q 0, I q ) as p. If the radius is a random R p instead of the fixed p, the density of Y 1) is the corresponding mixture of scaled distributions, and as p converges to the corresponding mixture of normal distributions. So if Y is spherically contoured because it is part of a larger random vector that is also spherically contoured, of arbitrarily large dimension, its distribution must be a mixture of spherical normal distributions. NC STATE UNIVERSITY 15 / 8

Conditional Distributions The conditional density of Y 1) given Y ) = y is g y 1 y 1 + y y ) g y y = g y 1 y 1 + r ) ) ) g r where r = y y. This is a spherically contoured density, so Y 1) has conditional mean zero and conditional covariance matrix C Y 1) Y ) = y )) = 1 q E Y 1) Y 1) ) R = r I q NC STATE UNIVERSITY 16 / 8

After simplification, E Y 1) Y 1) ) R = r = 0 0 r q+1 1 g r 1 + r ) dr1 r q 1 1 g r 1 + r ) dr1 In general, this depends on the conditional value r. It does not depend on r for the multivariate normal, because in that case g r1 + r ) g r ) ) = 1 g r g0) NC STATE UNIVERSITY 17 / 8

The conditional density of X 1) given X ) = x ) is { 1 [ ) g x 1) ν 1) B x ) ν ))] g r Λ 1 det 11 Λ11 [ x 1) ν 1) B x ) ν ))] } + r where r = B = Λ 1 Λ 1 x ) ν )) Λ 1 x ) ν )), NC STATE UNIVERSITY 18 / 8

This conditional density is also elliptically contoured, with E X 1) X ) = x )) = ν 1) + B x ) ν )) and C X 1) X ) = x )) = 1 q E R 1 R = r ) Λ11 The linearity of the conditional mean is shared with the multivariate normal distribution. The conditional covariance matrix is in general not constant, the exception being the multivariate normal distribution. NC STATE UNIVERSITY 19 / 8

Example: Multivariate t Suppose that Z N p 0, I p ), and ms χ m, independent of Z. Then Y = 1/s)Z has the multivariate t density with m degrees of freedom, and Γ m+p ) Γ ) m mπ) p/ R p = Y Y p ) 1 + y y m+p m F p,m = χ p/p χ m/m Every normalized linear combination c Y with c c = 1 has the univariate t-distribution with m degrees of freedom. NC STATE UNIVERSITY 0 / 8

Note: if m >, E R /p ) = m/m ), so CY) = m m I p. The standardized multivariate t-distribution is defined by Y s = m m m Y = ms so that E Y sy s /p) = 1 and CY s ) = I p. The standardized multivariate t-distribution is convenient for simulating long-tailed data with a given mean vector and covariance matrix, but it is not the familiar version. Z NC STATE UNIVERSITY 1 / 8

For the multivariate t-distribution, the conditional density of Y 1) given Y ) = y is also multivariate t, with m = m + p q) degrees of freedom, but with a scale factor that depends on r = y y. Conditionallly on Y ) = y, Y 1) = 1 Y 1) 1 + r p q) m has the conventional multivariate t-distribution in q dimensions with m degrees of freedom. Since this distribution does not depend on y, Y 1) is independent of Y ). NC STATE UNIVERSITY / 8

So if m >, C ) Y 1) Y ) = y = = [ 1 + r [ 1 + r = m + r m = m + r m I q. ] p q) C m ] p q) m p q) Y 1) m m I q I q ) Y ) = y That is, large values of Y ) in magnitude) increase the conditional covariance matrix of Y 1), and small values decrease it. NC STATE UNIVERSITY 3 / 8

The reason is that the magnitude of Y ) provides information about s : conditionally on Y ) = y, m + r ) s χ m. Heuristically, a large value of Y ) suggests a small value of s, which in turn implies a large value of Y 1). Similarly, a small value of Y ) suggests a large value of s, which in turn implies a small value of Y 1). NC STATE UNIVERSITY 4 / 8

For the general elliptically contoured multivariate t-distribution with m degrees of freedom, the density function is Γ m Γ m+p ) ) mπ) p/ det Λ [ 1 + x µ) Λ 1 ] m+p x µ) m The conditional density of X 1) given X ) = x is correspondingly multivariate t-distribution with m degrees of freedom. NC STATE UNIVERSITY 5 / 8

The conditional distribution is centered at µ 1) + B x ) µ )) where B = Λ 1 Λ 1. If m > 1 as it must be), this is the conditional mean. NC STATE UNIVERSITY 6 / 8

The matrix of the quadratic form in the conditional density is where now m + r m Λ 11 r = x ) µ )) Λ 1 If m >, the conditional covariance matrix is x ) µ )) m + r m Λ 11 NC STATE UNIVERSITY 7 / 8

The conditional structure of the multivariate t-distribution differs from that of the multivariate normal distribution in a surprisingly simple way: The conditional mean is the same linear function of x ) ; The conditional covariance matrix is multiplied by a scalar) quadratic function of x ), instead of being constant. NC STATE UNIVERSITY 8 / 8