1. Density and properties Brief outline 2. Sampling from multivariate normal and MLE 3. Sampling distribution and large sample behavior of X and S 4.
|
|
- Madeleine Gloria Knight
- 5 years ago
- Views:
Transcription
1 Multivariate normal distribution Reading: AMSA: pages Multivariate Analysis, Spring 2016 Institute of Statistics, National Chiao Tung University March 1, 2016
2 1. Density and properties Brief outline 2. Sampling from multivariate normal and MLE 3. Sampling distribution and large sample behavior of X and S 4. Assessing the assumption of normality 5. Detecting outliers and cleaning data 6. Transformations to near normality 2 / 26
3 X 1. X p Density = X (p 1) : random vector X N p (µ (p 1), Σ (p p) ) (i.e., follow a p-dimensional normal distribution) f (x) = 1 (2π) p 2 Σ 1 2 e (x µ)t Σ 1 (x µ)/2 where < x i <, i = 1, 2,, p 3 / 26
4 p = 2, Bivariate normal density f (x 1, x 2 ) = 1 2π σ 11 σ 22 (1 ρ 212 ) { exp [ ( ) 1 x1 µ 2 ( ) 1 x2 µ 2 2 2(1 ρ 2 12 ) + σ11 σ22 ( ) ( ) ]} x1 µ 1 x2 µ 2 2ρ 12 σ11 σ22 [ ] µ = (µ 1, µ 2 ) T σ11 σ, Σ = 12, ρ σ 12 σ 12 = σ 12 σ11 σ / 26
5 Properties Result: For a p-dimensional normal dist. X N p (µ, Σ), 1. µ is the point of maximum density (the mode). 2. µ is the expected value of X (the mean). Result: 1. If X N p (µ, Σ), then for any a (p 1) vector, a T X N(a T µ, a T Σa). 2. Also if a T X N(a T µ, a T Σa) for every a, then X N p (µ, Σ). 5 / 26
6 Result: If X N p (µ, Σ), 1. the q linear combination a 11 X a 1p X p A (q p) X =. a q1 X a qp X p are distributed as N q (Aµ, AΣA T ). 2. Also X + d, where d is a vector of constant, is distributed as N p (µ + d, Σ). 6 / 26
7 Result: X N p (µ, Σ), and X (p 1) = [ X1 (q 1) X 2 ((p q) 1) ] µ (p 1) = [ µ1 (q 1) [ Σ11 Σ (p p) = (q q) Σ 12 (q (p q)) then X 1 N q (µ 1, Σ 11 ). Σ 21 ((p q) q) µ 2 ((p q) 1) Σ 22 ((p q) (p q)) proof: Let A q p = [I (q q) 0 (q (p q)) ], then AX = X 1 N q ( Aµ, AΣA }{{}}{{ T }). µ Σ 1 11 Note: This implies that, for multivariate normal, all marginal dist. are normal. ] ] 7 / 26
8 Result: (a) If X 1 (q1 1) and X 2 (q2 1) are independent, then Cov(X [ ] 1, X 2 ) q1 q 2 = 0. [ ] [ ] ) X1 µ1 Σ11 Σ (b) If N X (q1 +q 2 )(, 12, then X 2 µ 2 Σ 21 Σ 1 and 22 X 2 are independent if and only if Σ 12 = 0. (c) If X 1 N q1 (µ 1, Σ 11 ), X 2 N q2 (µ 2, Σ 22 ), and X 1 and X 2 are independent, then [ X1 X 2 ] N (q1 +q 2 )( [ µ1 µ 2 ], [ Σ11 0 ] ) 0 Σ 22 (Note: marginal normals do not imply that their combined vector is normal) 8 / 26
9 [ ] X1 Result: Let X = N X p (µ, Σ) with [ ] [ 2 ] µ1 Σ11 Σ µ =, Σ = 12, and Σ µ 2 Σ 21 Σ 22 > 0. Then the 22 conditional dist. of X 1 given X 2 = x 2 is normal and has mean = µ 1 + Σ 12 Σ 1 22 (x 2 µ 2 ) covariance = Σ 11 Σ 12 Σ 1 22 Σ 21. Note: Above result implies that, for multivariate normal, 1. All conditional dist. are normal. 2. The conditional mean is the form µ 1 + β 1,q+1 (x q+1 µ q+1 ) + + β 1,p (x p µ p ). µ q + β q,q+1 (x q+1 µ q+1 ) + + β q,p (x p µ p ) where β s are defined by Σ 12 Σ The conditional covariance, Σ 11 Σ 12 Σ 1 22 Σ 21, dose not dependent on the value of the conditional variable, x 2. 9 / 26
10 Result: Let X N p (µ, Σ) with Σ > 0. Then (a) (X µ) T Σ 1 (X µ) is distributed as χ 2 p, where χ 2 p denotes the Chi-Square dist. with p degrees of freedom. (b) The N p (µ, Σ) dist. assigns prob. 1 α to the solid ellipsoid {x : (x µ) T Σ 1 (x µ) χ 2 p(α)}, where χ 2 p(α) denotes the upper (100α)th percentile of χ 2 p dist. proof: Note: Z = Σ 1 2 (X µ) has Np (0, I) and (X µ) T Σ 1 (X µ) = Z T Z. 10 / 26
11 Result: Let X 1,, X n be mutually independent with X j N p (µ j, Σ). (Note, X j s have the same covariance matrix Σ). Then V 1 = c 1 X c n X n N p ( n j=1 c jµ j, ( n j=1 c j 2)Σ). Moreover, V 1 and V 2 = b 1 X b n X n are jointly multivariate normal with covariance matrix [ ( n j=1 c j 2)Σ ] (bt c)σ (b T c)σ ( n j=1 b2 j )Σ. Consequently, V 1 and V 2 are independent if b T c = n j=1 c jb j = 0. proof: 11 / 26
12 Sampling from multivariate normal and MLE X 1,, X n : random sample from N p (µ, Σ) = = {Joint density of X 1,, X n } { n { 1 exp 1 [ ] }} (x j=1 (2π) p 2 Σ 1 j µ) T Σ 1 (x j µ) n [ ] exp (2π) np 2 Σ n 2 1 (x j µ) T Σ 1 (x j µ) 2 j=1 Result: Let X 1,, X n be random sample from N p (µ, Σ). Then µ = 1 n X j = X n Σ = 1 n (X j X)(X j X) T = n 1 n n S j=1 are MLE of µ and Σ. j=1 12 / 26
13 Result: Let X 1,, X n be random sample from N p (µ, Σ). Then X and S are sufficient statistics. proof: The joint density depends on the whole set of observations x 1,, x n only through x and n j=1 (x j x)(x j x) T = (n 1)S. Note: Many multivariate techniques begin with sample mean and covariance, it is prudent to check on the adequacy of multivariate normal assumption. If data cannot be regarded as multivariate normal, techniques that depends solely on x and S may ignore useful information. 13 / 26
14 Sampling dist. and large sample behavior of X and S For univariate case (p = 1), X 1,, X n are random sample from N(µ, σ 2 ), then 1. X N(µ, 1 n σ2 ) 2. (n 1)S 2 (= n j=1 (X j X ) 2 ) σ 2 χ 2 n 1 3. X and S 2 are independent. Wishart distribution (multivariate analogy of χ 2 ) Let Z 1,, Z m be independently distributed as N p (0, Σ), then W m ( Σ) = Wishart dist. with m d.f. = distribution of m j=1 Z jz T j 14 / 26
15 Result: Let X 1,, X n be a random sample of size n from a p-variate normal dist. with mean µ and covariance matrix Σ. Then 1. X N p (µ, 1 n Σ) 2. (n 1)S W n 1 ( Σ) 3. X and S are independent Law of larger number: X 1,, X n be a random sample with mean µ and covariance Σ. Then, X p µ (converge in probability to µ) n proof: S (or S n ) p Σ n 15 / 26
16 Central limit theorem: Let X 1,, X n be independent observations from a population with mean µ and covariance Σ. Then and n(x µ) n(x µ) T S 1 (X µ) d (n p) N p(0, Σ) d χ 2 p (n p) Note: From previously, n(x µ) T Σ 1 (X µ) χ 2 p if X N p (µ, 1 n Σ). The χ2 p is approximately the sampling dist. of n(x µ) T Σ 1 (X µ) when X is approximately normally distributed. Replacing Σ 1 by S 1 does not seriously affect this approximation for n is much larger than p. 16 / 26
17 Assessing the assumption of normality In practical work, the investigation of normality in one-dimension and two-dimension are ordinarily sufficient. We want address these questions: 1. Do the marginal distribution of X = (X 1,, X p ) appear to be normal? What about a few linear combinations of the components X i? 2. Do the scatter plots of pairs of observations on different characteristics give the elliptical appearance expected form normal populations? 3. Are there any wild observation that should be checked for accuracy? 17 / 26
18 Evaluating the normality of the univariate normal distribution Histogram of each X i, and observed proportion of observations lying in Q-Q plot against normal ( x i s ii, x i + s ii ) ( x i 2 s ii, x i + 2 s ii ) Shapiro and Wilk test for normality 18 / 26
19 Evaluating bivariate normality If the observations were generated from a multivariate normal distribution, each bivariate distribution should be normal, and the contours of constant density would be ellipses. The scatter plots for pairs of characteristics should exhibit an overall pattern that is nearly elliptical. Since X N p (µ, Σ) (X µ) T Σ 1 (X µ) χ 2 p so { all x such that (x x) T S 1 (x x) χ 2 p(1 r) } should be roughly r 100%. 19 / 26
20 For the sample observations x 1, x 2,, x n, can do Q-Q plot of d 2 j = (x j x) T S 1 (x j x), j = 1,, n versus quantiles of χ 2 p (the chi-square plot). Should be a line with slope 1 if x j s are from multivariate normal. 1. Order the dj 2 s from smallest to largest as d(1) 2 d (1) 2 d (n) 2 (. ) 2. Graph the pairs q c,p ( j 1 2 n ), d (j) 2, where q c,p ( j 1 2 n ) is the ( j 1 2 n ) 100 quantile of χ2 p. 20 / 26
21 Detecting outliers and cleaning data 1. Make a dot plot for each variable. 2. Make a scatter plot for each pair of variables. 3. Calculate the standardized values z jk = x jk x k, j = 1,, n, k = 1,, p skk Examine for large or small values (usually for z jk 3.5). 4. Calculate d 2 j = (x j x) T S 1 (x j x) In a chi-square plot (d 2 j s versus quantiles of χ2 p), examine those points that are farthest from the origin (usually for exceed (larger than) χ 2 p(0.05)). 21 / 26
22 Transformations to normality Method: To make non-normal data more normal looking by replacing x with g(x) For non-continuous data Original scale count, y proportion, ˆp Transformed scale y ( logit(ˆp) = log ˆp 1 ˆp correlation, r Fisher s z(r) = 1 2 log ( 1+r 1 r ) ) 22 / 26
23 Transformations to normality: continuous data Skewed data 23 / 26
24 Power transformation 24 / 26
25 Box-Cox transformation: for x > 0 { x λ x (λ) 1 λ 0 = λ ln(x) λ = 0 The choice of λ is the solution that maximize [ ] l(λ) = n 2 ln 1 n (x (λ) j x n (λ) ) 2 + (λ 1) j=1 where x (λ) = 1 n n j=1 x (λ) j. Why does it work? n ln(x j ), j=1 25 / 26
26 Transforming multivariate data Method 1: make each marginal distribution approximately normal by, e.g., Box-Cox transformation Method 2: Multivariate Box-Cox transformation Choose λ = (λ 1,, λ p ) which maximizes l(λ 1,, λ p ) = n ln S(λ) 2 n + (λ 1 1) ln(x j1 ). + (λ p 1) j=1 n ln(x jp ) where S(λ) is the sample covariance matrix computed by j=1 x (λ) j = [ λ x 1 j1 1 λ 1 x λp jp 1 ] T, j = 1,, n λ p 26 / 26
STT 843 Key to Homework 1 Spring 2018
STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ
More informationMultivariate Analysis Homework 1
Multivariate Analysis Homework A490970 Yi-Chen Zhang March 6, 08 4.. Consider a bivariate normal population with µ = 0, µ =, σ =, σ =, and ρ = 0.5. a Write out the bivariate normal density. b Write out
More informationExam 2. Jeremy Morris. March 23, 2006
Exam Jeremy Morris March 3, 006 4. Consider a bivariate normal population with µ 0, µ, σ, σ and ρ.5. a Write out the bivariate normal density. The multivariate normal density is defined by the following
More informationTUTORIAL 2 STA437 WINTER 2015
TUTORIAL 2 STA437 WINTER 2015 AL NOSEDAL Contents 1. Assessing the assumption of Normality 1 1.1. Evaluating the Normality of the Univariate Marginal Distributions 1 1.2. Evaluating Bivariate Normality
More informationInferences about a Mean Vector
Inferences about a Mean Vector Edps/Soc 584, Psych 594 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University
More informationYou can compute the maximum likelihood estimate for the correlation
Stat 50 Solutions Comments on Assignment Spring 005. (a) _ 37.6 X = 6.5 5.8 97.84 Σ = 9.70 4.9 9.70 75.05 7.80 4.9 7.80 4.96 (b) 08.7 0 S = Σ = 03 9 6.58 03 305.6 30.89 6.58 30.89 5.5 (c) You can compute
More informationMultivariate Capability Analysis Using Statgraphics. Presented by Dr. Neil W. Polhemus
Multivariate Capability Analysis Using Statgraphics Presented by Dr. Neil W. Polhemus Multivariate Capability Analysis Used to demonstrate conformance of a process to requirements or specifications that
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationSTA 2201/442 Assignment 2
STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution
More informationThe Multivariate Normal Distribution 1
The Multivariate Normal Distribution 1 STA 302 Fall 2017 1 See last slide for copyright information. 1 / 40 Overview 1 Moment-generating Functions 2 Definition 3 Properties 4 χ 2 and t distributions 2
More information5 Inferences about a Mean Vector
5 Inferences about a Mean Vector In this chapter we use the results from Chapter 2 through Chapter 4 to develop techniques for analyzing data. A large part of any analysis is concerned with inference that
More informationThe Multivariate Normal Distribution 1
The Multivariate Normal Distribution 1 STA 302 Fall 2014 1 See last slide for copyright information. 1 / 37 Overview 1 Moment-generating Functions 2 Definition 3 Properties 4 χ 2 and t distributions 2
More informationApplied Multivariate and Longitudinal Data Analysis
Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 In this chapter we will discuss inference
More informationRandom Vectors 1. STA442/2101 Fall See last slide for copyright information. 1 / 30
Random Vectors 1 STA442/2101 Fall 2017 1 See last slide for copyright information. 1 / 30 Background Reading: Renscher and Schaalje s Linear models in statistics Chapter 3 on Random Vectors and Matrices
More informationIntroduction to Normal Distribution
Introduction to Normal Distribution Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 17-Jan-2017 Nathaniel E. Helwig (U of Minnesota) Introduction
More informationLecture 11. Multivariate Normal theory
10. Lecture 11. Multivariate Normal theory Lecture 11. Multivariate Normal theory 1 (1 1) 11. Multivariate Normal theory 11.1. Properties of means and covariances of vectors Properties of means and covariances
More informationBIOS 2083 Linear Models Abdus S. Wahed. Chapter 2 84
Chapter 2 84 Chapter 3 Random Vectors and Multivariate Normal Distributions 3.1 Random vectors Definition 3.1.1. Random vector. Random vectors are vectors of random variables. For instance, X = X 1 X 2.
More informationMultivariate Statistics
Multivariate Statistics Chapter 2: Multivariate distributions and inference Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2016/2017 Master in Mathematical
More informationx. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ).
.8.6 µ =, σ = 1 µ = 1, σ = 1 / µ =, σ =.. 3 1 1 3 x Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ ). The Gaussian distribution Probably the most-important distribution in all of statistics
More informationMultivariate Analysis and Likelihood Inference
Multivariate Analysis and Likelihood Inference Outline 1 Joint Distribution of Random Variables 2 Principal Component Analysis (PCA) 3 Multivariate Normal Distribution 4 Likelihood Inference Joint density
More informationTables Table A Table B Table C Table D Table E 675
BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values
More informationRandom Vectors and Multivariate Normal Distributions
Chapter 3 Random Vectors and Multivariate Normal Distributions 3.1 Random vectors Definition 3.1.1. Random vector. Random vectors are vectors of random 75 variables. For instance, X = X 1 X 2., where each
More informationIntroduction to Computational Finance and Financial Econometrics Probability Review - Part 2
You can t see this text! Introduction to Computational Finance and Financial Econometrics Probability Review - Part 2 Eric Zivot Spring 2015 Eric Zivot (Copyright 2015) Probability Review - Part 2 1 /
More informationLecture 3. Inference about multivariate normal distribution
Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates
More informationMultivariate Linear Models
Multivariate Linear Models Stanley Sawyer Washington University November 7, 2001 1. Introduction. Suppose that we have n observations, each of which has d components. For example, we may have d measurements
More informationMIT Spring 2015
Assessing Goodness Of Fit MIT 8.443 Dr. Kempthorne Spring 205 Outline 2 Poisson Distribution Counts of events that occur at constant rate Counts in disjoint intervals/regions are independent If intervals/regions
More information(Multivariate) Gaussian (Normal) Probability Densities
(Multivariate) Gaussian (Normal) Probability Densities Carl Edward Rasmussen, José Miguel Hernández-Lobato & Richard Turner April 20th, 2018 Rasmussen, Hernàndez-Lobato & Turner Gaussian Densities April
More informationRandom Variables and Their Distributions
Chapter 3 Random Variables and Their Distributions A random variable (r.v.) is a function that assigns one and only one numerical value to each simple event in an experiment. We will denote r.vs by capital
More informationA Probability Review
A Probability Review Outline: A probability review Shorthand notation: RV stands for random variable EE 527, Detection and Estimation Theory, # 0b 1 A Probability Review Reading: Go over handouts 2 5 in
More informationNotes on Random Vectors and Multivariate Normal
MATH 590 Spring 06 Notes on Random Vectors and Multivariate Normal Properties of Random Vectors If X,, X n are random variables, then X = X,, X n ) is a random vector, with the cumulative distribution
More informationSTAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.
STAT 01 Assignment NAME Spring 00 Reading Assignment: Written Assignment: Chapter, and Sections 6.1-6.3 in Johnson & Wichern. Due Monday, February 1, in class. You should be able to do the first four problems
More informationSTAT 512 sp 2018 Summary Sheet
STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}
More informationStat 206: the Multivariate Normal distribution
Stat 6: the Multivariate Normal distribution James Johndrow (adapted from Iain Johnstone s notes) 16-11- Introduction The multivariate normal distribution plays a central role in multivariate statistics
More information7 Multivariate Statistical Models
7 Multivariate Statistical Models 7.1 Introduction Often we are not interested merely in a single random variable but rather in the joint behavior of several random variables, for example, returns on several
More informationCHAPTER 5. Outlier Detection in Multivariate Data
CHAPTER 5 Outlier Detection in Multivariate Data 5.1 Introduction Multivariate outlier detection is the important task of statistical analysis of multivariate data. Many methods have been proposed for
More informationThis does not cover everything on the final. Look at the posted practice problems for other topics.
Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry
More informationRandom vectors X 1 X 2. Recall that a random vector X = is made up of, say, k. X k. random variables.
Random vectors Recall that a random vector X = X X 2 is made up of, say, k random variables X k A random vector has a joint distribution, eg a density f(x), that gives probabilities P(X A) = f(x)dx Just
More informationMultivariate Gaussians. Sargur Srihari
Multivariate Gaussians Sargur srihari@cedar.buffalo.edu 1 Topics 1. Multivariate Gaussian: Basic Parameterization 2. Covariance and Information Form 3. Operations on Gaussians 4. Independencies in Gaussians
More information8 - Continuous random vectors
8-1 Continuous random vectors S. Lall, Stanford 2011.01.25.01 8 - Continuous random vectors Mean-square deviation Mean-variance decomposition Gaussian random vectors The Gamma function The χ 2 distribution
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Comparisons of Two Means Edps/Soc 584 and Psych 594 Applied Multivariate Statistics Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c
More information2. Multivariate Distributions
2. Multivariate Distributions Random vectors: mean, covariance matrix, linear transformations, dependence measures (a short introduction on the probability tools for multivariate statistics). Multidimensional
More informationPart 6: Multivariate Normal and Linear Models
Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of
More informationSTAT 4385 Topic 01: Introduction & Review
STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics
More informationChapter 5 continued. Chapter 5 sections
Chapter 5 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions
More informationEEL 5544 Noise in Linear Systems Lecture 30. X (s) = E [ e sx] f X (x)e sx dx. Moments can be found from the Laplace transform as
L30-1 EEL 5544 Noise in Linear Systems Lecture 30 OTHER TRANSFORMS For a continuous, nonnegative RV X, the Laplace transform of X is X (s) = E [ e sx] = 0 f X (x)e sx dx. For a nonnegative RV, the Laplace
More informationPattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions
Pattern Recognition and Machine Learning Chapter 2: Probability Distributions Cécile Amblard Alex Kläser Jakob Verbeek October 11, 27 Probability Distributions: General Density Estimation: given a finite
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Introduction Edps/Psych/Stat/ 584 Applied Multivariate Statistics Carolyn J Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c Board of Trustees,
More informationMath 152. Rumbos Fall Solutions to Assignment #12
Math 52. umbos Fall 2009 Solutions to Assignment #2. Suppose that you observe n iid Bernoulli(p) random variables, denoted by X, X 2,..., X n. Find the LT rejection region for the test of H o : p p o versus
More informationMA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems
MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Review of Basic Probability The fundamentals, random variables, probability distributions Probability mass/density functions
More information401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.
401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis
More information18 Bivariate normal distribution I
8 Bivariate normal distribution I 8 Example Imagine firing arrows at a target Hopefully they will fall close to the target centre As we fire more arrows we find a high density near the centre and fewer
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationThe purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.
Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That
More informationElliptically Contoured Distributions
Elliptically Contoured Distributions Recall: if X N p µ, Σ), then { 1 f X x) = exp 1 } det πσ x µ) Σ 1 x µ) So f X x) depends on x only through x µ) Σ 1 x µ), and is therefore constant on the ellipsoidal
More informationThe Delta Method and Applications
Chapter 5 The Delta Method and Applications 5.1 Local linear approximations Suppose that a particular random sequence converges in distribution to a particular constant. The idea of using a first-order
More informationExercises and Answers to Chapter 1
Exercises and Answers to Chapter The continuous type of random variable X has the following density function: a x, if < x < a, f (x), otherwise. Answer the following questions. () Find a. () Obtain mean
More information1 Exercises for lecture 1
1 Exercises for lecture 1 Exercise 1 a) Show that if F is symmetric with respect to µ, and E( X )
More informationBASICS OF PROBABILITY
October 10, 2018 BASICS OF PROBABILITY Randomness, sample space and probability Probability is concerned with random experiments. That is, an experiment, the outcome of which cannot be predicted with certainty,
More information01 Probability Theory and Statistics Review
NAVARCH/EECS 568, ROB 530 - Winter 2018 01 Probability Theory and Statistics Review Maani Ghaffari January 08, 2018 Last Time: Bayes Filters Given: Stream of observations z 1:t and action data u 1:t Sensor/measurement
More informationAppendix 2. The Multivariate Normal. Thus surfaces of equal probability for MVN distributed vectors satisfy
Appendix 2 The Multivariate Normal Draft Version 1 December 2000, c Dec. 2000, B. Walsh and M. Lynch Please email any comments/corrections to: jbwalsh@u.arizona.edu THE MULTIVARIATE NORMAL DISTRIBUTION
More informationACM 116: Lectures 3 4
1 ACM 116: Lectures 3 4 Joint distributions The multivariate normal distribution Conditional distributions Independent random variables Conditional distributions and Monte Carlo: Rejection sampling Variance
More information5.1 Consistency of least squares estimates. We begin with a few consistency results that stand on their own and do not depend on normality.
88 Chapter 5 Distribution Theory In this chapter, we summarize the distributions related to the normal distribution that occur in linear models. Before turning to this general problem that assumes normal
More informationunadjusted model for baseline cholesterol 22:31 Monday, April 19,
unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol
More informationCourse: ESO-209 Home Work: 1 Instructor: Debasis Kundu
Home Work: 1 1. Describe the sample space when a coin is tossed (a) once, (b) three times, (c) n times, (d) an infinite number of times. 2. A coin is tossed until for the first time the same result appear
More informationBivariate Distributions. Discrete Bivariate Distribution Example
Spring 7 Geog C: Phaedon C. Kyriakidis Bivariate Distributions Definition: class of multivariate probability distributions describing joint variation of outcomes of two random variables (discrete or continuous),
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Two sample T 2 test 1 Two sample T 2 test 2 Analogous to the univariate context, we
More informationLecture 3. Probability - Part 2. Luigi Freda. ALCOR Lab DIAG University of Rome La Sapienza. October 19, 2016
Lecture 3 Probability - Part 2 Luigi Freda ALCOR Lab DIAG University of Rome La Sapienza October 19, 2016 Luigi Freda ( La Sapienza University) Lecture 3 October 19, 2016 1 / 46 Outline 1 Common Continuous
More informationContents. Acknowledgments. xix
Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables
More informationIII - MULTIVARIATE RANDOM VARIABLES
Computational Methods and advanced Statistics Tools III - MULTIVARIATE RANDOM VARIABLES A random vector, or multivariate random variable, is a vector of n scalar random variables. The random vector is
More informationMULTIVARIATE DISTRIBUTIONS
Chapter 9 MULTIVARIATE DISTRIBUTIONS John Wishart (1898-1956) British statistician. Wishart was an assistant to Pearson at University College and to Fisher at Rothamsted. In 1928 he derived the distribution
More informationNotes on the Multivariate Normal and Related Topics
Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions
More informationWLS and BLUE (prelude to BLUP) Prediction
WLS and BLUE (prelude to BLUP) Prediction Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark April 21, 2018 Suppose that Y has mean X β and known covariance matrix V (but Y need
More informationContinuous Distributions
Chapter 3 Continuous Distributions 3.1 Continuous-Type Data In Chapter 2, we discuss random variables whose space S contains a countable number of outcomes (i.e. of discrete type). In Chapter 3, we study
More informationLearning Objectives for Stat 225
Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:
More informationChapter 3: Maximum Likelihood Theory
Chapter 3: Maximum Likelihood Theory Florian Pelgrin HEC September-December, 2010 Florian Pelgrin (HEC) Maximum Likelihood Theory September-December, 2010 1 / 40 1 Introduction Example 2 Maximum likelihood
More informationThe generative approach to classification. A classification problem. Generative models CSE 250B
The generative approach to classification The generative approach to classification CSE 250B The learning process: Fit a probability distribution to each class, individually To classify a new point: Which
More information3. Probability and Statistics
FE661 - Statistical Methods for Financial Engineering 3. Probability and Statistics Jitkomut Songsiri definitions, probability measures conditional expectations correlation and covariance some important
More informationIn most cases, a plot of d (j) = {d (j) 2 } against {χ p 2 (1-q j )} is preferable since there is less piling up
THE UNIVERSITY OF MINNESOTA Statistics 5401 September 17, 2005 Chi-Squared Q-Q plots to Assess Multivariate Normality Suppose x 1, x 2,..., x n is a random sample from a p-dimensional multivariate distribution
More information2. Matrix Algebra and Random Vectors
2. Matrix Algebra and Random Vectors 2.1 Introduction Multivariate data can be conveniently display as array of numbers. In general, a rectangular array of numbers with, for instance, n rows and p columns
More information4 Statistics of Normally Distributed Data
4 Statistics of Normally Distributed Data 4.1 One Sample a The Three Basic Questions of Inferential Statistics. Inferential statistics form the bridge between the probability models that structure our
More informationBayes Decision Theory
Bayes Decision Theory Minimum-Error-Rate Classification Classifiers, Discriminant Functions and Decision Surfaces The Normal Density 0 Minimum-Error-Rate Classification Actions are decisions on classes
More informationMAS223 Statistical Inference and Modelling Exercises
MAS223 Statistical Inference and Modelling Exercises The exercises are grouped into sections, corresponding to chapters of the lecture notes Within each section exercises are divided into warm-up questions,
More informationAsymptotic Statistics-VI. Changliang Zou
Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous
More information2.3. The Gaussian Distribution
78 2. PROBABILITY DISTRIBUTIONS Figure 2.5 Plots of the Dirichlet distribution over three variables, where the two horizontal axes are coordinates in the plane of the simplex and the vertical axis corresponds
More informationStat 5101 Notes: Brand Name Distributions
Stat 5101 Notes: Brand Name Distributions Charles J. Geyer September 5, 2012 Contents 1 Discrete Uniform Distribution 2 2 General Discrete Uniform Distribution 2 3 Uniform Distribution 3 4 General Uniform
More informationProbability Background
Probability Background Namrata Vaswani, Iowa State University August 24, 2015 Probability recap 1: EE 322 notes Quick test of concepts: Given random variables X 1, X 2,... X n. Compute the PDF of the second
More informationStatistics Ph.D. Qualifying Exam: Part II November 9, 2002
Statistics Ph.D. Qualifying Exam: Part II November 9, 2002 Student Name: 1. Answer 8 out of 12 problems. Mark the problems you selected in the following table. 1 2 3 4 5 6 7 8 9 10 11 12 2. Write your
More informationFinancial Econometrics and Volatility Models Copulas
Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns
More information2 Functions of random variables
2 Functions of random variables A basic statistical model for sample data is a collection of random variables X 1,..., X n. The data are summarised in terms of certain sample statistics, calculated as
More informationThe Multivariate Gaussian Distribution
The Multivariate Gaussian Distribution Chuong B. Do October, 8 A vector-valued random variable X = T X X n is said to have a multivariate normal or Gaussian) distribution with mean µ R n and covariance
More informationPrincipal Component Analysis for a Spiked Covariance Model with Largest Eigenvalues of the Same Asymptotic Order of Magnitude
Principal Component Analysis for a Spiked Covariance Model with Largest Eigenvalues of the Same Asymptotic Order of Magnitude Addy M. Boĺıvar Cimé Centro de Investigación en Matemáticas A.C. May 1, 2010
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationRandom Eigenvalue Problems Revisited
Random Eigenvalue Problems Revisited S Adhikari Department of Aerospace Engineering, University of Bristol, Bristol, U.K. Email: S.Adhikari@bristol.ac.uk URL: http://www.aer.bris.ac.uk/contact/academic/adhikari/home.html
More informationSTA 2101/442 Assignment 3 1
STA 2101/442 Assignment 3 1 These questions are practice for the midterm and final exam, and are not to be handed in. 1. Suppose X 1,..., X n are a random sample from a distribution with mean µ and variance
More informationMachine learning - HT Maximum Likelihood
Machine learning - HT 2016 3. Maximum Likelihood Varun Kanade University of Oxford January 27, 2016 Outline Probabilistic Framework Formulate linear regression in the language of probability Introduce
More informationMath 181B Homework 1 Solution
Math 181B Homework 1 Solution 1. Write down the likelihood: L(λ = n λ X i e λ X i! (a One-sided test: H 0 : λ = 1 vs H 1 : λ = 0.1 The likelihood ratio: where LR = L(1 L(0.1 = 1 X i e n 1 = λ n X i e nλ
More informationRandom Matrices and Multivariate Statistical Analysis
Random Matrices and Multivariate Statistical Analysis Iain Johnstone, Statistics, Stanford imj@stanford.edu SEA 06@MIT p.1 Agenda Classical multivariate techniques Principal Component Analysis Canonical
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For
More informationProbability Distributions
Probability Distributions Seungjin Choi Department of Computer Science Pohang University of Science and Technology, Korea seungjin@postech.ac.kr 1 / 25 Outline Summarize the main properties of some of
More informationLecture 6: Special probability distributions. Summarizing probability distributions. Let X be a random variable with probability distribution
Econ 514: Probability and Statistics Lecture 6: Special probability distributions Summarizing probability distributions Let X be a random variable with probability distribution P X. We consider two types
More information