MATH5745 Multivariate Methods Lecture 07
|
|
- Chrystal Louisa Bailey
- 5 years ago
- Views:
Transcription
1 MATH5745 Multivariate Methods Lecture 07 Tests of hypothesis on covariance matrix March 16, 2018 MATH5745 Multivariate Methods Lecture 07 March 16, / 39
2 Test on covariance matrices: Introduction example I Frets (1921) 1 gave the head length and breadth (in mm.) for first and second sons in a number of families. First son Second son Head length Head breadth Head length Head breadth y 1 y 2 y 3 y Frets, G.P. (1921) Heredity of head form in man. Genetica, 3, MATH5745 Multivariate Methods Lecture 07 March 16, / 39
3 Test on covariance matrices: Introduction example II The sample covariance matrix for the Frets (1921) data is S = To test the hypothesis that all variables are independent we would test a hypothesis that the covariance matrix Σ satisfies: Σ = σ σ σ σ 44. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
4 Test on covariance matrices: Introduction example III The sample covariance matrix for the Frets (1921) data is S = To test the hypothesis that the two sons are independent we would test a hypothesis that the covariance matrix Σ satisfies: Σ = σ 11 σ σ 21 σ σ 33 σ σ 43 σ 44. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
5 Test on covariance matrices: Introduction example IV The sample covariance matrix for the Frets (1921) data is S = To test the hypothesis that the two sons have common variances and covariances we might test a hypothesis that the covariance matrix Σ satisfies: Σ = σ 11 σ 12 σ 13 σ 14 σ 21 σ 22 σ 23 σ 24 σ 31 σ 32 σ 11 σ 12 σ 41 σ 42 σ 21 σ 22. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
6 Test on covariance matrices: Introduction example V The sample covariance matrix for the Frets (1921) data is S = We might want to test a specific proposal about the covariance matrix: Σ = MATH5745 Multivariate Methods Lecture 07 March 16, / 39
7 Test on covariance matrices Consider now some tests on covariance matrices. Like tests on mean vectors, can perform tests on covariance matrices. These are the null hypotheses we are interested in: H 0 : Σ = Σ 0 (where Σ 0 is a given known matrix) H 0 : Σ = σ 2 I (special case) H 0 : Σ = σ 2 {(1 ρ)i + ρj} (special case) H 0 : Σ = diag(σ 11, σ 22,..., σ pp ) (special case) H 0 : Σ 1 = Σ 2 (do two populations have same covariance structure?) MATH5745 Multivariate Methods Lecture 07 March 16, / 39
8 Likelihood ratio tests: Introduction I Have observations y i and wish to test hypotheses about parameters θ. The likelihood function is L(θ; Y). We test a null hypothesis H 0 against an alternative hypothesis H 1. The likelihood ratio test statistic is Clearly 0 < λ < 1. λ = Small values of λ imply H 0 false. Large values λ 1 suggest H 0 true. sup L(θ; Y) H 0 sup L(θ; Y). H 0 H 1 Can be shown that 2 log λ χ 2 r where r = dim H 0 H 1 dim H 0. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
9 Likelihood ratio tests: Introduction II The likelihood function is L(θ; Y). The log-likelihood function is l(θ; Y) = log L(θ; Y). sup L(θ; Y) H Recall: Likelihood ratio statistic is λ = 0 sup L(θ; Y). H 0 H 1 Can write 2 log λ = 2l 0 + 2l 1. Here l 0 is the maximised log-likelihood under H 0,. l 1 is the maximised log-likelihood under H 0 H 1. Reject H 0 for large values of U = 2 log λ. Recall: 2 log λ χ 2 r where r = dim H 0 H 1 dim H 0. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
10 Likelihood ratio tests: Multivariate normal distribution I Suppose we have y 1, y 2,..., y n ind N p (µ, Σ). Likelihood is L(µ, Σ) = 2πΣ n 2 exp { 1 2 The log-likelihood is } n (y i µ) Σ 1 (y i µ). l(µ, Σ) = 1 2 n log 2πΣ 1 2 n (y i µ) Σ 1 (y i µ). MATH5745 Multivariate Methods Lecture 07 March 16, / 39
11 Likelihood ratio tests: Multivariate normal distribution II Recall last slide: For y 1, y 2,..., y n ind N p (µ, Σ) the log-likelihood is l(µ, Σ) = 1 2 n log 2πΣ 1 2 n (y i µ) Σ 1 (y i µ). Recall lecture 05: we can write the log-likelihood as l(µ, Σ) = 1 2 n log 2πΣ 1 2 n trace(σ 1 V) 1 2 n(y µ) Σ 1 (y µ) where V = 1 n 1 n (y i y)(y i y). Can show that maximum likelihood estimators are: ˆµ = y and ˆΣ = V. The maximised log-likelihood is then l(ˆµ, ˆΣ) = 1 2 n log 2πV 1 2 np. [QUESTION] Why is trace( ˆΣ 1 V) = p? MATH5745 Multivariate Methods Lecture 07 March 16, / 39
12 Testing H 0 : Σ = Σ 0 Suppose we observe data of n observation vectors y 1, y 2,..., y n. The y i assumed to form a random sample from N p (µ, Σ). We want to test H 0 : Σ = Σ 0. Notice that µ is not specified in H 0. Suppose we observe the sample covariance matrix S (and equivalently we observe V). Idea: How far is V (assumed sampled from Σ) from Σ 0? MATH5745 Multivariate Methods Lecture 07 March 16, / 39
13 Testing H 0 : Σ = Σ 0 : Likelihood ratio test We want to test H 0 : Σ = Σ 0. The maximised log-likelihood under H 0 is, with ˆµ = y, l 0 = 1 2 n log 2πΣ n trace(σ 1 0 V). The maximised log-likelihood under H 1 is, with ˆµ = y and ˆΣ = V, The likelihood ratio test statistic is U = 2 log λ = 2l 0 + 2l 1 l 1 = 1 2 n log 2πV 1 2 np. = n log 2πΣ 0 + n trace(σ 1 0 V) n log 2πV np = n(trace(σ 1 0 V) log Σ 1 0 V p). MATH5745 Multivariate Methods Lecture 07 March 16, / 39
14 Testing H 0 : Σ = Σ 0 The test statistic is given by U = n(trace(σ 1 0 V) log Σ 1 0 V p). Notice that alternative representations are possible as trace(σ 1 0 V) = trace(vσ 1 0 ) and log Σ 1 0 V = log Σ 0 + log V. If V = Σ 0, then U = 0. dim H 0 = p, dim H 0 H 1 = p p(p + 1) so r = 1 2p(p + 1). Under H 0, the test statistic U satisfies U χ p(p+1) Reject H 0 if U > χ p(p+1)(5%). The χ 2 approximation is OK for large n. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
15 Testing H 0 : Σ = Σ 0 : Re-expression of U Suppose λ 1, λ 2,..., λ p are the eigenvalues of Σ 1 0 V. Then trace ( p Σ 1 0 V) = λ i. Also Σ 1 ( p p ) 0 V = λ i so log Σ 1 0 V = log λ i = p log λ i. The test statistic is U = n(trace(σ 1 0 V) log Σ 1 0 V p). This can be written as [ ] [ p p p ] U = n + λ i log λ i p = n (λ i log λ i ) p. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
16 Testing H 0 : Σ = Σ 0 : Re-expression of U Alternatively: Suppose λ 1, λ 2,..., λ p are the eigenvalues of Σ 1 0 ( V. Write: a = 1 p p ) 1/p λ i and g = λ i. p Then trace ( Σ 1 0 V) = p p λ i = ap and Σ 1 0 V = λ i = g p. The test statistic is U = n(trace(σ 1 0 V) log Σ 1 0 V p). This gives U = np(a log g 1). MATH5745 Multivariate Methods Lecture 07 March 16, / 39
17 Testing H 0 : Σ = Σ 0 : Rencher results. Rencher gives the formula U R = (n 1) ( trace(σ 1 0 using the unbiased covariance matrix S. This is NOT the same as U. S) log Σ 1S p) For moderate n, Rencher gives the following approximation [ UR = 1 1 ( 2p )] U R 6n 7 p + 1 as giving a better fit to χ p(p+1) under H 0. I would use U based upon the maximum likelihood estimate V. 0 MATH5745 Multivariate Methods Lecture 07 March 16, / 39
18 Testing H 0 : Σ = Σ 0. Special cases Test whether the variables are independent with common variance σ 2. σ σ H 0 : Σ = = σ2 I σ 2 Test whether the variables have common correlation (dependencies) and common variance. σ 2 σ 2 ρ... σ 2 ρ σ 2 ρ σ 2... σ 2 ρ H 0 : Σ = = σ2 {(1 ρ)i + ρj}. σ 2 ρ σ 2 ρ... σ 2 MATH5745 Multivariate Methods Lecture 07 March 16, / 39
19 Testing H 0 : Σ = Σ 0. Example. Consider the probe word example in Lecture 04. Timm (1975) reported response time (in ms) to probe words in five positions in a sentence from 11 individuals y 1 y 2 y 3 y 4 y The five variables look commensurate, so we are interested in testing H 0 : Σ = σ 2 I. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
20 Testing H 0 : Σ = Σ 0. Example. The sample covariance matrix is given by S = The estimate ˆΣ = V is given by V = n 1 n S = MATH5745 Multivariate Methods Lecture 07 March 16, / 39
21 Testing H 0 : Σ = Σ 0. Example 1. Suppose we are interested in the null hypothesis H 0 : Σ = MATH5745 Multivariate Methods Lecture 07 March 16, / 39
22 Testing H 0 : Σ = Σ 0. Example 1. We have n = 11, p = 5, trace(σ 1 0 V) = , log Σ 0 = , log V = U = 11( ) = Degrees of freedom 1 2p(p + 1) = 15. χ 2 15 (5%) = so reject H 0 at 5% level χ 2 15 pdf Reject H 0 at 5% level Pr(χ 2 15 > ) = 0.05 U= U MATH5745 Multivariate Methods Lecture 07 March 16, / 39
23 Testing H 0 : Σ = Σ 0. Example 1 (Rencher statistic). We have n = 11, p = 5, trace(σ 1 0 S) = , log Σ 0 = , log S = U R = 10( ) = Degrees of freedom 1 2p(p + 1) = 15. χ 2 15 (5%) = so reject H 0 at 5% level. Because of moderate ( size of n, consider )] the approximation: UR [1 = 1 6(10) 7 2(5) U R = Compare this to χ 2 15 (5%) = Again reject H 0 at 5% level. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
24 Testing H 0 : Σ = Σ 0. Example 2. Suppose we are interested in the null hypothesis H 0 : Σ = MATH5745 Multivariate Methods Lecture 07 March 16, / 39
25 Testing H 0 : Σ = Σ 0. Example 2. We have n = 11, p = 5, log V = (the same). trace(σ 1 0 V) = , log Σ 0 = U = 11( ) = Degrees of freedom 1 2p(p + 1) = 15. χ 2 15 (5%) = so do not reject H 0 at 5% level χ 2 15 pdf U= Do not reject H 0 at 5% level Pr(χ 2 15 > ) = U MATH5745 Multivariate Methods Lecture 07 March 16, / 39
26 Testing H 0 : Σ = Σ 0. Example 2 (Rencher statistic). We have n = 11, p = 5, log S = (the same). trace(σ 1 0 S) = , log Σ 0 = , log S = U R = 10( ) = Degrees of freedom 1 2p(p + 1) = 15. χ 2 15 (5%) = so do not reject H 0 at 5% level. Because of moderate ( size, we consider )] the approximation UR [1 = 1 6(10) 1 2(5) U R = Compare this to χ 2 15 (5%) = Again do not reject H 0 at 5% level. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
27 Testing H 0 : Σ = Σ 0. Example 3. Suppose we are interested in the null hypothesis H 0 : Σ = MATH5745 Multivariate Methods Lecture 07 March 16, / 39
28 Testing H 0 : Σ = Σ 0. Example 3. We have n = 11, p = 5, log V = (the same). trace(σ 1 0 V) = , log Σ 0 = U = 11( ) = Degrees of freedom 1 2p(p + 1) = 15. χ 2 15 (5%) = so reject H 0 at 5% level χ 2 15 pdf Reject H 0 at 5% level Pr(χ 2 15 > ) = 0.05 U= U MATH5745 Multivariate Methods Lecture 07 March 16, / 39
29 Testing H 0 : Σ = Σ 0. Example 3 (Rencher statistic). We have n = 11, p = 5, log S = (the same). trace(σ 1 0 S) = , log Σ 0 = , log S = U R = 10( ) = Degrees of freedom 1 2p(p + 1) = 15. χ 2 15 (5%) = so reject H 0 at 5% level. Because of moderate ( size, we consider )] the approximation UR [1 = 1 6(10) 1 2(5) U R = Compare this to χ 2 15 (5%) = Again (just) reject H 0 at 5% level. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
30 Testing H 0 : Σ 1 = Σ 2 = = Σ k. Imagine at time 1 (say) we have n 1 observations on p variables. Suppose at time 2 (say) we obtain a different sample with n 2 observations from the same p variables. Notice: we have different units of observations, but the same variables measured. We are interested in whether the covariances come from the same population. We want to test (here) H 0 : Σ 1 = Σ 2. More generally we have k samples and wish to test the null hypothesis H 0 : Σ 1 = Σ 2 = = Σ k. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
31 Testing H 0 : Σ 1 = Σ 2 = = Σ k. Suppose we observe n i observation vectors y ij taken from N p (µ i, Σ i ), for j = 1, 2,..., n i and i = 1, 2,..., k. We want to test H 0 : Σ 1 = Σ 2 = = Σ k. Notice that µ i is not specified in H 0. Sample i with n i observations has sample mean y i and sample covariance matrix S i (and equivalently V i ) where Let n = k n i, V = 1 n V i = (n i 1) n i S i. k n i V i, and S = 1 n k S is the pooled unbiased estimator of the common Σ. k (n i 1)S i. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
32 Testing H 0 : Σ 1 = Σ 2 = = Σ k : Likelihood ratio test We want to test H 0 : Σ 1 = Σ 2 = = Σ k. The maximised log-likelihood under H 0 is, with ˆµ i = y i and ˆΣ i = V, l 0 = 1 2 n log 2πV 1 2 np. The maximised log-likelihood under H 1 is, with ˆµ i = y i and ˆΣ i = V i, l 1 = 1 2 k n i log 2πV i 1 2 np. The likelihood ratio test statistic is U = 2 log λ = 2l 0 + 2l 1 = n log 2πV = k k n i log 2πV i n i log V 1 i V. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
33 Testing H 0 : Σ 1 = Σ 2 = = Σ k : Likelihood ratio test The likelihood ratio test statistic is given by U = k n i log V 1 i V. Notice that this can be written as k { V n U = n log V n i log V i or U = log V 1 n 1 V2 n 2 Vk n k dim H 0 = pk p(p + 1), dim H 0 H 1 = pk + 1 2p(p + 1)k so degrees of freedom of likelihood ratio test is r = 1 2 (k 1)p(p + 1). Under H 0, the test statistic U satisfies U χ (k 1)p(p+1) Reject H 0 if U > χ 2 1 (k 1)p(p+1)(5%). 2 The χ 2 approximation is OK for large n. }. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
34 Testing H 0 : Σ 1 = Σ 2 = = Σ k. Box M statistic. The likelihood ratio test statistic is U = k n i log V 1 i V. If n i is small, this gives too much weight to the contribution of V. Box (1949) proposed an alternative test statistic M using the unbiased estimators S i and S. Here S i is the i-th sample covariance matrix, and S is the pooled sample covariance matrix S = 1 n k k (n i 1)S i where n = k n i. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
35 Testing H 0 : Σ 1 = Σ 2 = = Σ k. Box M statistic. The Box M statistic is M = c k (n i 1) log S 1 i c is a constant which gives M χ (k 1)p(p+1) if H 0 is true. The Box M statistic can be alternatively written as { } k M = c (n k) log S (n i 1) log S i. S. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
36 Testing H 0 : Σ 1 = Σ 2 = = Σ k. Box M statistic. Box s constant c is given by { ( 2p 2 k )} + 3p 1 1 c = 1 6(k 1)(p + 1) n i 1 1. n k If H 0 is true, then M χ (k 1)p(p+1). Reject H 0 if M > χ (k 1)p(p+1)(5%). Notice that if n 1 = n 2 = = n k = n/k, then c = 1 { (k + 1)(2p 2 } + 3p 1). 6(n k)(p + 1) Box s test is OK for n i at least 20 and k and p not exceeding 5. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
37 Testing H 0 : Σ 1 = Σ 2. Example. Consider the data on four psychological test scores among 32 males and 32 females from lecture 06. We consider them as two samples, (1) males and (2) females. We test H 0 : Σ 1 = Σ 2. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
38 Testing H 0 : Σ 1 = Σ 2. Example. Recall 16 y 1 = 16 27, S 1 = y 2 = 14 17, S 2 = 22 S = (32 1)S 1 + (32 1)S = MATH5745 Multivariate Methods Lecture 07 March 16, / 39
39 Testing H 0 : Σ 1 = Σ 2. Example. Here n 1 = 32, n 2 = 32, n = 64, k = 2, p = 4. Also log S 1 = , log S 2 = , log S = { (k + 1)(2p 2 } { } + 3p 1) (5)(43) c = 1 = 1 = (n k)(p + 1) 6(62)(5) Box M statistic is M = c {62 log S 31 log S 1 31 log S 2 } = { } = { } = M has r = 1 2 (k 1)p(p + 1) = 10 degrees of freedom. χ 2 10 (5%) = Do not reject H 0 at 5% level. MATH5745 Multivariate Methods Lecture 07 March 16, / 39
Hypothesis Testing. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA
Hypothesis Testing Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA An Example Mardia et al. (979, p. ) reprint data from Frets (9) giving the length and breadth (in
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For
More informationNotes on the Multivariate Normal and Related Topics
Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions
More informationLecture 3. Inference about multivariate normal distribution
Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates
More informationI i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.
Serik Sagitov, Chalmers and GU, February, 08 Solutions chapter Matlab commands: x = data matrix boxplot(x) anova(x) anova(x) Problem.3 Consider one-way ANOVA test statistic For I = and = n, put F = MS
More informationCBA4 is live in practice mode this week exam mode from Saturday!
Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as
More informationExam 2. Jeremy Morris. March 23, 2006
Exam Jeremy Morris March 3, 006 4. Consider a bivariate normal population with µ 0, µ, σ, σ and ρ.5. a Write out the bivariate normal density. The multivariate normal density is defined by the following
More informationTopic 19 Extensions on the Likelihood Ratio
Topic 19 Extensions on the Likelihood Ratio Two-Sided Tests 1 / 12 Outline Overview Normal Observations Power Analysis 2 / 12 Overview The likelihood ratio test is a popular choice for composite hypothesis
More information2.6.3 Generalized likelihood ratio tests
26 HYPOTHESIS TESTING 113 263 Generalized likelihood ratio tests When a UMP test does not exist, we usually use a generalized likelihood ratio test to verify H 0 : θ Θ against H 1 : θ Θ\Θ It can be used
More informationLikelihood Ratio tests
Likelihood Ratio tests For general composite hypotheses optimality theory is not usually successful in producing an optimal test. instead we look for heuristics to guide our choices. The simplest approach
More informationMath 152. Rumbos Fall Solutions to Assignment #12
Math 52. umbos Fall 2009 Solutions to Assignment #2. Suppose that you observe n iid Bernoulli(p) random variables, denoted by X, X 2,..., X n. Find the LT rejection region for the test of H o : p p o versus
More informationSTT 843 Key to Homework 1 Spring 2018
STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ
More informationEconomics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,
Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationSTAT 730 Chapter 5: Hypothesis Testing
STAT 730 Chapter 5: Hypothesis Testing Timothy Hanson Department of Statistics, University of South Carolina Stat 730: Multivariate Analysis 1 / 28 Likelihood ratio test def n: Data X depend on θ. The
More informationA Likelihood Ratio Test
A Likelihood Ratio Test David Allen University of Kentucky February 23, 2012 1 Introduction Earlier presentations gave a procedure for finding an estimate and its standard error of a single linear combination
More informationTopic 15: Simple Hypotheses
Topic 15: November 10, 2009 In the simplest set-up for a statistical hypothesis, we consider two values θ 0, θ 1 in the parameter space. We write the test as H 0 : θ = θ 0 versus H 1 : θ = θ 1. H 0 is
More informationIntroduction to Normal Distribution
Introduction to Normal Distribution Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 17-Jan-2017 Nathaniel E. Helwig (U of Minnesota) Introduction
More informationLecture 15. Hypothesis testing in the linear model
14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma
More informationRobust Wilks' Statistic based on RMCD for One-Way Multivariate Analysis of Variance (MANOVA)
ISSN 2224-584 (Paper) ISSN 2225-522 (Online) Vol.7, No.2, 27 Robust Wils' Statistic based on RMCD for One-Way Multivariate Analysis of Variance (MANOVA) Abdullah A. Ameen and Osama H. Abbas Department
More informationExample 1 describes the results from analyzing these data for three groups and two variables contained in test file manova1.tf3.
Simfit Tutorials and worked examples for simulation, curve fitting, statistical analysis, and plotting. http://www.simfit.org.uk MANOVA examples From the main SimFIT menu choose [Statistcs], [Multivariate],
More informationChapter 7, continued: MANOVA
Chapter 7, continued: MANOVA The Multivariate Analysis of Variance (MANOVA) technique extends Hotelling T 2 test that compares two mean vectors to the setting in which there are m 2 groups. We wish to
More informationwhere x and ȳ are the sample means of x 1,, x n
y y Animal Studies of Side Effects Simple Linear Regression Basic Ideas In simple linear regression there is an approximately linear relation between two variables say y = pressure in the pancreas x =
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationBIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation
BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)
More informationModel comparison and selection
BS2 Statistical Inference, Lectures 9 and 10, Hilary Term 2008 March 2, 2008 Hypothesis testing Consider two alternative models M 1 = {f (x; θ), θ Θ 1 } and M 2 = {f (x; θ), θ Θ 2 } for a sample (X = x)
More informationMultilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2
Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do
More informationUniversity of Cambridge Engineering Part IIB Module 4F10: Statistical Pattern Processing Handout 2: Multivariate Gaussians
Engineering Part IIB: Module F Statistical Pattern Processing University of Cambridge Engineering Part IIB Module F: Statistical Pattern Processing Handout : Multivariate Gaussians. Generative Model Decision
More informationStatistics and econometrics
1 / 36 Slides for the course Statistics and econometrics Part 10: Asymptotic hypothesis testing European University Institute Andrea Ichino September 8, 2014 2 / 36 Outline Why do we need large sample
More informationRegression #5: Confidence Intervals and Hypothesis Testing (Part 1)
Regression #5: Confidence Intervals and Hypothesis Testing (Part 1) Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #5 1 / 24 Introduction What is a confidence interval? To fix ideas, suppose
More informationF & B Approaches to a simple model
A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 215 http://www.astro.cornell.edu/~cordes/a6523 Lecture 11 Applications: Model comparison Challenges in large-scale surveys
More informationCOMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS
Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationGraphical Models with Symmetry
Wald Lecture, World Meeting on Probability and Statistics Istanbul 2012 Sparse graphical models with few parameters can describe complex phenomena. Introduce symmetry to obtain further parsimony so models
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationThis paper is not to be removed from the Examination Halls
~~ST104B ZA d0 This paper is not to be removed from the Examination Halls UNIVERSITY OF LONDON ST104B ZB BSc degrees and Diplomas for Graduates in Economics, Management, Finance and the Social Sciences,
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Mixed models Yan Lu March, 2018, week 8 1 / 32 Restricted Maximum Likelihood (REML) REML: uses a likelihood function calculated from the transformed set
More informationWe know from STAT.1030 that the relevant test statistic for equality of proportions is:
2. Chi 2 -tests for equality of proportions Introduction: Two Samples Consider comparing the sample proportions p 1 and p 2 in independent random samples of size n 1 and n 2 out of two populations which
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will
More informationHypothesis Testing. A rule for making the required choice can be described in two ways: called the rejection or critical region of the test.
Hypothesis Testing Hypothesis testing is a statistical problem where you must choose, on the basis of data X, between two alternatives. We formalize this as the problem of choosing between two hypotheses:
More informationHypothesis Testing: The Generalized Likelihood Ratio Test
Hypothesis Testing: The Generalized Likelihood Ratio Test Consider testing the hypotheses H 0 : θ Θ 0 H 1 : θ Θ \ Θ 0 Definition: The Generalized Likelihood Ratio (GLR Let L(θ be a likelihood for a random
More informationMultiple Linear Regression
Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there
More informationSTAT 730 Chapter 4: Estimation
STAT 730 Chapter 4: Estimation Timothy Hanson Department of Statistics, University of South Carolina Stat 730: Multivariate Analysis 1 / 23 The likelihood We have iid data, at least initially. Each datum
More informationRon Heck, Fall Week 3: Notes Building a Two-Level Model
Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level
More informationElliptically Contoured Distributions
Elliptically Contoured Distributions Recall: if X N p µ, Σ), then { 1 f X x) = exp 1 } det πσ x µ) Σ 1 x µ) So f X x) depends on x only through x µ) Σ 1 x µ), and is therefore constant on the ellipsoidal
More informationChapter 7. Hypothesis Testing
Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Two sample T 2 test 1 Two sample T 2 test 2 Analogous to the univariate context, we
More informationMISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30
MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD Copyright c 2012 (Iowa State University) Statistics 511 1 / 30 INFORMATION CRITERIA Akaike s Information criterion is given by AIC = 2l(ˆθ) + 2k, where l(ˆθ)
More informationDirection: This test is worth 250 points and each problem worth points. DO ANY SIX
Term Test 3 December 5, 2003 Name Math 52 Student Number Direction: This test is worth 250 points and each problem worth 4 points DO ANY SIX PROBLEMS You are required to complete this test within 50 minutes
More informationInferences about a Mean Vector
Inferences about a Mean Vector Edps/Soc 584, Psych 594 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University
More informationHypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3
Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest
More information3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is
Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are
More informationSTA 437: Applied Multivariate Statistics
Al Nosedal. University of Toronto. Winter 2015 1 Chapter 5. Tests on One or Two Mean Vectors If you can t explain it simply, you don t understand it well enough Albert Einstein. Definition Chapter 5. Tests
More informationSummer School in Statistics for Astronomers V June 1 - June 6, Regression. Mosuk Chow Statistics Department Penn State University.
Summer School in Statistics for Astronomers V June 1 - June 6, 2009 Regression Mosuk Chow Statistics Department Penn State University. Adapted from notes prepared by RL Karandikar Mean and variance Recall
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationDefinition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution.
Hypothesis Testing Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution. Suppose the family of population distributions is indexed
More informationUniversity of Cambridge Engineering Part IIB Module 4F10: Statistical Pattern Processing Handout 2: Multivariate Gaussians
University of Cambridge Engineering Part IIB Module 4F: Statistical Pattern Processing Handout 2: Multivariate Gaussians.2.5..5 8 6 4 2 2 4 6 8 Mark Gales mjfg@eng.cam.ac.uk Michaelmas 2 2 Engineering
More informationSTAT 513 fa 2018 hw 5
STAT 513 fa 2018 hw 5 assigned: Thursday, Oct 4th, 2018 due: Thursday, Oct 11th, 2018 1. Let X 11,..., X 1n1 and X 21,..., X 2n2 be independent random samples from the Normal(µ 1, σ 2 ) and Normal(µ 1,
More informationParameter Estimation and Fitting to Data
Parameter Estimation and Fitting to Data Parameter estimation Maximum likelihood Least squares Goodness-of-fit Examples Elton S. Smith, Jefferson Lab 1 Parameter estimation Properties of estimators 3 An
More informationSTAT 501 EXAM I NAME Spring 1999
STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationChapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests
Chapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests Throughout this chapter we consider a sample X taken from a population indexed by θ Θ R k. Instead of estimating the unknown parameter, we
More informationCanonical Correlation Analysis of Longitudinal Data
Biometrics Section JSM 2008 Canonical Correlation Analysis of Longitudinal Data Jayesh Srivastava Dayanand N Naik Abstract Studying the relationship between two sets of variables is an important multivariate
More information8. Hypothesis Testing
FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements
More informationExercises and Answers to Chapter 1
Exercises and Answers to Chapter The continuous type of random variable X has the following density function: a x, if < x < a, f (x), otherwise. Answer the following questions. () Find a. () Obtain mean
More informationChapter 12 - Lecture 2 Inferences about regression coefficient
Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous
More informationHypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes
Neyman-Pearson paradigm. Suppose that a researcher is interested in whether the new drug works. The process of determining whether the outcome of the experiment points to yes or no is called hypothesis
More informationMath 181B Homework 1 Solution
Math 181B Homework 1 Solution 1. Write down the likelihood: L(λ = n λ X i e λ X i! (a One-sided test: H 0 : λ = 1 vs H 1 : λ = 0.1 The likelihood ratio: where LR = L(1 L(0.1 = 1 X i e n 1 = λ n X i e nλ
More informationMean Vector Inferences
Mean Vector Inferences Lecture 5 September 21, 2005 Multivariate Analysis Lecture #5-9/21/2005 Slide 1 of 34 Today s Lecture Inferences about a Mean Vector (Chapter 5). Univariate versions of mean vector
More informationUniversity of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout 2:. The Multivariate Gaussian & Decision Boundaries
University of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout :. The Multivariate Gaussian & Decision Boundaries..15.1.5 1 8 6 6 8 1 Mark Gales mjfg@eng.cam.ac.uk Lent
More informationHypothesis testing: theory and methods
Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable
More informationRejection regions for the bivariate case
Rejection regions for the bivariate case The rejection region for the T 2 test (and similarly for Z 2 when Σ is known) is the region outside of an ellipse, for which there is a (1-α)% chance that the test
More informationChapter 11. Hypothesis Testing (II)
Chapter 11. Hypothesis Testing (II) 11.1 Likelihood Ratio Tests one of the most popular ways of constructing tests when both null and alternative hypotheses are composite (i.e. not a single point). Let
More informationCointegration Lecture I: Introduction
1 Cointegration Lecture I: Introduction Julia Giese Nuffield College julia.giese@economics.ox.ac.uk Hilary Term 2008 2 Outline Introduction Estimation of unrestricted VAR Non-stationarity Deterministic
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationNominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers
Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain
More informationLecture 11. Multivariate Normal theory
10. Lecture 11. Multivariate Normal theory Lecture 11. Multivariate Normal theory 1 (1 1) 11. Multivariate Normal theory 11.1. Properties of means and covariances of vectors Properties of means and covariances
More information2014/2015 Smester II ST5224 Final Exam Solution
014/015 Smester II ST54 Final Exam Solution 1 Suppose that (X 1,, X n ) is a random sample from a distribution with probability density function f(x; θ) = e (x θ) I [θ, ) (x) (i) Show that the family of
More informationORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing
ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing Robert Vanderbei Fall 2014 Slides last edited on November 24, 2014 http://www.princeton.edu/ rvdb Coin Tossing Example Consider two coins.
More information4.5.1 The use of 2 log Λ when θ is scalar
4.5. ASYMPTOTIC FORM OF THE G.L.R.T. 97 4.5.1 The use of 2 log Λ when θ is scalar Suppose we wish to test the hypothesis NH : θ = θ where θ is a given value against the alternative AH : θ θ on the basis
More informationTHE UNIVERSITY OF CHICAGO Graduate School of Business Business 41912, Spring Quarter 2008, Mr. Ruey S. Tsay. Solutions to Final Exam
THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41912, Spring Quarter 2008, Mr. Ruey S. Tsay Solutions to Final Exam 1. (13 pts) Consider the monthly log returns, in percentages, of five
More informationMore on nuisance parameters
BS2 Statistical Inference, Lecture 3, Hilary Term 2009 January 30, 2009 Suppose that there is a minimal sufficient statistic T = t(x ) partitioned as T = (S, C) = (s(x ), c(x )) where: C1: the distribution
More informationy ˆ i = ˆ " T u i ( i th fitted value or i th fit)
1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u
More informationIntroduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution
Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis
More informationComputational functional genomics
Computational functional genomics (Spring 2005: Lecture 8) David K. Gifford (Adapted from a lecture by Tommi S. Jaakkola) MIT CSAIL Basic clustering methods hierarchical k means mixture models Multi variate
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth
More informationSpace Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses
Space Telescope Science Institute statistics mini-course October 2011 Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses James L Rosenberger Acknowledgements: Donald Richards, William
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More information1 Mixed effect models and longitudinal data analysis
1 Mixed effect models and longitudinal data analysis Mixed effects models provide a flexible approach to any situation where data have a grouping structure which introduces some kind of correlation between
More informationTUTORIAL 8 SOLUTIONS #
TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level
More informationPrimer on statistics:
Primer on statistics: MLE, Confidence Intervals, and Hypothesis Testing ryan.reece@gmail.com http://rreece.github.io/ Insight Data Science - AI Fellows Workshop Feb 16, 018 Outline 1. Maximum likelihood
More informationSTAT 514 Solutions to Assignment #6
STAT 514 Solutions to Assignment #6 Question 1: Suppose that X 1,..., X n are a simple random sample from a Weibull distribution with density function f θ x) = θcx c 1 exp{ θx c }I{x > 0} for some fixed
More informationGeneralized Linear Models Introduction
Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,
More informationMultivariate Regression (Chapter 10)
Multivariate Regression (Chapter 10) This week we ll cover multivariate regression and maybe a bit of canonical correlation. Today we ll mostly review univariate multivariate regression. With multivariate
More informationChapter 17: Undirected Graphical Models
Chapter 17: Undirected Graphical Models The Elements of Statistical Learning Biaobin Jiang Department of Biological Sciences Purdue University bjiang@purdue.edu October 30, 2014 Biaobin Jiang (Purdue)
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More informationTAMS39 Lecture 2 Multivariate normal distribution
TAMS39 Lecture 2 Multivariate normal distribution Martin Singull Department of Mathematics Mathematical Statistics Linköping University, Sweden Content Lecture Random vectors Multivariate normal distribution
More information