Mean Vector Inferences
|
|
- Patience Nash
- 6 years ago
- Views:
Transcription
1 Mean Vector Inferences Lecture 5 September 21, 2005 Multivariate Analysis Lecture #5-9/21/2005 Slide 1 of 34
2 Today s Lecture Inferences about a Mean Vector (Chapter 5). Univariate versions of mean vector inferences. Hypothesis tests. Confidence intervals. Inferences about multivariate means (mean vector). Today s Lecture Multivariate hypothesis tests. Confidence ellipsoids. Note: Homework # 3 due in my mailbox on Thursday (by 5pm). Lecture #5-9/21/2005 Slide 2 of 34
3 /Recurring Themes Inference - reaching valid conclusions concerning a population on the basis of information obtained from a sample. Univariate statistics perform inferences for a single variable at a time. Multivariate statistics perform inferences for a set of variables simultaneously. A recurring theme of multivariate statistics is that correlated variables (for instance, p of them) should be analyzed jointly. Data Collection You will discover why as today s lecture progresses. Lecture #5-9/21/2005 Slide 3 of 34
4 Data Collection Please pull out a piece of paper and write down: Your height in inches. An estimate of the number of inches from your elbow to your wrist. Data Collection Please omit any identifying remarks. I am compelled to mention that you can choose to not participate and yet still get credit for this course. I am also compelled to mention that this example may end up being very bad... Lecture #5-9/21/2005 Slide 4 of 34
5 A Wager For whatever reason, you believe the average height of graduate students at KU is 68 inches. Your friend, the contrarian, thinks otherwise and makes you a bet that the average height of graduate students is not 68 inches. Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals You collect data on a single variable: a person s height. How can you tell who wins the bet? Lecture #5-9/21/2005 Slide 5 of 34
6 Univariate Hypothesis Testing Imagine that you recall hypothesis testing and formulate your wager so as to gain statistical evidence either in support for or against your conjecture that graduate students at KU are, on average, 68 inches tall. The statistical null hypothesis would look a little something like: Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals H 0 : µ = 68 inches And, because your friend said that, on average, KU grad students were not 68 inches (as opposed to saying they were taller than 68 inches or shorter than 68 inches), the statistical null hypothesis test would look like: H 1 : µ 68 inches Lecture #5-9/21/2005 Slide 6 of 34
7 Univariate Hypothesis Testing More formally, hypothesis tests consist of a null hypothesis which is compared with an alternative hypothesis: H 0 : µ = µ 0 Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals H 1 : µ µ 0 The alternative hypothesis here is for a two-sided test (stemming from the ). One-sided hypothesis tests are equally valid given sound substantive reason for using such tests. Lecture #5-9/21/2005 Slide 7 of 34
8 Hypothesis Evaluation To evaluate the hypothesis, a test statistic must be created, and then compared with the distribution assumed under the null hypothesis. If the sample X 1, X 2,, X n come from a normal population, then the appropriate test statistic is: Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals where: t = ( X µ 0 ) s/ n X = 1 n s 2 = 1 n 1 n j=1 X j n ( Xj X ) 2 j=1 Lecture #5-9/21/2005 Slide 8 of 34
9 Hypothesis Evaluation The t statistic has a t-distribution with n 1 degrees of freedom (df). H 0 is rejected if t is greater than a specified percentage point of a t-distribution with n 1 df. Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals For instance, lets look at the values for the means of our data. SAS Example #1... Lecture #5-9/21/2005 Slide 9 of 34
10 Hypothesis Evaluation Note that rejecting H 0 when t is greater than t n 1 (α/2) is equivalent to rejecting H 0 when t 2 is greater than t 2 n 1(α/2): t 2 = ( X µ 0 ) 2 s 2 /n = n( X µ 0 )(s 2 ) 1 ( X µ 0 ) Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals t 2 is the square of the distance from the sample mean X to the test value µ 0, expressed in standard deviations of X (s/ n). H 0 is rejected at significance level α if: n( X µ 0 )(s 2 ) 1 ( X µ 0 ) > t 2 n 1(α/2) Lecture #5-9/21/2005 Slide 10 of 34
11 H 0 Not Rejected Imagine, for instance, that for your wager you conclude that H 0 is not rejected, or that there is not enough statistical evidence to conclude that the average height of KU grad students is 68 inches. Does this mean that you were correct, that 68 inches is the average height of KU grad students? Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals Are there other values of µ that would be also be consistent with the data? Could you pick another µ 0 such that you would fail to reject H 0? Lecture #5-9/21/2005 Slide 11 of 34
12 Confidence Intervals For the hypothesis test you just constructed, the reality is that there could be a range of values µ where you could not detect differences from your sample mean. This region is at the heart of the concept of confidence intervals. Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals Recall that a confidence interval gives us a range around the sample mean ( X) where the population mean (µ) lies with a given probability 1 α. Specifically, we say that with 100 (1 α)% confidence that µ 0 lies within: x ± t n 1 (α/2) s n Lecture #5-9/21/2005 Slide 12 of 34
13 Confidence Interval Example In reality, what does statistical confidence translate into? The statistical confidence level relays the probability that the mean µ falls within a specified interval. This is akin to having α% of confidence intervals not contain the true population mean. Imagine you set α = 0.05, then take a set of 100 random samples from a population. About 5 of those 100 samples would have confidence intervals where µ is not contained in the interval. Hypothesis Testing Hypothesis Evaluation H 0 Not Rejected Confidence Intervals In class example... Lecture #5-9/21/2005 Slide 13 of 34
14 In univariate statistics we were concerned......with making hypotheses based on a single mean µ 0....determining the location of the mean on a single interval. Multivariate statistics, however, shift the focus from a single mean (or single interval) to making inferences about multiple means (a mean vector). The problem is phrased as determining the whether a p 1 vector µ 0 is a plausible value for the true mean of a multivariate distribution. Multivariate Hypotheses Why Multivariate? Hotelling s T 2 MV Hypothesis Evaluation T 2 Information Lecture #5-9/21/2005 Slide 14 of 34
15 A Wager, Part II Imagine, for instance, that you had a feeling that: The average height of KU grad students was 68 inches. -and- The average distance from the elbow to the wrist for KU grad students was 10 inches. Your other friend, another contrarian, thinks otherwise and makes you a bet that the average height of graduate students is not 68 inches and that the average forearm distance is not 10 inches. Multivariate Hypotheses Why Multivariate? Hotelling s T 2 MV Hypothesis Evaluation T 2 Information You collect data on a both variables. How can you tell who wins the bet? Lecture #5-9/21/2005 Slide 15 of 34
16 Multivariate Hypotheses As with the univariate analog of the wager, a multivariate hypothesis test can be constructed. In multivariate, however, each hypothesis would be phrased in terms of mean vectors rather than a single mean value (scalar) as was done with univariate statistics: Multivariate Hypotheses Why Multivariate? Hotelling s T 2 MV Hypothesis Evaluation T 2 Information H 0 : µ = µ 0 H 1 : µ µ 0 Or, phrased in terms of your wager: H 0 : µ = H 1 : µ [ [ ] ] Lecture #5-9/21/2005 Slide 16 of 34
17 Why Use Multivariate? But why would we want to do use a multivariate test for a mean when we already know how to test each variable individually Controls for Type I error Univariate tests ignore associations between the variables (i.e., ignore the multivariate structure). Multivariate tests can be more powerful (i.e. there may be cases where each univariate test would not be significant, BUT the multivariate test is). Multivariate tests can reveal more about the variables (i.e., provide insight about why we rejected H 0 ). Multivariate Hypotheses Why Multivariate? Hotelling s T 2 MV Hypothesis Evaluation T 2 Information Lecture #5-9/21/2005 Slide 17 of 34
18 Hotelling s T 2 To evaluate the multivariate hypothesis test, a test statistic must be constructed. This test statistic is called Hotelling s T 2 : where: T 2 = n ( X µ0 ) S 1 ( X µ0 ) S p p = 1 n 1 X p 1 = 1 n µ 0 = n i=1 X i n (X i X) (X i X) i=1 µ 10 µ 20. Multivariate Hypotheses Why Multivariate? Hotelling s T 2 MV Hypothesis Evaluation T 2 Information µ p0 Lecture #5-9/21/2005 Slide 18 of 34
19 Multivariate Hypothesis Evaluation The T 2 statistic has a scaled F-distribution: (n 1)p (n p) F p,n p H 0 is rejected if T 2 is greater than a specified percentage point of the F distribution with p df numerator and n p degrees of freedom denominator. For instance, lets look at the values for the means of our data. SAS Example #2... Multivariate Hypotheses Why Multivariate? Hotelling s T 2 MV Hypothesis Evaluation T 2 Information Lecture #5-9/21/2005 Slide 19 of 34
20 Invariance An appealing property of the T 2 statistic is that it is invariant (unchanged) under changes in the units of measurement for X of the form: Y p 1 = C p p X p 1 + d p 1 Multivariate Hypotheses Why Multivariate? Hotelling s T 2 MV Hypothesis Evaluation T 2 Information In our example, this means that even if we changed our units of measure from inches to meters (1 IN = M), our T 2 statistic will be unchanged. SAS Example #3... Lecture #5-9/21/2005 Slide 20 of 34
21 Note: This section is developed only for purposes of introducing new statistical terminology. A common method for conducting hypothesis tests in multivariate statistics is by use of likelihood ratios. A likelihood ratio is formed by comparing the likelihood of the data under the the null hypothesis with the likelihood of the data overall. For large samples (with frequently used distributions), LR tests have some very appealing statistical properties (known distributions, most powerful tests). T 2 LR The hypothesis test for the multivariate mean vector can be expressed in terms of a likelihood ratio. As we generalize from a mean vector to multiple mean vectors (think going from t-tests to ANOVA), LR tests will become our predominant way of evaluating multivariate hypotheses. Lecture #5-9/21/2005 Slide 21 of 34
22 The LR for comparing multivariate means is computed by: where Λ = max Σ L(µ 0,Σ) max µ,σ L(µ,Σ) = ˆΣ = 1 n ˆΣ 0 = 1 n ( ˆΣ ˆΣ 0 n (x i x) (x i x) i=1 n (x i µ 0 ) (x i µ 0 ) i=1 Λ can be expressed as a function of T 2 : ) (n/2) T 2 LR Λ 2/n = (1 + T 2 (n 1) ) 1 Lecture #5-9/21/2005 Slide 22 of 34
23 Typically, when the sample size is large, the distribution of a function Λ ( 2 lnλ) under the null hypothesis is approximately χ 2 where the degrees of freedom are reflected by the dimensionality of the hypotheses under study. 2 lnλ is something you may get used to seeing when using multivariate statistics. T 2 LR Lecture #5-9/21/2005 Slide 23 of 34
24 Just as with univariate statistics, we can construct confidence intervals for the mean vector for multivariate inference. These intervals are no longer for a single number, but for a set of numbers contained by the mean vector. The term Confidence Region is used to describe the multivariate confidence intervals. In general, a 100 (1 α)% confidence region for the mean vector of a p-dimensional normal distribution is the ellipsoid determined by all µ such that: Building CRs n ( X µ ) S 1 ( X µ ) = p(n 1) (n p) F p,n p(α) Lecture #5-9/21/2005 Slide 24 of 34
25 Building CRs - Population To build confidence regions, recall our last lecture about the multivariate normal distribution... Specifically: (x µ)σ 1 (x µ) = χ 2 p (α) Building CRs provides the confidence region containing 1 α of the probability mass of the MVN distribution. We then calculated the axes of the ellipsoid by computing the eigenvalues and eigenvectors of the covariance matrix Σ: Specifically: (x µ)σ 1 (x µ) = c 2 has ellipsoids centered at µ, and has axes ±c λ i e i. Lecture #5-9/21/2005 Slide 25 of 34
26 Building CRs - Sample A similar function is used to develop the confidence region for the multivariate mean vector based on the sample mean ( x) and covariance matrix ( S). Note that because we are taking a sample rather than the population, the distribution of the squared statistical distance is no longer χ 2 p(α) but rather p(n 1) (n p) F p,n p(α) Building CRs This means that the confidence region is centered at ( x), and has axes ± λ i p(n 1) (n p) F p,n p(α)e i. SAS Example #4... Lecture #5-9/21/2005 Slide 26 of 34
27 An advantage to constructing confidence regions lies in the reduction (projections) of such regions to marginal (univariate) variables. For instance, in our wager example, we already know what the confidence region would look like for both height and forearm length, but is there any way we could use the multivariate power to produce a set of univariate confidence intervals for height and forearm length that controls the overall error rate? The answer, of course, is yes (done in concert with both variables)... The method lies in creating linear combinations of the multivariate variables. Lecture #5-9/21/2005 Slide 27 of 34
28 Imagine you collect a multivariate random sample: x 1, x 2,...,x p (from N p (µ,σ). Imagine then that you combine all multivariate variables into a new composite variable z j : z j = a 1 x j1 + a 2 x j a p x jp Using what we learned about linear combinations of variables that were jointly distributed multivariate normal, we know that Z has a normal distribution with: z = a x s 2 z = a S x a Lecture #5-9/21/2005 Slide 28 of 34
29 The 100 (1 α)% confidence interval for µ Z = aµ x is given by: a x ± p(n 2) (n p) F p,n p(α)a S x a Lecture #5-9/21/2005 Slide 29 of 34
30 Reasonable Combinations? What are the coefficients of a? Consider: A = Here we are constructing simultaneous univariate confidence intervals for each variable p in our data set. These will capitalize on the multivariate distribution of the variables (if correlated). SAS Example # 5... Lecture #5-9/21/2005 Slide 30 of 34
31 Large Sample Inferences Recall that the t statistic approaches normality as n gets large. Similarly, as n gets large, the distribution of T 2 approximates the χ 2 formulated by the exponent of the multivariate normal distribution. Practically speaking, the formula for the value of F will approximate the χ 2 distribution value, although you can change the critical value if you feel the need to. Large Samples For more information see Section 5.5. Lecture #5-9/21/2005 Slide 31 of 34
32 Multivariate quality control charts are discussed in Section 5.6. The methods used to inspect such charts are based on T 2 and linear combinations of data. Inferences about the mean vector when some observations are missing are discussed in Section 5.7. This discussion quickly moves past the scope of this course, and for purposes of practicality, I am omitting it from lecture. Large Samples Corrections for serially correlated observations are discussed in Section 5.8. Lecture #5-9/21/2005 Slide 32 of 34
33 Final Thought We have begun our discussion of multivariate inferential statistics with methods analogous to the univariate t-test. Today we only discussed testing the mean vector versus some stable quantity. Final Thought Next Class As we move forward, recall how your univariate courses moved from t-tests to ANOVA. Such transitions will become apparent in the next two weeks. Lecture #5-9/21/2005 Slide 33 of 34
34 Next Time Inferences for Multiple Mean Vectors. Paired comparisons. MANOVA. Profile analysis. Growth Curves. More simultaneous confidence intervals. Final Thought Next Class Lecture #5-9/21/2005 Slide 34 of 34
Inferences about a Mean Vector
Inferences about a Mean Vector Edps/Soc 584, Psych 594 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Comparisons of Two Means Edps/Soc 584 and Psych 594 Applied Multivariate Statistics Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c
More informationProfile Analysis Multivariate Regression
Lecture 8 October 12, 2005 Analysis Lecture #8-10/12/2005 Slide 1 of 68 Today s Lecture Profile analysis Today s Lecture Schedule : regression review multiple regression is due Thursday, October 27th,
More informationRepeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models
Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models
More informationIntroduction to Matrix Algebra and the Multivariate Normal Distribution
Introduction to Matrix Algebra and the Multivariate Normal Distribution Introduction to Structural Equation Modeling Lecture #2 January 18, 2012 ERSH 8750: Lecture 2 Motivation for Learning the Multivariate
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Two sample T 2 test 1 Two sample T 2 test 2 Analogous to the univariate context, we
More informationPsych Jan. 5, 2005
Psych 124 1 Wee 1: Introductory Notes on Variables and Probability Distributions (1/5/05) (Reading: Aron & Aron, Chaps. 1, 14, and this Handout.) All handouts are available outside Mija s office. Lecture
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationMultilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2
Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do
More informationMaximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood
Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood PRE 906: Structural Equation Modeling Lecture #3 February 4, 2015 PRE 906, SEM: Estimation Today s Class An
More informationApplied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur
Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture - 29 Multivariate Linear Regression- Model
More information5 Inferences about a Mean Vector
5 Inferences about a Mean Vector In this chapter we use the results from Chapter 2 through Chapter 4 to develop techniques for analyzing data. A large part of any analysis is concerned with inference that
More informationApplied Multivariate and Longitudinal Data Analysis
Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 In this chapter we will discuss inference
More informationLecture 5: Hypothesis tests for more than one sample
1/23 Lecture 5: Hypothesis tests for more than one sample Måns Thulin Department of Mathematics, Uppsala University thulin@math.uu.se Multivariate Methods 8/4 2011 2/23 Outline Paired comparisons Repeated
More informationVectors and Matrices Statistics with Vectors and Matrices
Vectors and Matrices Statistics with Vectors and Matrices Lecture 3 September 7, 005 Analysis Lecture #3-9/7/005 Slide 1 of 55 Today s Lecture Vectors and Matrices (Supplement A - augmented with SAS proc
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationA Introduction to Matrix Algebra and the Multivariate Normal Distribution
A Introduction to Matrix Algebra and the Multivariate Normal Distribution PRE 905: Multivariate Analysis Spring 2014 Lecture 6 PRE 905: Lecture 7 Matrix Algebra and the MVN Distribution Today s Class An
More informationSurvey on Population Mean
MATH 203 Survey on Population Mean Dr. Neal, Spring 2009 The first part of this project is on the analysis of a population mean. You will obtain data on a specific measurement X by performing a random
More informationLecture 3. Inference about multivariate normal distribution
Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationINTRODUCTION TO ANALYSIS OF VARIANCE
CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two
More informationMultiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions
Introduction to Analysis of Variance 1 Experiments with More than 2 Conditions Often the research that psychologists perform has more conditions than just the control and experimental conditions You might
More informationEconometrics. 4) Statistical inference
30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationQuestions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized
More informationMATH5745 Multivariate Methods Lecture 07
MATH5745 Multivariate Methods Lecture 07 Tests of hypothesis on covariance matrix March 16, 2018 MATH5745 Multivariate Methods Lecture 07 March 16, 2018 1 / 39 Test on covariance matrices: Introduction
More informationST505/S697R: Fall Homework 2 Solution.
ST505/S69R: Fall 2012. Homework 2 Solution. 1. 1a; problem 1.22 Below is the summary information (edited) from the regression (using R output); code at end of solution as is code and output for SAS. a)
More informationUnivariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation
Univariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation PRE 905: Multivariate Analysis Spring 2014 Lecture 4 Today s Class The building blocks: The basics of mathematical
More informationChapter 9 Inferences from Two Samples
Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review
More informationSTAT 501 EXAM I NAME Spring 1999
STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your
More informationSTT 843 Key to Homework 1 Spring 2018
STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ
More informationLECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit
LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define
More informationPOLI 443 Applied Political Research
POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College
More informationNotes on the Multivariate Normal and Related Topics
Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions
More informationCourse Review. Kin 304W Week 14: April 9, 2013
Course Review Kin 304W Week 14: April 9, 2013 1 Today s Outline Format of Kin 304W Final Exam Course Review Hand back marked Project Part II 2 Kin 304W Final Exam Saturday, Thursday, April 18, 3:30-6:30
More informationAn Introduction to Path Analysis
An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More informationEigenvalues and eigenvectors
Roberto s Notes on Linear Algebra Chapter 0: Eigenvalues and diagonalization Section Eigenvalues and eigenvectors What you need to know already: Basic properties of linear transformations. Linear systems
More informationRegression With a Categorical Independent Variable: Mean Comparisons
Regression With a Categorical Independent Variable: Mean Lecture 16 March 29, 2005 Applied Regression Analysis Lecture #16-3/29/2005 Slide 1 of 43 Today s Lecture comparisons among means. Today s Lecture
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -
More informationThe Multinomial Model
The Multinomial Model STA 312: Fall 2012 Contents 1 Multinomial Coefficients 1 2 Multinomial Distribution 2 3 Estimation 4 4 Hypothesis tests 8 5 Power 17 1 Multinomial Coefficients Multinomial coefficient
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationRejection regions for the bivariate case
Rejection regions for the bivariate case The rejection region for the T 2 test (and similarly for Z 2 when Σ is known) is the region outside of an ellipse, for which there is a (1-α)% chance that the test
More informationConfidence Interval Estimation
Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 4 5 Relationship to the 2-Tailed Hypothesis Test Relationship to the 1-Tailed Hypothesis Test 6 7 Introduction In
More informationAn Introduction to Mplus and Path Analysis
An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression
More informationChapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.
Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:
More informationWooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics
Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).
More informationLeast Squares Estimation
Least Squares Estimation Using the least squares estimator for β we can obtain predicted values and compute residuals: Ŷ = Z ˆβ = Z(Z Z) 1 Z Y ˆɛ = Y Ŷ = Y Z(Z Z) 1 Z Y = [I Z(Z Z) 1 Z ]Y. The usual decomposition
More informationChapter 12 - Lecture 2 Inferences about regression coefficient
Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous
More informationChapter 11. Hypothesis Testing (II)
Chapter 11. Hypothesis Testing (II) 11.1 Likelihood Ratio Tests one of the most popular ways of constructing tests when both null and alternative hypotheses are composite (i.e. not a single point). Let
More informationST 732, Midterm Solutions Spring 2019
ST 732, Midterm Solutions Spring 2019 Please sign the following pledge certifying that the work on this test is your own: I have neither given nor received aid on this test. Signature: Printed Name: There
More informationRandomized Complete Block Designs
Randomized Complete Block Designs David Allen University of Kentucky February 23, 2016 1 Randomized Complete Block Design There are many situations where it is impossible to use a completely randomized
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationChapter 7, continued: MANOVA
Chapter 7, continued: MANOVA The Multivariate Analysis of Variance (MANOVA) technique extends Hotelling T 2 test that compares two mean vectors to the setting in which there are m 2 groups. We wish to
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationHotelling s One- Sample T2
Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationPSYC 331 STATISTICS FOR PSYCHOLOGISTS
PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationMultivariate Statistics
Multivariate Statistics Chapter 2: Multivariate distributions and inference Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2016/2017 Master in Mathematical
More informationElementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing
(Section 8-2) Hypotheses testing is not all that different from confidence intervals, so let s do a quick review of the theory behind the latter. If it s our goal to estimate the mean of a population,
More informationEcon 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias
Contact Information Elena Llaudet Sections are voluntary. My office hours are Thursdays 5pm-7pm in Littauer Mezzanine 34-36 (Note room change) You can email me administrative questions to ellaudet@gmail.com.
More informationMultivariate Linear Regression Models
Multivariate Linear Regression Models Regression analysis is used to predict the value of one or more responses from a set of predictors. It can also be used to estimate the linear association between
More informationCOMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS
Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department
More informationLab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d
BIOS 4120: Introduction to Biostatistics Breheny Lab #11 We will explore observational studies in today s lab and review how to make inferences on contingency tables. We will only use 2x2 tables for today
More informationStatistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong
Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data
More informationProbability and Statistics
Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT
More informationVariance Estimates and the F Ratio. ERSH 8310 Lecture 3 September 2, 2009
Variance Estimates and the F Ratio ERSH 8310 Lecture 3 September 2, 2009 Today s Class Completing the analysis (the ANOVA table) Evaluating the F ratio Errors in hypothesis testing A complete numerical
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationInferences About Two Proportions
Inferences About Two Proportions Quantitative Methods II Plan for Today Sampling two populations Confidence intervals for differences of two proportions Testing the difference of proportions Examples 1
More informationProbability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur
Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation
More informationChi square test of independence
Chi square test of independence We can eyeball differences between percentages to determine whether they seem large enough to be important Better: Are differences in percentages statistically significant?
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationChapter 3 ANALYSIS OF RESPONSE PROFILES
Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Introduction Edps/Psych/Stat/ 584 Applied Multivariate Statistics Carolyn J Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c Board of Trustees,
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More informationDover- Sherborn High School Mathematics Curriculum Probability and Statistics
Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and
More informationMANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:
MULTIVARIATE ANALYSIS OF VARIANCE MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA: 1. Cell sizes : o
More informationy ˆ i = ˆ " T u i ( i th fitted value or i th fit)
1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u
More informationStatistics Introductory Correlation
Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.
More informationPSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing
Page Title PSY 305 Module 3 Introduction to Hypothesis Testing Z-tests Five steps in hypothesis testing State the research and null hypothesis Determine characteristics of comparison distribution Five
More informationACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H.
ACE 564 Spring 2006 Lecture 8 Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information by Professor Scott H. Irwin Readings: Griffiths, Hill and Judge. "Collinear Economic Variables,
More informationDifference between means - t-test /25
Difference between means - t-test 1 Discussion Question p492 Ex 9-4 p492 1-3, 6-8, 12 Assume all variances are not equal. Ignore the test for variance. 2 Students will perform hypothesis tests for two
More informationIn most cases, a plot of d (j) = {d (j) 2 } against {χ p 2 (1-q j )} is preferable since there is less piling up
THE UNIVERSITY OF MINNESOTA Statistics 5401 September 17, 2005 Chi-Squared Q-Q plots to Assess Multivariate Normality Suppose x 1, x 2,..., x n is a random sample from a p-dimensional multivariate distribution
More informationA proportion is the fraction of individuals having a particular attribute. Can range from 0 to 1!
Proportions A proportion is the fraction of individuals having a particular attribute. It is also the probability that an individual randomly sampled from the population will have that attribute Can range
More informationHOW TO WRITE PROOFS. Dr. Min Ru, University of Houston
HOW TO WRITE PROOFS Dr. Min Ru, University of Houston One of the most difficult things you will attempt in this course is to write proofs. A proof is to give a legal (logical) argument or justification
More informationA Re-Introduction to General Linear Models (GLM)
A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing
More informationLecture 41 Sections Mon, Apr 7, 2008
Lecture 41 Sections 14.1-14.3 Hampden-Sydney College Mon, Apr 7, 2008 Outline 1 2 3 4 5 one-proportion test that we just studied allows us to test a hypothesis concerning one proportion, or two categories,
More informationMultiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD
Multiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD Introduction When you perform statistical inference, you are primarily doing one of two things: Estimating the boundaries
More informationIndependent Samples ANOVA
Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationMeasurement & Lab Equipment
Measurement & Lab Equipment Abstract This lab reviews the concept of scientific measurement, which you will employ weekly throughout this course. Specifically, we will review the metric system so that
More informationCENTRAL LIMIT THEOREM (CLT)
CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic
More information