Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):
|
|
- Thomas French
- 5 years ago
- Views:
Transcription
1 Lecture Three Normal theory null distributions Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): A random variable which is a sum of many independent random variables will have an (approximately) normal distribution Examples (1) Many natural responses may be modelled as the additive effect of many factors eg crop yield: where y 1 = a 1 x seed1 + a 2 x soil1 + a 3 x water1 + y 2 = a 1 x seed2 + a 2 x soil2 + a 3 x water2 + y n = a 1 x seedn + a 2 x soiln + a 3 x watern + x seed1,, x seedn are independent samples from a (not necessarily normal) distribution with mean µ seed and variance σ 2 seed ; x soil1,, x soiln are independent samples from a distribution with mean µ soil and variance σ 2 soil ; x water1,, x watern are independent samples from a distribution with mean µ water and variance σ water 2 ; it then follows that (y 1,, y n ) will be independent samples from an approximately normal joint distribution with µ Y = a 1 µ seed + a 2 µ soil + a 3 µ water + σ 2 Y = a 2 1σ 2 seed + a2 2σ 2 soil + a2 3σ 2 water + additive effects normally distributed data (2) The sampling distribution for Ȳ from independent samples from a population Recap: sampling distribution: 1
2 Population A sample (y (1) 1,, y(1) n ) ȳ (1) Population A sample (y (2) 1,, y(2) n ) ȳ (2) Population A sample (y (N) 1,, y n (N) ) ȳ (N) Distribution of (ȳ (1),, ȳ (N) ) is called the sampling distribution of the sample mean Ȳ For reasonable distributions (finite mean µ and variance σ 2 ) and non-tiny sample sizes (n > 30), Ȳ will have an approximately normal distribution, with mean µ, variance σ 2 /n Why do we care about the sampling distribution of Ȳ? Consider H 0 : E(Y A ) = E(Y B ) = µ (treatment has no effect) Then regardless of the distribution of the data, under H 0 we have hence Ȳ A N(µ, σ 2 /n A ) Ȳ B N(µ, σ 2 /n B ) Ȳ A ȲB N(0, (σ 2 /n A ) + (σ 2 /n B )) represents a distribution of hypothetical results of a sampling experiment that it could have occurred under H 0 (Review of) properties of the Normal Distribution (I) (1) If Y N(µ, σ 2 ), then ay +b N(aµ+b, a 2 σ 2 ); in particular, (Y µ)/σ N(0, 1) N(0, 1) is called the standard Normal distribution (2) If Y 1 N(µ 1, σ 2 1) and Y 2 N(µ 2, σ 2 2) and Y 1, Y 2 independent then Y 1 + Y 2 N(µ 1 + µ 2, σ σ 2 2) (3) If Y 1,, Y n are an iid sample from N(µ, σ 2 ) then independent of S 2 = 1 n (Y i n 1 Ȳ )2 Ȳ is statistically 2
3 Normal theory Null Distributions Consider the simple hypothesis test: H 0 : E(Y ) = µ 0 H 1 : E(Y ) µ 0 (two-sided) obtain data y 1,, y n, evaluate H 0, H 1 with test statistic: d(y) = d(y 1,, y n ) = ȳ µ 0 σ/ n 1 If data are from a normal population N(µ, σ 2 ) then d(y) = (Ȳ µ 0) (σ/ n) Ȳ µ 0 N(µ µ 0, σ 2 /n) and thus if H 0 is true then d(y) N(0, 1) N( µ µ 0 σ/ n, 1) Thus if the null hypothesis is true then we would expect d(y) to have a standard normal distribution, eg in 95% of samples taking a value between 196 and +196 Equivalently: the null distribution is standard normal 2 If data are not normal, but H 0 is true, the variance is σ 2, and n is fairly big (eg n > 30), then by the CLT we have: Hypothesis Test d(y) = (Ȳ µ 0) (σ/ n) N(0, 1) Large values of d(y) provide strong evidence against H 0 Reject H 0 for larger values of d(y) p-value = P r( d(y) d(y obs ) ) = P r(z d(y obs ) ) + P r(z d(y obs ) ) = 2P r(z d(y obs ) ) (here we use Z to indicate a standard normal RV) Problem: σ 2 is usually unknown = 2 (1 pnorm( d(y obs ), 0, 1)) Thus we cannot compute d(y) (in this sense it is not a genuine test-statistic) 3
4 Solution: approximate σ 2 with s 2, the sample variance, and use t(y) = ȳ µ 0 s/ n Q: What is the distribution of t(y) under H 0 : E(Y ) = µ 0? Well, s σ, hence we might hope that: ȳ µ 0 s/ n ȳ µ 0 σ/ N(0, 1) n For very large samples (> 100) this is a reasonable approximation, but for less large samples we need to take into account that S varies around σ Properties of the Normal distribution (2): χ 2 and t-distributions (4) If Z 1,, Z n iid N(0, 1) then n Zi 2 χ 2 n the χ 2 ( Chi-squared ) distribution with n df (degrees of freedom) In particular, if Y 1,, Y n iidn(µ, σ 2 ), then Y 1 µ, Y 2 µ, Y n µ σ σ σ iid N(0, 1) 1 n σ 2 (Y i µ) 2 χ 2 n n 1 σ 2 S 2 = n 1 1 σ 2 n 1 n (Y i Ȳ )2 χ 2 n 1 Note: the df is the number of independent normal RVs that are summed and squared, or equivalently, the dimension of the space in which these variables live: Clearly Yi µ σ is independent of Yj µ σ ; But Yj Ȳ σ is not independent of Y j Ȳ σ, for j j However, if we define W i = Y i Ȳ then 1 σ 2 n (Y i Ȳ )2 = 1 n σ 2 W 2 i = 1 n 1 σ 2 U 2 i 4
5 where for i = 1,, n 1: U i = i i + 1 W i i(i + 1) i j=1 W j and U j, U j are independent for j j Thus we see that n 1 is indeed the correct df for S 2 (5) If Z N(0, 1), X χ 2 k and Z and X are independent then Z X/k t k Student s t-distribution on k df ( Student was a pseudonym used by W Gosset) Note that as k t k N(0, 1) distribution The t-distribution has heavier tails than the standard normal, ie for large values of x: where Z N(0, 1) and T t k Back to the hypothesis test: P ( Z > x) < P ( T > x) We now return to the question: if E(Y ) = µ 0, so H 0 is true, and in addition, Y 1, Y n iid N(µ 0, σ 2 ), what is the distribution of t(y) = Ȳ µ 0 S/ n? We already showed that if Y 1, Y n iid N(µ 0, σ 2 ) (Ȳ µ 0) (σ/ n) Further, from (4) it follows that N(0, 1) Z n 1 σ 2 S 2 χ 2 n 1 X with k = n 1 Finally from (3) we have that Ȳ and S2 are independent Hence Z and X are independent Thus it follows that Ȳ µ 0 S/ n = Ȳ µ 0 σ/ n n 1 σ 2 S2 n 1 = Z X /(n 1) t n 1 Note that neither X nor Z are test statistics, since (unless we know σ 2 ) they cannot be computed from the sample 5
6 One sample t-test H 0 : E(Y ) = µ 0 H 1 : E(Y ) µ 0 (two-sided) Assumption: if H 0 is true then Y 1,, Y n iid N(µ 0, σ 2 ) Data y 1,, y n Test statistic: t obs = t(y) = (ȳ ) µ0 n s Null distribution: from the assumptions, if H 0 is true then t(y) t n 1, ie the population of hypothetical values of t(y) that might have been sampled is a t-distribution on n 1 df p-value: measuring evidence against H 0 : p = P r ( t(y) t obs H 0 ) = 2 (1 pt( tobs, n 1)) You can also try ttest(datavector, mu=µ 0 ) Basic Decision Theory Goal: Reject H 0 when false, accept H 0 when true Method: Judge evidence provided by data against H 0 Truth Decision H 0 true H 0 false Accept H 0 : Correct Type II error Reject H 0 : Type I error Correct Alternate terminology: H 0 : no treatment effect / you don t have the disease H 1 : treatment effect / you do have the disease then Type I errors correspond to False positives; Type II errors correspond to False negatives Note: A type I error can only occur when the null hypothesis is true Conversely, a type II error can only occur when the alternative is true 6
7 p-value Decision Procedure: (i) Select a level α (0 < α < 1) (ii) Compute the observed value of the statistic, and hence the p-value (iii) Reject H 0 if p-value α; accept H 0 if p-value> α Note: A large p-value does not imply that the alternative hypothesis is false, merely that there is little evidence against the null hypothesis For this reason some people speak of failing to reject H 0, rather than accepting H 0 ; the implication being that the null hypothesis has escaped rejection this time, but might not be so lucky next time Example: One sample t-test H 0 : E(Y ) = µ 0 H 1 : E(Y ) µ 0 (two-sided) Let t obs be the observed value of the t-statistic t(y) = n(ȳ µ 0 )/s Let T n 1 be a random variable with a Student t-distribution on n 1 df reject H 0 iff p-value α P r( T n 1 t obs ) α 2 P r(t n 1 t obs ) α P r(t n 1 t obs ) α/2 1 P r(t n 1 t obs ) α/2 P r(t n 1 t obs ) 1 α/2 Let t n 1,1 α/2 be the 1 α/2 quantile of the t-distribution with n 1 df ie P r(t n 1 < t n 1,1 α/2 ) = 1 α/2 Let x 1, x 2, x 3 be three numbers such that (picture) x 1 < t n 1,1 α/2 < x 2 < t n 1,1 α/2 < x 3 If t obs = x 1 then P r(t n 1 x 1 ) > 1 α/2, hence p-value < α, so we reject H 0 If t obs = x 2 then P r(t n 1 x 2 ) < 1 α/2, hence p-value > α, so we accept H 0 If t obs = x 3 then P r(t n 1 x 3 ) < 1 α/2, hence p-value < α, so we reject H 0 Thus we see that for a one sample t-test, the p-value decision procedure is equivalent to 7
8 t-statistic Decision Procedure: (i) Select a level α (0 < α < 1) (ii) Compute the observed t-statistic t obs = t(y) (iii) Reject H 0 if t obs t n 1,1 α/2 ; Accept H 0 if t obs < t n 1,1 α/2 Terminology The range of values of the test statistic where (for a given α) the null hypothesis is rejected is called the rejection region; conversely the range of values for which the null is not rejected is called the acceptance region It should now be obvious that for both decision procedures that P (type I error H 0 is true ) = P (reject H 0 H 0 is true ) = P ( t(y) t n 1,1 α/2 H 0 is true) = 2 α/2 = α Such a procedure is called a level-α test α controls the pre-experimental type-i error rate Note that we can construct a decision procedure to give any specified type I error rate Note: (Interpretation) If every experimenter used, for example, α = 005 then The null hypothesis would be falsely rejected for 5% of those experiments in which H 0 is true, and would correctly not be rejected for 95% of these experiments (where H 0 is true) What about those experiments where H 1 is true? For what proportion of these will we incorrectly accept H 0, and for what proportion will we correctly reject H 0? ie for such experiments what will our Type II error rate be? We will return to this question when we discuss power and the control of type II errors (It will depend on how H 1 is true, and our sample size) 8
Confidence intervals
Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationNull Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017
Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationBIO5312 Biostatistics Lecture 6: Statistical hypothesis testings
BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical
More informationHypothesis Testing One Sample Tests
STATISTICS Lecture no. 13 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 12. 1. 2010 Tests on Mean of a Normal distribution Tests on Variance of a Normal
More informationIntroductory Econometrics
Session 4 - Testing hypotheses Roland Sciences Po July 2011 Motivation After estimation, delivering information involves testing hypotheses Did this drug had any effect on the survival rate? Is this drug
More information1 Hypothesis testing for a single mean
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationGov 2000: 6. Hypothesis Testing
Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationNull Hypothesis Significance Testing p-values, significance level, power, t-tests
Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test
More informationPopulation Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8
Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research
More informationStat 427/527: Advanced Data Analysis I
Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationSummary: the confidence interval for the mean (σ 2 known) with gaussian assumption
Summary: the confidence interval for the mean (σ known) with gaussian assumption on X Let X be a Gaussian r.v. with mean µ and variance σ. If X 1, X,..., X n is a random sample drawn from X then the confidence
More informationAnalysis of Variance
Analysis of Variance Math 36b May 7, 2009 Contents 2 ANOVA: Analysis of Variance 16 2.1 Basic ANOVA........................... 16 2.1.1 the model......................... 17 2.1.2 treatment sum of squares.................
More informationSTAT 135 Lab 5 Bootstrapping and Hypothesis Testing
STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,
More informationLecture 12: Small Sample Intervals Based on a Normal Population Distribution
Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)
More informationSummary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)
Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationCH.9 Tests of Hypotheses for a Single Sample
CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More information280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses
280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 9-1.2 Tests of Statistical Hypotheses To illustrate the general concepts, consider the propellant burning rate problem introduced earlier. The null
More informationWhat p values really mean (and why I should care) Francis C. Dane, PhD
What p values really mean (and why I should care) Francis C. Dane, PhD Session Objectives Understand the statistical decision process Appreciate the limitations of interpreting p values Value the use of
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS 1a. Under the null hypothesis X has the binomial (100,.5) distribution with E(X) = 50 and SE(X) = 5. So P ( X 50 > 10) is (approximately) two tails
More information1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments
/4/008 Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University C. A Sample of Data C. An Econometric Model C.3 Estimating the Mean of a Population C.4 Estimating the Population
More informationWooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics
Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).
More informationEC2001 Econometrics 1 Dr. Jose Olmo Room D309
EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationEconomics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing
Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing Eric Zivot October 12, 2011 Hypothesis Testing 1. Specify hypothesis to be tested H 0 : null hypothesis versus. H 1 : alternative
More informationSTA442/2101: Assignment 5
STA442/2101: Assignment 5 Craig Burkett Quiz on: Oct 23 rd, 2015 The questions are practice for the quiz next week, and are not to be handed in. I would like you to bring in all of the code you used to
More informationSTT 843 Key to Homework 1 Spring 2018
STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationChapter 10. Hypothesis Testing (I)
Chapter 10. Hypothesis Testing (I) Hypothesis Testing, together with statistical estimation, are the two most frequently used statistical inference methods. It addresses a different type of practical problems
More informationChapter 5: HYPOTHESIS TESTING
MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate
More information[Chapter 6. Functions of Random Variables]
[Chapter 6. Functions of Random Variables] 6.1 Introduction 6.2 Finding the probability distribution of a function of random variables 6.3 The method of distribution functions 6.5 The method of Moment-generating
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationIntroductory Econometrics. Review of statistics (Part II: Inference)
Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing
More informationCh. 7. One sample hypothesis tests for µ and σ
Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationCHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:
CHAPTER 8 Test of Hypotheses Based on a Single Sample Hypothesis testing is the method that decide which of two contradictory claims about the parameter is correct. Here the parameters of interest are
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationGov Univariate Inference II: Interval Estimation and Testing
Gov 2000-5. Univariate Inference II: Interval Estimation and Testing Matthew Blackwell October 13, 2015 1 / 68 Large Sample Confidence Intervals Confidence Intervals Example Hypothesis Tests Hypothesis
More informationAs an example, consider the Bond Strength data in Table 2.1, atop page 26 of y1 y 1j/ n , S 1 (y1j y 1) 0.
INSY 7300 6 F01 Reference: Chapter of Montgomery s 8 th Edition Point Estimation As an example, consider the Bond Strength data in Table.1, atop page 6 of By S. Maghsoodloo Montgomery s 8 th edition, on
More informationThe t-statistic. Student s t Test
The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very
More informationStatistics and Sampling distributions
Statistics and Sampling distributions a statistic is a numerical summary of sample data. It is a rv. The distribution of a statistic is called its sampling distribution. The rv s X 1, X 2,, X n are said
More informationJames H. Steiger. Department of Psychology and Human Development Vanderbilt University. Introduction Factors Influencing Power
Department of Psychology and Human Development Vanderbilt University 1 2 3 4 5 In the preceding lecture, we examined hypothesis testing as a general strategy. Then, we examined how to calculate critical
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More information1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests
Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More informationIntroduction to the Analysis of Variance (ANOVA)
Introduction to the Analysis of Variance (ANOVA) The Analysis of Variance (ANOVA) The analysis of variance (ANOVA) is a statistical technique for testing for differences between the means of multiple (more
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS 1a) The model is cw i = β 0 + β 1 el i + ɛ i, where cw i is the weight of the ith chick, el i the length of the egg from which it hatched, and ɛ i
More informationDescriptive Statistics
Descriptive Statistics Once an experiment is carried out and the results are measured, the researcher has to decide whether the results of the treatments are different. This would be easy if the results
More informationQuantitative Analysis and Empirical Methods
Hypothesis testing Sciences Po, Paris, CEE / LIEPP Introduction Hypotheses Procedure of hypothesis testing Two-tailed and one-tailed tests Statistical tests with categorical variables A hypothesis A testable
More informationCIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8
CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval
More informationIntroduction to Statistical Inference
Introduction to Statistical Inference Dr. Fatima Sanchez-Cabo f.sanchezcabo@tugraz.at http://www.genome.tugraz.at Institute for Genomics and Bioinformatics, Graz University of Technology, Austria Introduction
More informationLecture 13: p-values and union intersection tests
Lecture 13: p-values and union intersection tests p-values After a hypothesis test is done, one method of reporting the result is to report the size α of the test used to reject H 0 or accept H 0. If α
More informationSimple Linear Regression
Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.
More informationBias Variance Trade-off
Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]
More informationTopic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing
Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing ECO22Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit
More informationTwo Sample Hypothesis Tests
Note Packet #21 Two Sample Hypothesis Tests CEE 3710 November 13, 2017 Review Possible states of nature: H o and H a (Null vs. Alternative Hypothesis) Possible decisions: accept or reject Ho (rejecting
More informationChapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression
Chapter 7 Hypothesis Tests and Confidence Intervals in Multiple Regression Outline 1. Hypothesis tests and confidence intervals for a single coefficie. Joint hypothesis tests on multiple coefficients 3.
More informationFinansiell Statistik, GN, 15 hp, VT2008 Lecture 10-11: Statistical Inference: Hypothesis Testing
Finansiell Statistik, GN, 15 hp, VT008 Lecture 10-11: Statistical Inference: Hypothesis Testing Gebrenegus Ghilagaber, PhD, Associate Professor April 1, 008 1 1 Statistical Inferences: Introduction Recall:
More informationSTAT 4385 Topic 01: Introduction & Review
STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics
More informationNominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers
Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain
More informationMathematical Statistics
Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics
More informationLecture 15. Hypothesis testing in the linear model
14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma
More informationProbability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!
Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive
More informationIENG581 Design and Analysis of Experiments INTRODUCTION
Experimental Design IENG581 Design and Analysis of Experiments INTRODUCTION Experiments are performed by investigators in virtually all fields of inquiry, usually to discover something about a particular
More informationComposite Hypotheses and Generalized Likelihood Ratio Tests
Composite Hypotheses and Generalized Likelihood Ratio Tests Rebecca Willett, 06 In many real world problems, it is difficult to precisely specify probability distributions. Our models for data may involve
More informationChapter 9: Hypothesis Testing Sections
Chapter 9: Hypothesis Testing Sections 9.1 Problems of Testing Hypotheses 9.2 Testing Simple Hypotheses 9.3 Uniformly Most Powerful Tests Skip: 9.4 Two-Sided Alternatives 9.6 Comparing the Means of Two
More informationProbability Theory and Statistics. Peter Jochumzen
Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................
More informationQuantitative Methods for Economics, Finance and Management (A86050 F86050)
Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge
More informationBusiness Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology
More informationThe goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.
The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationLECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit
LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define
More informationSTAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)
STAT40 Midterm Exam University of Illinois Urbana-Champaign October 19 (Friday), 018 3:00 4:15p SOLUTIONS (Yellow) Question 1 (15 points) (10 points) 3 (50 points) extra ( points) Total (77 points) Points
More informationOne sample problem. sample mean: ȳ = . sample variance: s 2 = sample standard deviation: s = s 2. y i n. i=1. i=1 (y i ȳ) 2 n 1
One sample problem Population mean E(y) = µ, variance: Var(y) = σ 2 = E(y µ) 2, standard deviation: σ = σ 2. Normal distribution: y N(µ, σ 2 ). Standard normal distribution: z N(0, 1). If y N(µ, σ 2 ),
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationPOLI 443 Applied Political Research
POLI 443 Applied Political Research Session 6: Tests of Hypotheses Contingency Analysis Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College
More informationHYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC
1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare
More information7 Random samples and sampling distributions
7 Random samples and sampling distributions 7.1 Introduction - random samples We will use the term experiment in a very general way to refer to some process, procedure or natural phenomena that produces
More informationEXAMINERS REPORT & SOLUTIONS STATISTICS 1 (MATH 11400) May-June 2009
EAMINERS REPORT & SOLUTIONS STATISTICS (MATH 400) May-June 2009 Examiners Report A. Most plots were well done. Some candidates muddled hinges and quartiles and gave the wrong one. Generally candidates
More informationOutline. Unit 3: Inferential Statistics for Continuous Data. Outline. Inferential statistics for continuous data. Inferential statistics Preliminaries
Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 1 Center for Mind/Brain Sciences (CIMeC) University of Trento,
More informationHypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima
Applied Statistics Lecturer: Serena Arima Hypothesis testing for the linear model Under the Gauss-Markov assumptions and the normality of the error terms, we saw that β N(β, σ 2 (X X ) 1 ) and hence s
More informationIntroduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test
Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists
More information18.05 Practice Final Exam
No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For
More informationClass 19. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 19 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 2017 by D.B. Rowe 1 Agenda: Recap Chapter 8.3-8.4 Lecture Chapter 8.5 Go over Exam. Problem Solving
More informationy ˆ i = ˆ " T u i ( i th fitted value or i th fit)
1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationLecture 11. Multivariate Normal theory
10. Lecture 11. Multivariate Normal theory Lecture 11. Multivariate Normal theory 1 (1 1) 11. Multivariate Normal theory 11.1. Properties of means and covariances of vectors Properties of means and covariances
More informationStatistics 251: Statistical Methods
Statistics 251: Statistical Methods 1-sample Hypothesis Tests Module 9 2018 Introduction We have learned about estimating parameters by point estimation and interval estimation (specifically confidence
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More information