STAT Chapter 8: Hypothesis Tests

Similar documents
280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

hypothesis a claim about the value of some parameter (like p)

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

Single Sample Means. SOCY601 Alan Neustadtl

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

CHAPTER 9: HYPOTHESIS TESTING

Econ 325: Introduction to Empirical Economics

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Statistics 251: Statistical Methods

Chapter 9 Inferences from Two Samples

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12,

Chapter 5: HYPOTHESIS TESTING

Introduction to Statistics

First we look at some terms to be used in this section.

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CHAPTER EIGHT TESTS OF HYPOTHESES

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

The Purpose of Hypothesis Testing

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

Chapter 9. Hypothesis testing. 9.1 Introduction

Tests about a population mean

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Lecture 3: Inference in SLR

MATH 240. Chapter 8 Outlines of Hypothesis Tests

Multiple Regression Analysis

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Harvard University. Rigorous Research in Engineering Education

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

example: An observation X comes from a normal distribution with

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

HYPOTHESIS TESTING. Hypothesis Testing

STA Module 10 Comparing Two Proportions

Statistical Inference. Section 9.1 Significance Tests: The Basics. Significance Test. The Reasoning of Significance Tests.

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Chapter 7: Hypothesis Testing

Chapter 5 Confidence Intervals

Introductory Econometrics. Review of statistics (Part II: Inference)

Chapter 3 Multiple Regression Complete Example

Chapter Three. Hypothesis Testing

The Components of a Statistical Hypothesis Testing Problem

EXAM 3 Math 1342 Elementary Statistics 6-7

Example. χ 2 = Continued on the next page. All cells

A3. Statistical Inference

(8 One- and Two-Sample Test Of Hypothesis)

23. MORE HYPOTHESIS TESTING

Hypothesis for Means and Proportions

Psychology 282 Lecture #4 Outline Inferences in SLR

CH.9 Tests of Hypotheses for a Single Sample

8.1-4 Test of Hypotheses Based on a Single Sample

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Inferences About Two Population Proportions

UCLA STAT 251. Statistical Methods for the Life and Health Sciences. Hypothesis Testing. Instructor: Ivo Dinov,

Homework Exercises. 1. You want to conduct a test of significance for p the population proportion.

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Inferences About Two Proportions

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Statistics 301: Probability and Statistics 1-sample Hypothesis Tests Module

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

5.2 Tests of Significance

Lecture 7: Hypothesis Testing and ANOVA

Hypothesis testing. Data to decisions

Difference between means - t-test /25

10.4 Hypothesis Testing: Two Independent Samples Proportion

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

F79SM STATISTICAL METHODS

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs

Chapter 12: Inference about One Population

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions.

20 Hypothesis Testing, Part I

Chapter 23. Inference About Means

Chapter 8 of Devore , H 1 :

On Assumptions. On Assumptions

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

Chapter 7 Class Notes Comparison of Two Independent Samples

Econometrics. 4) Statistical inference

Hypotheses Test Procedures. Is the claim wrong?

Statistics for IT Managers

Business Statistics. Lecture 10: Course Review

Statistical Process Control (contd... )

Inferences about central values (.)

9.5 t test: one μ, σ unknown

Sampling Distributions

Two-Sample Inferential Statistics

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

What is a Hypothesis?

Comparing Means from Two-Sample

1 Hypothesis testing for a single mean

Section 5.4: Hypothesis testing for μ

Basic Concepts of Inference

Statistical Inference. Hypothesis Testing

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Transcription:

STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular value for a parameter is reasonable. In this case, a popular form of inference is the hypothesis test. We use data to test a claim (about a parameter) called the null hypothesis. Example 1: We claim the proportion of USC students who travel home for Christmas is 0.95. Example 2: We claim the mean nightly hotel price for hotels in SC is no more than $65. Null hypothesis (denoted H 0 ) often represents status quo, previous belief or no effect. Alternative hypothesis (denoted H a ) is usually what we seek evidence for. We will reject H 0 and conclude H a if the data provide convincing evidence that H a is true. Evidence in the data is measured by a test statistic.

A test statistic measures how far away the corresponding sample statistic is from the parameter value(s) specified by H 0. If the sample statistic is extremely far from the value(s) in H 0, we say the test statistic falls in the rejection region and we reject H 0 in favor of H a. Example 2: We assumed the mean nightly hotel price in SC is no more than $65, but we seek evidence that the mean price is actually greater than $65. We randomly sample 64 hotels and calculate the sample mean price X 65 X. Let Z = be our test statistic here. σ / n Note: If this Z value is much bigger than zero, then we have evidence against H 0 : μ 65 and in favor of H a : μ > 65. Suppose we ll reject H 0 if Z > 1.645. If μ really is 65, then Z has a standard normal distribution. (Why?) Picture:

If we reject H 0 whenever Z > 1.645, what is the probability we reject H 0 when H 0 really is true? P(Z > 1.645 μ = 65) = This is the probability of making a Type I error (rejecting H 0 when it is actually true). P(Type I error) = level of significance of the test (denoted α). We don t want to make a Type I error very often, so we choose α to be small: The α we choose will determine our rejection region (determines how strong the sample evidence must be to reject H 0 ). In the previous example, if we choose α =.05, then Z > 1.645 is our rejection region.

Hypothesis Tests of the Population Mean In practice, we don t know σ, so we don t use the Z- statistic for our tests about μ. Use the t-statistic: the null hypothesis. t X μ = 0, where μ s / n 0 is the value in This has a t-distribution (with n 1 d.f.) if H 0 is true (if μ really equals μ 0 ). Example 2: Hotel prices: H 0 : μ = 65 H a : μ > 65 Sample 64 hotels, get X _ $ $ = 67 and s = 10. Let s set α =.05. Rejection region: Reject H 0 if t is bigger than 1.67. Conclusion:

We never accept H 0 ; we simply fail to reject H 0. This example is a one-tailed test, since the rejection region was in one tail of the t-distribution. Only very large values of t provided evidence against H 0 and for H a. Suppose we had sought evidence that the mean price was less than $72. The hypotheses would have been: H 0 : μ = 72 H a : μ < 72 Now very small values of against H 0 and for H a. t X μ = 0 would be evidence s / n Rejection region would be in left tail:

Rules for one-tailed tests about population mean H 0 : μ = μ 0 H 0 : μ = μ 0 H a : μ < μ 0 or H a : μ > μ 0 Test statistic: t = X μ 0 s / n Rejection t < -t α t > t α Region: (where t α is based on n 1 d.f.) Rules for two-tailed tests about population mean H 0 : μ = μ 0 H a : μ μ 0 Test statistic: t = X μ 0 s / n Rejection t < -t α/2 or t > t α/2 (both tails) Region: (where t α/2 is based on n 1 d.f.)

Example: We want to test (using α =.05) whether or not the true mean height of male USC students is 70 inches. Sample 26 male USC students. Sample data: X _ = 68.5 inches, s = 3.3 inches. Assumptions of t-test (and CI) about μ We assume the data come from a population that is approximately normal. If this is not true, our conclusions from the hypothesis test may not be accurate (and our true level of confidence for the CI may not be what we specify). How to check this assumption? The t-procedures are robust: If the data are close to normal, the t-test and t CIs will be quite reliable.

Hypothesis Tests about a Population Proportion We often wish to test whether a population proportion p equals a specified value. Example 1: We suspect a theater is letting underage viewers into R-rated movies. Question: Is the proportion of R-rated movie viewers at this theater greater than 0.25? We test: Recall: The sample proportion pq N p, n H 0 : p = p 0 pˆ is approximately for large n, so our test statistic for testing has a standard normal distribution when H 0 is true (when p really is p 0 ).

Rules for one-tailed tests about population proportion H 0 : p = p 0 H 0 : p = p 0 H a : p < p 0 or H a : p > p 0 Test statistic: z = pˆ p p0q n 0 0 Rejection z < -z α z > z α Region: Rules for two-tailed tests about population proportion H 0 : p = p 0 H a : p p 0 Test statistic: z = pˆ p p0q n 0 0 Rejection Region: z < -z α/2 or z > z α/2 (both tails) Assumptions of test (need large sample): Need:

Example 1: Test H 0 : p = 0.25 vs. H a : p > 0.25 using α =.01. We randomly select 60 viewers of R-rated movies, and 23 of those are underage. Example 1(a): What if we had wanted to test whether the proportion of underage viewers was different from 0.25?

P-values Recall that the significance level α is the desired P(Type I error) that we specify before the test. The P-value (or observed significance level ) of a test is the probability of observing as extreme (or more extreme) of a value of the test statistic than we did observe, if H 0 was in fact true. The P-value gives us an indication of the strength of evidence against H 0 (and for H a ) in the sample. This is a different (yet equivalent) way to decide whether to reject the null hypothesis: A small p-value (less than α) = strong evidence against the null => Reject H 0 A large p-value (greater than α) = little evidence against the null => Fail to reject H 0 How do we calculate the P-value? It depends on the alternative hypothesis.

Alternative H a : < One-tailed tests P-value Area to the left of the test statistic value in the appropriate distribution (t or z). H a : > Area to the right of the test statistic value in the appropriate distribution (t or z). Alternative H a : Two-tailed test P-value 2 times the tail area outside the test statistic value in the appropriate distribution (t or z). Double the tail area to get the P-value!

P-values for Previous Examples Hotel Price Example: H 0 : μ = 65 vs. H a : μ > 65 Test statistic value: Student height example: H 0 : μ = 70 vs. H a : μ 70 Test statistic value:

Movie theater example: H 0 : p = 0.25 vs. H a : p > 0.25 Test statistic value: What if we had done a two-tailed test of H 0 : p = 0.25 vs. H a : p 0.25 at α =.01?

Relationship between a CI and a (two-sided) hypothesis test: A test of H 0 : μ = m* vs. H a : μ m* will reject H 0 if and only if a corresponding CI for μ does not contain the number m*. Example: A 95% CI for μ is (2.7, 5.5). (1) At α = 0.05, would we reject H 0 : μ = 3 in favor of H a : μ 3? (2) At α = 0.05, would we reject H 0 : μ = 2 in favor of H a : μ 2? (3) At α = 0.10, would we reject H 0 : μ = 2 in favor of H a : μ 2? (4) At α = 0.01, would we reject H 0 : μ = 3 in favor of H a : μ 3?

Power of a Hypothesis Test Recall the significance level α is our desired P(Type I error) = P(Reject H 0 H 0 true) The other type of error in hypothesis testing: Type II error = P(Type II error) = β The power of a test is High power is desirable, but we have little control over it (different from α) Calculating Power: The power of a test about μ depends on several things: α, n, σ, and the true μ. Example 1: Suppose we test whether the true mean nicotine contents in a population of cigarettes is greater than 1.5 mg, using α = 0.01. H 0 : H a : We take a random sample of 36 cigarettes. Suppose we know σ = 0.20 mg. Our test statistic is

We reject H 0 if: Now, suppose μ is actually 1.6 (implying that H 0 is false). Let s calculate the power of our test if μ = 1.6: This is just a normal probability problem! What if the true mean were 1.65? Verify: The farther the true mean is into the alternative region, the more likely we are to correctly reject H 0.

Example 2: Testing H 0 : p = 0.9 vs. H a : p < 0.9 at α = 0.01 using a sample of size 225. Suppose the true p is 0.8. Then our power is: