First we look at some terms to be used in this section.

Similar documents
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

7 Estimation. 7.1 Population and Sample (P.91-92)

Chapter 5: HYPOTHESIS TESTING

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

STAT Chapter 8: Hypothesis Tests

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

Chapter Three. Hypothesis Testing

Sampling Distributions: Central Limit Theorem

Mathematical statistics

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Unit 19 Formulating Hypotheses and Making Decisions

Econ 325: Introduction to Empirical Economics

The Purpose of Hypothesis Testing

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

LECTURE 5. Introduction to Econometrics. Hypothesis testing

23. MORE HYPOTHESIS TESTING

Section 5.4: Hypothesis testing for μ

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Mathematical Statistics

5.2 Tests of Significance

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

EXAM 3 Math 1342 Elementary Statistics 6-7

Hypothesis tests

One-sample categorical data: approximate inference

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

8.1-4 Test of Hypotheses Based on a Single Sample

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Statistical Inference. Section 9.1 Significance Tests: The Basics. Significance Test. The Reasoning of Significance Tests.

Tests about a population mean

Two-Sample Inferential Statistics

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

CONTINUOUS RANDOM VARIABLES

Chapter 9. Hypothesis testing. 9.1 Introduction

Statistics for IT Managers

Hypothesis testing. Data to decisions

Preliminary Statistics. Lecture 5: Hypothesis Testing

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

With our knowledge of interval estimation, we can consider hypothesis tests

CHAPTER 9: HYPOTHESIS TESTING

Practice Questions: Statistics W1111, Fall Solutions

Chapter 3 Multiple Regression Complete Example

Hypothesis Testing The basic ingredients of a hypothesis test are

Hypothesis testing: Steps

Single Sample Means. SOCY601 Alan Neustadtl

CHAPTER 10 Comparing Two Populations or Groups

Inferences for Correlation

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

Math 101: Elementary Statistics Tests of Hypothesis

HYPOTHESIS TESTING. Hypothesis Testing

Two-sample Categorical data: Testing

An inferential procedure to use sample data to understand a population Procedures

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

STA Module 10 Comparing Two Proportions

Introductory Econometrics. Review of statistics (Part II: Inference)

CH.9 Tests of Hypotheses for a Single Sample

Chapter 16. Simple Linear Regression and Correlation

Inference About Two Means: Independent Samples

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Chapter 16. Simple Linear Regression and dcorrelation

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

Chapter 12: Inference about One Population

Business Statistics. Lecture 5: Confidence Intervals

Review of Statistics 101

Hypotheses Testing. 1-Single Mean

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Hypothesis Testing: One Sample

Two-sample inference: Continuous data

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Hypothesis for Means and Proportions

Hypothesis testing: Steps

STAT 201 Assignment 6

Solutions to Practice Test 2 Math 4753 Summer 2005

Statistical Process Control (contd... )

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Comparing Means from Two-Sample

AP Statistics Ch 12 Inference for Proportions

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Chapter 3. Comparing two populations

Inferences About Two Population Proportions

Multiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes

POLI 443 Applied Political Research

hypothesis a claim about the value of some parameter (like p)

Harvard University. Rigorous Research in Engineering Education

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Introduction to Statistics

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

Transcription:

8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling distribution of X using CLT. In this chapter, we look at the problem of making decision about the population parameters, especially the population mean, µ based on sample information. For example, we wish to confirm if the mean of a particular population is 90 (a hypothetical value based on our belief and/or experience) or more than 90. This statistical inference problem is known as hypothesis testing. It is very important in statistics because it helps us to draw conclusions about the population parameters. 8.2 General Concepts (P.111-114) First we look at some terms to be used in this section. Statistical hypothesis is a claim about a population parameter. This claim may or may not be true. Null hypothesis: A default or unverified statement about the population parameter is called null hypothesis. Many authors prefer to use the notation H 0 to represent a null hypothesis. Alternative hypothesis: A claim or statement about the population parameter that contradicts or against the null hypothesis. A useful notation is H 1. SydU MATH1015 (2013) First semester 1

Motivating example: Suppose that a university reports that the average UAI for 2012 entry was 90. However, an education agency claims that the university s report is incorrect and its average entry UAI was in fact greater than 90. Note: It clear that the university s claim is a default statement and the agency wants to verify it. Therefore, it is the null hypothesis about the population mean, H 0 : µ = 90, where µ = average UAI entry of all accepted students in 2012. The alternative hypothesis is the claim from the education agency against the university s claim. In our example, H 1 : µ > 90. Clearly, there is a dispute between this two groups. How can we resolve this dispute or test this claim using a good statistical argument? Method: Take a random sample of students and calculate their average UAI. Draw your conclusion based on sample information and your statistics knowledge. The procedure to decide whether the sample data are agreeable or consistent with the null hypothesis is called statistical hypothesis testing or simply hypothesis testing or test of significance. It is clear now that the decision includes some randomness. Possible outcomes in hypothesis testing: Decide H 0 is correct when in fact it is true. This is the correct decision. Decide H 0 is not correct when in fact it is true. This is an SydU MATH1015 (2013) First semester 2

incorrect decision or an error has been made in the process. Another example: A suspected person in a court is assumed to be innocent (H 0 : innocence) and a judge may decide that the suspected person is guilty only when there is strong evidence that argues again H 0 of innocence, in favor of H 1 of guilty. Every person including the judge may draw incorrect decision. But this error of drawing incorrect decision must be made as small as possible because the error of charging an innocent person guilty can be fatal. This error of deciding H 0 as incorrect when H 0 is in fact true is known as the Type I error in practice. Definition: The probability of Type I error is called the level of significance for the test. The letter α is used to represent the level of significance in hypothesis testing. That is P(Type I error) = P(Reject H 0 H 0 is true) = α. Back to the example: Take a random sample of say 25 students and calculate the sample average UAI to be x = 93 and sample s.d. to be s = 7.5. Assuming the distribution of X = individual UAI, is normal with certain variance σ 2, that is, X N(90, σ 2 ) under H 0, find the probability of getting the observed and more extreme values; that is, find P (X > 93). Solution: Since X is normal, X N(µ, σ 2 ). Then the distribution of the standardized difference between sample mean X and n hypothesized mean µ or called the test statistics is T = X µ s/ n t n 1 SydU MATH1015 (2013) First semester 3

t 24 Hence P ( X 93) = P ( T 24 93 90 7.5/ 25 ) = P (T 24 2.0) 0.025 0 2 This is the observed level of significance and it is a small probability. Therefore, either 1. the null hypothesis is correct but we observe a very rare event ( 2.5% chance), or 2. the null hypothesis is incorrect. Notes: 1. Throughout this course, unless stated otherwise, we will assume that α = 0.05 or 5%. 2. The above probability of getting the observed and more extreme values under the null hypothesis is less than 5% or very small. Therefore, it is very unlikely to observe a sample mean of 93 or higher if the true mean is 90 under H 0. This shows that the data are inconsistent with or against our assumption of H 0 : µ = 90. Conclusion: the data provides strong evidence against the null hypothesis, H 0. Definition: The probability of getting the observed and more extreme values under the condition of null hypothesis is called the P -value or observed significance level of the test. SydU MATH1015 (2013) First semester 4

Remark: MATH1015 Biostatistics Week 8 According to the previous argument, 1. small P -values (i.e., P -value < α) argue against H 0. Similarly, 2. large P -values (i.e., P -value > α) support H 0 and we say that the data are consistent with H 0. Note: When we say the data are consistent with H 0, it doesn t prove H 0 to be true because the P -value is calculated assuming H 0 is true. What we can say is that there is not sufficient evident from the data to argue against H 0. In other words, we are inconclusive about H 0 based on the data. Read pages 116-118. Now we look at hypothesis tests using the t distribution. 8.3 One-sample t-test for the mean of a Normal population (P.114-119) Recall the test discussed before for testing the UAI of 2012 entrants at a university. This test was based on a single sample and therefore such a test is called a one-sample test. Hypotheses: H 0 : µ = 90 (null hypothesis) against H 1 : µ > 90 (alternative hypothesis) Note that this alternative hypothesis is one-sided and the test is said to be one-sided as we test for values only on one side of 90. Also since n is small and σ is unknown, we use the t distribution and this kind of test is called a t-test. SydU MATH1015 (2013) First semester 5

8.4 Steps in Hypothesis Testing: t-test The steps of any hypothesis test are very similar. Since this is the first test we study, we will list them in detail for the t-test: 1. Identify the null and alternative hypotheses. 2. Write down the test statistic, t = X µ S/ n t n 1 and evaluate it using the sample mean, x, the sample standard deviation, s and the value of µ assuming H 0 is true. 3. Write down a statement for the P -value to calculate the extreme probability under H 0. 4. Draw your conclusion based on the P -value in step 3: a large P -value (> α): the data are consistent with H 0 ; a small P -value (< α): the data provide evidence against H 0. Remark: All statistical tests are based on test statistics which have different formulas, depending on parameters to be tested and assumptions on data. Note: Some authors replace the decision data are consistent with H 0 by accept H 0 and there is evidence against H 0 by reject H 0. We prefer to use the former expressions. SydU MATH1015 (2013) First semester 6

Example 1: A city health department wishes to determine the mean bacteria count per unit volume of water at a lake. The regulation says that the bacteria count per unit volume of water should be less than 200 for safety use. To test the claim that the water is safe, a researcher collected 10 water samples of unit volume and found the bacteria count to be: 175, 190, 215, 198, 184, 207, 210, 193, 196, 180 Do these data indicate that there is no cause for concern? Assume that the measurements constitute a sample from a normal population. Solution: Let µ be the current population mean bacteria count per unit volume. 1. Hypotheses: Clearly, the null hypothesis is H 0 : µ = 200. No cause for concern if the bacteria count is less than 200 and therefore, the alternative hypothesis is H 1 : µ < 200. 2. Test statistic: Since the population is normal and the sd is not known, we employ the t-test with (n 1) df using t = X µ S/ n. Now calculate the sample mean, x = 194.8 and sd, s = 13.14 from the data and calculate t using these sample values under the null hypothesis H 0. The observed test statistic is t obs = 194.8 200 13.14/ 10 = 1.25 SydU MATH1015 (2013) First semester 7

3. P -value: Since the df = 9, the P -value is t 9 P ( X 194.8) = P(T 9 1.25) = P(T 9 > 1.25) > 0.1 from the t-table with 9 df. 1.25 0 4. Conclusion: The data are consistent with H 0. In other words, we don t have sufficient evidence to show that the true mean level of bacteria is within the safety level of below 200. Example 2: It is believed that the mean plasma aluminium level for population of healthy infants is 4.13 micro gram/l and this level increases for infants receiving antacid containing aluminium. To test this claim, a random sample of 11 infants receiving antacid containing aluminium is examined and gave a mean of x = 5.20 and s = 1.13 (both are in micro gram/l). Test this claim assuming normality. Solution: Let µ be the current mean plasma aluminium level for population of infants. 1. Hypotheses: By the default, µ should be 4.13, then the null hypothesis is H 0 : µ = 4.13 It is believed that the mean plasma aluminium level for infants receiving antacid containing aluminium is higher and so the corresponding alternative hypothesis is H 1 : µ > 4.13. SydU MATH1015 (2013) First semester 8

2. Test statistic: Since the population is normal and the sd is not known, we employ the t-test with (n 1) df, using t = X µ S/ n. Now use the given sample mean and sd to calculate t under the null hypothesis H 0. That is, t obs = 5.2 4.13 1.13/ 11 = 3.14 t 10 3. P -value: Since the df = 10, the P -value is P(T 10 3.14) lies in (0.01,0.005) from the t-table with 10 df. 0 3.14 4. Conclusion: The data provide strong evidence against H 0. In other words, the true mean plasma aluminium level for infants receiving antacid containing aluminium is higher than the normal infants. Note: Clearly understand this procedure for calculating the associated P -value according to the alternative hypothesis, H 1. SydU MATH1015 (2013) First semester 9

8.5 Two-Sided Alternatives (P.120-121) In our previous examples on hypothesis testing, the alternative hypotheses considered only one direction (with > or <). For example in (1) H 1 : µ < 200 and in (2) H 1 : µ > 4.13. These H 1 look for either larger or smaller values of µ. Therefore it is reasonable call these tests as one-sided. However, in many practical problems we need to look for a specific (exact) value for the parameter, µ. We cannot accept either larger or smaller values than this specific value. In other words, we will not accept any claim if the value is different (even slightly) from the specified value. Example 3: Suppose that a manufacturing company of surgical tubes claims that its production meets the specified average diameter of 8.5 mm. A medical research team believes that the manufacturer s claim is incorrect. To test this claim they took a random sample of 16 tubes and found that the sample mean, x = 8.3 and s = 0.8 (both are in mm). Set up the null hypothesis H 0 and the alternative hypothesis H 1 in this problem. Solution: Let µ be the population mean diameter of all such potential surgical tubes. 1. Hypotheses: We need to investigate the manufacturer s claim against the medical research team. Therefore, the null hypothesis is H 0 : µ = 8.5. SydU MATH1015 (2013) First semester 10

Since any large or small tubes cannot be used in a surgery, the alternative hypothesis is H 1 : µ 8.5. 2. Test statistic: Since the population is normal and the sd is not known, we employ the t-test with (n 1) df using t = X µ S/ n. Use the given sample mean and sd to calculate t under the null hypothesis H 0. That is, t obs = 8.3 8.5 0.8/ 16 = 1.0 Note that the sample mean is below the null hypothetical value of 8.5 and therefore expect a negative value for t obs. 3. P -value: Note that the df = 15. Since we need to allow both the large and small values to argue against H 0, the P -value is calculated by 2 P (t 15 1.0) > 2 0.10 = 0.2 which is very large from the t table with 15 df. t 15 1 0 1 SydU MATH1015 (2013) First semester 11

Note: we multiply P (t 15 1.0) by 2 to accommodate both large and small values for this two-sided alternative. Remark: In general, the P -value for a two-sided t-test is: P -value = 2P(T n 1 t obs ) 4. Conclusion: The data are consistent with H 0. In other words, the true mean diameter of the tube is not significantly different from 8.5mm or the data agree with the manufacturer s claim. SydU MATH1015 (2013) First semester 12

8.6 Relationship between Hypothesis Testing and CI s (P.122-125) Suppose we are testing H 0 : µ = µ 0 vs. H 1 : µ µ 0 with a two-sided alternative. Then finding evidence against H 0 at significance level α is equivalent to µ 0 to be outside the (1 α)100% CI for µ, that is, against H 0 : µ = µ 0 µ 0 lies outside CI for µ or, consistent with H 0 : µ = µ 0 µ 0 lies inside CI for µ Remarks: Make sure that the α used for the hypothesis test is matching the confidence level of the CI. For example, we can not draw a conclusion for testing hypothesis at significance level α = 0.05, based on a 98% CI. The duality between hypothesis testing and CI is valid only for two-sided alternatives. We do not study one-sided CI in this unit, therefore we can not apply the duality for one-sided alternative hypothesis. The same relationship between hypothesis testing and CI holds for any test. That is, we can apply it for tests other than the one-sample t-test for µ. SydU MATH1015 (2013) First semester 13

Example 4: Suppose that we want to know how many hours of TV per day (on average) a university student watches. We take a sample and find that the 99% CI for the true mean µ is (3.0, 3.4). Assume that the common belief is that the mean number of hours of TV is 3.5 per day. Hence, H 0 : µ = 3.5. We disagree with this claim, so H 1 : µ 3.5. Use the 99% CI to test the hypotheses. Solution: Because the hypothesised value falls outside the CI (3.5 / (3.0, 3.4)), we can say that our data support the alternative hypothesis H 1 at the 1% significance level (α = 0.01). On the other hand, if the null hypothesis were H 0 : µ = 3.1 with H 1 : µ 3.1, then we don t have enough evidence against the null hypothesis here for α = 0.01, because 3.1 lies within the 99% CI. Exercises: Book P.134 Q1-2, Q3 Should be repeat Q2(a), Q4-5. SydU MATH1015 (2013) First semester 14