Hypothesis Testing The basic ingredients of a hypothesis test are

Similar documents
Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Unit 19 Formulating Hypotheses and Making Decisions

Preliminary Statistics. Lecture 5: Hypothesis Testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

Mathematical statistics

Introductory Econometrics. Review of statistics (Part II: Inference)

Topic 17 Simple Hypotheses

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Introduction to Statistics

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

Introductory Econometrics

Probability and Statistics Notes

LECTURE 5 HYPOTHESIS TESTING

What is a Hypothesis?

Topic 17: Simple Hypotheses

8.1-4 Test of Hypotheses Based on a Single Sample

Section 5.4: Hypothesis testing for μ

STAT 3A03 Applied Regression With SAS Fall 2017

Hypotheses Test Procedures. Is the claim wrong?

Mathematical Statistics

Simple and Multiple Linear Regression

MTMS Mathematical Statistics

Visual interpretation with normal approximation

In any hypothesis testing problem, there are two contradictory hypotheses under consideration.

First we look at some terms to be used in this section.

MAT2377. Rafa l Kulik. Version 2015/November/26. Rafa l Kulik

School of Mathematical Sciences. Question 1

Inference for Regression

CHAPTER 9: HYPOTHESIS TESTING

Business Statistics. Lecture 10: Correlation and Linear Regression

14.30 Introduction to Statistical Methods in Economics Spring 2009

The Purpose of Hypothesis Testing

Econ 325: Introduction to Empirical Economics

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Inferences for Regression

Lectures 5 & 6: Hypothesis Testing

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Hypothesis tests

UNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017

HYPOTHESIS TESTING. Hypothesis Testing

Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution.

20 Hypothesis Testing, Part I

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Chapter 3 Multiple Regression Complete Example

Bias Variance Trade-off

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12,

STAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing

Ch 2: Simple Linear Regression

Statistical Tests. Matthieu de Lapparent

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

CENTRAL LIMIT THEOREM (CLT)

23. MORE HYPOTHESIS TESTING

Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing

Sampling and Sample Size. Shawn Cole Harvard Business School

Scatter plot of data from the study. Linear Regression

Statistical Inference. Hypothesis Testing

Multiple Regression Analysis

Multiple Linear Regression

Simple Linear Regression

Statistical Inference

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Finansiell Statistik, GN, 15 hp, VT2008 Lecture 10-11: Statistical Inference: Hypothesis Testing

MS&E 226: Small Data

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

Hypothesis testing. Data to decisions

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013

1 Least Squares Estimation - multiple regression.

Burden of Proof: Economic Analysis

Scatter plot of data from the study. Linear Regression

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

Chapter 9. Hypothesis testing. 9.1 Introduction

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing

Tests about a population mean

Psychology 282 Lecture #4 Outline Inferences in SLR

STAT Chapter 8: Hypothesis Tests

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

5.2 Tests of Significance

Variance Estimates and the F Ratio. ERSH 8310 Lecture 3 September 2, 2009

Linear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is.

Simple Linear Regression

Introduction to the Analysis of Variance (ANOVA)

Math 101: Elementary Statistics Tests of Hypothesis

The regression model with one fixed regressor cont d

Lecture 6 Multiple Linear Regression, cont.

Hypotheses and Errors

Statistical Process Control (contd... )

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

Midterm 2 - Solutions

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

WISE International Masters

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

Some General Types of Tests

Transcription:

Hypothesis Testing The basic ingredients of a hypothesis test are 1 the null hypothesis, denoted as H o 2 the alternative hypothesis, denoted as H a 3 the test statistic 4 the data 5 the conclusion.

The hypotheses are usually statements about the values of one or more unknown parameters, denoted as θ here. The null hypothesis is usually a more restrictive statement than the alternative hypothesis, e.g. H o : θ = θ o, H a : θ θ o. The burden of proof is on the alternative hypothesis. We will continue to believe in the null hypothesis unless there is very strong evidence in the data to refute it. The test statistic measures agreement of the data with the null hypothesis. It is a reasonable combination of the data and the hypothesized value of the parameter. It gets bigger when the data agrees less with the null hypothesis.

When ˆθ is an estimator for θ with standard error sˆθ, a common test statistic has the form z = ˆθ θ o. sˆθ When the data agrees perfectly with the null hypothesis, z = 0. When the estimated and hypothesized values for θ become farther apart, z increases in magnitude.

There are two closely related approaches to testing. 1 One weighs the evidence against H o. 2 The other ends in a decision to reject or not to reject H o.

Weighing the evidence This approach uses the significance probability or P-value, the probability of obtaining a value of the test statistic as or more extreme than the value actually observed, assuming that H o is true. This requires knowledge of the distribution of the test statistic under the assumption that H o is true, the null distribution.

For the two-sided alternative and test statistic mentioned above, the P-value is P = 2Pr(z z observed ) The factor 2 is required because a priori the sign of z observed is not known, and large (in magnitude) negative and positive values of z give evidence against H o. Sometimes we use a one-sided alternative, H a : θ > θ o or H a : θ < θ o. In these cases P = Pr(z z observed ) and P = Pr(z z observed ) respectively.

The strength of the evidence against H o is determined by the size of the P-value. A smaller value for P gives stronger evidence. The logic is that if H o is true, extreme values for the test statistic are unlikely, and therefore a possible indication that H o is not true. By convention we draw the following conclusions P value Strength of evidence against H o >.10 none (.05,.10] weak (.01,.05] strong <.01 very strong

When P <.01, for example, we could say that the results are statistically significant at the.01 level, or we have very strong evidence against the null hypothesis.

Decision approach I The second approach to hypothesis testing requires a decision be made whether or not to reject H o. One way to do this is to compare the P value to a small cut-off called the significance level α and to reject H o if P α. Another way is to choose a rejection region and to reject H o if the test statistic falls in this region. Two types of error are possible with this approach: 1 A type I error occurs if H o is rejected when it is true. 2 A type II error occurs if H o is not rejected when it is false. The type I error is considered to be much more important than the type II error.

Decision approach II A common analogy is with a court of law. In murder cases the presumption of innocence (H o ) is rejected only when the jury is convinced beyond a shadow of a doubt by very strong evidence (an extreme value for the test statistic). The type I error would be to convict and hang the accused (reject H o ) when he is innocent (H o is true). The type II error, considered less serious, would be to let a guilty man go free (don t reject H o when it is false). Recognizing the seriousness of the type I error, the rejection region is chosen so that the probability of rejecting H o when it is true is a small value α. For example, the test statistic z discussed above frequently has an approximate normal distribution. For the two-sided alternative, with α =.05, the rejection region consists of the values z z α/2 = 1.96.

Decision approach III When the data is assumed to be normally distributed and the variance is unknown and estimated by a sample variance, we use the t distribution. Finally, the data is collected and the test statistic is computed. If the test statistic falls in the rejection region we reject H o at level α. Otherwise we do not reject H o at level α.

Remember that A rejected H o may in fact be true. An H o which is not rejected is probably not true either (This is why I never say H o is accepted ). A result which is statistically significant (i.e. we have rejected H o ) may have no practical significance. With a very large sample size almost any H o will be rejected.

Hypothesis testing in simple linear regression I The most common and useful test is whether or not the relationship between the response and predictor is significant. H 0 : β 1 = 0, there is no linear relationship H a : β 1 0, there is a linear relationship The alternative is usually two sided. The test statistic is T = ˆβ 1 se( ˆβ 1 ) and this is compared to the t n 2 distribution. Here se( ˆβ 1 ) = s/ S XX. On occasion, we specify a value β 1,0 other than 0 in the null hypothesis.

Hypothesis testing in simple linear regression II Then the test statistic becomes T = ˆβ 1 β 1,0 se( ˆβ 1 ). One can also test hypotheses about the intercept H 0 : β 0 = β 0,0, H a : β 0 β 0,0. Often we are interested in whether the intercept is zero, so β 0,0 = 0. The test statistic is T = ˆβ 0 β 0,0 se( ˆβ 0 ) and this is compared to the t n 2 distribution.

Tests on the mean of Y at x 0 H 0 : µ x0 = µ x0,0, H a : µ x0 µ x0,0 The test statistic is T = ˆµ x 0 µ x0,0 se(ˆµ x0 ) and this is compared to the t n 2 distribution.