LECTURE 5. Introduction to Econometrics. Hypothesis testing

Similar documents
LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

Lectures 5 & 6: Hypothesis Testing

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

Applied Quantitative Methods II

Econometrics. 4) Statistical inference

Hypothesis Testing The basic ingredients of a hypothesis test are

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Hypotheses and Errors

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Bias Variance Trade-off

Mathematical statistics

Econ 325: Introduction to Empirical Economics

LECTURE 5 HYPOTHESIS TESTING

Chapter 12 - Lecture 2 Inferences about regression coefficient

Lecture 30. DATA 8 Summer Regression Inference

Statistical Process Control (contd... )

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

LECTURE 11. Introduction to Econometrics. Autocorrelation

Applied Econometrics (QEM)

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Sampling Distributions: Central Limit Theorem

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK

Preliminary Statistics. Lecture 5: Hypothesis Testing

POLI 443 Applied Political Research

Chapter 3 Multiple Regression Complete Example

Multiple Regression Analysis

Multiple Regression Analysis

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Hypothesis testing. Data to decisions

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

Business Statistics. Lecture 10: Course Review

coefficients n 2 are the residuals obtained when we estimate the regression on y equals the (simple regression) estimated effect of the part of x 1

Inferences for Regression

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

Statistical Inference with Regression Analysis

MS&E 226: Small Data

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Statistical inference

What is a Hypothesis?

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Introduction to the Analysis of Variance (ANOVA)

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Econometrics - 30C00200

14.30 Introduction to Statistical Methods in Economics Spring 2009

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

Review of probability and statistics 1 / 31

Review of Statistics

First we look at some terms to be used in this section.

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin

ECON3150/4150 Spring 2015

Introductory Econometrics. Review of statistics (Part II: Inference)

F3: Classical normal linear rgression model distribution, interval estimation and hypothesis testing

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Chapter 14 Simple Linear Regression (A)

Hypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima

Introduction to Statistics

Chapter Three. Hypothesis Testing

Math 101: Elementary Statistics Tests of Hypothesis

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Tests about a population mean

CIVL 7012/8012. Simple Linear Regression. Lecture 3

Mathematical Statistics

The problem of base rates

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Using regression to study economic relationships is called econometrics. econo = of or pertaining to the economy. metrics = measurement

Performance Evaluation and Comparison

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

EC402 - Problem Set 3

LECTURE 1. Introduction to Econometrics

ECON Introductory Econometrics. Lecture 2: Review of Statistics

Motivation for multiple regression

Probability and Statistics Notes

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

AP Statistics Ch 12 Inference for Proportions

Unit 19 Formulating Hypotheses and Making Decisions

Sociology 6Z03 Review II

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

CBA4 is live in practice mode this week exam mode from Saturday!

Introduction to Statistical Inference Lecture 8: Linear regression, Tests and confidence intervals

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

HYPOTHESIS TESTING. Hypothesis Testing

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12,

The regression model with one fixed regressor cont d

ECO375 Tutorial 4 Introduction to Statistical Inference

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Multiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

An overview of applied econometrics

Inference in Regression Model

Transcription:

LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26

ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will explain what significance of coefficients means We will learn how to read regression output Readings for this week: Studenmund, Chapter 5.1-5.4 Wooldridge, Chapter 4 2 / 26

QUESTIONS WE ASK What conclusions can we draw from our regression? What can we learn about the real world from a sample? Is it likely that our results could have been obtained by chance? If our theory is correct, what are the odds that this particular outcome would have been observed? 3 / 26

HYPOTHESIS TESTING We cannot prove that a given hypothesis is correct using hypothesis testing All that can be done is to state that a particular sample conforms to a particular hypothesis We can often reject a given hypothesis with a certain degree of confidence In such a case, we conclude that it is very unlikely the sample result would have been observed if the hypothesized theory were correct 4 / 26

NULL AND ALTERNATIVE HYPOTHESES First step in hypothesis testing: state explicitly the hypothesis to be tested Null hypothesis: statement of the range of values of the regression coefficient that would be expected to occur if the researcher s theory were not correct Alternative hypothesis: specification of the range of values of the coefficient that would be expected to occur if the researcher s theory were correct In other words: we define the null hypothesis as the result we do not expect 5 / 26

NULL AND ALTERNATIVE HYPOTHESES Notation: H0... null hypothesis H A... alternative hypothesis Examples: One-sided test H 0 : β 0 H A : β > 0 Two-sided test H 0 : β = 0 H A : β 0 6 / 26

TYPE I AND TYPE II ERRORS It would be unrealistic to think that conclusions drawn from regression analysis will always be right There are two types of errors we can make Type I : We reject a true null hypothesis Type II : We do not reject a false null hypothesis Example: H0 : β = 0 HA : β 0 Type I error: it holds that β = 0, we conclude that β 0 Type II error: it holds that β 0, we conclude that β = 0 7 / 26

TYPE I AND TYPE II ERRORS Example: H0 : The defendant is innocent HA : The defendant is guilty Type I error = Sending an innocent person to jail Type II error = Freeing a guilty person Obviously, lowering the probability of Type I error means increasing the probability of Type II error In hypothesis testing, we focus on Type I error and we ensure that its probability is not unreasonably large 8 / 26

DECISION RULE 1. Calculate sample statistic 2. Compare sample statistic with the critical value (from the statistical tables) The critical value divides the range of possible values of the statistic into two regions: acceptance region and rejection region If the sample statistic falls into the rejection region, we reject H 0 If the sample statistic falls into the acceptance region, we do not reject H 0 The idea is that if the value of the coefficient does not support H 0, the sample statistic should fall into the rejection region 9 / 26

ONE-SIDED REJECTION REGION H 0 : β 0 vs H A : β > 0 Distribution of β: Probability of Type I error Acceptance region Rejection region 10 / 26

TWO-SIDED REJECTION REGION H 0 : β = 0 vs H A : β 0 Distribution of β: Probability of Type I error Rejection region Acceptance region Rejection region 11 / 26

THE t-test We use t-test to test hypothesis about individual regression slope coefficients Test of more than one coefficient at a time (joint hypotheses) are typically done with the F-test (see next lecture) The t-test is appropriate to use when the stochastic error term is normally distributed and when the variance of that distribution is unknown These are the usual assumptions in regression analyses The t-test accounts for differences in the units of measurement of the variables 12 / 26

THE t-test Consider the model y = β 0 + β 1 x 1 + β 2 x 2 + ε Suppose we want to test (b is some constant) H 0 : β 1 = b vs H A : β 1 b We know that ( ) β 1 N β 1, Var( β 1 ) β 1 β 1 N(0, 1) Var( β 1 ) 13 / 26

THE t-test Problem: Var( β 1 ) depends on the variance of error term σ 2, which is unobservable and therefore unknown It has to be estimated as ˆσ 2 := s 2 = e e n k, k is the number of regression coefficients (here k = 3) e is the vector of residuals We denote standard error of β 1 (sample ) counterpart of standard deviation ) as s.e. ( β1 σ β1 14 / 26

THE t-test We define the t-statistic t := β 1 β 1 ) s.e. ( β1 t n k where β 1 is the estimated coefficient and β 1 is the value of the coefficient that is stated in our hypothesis This statistic depends only on the estimate β 1, our hypothesis about β 1, and it has a known distribution 15 / 26

TWO-SIDED t-test Our hypothesis is Hence, our t-statistic is H 0 : β 1 = b vs H A : β 1 b t = β 1 b ) s.e. ( β1 where β1 is the estimated regression coefficient of β 1 b is the constant from our null hypothesis ) s.e. ( β1 is the estimated standard error of β 1 16 / 26

TWO-SIDED t-test How to determine the critical value for this test statistic? The critical value is the value that distinguishes the acceptance region from the rejection region 1. We set the probability of Type I error Let s set the Type I. error to 5% We say the p-value of the test is 5% or that we have a test at 95% confidence level 2. We find the critical values in the statistical tables: t n k,0.975 and t n k,0.025 The critical value depends on the chosen level of Type I error and n k Note that t n k,0.975 = t n k,0.025 17 / 26

TWO-SIDED t-test Distribution tn-k Rejection region : p-value = 5% 2.5 % 2.5 % tn-k,0.025 tn-k,0.975 We reject H 0 if t > t n k,0.975 18 / 26

ONE-SIDED t-test Suppose our hypothesis is Our t-statistic still is H 0 : β 1 b vs H A : β 1 > b t = β 1 b ) s.e. ( β1 We set the probability of Type I error to 5% We compare our statistic to the critical value t n k,0.95 19 / 26

ONE-SIDED t-test Distribution tn-k Rejection region : p-value = 5% 5% tn-k,0.95 We reject H 0 if t > t n k,0.95 20 / 26

SIGNIFICANCE OF THE COEFFICIENT The most common test performed in regression is with the t-statistic H 0 : β = 0 vs H A : β 0 t = β ) t n k s.e. ( β If we reject H 0 : significant β = 0, we say the coefficient β is This t-statistic is displayed in most regression outputs 21 / 26

THE p-value Classical approach to hypothesis testing: first choose the significance level, then test the hypothesis at the given level of significance (e.g. 5%) However, there is no correct significance level. Or we can ask a more informative question: What is the smallest significance level at which the null hypothesis would still be rejected? This level of significance is known as the p-value. Remember that the significance level describes the probability of type I. error. The smaller the p-value, the smaller the probability of rejecting the true null hypothesis (the bigger the confidence the null hypothesis is indeed correctly rejected). The p-value for H0 : β = 0 is displayed in most regression outputs 22 / 26

EXAMPLE Let us study the impact of years of education on wages: wage = β 0 + β 1 education + β 2 experience + ε Output from Gretl: 23 / 26

CONFIDENCE INTERVAL A 95% confidence interval of β is an interval centered around β such that β falls into this interval with probability 95% ) P ( β c < β < β + c = c = P ) < s.e. ( β β β c ) < ) = 0.95 s.e. ( β s.e. ( β Since β β s.e.( β) t n k, we derive the confidence interval: ) β ± t n k,0.975 s.e. ( β 24 / 26

CONFIDENCE INTERVAL Output from Gretl (wage regression): Confidence interval ) for coefficient on education: β ± t n k,0.975 s.e. ( β = 0.644 ± 1.960 0.054 β [0.538; 0.750] with 95% probability 25 / 26

SUMMARY We discussed the principle of hypothesis testing We derived the t-statistic We defined the concept of the p-value We explained what significance of a coefficient means We observed a regression output on an example 26 / 26