Lecture notes to Stock and Watson chapter 4

Similar documents
Environmental Econometrics

Quantitative Analysis of Financial Markets. Summary of Part II. Key Concepts & Formulas. Christopher Ting. November 11, 2017

Lecture notes to Stock and Watson chapter 8

Lecture notes to Stock and Watson chapter 12

Econometrics A. Simple linear model (2) Keio University, Faculty of Economics. Simon Clinet (Keio University) Econometrics A October 16, / 11

1. The Multivariate Classical Linear Regression Model

Lecture 5. In the last lecture, we covered. This lecture introduces you to

Simple Linear Regression: The Model

2 Regression Analysis

Two-Variable Regression Model: The Problem of Estimation

Testing Linear Restrictions: cont.

Föreläsning /31

Microeconometria Day # 5 L. Cembalo. Regressione con due variabili e ipotesi dell OLS

STAT5044: Regression and Anova. Inyoung Kim

Least Squares Estimation-Finite-Sample Properties

ECON 4230 Intermediate Econometric Theory Exam

INTRODUCTORY ECONOMETRICS

Lecture 3: Multiple Regression

Math 1710 Class 20. V2u. Last Time. Graphs and Association. Correlation. Regression. Association, Correlation, Regression Dr. Back. Oct.

ECON3150/4150 Spring 2015

Basic Econometrics - rewiev

Simple Linear Regression Analysis

MBF1923 Econometrics Prepared by Dr Khairul Anuar

Regression. ECO 312 Fall 2013 Chris Sims. January 12, 2014

Lecture 4: Multivariate Regression, Part 2

9. Linear Regression and Correlation

Simple Linear Regression

SF2930: REGRESION ANALYSIS LECTURE 1 SIMPLE LINEAR REGRESSION.

Sample Problems. Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them.

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

A time series plot: a variable Y t on the vertical axis is plotted against time on the horizontal axis

Business Economics BUSINESS ECONOMICS. PAPER No. : 8, FUNDAMENTALS OF ECONOMETRICS MODULE No. : 3, GAUSS MARKOV THEOREM

(c) i) In ation (INFL) is regressed on the unemployment rate (UNR):

Regression. Econometrics II. Douglas G. Steigerwald. UC Santa Barbara. D. Steigerwald (UCSB) Regression 1 / 17

Introduction to Estimation Methods for Time Series models. Lecture 1

Econometrics. 9) Heteroscedasticity and autocorrelation

Association studies and regression

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Lectures on Simple Linear Regression Stat 431, Summer 2012

Statistical Inference. Part IV. Statistical Inference

F9 F10: Autocorrelation

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests

Violation of OLS assumption - Heteroscedasticity

Economics 308: Econometrics Professor Moody

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

AUTOCORRELATION. Phung Thanh Binh

ECON Introductory Econometrics. Lecture 4: Linear Regression with One Regressor

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

11. Bootstrap Methods

Lecture 4: Multivariate Regression, Part 2

Essential of Simple regression

Multivariate Regression: Part I

Simple linear regression

Economics 620, Lecture 2: Regression Mechanics (Simple Regression)

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Linear Regression. y» F; Ey = + x Vary = ¾ 2. ) y = + x + u. Eu = 0 Varu = ¾ 2 Exu = 0:

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Econometrics. 4) Statistical inference

L2: Two-variable regression model

Estadística II Chapter 4: Simple linear regression

P1.T2. Stock & Watson Chapters 4 & 5. Bionic Turtle FRM Video Tutorials. By: David Harper CFA, FRM, CIPM

Introductory Econometrics

Econometrics II Censoring & Truncation. May 5, 2011

MFin Econometrics I Session 4: t-distribution, Simple Linear Regression, OLS assumptions and properties of OLS estimators

Economics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects

Multiple Regression Analysis

Empirical Market Microstructure Analysis (EMMA)

Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope

The Simple Regression Model. Part II. The Simple Regression Model

7.0 Lesson Plan. Regression. Residuals

Generalized Least Squares

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

FENG CHIA UNIVERSITY ECONOMETRICS I: HOMEWORK 4. Prof. Mei-Yuan Chen Spring 2008


The Simple Linear Regression Model

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

Heteroscedasticity and Autocorrelation

Regression of Time Series

Review of Econometrics

Practice Questions for the Final Exam. Theoretical Part

REVIEW (MULTIVARIATE LINEAR REGRESSION) Explain/Obtain the LS estimator () of the vector of coe cients (b)

Lecture 15: Chapter 10

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

Linear Regression Spring 2014

The Simple Regression Model. Simple Regression Model 1

Econometrics Summary Algebraic and Statistical Preliminaries

Multiple Regression Analysis

Introductory Econometrics. Lecture 13: Hypothesis testing in the multiple regression model, Part 1

ECON3150/4150 Spring 2016

The OLS Estimation of a basic gravity model. Dr. Selim Raihan Executive Director, SANEM Professor, Department of Economics, University of Dhaka

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

The regression model with one fixed regressor cont d

Simple Regression Model (Assumptions)

15.0 Linear Regression

CHAPTER 6: SPECIFICATION VARIABLES

Econometrics I Lecture 3: The Simple Linear Regression Model

CIVL 7012/8012. Simple Linear Regression. Lecture 3

Freeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94

Transcription:

Lecture notes to Stock and Watson chapter 4 Introductory linear regression Tore Schweder Sept 2009 TS () LN3 03/09 1 / 15

Regression "Regression" is due to Francis Galton (1822-1911): how is a son s height related to his father s height? TS () LN3 03/09 2 / 15

Regression towards the mean Figure: Galton s original diagram. Parents-child pairs by heigth. TS () LN3 03/09 3 / 15

The geometry of linear regression Figure: The regression curve is y = f (x) = E [Y jx = x]. TS () LN3 03/09 4 / 15

The linear regression model Question: how is a response variable Y related to a stimulus variable X (explanatory/control/explanatory)? Assuming a linear relation, 1 what is the slope? TS () LN3 03/09 5 / 15

The linear regression model Question: how is a response variable Y related to a stimulus variable X (explanatory/control/explanatory)? Assuming a linear relation, 1 what is the slope? 2 is the slope positive? TS () LN3 03/09 5 / 15

The linear regression model Question: how is a response variable Y related to a stimulus variable X (explanatory/control/explanatory)? Assuming a linear relation, 1 what is the slope? 2 is the slope positive? 3 how good does the estimated line t the observed data? TS () LN3 03/09 5 / 15

The linear regression model Question: how is a response variable Y related to a stimulus variable X (explanatory/control/explanatory)? Assuming a linear relation, 1 what is the slope? 2 is the slope positive? 3 how good does the estimated line t the observed data? Example: Y = growth in BNP, X = in ation previous year. TS () LN3 03/09 5 / 15

The linear regression model Question: how is a response variable Y related to a stimulus variable X (explanatory/control/explanatory)? Assuming a linear relation, 1 what is the slope? 2 is the slope positive? 3 how good does the estimated line t the observed data? Example: Y = growth in BNP, X = in ation previous year. D : A sample of size n of pairs explanatory variables X and response variable Y. TS () LN3 03/09 5 / 15

The linear regression model Question: how is a response variable Y related to a stimulus variable X (explanatory/control/explanatory)? Assuming a linear relation, 1 what is the slope? 2 is the slope positive? 3 how good does the estimated line t the observed data? Example: Y = growth in BNP, X = in ation previous year. D : A sample of size n of pairs explanatory variables X and response variable Y. M : (X 1, Y 1 ),, (X n, Y n ) is an iid random sample from an in nite population. Y = β 0 + β 1 X + u; E [Y jx = x] = β 0 + β 1 x, Eu = 0, cov(u, X ) = 0. β 0, β 1 are parameters. TS () LN3 03/09 5 / 15

OLS Least squares and regression was known by Laplace and Gauss (80 years before Galton). b β 0, bβ 1 = arg min b 0,b 1 bβ 1 = s XY s 2 X n i=1 (Y i (b 0 + b 1 X i )) 2 = r s Y s X, bβ 0 = Y bβ 1 X Predicted response given the stimuli: by i = bβ 0 + bβ 1 X i is the tted value. TS () LN3 03/09 6 / 15

OLS Least squares and regression was known by Laplace and Gauss (80 years before Galton). b β 0, bβ 1 = arg min b 0,b 1 bβ 1 = s XY s 2 X n i=1 (Y i (b 0 + b 1 X i )) 2 = r s Y s X, bβ 0 = Y bβ 1 X Predicted response given the stimuli: by i = bβ 0 + bβ 1 X i is the tted value. (Empirical) residual: bu i = Y i by i is the vertical distance from the tted value to the observed value of Y. bu = 0, r bux = 0 TS () LN3 03/09 6 / 15

Measures of t Homoscedasticity when var(y jx = x) = σ 2 is independent of x. TS () LN3 03/09 7 / 15

Measures of t Homoscedasticity when var(y jx = x) = σ 2 is independent of x. 2 Then bσ 2 = 1 n 2 n i=1 Y i by i = 1 n 2 RSS is unbiased. TS () LN3 03/09 7 / 15

Measures of t Homoscedasticity when var(y jx = x) = σ 2 is independent of x. 2 Then bσ 2 = 1 n 2 n i=1 Y i by i = 1 n 2 RSS is unbiased. TSS = n i=1 Y i Y 2 = n i=1 Y i 2 by i + n i=1 Y bi 2 Y = RSS + ESS TS () LN3 03/09 7 / 15

Measures of t Homoscedasticity when var(y jx = x) = σ 2 is independent of x. 2 Then bσ 2 = 1 n 2 n i=1 Y i by i = 1 n 2 RSS is unbiased. TSS = n i=1 Y i Y 2 = n i=1 Y i by i 2 + n i=1 b Y i Y 2 = RSS + ESS R 2 = ESS TSS = 1 RSS TSS = r 2 XY TS () LN3 03/09 7 / 15

Be aware of extreme observations! Figure: The OLS line hinges on one point. Figure: Heavy tails in the conditional distribution of Y jx distorts the OLS line. TS () LN3 03/09 8 / 15

Desired number of children and age for beginning master students n Children Age OLS ρ R 2 2008 38 1.92 23.8 C = 1.88 + 0.002A 0.004 0.00002 2009 29 2.13 25.4 C = 1.61 + 0.021A 0.06 0.004 Children + rnorm(30, 0, 0.1) 0 1 2 3 4 5 Children 0 1 2 3 4 5 25 30 35 A ge 2009 20 22 24 26 28 30 32 34 A ge 2008 TS () LN3 03/09 9 / 15

Desired number of children and number of siblings, 2009 TS () LN3 03/09 10 / 15

0 1 2 3 4 5 0 2 4 6 Siblings Children Fitted values Figure: STATA scatter diagram of Children vs. Siblings, with OLS line. Some points represent several observations. Is the (slightly) positive relation between Age and Children due to Siblings being an intermediate variable? Age!Siblings!Children, but controling for siblings, no e ect of Age on Siblings? TS () LN3 03/09 11 / 15

Constructed model and simulated data y = number of children 0 1 2 3 4 x = age (and probability) 20 25 30 35 40 y 20 25 30 35 40 x = age (and probability) 0 1 2 3 4 y = number of children 0 1 2 3 4 var(y1x=x) 0.0 0.4 0.8 1.2 20 25 30 35 x 20 25 30 35 40 x = age Figure: UL: E [Y = yjx = x] = 2.02 + 0.101x, and P [Y = yjx = x] shown by horizontal thick lines for some values of x; UR: the same, but axes interchanged; LL the scatter plot of a simulated sample of size n = 43, the points are slightly jittered, with the OLS line y = 1.9 + 0.097x; LR: var [Y jx = x]. TS () LN3 03/09 12 / 15

Constructed model - repeated simulations density 0.0 0.05 0.10 0.15 0.20 density 0 1 2 3 4 5 10 5 0 5 beta0 0.2 0.0 0.2 0.4 beta1 density 0.0 0.1 0.2 0.3 0.4 density 0.0 0.1 0.2 0.3 0.4 4 2 0 2 4 4 2 0 2 4 T0 Figure: Approximate densities of bβ 0 (UL), bβ 1 (UR), T 0 = b β 0 β 0 /SE b β 0 (LL), and T 1 = b β 1 β 1 /SE b β 1 (LL), based on 1000 replicates of simulated data of size n = 43 in constructed model for (X, Y ).Dotted curve is the normal density. TS () LN3 03/09 13 / 15 T1

Constructed model - repeated simulations (cont) beta0.hat 10 5 0 5 beta1.hat 0.1 0.1 0.3 2 0 2 Quantiles of Standard Normal 2 0 2 Quantiles of Standard Normal T0 2 0 2 4 T1 4 2 0 2 4 2 0 2 Quantiles of Standard Normal 2 0 2 Quantiles of Standard Normal Figure: Normal probability plots (QQ-plots against N(0,1)) for the same simulated material as in previous gure. TS () LN3 03/09 14 / 15

Problems to be done in class SW: 4.3 TS () LN3 03/09 15 / 15