2SLS Estimates ECON In this case, begin with the assumption that E[ i

Similar documents
ε. Therefore, the estimate

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Multiple Choice Test. Chapter Adequacy of Models for Regression

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

Wu-Hausman Test: But if X and ε are independent, βˆ. ECON 324 Page 1

Econometric Methods. Review of Estimation

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

ρ < 1 be five real numbers. The

CHAPTER 6. d. With success = observation greater than 10, x = # of successes = 4, and

Training Sample Model: Given n observations, [[( Yi, x i the sample model can be expressed as (1) where, zero and variance σ

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Introduction to Econometrics (3 rd Updated Edition, Global Edition) Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 9

Simulation Output Analysis

Chapter 4 Multiple Random Variables

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes

X ε ) = 0, or equivalently, lim

Linear Regression with One Regressor

Random Variables. ECE 313 Probability with Engineering Applications Lecture 8 Professor Ravi K. Iyer University of Illinois

Third handout: On the Gini Index

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

MATH 371 Homework assignment 1 August 29, 2013

Functions of Random Variables

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

ESS Line Fitting

STK3100 and STK4100 Autumn 2017

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

Sampling Theory MODULE V LECTURE - 14 RATIO AND PRODUCT METHODS OF ESTIMATION

STA302/1001-Fall 2008 Midterm Test October 21, 2008

Lecture 2: Linear Least Squares Regression

Simple Linear Regression

Can we take the Mysticism Out of the Pearson Coefficient of Linear Correlation?

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

Summary of the lecture in Biostatistics

TESTS BASED ON MAXIMUM LIKELIHOOD

Lecture Notes Types of economic variables

STK4011 and STK9011 Autumn 2016

Chapter Two. An Introduction to Regression ( )

ENGI 4421 Propagation of Error Page 8-01

ECON 5360 Class Notes GMM

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Multiple Linear Regression Analysis

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

MEASURES OF DISPERSION

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

Lecture 3 Probability review (cont d)

STRONG CONSISTENCY FOR SIMPLE LINEAR EV MODEL WITH v/ -MIXING

Lecture 2: The Simple Regression Model

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class)

Chapter 5 Properties of a Random Sample

Lecture Notes Forecasting the process of estimating or predicting unknown situations

Simple Linear Regression

COV. Violation of constant variance of ε i s but they are still independent. The error term (ε) is said to be heteroscedastic.

Class 13,14 June 17, 19, 2015

Lecture Note to Rice Chapter 8

D KL (P Q) := p i ln p i q i

Chapter 5 Properties of a Random Sample

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

2. Independence and Bernoulli Trials

STK3100 and STK4100 Autumn 2018

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

ENGI 3423 Simple Linear Regression Page 12-01

Correlation and Simple Linear Regression

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67.

Econometrics. 3) Statistical properties of the OLS estimator

Qualifying Exam Statistical Theory Problem Solutions August 2005

Chapter 9 Jordan Block Matrices

Chapter 11 Systematic Sampling

Multivariate Transformation of Variables and Maximum Likelihood Estimation

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek

Lecture 02: Bounding tail distributions of a random variable

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

Logistic regression (continued)

Module 7. Lecture 7: Statistical parameter estimation

Idea is to sample from a different distribution that picks points in important regions of the sample space. Want ( ) ( ) ( ) E f X = f x g x dx

Mu Sequences/Series Solutions National Convention 2014

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

CS 2750 Machine Learning Lecture 5. Density estimation. Density estimation

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

Chapter Statistics Background of Regression Analysis


Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b

CHAPTER 4 RADICAL EXPRESSIONS

Lecture 8: Linear Regression

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Sampling Theory MODULE X LECTURE - 35 TWO STAGE SAMPLING (SUB SAMPLING)

Maximum Likelihood Estimation

Chapter 4 Multiple Random Variables

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

Lecture 9. Some Useful Discrete Distributions. Some Useful Discrete Distributions. The observations generated by different experiments have

Chain Rules for Entropy

1 Onto functions and bijections Applications to Counting

Lecture 3. Sampling, sampling distributions, and parameter estimation

Transcription:

SLS Estmates ECON 3033 Bll Evas Fall 05 Two-Stage Least Squares (SLS Cosder a stadard lear bvarate regresso model y 0 x. I ths case, beg wth the assumto that E[ x] 0 whch meas that OLS estmates of wll roduced based ad cosstet estmates. Suose however that there s a varable z that drectly macts x but t s ucorrelated wth. Ths ca be exressed as E[ z] 0. The varable z s kow as a strumet the model. The mlct assumto s that z macts x ad x macts y. Therefore, f we were to somehow shock x wth z, we ca fer the relatosh betwee x ad y. Ths s doe by cosderg the relatosh betwee x ad z as a smle lear model. Ths frst-stage relatosh betwee x ad z s gve by the equato x z u 0 Where we assume z ad u are ucorrelated. Note that x has two comoets. Oe s a determstc oe that s a fucto of z, x 0 z. Because z s ucorrelated wth ε the ths redctve comoet of x s also ucorrelated wth ε. Sce x s correlated wth ε ad x s ot, t must be that what s drvg the correlato betwee x ad ε s the u. The suggesto the s stead of usg x the regresso y 0 x -- whch we kow would roduce based estmates -- why ot use x stead. The x comoet of x cotas formato about x, but t s NOT correlated wth ε so t should roduce estmates wth ce roertes. That s a great dea excet that we do ot kow 0 or ad therefore, we caot use x a regresso. We ca however use the ext best thg: a ubased estmate of x whch s roduced from a frst-stage regresso of x o z. Ths two-stage least squares rocedure s very straght forward. Ru a regresso of the form x z u. It should be o surrse that the OLS estmates for the arameters 0 ad are 0 x z ad 0 x. ( z z ( x x. Wth these estmates we ca costruct a redcted value for ( z z 0 z Istead of usg x the estmates for -- whch roduces ols ( x x( y y ( x x

use the redcted value of x, x 0 z, stead. The redcted value s a lear fucto of z whch s ucorrelated wth so the SLS estmate for should have ce roertes. The SLS estmate for s therefore sls x( y y. x Workg wth the SLS estmate Gve that x 0 z t s clear that x 0 zad hece x ( x z z. Substtute ths to the sls equato for ad we obta x( y y ( z z ( y y ( z z ( y y ( z z ( y y sls x ( z z ( z z ( z z Substtute the defto of to ths equato ad we obta ( z z ( y y ( z z ( y y ( z z ( y y sls ( z z ( x x ( z z ( z z ( z z ( z z ( x x Dvde the umerator ad deomator by (- ad we ote the SLS estmte s therefore the rato of the covarate betwee y ad z dvded by the covarace betwee x ad z. ( z z ( y y ( z z ( y y / ( ( z z ( x x ( z z ( x x / ( sls yz sls Is a ubased estmate?

Aytme we have a ew radom varable, we always ask what s the exected value ad the varace? Ths s a dffcult questo. I the ed, we are uable to show that sls s a ubased estmate ths cotext? Why? I order to demostrate the roertes of the estmate, we always substtute the truth back to the model. Start wth the estmate for sls ad ote that we ca dro oe of the meas both the umeatror ad deomator. ( z z ( y y ( z z y sls ( z z ( x x ( z z x At ths ot, we usually substtute the defto of y to the umerator. However, x s also a radom varable that s a fucto of the error v so we would eed to substtute that defto as well. ( z z y ( z z ( x 0 sls ( z z x ( z z ( z u 0 The roblem s that we ow have radom varables the umerator ad the deomator. If a ad b are radom varables E[a/b] E[a]/E[b]. Wthout kowg the jot dstrbuto of the umerator ad deomator, we caot fgure out the exected value of the estmate. Our oly oto s the to cosder whether the estmate s cosstet. The cosstecy of SLS estmates ( z z ( y y To show whether the SLS estmate s cosstet, start wth the defto sls dro ( x x( z z y ad substtute the defto of y. Ths roduces sls ( z z ( 0 x. Exad the ( x x( z z umerator, ad we get ( z z ( z z x ( z z. The frst ( x x( z z ( x x( z z ( x x( z z 0 sls 3

term equals zero because sums of devato from meas equal zero. The secod term equals ad the ( z z whole term reduces to sls. Dvde the umerator ad deomator of the rght ( x x( z z ( z z / ( had term by (- ad you get ( x x( z z / ( sls ze. Now we ask what haes to ths estmate as the samle szes creases to fty. Note that f a ad b are estmates, the lm(a/b=lm(a/lm(b. Note also that the lm( ad lm(, ad therefore, t s z z o surrse that SLS ze lm(. I ths case, the estmate SLS wll oly be cosstet f lm( 0. To roduce cosstet estmates, t must be the case that z oly roduces a chage z z y through a tal chage z. If z has a drect effect o y the lm( 0 ad the model wll z z roduce cosstet estmates. A alteratve terretato of SLS estmates The structural equato of terest s y 0 x. We atcate that x s correlated wth ε so we caot estmate ths model by OLS. We do however have a varable z that we beleve s redctve of x but t s ot drectly correlated wth ε. The frst-stage equato that relatos x to z s gve by the equato x z u. Substtute ths equato to the equato for y ad oe obtas the equato 0 y ( z u ( z u 0 0 0 0 I ths model, defe ( 0 0 0 let ad let u v. Ths allows us to wrte the equato for y as y 0 z v. Ths equato s referred to as the reduced-form ad t reresets the y correlato betwee z ad y. The coeffcet s. Note that the coeffcet has a terestg z terretato. The model we are suggestg s that y s a fucto of x ad x s a fucto of z. Therefore y=f(x(z. If we take the dervatve of y wth resect to z, because we assume z oly chages y through a 4

y y x chage x, we obta z x z. Note that yx =, whch s exactly. Therefore, f we xz yx take the reduced form estmate for whch equals xz = ad dvde t by the st stage coeffcet we get. It s also the case that sls. ( z z ( y y ( z z ( z z ( y y sls ( z z ( x x ( z z ( x x ( z z whch s exactly the same estmate we got above. To reca, we call the tal equato the structural equato of terest : y 0 x To estmate ths va SLS or drect least squares, we eed a frst-stage relatosh x 0 z u The drect relatosh betwee the strumet ad the outcome of terest s called the reduced-form. y z v 0 sls The varace of Note that f we were to estmate the regresso y 0 x by OLS, the varace for would be ols Var( ols ( x x. The varace s roortoal to how much varace x s used to roduce the estmate. I the case of SLS, we are ot use x but stead, x. Therefore Recall that a regresso, SST=SSM+SSE ad ths case result, by costructo, ols sls ( x ( x x x Var( sls ( x ( x x x v x. As a whch meas also that by costructo Var( Var(. The beeft of SLS s that we are oly usg a art of the varato x that art. 5

that s ucorrelated wth -- so our estmates are cosstet. However, because we are usg smaller varato x that the OLS case, the cost of SLS s a reducto recso the value sls Var( creases cosderably. What wll roduce smaller values for ( sls Var? Start wth the defto Var( sls z z remember that x x ( z z. Substtute ths ad we get sls Var( ( ( z z ( x x ad substtute ths to the varace equato ad you get ( z z x ad. Note that Var( ( z z sls ( z z ( x x ( ( z z x x ( z z ( z z The deomator ths term ( z z ( x x s othg more that (- xz whle the umerator ca be thought of as ( z z (. Therefore, z Var( ( z z sls ( z z ( ( xz xz ( z z ( x x ad we wll reduce the ( sls Var s f we ca fd a varable z that s strogly correlated wth x. At the other ed of the sectrum, suose xz aroaches zero ths meas z does ot exla much of x. What s the cost? I ths case, f z does ot exla much of x, the we caot lear x s mact o y ad the sls Var( wll exlode. 6