Instrumental Variables and the Problem of Endogeneity

Similar documents
IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Ordinary Least Squares Regression

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

8. Instrumental variables regression

Fixed Effects Models for Panel Data. December 1, 2014

Dealing With Endogeneity

Instrumental Variables Estimation in Stata

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Instrumental Variables, Simultaneous and Systems of Equations

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour

Handout 12. Endogeneity & Simultaneous Equation Models

An overview of applied econometrics

Applied Health Economics (for B.Sc.)

1 Motivation for Instrumental Variable (IV) Regression

ECON Introductory Econometrics. Lecture 16: Instrumental variables

Specification testing in panel data models estimated by fixed effects with instrumental variables

Econometrics. 8) Instrumental variables

Click to edit Master title style

EC402 - Problem Set 3

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Motivation for multiple regression

Econometrics of Panel Data

IV and IV-GMM. Christopher F Baum. EC 823: Applied Econometrics. Boston College, Spring 2014

Ec1123 Section 7 Instrumental Variables

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

ECO375 Tutorial 8 Instrumental Variables

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Mgmt 469. Causality and Identification

ECO375 Tutorial 9 2SLS Applications and Endogeneity Tests

We begin by thinking about population relationships.

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

Linear IV and Simultaneous Equations

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Session IV Instrumental Variables

Introduction to Panel Data Analysis

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

IV estimators and forbidden regressions

Rockefeller College University at Albany

Instrumental Variables


Econometrics for PhDs

Economics 308: Econometrics Professor Moody

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Econ 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Instrumental variables estimation using heteroskedasticity-based instruments

Linear Panel Data Models

Applied Statistics and Econometrics

Lecture Module 8. Agenda. 1 Endogeneity. 2 Instrumental Variables. 3 Two-stage least squares. 4 Panel Data: First Differencing

Multiple Linear Regression CIVL 7012/8012

Econometrics Midterm Examination Answers

Instrumental variables: Overview and advances

Economics 241B Estimation with Instruments

Econometrics - 30C00200

Birkbeck Working Papers in Economics & Finance

08 Endogenous Right-Hand-Side Variables. Andrius Buteikis,

An example to start off with

Missing dependent variables in panel data models

Topic 10: Panel Data Analysis

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Chapter 2: simple regression model

Autocorrelation. Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time

Endogeneity. Tom Smith

Homework Set 2, ECO 311, Fall 2014

Econ 510 B. Brown Spring 2014 Final Exam Answers

Applied Statistics and Econometrics

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Simultaneous Equation Models

Instrumental Variables

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Linear Models in Econometrics

Greene, Econometric Analysis (7th ed, 2012)

Applied Quantitative Methods II

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

What s New in Econometrics. Lecture 13

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

The Simple Linear Regression Model

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

5. Erroneous Selection of Exogenous Variables (Violation of Assumption #A1)

Dynamic Panel Data Models

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

4.8 Instrumental Variables

On Variance Estimation for 2SLS When Instruments Identify Different LATEs

Greene, Econometric Analysis (5th ed, 2003)

SEM with observed variables: parameterization and identification. Psychology 588: Covariance structure and factor models

the error term could vary over the observations, in ways that are related

Econ 1123: Section 5. Review. Internal Validity. Panel Data. Clustered SE. STATA help for Problem Set 5. Econ 1123: Section 5.

Linear Regression. Junhui Qian. October 27, 2014

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

POL 681 Lecture Notes: Statistical Interactions

1 Correlation between an independent variable and the error

Short Questions (Do two out of three) 15 points each

Transcription:

Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38

Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] = 0 [K 1] (2) 2 / 38

Endogeneity Defined In a standard OLS framework, y = xβ + ɛ (3) b is biased since E[b] β. This happens because E[x ɛ] 0 (4) 3 / 38

The Problem If ɛ imparts some effect on x, then we can t disentangle the direct impact of x on y (what we really want to know) with the indirect effect of ɛ on y via x. Examples Simultaneity: Aids funding in Africa and Aids incidence Missing Variable Bias: Wage equation With biased estimates of β our model gives poor policy guidance. 4 / 38

Approaches Proxy Variables Need to find a proxy correlated with the missing variable (or problematic part of the error). Difficulties interpreting results, since the scale of estimated coefficient not necessarily informative for β Lagging x (ad hoc) Instrumental Variables Control Function 5 / 38

The Instrumental Variable Approach: Setup Let x be an N K matrix of the following form: 1 x 12... x 1,K 1 x 1,K..... x = 1 x i2... x i,k 1 x i,k = [ ] 1 x 2... x K 1 x K..... 1 x N2... x N,K 1 x N,K (5) We believe that the K th column is endogenous (it could be any column, 2 through K). Columns 1 to K-1 are exogenous (x K = [ 1 x 2... x K 1 ] ) 6 / 38

Pick an Instrumental Variable (IV) Find an instrumental Variable (z K ) having the following properties: 1 E(z K ɛ) = 0 2 Relevant: x K = δ 0 + δ 1 x 1 +... + δ K 1 x K 1 + θ K z K + r K (6) Estimate the relevancy equation and test H 0 : z K not relevant: θ K = 0 H 1 : z K is relevant: θ K 0 3 z k does not directly impact y 7 / 38

Endogeneity and Instrumental Variables: An illustration These two conditions ensure the causality is running in this direction: In particular: 1 z k is uncorrelated with ɛ 2 z k has no direct impact on y (sometimes called the exclusion restriction) 8 / 38

Examples Demand estimation: price is endogenously determined by demand shifts (and random shocks to demand) Look for something correlated (+/-) with price but not directly related to quantity demanded or error. Things only impacting supply have this property (for food commodities: weather) Returns to education: educational attainment likely correlated with errors in wage equation Look for something correlated with educational attainment but not related to wage or error Distance from school/university Month of birth 9 / 38

Implementing the IV Approach: Step 1 Define the matrix z as 1 x 12... x 1,K 1 z 1,K..... z = 1 x i2... x i,k 1 z i,k = [ ] 1 x 2... x K 1 z K..... 1 x N2... x N,K 1 z N,K (7) 10 / 38

How do we use the IV (found in z) to estimate β Idea: Define b iv as (z z) 1 z y Substitute the Relevancy equation into our original estimating equation: y =β 1 + β 2 x 2 +... + β K 1 x K 1 + β K (δ 1 + δ 2 x 2 +... + δ K 1 x K 1 + θ K z K + r) + ɛ =(β 1 + β K δ 1 ) + (β 2 + β K δ 2 )x 2 +... + (β K 1 + β K δ K 1 )x K 1 + (β K θ K )z k + (β K r k + ɛ) =α 1 + α 2 x 2 +... + α K 1 x K 1 + α K z k + v If we run this regression, we obtain a = (z z) 1 z y 11 / 38

So merely substitute our instrument in for x K and recovering parameters a will give you estimates where: α k β k for every parameter you estimate, not just the endogenous one, β K The variance/covariance matrix of the errors (v) is not N(0, σ 2 I) E[a] β, so this is not a good IV estimator. Given an estimate for a, we can t solve for the K estimates for β because we have K equations in 2 K unknowns. 12 / 38

Implementing the IV Approach: Step 2 Given an estimate using the instrumental variables approach (b iv ), we can define the predicted model error as This error must have the property that 1 e iv = y xb iv (8) E[z ɛ] z e iv = 0 (9) 1 So long as it can be shown that b iv is a consistent estimate of β 13 / 38

Implementing the IV Approach: Step 2 Simplify 0 =z (y xb iv ) (10) =z y z xb iv (11) b iv = (z x) 1 z y (12) 14 / 38

Implementing the IV Approach: Step 2a (alternative way) An equivalent way to think about IV regression: 1 Run the following regression (relevancy equation): x K = δ 1 + δ 2 x 2 +... + δ K 1 x K 1 + θ K z K + r (13) 2 With parameter estimates (d and t k ) calculate ˆx K : ˆx K = d 1 + d 2 x 2 +... + d K 1 x K 1 + t K z K (14) 15 / 38

Implementing the IV Approach: Step 2a (alternative way),cont. Defining 1 x 12... x 1,K 1 ˆx 1,K..... ˆx = 1 x i2... x i,k 1 ˆx i,k = [ ] 1 x 2... x K 1 ˆx K..... 1 x N2... x N,K 1 ˆx N,K (15) We can write the IV estimator as b iv = (ˆx ˆx) 1ˆx y (16) 16 / 38

Intuition of IV estimators Intuition ˆx contains only exogenous information. By using the predicted value we take the part of x k not correlated with ɛ and use it to estimate β. It can also be shown that for 1 IV and 1 endogenous variable: b iv = (z x) 1 z y = (ˆx ˆx) 1ˆx y (17) 17 / 38

We can identify the β parameters using IV Regression Assumptions 1 z is not correlated with the true model error, or E[z ɛ] = 0 (we can t test this) 2 z imparts an effect on y only via x, not directly (difficult to test) 3 It is relevant and strong (we can test this) 4 rank(z)=k If these conditions hold, then we have consistent estimates for β. In this case, since we have 1 IV and 1 endogenous variable, we say the model is exactly identified. 18 / 38

b iv is LATE The instrumental variable estimator gives us estimates of locally averaged treatment (causal effect) effects (LATE). This means it doesn t help identify behavioral changes that may occur for non-treatment groups. Consider using an earthquake event (=1,0) to instrument for price of agricultrual commodities. The b iv we get gives us the correct estimate of the estimate on price (β p ) telling us about those affected by price experiencing the earthquake very little about those not affected by the earthquake These sorts of problems loom larger when your instrument is a dummy variable. So long as your treatment group is representative of everyone, no issues. 19 / 38

Relevancy Test Run the relevancy regression: and test: x K = δ 1 + δ 2 x 2 +... + δ K 1 x K 1 + θ K z K + r (18) Relevancy Test H 0 : θ k = 0 : The instrument z K is not relevant H 1 : θ k 0 : The instrument z K is relevant Furthermore, the F-test with one degree of freedom can tell us about strong instruments. Rejecting the NULL hypothesis, z K is strong if the F-statistic exceeds 10. 20 / 38

Testing for the endogeneity of x K Another standard test is to see if, in fact x K is endogenous. This test proceeds from the observation that if x K is endogenous and we have a strong and relevant IV meeting the assumptions above, then the OLS estimate is biased and inconsistent whereas the IV estimate is consistent: E[b OLS ] E[b IV ] = β (19) Consequently, in the case of an endogenous x K, we can test for a meaningful difference between the two sets of estimates. 21 / 38

Hausman Test The Hausman test is widely used for testing differences in parameter estimates. In an IV setting, this is called the Hausman-Wu test, having Hausman-Wu Endogeneity Test H 0 : b IV b OLS = 0 : x K is exogenous H 1 : b IV b OLS 0 : x K is endogenous Where the test statistic is distributed F with 1 degree of freedom. 22 / 38

Implementing the Hausman-Wu Test 1 Run the Relevancy Equation: x K = δ 1 + δ 2 x 2 +... + δ K 1 x K 1 + θ K z K + r (20) 2 Recover the predicted residuals from this regression (ˆr) 3 Run this regression: y = xτ + µˆr + u (21) Hausman-Wu Endogeneity Test H 0 : b IV b OLS = 0 : µ = 0 : x K is exogenous H 1 : b IV b OLS 0 : µ 0: x K is endogenous Where the test statistic is distributed F with 1 degree of freedom. 23 / 38

Intuition of the Test The residual ˆr should only include the endogenous part of x K (if any exists), since we have controlled for all exogenous information at our disposal (x K and z K ). If this endogenous part of x K is useful for predicting y after we control for our full set of original regressors (x), then this provides evidence of significance differences in our regressors because there is a part of x K that is correlated with y via the error term. 24 / 38

Variance/Covariance Matrix of Parameters It can be shown that Var(b IV ) is ( E[Var(b IV )] = σ 2 (z x) 1 z z(z x) 1 ) (22) With the robust version: E[Var(b IV )] Robust = (z x) 1 z Vz(z x) 1 (23) 25 / 38

More IV s than endogenous variables Now consider a case where for our population regression, column K continues to be suspected as endogenous and we want to have M instrumental Variables rather than only 1. Redefine z as: z = = 1 x 12... x 1,K 1 z 1,1 z 1,2... z 1,M....... 1 x i2... x i,k 1 z i,1 z i,2... z i,m....... 1 x N2... x N,K 1 z N,1 z N,2... z N,M [ ] 1 x2... x K 1 z 1 z 2... z M (24) (25) 26 / 38

Some Terminology Stata uses a slightly different terminology: x K z k x k My Terminology Exogenous Instrumental Endogenous Independent Variable(s) Independent Variables Variable(s) Stata s Instruments Instruments Instrumented Terminology 27 / 38

Rationale for more than one IV We put alot of hopes on our lone IV (z K ) It is uncorrelated with ɛ It is correlated with y only via x K It is relevant. While relevancy is a good thing, it doesn t ensure the best IV With this limitation in mind, extend the IV model to include more instruments for x K 28 / 38

Rationale for more than one IV: The dark side But including more IV s risks violating the exogeneity assumption: E[z ɛ] 0 (26) Or, we might have redundant or near redundant information in z. Combined, or taken separately this complicates pinning down a unique and consistent estimate for β. 29 / 38

Deriving the IV Estimator Using the Method of Moments approach outlined above, we need to find an estimate for b IV satisfying E[z ɛ] = 0 (27) If β IV is a consistent estimate for β, then this condition becomes 0 = z e (28) 30 / 38

Deriving the IV Estimator Using the Method of Moments approach outlined above, we need to find an estimate for b IV satisfying E[z ɛ] = 0 (27) If β IV is a consistent estimate for β, then this condition becomes 0 = z e (28) 0 = z (y xb IV ) (29) 0 = z y z xb IV (30) Rearranging yields b IV = (z x) 1 z y as before...or does it. Check dimensionality of (z x) 1 z y versus what we expect the dimensionality of b IV to be. 31 / 38

Deriving the Estimator, 2SLS As in the case above, from the relevancy equation calculate predicted x K as a function of our exogenous variables: ˆx K = d 0 + d 2 x 2 + d 3 x 3 +... + d K 1 x K 1 + t 1 z 1 +... + t M z M (31) by running an OLS regression. Denoting ˆx = [ 1 x 2... x K 1 ˆx K ], the two stage least squares estimator (2SLS) is ˆβ 2SLS = (ˆx ˆx) 1ˆx y (32) Note: In a more general setting where there are more than 1 endogenous variables, define ˆx as ˆx = z(z z) 1 z x (33) 32 / 38

Deriving the Estimator, GMM We have more equations than unknowns making it unlikely that z e = 0 for every column of z. For estimation purposes b IV is found by minimizing min b IV e zwz e N (34) which is a scalar value. W is a weighting matrix. Typically W contains similar information to V, the matrix we used to correct for robust standard errors in the OLS chapter. Setting W = I restricts the GMM estimate for b IV to be equal to the 2SLS estimate. 33 / 38

Deriving the Estimator, GMM While it is almost always possible to find a b IV that minimizes this condition, it does not impose the orthogonality condition for each column of z. Thus there is the possibility of overidentification. 34 / 38

GMM versus 2SLS, which to use? In addition to GMM and 2SLS, there are additional methods one could use to estimate our estimate for β in an IV framework. Which to use? Stata Name Description Notes 2SLS Two Stage Least Squares Useful for understanding classical IV regression GMM Generalized Method of Moments Probably most useful method for most settings. Must understand implications for various choices of W you might use. Defaults not bad. LIML Limited Information Maximum Likelihood Uses ML methods, has the best small sample properties 3SLS Three Stage Least Squares More efficient than 2SLS but not used as often in an IV framework because not robust to specification error In most cases you want to use GMM with default weighting matrix unless you have good reasons to do otherwise. 35 / 38

More IV s than endogenous columns of x: Steps 1 Contemplate endogeneity problem for each regressor in x 2 If some elements of x could be endogenous, contemplate instrumental variables, noting that the number of instruments must be greater than or equal to the number of endogenous variables. 3 Test for the relevance of your instruments 4 Test for overidentification 5 Test for the endogeneity of your suspect columns of x 36 / 38

The Overidentification Test (Sargon Test): Manual Method Steps for 2SLS: 1 Recover predicted errors from IV regression (e iv = y xb iv ) 2 Regress the predicted errors e iv on all exogenous regressors and instruments (x K and the instrumental variables z 1 through z M ), which we defined previously as z e iv = zµ + ψ (35) 3 Conduct the joint test of R 2 N distributed χ 2 (M 1): H 0 = µ = 0 = (z e iv = 0) H 1 = µ 0 = (z e iv 0) Note: The manual steps outlined above are not recommended, instead use the stata command overid. 37 / 38

Variance/Covariance Matrix for b iv (standard errors) Standard errors in the multiple IV 2SLS framework can be calculated as above, except they are inefficient (and will not match what stata reports). Why? Two stage least squares involves estimation of ˆx k used to estimate b iv The correct standard errors take into acccount that ˆx k is a random variable. Bottom Line: Let stata calculate standard errors, and don t use manually calculated se s. 38 / 38