Single-Equation GMM: Endogeneity Bias

Similar documents
Regression. Econometrics II. Douglas G. Steigerwald. UC Santa Barbara. D. Steigerwald (UCSB) Regression 1 / 17

Introduction: structural econometrics. Jean-Marc Robin

Regression Discontinuity

Speci cation of Conditional Expectation Functions

Economics 241B Estimation with Instruments

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Controlling for Time Invariant Heterogeneity

EC402 - Problem Set 3

Single-Equation GMM: Estimation

Environmental Econometrics

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Chapter 2. Dynamic panel data models

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Instrumental Variables

Problem Set # 1. Master in Business and Quantitative Methods

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

Nonparametric Density Estimation

Dealing With Endogeneity

ECO375 Tutorial 8 Instrumental Variables

Lecture notes to Stock and Watson chapter 12

We begin by thinking about population relationships.

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

1 Correlation between an independent variable and the error

ECON Introductory Econometrics. Lecture 16: Instrumental variables

The regression model with one stochastic regressor (part II)

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 18th Class 7/2/10

Introductory Econometrics

Instrumental Variables

Estimation of a Local-Aggregate Network Model with. Sampled Networks

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

Lecture 4: Linear panel models

Instrumental Variables

Birkbeck Economics MSc Economics, PGCert Econometrics MSc Financial Economics Autumn 2009 ECONOMETRICS Ron Smith :

Econometrics. 7) Endogeneity

A Note on the Correlated Random Coefficient Model. Christophe Kolodziejczyk

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market

More on Specification and Data Issues

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

4.8 Instrumental Variables

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation

Applied Quantitative Methods II

Final Exam. Economics 835: Econometrics. Fall 2010

Lecture Notes on Measurement Error

5. Erroneous Selection of Exogenous Variables (Violation of Assumption #A1)

Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

GMM Estimation with Noncausal Instruments

Returns to Tenure. Christopher Taber. March 31, Department of Economics University of Wisconsin-Madison

Testing for Regime Switching: A Comment

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models

Recitation 1: Regression Review. Christina Patterson

Microeconometrics: Clustering. Ethan Kaplan

Instrumental Variables. Ethan Kaplan

Econometrics Lecture 1 Introduction and Review on Statistics

I 1. 1 Introduction 2. 2 Examples Example: growth, GDP, and schooling California test scores Chandra et al. (2008)...

Föreläsning /31

The returns to schooling, ability bias, and regression

AGEC 661 Note Fourteen

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

Econometrics. 8) Instrumental variables

Handout 12. Endogeneity & Simultaneous Equation Models

When is it really justifiable to ignore explanatory variable endogeneity in a regression model?

On the Power of Tests for Regime Switching

The Augmented Solow Model Revisited

Ultra High Dimensional Variable Selection with Endogenous Variables

Statistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression

Contents. University of York Department of Economics PhD Course 2006 VAR ANALYSIS IN MACROECONOMICS. Lecturer: Professor Mike Wickens.

Lecture 8: Instrumental Variables Estimation

Applied Quantitative Methods II

Pseudo panels and repeated cross-sections

Lab 07 Introduction to Econometrics

Club Convergence: Some Empirical Issues

What Accounts for the Growing Fluctuations in FamilyOECD Income March in the US? / 32

1 Static (one period) model

Empirical Methods in Applied Microeconomics

Applied Econometrics Lecture 1

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Exam D0M61A Advanced econometrics

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006

Regression with time series

Lecture Notes Part 7: Systems of Equations

ECON 4160: Econometrics-Modelling and Systems Estimation Lecture 9: Multiple equation models II

Endogeneity. Tom Smith

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

The College Premium in the Eighties: Returns to College or Returns to Ability

Exercise sheet 3 The Multiple Regression Model

Simultaneous Equation Models

Chapter 1. GMM: Basic Concepts

Econometrics Problem Set 4

Multiple Choice Questions (circle one part) 1: a b c d e 2: a b c d e 3: a b c d e 4: a b c d e 5: a b c d e

Econ 582 Fixed Effects Estimation of Panel Data

Recitation Notes 5. Konrad Menzel. October 13, 2006

Estimation of Dynamic Nonlinear Random E ects Models with Unbalanced Panels.

Dynamic Regression Models (Lect 15)

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

ow variables (sections A1. A3.); 2) state-level average earnings (section A4.) and rents (section

Transcription:

Single-Equation GMM: Lecture for Economics 241B Douglas G. Steigerwald UC Santa Barbara January 2012

Initial Question Initial Question How valuable is investment in college education? economics - measure value in terms of wage How would you determine the return on investment in college education?

Framework Stochastic Model What are the returns to a college education? Random variables of interest W - log of worker wage S - years of schooling A - age M - indicator for male R - indicator for white U - other factors that a ect wages Stochastic Model W = β 0 + β 1 S + β 2 A + β 3 A 2 + β 4 M + β 5 R + U

Initial Question Answered? Estimates Stochastic Model W = β 0 + β 1 S + β 2 A + β 3 A 2 + β 4 M + β 5 R + U ˆβ 1.084 (and signi cantly di erent from zero) each additional year of schooling is worth an additional 8.4% in wages 4 years of college would increase wages by 38% (1.084 4 ) the median full time worker earns about $550 per week in 2000 wage increase of 38% is $210 per week over 30 year work-life, earnings increase by $170,000 in present value (5% interest), makes public universities a good deal

Initial Question Answered? Potential Endogeneity W = β 0 + β 1 S + β 2 A + β 3 A 2 + β 4 M + β 5 R + U β 1 may not capture a causal impact on wages workers who obtained more education may have attributes that would have led to higher earnings even without additional education S is endogenous ) Cov (S, U) 6= 0 ˆβ 1 is biased and inconsistent What is the direction of bias in ˆβ 1? ˆβ 1 is biased upward, does not provide a helpful bound to argue for bene ts of education

Background Sources of Covariate-Error Correlation Focus on Cor (S, U) Endogeneity workers who would otherwise have high wage rates are more likely to obtain higher education U = µ + e µ - ability e - random shock S = αµ + w describes how schooling is correlated with ability in this application, likely that Cor (S, U) > 0 Measurement Error S = S + v S - actual schooling S - reported v - measurement error in all applications, Cor (S, U) < 0

Background Detail: Measurement Error Correlation Simplify population model W t = βst + U t S t = St + v t estimated model W t = βs t + (U t βv t ) v t is a component of S t ) Cor [S t, (U t βv t )] < 0 in large samples ˆβ tends to Cov (S t, v t ) β 1 Var (S t ) ˆβ = β + n t=1 S t [U t βv t ] n t=1 S 2 t Var (S = t ) β Var (St ) + Var (v t ) where Var (S t ) = Var (S t ) + Var (v t ) Cov (S t, v t ) = Var (v t ) Iron Law of Econometrics - measurement error leads to attenuation bias

Background Solutions Instrument z is a (valid) instrument if Cov (S, z) 6= 0 and Cov (U, z) = 0 instruments can address both sources of covariate-error correlation issue - instruments can be di cult to nd Measurement error assumption S = S + v assumptions regard v example: v is symmetric around 0 issue - does not address endogeneity

Background Instrument Solutions Standard Instrument Solution implicit model of endogeneity no speci ed model linking endogenous covariates to error yields classic instrumental variable (IV) estimator Model-Based Selection (Endogeneity) Correction explicit model of endogeneity clearly speci ed model linking endogenous covariates to error yields selection-corrected IV estimator

Background Standard Instrument Solution : Identi cation X (K 1) covariate vector Z (L1) instrument vector Identi cation Assumption (Rank Condition) The L K matrix E ZX T has rank K. Example X T = (1, S) Z T = (1, z) E ZX T = 1 E (S) E (z) E (Sz) Rank is K if determinant is not zero, Cov (S, z) 6= 0

Background Identi cation Identi cation Assumption (Order Condition) There are at least as many instruments as endogenous covariates: L K. Over identi cation rank condition satis ed and L > K Exact identi cation rank condition satis ed and L = K No identi cation L < K (rank condition cannot hold)

Initial Question Revisited Selection (Endogeneity) Correction Key - construct E [UjX, Z ] add to regression, remaining error uncorrelated with covariates Wage Regression Application data on twins (indexed by i) who share family characteristics Selection (Endogeneity) Model U i = µ + ε i µ = γs 1 + γs 2 + ω µ - latent family characteristics, correlated with S could relax assumption that γ is constant (use equation for twin 1 to identify γ 2 ) γ - selection e ect : γ > 0 ) families that would otherwise have high wages are more likely to educate their children

Initial Question Revisited Selection Correction wage regression (twin 1 C 0 1 = A 1, A 2 1, M 1, R 1 ) W 1 = β 0 + β 1 S 1 + C 0 1δ + (µ + ε 1 ) = β 0 + β 1 S 1 + C 0 1δ + (γs 1 + γs 2 + ω + ε 1 ) identi cation assumption E [U 1 jx, Z ] = γs 1 + γs 2 selection-corrected regression W 1 = β 0 + (β 1 + γ) S 1 + γs 2 + C 0 1δ + (ε 1 + ω) Variable OLS Include S 2 Own education 0.084 0.088 (0.014) (0.015) Sibling s education - 0.007 (0.015) Twins data - endogeneity bias is negative!

Review Review Stochastic Model W i = β 0 + β 1 S i + U i What two issues lead to correlation between S i and U i? 1) endogeneity - latent ability that impacts both S i and U i 2) measurement error in S i What is the impact of the correlation on B OLS? biased and inconsistent What is needed to construct a consistent estimator? 1) instrument Z i Cor (Z i, S i ) 6= 0 Cor (Z i, U i ) = 0 2) assumption about measurement error