INSTRUMENTAL VARIABLES

Similar documents
CAUSAL INFERENCE. Technical Track Session I. Phillippe Leite. The World Bank

Evaluating enterprise support: state of the art and future challenges. Dirk Czarnitzki KU Leuven, Belgium, and ZEW Mannheim, Germany

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

MATCHING TECHNIQUES. Technical Track Session VI. Emanuela Galasso. The World Bank

MATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank

REGRESSION DISCONTINUITY (RD) Technical Track Session V. Dhushyanth Raju Julieta Trias The World Bank

Getting Involved O. Responsibilities of a Member. People Are Depending On You. Participation Is Important. Think It Through

SIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST. Mark C. Otto Statistics Research Division, Bureau of the Census Washington, D.C , U.S.A.

Section 11 Simultaneous Equations

PSU GISPOPSCI June 2011 Ordinary Least Squares & Spatial Linear Regression in GeoDa

CHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came.

A Matrix Representation of Panel Data

Hypothesis Tests for One Population Mean

Bootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) >

Differentiation Applications 1: Related Rates

CS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007

4th Indian Institute of Astrophysics - PennState Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur. Correlation and Regression

Kinetic Model Completeness

Modelling of Clock Behaviour. Don Percival. Applied Physics Laboratory University of Washington Seattle, Washington, USA

Lesson Plan. Recode: They will do a graphic organizer to sequence the steps of scientific method.

Turing Machines. Human-aware Robotics. 2017/10/17 & 19 Chapter 3.2 & 3.3 in Sipser Ø Announcement:

Introduction to Spacetime Geometry

LHS Mathematics Department Honors Pre-Calculus Final Exam 2002 Answers

Five Whys How To Do It Better

Resampling Methods. Chapter 5. Chapter 5 1 / 52

Resampling Methods. Cross-validation, Bootstrapping. Marek Petrik 2/21/2017

A Regression Solution to the Problem of Criterion Score Comparability

Chapter 3: Cluster Analysis

An example to start off with

Section 11 Simultaneous Equations

Functional Form and Nonlinearities

Writing Guidelines. (Updated: November 25, 2009) Forwards

A New Evaluation Measure. J. Joiner and L. Werner. The problems of evaluation and the needed criteria of evaluation

First Survey. Carried out by IPR feedback

Simple Linear Regression (single variable)

Lab 1 The Scientific Method

AP Statistics Practice Test Unit Three Exploring Relationships Between Variables. Name Period Date

Eric Klein and Ning Sa

In the OLG model, agents live for two periods. they work and divide their labour income between consumption and

THE LIFE OF AN OBJECT IT SYSTEMS

Collocation Map for Overcoming Data Sparseness

End of Course Algebra I ~ Practice Test #2

Large Sample Hypothesis Tests for a Population Proportion

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff

CONSTRUCTING STATECHART DIAGRAMS

Excessive Social Imbalances and the Performance of Welfare States in the EU. Frank Vandenbroucke, Ron Diris and Gerlinde Verbist

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff

ENSC Discrete Time Systems. Project Outline. Semester

What is Statistical Learning?

Admin. MDP Search Trees. Optimal Quantities. Reinforcement Learning

READING STATECHART DIAGRAMS

Pattern Recognition 2014 Support Vector Machines

Determining the Accuracy of Modal Parameter Estimation Methods

You need to be able to define the following terms and answer basic questions about them:

This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement number

How do scientists measure trees? What is DBH?

Chapter 5: The Keynesian System (I): The Role of Aggregate Demand

Maximum A Posteriori (MAP) CS 109 Lecture 22 May 16th, 2016

We can see from the graph above that the intersection is, i.e., [ ).

WRITING THE REPORT. Organizing the report. Title Page. Table of Contents

CHAPTER 3 INEQUALITIES. Copyright -The Institute of Chartered Accountants of India

This section is primarily focused on tools to aid us in finding roots/zeros/ -intercepts of polynomials. Essentially, our focus turns to solving.

SAMPLING DYNAMICAL SYSTEMS

k-nearest Neighbor How to choose k Average of k points more reliable when: Large k: noise in attributes +o o noise in class labels

COMP 551 Applied Machine Learning Lecture 11: Support Vector Machines

Section 5.8 Notes Page Exponential Growth and Decay Models; Newton s Law

IN a recent article, Geary [1972] discussed the merit of taking first differences

Computational modeling techniques

Physics 2010 Motion with Constant Acceleration Experiment 1

The general linear model and Statistical Parametric Mapping I: Introduction to the GLM

Instructional Plan. Representational/Drawing Level

Department of Economics, University of California, Davis Ecn 200C Micro Theory Professor Giacomo Bonanno. Insurance Markets

How T o Start A n Objective Evaluation O f Your Training Program

Assessment Primer: Writing Instructional Objectives

SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis

Session IV Instrumental Variables

AP Statistics Notes Unit Two: The Normal Distributions

Internship Programme of German Business for the Countries of Western Balkans. How to complete your application?

Reinforcement Learning" CMPSCI 383 Nov 29, 2011!

Comparing Several Means: ANOVA. Group Means and Grand Mean

Fall 2013 Physics 172 Recitation 3 Momentum and Springs

Why Don t They Get It??

CHM112 Lab Graphing with Excel Grading Rubric

Click to edit Master title style

Exam #1. A. Answer any 1 of the following 2 questions. CEE 371 October 8, Please grade the following questions: 1 or 2

Math Foundations 20 Work Plan

Activity Guide Loops and Random Numbers

Pipetting 101 Developed by BSU CityLab

The standards are taught in the following sequence.

Causal inference using regression on the treatment variable

Please Stop Laughing at Me and Pay it Forward Final Writing Assignment

the results to larger systems due to prop'erties of the projection algorithm. First, the number of hidden nodes must

Lifting a Lion: Using Proportions

AP Statistics Notes Unit Five: Randomness and Probability

**DO NOT ONLY RELY ON THIS STUDY GUIDE!!!**

NAME: Prof. Ruiz. 1. [5 points] What is the difference between simple random sampling and stratified random sampling?

Lyapunov Stability Stability of Equilibrium Points

UN Committee of Experts on Environmental Accounting New York, June Peter Cosier Wentworth Group of Concerned Scientists.

Churn Prediction using Dynamic RFM-Augmented node2vec

MODULE FOUR. This module addresses functions. SC Academic Elementary Algebra Standards:

Transcription:

INSTRUMENTAL VARIABLES Technical Track Sessin IV Sergi Urzua University f Maryland

Instrumental Variables and IE Tw main uses f IV in impact evaluatin: 1. Crrect fr difference between assignment f treatment and actual treatment E.g. Randmized Assignment with nn-cmpliers E.g. Fuzzy Regressin Discntinuity 2. Lk fr exgenus variatin (ex-pst) t evaluate the impact f a prgram in absence f a prspective design. Here: General Principles behind IV and an example with a fcus n use (1)

An example t start ff with Say we wish t evaluate a vluntary jb training prgram Any unemplyed persn is eligible (Universal eligibility) Sme peple chse t register (Participants) Other peple chse nt t register (Nn-participants) Sme simple ways t evaluate the prgram: Randm sample cntaining treatment status (P), exgenus cntrls (X) and utcme (Y). First alternative: T cmpare situatin f participants and nn-participants after the interventin. We already learned this estimatr wuld be biased!

Vluntary jb training prgram Say we decide t cmpare utcmes fr thse wh participate t the utcmes f thse wh d nt participate: A simple mdel t d this: y = α + β 1 P + β 2 x + ε P = 1 If persn participates in training 0 If persn des nt participate in training x = Cntrl variables (exgenus & bserved) Why wuld this nt be crrect? 2 prblems: Decisin t participate in training is endgenus (e.g. based n an unmeasurable characteristic). Variables that we mit (e.g. unmeasured) but that are imprtant P and ε are crrelated

What can we d t slve this prblem? We estimate: y = β 0 + β 1 x + β 2 P + ε S the prblem is the crrelatin between P and ε Intuitin f IV: Hw abut we replace P with smething else that is similar t P but is nt crrelated with ε

Back t the jb training prgram P = participatin ε = that part f utcmes that is nt explained by prgram participatin r by bserved characteristics Instrumental variable will be a variable Z that is: (1) Clsely related t participatin P. [i.e. Crr ( Z, P ) > 0] (2) but desn t directly affect peple s utcmes Y, except thrugh its effect n participatin. [i.e. Crr ( Z, ε ) = 0 ] Hard t cme up with such a variable ex-pst but if we anticipate this prblem, we can plan fr it

Generating an instrumental variable Encuragement design: - Say that a scial wrker visits persns t encurage them t participate. She nly visits 50% f persns n her rster, and She randmly chses whm she will visit If she is effective, many peple she visits will enrll. There will be a crrelatin between receiving a visit and enrlling. - But visit des nt have direct effect n utcmes (e.g. incme) except frm its effect thrugh enrllment in the training prgram. Randmized encuragement r prmtin visits can be a useful instrumental variable.

Characteristics f an instrumental variable Define a new variable Z Z = 1 If persn was randmly chsen t receive the encuragement visit frm the scial wrker 0 If persn was randmly chsen nt t receive the encuragement visit frm the scial wrker Crr ( Z, P ) > 0 Peple wh receive the encuragement visit are mre likely t participate than thse wh dn t Crr ( Z, ε ) = 0 N crrelatin between receiving a visit and benefit t the prgram apart frm the effect f the visit n participatin. Z therefre satisfies the cnditins fr being an instrumental variable

Tw-stage least squares (2SLS) Remember the riginal mdel with endgenus P: Step 1 y = β 0 + β 1 x + β 2 P + ε Regress the endgenus variable P n the instrumental variable(s) Z and ther exgenus variables P = δ 0 + δ 1 x + δ 2 Z + τ Calculate the predicted value f P fr each bservatin: P^ Since Z and x are nt crrelated with ε, neither will be Yu will need ne instrumental variable fr each ptentially endgenus regressr. P^

Tw-stage least squares (2SLS) Step 2 Regress y n the predicted variable P and the ther exgenus variables y = β 0 + β 1 x + β 2 + ε Nte: The standard errrs f the secnd stage OLS need t be crrected because P^ is a generated regressr. In Practice: Use STATA ivreg cmmand, which des the tw steps at nce and reprts crrect standard errrs. Intuitin: By using Z t predict P, we cleaned P f its crrelatin with η It can be shwn that (under certain cnditins) β 2,IV yields a cnsistent estimatr f γ 2 (large sample thery) P^

Example: Training & Earnings Cnsider the mdel: y = β 0 + β 2 P + ε Randm Sample f 10,000 bservatins Data cntains (y, P, Z ) 6,328 individuals with D=1 & 3,618 with D=0.

Example: Training & Earnings Cnsider the mdel: y = β 0 + β 2 P + ε First Strategy (Participants vs. Nn-participants) E(Y1 D=1) = -0.227 E(Y0 D=0) = 0.996 Thus, δ = E(Y1 D=1) - E(Y0 D=0) = -1.223*** Yu might cnclude then that the effect f the prgram is negative. Selectin bias?

Example: Training & Earnings Cnsider the mdel: y = β 0 + β 2 P + ε Let intrduce the instrument Z: Crr(Z,D)=0.37*** Pr(D=1 Z=1)=0.82 Pr(D=1 Z=0)= 0.45 Cv(y,Z) Cv(P,Z) E(Y Z 1) E(Y Z 0) E(P Z 1) E(P Z 0) 0.210

Example: Was it real? Cnsider the mdel: y = β 0 + β 2 P + ε I generated the data: Y1(u)=0.1 + 0.2 + ε1(u) Y0(u)=0.1 + + ε0(u) P = 1 if Z(u) Y0(u)>0, =0 therwise Y(u) = Y1(u) * P(u) + Y0(u) * (1-P(u)) THUS, I KNOW THE TRUE AVERAGE TREATMENT EFFECT

Example: Was it real? Cnsider the mdel: y = β 0 + β 2 P + ε In ur fake data, we bserve (D,Z,Y1,Y0,Y) Treatment Effect= E(Y1 D=1)-E(Y0 D=1) = 0.2 Selectin Bias = E(Y0 D=1)-E(Y0 D=0) =-1.423 δ = E(Y1 D=1)-E(Y0 D=0) = 0.2+(-1.423) = -1.22 IV gt it right (IV=0.21) This is nt rcket science!

Nn ecnmetric intuitin: Illustratin frm vluntary jb training prgram Ppulatin eligible fr jb training prgram Randm Sample Randmized assignment Standard Infrmatin nly Mnthly incme 1 year later = 700 Standard Infrmatin + Encuragement visit Mnthly incme 1 year later = 850 25% take-up 75% take-up Questin: what is the impact f the jb training prgram n the mnthly incme f participants?

Standard Infrmatin Package nly Mnthly incme 1 year later = 700 Standard + Additinal Infrmatin Package Mnthly incme 1 year later = 850 25% take-up 75% take-up Questin: what is the impact f the jb training prgram? Stage 1: Take-up difference between well infrmed and nt well infrmed :... Stage 2a: Incme difference between the well infrmed and nt well infrmed grup:.. Stage 2b: Impact f participatin: Incme difference scaled by take-up difference:

Reminder and a wrd f cautin crr (Z,ε) =0 If crr (Z, ε) 0, Bad instrument Finding a naturally gd instrument is hard! But yu can build ne yurself with a randmized encuragement design crr (Z,P) 0 If crr (Z, P) 0 Weak instruments : the crrelatin between Z and P needs t be sufficiently strng. If nt, the bias stays large even fr large sample sizes.

Reminder and a wrd f cautin: Hetergeneity It is pssible t shw that, in the cntext f hetergeneus effects, the IV apprach might NOT prvide meaningful results. Hwever, we can still evaluate using structural mdels. Example: Evaluating the impact f financial intermediatin

References Heckman, J., E. Vytlacil, S. Urzua (2006). Understanding instrumental Variables in Mdels with Essential Hetergeneity, Review f Ecnmics and Statistics, v88, n3. Heckman, J., S. Urzua(2010) Cmparing IV With Structural Mdels: What Simple IV Can and Cannt. Jurnal f Ecnmetrics, Vl. 156(1), 2010 Angrist, J. D. and A. Krueger (2001). Instrumental Variables and the Search fr Identificatin: Frm Supply and Demand t Natural Experiments, Jurnal f Ecnmic Perspectives, 15(4). Angrist, J. D., G. W. Imbens and D. B. Rubin (1996). Identificatin f Causal Effects Using Instrumental Variables, Jurnal f the American Statistical Assciatin, Vl. 91, 434. Angrist, J., Bettinger, E., Blm, E., King, E. and M. Kremer (2002). Vuchers fr Private Schling in Clmbia: Evidence frm a Randmized Natural Experiment, American Ecnmic Review, 92, 5. Imbens, G. W. and J. D. Angrist, (1994). Identificatin and Estimatin f Lcal Average Treatment Effects. Ecnmetrica, 62(2). Newman, J., M. Pradhan, L. B. Rawlings, G. Ridder, R. Ca, J. L. Evia, (2002). An Impact Evaluatin f Educatin, Health, and Water Supply Investments by the Blivian Scial Investment Fund., Wrld Bank Ecnmic Review, vl. 16(2).