Econometrics of Policy Evaluation (Geneva summer school)

Similar documents
Sensitivity checks for the local average treatment effect

CEPA Working Paper No

EMERGING MARKETS - Lecture 2: Methodology refresher

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Regression Discontinuity Designs.

Empirical approaches in public economics

Instrumental Variables

Quantitative Economics for the Evaluation of the European Policy

ECO Class 6 Nonparametric Econometrics

Principles Underlying Evaluation Estimators

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Applied Health Economics (for B.Sc.)

WISE International Masters

Econometric Causality

Introduction to Causal Inference. Solutions to Quiz 4

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Groupe de lecture. Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings. Abadie, Angrist, Imbens

Selection on Observables: Propensity Score Matching.

Chapter Course notes. Experiments and Quasi-Experiments. Michael Ash CPPA. Main point of experiments: convincing test of how X affects Y.

Flexible Estimation of Treatment Effect Parameters

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

150C Causal Inference

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Potential Outcomes Model (POM)

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Chapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes

Final Exam. Economics 835: Econometrics. Fall 2010

Causal Inference with Big Data Sets

The Economics of European Regions: Theory, Empirics, and Policy

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Introduction to Econometrics. Assessing Studies Based on Multiple Regression

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation

Exam ECON5106/9106 Fall 2018

Causal Inference Basics

Experimental Designs for Identifying Causal Mechanisms

Propensity Score Matching

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Sharp IV bounds on average treatment effects on the treated and other populations under endogeneity and noncompliance

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

Controlling for Time Invariant Heterogeneity

Recitation Notes 6. Konrad Menzel. October 22, 2006

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

KNES Geography Course Outline. Year 9

Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN

Probability and Statistics

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Econometrics Multiple Regression Analysis with Qualitative Information: Binary (or Dummy) Variables

Matching Problems. Roberto Lucchetti. Politecnico di Milano

Differences-in- Differences. November 10 Clair

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata


Lecture: Difference-in-Difference (DID)

Comparing Treatments across Labor Markets: An Assessment of Nonexperimental Multiple-Treatment Strategies

14.74 Lecture 10: The returns to human capital: education

Applied Microeconometrics (L5): Panel Data-Basics

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Recitation Notes 5. Konrad Menzel. October 13, 2006

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

Sharp Bounds on Causal Effects under Sample Selection*

Empirical Methods in Applied Microeconomics

Exploring Marginal Treatment Effects

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity.

Predicting the Treatment Status

AGEC 661 Note Fourteen

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

The Supervised Learning Approach To Estimating Heterogeneous Causal Regime Effects

ECON Introductory Econometrics. Lecture 17: Experiments

Technical Track Session I:

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Field Course Descriptions

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments

Treatment Effects. Christopher Taber. September 6, Department of Economics University of Wisconsin-Madison

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Policy-Relevant Treatment Effects

Causal Inference. Prediction and causation are very different. Typical questions are:

The problem of causality in microeconometrics.

What s New in Econometrics. Lecture 1

Guanglei Hong. University of Chicago. Presentation at the 2012 Atlantic Causal Inference Conference

Causality and Experiments

The Estimation of Causal Effects by Difference-in-Difference Methods

Statistical Analysis of Causal Mechanisms

Technical Track Session I: Causal Inference

causal inference at hulu

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives

Package causalweight

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Experiments and Quasi-Experiments

1 Impact Evaluation: Randomized Controlled Trial (RCT)

Dealing with a Technological Bias: The Difference-in-Difference Approach

PSC 504: Instrumental Variables

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Transcription:

Michael Lechner, Slide 1 Econometrics of Policy Evaluation (Geneva summer school) Michael Lechner Swiss Institute for Empirical Economic Research (SEW) University of St. Gallen Switzerland June 2016 Overview over causality and research designs

Michael Lechner, Slide 2 1 The causal problem 2 Matching 3 Instrumental variables 4 Regression discontuity design (RDD) 5 Differences-in-differences methods (DiD)

1 The causal problem Michael Lechner, Slide 3

Michael Lechner, Slide 4 The causal problem 1 How does an outcome change (Y) if some variable (D) changes Example: Effect of participation in university sports (D) on individual educational outcomes (grades, time to degree) of students (Y) There will be a positive correlation (corr(y,d) > 0)! Does this mean that better students are doing sports (selection effect due to confounding variables) or doing sports makes students better (causal effect) Policy relevance for (non-marketing oriented) university: causal effect! One possible question to answer for a researcher: How much did the participants in unversity sport benefit on average from participating? Key ingredient for knowing the answer: Which grades would the participants have obtained, had they NOT participated?

Michael Lechner, Slide 5 The causal problem 2 Data alone can never answer a causal question Y 1 : Grade if participating in university sports Y 0 : Grade if NOT participating in university sports Average causal effect for participants: ATET=E(Y 1 - Y 0 D=1)=E(Y 1 D=1) E(Y 0 D=1) Use data to estimate E(Y 1 D=1) as mean grade of participants Assume mean grade is 4.5 BUT: No information about E(Y 0 D=1) [may be between 1 and 6] Thus, without further information (assumptions), causal effect only known to be larger than -1.5 (=4.5-6) and smaller than 3.5 (=4.5-1)

Michael Lechner, Slide 6 The causal problem 3 Data alone can never answer a causal question The causal effect is not identified from the data alone Therefore, so-called identifying assumptions are required that cannot be tested by the data Whether such assumptions make sense or not needs to be decided about substantive knowledge of the phenomenon under investigation Thus, chosing a convincing research design is a most important task in any empirical analysis (and different from the more technical aspects of chosing a suitable econometric estimator with such a research design)

Michael Lechner, Slide 7 The causal problem 4 An ideally designed and perfectly implemented experiment solves the problem because it removes any confounding Thus it provides a sufficiently strong set of assumptions that identify the causal effect Example: Randomly selected students must be forced to do sports In this case, participants and non-participants differ only w.r.t. participation and, thus, there cannot be any confounding In this course, we discuss alternative approaches that might be used with experimental and non-experimental data

Michael Lechner, Slide 8 1 The causal problem 2 Matching

Michael Lechner, Slide 9 Matching 1 Matching solves the identification problem by assuming that a sufficiently rich data set is available Remove influence of all confounders by conditioning on them Example ATET: Use outcomes of non-treated appearing 'identical in all relevant dimensions' to the treated ATET =E(Y 1 D=1) E(Y 0 D=1)= E(Y 1 D=1) E[E(Y X,D=0) D=1] Can be viewed as similar to regression methods but without (almost) any functional form assumption allowing for flexible effect heterogeneity

Michael Lechner, Slide 10 1 The causal problem 2 Matching 3 Instrumental variable

Michael Lechner, Slide 11 IV 1 Confounding cannot be removed by conditioning on observables But there is another variable (Z) for which the causal effect on D and Y can be estimated by conditioning on observables there is only a causal effect Z Y because there is a causal effect of Z D and D Y. There is no direct effect of Z on Y. If Z is binary: (Z Y) / (Z D) gives causal effect for compliers 2SLS, IV are parametric versions of this. Can be generalized.

Michael Lechner, Slide 12 IV 2 Back to example: Use randomized incentives for sports Fulfills the conditions Causal effect identified for the subgroup of people who react to the financial incentive by changing their sports behaviour LATE( Z) = E [ EY ( Z= 1, X= x) ] E[ EY ( Z= 0, X= x) ] [ EDZ ( = 1, X= x) ] [ ( = 0, = )] E E EDZ X x

Michael Lechner, Slide 13 1 The causal problem 2 Matching 3 Instrumental variable 4 Regression discontuity design (RDD)

Michael Lechner, Slide 14 RDD 1 Interest is in the effect of repeating the first year in university on overall grades Those who fail and those who don't fail differ in many dimensions, which are not observable in your data (matching likely to fail) Suppose that there is An almost continuous scale that determines the grade Examiners are not perfect. Thus, there is some small randomness in the values of the underlying scale (and thus the grades)

Michael Lechner, Slide 15 RDD 2 Therefore, just around the cut-off, students are approx. identical No confounding in the neighbourhood of the cut-off Thus, the identification problem is solved Works also in cases in which cut-off is not strict but influences the probability of participation to some extend Estimation: Same as LATE but local around the cut-off

Michael Lechner, Slide 16 1 The causal problem 2 Matching 3 Instrumental variable 4 Regression discontuity design (RDD) 5 Differences-in-differences methods (DiD)

Michael Lechner, Slide 17 DiD and panel methods 1 Have observations before and after D happened (no panel needed) Example about university sports Compare grades of participants before and after participation Concern: Grades changes generally during study (trend) Use before-after-comparison of non-participants to remove this trend Non-parametric identification impossible Essentially similar to matching. Not much discussed here, see my survey (2010) for methods appropriate in a non-linear or semiparametric setting. See also panel section of Econometric Methods

Michael Lechner, Slide 18 Next we consider approaches that can be used to estimate the various conditional expectations that play a role for the different research designs