Exploring Marginal Treatment Effects

Similar documents
Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 11 Roy model, MTE, PRTE

Lecture 11/12. Roy Model, MTE, Structural Estimation

The Stata Journal. Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas

Policy-Relevant Treatment Effects

Estimation of Treatment Effects under Essential Heterogeneity

The relationship between treatment parameters within a latent variable framework

Marginal Treatment Effects from a Propensity Score Perspective

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1

Instrumental Variables: Then and Now

The Econometric Evaluation of Policy Design: Part III: Selection Models and the MTE

Estimating Heterogeneous Treatment Effects in the Presence of Self-Selection: A Propensity Score Perspective*

Flexible Estimation of Treatment Effect Parameters

Principles Underlying Evaluation Estimators

Heterogeneous Treatment Effects in the Presence of. Self-Selection: A Propensity Score Perspective

Identification and Extrapolation with Instrumental Variables

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Estimating Marginal and Average Returns to Education

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model

Econometric Causality

David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata

The Generalized Roy Model and Treatment Effects

Estimating Marginal Returns to Education

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Estimating marginal returns to education

Empirical approaches in public economics

STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION 1. By James J. Heckman and Edward Vytlacil

Haavelmo, Marschak, and Structural Econometrics

TREATMENT HETEROGENEITY PAUL SCHRIMPF NOVEMBER 2, 2011 UNIVERSITY OF BRITISH COLUMBIA ECONOMICS 628: TOPICS IN ECONOMETRICS

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

A simple alternative to the linear probability model for binary choice models with endogenous regressors

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Problem set - Selection and Diff-in-Diff

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Propensity-Score-Based Methods versus MTE-Based Methods. in Causal Inference

Nonlinear Regression Functions

Implementing Matching Estimators for. Average Treatment Effects in STATA

Impact Evaluation Technical Workshop:

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Generated Covariates in Nonparametric Estimation: A Short Review.

Quantitative Economics for the Evaluation of the European Policy

Econometric Analysis of Cross Section and Panel Data

Applied Health Economics (for B.Sc.)

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments

NBER WORKING PAPER SERIES STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION. James J. Heckman Edward Vytlacil

Exam ECON5106/9106 Fall 2018

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Estimating Marginal and Average Returns to Education

Marginal effects and extending the Blinder-Oaxaca. decomposition to nonlinear models. Tamás Bartus

NBER WORKING PAPER SERIES

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Noncompliance in Randomized Experiments

Extended regression models using Stata 15

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

GMM Estimation in Stata

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Thoughts on Heterogeneity in Econometric Models

Understanding What Instrumental Variables Estimate: Estimating Marginal and Average Returns to Education

Homework Solutions Applied Logistic Regression

Dealing With and Understanding Endogeneity

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Selection on Observables: Propensity Score Matching.

Econometrics, Harmless and Otherwise

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case

Lab 6 - Simple Regression

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation

Testing for Essential Heterogeneity

Estimation of pre and post treatment Average Treatment Effects (ATEs) with binary time-varying treatment using Stata

ECON 594: Lecture #6

Propensity Score Methods for Causal Inference

Identification of Regression Models with Misclassified and Endogenous Binary Regressor

ECON Introductory Econometrics. Lecture 16: Instrumental variables

PSC 504: Instrumental Variables

Identifying Effects of Multivalued Treatments

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Notes on causal effects

SUPPOSE a policy is proposed for adoption in a country.

AGEC 661 Note Fourteen

Gov 2002: 9. Differences in Differences

dqd: A command for treatment effect estimation under alternative assumptions

STRUCTURAL EQUATIONS, TREATMENT EFFECTS, AND ECONOMETRIC POLICY EVALUATION 1

( ) : : (CUHIES2000), Heckman Vytlacil (1999, 2000, 2001),Carneiro, Heckman Vytlacil (2001) Carneiro (2002) Carneiro, Heckman Vytlacil (2001) :

SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS

What s New in Econometrics? Lecture 14 Quantile Methods

The Generalized Roy Model and the Cost-Benefit Analysis of Social Programs

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling

crreg: A New command for Generalized Continuation Ratio Models

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E.

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1

Transcription:

Exploring Marginal Treatment Effects Flexible estimation using Stata Martin Eckhoff Andresen Statistics Norway Oslo, September 12th 2018 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 1 / 25

Introduction Motivation Instrumental variables (IV) estimators solve endogeneity problems When there is heterogenous returns, IV estimate LATE: Average treatment effect among compliers Not always of interest! Marginal Treatment Effects allows you to Go beyond LATE in settings with essential heterogeneity Capture the full distribution of treatment effects Allow us to back out commonly used treatment effect parameters Unify IV methods, selection models and control function approaches Martin Andresen (SSB) Exploring MTEs Oslo, 2018 2 / 25

Introduction This paper Presents the theory of Marginal Treatment Effects aimed at the applied empiricist Highlights similarities to selection models and control function approaches Introduces the new Stata package mtefe for estimating MTEs Performs Monte Carlo simulations to investigate the robustness of the estimators Martin Andresen (SSB) Exploring MTEs Oslo, 2018 3 / 25

Introduction A motivating example: College and wages unobserved {}}{ D = a+bx + cz + dability + ez ability + µ w = f +gx + hd + iability + jd ability + ɛ }{{} unobserved If d 0 i: Selection problem If j = 0: IV recovers the ATE with a valid instrument Z If j 0: IV recovers a local average treatment effect Relative size of LATE vs. ATE depends on what individuals are shifted into treatment by the instrument - e what individuals have higher or lower treatment effects - j Martin Andresen (SSB) Exploring MTEs Oslo, 2018 4 / 25

Marginal treatment effects A generalized Roy model Y j = µ j (X ) + U j for j = 0, 1 (1) Y = DY 1 + (1 D)Y 0 (2) D = 1 [Zγ > V ] where Z = X, Z (3) Without loss of generality normalize the scale of V D = 1 γz > V F V (Zγ) > F V (V ) P(Z) > U D U D U(0, 1): Percentiles of the unobserved resistance Treatment effect: β = Y 1 Y 0 = µ 1 (X ) µ 0 (X ) + U 1 U 0 With essential heterogeneity: Sorting on unobserved gains cov(β, D X ) 0 Treatment decision made with knowledge of unobserved gains Martin Andresen (SSB) Exploring MTEs Oslo, 2018 5 / 25

Marginal treatment effects Marginal Treatment Effects MTE(x, u) E(Y 1 Y 0 U d = u, X = x) = µ 1(x) µ 0(x) + E(U 1 U 0 U D = u) Average β for people with a particular distaste for treatment and x Björklund and Moffitt (1987), Heckman and coauthors (1997; 1999; 2005; 2007), Cornelissen et al. (2016); Brinch et al. (2015). Martin Andresen (SSB) Exploring MTEs Oslo, 2018 6 / 25

Marginal treatment effects LATE vs MTE With two particular values of an instrument, z and z, the Wald estimator is LATE(x) = E(Y X = x, Z = z ) E(Y X = x, Z = z) E(D X = x, Z = z ) E(D X = x, Z = z) This is a Local Average Treatment Effect People who choose treatment when Z = z, but not when Z = z In the choice model: People with P(x, z ) < U D P(x, z): LATE(x, z, z ) = µ 1(x) µ 0(x) + E(U 1 U 0 P(x, z ) < U D P(x, z)) MTE(x, u) = µ 1(x) µ 0(x) + E(U 1 U 0 U D = u) MTE is a limit form of LATE (Heckman, 1997) Martin Andresen (SSB) Exploring MTEs Oslo, 2018 7 / 25

Marginal treatment effects Back to the motivating example unobserved {}}{ D = a+bx + cz + dability + ez ability + µ w = f +gx + hd + iability + jd ability + ɛ }{{} unobserved In the choice model, every omitted variable will enter U 0, U 1, U D. High-ability people will have lower U D if d > 0...and higher unobserved treatment effects (U 1 U 0 ) if j > 0 Should lead to a downward sloping MTE - cov(u 1 U 0, U D ) < 0 This selection pattern is precisely what MTE estimates Martin Andresen (SSB) Exploring MTEs Oslo, 2018 8 / 25

Marginal treatment effects An example MTE curve Marginal Treatment Effects Treatment effect 1 0 1 2 0.1.2.3.4.5.6.7.8.9 1 Unobserved resistance to treatment MTE 95% CI ATE Martin Andresen (SSB) Exploring MTEs Oslo, 2018 9 / 25

Estimating MTEs Standard IV assumptions Interpreting IV as LATE (Imbens and Angrist, 1994) requires: Exclusion Y j Z X. The instrument affect outcomes only through the probability of treatment X Relevance P(z) P(z ). Treatment is a nontrivial function of the instrument Monotonicity P(z) P(z ) i two values of the instrument cannot shift some people in and others out Monotonicity should hold between all possible pairs z, z These assumptions imply and are implied by the model in Eq. 1-3 (Vytlacil, 2002) Martin Andresen (SSB) Exploring MTEs Oslo, 2018 10 / 25

Estimating MTEs The separability assumption Best case scenario: Estimate MTEs with no more assumptions than IV Estimate MTE within each cell of X, aggregate In practice: Limited data and support. Instead assume Separability E(U j X, U D ) = E(U j U D ) Implied by, but weaker than, full independence All X do is shift the MTE curve up or down Same assumption as in selection models Usually also work with linear version of µ j (x) = xβ j Martin Andresen (SSB) Exploring MTEs Oslo, 2018 11 / 25

Estimating MTEs Estimation methods First estimate the propensity scores P(Z) Local Instrumental Variables The derivative of the conditional expectation of Y wrt. p MTE(x, u) = E(Y x,p) p u=p Separate approach Estimate outcome given x, p separately controlling selection MTE(x, u) = E(Y 1 x, U D = u) E(Y 0 x, U D = u) Control selection via control function - similar to selection models Maximum likelihood (joint normal model only) Martin Andresen (SSB) Exploring MTEs Oslo, 2018 12 / 25

Estimating MTEs Functional forms (U 0, U 1, V ) joint normal: Heckman selection E(U j U D = u) = K 1 π k(u k 1 k+1 ): polynomial model Polynomial model with splines Semiparametric model Estimate partial linear model of E(Y X, p) = X β 0 + X (β 1 β 0 )p + K(p) Using double residual regression (Robinson, 1988) Martin Andresen (SSB) Exploring MTEs Oslo, 2018 13 / 25

The mtefe package The mtefe package Acceps fixed effects in all independent varlists Supports weights (pweights, fweights) Supports Local IV, separate approach and maximum likelihood estimation More flexible MTE models, including spline functions Calculates treatment effect parameters from results Analytic standard errors and bootstrap including first stage Improved graphical output (mtefeplot) Brave and Walstrum (2014) Martin Andresen (SSB) Exploring MTEs Oslo, 2018 14 / 25

The mtefe package The mtefe command mtefe depvar [ indepvars ] (depvar t = varlist iv ) [ if ] [ in ] [ weight ] [, polynomial(#) splines(numlist) semiparametric restricted(varlist r ) separate mlikelihood link(string) + other options ] Follows Stata s IV syntax Accepts fixed effects (i.varname) Several options follow similar options in margte Martin Andresen (SSB) Exploring MTEs Oslo, 2018 15 / 25

The mtefe package Example output I. mtefe_gendata, obs(10000) districts(10).. mtefe lwage exp exp2 i.district (col=distcol) Parametric normal MTE model Observations : 10000 Treatment model: Probit Estimation method: Local IV beta0 lwage Coef. Std. Err. t P> t [95% Conf. Interval] exp.0358398.0064408 5.56 0.000.0232145.0484651 exp2 -.0008453.0002019-4.19 0.000 -.0012411 -.0004496 district 2.2352456.0680412 3.46 0.001.1018712.36862 3.6294914.0701091 8.98 0.000.4920634.7669194 4.0131179.0597721 0.22 0.826 -.1040474.1302832 5.0338606.0705835 0.48 0.631 -.1044974.1722186 6.1699366.0605086 2.81 0.005.0513275.2885458 7 -.1899241.060115-3.16 0.002 -.3077617 -.0720865 8 -.1842254.0676843-2.72 0.007 -.3169003 -.0515504 9 -.7908301.0578436-13.67 0.000 -.9042153 -.677445 10 -.4432749.0597237-7.42 0.000 -.5603455 -.3262044 beta1-beta0 _cons 3.164706.0650331 48.66 0.000 3.037228 3.292184 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 16 / 25

The mtefe package Example output II k exp -.0386384.010241-3.77 0.000 -.0587128 -.018564 exp2.0012967.0003288 3.94 0.000.0006523.0019412 district 2.265112.107039 2.48 0.013.0552939.4749301 (output omitted ) 10.3143661.1072555 2.93 0.003.1041237.5246085 _cons.4255863.0983572 4.33 0.000.2327863.6183863 mills -.4790282.0611081-7.84 0.000 -.5988124 -.359244 effects ate.3283373.0242932 13.52 0.000.2807177.3759568 att.5369432.0388809 13.81 0.000.4607287.6131576 atut.1195067.0384691 3.11 0.002.0440995.194914 late.3279726.0245142 13.38 0.000.2799198.3760254 mprte1.3463148.0256971 13.48 0.000.2959433.3966862 mprte2.3309428.024298 13.62 0.000.2833137.3785719 mprte3 -.016257.0498984-0.33 0.745 -.1140679.0815538 Test of observable heterogeneity, p-value 0.0000 Test of essential heterogeneity, p-value 0.0000 Note: Analytical standard errors ignore the facts that the propensity score, (output omitted ) Martin Andresen (SSB) Exploring MTEs Oslo, 2018 17 / 25

The mtefe package Marginal Treatment Effects of college Density 0 2 4 6 8 10 Common support Treatment effect 1 0 1 2 Marginal Treatment Effects 0.2.4.6.8 1 Propensity score Treated Untreated 0.1.2.3.4.5.6.7.8.9 1 Unobserved resistance to treatment MTE 95% CI ATE Martin Andresen (SSB) Exploring MTEs Oslo, 2018 18 / 25

The mtefe package Interpreting heterogeneity The unobserved dimension Depends on what is observed! Positive selection on unobserved gains: MTE is downward sloping In line with predictions from a simple Roy model Consistent with treatment decisions made with knowledge of unobserved gains The observed dimension Positive selection if γ (β 1 β 0 ) > 0 X that leads to more treatment also leads to higher treatment effects Negative selection if γ (β 1 β 0 ) < 0 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 19 / 25

The mtefe package MTEs unify treatment effect parameters Using MTE(x, u), we can calculate any treatment effect parameter as 1 0 ω(u)mte( x, u)du = x(β 1 β 0 ) + 1 0 ω(u)k(u)du ω(u) is the density of U D in the population of interest x is the average x in the population of interest Where the population of interest depends on the parameter: ATE: Everyone, ω(u) = 1, x is average x ATT: Population has D = 1 U D p, x is average among treated ATUT: Population has D = 0 U D > p, x is average among untreated LATE/IV: Population is compliers PRTE/MPRTE: Population is people shifted by policy Martin Andresen (SSB) Exploring MTEs Oslo, 2018 20 / 25

The mtefe package Local Average Treatment Effects Treatment effect ATE LATE 0.5 1 1.5 Marginal Treatment Effects 0.005.01.015.02.025 Weights 0.1.2.3.4.5.6.7.8.9 1 Unobserved resistance to treatment MTE 2SLS MTE (LATE) LATE weights Martin Andresen (SSB) Exploring MTEs Oslo, 2018 21 / 25

Conclusion Practical advice for users Interpret the unobserved dimension in light of observables Know your setting, argue explicitly for what U D could pick up Use this to defend the separability assumption Use semiparametric methods to guide your choice of functional form Show robustness to Choice of functional form Use of estimation method Martin Andresen (SSB) Exploring MTEs Oslo, 2018 22 / 25

Conclusion Conclusion Marginal treatment effects should be in your toolbox Heterogeneous returns is the more reasonable baseline case MTE analysis estimate the full distribution of treatment effects and thus go beyond LATE But usually at the cost of stricter assumptions Unless you have an instrument that work without covariates and generate full support...but MTE aren t all that new - closely related to selection models. The mtefe package does the work for you (please report bugs) Martin Andresen (SSB) Exploring MTEs Oslo, 2018 23 / 25

References References I Björklund, A. and R. Moffitt (1987): The Estimation of Wage Gains and Welfare Gains in Self-selection, The Review of Economics and Statistics, 69, 42 49. Brave, S. and T. Walstrum (2014): Estimating marginal treatment effects using parametric and semiparametric methods, Stata Journal, 14, 191 217(27). Brinch, C., M. Mogstad, and M. Wiswall (2015): Beyond LATE with a discrete instrument, Forthcoming in Journal of Political Economy. Cornelissen, T., C. Dustmann, A. Raute, and U. Schönberg (2016): From LATE to MTE: Alternative methods for the evaluation of policy interventions, Labour Economics, 41, 47 60, sole/eale conference issue 2015. Heckman, J. (1997): Instrumental Variables: A Study of Implicit Behavioral Assumptions Used in Making Program Evaluations, Journal of Human Resources, 32, 441 462. Heckman, J. J. and E. J. Vytlacil (1999): Local instrumental variables and latent variable models for identifying and bounding treatment effects, Proceedings of the National Academy of Sciences of the United States of America, 96(8), 4730 4734. (2005): Structural Equations, Treatment Effects, and Econometric Policy Evaluation, Econometrica, 73, 669 738. Martin Andresen (SSB) Exploring MTEs Oslo, 2018 24 / 25

References References II (2007): Chapter 71 Econometric Evaluation of Social Programs, Part II: Using the Marginal Treatment Effect to Organize Alternative Econometric Estimators to Evaluate Social Programs, and to Forecast their Effects in New Environments, Elsevier, vol. 6, Part B of Handbook of Econometrics, 4875 5143. Imbens, G. W. and J. D. Angrist (1994): Identification and Estimation of Local Average Treatment Effects, Econometrica, 62, pp. 467 475. Robinson, P. (1988): Root- N-Consistent Semiparametric Regression, Econometrica, 56, 931 54. Vytlacil, E. (2002): Independence, Monotonicity, and Latent Index Models: An Equivalence Result, Econometrica, 70, 331 341. Martin Andresen (SSB) Exploring MTEs Oslo, 2018 25 / 25