NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E.

Similar documents
A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples

Policy-Relevant Treatment Effects

Course Description. Course Requirements

The relationship between treatment parameters within a latent variable framework

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region

Estimation of Treatment Effects under Essential Heterogeneity

By Marcel Voia. February Abstract

NBER WORKING PAPER SERIES

What s New in Econometrics. Lecture 1

Sensitivity of Propensity Score Methods to the Specifications

Econ 273B Advanced Econometrics Spring

Course Description. Course Requirements

Matching using Semiparametric Propensity Scores

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Course Description. Course Requirements

( ) : : (CUHIES2000), Heckman Vytlacil (1999, 2000, 2001),Carneiro, Heckman Vytlacil (2001) Carneiro (2002) Carneiro, Heckman Vytlacil (2001) :

Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand

Blinder-Oaxaca as a Reweighting Estimator

Implementing Matching Estimators for. Average Treatment Effects in STATA

Principles Underlying Evaluation Estimators

Flexible Estimation of Treatment Effect Parameters

APEC 8212: Econometric Analysis II

Chapter 60 Evaluating Social Programs with Endogenous Program Placement and Selection of the Treated

Econometrics, Harmless and Otherwise

Selection on Observables: Propensity Score Matching.

CALIFORNIA INSTITUTE OF TECHNOLOGY

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ

Generated Covariates in Nonparametric Estimation: A Short Review.

IDENTIFICATION OF THE BINARY CHOICE MODEL WITH MISCLASSIFICATION

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Covariate selection and propensity score specification in causal inference

NBER WORKING PAPER SERIES IS THE SPURIOUS REGRESSION PROBLEM SPURIOUS? Bennett T. McCallum. Working Paper

Marginal Treatment Effects from a Propensity Score Perspective

Notes on causal effects

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

Evaluating Social Programs with Endogenous Program Placement and Selection of the Treated 1

Jinyong Hahn. Department of Economics Tel: (310) Bunche Hall Fax: (310) Professional Positions

Controlling for overlap in matching

EC 709: Advanced Econometrics II Fall 2009

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1

A Distinction between Causal Effects in Structural and Rubin Causal Models

Parametric Identification of Multiplicative Exponential Heteroskedasticity

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

NBER WORKING PAPER SERIES STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION. James J. Heckman Edward Vytlacil

Dummy Endogenous Variables in Weakly Separable Models

Imbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata

A Simple Way to Calculate Confidence Intervals for Partially Identified Parameters. By Tiemen Woutersen. Draft, September

DOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS. A SAS macro to estimate Average Treatment Effects with Matching Estimators

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Trimming for Bounds on Treatment Effects with Missing Outcomes *

NONPARAMETRIC ESTIMATION OF AVERAGE TREATMENT EFFECTS UNDER EXOGENEITY: A REVIEW*

EVALUATING EDUCATION POLICIES WHEN THE RATE OF RETURN VARIES ACROSS INDIVIDUALS PEDRO CARNEIRO * Take the standard model of potential outcomes:

Causal Inference with and without Experiments I

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Econometric Causality

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

12E016. Econometric Methods II 6 ECTS. Overview and Objectives

studies, situations (like an experiment) in which a group of units is exposed to a

Causal Inference Basics

The New Palgrave Dictionary of Economics Online

A Measure of Robustness to Misspecification

Using matching, instrumental variables and control functions to estimate economic choice models

Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Finite Sample Properties of Semiparametric Estimators of Average Treatment Effects

Endogenous binary choice models with median restrictions: A comment

SUPPOSE a policy is proposed for adoption in a country.

The Information Basis of Matching with Propensity Score

Exploring Marginal Treatment Effects

INVERSE PROBABILITY WEIGHTED ESTIMATION FOR GENERAL MISSING DATA PROBLEMS

Econometric Methods for Ex Post Social Program Evaluation

Bayesian Interpretations of Heteroskedastic Consistent Covariance Estimators Using the Informed Bayesian Bootstrap

Efficient Semiparametric Estimation of Quantile Treatment Effects

TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES. James J. Heckman Edward J. Vytlacil

Econ 673: Microeconometrics

Randomized trials for policy

Comparing IV with Structural Models: What Simple IV Can and Cannot Identify

Counterfactual worlds

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Random Coefficients on Endogenous Variables in Simultaneous Equations Models

Estimation of the Conditional Variance in Paired Experiments

Conditions for the Existence of Control Functions in Nonseparable Simultaneous Equations Models 1

A Course in Applied Econometrics. Lecture 2 Outline. Estimation of Average Treatment Effects. Under Unconfoundedness, Part II

The Evaluation of Social Programs: Some Practical Advice

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity.

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

NBER WORKING PAPER SERIES USE OF PROPENSITY SCORES IN NON-LINEAR RESPONSE MODELS: THE CASE FOR HEALTH CARE EXPENDITURES

Large Sample Properties of Matching Estimators for Average Treatment Effects

Understanding Instrumental Variables in Models with Essential Heterogeneity

Evaluating Nonexperimental Estimators for Multiple Treatments: Evidence from a Randomized Experiment

TECHNICAL WORKING PAPER SERIES USING WEIGHTS TO ADJUST FOR SAMPLE SELECTION WHEN AUXILIARY INFORMATION IS AVAILABLE. Aviv Nevo

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Transcription:

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES James J. Heckman Petra E. Todd Working Paper 15179 http://www.nber.org/papers/w15179 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 July 2009 This research was supported by NSF SBR 93-21-048 and NSF 97-09-873 and NICHD 40-4043-000-85-261. The views expressed herein are those of the author(s) and do not necessarily reflect the views of the National Bureau of Economic Research. NBER working papers are circulated for discussion and comment purposes. They have not been peerreviewed or been subject to the review by the NBER Board of Directors that accompanies official NBER publications. 2009 by James J. Heckman and Petra E. Todd. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including notice, is given to the source.

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples James J. Heckman and Petra E. Todd NBER Working Paper No. 15179 July 2009 JEL No. C13,C51 ABSTRACT The probability of selection into treatment plays an important role in matching and selection models. However, this probability can often not be consistently estimated, because of choice-based sampling designs with unknown sampling weights. This note establishes that the selection and matching procedures can be implemented using propensity scores fit on choice-based samples with misspecified weights, because the odds ratio of the propensity score fit on the choice-based sample is monotonically related to the odds ratio of the true propensity scores. James J. Heckman Department of Economics The University of Chicago 1126 E. 59th Street Chicago, IL 60637 and NBER jjh@uchicago.edu Petra E. Todd Department of Economics University of Pennsylvania 3718 Locust Walk Philadelphia, PA 19104 and NBER ptodd@econ.upenn.edu

1 Introduction The probability of selection into a treatment, also called the propensity score, plays a central role in classical selection models and in matching models (see, e.g., Heckman, 1980; Heckman and Navarro, 2004; Heckman and Vytlacil, 1 2007; Hirano et al., 2003; Rosenbaum and Rubin, 1983). Heckman and Robb (1986, reprinted 2000), Heckman and Navarro (2004) and Heckman and Vytlacil (2007) show how the propensity score is used differently in matching and selection models. They also show that, given the propensity score, both matching and selection models are robust to choice-based sampling, which occurs when treatment group members are over- or under-represented relative to their frequency in the population. Choice-based sampling designs are frequently chosen in evaluation studies to reduce the costs of data collection and to obtain more observations on treated individuals. Given a consistent estimate of the propensity score, matching and classical selection methods are robust to choice-based sampling, because both are defined conditional on treatment and comparison group status. This note extends the analysis of Heckman and Robb (1985),Heckman and Robb (1986, reprinted 2000) to consider the case where population weights are unknown so that the propensity score cannot be consistently estimated. In evaluation settings, the population weights are often unknown or cannot 1 It also plays a key role in instrumental variables models (see Heckman et al., 2006). Heckman and Vytlacil (2007) discuss the different role played by the propensity score in matching IV and selection models. 2

easily be estimated. 2 For example, for the National Supported Work training program studied in LaLonde (1986), Dehejia and Wahba (1999, 2002) and in Smith and Todd (2005), the population consists of all persons eligible for the program, which was targeted at drug addicts, ex-convicts, and welfare recipients. Few datasets have the information necessary to determine whether a person is eligible for the program, so it would be difficult to estimate the population weights needed to consistently estimate propensity scores. In this note, we establish that matching and selection procedures can still be applied when the propensity score is estimated on unweighted choice based samples. The idea is simple. To implement both matching and classical selection models, only a monotonic transformation of the propensity score is required. In choice based samples, the odds ratio of the propensity score estimated using misspecified weights is monotonically related to the odds ratio of the true propensity scores. Thus, selection and matching procedures can identify population treatment effects using misspecified estimates of propensity scores fit on choice-based samples. 2 Discussion of the Proposition Let D = 1 if a person is a treatment group member; D = 0 if the person is a member of the comparison group. X = x is a realization of X. In the 2 The methods of Manski and Lerman (1977) and Manski (1986) for adjusting for choicebased sampling in estimating the discrete choice probabilities cannot be applied when the weights are unknown and cannot be identified from the data. 3

population generated from random sampling, the joint density is g(d, x) = [Pr(D = 1 x)] d [P r(d = 0 x)] 1 d g(x) for D = d, d {0, 1}, where g is the density of the data. By Bayes s theorem, we have, letting P r(d = 1) = P, (1a) g(x D = 1)P = g(x)p r(d = 1 x) and (1b) g(x D = 0)(1 P ) = g(x)p r(d = 0 x). Take the ratio of (1a) to (1b) (2) g(x D=1) g(x D=0) ( P 1 P ) = P r(d=1 x) P r(d=0 x). Assume 0 < P r(d = 1 x) < 1. From knowledge of the densities of the data in the two samples, g(x D = 1) and g(x D = 0), one can form a scalar multiple of the ratio of the propensity score without knowing P. The odds ratio is a monotonic function of the propensity score that does not require knowledge of the true sample weights. In a choice-based sample, both the numerator and denominator of the first term in (2) can be consistently estimated. This monotonic function can replace P (x) in implementing both matching and nonparametric selection models. However, estimating g(x D = d) is demanding of the data when X is of high dimension. Instead of estimating these densities, we can substitute for the left hand side of (2) the odds ratio of the estimated conditional probabilities obtained using the choice-based sample with the wrong weights. 4

(i.e. for example, ignoring the fact that the data are a choice based sample). The odds ratio of the estimated probabilities is a scalar multiple of the true odds ratio. It can therefore be used instead of P r(d = 1 X) to match or construct nonparametric control functions in selection bias models. In the choice-based sample, let P r(d = 1 x) be the conditional probability that D = 1 and P be the unconditional probability of sampling D = 1, where P P, the true population proportion. The joint density of the data from the sampled population is [g(x D = 1)P ] d [g(x D = 0)(1 P )] 1 d. Using (1a) and (1b) to solve for g(x D = 1) and g(x D = 0) one may write the data density as so [ P r(d=1 x)g(x) P ] d [ ] 1 d P P r(d=0 x)g(x) (1 P ) (1 P ) P r(d=1 x)g(x) (3a) P r(d = 1 x) = P P g(x D=1)P +g(x D=0)(1 P ) and P r(d=0 x)g(x) (3b) P r(d = 0 x) = 1 P 1 P. g(x D=1)P +g(x D=0)(1 P ) Under random sampling, the right-hand sides of (3a) and (3b) are the limits to which the choice-based probabilities converge. Taking the ratio of (3a) to (3b), assuming the latter is not zero, one obtains P r(d=1 x) (4) P = P r(d=1 x) ( P ) ( 1 P ) r(d=0 x) P r(d=0 x) 1 P P. Thus, one can estimate the ratio of the propensity score up to scale (the scale is the product of the two terms on the right-hand side of (4)). Instead 5

of estimating matching or semiparametric selection models using Pr(D = 1 x) (as in, for example, Ahn and Powell (1993); Heckman (1980); Heckman and Hotz (1989); Heckman et al. (1998); Heckman and Robb (1986); Powell (2001), one can, instead, use the odds ratio of the estimate P r(d = 1 x), which is monotonically related to the true P r(d = 1 x). In the case of a logit P (x), P (x) = exp(xβ)/(1 + exp(xβ)), the log of this ratio becomes ln P r(d=1 x) P r(d=0 x) = x β where the slope coefficients are the true values and the intercept β 0 = β 0 + ln(p /(1 P )) + ln((1 P )/P )), where β 0 is the true value. 3 In implementing nearest-neighbor matching estimators, matching on the log odds ratio gives identical estimates to matching on the (unknown) Pr(D = 1 x), because the odds ratio preserves the ranking of the neighbors. In application of either matching or classical selection bias correction methods, one must account for the usual problems of using estimated log odds ratios instead of true values. 4 3 See Manski and McFadden (1981, p. 26). 4 For discussion related to using estimated propensity scores, see Hahn (1998); Heckman et al. (1998); Heckman et al. (1998); Hirano et al. (2003). 6

References Ahn, H. and J. Powell (1993, July). Semiparametric estimation of censored selection models with a nonparametric selection mechanism. Journal of Econometrics 58 (1-2), 3 29. Dehejia, R. and S. Wahba (1999, December). Causal effects in nonexperimental studies: Reevaluating the evaluation of training programs. Journal of the American Statistical Association 94 (448), 1053 1062. Dehejia, R. and S. Wahba (2002, February). methods for nonexperimental causal studies. Propensity score matching Review of Economics and Statistics 84 (1), 151 161. Hahn, J. (1998, March). On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica 66 (2), 315 31. Heckman, J. J. (1980). Addendum to sample selection bias as a specification error. In E. Stromsdorfer and G. Farkas (Eds.), Evaluation Studies Review Annual, Volume 5. Beverly Hills: Sage Publications. Heckman, J. J. and V. J. Hotz (1989, December). Choosing among alternative nonexperimental methods for estimating the impact of social programs: The case of Manpower Training. Journal of the American Statistical Association 84 (408), 862 874. Rejoinder also published in Vol. 84, No. 408, (Dec. 1989). 7

Heckman, J. J., H. Ichimura, J. Smith, and P. E. Todd (1998, September). Characterizing selection bias using experimental data. Econometrica 66 (5), 1017 1098. Heckman, J. J., H. Ichimura, and P. E. Todd (1998, April). Matching as an econometric evaluation estimator. Review of Economic Studies 65 (223), 261 294. Heckman, J. J. and S. Navarro (2004, February). Using matching, instrumental variables, and control functions to estimate economic choice models. Review of Economics and Statistics 86 (1), 30 57. Heckman, J. J. and R. Robb (1985, October-November). Alternative methods for evaluating the impact of interventions: An overview. Journal of Econometrics 30 (1-2), 239 267. Heckman, J. J. and R. Robb (1986). Alternative methods for solving the problem of selection bias in evaluating the impact of treatments on outcomes. In H. Wainer (Ed.), Drawing Inferences from Self-Selected Samples, pp. 63 107. New York: Springer-Verlag. Reprinted in 2000, Mahwah, NJ: Lawrence Erlbaum Associates. Heckman, J. J., S. Urzua, and E. J. Vytlacil (2006). Understanding instrumental variables in models with essential heterogeneity. Review of Economics and Statistics 88 (3), 389 432. 8

Heckman, J. J. and E. J. Vytlacil (2007). Econometric evaluation of social programs, part II: Using the marginal treatment effect to organize alternative economic estimators to evaluate social programs and to forecast their effects in new environments. In J. Heckman and E. Leamer (Eds.), Handbook of Econometrics, Volume 6B, pp. 4875 5144. Amsterdam: Elsevier. Hirano, K., G. W. Imbens, and G. Ridder (2003, July). Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71 (4), 1161 1189. LaLonde, R. J. (1986, September). Evaluating the econometric evaluations of training programs with experimental data. American Economic Review 76 (4), 604 620. Manski, C. F. (1986, February). Semiparametric analysis of binary response from response-based samples. Journal of Econometrics 31 (1), 31 40. Manski, C. F. and S. R. Lerman (1977, November). The estimation of choice probabilities from choice based samples. Econometrica 45 (8), 1977 1988. Manski, C. F. and D. McFadden (1981). Statistical analysis of discrete probability models. In C. F. Manski and D. McFadden (Eds.), Structural Analysis of Discrete Data with Econometric Applications, pp. 2 49. Cambridge, MA: MIT Press. Powell, J. L. (2001). Semiparametric estimation of bivariate latent variable models. In C. Hsiao, K. Morimune, and J. L. Powell (Eds.), Nonlinear 9

Statistical Modeling: Proceedings of the Thirteenth International Symposium in Economic Theory and Econometrics: Essays in Honor of Takeshi Amemiya. New York: Cambridge University Press. Rosenbaum, P. R. and D. B. Rubin (1983, April). The central role of the propensity score in observational studies for causal effects. Biometrika 70 (1), 41 55. Smith, J. A. and P. E. Todd (2005, March-April). Does matching overcome LaLonde s critique of nonexperimental estimators? Journal of Econometrics 125 (1-2), 305 353. 10