WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

Similar documents
SREE WORKSHOP ON PRINCIPAL STRATIFICATION MARCH Avi Feller & Lindsay C. Page

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

arxiv: v1 [stat.me] 8 Jun 2016

Modeling Mediation: Causes, Markers, and Mechanisms

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small

Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications

Survivor Bias and Effect Heterogeneity

Growth Mixture Modeling and Causal Inference. Booil Jo Stanford University

Instrumental Variables

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death

The problem of causality in microeconometrics.

AGEC 661 Note Fourteen

Sharp Bounds on Causal Effects under Sample Selection*

Noncompliance in Randomized Experiments

Recitation Notes 6. Konrad Menzel. October 22, 2006

Potential Outcomes Model (POM)

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017

Introduction to Causal Inference. Solutions to Quiz 4

Relaxing Latent Ignorability in the ITT Analysis of Randomized Studies with Missing Data and Noncompliance

The problem of causality in microeconometrics.

Identification and Estimation of Causal Mediation Effects with Treatment Noncompliance

The Causal Inference Problem and the Rubin Causal Model

Introduction to Statistical Inference

The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures

Recitation Notes 5. Konrad Menzel. October 13, 2006

Statistical Models for Causal Analysis

Gov 2002: 4. Observational Studies and Confounding

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance

Using Post Outcome Measurement Information in Censoring by Death Problems

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives

Harvard University. Harvard University Biostatistics Working Paper Series

Bounds on Average and Quantile Treatment Effects of Job Corps Training on Wages*

Bounds on Average and Quantile Treatment Effects on Duration Outcomes under Censoring, Selection, and Noncompliance 1

150C Causal Inference

Causal Inference with Counterfactuals

Potential Outcomes and Causal Inference I

Front-Door Adjustment

Propensity Score Methods for Causal Inference

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

Treatment Effects. Christopher Taber. September 6, Department of Economics University of Wisconsin-Madison

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments

Evaluating the Effect of Training on Wages in the Presence of Noncompliance, Nonemployment, and Missing Outcome Data

Sensitivity checks for the local average treatment effect

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity.

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

CompSci Understanding Data: Theory and Applications

arxiv: v1 [stat.me] 3 Feb 2016

Bayesian Inference for Causal Mediation Effects Using Principal Stratification with Dichotomous Mediators and Outcomes

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials

Modeling Log Data from an Intelligent Tutor Experiment

Econometric Causality

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Estimating Post-Treatment Effect Modification With Generalized Structural Mean Models

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

Identification of Causal Parameters in Randomized Studies with Mediating Variables

Estimating causal effects in trials involving multitreatment arms subject to non-compliance: a Bayesian framework

PSC 504: Instrumental Variables

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1

Quantitative Economics for the Evaluation of the European Policy

Sharp IV bounds on average treatment effects on the treated and other populations under endogeneity and noncompliance

Impact Evaluation Technical Workshop:

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.

Instrumental Variables

CEPA Working Paper No

Supplemental Material for Policy Deliberation and Voter Persuasion: Experimental Evidence from an Election in the Philippines

Bounds on Average and Quantile Treatment E ects of Job Corps Training on Participants Wages

Instrumental Variables

Instrumental Variables in Action: Sometimes You get What You Need

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized. Clinical Trials Subject to Noncompliance.

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA

Nested Markov Compliance Class Model in the Presence of Time-Varying Noncompliance

ECON Introductory Econometrics. Lecture 17: Experiments

Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

On Variance Estimation for 2SLS When Instruments Identify Different LATEs

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation

Instrumental Variables

PSC 504: Differences-in-differeces estimators

arxiv: v1 [stat.ap] 6 Mar 2018

Bounds on Average and Quantile Treatment Effects of Job Corps Training on Wages*

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

Experiments and Quasi-Experiments

Going Beyond LATE: Bounding Average Treatment Effects of Job Corps Training

Empirical approaches in public economics

Comparative effectiveness of dynamic treatment regimes

Potential Outcomes and Causal Inference

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Ratio-of-Mediator-Probability Weighting for Causal Mediation Analysis. in the Presence of Treatment-by-Mediator Interaction

Propensity Score Weighting with Multilevel Data

Experimental Designs for Identifying Causal Mechanisms

Final Exam. Economics 835: Econometrics. Fall 2010

Identification and Estimation of Causal Mechanisms in Clustered Encouragement Designs: Disentangling Bed Nets Using Bayesian Principal Stratification

Transcription:

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, 2016 Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

Our team! 2 Avi Feller (Berkeley) Jane Furey (Abt Associates) Todd Grindal (Abt Associates) Luke Miratrix (Harvard) Lindsay Page (University of Pittsburgh) Marie-Andree Somers (MDRC) Rebecca Unterman (MDRC)

Agenda 3 Conceptual framework & key assumptions Break Small group exercise Principal stratification set up Defining estimands of interest Lunch Estimation & bounds Break Wrap up

Potential outcomes framework 4 ID Y i (Z i =1) Y i (Z i =0) Z i 1 105? 1 2 90? 1 3 87? 1 4 111? 1 5? 92 0 6? 80 0 Each individual has the potential to experience assignment to the treatment condition or assignment to the control condition. Therefore, the potential outcomes for an individual are what we would see under each condition. 7? 103 0 8? 97 0

Potential outcomes framework 5 ID Y i (Z i =1) Y i (Z i =0) Z i 1 105? 1 2 90? 1 3 87? 1 4 111? 1 5? 92 0 6? 80 0 7? 103 0 8? 97 0 For outcome Y, the causal effect of the treatment (Z) for individual i is: Y i (1) Y i (0) We can never compute this directly, but it can be estimated. The Average Treatment Effect (ATE) for the sample is: 1 n i Y i 1 Y i 0 This also cannot be computed directly but can be estimated.

Potential outcomes framework: Key assumptions 6 Assumption 1: Ignorability. Whether we observe Y i (1) or Y i (0) for individual i is unrelated to the values of the potential outcomes. Random assignment allows us to make this assumption. E Y i 1 Y i 0 =E Y i 1 Z i = 1] E[Y i 0 Z i = 0 Assumption 2: Stable Unit Treatment Value Assumption (SUTVA). The observed outcome for each individual is a function of the potential outcomes and random assignment and is not dependent, for example, on the assignment of other individuals. The potential outcomes are fixed pre-treatment variables. Therefore, Y obs,i = Z i Y i 1 + (1 Z i ) Y i 0 E Y i 1 Z i = 1] E[Y i 0 Z i = 0 = E Y obs,i Z i = 1] E[Y obs,i Z i = 0

Potential outcomes framework: ITT 7 ITT = E Y obs,i Z i = 1] E[Y obs,i Z i = 0 The Intent to Treat (ITT) estimand is the causal impact on outcome Y of being assigned to the treatment. Of course, we may be interested in assessing impacts among those who would actually take up the treatment. Therefore, we need to think about the post-treatment or intermediate behaviors (such as treatment compliance) that we care about.

Potential values of intermediate behavior 8 Potential outcomes Potential intermediate behavior ID Y i (Z i =1) Y i (Z i =0) D i (Z i =1) D i (Z i =0) Z i 1 105?? 1 2 90?? 1 3 87?? 1 4 111?? 1 5? 92? 0 6? 80? 0 7? 103? 0 8? 97? 0

Potential values of intermediate behavior 9 ID D i (Z i =1) D i (Z i =0) Z i 1 1? 1 2 1? 1 3 0? 1 1. Suppose the intermediate behavior is binary treatment take up. 4 0? 1 5? 0 0 6? 1 0 7? 0 0 8? 1 0

Potential values of intermediate behavior 10 ID D i (Z i =1) D i (Z i =0) Z i 1 1 0 1 2 1 1 1 3 0 0 1 4 0 1 1 5 1 0 0 6 1 1 0 7 0 0 0 8 0 1 0 1. Suppose the intermediate behavior is binary treatment take up. 2. Suppose that we are able to observe both potential values of the intermediate behavior for each person. Then, we could classify individuals according to the potential values of the intermediate behavior of interest (such as exposure to the treatment).

Potential values of intermediate behavior 11 ID D i (Z i =1) D i (Z i =0) Z i 1 1 0 1 2 1 1 1 Compliers: Take up treatment under assignment to treatment but not under assignment to control 3 0 0 1 4 0 1 1 5 1 0 0 6 1 1 0 7 0 0 0 8 0 1 0

Potential values of intermediate behavior 12 ID D i (Z i =1) D i (Z i =0) Z i 1 1 0 1 2 1 1 1 3 0 0 1 4 0 1 1 Compliers: Take up treatment under assignment to treatment but not under assignment to control Always-takers: Always take up treatment 5 1 0 0 6 1 1 0 7 0 0 0 8 0 1 0

Potential values of intermediate behavior 13 ID D i (Z i =1) D i (Z i =0) Z i 1 1 0 1 2 1 1 1 3 0 0 1 4 0 1 1 5 1 0 0 6 1 1 0 Compliers: Take up treatment under assignment to treatment but not under assignment to control Always-takers: Always take up treatment Never-takers: Never take up treatment 7 0 0 0 8 0 1 0

Potential values of intermediate behavior 14 ID D i (Z i =1) D i (Z i =0) Z i 1 1 0 1 2 1 1 1 3 0 0 1 4 0 1 1 5 1 0 0 6 1 1 0 7 0 0 0 8 0 1 0 Compliers: Take up treatment under assignment to treatment but not under assignment to control Always-takers: Always take up treatment Never-takers: Never take up treatment Defiers: Take up treatment under assignment to control but not under assignment to treatment

Potential values of intermediate behavior 15 Principal Strata (Frangakis & Rubin, 2002) Stratum membership is defined by answers to two questions: 1. What were individuals intermediate behaviors, given their randomized treatment assignments? 2. What would their intermediate behaviors have been, had they been assigned to the counterfactual experimental condition? Compliers: Comply with treatment under assignment to treatment but not under assignment to control Always-takers: Always take up treatment Never-takers: Never take up treatment Defiers: Take up treatment under assignment to control but not under assignment to treatment

Potential values of intermediate behavior 16 Recall Stable Unit Treatment Value Assumption (SUTVA). The observed outcome for each individual is a function of the potential outcomes and random assignment and are not dependent, for example, on the assignment of other individuals. Y obs,i = Z i Y i 1 + (1 Z i ) Y i 0 The same assumption applies to values of intermediate outcomes: D obs,i = Z i D i 1 + (1 Z i ) D i 0 Another way of saying this is that how an individual responds to the offer or lack thereof of a particular treatment is a characteristic of that individual and is not, for example, influenced by the random assignment of others. Compliance type, as defined by the joint values of the intermediate behaviors, is a pre-treatment covariate (even though it is not fully observed).

Subgroup-specific ITT estimates 17 Overall ITT = E Y obs,i Z i = 1] E[Y obs,i Z i = 0 Because treatment assignment is uncorrelated with baseline characteristics, we can consider subgroup-specific ITT effects. For example: Overall ITT for males = E Y obs,i Z i = 1, male i = 1] E[Y obs,i Z i = 0, male i = 1 Overall ITT for females = E Y obs,i Z i = 1, male i = 0] E[Y obs,i Z i = 0, male i = 0 The ITT effect within each subgroup is a causal effect of treatment. HOWEVER, the comparison of causal effects across the two subgroups is a descriptive exercise. We could say descriptively, for example, that the treatment effect was larger for males than it was for females, but we can t say that gender was the cause of the differential in treatment effect, because we have not experimentally manipulated gender.

Subgroup-specific ITT estimates 18 Overall ITT = E Y obs,i Z i = 1] E[Y obs,i Z i = 0 Just as we can express ITT effects for fully observed subgroups, we can also express ITT effects for partially observed (or latent) subgroups. ITT C = E Y obs,i Z i = 1, s i = c] E[Y obs,i Z i = 0, s i = c ITT AT = E Y obs,i Z i = 1, s i = at] E[Y obs,i Z i = 0, s i = at ITT NT = E Y obs,i Z i = 1, s i = nt] E[Y obs,i Z i = 0, s i = nt ITT D = E Y obs,i Z i = 1, s i = d] E[Y obs,i Z i = 0, s i = d These are causal effects. Comparison of ITT effects across subgroups is a descriptive exercise, as we have not randomly manipulated compliance type.

Z = 1 Principal strata 19 Z = 0 D = 1 D = 0 D = 1 Always takers Compliers D = 0 Defiers Never takers

Z = 1 Principal strata 20 While the defined strata are the subgroups of interest, they are not fully observed. Z = 0 D = 1 D = 0 D = 1 Always takers Compliers D = 0 Defiers Never takers And so, outcome distributions for groups defined by treatment assignment and treatment take up (e.g., what we are actually able to observe) are mixtures of distributions across our strata of interest. Therefore, we apply sets of assumptions (justified by science, field knowledge and/or theory) to make progress and simplify an otherwise very hard estimation problem.

Z = 1 Removing boxes: the Monotonicity Assumption 21 D i 1 D i 0 Z = 0 D = 1 D = 0 D = 1 Always takers Compliers D = 0 Defiers Never takers In IV, monotonicity is a standard assumption which states that the potential value of the intermediate variable under assignment to treatment is at least as large as the potential value of the intermediate variable under assignment to control. For example, assignment to treatment can t decrease tendency to take treatment. Therefore, applying the monotonicity assumption leads us to assume away defiers.

Z = 1 Constraining the outcomes: the Exclusion Restriction 22 ITT AT = 0 ITT NT = 0 Z = 0 D = 1 D = 0 D = 1 Always takers Compliers D = 0 Defiers Never takers In IV, the exclusion restriction states that random assignment can only impact the outcome via change induced in the level of the intermediate variable (i.e., that this intermediate variable is the only causal pathway between randomization and the outcome.) IV, therefore, relies on monotonicity and the exclusion restriction, but we could imagine circumstances where we would want to relax one or both of these assumptions.

Generalizing beyond compliance 23 We can use the same framework and process for articulating a variety of types of research questions: Naturally occurring variation in control setting (ex. Impact of ECHS by quality of high school otherwise attended) Truncation-due-to-death (ex. Quality of life indicator only observable for subjects who remain alive) Surrogate outcome (ex. Long run impacts of career academy high school opportunity on earnings realized primarily by those who experienced a significant rise in labor market exposure during high school) Process to formulate and set up the estimation problem is similar in each case

24 Setting up principal strata & defining treatment effects of interest To utilize the principal stratification framework, ask: 1. What is the treatment that was randomized? 2. What is the intermediate behavior of interest? 3. What are the strata? 4. What is / are the outcome(s) of interest? 5. In which strata is the treatment effect well defined? 6. What are justifiable assumptions that can reasonably be applied to: a. reduce the number of strata, and/or b. constrain the treatment effects within certain strata? in order to make the problem more tractable?

Agenda 25 Conceptual framework & key assumptions Break Small group exercise Principal stratification set up Defining estimands of interest Lunch Estimation & bounds Break Wrap up

Agenda 26 Conceptual framework & key assumptions Break Small group exercise Principal stratification set up Defining estimands of interest Lunch Estimation & bounds Break Wrap up

Application 1: Truncation-by-Death 27 Outcome of interest is not defined for all sample members Not censoring: there is no underlying value to the outcome Examples: Causal effect of post-stroke procedure on a quality-of-life metric can only be assessed for those who would remain alive with or without procedure Lee (2009): In the study of a given job training program, is the positive effect on total earnings a function of increased employment (extensive margin) or an increased wage rate (intensive margin) or both? Any improvement in wage rate can only be assessed for those who would be employed with or without the intervention.

Application 1: Truncation-by-Death 28 Research Question: In the study of a given job training program, is the positive effect on total earnings a function of increased employment or an increased wage rate or both? Treatment Offer of job training program Outcome(s) Hourly wage Key intermediate behavior Employment Lee (2009): Wages are only observed for individuals who are employed. Thus, even if there is random assignment of the treatment of a training program, there may not only be an effect on wages but also on the probability that a person s wage will even be observed. Even a randomized experiment cannot guarantee that treatment and control individuals will be comparable conditional on being employed.

Job training offer = 1 Application 1: Truncation-by-Death 29 Defining all possible principal strata Job training offer = 0 Employed Not employed Employed Not employed

Job training offer = 1 Application 1: Truncation-by-Death 30 Defining all possible principal strata Job training offer = 0 Employed Not employed Employed Always employed Employed if treated Not employed Employed if not treated Never employed

Job training offer = 1 Application 1: Truncation-by-Death 31 What assumptions can be applied to reduce the number of strata? Job training offer = 0 Employed Not employed Employed Always employed Employed if treated Not employed Employed if not treated Never employed Is it a reasonable assumption that individuals are more likely to be employed under treatment than under control? Or, might the training raise the lowest wage certain workers are willing to accept, thereby potentially lowering employment for some if it is not possible to attain a job with this higher acceptable wage?

Job training offer = 1 Application 1: Truncation-by-Death 32 In which stratum / strata is the treatment effect well defined? Job training offer = 0 Employed Not employed Employed Always employed Employed if treated Not employed Employed if not treated Never employed In these strata, the treatment effect on hourly wages is not defined, because hourly wage is not defined for those who are not employed. In the employed if treated stratum, we observe hourly wages only under treatment and in the never employed stratum, we observe wages under neither treatment nor control.

Job training offer = 1 Application 1: Truncation-by-Death 33 In which stratum / strata is the treatment effect well defined? Job training offer = 0 Employed Not employed Employed Always employed Employed if treated Not employed Employed if not treated Never employed Individuals in this stratum are employed regardless of the intervention, so for this stratum only can we estimate the causal effect of the intervention on hourly wages. In the language of truncation-by-death, the treatment effect in this stratum is referred to as the Survivor Average Causal Effect (SACE).

34 Small group exercise

Role of assumptions 35 Common assumptions Exclusion restriction(s) constraining outcomes Monotonicity removing boxes Irrelevant alternatives removing boxes Key take-away regarding assumptions: Critical part of analytic set up The more groups we can eliminate, the better / more precise our impact estimates will be Assumptions are unverifiable, so they must be substantively justified

36 Questions / Discussion / Wrap-up

Agenda 37 Conceptual framework & key assumptions Break Small group exercise Principal stratification set up Defining estimands of interest Lunch Estimation & bounds Break Wrap up