Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Similar documents
Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Experimental Designs for Identifying Causal Mechanisms

Causal Mechanisms Short Course Part II:

Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Statistical Analysis of Causal Mechanisms

Statistical Analysis of Causal Mechanisms

Causal Mediation Analysis in R. Quantitative Methodology and Causal Mechanisms

Identification and Inference in Causal Mediation Analysis

Statistical Analysis of Causal Mechanisms for Randomized Experiments

Statistical Analysis of Causal Mechanisms

Identification and Sensitivity Analysis for Multiple Causal Mechanisms: Revisiting Evidence from Framing Experiments

Experimental Designs for Identifying Causal Mechanisms

Identification, Inference, and Sensitivity Analysis for Causal Mediation Effects

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Introduction to Statistical Inference

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Telescope Matching: A Flexible Approach to Estimating Direct Effects

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

mediation: R Package for Causal Mediation Analysis

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Causal Interaction in Factorial Experiments: Application to Conjoint Analysis

Causal Interaction in Factorial Experiments: Application to Conjoint Analysis

Bayesian Causal Mediation Analysis for Group Randomized Designs with Homogeneous and Heterogeneous Effects: Simulation and Case Study

R and Stata for Causal Mechanisms

Identification, Inference and Sensitivity Analysis for Causal Mediation Effects

Regression Discontinuity Designs

Telescope Matching: A Flexible Approach to Estimating Direct Effects *

Discussion of Papers on the Extensions of Propensity Score

Statistical Models for Causal Analysis

On the Use of Linear Fixed Effects Regression Models for Causal Inference

Published online: 19 Jun 2015.

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Difference-in-Differences Methods

Causal Interaction in Factorial Experiments: Application to Conjoint Analysis

Noncompliance in Randomized Experiments

Statistical Analysis of List Experiments

Mediation: Background, Motivation, and Methodology

Regression Discontinuity Designs.

Help! Statistics! Mediation Analysis

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Gov 2002: 13. Dynamic Causal Inference

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Causal Directed Acyclic Graphs

Identification and Estimation of Causal Mediation Effects with Treatment Noncompliance

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives

AGEC 661 Note Fourteen

Covariate Balancing Propensity Score for General Treatment Regimes

1 Motivation for Instrumental Variable (IV) Regression

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Item Response Theory for Conjoint Survey Experiments

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula.

Casual Mediation Analysis

arxiv: v3 [stat.me] 4 Apr 2018

Causal mediation analysis: Definition of effects and common identification assumptions

Discussion: Can We Get More Out of Experiments?

ECO Class 6 Nonparametric Econometrics

Econometrics of Policy Evaluation (Geneva summer school)

Statistical Analysis of the Item Count Technique

ONLINE APPENDICES Emotions and the Micro-Foundations of Commitment Problems

Introduction to Causal Inference. Solutions to Quiz 4

Flexible Estimation of Treatment Effect Parameters

Combining Difference-in-difference and Matching for Panel Data Analysis

EMERGING MARKETS - Lecture 2: Methodology refresher

150C Causal Inference

Gov 2002: 4. Observational Studies and Confounding

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Conceptual overview: Techniques for establishing causal pathways in programs and policies

Modeling Mediation: Causes, Markers, and Mechanisms

High Dimensional Propensity Score Estimation via Covariate Balancing

Two-Variable Regression Model: The Problem of Estimation

Sensitivity checks for the local average treatment effect

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE

Principles Underlying Evaluation Estimators

What s New in Econometrics. Lecture 1

14.74 Lecture 10: The returns to human capital: education

Gov 2002: 3. Randomization Inference

Advanced Quantitative Research Methodology Lecture Notes: January Ecological 28, 2012 Inference1 / 38

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death

PSC 504: Instrumental Variables

Instrumental Variables

Combining multiple observational data sources to estimate causal eects

mediation: R Package for Causal Mediation Analysis

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Econometric Causality

Front-Door Adjustment

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

The Essential Role of Pair Matching in. Cluster-Randomized Experiments. with Application to the Mexican Universal Health Insurance Evaluation

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

An Experimental Evaluation of High-Dimensional Multi-Armed Bandits

Robust Estimation of Inverse Probability Weights for Marginal Structural Models

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Regression Discontinuity

Transcription:

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University Joint work with Keele (Ohio State), Tingley (Harvard), Yamamoto (Princeton) January 19, 2011 Harris School of Public Policy The University of Chicago Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 1 / 31 Identification of Causal Mechanisms Causal inference is a central goal of scientific research Scientists care about causal mechanisms, not just about causal effects Randomized experiments often only determine whether the treatment causes changes in the outcome Not how and why the treatment affects the outcome Common criticism of experiments and statistics: black box view of causality Question: How can we learn about causal mechanisms from experimental and observational studies? Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 2 / 31

Goals of the Talk Present a general framework for statistical design and analysis of causal mechanisms: 1 Show that the sequential ignorability assumption is required to identify mechanisms even in experiments 2 Offer a flexible estimation strategy under this assumption 3 Propose a sensitivity analysis to probe this assumption 4 Propose new experimental designs that do not rely on sequential ignorability 5 Extend these methods to observational studies Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 3 / 31 Project Reference Project Website: http://imai.princeton.edu/projects/mechanisms.html Papers: Unpacking the Black Box: Learning about Causal Mechanisms from Experimental and Observational Studies. Identification, Inference, and Sensitivity Analysis for Causal Mediation Effects. Statistical Science A General Approach to Causal Mediation Analysis. Psychological Methods Experimental Identification of Causal Mechanisms. Causal Mediation Analysis Using R. Advances in Social Science Research Using R Software: R package mediation implements all methods Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 4 / 31

What Is a Causal Mechanism? Mechanisms as alternative causal pathways Causal mediation analysis Mediator, M Treatment, T Outcome, Y Quantities of interest: Direct and indirect effects Fast growing methodological literature Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 5 / 31 Two American Politics Examples 1 Media priming Media cues referencing ethnic groups effectively affect attitudes towards immigration policy Brader, Valentino, and Suhay (2008, AJPS): Anxiety transmits the effect of cues on attitudes Experimental study with randomized treatment 2 Incumbency advantage 1 Imcumbency advantage has been positive and growing 2 Cox and Katz (1996, AJPS): incumbents deter high-quality challengers from entering the race 3 Observational study with non-random treatment Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 6 / 31

Potential Outcomes Framework Framework: Potential outcomes model of causal inference Binary treatment: T i {0, 1} Mediator: M i M Outcome: Y i Y Observed pre-treatment covariates: X i X Potential mediators: M i (t), where M i = M i (T i ) observed Potential outcomes: Y i (t, m), where Y i = Y i (T i, M i (T i )) observed In a standard experiment, only one potential outcome can be observed for each i Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 7 / 31 Causal Mediation Effects Total causal effect: τ i Y i (1, M i (1)) Y i (0, M i (0)) Causal mediation (Indirect) effects: δ i (t) Y i (t, M i (1)) Y i (t, M i (0)) Causal effect of the change in M i on Y i that would be induced by treatment Change the mediator from M i (0) to M i (1) while holding the treatment constant at t Represents the mechanism through M i Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 8 / 31

Total Effect = Indirect Effect + Direct Effect Direct effects: ζ i (t) Y i (1, M i (t)) Y i (0, M i (t)) Causal effect of T i on Y i, holding mediator constant at its potential value that would realize when T i = t Change the treatment from 0 to 1 while holding the mediator constant at M i (t) Represents all mechanisms other than through M i Total effect = mediation (indirect) effect + direct effect: τ i = δ i (t) + ζ i (1 t) = 1 2 {δ i(0) + δ i (1) + ζ i (0) + ζ i (1)} Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 9 / 31 What Does the Observed Data Tell Us? Quantity of Interest: Average causal mediation effects δ(t) E(δ i (t)) = E{Y i (t, M i (1)) Y i (t, M i (0))} Average direct effects ( ζ(t)) are defined similarly Problem: Y i (t, M i (t)) is observed but Y i (t, M i (t )) can never be observed We have an identification problem = Need additional assumptions to make progress Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 10 / 31

Identification under Sequential Ignorability Proposed identification assumption: Sequential Ignorability {Y i (t, m), M i (t)} T i X i = x (1) Y i (t, m) M i (t) T i = t, X i = x (2) (1) is guaranteed to hold in a standard experiment (2) does not hold unless X i includes all confounders Theorem: Under sequential ignorability, ACME and average direct effects are nonparametrically identified (= consistently estimated from observed data) Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 11 / 31 Nonparametric Identification Theorem: Under SI, both ACME and average direct effects are given by, ACME δ(t) E(Y i M i, T i = t, X i ) {dp(m i T i = 1, X i ) dp(m i T i = 0, X i )} dp(x i ) Average direct effects ζ(t) {E(Y i M i, T i = 1, X i ) E(Y i M i, T i = 0, X i )} dp(m i T i = t, X i ) dp(x i ) Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 12 / 31

Traditional Estimation Method Linear structural equation model (LSEM): together implying M i = α 2 + β 2 T i + ξ 2 X i + ɛ i2, Y i = α 3 + β 3 T i + γm i + ξ 3 X i + ɛ i3. Y i = α 1 + β 1 T i + ɛ i1 Fit two least squares regressions separately Use product of coefficients ( ˆβ 2ˆγ) to estimate ACME Use asymptotic variance to test significance (Sobel test) Under SI and the no-interaction assumption ( δ(1) δ(0)), ˆβ 2ˆγ consistently estimates ACME Can be extended to LSEM with interaction terms Problem: Only valid for the simplest LSEM Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 13 / 31 Proposed General Estimation Algorithm 1 Model outcome and mediator Outcome model: p(y i T i, M i, X i ) Mediator model: p(m i T i, X i ) These models can be of any form (linear or nonlinear, semi- or nonparametric, with or without interactions) 2 Predict mediator for both treatment values (M i (1), M i (0)) 3 Predict outcome by first setting T i = 1 and M i = M i (0), and then T i = 1 and M i = M i (1) 4 Compute the average difference between two outcomes to obtain a consistent estimate of ACME 5 Monte Carlo simulation or bootstrap to estimate uncertainty Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 14 / 31

Reanalysis of Brader et al. (2008) ACME = Average difference in immigration attitudes due to the change in anxiety induced by the media cue treatment Sequential ignorability = No unobserved covariate affecting both anxiety and immigration attitudes Original method: Product of coefficients with the Sobel test Valid only when both models are linear w/o T M interaction (which they are not) Our method: Calculate ACME using our general algorithm Product of Coefficients Average Causal Mediation Effect Decrease Immigration Request Anti-Immigration Info Send Anti-Immigration Message.399.089 [0.066,.732] [0.023,.178].295.049 [0.023, 0.567] [0.007, 0.121].303.105 [0.046,.561] [0.021, 0.191] Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 15 / 31 Reanalysis of Cox and Katz (1996) Democratic Incumbents Republican Incumbents Effect of Incumbency on Incumbent % of Two Party Vote 5 0 5 10 15 20 Average Mediation Effect: δ(0) Average Total Effect: τ 5 0 5 10 15 20 1956 1960 1966 1970 1976 1980 1986 1990 1958 1964 1968 1974 1978 1984 1988 Original findings: Year 1956 1960 1966 1970 1976 1980 1986 1990 1958 1964 1968 1974 1978 1984 1988 1 Incumbency advantage has increased over time 2 This increase is attributed to increase in scare-off/quality effect Our findings: Increasing incumbency advantage may be attributable to mechanisms other than the scare-off/quality effect Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 16 / 31 Year

Need for Sensitivity Analysis Standard experiments require sequential ignorability to identify mechanisms The sequential ignorability assumption is often too strong Need to assess the robustness of findings via sensitivity analysis Question: How large a departure from the key assumption must occur for the conclusions to no longer hold? Parametric sensitivity analysis by assuming {Y i (t, m), M i (t)} T i X i = x but not Y i (t, m) M i (t) T i = t, X i = x Possible existence of unobserved pre-treatment confounder Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 17 / 31 Parametric Sensitivity Analysis Sensitivity parameter: ρ Corr(ɛ i2, ɛ i3 ) Sequential ignorability implies ρ = 0 Set ρ to different values and see how ACME changes Interpreting ρ: how small is too small? An unobserved (pre-treatment) confounder formulation: ɛ i2 = λ 2 U i + ɛ i2 and ɛ i3 = λ 3 U i + ɛ i3 How much does U i have to explain for our results to go away? Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 18 / 31

Sensitivity Analysis of Brader et al. (2008) I Average Mediation Effect: δ(t) 0.4 0.2 0.0 0.2 0.4 1.0 0.5 0.0 0.5 1.0 Sensitivity Parameter: ρ ACME > 0 as long as the error correlation is less than 0.39 (0.30 with 95% CI) Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 19 / 31 Sensitivity Analysis of Brader et al. (2008) II Proportion of Total Variance in Y Explained by Confounder 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.15 0.1 0.05 0 0.05 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Proportion of Total Variance in M Explained by Confounder An unobserved confounder can account for up to 26.5% of the variation in both Y i and M i before ACME becomes zero Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 20 / 31

Sensitivity Analysis of Cox and Katz (1996) 1976 1980 Average Causal Mediation Effect: δ 5 0 5 10 5 0 5 10 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 Sensitivity Parameter: ρ 1976 1980 Proportion of Total Variance in Y Explained by Confounder 0.0 0.1 0.2 0.3 0.4 0.5 0.6 1 0.5 0.5 0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0 1 3 2 1 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Proportion of Total Variance in M Explained by Confounder Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 21 / 31 Beyond Sequential Ignorability Without sequential ignorability, standard experimental design lacks identification power Even the sign of ACME is not identified Need to develop alternative experimental designs for more credible inference Possible when the mediator can be directly or indirectly manipulated Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 22 / 31

Parallel Design Must assume no direct effect of manipulation on outcome More informative than standard single experiment If we assume no T M interaction, ACME is point identified Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 23 / 31 Example from Behavioral Neuroscience Why study brain?: Social scientists search for causal mechanisms underlying human behavior Psychologists, economists, and even political scientists Question: What mechanism links low offers in an ultimatum game with irrational" rejections? A brain region known to be related to fairness becomes more active when unfair offer received (single experiment design) Design solution: manipulate mechanisms with TMS Knoch et al. use TMS to manipulate turn off one of these regions, and then observes choices (parallel design) Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 24 / 31

Limitations Difference between manipulation and mechanism Prop. M i (1) M i (0) Y i (t, 1) Y i (t, 0) δ i (t) 0.3 1 0 0 1 1 0.3 0 0 1 0 0 0.1 0 1 0 1 1 0.3 1 1 1 0 0 Here, E(M i (1) M i (0)) = E(Y i (t, 1) Y i (t, 0)) = 0.2, but δ(t) = 0.2 Limitations: Direct manipulation of the mediator is often impossible Even if possible, manipulation can directly affect outcome Need to allow for subtle and indirect manipulations Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 25 / 31 Encouragement Design Randomly encourage subjects to take particular values of the mediator M i Standard instrumental variable assumptions (Angrist et al.) Use a 2 3 factorial design: 1 Randomly assign T i 2 Also randomly decide whether to positively encourage, negatively encourage, or do nothing 3 Measure mediator and outcome Informative inference about the complier ACME Reduces to the parallel design if encouragement is perfect Possible application to the immigration experiment: Use autobiographical writing tasks to encourage anxiety Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 26 / 31

Crossover Design Recall ACME can be identified if we observe Y i (t, M i (t)) Get M i (t), then switch T i to t while holding M i = M i (t) Crossover design: 1 Round 1: Conduct a standard experiment 2 Round 2: Change the treatment to the opposite status but fix the mediator to the value observed in the first round Very powerful identifies mediation effects for each subject Must assume no carryover effect: Round 1 must not affect Round 2 Can be made plausible by design Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 27 / 31 Example from Labor Economics Bertrand & Mullainathan (2004, AER) Treatment: Black vs. White names on CVs Mediator: Perceived qualifications of applicants Outcome: Callback from employers Quantity of interest: Direct effects of (perceived) race Would Jamal get a callback if his name were Greg but his qualifications stayed the same? Round 1: Send Jamal s actual CV and record the outcome Round 2: Send his CV as Greg and record the outcome No carryover effect: send two CVs to randomly selected, different potential employers Assumption: perceived qualifications don t depend on applicant s race Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 28 / 31

Designing Observational Studies Key difference between experimental and observational studies: treatment assignment Sequential ignorability: 1 Ignorability of treatment given covariates 2 Ignorability of mediator given treatment and covariates Both (1) and (2) are suspect in observational studies Statistical control: matching, regressions, etc. Search for quasi-randomized treatments: natural experiments How can we design observational studies? Experiments can serve as templates for observational studies Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 29 / 31 Example from Imcumbency Advantage Use of cross-over design (Levitt and Wolfram) 1 1st Round: two non-incumbents in an open seat 2 2nd Round: same candidates with one being an incumbent Assume challenger quality (mediator) stays the same Estimation of direct effect is possible Redistricting as natural experiments (Ansolabehere et al.) 1 1st Round : incumbent in the old part of the district 2 2nd Round : incumbent in the new part of the district Challenger quality is the same but treatment is different Estimation of direct effect is possible Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 30 / 31

Concluding Remarks Even in a randomized experiment, a strong assumption is needed to identify causal mechanisms However, progress can be made toward this fundamental goal of scientific research with modern statistical tools A general, flexible estimation method is available once we assume sequential ignorability Sequential ignorability can be probed via sensitivity analysis More credible inferences are possible using clever experimental designs Insights from new experimental designs can be directly applied when designing observational studies Kosuke Imai (Princeton) Causal Mechanisms Chicago (January 2011) 31 / 31