Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE

Similar documents
Causal mediation analysis: Definition of effects and common identification assumptions

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

The Causal Mediation Formula A Guide to the Assessment of Pathways and Mechanisms

Casual Mediation Analysis

Causal Mechanisms Short Course Part II:

The Causal Mediation Formula A Guide to the Assessment of Pathways and Mechanisms

Causal Mechanisms and Process Tracing

Path analysis for discrete variables: The role of education in social mobility

Mediation for the 21st Century

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula.

Statistical Analysis of Causal Mechanisms

13.1 Causal effects with continuous mediator and. predictors in their equations. The definitions for the direct, total indirect,

Statistical Methods for Causal Mediation Analysis

Identifying Linear Causal Effects

New developments in structural equation modeling

SEM REX B KLINE CONCORDIA D. MODERATION, MEDIATION

Modern Mediation Analysis Methods in the Social Sciences

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (

arxiv: v2 [math.st] 4 Mar 2013

Modeling Mediation: Causes, Markers, and Mechanisms

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

The Mediation Formula: A guide to the assessment of causal pathways in nonlinear models

Introduction to Causal Calculus

Estimating direct effects in cohort and case-control studies

UCLA Department of Statistics Papers

Sensitivity analysis and distributional assumptions

Outline

Statistical Analysis of Causal Mechanisms for Randomized Experiments

Causal Inference for Mediation Effects

Estimation of direct causal effects.

Bounds on Direct Effects in the Presence of Confounded Intermediate Variables

A Linear Microscope for Interventions and Counterfactuals

CAUSALITY CORRECTIONS IMPLEMENTED IN 2nd PRINTING

Discussion of Papers on the Extensions of Propensity Score

A Linear Microscope for Interventions and Counterfactuals

Help! Statistics! Mediation Analysis

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (

Mediation and Interaction Analysis

The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA

Estimating complex causal effects from incomplete observational data

Conceptual overview: Techniques for establishing causal pathways in programs and policies

Identification and Estimation of Causal Effects from Dependent Data

Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Identification and Inference in Causal Mediation Analysis

Non-parametric Mediation Analysis for direct effect with categorial outcomes

Statistical Analysis of Causal Mechanisms

Causal Inference Using Nonnormality Yutaka Kano and Shohei Shimizu 1

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (

From Causality, Second edition, Contents

Ratio-of-Mediator-Probability Weighting for Causal Mediation Analysis. in the Presence of Treatment-by-Mediator Interaction

An Introduction to Causal Analysis on Observational Data using Propensity Scores

The Markov equivalence class of the mediation model. Felix Thoemmes, Wang Liao, & Marina Yamasaki Cornell University

Causal Mediation Analysis in R. Quantitative Methodology and Causal Mechanisms

arxiv: v1 [math.st] 28 Nov 2017

CAUSES AND COUNTERFACTUALS: CONCEPTS, PRINCIPLES AND TOOLS

Identification of Conditional Interventional Distributions

PANEL ON MEDIATION AND OTHER MIRACLES OF THE 21 ST CENTURY

Flexible Mediation Analysis in the Presence of Nonlinear Relations: Beyond the Mediation Formula

New developments in structural equation modeling

The Mathematics of Causal Inference

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

The Effects of Interventions

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Methods for Integrating Moderation and Mediation: A General Analytical Framework Using Moderated Path Analysis

A Unification of Mediation and Interaction. A 4-Way Decomposition. Tyler J. VanderWeele

The International Journal of Biostatistics

Statistical Models for Causal Analysis

Introduction. Consider a variable X that is assumed to affect another variable Y. The variable X is called the causal variable and the

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015

Identification, Inference, and Sensitivity Analysis for Causal Mediation Effects

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

arxiv: v1 [stat.me] 1 Nov 2018

CAUSAL MEDIATION ANALYSIS FOR NON-LINEAR MODELS WEI WANG. for the degree of Doctor of Philosophy. Thesis Adviser: Dr. Jeffrey M.

A Decision Theoretic Approach to Causality

Identification and Sensitivity Analysis for Multiple Causal Mechanisms: Revisiting Evidence from Framing Experiments

Statistical Analysis of Causal Mechanisms

A Distinction between Causal Effects in Structural and Rubin Causal Models

Estimating the Marginal Odds Ratio in Observational Studies

5.3 Graphs and Identifiability

On the Identification of a Class of Linear Models

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

Ignoring the matching variables in cohort studies - when is it valid, and why?

Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

External Validity and Transportability: A Formal Approach

Mediation Analysis: A Practitioner s Guide

Causal Mediation Analysis Short Course

University of California, Berkeley

Structure learning in human causal induction

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Natural direct and indirect effects on the exposed: effect decomposition under. weaker assumptions

Guanglei Hong. University of Chicago. Presentation at the 2012 Atlantic Causal Inference Conference

Causal mediation analysis in the context of clinical research

Front-Door Adjustment

Transcription:

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE insert p. 90: in graphical terms or plain causal language. The mediation problem of Section 6 illustrates how such symbiosis clarifies the definition and identification of direct and indirect effects, a task deemed insurmountable, deceptive and ill-defined by advocates of the structureless potential-outcome approach (Rubin, 2004, 2005). remove paragraph pp. 90-91 starting: In contrast, when the mediation problem is approached... including printed footnote 29. new footnote p. 99: The need to control for mediator-outcome confounders (e.g., W 2 in Figure 6(b)) was evidently overlooked in the classical paper of Baron and Kenny (1986), and has subsequently been ignored by most social science researchers. revised paragraph p. 100: Graphical identification conditions for multi-action expressions of the type E(Y do(x), do(z 1 ), do(z 2 ),..., do(z k )) in the presence of unmeasured confounders were derived by Pearl and Robins (1995) (see Pearl, 2000, ch. 4) using sequential application of the back-door conditions discussed in Section 3.2. new footnote p. 101 Pearl (2001) used the acronym NDE to denote the natural direct effect. We will delete the letter N from the acronyms of both the direct and indirect effect, and use DE and IE, respectively. best break for equation 48 p. 104: DE x,x (Y ) = P(w 1 )[E(Y x, z, w 1 )) E(Y x, z, w 1 ))] z w 1 P(z x, w 2 )P(w 2 ). (48) w 2

revised pp. 110 11 linear version of Figure 6(a) (equation 45), which reads x = u X z = b 0 + b x x + u Z (54) y = c 0 + c x x + c z z + u Y with u X, u Y, and u Z uncorrelated, zero-mean error terms. Computing the conditional expectation in (53) gives and yields E(Y x, z) = E(c 0 + c x x + c z z + u Y ) = c 0 + c x x + c z z IE x,x (Y ) = z (c x x + c z z)[p(z x ) P(z x)]. where b is the total effect coefficient, = c z [E(Z x ) E(Z x)] (55) = (x x)(c z b x ) (56) = (x x)(b c x ) (57) b = (E(Y x ) E(Y x))/(x x) = c x + c z b x. We thus obtained the standard expressions for indirect effects in linear systems, which can be estimated either as a difference in two regression coefficients (equation 57) or a product of two regression coefficients (equation 56), with Y regressed on both X and Z. (see MacKinnon, Lockwood, et al., 2007). These two strategies do not generalize to nonlinear systems as shown in Pearl (2010a); direct application of (53) is necessary. To understand the difficulty, consider adding an interaction term c xz xz to the model in equation (54), yielding y = c 0 + c x x + c z z + c xz xz + u Y Now assume that, through elaborate regression analysis, we obtain accurate estimates of all parameters in the model. It is still not clear what combinations of parameters measure the direct and indirect effects of X on Y, or, more specifically, how to assess the fraction of the total effect that is explained by mediation and the fraction that is owed to mediation. In linear

analysis, the former fraction is captured by the product c z b x /b (equation 56), the latter by the difference (b c x )/b (equation 57) and the two quantities coincide. In the presence of interaction, however, each fraction demands a separate analysis, as dictated by the Mediation Formula. To witness, substituting the nonlinear equation in (49), (52) and (53) and assuming x = 0 and x = 1, yields the following decomposition: DE = c x + b 0 c xz IE = b x c z TE = c x + b 0 c xz + b x (c z + c xz ) = DE + IE + b x c xz We therefore conclude that the fraction of output change for which mediation would be sufficient is IE/TE = b x c z /(c x + b 0 c xz + b x (c z + c xz )) while the fraction for which mediation would be necessary is 1 DE/TE = b x (c z + c xz )/(c x + b 0 c xz + b x (c z + c xz )) We note that, due to interaction, a direct effect can be sustained even when the parameter c x vanishes and, moreover, a total effect can be sustained even when both the direct and indirect effects vanish. This illustrates that estimating parameters in isolation tells us little about the effect of mediation and, more generally, mediation and moderation are intertwined and cannot be assessed separately. If the policy evaluated aims to prevent the outcome Y by ways of weakening the mediating pathways, the target of analysis should be the difference T E DE, which measures the highest prevention effect of any such policy. If, on the other hand, the policy aims to prevent the outcome by weakening the direct pathway, the target of analysis should shift to IE, for TE IE measures the highest preventive impact of this type of policies. The main power of the Mediation Formula shines in studies involving categorical variables, especially when we have no parametric model of the data generating process. To illustrate, consider the case where [...continue]

new footnote p. 105: Some authors (e.g., VanderWeele 2009), define the natural indirect effect as the difference TE DE. This renders the additive formula a tautology of definition, rather then a theorem, conditioned upon the anti-symmetry IE x,x (Y ) = IE x,x(y ). Violation of (52) will be demonstrated in the next section. corrected Figure 7 p. 112 (number of samples Number of Samples) Number of Samples X Z Y E(Y x, z) = g xz E(Z x) = h x n 1 0 0 0 n 2 n 2 0 0 1 n 3 0 1 0 n 4 n 3 +n 4 = g 01 n 4 0 1 1 n 5 1 0 0 n 6 n 6 1 0 1 n 7 1 1 0 n 8 n 7 +n 8 = g 11 n 8 1 1 1 n 1 +n 2 = g 00 n 3 +n 4 n 1 +n 2 +n 3 +n 4 = h 0 n 5 +n 6 = g 10 n 7 +n 8 n 5 +n 6 +n 7 +n 8 = h 1 Figure 7: Computing the Mediation Formula for the model in Figure 6(a), with X, Y, Z binary.

References Baron, R. and Kenny, D. (1986). The moderator-mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology 51 1173 1182. MacKinnon, D., Lockwood, C., Brown, C., Wang, W. and Hoffman, J. (2007). The intermediate endpoint effect in logistic and probit regression. Clinical Trials 4 499 513. Pearl, J. (2000). Causality: Models, Reasoning, and Inference. Cambridge University Press, New York. 2nd edition, 2009. Pearl, J. and Robins, J. (1995). Probabilistic evaluation of sequential plans from causal models with hidden variables. In Uncertainty in Artificial Intelligence 11 (P. Besnard and S. Hanks, eds.). Morgan Kaufmann, San Francisco, 444 453. Rubin, D. (2004). Direct and indirect causal effects via potential outcomes. Scandinavian Journal of Statistics 31 161 170. Rubin, D. (2005). Causal inference using potential outcomes: Design, modeling, decisions. Journal of the American Statistical Association 100 322 331.