Causal inference in epidemiological practice

Similar documents
Generating survival data for fitting marginal structural Cox models using Stata Stata Conference in San Diego, California

Journal of Biostatistics and Epidemiology

Comparative effectiveness of dynamic treatment regimes

Estimating direct effects in cohort and case-control studies

Casual Mediation Analysis

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016

Causal Effect Models for Realistic Individualized Treatment and Intention to Treat Rules

An Introduction to Causal Inference, with Extensions to Longitudinal Data

Estimation of direct causal effects.

Integrated approaches for analysis of cluster randomised trials

Simulating from Marginal Structural Models with Time-Dependent Confounding

Ignoring the matching variables in cohort studies - when is it valid, and why?

University of California, Berkeley

Mediation and Interaction Analysis

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

University of California, Berkeley

Causal Inference Basics

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures

Extending causal inferences from a randomized trial to a target population

Combining multiple observational data sources to estimate causal eects

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Marginal Structural Models and Causal Inference in Epidemiology

A comparison of 5 software implementations of mediation analysis

Estimating the Marginal Odds Ratio in Observational Studies

This paper revisits certain issues concerning differences

University of California, Berkeley

Marginal Structural Cox Model for Survival Data with Treatment-Confounder Feedback

Modern Statistical Learning Methods for Observational Biomedical Data. Chapter 2: Basic identification and estimation of an average treatment effect

Propensity Score Methods, Models and Adjustment

Mediation Analysis for Health Disparities Research

Confounding, mediation and colliding

A Sampling of IMPACT Research:

University of California, Berkeley

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula.

Targeted Maximum Likelihood Estimation in Safety Analysis

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

University of California, Berkeley

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

MARGINAL STRUCTURAL COX MODELS WITH CASE-COHORT SAMPLING. Hana Lee

Effects of multiple interventions

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings

A Decision Theoretic Approach to Causality

Estimating and contextualizing the attenuation of odds ratios due to non-collapsibility

Unbiased estimation of exposure odds ratios in complete records logistic regression

Diagnosing and responding to violations in the positivity assumption.

Sensitivity analysis and distributional assumptions

University of California, Berkeley

arxiv: v1 [stat.me] 15 May 2011

Simple Sensitivity Analysis for Differential Measurement Error. By Tyler J. VanderWeele and Yige Li Harvard University, Cambridge, MA, U.S.A.

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Department of Biostatistics University of Copenhagen

Lecture 2: Constant Treatment Strategies. Donglin Zeng, Department of Biostatistics, University of North Carolina

Supplementary Materials for Residual Balancing: A Method of Constructing Weights for Marginal Structural Models

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017

Instrumental variables estimation in the Cox Proportional Hazard regression model

A Unification of Mediation and Interaction. A 4-Way Decomposition. Tyler J. VanderWeele

Estimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian

Causal Inference with Measurement Error

Effect Modification and Interaction

University of California, Berkeley

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Statistical Methods for Causal Mediation Analysis

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution

Bayesian Inference for Causal Mediation Effects Using Principal Stratification with Dichotomous Mediators and Outcomes

CAUSAL MEDIATION ANALYSIS FOR NON-LINEAR MODELS WEI WANG. for the degree of Doctor of Philosophy. Thesis Adviser: Dr. Jeffrey M.

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

University of California, Berkeley

Observational Studies 4 (2018) Submitted 12/17; Published 6/18

Harvard University. Harvard University Biostatistics Working Paper Series. Sheng-Hsuan Lin Jessica G. Young Roger Logan

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

Longitudinal analysis of ordinal data

Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations)

Robust Estimation of Inverse Probability Weights for Marginal Structural Models

Mediation Analysis: A Practitioner s Guide

Targeted Maximum Likelihood Estimation of the Parameter of a Marginal Structural Model

Causal Inference. Prediction and causation are very different. Typical questions are:

I. Partial identification of causal parameters with multiple proposed instruments

arxiv: v5 [stat.me] 13 Feb 2018

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen

Targeted Learning with Daily EHR Data

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal modelling in Medical Research

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

University of California, Berkeley

Technical Track Session I: Causal Inference

arxiv: v2 [math.st] 4 Mar 2013

The distinction between a biologic interaction or synergism

More Statistics tutorial at Logistic Regression and the new:

Empirical Bayes Moderation of Asymptotically Linear Parameters

Empirical Bayes Moderation of Asymptotically Linear Parameters

Four types of e ect modi cation - a classi cation based on directed acyclic graphs

Propensity Score Methods for Causal Inference

Statistics in medicine

Transcription:

Causal inference in epidemiological practice Willem van der Wal Biostatistics, Julius Center UMC Utrecht June 5, 2

Overview Introduction to causal inference Marginal causal effects Estimating marginal causal effects More detail: inverse probability weighting (IPW) Efficiency & small sample bias Possible solution

Causal inference in epidemiological practice

Counterfactual histories: individual level Dichotomous treatment outcome for individual i when not treated: Y i,a= outcome for individual i when treated: Y i,a= Treatment with varying dosage outcome for individual i with dosis : Y i,a= outcome for individual i with dosis 2: Y i,a=2 outcome for individual i with dosis 5: Y i,a=5 outcome for individual i with dosis : Y i,a=

Counterfactual histories: population level Dichotomous treatment distribution of outcomes, nobody treated: P(Y a= ) distribution of outcomes, everybody treated: P(Y a= ) Treatment with varying dosage distribution of outcomes, everybody dosis : P(Y a= ) distribution of outcomes, everybody dosis 2: P(Y a=2 ) distribution of outcomes, everybody dosis 5: P(Y a=5 ) distribution of outcomes, everybody dosis : P(Y a= )

Marginal Structural Models Potential outcomes distribution: P(Y a ) Marginal Structural Models Model the population distribution of potential outcomes Examples dichotomous treatment logitp(y a = ) = β + β a exp(β ) is causal odds ratio P(Y a==)/( P(Y a= =)) P(Y a= =)/( P(Y a= =)) continous treatment E(Y a ) = β + β a β is causal mean difference E(Y a= ) E(Y a= )

Conditional Effects Causal questions What is on average the difference in outcome when patients are treated compared to when patients are not treated? Often too unspecific More specific causal questions: For patients with poor health, what is on average the difference in outcome between treating and not treating? And for patients with good health, what is on average the difference in outcome between treating and not treating? P(Y a V ) e.g. logitp(y a = V ) = β + β a + β 2 V + β 3 av

Trials Randomized experiments Example: randomized treatment (placebo/treatment) Both treatment arms are comparable Both treatment arms are comparable to the whole sample P(Y a= ) = P(Y A = ) P(Y a= ) = P(Y A = ) Therefore, we can give a causal interpretation to the treatment effect estimated from this sample e.g. P(Y = A = ) P(Y = A = ) = P(Y a= = ) P(Y a= = )

Observational studies Confounding L Y β r A Treatment arms are not comparable (could also happen by chance) r is not an estimate of β

Adjusting for confounding: conditioning Conditioning Stratifying Including confounders in regression model

Adjusting for confounding: conditioning Conditioning Stratifying Including confounders in regression model

Adjusting for confounding: conditioning Conditioning Stratifying Including confounders in regression model

Adjusting for confounding: conditioning Conditioning Stratifying Including confounders in regression model Estimate effect within each stratum Pool estimates together

Conditioning, caveat : Adjusting away the effect Time dependent confounder and time dependent treatment Confounder also intermediary for treatment Conditioning: indirect effect is adjusted away L t A t Y t L t A t Y t widely known in Epidemiology, see e.g. Rothman (22)

Conditioning, caveat 2: Non-collapsibility Simulation: confounder L, exposure A and outcome Y all dichotomous logitp(l = ) = logitp(a = ) = + 2L logitp(y = ) = + A + βl True conditional causal OR: e True marginal causal OR: logitp[y a = L] = + a + βl = f (a, L, β) P[Y a = ] = l {expitf (a, l, β)p(l = l)} = g(a, β) true marginal cor = g(, β)/( g(, β)) g(, β)/( g(, β))

Conditioning, caveat 2: Non-collapsibility True marginal causal OR 3. True conditional causal OR 2.5 2..5 5 5 β

Conditioning, caveat 3: Berkson s bias Berkson s bias A L A Y U Condition on L An association between A and U is introduced Therefore, an association between A and Y is introduced see e.g. Hernán e.a. (24)

Adjusting for confounding: IPW Inverse probability weighting (IPW) P(Exposure = Blue) = /3 weight by (/3) = 3 3 2 = 6 P(Exposure = Blue) = 2/3 weight by (2/3) =.5.5 4 = 6 P(Exposure = Red) = 2/3 weight by (2/3) =.5.5 4 = 6 P(Exposure = Red) = /3 weight by (/3) = 3 3 2 = 6

Adjusting for confounding: IPW Result of IPW Take into account dependency within patients

IPW, formally Weight each observation i by Stabilized weight: Continuous exposure: w i = sw i = P(A i = a i L i = l i ) P(A i = a i ) P(A i = a i L i = l i ) sw i = f (a i) f (a i l i )

HIV example Effect of active tuberculosis on mortality in HIV patients (n = 276) Active tuberculosis (TB) status: A(t) changes from to Outcome: mortality Confounder CD4 count: L(t)

HIV example: inverse probability weighting Observations: {A ij, L ij, Y ij } Weight observations by: sw ij = j k= P[A ik = a ik Āik = ā ik ] P[A ik = a ik Ā ik = ā ik, L ik = l ik ] () Elements in numerator and denominator of () estimated using regression models Fit MSM on weighted observations: λ Tā = λ (t)e θa(t)

Adjusting for confounding: G-computation G-computation RR = 3/6 4/6 =.75 f (y a ) = f (y a L = l)f (l)dl l

Adjusting for confounding: G-computation G-computation O O O O O O O RR = 3/6 4/6 =.75 f (y a ) = f (y a L = l)f (l)dl l

Adjusting for confounding: G-computation G-computation O O O O O O O RR = 3/6 4/6 =.75 f (y a ) = l f (y a L = l)f (l)dl

Adjusting for confounding: G-computation G-computation O O O O O O O O O O O O O O O O O O O O O O RR = 3/6 4/6 =.75 f (y a ) = l f (y a L = l)f (l)dl

Adjusting for confounding: G-computation G-computation O O O O O O O O O O O O O O O RR = 6/2 9/2 =.67 f (y a ) = f (y a L = l)f (l)dl l

Adjusting for confounding: G-computation G-computation O O O O O O O O O O O O O O O RR = 6/2 9/2 =.67 f (y a ) = f (y a, L = l)f (l)dl l

Van der Wal e.a. (29)

MSMs: other methods Doubly robust estimation Targeted maximum likelihood estimation see Van der Laan (2)

Assumptions No unmeasured confounding Consistency subjects counterfactual outcome given his observed exposure level is precisely his or her observed outcome Positivity every exposure level has a positive probability of being allocated in every stratum defined by confounders (IPW especially sensitive) Correct model specification

Implementation SPSS weighting possible no robust standard errors bootstrap rather difficult SAS macro, binary exposure, point treatment: Jonsson e.a. (27) longitudinal IPW example: Hernán e.a. (2) STATA longitudinal IPW example: Fewell e.a. (24) R causalgam, binary exposure, point treatment cvdsa, point treatment, data adaptive algorithm http://www.stat.berkeley.edu/~laan/software/ tmlelite, binary point treatment, targeted maximum likelihood estimation http://www.stat.berkeley.edu/ ~laan/software/ ipw, point treatment and longitudinal, flexible models

IPW problems Inefficient compared to likelihood-based methods Small sample bias when strata defined by confounder have low response probabilities

Simulation study Continuous confounder L N (, 5), 2 Dichotomous exposure logitp(a = ) = + L, 3 Continuous outcome Y = A +.5L + N (, 5).

Density plots (sample of n=) A= A= 5 5 5 2 25 L Difficult to reweight

Models Uncorrected: 2 Conditional: E(Y A) = β + β A 3 IPW MSM: E(Y A, L) = β + β A + β 2 L E(Y a ) = β + β a observations weighted by sw i = P(A i =a i ) P(A i =a i L i =l i )

n=, 5 runs uncorrected conditional IPW 6 8 2 4 6 Parameter estimates IPW: correct model specification!

n=, 5 runs uncorrected conditional IPW IPW (n=) 6 8 2 4 6 Parameter estimates IPW: correct model specification!

Iterative IPW algorithm Compute sw i, = 2 For k =,..., k max repeat the following steps { In the weighted dataset, estimate new inverse probability weights sw i,k 2 Convergence is reached when VAR(ln( sw i,k )) c STOP. 3 Update the inverse probability weights by multiplying sw i,k = sw i,k sw i,k. 4 Truncate sw i,k to percentiles p trunc % and p trunc %. 5 Center the weights at mean of : sw i,k = sw i,k /MEAN(sw i,k ). } Use weights sw i,k obtained after convergence (or when k max was reached).

n=, 5 runs uncorrected conditional IIPW % IIPW 5% IIPW % IIPW % IPW 6 8 2 4 6 Parameter estimates k max = 5, c STOP = 7

Simulation results (5 runs) Condition/Method Estimate 95% C.I. IIPW Iterations Mean bias RMSE Coverage Average Median Converged length Sample size Uncorrected 3.75 3.76..3 Conditional..48.95.88 IPW.89.92.58 3.93 IIPW, p trunc = % -.3.59.88 4.89 4 99.9% IIPW, p trunc =.%..2.93 4.47 5 99.98% IIPW, p trunc = %..83.95 3.27 6 % IIPW, p trunc = 5%..67.95 2.63 9 % IIPW, p trunc = %..63.95 2.47 4 % IIPW, p trunc = 2% -..87.94 2.95 68 99.82%

Discussion Is the resulting IIPW estimator consistent? Implementation for longitudinal data Robustness against misspecification Or maybe we should all just use TMLE

Literature Cole SR, Hernán MA (28). Constructing inverse probability weights for marginal structural models. American Journal of Epidemiology, pp. 68(6), 656 664. Fewell Z, Hernán MA, Wolfe F, Tilling K, Choi H, Sterne JAC (24). Controlling for time-dependent confounding using marginal structural models. The Stata Journal, pp. 4(4), 42 42. Hernán MA (24). A definition of causal effect for epidemiological research. Journal of Epidemiology & Community Health, pp. 58, 265 27. Hernán MA, Brumback BA, Robins JM (2). Marginal structural models to estimate the causal effect of Zidovudine on the survival of HIV-positive men. Epidemiology, pp. (5), 56 57. Hernán MA, Hernández-Díaz S, Robins JM (24). A structural approach to selection bias. Epidemiology, pp. 5(5), 56 57. Jonsson Funk M, Westreich D, Davidian M, Weisen C (27). Introducing a SAS macro for doubly robust estimation. SAS Global Forum 27, paper 89 27. Pearl J (2). An introduction to causal inference. The International Journal of Biostatistics, pp. 6(2), Article 7. Petersen ML, Deeks SG, Martin JN, Van der Laan MJ (27). History-adjusted marginal structural models for estimating time-varying effect modifcation. American Journal of Epidemiology, pp. 66(9), 985 993. Rothman KJ (22). Epidemiology: An Introduction. New York: Oxford University Press. Van der Laan, MJ (2). Targeted maximum likelihood based causal inference: Part I. The International Journal of Biostatistics, pp. 6(2), Article 2. Van der Wal WM, Prins M, Lumbreras B, Geskus RB (29). A simple G-computation algorithm to quantify the causal effect of a secondary illness on the progression of a chronic disease. Statistics in Medicine, pp. 28(8), 2325 2337.