Comparative effectiveness of dynamic treatment regimes

Similar documents
Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017

Extending causal inferences from a randomized trial to a target population

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal inference in epidemiological practice

Casual Mediation Analysis

University of California, Berkeley

Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous

Combining multiple observational data sources to estimate causal eects

Estimation of direct causal effects.

Causal Effect Models for Realistic Individualized Treatment and Intention to Treat Rules

Instrumental variables estimation in the Cox Proportional Hazard regression model

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Effects of multiple interventions

Gov 2002: 13. Dynamic Causal Inference

Harvard University. Harvard University Biostatistics Working Paper Series

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Estimating Causal Effects of Organ Transplantation Treatment Regimes

Estimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Journal of Biostatistics and Epidemiology

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016

Lecture 7 Time-dependent Covariates in Cox Regression

Estimating direct effects in cohort and case-control studies

Discussion of Papers on the Extensions of Propensity Score

Longitudinal + Reliability = Joint Modeling

Simulating from Marginal Structural Models with Time-Dependent Confounding

Harvard University. Harvard University Biostatistics Working Paper Series. Sheng-Hsuan Lin Jessica G. Young Roger Logan

Extending the results of clinical trials using data from a target population

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Ignoring the matching variables in cohort studies - when is it valid, and why?

Propensity Score Methods, Models and Adjustment

Propensity Score Methods for Causal Inference

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

An Introduction to Causal Inference, with Extensions to Longitudinal Data

Marginal Structural Models and Causal Inference in Epidemiology

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Studies. Frank E Harrell Jr. NIH AHRQ Methodologic Challenges in CER 2 December 2010

Estimating Optimal Dynamic Treatment Regimes from Clustered Data

Estimating the treatment effect on the treated under time-dependent confounding in an application to the Swiss HIV Cohort Study

Marginal Structural Cox Model for Survival Data with Treatment-Confounder Feedback

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

Targeted Maximum Likelihood Estimation in Safety Analysis

Core Courses for Students Who Enrolled Prior to Fall 2018

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

A Sampling of IMPACT Research:

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

Selection on Observables: Propensity Score Matching.

PSC 504: Dynamic Causal Inference

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

Distributed analysis in multi-center studies

Survival Analysis for Case-Cohort Studies

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

University of California, Berkeley

Sensitivity analysis and distributional assumptions

Q learning. A data analysis method for constructing adaptive interventions

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causal Inference Basics

Propensity Score Weighting with Multilevel Data

Modern Statistical Learning Methods for Observational Biomedical Data. Chapter 2: Basic identification and estimation of an average treatment effect

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

eappendix: Description of mgformula SAS macro for parametric mediational g-formula

Semiparametric Regression and Machine Learning Methods for Estimating Optimal Dynamic Treatment Regimes

ECO Class 6 Nonparametric Econometrics

The Use of Spatial Exposure Predictions in Health Effects Models: An Application to PM Epidemiology

Global Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Propensity score modelling in observational studies using dimension reduction methods

Technical Track Session I: Causal Inference

Gov 2002: 4. Observational Studies and Confounding

University of California, Berkeley

STAT331. Cox s Proportional Hazards Model

Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

arxiv: v2 [math.st] 4 Mar 2013

Bounding the Probability of Causation in Mediation Analysis

E(Y ij b i ) = f(x ijβ i ), (13.1) β i = A i β + B i b i. (13.2)

Measurement Error in Spatial Modeling of Environmental Exposures

Causal Inference with Big Data Sets

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.

Dynamic Prediction of Disease Progression Using Longitudinal Biomarker Data

arxiv: v1 [stat.me] 15 May 2011

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Multistate models and recurrent event models

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small

Sampling bias in logistic models

Advanced Statistical Methods for Observational Studies L E C T U R E 0 1

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

Transcription:

Comparative effectiveness of dynamic treatment regimes An application of the parametric g- formula Miguel Hernán Departments of Epidemiology and Biostatistics Harvard School of Public Health www.hsph.harvard.edu/causal

What is comparative effectiveness research (CER)? The generation and synthesis of evidence that compares the benefits and harms of alternative methods to prevent, diagnose, treat, and monitor a clinical condition or to improve the delivery of care IOM 2009 Comparative effectiveness or comparative safety 8 June 2012 G-formula for CER 2

What is the purpose of CER? To assist consumers, clinicians, purchasers, and policy makers to make informed decisions that will improve health care at both the individual and population levels IOM 2009 To help decision-makers 8 June 2012 G-formula for CER 3

CER is about causal inference The generation and synthesis of evidence that compares the benefits and harms to make informed decisions that will improve health In other words, CER is about comparing the effects of well-defined interventions or strategies to answer policy, public health, and clinical questions 8 June 2012 G-formula for CER 4

3 requirements for CER 1. A relevant, well-defined question A key prerequisite 2. High-quality data Appropriate for the question above 3. Valid statistical methods To be applied to the high-quality data to answer the well-defined question 8 June 2012 G-formula for CER 5

CER questions are becoming increasingly complex En epidemiology, clinical medicine, comparative effectiveness research Often not about the average treatment effect of a non-timevarying treatment Often involving dynamic strategies or regimes i.e., treatment strategies that depend on the individual s evolving covariates 8 June 2012 G-formula for CER 6

Example: antiretroviral therapy (ART) for HIV-infected patients RCTs have been successful in showing that assignment to combined ART is best However, key questions still unanswered what to start when to start when to switch when to monitor Dozens of relevant strategies with potentially very wide variation in effectiveness 8 June 2012 G-formula for CER 7

Examples of complex questions involving dynamic regimes When to start treatment? What would be the 5-year mortality risk when cart is initiated within 6 months of either an AIDS diagnosis or CD4 cell count first dropping below 500 cells/mm 3 Below 350? Within 3 months? Within 6 months of CD4 first dropping below 500 and viral load above 1000? 8 June 2012 G-formula for CER 8

Examples of complex questions involving dynamic regimes When to change treatment? What would be the 5-year mortality risk when the cart regime is switched within 6 months of immunologic failure? What if we wait until viral load is greater than 1000? Until CD4<350? When to monitor treated patients? Should we measure viral load every 3, 6, 12 months? Before/After treatment? 8 June 2012 G-formula for CER 9

CER questions are complex because clinical decisions are Classical RCTs, however, provide simple information Typically an intention-to-treat analysis will say that assignment to A is better than assignment to B That s fine for one-time treatments e.g., surgery, a one-dose vaccination, CABG But what about long-term clinical strategies? e.g., therapeutics for chronic diseases with no cure (diabetes, HIV disease, chronic kidney disease, cardiovascular disease ) 8 June 2012 G-formula for CER 10

One option: RCTs Design RCTs that compare the effectiveness of explicitly defined clinical strategies on clinically relevant outcomes over a clinically relevant (i.e., long) period An example of strategy would be start treatment A within 3 months of the CD4 cell count first dropping below 500 or of an AIDS diagnosis, whichever occurs first; switch to treatment B within 3 months of the viral load going over 1000 or immediately if toxicity occurs, unless the patient is pregnant or has a liver disorder, in which case switch to treatment C; if C fails then 8 June 2012 G-formula for CER 11

CER and personalized medicine Complex clinical strategies need to be compared if we get serious about comparative effectiveness research for individualized treatment decisions But once we get this serious, RCTs suddenly look less attractive 8 June 2012 G-formula for CER 12

Less attractive RCTs Adherence to complex strategies over a long period may be low intention-to-treat analysis a black box if common deviations from protocol as treated analyses: time-varying confounding like in observational studies Long follow-up drop-out an intention-to-treat analysis not even possible selection bias like in observational studies For a discussion on RCTs for CER, see Hernán and Hernández-Díaz. Clinical Trials 2012. 8 June 2012 G-formula for CER 13

The next best option Use observational data to emulate an RCT Need prospective collection of data on time-varying data on treatments, outcomes, and confounders Not all data sources include this information Therefore not all data sources can be used 8 June 2012 G-formula for CER 14

Enter observational registries and electronic medical records These data sources are our best chance to compare effectiveness of clinical strategies in a timely way in unselected populations They include prospective data on timevarying treatments, outcomes, and confounders RCTs do not typically collect this information They can be used to emulate trials without baseline randomization 8 June 2012 G-formula for CER 15

Observational data for CER are becoming increasingly complex Increasingly high-dimensional Longitudinal studies with timevarying treatments and confounders Measurement of treatment and confounders may occur at subjectspecific (random) times 8 June 2012 G-formula for CER 16

Examples of complex longitudinal data Large populations followed for long periods with frequent measurement of treatments, outcomes, and confounders At prespecified intervals: e.g., the Nurses Health Study, MACS/WIHS At random intervals: e.g., clinical cohorts like the HIV-CAUSAL Collaboration, USRDS claims, electronic medical records 8 June 2012 G-formula for CER 17

Many methods originally developed for less complex questions Conventional regression Bias if confounders affected by previous treatment Propensity score matching For time-varying treatments? Instrumental variable estimation Who are the compliers when treatment is given every month? And the never takers? Principal stratification How are principal strata defined? 8 June 2012 G-formula for CER 18

G-methods Robins, 1986 and onwards G stands for Generalized causal comparisons Including the contrast of dynamic regimes in the presence of time-varying confounders G-formula (Robins 1986) g-estimation of structural nested models (Robins 1989) inverse probability weighting of marginal structural models (Robins 1998) 8 June 2012 G-formula for CER 19

Aside: Time-varying confounders affected by exposure A t : Antiretroviral therapy (0: no, 1: yes) at time t Y: Viral load (1 if detectable, 0: otherwise) L: CD4 count (0: high, 1: low) U: True immunosupression level (Unknown to data analyst: No effect of A t on Y) 8 June 2012 G-formula for CER 20

Aside: Stratification to compute the causal effect of A Is the conditional risk ratio equal to the causal risk ratio (i.e., one)? Pr[Y=1 A=2, L 1 =l] / Pr[Y=1 A=0, L 1 =l] NO Conditioning on L 1 eliminates confounding (blocks the back-door path) for one component of A, i.e., A 1 creates selection bias for the other component of A, i.e., A 0 As long as one component of A is associated with Y, A is associated with Y 8 June 2012 G-formula for CER 21

Aside: To stratify or not to stratify Not stratifying is bad because there is confounding Stratifying is bad because stratification eliminates confounding at the cost of introducing selection bias Because the confounder for part of the exposure is affected by another part of the exposure 8 June 2012 G-formula for CER 22

Aside: More generally Bias if either the confounder is affected by the exposure or shares a common cause with it Bias even if the confounder is not on the causal pathway from exposure to outcome 8 June 2012 G-formula for CER 23

What if the confounder is on the causal pathway? Conditioning on the confounder not only creates selection bias but also prevents identification of the total effect of exposure In general, stratification by intermediate variables to identify direct effects is dangerous 8 June 2012 G-formula for CER 24

In summary Methods that estimate association measure ignoring data on L 1 Association measure does not have a causal interpretation if there is confounding by L 1 Methods that estimate association measure within levels of L 1 Association measure does not have a causal interpretation if L 1 affected by exposure (or a cause of the exposure) Need for other methods 8 June 2012 G-formula for CER 25

G-formula, IP weighting, G-estimation Appropriately adjust for confounding when time-dependent confounders are affected by exposure (or by causes of exposure) For example, in IP weighting adjustment is achieved by eliminating the arrow from confounder to subsequent exposure (in the pseudopopulation) Not by conditioning on the confounder 8 June 2012 G-formula for CER 26

Identifying assumptions Exchangeability Sequential ignorability, no unmeasured confounding Positivity Experimental treatment assumption Well-defined interventions See Robins and Hernán (2008) Estimation of the causal effects of time-varying exposures. In: Longitudinal Data Analysis. Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G, eds. 8 June 2012 G-formula for CER 27

The g-formula aka the g-computation algorithm formula (Robins 1986) Later rediscovered by computer scientists (Pearl et al) Under its assumptions, the g-formula can be used to estimate any average causal effect Not a causal method but the causal method 8 June 2012 G-formula for CER 28

The g-formula Solution to causal inference from complex longitudinal data? Well, there is no solution to causal inference from observational data Unmeasured confounding always a possibility Lack of exchangeability given the measured covariates The g-formula adjusts appropriately for time-varying measured confounding Like IP weighting, unlike conventional methods 8 June 2012 G-formula for CER 29

The g-formula Nonparametric method!! Need to estimate the conditional distribution of outcome, confounders For longitudinal data and/or continuous covariates it requires essentially infinite data and computing time Not a trivial problem More later Let s describe the g-formula for the when to start question 8 June 2012 G-formula for CER 30

Data and notation (I) Time k measured in months k=0,1,2 150 A k indicator of treatment initiation before end of month k V vector of variables measured at or before k=0 sex, geographic origin, mode of transmission, race, cohort, calendar year years since HIV diagnosis, age 8 June 2012 G-formula for CER 31

Data and notation (II) L k vector of time-varying variables during period k indicators for measurement of viral load (L 1,k ) and CD4 (L 2,k ), most recently measured viral load (L 3,k ) and CD4 (L 4,k ), AIDS diagnosis (L 5,k ) Y k+1 indicator of death by end of month k+1 C k+1 indicator of censoring by end of month k+1 8 June 2012 G-formula for CER 32

Dynamic regimes of interest "start cart when CD4 cell count first drops below x or there is a diagnosis of an AIDS-defining illness, whichever happens first" Regimes indexed by x Formalization: Let g k a k 1, l k 1ifeither 1. g k 1 a k 2, l k 1 0andl 4,k x or l 5,k 1or 2. g k 1 a k 2, l k 1 1 and let g k a k 1, l k 0otherwisefork 0. 8 June 2012 G-formula for CER 33

Estimand of interest Pr Yx k 1 1 counterfactual risk of death by month k+1 for k=0,,60 under treatment regime x and with censoring by loss to follow-up abolished The g-formula expresses this counterfactual risk in terms of only the observed data distribution Under the identifying assumptions 8 June 2012 G-formula for CER 34

The g-formula for Pr Yx k 1 1 l k k j 0 Pr Y j 1 1 L j l j, A j a j,y j C j 1 0 j Pr Y s 0 L s 1 l s 1, A s 1 a s 1, Y s 1 C s 0 s 0 f l s l s 1,a s 1, Y s C s 0 General form of epidemiologic standardization for time-varying treatments and confounders 8 June 2012 G-formula for CER 35

Two nontrivial problems 1. Need to estimate the entire likelihood Nonparametric estimation is out of the question except marginal distribution of baseline covariates V can be empirically estimated Parametric estimation perceived as too dangerous 2. Need to compute a huge integral Approximated via Monte Carlo simulation 8 June 2012 G-formula for CER 36

Should we consider parametric estimation of the g-formula? A big chunk of Robins et al s careers devoted to semiparametric methods IP weighting of MSMs, g-estimation of SNMs Model misspecification viewed as particularly bad for longitudinal data Can lead to propagation of errors Now we are talking about going fully parametric? Really? Well, you never know until you try 8 June 2012 G-formula for CER 37

Parametric vs. semiparametric IP weighting of dynamic MSMs Model for treatment initiation Structural model to smooth over regimes Cain et al. Int J Biostat 2010. Parametric g-formula Parametric models for every single density in the likelihood of the observed data Young et al. Stat Biosci 2011, Westreich et al. Stats Med 2012. 8 June 2012 G-formula for CER 38

Implementation of parametric g-formula 3 steps 1. Parametric modeling to estimate factors of the g-formula 2. Monte Carlo simulation to approximate the integral 3. Computation of risk Nonparametric bootstrap for variance estimation Technical details, including on random dynamic regimes, in Young et al. Stat Biosci 2011 Publicly available SAS macro http://www.hsph.harvard.edu/causal 8 June 2012 G-formula for CER 39

Step 1: Parametric modeling of conditional densities 1. Fit models for the conditional densities of the covariates f l s l s 1,a s 1, Y s C s 0 e.g., logistic for discrete, log-linear for continuous Use empirical distribution for baseline covariates 2. Fit a logistic model for the conditional density of the outcome Pr Y k 1 1 L k l k,a k a k,y k C k 1 0 8 June 2012 G-formula for CER 40

Step 2: Monte Carlo simulation under regime x 8 June 2012 G-formula for CER 41

Step 2: Monte Carlo simulation under regime x 8 June 2012 G-formula for CER 42

Step 3: Computation of risk under regime x For k=0, 1, K, estimate the g-formula as n 1 n v 1 k j 0 Pr Y j 1 1 L j l j,v,a j a j,v,y j C j 1 0 j 1 Pr Y s 1 L s 1 l s 1,v,A s 1 a s 1,v,Y s 1 C s 0 s 0 8 June 2012 G-formula for CER 43

Point estimates and 95% CIs x Risk (%) Risk Ratio 500 2.65 (2.15, 3.43) 1.00 (ref) 450 2.77 (2.28, 3.47) 1.05 (0.99, 1.09) 400 2.92 (2.43, 3.55) 1.10 (0.99, 1.18) 350 3.06 (2.59, 3.67) 1.16 (1.00, 1.29) 300 3.22 (2.73, 3.81) 1.22 (1.02, 1.41) 250 3.44 (2.87, 4.07) 1.30 (1.03, 1.55) 200 3.65 (3.01, 4.44) 1.38 (1.05, 1.71) 8 June 2012 G-formula for CER 44

Survival under 3 dynamic regimes 8 June 2012 G-formula for CER 45

Stability of results Results did not materially change under different modeling strategies functional forms for continuous variables, e.g., polynomial, restricted cubic splines inclusion of interaction terms modeling of measurement process More formal model selection and goodness of fit tests are ongoing 8 June 2012 G-formula for CER 46

The natural course One necessary condition is that our approach predicts the observed data correctly That is, we need to be able to predict what would had happened under no treatment intervention 8 June 2012 G-formula for CER 47

Natural course Observed vs. predicted Top: absolute Bottom: difference (95% CI) 8 June 2012 G-formula for CER 48

Natural course Observed vs. predicted Top: absolute Bottom: difference (95% CI) 8 June 2012 G-formula for CER 49

Conclusions (I) Parametric estimation more efficient than semiparametric estimation, duh! Compared with IP weighted estimates in Cain et al. Annals of Internal Medicine 2011 Parametric g-formula is more flexible than any of the other g-methods and is computationally tractable 8 June 2012 G-formula for CER 50

Conclusions (II) Most shockingly, so far the parametric g-formula yields reasonable/stable estimates More applications coming Weight loss and diabetes, fish and heart disease... 8 June 2012 G-formula for CER 51

Acknowledgements Co-investigators At HSPH: Jessica Young, Lauren Cain, Jamie Robins, Roger Logan Members of the HIV-CAUSAL Collaboration Funding Sources NIH grants: R01-AI073127 and U10- AA013566 8 June 2012 G-formula for CER 52