Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Similar documents
Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Inference on Treatment Effects from a Randomized Clinical Trial in the Presence of Premature Treatment Discontinuation: The SYNERGY Trial

A Sampling of IMPACT Research:

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Estimating the Marginal Odds Ratio in Observational Studies

Inference on treatment effects from a randomized clinical trial in the presence of premature treatment discontinuation: the SYNERGY trial

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Causal Inference Basics

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Estimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Estimating Causal Effects of Organ Transplantation Treatment Regimes

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Subgroup analysis using regression modeling multiple regression. Aeilko H Zwinderman

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Independent Increments in Group Sequential Tests: A Review

Power and Sample Size Calculations with the Additive Hazards Model

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Instrumental variables estimation in the Cox Proportional Hazard regression model

Targeted Maximum Likelihood Estimation in Safety Analysis

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

BIOS 2083: Linear Models

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Adaptive Trial Designs

Gov 2002: 4. Observational Studies and Confounding

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Lecture 1 January 18

Group Sequential Designs: Theory, Computation and Optimisation

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Causal modelling in Medical Research

Causal Effect Models for Realistic Individualized Treatment and Intention to Treat Rules

A Decision Theoretic Approach to Causality

Sensitivity analysis and distributional assumptions

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Survival Analysis for Case-Cohort Studies

Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous

Propensity Score Weighting with Multilevel Data

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

CompSci Understanding Data: Theory and Applications

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints

Implementing Precision Medicine: Optimal Treatment Regimes and SMARTs. Anastasios (Butch) Tsiatis and Marie Davidian

Variable selection and machine learning methods in causal inference

Lecture 3. Truncation, length-bias and prevalence sampling

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Lecture 7 Time-dependent Covariates in Cox Regression

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Combining multiple observational data sources to estimate causal eects

Bios 6648: Design & conduct of clinical research

Ph.D. course: Regression models. Introduction. 19 April 2012

Comparative effectiveness of dynamic treatment regimes

Evaluation of Viable Dynamic Treatment Regimes in a Sequentially Randomized Trial of Advanced Prostate Cancer

Multistate models and recurrent event models

What s New in Econometrics. Lecture 1

DATA-ADAPTIVE VARIABLE SELECTION FOR

Lecture 5 Models and methods for recurrent event data

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes. Donglin Zeng, Department of Biostatistics, University of North Carolina

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Statistical Methods for Alzheimer s Disease Studies

An Introduction to Causal Inference, with Extensions to Longitudinal Data

Sample size re-estimation in clinical trials. Dealing with those unknowns. Chris Jennison. University of Kyoto, January 2018

Stat 642, Lecture notes for 04/12/05 96

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Construction and statistical analysis of adaptive group sequential designs for randomized clinical trials

Web-based Supplementary Materials for A Robust Method for Estimating. Optimal Treatment Regimes

Propensity Score Methods for Causal Inference

Multistate models and recurrent event models

Extending causal inferences from a randomized trial to a target population

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study

Rerandomization to Balance Covariates

SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Semiparametric Regression and Machine Learning Methods for Estimating Optimal Dynamic Treatment Regimes

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

EMERGING MARKETS - Lecture 2: Methodology refresher

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

Effects of multiple interventions

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

arxiv: v1 [stat.me] 15 May 2011

Harvard University. Robust alternatives to ANCOVA for estimating the treatment effect via a randomized comparative study

A Gate-keeping Approach for Selecting Adaptive Interventions under General SMART Designs

Bayesian Nonparametric Accelerated Failure Time Models for Analyzing Heterogeneous Treatment Effects

Statistics and Probability Letters. Using randomization tests to preserve type I error with response adaptive and covariate adaptive randomization

Building a Prognostic Biomarker

Extending the results of clinical trials using data from a target population

Balancing Covariates via Propensity Score Weighting

Transcription:

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1

Outline Description of Motivating Example Formulating the Statistical Question Solution Application to Example Introduction to Dynamic Treatment Regimes 2

The ESPRIT Infusion Trial Study of the effect of Integrilin therapy on outcome for patients undergoing coronary stent implantation Study Design and Features: Randomized, placebo-controlled trial - 1024 on placebo and 1040 on experimental treatment Experimental Integrilin infusion for 18-24 hours Outcome: composite endpoint of death, MI, or urgent target revascularization at 30 days Conclusion: Outcome (failure rate) reduced from 10.5% (on placebo) to 6.8% (on treatment) which was significant (p=0.0034). Introduction to Dynamic Treatment Regimes 3

After the study concluded that Integrilin therapy was effective in reducing the failure rate, attention focused on determining a recommended treatment duration. Treatment duration decision: Infusion length ends when attending physician deems it appropriate (physician discretion); generally between 18-24 hours Bailout - infusion immediately stopped if patient experiences adverse event severe bleeding abrupt closure no reflow coronary thrombosis failure outcome Thus, actual infusion is stopped either by physician discretion or the occurrence of a treatment-terminating adverse event. Introduction to Dynamic Treatment Regimes 4

Research Goal (Dynamic Treatment Regime) What should be the recommended infusion length? More precisely, an infusion length recommendation for t hours really means to infuse for t hours or until a treatment-terminating event occurs, whichever comes first. Such a recommendation is an example of a dynamic treatment regime because the treatment will vary according to the outcome of the patient. We refer to such a dynamic treatment regime as a Treatment Duration Regime of t Hours To address our goal, we must be able to estimate the mean response (probability of a failure) associated with different treatment duration regimes. Introduction to Dynamic Treatment Regimes 5

Integrilin Duration for Patients with No Adverse Events hours no. of pts 16 ( < 17) 61 18 (17-19) 479 20 (19-21) 194 22 (21-23) 85 24 ( > 23) 111 930 Table 1: Table of Treatment Completion versus Outcome Introduction to Dynamic Treatment Regimes 6

Response Summary for Integrilin Patients With Adverse Events Status no. of pts no. of failures Early Term. 106 20 18.9% ( < 17 ) 89 19 21.3% ( 17-19 ) 11 0 0.0% ( 19-21 ) 6 1 16.7% Completed 930 51 5.5% 1036 71 6.8% Table 2: Table of Early Treatment Termination versus Outcome Introduction to Dynamic Treatment Regimes 7

Summarizing Adverse events occurred early Poor outcome associated with adverse events Infusion duration was between 16-24 hours Introduction to Dynamic Treatment Regimes 8

Ideal experiment: Randomized study Randomize patients to different treatment-duration regimes Know patient treatment assignment Patients are, on average, similar across treatment regimes Compare failure rates among randomized regimes (Intent-to-treat analysis) Introduction to Dynamic Treatment Regimes 9

In an Observational Study Patients are not randomly assigned to regimes. The study of the duration of Integrilin infusion on outcome is an example of an observational study embedded in a randomized study. This creates two difficulties: No longer reasonable to assume patients are similar across assigned treatment durations Unlike the randomized study, the intended treatment duration regime is not known (censored) for a patient whose infusion is terminated because of an adverse event Introduction to Dynamic Treatment Regimes 10

Observed Data The data in an observational study can be summarized as Y = binary 30-day outcome (failure indicator) U = observed infusion length = 1 (infusion stopped by physician discretion) = 0 (infusion stopped by adverse event) Z(U) = covariate history through time U For simplicity, we consider a finite number of treatment duration regimes t 1,...,t k by discretizing the duration data With such data how do we answer the research question of interest? For that matter what is the research question? Introduction to Dynamic Treatment Regimes 11

Causal Models Conceptualize the problem and define the parameter of interest using potential outcomes (counterfactuals) as defined by Neyman, Rubin, Holland and Robins Model the treatment decision (infusion duration) process Make assumptions. Are they feasible? Introduction to Dynamic Treatment Regimes 12

Framework, Methods and Assumptions The Conceptualization of the Problem using Potential Outcomes (What if Outcomes) For a randomly selected individual in our population, let {C, Y (C)} = time to adverse event if continuously treated and potential response if treatment continued to that point {Y (t 1 ),...,Y (t k )} = potential responses for treatment regimes t 1,...,t k. Note that Y (t j ) = Y (C) for t j C Z(C) = covariate history Let us denote the all the potential outcomes by V = {C, Y (t 1 ),...,Y (t k ), Z(C)} Introduction to Dynamic Treatment Regimes 13

Goal: Estimate µ j = E{Y (t j )}, j = 1,...,k Easy if we had the potential outcomes, but, of course, we don t We only get to observe the responses according to the actual infusion duration each patient received Introduction to Dynamic Treatment Regimes 14

Assumptions What assumptions on the treatment decision (infusion duration) process will allow us to identify the distribution of Y (t j ) through the observed data? Introduction to Dynamic Treatment Regimes 15

Observational Study Assumption No Unmeasured confounders { } λ j (V ) = P U = t j, = 1 U t j, V = P { U = t j, = 1 U t j, Z(t j ) } = λ j ( Z j ), for j = 1,...,k, where λ j ( Z j ) denotes the discrete cause-specific hazard for stopping infusion at t j with physician discretion. In words, this assumption implies that the conditional probability to stop or continue treatment at time t j, given that the patient has continuously received treatment without experiencing an adverse event, only depends on the observed covariate data up through time t j and not additionally on other potential outcomes. Introduction to Dynamic Treatment Regimes 16

Question Using the above assumptions, can we construct a consistent estimator for µ j? Answer A weighted estimator using importance weights defined via the inverse propensity score Introduction to Dynamic Treatment Regimes 17

Propensity Score Let R ij denote the indicator of whether the treatment that patient i receives is consistent with the regime t j That is, R ij = 1 if (U i = t j, i = 1) or (U i < t j, i = 0) = 0 otherwise Define the propensity score by π j (V i ) = P(R ij = 1 V i ) Introduction to Dynamic Treatment Regimes 18

Importance Weights The importance weight for treatment duration regime t j is defined by w ij = R ij π j (V i ) That is, when R ij = 0 then w ij = 0 R ij = 1 then w ij = 1/π j (V i ) The weighted estimator is then given by ˆµ j = n i=1 w ijy i n i=1 w ij Introduction to Dynamic Treatment Regimes 19

Motivation Suppose everyone in the sample was assigned to regime t j ; hence Y i = Yi (t j), then ˆµ j = n 1 n i=1 Y i However, not everyone is assigned treatment consistent with regime t j Therefore, take every one in the sample that is consistent with the regime t j ; i.e. {i : R ij = 1}, and weight their response by 1/π j (V i ), so that they not only represent their response but also the response of others that didn t receive this treatment regime Introduction to Dynamic Treatment Regimes 20

Propensity scores What is the propensity score π j (V ) = P(R j = 1 V )? Since V is not observed, can the propensity score be written as a function of the observed data {U,, Z(U)}? Introduction to Dynamic Treatment Regimes 21

Propensity scores The propensity score π j (V ) equals {1 λ 1 (V )}... {1 λ j 1 (V )} λ j (V ) if C > t j {1 λ 1 (V )}... {1 λ [C] (V )} if C < t j Because of the assumption of No Unmeasured Confounders {1 λ 1 ( Z 1 )}... {1 λ j 1 ( Z j 1 )} λ j ( Z j ) if C > t j {1 λ 1 ( Z 1 )}... {1 λ [C] ( Z [C] )} if C < t j where λ j (V ) = Pr(U = t j, = 1 U t j, V ) λ j ( Z j ) = Pr(U = t j, = 1 U t j, Z j ) Introduction to Dynamic Treatment Regimes 22

Weighted estimator Define f j ( Z j ) = λ j ( Z j ) K j ( Z j ) = j m=1 j 1 m=1 {1 λ m ( Z m )} {1 λ m ( Z m )} Consequently, w ij = R ij π j (V i ) = I(U i = t j, i = 1) f j ( Z ij ) + I(U i < t j, i = 0) K [Ui ]( Z i[ui ]) Hence, the weighted estimator is ˆµ j = n i=1 w ijy i n i=1 w ij Introduction to Dynamic Treatment Regimes 23

Estimating f j ( Z j ) and K [U] ( Z [U] ) The discrete hazards λ j ( Z j ), j = 1,...,k, needed to compute f j ( Z j ) and K [U] ( Z [U] ), were estimated using conditional logistic regression models which were then used to estimate the inverse propensity score estimators ˆµ j, j = 1,...,k. Using theory of m-estimation we show that, under suitable regularity conditions, ˆµ j will be consistent and asymptotically normal with asymptotic variance which can be estimated using a sandwich estimator. Introduction to Dynamic Treatment Regimes 24

Analysis of ESPRIT Infusion Trial The key assumption driving this methodology is that of No Unmeasured Confounders We emphasized this point with the investigators General belief was that, other than the adverse events that necessitated termination of treatment, the infusion duration was a matter of physician preference and convenience Potential Confounders: Time Independent Covariates: Diabetes, PTCA, Angina, Heparin, Weight, Time Dependent Covariate: Enzyme levels (recorded every six hours) Introduction to Dynamic Treatment Regimes 25

Table 3: T 1j denotes an overall event rate, T 2j denotes an uncensored event rate, ˆµ (0) j is the proposed estimator assuming no confounding is present, ˆµ (1) j is the proposed estimator assuming confounding is present through baseline factors only, and ˆµ (2) j time-dependent confounding. is the proposed estimator assuming t j (hrs) T 1j T 2j ˆµ (0) j ˆµ (1) j ˆµ (2) j 16.133.016.047(.021).040(.016).044(.018) 18.045.046.066(.010).066(.010).070(.011) 20.065.062.079(.017).078(.017).078(.017) 22.047.047.071(.024).071(.024).067(.022) 24.108.108.116(.027).121(.035).109(.032) Introduction to Dynamic Treatment Regimes 26

Placebo patients t j (hrs) T 1j T 2j ˆµ (0) j ˆµ (1) j ˆµ (2) j 16.187.097.106(.036).116(.040).131(.045) 18.073.071.083(.012).083(.012).091(.013) 20.116.116.125(.022).125(.022).126(.022) 22.070.070.081(.026).079(.026).079(.026) 24.187.187.191(.033).170(.031).124(.024) Introduction to Dynamic Treatment Regimes 27

Discussion Through potential outcomes and importance weighting, we proposed a method to estimate mean response for treatment duration regime of t hours where only naive and ad hoc methods had been used before Estimator is consistent and asymptotically normal under the proposed assumptions Simulation studies show that our estimator performs well in realistic sample sizes Introduction to Dynamic Treatment Regimes 28

References and Future Direction For discrete time approximations, results can be found in Johnson and Tsiatis (2004) Biometrics 60: 315-323. Clinical paper in Rebeiz et al. (2004) American Journal of Cardiology 94: 926-929. Generalized to estimate duration-response relationships in continuous time Johnson and Tsiatis (2005) Biometrika 92: 605-618. Optimal duration regimes, where timing may additionally be influenced by covariate history in a way to obtain best overall average outcome may be considered Introduction to Dynamic Treatment Regimes 29