Extending the results of clinical trials using data from a target population

Similar documents
Extending causal inferences from a randomized trial to a target population

Targeted Maximum Likelihood Estimation in Safety Analysis

Propensity Score Weighting with Multilevel Data

Comparative effectiveness of dynamic treatment regimes

Combining multiple observational data sources to estimate causal eects

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-level Information from External Big Data Sources

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Covariate Balancing Propensity Score for General Treatment Regimes

Propensity Score Analysis with Hierarchical Data

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

What s New in Econometrics. Lecture 1

Graphical Representation of Causal Effects. November 10, 2016

Statistical Testing I. De gustibus non est disputandum

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Unbiased estimation of exposure odds ratios in complete records logistic regression

Estimating direct effects in cohort and case-control studies

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Deep Temporal Generative Models of. Rahul Krishnan, Uri Shalit, David Sontag

Spatial Variation in Hospitalizations for Cardiometabolic Ambulatory Care Sensitive Conditions Across Canada

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017

Statistical inference in Mendelian randomization: From genetic association to epidemiological causation

Propensity Score Methods for Causal Inference

Journal of Biostatistics and Epidemiology

Instrumental variables estimation in the Cox Proportional Hazard regression model

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with Measurement Error

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

A Sampling of IMPACT Research:

Marginal Structural Cox Model for Survival Data with Treatment-Confounder Feedback

Causal Inference with Big Data Sets

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Probability and Probability Distributions. Dr. Mohammed Alahmed

Using Instrumental Variables to Find Causal Effects in Public Health

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

Lecture 1 January 18

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Evidence synthesis for a single randomized controlled trial and observational data in small populations

A Decision Theoretic Approach to Causality

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

Causal inference in epidemiological practice

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Advanced Statistical Methods for Observational Studies L E C T U R E 0 6

Robustness to Parametric Assumptions in Missing Data Models

An introduction to biostatistics: part 1

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Causality and Experiments

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

The Lady Tasting Tea. How to deal with multiple testing. Need to explore many models. More Predictive Modeling

Lecture 4 Multiple linear regression

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

Gov 2002: 4. Observational Studies and Confounding

Sensitivity analysis and distributional assumptions

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (

Distributed analysis in multi-center studies

Estimating Optimal Dynamic Treatment Regimes from Clustered Data

Survival Analysis with Time- Dependent Covariates: A Practical Example. October 28, 2016 SAS Health Users Group Maria Eberg

6.3 How the Associational Criterion Fails

Multiple linear regression S6

ST 732, Midterm Solutions Spring 2019

Lecture 3: Causal inference

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

Causal Inference Basics

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Multi-state Models: An Overview

High Dimensional Propensity Score Estimation via Covariate Balancing

DATA-ADAPTIVE VARIABLE SELECTION FOR

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Estimating the Marginal Odds Ratio in Observational Studies

arxiv: v1 [stat.me] 3 Feb 2017

Variable selection and machine learning methods in causal inference

University of Michigan School of Public Health

Doubly Robust Estimation in Missing Data and Causal Inference Models

Mediation Analysis for Health Disparities Research

Econometric Causality

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Lecture 01: Introduction

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

STA6938-Logistic Regression Model

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

arxiv: v1 [stat.me] 15 May 2011

Construction and statistical analysis of adaptive group sequential designs for randomized clinical trials

Introduction to Statistical Inference

University of California, Berkeley

Transcription:

Extending the results of clinical trials using data from a target population Issa Dahabreh Center for Evidence-Based Medicine, Brown School of Public Health

Disclaimer Partly supported through PCORI Methods Research Awards ME-1306-03758 & ME-1502-27794. All statements are solely mine and do not necessarily represent the views of the PCORI, its Board of Governors, or the Methodology Committee. Work in progress 1

Outline Applicability, external validity Concepts & study design Extending trial findings to the target population Structural assumptions Statistical methods Simulation study Applications small data: revascularization for coronary artery disease Big data: anti-hypertensives for chronic renal failure Extensions Concluding remarks 2

Applicability, external validity Concepts and design-based solutions 3

Applicability and external validity essential for imparting significance to scientific research beyond the confines of the original investigation Evidence interpretation: obtain à assess validity à synthesize à assess applicability Unaided subjective judgment Not transparent Subject to cognitive biases No conceptual framework Key Question: How can we aid our judgment using causal assumptions & statistical methods? 4

Assessing applicability Based on similarity judgments Populations Interventions Outcomes Applicability is tied to representativeness of trial participants Target population where the trial results will be applied 5

What is a representative sample? Kruskal & Mosteller, Int Stat Review 1980 Representativeness: main preoccupation in survey research 6

Clinical trials and target populations Target population 1,319 patients who were eligible for randomization [ ] were not randomized OR 23,467 patients seen at participating centers during the study period Experimentally accessible population Population that gave rise to the trial participants Randomized clinical trial sample Myocardial infarction and mortality in the Coronary Artery Surgery Study randomized trial, n = 780 CASS investigators NEJM 1984 7

Should we worry about representativeness? Participants in the trials have different distributions of covariates than the target Treatment effects are heterogeneous over these covariates Effect modification or heterogeneity of treatment effects Key idea: applicability requires similar distribution of effect modifiers across populations 8

Is this a real problem? Systematic review, 52 studies (cardiology = 20; mental health = 17; oncology = 15) a high proportion of the general disease population was often excluded from trials. highly selected and have a lower risk profile than real-world populations, with the frequent exclusion of elderly patients and patients with comorbidities. 71.2 % [of studies] concluded that RCT samples were not broadly representative of real-world patients Kennedy-Martin et al., Trials 9 2015

Design-based strategy (ideal) : survey experiments Randomly select individuals from the target population (survey) Randomly assign the sampled individuals (experiment) Analyze the results using survey weights for inference to the target population 10

Limitations of survey experiments The patients invited are different from those who agree to participate Exceedingly difficult to do in medical or health services research E.g.: Oregon Health Insurance Experiment Applying the results of a survey experiment to different target population raises the same problem as for typical trials 11

Extending trial results to a target population Structural assumptions and statistical methods 12

Counterfactual causality Causal effects = contrasts between potential outcomes under different interventions For the i th individual, i = 1,...,N Y 1 i is the potential outcome under treatment Y 0 i is the potential outcome under control Individual treatment effect = Y 1 i Y 0 i Average treatment effect, Δ = E[Y 1 i Y 0 i] = E[Y 1 i] Ε[Y 0 i] 13

Causal inference and missing data Foundamental problem of causal inference: we can never know Y 1 i and Y 0 i Causal problem à missing data problem Deep connection Causal inference problems are missing data problems Missing data problems are causal inference problems Rubin, J Ed Psych 1974; Rubin, JASA 2008 14

Missing data in the trial We have observed outcomes but know nothing about potential outcomes S A Y Y 1 Y 0 X 1 1 0 15

Identification in the trial S A Y Y 1 Y 0 X 1 y 1 0 y 16

Identification in the trial S A 1 Y y Y 1 y 1 Y 0 X Well defined interventions, consistency Y obs = A Y 1 + (1 - A) Y 0 1 0 y y 0 17

Identification in the trial S A 1 Y y Y 1 y 1 Y 0 X Well defined interventions, consistency Y obs = A Y 1 + (1 - A) Y 0 1 0 y y 0 Positivity 0 < P(A = 1 X) < 1 18

Identification in the trial S A 1 Y y Y 1 Y 0 X Well defined interventions, consistency Y obs = A Y 1 + (1 - A) Y 0 1 0 y y 1 y 0 Positivity 0 < P(A = 1 X) < 1 Exchangeability Marginal: E[Y a A = a, S = 1] = E[Y a S = 1] 19

Identification in the trial S A 1 Y y Y 1 Y 0 X Well defined interventions, consistency Y obs = A Y 1 + (1 - A) Y 0 Positivity 0 < P(A = 1 X) < 1 Exchangeability Marginal: E[Y a A = a, S = 1] = E[Y a S = 1] Conditional: E[Y a A = a, X, S = 1] = E[Y a X, S = 1] 1 0 y y 1 y 0 Will also assume that the trial is otherwise flawless: no dropouts, no missing data, no measurement error, This is only done for clarity of exposition 20

Extending the trial to the target population S A 1 Y y Y 1 Y 0 X All we know is who is in and who is out of the trial 1 0 y y 1 y 0 0 21

Identification in the target S A 1 Y y Y 1 Y 0 X Often we have information on the covariate distribution in the target 1 0 y y 1 y 0 0 22

Identification in the target S A 1 Y y Y 1 Y 0 X Often we have information on the covariate distribution in the target 1 0 y y 1 y 0 Positivity of trial participation P(S = 1 X) > 0 0 23

Identification in the target S A 1 Y y Y 1 Y 0 X Often we have information on the covariate distribution in the target 1 0 y Positivity of trial participation P(S = 1 X) > 0 Conditional mean transportability E[Y a X, S = 1] = E[Y a X] 0 y 1 y 0 24

Positivity of trial participation No set of covariates predicts lack of participation perfectly No systematically excluded subgroups of patients Required for principled extrapolation Sometimes implausible, e.g., Tails of the age distribution Disease severity Socioeconomic status 25

Conditional mean transportability We know enough factors that determine the outcome so that trial participation itself is unimportant Trial participation itself does not have a direct effect on the outcome Often implausible Hawthorne effect Incomplete knowledge of the outcome mechanisms Lack of measurements 26

Using a model for the outcome Treat the trial as an investigation into the mechanism of the outcome What statistics professors tell us not to do extrapolation How to do it: 1. Estimate 2 models for the outcome, one in each of the treatment arms 2. Use the models to predict outcomes under treatment and no treatment for everyone in the target 3. Compare the average values to estimate the causal effect Imai & Ratkovic, Ann Applied Stat 2013 27

Outcome-model-based estimator 28

Using a model for the probability of participation Treat the trial as a survey where we lost the sampling weights Same structural, different modeling assumptions extrapolation How to do it: 1. Estimate a model for the probability of participation 2. Weight the outcomes in each trial arm by the inverse of the probability of participation 3. Compare the average weighted values to estimate the causal effect Stuart & Cole, Am J Epi 2010 29

Probability of participation estimator 30

Extreme model reliance Need for correct specification of the outcome or participation models Otherwise estimators are inconsistent Ideally, we want a method that Protects us from getting the wrong answer if we could get one of the models right (and did not know which) Is as efficient as the outcome model-based approach Provides a way to assess model specification 31

Doubly robust estimation Such a method exists! Smart way to combine the outcome model and the probability of participation Make the weighted estimator more efficient and gain robustness Analogous to de-confounding methods for combining weighting and regression modeling Scharfstein et al. JASA 1999; Robins & Rotnitzky, Stat Sinica 2001; Robins et al., Stat Science 2007 32

Doubly robust estimators Unbounded, Horwitz-Thompson 1. Model participation in the trial and calculate inverse probability of participation weights 2. Estimate a model for the outcome using regression and predict outcomes 3. Combine observed outcomes, predicted outcomes, and the probability of participation in a smart way ( ˆδ DR = 1 NX S i A i Y i N i=1 w 1 (X i, ˆβ) + w 1(X i, ˆβ) S i A i w 1 (X i, ˆβ) g 1 (X i, ˆγ 1 ) S i(1 A i )Y i w 0 (X i, ˆβ) w 0(X i, ˆβ) ) S i (1 A i ) w 0 (X i, ˆβ) g 0 (X i, ˆγ 0 ). Bounded, regression-based 1. Model participation in the trial and calculate inverse probability of participation weights 2. Estimate a model for the outcome using weighted regression and predict outcomes 3. Use the predictions to estimate the causal effect 33

3 claims Consistent if one of the two models (or both) are correctly specified In large samples, if both models are correctly specified, as efficient as the outcome model based estimator Reasonable performance in finite samples Key idea: 2 opportunities to make a modeling mistake for the price of 1 34

Simulation study (selected results) 35

Simulation setup N = 1,000,000 X j,i ~ N(0,1); j = 1,2,3; i= 1,,N; X it =(1, X 1,i, X 2,i, X 3,i ) S i ~ Bernoulli(π i ) A i ~ Bernoulli(0.5) if S i = 1 π i = exp(x it α) / (1 + exp(x it α)); α T = (α 0, 1,1,1) α 0 to obtain expected trial sample sizes of 1000, 5000, or 10,000 μ i = β 0 + Σβ j Χ j,i + ψx 1,i A i + γa i Y i ~ N(μ i,1) 36

Bias Bias 0.1 0.0 n = 1000, ψ = 0 0.1 0.0 n = 5000, ψ = 0 0.1 0.0 n = 10,000, ψ = 0-0.1 0 1 2 3-0.1 0 1 2 3-0.1 0 1 2 3 0.1 n = 1000, ψ = -1 0.1 n = 5000, ψ = -1 0.1 n = 10,000, ψ = -1 Bias 0.0 0.0 0.0-0.1 0 1 2 3-0.1 0 1 2 3-0.1 0 1 2 3 0.1 n = 1000, ψ = 1 0.1 n = 5000, ψ = 1 0.1 n = 10,000, ψ = 1 Bias 0.0 0.0 0.0-0.1 0 1 2 3 γ -0.1 0 1 2 3 γ -0.1 0 1 2 3 γ 37

Variance Variance 0.60 0.40 0.20 n = 1000, ψ = 0 0.60 0.40 0.20 n = 5000, ψ = 0 0.60 0.40 0.20 n = 10,000, ψ = 0 0.00 0 1 2 3 0.00 0 1 2 3 0.00 0 1 2 3 0.60 n = 1000, ψ = -1 0.60 n = 5000, ψ = -1 0.60 n = 10,000, ψ = -1 Variance 0.40 0.20 0.40 0.20 0.40 0.20 0.00 0 1 2 3 0.00 0 1 2 3 0.00 0 1 2 3 0.60 n = 1000, ψ = 1 0.60 n = 5000, ψ = 1 0.60 n = 10,000, ψ = 1 Variance 0.40 0.20 0.40 0.20 0.40 0.20 0.00 0 1 2 3 γ 0.00 0 1 2 3 γ 0.00 0 1 2 3 γ 38

Applications Data, Big and small 39

Small data: randomized preference design Chronic coronary artery disease Coronary artery bypass grafting (CABG) vs. medical therapy 2099 met trial criteria: 780 randomized (390 CABG; 390 medical) 1319 self-selected (570 CABG; 745 medical) Outcome: death by year 24 (no dropout) 40

Treatment effects CASS study Randomizable (crude) 1.169 (0.930 to 1.469) DR (regression) 1.038 (0.788 to 1.395) Total population DR (HT) Outcome model 1.015 (0.743 to 1.387) 1.037 (0.784 to 1.385) Weighted 1.035 (0.770 to 1.388) Randomized 1.011 (0.754 to 1.356) 0.5 1 Odds Ratio 2 41

Big data: combined claims and EMR Non-diabetic chronic kidney disease Angiotensin-converting inhibitors (ACE) vs. other anti-hypertensives (control) 9 trials, 791 ACEi vs 765 control Outcome: dialysis Optum Labs, chronic kidney disease cohort 93,160 non-diabetic kidney disease patients met trial inclusion-exclusion criteria (e.g., no diabetes, cancer, dementia) Data on Claims: age, sex, race, hypertension + comorbidities EMR: age, sex, race, hypertension, BMI, height, weight, creatinine values, systolic + diastolic blood pressure, estimated glomerular filtration rate + comorbidities 42

Causally interpretable meta-analysis All trials re-standardized to a common target population trial data claims 0.659 (0.486 to 0.893) 0.593 (0.406 to 0.865) 0.659 (0.486 to 0.893) 0.593 (0.406 to 0.865) 0.667 (0.298 to 1.492) EMR 0.667 (0.298 to 1.492) 0.01 0.1 1 10 100 43

How seriously should we take these results? Mean transportability and positivity are strong assumptions Results can be thought as attempts to approximate the causal effect Tools for informing judgments Input to the design of a future study Transparent assumptions No no confounding assumption in the target (triangulation) Better covariate collection in trials (and observational studies) 44

Extensions Assessing applicability Assessing whether the trial and target distribution are close enough High dimensions (visualization + testing) Methods for extending trial findings Additional data available in the target population Behavior under model misspecification Sensitivity analysis Implications for study design and economic modeling 45

Concluding remarks Elements of a framework for assessing applicability and extending trial findings Heroic assumptions needed (made explicit) Transparency Methods for extending trial findings, with a view towards robustness 2 real world applications first causally interpretable meta-analysis ever Causal inference and evidence interpretation: extrapolations are normally components of a complex web of interrelated evidence that must be considered together in assessing a hypothesis 46

a highly standardized experiment supplies direct information only in respect to the narrow range of conditions achieved by standardization. Standardization, therefore, weakens rather than strengthens our ground for inferring a like result, when, as is invariably the case in practice, these conditions are somewhat varied. R.A. Fisher The design of experiments, 1935 47

Thank you! BSPH Chris Schmid Sarah Robertson Iman Saeed HSPH Miguel Hernan James Robins John s Hopkins Liz Stuart Tufts MC David Kent John Wong Jason Nelson Lesley Inker Duke Liz Delong ICER Moira Kapral PCORI Emily Evans 48

Misspecified outcome model

Misspecified probability of participation model