Causal-inference and Meta-analytic Approaches to Surrogate Endpoint Validation

Similar documents
Practical issues related to the use of biomarkers in a seamless Phase II/III design

Package Surrogate. January 20, 2018

1 Introduction. Keywords: clinical trial, principal stratification, surrogate paradox, vaccine efficacy trial, Zoster

Surrogate marker evaluation when data are small, large, or very large Geert Molenberghs

arxiv: v1 [stat.ap] 6 Mar 2018

Evaluating Causal Effect Predictiveness of Candidate Surrogate Endpoints

Causal Effect Evaluation and Causal Network Learning

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

A Unified Approach to the Statistical Evaluation of Differential Vaccine Efficacy

Module 16: Evaluating Vaccine Efficacy

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small

Modeling of Correlated Data and Multivariate Survival Data

A proof of Bell s inequality in quantum mechanics using causal interactions

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

Growth Mixture Modeling and Causal Inference. Booil Jo Stanford University

arxiv: v1 [stat.me] 3 Feb 2016

Evaluation of the predictive capacity of a biomarker

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

Survival Analysis for Case-Cohort Studies

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Group Sequential Trial with a Biomarker Subpopulation

DISIA WORKING PAPER 2015/06

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Accounting for Baseline Observations in Randomized Clinical Trials

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

A Measure of the Proportion of Treatment Effect Explained by a Surrogate Marker

Bayesian concept for combined Phase 2a/b trials

Causal modelling in Medical Research

Using Post Outcome Measurement Information in Censoring by Death Problems

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

Bounds on Direct Effects in the Presence of Confounded Intermediate Variables

Rubin s Potential Outcome Framework. Motivation. Example: Crossfit versus boot camp 2/20/2012. Paul L. Hebert, PhD

Statistical Models for Causal Analysis

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Choice of units of analysis and modeling strategies in multilevel hierarchical models

CTDL-Positive Stable Frailty Model

Represent processes and observations that span multiple levels (aka multi level models) R 2

Harvard University. Harvard University Biostatistics Working Paper Series

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Introduction to Statistical Inference

Bios 6648: Design & conduct of clinical research

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016

Modeling Mediation: Causes, Markers, and Mechanisms

Accounting for Baseline Observations in Randomized Clinical Trials

A Decision Theoretic Approach to Causality

Single World Intervention Graphs (SWIGs):

University of California, Berkeley

Bayesian Analysis for Markers and Degradation

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Bayesian Inference for Causal Mediation Effects Using Principal Stratification with Dichotomous Mediators and Outcomes

Estimation of Optimally-Combined-Biomarker Accuracy in the Absence of a Gold-Standard Reference Test

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized. Clinical Trials Subject to Noncompliance.

Valida&on of Predic&ve Classifiers

arxiv: v1 [stat.me] 8 Jun 2016

Multivariate Survival Analysis

Variable selection and machine learning methods in causal inference

Bayesian Drug Disease Model with Stan Using published longitudinal data summaries in population models

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance

Discussion of "Principal stratification designs..." James Robins, Andrea Rotnitzky and Stijn Vansteelandt

Sensitivity analysis and distributional assumptions

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Causal Inference with Counterfactuals

The Causal Inference Problem and the Rubin Causal Model

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

Journal of Biostatistics and Epidemiology

Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample Size Re-estimation

Inference in Randomized Trials with Death and Missingness

Analysing Survival Endpoints in Randomized Clinical Trials using Generalized Pairwise Comparisons

Dose-response modeling with bivariate binary data under model uncertainty

External Validity and Transportability: A Formal Approach

A Clinical Trial Simulation System, Its Applications, and Future Challenges. Peter Westfall, Texas Tech University Kuenhi Tsai, Merck Research Lab

arxiv: v2 [stat.me] 24 Apr 2018

Pubh 8482: Sequential Analysis

Causal Mechanisms Short Course Part II:

SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN

Harvard University. Harvard University Biostatistics Working Paper Series

Two-stage k-sample designs for the ordered alternative problem

A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes

Lecture Discussion. Confounding, Non-Collapsibility, Precision, and Power Statistics Statistical Methods II. Presented February 27, 2018

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017

A spatial causal analysis of wildfire-contributed PM 2.5 using numerical model output. Brian Reich, NC State

Substantiation of Disease and Health-Related Claims in Advertising. Washington, D.C.

The Design of a Survival Study

Estimating causal effects in trials involving multitreatment arms subject to non-compliance: a Bayesian framework

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Modeling Log Data from an Intelligent Tutor Experiment

Research Design: Causal inference and counterfactuals

Estimation of direct causal effects.

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Evidence synthesis for a single randomized controlled trial and observational data in small populations

Adaptive Designs: Why, How and When?

Models for Multivariate Panel Count Data

Transcription:

Causal-inference and Meta-analytic Approaches to Surrogate Endpoint Validation Tomasz Burzykowski I-BioStat, Hasselt University, Belgium, and International Drug Development Institute (IDDI), Belgium tomasz.burzykowski@uhasselt.be 1

Outline Motivation Meta-analytic approach Causal paradigms Principal surrogacy Causal-association & the meta-analytic approach 2

Terminology Clinical endpoint: a characteristic or variable that reflects how a patient feels, functions, or survives Biomarker: a characteristic that is objectively measured and evaluated as an indicator of normal biological processes, pathogenic processes, or pharmacologic responses to a therapeutic intervention Surrogate endpoint: a biomarker that is intended to substitute for a clinical endpoint. A surrogate endpoint is expected to predict clinical benefit (or harm or lack of benefit or harm) Refs: Biomarkers Definition Working Group, Clin Pharmacol Ther 2001, 69: 89; De Gruttola et al, Controlled Clin Trials 2001, 22: 485.

Why Do We Need Surrogates? Practicality of studies: Shorter duration Smaller sample size (?) Availability of biomarkers: Tissue, cellular, hormonal factors, etc. Imaging techniques Genomics, proteomics, other -omics Ref: Schatzkin and Gail, Nature Reviews (Cancer) 2001, 3 4

How To Define a Surrogate? Key point: A correlate does not a surrogate make correlation between S(urrogate) and T(rue endpoint) is not a sufficient condition for validity Ref: Fleming and DeMets, Ann Intern Med 1996, 125: 605. 5

A Correlate Does not a Surrogate Make Ref: Korn, Albert & McShane, Statist Med 2005;24:163 6

Validation Based on the Precision of Prediction The effect of treatment on a surrogate endpoint must be reasonably likely to predict clinical benefit Ref: Biomarkers Definition Working Group, Clin Pharmacol Ther 2001, 69: 89. 7

Meta-analytic Approach Assume (ε Tij, ε Sij ) ~ N 2 (0, Σ e ), with Σ e = Let (μ Si, μ Ti, β i,α i ) ~ normal. Define μ Δi =(β i,α i ) ~ N 2 (μ * Δ,D Δ ), with Ref: Buyse et al, Biostatistics 2000 8

Prediction of True Endpoint From Surrogate R² (or R) indicates quality of regression Endpoints observed on individual patients True Endpoint Surrogate Endpoint 9

Prediction of Treatment Effect: Multiple Trials Treatment Effect on True Endpoint (β) 1.5 0 -.5 R² (or R) indicates quality of regression Treatment effects observed in all trials -1-1 0 1 Treatment Effect on Surrogate Endpoint (α) 10

Meta-analytic Approach (ε Tij, ε Sij ) ~ N 2 (0, Σ e ), with Σ e = Define R ind =σ TS /(σ TT σ SS ) 0.5 ; if R ind ±1, surrogate valid at the individual-level μ Δi =(β i,α i ) ~ N 2 (μ * Δ,D Δ ), with Define R trial =d ab /(d aa d bb ) 0.5 ; if R trial ± 1, surrogate valid at the trial-level Ref: Buyse et al, Biostatistics 2000 11

Example: Age-related Macular Degeneration 181 patients form 36 centers; placebo vs. interferon-α S: change in visual acuity at 6 mths; T: change at 1 year Ref: Buyse et al, Biostatistics 2000; Alonso et al, Biometrics 2015 12

Example: Age-related Macular Degeneration Ref: Alonso et al, Biometrics 2015 13

Causal-inference and Validation of Surrogate Endpoints Principal stratification (Frangakis & Rubin, Biometrics 2002) Causal inference links for proportion explained (Taylor et al, Biometrics 2005) Causal necessity and sufficiency (Gilbert & Hudgens, Biometrics 2008) Causal paradigms (Joffe & Greene, Biometrics 2009) Bayesian model, single trial (Li et al., Biometrics 2010) Bayesian model, multiple trials (Li et al., Biostatistics 2011) Principal stratification, normal endpoints (Conlon et al., Biostatistics 2013) Surrogate paradox (VanderWeele, Biometrics 2013) Link to the meta-analytic approach (Alonso et al., Biometrics 2015) 14

Causal Inference: Potential Outcomes and Counterfactuals Each subject has two potential outcomes for T: T 0j and T 1j, corresponding to the two treatments (Z = 0,1). Likewise for S. Only one of these outcomes is observed, the other is counterfactual. Example: binary data Subject j Z j T 0j T 1j S 0j S 1j 1 0 1? 1? 2 1? 1? 1 3 1? 0? 0 4 0 0? 1? 5 1? 1? 1 15

Causal Inference: Individual Causal Effects The causal treatment effect ont for subject j is Δ Tj =T 1j T 0j. The causal treatment effect on S for subject j is Δ Sj =S 1j S 0j. The effects are unobservable, unless special designs, e.g., a cross-over trial, are considered. 16

Causal Inference: Expected (Average) Causal Effects The expected (average) causal effect on T is β = E(T 1j -T 0j ) = E(Δ Tj ). The expected (average) causal effect on S is α = E(S 1j -S 0j ) = E(Δ Sj ). Under Stable Unit Treatment Value Assumption (SUTVA), and if Z j (T 0j,T 1j,S 0j,S 1j ) (plausible under randomization), then β = E(T j Z j =1) - E(T j Z j =0) and α = E(S j Z j =1) - E(S j Z j =0) E(T j Z j =1), etc., can be estimated by the sample means. Hence, the meta-analytic approach can be seen as the analysis of the association of the expected causal effects. 17

Causal-inference Paradigms for Validation of Surrogate Endpoints Two paradigms: -The causal-effects (CE) paradigm, in which knowledge of the effects of the treatment on the surrogate and of the surrogate on the clinical outcome is used to predict the effect of the treatment on the clinical outcome. Examples: Prentice (1989); Taylor et al (2005) -The causal-association (CA) paradigm, in which the effect of treatment on the surrogate is associated, across studies or population subgroups, with its effect on the clinical outcome, so allowing prediction of the effect on the clinical outcome from the effect on the surrogate Examples: Frangakis & Rubin (2002); Buyse et al (2000) Ref: Joffee & Greene, Biometrics 2009 18

The Causal-effects Paradigm Uses concepts of the direct and indirect effects. Applicable in a single-trial setting. Explains the treatment effect on T mechanistically. Drawbacks: -Requires (untestable) assumptions. -Lacks power. - Effects defined in terms of manipulation of the putative surrogate, but in pracitce interventions do not directly manipulate the surrogate. Ref: Joffee & Greene, Biometrics 2009 19

The Causal-association Paradigm Considers association between expected causal effects. Natural for the meta-analytic seting, though can be considered in a single-trial setting (principal surrogacy). In the meta-analytic setting, applicable using the observable quantities. More powerful than the CE paradigm. Drawbacks: - Does not explain the treatment effect on T; rather, evaluates its covariation with the effect on S. Ref: Joffee & Greene, Biometrics 2009 20

Principal Surrogate (1) Frangakis & Rubin (2002): if no causal effect on S, then no effect on T. Average causal necessity: E(Δ Tj Δ Sj = 0) = 0 Note: values of Δ Sj define unobservable principal strata Example: binary data (T 0j, T 1j ) (S 0j, S 1j ) (0,0) (0,1) (1,1) (1,0) Principal stratification (0,0) p 11 p 12 p 13 p 14 Never responders (0,1) p 21 p 22 p 23 p 24 Improved (1,1) p 31 p 32 p 33 p 34 Always responders (1,0) p 41 p 42 p 43 p 44 Harmed Ref: Frangakis & Rubin, Biometrics 2002; Gilbert and Hudgens, Biometrics 2008; Joffe & Greene, 2009 21

Principal Surrogate (2) Frangakis & Rubin (2002): if no causal effect on S, then no effect on T. Average causal necessity: E(Δ Tj Δ Sj = 0) = 0 Note: values of Δ Sj define unobservable principal strata Example: binary data (T 0j, T 1j ) (S 0j, S 1j ) (0,0) (0,1) (1,1) (1,0) Principal stratification (0,0) p 11 0 p 13 0 Never responders (0,1) p 21 p 22 p 23 p 24 Improved (1,1) p 31 0 p 33 0 Always responders (1,0) p 41 p 42 p 43 p 44 Harmed Ref: Frangakis & Rubin, Biometrics 2002; Gilbert and Hudgens, Biometrics 2008; Joffe & Greene, 2009 22

Principal Surrogate (3) Frangakis & Rubin (2002): if no causal effect on S, then no effect on T. Average causal necessity: E(Δ Tj Δ Sj = 0) = 0 Gilbert & Hudgens (2008): PS if average causal necessity and Average causal sufficiency: ω such that E(Δ Tj Δ Sj >ω) >0 Ref: Frangakis & Rubin, Biometrics 2002; Gilbert and Hudgens, Biometrics 2008; 23

Dissociative & Associative Proportion The causal effect on T is CET = p +2 p +4 The dissociative proportion is defined as DP=((p 12 -p 14 ) + (p 32 -p 34 )) / CET The associative proportion is defined as AP=((p 22 -p 24 ) + (p 42 -p 44 )) / CET (T 0j, T 1j ) (S 0j, S 1j ) (0,0) (0,1) (1,1) (1,0) Principal stratification (0,0) p 11 p 12 p 13 p 14 Never responders (0,1) p 21 p 22 p 23 p 24 Improved (1,1) p 31 p 32 p 33 p 34 Always responders (1,0) p 41 p 42 p 43 p 44 Harmed Note : the proportions are ratios of differences; their values span [, + ] which is undesirable for a proportion Ref: Li et al, Biostatistics 2011;12:478. 24

Dissociative & Associative Proportion for a Principal Surrogate The dissociative proportion should be 0. Additionally, for a perfect principal surrogate, we would like p 24 and p 42 to be 0. Hence, the associative proportion should be 1. (T 0j, T 1j ) (S 0j, S 1j ) (0,0) (0,1) (1,1) (1,0) Principal stratification (0,0) p 11 0 p 13 0 Never responders (0,1) p 21 p 22 p 23 p 24 Improved (1,1) p 31 0 p 33 0 Always responders (1,0) p 41 p 42 p 43 p 44 Harmed 25

What Is Observed? T Z S 0 1 0 0 p 11 +p 12 +p 21 +p 22 p 13 +p 14 +p 23 +p 24 1 p 31 +p 32 +p 41 +p 42 p 33 +p 34 +p 43 +p 44 1 0 p 11 +p 14 +p 41 +p 44 p 12 +p 13 +p 42 +p 43 1 p 21 +p 24 +p 31 +p 34 p 22 +p 23 +p 32 +p 33 15 free parameters, 6 supported by data Estimation of, e.g., DP & AP requires (untestable) identifying restrictions. For instance, montonicity (p 41 =p 42 =p 43 =p 44 =p 14 =p 24 =p 34 =0) reduces the number of free parameters to 8 Extra assumptions in a form of, e.g., priors in a Bayesian approach 26

Normally-distributed Endpoints, Single-trial Setting Consider Y j =(T 0j,T 1j,S 0j,S 1j ) ~ N 4 (μ, Σ) with μ=(μ T0, μ T1, μ S0, μ S1 ) and Consider individual-causal effects: with Σ Δ =A Σ A, μ Δ =(β, α), β = E(Δ Tj ) = μ T1 -μ T0, α =E(Δ Sj )= μ S1 -μ S0. Define Individual Causal Association (ICA), ρ Δ = Corr(Δ Tj, Δ sj ). Then: Ref: Alonso et al, Biometrics 2015 27

Non-identifiability of ICA Assume σ T0 = σ T1 = σ T and σ S0 = σ S1 = σ S. Then Only ρ T0S0 and ρ T1S1 are identifiable. Hence, ICA is non-identifiable without making untestable assumptions. The assumptions influence the value of ρ Δ. Ref: Alonso et al, Biometrics 2015 28

Average Causal Necessity Average causal necessity: E(Δ Tj Δ Sj = 0) = 0 But Thus, even if ρ Δ2 =1, we get Hence, causal necessity is not satisfied, unless we assume β=α=0. Ref: Alonso et al, Biometrics 2015 29

Average Causal Sufficiency Average causal sufficiency: ω such that E(Δ Tj Δ Sj >ω) >0 For truncated bivariate-normal with λ(u)=φ(u)/(1-φ(u)), λ(u)>0. The average causal sufficiency is satisfied iff ρ Δ >0. Ref: Alonso et al, Biometrics 2015 30

Individual Causal Effects in a Meta-analytic Setting Conider the individual causal effects in a multiple-trial setting. Let Then Δ ij =(T 1ij -T 0ij, S 1ij -S 0ij ) ~ N 2 (μ Δi,Σ Δ ) and μ Δi ~ N 2 (μ * Δ,D Δ ). with (ε ΔTij, ε ΔSij ) ~ N 2 (0, Σ Δ ). Define the meta-analytic individual causal association MICA = Corr(Δ Tij, Δ Sij ). It is equal to with λ T =d bb /σ TT, λ S =d bb /σ SS Ref: Alonso et al, Biometrics 2015 31

Some Properties of MICA (1) If λ T + and λ S +, then ρ M =R trial (large between-trial variation association between expected CA s captured) If λ T 0 and λ S 0, then ρ M = ρ Δ (small between-trial variation association between individual CA s captured) Hence, MICA (ρ M ) evaluates validity of a surrogate across similar, but different populations (external validity). ICA (ρ Δ ) does it only for a single, fixed population (internal validity). Ref: Alonso et al, Biometrics 2015 32

Some Properties of MICA (2) Let ρ T0T1 = ρ S0S1 =0. Then If R ind large, then ρ Δ R ind. Hence, choosing a surrogate with large R trial >0 and R ind >0 likely leads to a large ρ M. Thus, surrogates valid in the meta-analytic approach may also be valid in the causal-inference framework. Ref: Alonso et al, Biometrics 2015 33

Some Properties of MICA (3) Large ρ Δ (single-trial) may still give a low ρ M (meta-analytic) if between-trial heterogeneity is larger than the within-trial one and expected causal association is low (small R trial ). Note that, as ICA, MICA is also not identifiable, unless untestable assumptions are made. Ref: Alonso et al, Biometrics 2015 34

Example: Age-related Macular Degeneration ρ S0T0, ρ S1T1, σ S0S0, σ S1S1, σ T0T0, σ T1T1 fixed at the estimated values Correlations in Σ fixed at values from {-1,-0.95,...,0, 0.05,...,1} 41 4 matrices, leading to ρ M Substantial impact of the unidentified correlations In 50% of the cases, ρ M >0.77. In <10% of the cases, ρ M >0.9. Similar (inconclusive) results as for the MA approach. Ref: Alonso et al, Biometrics 2015 35

Causal-association & Meta-analytic Approach Attention shifted to the association of expected causal effects. Surrogate successfully evaluated at the trial and individual-level in a meta-analytic context, may likely also successfully pass a validation exercise based on individual causal effects. A surrogate successfully evaluated in a single trial using ICEs may fail to pass a similar evaluation in a meta-analytic context. Three main advantages of the MA approach : (1) it has a causal interpretation (Joffe and Greene (2009)), (2) it is identifiable under randomization; (3) it may be more appealing to regulatory authorities and, therefore, more useful for drug approval. Ref: Alonso et al, Biometrics 2015 36