Whether to use MMRM as primary estimand.

Similar documents
A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

Time-Invariant Predictors in Longitudinal Models

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Time-Invariant Predictors in Longitudinal Models

2 Naïve Methods. 2.1 Complete or available case analysis

Global Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach

Time Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models

Unbiased estimation of exposure odds ratios in complete records logistic regression

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

Case Study in the Use of Bayesian Hierarchical Modeling and Simulation for Design and Analysis of a Clinical Trial

6 Pattern Mixture Models

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

7 Sensitivity Analysis

Time-Invariant Predictors in Longitudinal Models

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse

A Flexible Bayesian Approach to Monotone Missing. Data in Longitudinal Studies with Nonignorable. Missingness with Application to an Acute

A comparison of fully Bayesian and two-stage imputation strategies for missing covariate data

Basics of Modern Missing Data Analysis

Methods for Handling Missing Data

Analyzing Pilot Studies with Missing Observations

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Alexina Mason. Department of Epidemiology and Biostatistics Imperial College, London. 16 February 2010

Shared Random Parameter Models for Informative Missing Data

Comparing Adaptive Designs and the. Classical Group Sequential Approach. to Clinical Trial Design

Reconstruction of individual patient data for meta analysis via Bayesian approach

A causal modelling framework for reference-based imputation and tipping point analysis

MISSING or INCOMPLETE DATA

Lecture 2: Constant Treatment Strategies. Donglin Zeng, Department of Biostatistics, University of North Carolina

MISSING or INCOMPLETE DATA

Adaptive Designs: Why, How and When?

Describing Change over Time: Adding Linear Trends

A comparison of arm-based and contrast-based approaches to network meta-analysis (NMA)

Comment on Tests of Certain Types of Ignorable Nonresponse in Surveys Subject to Item Nonresponse or Attrition

Data Assimilation Research Testbed Tutorial

1 Impact Evaluation: Randomized Controlled Trial (RCT)

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA. Belfast 9 th June to 10 th June, 2011

Some methods for handling missing values in outcome variables. Roderick J. Little

Lecture 21: October 19

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Covariance modelling for longitudinal randomised controlled trials

CTDL-Positive Stable Frailty Model

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures

A weighted simulation-based estimator for incomplete longitudinal data models

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

36 th Annual Meeting Preconference Workshop P3 Handout

Group Sequential Designs: Theory, Computation and Optimisation

Multivariate Survival Analysis

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

Fully Bayesian inference under ignorable missingness in the presence of auxiliary covariates

E(Y ij b i ) = f(x ijβ i ), (13.1) β i = A i β + B i b i. (13.2)

Pattern Mixture Models for the Analysis of Repeated Attempt Designs

Workshop 12: ARE MISSING DATA PROPERLY ACCOUNTED FOR IN HEALTH ECONOMICS AND OUTCOMES RESEARCH?

Comparison of multiple imputation methods for systematically and sporadically missing multilevel data

REGRESSION ANALYSIS IN LONGITUDINAL STUDIES WITH NON-IGNORABLE MISSING OUTCOMES

Statistical Distribution Assumptions of General Linear Models

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.

Timevarying VARs. Wouter J. Den Haan London School of Economics. c Wouter J. Den Haan

Ages of stellar populations from color-magnitude diagrams. Paul Baines. September 30, 2008

L applicazione dei metodi Bayesiani nella Farmacoeconomia

Comparison of methods for repeated measures binary data with missing values. Farhood Mohammadi. A thesis submitted in partial fulfillment of the

Computational Techniques Prof. Dr. Niket Kaisare Department of Chemical Engineering Indian Institute of Technology, Madras

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Interim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming. and Optimal Stopping

Numerical Optimization Prof. Shirish K. Shevade Department of Computer Science and Automation Indian Institute of Science, Bangalore

Eco517 Fall 2014 C. Sims FINAL EXAM

TUTORIAL IN BIOSTATISTICS Handling drop-out in longitudinal studies

Comparison of several analysis methods for recurrent event data for dierent estimands

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Univariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation

CSC 2541: Bayesian Methods for Machine Learning

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Variational Bayesian Inference for Parametric and Non-Parametric Regression with Missing Predictor Data

Introduction to Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras

(Refer Slide Time: 01:11) So, in this particular module series we will focus on finite difference methods.

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Estimating Single-Agent Dynamic Models

Lecture 3 - More Conditional Probability

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Bayes Nets III: Inference

Efficient Algorithms for Bayesian Network Parameter Learning from Incomplete Data

Statistical Analysis of Binary Composite Endpoints with Missing Data in Components

Bayesian Nonparametric Rasch Modeling: Methods and Software

Reminder of some Markov Chain properties:

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies

Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras

Bayesian Nonparametric Accelerated Failure Time Models for Analyzing Heterogeneous Treatment Effects

Behavioral Data Mining. Lecture 19 Regression and Causal Effects

CPSC 540: Machine Learning

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Analysis of repeated measurements (KLMED8008)

Construction and statistical analysis of adaptive group sequential designs for randomized clinical trials

Transcription:

Whether to use MMRM as primary estimand. James Roger London School of Hygiene & Tropical Medicine, London. PSI/EFSPI European Statistical Meeting on Estimands. Stevenage, UK: 28 September 2015. 1 / 38

Acknowledgements Mouna Akacha, Michael O Kelly, James Carpenter & Mike Kenward for many discussions. All those project teams at GSK and elsewhere who have allowed me into their stories. But all the views expressed here are my own. These slides are designed to be self explanatory (so rather extensive). [Declaration: James Roger and Livedata (UK) Ltd have a consultancy agreement with AstraZeneca. James Roger was employed by Johnson & Johnson and GlaxoSmithKlyne, and is a director of Livedata (UK) Ltd.] 2 / 38

Outline The salient features of MMRM. How the estimand is usually explained. Alternative estimands that it can possibly estimate. A personal view of the way forward. 3 / 38

What is MMRM Mixed Models Repeated Measures. Not really a Mixed model but a useful acronym. 4 / 38

What is MMRM Mixed Models Repeated Measures. Not really a Mixed model but a useful acronym. Multivariate Normal distribution across visits within patient. Fully parameterised Unstructured covariance matrix. Usually the same for each arm. Fixed effects linear model Treatment by Visit interaction. Other baseline covariates that are often crossed with visit. 4 / 38

MMRM: Why is MMRM important? (1) Flexible model for quantitative data measured repeatedly across multiple visits. Allows for correlation between observations within a subject. If data are complete and all covariates are crossed with visit (saturated model) then it is equivalent to a univariate analysis at each visit. 5 / 38

MMRM: Why is MMRM important? (2) Easy to fit in conventional software: Fast iterative maximum likelihood solution. Simple conjugate priors for Bayesian modelling. But issues if there are more parameters than data (at last visit)! 6 / 38

MMRM: Why is MMRM important? (2) Easy to fit in conventional software: Fast iterative maximum likelihood solution. Simple conjugate priors for Bayesian modelling. But issues if there are more parameters than data (at last visit)! Nice distributional properties: Conditional model is MV Normal. e.g. distribution at second visit given known value at first visit. Marginal model is MV Normal. e.g. distribution at second visit when value at first visit is not known. But margin over mixture of Normals is not Normal. e.g. Marginal distribution from Pattern mixture models is not Normal. 6 / 38

MMRM: Why is MMRM important? (2) Easy to fit in conventional software: Fast iterative maximum likelihood solution. Simple conjugate priors for Bayesian modelling. But issues if there are more parameters than data (at last visit)! Nice distributional properties: Conditional model is MV Normal. e.g. distribution at second visit given known value at first visit. Marginal model is MV Normal. e.g. distribution at second visit when value at first visit is not known. But margin over mixture of Normals is not Normal. e.g. Marginal distribution from Pattern mixture models is not Normal. Last but not least... Popular way to handle missing data. 6 / 38

So what does it estimate? The conventional answer 7 / 38

Illustration To illustrate the issues I am going to assume,... an active treatment with primarily AE related withdrawals a control treatment with withdrawals mostly due to lack of efficacy.... and see the implication. 8 / 38

Early withdrawal of randomized treatment There are two different important impacts of early withdrawal from treatment. 1 Selection: Those who withdraw in one arm are a different population from... those who do not withdraw in this arm those who do withdraw in the other arm those who do not withdraw in the other arm This messes up any analysis simply based on completers. 9 / 38

Early withdrawal of randomized treatment There are two different important impacts of early withdrawal from treatment. 1 Selection: Those who withdraw in one arm are a different population from... those who do not withdraw in this arm those who do withdraw in the other arm those who do not withdraw in the other arm This messes up any analysis simply based on completers. 2 Switching: After withdrawal patients receive some alternative treatment, which may or may not be closely aligned with the estimand, and may or may not be recorded within the trial datbase. 9 / 38

Early withdrawal of randomized treatment 1 Selection: We can often observe the impact on this and other outcome measures before withdrawal and can try to correct for it. 10 / 38

Early withdrawal of randomized treatment 1 Selection: We can often observe the impact on this and other outcome measures before withdrawal and can try to correct for it. 2 Switching: We can decide whether to include the impact of patients receiving some alternative treatment within the estimand. De jure: We ignore it and are interested in either... 1 outcome if carry on taking treatment, 2 or perhaps restrict to population that can take either treatment. De facto: We allow for impact of alternative treatments in an ITT fashion. Need to... either observe outcome after withdrawal and use it, or hypothesise some pattern of treatment profile after withdrawal. 10 / 38

Early withdrawal of randomized treatment 1 Selection: We can often observe the impact on this and other outcome measures before withdrawal and can try to correct for it. 2 Switching: We can decide whether to include the impact of patients receiving some alternative treatment within the estimand. De jure: We ignore it and are interested in either... 1 outcome if carry on taking treatment, 2 or perhaps restrict to population that can take either treatment. De facto: We allow for impact of alternative treatments in an ITT fashion. Need to... either observe outcome after withdrawal and use it, or hypothesise some pattern of treatment profile after withdrawal. Here we ignore switching and go down the two de jure routes. 10 / 38

Missing Data: MMRM is just a model MMRM is a sensible model for... what happens to a quantiative outcome on randomized treatment in a homogeneous population before something goes wrong and the patient is withdrawn from treatment. 11 / 38

Missing Data: MMRM is just a model MMRM is a sensible model for... what happens to a quantiative outcome on randomized treatment in a homogeneous population before something goes wrong and the patient is withdrawn from treatment. It is not an estimand in itself. Under a Missing at Random (MAR) assumption it allows one to make statements about what would have happened if patients had in some sense remained in the study. 11 / 38

Missing Data: MMRM is just a model MMRM is a sensible model for... what happens to a quantiative outcome on randomized treatment in a homogeneous population before something goes wrong and the patient is withdrawn from treatment. It is not an estimand in itself. Under a Missing at Random (MAR) assumption it allows one to make statements about what would have happened if patients had in some sense remained in the study. But this is a Survey Statistician s approach to missing data (the data is out there somewhere if only I could find it). 11 / 38

Missing Data: MMRM is just a model MMRM is a sensible model for... what happens to a quantiative outcome on randomized treatment in a homogeneous population before something goes wrong and the patient is withdrawn from treatment. It is not an estimand in itself. Under a Missing at Random (MAR) assumption it allows one to make statements about what would have happened if patients had in some sense remained in the study. But this is a Survey Statistician s approach to missing data (the data is out there somewhere if only I could find it). Here such data are hypothetical. Not only may it not have happened it may even be impossible. 11 / 38

Missing at Random (MAR) Missing at Random is best seen in terms of Selection models p(y obs, R, y mis θ, φ) = p(r y obs, y mis, φ) p(y obs, y mis θ) where R is a vector indicating whether data are observed (1) or not (0) at visit i. Here R is constrained by monotonicity. Under MAR the probability of withdrawal at visit i only depends on baseline covariates and previous observed values. p(y obs, R, y mis θ, φ) = p(r y obs, φ) p(y obs, y mis θ) 12 / 38

Missing at Random (MAR) Missing at Random is best seen in terms of Selection models p(y obs, R, y mis θ, φ) = p(r y obs, y mis, φ) p(y obs, y mis θ) where R is a vector indicating whether data are observed (1) or not (0) at visit i. Here R is constrained by monotonicity. Under MAR the probability of withdrawal at visit i only depends on baseline covariates and previous observed values. p(y obs, R, y mis θ, φ) = p(r y obs, φ) p(y obs, y mis θ) Then we simply estimate parameters based on p(y obs, y mis θ) ignoring the missingness process. [θ and φ are parameters.] 12 / 38

Missing at Random (MAR) By conditioning on the baseline covariates and previous observed values can we remove the impact of the selective aspect of patient withdrawal? If so, then we have MAR. Then we can correct for any difference between those withdrawing and those who do not. 13 / 38

How does it work It uses baseline covariates and pre-withdrawal observed values to correct the treatment difference we see between those who complete. 14 / 38

How does it work It uses baseline covariates and pre-withdrawal observed values to correct the treatment difference we see between those who complete. First consider just baseline data with no post-randomization data before outcome. 14 / 38

Complete data: Plot versus baseline. Randomization means baseline means should be similar. 15 / 38

Complete data: Adjust for baseline diff. Estimated treatment difference is slightly smaller. 16 / 38

Missing data: Observed are +. Removed low baseline/outcome from Placebo Removed high baseline/outcome from Active. 17 / 38

Missing data. MAR is also MCAR. Nothing post-randomization to condition on. 18 / 38

Baseline correction. Baseline values for those withdrawn are ignored. 19 / 38

Two visits and no baseline. Use the same data layout but now the X-axis is Outcome at first visit rather than baseline. This means that we expect X-axis mean to be different between arms. Rather than adjust to the same (mean) X-axis values, now correct each arm to mean of the observed X-axis values within each arm. 20 / 38

Previous outcome data. Difference between Outcome values at visit 1 is important. 21 / 38

MMRM analysis MMRM sinply applies these two ideas with several baseline covariates and several post-randomization observed values. 22 / 38

MMRM analysis MMRM sinply applies these two ideas with several baseline covariates and several post-randomization observed values. Note how this links directly to how baseline covariates can be included as part of the Repeated Measures series. No treatment effect at extra visit. 22 / 38

The conventional answer summarised Under MAR the MMRM model estimates the mean treatment effect assuming that... after withdrawal subjects would have continued just like their peers in the same arm who have... the same covariates and same observed data (so far). 23 / 38

The conventional answer summarised Under MAR the MMRM model estimates the mean treatment effect assuming that... after withdrawal subjects would have continued just like their peers in the same arm who have... the same covariates and same observed data (so far). This links with equivalent ideas of Multiple Imputation (MI) under MAR, where we impute as if patients continue in the same way within arm. 23 / 38

A better way forward? How can we develop easily interpreted Repeated Measures models?... Rather than use selection models or pattern mixture models we need to model the outcome and missingness processes as they evolve across visits... 24 / 38

Extrapolation Factorizaton (EF) In contrast to both pattern mixture p(y obs, R, y mis ω) = p(y obs, y mis R, ω) p(r ω) and selection models... p(y obs, R, y mis ω) = p(r y obs, y mis, ω) p(y obs, y mis ω)... the distribution of the full data (observed and unobserved) is factored into 1 joint distribution of the observed values y obs and the Response pattern R, 2 that for unobserved or potential values y pot conditional on the previous [effectively same as y mis ]. p(y obs, R, y pot ω) = p(y pot y obs, R, ω E ) p(y obs, R ω O ) [The Extrapolation Factorizaton (EF). See Daniels & Hogan. Chapter 9, section 9.1.1. ω = (θ, φ)] 25 / 38

Sequential distribution - 3 visits Visit1 Withdraw? Visit2 Withdraw? Visit3 Y 1 R 2 Y 2 R 3 Y 3 f (Y 1 ) g(r 2 f (Y 2 g(r 3 f (Y 3 Y 1 ) Y 1, R 2 = 1) Y 1, Y 2 ) Y 1, Y 2, R 3 = 1) f (Y 3 Y 1, Y 2, R 3 = 0) f (Y 2 f (Y 3 Y 1, R 2 = 0) Y 1, Y 2, R 2 = 0) f (Y i Y 1,..., Y (i 1), R i = k) is distribution of outcome conditonal upon history. g(r i Y 1,..., Y (i 1) ) is probability of continuing to observe at visit i conditional upon observing at vist (i-1). Both will depend on baseline and possibly other previous post-randomization observations - especially on treatment regime. [Here we ignore possible withdrawal before first visit (No R 1 ).] 26 / 38

Possible flows Visit1 Withdraw? Visit2 Withdraw? Visit3 Y 1 R 2 Y 2 R 3 Y 3 f (Y 1 ) g(r 2 f (Y 2 g(r 3 f (Y 3 Y 1 ) Y 1, R 2 = 1) Y 1, Y 2 ) Y 1, Y 2, R 3 = 1) f (Y 3 Y 1, Y 2, R 3 = 0) f (Y 2 f (Y 3 Y 1, R 2 = 0) Y 1, Y 2, R 2 = 0) f (Y 1 ) g(r 2 f (Y 2 g(r 3 f (Y 3 Y 1 ) Y 1, R 2 = 1) Y 1, Y 2 ) Y 1, Y 2, R 3 = 1) f (Y 3 Y 1, Y 2, R 3 = 0) f (Y 2 f (Y 3 Y 1, R 2 = 0) Y 1, Y 2, R 2 = 0) f (Y 1 ) g(r 2 f (Y 2 g(r 3 f (Y 3 Y 1 ) Y 1, R 2 = 1) Y 1, Y 2 ) Y 1, Y 2, R 3 = 1) f (Y 3 Y 1, Y 2, R 3 = 0) f (Y 2 f (Y 3 Y 1, R 2 = 0) Y 1, Y 2, R 2 = 0) 27 / 38

The part we always observe Visit1 Withdraw? Visit2 Withdraw? Visit3 Y 1 R 2 Y 2 R 3 Y 3 f (Y 1 ) g(r 2 f (Y 2 g(r 3 f (Y 3 Y 1 ) Y 1, R 2 = 1) Y 1, Y 2 ) Y 1, Y 2, R 3 = 1) f (Y 3 Y 1, Y 2, R 3 = 0) f (Y 2 f (Y 3 Y 1, R 2 = 0) Y 1, Y 2, R 2 = 0) We can model these distributions in red and estimate parameters based on data. 28 / 38

The margin distribution for final visit Visit1 Withdraw? Visit2 Withdraw? Visit3 Y 1 R 2 Y 2 R 3 Y 3 Margin f (Y 1 ) g(r 2 f (Y 2 g(r 3 f (Y 3 Y 1 ) Y 1, R 2 = 1) Y 1, Y 2 ) Y 1, Y 2, R 3 = 1) f (Y 3 Y 1, Y 2, R 3 = 0) f (Y 2 f (Y 3 Y 1, R 2 = 0) Y 1, Y 2, R 2 = 0) f (Y 3, R 3 = 1) f (Y 3, R 3 = 0) f (Y 3, R 2 = 0) f (Y 3 ) If we know all these intermediate distributions then we can derive least squares means for the margin for some predefined set of covariate values. That is we can get the treatment effect by differencing between means based on one arm or the other. The other terms we may base on post-withdrawal data or by sharing parameters from elsewhere. 29 / 38

Missing at Random (MAR) Visit1 Withdraw? Visit2 Withdraw? Visit3 Y 1 R 2 Y 2 R 3 Y 3 Margin f (Y 1 ) g(r 2 f (Y 2 g(r 3 f (Y 3 Y 1 ) Y 1 ) Y 1, Y 2 ) Y 1, Y 2 ) f (Y 3 Y 1, Y 2 ) f (Y 2 f (Y 3 Y 1 ) Y 1, Y 2 ) f (Y 3 ) g(r 3 = 1) f (Y 3 ) g(r 3 = 0) f (Y 3 ) g(r 2 = 0) f (Y 3 ) MAR indicates that f (Y 2 Y 1, R 2 ) is the same irrespective of R 2 being 0 or 1. So margin is simply f (Y 1 ) f (Y 2 Y 1 ) f (Y 3 Y 1, Y 2 )dy 1 dy 2. Simply ignore the missingness part of the distribution. 30 / 38

A general framework Extrapolation Factoristaion (EF) provides a general framework for deriving estimands based on a flexible approach to the selectiveness of the withdrawal process. Can also incorporate de facto style estimands. As such it is a suitable way forward for handling the switching issue. An MCMC approach can be applied to the model leading to... an overall Bayesian approach, or imputation of complete data and a summary across multiple analyses (MI). It could possibly allow an estimand to be defined in a subpopulation based on withdrawal process.... those who would reamin on treatment in either arm. 31 / 38

So what does MMRM + MAR estimate? The title of the talk should have been Whether to use MMRM + MAR as primary estimand. MMRM has tried to answer the selection issue but ignored the switching issue. 32 / 38

So what does MMRM + MAR estimate? The title of the talk should have been Whether to use MMRM + MAR as primary estimand. MMRM has tried to answer the selection issue but ignored the switching issue. As a result it answers a hypothetical question. Can it answer more (de jure option 2)? 32 / 38

Circumventing the hypothetical nature One might restrict to those subjects who can tolerate the treatment. 33 / 38

Circumventing the hypothetical nature One might restrict to those subjects who can tolerate the treatment. Restrict to those who can tolerate... either treatment. or the new treatment, as this is our focus (while retaining counterfactual for Control). 33 / 38

Circumventing the hypothetical nature One might restrict to those subjects who can tolerate the treatment. Restrict to those who can tolerate... either treatment. or the new treatment, as this is our focus (while retaining counterfactual for Control). [ If the patient stops treatment then their additional cost may be small! ] 33 / 38

Circumventing the hypothetical nature One might restrict to those subjects who can tolerate the treatment. Restrict to those who can tolerate... either treatment. or the new treatment, as this is our focus (while retaining counterfactual for Control). [ If the patient stops treatment then their additional cost may be small! ] MMRM + MAR does not directly answer this question. 33 / 38

How it is often interpreted MMRM is sometimes thought of as the treatment effect in those who can accept either treatment regime. What additional assumptions do we need to interpret it in this way? A predefined baseline covariate condition that guarantees trial completion in both arms. OR There is effectively no difference between arms in the way previous outcome impacts on the probability of withdrawal. 34 / 38

How it is often interpreted MMRM is sometimes thought of as the treatment effect in those who can accept either treatment regime. What additional assumptions do we need to interpret it in this way? A predefined baseline covariate condition that guarantees trial completion in both arms. OR There is effectively no difference between arms in the way previous outcome impacts on the probability of withdrawal. If not, then we could model the whole thing and draw conclusion. 34 / 38

Conclusions MMRM under MAR defines a classic de jure estimand. MAR before withdrawal and MNAR after withdrawal using modified post-withdrawal RM distribution defines a series of possible de facto estimands. A full EF sequential specification of the model allows the derivation of a wide range of possible estimands. 35 / 38

Conclusions MMRM under MAR defines a classic de jure estimand. MAR before withdrawal and MNAR after withdrawal using modified post-withdrawal RM distribution defines a series of possible de facto estimands. A full EF sequential specification of the model allows the derivation of a wide range of possible estimands. The Regulators need to state the kinds of estimand they want. This does not need to be limited by currently available statistical methods. Nearly anything is possible. 35 / 38

Abstract (1) There are two main impacts of early withdrawal on study results; first the potential selection bias caused by those withdrawing being different from the remaining patients, and second the fact that patients may receive alternative treatments after withdrawal. The most common method for handling early withdrawal in clinical studies is MMRM or some other form of missing at random (MAR) based analysis. The motivation behind MMRM is to solve the first issue while addressing an on-treatment question, i.e. what happens if a typical patient completes their assigned treatment. It does this by conditioning on the previous observations and other covariates that may inform on both missingness and outcome. 36 / 38

Abstract (2) So what is the scientific question of interest, i.e. the estimand that it targets? This cannot be answered without considering the second aforementioned issue which is related to treatment switching or modification after withdrawal. If the design of the study, in terms of treatment after withdrawal from randomized treatment, matches the estimand, then collection of data after treatment withdrawal allows direct analysis. Then later absolute study termination after switching (truly missing data) can be handled via a modified MMRM approach. But when the design of the study after treatment withdrawal does not match the estimand any analysis must depend upon additional unverifiable information or assumptions. This is a whole new area of potential statistical research. Several of the more recent proposals ignore the first issue of selection bias. Any coherent approach must address both issues. 37 / 38

The proposal from Tom Permutt One might restrict to those subjects who can tolerate the treatment. Restrict to those who can tolerate... either treatment the new treatment, as this is our focus (while retaining counterfactual for Control). The proposal from Tom Permutt effectively does neither of these. Withdrawals receive worst outcome and then he discards within each arm the worst x% of subjects. So discarded subjects are different sorts of subject in each arm. Withdrawn for AE versus withdrawn for lack of efficacy, say. So this can only test if there is any difference between treatments. It does not define an estimand to compare between arms. 38 / 38