On the efficiency of two-stage adaptive designs

Similar documents
ADAPTIVE EXPERIMENTAL DESIGNS. Maciej Patan and Barbara Bogacka. University of Zielona Góra, Poland and Queen Mary, University of London

Model Selection versus Model Averaging in Dose Finding Studies

Designing dose finding studies with an active control for exponential families

Generalizing the MCPMod methodology beyond normal, independent data

Model identification for dose response signal detection

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Generalizing the MCPMod methodology beyond normal, independent data

Information in a Two-Stage Adaptive Optimal Design

Estimation MLE-Pandemic data MLE-Financial crisis data Evaluating estimators. Estimation. September 24, STAT 151 Class 6 Slide 1

Response-adaptive dose-finding under model uncertainty

Likelihood Ratio Tests for a Dose-Response Effect using. Multiple Nonlinear Regression Models

Regression Estimation - Least Squares and Maximum Likelihood. Dr. Frank Wood

Penalized D-optimal design for dose finding

Estimation in Flexible Adaptive Designs

Efficient algorithms for calculating optimal designs in pharmacokinetics and dose finding studies

Introduction to Estimation Methods for Time Series models Lecture 2

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Methods of evaluating estimators and best unbiased estimators Hamid R. Rabiee

Bayesian concept for combined Phase 2a/b trials

AP-Optimum Designs for Minimizing the Average Variance and Probability-Based Optimality

(1) Introduction to Bayesian statistics

Implementing Response-Adaptive Randomization in Multi-Armed Survival Trials

Review and continuation from last week Properties of MLEs

Estimation Theory. as Θ = (Θ 1,Θ 2,...,Θ m ) T. An estimator

Master s Written Examination

Mathematical statistics

EIE6207: Estimation Theory

SGN Advanced Signal Processing: Lecture 8 Parameter estimation for AR and MA models. Model order selection

Discussion Paper SFB 823. Nearly universally optimal designs for models with correlated observations

Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim

Sparse Linear Models (10/7/13)

Regression Estimation Least Squares and Maximum Likelihood

Adaptive Designs: Why, How and When?

Estimation Tasks. Short Course on Image Quality. Matthew A. Kupinski. Introduction

LINEAR MODELS FOR CLASSIFICATION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

Statistics II Lesson 1. Inference on one population. Year 2009/10

SYDE 372 Introduction to Pattern Recognition. Probability Measures for Classification: Part I

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Elements of statistics (MATH0487-1)

Mathematical statistics

F & B Approaches to a simple model

ECE531 Lecture 10b: Maximum Likelihood Estimation

10-704: Information Processing and Learning Fall Lecture 24: Dec 7

Chapters 9. Properties of Point Estimators

Comparing Adaptive Designs and the. Classical Group Sequential Approach. to Clinical Trial Design

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Introduction to Simple Linear Regression

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Rerandomization to Balance Covariates

Beyond GLM and likelihood

Linear Models for Classification

Graduate Econometrics I: Unbiased Estimation

Lecture 3 September 1

ELEG 5633 Detection and Estimation Minimum Variance Unbiased Estimators (MVUE)

Statistics & Data Sciences: First Year Prelim Exam May 2018

Artificial Neural Networks

Pubh 8482: Sequential Analysis

Compound Optimal Designs for Percentile Estimation in Dose-Response Models with Restricted Design Intervals

Seminar über Statistik FS2008: Model Selection

ST440/540: Applied Bayesian Statistics. (9) Model selection and goodness-of-fit checks

Optimal designs for the EMAX, log-linear and exponential model

Locally optimal designs for errors-in-variables models

Regression I: Mean Squared Error and Measuring Quality of Fit

Lecture 7 Introduction to Statistical Decision Theory

Discussion Paper SFB 823. Optimal designs for the EMAX, log-linear linear and exponential model

STA 4273H: Statistical Machine Learning

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

Lecture 2: Statistical Decision Theory (Part I)

EXAMINERS REPORT & SOLUTIONS STATISTICS 1 (MATH 11400) May-June 2009

9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures

Lecture 1: Supervised Learning

Brief Review on Estimation Theory

Optimum designs for model. discrimination and estimation. in Binary Response Models

Linear Classification. CSE 6363 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington

Hierarchical Models & Bayesian Model Selection

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

Theory of Maximum Likelihood Estimation. Konstantin Kashin

Parametric Models. Dr. Shuang LIANG. School of Software Engineering TongJi University Fall, 2012

Ch 4. Linear Models for Classification

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

LECTURE NOTE #3 PROF. ALAN YUILLE

Likelihood and p-value functions in the composite likelihood context

Lecture 2 Machine Learning Review

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

Adaptive Dunnett Tests for Treatment Selection

13. Parameter Estimation. ECE 830, Spring 2014

Covariance function estimation in Gaussian process regression

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over

OPTIMAL DESIGN INPUTS FOR EXPERIMENTAL CHAPTER 17. Organization of chapter in ISSO. Background. Linear models

Linear Classification

Last updated: Oct 22, 2012 LINEAR CLASSIFIERS. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

MULTIPLE-OBJECTIVE DESIGNS IN A DOSE-RESPONSE EXPERIMENT

F. Combes (1,2,3) S. Retout (2), N. Frey (2) and F. Mentré (1) PODE 2012

Theory of Statistics.

Randomized dose-escalation design for drug combination cancer trials with immunotherapy

Functional Latent Feature Models. With Single-Index Interaction

Maximum Likelihood Tests and Quasi-Maximum-Likelihood

ML estimation: Random-intercepts logistic model. and z

Individual bioequivalence testing under 2 3 designs

Power and Sample Size Calculations with the Additive Hazards Model

Transcription:

On the efficiency of two-stage adaptive designs Björn Bornkamp (Novartis Pharma AG) Based on: Dette, H., Bornkamp, B. and Bretz F. (2010): On the efficiency of adaptive designs www.statistik.tu-dortmund.de/sfb823-dp2010.html

Adaptive Designs Adaptive Designs: Phase I (CRM) Phase II (adaptive dose-ranging) Phase II/III (combination of Phase II and III results) Past 5-6 years increased interest in adaptive/model-based dose-ranging designs FDA critical path initiative ASTIN dose-finding trial (Bayesian highly adaptive) PhRMA working group on adaptive dose-finding studies (José Pinheiro, Michael Krams, Vlad Dragalin & many others) 1

Adaptive Dose-Ranging Designs Allocate patients adaptively within the trial, so that information on the dose-response curve (or target dose) is maximized. Stop the trial, when enough information has been gathered. Some examples: Target function, dose-response model: Müller, Berry, Grieve, Krams (2006) variance of the ED95, semiparametric Bayesian dose-response model Dragalin et al. (2007) covariance matrix of parameters (D-optimality), flexible nonlinear dose-response model Bornkamp et al. (2011) variance of threshold dose (MED), set of candidate nonlinear candidate dose-response models 2

Adaptive dose-finding designs: Merits From FDA adaptive design draft guidance: Adaptive designs may lead to studies that more efficiently provide the same information (shorter duration, fewer patients) more likely to demonstrate an effect of the drug (if one exists) more informative on the treatment s effect (broader and better dose-response information, subgroup effects), which may lead to more efficient subsequent studies My perspective (and for adaptive dose-ranging trials): Flexibility allows to robustify the design of the study. Refine initial assumptions within the trial, e.g., on dose-response relationship (where to place doses to learn most, dose-range) expected treatment effect (stop early) 3

Adaptive Designs: Challenges Logistically more challenging to plan and implement: simulations needed to assess operating characteristics drug supply/logistical questions recruitement rate (study duration might increase) more institutional hurdles (internal and external) This talk: Focus on a specific aspect regarding adaptive designs statistical estimation efficiency 4

Motivation Experience from a number of simulation studies performed with adaptive dose-finding methods: moderate gains for most simulation scenarios sometimes substantial gains particularly, when initial assumptions are dramatically wrong but sometimes also slightly worse (!) Question at that time: Can analytical considerations confirm these findings? Simulations typically performed under realistic settings, i.e., a number of potentially interfering parameters (logistical constraints, complex models, programming bugs etc). Use simplified/idealized setting to identify key factors 5

Considered designs Target: Estimate the parameters or parameter function Use the design that maximizes φ(m(ξ, θ)) θ model parameters, M(ξ, θ) Fisher information matrix, ξ experimental design (specifying doses and allocations), φ differentiable design criterion Compare designs: 1. ξ F Fixed design: Take all N observations at the locally optimal design for θ 0 (parameter guess) 2. ξ A Two-stage adaptive design: Split study into two parts, take N 0 = p 0 N samples in first part, p 0 (0, 1), calculate ML estimate θ 1 and allocate remaining N N 0 samples to the optimal design obtained by maximizing φ(m(ξ, θ 1 )) 6

Main idea for analysis At the end calculate maximum likelihood estimate θ using both parts. Measure estimation precision in terms of the mean-squared error MSE( θ) = E[( θ θ)( θ θ) T ] Var( θ), because the bias is of order 1/N 2 Main idea for analytical considerations: Variance decomposition V ar( θ) = E(V ar( θ Y 1,..., Y N0 )) + V ar(e( θ Y 1,..., Y N0 )) Derive approximations for the terms in the sum (assume N, but p 0 = N 0 /N constant) 7

Analysis Defining: I(θ, θ 0 ) = M(ξ θ0, θ) (ξ θ0 is the local optimal design for θ 0 ) and H(θ, θ 0 ) = p 0 I(θ, θ 0 ) + p 1 I(θ, θ) One obtains for the approximate covariance matrices: M(ξ F, θ) I(θ, θ 0 ) + 1 N0 K(θ, θ 0 ) + 1 N 0 L(θ, θ 0 ) M(ξ A, θ) H(θ, θ 0 ) + 1 N0 K(θ, θ0 ) + 1 N 0 L(θ, θ0 ) General observation: Because H(θ, θ 0 ) I(θ, θ 0 ) the adaptive design is asymptotically better For finite sample size the terms of order 1/N 0 and 1/ N 0 are non-negligible, hence need to consider each specific model separately 8

General design criteria For differentiable optimality criterion φ one obtains, when comparing the efficiency eff φ (ξ F, ξ A ) = φ(m(ξ F, θ)) φ(m(ξ A, θ)) φ(i(θ, θ 0 )) φ(p 0 I(θ, θ 0 ) + p 1 I(θ, θ)) + no information regarding the sign of the constants c and d is available in general. c + d, N0 N 0 9

One parameter models For simplicity concentrate on one parameter models in what follows eff(ξ F, ξ A ) = MSE(ˆθ F ) MSE(ˆθ A ) Var(ˆθ F ) Var(ˆθ A ) { I(θ, θ0 ) H(θ, θ 0 ) p 1 where g(θ) := 2 I(θ, τ) g(θ)(5p 0 I(θ, θ 0 ) + p 1 I(θ, θ)) 2N 0 H 3 (θ, θ 0 ) τ=θ is always negative } 1 The dominating term I(θ,θ 0) H(θ,θ 0 ) 1 But for finite sample sizes, the relationship is not clear due to the second summand 10

Exponential regression model Simple example Exponential model E[Y x] = η(x, θ) = e θx, Var (Y x) = σ 2 > 0 with unknown parameter θ and initial guess θ 0 ξ F : Fixed design N observations according to optimal design based on θ 0 ξ A : Two-stage adaptive design Stage 1: N 0 observations with design based on θ 0 Interim: Estimate θ, resulting in ˆθ 1 Stage 2: N N 0 observations with design based on ˆθ 1 Which design is more efficient and estimates θ more precisely? 11

Exponential Regression Exponential model with unknown parameter θ = 1 and different initial guesses 12

Exponential Regression Relative efficiency of adaptive versus non-adaptive design for N = 100, θ = 1. Efficiency > 1 indicates that the adaptive design is better. 13

Efficiency > 1 indicates that the adaptive design is better. Exponential Regression Main factors: Variability, adequacy of initial guess θ 0 14

Exponential Regression θ 0 = 2, θ = 1, σ 2 = 1 Simulated Approximation 0.0 0.2 0.4 0.6 0.8 1.0 θ 0 = 2, θ = 1, σ 2 = 0.1 θ 0 = 2, θ = 1, σ 2 = 0.01 2.0 1.5 1.0 0.5 0.0 θ 0 = 3, θ = 1, σ 2 = 1 θ 0 = 3, θ = 1, σ 2 = 0.1 θ 0 = 3, θ = 1, σ 2 = 0.01 2.0 Relative Efficiency 1.5 1.0 0.5 θ 0 = 3, θ = 2, σ 2 = 1 θ 0 = 3, θ = 2, σ 2 = 0.1 θ 0 = 3, θ = 2, σ 2 = 0.01 0.0 2.0 1.5 1.0 0.5 0.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 p 0 15

Logistic regression p(x, θ) = E[Y x] = 1 1+e x θ θ 0 = 1, θ = 0, N=500 Simulated Approximation 0.0 0.2 0.4 0.6 0.8 1.0 θ 0 = 1, θ = 0, N=100 θ 0 = 1, θ = 0, N=50 2.0 1.5 1.0 0.5 Relative Efficiency 0.0 θ 0 = 2, θ = 0, N=200 θ 0 = 1, θ = 0, N=200 θ 0 = 0.5, θ = 0, N=200 2.0 1.5 1.0 0.5 0.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 p 0 16

Emax model E[Y d] = η(d, θ) = θ 0 + θ 1 d θ 2 + d, Var (Y d) = σ2 > 0. Focus on estimating the ED 90 parameter with small variability for a trial with 100 patients. The optimal design allocates 1/4 of the patients on each of 0 and d max the maximum dose and the remaining 1/2 of the patients on the intermediate dose d max θ 2 d max + 2θ 2. 17

Emax model 0.0 0.2 0.4 0.6 0.8 1.0 θ 2 = 25, θ 2 (0) = 50, σ = 0.1 θ 2 = 25, θ 2 (0) = 50, σ = 0.25 θ2 = 25, θ 2 (0) = 50, σ = 0.5 1.2 1.0 0.8 0.6 0.4 0.2 θ 2 = 10, θ 2 (0) = 25, σ = 0.1 θ2 = 10, θ 2 (0) = 25, σ = 0.25 θ 2 = 10, θ 2 (0) = 25, σ = 0.5 Relative Efficiency 1.2 1.0 0.8 0.6 0.4 θ 2 = 25, θ 2 (0) = 10, σ = 0.1 θ2 = 25, θ 2 (0) = 10, σ = 0.25 θ 2 = 25, θ 2 (0) = 10, σ = 0.5 0.2 1.2 1.0 0.8 0.6 0.4 0.2 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 p 0 18

Summary Despite several limitations (restriction to simple models, idealized situation, asymptotic approximations): Analytical considerations confirm simulation results The adaptive design will dominate the non-adaptive design (for large enough sample size) But there can be situations, where adaptation makes things worse (in terms of statistical estimation efficiency) Results strongly depends on the specific model and situation (apparently no general results possible) Simulations need to be performed to evaluate adaptive designs In addition: An advantage of adaptive designs is robustification (things can be different than expected), this flexibility is hard to quantify numerically 19

References Mller, P., Berry, D., Grieve, A. and Krams, M. (2006) A Bayesian Decision-Theoretic Dose-Finding Trial, Decision Analysis, 3, 197 207 Bornkamp, B., Bretz, F., Dette, H., and Pinheiro, J. (2011) -adaptive dose-finding under model uncertainty, Annals of Applied Statistics, 5, 1611 1631 Dragalin, V., Hsuan, F. and Padmanabhan, S.K. (2007) Adaptive Designs for dose-finding studies based on the sigmoid Emax model, Journal of Biopharmaceutical Statistics, 17, 1051 1070 20