Stochastic approximation EM algorithm in nonlinear mixed effects model for viral load decrease during anti-hiv treatment

Similar documents
Extension of the SAEM algorithm to left-censored data in nonlinear mixed-effects model: application to HIV dynamics model

Extension of the SAEM algorithm for nonlinear mixed models with 2 levels of random effects Panhard Xavière

A comparison of estimation methods in nonlinear mixed effects models using a blind analysis

Extension of the SAEM algorithm for nonlinear mixed. models with two levels of random effects

Estimation and Model Selection in Mixed Effects Models Part I. Adeline Samson 1

DESIGN EVALUATION AND OPTIMISATION IN CROSSOVER PHARMACOKINETIC STUDIES ANALYSED BY NONLINEAR MIXED EFFECTS MODELS

Robust design in model-based analysis of longitudinal clinical data

1Non Linear mixed effects ordinary differential equations models. M. Prague - SISTM - NLME-ODE September 27,

F. Combes (1,2,3) S. Retout (2), N. Frey (2) and F. Mentré (1) PODE 2012

Evaluation of the Fisher information matrix in nonlinear mixed eect models without linearization

Between-Subject and Within-Subject Model Mixtures for Classifying HIV Treatment Response

dans les modèles à vraisemblance non explicite par des algorithmes gradient-proximaux perturbés

Joint modeling of longitudinal and repeated time-to-event data using nonlinear mixed-effects models and the SAEM algorithm

EM Algorithm II. September 11, 2018

Estimation, Model Selection and Optimal Design in Mixed Eects Models Applications to pharmacometrics. Marc Lavielle 1. Cemracs CIRM logo

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

An efficient stochastic approximation EM algorithm using conditional particle filters

Comparison of Three Calculation Methods for a Bayesian Inference of Two Poisson Parameters

SAEMIX, an R version of the SAEM algorithm for parameter estimation in nonlinear mixed effect models

Sebastian Ueckert, France Mentré. To cite this version: HAL Id: inserm

Mixed-Effects Biological Models: Estimation and Inference

Pharmacometrics : Nonlinear mixed effect models in Statistics. Department of Statistics Ewha Womans University Eun-Kyung Lee

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

An Introduction to mixture models

Hmms with variable dimension structures and extensions

Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

Approximate Bayesian Computation

Parametric fractional imputation for missing data analysis

Controlled sequential Monte Carlo

Computer Intensive Methods in Mathematical Statistics

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Generative Models and Stochastic Algorithms for Population Average Estimation and Image Analysis

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

MCMC Sampling for Bayesian Inference using L1-type Priors

Kazuhiko Kakamu Department of Economics Finance, Institute for Advanced Studies. Abstract

Correction of the likelihood function as an alternative for imputing missing covariates. Wojciech Krzyzanski and An Vermeulen PAGE 2017 Budapest

Alternative implementations of Monte Carlo EM algorithms for likelihood inferences

Streamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level

Econometrics I, Estimation

Hidden Markov models for time series of counts with excess zeros

Statistical Methods for Bridging Experimental Data and Dynamic Models with Biomedical Applications

Parametric inference for mixed models defined by stochastic differential equations

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

Reminder of some Markov Chain properties:

Principles of Bayesian Inference

Down by the Bayes, where the Watermelons Grow

Likelihood-based inference with missing data under missing-at-random

Principles of Bayesian Inference

Pseudo-marginal MCMC methods for inference in latent variable models

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Supplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements

Monte Carlo methods for sampling-based Stochastic Optimization

Stochastic Approximation Methods for Latent Regression Item Response Models

Adaptive Population Monte Carlo

CONVERGENCE OF A STOCHASTIC APPROXIMATION VERSION OF THE EM ALGORITHM

Pharmacokinetics Introduction to

A Review of Pseudo-Marginal Markov Chain Monte Carlo

Heterogeneous shedding of influenza by human subjects and. its implications for epidemiology and control

Design of HIV Dynamic Experiments: A Case Study

Monte Carlo Markov Chains: A Brief Introduction and Implementation. Jennifer Helsby Astro 321

Nonlinear random effects mixture models: Maximum likelihood estimation via the EM algorithm

Bayesian Regression Linear and Logistic Regression

Accounting for Population Uncertainty in Covariance Structure Analysis

Sparse Stochastic Inference for Latent Dirichlet Allocation

POMP inference via iterated filtering

On Markov Chain Monte Carlo

Chapter 4 Multi-factor Treatment Designs with Multiple Error Terms 93

Sensitivity study of dose-finding methods

Riemann Manifold Methods in Bayesian Statistics

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.

A Novel Screening Method Using Score Test for Efficient Covariate Selection in Population Pharmacokinetic Analysis

Multivariate Survival Analysis

Reconstruction of individual patient data for meta analysis via Bayesian approach

Bayesian Inference and MCMC

Lecture 2 Part 1 Optimization

Biostat 2065 Analysis of Incomplete Data

Latent Class Analysis

Principles of Bayesian Inference

Lecture 7 and 8: Markov Chain Monte Carlo

Yaming Yu Department of Statistics, University of California, Irvine Xiao-Li Meng Department of Statistics, Harvard University.

Minimum Message Length Analysis of the Behrens Fisher Problem

Strong Consistency of MLE in Nonlinear Mixed-effects Models with large cluster size

Ages of stellar populations from color-magnitude diagrams. Paul Baines. September 30, 2008

Multivariate Normal & Wishart

Monte Carlo Methods. Leon Gu CSD, CMU

MH I. Metropolis-Hastings (MH) algorithm is the most popular method of getting dependent samples from a probability distribution

Probabilistic Machine Learning

Markov Chain Monte Carlo Methods

Harvard University. Harvard University Biostatistics Working Paper Series

Markov Chain Monte Carlo (MCMC) and Model Evaluation. August 15, 2017

Default Priors and Effcient Posterior Computation in Bayesian

Mixed effect model for the spatiotemporal analysis of longitudinal manifold value data

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

Tutorial on Approximate Bayesian Computation

Bayesian Methods for Machine Learning

Exercises Tutorial at ICASSP 2016 Learning Nonlinear Dynamical Models Using Particle Filters

Monte Carlo in Bayesian Statistics

MCMC for big data. Geir Storvik. BigInsight lunch - May Geir Storvik MCMC for big data BigInsight lunch - May / 17

Transcription:

Stochastic approximation EM algorithm in nonlinear mixed effects model for viral load decrease during anti-hiv treatment Adeline Samson 1, Marc Lavielle and France Mentré 1 1 INSERM E0357, Department of Epidemiology, Biostatistics and Clinical Research, University Hospital Bichat-Claude Bernard, Paris, France University Paris-Sud, Orsay, France A. Samson, PAGE, 17/06/04 p. 1/1

Estimation in nonlinear mixed effects models (NLMEM) by maximum likelihood No close form of the likelihood because of the non linearity of the model estimation by maximum likelihood rather delicate Usual estimation software: approximation of the likelihood by linearization of the model (FO, FOCE,...) NONMEM software, nlme in Splus/R, Proc SAS NLMIXED,... Inconsistent estimates produced by these algorithms based on linearization except if both N and n i (Vonesh, Biometrika, 1996) Inflation of the type I error of the Likelihood Ratio Test and Wald Test using the algorithms based on linearization (Wählby et al., J Pharmacokinet Pharmacodyn, 001; Comets and Mentré, J Biopharm Stat, 001; Ding and Wu, Stat Med, 001; Panhard and Mentré, Stat Med, 004) Hypothesis: problems due to linearization A. Samson, PAGE, 17/06/04 p. /1

EM algorithms Alternative method for maximum likelihood estimation to avoid the linearization (Dempster et al., JRSSB, 1977) Individual parameters considered as missing data Iterative algorithm 1. E step: expectation of the missing data No analytical form of the distribution of the individual parameters conditionally to the observations and the hyperparameters Conditional expectation of the log-likelihood of the complete data difficult to compute, even when the individual parameters are known. M step: maximization of the likelihood of the complete data Analytically Newton-Raphson algorithm A. Samson, PAGE, 17/06/04 p. 3/1

E step for the NLMEM Linearization (Mentré and Gomeni, J Biopharm Stat, 1995 ) Simulation of the individual parameters in their unknown distribution Monte Carlo method (MCEM Algorithm) (Walker, Biometrics, 1996) Metropolis Hastings procedure (Quintana, Liu and Del Pino, CSDA, 1999) Monte Carlo Markov Chain procedure (MCMC) (Gu and Kong, PNAS, 1998, Kuhn and Lavielle PAGE 003, ESAIM PS, 004) Approximation of the conditional expectation Monte Carlo EM (Wei and Tanner, JASA, 1990) Stochastic Approximation EM (SAEM) (Delyon, Lavielle and Moulines, Annals Stat, 1999) Kuhn and Lavielle (PAGE 003, ESAIM PS 004) proposed to combine SAEM and MCMC A. Samson, PAGE, 17/06/04 p. 4/1

Contents 1. Evaluation of the SAEM algorithm by simulation. Estimation of the likelihood by importance sampling 3. Evaluation of the type I error of a LRT 4. Illustration on real data : TRIANON A. Samson, PAGE, 17/06/04 p. 5/1

Models and notations Statistical model: y ij = f(θ i, t ij ) + ε ij y ij : measurement of subject i (i = 1,..., N) at time t ij (j = 1,..., n i ) θ i : p-vector of the parameters of subject i ε ij : measurement error of subject i at time t ij ε ij θ i homoscedastic or heteroscedastic model θ i = µ + b i with b i N (0, Ω) ψ: vector of the hyperparameters of the model Log-likelihood: L obs (ψ y) = N i=1 log (p(y i ψ )) with p(y i ψ ) = p( y i θ i, ψ )p( θ i ψ ) dθ i A. Samson, PAGE, 17/06/04 p. 6/1

1.1 Evaluation of SAEM: simulation example Model of the decrease of viral load after beginning of anti HIV treatment (Ding and Wu, Stat Med, 001) f(θ i, t ij ) = log 10 (P 1,i exp( λ 1,i t ij ) + P,i exp( λ,i t ij )) Four parameters: lnp 1, lnp, lnλ 1, lnλ Additive random effects (ω k = 0.3, k = 1,..., 4) Additive error with constant variance (σ = 0.065) Six identical sampling times: 1, 3, 7, 14, 8 and 56 days N = 40 and N = 00 A. Samson, PAGE, 17/06/04 p. 7/1

1. Example of simulated data, N=40 log(vl) (cp/ml) 0 1 3 4 5 6 0 10 0 30 40 50 60 time (days) A. Samson, PAGE, 17/06/04 p. 8/1

1.3 Evaluation settings Simulation of 100 trials with N = 40 or N = 00 subjects nlme function of R 1.7 software SAEM function implemented in R 1.7 software Evaluation of the estimation properties for both algorithms Relative Biases Relative RMSE Test whether the biases are significantly different from zero by a Student s test on the 100 replications A. Samson, PAGE, 17/06/04 p. 9/1

1.4 Results N=40, fixed effects Relative Biases (%) Relative RMSE (%) 4 3 1 0 1 * * nlme SAEM * * * 0 4 6 8 10 1 14 nlme SAEM P 1 P λ 1 λ P 1 P λ 1 λ (* p<0.05) A. Samson, PAGE, 17/06/04 p. 10/1

1.5 Results N=40, variance parameters Relative Biases (%) Relative RMSE (%) 10 5 0 5 10 ω 1 ω ω 3 ω 4 nlme SAEM σ 0 10 0 30 40 50 60 70 ω 1 ω ω 3 ω 4 nlme SAEM σ ( * p<0.05) A. Samson, PAGE, 17/06/04 p. 11/1

1.6 Results N=00 Relative Biases (%) Relative RMSE (%) 4 3 1 0 1 nlme SAEM * * * 0 4 6 8 10 1 14 nlme SAEM P 1 P λ 1 λ P 1 P λ 1 λ 10 5 0 5 10 ω 1 ω ω 3 * ω 4 * nlme SAEM σ 0 10 0 30 40 50 60 70 ω 1 ω ω 3 ω 4 nlme SAEM σ A. Samson, PAGE, 17/06/04 p. 1/1

.1 Evaluation of the likelihood without linearization Approximation of each p(y i ψ ) For t = 1,..., T, simulation of sample θ (t) i Monte Carlo integration with θ (t) i p(y i ˆψ ) 1 T T t=1 p(y i θ (t) i, ˆψ) sampled in the current population distribution p(., ˆψ) No stability of the evaluation even with a very large T Importance sampling procedure p(y i ˆψ ) 1 T T t=1 p(y i θ (t) i, h i (θ (t) i (t) ˆψ)p(θ i, ˆψ), ˆψ) with θ (t) i sampled in an instrumental distribution h i (., ˆψ) A. Samson, PAGE, 17/06/04 p. 13/1

. Importance sampling method Choice of the instrumental distribution Gaussian approximation of the individual posterior distribution of θ i given y i and ψ ˆµ post i ˆΩpost i h i (., ˆψ) = N (ˆµ post i, ˆΩ post i ) the posterior individual mean the posterior individual variance Estimation of ˆµ post i ˆΩ post i and by the empirical mean and variance of the 50 last θ i simulated during the MCMC procedure Choice of T: the number of simulated samples A. Samson, PAGE, 17/06/04 p. 14/1

.3 Study of the influence of T Repeated evaluations of the log likelihood for one trial (N=40) with different seeds and values of T Log Likelihood 0 4 6 8 1000 000 5000 10000 0000 50000 T Using the R software: T=50,000 is a good choice A. Samson, PAGE, 17/06/04 p. 15/1

3.1 LRT LRT for a treatment effect on the first viral decay rate lnλ 1 Two groups of treatment of same size Evaluation of the likelihood SAEM: likelihood estimated by importance sampling without linearization nlme: - linearized likelihood obtained by nlme - likelihood estimated by importance sampling without linearization Three LRT evaluated Evaluation of the type I error with a critical value of 3.84 (level of 5% for a χ with 1 d.f.) A. Samson, PAGE, 17/06/04 p. 16/1

3. Evaluation of the type I error of the LRT (p=5 %) N=40 N=00 nlme 13 % 18 % nlme IS 1 % 16 % SAEM 7 % 5 % Inflation of the type I error with nlme Same inflation with the use of a likelihood estimated without linearization Accurate level with SAEM Inflation can be explained by the use of algorithms based on the linearization A. Samson, PAGE, 17/06/04 p. 17/1

4.1 Illustration of SAEM on real data (TRIANON) AIDS clinical trial supported by the Agence Nationale de Recherche sur le Sida in France (ANRS81) 144 HIV-1 infected patients treated during 7 weeks Treatment A : Lamivudine + d4t + Indinavir (71 patients) Treatment B : Nevirapine + d4t + Indinavir (73 patients) Analysis of the initial decrease of the viral load (D0, D8, D56, D11) Data below the LOQ fixed to LOQ/ A. Samson, PAGE, 17/06/04 p. 18/1

4. Results group A group B log(cv) (cp/ml) 0 4 6 8 0 0 40 60 80 100 time (days) No convergence of the nlme function No significant differences with SAEM between the two treatments (LRT on λ1 and λ) A. Samson, PAGE, 17/06/04 p. 19/1

4.3 Estimated parameters with SAEM Parameter Estimates (SE %) lnp 1 10.9 (3.1 %) lnp 6.35 (13 %) lnλ 1-0.78 (5.3 %) lnλ -3.38 (0.7 %) ω 1 0.6 (16 %) ω 1.47 (13 %) ω 3 0.31 (18 %) ω 4 0.09 (17 %) σ 0.6 (6.5 %) A. Samson, PAGE, 17/06/04 p. 0/1

Conclusion Rapidity and stability of the convergence of the SAEM algorithm Robust to the choice of the initial values Less biased than nlme (especially for small number of subjects) Evaluation of the likelihood by importance sampling Stable with a large T Application to the LRT with no inflation of the type I error (in this example) Application to the evaluation of the SE (not shown here) SAEM is a good alternative for maximum likelihood estimation in NLMEM SAEM now implemented in a MATLAB function developped by Marc Lavielle Demonstration during the Software demonstration by Marc Lavielle on several examples (simulated and real data sets) A. Samson, PAGE, 17/06/04 p. 1/1