Performance of INLA analysing bivariate meta-regression and age-period-cohort models

Similar documents
Implementing Approximate Bayesian Inference using Integrated Nested Laplace Approximation: a manual for theinla program

Beyond MCMC in fitting complex Bayesian models: The INLA method

Spatial modelling using INLA

Disease mapping with Gaussian processes

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina

Bayesian bivariate meta-analysis of diagnostic test studies with interpretable priors

Bayesian Hierarchical Models

R-INLA. Sam Clifford Su-Yun Kang Jeff Hsieh. 30 August Bayesian Research and Analysis Group 1 / 14

Meta-Analysis for Diagnostic Test Data: a Bayesian Approach

Aggregated cancer incidence data: spatial models

Richard D Riley was supported by funding from a multivariate meta-analysis grant from

Multilevel Statistical Models: 3 rd edition, 2003 Contents

NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET

NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET. Fitting dynamic models using integrated nested Laplace approximations INLA

Analysing geoadditive regression data: a mixed model approach

A short introduction to INLA and R-INLA

CTDL-Positive Stable Frailty Model

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:

Multivariate Count Time Series Modeling of Surveillance Data

Bayesian spatial hierarchical modeling for temperature extremes

Longitudinal breast density as a marker of breast cancer risk

Community Health Needs Assessment through Spatial Regression Modeling

A spatio-temporal model for extreme precipitation simulated by a climate model

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition

arxiv: v1 [stat.me] 13 May 2017

Centre for Biodiversity Dynamics, Department of Biology, NTNU, NO-7491 Trondheim, Norway,

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Departamento de Economía Universidad de Chile

Stat 5101 Lecture Notes

Subject CS1 Actuarial Statistics 1 Core Principles

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET

Downloaded from:

FACTORIZATION MACHINES AS A TOOL FOR HEALTHCARE CASE STUDY ON TYPE 2 DIABETES DETECTION

Integrated Non-Factorized Variational Inference

Joint longitudinal and time-to-event models via Stan

Bayesian Adjustment for Exposure Misclassification in. Case-Control Studies

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.

3 Joint Distributions 71

Multivariate Survival Analysis

Generalized common spatial factor model

Physician Performance Assessment / Spatial Inference of Pollutant Concentrations

Fully Bayesian Spatial Analysis of Homicide Rates.

Forecasting with the age-period-cohort model and the extended chain-ladder model

Markov Chain Monte Carlo in Practice

Estimating the marginal likelihood with Integrated nested Laplace approximation (INLA)

A Bayesian multi-dimensional couple-based latent risk model for infertility

Nonparametric meta-analysis for diagnostic accuracy studies Antonia Zapf

Disease mapping with Gaussian processes

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

A discrete semi-markov model for the effect of need-based treatments on the disease states Ekkehard Glimm Novartis Pharma AG

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

STA 4273H: Statistical Machine Learning

2017 SISG Module 1: Bayesian Statistics for Genetics Lecture 7: Generalized Linear Modeling

Bayesian variable selection and classification with control of predictive values

SAS macro to obtain reference values based on estimation of the lower and upper percentiles via quantile regression.

Scaling Neighbourhood Methods

MCMC Sampling for Bayesian Inference using L1-type Priors

Gaussian Process Regression Model in Spatial Logistic Regression

Default Priors and Effcient Posterior Computation in Bayesian

Why Bayesian approaches? The average height of a rare plant

Inference for latent variable models with many hyperparameters

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

Chapter 4 - Spatial processes R packages and software Lecture notes

Reconstruction of individual patient data for meta analysis via Bayesian approach

A general mixed model approach for spatio-temporal regression data

Fast algorithms that utilise the sparsity of Q exist (c-package ( Q 1. GMRFlib) Seattle February 3, 2009

Statistical Methods III Statistics 212. Problem Set 2 - Answer Key

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Measurement Error in Spatial Modeling of Environmental Exposures

Chapter 4 - Fundamentals of spatial processes Lecture notes

arxiv: v1 [stat.co] 16 May 2011

Empirical Bayes methods for the transformed Gaussian random field model with additive measurement errors

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

Bayesian Inference of Multiple Gaussian Graphical Models

Luke B Smith and Brian J Reich North Carolina State University May 21, 2013

Spatial Analysis of Incidence Rates: A Bayesian Approach

Constructing Confidence Intervals of the Summary Statistics in the Least-Squares SROC Model

Surveillance of Infectious Disease Data using Cumulative Sum Methods

Andrew B. Lawson 2019 BMTRY 763

Bayesian Multivariate Logistic Regression

A tutorial in spatial and spatio-temporal models with R-INLA

Weakly informative priors

ANALYSING BINARY DATA IN A REPEATED MEASUREMENTS SETTING USING SAS

Joint Modeling of Longitudinal Item Response Data and Survival

Modelling geoadditive survival data

Efficient adaptive covariate modelling for extremes

The STS Surgeon Composite Technical Appendix

Bayesian Methods for Estimating the Reliability of Complex Systems Using Heterogeneous Multilevel Information

Bayesian Spatial Health Surveillance

Kazuhiko Kakamu Department of Economics Finance, Institute for Advanced Studies. Abstract

A TWO-STAGE LINEAR MIXED-EFFECTS/COX MODEL FOR LONGITUDINAL DATA WITH MEASUREMENT ERROR AND SURVIVAL

Transcription:

Performance of INLA analysing bivariate meta-regression and age-period-cohort models Andrea Riebler Biostatistics Unit, Institute of Social and Preventive Medicine University of Zurich INLA workshop, May 2009 Joint work with Lucas Bachmann, Leonhard Held, Michaela Paul and Håvard Rue

Andrea Riebler 2/ 29 Outline 1. Introduction 2. Bivariate meta-analysis 3. Age-period-cohort model 4. Summary

Andrea Riebler 3/ 29 1. Introduction Bivariate meta-analysis Comparison of the performance of inla and the performance obtained by the maximum likelihood procedure SAS PROC NLMIXED (Paul et al., 2009). Age-period-cohort models Comparison of the performance of inla and an MCMC algorithm implemented in C using the GMRFLib library (Rue and Held, 2005, Appendix). All analyses were run under Kubuntu 8.04 on a laptop with Intel(R) Core(TM) 2 Duo T7200 processor with 2.00 GHz.

Andrea Riebler 4/ 29 Bivariate meta-analysis Meta-analyses are used to summarise the results of separately performed studies, here diagnostic studies. Diagnostic studies often report two-by-two tables Sensitivity Se = TP TP + FN and specificity Sp = TN TN + FP. Bivariate meta-analysis: Models the relationship between sensitivity and specificity (after logit transformation), including random effects for both and allowing for correlation between them. Focus: Estimation of the expected sensitivity and specificity TP = true positives, FP = false positives, TN = true negatives, FN = false negatives.

Model formulation 1. Level TP i Se i Binomial(TP i + FN i, Se i ) TN i Sp i Binomial(TN i + FP i, Sp i ) 2. Level logit(se i ) = µ + U i α + φ i, logit(sp i ) = ν + V i β + ψ i, with ( φi ψ i ) N [( ) ( 0 1/τφ ρ/ )] τ, φ τ ψ 0 ρ/, τ φ τ ψ 1/τ ψ where i = 1,..., I is the study index (Chu and Cole, 2006). Andrea Riebler 5/ 29

Andrea Riebler 6/ 29 Inference Likelihood approaches Numerical maximisation might fail in complex problems. Construction of confidence intervals is problematic. Bayesian approaches Markov chain Monte Carlo (MCMC) is very time-consuming. Credible intervals are obtained as the quantiles of the samples. Comparison of inla and SAS PROC NLMIXED using an extensive simulation study.

Andrea Riebler 7/ 29 Simulation study 72 different scenarios where each scenario contains 1000 meta-analyses sampled from the model. We varied the number of studies per meta-analysis. the overall sensitivity and specificity. the between-studies precisions. the correlation between logit sensitivity and logit specificity. The number of participants is sampled for each study separately.

Andrea Riebler 8/ 29 Settings In a Bayesian context all parameters are treated as random and prior distributions are assigned (determined by a sensitivity analysis): For τ φ, τ ψ : Gamma(shape=0.25, rate=0.025). For Fisher s z-transformed correlation ρ: 1.0 Precision = 0.2 ρ N (0, 0.2 1 ) 0.8 0.6 0.4 0.2 0.0 1.0 0.5 0.0 0.5 1.0 ρ

Andrea Riebler 9/ 29 Results Comparison using bias, SD, MSE and coverage probabilities: Bias and MSE of inla and NLMIXED are almost the same. Bias and MSE depend on choice of sensitivity and specificity. The estimates are more precise for more studies. Precision of estimates and MSE are hardly influenced by the value of ρ. In general inla produces better coverage.

Andrea Riebler 10/ 29 Performance and running time Performance: Out of 72 000 analyses inla failed 2 times, NLMIXED failed 7 482 times (10.4%). Running time: For one scenario of 1000 meta-analyses inla took on average 6.0 minutes (min: 4.7, max: 7.8), NLMIXED took on average 38.1 minutes (min: 20.5, max: 89.3).

Andrea Riebler 11/ 29 Radiological evaluation of lymph node metastases Three types of diagnostic imaging are compared for detecting lymph node metastases in patients with cervical cancer (Scheidler et al., 1997). The meta-analysis consists of a total of 46 studies: 17 studies for lymphangiography (LAG) 19 studies for computed tomography (CT) 10 studies for magnetic resonance (MR) with each containing at least 20 patients.

Andrea Riebler 12/ 29 INLA call using the R-Interface > library(inla) > data(bivmetaanalysis) > head(bivmetaanalysis) N Y diid lag.tp lag.tn ct.tp ct.tn mr.tp mr.tn 1 29 19 1 1 0 0 0 0 0 2 82 81 2 0 1 0 0 0 0 3 10 8 3 1 0 0 0 0 0 4 22 13 4 0 1 0 0 0 0 5 53 41 5 1 0 0 0 0 0 6 50 49 6 0 1 0 0 0 0 > formula <- Y ~ f(diid, model = "2diid", + param = c(0.25, 0.025, 0.25, 0.025, 0, 0.2)) + + lag.tp + lag.tn + ct.tp + ct.tn + mr.tp + mr.tn - 1 > model <- inla(formula, family = "binomial", Ntrials = N, + data = BivMetaAnalysis, quantiles = c(0.025, 0.5, 0.975)) The analysis took about 0.6 seconds.

Andrea Riebler 13/ 29 Summary estimates Imaging Sensitivity Median 2.5%-quantile 97.5%-quantile LAG 0.69 0.57 0.80 CT 0.49 0.36 0.62 MR 0.55 0.37 0.71 Imaging Specificity Median 2.5%-quantile 97.5%-quantile LAG 0.83 0.76 0.89 CT 0.93 0.89 0.96 MR 0.95 0.91 0.98 The correlation ρ was estimated to 0.48 ( 0.76, 0.04).

Andrea Riebler 14/ 29 Discussion Similar performance of inla and NLMIXED regarding bias and MSE. Advantage of inla Better coverage More stable and faster Since sensitivity and specificity are jointly analysed, a joint confidence ellipse for these measures might be of interest. Comparison of NLMIXED and inla using an empirical Bayes approach?

Andrea Riebler 15/ 29 3. Age-period-cohort model Data on cancer often consist of yearly counts for different age groups and gender in pre-defined geographical areas. Our goal lies in: Detecting temporal patterns. Providing predictions for subsequent periods. Age-period-cohort (APC) model to describe incidence or mortality rates using three time scales. Age: age at diagnosis. Period: date of diagnosis. Cohort: date of birth.

Andrea Riebler 16/ 29 Univariate age-period-cohort model y ij : n ij : number of deaths or disease cases in age group i at period j number of persons at risk in age group i at period j y ij Poisson(n ij exp(ξ ij )) ξ ij = µ + α i + β j + γ k + z ij with age effect α i, period effect β j, cohort effect γ k and additional random effect z ij N (0, δ 1 ) to adjust for overdispersion. To assure identifiability of the intercept µ, we set I α i = i=1 J K β j = γ k = 0. j=1 k=1 Note: Because of the linear relationship k = I i + j, the age, period and cohort effects are still not identifiable

Andrea Riebler 17/ 29 Bayesian age-period-cohort model Non-parametric smoothing priors are used for the main effects with gamma hyperpriors for the associated smoothing parameters. Second-order random walk (RW2) α i N (2α i 1 α i 2, κ 1 ) i = 3,..., I RW2 penalises deviations from a linear trend α i = 2α i 1 α i 2. Note: Non-identifiability of the latent parameters remains, but does not require further constraints.

Case study: Lung cancer mortality in West Germany Period 1952 1962 1972 1982 1992 Males < 5 10 14 20 24 30 34 40 44 50 54 60 64 70 74 80 84 Age group Mortality rate per 100 000 403.4 148.4 54.6 20.09 7.39 2.72 1 0.37 0.14 0.05 0.02 0.01 18 age groups: < 5, 5-9, 10-14,..., 80-84, 85. 45 periods: 1952-1996. 130 cohorts: 1862-1867, 1863-1868,..., 1991-1996. (Knorr-Held and Rainer, 2001) Andrea Riebler 18/ 29

Andrea Riebler 19/ 29 INLA call using the R-Interface y n i j k z 3 250 1 1 2 1 20 260 2 1 1 2 9 230 1 2 3 3 12 270 2 2 2 4 7 260 1 3 4 5 10 290 2 3 3 6. > library(inla) For predictions, set y ij = NA. > lungm <- read.table("data/lungm4inla.txt", header=t) > formula <- y ~ f(i, model="rw2", param=c(1,0.00005)) + + f(j, model="rw2", param=c(1,0.00005)) + + f(k, model="rw2", param=c(1,0.00005)) + + f(z, model="iid", param=c(1,0.005)) > model <- inla(formula, family="poisson", data=lungm, E=lungm$n, + quantiles=c(0.1, 0.5, 0.9), control.compute=list(cpo=true), + control.predictor=list(compute=true)) > hyper <- inla.hyperpar(model)

Andrea Riebler 20/ 29 Results for complete dataset MCMC needed for 120 000 iterations about 10 minutes. INLA needed for the model estimation about 17 seconds and for the improved hyperparameter estimation about 2 minutes. Hyperparameter of age effects Hyperparameter of period effects Hyperparameter of cohort effects 0.00 0.02 0.04 Rao Blackwell INLA INLA improved 10 20 30 40 50 60 70 0.0e+00 1.0e 05 2.0e 05 20000 60000 100000 0.00000 0.00010 5000 10000 15000 20000 The inspection of identifiable measures gave similar results.

Andrea Riebler 21/ 29 Predictions for 1987-1996 Number of cases per 100 000 60 70 80 90 Observations INLA (with 80% CI) MCMC (with 80% CI) Number of cases per 100 000 100 110 120 130 140 150 160 170 1960 1970 1980 1990 Period 50 54 1960 1970 1980 1990 Period 55 59 Number of cases per 100 000 150 200 250 Number of cases per 100 000 150 200 250 300 350 400 1960 1970 1980 1990 Period 60 64 1960 1970 1980 1990 Period 65 69

Andrea Riebler 22/ 29 Inclusion of smoking data in the APC model The inclusion of appropriate covariate information in the APC model could improve the predictions. Model formulation: Assuming a time-constant effect β: ξ ij = µ + α i + β x j L + γ k + z ij. Assuming a time-varying effect β j : ξ ij = µ + α i + β j x j L + γ k + z ij, assigning a RW2 smoothing prior to β j. x j : number of cigarettes sold per 1/1000 capita in 1955-1994. L = 20 years: latency period.

Andrea Riebler 23/ 29 Inclusion of covariates in R-inla # cigarettes sold per capita (age >= 15) 0 500 1000 2000 3000 Goal: Prediction until 2010. Note: Because of L = 20 years, only data from 1975 onwards can enter. 1955 1960 1965 1970 1975 1980 1985 1990 Year Assuming a time-constant effect β: formula_const <- y ~ f(i, model="rw2", param=c(1,0.00005), constr=1) + f(k, model="rw2", param=c(1,0.00005), constr=1) + f(z, model="iid", param=c(1,0.005) ) + cig_cov Assuming a time-varying effect β j : formula_vary <- y ~ f(i, model="rw2", param=c(1,0.00005), constr=1) + f(j, model="rw2", param=c(1,0.00005), constr=0, weights=cig_cov) + f(k, model="rw2", param=c(1,0.00005), constr=1) + f(z, model="iid", param=c(1,0.005) )

Andrea Riebler 24/ 29 Covariate effects Time constant effect exp(β): 10%-quantile Median 90%-quantile 1.11 1.13 1.15 Time-varying effect exp(β j ): exp(β j) 0.0 0.5 1.0 1.5 2.0 1975 1980 1985 1990 1995 2000 2005 2010 Periods

Andrea Riebler 25/ 29 Model assessment PIT histogram for count data (Czado et al. 2009): APC (RW2) Time constant covariate effect Time varying covariate effect Relative frequency 0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.0 0.2 0.4 0.6 0.8 1.0 PIT 0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.0 0.2 0.4 0.6 0.8 1.0 PIT 0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.0 0.2 0.4 0.6 0.8 1.0 PIT Log-score: APC constant RW2 log(cpo) 3.895 3.887 3.905 Two CPO values were removed as they were classified as unreliable.

Andrea Riebler 26/ 29 Prediction until 2010 Number of cases per 100 000 30 40 50 60 70 80 Observations Median 80% region APC (RW2) time constant effect of covariate time varying effect of covariate Number of cases per 100 000 60 80 100 120 140 1975 1980 1985 1990 1995 2000 2005 2010 Period 50 54 1975 1980 1985 1990 1995 2000 2005 2010 Period 55 59

Andrea Riebler 27/ 29 Discussion INLA facilitates the analysis of Bayesian APC models. Prediction is straightforward. Covariate information can be easily incorporated. Model diagnostics available, but not completely robust.

Andrea Riebler 28/ 29 4. Summary For both applications presented, INLA is an alternative to the standard used inference approaches (ML, MCMC). It is: User-friendly and easy to apply Fast Flexible Issues for future work might be: Improved model diagnostics, Calculation of joint credibility intervals, Calculation of predictive distribution for response.

Andrea Riebler 29/ 29 Thank you for your attention Chu, H. and Cole, S.R. (2006). Bivariate meta-analysis of sensitivity and specificity with sparse data: a generalised linear mixed model approach. Journal of Clinical Epidemiology, 59, 1331 1333. Czado, C., Gneiting, T. and Held, L. (2009). Predictive Model Assessment for Count Data. Biometrics, to appear. Knorr-Held, L. and Rainer, E. (2001). Projections of lung cancer mortality in West Germany: a case study in Bayesian prediction. Biostatistics, 2, 109 129. Paul, M., Riebler, A., Bachmann, L., Rue, H. and Held, L. (2009). Bivariate meta-analysis with INLA: an approximate Bayesian inference. Statistics in Medicine, submitted. Rue, H. and Held, L. (2005). Gaussian Markov Random Fields: Theory and Applications. Volume 104 of Monographs on Statistics and Applied Probability, Chapmann & Hall/CRC. Rue, H., Martino, S. and Chopin, N. (2009). Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations (with discussion). Journal of the Royal Statistical Society: Series B, 71, 319 392. Scheidler, J., Hricak, H., Yu, K. K., Subak, L. Segal, M. R. (1997). Radiological evaluation of lymph node metastases in patients with cervical cancer. Journal of the American Medical Association,278, 1096 1101.