Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial
|
|
- Dorthy Powers
- 5 years ago
- Views:
Transcription
1 Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial Jin Kyung Park International Vaccine Institute Min Woo Chae Seoul National University R. Leon Ochiai International Vaccine Institute Yongdai Kim Seoul National University
2 Outlines 1. Motivational data 2. Frailty model, parameters, and priors 3. MCMC algorithm 4. Deviance Information Criterion 5. Simulation 6. Example 7. Summary 1
3 1. Motivational data To assess typhoid vaccine effectiveness, the International Vaccine Institute (IVI) coordinated a clinical trial such that Clustered randomization trial (CRT) Two-year surveillance at Kolkata, India from 2004 to 2006 Two-year surveillance at Karachi, Pakistan from 2002/2003 to 2004/2005 Time-to-first event of typhoid or censoring Risk factors: gender, age group etc. 2
4 Figure 1: Geographic clusters in Kolkata, India Source: The New England Journal of Medicine,
5 Table 1: Basic statistics Country Clusters Subjects Males Children Cases no. no. no.(%) no.(%) no.(ir /1000) Kolkata, India (51.49) 2192 (18.33) 109 (9.12) Karachi, Pakistan (51.74) 6481 (23.80) 79 (2.9) Vaccinees (2-16 years aged people) 5 yrs aged Incidence rate(no. of cases/no. of subjects) Clustering effect (random effects) Variance component of random effects by country Common effect of the treatment (vaccine effectiveness) We propose a way of selecting a model for the multi-country CRT based on the deviance information criterion (DIC) 4
6 2. Frailty model Observations: (T ijk, δ ijk, z ijk ), i = 1,, n s, j = 1, 2,, n ic, k = 1, 2,, n ijl where T ijk = min{x ijk, C ijk }, δ ijk = I(X ijk C ijk ),z ijk = (z ijk1,..., z ijkp ), X ijk and C ijk are survival and censoring times respectively n s is the number of sites (i.e., countries) n ic is the number of clusters of ith site n ijl the number of subjects in the jth cluster of ith site p is the number of covariates 5
7 Model a i (t z ijk, η ij ) = η ij exp(z ijkβ)a i (t) where β = (β 1,..., β p ) are the regression coefficients, a i (t) are baseline hazard functions of ith site and η ij (i = 1,, n s, j = 1, 2,, n ic ) are cluster specific frailty terms. Frailty η ij Γ(α i, α i ) It is assumed that η ij are independent random variables distributed by the gamma distribution with mean 1 and variance 1/α i. Here, α i can be considered as the degree of cluster heterogeneity of ith site (Clayton, 1978). 6
8 Parameters: β, a i ( ), α i Priors Normal prior for β m N(0, σm), 2 m = 1,..., p Gamma prior for α i, α i Γ(τ i1, τ i2 ) Bayesian bootstrap prior for a i ( ) 7
9 Bayesian Bootstrap (Kim and Lee, 2003) To approximate the full Bayesian posterior by Bayesian bootstrap posterior that is proportional to the product of the empirical likelihood and prior The conditional likelihood of θ = (β, a( )) given η = (η 1,, η nc ) L(θ η) = = n c n i i=1 j=1 n c n i i=1 j=1 ( ) ( η i exp(z ijβ)a(t δij ij ) exp ( ) ( η i exp(z ijβ)da(t δij ij ) exp tij 0 tij 0 η i exp(z ijβ)a(s)ds ) η i exp(z ijβ)da(s) ) where A(t) = t 0 a(s)ds is the cumulative baseline hazard function. 8
10 Let q be the number of distinct uncensored observations Let 0 < t 1 < < t q be the corresponding ordered uncensored observations Then, the empirical likelihood is obtained by assuming that A is a step function having jumps only at t 1,..., t q and replacing da(t) by A(t) = A(t) A(t ) L E (θ η) = n c n i i=1 j=1 ( ) η i exp(z ijβ) A(t δij ij ) exp Bayesian bootstrap prior for A (Kim and Lee, 2003) π( A) q k=1 1 A(t k ) Posterior Empirical likelihood BB prior η i exp(zijβ) A(t k ) k:t k t ij 9
11 Advantages of the Bayesian bootstrap Finitely many parameters: standard Bayes theorem can be used for posterior calculation Parameters: β = (β 1,, β p ), { A(t k ), k = 1,..., q}, α Computation of posterior is easy The resulting posterior is always proper, approximates the full Bayesian posterior well, and has desirable large sample properties. Kim and Lee (2003) Annals of Statistics 10
12 3. MCMC algorithm Step 1 : updating cumulative baseline hazards (A i ) Step 2 : updating regression coefficients (β 1,..., β p ) Step 3 : updating cluster specific random effects (η ij ) Step 4 : updating variance of cluster specific random effects (α i ) 11
13 A i (t l ) others Gamma(a il, b il ), where and b il = where a il = (i,j,k) D(t l ) (i,j,k) R(t l ) δ ijk η ij exp(z ijkβ) R(t) = {(i, j, k) : T ijk t, i = 1,..., n s, j = 1, 2,, n c, k = 1, 2,, n j } and D(t) = {(i, j, k) : T ijk = t, δ ijk = 1, i = 1,..., n c, j = 1, 2,, n i, k = 1, 2,, n j }. π p (β 1,..., β p others) q η ij exp(z ijkβ) exp Ai (t l ) l=1 (i,j,k) D(t l ) (i,j,k) R(t l ) Metropolis-Hastings algorithm is used for generating β. η ij exp(z ijkβ) π(β) 12
14 η ij others Gamma(c ij, d ij ), where c ij = n c j=1 δ ijk + α i and d ij = n c j=1 exp(z ijkβ)a(t ijk ) + α i. π p (α i others) [ nc j=1 ] α α i i Γ(α i ) ηα i 1 ij exp( α i η ij ) α τ i1 1 i exp( τ i2 α i ). Metropolis-Hastings algorithm is used for generating α i. 13
15 Exemplary data: Model selection using CRT data of two countries Table 2: Models: - mean that the corresponding parameters are all different Model Treatment effect Effect of other risk factors Cluster heterogeneity M M2 β 11 = = β nc M3 - β 1( 1) = = β nc ( 1) - M4 β 11 = = β nc 1 β 1( 1) = = β nc ( 1) - M5 - - α 1 = = α nc M6 β 11 = = β nc 1 - α 1 = = α nc M7 - β 1( 1) = = β nc ( 1) α 1 = = α nc M8 β 11 = = β nc 1 β 1( 1) = = β nc ( 1) α 1 = = α nc 14
16 4. Deviance Information Criterion (DIC) Bayesian Model selection criterion (Spiegelhalter et al., 2002) DIC(θ) = 4E θ [log f(y θ) y] + 2 log f(y θ(y)) where f(y θ) be the density of data y and parameter of interest, θ DIC is defined as the sum of the expected deviance and the effective number of parameters A simple and intuitively appealing extension of the Akaike Information Criterion (AIC, Akaike 1973) 15
17 Extensions of DIC (Celux et al., 2006) Observed DIC Use the marginal likelihood to define the DIC Often the observed DIC is difficult to calculate Alternatives Complete DIC Conditional DIC 16
18 Complete DIC Defined as the posterior expectation of the DIC obtained assuming the random effects are known DIC comp1 = 4E θ,w [log f(y, w θ) y] + 2E w [log f(y, w θ(y, w)) y] where f(y, w θ) be the density of y and w y and w represent observed data and random effects, respectively Computationally demanding when θ(y, w) does not have a closed form solution 17
19 Complete DIC type 2 Replace the term E w [log f(y, w θ(y, w)) y] by log f(y, w(y) θ(y)) DIC comp1 = 4E θ,w [log f(y, w θ) y] + 2E w [log f(y, w θ(y, w)) y] DIC comp2 = 4E θ,w [log f(y, w θ) y] + 2 log f(y, w(y) θ(y)) ( w(y), θ(y)) is an reasonable estimator of (w, θ) by treating w as an additional parameter (of interest) Tends to overestimate the effective number of parameters 18
20 Complete DIC type 3 Replace θ(y, w) in the DIC comp1 by θ(y) DIC comp1 = 4E θ,w [log f(y, w θ) y] + 2E w [log f(y, w θ(y, w)) y] DIC comp2 = 4E θ,w [log f(y, w θ) y] + 2 log f(y, w(y) θ(y)) DIC comp3 = 4E θ,w [log f(y, w θ) y] + 2E w [log f(y, w θ(y)) y] If we let θ = E θ [θ y], which is easily obtained using MCMC samples 19
21 Conditional DIC The conditional likelihood of y given w and θ is used DIC cond1 = 4E θ,w [log f(y w, θ) y] + 2E w [log f(y w, θ(y, w)) y] DIC cond2 = 4E θ,w [log f(y w, θ) y] + 2 log f(y w(y), θ(y)) DIC cond3 = 4E θ,w [log f(y w, θ) y] + 2E w [log f(y w, θ(y)) y] Note that the DIC cond1, DIC cond2 and DIC cond3 are conditional likelihood versions of the DIC comp1, DIC comp2 and DIC comp3, respectively. 20
22 5. Simulations Complete/Conditional DIC type 2 and 3 are used for model selection Generate 100 data sets with no. of sites, n s = 2 no. of clusters per site, n c = 20 no. of subjects per cluster, n k = 5 Generate Z 1 and Z 2 as covariates independently from the symmetric Bernoulli distribution with success probability 0.5. For the baseline hazard functions, we let a 1 (t) = 1 and a 2 (t) = 2 21
23 Consider the four models: 1. Equal parameter of interest and equal nuisance parameters SM1: β 11 = β 21 = 1.0, β 12 = β 22 = 0.5, α 1 = α 2 = Different parameter of interest and equal nuisance parameters SM2: (β 11, β 21 ) = ( 1.0, 0.0) and β 12 = β 22 = 0.5, α 1 = α 2 = Equal parameter of interest and different nuisance parameters SM3: β 11 = β 21 = 1.0, (β 12, β 22 ) = (1.0, 0.0), (α 1, α 2 ) = (0.3, 0.7) 4. Different parameter of interest and different nuisance parameters SM4: (β 11, β 21 ) = ( 1.0, 0.0), (β 12, β 22 ) = (1.0, 0.0), (α 1, α 2 ) = (0.3, 0.7) 22
24 Table 3: No frailty: the frequencies of the models selected by DICs among 100 data sets Censoring Simulation Partial likelihood Full likelihood prob. Model SM1 SM2 SM3 SM4 SM1 SM2 SM3 SM4 0.0 SM SM SM SM SM SM SM SM Selection by using AIC 23
25 Table 4: Frailty: the frequencies of the models selected by Complete DICs among 100 data sets True model COMPLETE DICs Censoring Simulation Type 2 Type 3 prob. Model SM1 SM2 SM3 SM4 SM1 SM2 SM3 SM4 0.0 SM SM SM SM SM SM SM SM
26 Table 5: Frailty: the frequencies of the models selected by Conditional DICs among 100 data sets True model CONDITIONAL DICs Censoring Simulation Type 2 Type 3 prob. Model SM1 SM2 SM3 SM4 SM1 SM2 SM3 SM4 0.0 SM SM SM SM SM SM SM SM
27 Summary of simulations DIC using full likelihood selects model well where the data has no frailties Complete DICs highly depend on frailty distribution Conditional DICs is proposed to be used for the model selection of the frailty models 26
28 DIC cond2 = 4E θ,w [log f(y w, θ) y] + 2 log f(y w(y), θ(y)) DIC cond3 = 4E θ,w [log f(y w, θ) y] + 2E w [log f(y w, θ(y)) y] DIC comp2 = 4E θ,w [log f(y w, θ) y] + 2 log f(y w(y), θ(y)) 4E θ,w [log π(w θ)] + 2 log π( w(y) θ(y)) DIC comp3 = 4E θ,w [log f(y w, θ) y] + 2E w [log f(y w, θ(y)) y] 4E θ,w [log π(w θ)] + 2E w [log π(w θ(y))] where w = η i and the distribution of η i follows η i Gamma(α, α) 27
29 6. Exemplary data: model selection using CRT data of two countries Table 6: Models: - mean that the corresponding parameters are all different Model Treatment effect Effect of other risk factors Cluster heterogeneity M M2 β 11 = = β nc M3 - β 1( 1) = = β nc ( 1) - M4 β 11 = = β nc 1 β 1( 1) = = β nc ( 1) - M5 - - α 1 = = α nc M6 β 11 = = β nc 1 - α 1 = = α nc M7 - β 1( 1) = = β nc ( 1) α 1 = = α nc M8 β 11 = = β nc 1 β 1( 1) = = β nc ( 1) α 1 = = α nc 28
30 MCMC sampling 500 times of burn-in no. of repetitions = no. of updates =
31 Table 7: Results of fitting data to frailty model: Type 2 DICs Conditional DIC Complete DIC Model DIC p D DIC p D M M M M M M M M
32 Table 8: Results of fitting data to frailty model: Type 3 DICs Conditional DIC Complete DIC Model DIC p D DIC p D M M M M M M M M
33 Table 9: Results of fitting data Model Country Effect coeff HR 95% CI of HR M4 I+II Vaccine (0.306, 0.707) Child (1.446, 2.670) Male (0.387, 1.454) Variance of frailties of Kolkata 1/ ˆα B 1 = (0.387, 1.236) Karachi 1/ ˆα B 2 = (0.768, 2.560) M2 I+II Vaccine (0.310, 0.728) I Child (1.287, 3.007) Male (0.811, 1.740) II Child (1.262, 3.076) Male (0.608, 1.402) Variance of frailties of Kolkata 1/ ˆα B 1 = (0.403, 1.233) Karachi 1/ ˆα B 2 = (0.772, 2.756) ˆα i B is the Bayes estimate of α i (i = 1, 2) 32
34 Summary of the example Model selection is done using the conditional DICs Data of CRT from two countries are combined The cluster effects are heterogeneous among countries (variance component of countries) 33
35 7. Summary Introduce frailty model for the cluster randomized trial Bayesian bootstrap prior for baseline hazard function Gamma prior frailty Introduce model selection for frailty model Complete DIC Conditional DIC Perform intensive simulation for the proportional hazards model and frailty model Conditional DIC is proposed for the frailty model selection 34
36 Acknowledgement This work was supported by the Bill and Melinda Gates Foundation through the Diseases of the Most Impoverished (DOMI) program, administered by the International Vaccine Institute (IVI), Seoul, Korea. Vaccine, Children, and a Better World 35
CTDL-Positive Stable Frailty Model
CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland
More informationRegularization in Cox Frailty Models
Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University
More informationFrailty Modeling for clustered survival data: a simulation study
Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University
More informationFrailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.
Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk
More informationModelling geoadditive survival data
Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model
More informationMultivariate Survival Analysis
Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in
More informationTopic 12 Overview of Estimation
Topic 12 Overview of Estimation Classical Statistics 1 / 9 Outline Introduction Parameter Estimation Classical Statistics Densities and Likelihoods 2 / 9 Introduction In the simplest possible terms, the
More informationModels for Multivariate Panel Count Data
Semiparametric Models for Multivariate Panel Count Data KyungMann Kim University of Wisconsin-Madison kmkim@biostat.wisc.edu 2 April 2015 Outline 1 Introduction 2 3 4 Panel Count Data Motivation Previous
More informationSAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000)
SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000) AMITA K. MANATUNGA THE ROLLINS SCHOOL OF PUBLIC HEALTH OF EMORY UNIVERSITY SHANDE
More informationResearch & Reviews: Journal of Statistics and Mathematical Sciences
Research & Reviews: Journal of Statistics and Mathematical Sciences Bayesian Estimation in Shared Compound Negative Binomial Frailty Models David D Hanagal * and Asmita T Kamble Department of Statistics,
More informationBayesian Multivariate Logistic Regression
Bayesian Multivariate Logistic Regression Sean M. O Brien and David B. Dunson Biostatistics Branch National Institute of Environmental Health Sciences Research Triangle Park, NC 1 Goals Brief review of
More informationCASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:
CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity Outline: 1. NIEHS Uterine Fibroid Study Design of Study Scientific Questions Difficulties 2.
More informationStep-Stress Models and Associated Inference
Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated
More informationStatistical Analysis of Spatio-temporal Point Process Data. Peter J Diggle
Statistical Analysis of Spatio-temporal Point Process Data Peter J Diggle Department of Medicine, Lancaster University and Department of Biostatistics, Johns Hopkins University School of Public Health
More informationMixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina
Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -
More informationBAYESIAN INFLUENCE DIAGNOSTIC METHODS FOR PARAMETRIC REGRESSION MODELS
BAYESIAN INFLUENCE DIAGNOSTIC METHODS FOR PARAMETRIC REGRESSION MODELS Hyunsoon Cho A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of
More informationPractical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK
Practical Bayesian Quantile Regression Keming Yu University of Plymouth, UK (kyu@plymouth.ac.uk) A brief summary of some recent work of us (Keming Yu, Rana Moyeed and Julian Stander). Summary We develops
More informationA general mixed model approach for spatio-temporal regression data
A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression
More informationOptimal rules for timing intercourse to achieve pregnancy
Optimal rules for timing intercourse to achieve pregnancy Bruno Scarpa and David Dunson Dipartimento di Statistica ed Economia Applicate Università di Pavia Biostatistics Branch, National Institute of
More informationEmpirical Likelihood Based Deviance Information Criterion
Empirical Likelihood Based Deviance Information Criterion Yin Teng Smart and Safe City Center of Excellence NCS Pte Ltd June 22, 2016 Outline Bayesian empirical likelihood Definition Problems Empirical
More informationAnalysing geoadditive regression data: a mixed model approach
Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression
More informationFrailty Probit model for multivariate and clustered interval-censor
Frailty Probit model for multivariate and clustered interval-censored failure time data University of South Carolina Department of Statistics June 4, 2013 Outline Introduction Proposed models Simulation
More informationBAYESIAN ESTIMATION IN SHARED COMPOUND POISSON FRAILTY MODELS
Journal of Reliability and Statistical Studies; ISSN (Print): 0974-8024, (Online):2229-5666 Vol. 8, Issue 1 (2015): 159-180 BAYESIAN ESTIMATION IN SHARED COMPOUND POISSON FRAILTY MODELS *David D. Hanagal
More informationMarkov Chain Monte Carlo in Practice
Markov Chain Monte Carlo in Practice Edited by W.R. Gilks Medical Research Council Biostatistics Unit Cambridge UK S. Richardson French National Institute for Health and Medical Research Vilejuif France
More informationSubmitted to the Brazilian Journal of Probability and Statistics
Submitted to the Brazilian Journal of Probability and Statistics A Bayesian sparse finite mixture model for clustering data from a heterogeneous population Erlandson F. Saraiva a, Adriano K. Suzuki b and
More informationBayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang
Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January
More informationMultistate Modeling and Applications
Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)
More information7. Estimation and hypothesis testing. Objective. Recommended reading
7. Estimation and hypothesis testing Objective In this chapter, we show how the election of estimators can be represented as a decision problem. Secondly, we consider the problem of hypothesis testing
More informationOutline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models
Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Collaboration with Rudolf Winter-Ebmer, Department of Economics, Johannes Kepler University
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationMonte Carlo in Bayesian Statistics
Monte Carlo in Bayesian Statistics Matthew Thomas SAMBa - University of Bath m.l.thomas@bath.ac.uk December 4, 2014 Matthew Thomas (SAMBa) Monte Carlo in Bayesian Statistics December 4, 2014 1 / 16 Overview
More informationIntegrated likelihoods in survival models for highlystratified
Working Paper Series, N. 1, January 2014 Integrated likelihoods in survival models for highlystratified censored data Giuliana Cortese Department of Statistical Sciences University of Padua Italy Nicola
More informationDefault Priors and Effcient Posterior Computation in Bayesian
Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature
More informationBayesian course - problem set 6 (lecture 7)
Bayesian course - problem set 6 (lecture 7) Ben Lambert December 7, 2016 1 A meta-analysis of beta blocker trials Table 1 shows the results of some of the 22 trials included in a meta-analysis of clinical
More informationMultinomial Data. f(y θ) θ y i. where θ i is the probability that a given trial results in category i, i = 1,..., k. The parameter space is
Multinomial Data The multinomial distribution is a generalization of the binomial for the situation in which each trial results in one and only one of several categories, as opposed to just two, as in
More informationOr How to select variables Using Bayesian LASSO
Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO On Bayesian Variable Selection
More informationPractical considerations for survival models
Including historical data in the analysis of clinical trials using the modified power prior Practical considerations for survival models David Dejardin 1 2, Joost van Rosmalen 3 and Emmanuel Lesaffre 1
More informationWavelet-Based Nonparametric Modeling of Hierarchical Functions in Colon Carcinogenesis
Wavelet-Based Nonparametric Modeling of Hierarchical Functions in Colon Carcinogenesis Jeffrey S. Morris University of Texas, MD Anderson Cancer Center Joint wor with Marina Vannucci, Philip J. Brown,
More informationA Bayesian multi-dimensional couple-based latent risk model for infertility
A Bayesian multi-dimensional couple-based latent risk model for infertility Zhen Chen, Ph.D. Eunice Kennedy Shriver National Institute of Child Health and Human Development National Institutes of Health
More informationCorrelated Gamma Frailty Models for Bivariate Survival Data
Correlated Gamma Frailty Models for Bivariate Survival Data David D. Hanagal, Arvind Pandey and Ayon Ganguly Department of Statistics, University of Pune, Pune-411007, India. Email: david hanagal@yahoo.co.in;
More informationNORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET
NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET Investigating posterior contour probabilities using INLA: A case study on recurrence of bladder tumours by Rupali Akerkar PREPRINT STATISTICS NO. 4/2012 NORWEGIAN
More informationA Regression Model For Recurrent Events With Distribution Free Correlation Structure
A Regression Model For Recurrent Events With Distribution Free Correlation Structure J. Pénichoux(1), A. Latouche(2), T. Moreau(1) (1) INSERM U780 (2) Université de Versailles, EA2506 ISCB - 2009 - Prague
More informationCURE FRACTION MODELS USING MIXTURE AND NON-MIXTURE MODELS. 1. Introduction
ØÑ Å Ø Ñ Ø Ð ÈÙ Ð Ø ÓÒ DOI: 10.2478/v10127-012-0001-4 Tatra Mt. Math. Publ. 51 (2012), 1 9 CURE FRACTION MODELS USING MIXTURE AND NON-MIXTURE MODELS Jorge A. Achcar Emílio A. Coelho-Barros Josmar Mazucheli
More informationAnalysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates
Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a
More informationNovember 2002 STA Random Effects Selection in Linear Mixed Models
November 2002 STA216 1 Random Effects Selection in Linear Mixed Models November 2002 STA216 2 Introduction It is common practice in many applications to collect multiple measurements on a subject. Linear
More informationStatistical Inference and Methods
Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and
More informationMATH c UNIVERSITY OF LEEDS Examination for the Module MATH2715 (January 2015) STATISTICAL METHODS. Time allowed: 2 hours
MATH2750 This question paper consists of 8 printed pages, each of which is identified by the reference MATH275. All calculators must carry an approval sticker issued by the School of Mathematics. c UNIVERSITY
More informationIntroduction to BGPhazard
Introduction to BGPhazard José A. García-Bueno and Luis E. Nieto-Barajas February 11, 2016 Abstract We present BGPhazard, an R package which computes hazard rates from a Bayesian nonparametric view. This
More informationA Bayesian Analysis in the Presence of Covariates for Multivariate Survival Data: An example of Application
Revista Colombiana de Estadística Junio 2011, volumen 34, no. 1, pp. 111 a 131 A Bayesian Analysis in the Presence of Covariates for Multivariate Survival Data: An example of Application Análisis bayesiano
More informationQuasi-likelihood Scan Statistics for Detection of
for Quasi-likelihood for Division of Biostatistics and Bioinformatics, National Health Research Institutes & Department of Mathematics, National Chung Cheng University 17 December 2011 1 / 25 Outline for
More informationNoninformative Priors for the Ratio of the Scale Parameters in the Inverted Exponential Distributions
Communications for Statistical Applications and Methods 03, Vol. 0, No. 5, 387 394 DOI: http://dx.doi.org/0.535/csam.03.0.5.387 Noninformative Priors for the Ratio of the Scale Parameters in the Inverted
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL
October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach
More informationeqr094: Hierarchical MCMC for Bayesian System Reliability
eqr094: Hierarchical MCMC for Bayesian System Reliability Alyson G. Wilson Statistical Sciences Group, Los Alamos National Laboratory P.O. Box 1663, MS F600 Los Alamos, NM 87545 USA Phone: 505-667-9167
More informationComparison of Bayesian Spatio-Temporal Models for Chronic Diseases
Journal of Data Science 8(2010), 189-211 Comparison of Bayesian Spatio-Temporal Models for Chronic Diseases Hoon Kim 1 and HeeJeong Lim 2 1 California State Polytechnic University and 2 California State
More informationSurvival Analysis Math 434 Fall 2011
Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup
More informationDownloaded from:
Camacho, A; Kucharski, AJ; Funk, S; Breman, J; Piot, P; Edmunds, WJ (2014) Potential for large outbreaks of Ebola virus disease. Epidemics, 9. pp. 70-8. ISSN 1755-4365 DOI: https://doi.org/10.1016/j.epidem.2014.09.003
More informationGeneralized linear mixed models (GLMMs) for dependent compound risk models
Generalized linear mixed models (GLMMs) for dependent compound risk models Emiliano A. Valdez, PhD, FSA joint work with H. Jeong, J. Ahn and S. Park University of Connecticut Seminar Talk at Yonsei University
More informationJoint Modeling of Longitudinal Item Response Data and Survival
Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,
More information7. Estimation and hypothesis testing. Objective. Recommended reading
7. Estimation and hypothesis testing Objective In this chapter, we show how the election of estimators can be represented as a decision problem. Secondly, we consider the problem of hypothesis testing
More informationSpatial Survival Analysis via Copulas
1 / 38 Spatial Survival Analysis via Copulas Tim Hanson Department of Statistics University of South Carolina, U.S.A. International Conference on Survival Analysis in Memory of John P. Klein Medical College
More informationPower and Sample Size Calculations with the Additive Hazards Model
Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine
More informationBAYESIAN ANALYSIS OF PROPORTIONAL HAZARD MODELS. Ewha Womans University and Pennsylvania State University
The Annals of Statistics 23, Vol. 31, No. 2, 493 511 Institute of Mathematical Statistics, 23 BAYESIAN ANALYSIS OF PROPORTIONAL HAZARD MODELS BY YONGDAI KIM 1 AND JAEYONG LEE Ewha Womans University and
More informationGoodness-of-fit tests for randomly censored Weibull distributions with estimated parameters
Communications for Statistical Applications and Methods 2017, Vol. 24, No. 5, 519 531 https://doi.org/10.5351/csam.2017.24.5.519 Print ISSN 2287-7843 / Online ISSN 2383-4757 Goodness-of-fit tests for randomly
More informationNORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET
NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET Approximate Bayesian Inference for nonhomogeneous Poisson processes with application to survival analysis by Rupali Akerkar, Sara Martino and Håvard Rue PREPRINT
More informationModelling count data with overdispersion and spatial effects
myjournal manuscript No. (will be inserted by the editor) Modelling count data with overdispersion and spatial effects Susanne Gschlößl 1, Claudia Czado 2 1 Both at Center of Mathematical Sciences, Munich
More informationLocal Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina
Local Likelihood Bayesian Cluster Modeling for small area health data Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modelling for Small Area
More informationBAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA
BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA Intro: Course Outline and Brief Intro to Marina Vannucci Rice University, USA PASI-CIMAT 04/28-30/2010 Marina Vannucci
More informationStatistics & Data Sciences: First Year Prelim Exam May 2018
Statistics & Data Sciences: First Year Prelim Exam May 2018 Instructions: 1. Do not turn this page until instructed to do so. 2. Start each new question on a new sheet of paper. 3. This is a closed book
More informationObnoxious lateness humor
Obnoxious lateness humor 1 Using Bayesian Model Averaging For Addressing Model Uncertainty in Environmental Risk Assessment Louise Ryan and Melissa Whitney Department of Biostatistics Harvard School of
More informationMultivariate spatial modeling
Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced
More informationSurvival Analysis I (CHL5209H)
Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really
More informationBayesian Analysis for Partially Complete Time and Type of Failure Data
Bayesian Analysis for Partially Complete Time and Type of Failure Data Debasis Kundu Abstract In this paper we consider the Bayesian analysis of competing risks data, when the data are partially complete
More informationBayes Estimation and Prediction of the Two-Parameter Gamma Distribution
Bayes Estimation and Prediction of the Two-Parameter Gamma Distribution Biswabrata Pradhan & Debasis Kundu Abstract In this article the Bayes estimates of two-parameter gamma distribution is considered.
More informationDIC, AIC, BIC, PPL, MSPE Residuals Predictive residuals
DIC, AIC, BIC, PPL, MSPE Residuals Predictive residuals Overall Measures of GOF Deviance: this measures the overall likelihood of the model given a parameter vector D( θ) = 2 log L( θ) This can be averaged
More informationJoint longitudinal and survival-cure models in tumour xenograft experiments
Joint longitudinal and survival-cure models in tumour xenograft experiments Jianxin Pan, Yanchun Bao,, Hongsheng Dai,, Hong-Bin Fang, Abstract In tumour xenograft experiments, treatment regimens are administered
More informationModel comparison and selection
BS2 Statistical Inference, Lectures 9 and 10, Hilary Term 2008 March 2, 2008 Hypothesis testing Consider two alternative models M 1 = {f (x; θ), θ Θ 1 } and M 2 = {f (x; θ), θ Θ 2 } for a sample (X = x)
More informationAnalysis of competing risks data and simulation of data following predened subdistribution hazards
Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013
More informationAnalysis of Time-to-Event Data: Chapter 6 - Regression diagnostics
Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Residuals for the
More informationSemiparametric Regression
Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under
More informationAssessing the Effect of Prior Distribution Assumption on the Variance Parameters in Evaluating Bioequivalence Trials
Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics 8--006 Assessing the Effect of Prior Distribution Assumption on the Variance
More informationReview. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,
More informationThe Log-generalized inverse Weibull Regression Model
The Log-generalized inverse Weibull Regression Model Felipe R. S. de Gusmão Universidade Federal Rural de Pernambuco Cintia M. L. Ferreira Universidade Federal Rural de Pernambuco Sílvio F. A. X. Júnior
More informationPart III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data
1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions
More informationUNIVERSITY OF CALIFORNIA, SAN DIEGO
UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department
More informationCHAPTER 10. Bayesian methods. Geir Storvik
CHAPTER 10 Bayesian methods Geir Storvik 10.1 Introduction Statistical inference concerns about learning from data, either parameters (estimation) or some, typically future, variables (prediction). In
More informationMetropolis-Hastings Algorithm
Strength of the Gibbs sampler Metropolis-Hastings Algorithm Easy algorithm to think about. Exploits the factorization properties of the joint probability distribution. No difficult choices to be made to
More informationIntroduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016
Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An
More informationThe Jackknife-Like Method for Assessing Uncertainty of Point Estimates for Bayesian Estimation in a Finite Gaussian Mixture Model
Thai Journal of Mathematics : 45 58 Special Issue: Annual Meeting in Mathematics 207 http://thaijmath.in.cmu.ac.th ISSN 686-0209 The Jackknife-Like Method for Assessing Uncertainty of Point Estimates for
More informationPart 7: Hierarchical Modeling
Part 7: Hierarchical Modeling!1 Nested data It is common for data to be nested: i.e., observations on subjects are organized by a hierarchy Such data are often called hierarchical or multilevel For example,
More informationNow consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.
Weighting We have seen that if E(Y) = Xβ and V (Y) = σ 2 G, where G is known, the model can be rewritten as a linear model. This is known as generalized least squares or, if G is diagonal, with trace(g)
More informationSTAT 740: Testing & Model Selection
STAT 740: Testing & Model Selection Timothy Hanson Department of Statistics, University of South Carolina Stat 740: Statistical Computing 1 / 34 Testing & model choice, likelihood-based A common way to
More informationST440/540: Applied Bayesian Statistics. (9) Model selection and goodness-of-fit checks
(9) Model selection and goodness-of-fit checks Objectives In this module we will study methods for model comparisons and checking for model adequacy For model comparisons there are a finite number of candidate
More informationFRAILTY MODELS FOR MODELLING HETEROGENEITY
FRAILTY MODELS FOR MODELLING HETEROGENEITY By ULVIYA ABDULKARIMOVA, B.Sc. A Thesis Submitted to the School of Graduate Studies in Partial Fulfillment of the Requirements for the Degree Master of Science
More informationSeminar über Statistik FS2008: Model Selection
Seminar über Statistik FS2008: Model Selection Alessia Fenaroli, Ghazale Jazayeri Monday, April 2, 2008 Introduction Model Choice deals with the comparison of models and the selection of a model. It can
More informationOn Properties of QIC in Generalized. Estimating Equations. Shinpei Imori
On Properties of QIC in Generalized Estimating Equations Shinpei Imori Graduate School of Engineering Science, Osaka University 1-3 Machikaneyama-cho, Toyonaka, Osaka 560-8531, Japan E-mail: imori.stat@gmail.com
More informationLecture 4: Dynamic models
linear s Lecture 4: s Hedibert Freitas Lopes The University of Chicago Booth School of Business 5807 South Woodlawn Avenue, Chicago, IL 60637 http://faculty.chicagobooth.edu/hedibert.lopes hlopes@chicagobooth.edu
More informationOn Measurement Error Problems with Predictors Derived from Stationary Stochastic Processes and Application to Cocaine Dependence Treatment Data
On Measurement Error Problems with Predictors Derived from Stationary Stochastic Processes and Application to Cocaine Dependence Treatment Data Yehua Li Department of Statistics University of Georgia Yongtao
More informationCox s proportional hazards/regression model - model assessment
Cox s proportional hazards/regression model - model assessment Rasmus Waagepetersen September 27, 2017 Topics: Plots based on estimated cumulative hazards Cox-Snell residuals: overall check of fit Martingale
More informationThe Bayesian Choice. Christian P. Robert. From Decision-Theoretic Foundations to Computational Implementation. Second Edition.
Christian P. Robert The Bayesian Choice From Decision-Theoretic Foundations to Computational Implementation Second Edition With 23 Illustrations ^Springer" Contents Preface to the Second Edition Preface
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More information