Spatial Survival Analysis via Copulas
|
|
- Noel Doyle
- 5 years ago
- Views:
Transcription
1 1 / 38 Spatial Survival Analysis via Copulas Tim Hanson Department of Statistics University of South Carolina, U.S.A. International Conference on Survival Analysis in Memory of John P. Klein Medical College of Wisconsin June 2014
2 Outline 2 / 38 1 I: Prostate cancer, areally-referenced semiparametric survival 2
3 3 / 38 SCCCR data set on prostate cancer survival Large dataset on prostate cancer survival that does not follow proportional hazards. n = patients from South Carolina Central Cancer Registry (SCCCR) for the period ; each recorded with county, race, marital status, grade of tumor, and SEER summary stage; 72.3% are censored. Need to allow for non-proportional hazards and accommodate correlation of survival times within county. Joint work with Li Li and Jiajia Zhang.
4 4 / 38 Extended hazards model Etezadi-Amoli and Ciampi (1987) propose EH model Say x = (x 1, x 2 ), then EH is λ x (t) = λ 0 (te x β )e x γ. λ x (t) = λ 0 (te β 1x 1 +β 2 x 2 )e γ 1x 1 +γ 2 x 2. γ 1 = β 1 x 1 has AFT interpretation; β 1 = 0 x 1 has PH interpretation; γ 1 = 0 x 1 has AH interpretation. Likelihood for observing {(t i, x i, δ i )} n i=1 { n { δi } ti L(β, γ, λ 0 ( )) = e γ x i λ 0 (e β x i t i )} exp e γ x i λ 0 (te β x i )dt 0 i=1
5 5 / 38 Baseline hazard λ 0 (t) Want to shrink λ 0 (t) toward parametric target λ θ (t): λ 0 (t) = J b j B kj (t) j=1 where B k1 ( ),..., B k,j ( ) are kth order B-spline basis function over knots (s 1,..., s J+k ). Let s j = j+k j=k+1 s k/(k 1) and b j = λ θ ( s j ). Schoenberg s approximation theorem (Marsden 1972) says max 0 t sj+1 λ 0 (t) λ θ (t) ɛ(λ θ, k, J).
6 6 / 38 Posterior updating of model Step 1: Update the blocks {β, γ}, θ, c separately using adaptive Metropolis-Hastings algorithms (Haario, Saksman, and Tamminen 2005). Step 2: Sample g-prior parameters g 1 1 from Gamma(a g + 1, b g + β x xβ/2n + b g ) and g 1 2 from Gamma(a g + 1, b g + γ x xβ/2n + b g ). Step 3: Sample the latent random vectors u i from ( ) b 1 B 1 (e β x i t u i β, γ, b, θ, u i Mult i ) n j=1 b j B j (e β x i t i ),..., b J B J (e β x i t i ) n j=1 b j Bj (e β x i t i ) Step 4: B-spline coefficients are updated by b j β, γ, b j, θ, {u i } Gamma n ti u ij + cλ θ ( s j ), c + e γ x i B j (te β x i )dt i S i=1 0
7 Works great on simulated data Survival function Density function S 0(t) True Mean estimates 2.5%, 97.5% quantiles f 0(t) True Mean estimates 2.5%, 97.5% quantiles t t Hazard function Hazard function λ 0(t) True Mean estimates 2.5%, 97.5% quantiles λ 0(t) True Mean estimates Mean λ θ t t 7 / 38
8 8 / 38 Spatial dependence via frailties impractical PH with frailties: where g ci EH with frailties: λ(t i x) = λ 0 (t i )e γ x i +g ci, are county-level frailties, c i is county subject i in. λ(t i x) = λ 0 {t i e β x i +b ci }e γ x i +g ci, where, for our data, b 1,..., b 46 and g 1,..., g 46 are county-level frailties. Possible but impractical, and hard to interpret.
9 9 / 38 Spatial dependence via copula works great Define Y i = Φ 1 { 1 e Λ i (T i ) } where Φ( ) is the standard normal cumulative distribution function. Under Li and Lin (2006) Y N(0, Γ). Likelihood ffrom data {(t i, x i, δ i )} n i=1 is [ ] [ ] f i (t i ) f i (z i ) L s (β, γ, b, θ, Γ) = φ(y i ) φ(y i ) I(z i > t i ) φ(y; 0, Γ) dz i i S c i S c How to define Γ? i S We consider county-level lattice data; popular correlation model is intrinsic conditional autoregressive (ICAR) prior.
10 10 / 38 ICAR definition α = (α 1,, α m ) is vector of correlated effects on a lattice. ICAR prior on α is p(α) exp{ ϕα (D W)α/2} Implication of ICAR ( prior: n ) α j α j, ϕ N j=1 ω ijα j /ω j+, 1/(ϕω j+ ). Random effects approach Ỹ = (Ỹ1,..., Ỹm) = (Ỹ11,..., Ỹ1n 1,..., Ỹm1,..., Ỹmn m ). Ỹ ij = α i + ɛ ij, α N m (0, Ω), ɛ N n (0, Iσ 2 ). Resulting correlation matrix Γ = corr(ỹ) involves one unknown parameter ϕ.
11 11 / 38 Posterior updating-latent survival approach MCMC sampling steps 1 to 3 are the same as before. Step 4: Propose bj new from ( n ) ti Gamma u ij + cλ θ ( s j ), c + e γ x i B j (te β x i )dt i S c i=1 0 and accept it with probability { } n i=1 min 1, φ(y i)e ynew Γ 1 y new /2 n new i=1 φ(yi )e y Γ 1 y/2 where y new is new transformed failure time vector corresponding to bj new. Step 5: Sample Y i N(y i y i, Γ)I(y i > Φ 1 {F i (t i ))} then set z i = F 1 i {Φ(y i )}. Step 6: Update ϕ using adaptive Metropolis-Hastings.
12 12 / 38 Efficient evaluation of y Γ 1 y Γ is a n n matrix Elements of Γ 1 can be easily computed using SVD, Γ 1 = A 1 U 1 ( (K + σ 2 I m ) 1 σ 2 I m ) U 1 A 1 + σ 2 A 2 where A is a diagonal matrix, U 1 = (u 1,..., u m ), u i is a vector of length n with ones corresponding to county i and zero elsewhere. y Γ 1 y = x ( (K + σ 2 I m ) 1 σ 2 I m ) x + σ 2 y A 2 y
13 13 / 38 Savage-Dickey ratio for global and per-variable tests Example of global test of PH vs. EH BF 12 = π(β = 0 D, EH) π(β = 0 EH). Example of per-variable of PH for x j vs. EH BF 12 = π(β j = 0 D, EH) π(β j = 0 EH).
14 14 / 38 SCCCR data SCCCR prostate cancer data for the period Baseline covariates are county of residence, age, race, marital status, grade of tumor differentiation, and SEER summary stage. n = patients in the dataset after excluding subjects with missing information. 72.3% of the survival times are right-censored. Goal: assess racial disparity in prostate cancer survival, adjusting for the remaining risk factors and accounting for the county the subject lives in.
15 15 / 38 SCCCR data Table: Summary characteristics of prostate cancer patients in SC from Covariate n Sample percentage Race Black White Marital status Non-married Married Grade well or moderately differentiated poorly differentiated or undifferentiated SEER summary stage Localized or regional Distant
16 16 / 38 Non-spatial EH and reduced models Table: Summary of fitting the extended hazard model EH, the reduced model, AFT, and PH; indicates LPML and DIC Covar EH Reduced AFT PH PH+additive age β = γ β = 0 β = 0 Age β (0.48,0.52) 0.48(0.46,0.50) 0.48(0.45,0.51) γ (0.42,0.49) γ 1 = β (0.62,0.68) Race β (0.15,0.21) 0.20(0.16,0.21) 0.18(0.15,0.22) γ (0.12,0.24) γ 2 = β (0.21,0.32) 0.26(0.20,0.31) Marital β (-0.11,-0.02) -0.05(-0.09,-0.00) 0.26(0.21,0.30) status γ (0.29,0.40) 0.33(0.28,0.40) 0.33(0.27,0.39) 0.31(0.26,0.37) Grade β (-0.02,0.08) β 4 = (0.22,0.32) γ (0.29,0.41) 0.37(0.31,0.43) 0.38(0.32,0.44) 0.37(0.33,0.43) SEER β (2.80,3.53) 3.27(2.79,3.57) 1.50(1.41,1.59) stage γ (0.83,1.20) 1.00(0.82,1.19) 1.56(1.47,1.64) 1.57(1.19,1.65) LPML DIC
17 17 / 38 Non-spatial EH and reduced models Table: Bayes factors for comparing EH to PH, AFT, and AH with and without spatial correlation. EH Spatial+EH Covariate PH AFT AH PH AFT AH Age > > 1000 > > 1000 Race > > 1000 > 1000 < 0.01 > 1000 Marital status 1.79 > 1000 > > 1000 > 1000 Grade 0.14 > 1000 > > 1000 > 1000 SEER stage > 1000 > 1000 > 1000 > 1000 > 1000 > 1000
18 18 / 38 Spatial EH and reduced models Table: Summary of spatial models; indicates LPML and DIC Covariates Marginal EH Marginal reduced PH+ICAR+additive age β = 0 Age β (0.47,0.52) 0.47(0.46,0.49) γ (0.43,0.49) γ 1 = β 1 Race β (0.15,0.21) 0.20(0.17,0.22) γ (0.11,0.23) γ 2 = β (0.18,0.30) Marital status β (-0.10,-0.02) -0.02(-0.05,-0.00) γ (0.28,0.41) 0.33(0.27,0.39) 0.32(0.25,0.38) Grade β (-0.01,0.07) β 4 = 0 γ (0.30,0.42) 0.38(0.32,0.43) 0.37(0.32,0.44) SEER stage β (2.86,3.34) 2.77(2.72,2.82) γ (0.94,1.26) 1.21(1.01,1.33) 1.55(1.46,1.64) ϕ 50.1(19.9,113.7) 54.6(22.7,120.8) 33.08(9.2,100.1) LPML DIC
19 < to to to to 0.33 >0.33 < to to to to 0.06 > / 38 Spatial EH and reduced models Mortality rate PH ICAR frailty Marginal reduced model random effects < to to to to 0.04 >0.04 (a) (b) (c) Figure: Map of (a) Mortality rate, (b) ICAR frailties in the PH model and (c) random effects in the marginal reduced model for SC counties.
20 20 / 38 Spatial EH and reduced models Hazard estimates Survival estimates Hazard PH EH Reduced AFT Survival probability PH EH Reduced AFT Time in Years Time in Years Figure: Baseline hazard (left) and survival probabilities (right) estimates.
21 21 / 38 Spatial EH and reduced models Hazard estimates Survival estimates Hazard Posterior mean for blacks Posterior mean for whites Survival probability Posterior mean for blacks Posterior mean for whites 95% CI Time in Years Time in Years Figure: Hazard and survival for black patients (solid line) and white patients; baseline covariates.
22 22 / 38 Interpretation for race effect Based on the fitted results of the reduced models with and without spatial dependence, white South Carolina subjects diagnosed with prostrate cancer in live 22% longer (e ) than black patients (95% CI is 18% to 25%), fixing age, stage, and SEER stage. Cox said...the physical or substantive basis for...proportional hazards models...is one of its weaknesses... and goes on to suggest that...accelerated failure time models are in many ways more appealing because of their quite direct physical interpretation. The SCCCR analysis showed that the main covariate of interest, race, is best modeled as an AFT effect. Survival probabilities for black patients are significantly lower than those for white patients when other factors are fixed at the same levels.
23 23 / 38 More interpretation Decreasing age by one year increases survival time by 5.4%. Hazard of dying increases 46% for poorly or undifferentiated grades vs. well or moderately differentiated, holding age, race, and SEER stage constant. SEER stage has general EH effects, e (AH) and e (PH). Those with distant stage are at least three times worse in one-sixteenth of the time as those with localized or regional. Marital status essentially has PH interpretation; single (including widowed or separated) subjects are e times more likely to die at any instant than married.
24 24 / 38 About this work Analysis of frog population extinction Joint work with Haiming Zhou and Roland Knapp (Sierra Nevada Aquatic Research Laboratory). Frogs and other amphibians have been dying off in large numbers since the 1980s because of a deadly fungus called Batrachochytrium dendrobatidis, also known as Bd. Dr. Knapp has been studying the amphibian declines for the past decade at Sierra Nevada Aquatic Research Laboratory; he has hiked thousands of miles and surveyed hundreds of frog populations in Sequoia-Kings Canyon National Park collecting the data by hand. Again, proportional hazards is grossly violated for these data.
25 25 / 38 The Frog Data ( ) Analysis of frog population extinction Contains 309 frog populations. Each was followed up until infection or being censored (10% censoring). The response is the Bd infection time (i.e. Bd arrival year baseline year). Main covariates: bdwater: whether or not Bd has been found in the watershed. bddistance: straight-line distance to the nearest Bd location. Populations near each other tend to become infected at about the same time.
26 26 / 38 Objectives Analysis of frog population extinction T i = T (s i ) is time to local extinction for pop n located at s i. x i = x(s i ) : a p 1 vector of covariates. Goal 1: describe the association between x(s) and T (s) while allowing for spatial dependence. Goal 2: predict T (s 0 ) given x(s 0 ) at any new location s 0.
27 27 / 38 Analysis of frog population extinction LDDPM model and Spatial Extension LDDPM (De Iorio et al., 2009; Jara et al., 2010): Y i = log T i given x i independently follow mixture model ( y x ) F xi (y G) = Φ i β dg{β, σ 2 }, σ where G follows a Dirichlet Process (DP) prior, i.e. G DP(α, G 0 ). the joint distribution of Y = (Y 1,..., Y n ) by F(t 1,..., t n G) = C(F x1 (t 1 G),..., F xn (t n G); θ), where C(u 1,..., u n ; θ) is a spatial copula with parameter θ, which is used for capturing spatial dependence. We use point-referenced Gaussian copula of Li and Lin (2006).
28 28 / 38 Specification of F xi (y G) Analysis of frog population extinction Assume Y i = log T i given x i follows a mixture model ( y x ) F xi (y G) = Φ i β dg{β, σ 2 }, σ where G DP(α, G 0 ) with concentration parameter α. G in truncated stick-breaking form (Sethuraman, 1994) as N G = w k δ (βk,σk 2), k=1 w k = V k (1 V j ), j<k where V k α iid Beta(1, α), (β k, σ 2 k ) iid G 0.
29 29 / 38 The Likelihood Analysis of frog population extinction Observed data {(y o i, δ i, x i, s i ) : i = 1,..., n}. Denote by y i the latent true log event-time corresponding to y o i. Then δ i = I(y i = y o i ), where δ i = 0 implies y i > y o i. The augmented likelihood for parameters {y 1,..., y n, G, θ} is n { } L = f xi (y i G) δ i I(y i = yi o ) + (1 δ i )I(y i > yi o ) i=1 C 1/2 exp { 1 } 2 z (C 1 I n )z where f xi (y i G) is the density w.r.t. F xi (y i G), and z = (z 1,..., z n ) with z i = Φ 1 {F xi (y i G)}.
30 30 / 38 MCMC Overview Analysis of frog population extinction All parameters involved in G are updated based on a modification of the blocked Gibbs sampler (Ishwaran and James, 2001): M-H samplers with independent proposals. The latent y i s are updated via independent M-H sampler. Delayed rejection (Tierney and Mira, 1999) used for several parameters; helps sampler not get stuck. The correlation parameters θ are updated using adaptive M-H (Haario et al., 2001). For large n, the inversion of the n n matrix C substantially sped up using a full scale approximation (FSA) (Sang and Huang, 2012).
31 Analysis of frog population extinction Simulated Data Assume the log T given x follows a mixture of normals f (y x) = 0.4N( x, 1 2 ) + 0.6N(2.5 x, ). Specify covariance matrix C with θ 1 = 0.98 and θ 2 = 0.1. Around 10% of survival times are right-censored Figure: 330 randomly selected sites: the n = 300 subjects for estimation; the 30 subjects for prediction. 31 / 38
32 32 / 38 Analysis of frog population extinction Simulated Data: Inference on Spatial Correlation Table: Posterior summary statistics for the spatial correlation parameters Par. True Mean Median Std. dev. 95% HPD Interval θ (0.933, 1.000) θ (0.056, 0.138)
33 33 / 38 Simulated Data: Estimated Density Analysis of frog population extinction Truth 0.6 De Iorio Ours Figure: Estimated density of log(t ) when x = 1 (left) and x = 1 (right). Recall: f (y x) = 0.4N( x, 1 2 ) + 0.6N(2.5 x, ).
34 34 / 38 Simulated Data: Prediction Analysis of frog population extinction Table: The mean squared prediction error (MSPE) for the held-out 30 locations under two different methods Method MSPE De Iorio Ours 0.269
35 35 / 38 Analysis of frog population extinction Frog Data: Inference on Spatial Correlation FYI: the spatial Gaussian copula involves the n n correlation matrix C(s i, s j ; θ) = θ 1 ρ(s i, s j ) + (1 θ 1 )I(s i = s j ), where θ 1 [0, 1] and ρ(s i, s j ) = exp { θ 2 s i s j }. Posterior mean ˆθ 1 = Posterior mean ˆθ 2 = , indicating the correlation decays by 1 exp( ) = 8% for every 1-km increase in distance and 1 exp( ) = 58% for every 10-km increase in distance. Table: Posterior summary statistics for the spatial correlation parameters Par. Mean Median Std. dev. 95% HPD Interval θ (0.9879, ) θ (0.0493, )
36 36 / 38 Frog Data: Estimated Hazard Analysis of frog population extinction bdwater=0 0.9 bdwater= Figure: Estimated hazard function with 90% point-wise CI for bdwater = 1 vs bdwater = 0 when the bddistance is fixed at its population mean.
37 37 / 38 Frog Data: Spatial Prediction Spatial map for the transformed process z(s) = Φ 1 { F x(s) (log T (s) G) }. Analysis of frog population extinction Figure: Predictive spatial map based on new 2000 random locations in D.
38 38 / 38 Remarks Analysis of frog population extinction Proposed Bayesian spatial copula approaches to estimate survival curves semiparametrically (EH model) and nonparametrically (LDDPM) while allowing for spatial dependence, leading to high predictive accuracy. Adopted the FSA approach to compute the inversion of n-dimensional spatial covariance matrices for georeferenced data. Future research involves use of low-rank approaches to modeling data with lots of spatial locations, e.g. random projections (Banerjee et al., 2012) or predictive processes (Banerjee et al., 2008). Two different Banerjees! Extension to simpler semiparametric models such as proportional odds. Thanks to my colleagues Li Li, Haiming Zhou, Roland Knapp, and Jiajia Zhang. Thanks for the invitation!
Recent Advances in Bayesian Spatial Survival Modeling
1 / 47 Recent Advances in Bayesian Spatial Survival Modeling Tim Hanson Department of Statistics University of South Carolina, U.S.A. University of Alabama Department of Mathematics April 10, 2015 2 /
More informationMarginal Survival Modeling through Spatial Copulas
1 / 53 Marginal Survival Modeling through Spatial Copulas Tim Hanson Department of Statistics University of South Carolina, U.S.A. University of Michigan Department of Biostatistics March 31, 2016 2 /
More informationA Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness
A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model
More informationFrailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.
Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk
More informationCTDL-Positive Stable Frailty Model
CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland
More informationAnalysing geoadditive regression data: a mixed model approach
Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression
More informationBayesian non-parametric model to longitudinally predict churn
Bayesian non-parametric model to longitudinally predict churn Bruno Scarpa Università di Padova Conference of European Statistics Stakeholders Methodologists, Producers and Users of European Statistics
More informationModelling geoadditive survival data
Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model
More informationMultivariate spatial modeling
Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced
More informationStatistical Inference and Methods
Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and
More informationBayesian nonparametrics
Bayesian nonparametrics 1 Some preliminaries 1.1 de Finetti s theorem We will start our discussion with this foundational theorem. We will assume throughout all variables are defined on the probability
More informationBayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang
Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January
More informationBayesian Nonparametric Inference Methods for Mean Residual Life Functions
Bayesian Nonparametric Inference Methods for Mean Residual Life Functions Valerie Poynor Department of Applied Mathematics and Statistics, University of California, Santa Cruz April 28, 212 1/3 Outline
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationOutline. Binomial, Multinomial, Normal, Beta, Dirichlet. Posterior mean, MAP, credible interval, posterior distribution
Outline A short review on Bayesian analysis. Binomial, Multinomial, Normal, Beta, Dirichlet Posterior mean, MAP, credible interval, posterior distribution Gibbs sampling Revisit the Gaussian mixture model
More informationBayesian Semi- and Non-parametric Analysis for Spatially Correlated Survival Data
University of South Carolina Scholar Commons Theses and Dissertations 2015 Bayesian Semi- and Non-parametric Analysis for Spatially Correlated Survival Data Haiming Zhou University of South Carolina Follow
More informationMultivariate Survival Analysis
Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in
More informationOther Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model
Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);
More informationFrailty Probit model for multivariate and clustered interval-censor
Frailty Probit model for multivariate and clustered interval-censored failure time data University of South Carolina Department of Statistics June 4, 2013 Outline Introduction Proposed models Simulation
More informationBayesian Nonparametric Autoregressive Models via Latent Variable Representation
Bayesian Nonparametric Autoregressive Models via Latent Variable Representation Maria De Iorio Yale-NUS College Dept of Statistical Science, University College London Collaborators: Lifeng Ye (UCL, London,
More informationSTAT Advanced Bayesian Inference
1 / 32 STAT 625 - Advanced Bayesian Inference Meng Li Department of Statistics Jan 23, 218 The Dirichlet distribution 2 / 32 θ Dirichlet(a 1,...,a k ) with density p(θ 1,θ 2,...,θ k ) = k j=1 Γ(a j) Γ(
More informationA unified framework for fitting Bayesian semiparametric models to arbitrarily censored survival data, including spatially-referenced data
A unified framework for fitting Bayesian semiparametric models to arbitrarily censored survival data, including spatially-referenced data arxiv:1701.06976v2 [stat.ap] 3 Jul 2017 Haiming Zhou Division of
More informationModel Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial
Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial Jin Kyung Park International Vaccine Institute Min Woo Chae Seoul National University R. Leon Ochiai International
More informationBayesian Model Diagnostics and Checking
Earvin Balderama Quantitative Ecology Lab Department of Forestry and Environmental Resources North Carolina State University April 12, 2013 1 / 34 Introduction MCMCMC 2 / 34 Introduction MCMCMC Steps in
More information39th Annual ISMS Marketing Science Conference University of Southern California, June 8, 2017
Permuted and IROM Department, McCombs School of Business The University of Texas at Austin 39th Annual ISMS Marketing Science Conference University of Southern California, June 8, 2017 1 / 36 Joint work
More informationFlexible Regression Modeling using Bayesian Nonparametric Mixtures
Flexible Regression Modeling using Bayesian Nonparametric Mixtures Athanasios Kottas Department of Applied Mathematics and Statistics University of California, Santa Cruz Department of Statistics Brigham
More informationBayesian Statistics. Debdeep Pati Florida State University. April 3, 2017
Bayesian Statistics Debdeep Pati Florida State University April 3, 2017 Finite mixture model The finite mixture of normals can be equivalently expressed as y i N(µ Si ; τ 1 S i ), S i k π h δ h h=1 δ h
More informationBayes Model Selection with Path Sampling: Factor Models
with Path Sampling: Factor Models Ritabrata Dutta and Jayanta K Ghosh Purdue University 07/02/11 Factor Models in Applications Factor Models in Applications Factor Models Factor Models and Factor analysis
More informationPairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion
Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial
More informationLocal Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina
Local Likelihood Bayesian Cluster Modeling for small area health data Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modelling for Small Area
More informationRegularization in Cox Frailty Models
Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University
More informationModeling and Predicting Healthcare Claims
Bayesian Nonparametric Regression Models for Modeling and Predicting Healthcare Claims Robert Richardson Department of Statistics, Brigham Young University Brian Hartman Department of Statistics, Brigham
More informationComputational statistics
Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated
More informationLuke B Smith and Brian J Reich North Carolina State University May 21, 2013
BSquare: An R package for Bayesian simultaneous quantile regression Luke B Smith and Brian J Reich North Carolina State University May 21, 2013 BSquare in an R package to conduct Bayesian quantile regression
More informationA Dirichlet Process Mixture Model for Survival Outcome Data: Assessing Nationwide Kidney Transplant Centers
Research Article Received XXXX (www.interscience.wiley.com) DOI:.02/sim.0000 A Dirichlet Process Mixture Model for Survival Outcome Data: Assessing Nationwide Kidney Transplant s Lili Zhao a, Jingchunzi
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters
More information5. Parametric Regression Model
5. Parametric Regression Model The Accelerated Failure Time (AFT) Model Denote by S (t) and S 2 (t) the survival functions of two populations. The AFT model says that there is a constant c > 0 such that
More information19 : Bayesian Nonparametrics: The Indian Buffet Process. 1 Latent Variable Models and the Indian Buffet Process
10-708: Probabilistic Graphical Models, Spring 2015 19 : Bayesian Nonparametrics: The Indian Buffet Process Lecturer: Avinava Dubey Scribes: Rishav Das, Adam Brodie, and Hemank Lamba 1 Latent Variable
More informationKarhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes
TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University
More informationPart 6: Multivariate Normal and Linear Models
Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of
More informationNORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET
NORGES TEKNISK-NATURVITENSKAPELIGE UNIVERSITET Investigating posterior contour probabilities using INLA: A case study on recurrence of bladder tumours by Rupali Akerkar PREPRINT STATISTICS NO. 4/2012 NORWEGIAN
More informationComputer Practical: Metropolis-Hastings-based MCMC
Computer Practical: Metropolis-Hastings-based MCMC Andrea Arnold and Franz Hamilton North Carolina State University July 30, 2016 A. Arnold / F. Hamilton (NCSU) MH-based MCMC July 30, 2016 1 / 19 Markov
More informationBayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework
HT5: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Maximum Likelihood Principle A generative model for
More informationSurvival Analysis. Stat 526. April 13, 2018
Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is
More informationFoundations of Nonparametric Bayesian Methods
1 / 27 Foundations of Nonparametric Bayesian Methods Part II: Models on the Simplex Peter Orbanz http://mlg.eng.cam.ac.uk/porbanz/npb-tutorial.html 2 / 27 Tutorial Overview Part I: Basics Part II: Models
More informationAFT Models and Empirical Likelihood
AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t
More informationLongitudinal breast density as a marker of breast cancer risk
Longitudinal breast density as a marker of breast cancer risk C. Armero (1), M. Rué (2), A. Forte (1), C. Forné (2), H. Perpiñán (1), M. Baré (3), and G. Gómez (4) (1) BIOstatnet and Universitat de València,
More informationIncorporating unobserved heterogeneity in Weibull survival models: A Bayesian approach
Incorporating unobserved heterogeneity in Weibull survival models: A Bayesian approach Catalina A. Vallejos 1 Mark F.J. Steel 2 1 MRC Biostatistics Unit, EMBL-European Bioinformatics Institute. 2 Dept.
More informationMarkov Chain Monte Carlo in Practice
Markov Chain Monte Carlo in Practice Edited by W.R. Gilks Medical Research Council Biostatistics Unit Cambridge UK S. Richardson French National Institute for Health and Medical Research Vilejuif France
More informationNon-Parametric Bayes
Non-Parametric Bayes Mark Schmidt UBC Machine Learning Reading Group January 2016 Current Hot Topics in Machine Learning Bayesian learning includes: Gaussian processes. Approximate inference. Bayesian
More informationTests of independence for censored bivariate failure time data
Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ
More informationBayesian semiparametric analysis of short- and long- term hazard ratios with covariates
Bayesian semiparametric analysis of short- and long- term hazard ratios with covariates Department of Statistics, ITAM, Mexico 9th Conference on Bayesian nonparametrics Amsterdam, June 10-14, 2013 Contents
More informationIntroduction to BGPhazard
Introduction to BGPhazard José A. García-Bueno and Luis E. Nieto-Barajas February 11, 2016 Abstract We present BGPhazard, an R package which computes hazard rates from a Bayesian nonparametric view. This
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL
October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach
More informationLecture 3. Truncation, length-bias and prevalence sampling
Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in
More informationA general mixed model approach for spatio-temporal regression data
A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression
More informationMCMC 2: Lecture 2 Coding and output. Phil O Neill Theo Kypraios School of Mathematical Sciences University of Nottingham
MCMC 2: Lecture 2 Coding and output Phil O Neill Theo Kypraios School of Mathematical Sciences University of Nottingham Contents 1. General (Markov) epidemic model 2. Non-Markov epidemic model 3. Debugging
More informationBayesian Semiparametric Methods for Joint Modeling Longitudinal and Survival Data
Bayesian Semiparametric Methods for Joint Modeling Longitudinal and Survival Data Tim Hanson Division of Biostatistics University of Minnesota and Adam Branscum University of Kentucky Departments of Biostatistics,
More informationSTAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where
STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht
More informationMarkov Chain Monte Carlo
Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).
More informationContents. Part I: Fundamentals of Bayesian Inference 1
Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian
More informationMonte Carlo in Bayesian Statistics
Monte Carlo in Bayesian Statistics Matthew Thomas SAMBa - University of Bath m.l.thomas@bath.ac.uk December 4, 2014 Matthew Thomas (SAMBa) Monte Carlo in Bayesian Statistics December 4, 2014 1 / 16 Overview
More informationMarkov Chain Monte Carlo (MCMC)
Markov Chain Monte Carlo (MCMC Dependent Sampling Suppose we wish to sample from a density π, and we can evaluate π as a function but have no means to directly generate a sample. Rejection sampling can
More informationModule 22: Bayesian Methods Lecture 9 A: Default prior selection
Module 22: Bayesian Methods Lecture 9 A: Default prior selection Peter Hoff Departments of Statistics and Biostatistics University of Washington Outline Jeffreys prior Unit information priors Empirical
More informationChapter 2. Data Analysis
Chapter 2 Data Analysis 2.1. Density Estimation and Survival Analysis The most straightforward application of BNP priors for statistical inference is in density estimation problems. Consider the generic
More informationGibbs Sampling in Latent Variable Models #1
Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor
More informationReduced-rank hazard regression
Chapter 2 Reduced-rank hazard regression Abstract The Cox proportional hazards model is the most common method to analyze survival data. However, the proportional hazards assumption might not hold. The
More informationAnalysis of Cure Rate Survival Data Under Proportional Odds Model
Analysis of Cure Rate Survival Data Under Proportional Odds Model Yu Gu 1,, Debajyoti Sinha 1, and Sudipto Banerjee 2, 1 Department of Statistics, Florida State University, Tallahassee, Florida 32310 5608,
More informationLecture 8: The Metropolis-Hastings Algorithm
30.10.2008 What we have seen last time: Gibbs sampler Key idea: Generate a Markov chain by updating the component of (X 1,..., X p ) in turn by drawing from the full conditionals: X (t) j Two drawbacks:
More informationConstruction of Dependent Dirichlet Processes based on Poisson Processes
1 / 31 Construction of Dependent Dirichlet Processes based on Poisson Processes Dahua Lin Eric Grimson John Fisher CSAIL MIT NIPS 2010 Outstanding Student Paper Award Presented by Shouyuan Chen Outline
More informationNoninformative Priors for the Ratio of the Scale Parameters in the Inverted Exponential Distributions
Communications for Statistical Applications and Methods 03, Vol. 0, No. 5, 387 394 DOI: http://dx.doi.org/0.535/csam.03.0.5.387 Noninformative Priors for the Ratio of the Scale Parameters in the Inverted
More informationMAS3301 / MAS8311 Biostatistics Part II: Survival
MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the
More informationFrailty Probit Models for Clustered Interval- Censored Failure Time Data
University of South Carolina Scholar Commons Theses and Dissertations 2016 Frailty Probit Models for Clustered Interval- Censored Failure Time Data Haifeng Wu University of South Carolina Follow this and
More informationEfficient Bayesian Multivariate Surface Regression
Efficient Bayesian Multivariate Surface Regression Feng Li (joint with Mattias Villani) Department of Statistics, Stockholm University October, 211 Outline of the talk 1 Flexible regression models 2 The
More informationGibbs Sampling in Linear Models #2
Gibbs Sampling in Linear Models #2 Econ 690 Purdue University Outline 1 Linear Regression Model with a Changepoint Example with Temperature Data 2 The Seemingly Unrelated Regressions Model 3 Gibbs sampling
More informationMAS3301 Bayesian Statistics Problems 5 and Solutions
MAS3301 Bayesian Statistics Problems 5 and Solutions Semester 008-9 Problems 5 1. (Some of this question is also in Problems 4). I recorded the attendance of students at tutorials for a module. Suppose
More informationBayesian model selection for computer model validation via mixture model estimation
Bayesian model selection for computer model validation via mixture model estimation Kaniav Kamary ATER, CNAM Joint work with É. Parent, P. Barbillon, M. Keller and N. Bousquet Outline Computer model validation
More informationBayesian semiparametric modeling for stochastic precedence, with applications in epidemiology and survival analysis
Bayesian semiparametric modeling for stochastic precedence, with applications in epidemiology and survival analysis ATHANASIOS KOTTAS Department of Applied Mathematics and Statistics, University of California,
More informationSurvival Analysis Math 434 Fall 2011
Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup
More informationBayesian linear regression
Bayesian linear regression Linear regression is the basis of most statistical modeling. The model is Y i = X T i β + ε i, where Y i is the continuous response X i = (X i1,..., X ip ) T is the corresponding
More informationA Fully Nonparametric Modeling Approach to. BNP Binary Regression
A Fully Nonparametric Modeling Approach to Binary Regression Maria Department of Applied Mathematics and Statistics University of California, Santa Cruz SBIES, April 27-28, 2012 Outline 1 2 3 Simulation
More informationNonparametric Bayesian Methods - Lecture I
Nonparametric Bayesian Methods - Lecture I Harry van Zanten Korteweg-de Vries Institute for Mathematics CRiSM Masterclass, April 4-6, 2016 Overview of the lectures I Intro to nonparametric Bayesian statistics
More informationGibbs Sampling in Endogenous Variables Models
Gibbs Sampling in Endogenous Variables Models Econ 690 Purdue University Outline 1 Motivation 2 Identification Issues 3 Posterior Simulation #1 4 Posterior Simulation #2 Motivation In this lecture we take
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationABC methods for phase-type distributions with applications in insurance risk problems
ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon
More informationStat 542: Item Response Theory Modeling Using The Extended Rank Likelihood
Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal
More informationNearest Neighbor Gaussian Processes for Large Spatial Data
Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns
More informationBuilding a Prognostic Biomarker
Building a Prognostic Biomarker Noah Simon and Richard Simon July 2016 1 / 44 Prognostic Biomarker for a Continuous Measure On each of n patients measure y i - single continuous outcome (eg. blood pressure,
More informationParameter Estimation
Parameter Estimation Chapters 13-15 Stat 477 - Loss Models Chapters 13-15 (Stat 477) Parameter Estimation Brian Hartman - BYU 1 / 23 Methods for parameter estimation Methods for parameter estimation Methods
More informationNonparametric Bayesian modeling for dynamic ordinal regression relationships
Nonparametric Bayesian modeling for dynamic ordinal regression relationships Athanasios Kottas Department of Applied Mathematics and Statistics, University of California, Santa Cruz Joint work with Maria
More informationFrailty Modeling for clustered survival data: a simulation study
Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More informationLog Gaussian Cox Processes. Chi Group Meeting February 23, 2016
Log Gaussian Cox Processes Chi Group Meeting February 23, 2016 Outline Typical motivating application Introduction to LGCP model Brief overview of inference Applications in my work just getting started
More informationBayesian Inference for DSGE Models. Lawrence J. Christiano
Bayesian Inference for DSGE Models Lawrence J. Christiano Outline State space-observer form. convenient for model estimation and many other things. Bayesian inference Bayes rule. Monte Carlo integation.
More informationBayesian semiparametric inference for the accelerated failure time model using hierarchical mixture modeling with N-IG priors
Bayesian semiparametric inference for the accelerated failure time model using hierarchical mixture modeling with N-IG priors Raffaele Argiento 1, Alessandra Guglielmi 2, Antonio Pievatolo 1, Fabrizio
More informationFundamental Issues in Bayesian Functional Data Analysis. Dennis D. Cox Rice University
Fundamental Issues in Bayesian Functional Data Analysis Dennis D. Cox Rice University 1 Introduction Question: What are functional data? Answer: Data that are functions of a continuous variable.... say
More informationBivariate Degradation Modeling Based on Gamma Process
Bivariate Degradation Modeling Based on Gamma Process Jinglun Zhou Zhengqiang Pan Member IAENG and Quan Sun Abstract Many highly reliable products have two or more performance characteristics (PCs). The
More informationCASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:
CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity Outline: 1. NIEHS Uterine Fibroid Study Design of Study Scientific Questions Difficulties 2.
More informationMotivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University
Econ 690 Purdue University In virtually all of the previous lectures, our models have made use of normality assumptions. From a computational point of view, the reason for this assumption is clear: combined
More information