Approaches for Multiple Disease Mapping: MCAR and SANOVA

Size: px
Start display at page:

Download "Approaches for Multiple Disease Mapping: MCAR and SANOVA"

Transcription

1 Approaches for Multiple Disease Mapping: MCAR and SANOVA Dipankar Bandyopadhyay Division of Biostatistics, University of Minnesota SPH April 22, Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

2 Outline Outline Introduction MCAR: Multivariate Conditionally Auto-Regressive model Jin, X., Banerjee, S. and Carlin, B.P. (2007). Order-free co-regionalized areal data models with application to multiple-disease mapping. Journal of the Royal Statistical Society, Series B, 65, SANOVA: Smoothed ANalysis Of VAriance Zhang, Y., Hodges, J.S. and Banerjee, S. (2009). Smoothed ANOVA with spatial effects as a competitor to MCAR in multivariate spatial smoothing. Annals of Applied Statistics, 3(4), MCAR vs SANOVA: modelling correlations vs means Simulation experiments and analysis of Minnesota 3-cancer data Discussion and future work 2 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

3 Introduction Disease mapping: Describe the geographic variation of disease Generate hypotheses about the possible causes for differences in risk of disease. 3 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

4 Introduction Disease mapping: Describe the geographic variation of disease Generate hypotheses about the possible causes for differences in risk of disease. Standard mortality ratio: SMR = Y i /E i Y i is the observed number of deaths in region i. E i is the expected number of deaths in region i. What factors are accountable for discrepancies between Y i s and E i s? 3 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

5 Single Disease Mapping Map of raw standard mortality ratios (SMR) of oesophagus cancer between 1991 and 1998 in counties of Minnesota, USA. (c) Esophagus Cancer Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

6 Single Disease Mapping Background: Modelling of a single disease Let n be the number of counties For counts of (rare) disease, poisson regression model: Hierarchical spatial Poisson regression model Y i µ i ind P oisson {E i exp(µ i )} i = 1,..., n. µ i = x i β + φ i. The x i s are explanatory, region-level spatial covariates, having parameter coefficients β. How do we model φ i s? 5 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

7 Univariate CAR model Define: φ i : univariate spatial random variable, i = 1,, n; φ = (φ 1,, φ n ) ; i j = region i is a neighbor of region j; m i = number of neighbors of region i. Let D = diag{m i } W = adjacency matrix of the map. 6 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

8 Univariate CAR model Define: φ i : univariate spatial random variable, i = 1,, n; φ = (φ 1,, φ n ) ; i j = region i is a neighbor of region j; m i = number of neighbors of region i. Let D = diag{m i } W = adjacency matrix of the map. p(φ i φ j, j i, τ) = N α i j w ij w i+ φ j, 1 ; τm i Brook s Lemma: φ N ( 0, τ 1 (D αw ) 1). 6 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

9 Univariate CAR model Define: φ i : univariate spatial random variable, i = 1,, n; φ = (φ 1,, φ n ) ; i j = region i is a neighbor of region j; m i = number of neighbors of region i. Let D = diag{m i } W = adjacency matrix of the map. p(φ i φ j, j i, τ) = N α i j w ij w i+ φ j, 1 ; τm i Brook s Lemma: φ N ( 0, τ 1 (D αw ) 1). Aside: D αw is known as the Laplacian of a graph in combinatorics (Brualdi and Ryser, 1991). 6 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

10 Univariate CAR model λ min and λ max are the minimum and maximum eigenvalues of D 1/2 W D 1/2. In fact, λ min < 0 and λ max = 1. α (λ 1 min, λ 1 max) ensures proper distribution for φ (Cressie, 1993; Sun et al., 2000). α = 1 implies improper (singular) distribution (Besag et al. 1991). { p(φ 1, φ 2,..., φ n τ) (τ) (n G)/2 exp τ } 2 φ (D W )φ, where G is the number of islands in the map. In fact, n G is the rank of D W. 7 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

11 Univariate CAR model Hierarchical Bayesian modeling: First stage: likelihood for observation data Second stage: prior distributions for fixed effects β and random effects ψ = (ψ 1,..., ψ N ). Estimate model using Markov chain Monte Carlo (MCMC) methods 8 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

12 Multiple Disease Mapping Maps of raw standard mortality ratios (SMR) of lung and esophagus cancer between 1991 and 1998 in Minnesota counties (a) (c) Lung Cancer Esophagus Cancer Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

13 Multiple Disease Mapping Y ij is the disease count for disease j in county i. 10 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

14 Multiple Disease Mapping Y ij is the disease count for disease j in county i. x ij are explanatory, region-level spatial covariates for disease j having parameter coefficients β j. 10 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

15 Multiple Disease Mapping Y ij is the disease count for disease j in county i. x ij are explanatory, region-level spatial covariates for disease j having parameter coefficients β j. For counts of (rare) diseases, poisson regression model: Multiple disease model Y ij ind P oisson(e ij e x ijβ j + φ ij ), i = 1,..., n, j = 1,..., p. 10 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

16 Multiple Disease Mapping Y ij is the disease count for disease j in county i. x ij are explanatory, region-level spatial covariates for disease j having parameter coefficients β j. For counts of (rare) diseases, poisson regression model: Multiple disease model Y ij ind P oisson(e ij e x ijβ j + φ ij ), i = 1,..., n, j = 1,..., p. How do we model the φ ij s? 10 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

17 Multiple Disease Mapping Associations in multiple disease data: 11 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

18 Multiple Disease Mapping Associations in multiple disease data: Spatial association across regions now for each disease Dependence among multiple diseases (within the same region) Cross-spatial associations among multiple diseases in different regions 11 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

19 MCAR models M CAR(α, Λ) (Mardia, 1988) φ 2 Define φ j = (φ 1j,, φ nj ) ; φ = (φ 1., φ p). φ N ( 0, Λ 1 (D αw ) 1) Covariance matrix (p = 2): ( ) ( ( ) ( φ1 0 (D αw )Λ11 (D αw )Λ N, 12 0 (D αw )Λ 12 (D αw )Λ 22 Note: Common spatial smoothing parameter is used for all diseases. ) 1 ) 12 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

20 MCAR models MCAR(α 1,..., α p, Λ) (Gelfand and Vounatsou, 2003) Now every disease has its own smoothing parameter. Covariance matrix (p = 2). ( φ1 φ 2 ) N ( ( 0 0 ) ( (D α1 W )Λ, 11 (D α 2 W )Λ 22 ) 1 ), No additional parameter to control smoothness of cross-correlations. 13 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

21 MCAR models Gelfand and Vounatsou (2003) recognize the difficulties in specifying covariance matrices of the following form ( e.g. with (p = 2)): ( φ1 φ 2 ) N ( ( 0 0 ) ( (D α1 W )Λ, 11 (D α 3 W )Λ 12 (D α 3 W )Λ 12 (D α 2 W )Λ 22 ) 1 ) 14 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

22 MCAR models Gelfand and Vounatsou (2003) recognize the difficulties in specifying covariance matrices of the following form ( e.g. with (p = 2)): ( φ1 φ 2 ) N ( ( 0 0 ) ( (D α1 W )Λ, 11 (D α 3 W )Λ 12 (D α 3 W )Λ 12 (D α 2 W )Λ 22 ) 1 ) Kim, Sun and Tsutakawa (2001): Two-fold CAR model that allows smoothness parameters for cross-correlations for two diseases. Also work by Sain and Cressie (2002). Extremely difficult to generalize to p > Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

23 MCAR models Gelfand and Vounatsou (2003) recognize the difficulties in specifying covariance matrices of the following form ( e.g. with (p = 2)): ( φ1 φ 2 ) N ( ( 0 0 ) ( (D α1 W )Λ, 11 (D α 3 W )Λ 12 (D α 3 W )Λ 12 (D α 2 W )Λ 22 ) 1 ) Kim, Sun and Tsutakawa (2001): Two-fold CAR model that allows smoothness parameters for cross-correlations for two diseases. Also work by Sain and Cressie (2002). Extremely difficult to generalize to p > 2. Jin et al. (2005): Conditional approach to building 2-CAR models: [φ 1 ][φ 2 φ 1 ]. 14 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

24 MCAR models county 1 county 2 county 3 α 1 α 1 φ 11 φ 21 φ 31 A α 3 α 3 α 0 α 3 α 0 α 3 α 0 α 2 α 2 φ 12 φ 22 φ 32 B 15 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

25 MCAR models through latent spatial effects Basic idea Borrow ideas from factor analysis and multivariate geostatistics? 16 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

26 MCAR models through latent spatial effects Basic idea Borrow ideas from factor analysis and multivariate geostatistics? For i = 1,..., n and j = 1,..., p: φ ij = a j1 u i1 + a j2 u i2 + + a jp u ip 16 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

27 MCAR models through latent spatial effects Basic idea Borrow ideas from factor analysis and multivariate geostatistics? For i = 1,..., n and j = 1,..., p: φ ij = a j1 u i1 + a j2 u i2 + + a jp u ip Letting φ j = (φ 1j,..., φ nj ), φ j = a j1 u 1 + a j2 u a jp u p 16 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

28 MCAR models through latent spatial effects Basic idea Borrow ideas from factor analysis and multivariate geostatistics? For i = 1,..., n and j = 1,..., p: φ ij = a j1 u i1 + a j2 u i2 + + a jp u ip Letting φ j = (φ 1j,..., φ nj ), φ j = a j1 u 1 + a j2 u a jp u p u j s are latent variables (no dimension reduction) a jk s are unknown coefficients (weights) not varying over regions (but could) 16 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

29 MCAR models through latent spatial effects Basic idea Case 1: p independent and identical latent CAR effects Assume p latent independent CAR effects u j (one for each disease): u j N n (0, (D αw ) 1), j = 1,..., p. 17 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

30 MCAR models through latent spatial effects Basic idea Case 1: p independent and identical latent CAR effects Assume p latent independent CAR effects u j (one for each disease): u j N n (0, (D αw ) 1), j = 1,..., p. Let φ = (φ 1,..., φ p), A = {a jk } and Σ = AA. 17 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

31 MCAR models through latent spatial effects Basic idea Case 1: p independent and identical latent CAR effects Assume p latent independent CAR effects u j (one for each disease): u j N n (0, (D αw ) 1), j = 1,..., p. Let φ = (φ 1,..., φ p), A = {a jk } and Σ = AA. The joint distribution of φ is ( φ N 0, Σ (D αw ) 1). 17 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

32 MCAR models through latent spatial effects Basic idea Case 1: p independent and identical latent CAR effects Assume p latent independent CAR effects u j (one for each disease): u j N n (0, (D αw ) 1), j = 1,..., p. Let φ = (φ 1,..., φ p), A = {a jk } and Σ = AA. The joint distribution of φ is ( φ N 0, Σ (D αw ) 1). This is the separable MCAR distribution M CAR(α, Σ) distribution. 17 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

33 MCAR models through latent spatial effects Basic idea Case 2: p independent but not identical latent CAR effects Assume that the latent CAR effects u j are independent and not identical. ( u j N 0, (D α j W ) 1), j = 1,..., p. 18 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

34 MCAR models through latent spatial effects Basic idea Case 2: p independent but not identical latent CAR effects Assume that the latent CAR effects u j are independent and not identical. ( u j N 0, (D α j W ) 1), j = 1,..., p. The joint distribution of φ is φ N ( 0, (A I n n )Γ 1 (A I n n ) ), 18 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

35 MCAR models through latent spatial effects Basic idea Case 2: p independent but not identical latent CAR effects Assume that the latent CAR effects u j are independent and not identical. ( u j N 0, (D α j W ) 1), j = 1,..., p. The joint distribution of φ is φ N ( 0, (A I n n )Γ 1 (A I n n ) ), Γ is block diagonal with diagonal entries Γ j = D α j W, j = 1,..., p. 18 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

36 MCAR models through latent spatial effects Basic idea Case 2: p independent but not identical latent CAR effects Assume that the latent CAR effects u j are independent and not identical. ( u j N 0, (D α j W ) 1), j = 1,..., p. The joint distribution of φ is φ N ( 0, (A I n n )Γ 1 (A I n n ) ), Γ is block diagonal with diagonal entries Γ j = D α j W, j = 1,..., p. This is the non-separable MCAR, MCAR(α 1,..., α p, Σ) distribution. 18 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

37 MCAR models through latent spatial effects Basic idea Case 3: p dependent and not identical latent CAR effects Assume that the latent CAR effects u = (u 1,..., u p) are such that: u N ( 0, (I p p D B W ) 1), where B = {b jk } is a p p symmetric matrix. 19 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

38 MCAR models through latent spatial effects Basic idea Case 3: p dependent and not identical latent CAR effects Assume that the latent CAR effects u = (u 1,..., u p) are such that: u N ( 0, (I p p D B W ) 1), where B = {b jk } is a p p symmetric matrix. The joint distribution of φ is φ N ( 0, (A I n n ) (I p p D B W ) 1 (A I n n ) ). 19 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

39 MCAR models through latent spatial effects Basic idea Case 3: p dependent and not identical latent CAR effects Assume that the latent CAR effects u = (u 1,..., u p) are such that: u N ( 0, (I p p D B W ) 1), where B = {b jk } is a p p symmetric matrix. The joint distribution of φ is φ N ( 0, (A I n n ) (I p p D B W ) 1 (A I n n ) ). We call this the MCAR(B, Σ) distribution 19 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

40 MCAR models through latent spatial effects Basic idea Method behind this madness? The M CAR(B, Σ) distribution can be expressed (e.g. p = 2 diseases) as ( ( ) ) 1 (D α11 W )Λ φ N 0, 11 (D α 12 W )Λ 12. (D α 12 W )Λ 12 (D α 22 W )Λ 22 Result holds for general p, i.e. matrices of the form {(D α ij W )Λ ij } There exists a bijection: ({a jk }, {b jk }) ({Λ ij }, {α ij }) 20 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

41 MCAR models through latent spatial effects Basic idea Computation Notes Restriction on B: ξ i = eigenvalues of D 1/2 W D 1/2, i = 1,..., n; (ξ min < 0) ζ j = eigenvalues of B, j = 1,, p ξ i ζ j = eigenvalues for B (D 1/2 W D 1/2 ) I p D B W is p.d. iff ζ j (1/ξ min, 1). 21 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

42 MCAR models through latent spatial effects Basic idea Computation Notes Restriction on B: ξ i = eigenvalues of D 1/2 W D 1/2, i = 1,..., n; (ξ min < 0) ζ j = eigenvalues of B, j = 1,, p ξ i ζ j = eigenvalues for B (D 1/2 W D 1/2 ) I p D B W is p.d. iff ζ j (1/ξ min, 1). Update B: Spectral decomposition of B: B = P P ; P = p 1 i=1 p j=i+1 G ij(θ ij ) where G ij is I p with i-th and j-th diagonal elements replaced by cos(θ ij ) and (i, j)-th and (j, i)-th entry replaced by ± sin(θ ij ); priors: θ ij U( π/2, π/2), ζ j U(1/ξ min, 1) 21 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

43 Minnesota Cancer Example Minnesota Cancer Surveillance data used; Counties in Minnesota are treated as locations: j = 1,..., types of diseases considered: cancers of the lung, larynx and oesophagus: j = 1, 2, 3. E ij computed as the expected age adjusted number of deaths due to cancer j in county i. 22 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

44 Minnesota Cancer Example 23 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

45 Do we really need this madness? Do we really need these complex multivariate structures? Can the data really identify MCAR s elaborate dependence structures? What if we want to work with an improper MCAR? Can we have a simpler and more interpretable approach? 24 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

46 A new approach to multiple disease mapping Two-way ANOVA with one observation per cell County 1 County 2 County n Cancer 1 Cancer Cancer p 25 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

47 A new approach to multiple disease mapping Two-way ANOVA table Effects df Cancer p 1 County n 1 Error=Cancer County (n 1)(p 1) Total np 1 26 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

48 A new approach to multiple disease mapping Two-way ANOVA table Effects df Cancer p 1 County n 1 Error=Cancer County (n 1)(p 1) Total np 1 Saturated linear model: Grand Mean + Cancer Main Effects + County Main Effects + Cancer County Interaction Effects 26 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

49 A new approach to multiple disease mapping A new approach to multiple disease mapping? Smoothed ANOVA (Hodges et al, 2007): g(e[y]) = A 1 Θ 1 + A 2 Θ 2 ; M 1 and M 2 degrees of freedom for main effects and interactions respectively A 1 s columns correspond to the grand mean and main effects; A 1 A 1 = I M1. A 2 s columns correspond to interactions; A 2 A 2 = I M2. Rather than choosing interaction (and main effects) to include/exclude...include all these effects, but shrink toward zero. 27 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

50 A new approach to multiple disease mapping A 1 and A 2 are design matrices of contrasts for the unshrunk (overall & main) and shrunk (interaction) effects. Smoothing : Θ 2 N(0, diag( 1 η j )). Usually the interaction effects are shrunk towards zero. Θ 1 is assigned a flat prior; grand mean and main effects are not smoothed. 28 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

51 A new approach to multiple disease mapping A 1 and A 2 are design matrices of contrasts for the unshrunk (overall & main) and shrunk (interaction) effects. Smoothing : Θ 2 N(0, diag( 1 η j )). Usually the interaction effects are shrunk towards zero. Θ 1 is assigned a flat prior; grand mean and main effects are not smoothed. How does the improper CAR fit into SANOVA? 28 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

52 The intrinsic CAR Model Define Q = D W. The intrinsic CAR model (Besag et al 1991) with L 2 norm has the improper density ( p(φ τ) τ n G 2 exp τ ) 2 φ Qφ, 29 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

53 The intrinsic CAR Model Define Q = D W. The intrinsic CAR model (Besag et al 1991) with L 2 norm has the improper density ( p(φ τ) τ n G 2 exp τ ) 2 φ Qφ, τ is the spatial dispersion parameter. G is the number of islands" (disconnected parts). Assume that G = 1 (no major issues for general G) 29 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

54 Fit CAR into SANOVA Q1 = 0 and rank of Q is n Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

55 Fit CAR into SANOVA Q1 = 0 and rank of Q is n 1. Let Q have spectral decomposition Q = V ΛV, Λ is diagonal and V is orthogonal with columns as eigenvectors. 30 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

56 Fit CAR into SANOVA Q1 = 0 and rank of Q is n 1. Let Q have spectral decomposition Q = V ΛV, Λ is diagonal and V is orthogonal with columns as eigenvectors. Eigenvector 1 n 1 n has eigenvalue Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

57 Fit CAR into SANOVA Q1 = 0 and rank of Q is n 1. Let Q have spectral decomposition Q = V ΛV, Λ is diagonal and V is orthogonal with columns as eigenvectors. Eigenvector 1 n 1 n has eigenvalue 0. Let V s other columns are orthogonal to 1 n ; call them V 1 and their corresponding eigenvalue matrix as Λ Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

58 Fit CAR into SANOVA Q1 = 0 and rank of Q is n 1. Let Q have spectral decomposition Q = V ΛV, Λ is diagonal and V is orthogonal with columns as eigenvectors. Eigenvector 1 n 1 n has eigenvalue 0. Let V s other columns are orthogonal to 1 n ; call them V 1 and their corresponding eigenvalue matrix as Λ 1. V 1 s n 1 columns also define contrasts in φ s: 1 V 1 = Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

59 Fit CAR into SANOVA Define a new parameter Θ = V φ, so ( p(θ τ) τ n G 2 exp τ ) 2 Θ ΛΘ, 31 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

60 Fit CAR into SANOVA Define a new parameter Θ = V φ, so ( p(θ τ) τ n G 2 exp τ ) 2 Θ ΛΘ, Setting a CAR prior on φ setting Θ s prior to N(0, (τλ) 1 ). 31 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

61 Fit CAR into SANOVA Define a new parameter Θ = V φ, so ( p(θ τ) τ n G 2 exp τ ) 2 Θ ΛΘ, Setting a CAR prior on φ setting Θ s prior to N(0, (τλ) 1 ). Let φ = (φ 1, φ 2,..., φ n ), then the SANOVA model φ = V Θ [ ] [ 1 = V 1 n Θ 1 Reg n Θ GM where Θ REG N(0, (τλ 1 ) 1 ) and the precision Λ nn = 0 for the overall level, i.e., a flat prior on Θ GM. ]. 31 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

62 Fit MCAR into SANOVA Suppose φ, the spatial effects for p cancers in n counties, arises from a saturated linear regression model: φ = [ φ 1, φ 2,..., φ n ] = [A1 A 2 ] Θ = 1 np 1 np }{{} 1 1 n H CA n }{{} V 1 1 p 1 p }{{} V 1 H CA (1)... V1 H CA (p 1) }{{} Θ GM Θ CA Θ CO Θ CO CA Cancer County Cancer County Grand mean main eect main eect interaction np 1 np (p 1) np (n 1) np (n 1)(p 1) Let Θ CO N n 1 (0, (τ 0 Λ 1 ) 1 ); Θ CO CA (j) N n 1 (0, (τ j Λ 1 ) 1 ), where H CA is p (p 1) matrix whose columns are contrasts for cancer, Λ 1 corresponds to V 1 ; τ 0, τ j > 0 are unknown and τ 0 Λ 1, τ j Λ 1 are precision matrices. 32 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

63 Fit MCAR into SANOVA We can prove that a priori, A 2 Θ CO Θ CO CA (1). Θ CO CA (p 1) has precision Q (H A (+) (+) diag(τ j )H A ), 33 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

64 Fit MCAR into SANOVA We can prove that a priori, A 2 Θ CO Θ CO CA (1). Θ CO CA (p 1) has precision Q (H A (+) (+) diag(τ j )H A ), A 2 refers to the columns of the design matrix corresponding to the county main effects and the cancer-by-county interactions 33 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

65 Fit MCAR into SANOVA We can prove that a priori, A 2 Θ CO Θ CO CA (1). Θ CO CA (p 1) has precision Q (H A (+) (+) diag(τ j )H A ), A 2 refers to the columns of the design matrix corresponding to the county main effects and the cancer-by-county interactions H A (+) = ( 1 p 1 p H CA ) is an orthogonal matrix 33 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

66 MCAR vs SANOVA The marginal precision matrix of [ φ 1, φ 2,..., φ N ] is: MCAR SANOVA 34 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

67 Simulation Experiment We simulated both normal and Poisson distributed data for different settings of precision parameters. When generating data, we set H (+) A = HA 1 = We fit three SANOVA models: SANOVA with the correct H A (+), HA 1 ; SANOVA with a somewhat incorrect H A (+), HA 2 ; a variant SANOVA with a very incorrect H A (+), H AM, where HA 2 = H AM = , Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

68 Simulation Findings Six models were compared. Three flavours of SANOVA SANOVA Correct has the H (+) A used to generate the data. SANOVA Incorrect used a slightly wrong H (+) A matrix. SANOVA Variant data generated from an MCAR (very wrong H (+) A ). Compared to an MCAR model with varying values for Ω s hyper-prior matrix (i.e. the Wishart scale matrix) R. R = I 3 R = I 3 R = 200 I 3 36 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

69 Simulation Findings Metrics used: AMSE: Average (over L = 100 replications) MSE of the cell means plugging in the posterior medians MBIAS: 2.5, 50, & 97.5 percentiles of the average (over replications) bias of the cell means (plugging in the posterior median). PI Rate: average coverage of the 95% posterior intervals of the cell means. DIC scores 37 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

70 Simulation Findings Findings: SANOVA Correct did the best (smallest AMSE & MBIAS, nearly nominal PI Rate). For normal data: SANOVA mostly outperformed MCAR. SANOVA models were comparable but worse than MCAR for good choices of the Wishart s scale matrix; MCAR performed poorly for very bad choices on R Overall: SANOVA appears competitive than MCAR and is fairly robust to mis-specifying H (+) A, unlike MCAR which is relatively sensitive to the hyper-prior on Ω. 38 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

71 Maps of raw and fitted SMR s Lung Cancer 39 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

72 Maps of raw and fitted SMR s Larynx Cancer 40 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

73 Maps of raw and fitted SMR s Esophagus Cancer 41 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

74 Discussion and future work We extended SANOVA to smooth spatial random effects taking advantage of the spatial structure. For the cases considered, SANOVA does in fact, compete well with MCAR. To improve SANOVA in the future, Extend SANOVA to more general MCAR case (ours is intrinsic MCAR). SANOVA in spatiotemporal models. SANOVA in spatial survival models (e.g., Cox model with spatial effects). Further investigations on choice of priors and robustness issues. 42 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

75 References 1 Besag, J, York, JC, and Mollié, A (1991). Bayesian image restoration, with two applications in spatial statistics (with discussion). Annals of the Institute of Statistical Mathematics, 43: Gelfand AE, Vounatsou P (2003). Proper multivariate conditional autoregressive models for spatial data analysis. Biostatistics, 4: Hodges JS, Cui Y, Sargent DJ, Carlin BP (2007). Smoothing balanced single-error-term analysis of variance. Technometrics, 49: Jin X, Carlin BP, Banerjee S (2005) Generalized hierarchical multivariate CAR models for areal data. Biometrics, 61: Jin X, Banerjee S, Carlin BP (2007). Order-free coregionalized lattice models with application to multiple disease mapping. J. Roy. Stat. Soc., Ser. B, 69: Kim H, Sun D, Tsutakawa RK (2001). A bivariate Bayes method for improving the estimates of mortality rates with a twofold conditional autoregressive model. J. Amer. Stat. Assn., 96: Sain SR, Cressie N (2002). Multivariate lattice models for spatial environmental data. In Proc. A.S.A. Section on Statistics and the Environment. Alexandria, VA: American Statistical Association, pp D. Sun, R.K. Tsutakawa, H. Kim and Z. He (2000). Bayesian Analysis of Mortality Rates with Disease Maps. Statistics in Medicine, 19: Zhang, Y., Hodges, J.S. and Banerjee, S. (2009) Smoothed ANOVA with Spatial Effects as a Competitor to MCAR in Multivariate Spatial Smoothing. Annals of Applied Statistics, in press. 43 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

76 Questions Thank You! 44 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

SMOOTHED ANOVA WITH SPATIAL EFFECTS AS A COMPETITOR TO MCAR IN MULTIVARIATE SPATIAL SMOOTHING

SMOOTHED ANOVA WITH SPATIAL EFFECTS AS A COMPETITOR TO MCAR IN MULTIVARIATE SPATIAL SMOOTHING The nnals of pplied Statistics 2009, Vol. 3, No. 4, 1805 1830 DOI: 10.1214/09-OS267 Institute of Mathematical Statistics, 2009 SMOOTHED NOV WITH SPTIL EFFECTS S COMPETITOR TO MCR IN MULTIVRITE SPTIL SMOOTHING

More information

Generalized common spatial factor model

Generalized common spatial factor model Biostatistics (2003), 4, 4,pp. 569 582 Printed in Great Britain Generalized common spatial factor model FUJUN WANG Eli Lilly and Company, Indianapolis, IN 46285, USA MELANIE M. WALL Division of Biostatistics,

More information

Bayesian Areal Wombling for Geographic Boundary Analysis

Bayesian Areal Wombling for Geographic Boundary Analysis Bayesian Areal Wombling for Geographic Boundary Analysis Haolan Lu, Haijun Ma, and Bradley P. Carlin haolanl@biostat.umn.edu, haijunma@biostat.umn.edu, and brad@biostat.umn.edu Division of Biostatistics

More information

Multivariate spatial modeling

Multivariate spatial modeling Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced

More information

arxiv: v1 [stat.ap] 2 Nov 2010

arxiv: v1 [stat.ap] 2 Nov 2010 The nnals of pplied Statistics 2009, Vol. 3, No. 4, 1805 1830 DOI: 10.1214/09-OS267 c Institute of Mathematical Statistics, 2009 arxiv:1011.0646v1 [stat.p] 2 Nov 2010 SMOOTHED NOV WITH SPTIL EFFECTS S

More information

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P. Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk

More information

Bayesian Linear Regression

Bayesian Linear Regression Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective

More information

Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis

Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis Haijun Ma, Bradley P. Carlin and Sudipto Banerjee December 8, 2008 Web Appendix A: Selecting

More information

Spatial Smoothing in Stan: Conditional Auto-Regressive Models

Spatial Smoothing in Stan: Conditional Auto-Regressive Models Spatial Smoothing in Stan: Conditional Auto-Regressive Models Charles DiMaggio, PhD, NYU School of Medicine Stephen J. Mooney, PhD, University of Washington Mitzi Morris, Columbia University Dan Simpson,

More information

Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models

Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models By BRIAN J. REICH a, JAMES S. HODGES a, and VESNA ZADNIK b 1 a Division of Biostatistics, School of Public

More information

Small Area Estimation via Multivariate Fay-Herriot Models with Latent Spatial Dependence

Small Area Estimation via Multivariate Fay-Herriot Models with Latent Spatial Dependence Small Area Estimation via Multivariate Fay-Herriot Models with Latent Spatial Dependence Aaron T. Porter 1, Christopher K. Wikle 2, Scott H. Holan 2 arxiv:1310.7211v1 [stat.me] 27 Oct 2013 Abstract The

More information

Default Bayesian Analysis for Hierarchical Spatial Multivariate Models

Default Bayesian Analysis for Hierarchical Spatial Multivariate Models Default Bayesian Analysis for Hierarchical Spatial Multivariate Models Sarat C. Dass, Chae Young Lim and Tapabrata Maiti Department of Statistics & Probability Michigan State University, East Lansing,

More information

Spatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models

Spatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models Spatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models Panudet Saengseedam Nanthachai Kantanantha Abstract This paper proposes a linear mixed model (LMM) with spatial

More information

Fully Bayesian Spatial Analysis of Homicide Rates.

Fully Bayesian Spatial Analysis of Homicide Rates. Fully Bayesian Spatial Analysis of Homicide Rates. Silvio A. da Silva, Luiz L.M. Melo and Ricardo S. Ehlers Universidade Federal do Paraná, Brazil Abstract Spatial models have been used in many fields

More information

Aggregated cancer incidence data: spatial models

Aggregated cancer incidence data: spatial models Aggregated cancer incidence data: spatial models 5 ième Forum du Cancéropôle Grand-est - November 2, 2011 Erik A. Sauleau Department of Biostatistics - Faculty of Medicine University of Strasbourg ea.sauleau@unistra.fr

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee and Andrew O. Finley 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Jon Wakefield Departments of Statistics and Biostatistics University of Washington 1 / 37 Lecture Content Motivation

More information

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information p. 1/27 Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information Shengde Liang, Bradley

More information

Spatial Analysis of Incidence Rates: A Bayesian Approach

Spatial Analysis of Incidence Rates: A Bayesian Approach Spatial Analysis of Incidence Rates: A Bayesian Approach Silvio A. da Silva, Luiz L.M. Melo and Ricardo Ehlers July 2004 Abstract Spatial models have been used in many fields of science where the data

More information

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case Areal data models Spatial smoothers Brook s Lemma and Gibbs distribution CAR models Gaussian case Non-Gaussian case SAR models Gaussian case Non-Gaussian case CAR vs. SAR STAR models Inference for areal

More information

Gaussian Process Regression Model in Spatial Logistic Regression

Gaussian Process Regression Model in Spatial Logistic Regression Journal of Physics: Conference Series PAPER OPEN ACCESS Gaussian Process Regression Model in Spatial Logistic Regression To cite this article: A Sofro and A Oktaviarina 018 J. Phys.: Conf. Ser. 947 01005

More information

Bayesian spatial hierarchical modeling for temperature extremes

Bayesian spatial hierarchical modeling for temperature extremes Bayesian spatial hierarchical modeling for temperature extremes Indriati Bisono Dr. Andrew Robinson Dr. Aloke Phatak Mathematics and Statistics Department The University of Melbourne Maths, Informatics

More information

Penalized Loss functions for Bayesian Model Choice

Penalized Loss functions for Bayesian Model Choice Penalized Loss functions for Bayesian Model Choice Martyn International Agency for Research on Cancer Lyon, France 13 November 2009 The pure approach For a Bayesian purist, all uncertainty is represented

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Bayesian Hierarchical Models

Bayesian Hierarchical Models Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology

More information

Disease mapping with Gaussian processes

Disease mapping with Gaussian processes EUROHEIS2 Kuopio, Finland 17-18 August 2010 Aki Vehtari (former Helsinki University of Technology) Department of Biomedical Engineering and Computational Science (BECS) Acknowledgments Researchers - Jarno

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

DEFAULT BAYESIAN ANALYSIS FOR MULTIVARIATE GENERALIZED CAR MODELS

DEFAULT BAYESIAN ANALYSIS FOR MULTIVARIATE GENERALIZED CAR MODELS Statistica Sinica 22 (2012), 231-248 doi:http://dx.doi.org/10.5705/ss.2010.106 DEFAULT BAYESIAN ANALYSIS FOR MULTIVARIATE GENERALIZED CAR MODELS Sarat C. Dass, Chae Young Lim and Tapabrata Maiti Michigan

More information

Comparison of Bayesian Spatio-Temporal Models for Chronic Diseases

Comparison of Bayesian Spatio-Temporal Models for Chronic Diseases Journal of Data Science 8(2010), 189-211 Comparison of Bayesian Spatio-Temporal Models for Chronic Diseases Hoon Kim 1 and HeeJeong Lim 2 1 California State Polytechnic University and 2 California State

More information

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models arxiv:1811.03735v1 [math.st] 9 Nov 2018 Lu Zhang UCLA Department of Biostatistics Lu.Zhang@ucla.edu Sudipto Banerjee UCLA

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters

More information

The Wishart distribution Scaled Wishart. Wishart Priors. Patrick Breheny. March 28. Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/11

The Wishart distribution Scaled Wishart. Wishart Priors. Patrick Breheny. March 28. Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/11 Wishart Priors Patrick Breheny March 28 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/11 Introduction When more than two coefficients vary, it becomes difficult to directly model each element

More information

Summary STK 4150/9150

Summary STK 4150/9150 STK4150 - Intro 1 Summary STK 4150/9150 Odd Kolbjørnsen May 22 2017 Scope You are expected to know and be able to use basic concepts introduced in the book. You knowledge is expected to be larger than

More information

Cross-sectional space-time modeling using ARNN(p, n) processes

Cross-sectional space-time modeling using ARNN(p, n) processes Cross-sectional space-time modeling using ARNN(p, n) processes W. Polasek K. Kakamu September, 006 Abstract We suggest a new class of cross-sectional space-time models based on local AR models and nearest

More information

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model UNIVERSITY OF TEXAS AT SAN ANTONIO Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model Liang Jing April 2010 1 1 ABSTRACT In this paper, common MCMC algorithms are introduced

More information

Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus. Abstract

Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus. Abstract Bayesian analysis of a vector autoregressive model with multiple structural breaks Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus Abstract This paper develops a Bayesian approach

More information

Spatio-temporal modeling of avalanche frequencies in the French Alps

Spatio-temporal modeling of avalanche frequencies in the French Alps Available online at www.sciencedirect.com Procedia Environmental Sciences 77 (011) 311 316 1 11 Spatial statistics 011 Spatio-temporal modeling of avalanche frequencies in the French Alps Aurore Lavigne

More information

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline. MFM Practitioner Module: Risk & Asset Allocation September 11, 2013 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 4 for Applied Multivariate Analysis Outline 1 Eigen values and eigen vectors Characteristic equation Some properties of eigendecompositions

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Hierarchical Modeling and Analysis for Spatial Data

Hierarchical Modeling and Analysis for Spatial Data Hierarchical Modeling and Analysis for Spatial Data Bradley P. Carlin, Sudipto Banerjee, and Alan E. Gelfand brad@biostat.umn.edu, sudiptob@biostat.umn.edu, and alan@stat.duke.edu University of Minnesota

More information

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May 5-7 2008 Peter Schlattmann Institut für Biometrie und Klinische Epidemiologie

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

Bayesian Regression Linear and Logistic Regression

Bayesian Regression Linear and Logistic Regression When we want more than point estimates Bayesian Regression Linear and Logistic Regression Nicole Beckage Ordinary Least Squares Regression and Lasso Regression return only point estimates But what if we

More information

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modeling for small area health data Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modelling for Small Area

More information

Bayes: All uncertainty is described using probability.

Bayes: All uncertainty is described using probability. Bayes: All uncertainty is described using probability. Let w be the data and θ be any unknown quantities. Likelihood. The probability model π(w θ) has θ fixed and w varying. The likelihood L(θ; w) is π(w

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Bayesian Inference. Chapter 9. Linear models and regression

Bayesian Inference. Chapter 9. Linear models and regression Bayesian Inference Chapter 9. Linear models and regression M. Concepcion Ausin Universidad Carlos III de Madrid Master in Business Administration and Quantitative Methods Master in Mathematical Engineering

More information

The Bayesian Approach to Multi-equation Econometric Model Estimation

The Bayesian Approach to Multi-equation Econometric Model Estimation Journal of Statistical and Econometric Methods, vol.3, no.1, 2014, 85-96 ISSN: 2241-0384 (print), 2241-0376 (online) Scienpress Ltd, 2014 The Bayesian Approach to Multi-equation Econometric Model Estimation

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH Lecture 5: Spatial probit models James P. LeSage University of Toledo Department of Economics Toledo, OH 43606 jlesage@spatial-econometrics.com March 2004 1 A Bayesian spatial probit model with individual

More information

Modelling geoadditive survival data

Modelling geoadditive survival data Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model

More information

Riemann Manifold Methods in Bayesian Statistics

Riemann Manifold Methods in Bayesian Statistics Ricardo Ehlers ehlers@icmc.usp.br Applied Maths and Stats University of São Paulo, Brazil Working Group in Statistical Learning University College Dublin September 2015 Bayesian inference is based on Bayes

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,

More information

Hierarchical Modeling for Spatio-temporal Data

Hierarchical Modeling for Spatio-temporal Data Hierarchical Modeling for Spatio-temporal Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of

More information

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2 ARIC Manuscript Proposal # 1186 PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2 1.a. Full Title: Comparing Methods of Incorporating Spatial Correlation in

More information

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS Srinivasan R and Venkatesan P Dept. of Statistics, National Institute for Research Tuberculosis, (Indian Council of Medical Research),

More information

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

Bayesian Nonparametric Spatio-Temporal Models for Disease Incidence Data

Bayesian Nonparametric Spatio-Temporal Models for Disease Incidence Data Bayesian Nonparametric Spatio-Temporal Models for Disease Incidence Data Athanasios Kottas 1, Jason A. Duan 2, Alan E. Gelfand 3 University of California, Santa Cruz 1 Yale School of Management 2 Duke

More information

Bayesian Analysis of Latent Variable Models using Mplus

Bayesian Analysis of Latent Variable Models using Mplus Bayesian Analysis of Latent Variable Models using Mplus Tihomir Asparouhov and Bengt Muthén Version 2 June 29, 2010 1 1 Introduction In this paper we describe some of the modeling possibilities that are

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan

More information

7. Estimation and hypothesis testing. Objective. Recommended reading

7. Estimation and hypothesis testing. Objective. Recommended reading 7. Estimation and hypothesis testing Objective In this chapter, we show how the election of estimators can be represented as a decision problem. Secondly, we consider the problem of hypothesis testing

More information

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline. Practitioner Course: Portfolio Optimization September 10, 2008 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y ) (x,

More information

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,

More information

Hierarchical Modelling for Univariate and Multivariate Spatial Data

Hierarchical Modelling for Univariate and Multivariate Spatial Data Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 1/4 Hierarchical Modelling for Univariate and Multivariate Spatial Data Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1) HW 1 due today Parameter Estimation Biometrics CSE 190 Lecture 7 Today s lecture was on the blackboard. These slides are an alternative presentation of the material. CSE190, Winter10 CSE190, Winter10 Chapter

More information

Gaussian predictive process models for large spatial data sets.

Gaussian predictive process models for large spatial data sets. Gaussian predictive process models for large spatial data sets. Sudipto Banerjee, Alan E. Gelfand, Andrew O. Finley, and Huiyan Sang Presenters: Halley Brantley and Chris Krut September 28, 2015 Overview

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr

More information

Bayesian Model Diagnostics and Checking

Bayesian Model Diagnostics and Checking Earvin Balderama Quantitative Ecology Lab Department of Forestry and Environmental Resources North Carolina State University April 12, 2013 1 / 34 Introduction MCMCMC 2 / 34 Introduction MCMCMC Steps in

More information

INRIA Rh^one-Alpes. Abstract. Friedman (1989) has proposed a regularization technique (RDA) of discriminant analysis

INRIA Rh^one-Alpes. Abstract. Friedman (1989) has proposed a regularization technique (RDA) of discriminant analysis Regularized Gaussian Discriminant Analysis through Eigenvalue Decomposition Halima Bensmail Universite Paris 6 Gilles Celeux INRIA Rh^one-Alpes Abstract Friedman (1989) has proposed a regularization technique

More information

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Kneib, Fahrmeir: Supplement to Structured additive regression for categorical space-time data: A mixed model approach Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/

More information

Model Assessment and Comparisons

Model Assessment and Comparisons Model Assessment and Comparisons Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

Example using R: Heart Valves Study

Example using R: Heart Valves Study Example using R: Heart Valves Study Goal: Show that the thrombogenicity rate (TR) is less than two times the objective performance criterion R and WinBUGS Examples p. 1/27 Example using R: Heart Valves

More information

Spatial Stochastic Volatility for Lattice Data

Spatial Stochastic Volatility for Lattice Data Spatial Stochastic Volatility for Lattice Data Jun Yan Department of Statistics and Actuarial Science, University of Iowa, Iowa City, IA 2242, U.S.A. March 22, 06 Abstract Spatial heteroskedasticity may

More information

Bayesian (conditionally) conjugate inference for discrete data models. Jon Forster (University of Southampton)

Bayesian (conditionally) conjugate inference for discrete data models. Jon Forster (University of Southampton) Bayesian (conditionally) conjugate inference for discrete data models Jon Forster (University of Southampton) with Mark Grigsby (Procter and Gamble?) Emily Webb (Institute of Cancer Research) Table 1:

More information

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University An Introduction to Spatial Statistics Chunfeng Huang Department of Statistics, Indiana University Microwave Sounding Unit (MSU) Anomalies (Monthly): 1979-2006. Iron Ore (Cressie, 1986) Raw percent data

More information

Community Health Needs Assessment through Spatial Regression Modeling

Community Health Needs Assessment through Spatial Regression Modeling Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular

More information

Effects of Residual Smoothing on the Posterior of the Fixed Effects in Disease-Mapping Models

Effects of Residual Smoothing on the Posterior of the Fixed Effects in Disease-Mapping Models Biometrics 62, 1197 1206 December 2006 DOI: 10.1111/j.1541-0420.2006.00617.x Effects of Residual Smoothing on the Posterior of the Fixed Effects in Disease-Mapping Models Brian J. Reich, 1, James S. Hodges,

More information

Expression Data Exploration: Association, Patterns, Factors & Regression Modelling

Expression Data Exploration: Association, Patterns, Factors & Regression Modelling Expression Data Exploration: Association, Patterns, Factors & Regression Modelling Exploring gene expression data Scale factors, median chip correlation on gene subsets for crude data quality investigation

More information

Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models

Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models Kohta Aoki 1 and Hiroshi Nagahashi 2 1 Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology

More information

Simultaneous Estimation of Disease Risks and Spatial Clustering: A Hierarchical Bayes Approach

Simultaneous Estimation of Disease Risks and Spatial Clustering: A Hierarchical Bayes Approach Simultaneous Estimation of Disease Risks and Spatial Clustering: A Hierarchical Bayes Approach Wenning Feng, Chae Young Lim and Tapabrata Maiti Department of Statistics and Probability Michigan State University

More information

BAYESIAN HIERARCHICAL MODELS FOR MISALIGNED DATA: A SIMULATION STUDY

BAYESIAN HIERARCHICAL MODELS FOR MISALIGNED DATA: A SIMULATION STUDY STATISTICA, anno LXXV, n. 1, 2015 BAYESIAN HIERARCHICAL MODELS FOR MISALIGNED DATA: A SIMULATION STUDY Giulia Roli 1 Dipartimento di Scienze Statistiche, Università di Bologna, Bologna, Italia Meri Raggi

More information

arxiv: v1 [stat.ap] 27 Nov 2018

arxiv: v1 [stat.ap] 27 Nov 2018 A spatially varying change points model for monitoring glaucoma progression using visual field data Samuel I. Berchuck a, Jean-Claude Mwanza b, Joshua L. Warren c, arxiv:1811.138v1 [stat.ap] 27 Nov 18

More information

MCMC algorithms for fitting Bayesian models

MCMC algorithms for fitting Bayesian models MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models

More information

Collinearity/Confounding in richly-parameterized models

Collinearity/Confounding in richly-parameterized models Collinearity/Confounding in richly-parameterized models For MLM, it often makes sense to consider X + Zu to be the mean structure, especially for new-style random e ects. This is like an ordinary linear

More information

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline: CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity Outline: 1. NIEHS Uterine Fibroid Study Design of Study Scientific Questions Difficulties 2.

More information

1 Data Arrays and Decompositions

1 Data Arrays and Decompositions 1 Data Arrays and Decompositions 1.1 Variance Matrices and Eigenstructure Consider a p p positive definite and symmetric matrix V - a model parameter or a sample variance matrix. The eigenstructure is

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing

More information

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

On Gaussian Process Models for High-Dimensional Geostatistical Datasets On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May

More information

Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States

Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States Laura A. Hatfield and Bradley P. Carlin Division of Biostatistics School of Public Health University

More information

Bayesian data analysis in practice: Three simple examples

Bayesian data analysis in practice: Three simple examples Bayesian data analysis in practice: Three simple examples Martin P. Tingley Introduction These notes cover three examples I presented at Climatea on 5 October 0. Matlab code is available by request to

More information