Bayesian Areal Wombling for Geographic Boundary Analysis
|
|
- Rosaline Riley
- 6 years ago
- Views:
Transcription
1 Bayesian Areal Wombling for Geographic Boundary Analysis Haolan Lu, Haijun Ma, and Bradley P. Carlin and Division of Biostatistics School of Public Health University of Minnesota Bayesian Areal Wombling for Geographic Boundary Analysis p. 1/30
2 Background Spatial data typically classified two ways: point-referenced (geostatistical): spatial locations are points with known coordinates areal (lattice): locations are geographic regions (e.g., counties) with adjacency information Bayesian Areal Wombling for Geographic Boundary Analysis p. 2/30
3 Background Spatial data typically classified two ways: point-referenced (geostatistical): spatial locations are points with known coordinates areal (lattice): locations are geographic regions (e.g., counties) with adjacency information Important topic in spatial statistics: boundary analysis, wherein we seek to identify regions of abrupt change. Indicated by: steep gradients in a continuous surface regional boundaries separating regions with drastically different measurements in a lattice surface Bayesian Areal Wombling for Geographic Boundary Analysis p. 2/30
4 Background Spatial data typically classified two ways: point-referenced (geostatistical): spatial locations are points with known coordinates areal (lattice): locations are geographic regions (e.g., counties) with adjacency information Important topic in spatial statistics: boundary analysis, wherein we seek to identify regions of abrupt change. Indicated by: steep gradients in a continuous surface regional boundaries separating regions with drastically different measurements in a lattice surface Focus in this talk: Boundary analysis for areal data Bayesian Areal Wombling for Geographic Boundary Analysis p. 2/30
5 Background (cont d) Spatial boundary analysis techniques often called wombling, after a foundational paper by Womble (1951). Bayesian Areal Wombling for Geographic Boundary Analysis p. 3/30
6 Background (cont d) Spatial boundary analysis techniques often called wombling, after a foundational paper by Womble (1951). In areal (or polygonal) wombling, a dissimilarity metric measures the difference between adjacent regions. Methods for choosing boundary elements: absolute (dissimilarity metrics greater than C) relative (dissimilarity metrics in the top k%) Bayesian Areal Wombling for Geographic Boundary Analysis p. 3/30
7 Background (cont d) Spatial boundary analysis techniques often called wombling, after a foundational paper by Womble (1951). In areal (or polygonal) wombling, a dissimilarity metric measures the difference between adjacent regions. Methods for choosing boundary elements: absolute (dissimilarity metrics greater than C) relative (dissimilarity metrics in the top k%) Problems: Relative (top k%) thresholding method always finds a fixed number of boundary elements Approach is algorithmic, rather than stochastic: no model or likelihood, so statements about the significance of a detected boundary only relative to predetermined, often unrealistic null distributions. Bayesian Areal Wombling for Geographic Boundary Analysis p. 3/30
8 Crisp vs. Fuzzy Areal Wombling Suppose we have responses Z i for regions i = 1,...,n For neighboring regions i and j and some distance metric, assign the boundary likelihood value (BLV) D ij = Z i Z j. Bayesian Areal Wombling for Geographic Boundary Analysis p. 4/30
9 Crisp vs. Fuzzy Areal Wombling Suppose we have responses Z i for regions i = 1,...,n For neighboring regions i and j and some distance metric, assign the boundary likelihood value (BLV) D ij = Z i Z j. Crisp wombling: Boundary is those edges having BLV s above specified thresholds, i.e., for some c > 0, {(i,j) : D ij > c, i adjacent to j}. Bayesian Areal Wombling for Geographic Boundary Analysis p. 4/30
10 Crisp vs. Fuzzy Areal Wombling Suppose we have responses Z i for regions i = 1,...,n For neighboring regions i and j and some distance metric, assign the boundary likelihood value (BLV) D ij = Z i Z j. Crisp wombling: Boundary is those edges having BLV s above specified thresholds, i.e., for some c > 0, {(i,j) : D ij > c, i adjacent to j}. Fuzzy wombling: Partial membership in the boundary is permitted, say by defining the boundary membership values (BMVs) D ij / max{d ij } ij Bayesian Areal Wombling for Geographic Boundary Analysis p. 4/30
11 Example: MN colorectal cancer data Here n i is the total number of colorectal cancers occurring in county i, and Y i is the number of these that were detected late. Let Z i = SLDR i = Y i E i, i = 1,...,N, the standardized late detection ratio (SLDR), where the expected counts are computed via internal standardization as E i = n i r, and r = i Y i/ i n i, the statewide late detection rate. Bayesian Areal Wombling for Geographic Boundary Analysis p. 5/30
12 Example: MN colorectal cancer data Here n i is the total number of colorectal cancers occurring in county i, and Y i is the number of these that were detected late. Let Z i = SLDR i = Y i E i, i = 1,...,N, the standardized late detection ratio (SLDR), where the expected counts are computed via internal standardization as E i = n i r, and r = i Y i/ i n i, the statewide late detection rate. Left panel of next slide shows BoundarySEER plots of traditional wombled boundaries... Bayesian Areal Wombling for Geographic Boundary Analysis p. 5/30
13 Example: MN colorectal cancer data Left: BoundarySeer choropleth map of colorectal cancer late detection SLDRs, and crisp wombled boundaries arising from the top 20% (red) and 50% (yellow) of the D ij. Bayesian Areal Wombling for Geographic Boundary Analysis p. 6/30
14 Example: MN colorectal cancer data Left: BoundarySeer choropleth map of colorectal cancer late detection SLDRs, and crisp wombled boundaries arising from the top 20% (red) and 50% (yellow) of the D ij. Right: "Tricked" BoundarySeer map of fitted Bayesian colorectal cancer detection SLDRs, and Bayesian wombled boundaries based on the top 20% of the Ê( ij y) values using G = 1500 Gibbs samples. Bayesian Areal Wombling for Geographic Boundary Analysis p. 6/30
15 Hierarchical modeling approach We employ the Poisson log-linear form Y i Poisson(µ i ) where log µ i = log E i + x iβ + φ i. where x i are region-specific covariates, and Bayesian Areal Wombling for Geographic Boundary Analysis p. 7/30
16 Hierarchical modeling approach We employ the Poisson log-linear form Y i Poisson(µ i ) where log µ i = log E i + x iβ + φ i. where x i are region-specific covariates, and the random effects φ = (φ 1,...,φ N ) are given a conditionally autoregressive CAR(τ) specification, φ i φ j i N( φ i, 1/(τm i )), where N denotes the normal distribution, φ i is the average of the φ j i that are adjacent to φ i, and m i is the number of these adjacencies. Bayesian Areal Wombling for Geographic Boundary Analysis p. 7/30
17 Hierarchical modeling approach Define the BLV for boundary (i,j) as ij = η i η j for all i adjacent to j, with η i = µ i /E i. Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30
18 Hierarchical modeling approach Define the BLV for boundary (i,j) as ij = η i η j for all i adjacent to j, with η i = µ i /E i. BLVs posterior distribution = wombled boundaries! Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30
19 Hierarchical modeling approach Define the BLV for boundary (i,j) as with η i = µ i /E i. ij = η i η j for all i adjacent to j, BLVs posterior distribution = wombled boundaries! Crisp: Define ij to be part of the boundary if and only if E( ij y) > c for some constant c > 0, or if and only if P( ij c y) > c for some constant 0 < c < 1. Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30
20 Hierarchical modeling approach Define the BLV for boundary (i,j) as with η i = µ i /E i. ij = η i η j for all i adjacent to j, BLVs posterior distribution = wombled boundaries! Crisp: Define ij to be part of the boundary if and only if E( ij y) > c for some constant c > 0, or if and only if P( ij c y) > c for some constant 0 < c < 1. Fuzzy: P( ij c y) is itself the fuzzy BMV, with MC estimate and associated sd ˆp ij = # (g) ij > c G and ˆpij (1 ˆp ij ) G, retaining only every M th sample, for a total of G. Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30
21 MN colorectal cancer data revisited means ˆp ij : associated sd s: Posterior probability areal wombling maps for the Minnesota colorectal cancer detection data using three illustrative values of c (5, 15, and 30%), M = 5, and G = Bayesian Areal Wombling for Geographic Boundary Analysis p. 9/30
22 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30
23 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) The ˆp maps suggest little evidence of strong boundaries between counties; only a few county boundaries estimated to separate regions with true SLDRs that differ by more than 15%. Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30
24 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) The ˆp maps suggest little evidence of strong boundaries between counties; only a few county boundaries estimated to separate regions with true SLDRs that differ by more than 15%. The standard deviation plots reveal that the overall uncertainty associated with each segment tends to decrease for the more extreme c (5 and 30), as we become more certain that most segments either are or are not part of the boundary. Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30
25 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) The ˆp maps suggest little evidence of strong boundaries between counties; only a few county boundaries estimated to separate regions with true SLDRs that differ by more than 15%. The standard deviation plots reveal that the overall uncertainty associated with each segment tends to decrease for the more extreme c (5 and 30), as we become more certain that most segments either are or are not part of the boundary. Animated sequences of crisp boundaries (as in haolanl/movie.gif) may be more enlightening than maps of posterior standard deviations... Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30
26 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30
27 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30
28 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). If we womble the spatial residuals φ i (instead of the η i ), Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30
29 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). If we womble the spatial residuals φ i (instead of the η i ), Boundaries now separate regions with differing unmodeled heterogeneity = help identify missing covariates! Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30
30 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). If we womble the spatial residuals φ i (instead of the η i ), Boundaries now separate regions with differing unmodeled heterogeneity = help identify missing covariates! No significant boundaries = (covariate-adjusted) mapwide equity, a.k.a. environmental justice Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30
31 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30
32 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. The CAR remains a valid distributional specification provided 0 w ij 1 = more possibilities for spatial smoothing: Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30
33 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. The CAR remains a valid distributional specification provided 0 w ij 1 = more possibilities for spatial smoothing: Choose the w ij inversely proportional to the distance separating the centroids of regions i and j, or Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30
34 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. The CAR remains a valid distributional specification provided 0 w ij 1 = more possibilities for spatial smoothing: Choose the w ij inversely proportional to the distance separating the centroids of regions i and j, or Think of the w ij as additional unknown parameters to be estimated, allowing the data to help determine the degree and nature of spatial smoothing. Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30
35 Alternative: Modeling adjacency Mimicking an idea from statistical social network analysis (Wang and Wong, 1987; Hoff et al., 2002), model the w ij as w ij p ij Bernoulli(p ij ), where log ( pij 1 p ij ) = z ijγ. Bayesian Areal Wombling for Geographic Boundary Analysis p. 13/30
36 Alternative: Modeling adjacency Mimicking an idea from statistical social network analysis (Wang and Wong, 1987; Hoff et al., 2002), model the w ij as w ij p ij Bernoulli(p ij ), where log ( pij 1 p ij Example: Let z 1ij = 1 (so that γ 1 is an intercept parameter), ) = z ijγ. z 2ij = d ij, the distance between the centroids of regions i and j, z 3ij = (area i + area j )/2, the average area of the two regions, and z 4ij = x i x j, the absolute difference of some regional covariate (say, percent urban, or percent of residents who are smokers). Bayesian Areal Wombling for Geographic Boundary Analysis p. 13/30
37 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30
38 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30
39 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30
40 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. MCMC sampling of φ, τ, β, γ, and W can proceed by a tuned mixture of Gibbs and Metropolis steps. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30
41 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. MCMC sampling of φ, τ, β, γ, and W can proceed by a tuned mixture of Gibbs and Metropolis steps. Try it with simulated data, arising from γ 0 = 1,γ 1 = 2,β 0 = 1, and τ = 30. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30
42 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. MCMC sampling of φ, τ, β, γ, and W can proceed by a tuned mixture of Gibbs and Metropolis steps. Try it with simulated data, arising from γ 0 = 1,γ 1 = 2,β 0 = 1, and τ = 30. The simulation of φ is facilitated by WinBUGS, and log E i s are borrowed from a Minnesota breast cancer late detection data set. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30
43 Example: Simulated data Left: Map of simulated "true" boundaries (1 w ij ), where thick dark lines indicate boundaries and thin blue lines indicate adjacency between two regions. Bayesian Areal Wombling for Geographic Boundary Analysis p. 15/30
44 Example: Simulated data Left: Map of simulated "true" boundaries (1 w ij ), where thick dark lines indicate boundaries and thin blue lines indicate adjacency between two regions. Right: Map of simulated "true" φ i on a blue color scale. Bayesian Areal Wombling for Geographic Boundary Analysis p. 15/30
45 Example: Simulated data Left: Map of simulated "true" boundaries (1 w ij ), where thick dark lines indicate boundaries and thin blue lines indicate adjacency between two regions. Right: Map of simulated "true" φ i on a blue color scale. Note: Most of the boundaries in the left panel are preserved in the right panel! Bayesian Areal Wombling for Geographic Boundary Analysis p. 15/30
46 Example: Simulated data Left: Map of posterior mean of 1 p ij with corresponding boundaries shaded in color. Note: The majority of these lines match with those in L panel of previous figure! Bayesian Areal Wombling for Geographic Boundary Analysis p. 16/30
47 Example: Simulated data Left: Map of posterior mean of 1 p ij with corresponding boundaries shaded in color. Note: The majority of these lines match with those in L panel of previous figure! Right: Map of posterior means of φ i on blue color scale. Note: Overall spatial pattern remains, but with smoothing of the φ i among neighbors. Bayesian Areal Wombling for Geographic Boundary Analysis p. 16/30
48 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30
49 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. That is, given that a segment is part of the boundary, we d like our model to favor including neighboring segments in the boundary as well. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30
50 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. That is, given that a segment is part of the boundary, we d like our model to favor including neighboring segments in the boundary as well. But the CAR model is ideal for this type of modeling! We simply need to define a second CAR model on the boundary segment space, in addition to the one we already have on the areal unit space. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30
51 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. That is, given that a segment is part of the boundary, we d like our model to favor including neighboring segments in the boundary as well. But the CAR model is ideal for this type of modeling! We simply need to define a second CAR model on the boundary segment space, in addition to the one we already have on the areal unit space. That is, if W is the N N (random) adjacency matrix for the counties, now we also have W, an N adj N adj (fixed) adjacency matrix for the boundary segments, where N adj is the total number of unique county adjacencies. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30
52 New Approach: 2-Level CAR That is, we need to expand our logit model to ( ) pij log = z 1 p ijγ + ψ ij, ij where ψ CAR(λ,W ) and W is a 0-1 adjacency matrix on the boundary segment space, with adjacency defined by the map (fixed; no covariates this time). Bayesian Areal Wombling for Geographic Boundary Analysis p. 18/30
53 New Approach: 2-Level CAR That is, we need to expand our logit model to ( ) pij log = z 1 p ijγ + ψ ij, ij where ψ CAR(λ,W ) and W is a 0-1 adjacency matrix on the boundary segment space, with adjacency defined by the map (fixed; no covariates this time). Boundaries may again be obtained from posterior summarization of the ij, the p ij, or the w ij. Bayesian Areal Wombling for Geographic Boundary Analysis p. 18/30
54 New Approach: 2-Level CAR That is, we need to expand our logit model to ( ) pij log = z 1 p ijγ + ψ ij, ij where ψ CAR(λ,W ) and W is a 0-1 adjacency matrix on the boundary segment space, with adjacency defined by the map (fixed; no covariates this time). Boundaries may again be obtained from posterior summarization of the ij, the p ij, or the w ij. To check, we generate data from a 4 4 "template" having true boundary" separating 6 regions (the 4 in the bottom row and the middle two in the 3rd row) from the rest... Bayesian Areal Wombling for Geographic Boundary Analysis p. 18/30
55 New Approach: 2-Level CAR Raw Data L&C, Pr(delta_ij>c data) [ 2.17, 1.51) [ 1.51, 0.85) [ 0.85, 0.19) [ 0.19,0.47) [0.47,1.13) [1.13,1.79] [0.06,0.75] (0.75,0.94] Left: Map of a simulated data set from our model. Bayesian Areal Wombling for Geographic Boundary Analysis p. 19/30
56 New Approach: 2-Level CAR Raw Data L&C, Pr(delta_ij>c data) [ 2.17, 1.51) [ 1.51, 0.85) [ 0.85, 0.19) [ 0.19,0.47) [0.47,1.13) [1.13,1.79] [0.06,0.75] (0.75,0.94] Left: Map of a simulated data set from our model. Right: Posterior probability that ij exceeds the cutoff c, LC method (nonrandom adjacency matrix W ). Bayesian Areal Wombling for Geographic Boundary Analysis p. 19/30
57 New Approach: 2-Level CAR Raw Data L&C, Pr(delta_ij>c data) [ 2.17, 1.51) [ 1.51, 0.85) [ 0.85, 0.19) [ 0.19,0.47) [0.47,1.13) [1.13,1.79] [0.06,0.75] (0.75,0.94] Left: Map of a simulated data set from our model. Right: Posterior probability that ij exceeds the cutoff c, LC method (nonrandom adjacency matrix W ). Note that there are six true" boundary segments; LC finds only 3 of them. It is badly fooled by the surprisingly low values in the northeast corner and the surprisingly high value in the 3rd row, last column. Bayesian Areal Wombling for Geographic Boundary Analysis p. 19/30
58 Results: 2-Level CAR CAR2, Pr(delta_ij>c data) CAR2, post.mean of (1 pij) [0.07,0.82] (0.82,0.97] [0.12,0.17) [0.17,0.21) [0.21,0.25) [0.25,0.29) [0.29,0.34) [0.34,0.38] Left: Posterior probability that ij exceeds the cutoff c (fuzzy boundaries), 2-Level CAR (CAR2). Bayesian Areal Wombling for Geographic Boundary Analysis p. 20/30
59 Results: 2-Level CAR CAR2, Pr(delta_ij>c data) CAR2, post.mean of (1 pij) [0.07,0.82] (0.82,0.97] [0.12,0.17) [0.17,0.21) [0.21,0.25) [0.25,0.29) [0.29,0.34) [0.34,0.38] Left: Posterior probability that ij exceeds the cutoff c (fuzzy boundaries), 2-Level CAR (CAR2). Right: Posterior mean of 1 p ij, 2-Level CAR. Bayesian Areal Wombling for Geographic Boundary Analysis p. 20/30
60 Results: 2-Level CAR CAR2, Pr(delta_ij>c data) CAR2, post.mean of (1 pij) [0.07,0.82] (0.82,0.97] [0.12,0.17) [0.17,0.21) [0.21,0.25) [0.25,0.29) [0.29,0.34) [0.34,0.38] Left: Posterior probability that ij exceeds the cutoff c (fuzzy boundaries), 2-Level CAR (CAR2). Right: Posterior mean of 1 p ij, 2-Level CAR. When applied to the ij s, the CAR2 method performs no better than LC. But when applied to the p ij s, the method now correctly finds all six boundary segments. Bayesian Areal Wombling for Geographic Boundary Analysis p. 20/30
61 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30
62 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30
63 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30
64 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Try to increase the impact of response data Y i on the wombled boundaries (relative to the covariates Z ij ) while retaining boundary connectedness Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30
65 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Try to increase the impact of response data Y i on the wombled boundaries (relative to the covariates Z ij ) while retaining boundary connectedness by eliminating the CAR(τ,W) model and the φ i s altogether (but retaining the CAR(λ,W ) and the ψ ij ) Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30
66 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Try to increase the impact of response data Y i on the wombled boundaries (relative to the covariates Z ij ) while retaining boundary connectedness by eliminating the CAR(τ,W) model and the φ i s altogether (but retaining the CAR(λ,W ) and the ψ ij ) by retaining both CAR models, but fitting them simultaneously, rather than hierarchically Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30
67 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30
68 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Suppose D ij N(η ij, 1/τ e ), where η ij = γ 0 + z ij γ 1 + ψ ij. Let z ij = x i x j (again, difference not absolute difference here), and we again let ψ CAR(λ,W ). Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30
69 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Suppose D ij N(η ij, 1/τ e ), where η ij = γ 0 + z ij γ 1 + ψ ij. Let z ij = x i x j (again, difference not absolute difference here), and we again let ψ CAR(λ,W ). This model eliminates the hard-to-estimate W ij (Bernoulli) parameters, and retains only the second" CAR model, which captures the similarity of neighboring boundary segments on the map. Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30
70 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Suppose D ij N(η ij, 1/τ e ), where η ij = γ 0 + z ij γ 1 + ψ ij. Let z ij = x i x j (again, difference not absolute difference here), and we again let ψ CAR(λ,W ). This model eliminates the hard-to-estimate W ij (Bernoulli) parameters, and retains only the second" CAR model, which captures the similarity of neighboring boundary segments on the map. This model should deliver ψ ij -based wombled boundaries that are more connected than before Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30
71 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30
72 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30
73 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: ψ ij 0 when Y i and Y j are both larger or both smaller than expected Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30
74 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: ψ ij 0 when Y i and Y j are both larger or both smaller than expected ψ ij 0 when one of Y i or Y i is bigger than expected, and the other is smaller Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30
75 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: ψ ij 0 when Y i and Y j are both larger or both smaller than expected ψ ij 0 when one of Y i or Y i is bigger than expected, and the other is smaller Might even also contemplate interaction between the φ i and the ψ ij! Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30
76 Another issue: Multivariate data Suppose we have multivariate disease counts Y ki for disease k in county i. A multivariate hierarchical areal wombling approach might model Y ki ind Poisson(µ ki ), k = 1,...,K > 1, and φ i = (φ 1i,...,φ Ki ) MCAR(α, Λ). Wombling may now proceed much as before, but with a Multivariate CAR model that captures correlation both across space and among the K variables. Bayesian Areal Wombling for Geographic Boundary Analysis p. 24/30
77 Another issue: Multivariate data Suppose we have multivariate disease counts Y ki for disease k in county i. A multivariate hierarchical areal wombling approach might model Y ki ind Poisson(µ ki ), k = 1,...,K > 1, and φ i = (φ 1i,...,φ Ki ) MCAR(α, Λ). Wombling may now proceed much as before, but with a Multivariate CAR model that captures correlation both across space and among the K variables. Several conditional (e.g., φ 1 followed by φ 2 φ 1 ) and marginal (obtained jointly by coregionalization) MCAR models are available, many of which can be fit in WinBUGS! Bayesian Areal Wombling for Geographic Boundary Analysis p. 24/30
78 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30
79 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30
80 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Goal is to estimate which zips are covered" and which are not covered" by a particular hospice system. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30
81 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Goal is to estimate which zips are covered" and which are not covered" by a particular hospice system. In the Duluth area there are two hospice systems (St. Mary s and St. Luke s). Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30
82 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Goal is to estimate which zips are covered" and which are not covered" by a particular hospice system. In the Duluth area there are two hospice systems (St. Mary s and St. Luke s). We seek to draw both boundaries around the service regions of both hospices (both invidually and jointly), while accounting for any correlation between the two. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30
83 Raw Hospice Data MN: Zipcodes Served by Duluth Hospices Medicare Raw Data Map Not Served St. Mary Only St. Luke Only Served by both Bayesian Areal Wombling for Geographic Boundary Analysis p. 26/30
84 Results of straight Bayesian approach MN: Zipcodes Served by Duluth Hospices Model Based Map Not Served St. Mary Only St. Luke Only Served by both Bayesian Areal Wombling for Geographic Boundary Analysis p. 27/30
85 Bayes with Backfilling" results MN: Zipcodes Served by Duluth Hospices Model Based Map Not Served St. Mary Only St. Luke Only Served by both Bayesian Areal Wombling for Geographic Boundary Analysis p. 28/30
86 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30
87 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Positive, since if the St. Mary s deaths in a zip are high, the St. Luke s deaths are also likely to be high (consistency of cancer control" across regions). Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30
88 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Positive, since if the St. Mary s deaths in a zip are high, the St. Luke s deaths are also likely to be high (consistency of cancer control" across regions). Negative, since the two hospices are in competition for patients, and the resulting zero-sum game means negative correlation among hospices within zip (though there may still be positive spatial correlation). Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30
89 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Positive, since if the St. Mary s deaths in a zip are high, the St. Luke s deaths are also likely to be high (consistency of cancer control" across regions). Negative, since the two hospices are in competition for patients, and the resulting zero-sum game means negative correlation among hospices within zip (though there may still be positive spatial correlation). So we re not sure! But fortunately, the MCAR(α, Λ) distribution will automatically accommodate either possibility! Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30
90 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30
91 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Variance-based, univariate: Use means or quantiles of p ij or w ij as before, since usual MCAR has only one adjacency matrix W. This then gives boundaries for the larger region served by any hospice. Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30
92 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Variance-based, univariate: Use means or quantiles of p ij or w ij as before, since usual MCAR has only one adjacency matrix W. This then gives boundaries for the larger region served by any hospice. Variance-based, bivariate: Use an MCAR with two adjacency matrices, one for each hospice???... Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30
93 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Variance-based, univariate: Use means or quantiles of p ij or w ij as before, since usual MCAR has only one adjacency matrix W. This then gives boundaries for the larger region served by any hospice. Variance-based, bivariate: Use an MCAR with two adjacency matrices, one for each hospice???... We plan to investigate all 3! Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30
Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis
Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis Haijun Ma, Bradley P. Carlin and Sudipto Banerjee December 8, 2008 Web Appendix A: Selecting
More informationAnalysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information
Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information p. 1/27 Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information Shengde Liang, Bradley
More informationMultivariate spatial modeling
Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced
More informationApproaches for Multiple Disease Mapping: MCAR and SANOVA
Approaches for Multiple Disease Mapping: MCAR and SANOVA Dipankar Bandyopadhyay Division of Biostatistics, University of Minnesota SPH April 22, 2015 1 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR
More informationHierarchical Modeling and Analysis for Spatial Data
Hierarchical Modeling and Analysis for Spatial Data Bradley P. Carlin, Sudipto Banerjee, and Alan E. Gelfand brad@biostat.umn.edu, sudiptob@biostat.umn.edu, and alan@stat.duke.edu University of Minnesota
More informationGeneralized common spatial factor model
Biostatistics (2003), 4, 4,pp. 569 582 Printed in Great Britain Generalized common spatial factor model FUJUN WANG Eli Lilly and Company, Indianapolis, IN 46285, USA MELANIE M. WALL Division of Biostatistics,
More informationFrailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.
Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk
More informationAreal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case
Areal data models Spatial smoothers Brook s Lemma and Gibbs distribution CAR models Gaussian case Non-Gaussian case SAR models Gaussian case Non-Gaussian case CAR vs. SAR STAR models Inference for areal
More informationSPATIAL ANALYSIS & MORE
SPATIAL ANALYSIS & MORE Thomas A. Louis, PhD Department of Biostatistics Johns Hopins Bloomberg School of Public Health www.biostat.jhsph.edu/ tlouis/ tlouis@jhu.edu 1/28 Outline Prediction vs Inference
More informationBayesian Linear Regression
Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective
More informationSpatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States
Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States Laura A. Hatfield and Bradley P. Carlin Division of Biostatistics School of Public Health University
More informationSTA 216, GLM, Lecture 16. October 29, 2007
STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters
More informationExample using R: Heart Valves Study
Example using R: Heart Valves Study Goal: Show that the thrombogenicity rate (TR) is less than two times the objective performance criterion R and WinBUGS Examples p. 1/27 Example using R: Heart Valves
More informationCluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May
Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May 5-7 2008 Peter Schlattmann Institut für Biometrie und Klinische Epidemiologie
More informationBayesian Hierarchical Models
Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee and Andrew O. Finley 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department
More informationSpatio-Temporal Modelling of Credit Default Data
1/20 Spatio-Temporal Modelling of Credit Default Data Sathyanarayan Anand Advisor: Prof. Robert Stine The Wharton School, University of Pennsylvania April 29, 2011 2/20 Outline 1 Background 2 Conditional
More informationFully Bayesian Spatial Analysis of Homicide Rates.
Fully Bayesian Spatial Analysis of Homicide Rates. Silvio A. da Silva, Luiz L.M. Melo and Ricardo S. Ehlers Universidade Federal do Paraná, Brazil Abstract Spatial models have been used in many fields
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationMarkov Chain Monte Carlo
Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).
More informationStat 542: Item Response Theory Modeling Using The Extended Rank Likelihood
Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal
More informationBayesian data analysis in practice: Three simple examples
Bayesian data analysis in practice: Three simple examples Martin P. Tingley Introduction These notes cover three examples I presented at Climatea on 5 October 0. Matlab code is available by request to
More informationModels for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data
Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise
More informationspbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models
spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments
More informationMcGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675.
McGill University Department of Epidemiology and Biostatistics Bayesian Analysis for the Health Sciences Course EPIB-675 Lawrence Joseph Bayesian Analysis for the Health Sciences EPIB-675 3 credits Instructor:
More informationDisease mapping with Gaussian processes
EUROHEIS2 Kuopio, Finland 17-18 August 2010 Aki Vehtari (former Helsinki University of Technology) Department of Biomedical Engineering and Computational Science (BECS) Acknowledgments Researchers - Jarno
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationSpatial Analysis of Incidence Rates: A Bayesian Approach
Spatial Analysis of Incidence Rates: A Bayesian Approach Silvio A. da Silva, Luiz L.M. Melo and Ricardo Ehlers July 2004 Abstract Spatial models have been used in many fields of science where the data
More informationBeyond MCMC in fitting complex Bayesian models: The INLA method
Beyond MCMC in fitting complex Bayesian models: The INLA method Valeska Andreozzi Centre of Statistics and Applications of Lisbon University (valeska.andreozzi at fc.ul.pt) European Congress of Epidemiology
More informationLatent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent
Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary
More informationIndex. Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables.
Index Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables. Adaptive rejection metropolis sampling (ARMS), 98 Adaptive shrinkage, 132 Advanced Photo System (APS), 255 Aggregation
More informationBayesian Regression Linear and Logistic Regression
When we want more than point estimates Bayesian Regression Linear and Logistic Regression Nicole Beckage Ordinary Least Squares Regression and Lasso Regression return only point estimates But what if we
More informationCommunity Health Needs Assessment through Spatial Regression Modeling
Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department
More informationPhysician Performance Assessment / Spatial Inference of Pollutant Concentrations
Physician Performance Assessment / Spatial Inference of Pollutant Concentrations Dawn Woodard Operations Research & Information Engineering Cornell University Johns Hopkins Dept. of Biostatistics, April
More informationIntroduction to Spatial Analysis. Spatial Analysis. Session organization. Learning objectives. Module organization. GIS and spatial analysis
Introduction to Spatial Analysis I. Conceptualizing space Session organization Module : Conceptualizing space Module : Spatial analysis of lattice data Module : Spatial analysis of point patterns Module
More informationIntroduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016
Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An
More informationSpatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models
Spatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models Panudet Saengseedam Nanthachai Kantanantha Abstract This paper proposes a linear mixed model (LMM) with spatial
More informationMultivariate Survival Analysis
Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in
More informationPart 8: GLMs and Hierarchical LMs and GLMs
Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course
More informationDefault Priors and Effcient Posterior Computation in Bayesian
Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature
More informationReport and Opinion 2016;8(6) Analysis of bivariate correlated data under the Poisson-gamma model
Analysis of bivariate correlated data under the Poisson-gamma model Narges Ramooz, Farzad Eskandari 2. MSc of Statistics, Allameh Tabatabai University, Tehran, Iran 2. Associate professor of Statistics,
More informationChapter 4 Dynamic Bayesian Networks Fall Jin Gu, Michael Zhang
Chapter 4 Dynamic Bayesian Networks 2016 Fall Jin Gu, Michael Zhang Reviews: BN Representation Basic steps for BN representations Define variables Define the preliminary relations between variables Check
More informationNearest Neighbor Gaussian Processes for Large Spatial Data
Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns
More informationMcGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-682.
McGill University Department of Epidemiology and Biostatistics Bayesian Analysis for the Health Sciences Course EPIB-682 Lawrence Joseph Intro to Bayesian Analysis for the Health Sciences EPIB-682 2 credits
More informationInclusion of Non-Street Addresses in Cancer Cluster Analysis
Inclusion of Non-Street Addresses in Cancer Cluster Analysis Sue-Min Lai, Zhimin Shen, Darin Banks Kansas Cancer Registry University of Kansas Medical Center KCR (Kansas Cancer Registry) KCR: population-based
More informationStatistícal Methods for Spatial Data Analysis
Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department
More informationNonparametric Bayesian Methods (Gaussian Processes)
[70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent
More informationA Fully Nonparametric Modeling Approach to. BNP Binary Regression
A Fully Nonparametric Modeling Approach to Binary Regression Maria Department of Applied Mathematics and Statistics University of California, Santa Cruz SBIES, April 27-28, 2012 Outline 1 2 3 Simulation
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationAndrew B. Lawson 2019 BMTRY 763
BMTRY 763 FMD Foot and mouth disease (FMD) is a viral disease of clovenfooted animals and being extremely contagious it spreads rapidly. It reduces animal production, and can sometimes be fatal in young
More informationBayesian Linear Models
Bayesian Linear Models Sudipto Banerjee September 03 05, 2017 Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles Linear Regression Linear regression is,
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationAreal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets
Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Chapter 3: Basics of Areal Data Models p. 1/18 Areal Unit Data Regular or Irregular Grids
More informationMarkov Networks.
Markov Networks www.biostat.wisc.edu/~dpage/cs760/ Goals for the lecture you should understand the following concepts Markov network syntax Markov network semantics Potential functions Partition function
More informationUsing Estimating Equations for Spatially Correlated A
Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship
More informationMarkov chain Monte Carlo
1 / 26 Markov chain Monte Carlo Timothy Hanson 1 and Alejandro Jara 2 1 Division of Biostatistics, University of Minnesota, USA 2 Department of Statistics, Universidad de Concepción, Chile IAP-Workshop
More informationComputational statistics
Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated
More informationPart 6: Multivariate Normal and Linear Models
Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of
More informationEffects of residual smoothing on the posterior of the fixed effects in disease-mapping models
Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models By BRIAN J. REICH a, JAMES S. HODGES a, and VESNA ZADNIK b 1 a Division of Biostatistics, School of Public
More informationBayesian Inference. Chapter 4: Regression and Hierarchical Models
Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative
More informationContents. Part I: Fundamentals of Bayesian Inference 1
Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian
More informationBayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang
Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January
More informationSpatial Misalignment
Spatial Misalignment Let s use the terminology "points" (for point-level observations) and "blocks" (for areal-level summaries) Chapter 6: Spatial Misalignment p. 1/5 Spatial Misalignment Let s use the
More informationAdvanced Methods for Agricultural and Agroenvironmental. Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller
Advanced Methods for Agricultural and Agroenvironmental Monitoring Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller Outline 1. Introduction to the National Resources Inventory 2. Hierarchical
More informationAccounting for Complex Sample Designs via Mixture Models
Accounting for Complex Sample Designs via Finite Normal Mixture Models 1 1 University of Michigan School of Public Health August 2009 Talk Outline 1 2 Accommodating Sampling Weights in Mixture Models 3
More informationBayesian Inference. Chapter 4: Regression and Hierarchical Models
Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Advanced Statistics and Data Mining Summer School
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate
More informationTechnical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models
Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University
More informationDesign of Text Mining Experiments. Matt Taddy, University of Chicago Booth School of Business faculty.chicagobooth.edu/matt.
Design of Text Mining Experiments Matt Taddy, University of Chicago Booth School of Business faculty.chicagobooth.edu/matt.taddy/research Active Learning: a flavor of design of experiments Optimal : consider
More informationI don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph
Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and
More informationBayesian Mixture Modeling
University of California, Merced July 21, 2014 Mplus Users Meeting, Utrecht Organization of the Talk Organization s modeling estimation framework Motivating examples duce the basic LCA model Illustrated
More informationBayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models
Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models Kohta Aoki 1 and Hiroshi Nagahashi 2 1 Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North
More informationRiemann Manifold Methods in Bayesian Statistics
Ricardo Ehlers ehlers@icmc.usp.br Applied Maths and Stats University of São Paulo, Brazil Working Group in Statistical Learning University College Dublin September 2015 Bayesian inference is based on Bayes
More informationMultivariate Normal & Wishart
Multivariate Normal & Wishart Hoff Chapter 7 October 21, 2010 Reading Comprehesion Example Twenty-two children are given a reading comprehsion test before and after receiving a particular instruction method.
More informationHierarchical Modelling for Univariate and Multivariate Spatial Data
Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 1/4 Hierarchical Modelling for Univariate and Multivariate Spatial Data Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota
More informationMCMC algorithms for fitting Bayesian models
MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models
More informationEstimating marginal likelihoods from the posterior draws through a geometric identity
Estimating marginal likelihoods from the posterior draws through a geometric identity Johannes Reichl Energy Institute at the Johannes Kepler University Linz E-mail for correspondence: reichl@energieinstitut-linz.at
More informationHierarchical Modelling for Univariate Spatial Data
Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department
More informationST 740: Markov Chain Monte Carlo
ST 740: Markov Chain Monte Carlo Alyson Wilson Department of Statistics North Carolina State University October 14, 2012 A. Wilson (NCSU Stsatistics) MCMC October 14, 2012 1 / 20 Convergence Diagnostics:
More informationBayesian Inference for Regression Parameters
Bayesian Inference for Regression Parameters 1 Bayesian inference for simple linear regression parameters follows the usual pattern for all Bayesian analyses: 1. Form a prior distribution over all unknown
More informationSpatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions
Spatial inference I will start with a simple model, using species diversity data Strong spatial dependence, Î = 0.79 what is the mean diversity? How precise is our estimate? Sampling discussion: The 64
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear
More informationQuasi-likelihood Scan Statistics for Detection of
for Quasi-likelihood for Division of Biostatistics and Bioinformatics, National Health Research Institutes & Department of Mathematics, National Chung Cheng University 17 December 2011 1 / 25 Outline for
More informationA short introduction to INLA and R-INLA
A short introduction to INLA and R-INLA Integrated Nested Laplace Approximation Thomas Opitz, BioSP, INRA Avignon Workshop: Theory and practice of INLA and SPDE November 7, 2018 2/21 Plan for this talk
More informationParameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1
Parameter Estimation William H. Jefferys University of Texas at Austin bill@bayesrules.net Parameter Estimation 7/26/05 1 Elements of Inference Inference problems contain two indispensable elements: Data
More informationHierarchical Linear Models
Hierarchical Linear Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin The linear regression model Hierarchical Linear Models y N(Xβ, Σ y ) β σ 2 p(β σ 2 ) σ 2 p(σ 2 ) can be extended
More informationKatsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus. Abstract
Bayesian analysis of a vector autoregressive model with multiple structural breaks Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus Abstract This paper develops a Bayesian approach
More informationBayesian Linear Models
Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public
More information1 Inference for binomial proportion (Matlab/Python)
Bayesian data analysis exercises from 2015 1 Inference for binomial proportion (Matlab/Python) Algae status is monitored in 274 sites at Finnish lakes and rivers. The observations for the 2008 algae status
More informationLabor-Supply Shifts and Economic Fluctuations. Technical Appendix
Labor-Supply Shifts and Economic Fluctuations Technical Appendix Yongsung Chang Department of Economics University of Pennsylvania Frank Schorfheide Department of Economics University of Pennsylvania January
More informationBayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1
Bayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1 Division of Biostatistics, School of Public Health, University of Minnesota, Mayo Mail Code 303, Minneapolis, Minnesota 55455 0392, U.S.A.
More informationMixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina
Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -
More informationProbing the covariance matrix
Probing the covariance matrix Kenneth M. Hanson Los Alamos National Laboratory (ret.) BIE Users Group Meeting, September 24, 2013 This presentation available at http://kmh-lanl.hansonhub.com/ LA-UR-06-5241
More informationAreal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield
Areal data Reminder about types of data Geostatistical data: Z(s) exists everyhere, varies continuously Can accommodate sudden changes by a model for the mean E.g., soil ph, two soil types with different
More information