Bayesian Areal Wombling for Geographic Boundary Analysis

Size: px
Start display at page:

Download "Bayesian Areal Wombling for Geographic Boundary Analysis"

Transcription

1 Bayesian Areal Wombling for Geographic Boundary Analysis Haolan Lu, Haijun Ma, and Bradley P. Carlin and Division of Biostatistics School of Public Health University of Minnesota Bayesian Areal Wombling for Geographic Boundary Analysis p. 1/30

2 Background Spatial data typically classified two ways: point-referenced (geostatistical): spatial locations are points with known coordinates areal (lattice): locations are geographic regions (e.g., counties) with adjacency information Bayesian Areal Wombling for Geographic Boundary Analysis p. 2/30

3 Background Spatial data typically classified two ways: point-referenced (geostatistical): spatial locations are points with known coordinates areal (lattice): locations are geographic regions (e.g., counties) with adjacency information Important topic in spatial statistics: boundary analysis, wherein we seek to identify regions of abrupt change. Indicated by: steep gradients in a continuous surface regional boundaries separating regions with drastically different measurements in a lattice surface Bayesian Areal Wombling for Geographic Boundary Analysis p. 2/30

4 Background Spatial data typically classified two ways: point-referenced (geostatistical): spatial locations are points with known coordinates areal (lattice): locations are geographic regions (e.g., counties) with adjacency information Important topic in spatial statistics: boundary analysis, wherein we seek to identify regions of abrupt change. Indicated by: steep gradients in a continuous surface regional boundaries separating regions with drastically different measurements in a lattice surface Focus in this talk: Boundary analysis for areal data Bayesian Areal Wombling for Geographic Boundary Analysis p. 2/30

5 Background (cont d) Spatial boundary analysis techniques often called wombling, after a foundational paper by Womble (1951). Bayesian Areal Wombling for Geographic Boundary Analysis p. 3/30

6 Background (cont d) Spatial boundary analysis techniques often called wombling, after a foundational paper by Womble (1951). In areal (or polygonal) wombling, a dissimilarity metric measures the difference between adjacent regions. Methods for choosing boundary elements: absolute (dissimilarity metrics greater than C) relative (dissimilarity metrics in the top k%) Bayesian Areal Wombling for Geographic Boundary Analysis p. 3/30

7 Background (cont d) Spatial boundary analysis techniques often called wombling, after a foundational paper by Womble (1951). In areal (or polygonal) wombling, a dissimilarity metric measures the difference between adjacent regions. Methods for choosing boundary elements: absolute (dissimilarity metrics greater than C) relative (dissimilarity metrics in the top k%) Problems: Relative (top k%) thresholding method always finds a fixed number of boundary elements Approach is algorithmic, rather than stochastic: no model or likelihood, so statements about the significance of a detected boundary only relative to predetermined, often unrealistic null distributions. Bayesian Areal Wombling for Geographic Boundary Analysis p. 3/30

8 Crisp vs. Fuzzy Areal Wombling Suppose we have responses Z i for regions i = 1,...,n For neighboring regions i and j and some distance metric, assign the boundary likelihood value (BLV) D ij = Z i Z j. Bayesian Areal Wombling for Geographic Boundary Analysis p. 4/30

9 Crisp vs. Fuzzy Areal Wombling Suppose we have responses Z i for regions i = 1,...,n For neighboring regions i and j and some distance metric, assign the boundary likelihood value (BLV) D ij = Z i Z j. Crisp wombling: Boundary is those edges having BLV s above specified thresholds, i.e., for some c > 0, {(i,j) : D ij > c, i adjacent to j}. Bayesian Areal Wombling for Geographic Boundary Analysis p. 4/30

10 Crisp vs. Fuzzy Areal Wombling Suppose we have responses Z i for regions i = 1,...,n For neighboring regions i and j and some distance metric, assign the boundary likelihood value (BLV) D ij = Z i Z j. Crisp wombling: Boundary is those edges having BLV s above specified thresholds, i.e., for some c > 0, {(i,j) : D ij > c, i adjacent to j}. Fuzzy wombling: Partial membership in the boundary is permitted, say by defining the boundary membership values (BMVs) D ij / max{d ij } ij Bayesian Areal Wombling for Geographic Boundary Analysis p. 4/30

11 Example: MN colorectal cancer data Here n i is the total number of colorectal cancers occurring in county i, and Y i is the number of these that were detected late. Let Z i = SLDR i = Y i E i, i = 1,...,N, the standardized late detection ratio (SLDR), where the expected counts are computed via internal standardization as E i = n i r, and r = i Y i/ i n i, the statewide late detection rate. Bayesian Areal Wombling for Geographic Boundary Analysis p. 5/30

12 Example: MN colorectal cancer data Here n i is the total number of colorectal cancers occurring in county i, and Y i is the number of these that were detected late. Let Z i = SLDR i = Y i E i, i = 1,...,N, the standardized late detection ratio (SLDR), where the expected counts are computed via internal standardization as E i = n i r, and r = i Y i/ i n i, the statewide late detection rate. Left panel of next slide shows BoundarySEER plots of traditional wombled boundaries... Bayesian Areal Wombling for Geographic Boundary Analysis p. 5/30

13 Example: MN colorectal cancer data Left: BoundarySeer choropleth map of colorectal cancer late detection SLDRs, and crisp wombled boundaries arising from the top 20% (red) and 50% (yellow) of the D ij. Bayesian Areal Wombling for Geographic Boundary Analysis p. 6/30

14 Example: MN colorectal cancer data Left: BoundarySeer choropleth map of colorectal cancer late detection SLDRs, and crisp wombled boundaries arising from the top 20% (red) and 50% (yellow) of the D ij. Right: "Tricked" BoundarySeer map of fitted Bayesian colorectal cancer detection SLDRs, and Bayesian wombled boundaries based on the top 20% of the Ê( ij y) values using G = 1500 Gibbs samples. Bayesian Areal Wombling for Geographic Boundary Analysis p. 6/30

15 Hierarchical modeling approach We employ the Poisson log-linear form Y i Poisson(µ i ) where log µ i = log E i + x iβ + φ i. where x i are region-specific covariates, and Bayesian Areal Wombling for Geographic Boundary Analysis p. 7/30

16 Hierarchical modeling approach We employ the Poisson log-linear form Y i Poisson(µ i ) where log µ i = log E i + x iβ + φ i. where x i are region-specific covariates, and the random effects φ = (φ 1,...,φ N ) are given a conditionally autoregressive CAR(τ) specification, φ i φ j i N( φ i, 1/(τm i )), where N denotes the normal distribution, φ i is the average of the φ j i that are adjacent to φ i, and m i is the number of these adjacencies. Bayesian Areal Wombling for Geographic Boundary Analysis p. 7/30

17 Hierarchical modeling approach Define the BLV for boundary (i,j) as ij = η i η j for all i adjacent to j, with η i = µ i /E i. Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30

18 Hierarchical modeling approach Define the BLV for boundary (i,j) as ij = η i η j for all i adjacent to j, with η i = µ i /E i. BLVs posterior distribution = wombled boundaries! Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30

19 Hierarchical modeling approach Define the BLV for boundary (i,j) as with η i = µ i /E i. ij = η i η j for all i adjacent to j, BLVs posterior distribution = wombled boundaries! Crisp: Define ij to be part of the boundary if and only if E( ij y) > c for some constant c > 0, or if and only if P( ij c y) > c for some constant 0 < c < 1. Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30

20 Hierarchical modeling approach Define the BLV for boundary (i,j) as with η i = µ i /E i. ij = η i η j for all i adjacent to j, BLVs posterior distribution = wombled boundaries! Crisp: Define ij to be part of the boundary if and only if E( ij y) > c for some constant c > 0, or if and only if P( ij c y) > c for some constant 0 < c < 1. Fuzzy: P( ij c y) is itself the fuzzy BMV, with MC estimate and associated sd ˆp ij = # (g) ij > c G and ˆpij (1 ˆp ij ) G, retaining only every M th sample, for a total of G. Bayesian Areal Wombling for Geographic Boundary Analysis p. 8/30

21 MN colorectal cancer data revisited means ˆp ij : associated sd s: Posterior probability areal wombling maps for the Minnesota colorectal cancer detection data using three illustrative values of c (5, 15, and 30%), M = 5, and G = Bayesian Areal Wombling for Geographic Boundary Analysis p. 9/30

22 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30

23 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) The ˆp maps suggest little evidence of strong boundaries between counties; only a few county boundaries estimated to separate regions with true SLDRs that differ by more than 15%. Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30

24 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) The ˆp maps suggest little evidence of strong boundaries between counties; only a few county boundaries estimated to separate regions with true SLDRs that differ by more than 15%. The standard deviation plots reveal that the overall uncertainty associated with each segment tends to decrease for the more extreme c (5 and 30), as we become more certain that most segments either are or are not part of the boundary. Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30

25 MN colorectal cancer data revisited The probability of each segment being a member of the boundary decreases in c (the threshold for being a BE) The ˆp maps suggest little evidence of strong boundaries between counties; only a few county boundaries estimated to separate regions with true SLDRs that differ by more than 15%. The standard deviation plots reveal that the overall uncertainty associated with each segment tends to decrease for the more extreme c (5 and 30), as we become more certain that most segments either are or are not part of the boundary. Animated sequences of crisp boundaries (as in haolanl/movie.gif) may be more enlightening than maps of posterior standard deviations... Bayesian Areal Wombling for Geographic Boundary Analysis p. 10/30

26 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30

27 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30

28 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). If we womble the spatial residuals φ i (instead of the η i ), Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30

29 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). If we womble the spatial residuals φ i (instead of the η i ), Boundaries now separate regions with differing unmodeled heterogeneity = help identify missing covariates! Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30

30 MN colorectal cancer data revisited To assess whether County 63 (Red Lake, a T-shaped county in the NW) is truly isolated from its only two neighbors (Counties 57 and 60), by evaluating p 63 P( 63,57 > c 63,60 > c y). Since the { (g) ij } come from the joint posterior of { ij }, Monte Carlo estimates are available = simultaneous inference without a multiple comparisons problem (i.e. no need for Bonferroni correction). If we womble the spatial residuals φ i (instead of the η i ), Boundaries now separate regions with differing unmodeled heterogeneity = help identify missing covariates! No significant boundaries = (covariate-adjusted) mapwide equity, a.k.a. environmental justice Bayesian Areal Wombling for Geographic Boundary Analysis p. 11/30

31 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30

32 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. The CAR remains a valid distributional specification provided 0 w ij 1 = more possibilities for spatial smoothing: Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30

33 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. The CAR remains a valid distributional specification provided 0 w ij 1 = more possibilities for spatial smoothing: Choose the w ij inversely proportional to the distance separating the centroids of regions i and j, or Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30

34 Alternative: Modeling adjacency Note that the univariate CAR model can be written as ( j φ i φ ( i) N w ) ijφ j 1 j w, ij τ j w, ij where the weights w ij are equal to 1 if i j and regions i and j are adjacent, and 0 otherwise. The CAR remains a valid distributional specification provided 0 w ij 1 = more possibilities for spatial smoothing: Choose the w ij inversely proportional to the distance separating the centroids of regions i and j, or Think of the w ij as additional unknown parameters to be estimated, allowing the data to help determine the degree and nature of spatial smoothing. Bayesian Areal Wombling for Geographic Boundary Analysis p. 12/30

35 Alternative: Modeling adjacency Mimicking an idea from statistical social network analysis (Wang and Wong, 1987; Hoff et al., 2002), model the w ij as w ij p ij Bernoulli(p ij ), where log ( pij 1 p ij ) = z ijγ. Bayesian Areal Wombling for Geographic Boundary Analysis p. 13/30

36 Alternative: Modeling adjacency Mimicking an idea from statistical social network analysis (Wang and Wong, 1987; Hoff et al., 2002), model the w ij as w ij p ij Bernoulli(p ij ), where log ( pij 1 p ij Example: Let z 1ij = 1 (so that γ 1 is an intercept parameter), ) = z ijγ. z 2ij = d ij, the distance between the centroids of regions i and j, z 3ij = (area i + area j )/2, the average area of the two regions, and z 4ij = x i x j, the absolute difference of some regional covariate (say, percent urban, or percent of residents who are smokers). Bayesian Areal Wombling for Geographic Boundary Analysis p. 13/30

37 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30

38 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30

39 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30

40 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. MCMC sampling of φ, τ, β, γ, and W can proceed by a tuned mixture of Gibbs and Metropolis steps. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30

41 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. MCMC sampling of φ, τ, β, γ, and W can proceed by a tuned mixture of Gibbs and Metropolis steps. Try it with simulated data, arising from γ 0 = 1,γ 1 = 2,β 0 = 1, and τ = 30. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30

42 Alternative: Modeling adjacency Crisp boundaries based on w ij : those segments ij having P(w ij = 0 y) > c. Fuzzy boundaries based on w ij : use the P(w ij = 0 y) values themselves as the BMVs. Reilly (2001) showed γ is estimable even under a noninformative prior. MCMC sampling of φ, τ, β, γ, and W can proceed by a tuned mixture of Gibbs and Metropolis steps. Try it with simulated data, arising from γ 0 = 1,γ 1 = 2,β 0 = 1, and τ = 30. The simulation of φ is facilitated by WinBUGS, and log E i s are borrowed from a Minnesota breast cancer late detection data set. Bayesian Areal Wombling for Geographic Boundary Analysis p. 14/30

43 Example: Simulated data Left: Map of simulated "true" boundaries (1 w ij ), where thick dark lines indicate boundaries and thin blue lines indicate adjacency between two regions. Bayesian Areal Wombling for Geographic Boundary Analysis p. 15/30

44 Example: Simulated data Left: Map of simulated "true" boundaries (1 w ij ), where thick dark lines indicate boundaries and thin blue lines indicate adjacency between two regions. Right: Map of simulated "true" φ i on a blue color scale. Bayesian Areal Wombling for Geographic Boundary Analysis p. 15/30

45 Example: Simulated data Left: Map of simulated "true" boundaries (1 w ij ), where thick dark lines indicate boundaries and thin blue lines indicate adjacency between two regions. Right: Map of simulated "true" φ i on a blue color scale. Note: Most of the boundaries in the left panel are preserved in the right panel! Bayesian Areal Wombling for Geographic Boundary Analysis p. 15/30

46 Example: Simulated data Left: Map of posterior mean of 1 p ij with corresponding boundaries shaded in color. Note: The majority of these lines match with those in L panel of previous figure! Bayesian Areal Wombling for Geographic Boundary Analysis p. 16/30

47 Example: Simulated data Left: Map of posterior mean of 1 p ij with corresponding boundaries shaded in color. Note: The majority of these lines match with those in L panel of previous figure! Right: Map of posterior means of φ i on blue color scale. Note: Overall spatial pattern remains, but with smoothing of the φ i among neighbors. Bayesian Areal Wombling for Geographic Boundary Analysis p. 16/30

48 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30

49 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. That is, given that a segment is part of the boundary, we d like our model to favor including neighboring segments in the boundary as well. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30

50 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. That is, given that a segment is part of the boundary, we d like our model to favor including neighboring segments in the boundary as well. But the CAR model is ideal for this type of modeling! We simply need to define a second CAR model on the boundary segment space, in addition to the one we already have on the areal unit space. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30

51 Extension: Boundary Adjacency The previous approaches (especially those based on ij ) tend to create networks of disconnected boundary segments. We would prefer an approach that drew continuous boundaries across the map. That is, given that a segment is part of the boundary, we d like our model to favor including neighboring segments in the boundary as well. But the CAR model is ideal for this type of modeling! We simply need to define a second CAR model on the boundary segment space, in addition to the one we already have on the areal unit space. That is, if W is the N N (random) adjacency matrix for the counties, now we also have W, an N adj N adj (fixed) adjacency matrix for the boundary segments, where N adj is the total number of unique county adjacencies. Bayesian Areal Wombling for Geographic Boundary Analysis p. 17/30

52 New Approach: 2-Level CAR That is, we need to expand our logit model to ( ) pij log = z 1 p ijγ + ψ ij, ij where ψ CAR(λ,W ) and W is a 0-1 adjacency matrix on the boundary segment space, with adjacency defined by the map (fixed; no covariates this time). Bayesian Areal Wombling for Geographic Boundary Analysis p. 18/30

53 New Approach: 2-Level CAR That is, we need to expand our logit model to ( ) pij log = z 1 p ijγ + ψ ij, ij where ψ CAR(λ,W ) and W is a 0-1 adjacency matrix on the boundary segment space, with adjacency defined by the map (fixed; no covariates this time). Boundaries may again be obtained from posterior summarization of the ij, the p ij, or the w ij. Bayesian Areal Wombling for Geographic Boundary Analysis p. 18/30

54 New Approach: 2-Level CAR That is, we need to expand our logit model to ( ) pij log = z 1 p ijγ + ψ ij, ij where ψ CAR(λ,W ) and W is a 0-1 adjacency matrix on the boundary segment space, with adjacency defined by the map (fixed; no covariates this time). Boundaries may again be obtained from posterior summarization of the ij, the p ij, or the w ij. To check, we generate data from a 4 4 "template" having true boundary" separating 6 regions (the 4 in the bottom row and the middle two in the 3rd row) from the rest... Bayesian Areal Wombling for Geographic Boundary Analysis p. 18/30

55 New Approach: 2-Level CAR Raw Data L&C, Pr(delta_ij>c data) [ 2.17, 1.51) [ 1.51, 0.85) [ 0.85, 0.19) [ 0.19,0.47) [0.47,1.13) [1.13,1.79] [0.06,0.75] (0.75,0.94] Left: Map of a simulated data set from our model. Bayesian Areal Wombling for Geographic Boundary Analysis p. 19/30

56 New Approach: 2-Level CAR Raw Data L&C, Pr(delta_ij>c data) [ 2.17, 1.51) [ 1.51, 0.85) [ 0.85, 0.19) [ 0.19,0.47) [0.47,1.13) [1.13,1.79] [0.06,0.75] (0.75,0.94] Left: Map of a simulated data set from our model. Right: Posterior probability that ij exceeds the cutoff c, LC method (nonrandom adjacency matrix W ). Bayesian Areal Wombling for Geographic Boundary Analysis p. 19/30

57 New Approach: 2-Level CAR Raw Data L&C, Pr(delta_ij>c data) [ 2.17, 1.51) [ 1.51, 0.85) [ 0.85, 0.19) [ 0.19,0.47) [0.47,1.13) [1.13,1.79] [0.06,0.75] (0.75,0.94] Left: Map of a simulated data set from our model. Right: Posterior probability that ij exceeds the cutoff c, LC method (nonrandom adjacency matrix W ). Note that there are six true" boundary segments; LC finds only 3 of them. It is badly fooled by the surprisingly low values in the northeast corner and the surprisingly high value in the 3rd row, last column. Bayesian Areal Wombling for Geographic Boundary Analysis p. 19/30

58 Results: 2-Level CAR CAR2, Pr(delta_ij>c data) CAR2, post.mean of (1 pij) [0.07,0.82] (0.82,0.97] [0.12,0.17) [0.17,0.21) [0.21,0.25) [0.25,0.29) [0.29,0.34) [0.34,0.38] Left: Posterior probability that ij exceeds the cutoff c (fuzzy boundaries), 2-Level CAR (CAR2). Bayesian Areal Wombling for Geographic Boundary Analysis p. 20/30

59 Results: 2-Level CAR CAR2, Pr(delta_ij>c data) CAR2, post.mean of (1 pij) [0.07,0.82] (0.82,0.97] [0.12,0.17) [0.17,0.21) [0.21,0.25) [0.25,0.29) [0.29,0.34) [0.34,0.38] Left: Posterior probability that ij exceeds the cutoff c (fuzzy boundaries), 2-Level CAR (CAR2). Right: Posterior mean of 1 p ij, 2-Level CAR. Bayesian Areal Wombling for Geographic Boundary Analysis p. 20/30

60 Results: 2-Level CAR CAR2, Pr(delta_ij>c data) CAR2, post.mean of (1 pij) [0.07,0.82] (0.82,0.97] [0.12,0.17) [0.17,0.21) [0.21,0.25) [0.25,0.29) [0.29,0.34) [0.34,0.38] Left: Posterior probability that ij exceeds the cutoff c (fuzzy boundaries), 2-Level CAR (CAR2). Right: Posterior mean of 1 p ij, 2-Level CAR. When applied to the ij s, the CAR2 method performs no better than LC. But when applied to the p ij s, the method now correctly finds all six boundary segments. Bayesian Areal Wombling for Geographic Boundary Analysis p. 20/30

61 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30

62 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30

63 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30

64 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Try to increase the impact of response data Y i on the wombled boundaries (relative to the covariates Z ij ) while retaining boundary connectedness Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30

65 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Try to increase the impact of response data Y i on the wombled boundaries (relative to the covariates Z ij ) while retaining boundary connectedness by eliminating the CAR(τ,W) model and the φ i s altogether (but retaining the CAR(λ,W ) and the ψ ij ) Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30

66 Current Work Compare methods based on ij and w ij by simulating the average probability that they correctly classify each potential boundary segment Various remedies for handling "islands" (regions with no neighbors) when modeling adjacency Investigate L 1 (absolute difference) CARs, and sensitivity to the choice of priors p(γ i ) that control the effect of the covariate data Z ij Try to increase the impact of response data Y i on the wombled boundaries (relative to the covariates Z ij ) while retaining boundary connectedness by eliminating the CAR(τ,W) model and the φ i s altogether (but retaining the CAR(λ,W ) and the ψ ij ) by retaining both CAR models, but fitting them simultaneously, rather than hierarchically Bayesian Areal Wombling for Geographic Boundary Analysis p. 21/30

67 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30

68 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Suppose D ij N(η ij, 1/τ e ), where η ij = γ 0 + z ij γ 1 + ψ ij. Let z ij = x i x j (again, difference not absolute difference here), and we again let ψ CAR(λ,W ). Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30

69 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Suppose D ij N(η ij, 1/τ e ), where η ij = γ 0 + z ij γ 1 + ψ ij. Let z ij = x i x j (again, difference not absolute difference here), and we again let ψ CAR(λ,W ). This model eliminates the hard-to-estimate W ij (Bernoulli) parameters, and retains only the second" CAR model, which captures the similarity of neighboring boundary segments on the map. Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30

70 Connected, Y i -dominated boundaries Eliminating the CAR(τ,W) model and the φ i s: Suppose we think of the original data Y i as normal. Then the differences D ij = Y i Y j are also normal (note differences, not absolute differences here). Suppose D ij N(η ij, 1/τ e ), where η ij = γ 0 + z ij γ 1 + ψ ij. Let z ij = x i x j (again, difference not absolute difference here), and we again let ψ CAR(λ,W ). This model eliminates the hard-to-estimate W ij (Bernoulli) parameters, and retains only the second" CAR model, which captures the similarity of neighboring boundary segments on the map. This model should deliver ψ ij -based wombled boundaries that are more connected than before Bayesian Areal Wombling for Geographic Boundary Analysis p. 22/30

71 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30

72 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30

73 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: ψ ij 0 when Y i and Y j are both larger or both smaller than expected Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30

74 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: ψ ij 0 when Y i and Y j are both larger or both smaller than expected ψ ij 0 when one of Y i or Y i is bigger than expected, and the other is smaller Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30

75 Connected, Y i -dominated boundaries Simultaneous fitting of the two CAR models: Suppose we replace the Poisson mean structure with log µ i = log E i + x iβ + φ i + ψ i, where φ CAR(τ,W) and ψ CAR(λ,W ) as before. Now ψ i, the average of the edge effects for those edges that comprise region i, contributes directly to the mean structure (instead of to the variance structure via W ). The ψ ij capture signed agreement since: ψ ij 0 when Y i and Y j are both larger or both smaller than expected ψ ij 0 when one of Y i or Y i is bigger than expected, and the other is smaller Might even also contemplate interaction between the φ i and the ψ ij! Bayesian Areal Wombling for Geographic Boundary Analysis p. 23/30

76 Another issue: Multivariate data Suppose we have multivariate disease counts Y ki for disease k in county i. A multivariate hierarchical areal wombling approach might model Y ki ind Poisson(µ ki ), k = 1,...,K > 1, and φ i = (φ 1i,...,φ Ki ) MCAR(α, Λ). Wombling may now proceed much as before, but with a Multivariate CAR model that captures correlation both across space and among the K variables. Bayesian Areal Wombling for Geographic Boundary Analysis p. 24/30

77 Another issue: Multivariate data Suppose we have multivariate disease counts Y ki for disease k in county i. A multivariate hierarchical areal wombling approach might model Y ki ind Poisson(µ ki ), k = 1,...,K > 1, and φ i = (φ 1i,...,φ Ki ) MCAR(α, Λ). Wombling may now proceed much as before, but with a Multivariate CAR model that captures correlation both across space and among the K variables. Several conditional (e.g., φ 1 followed by φ 2 φ 1 ) and marginal (obtained jointly by coregionalization) MCAR models are available, many of which can be fit in WinBUGS! Bayesian Areal Wombling for Geographic Boundary Analysis p. 24/30

78 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30

79 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30

80 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Goal is to estimate which zips are covered" and which are not covered" by a particular hospice system. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30

81 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Goal is to estimate which zips are covered" and which are not covered" by a particular hospice system. In the Duluth area there are two hospice systems (St. Mary s and St. Luke s). Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30

82 Application: Hospices near Duluth, MN Data consist of hospice deaths and total Medicare population in each Minnesota zip code. Hospices have a home base", and will typically only drive so many miles to provide in-home hospice service. Goal is to estimate which zips are covered" and which are not covered" by a particular hospice system. In the Duluth area there are two hospice systems (St. Mary s and St. Luke s). We seek to draw both boundaries around the service regions of both hospices (both invidually and jointly), while accounting for any correlation between the two. Bayesian Areal Wombling for Geographic Boundary Analysis p. 25/30

83 Raw Hospice Data MN: Zipcodes Served by Duluth Hospices Medicare Raw Data Map Not Served St. Mary Only St. Luke Only Served by both Bayesian Areal Wombling for Geographic Boundary Analysis p. 26/30

84 Results of straight Bayesian approach MN: Zipcodes Served by Duluth Hospices Model Based Map Not Served St. Mary Only St. Luke Only Served by both Bayesian Areal Wombling for Geographic Boundary Analysis p. 27/30

85 Bayes with Backfilling" results MN: Zipcodes Served by Duluth Hospices Model Based Map Not Served St. Mary Only St. Luke Only Served by both Bayesian Areal Wombling for Geographic Boundary Analysis p. 28/30

86 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30

87 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Positive, since if the St. Mary s deaths in a zip are high, the St. Luke s deaths are also likely to be high (consistency of cancer control" across regions). Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30

88 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Positive, since if the St. Mary s deaths in a zip are high, the St. Luke s deaths are also likely to be high (consistency of cancer control" across regions). Negative, since the two hospices are in competition for patients, and the resulting zero-sum game means negative correlation among hospices within zip (though there may still be positive spatial correlation). Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30

89 Application: Hospices near Duluth, MN What sort of correlation do we expect between data from the two hospices? Positive, since if the St. Mary s deaths in a zip are high, the St. Luke s deaths are also likely to be high (consistency of cancer control" across regions). Negative, since the two hospices are in competition for patients, and the resulting zero-sum game means negative correlation among hospices within zip (though there may still be positive spatial correlation). So we re not sure! But fortunately, the MCAR(α, Λ) distribution will automatically accommodate either possibility! Bayesian Areal Wombling for Geographic Boundary Analysis p. 29/30

90 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30

91 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Variance-based, univariate: Use means or quantiles of p ij or w ij as before, since usual MCAR has only one adjacency matrix W. This then gives boundaries for the larger region served by any hospice. Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30

92 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Variance-based, univariate: Use means or quantiles of p ij or w ij as before, since usual MCAR has only one adjacency matrix W. This then gives boundaries for the larger region served by any hospice. Variance-based, bivariate: Use an MCAR with two adjacency matrices, one for each hospice???... Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30

93 Application: Hospices near Duluth, MN The wombled boundaries can then be: Mean-based, bivariate: Use means or quantiles of = φ ki φ kj, obtaining separate boundaries for each hospice system k. (k) ij Variance-based, univariate: Use means or quantiles of p ij or w ij as before, since usual MCAR has only one adjacency matrix W. This then gives boundaries for the larger region served by any hospice. Variance-based, bivariate: Use an MCAR with two adjacency matrices, one for each hospice???... We plan to investigate all 3! Bayesian Areal Wombling for Geographic Boundary Analysis p. 30/30

Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis

Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis Haijun Ma, Bradley P. Carlin and Sudipto Banerjee December 8, 2008 Web Appendix A: Selecting

More information

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information p. 1/27 Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information Shengde Liang, Bradley

More information

Multivariate spatial modeling

Multivariate spatial modeling Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced

More information

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Approaches for Multiple Disease Mapping: MCAR and SANOVA Approaches for Multiple Disease Mapping: MCAR and SANOVA Dipankar Bandyopadhyay Division of Biostatistics, University of Minnesota SPH April 22, 2015 1 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

More information

Hierarchical Modeling and Analysis for Spatial Data

Hierarchical Modeling and Analysis for Spatial Data Hierarchical Modeling and Analysis for Spatial Data Bradley P. Carlin, Sudipto Banerjee, and Alan E. Gelfand brad@biostat.umn.edu, sudiptob@biostat.umn.edu, and alan@stat.duke.edu University of Minnesota

More information

Generalized common spatial factor model

Generalized common spatial factor model Biostatistics (2003), 4, 4,pp. 569 582 Printed in Great Britain Generalized common spatial factor model FUJUN WANG Eli Lilly and Company, Indianapolis, IN 46285, USA MELANIE M. WALL Division of Biostatistics,

More information

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P. Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk

More information

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case Areal data models Spatial smoothers Brook s Lemma and Gibbs distribution CAR models Gaussian case Non-Gaussian case SAR models Gaussian case Non-Gaussian case CAR vs. SAR STAR models Inference for areal

More information

SPATIAL ANALYSIS & MORE

SPATIAL ANALYSIS & MORE SPATIAL ANALYSIS & MORE Thomas A. Louis, PhD Department of Biostatistics Johns Hopins Bloomberg School of Public Health www.biostat.jhsph.edu/ tlouis/ tlouis@jhu.edu 1/28 Outline Prediction vs Inference

More information

Bayesian Linear Regression

Bayesian Linear Regression Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective

More information

Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States

Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States Laura A. Hatfield and Bradley P. Carlin Division of Biostatistics School of Public Health University

More information

STA 216, GLM, Lecture 16. October 29, 2007

STA 216, GLM, Lecture 16. October 29, 2007 STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters

More information

Example using R: Heart Valves Study

Example using R: Heart Valves Study Example using R: Heart Valves Study Goal: Show that the thrombogenicity rate (TR) is less than two times the objective performance criterion R and WinBUGS Examples p. 1/27 Example using R: Heart Valves

More information

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May 5-7 2008 Peter Schlattmann Institut für Biometrie und Klinische Epidemiologie

More information

Bayesian Hierarchical Models

Bayesian Hierarchical Models Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee and Andrew O. Finley 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

Spatio-Temporal Modelling of Credit Default Data

Spatio-Temporal Modelling of Credit Default Data 1/20 Spatio-Temporal Modelling of Credit Default Data Sathyanarayan Anand Advisor: Prof. Robert Stine The Wharton School, University of Pennsylvania April 29, 2011 2/20 Outline 1 Background 2 Conditional

More information

Fully Bayesian Spatial Analysis of Homicide Rates.

Fully Bayesian Spatial Analysis of Homicide Rates. Fully Bayesian Spatial Analysis of Homicide Rates. Silvio A. da Silva, Luiz L.M. Melo and Ricardo S. Ehlers Universidade Federal do Paraná, Brazil Abstract Spatial models have been used in many fields

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

Markov Chain Monte Carlo

Markov Chain Monte Carlo Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information

Bayesian data analysis in practice: Three simple examples

Bayesian data analysis in practice: Three simple examples Bayesian data analysis in practice: Three simple examples Martin P. Tingley Introduction These notes cover three examples I presented at Climatea on 5 October 0. Matlab code is available by request to

More information

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise

More information

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments

More information

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675.

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675. McGill University Department of Epidemiology and Biostatistics Bayesian Analysis for the Health Sciences Course EPIB-675 Lawrence Joseph Bayesian Analysis for the Health Sciences EPIB-675 3 credits Instructor:

More information

Disease mapping with Gaussian processes

Disease mapping with Gaussian processes EUROHEIS2 Kuopio, Finland 17-18 August 2010 Aki Vehtari (former Helsinki University of Technology) Department of Biomedical Engineering and Computational Science (BECS) Acknowledgments Researchers - Jarno

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Spatial Analysis of Incidence Rates: A Bayesian Approach

Spatial Analysis of Incidence Rates: A Bayesian Approach Spatial Analysis of Incidence Rates: A Bayesian Approach Silvio A. da Silva, Luiz L.M. Melo and Ricardo Ehlers July 2004 Abstract Spatial models have been used in many fields of science where the data

More information

Beyond MCMC in fitting complex Bayesian models: The INLA method

Beyond MCMC in fitting complex Bayesian models: The INLA method Beyond MCMC in fitting complex Bayesian models: The INLA method Valeska Andreozzi Centre of Statistics and Applications of Lisbon University (valeska.andreozzi at fc.ul.pt) European Congress of Epidemiology

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Index. Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables.

Index. Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables. Index Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables. Adaptive rejection metropolis sampling (ARMS), 98 Adaptive shrinkage, 132 Advanced Photo System (APS), 255 Aggregation

More information

Bayesian Regression Linear and Logistic Regression

Bayesian Regression Linear and Logistic Regression When we want more than point estimates Bayesian Regression Linear and Logistic Regression Nicole Beckage Ordinary Least Squares Regression and Lasso Regression return only point estimates But what if we

More information

Community Health Needs Assessment through Spatial Regression Modeling

Community Health Needs Assessment through Spatial Regression Modeling Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

Physician Performance Assessment / Spatial Inference of Pollutant Concentrations

Physician Performance Assessment / Spatial Inference of Pollutant Concentrations Physician Performance Assessment / Spatial Inference of Pollutant Concentrations Dawn Woodard Operations Research & Information Engineering Cornell University Johns Hopkins Dept. of Biostatistics, April

More information

Introduction to Spatial Analysis. Spatial Analysis. Session organization. Learning objectives. Module organization. GIS and spatial analysis

Introduction to Spatial Analysis. Spatial Analysis. Session organization. Learning objectives. Module organization. GIS and spatial analysis Introduction to Spatial Analysis I. Conceptualizing space Session organization Module : Conceptualizing space Module : Spatial analysis of lattice data Module : Spatial analysis of point patterns Module

More information

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An

More information

Spatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models

Spatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models Spatial Time Series Models for Rice and Cassava Yields Based On Bayesian Linear Mixed Models Panudet Saengseedam Nanthachai Kantanantha Abstract This paper proposes a linear mixed model (LMM) with spatial

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

Report and Opinion 2016;8(6) Analysis of bivariate correlated data under the Poisson-gamma model

Report and Opinion 2016;8(6)   Analysis of bivariate correlated data under the Poisson-gamma model Analysis of bivariate correlated data under the Poisson-gamma model Narges Ramooz, Farzad Eskandari 2. MSc of Statistics, Allameh Tabatabai University, Tehran, Iran 2. Associate professor of Statistics,

More information

Chapter 4 Dynamic Bayesian Networks Fall Jin Gu, Michael Zhang

Chapter 4 Dynamic Bayesian Networks Fall Jin Gu, Michael Zhang Chapter 4 Dynamic Bayesian Networks 2016 Fall Jin Gu, Michael Zhang Reviews: BN Representation Basic steps for BN representations Define variables Define the preliminary relations between variables Check

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-682.

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-682. McGill University Department of Epidemiology and Biostatistics Bayesian Analysis for the Health Sciences Course EPIB-682 Lawrence Joseph Intro to Bayesian Analysis for the Health Sciences EPIB-682 2 credits

More information

Inclusion of Non-Street Addresses in Cancer Cluster Analysis

Inclusion of Non-Street Addresses in Cancer Cluster Analysis Inclusion of Non-Street Addresses in Cancer Cluster Analysis Sue-Min Lai, Zhimin Shen, Darin Banks Kansas Cancer Registry University of Kansas Medical Center KCR (Kansas Cancer Registry) KCR: population-based

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparametric Bayesian Methods (Gaussian Processes) [70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent

More information

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

A Fully Nonparametric Modeling Approach to. BNP Binary Regression A Fully Nonparametric Modeling Approach to Binary Regression Maria Department of Applied Mathematics and Statistics University of California, Santa Cruz SBIES, April 27-28, 2012 Outline 1 2 3 Simulation

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

Andrew B. Lawson 2019 BMTRY 763

Andrew B. Lawson 2019 BMTRY 763 BMTRY 763 FMD Foot and mouth disease (FMD) is a viral disease of clovenfooted animals and being extremely contagious it spreads rapidly. It reduces animal production, and can sometimes be fatal in young

More information

Bayesian Linear Models

Bayesian Linear Models Bayesian Linear Models Sudipto Banerjee September 03 05, 2017 Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles Linear Regression Linear regression is,

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Chapter 3: Basics of Areal Data Models p. 1/18 Areal Unit Data Regular or Irregular Grids

More information

Markov Networks.

Markov Networks. Markov Networks www.biostat.wisc.edu/~dpage/cs760/ Goals for the lecture you should understand the following concepts Markov network syntax Markov network semantics Potential functions Partition function

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Markov chain Monte Carlo

Markov chain Monte Carlo 1 / 26 Markov chain Monte Carlo Timothy Hanson 1 and Alejandro Jara 2 1 Division of Biostatistics, University of Minnesota, USA 2 Department of Statistics, Universidad de Concepción, Chile IAP-Workshop

More information

Computational statistics

Computational statistics Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated

More information

Part 6: Multivariate Normal and Linear Models

Part 6: Multivariate Normal and Linear Models Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of

More information

Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models

Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models Effects of residual smoothing on the posterior of the fixed effects in disease-mapping models By BRIAN J. REICH a, JAMES S. HODGES a, and VESNA ZADNIK b 1 a Division of Biostatistics, School of Public

More information

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Bayesian Inference. Chapter 4: Regression and Hierarchical Models Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative

More information

Contents. Part I: Fundamentals of Bayesian Inference 1

Contents. Part I: Fundamentals of Bayesian Inference 1 Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

Spatial Misalignment

Spatial Misalignment Spatial Misalignment Let s use the terminology "points" (for point-level observations) and "blocks" (for areal-level summaries) Chapter 6: Spatial Misalignment p. 1/5 Spatial Misalignment Let s use the

More information

Advanced Methods for Agricultural and Agroenvironmental. Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller

Advanced Methods for Agricultural and Agroenvironmental. Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller Advanced Methods for Agricultural and Agroenvironmental Monitoring Emily Berg, Zhengyuan Zhu, Sarah Nusser, and Wayne Fuller Outline 1. Introduction to the National Resources Inventory 2. Hierarchical

More information

Accounting for Complex Sample Designs via Mixture Models

Accounting for Complex Sample Designs via Mixture Models Accounting for Complex Sample Designs via Finite Normal Mixture Models 1 1 University of Michigan School of Public Health August 2009 Talk Outline 1 2 Accommodating Sampling Weights in Mixture Models 3

More information

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Bayesian Inference. Chapter 4: Regression and Hierarchical Models Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Advanced Statistics and Data Mining Summer School

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

Design of Text Mining Experiments. Matt Taddy, University of Chicago Booth School of Business faculty.chicagobooth.edu/matt.

Design of Text Mining Experiments. Matt Taddy, University of Chicago Booth School of Business faculty.chicagobooth.edu/matt. Design of Text Mining Experiments Matt Taddy, University of Chicago Booth School of Business faculty.chicagobooth.edu/matt.taddy/research Active Learning: a flavor of design of experiments Optimal : consider

More information

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and

More information

Bayesian Mixture Modeling

Bayesian Mixture Modeling University of California, Merced July 21, 2014 Mplus Users Meeting, Utrecht Organization of the Talk Organization s modeling estimation framework Motivating examples duce the basic LCA model Illustrated

More information

Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models

Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models Kohta Aoki 1 and Hiroshi Nagahashi 2 1 Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

Riemann Manifold Methods in Bayesian Statistics

Riemann Manifold Methods in Bayesian Statistics Ricardo Ehlers ehlers@icmc.usp.br Applied Maths and Stats University of São Paulo, Brazil Working Group in Statistical Learning University College Dublin September 2015 Bayesian inference is based on Bayes

More information

Multivariate Normal & Wishart

Multivariate Normal & Wishart Multivariate Normal & Wishart Hoff Chapter 7 October 21, 2010 Reading Comprehesion Example Twenty-two children are given a reading comprehsion test before and after receiving a particular instruction method.

More information

Hierarchical Modelling for Univariate and Multivariate Spatial Data

Hierarchical Modelling for Univariate and Multivariate Spatial Data Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 1/4 Hierarchical Modelling for Univariate and Multivariate Spatial Data Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota

More information

MCMC algorithms for fitting Bayesian models

MCMC algorithms for fitting Bayesian models MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models

More information

Estimating marginal likelihoods from the posterior draws through a geometric identity

Estimating marginal likelihoods from the posterior draws through a geometric identity Estimating marginal likelihoods from the posterior draws through a geometric identity Johannes Reichl Energy Institute at the Johannes Kepler University Linz E-mail for correspondence: reichl@energieinstitut-linz.at

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

ST 740: Markov Chain Monte Carlo

ST 740: Markov Chain Monte Carlo ST 740: Markov Chain Monte Carlo Alyson Wilson Department of Statistics North Carolina State University October 14, 2012 A. Wilson (NCSU Stsatistics) MCMC October 14, 2012 1 / 20 Convergence Diagnostics:

More information

Bayesian Inference for Regression Parameters

Bayesian Inference for Regression Parameters Bayesian Inference for Regression Parameters 1 Bayesian inference for simple linear regression parameters follows the usual pattern for all Bayesian analyses: 1. Form a prior distribution over all unknown

More information

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions Spatial inference I will start with a simple model, using species diversity data Strong spatial dependence, Î = 0.79 what is the mean diversity? How precise is our estimate? Sampling discussion: The 64

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear

More information

Quasi-likelihood Scan Statistics for Detection of

Quasi-likelihood Scan Statistics for Detection of for Quasi-likelihood for Division of Biostatistics and Bioinformatics, National Health Research Institutes & Department of Mathematics, National Chung Cheng University 17 December 2011 1 / 25 Outline for

More information

A short introduction to INLA and R-INLA

A short introduction to INLA and R-INLA A short introduction to INLA and R-INLA Integrated Nested Laplace Approximation Thomas Opitz, BioSP, INRA Avignon Workshop: Theory and practice of INLA and SPDE November 7, 2018 2/21 Plan for this talk

More information

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1 Parameter Estimation William H. Jefferys University of Texas at Austin bill@bayesrules.net Parameter Estimation 7/26/05 1 Elements of Inference Inference problems contain two indispensable elements: Data

More information

Hierarchical Linear Models

Hierarchical Linear Models Hierarchical Linear Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin The linear regression model Hierarchical Linear Models y N(Xβ, Σ y ) β σ 2 p(β σ 2 ) σ 2 p(σ 2 ) can be extended

More information

Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus. Abstract

Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus. Abstract Bayesian analysis of a vector autoregressive model with multiple structural breaks Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus Abstract This paper develops a Bayesian approach

More information

Bayesian Linear Models

Bayesian Linear Models Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public

More information

1 Inference for binomial proportion (Matlab/Python)

1 Inference for binomial proportion (Matlab/Python) Bayesian data analysis exercises from 2015 1 Inference for binomial proportion (Matlab/Python) Algae status is monitored in 274 sites at Finnish lakes and rivers. The observations for the 2008 algae status

More information

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix Labor-Supply Shifts and Economic Fluctuations Technical Appendix Yongsung Chang Department of Economics University of Pennsylvania Frank Schorfheide Department of Economics University of Pennsylvania January

More information

Bayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1

Bayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1 Bayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1 Division of Biostatistics, School of Public Health, University of Minnesota, Mayo Mail Code 303, Minneapolis, Minnesota 55455 0392, U.S.A.

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

Probing the covariance matrix

Probing the covariance matrix Probing the covariance matrix Kenneth M. Hanson Los Alamos National Laboratory (ret.) BIE Users Group Meeting, September 24, 2013 This presentation available at http://kmh-lanl.hansonhub.com/ LA-UR-06-5241

More information

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield Areal data Reminder about types of data Geostatistical data: Z(s) exists everyhere, varies continuously Can accommodate sudden changes by a model for the mean E.g., soil ph, two soil types with different

More information