An Introduction to SaTScan

Similar documents
Cluster Analysis using SaTScan

SaTScan TM. User Guide. for version 7.0. By Martin Kulldorff. August

A spatial scan statistic for multinomial data

Cluster Analysis using SaTScan. Patrick DeLuca, M.A. APHEO 2007 Conference, Ottawa October 16 th, 2007

FleXScan User Guide. for version 3.1. Kunihiko Takahashi Tetsuji Yokoyama Toshiro Tango. National Institute of Public Health

A nonparametric spatial scan statistic for continuous data

Detecting Emerging Space-Time Crime Patterns by Prospective STSS

USING CLUSTERING SOFTWARE FOR EXPLORING SPATIAL AND TEMPORAL PATTERNS IN NON-COMMUNICABLE DISEASES

Long Island Breast Cancer Study and the GIS-H (Health)

Success scans in a sequence of trials

Spatio-Temporal Cluster Detection of Point Events by Hierarchical Search of Adjacent Area Unit Combinations

Interactive GIS in Veterinary Epidemiology Technology & Application in a Veterinary Diagnostic Lab

Bayesian Hierarchical Models

Inclusion of Non-Street Addresses in Cancer Cluster Analysis

Outline. Practical Point Pattern Analysis. David Harvey s Critiques. Peter Gould s Critiques. Global vs. Local. Problems of PPA in Real World

Community Health Needs Assessment through Spatial Regression Modeling

Geographical Visualization Approach to Perceive Spatial Scan Statistics: An Analysis of Dengue Fever Outbreaks in Delhi

Analyzing the Geospatial Rates of the Primary Care Physician Labor Supply in the Contiguous United States

Identifying West Nile Virus Risk Areas: The Dynamic Continuous-Area Space-Time System

A SPACE-TIME SCAN STATISTIC FOR DETECTION OF TB OUTBREAKS IN THE SAN FRANCISCO HOMELESS POPULATION

Determine the flooded area based on elevation information and the current water level

GIS for Integrated Pest Management. Christina Hailey. Abstract:

The Church Demographic Specialists

Stat 587: Key points and formulae Week 15

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Rapid detection of spatiotemporal clusters

The CrimeStat Program: Characteristics, Use, and Audience

Scalable Bayesian Event Detection and Visualization

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

Applied Spatial Analysis in Epidemiology

Hotspot detection using space-time scan statistics on children under five years of age in Depok

Spatio-temporal modeling of weekly malaria incidence in children under 5 for early epidemic detection in Mozambique

Applied Spatial Analysis in Epidemiology

ACCELERATING THE DETECTION VECTOR BORNE DISEASES

School on Modelling Tools and Capacity Building in Climate and Public Health April Point Event Analysis

Peninsular Florida p Modeled Water Table Depth Arboviral Epidemic Risk Assessment. Current Assessment: 06/08/2008 Week 23 Initial Wetting Phase

Role of GIS in Tracking and Controlling Spread of Disease

Disease mapping with Gaussian processes

Sampling. General introduction to sampling methods in epidemiology and some applications to food microbiology study October Hanoi

THE EFFECT OF GEOGRAPHY ON SEXUALLY TRANSMITTED

GIS & Spatial Analysis in MCH

Bayesian Spatial Health Surveillance

Integrating GIS into West Nile Virus Planning and Surveillance

College Of Science Engineering Health: GIS at UWF

Are You Maximizing The Value Of All Your Data?

Swarm and Evolutionary Computation. A new PSO-optimized geometry of spatial and spatio-temporal scan statistics for disease outbreak detection

Predicting Long-term Exposures for Health Effect Studies

CitySense TM : multiscale space time clustering of GPS points and trajectories

Stat 642, Lecture notes for 04/12/05 96

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

Using Estimating Equations for Spatially Correlated A

TOTAL RAINFALL June 1 July 24. Mundelein Lake % O Hare Cook % Midway Cook % Romeoville Will

Railway suicide clusters: how common are they and what predicts them? Lay San Too Jane Pirkis Allison Milner Lyndal Bugeja Matthew J.

Urbanization, Land Cover, Weather, and Incidence Rates of Neuroinvasive West Nile Virus Infections In Illinois

Appraoaches to Landscape-Scale Modeling

Sampling Considerations for Disease Surveillance in Wildlife Populations

OUTBREAK INVESTIGATION AND SPATIAL ANALYSIS OF SURVEILLANCE DATA: CLUSTER DATA ANALYSIS. Preliminary Programme / Draft_2. Serbia, Mai 2013

GIS to Support West Nile Virus Program

Expectation-based scan statistics for monitoring spatial time series data

Applications of GIS in Health Research. West Nile virus

Medical GIS: New Uses of Mapping Technology in Public Health. Peter Hayward, PhD Department of Geography SUNY College at Oneonta

Malaria Infection Has Spatial, Temporal, and Spatiotemporal Heterogeneity in Unstable Malaria Transmission Areas in Northwest Ethiopia

Zoonotic Disease Report September 2 September 8, 2018 (Week 36)

Hierarchical Modeling and Analysis for Spatial Data

Learning Outbreak Regions in Bayesian Spatial Scan Statistics

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Computational Statistics and Data Analysis

Spatial Analysis 1. Introduction

Spatial Variation in Local Road Pedestrian and Bicycle Crashes

INTRODUCTION. In March 1998, the tender for project CT.98.EP.04 was awarded to the Department of Medicines Management, Keele University, UK.

Effective Use of Geographic Maps

Describing Contingency tables

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Geostatistical Modelling, Analysis and Mapping of Epidemiology of Dengue Fever in Johor State, Malaysia

A Geostatistical Approach to Linking Geographically-Aggregated Data From Different Sources

Perinatal Mental Health Profile User Guide. 1. Using Fingertips Software

Shape and scale in detecting disease clusters

Acknowledgments xiii Preface xv. GIS Tutorial 1 Introducing GIS and health applications 1. What is GIS? 2

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Chapter 4 Part 3. Sections Poisson Distribution October 2, 2008

GIS use in Public Health 1

Environmental Justice and the Environmental Protection Agency s Superfund Program

Making Our Cities Safer: A Study In Neighbhorhood Crime Patterns

Combining Incompatible Spatial Data

Everything is related to everything else, but near things are more related than distant things.

Map your way to deeper insights

Lattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III)

Prospective Detection of Outbreaks

In matrix algebra notation, a linear model is written as

Objectives Define spatial statistics Introduce you to some of the core spatial statistics tools available in ArcGIS 9.3 Present a variety of example a

An Introduction to Pattern Statistics

Introducing GIS analysis

Geog183: Cartographic Design and Geovisualization Winter Quarter 2017 Lecture 6: Map types and Data types

Spatial Clusters of Rates

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters

Tracey Farrigan Research Geographer USDA-Economic Research Service

Exploratory Spatial Data Analysis Using GeoDA: : An Introduction

Detection of Clustering in Spatial Data

Announcing a Course Sequence: March 9 th - 10 th, March 11 th, and March 12 th - 13 th 2015 Historic Charleston, South Carolina

Lectures of STA 231: Biostatistics

Transcription:

An Introduction to SaTScan Software to measure spatial, temporal or space-time clusters using a spatial scan approach Marilyn O Hara University of Illinois moruiz@illinois.edu Lecture for the Pre-conference workshop ASCHP 11 Dec 2012

SaTScan Overview Developed by Martin Kulldorff with funding from National Cancer Institute, CDC and other Agencies It is free and can be downloaded at http://www.satscan.org New version (v. 9.0) came out Mar 2011 Used widely in public health and other fields to identify high or low clusters of illness or other events across space and time.

What is a spatial scan? A circle (or ellipse) of varying sizes (from 0 upper distance set by user) systematically goes across (scans) the study area as it defines a dynamic geographic unit, or moving window. In a typical situation, each original geographic unit (county, census tract, etc) is a potential cluster center. Clusters are reported for those circles where observed values are greater than expected values. They can be at any location and be of varying sizes.

What about time? The condition of the event at varying temporal periods is incorporated into the space-time scan. Think of it as a set of cylinders, of varying height (for different time periods) that also scan spatially Each time period can be the temporal start point of a space/time cluster. The number of different time periods (length of time) in clusters varies. Space-time cluster is reported as spatial location, spatial size and the time period. Temporal scan without space is also available.

Location 1 Day 1 Day 2 Day 3 Day 4 Day 5 Location 2 Day 1 Day 2 Day 3 Day 4 Day 5 Clusters defined by radius, time period, and center point For space-time cluster measure, each location is evaluated at a set of circles (or ellipses) of varying sizes. Each time period is evaluated alone as well as with the neighboring periods for each of the spatial sizes.

2004 Mosquito Infection space-time clusters using a space-time permutation Poisson model in SaTScan.

Probability models for discrete (fixed location) scan statistics Count data discrete Poisson (fastest to run; unlimited covariates; can include population) Bernoulli (case/no case) space-time permutation (unlimited covariates; requires only case data) Nominal/Ordinal data ordinal (slowest to run; cases in ordered groups) multinomial (cases are in groups, with order e.g. severity of illness, or nominal, e.g. serotype) Continuous normal (can take positive or negative values) Exponential (designed for survival time analysis)

A few other things You can also use a continuous Poisson model to determine if the events of interest have a spatial pattern, themselves, rather than if values (case counts, etc) have patterns Some of the different model types give the same results in some situations. Some have more options so might be better choices. To identify clusters, a likelihood function is maximized across all locations/times. The maximum likelihood indicates the cluster least likely to have occurred by chance = Cluster 1. Clusters can be statistically significant or not. The p- value is obtained through Monte Carlo hypothesis testing. Secondary clusters are those that are in rank order after Cluster 1, by their likelihood ratio test statistic.

Example 1: Mostashari et al. 2003. Dead bird clusters as an early warning system for West Nile virus activity WNV in NYC, dead bird data for prospective analysis to predict human illness. Bernoulli model, with census tracts As control, used record of prior, unclustered, dead bird counts compared to dead birds in current time period. Identified bird clusters as warning of increased risk for human illness.

Results- Example 1 Each census tract is either part or not part of a cluster. Shading is based on number of clusters in tract up to date of map.

Example 2: Joly et al. 2006. Spatial epidemiology of Chronic Wasting Disease in Wisconsin white-tailed deer Tested if prevalence of CWD in Wisconsin deer was uniform across study area Used Poisson model Cases of diseased deer per square mile grid area The core affected area was identified from the scan and prevalence measured inside that area compared to other areas

Results- Example 2 Circle indicates the single cluster in area (spatial only scan) Shading is prevalence of CWD by square mile.

Example 3: Cech et al. 2007. Orofacial cleft defect births and low-level radioactivity in tap water. Measured spatial clustering of cleft palate/ lip Used geocoded birth records to count cases at several spatial scales (0.01 deg square grids, zipcode) Used Bernoulli clustering with cases and controls per square grid. Most of analysis was of incidence at zipcode level, not SaTScan.

Results - Example 3 High case area was identified. Link between the high cases area and radioactivity in tap water was in analysis discussion.

Example 4: Coleman et al. 2009. Using the SaTScan method to detect local malaria clusters for guiding malaria control programmes. Identify risk areas for malaria within villages for targeted control Created grid of each village and counted households with cases for which treatment was sought Used Bernoulli method for space-only scan of cases at household level. Used retrospective space-time permutation on cases

Results- Example 4 Spatial clustersclusters in 4 of 7 villages. Space time clusters in 2 of 7 (maps on right) One secondary cluster identified

Data for input vary depending on the analysis and model chosen Coordinate file: X,Y location Number of cases / controls Population size Time precision: Year, month, day Study Period

The output by SatScan Results file summary of results Cluster information each row has data for each cluster Location information - each row lists a location and its cluster membership Risk estimates for each location Simulated log likelihood ratios

What SaTScan can and can t do CAN Identify spatial, temporal, spatio-temporal patterns to find clusters in many types of data Provide flexible geographic units/neighborhoods Create output that can be read into GIS or other programs CANNOT Process data for input need SAS, SPSS, Excel, etc Display maps of events and cluster locations need GIS or mapping system Create other statistical models, e.g regression models

Mostashari F, Kulldorff M, Hartman J J, Miller J R, Kulasekera V. Dead bird clusters as an early warning system for West Nile virus activity. Emerg Infect Dis 2003; (9): 641-646. Joly D O, Samuel M D, Langenberg J A, Blanchong J A, Batha C A, Rolley R E, Keane D P, Ribic C A. Spatial epidemiology of chronic wasting disease in Wisconsin white-tailed deer. J Wildl Dis 2006; (42): 578-588. Cech I, Burau K D, Walston J. Spatial distribution of orofacial cleft defect births in Harris County, Texas, 1990 to 1994, and historical evidence for the presence of low-level radioactivity in tap water. South Med J 2007; (100): 560-569. Coleman M, Coleman M, Mabuza A M, Kok G, Coetzee M, Durrheim D N. Using the SaTScan method to detect local malaria clusters for guiding malaria control programmes. Malar J 2009; (8): 68.