ACCELERATING THE DETECTION VECTOR BORNE DISEASES

Similar documents
Space-time visualization of dengue fever outbreaks

USING CLUSTERING SOFTWARE FOR EXPLORING SPATIAL AND TEMPORAL PATTERNS IN NON-COMMUNICABLE DISEASES

Detecting Emerging Space-Time Crime Patterns by Prospective STSS

Spatio-Temporal Cluster Detection of Point Events by Hierarchical Search of Adjacent Area Unit Combinations

Temporal and spatial mapping of hand, foot and mouth disease in Sarawak, Malaysia

Cluster Analysis using SaTScan

Alexander Severin Hohl

Cluster Analysis using SaTScan. Patrick DeLuca, M.A. APHEO 2007 Conference, Ottawa October 16 th, 2007

Spatial Analysis 1. Introduction

An Introduction to SaTScan

Peninsular Florida p Modeled Water Table Depth Arboviral Epidemic Risk Assessment. Current Assessment: 06/08/2008 Week 23 Initial Wetting Phase

WHO lunchtime seminar Mapping child growth failure in Africa between 2000 and Professor Simon I. Hay March 12, 2018

Animating Maps: Visual Analytics meets Geoweb 2.0

Everything is related to everything else, but near things are more related than distant things.

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

Interactive GIS in Veterinary Epidemiology Technology & Application in a Veterinary Diagnostic Lab

Role of GIS in Tracking and Controlling Spread of Disease

Spatial and Temporal Geovisualisation and Data Mining of Road Traffic Accidents in Christchurch, New Zealand

Urban Geo-Informatics John W Z Shi

Spatial Mapping of Dengue Incidence: A Case Study in Hulu Langat District, Selangor, Malaysia

Lecture 4. Spatial Statistics

Cloud-Based Computational Bio-surveillance Framework for Discovering Emergent Patterns From Big Data

GIS for Integrated Pest Management. Christina Hailey. Abstract:

Space-time modelling of sexually transmitted infections in London with focus on Chlamydia trachomatis

David Rogers Health and Climate Foundation

Supplementary Information

GIS CONFERENCE MAKING PLACE MATTER Decoding Health Data with Spatial Statistics

Title: Space and space-time distributions of dengue in a hyper-endemic urban space: The case of Girardot, Colombia

Geographical Information System (GIS)-based maps for monitoring of entomological risk factors affecting transmission of chikungunya in Sri Lanka

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Urban GIS for Health Metrics

Spatio Temporal Density Mapping of a Dynamic Phenomenon

Spatial Risk Modeling for Infectious Disease Surveillance Using Population Movement Data

Scalable and Power-Efficient Data Mining Kernels

GEO Geohazards Community of Practice

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Geostatistical Modelling, Analysis and Mapping of Epidemiology of Dengue Fever in Johor State, Malaysia

FleXScan User Guide. for version 3.1. Kunihiko Takahashi Tetsuji Yokoyama Toshiro Tango. National Institute of Public Health

INTRODUCTION. In March 1998, the tender for project CT.98.EP.04 was awarded to the Department of Medicines Management, Keele University, UK.

University of Lusaka

GIS = Geographic Information Systems;

This report details analyses and methodologies used to examine and visualize the spatial and nonspatial

Lab #3 Background Material Quantifying Point and Gradient Patterns

School on Modelling Tools and Capacity Building in Climate and Public Health April Point Event Analysis

Temporal and Spatial Autocorrelation Statistics of Dengue Fever

STATISTICAL DATA ANALYSIS AND MODELING OF SKIN DISEASES

Summary Visualizations for Coastal Spatial Temporal Dynamics

Question 1. Feedback Lesson 4 Quiz. You submitted this quiz on Tue 27 May :04 PM MYT. You got a score of out of

Applied Geography 60 (2015) 29e36. Contents lists available at ScienceDirect. Applied Geography. journal homepage:

WEB application for the analysis of spatio-temporal data

Eric Delmelle. I. Research Activity. Assistant Professor. Education. Professional Experience. Edited book. Refereed Publications

An Introduction to Pattern Statistics

Exploratory Spatial Data Analysis (ESDA)

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

Comparison of spatial methods for measuring road accident hotspots : a case study of London

Historical Spatio-temporal data in current GIS

GRADUATE CERTIFICATE PROGRAM

Site-specific Prediction of Mosquito Abundance using Spatio-Temporal Geostatistics

A Hypothesis-Free Multiple Scan Statistic with Variable Window

Improving SLEUTH Calibration with a Genetic Algorithm

Dr Arulsivanathan Naidoo Statistics South Africa 18 October 2017

A COMPREHENSIVE PROCESS FOR SPATIOTEMPORAL ANALYSIS OF NETWORK-BASED PHENOMENA

Bayesian Spatial Health Surveillance

GIS to Support West Nile Virus Program

Monsuru Adepeju 1 and Andy Evans 2. School of Geography, University of Leeds, LS21 1HB 1

ArcGIS for Desktop. ArcGIS for Desktop is the primary authoring tool for the ArcGIS platform.

Finding Common Ground Through GIS

FUNDAMENTALS OF GEOINFORMATICS PART-II (CLASS: FYBSc SEM- II)

Downloaded from:

Scalable Bayesian Event Detection and Visualization

Results and Findings. I. Theory of linear time subset scanning

GIS Spatial Statistics for Public Opinion Survey Response Rates

METHODS FOR STATISTICS

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Analysis of traffic incidents in METU campus

A spatial literacy initiative for undergraduate education at UCSB

Environment and Health in Urban Area: Analysis of Mosquito Density in Kuala Lumpur, Malaysia

Spatio-temporal modeling of weekly malaria incidence in children under 5 for early epidemic detection in Mozambique

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis:

Expectation-based scan statistics for monitoring spatial time series data

The Case for Space in the Social Sciences

Why Spatial Data Mining?

Ministry of Health and Long-Term Care Geographic Information System (GIS) Strategy An Overview of the Strategy Implementation Plan November 2009

Spatial Data Science. Soumya K Ghosh

Kernel Density Estimation (KDE) vs. Hot-Spot Analysis - Detecting Criminal Hot Spots in the City of San Francisco

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Fighting Cholera With Maps

The CrimeStat Program: Characteristics, Use, and Audience

Spatiotemporal Analysis of Dengue Hemorrhagic Fever and Dengue Shock Syndrome Incidence within Trinidad, West Indies

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

OUTBREAK INVESTIGATION AND SPATIAL ANALYSIS OF SURVEILLANCE DATA: CLUSTER DATA ANALYSIS. Preliminary Programme / Draft_2. Serbia, Mai 2013

Cyberinfrastructure and CyberGIS: Recent Advances and Key Themes

CLUSTER ANALYSIS OF HEALTH DATA INVOLVING GENERAL DISEASE CLUSTER SHAPES AND MULTIPLE VARIABLES

A SPACE-TIME SCAN STATISTIC FOR DETECTION OF TB OUTBREAKS IN THE SAN FRANCISCO HOMELESS POPULATION

Disaster Management & Recovery Framework: The Surveyors Response

In matrix algebra notation, a linear model is written as

From Viral product spreading to Poverty prediction Data Science from a telecom perspective

FMEL Arboviral Epidemic Risk Assessment: Second Update for 2012 Week 23 (June 12, 2012)

GEOGRAPHIC INFORMATION SYSTEMS FOR SPATIAL DISEASE CLUSTER DETECTION, SPATIO-TEMPORAL DISEASE MAPPING, AND HEALTH SERVICE PLANNING PING YIN

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Transcription:

NC State s Geospatial Analytics Forum October 22 2015 ACCELERATING THE DETECTION of SPACE-TIME CLUSTERS for VECTOR BORNE DISEASES Dr. Eric Delmelle Geography & Earth Sciences, University of North Carolina at Charlotte, N.C., U.S.A.

Infectious diseases Disease Malaria Dengue Fever HIV Cholera Cases/Deaths 3.2 billion at risk, 214 million cases 438,000 deaths ½ of the world population at risk 390 million cases/year 10-25K deaths 35 millions; 1-1.5 million deaths 3-5 million cases, 100K deaths Ebola 25K cases, 10K deaths (2014) West Nile Virus 5,674 (286 deaths), 2012 Increasing research in the spread of infectious diseases Changing climate patterns Movement of individuals (air) and increasing urbanization World Health Organization (WHO); Centers for Disease Control and Prevention (CDC)

Context Context Vector-borne diseases (malaria, dengue fever) spread very quickly under suitable conditions Prompt and accurate space-time analyses are necessary to detect outbreaks in a timely manner and take appropriate steps to curb expansion of the disease Accurate information should be disseminated among the public to limit the risk of further contagion. Duncombe, J., et al., 2012. Review: geographical information systems for dengue surveillance. American Journal of Tropical Medicine and Hygiene, 86 (5), 753. Eisen, L. and Eisen, R., 2011. Using geographic information systems and decision support systems for the prediction, prevention, and control of vector-borne diseases. Annual Review of Entomology, 56 (1), 41 61 Gubler, D.J. and Clark, G.G., 1995. Dengue/dengue hemorrhagic fever: the emergence of a global health problem. Emerging Infectious Diseases, 1 (2), 55..

My recent research to the field of vector-borne diseases Dengue fever, Cali, Colombia - 2010 Dengue fever data from SIVIGILA (Public Health Surveillance System). Implicit geographic information. Data for the year 2010 [x, y, time]; n = 9,555 Delmelle, E., Casas, I., Rojas, J. H., & Varela, A. (2013). Spatio-temporal patterns of Dengue Fever in Cali, Colombia. International Journal of Applied Geospatial Research, 4(4), 58-75.

My recent research to the field of vector-borne diseases

My recent research to the field of vector-borne diseases

My recent research to the field of vector-borne diseases

My recent research to the field of vector-borne diseases

My recent research to the field of vector-borne diseases

Objectives Objectives Present a framework to accelerate the computation of the space-time kernel density estimation Octree-based adaptive decomposition of the space-time domain Parallel computing approach to reduce time Visualize results in a 3D framework Delmelle, E., Dony, C., Casas, I., Jia, M., & Tang, W. (2014). Visualizing the impact of space-time uncertainties on dengue fever patterns. International Journal of Geographical Information Science, (ahead-of-print), 1-21.

Literature Existing work on space-time modeling Space-time clustering and geosurveillance are critical research areas when spatial epidemiological data have a temporal signature. Significant research dedicated to the mapping of disease intensities across an area and associated clusters. Exploratory (KDE) and confirmatory approaches Space-time clustering is a more recent research thread Computational and visualization issues Aldstadt, J. and Getis, A., 2006. Using AMOEBA to create a spatial weights matrix and identify spatial clusters. Geographical Analysis, 38 (4), 327 343. Bailey, T. and Gatrell, Q., 1995. Interactive spatial data analysis. Edinburgh Gate, England: Pearson Education Limited. Delmelle, E., et al., 2011. H.E.L.P: a GIS-based health exploratory analysis tool for practitioners. Applied Spatial Analysis and Policy, 4 (2), 113 137. Knox, G.E., 1964. The detection of space-time iterations. Journal of the Royal Statistical Society,13, 25 29. Kulldorff, M., et al., 2005. A space time permutation scan statistic for disease outbreak detection. PLoS Medicine, 2 (3), e59. Rogerson, P. and Yamada, I., 2008. Statistical detection and surveillance of geographic clusters. Boca Raton, FL: CRC Press.

Literature Existing work on space-time modeling Authors Year Statistic Kulldorff et al. 1997 Spatial Scan Kulldorff et al. 2005 Space-time scan Goovaerts and Jacquez 2005 Local Moran s I Takahashi et al. 2008 Flexibly shaped space-time scan Wang and Armstrong 2009 G* i (d) Tang, Feng and Jia 2014 Ripley s K Delmelle et al. 2014 STKDE Kulldorff, M. (1997). A spatial scan statistic. Communications in Statistics-Theory and methods, 26(6), 1481-1496. Kulldorff, M., Heffernan, R., Hartman, J., Assunçao, R., & Mostashari, F. (2005). A space-time permutation scan statistic for disease outbreak detection.plos medicine, 2(3), 216. Takahashi, K., Kulldorff, M., Tango, T., & Yih, K. (2008). A flexibly shaped space-time scan statistic for disease outbreak detection and monitoring.international Journal of Health Geographics, 7(1), 14. Goovaerts, P., & Jacquez, G. M. (2005). Detection of temporal changes in the spatial distribution of cancer rates using local Moran s I and geostatistically simulated spatial neutral models. Journal of Geographical Systems, 7(1), 137-159.

Literature Existing work on space-time modeling Authors Year Statistic Kulldorff et al. 1997 Spatial Scan Kulldorff et al. 2005 Space-time scan Goovaerts and Jacquez 2005 Local Moran s I Takahashi et al. 2008 Flexibly shaped space-time scan Wang and Armstrong 2009 G* i (d) Tang, Feng and Jia 2014 Ripley s K Delmelle et al. 2014 STKDE Tang, W., Feng, W., & Jia, M. (2014). Massively parallel spatial point pattern analysis: Ripley s K function accelerated using graphics processing units.international Journal of Geographical Information Science, (aheadof-print), 1-28. Delmelle, E., Dony, C., Casas, I., Jia, M., & Tang, W. (2014). Visualizing the impact of space-time uncertainties on dengue fever patterns. International Journal of Geographical Information Science, 28(5), 1107-1127. Wang, S., & Armstrong, M. P. (2009). A theoretical approach to the use of cyberinfrastructure in geographical analysis. International Journal of Geographical Information Science, 23(2), 169-193.

STKDE Space-time kernel density estimation Delmelle, E., Dony, C., Casas, I., Jia, M., & Tang, W. (2014). Visualizing the impact of space-time uncertainties on dengue fever patterns. International Journal of Geographical Information Science, (ahead-of-print), 1-21.

STKDE Space-time kernel density estimation Delmelle, E., Dony, C., Casas, I., Jia, M., & Tang, W. (2014). Visualizing the impact of space-time uncertainties on dengue fever patterns. International Journal of Geographical Information Science, (ahead-of-print), 1-21. Demšar, U. and Virrantaus, K., 2010. Space time density of trajectories: exploring spatio-temporal patterns in movement data. International Journal of Geographical Information Science, 24 (10), 1527 1542. Nakaya, T., and Yano, K., 2010. Visualising crime clusters in a space-time cube: an exploratory data-analysis approach using space-time kernel density estimation and scan statistics. Transactions in GIS, 14 (3), 223 239.

Visualization Visualization The kernel density volume is rendered by estimating a density value for each of the voxels Color-coding each voxel based on its density value (rainbow) Voxels with lower kernel density values are assigned a higher level of transparency whereas higher densities are kept opaque e1 v i t c pe Pers Perspec tive 2

Visualization Temporal cross section Cross-sectional cuts Visualization

Sensitivity Impact of parameters Impact of STKDE parameters on computational effort

Rationale Rationale Performance and computational complexity hinder scientific investigation Increase in volume, variety, and velocity of spatiotemporal data Geographic search strategies explode with increasing problem size Need for ability to process large datasets in acceptable amount of time Parallel computing: Divide task into subproblems and distribute to CPUs

Rationale Rationale

Approach Parallel computation approach The derivation of STKDE of a point pattern consumes considerable computing resources Conduct STKDE for each voxel Conduct STKDE on observed and simulated datasets Armstrong, M. P., and P. J. Densham. 1992. Domain decomposition for parallel processing of spatial problems. Computers, environment and urban systems 16 (6):497-513. Ding, Y., and P. J. Densham. 1996. Spatial strategies for parallel spatial modelling. IJGIS 10 (6):669-698.

Recursive octree decomposition of spatiotemporal domain creates subdomains (SDs) of similar computational intensity. Computational intensity = f(#voxels * #data points) Space-filling curve Distribute SDs to processors, equalizing computational intensity, Compute kernel density for each subdomain separately Collect subdomain outputs and reassemble. http://www.beyond3d.com/images/articles/ingenu-part-2/octree.png, 3/3/14 www.forceflow.be, 3/3/14

Recursive octree decomposition of spatiotemporal domain creates subdomains (SDs) of similar computational intensity. Computational intensity = f(#voxels * #data points) Space-filling curve Distribute SDs to processors, equalizing computational intensity, Compute kernel density for each subdomain separately Collect subdomain outputs and reassemble. http://www.beyond3d.com/images/articles/ingenu-part-2/octree.png, 3/3/14 www.forceflow.be, 3/3/14

Recursive octree decomposition of spatiotemporal domain creates subdomains (SDs) of similar computational intensity. Computational intensity = f(#voxels * #data points) Space-filling curve Distribute SDs to processors, equalizing computational intensity, Compute kernel density for each subdomain separately Collect subdomain outputs and reassemble. http://www.beyond3d.com/images/articles/ingenu-part-2/octree.png, 3/3/14 www.forceflow.be, 3/3/14

Recursive octree decomposition of spatiotemporal domain creates subdomains (SDs) of similar computational intensity. Computational intensity = f(#voxels * #data points) Space-filling curve Distribute SDs to processors, equalizing computational intensity, Compute kernel density for each subdomain separately Collect subdomain outputs and reassemble. http://www.beyond3d.com/images/articles/ingenu-part-2/octree.png, 3/3/14 www.forceflow.be, 3/3/14

Recursive octree decomposition of spatiotemporal domain creates subdomains (SDs) of similar computational intensity. Computational intensity = f(#voxels * #data points) Space-filling curve Distribute SDs to processors, equalizing computational intensity, Compute kernel density for each subdomain separately Collect subdomain outputs and reassemble. http://www.beyond3d.com/images/articles/ingenu-part-2/octree.png, 3/3/14 www.forceflow.be, 3/3/14

Recursive octree decomposition of spatiotemporal domain creates subdomains (SDs) of similar computational intensity. Computational intensity = f(#voxels * #data points) Space-filling curve Distribute SDs to processors, equalizing computational intensity, Compute kernel density for each subdomain separately Collect subdomain outputs and reassemble. http://www.beyond3d.com/images/articles/ingenu-part-2/octree.png, 3/3/14 www.forceflow.be, 3/3/14

Recursive octree decomposition of 3D domain creates subdomains (SDs) of similar computational intensity. Computational intensity = f(#voxels * #data points) Space-filling curve Distribute SDs to processors, equalizing computational intensity, Compute kernel density for each subdomain separately Collect subdomain outputs and reassemble. http://www.beyond3d.com/images/articles/ingenu-part-2/octree.png, 3/3/14 www.forceflow.be, 3/3/14

algorithm Initialize 1st level of decomposition assign data points to SDs. Iterate through SDs, check If nvoxels > 0 If npoints > 0 skip If npoints < npoints_max OR npoints(parent) == max(npoints(children)) Write Points to file Else: decompose ID, X, Y, T 1, 4,2,1 2,6,7,1 3,4,5,4.

Space-time buffer implementation Buffer distances equal spatial- and temporal bandwidths Creates overlap and therefore, data redundancy Points assigned to 1 8 SDs SD 3 SD 1 SD4 SD2

Space-time buffer implementation Buffer distances equal spatial- and temporal bandwidths Creates overlap and therefore, data redundancy Points assigned to 1 8 SDs SD 3 SD 1 SD4 SD2

Space-time buffer implementation Buffer distances equal spatial- and temporal bandwidths Creates overlap and therefore, data redundancy Points assigned to 1 8 SDs SD 3 SD 1 SD4 SD2

Space-time buffer implementation Buffer distances equal spatial- and temporal bandwidths Creates overlap and therefore, data redundancy Points assigned to 1 8 SDs SD 3 SD 1 SD4 SD2

Space-time buffer implementation Buffer distances equal spatial- and temporal bandwidths Creates overlap and therefore, data redundancy Points assigned to 1 8 SDs SD 3 SD 1 SD4 SD2

Space-time buffer implementation Buffer distances equal spatial- and temporal bandwidths Creates overlap and therefore, data redundancy Points assigned to 1 8 SDs SD 3 SD 1 SD4 SD2

Results Parameters and Performance metric Dataset Maximum number of data points per SD Voxel resolution Bandwidths Average execution times of 20 runs Speedup: S(n) = T s /T p (n) R 2 for CI = f(#vox * #dp) Dengue fever 100 100m * 100m * 1d Spatial (m) 500 750 1000 Temporal (d) 3 7 14

Results Result: Execution times & speedup h s 3 500 750 1000 h t 7 14

Results Computational intensity estimation h s 3 500 750 1000 7 h t 14

Discussion Discussion 3D domain decomposition for parallel STKDE of massive spatiotemporal geographic data Achieve speedup ( ) Achieve workload balance ( ) Sensitivity analysis ( ) Space-time uncertainty ( )

Questions Department of Geography and Earth Sciences University of North Carolina at Charlotte Charlotte, NC 28223 Tel: (704) 687-5991 Email: Eric.Delmelle@uncc.edu