GIS CONFERENCE MAKING PLACE MATTER Decoding Health Data with Spatial Statistics

Similar documents
Spatial Pattern Analysis: Mapping Trends and Clusters

Spatial Pattern Analysis: Mapping Trends and Clusters. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Objectives Define spatial statistics Introduce you to some of the core spatial statistics tools available in ArcGIS 9.3 Present a variety of example a

Application of the Getis-Ord Gi* statistic (Hot Spot Analysis) to seafloor organisms

Using GIS: to analyze vector borne disease. David Attaway Esri

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Lecture 4. Spatial Statistics

Local Spatial Autocorrelation Clusters

Exploratory Spatial Data Analysis (ESDA)

The Nature of Geographic Data

Math Sec 4 CST Topic 7. Statistics. i.e: Add up all values and divide by the total number of values.

An Introduction to Pattern Statistics

Exploratory Spatial Data Analysis Using GeoDA: : An Introduction

Geoprocessing Tools at ArcGIS 9.2 Desktop

Output: -Observed Mean Distance -Expected Mean Distance - Nearest Neighbor Index -Graphic report - Test variables:

Using Spatial Statistics and Geostatistical Analyst as Educational Tools

This lab exercise will try to answer these questions using spatial statistics in a geographic information system (GIS) context.

Everything is related to everything else, but near things are more related than distant things.

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis:

Spatial Analysis 2. Spatial Autocorrelation

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Cluster and outlier analysis applications in minerals exploration

GIS Spatial Statistics for Public Opinion Survey Response Rates

Texas A&M University

Lab #3 Background Material Quantifying Point and Gradient Patterns

Approaches to Spatial Analysis. Flora Vale, Linda Beale, Mark Harrower, Clint Brown Esri Redlands

Introduction to Spatial Statistics and Modeling for Regional Analysis

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

GIS Test Drive What a Geographic Information System Is and What it Can Do. Alison Davis-Holland

Spatial Analysis with ArcGIS Pro STUDENT EDITION

Introduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées.

Using GIS to Evaluate Rural Emergency Medical Services (EMS)

Tracey Farrigan Research Geographer USDA-Economic Research Service

Exploratory Spatial Data Analysis (And Navigating GeoDa)

Spatial analysis of electricity demand patterns in Greece: Application of a GIS-based methodological framework

Outline. 15. Descriptive Summary, Design, and Inference. Descriptive summaries. Data mining. The centroid

Modeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Geog 469 GIS Workshop. Data Analysis

A.1 Spatial Statistics in ArcGIS

Identification of Economic Clusters Using ArcGIS Spatial Statistics. Joseph Frizado Bruce Smith Michael Carroll

GIS CONCEPTS ARCGIS METHODS AND. 3 rd Edition, July David M. Theobald, Ph.D. Warner College of Natural Resources Colorado State University

BROOKINGS May

How GIS Can Help With Tribal Safety Planning

Math 2311 Sections 4.1, 4.2 and 4.3

USING HYPERSPECTRAL IMAGERY

Chapter 6. Fundamentals of GIS-Based Data Analysis for Decision Support. Table 6.1. Spatial Data Transformations by Geospatial Data Types

LOCATION OF PREHOSPITAL CARE BASIS THROUGH COMBINED FUZZY AHP AND GIS METHOD

GEO 463-Geographic Information Systems Applications. Lecture 1

The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale

CS 350 A Computing Perspective on GIS

Statistics for Managers using Microsoft Excel 6 th Edition

SPATIAL ANALYSIS. Transformation. Cartogram Central. 14 & 15. Query, Measurement, Transformation, Descriptive Summary, Design, and Inference

A GIS Spatial Model for Defining Management Zones in Orchards Based on Spatial Clustering

In matrix algebra notation, a linear model is written as

THIRD EDITION. An Introduction to. J. Chapman McGrew, Jr. Salisbury University. Arthur J. Lembo, Jr. Salisbury University. Charles B.

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Spatial Analysis 1. Introduction

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time

Universitat Autònoma de Barcelona Facultat de Filosofia i Lletres Departament de Prehistòria Doctorat en arqueologia prehistòrica

Semi Supervised Distance Metric Learning

ESSENTIALS OF LEARNING. Math 7. Math A MATH B. Pre-Calculus. Math 12X. Visual Basic

GIST 4302/5302: Spatial Analysis and Modeling

Why Is It There? Attribute Data Describe with statistics Analyze with hypothesis testing Spatial Data Describe with maps Analyze with spatial analysis

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

A Modified DBSCAN Clustering Method to Estimate Retail Centre Extent

Data Visualization (CSE 578) About this Course. Learning Outcomes. Projects

Data Exploration and Unsupervised Learning with Clustering

SAIT & FortisAlberta Forge a Partnership to put New Ideas on the Map

Nearest Neighbor Search with Keywords: Compression

Analyzing the Geospatial Rates of the Primary Care Physician Labor Supply in the Contiguous United States

Human-Caused Fires: An Exploratory Pattern Analysis

GCSE 185/09 MATHEMATICS HIGHER TIER PAPER 1. P.M. WEDNESDAY, 9 November hours. Centre Number. Candidate Number. Surname.

Classification methods

Urban Erosion Potential Risk Mapping with GIS

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor

Spatial-Temporal Analytics with Students Data to recommend optimum regions to stay

Dr Arulsivanathan Naidoo Statistics South Africa 18 October 2017

The Use of Local Moran s I in Determining the Effectiveness of Location for Gas Extraction

Snowfall Patterns Indicated by SNOTEL Measurements NOAA

Basics of Geographic Analysis in R

MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS MEASURES OF CENTRAL TENDENCY. In an experiment, these are applied to the dependent variable (DV)

Chapter 2 Spatial and Spatiotemporal Big Data Science

SASI Spatial Analysis SSC Meeting Aug 2010 Habitat Document 5

IV Course Spring 14. Graduate Course. May 4th, Big Spatiotemporal Data Analytics & Visualization

Reminders. Homework due tomorrow Quiz tomorrow

Temporal and spatial mapping of hand, foot and mouth disease in Sarawak, Malaysia

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

GIS CONCEPTS ARCGIS METHODS AND. 2 nd Edition, July David M. Theobald, Ph.D. Natural Resource Ecology Laboratory Colorado State University

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

Lesson 8: Graphs of Simple Non Linear Functions

ArcGIS Pro: Analysis and Geoprocessing. Nicholas M. Giner Esri Christopher Gabris Blue Raster

INTRODUCTION TO GIS. Dr. Ori Gudes

Types of Spatial Data

Models to carry out inference vs. Models to mimic (spatio-temporal) systems 5/5/15

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Using GIS to Identify Pedestrian- Vehicle Crash Hot Spots and Unsafe Bus Stops

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System

Studying Topography, Orographic Rainfall, and Ecosystems (STORE)

Transcription:

esri HEALTH AND HUMAN SERVICES GIS CONFERENCE MAKING PLACE MATTER Decoding Health Data with Spatial Statistics Flora Vale Jenora D Acosta

Wait a minute

Wait a minute Where is Lauren??

Wait a minute Where is Lauren??

DAY 1 What are Spatial Statistics? Measuring Geographic Distributions Analyzing Patterns Multivariate Analysis DAY 2 Subjectivity of Maps Mapping Clusters Space Time Pattern Mining Modeling Spatial Relationships

Are you using Data or Information?

Spreadsheets Data or Information?

Maps Data or Information?

When you look at a spreadsheet

You ask for more Mean Standard Deviations Min and Max

Same goes for maps!

We can do more

What are Spatial Statistics?

Spatial Statistics are a set of exploratory techniques for describing and modeling spatial distributions, patterns, processes, and relationships.

Spatial Statistics are a set of exploratory techniques for describing and modeling spatial distributions, patterns, processes, and relationships.

Spatial Statistics are a set of exploratory techniques for describing and modeling spatial distributions, patterns, processes, and relationships.

Spatial Statistics are a set of exploratory techniques for describing and modeling spatial distributions, patterns, processes, and relationships.

Spatial Statistics are a set of exploratory techniques for describing and modeling spatial distributions, patterns, processes, and relationships.

Spatial Statistics are a set of exploratory techniques for describing and modeling spatial distributions, patterns, processes, and relationships.

Spatial Statistics are a set of exploratory techniques for describing and modeling spatial distributions, patterns, processes, and relationships.

coincidence area connectivity proximity orientation direction length

python script!

Measuring Geographic Distributions

Descriptive Statistics

Mean Median Standard Deviation

Central Feature identifies the most centrally located feature in a point, line, or polygon feature class

Y X

Y X

Y X

Y X

Y X

Y central feature X

Mean Center identifies the geographic center (or the center of concentration) for a set of features

Y (9,18) (13,12) (22,23) (14,14) (25,24) (24,16) (12,12) (14,8) (18,12) X

(14,14) (13,12) Y (25,24) (24,16) (22,23) (18,12) (17,15) mean center X (12,12) (14,8) (9,18) mean = (17,15)

Median Center identifies the location that minimizes overall Euclidean distance to the features in a dataset

X (9,18) (13,12) (22,23) (14,14) (25,24) (24,16) (12,12) (14,8) (18,12) Y

Y X: 25. 24. 22. 18. 14. 14. 13. 12. 9 Y: 24. 23. 18. 16. 14. 12. 12. 12. 8 median = (14,14) median center X

Mean vs Median?

Y (176,138) (9,18) (13,12) (22,23) (14,14) (25,24) (24,16) (12,12) (14,8) (18,12) X

Y (14,14) (13,12) (25,24) (24,16) (22,23) (18,12) (12,12) (14,8) (9,18) (74,38) X mean = (33,28)

Y X: 74. 25. 24. 22. 18. 14. 14. 13. 12. 9 Y: 38. 24. 23. 18. 16. 14. 12. 12. 12. 8 median = X (16,15) (14,14) (13,12) (25,24) (24,16) (22,23) (18,12) (12,12) (14,8) (9,18) (74,38) mean = (33,28)

Y X: 74. 25. 24. 22. 18. 14. 14. 13. 12. 9 Y: 38. 24. 23. 18. 16. 14. 12. 12. 12. 8 median = X (16,15) (14,14) (13,12) (25,24) (24,16) (22,23) (18,12) (12,12) (14,8) (9,18) (74,38) mean = (33,28)

Y (33,28) (16,15) (17,15) (14,14) X

Linear Directional Mean identifies the mean direction, length, and geographic center for a set of lines

Y X

Y X

Standard Distance measures the degree to which features are concentrated or dispersed around the geometric mean center

Y mean center X

Y X

Directional Distribution (Standard Deviational Ellipse) creates standard deviational ellipses to summarize the spatial characteristics of geographic features: central tendency, dispersion, and directional trends

Y mean center X

Y X

Analyzing Patterns

Global Inferential Statistics

Is there a PATTERN?

Clustered

Clustered Dispersed

Complete Spatial RANDOMNESS

z-scores and p-values z-scores -2.58-1.96-1.65 0 1.65 1.96 2.58 p-value -0.01-0.05-0.10 0 0.10 0.05 0.01

z-scores and p-values 90% random 90% 95% 95% 99% 99% z-scores -2.58-1.96-1.65 0 1.65 1.96 2.58 p-values 0.01 0.05 0.10 0 0.10 0.05 0.01

How intense is the clustering?

How intense is the clustering? Compared to what?

How intense is the clustering? Compared to where?

How intense is the clustering? Compared to when?

An Example 1969 1985 2002 Is the spatial segregation of the rich and the poor increasing or decreasing in New York?

An Example 1969 1985 2002 5.21 4.26 2.40 6 4 2 1969 1979 1989 1999

Average Nearest Neighbor calculates a nearest neighbor index based on the average distance from each feature to its nearest neighboring feature

average distance = expected average = distance

ANN ratio = observed expected

ANN ratio = observed expected Clustered < 1

ANN ratio = observed expected Clustered < 1 < Dispersed

Spatial Autocorrelation (Moran s I) measures spatial autocorrelation based on feature locations and attribute values using the Global Moran's I statistic

Are distances and values correlated? Clustered Dispersed

16 14 17 16 32

16 2 14 17 16 18 32

Incremental Spatial Autocorrelation measures spatial autocorrelation for a series of distances and optionally creates a line graph of those distances and their corresponding z-scores

5 Spatial Autocorrelation by Distance 4 z-score 3 2 1 60 80 100 120 140 160 180 Distance (meters)

5 Spatial Autocorrelation by Distance 4 z-score 3 2 1 60 80 100 120 140 160 180 Distance (meters)

5 Spatial Autocorrelation by Distance 4 z-score 3 2 1 60 80 100 120 140 160 180 Distance (meters)

5 Spatial Autocorrelation by Distance 4 Fixed Distance Band = 70.3 meters z-score 3 2 1 60 80 100 120 140 160 180 Distance (meters)

High/Low Clustering (Getis-Ord General G) measures the concentration of high or low values for a given study area

What type of clustering is present in the data? High value clusters Low value clusters

Multi-Distance Spatial Cluster Analysis (Ripleys K Function) determines whether features, or the values associated with features, exhibit statistically significant clustering or dispersion over a range of distances

5 Spatial Clustering by Distance 4 3 L(d) 2 1 60 80 100 120 140 160 180 Distance (meters)

5 Spatial Clustering by Distance 4 statistically significant clustering 3 L(d) 2 statistically significant dispersion 1 60 80 100 120 140 160 180 Distance (meters)

dispersed

clustered

Grouping Analysis groups features based on feature attributes and optional spatial/temporal constraints

K Means 3 groups 4 groups

K Means 2 groups 3 groups 4 groups

K Means 2 groups 3 groups 4 groups

K Means 2 groups 3 groups 4 groups

Minimum Spanning Tree

Minimum Spanning Tree

Minimum Spanning Tree

Minimum Spanning Tree

MATH!

interpret results through box plots

Similarity Search identifies which candidate features are most similar or most dissimilar to one or more input features based on feature attributes

potential store locations

potential store locations high performing store

LocID PopDensity AvIncome DistToCompetition

LocID PopDensity AvIncome DistToCompetition LocID PopDensity AvIncome DistToCompetition

LocID PopDensity AvIncome DistToCompetition Rank by how similar they are to based on: PopDensity AvIncome DistToCompetition LocID PopDensity AvIncome DistToCompetition

5 2 4 1 3

Input Feature(s) to Match Candidate Features Attributes of Interest PopDensity AvIncome DistToCompetition

Input Feature(s) to Match average Candidate Features Attributes of Interest PopDensity AvIncome DistToCompetition

3 Match Methods

3 Match Methods Attribute Values

3 Match Methods Attribute Values Ranked Attribute Values

3 Match Methods Attribute Values Ranked Attribute Values Attribute Profiles

Attribute Values

Attribute Values standardize attributes Z-transform: ( x - x ) / SD

Attribute Values Population = 14,159 standardize attributes % Uninsured =.26 Distance (km) = 535.89

Attribute Values Population = -.7932 standardize attributes % Uninsured = 3.8462 Distance (km) =.6433

Attribute Values standardize attributes subtract candidate from target square differences sum squares

Ranked Attribute Values

Ranked Attribute Values rank attributes

Ranked Attribute Values 9.5 rank attributes 8.8 8.3 4.1 2.7 0.2

Ranked Attribute Values rank attributes 9.5 8.8 8.3 4.1 6 5 4 3 2.7 0.2 2 1

Ranked Attribute Values rank attributes subtract candidate from target square differences sum squares

Attribute Profiles

Attribute Profiles standardize attributes cosine similarity index cosine similarity index * Must have at least 2 attributes of interest

Dengue Fever Risk in Kenya

questions?

fvale@esri.com jdacosta@esri.com Want to learn more??? esriurl.com/spatialstats