Introduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées.

Similar documents
Exploratory spatial data analysis with

GeoXp: An R Package for Exploratory Spatial Data Analysis

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System

Geometric Algorithms in GIS

Mapping and Analysis for Spatial Social Science

Exploratory Spatial Data Analysis (And Navigating GeoDa)

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Exploratory Spatial Data Analysis Using GeoDA: : An Introduction

Exploratory Spatial Data Analysis and GeoDa

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

This lab exercise will try to answer these questions using spatial statistics in a geographic information system (GIS) context.

Outline ESDA. Exploratory Spatial Data Analysis ESDA. Luc Anselin

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University. GeoDa: Spatial Autocorrelation

Creating and Managing a W Matrix

Exploratory Spatial Data Analysis (ESDA)

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK

GeoXp: an R package for exploratory spatial data analysis

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Report on Kriging in Interpolation

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

Contents. 13. Graphs of Trigonometric Functions 2 Example Example

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Task 1: Open ArcMap and activate the Spatial Analyst extension.

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Module 2A Turning Multivariable Models into Interactive Animated Simulations

CSISS Tools and Spatial Analysis Software

Geography 281 Map Making with GIS Project Four: Comparing Classification Methods

GIST 4302/5302: Spatial Analysis and Modeling

Spatial Analysis 1. Introduction

Talking feet: Scatterplots and lines of best fit

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University. GeoDa: Exploratory Spatial Data Analysis

GIS CONFERENCE MAKING PLACE MATTER Decoding Health Data with Spatial Statistics

Why Is It There? Attribute Data Describe with statistics Analyze with hypothesis testing Spatial Data Describe with maps Analyze with spatial analysis

USING DOWNSCALED POPULATION IN LOCAL DATA GENERATION

Where to Invest Affordable Housing Dollars in Polk County?: A Spatial Analysis of Opportunity Areas

Application of mathematical, statistical, graphical or symbolic methods to maximize chemical information.

Experimental Design and Data Analysis for Biologists

Development of Integrated Spatial Analysis System Using Open Sources. Hisaji Ono. Yuji Murayama

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Bayesian Hierarchical Models

Dr.Weerakaset Suanpaga (D.Eng RS&GIS)

Investigating Weather and Climate with Google Earth Teacher Guide

Comparing whole genomes

Chart types and when to use them

Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap

Attribute Data. ArcGIS reads DBF extensions. Data in any statistical software format can be

Looking at Communities: Comparing Urban and Rural Neighborhoods

Google Earth. Overview: Targeted Alaska Grade Level Expectations: Objectives: Materials: Grades 5-8

Statistics Toolbox 6. Apply statistical algorithms and probability models

Visualizing Data: Basic Plot Types

Catchment Delineation Workflow

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

The Implementation of Autocorrelation-Based Regioclassification in ArcMap Using ArcObjects

Appendix A Summary of Tasks. Appendix Table of Contents

Lecture 8. Spatial Estimation

6. Let C and D be matrices conformable to multiplication. Then (CD) =

Two problems to be solved. Example Use of SITATION. Here is the main menu. First step. Now. To load the data.

Geovisualization. Luc Anselin. Copyright 2016 by Luc Anselin, All Rights Reserved

Contents. 9. Fractional and Quadratic Equations 2 Example Example Example

ENGRG Introduction to GIS

Analyzing the Earth Using Remote Sensing

Geographical Information Systems

Lecture 2. A Review: Geographic Information Systems & ArcGIS Basics

Lecture 5 Geostatistics

Design of Experiments

STATISTICS 479 Exam II (100 points)

Introduction to Coastal GIS

Tracey Farrigan Research Geographer USDA-Economic Research Service

Esri EADA10. ArcGIS Desktop Associate. Download Full Version :

YEAR 8 GEOGRAPHY. Landscapes & Landforms

Tutorial 8 Raster Data Analysis

Applying MapCalc Map Analysis Software

Soc/Anth 597 Spatial Demography March 14, GeoDa 0.95i Exercise A. Stephen A. Matthews. Outline. 1. Background

Classification Exercise UCSB July 2006

Appendix F. Computational Statistics Toolbox. The Computational Statistics Toolbox can be downloaded from:

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

8. Classification and Regression Trees (CART, MRT & RF)

The GeoDa Book. Exploring Spatial Data. Luc Anselin

MATH 1150 Chapter 2 Notation and Terminology

Describing Bivariate Relationships

Studying Topography, Orographic Rainfall, and Ecosystems (STORE)

Lab 1 Uniform Motion - Graphing and Analyzing Motion

Studying Topography, Orographic Rainfall, and Ecosystems (STORE)

An Introduction to Applied Multivariate Analysis with R

Spatial Data Analysis in Archaeology Anthropology 589b. Kriging Artifact Density Surfaces in ArcGIS

BNG 495 Capstone Design. Descriptive Statistics

Statistics: A review. Why statistics?

Subject CS1 Actuarial Statistics 1 Core Principles

Math/Stat Classification of Spatial Analysis and Spatial Statistics Operations

Where Do Overweight Women In Ghana Live? Answers From Exploratory Spatial Data Analysis

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

Using Tables and Graphing Calculators in Math 11

Introduction. Part I: Quick run through of ESDA checklist on our data

Short Answer Questions: Answer on your separate blank paper. Points are given in parentheses.

Units. Exploratory Data Analysis. Variables. Student Data

Data Exploration and Unsupervised Learning with Clustering

Hit the brakes, Charles!

GED 554 IT & GIS. Lecture 6 Exercise 5. May 10, 2013

Transcription:

Presentation of Presentation of Use of Introduction : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées. Authors of : Christine Thomas-Agnan, Anne Ruiz-Gazen and Yves Aragon (Gremaq, Toulouse). Version S-plus (2001), Matlab (2003) and R (2005)., Anne Ruiz-Gazen and Christine Thomas-Agnan Gremaq (Groupe de Recherche en Economie Mathématique et Quantitative), UT1 and LSP (Laboratoire de Statistique et Probabilité), UT3 15 June 2006 Presentation of Use of Presentation of Use of Introduction Introduction Authors of : Christine Thomas-Agnan, Anne Ruiz-Gazen and Yves Aragon (Gremaq, Toulouse). Version S-plus (2001), Matlab (2003) and R (2005). Tool for researchers in spatial statistics, geography, ecology,... for anyone who possesses a data set of variables measured at geographical sites or on geographical zones (cities, counties, countries,...) Authors of : Christine Thomas-Agnan, Anne Ruiz-Gazen and Yves Aragon (Gremaq, Toulouse). Version S-plus (2001), Matlab (2003) and R (2005). Tool for researchers in spatial statistics, geography, ecology,... for anyone who possesses a data set of variables measured at geographical sites or on geographical zones (cities, counties, countries,...) Main objective : exploratory spatial data analysis and coupling between a map and a statistical graph.

Presentation of Use of Presentation of Use of links dynamically statistical plots like boxplot, histogram, scatterplot,... with a map. Example : sites selected by mouse clicking on bars of histogram are represented in red on the map. Selection of a zone on the map results in the automatic highlighting of the corresponding points on the statistical graph. Selection of a portion of the graph results in the automatic highlighting of the corresponding sites on the map. Presentation of Use of Presentation of Use of Example : sites selected by mouse clicking on bars of histogram are represented in red on the map. Example : sites selected by points or polygon on the map are represented in red on the histogram.

Presentation of Use of Presentation of Use of Functionality Example : sites selected by points or polygon on the map are represented in red on the histogram. Descriptive functions : univariate or bivariate graphs such as histogram, barplot, scatterplot,... Presentation of Use of Presentation of Use of Functionality Functionality Descriptive functions : univariate or bivariate graphs such as histogram, barplot, scatterplot,... : angle plot, drift plot,... Descriptive functions : univariate or bivariate graphs such as histogram, barplot, scatterplot,... : angle plot, drift plot,... Econometric functions : Moran plot, neighbour plot,...

Presentation of Use of Presentation of Use of Functionality Use of Descriptive functions : univariate or bivariate graphs such as histogram, barplot, scatterplot,... Point pattern analysis : a site is represented on the map by a point. Coordinates are included into two vectors of numeric values. : angle plot, drift plot,... Econometric functions : Moran plot, neighbour plot,... Multivariate functions : principal component analysis, cluster analysis... Presentation of Use of Presentation of Use of Use of Use of Point pattern analysis : a site is represented on the map by a point. Coordinates are included into two vectors of numeric values. Data set is a vector or a matrix of numerical or categorical variables, associated to each spatial unit. Point pattern analysis : a site is represented on the map by a point. Coordinates are included into two vectors of numeric values. Data set is a vector or a matrix of numerical or categorical variables, associated to each spatial unit. Some options are common to all functions (spatial contours, bubbles,...) and others depend on the function (number of bars for histogram, use of colors for barplot,...).

Presentation of Use of Presentation of Use of Use of Examples Point pattern analysis : a site is represented on the map by a point. Coordinates are included into two vectors of numeric values. Example 1 (univariate analysis) : histomap(latitude, longitude, var, opt1, opt2a) Data set is a vector or a matrix of numerical or categorical variables, associated to each spatial unit. Some options are common to all functions (spatial contours, bubbles,...) and others depend on the function (number of bars for histogram, use of colors for barplot,...). Remark : no lattice/area data analysis and no use of Spatial Classes such as in spdep library (Roger Bivand). Presentation of Use of Presentation of Use of Examples Examples Example 1 (univariate analysis) : histomap(latitude, longitude, var, opt1, opt2a) Example 2 (bivariate analysis) : scattermap(latitude, longitude, var 1, var 2, opt1, opt2b) Example 1 (univariate analysis) : histomap(latitude, longitude, var, opt1, opt2a) Example 2 (bivariate analysis) : scattermap(latitude, longitude, var 1, var 2, opt1, opt2b) Example 3 (multivariate analysis) : pcamap(latitude, longitude, dataset, opt1, opt2c)

Presentation of Use of Presentation of Use of Examples Common options Example 1 (univariate analysis) : histomap(latitude, longitude, var, opt1, opt2a) Example 2 (bivariate analysis) : scattermap(latitude, longitude, var 1, var 2, opt1, opt2b) Example 3 (multivariate analysis) : pcamap(latitude, longitude, dataset, opt1, opt2c) On the map : Example 4 (spatial econometric analysis) : moranplotmap(latitude, longitude, var, W, opt1, opt2d) Presentation of Use of Presentation of Use of Common options Common options On the map : On the map : Possibility to draw spatial contours. Possibility to draw spatial contours. Possibility to cross out specific sites.

Presentation of Use of Presentation of Use of Common options Common options Possibility to improve the map : Possibility to draw spatial contours. Possibility to cross out specific sites. Possibility to improve the map : Possibility to draw spatial contours. Possibility to cross out specific sites. Possibility to print labels. Possibility to print labels. Possibility to draw bubbles. Presentation of Use of Presentation of Use of Call of a function drawing of a graph, a map and creation of a tcltk Window (library tcltk), tool for selecting a zone on the map (or on the graph) and using options. Use of : data set considered Spatial units s j (j = 1,...226) : public schools in French Midi-Pyrénées region. Spatial position of a school is represented by the centroid of the commune where the school is located.

Presentation of Use of Presentation of Use of Use of : data set considered Use of : data set considered Spatial units s j (j = 1,...226) : public schools in French Midi-Pyrénées region. Spatial position of a school is represented by the centroid of the commune where the school is located. Observed variables : number of students, age of staff, different fields of study, status of teacher,... during the 2003-2004 school year. Spatial units s j (j = 1,...226) : public schools in French Midi-Pyrénées region. Spatial position of a school is represented by the centroid of the commune where the school is located. Observed variables : number of students, age of staff, different fields of study, status of teacher,... during the 2003-2004 school year. Aim : determine characteristic of schools according to their localization in rural, periurban and urban area (Insee classification). Presentation of Presentation of Scatter plot Scatter plot scattermap(x,y,var1,var2,carte=coord,listvar=dataset) Possibility to draw conditionnal quantile regression for a given list of quantiles orders (ex : quantiles=c(0.05,0.5,0.95)). Number of classroom number of students

Presentation of Presentation of Scatter plot Possibility to add a graph among histogram, scatterplot and barplot, by selecting variable(s) (given in listvar) on tcltk window. ginimap(lat,long,var,carte=coord,listvar=dataset) Computes a Lorentz curve from var and calculates the Gini Index associate. The schools selected are mainly included in urban area. Presentation of Gini Index = 0.28 ; the 50% schools with lowest number of students contain only 29% of the student population. Presentation of Possibility to draw bubbles by selecting on tcltk window a numerical variable among listvar. The schools selected are mainly included in rural area.

Presentation of Presentation of Kernel Density function estimate densitymap(lat,long,var,carte=coord,listvar=dataset) Use of function bkde.r (library Kernsmooth) with option kernel=triweight. Possibility to choose an interval by mouse clicking on the graph on the extremities of interval or by directly specifying values. Kernel Density function estimate Possibility to change smoothing parameter α with a cursor on tcltk window. Urban schools with a high coefficient are close to Toulouse. Presentation of Angle plot Drift map Presentation of Angle plot Drift map Angle plot angleplotmap(lat,long,var,carte=coord,listvar=dataset) Represents the absolute difference between the value of var at two sites as a function of the angle between vector s i s j and the x-axis. Angle plot Possibility to draw only couple of sites whose absolute difference is larger than the 95% regression quantile smooth spline. Variable average cost per student. 2 schools in the North and in the west with high average cost per student.

Presentation of Angle plot Drift map Presentation of Spatial weight matrix Neighbour plot Moran plot Drift map function Spatial weight matrix Creates a grid on the map and calculates the mean and median for each cell. The right plot (resp. left plot) represents row (resp. column) means and median. Average number of student per class reaches a maximum in the center of the region which corresponds to the surroundings of Toulouse. Neighbour plot Presentation of Spatial weight matrix Neighbour plot Moran plot of the values of a variable at neighbouring sites for a neighbourhood structure given by a spatial weight matrix. Possibility to create spatial weight matrix based on a threshold distance or based on a given number of nearest neighbors. ex : Moran plot W = Presentation of 0 1 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 1 0 0 0 1 0 Spatial weight matrix Neighbour plot Moran plot On x-axis, is represented (Var Var) and on y-axis, is represented W (Var Var). It also calcultes Moran s I statistic (see nonnormoran.r) and gives a p-value associated to the gaussian test or to the permutation test. 2 schools with a high difference with its neighbours for variable average distance between school and home. W built with 3 nearest neighbors.

Presentation of Spatial weight matrix Neighbour plot Moran plot Presentation of Moran plot High spatial autocorrelation for average number of student by class : Moran I statistic = 0.19 with p value < 0.0001. PCA pcamap(lat,long,dataset) Draws the plots summarizing a generalized Principal Component Analysis (PCA). Schools with high values and whose neighbours have also high values are mainly included in urbain area.) Presentation of Presentation of Cluster Analysis clustermap(lat,long,dataset,number.of.class) Performs a classification of the sites from the variables included in dataset and computes a bar plot of the clusters calculated. Two methods : Hierarchical Cluster Analysis (see hclust.r) or k-means clustering (see kmeans.r)

Presentation of Presentation of Perspective Writing of a user manual. There seem to exist no link between calculated clusters and geographical area. Presentation of Presentation of Perspective Perspective Writing of a user manual. To include others functions as scatterplot 3D, sliced inverse regression, projection pursuit, spatial autoregressive models (SAR, SEM,...). Writing of a user manual. To include others functions as scatterplot 3D, sliced inverse regression, projection pursuit, spatial autoregressive models (SAR, SEM,...). Integration of Micromaps software (Carr and Symanzik).

Presentation of Presentation of Conclusion Conclusion You can download Geoχp on http ://w3.univ-tlse1.fr/gremaq/statistique/ You can download Geoχp on http ://w3.univ-tlse1.fr/gremaq/statistique/ Don t try to ask me a demonstration Presentation of Conclusion You can download Geoχp on http ://w3.univ-tlse1.fr/gremaq/statistique/ Don t try to ask me a demonstration Merci de votre attention!!