Using R for Epidemiological Analysis

Size: px
Start display at page:

Download "Using R for Epidemiological Analysis"

Transcription

1 Using R for Epidemiological Analysis Practical 3: Disease Mapping in R Introduction In this session we will use R to do some disease mapping. We will work through an example of how to create basic maps in R by creating a map of Estonia. We will then move on to an example of creating smoothed SMRs and plot them on a map by working with data on hospital admissions for chronic obstructive pulmonary disease (COPD) for England between All data required for this practical can be found in the folder Data. You will need the following files shapefiles and information for Estonia split by regions (Estonia.shp, Estonia.dbf, Estonia.prj, Estonia.shx) shapefiles and information for England split by local authorities (englandlocalauthority.shp, englandlocalauthority.dbf) observed numbers of hospital admissions by local authority (copdmortalityobserved.csv) expected numbers of hospital admissions by local authority (copdmortalityobserved.csv). Preliminaries We need the following packages spdep - Package to create spatial objects (such as neighbourhood matrix). shapefiles - Package to read and write shapefiles. CARBayes - Package to fit spatial GLMMs. rgdal - Package to handle spatial objects. rgeos - Package to handle spatial objects. As before we use the install.packages() function to download and install the packages that we need and the require() function to load them into the R library. install.packages("spdep") install.packages("shapefiles") install.packages("carbayes") install.packages("rgdal") install.packages("rgeos") require(spdep) require(shapefiles) require(carbayes) require(rgdal) require(rgeos) Before reading in any data for this practical you will need to ensure that you are in the correct folder. As explained in Practical, you can use the setwd() function

2 setwd("chosen_directory_path") If you cannot get the setwd() to work, go to Session > Set Working Directory > Choose Directory in the toolbar on the top. Creating an Estonia map To create maps, we need to read in the relevant shapefiles. The files Estonia.shp, Estonia.dbf, Estonia.prj and Estonia.shx contain the location, shape, and attributes of Estonia by region. The boundaries (shapefiles) were obtained from A map of Estonia split by more local authories can also be downloaded from here. The function readogr() will read these shapefiles into R. Estonia <- readogr(dsn=.,layer= Estonia ) OGR data source with driver: ESRI Shapefile Source: ".", layer: "Estonia" with 6 features It has 2 fields To check that the shapefiles have been read into R correctly, we can plot the boundaries using the plot() function. plot(estonia) We use simulated random numbers to create the map here. If you have your own data, we can use that. To plot this simulated data we use the ssplot() function. # Adding random numbers to each Estonia region Estonia@data$rnum <- rnorm(nrow(estonia@data),0,) # Cut off points for legend of plot range <- seq(min(estonia@data$rnum)-0.0, max(estonia@data$rnum)+0.0, length.out=5) # Creating Map spplot(obj = Estonia, # Spatial object to be plotted 2

3 zcol = c("rnum"), # Choice of the column the object you are plotting. xlab = "Easting", # Label of the x column ylab = "Northing", # Label of the y column at = range, # Break points for colours col.regions = hsv(0.6, seq(0.2,, length.out=0), )) # Create a set of colours.5.0 Northing Easting Reading in England Data and Shapefiles The observed and expected COPD counts in England by local authority need to be read into R. These are in csv format, so we use the read.csv() function. observed <- read.csv(file="copdmortalityobserved.csv", row.names=) expected <- read.csv(file="copdmortalityexpected.csv", row.names=) To check that the data has been read into R correctly, we can use the head() function, which prints the first six rows of a dataset. head(observed) name Y200 Y2002 Y2003 Y2004 Y2005 Y2006 Y AA City of London LB AB Barking and Dagenham LB AC Barnet LB AD Bexley LB AE Brent LB AF Bromley LB Y2008 Y2009 Y200 00AA 0 00AB AC AD

4 00AE AF head(expected) E200 E2002 E2003 E2004 E2005 E AA AB AC AD AE AF E2007 E2008 E2009 E200 00AA AB AC AD AE AF To familiarise ourselves with the data, we can summarise it using the summary() function. This will allow us to check for anomalies in our data. summary(observed) name Y200 Y2002 Y2003 Adur CD : Min. : 2.00 Min. : 0.00 Min. : 3.00 Allerdale CD : st Qu.: st Qu.: st Qu.: Amber Valley CD: Median : Median : Median : Arun CD : Mean : 68.0 Mean : Mean : Ashfield CD : 3rd Qu.: rd Qu.: rd Qu.: Ashford CD : Max. : Max. : Max. : (Other) :38 Y2004 Y2005 Y2006 Y2007 Min. :.00 Min. :.00 Min. :.00 Min. : 5.00 st Qu.: st Qu.: st Qu.: st Qu.: Median : Median : 5.00 Median : Median : Mean : Mean : Mean : Mean : rd Qu.: rd Qu.: rd Qu.: rd Qu.: Max. : Max. : Max. : Max. : Y2008 Y2009 Y200 Min. :.00 Min. : 0.00 Min. :.00 st Qu.: st Qu.: st Qu.: Median : 5.00 Median : Median : 5.00 Mean : 7.40 Mean : Mean : rd Qu.: rd Qu.: rd Qu.: 8.25 Max. : Max. : Max. :44.00 summary(expected) E200 E2002 E2003 E2004 Min. : Min. : 2.68 Min. : Min. : 2.75 st Qu.: st Qu.: st Qu.: st Qu.: Median : Median : Median : Median : Mean : Mean : Mean : Mean :

5 3rd Qu.: rd Qu.: rd Qu.: rd Qu.: Max. : Max. :37.27 Max. : Max. : E2005 E2006 E2007 E2008 Min. : Min. : 2.96 Min. : Min. : 3.5 st Qu.: st Qu.: st Qu.: st Qu.: Median : Median : Median : Median : Mean : Mean : Mean : Mean : rd Qu.: rd Qu.: rd Qu.: rd Qu.: Max. : Max. : Max. : Max. : E2009 E200 Min. : Min. : st Qu.: st Qu.: Median : Median : Mean : Mean : rd Qu.: rd Qu.: Max. : Max. : Does it look like R has read in the data correctly? 2. Are there any strange values in our dataset? To create SMR maps, we need to read in the relevant shapefiles. As we are going to attach the SMRs to the shapefiles to create maps, we use a di erent method to creating a spatial object than above. Instead we build the spatial object from scratch. The files englandlocalauthority.shp and englandlocalauthority.dbf contain the location, shape, and attributes of English local authorities. The functions read.shp() and read.dbf() will read shapefiles into R. shp <- read.shp(shp.name="englandlocalauthority.shp") shp$shp[[23]]$points <- shp$shp[[23]]$points[-c(460, 46, 462, 463, 464, 465), ] dbf <- read.dbf(dbf.name="englandlocalauthority.dbf") dbf$dbf <- dbf$dbf[,c(2,,3:7)] Modelling the Raw SMRs Now that we have read in the data, we can calculate raw SMRs. Remember that SMR_raw <- observed[,-]/expected SMR = observed expected To change attributes of a dataset such as the column names, we use the names() function. names(smr_raw) <- c("smr200", "SMR2002", "SMR2003", "SMR2004", "SMR2005", "SMR2006", "SMR2007", "SMR2008", "SMR2009", "SMR200") It is important that we check that no errors have occurred at any stages, so we check by summarising the results using the head() and summary() functions. 5

6 head(smr_raw) SMR200 SMR2002 SMR2003 SMR2004 SMR2005 SMR2006 SMR AA AB AC AD AE AF SMR2008 SMR2009 SMR200 00AA AB AC AD AE AF summary(smr_raw) SMR200 SMR2002 SMR2003 SMR2004 Min. : Min. : Min. :0.366 Min. : st Qu.: st Qu.: st Qu.:0.859 st Qu.: Median : Median :.068 Median :.0209 Median : Mean :.0349 Mean :.0508 Mean :.0895 Mean : rd Qu.: rd Qu.: rd Qu.:.307 3rd Qu.:.858 Max. :.986 Max. :2.28 Max. : Max. :.98 SMR2005 SMR2006 SMR2007 SMR2008 Min. : Min. : Min. : Min. :0.32 st Qu.: st Qu.:0.745 st Qu.: st Qu.: Median : Median :0.90 Median : Median : Mean :.026 Mean : Mean : Mean : rd Qu.: rd Qu.:.586 3rd Qu.:.679 3rd Qu.:.979 Max. :2.244 Max. : Max. :.8528 Max. : SMR2009 SMR200 Min. : Min. : st Qu.: st Qu.: Median : Median : Mean : Mean : rd Qu.: rd Qu.:.335 Max. :.8507 Max. : Does it look like the SMRs have been estimated correctly? 2. Are there any strange values? Mapping the Raw SMRs To plot these SMRs to a map, we need to attach them to the shapefiles. The function combine.data.shapefile() allows us to combine shapefiles to plot later. SMRspatial_raw <- combine.data.shapefile(data = SMR_raw, # Dataset to attach shp = shp, # Shapefile dbf = dbf) # Database file 6

7 Now that the estimates are attached to the shapefile, the function spplot() allows us to create a map which colours the local authorities by the SMR estimate. range <- seq(min(smr_raw$smr200)-0.0, max(smr_raw$smr200)+0.0, length.out=) spplot(obj = SMRspatial_raw, # Spatial object to be plotted zcol = c("smr200"), # Choice of the column the object you are plotting. xlab = "Easting", # Label of the x column ylab = "Northing", # Label of the y column at = range, # Break points for colours col = "transparent", # Choice of colour col.regions = hsv(0.6, seq(0.2,, length.out=0), )) # Create a set of colours 2.0 Northing Easting. What do you notice about this new plot? 2. Are there any strange values? 3. If so, do you believe that these are the truth or perhaps sampling error? Modelling the smoothed SMRs To calculate the smoothed SMRs, we first need to create a neighbourhood structure. poly2nb() and nb2mat() can be used to create this. The functions # Creates the neighbourhood W.nb <- poly2nb(smrspatial_raw, row.names = rownames(smrspatial_raw)) # Creates a matrix for following function call W.mat <- nb2mat(w.nb, style="b") Here, we use first neighbours to define our structure, so any local authority that shares a border with another are considered neighbours. 7

8 The function S.CARleroux() allows us to use this neighbourhood structure and performs a Bayesian analysis, to create a smoothed set of observed values as discussed in the lecture. model <- S.CARleroux(formula=observed$Y200~offset(log(expected$E200)), # Model Formula family="poisson", # Choosing Poisson Regression W=W.mat, # Neighbourhood matrix burnin=20000, # Number of burn in samples n.sample=00000, # Number of MCMC samples thin=0, fix.rho=true, rho=) The new smoothed values can be extracted from the model output and divided by the expected values to allow comparison between the two methods. SMR200 <- model$fitted.values / expected$e200 SMR_smooth <- as.data.frame(smr200, row.names = rownames(observed)) Again, we check that no errors have occurred, by summarising the results using the head() and summary() functions. head(smr_smooth) SMR200 00AA AB AC AD AE AF summary(smr_smooth) SMR200 Min. : st Qu.:0.793 Median :0.924 Mean : rd Qu.:.0802 Max. : Does it look like the SMRs have been estimated correctly? 2. Are there any strange values? Mapping the Smoothed SMRs Similarly to before, we attach the values of the smoothed SMRs to the shapefile using the combine.data.shapefile() function and create a map using the spplot() function. SMRspatial_smooth <- combine.data.shapefile(data = SMR_smooth, # Dataset to attach shp = shp, # Shapefile dbf = dbf) # Database file 8

9 spplot(obj = SMRspatial_smooth, # Spatial object to be plotted zcol = c("smr200"), # Choice of the column the object you are plotting. xlab = "Easting", # Label of the x column ylab = "Northing", # Label of the y column at = range, # Break points for colours col = "transparent", # Choice of colour col.regions = hsv(0.6, seq(0.2,, length.out=0), )) # Create a set of colours 2.0 Northing Easting. What do you notice about this new plot? 2. Are there any di erences between the smoothed and raw estimates? You may want to repeat this analysis for another year to see if the results are similar. Carefully go through the previous sections and change any references from 200 to any year that you wish between

Package CARBayes. March 5, 2013

Package CARBayes. March 5, 2013 Package CARBayes March 5, 2013 Type Package Title Spatial areal unit modelling Version 1.3 Date 2013-02-19 Author Duncan Lee Maintainer Duncan Lee Description This package implements

More information

CARBayes version 4.4: An R Package for Spatial Areal Unit Modelling with Conditional Autoregressive Priors

CARBayes version 4.4: An R Package for Spatial Areal Unit Modelling with Conditional Autoregressive Priors CARBayes version 4.4: An R Package for Spatial Areal Unit Modelling with Conditional Autoregressive Priors Duncan Lee University of Glasgow Abstract This is a vignette for the R package CARBayes version

More information

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio Applied Spatial Data Analysis with R 4:1 Springer Contents Preface VII 1 Hello World: Introducing Spatial Data 1 1.1 Applied Spatial Data Analysis

More information

Introduction to ggplot2. ggplot2 is (in my opinion) one of the best documented packages in R. The full documentation for it can be found here:

Introduction to ggplot2. ggplot2 is (in my opinion) one of the best documented packages in R. The full documentation for it can be found here: Introduction to ggplot2 This practical introduces a slightly different method of creating plots in R using the ggplot2 package. The package is an implementation of Leland Wilkinson's Grammar of Graphics-

More information

Package CARBayes. June 1, 2017

Package CARBayes. June 1, 2017 Type Package Package CARBayes June 1, 2017 Title Spatial Generalised Linear Mixed Models for Areal Unit Data Version 5.0 Date 2017-06-01 Author Maintainer Implements a class

More information

Bayesian Hierarchical Models

Bayesian Hierarchical Models Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology

More information

Using the Stock Hydrology Tools in ArcGIS

Using the Stock Hydrology Tools in ArcGIS Using the Stock Hydrology Tools in ArcGIS This lab exercise contains a homework assignment, detailed at the bottom, which is due Wednesday, October 6th. Several hydrology tools are part of the basic ArcGIS

More information

Analysing Spatial Data in R: Why spatial data in R?

Analysing Spatial Data in R: Why spatial data in R? Analysing Spatial Data in R: Why spatial data in R? Roger Bivand Department of Economics Norwegian School of Economics and Business Administration Bergen, Norway 31 August 2007 Why spatial data in R? What

More information

CARBayes version 5.0: An R Package for Spatial Areal Unit Modelling with Conditional Autoregressive Priors

CARBayes version 5.0: An R Package for Spatial Areal Unit Modelling with Conditional Autoregressive Priors CARBayes version 5.0: An R Package for Spatial Areal Unit Modelling with Conditional Autoregressive Priors Duncan Lee University of Glasgow Abstract This is a vignette for the R package CARBayes version

More information

Spatial plotting in R

Spatial plotting in R Spatial plotting in R Contents Introduction................................................. 1 Plotting various spatial objects...................................... 1 2 2 Using plot().............................................

More information

Naive Bayes classification

Naive Bayes classification Naive Bayes classification Christos Dimitrakakis December 4, 2015 1 Introduction One of the most important methods in machine learning and statistics is that of Bayesian inference. This is the most fundamental

More information

Instructions for Mapping 2011 Census Data

Instructions for Mapping 2011 Census Data Instructions for Mapping 2011 Census Data To map 2011 census data, you must download the census boundary files and the census data separately, then join the two files in ArcMap. In this guide, we will

More information

R Demonstration ANCOVA

R Demonstration ANCOVA R Demonstration ANCOVA Objective: The purpose of this week s session is to demonstrate how to perform an analysis of covariance (ANCOVA) in R, and how to plot the regression lines for each level of the

More information

GEOG 3830 Geographic Information Systems

GEOG 3830 Geographic Information Systems 1 GEOG 3830 Geographic Information Systems Lab 08: Spatial Relationships The objective of this lab exercise is to introduce students to a technique commonly used to evaluate the most basic types of spatial

More information

Introduction to the North Carolina SIDS data set

Introduction to the North Carolina SIDS data set Introduction to the North Carolina SIDS data set Roger Bivand August 14, 2006 1 Introduction This data set was presented first in Symons et al. (1983), analysed with reference to the spatial nature of

More information

Data Creation and Editing

Data Creation and Editing 11.520:A Workshop on Geographical Information Systems 1.188: Urban Planning and Social Science Laboratory Data Creation and Editing Based in part on notes by Prof. Joseph Ferreira and Michael Flaxman Lulu

More information

Findings from a Search for R Spatial Analysis Support. Donald L. Schrupp Wildlife Ecologist Colorado Division of Wildlife (Retired)

Findings from a Search for R Spatial Analysis Support. Donald L. Schrupp Wildlife Ecologist Colorado Division of Wildlife (Retired) Findings from a Search for R Spatial Analysis Support Donald L. Schrupp Wildlife Ecologist Colorado Division of Wildlife (Retired) Findings from a Search for R Spatial Analysis Support === Approach Steps

More information

Metric Predicted Variable on One Group

Metric Predicted Variable on One Group Metric Predicted Variable on One Group Tim Frasier Copyright Tim Frasier This work is licensed under the Creative Commons Attribution 4.0 International license. Click here for more information. Prior Homework

More information

ENV101 EARTH SYSTEMS

ENV101 EARTH SYSTEMS ENV101 EARTH SYSTEMS Practical Exercise 2 Introduction to ArcMap and Map Projections 1. OVERVIEW This practical is designed to familiarise students with the use of ArcMap for visualising spatial data and

More information

Machine Learning - TP

Machine Learning - TP Machine Learning - TP Nathalie Villa-Vialaneix - nathalie.villa@univ-paris1.fr http://www.nathalievilla.org IUT STID (Carcassonne) & SAMM (Université Paris 1) Formation INRA, Niveau 3 Formation INRA (Niveau

More information

February Getting started. A guide to unpacking Administrative Boundaries

February Getting started. A guide to unpacking Administrative Boundaries Product: Prepared: February 2016 Getting started A guide to unpacking Administrative Boundaries Disclaimer PSMA Australia believes this publication to be correct at the time of printing and does not accept

More information

Andrew B. Lawson 2019 BMTRY 763

Andrew B. Lawson 2019 BMTRY 763 BMTRY 763 FMD Foot and mouth disease (FMD) is a viral disease of clovenfooted animals and being extremely contagious it spreads rapidly. It reduces animal production, and can sometimes be fatal in young

More information

Univariate Descriptive Statistics for One Sample

Univariate Descriptive Statistics for One Sample Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 4 5 6 7 8 Introduction Our first step in descriptive statistics is to characterize the data in a single group of

More information

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK WHAT IS GEODA? Software program that serves as an introduction to spatial data analysis Free Open Source Source code is available under GNU license

More information

georglm : a package for generalised linear spatial models introductory session

georglm : a package for generalised linear spatial models introductory session georglm : a package for generalised linear spatial models introductory session Ole F. Christensen & Paulo J. Ribeiro Jr. Last update: October 18, 2017 The objective of this page is to introduce the reader

More information

Exercise 2: Working with Vector Data in ArcGIS 9.3

Exercise 2: Working with Vector Data in ArcGIS 9.3 Exercise 2: Working with Vector Data in ArcGIS 9.3 There are several tools in ArcGIS 9.3 used for GIS operations on vector data. In this exercise we will use: Analysis Tools in ArcToolbox Overlay Analysis

More information

Package CARBayesdata

Package CARBayesdata Type Package Package CARBayesdata June 8, 2016 Title Data Used in the Vignettes Accompanying the CARBayes and CARBayesST Packages Version 2.0 Date 2016-06-10 Author Duncan Lee Maintainer Duncan Lee

More information

ArcGIS 9 ArcGIS StreetMap Tutorial

ArcGIS 9 ArcGIS StreetMap Tutorial ArcGIS 9 ArcGIS StreetMap Tutorial Copyright 2001 2008 ESRI All Rights Reserved. Printed in the United States of America. The information contained in this document is the exclusive property of ESRI. This

More information

Review on Spatial Data

Review on Spatial Data Week 12 Lecture: Spatial Autocorrelation and Spatial Regression Introduction to Programming and Geoprocessing Using R GEO6938 4172 GEO4938 4166 Point data Review on Spatial Data Area/lattice data May be

More information

Generalized Linear Models in R

Generalized Linear Models in R Generalized Linear Models in R NO ORDER Kenneth K. Lopiano, Garvesh Raskutti, Dan Yang last modified 28 4 2013 1 Outline 1. Background and preliminaries 2. Data manipulation and exercises 3. Data structures

More information

Getting started with spatstat

Getting started with spatstat Getting started with spatstat Adrian Baddeley, Rolf Turner and Ege Rubak For spatstat version 1.54-0 Welcome to spatstat, a package in the R language for analysing spatial point patterns. This document

More information

The coxvc_1-1-1 package

The coxvc_1-1-1 package Appendix A The coxvc_1-1-1 package A.1 Introduction The coxvc_1-1-1 package is a set of functions for survival analysis that run under R2.1.1 [81]. This package contains a set of routines to fit Cox models

More information

Urban Canopy Tool User Guide `bo`

Urban Canopy Tool User Guide `bo` Urban Canopy Tool User Guide `bo` ADMS Urban Canopy Tool User Guide Version 2.0 June 2014 Cambridge Environmental Research Consultants Ltd. 3, King s Parade Cambridge CB2 1SJ UK Telephone: +44 (0)1223

More information

The evdbayes Package

The evdbayes Package The evdbayes Package April 19, 2006 Version 1.0-5 Date 2006-18-04 Title Bayesian Analysis in Extreme Theory Author Alec Stephenson and Mathieu Ribatet. Maintainer Mathieu Ribatet

More information

Introduction to Simple Linear Regression

Introduction to Simple Linear Regression Introduction to Simple Linear Regression 1. Regression Equation A simple linear regression (also known as a bivariate regression) is a linear equation describing the relationship between an explanatory

More information

Working with Map Projections

Working with Map Projections Working with Map Projections Rachel Applebaum and Margaret M. Maher Author of Lining Up Data in ArcGIS: a guide to map projections Second edition Some common problems that prevent data from aligning correctly

More information

Holiday Assignment PS 531

Holiday Assignment PS 531 Holiday Assignment PS 531 Prof: Jake Bowers TA: Paul Testa January 27, 2014 Overview Below is a brief assignment for you to complete over the break. It should serve as refresher, covering some of the basic

More information

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests

Two Correlated Proportions Non- Inferiority, Superiority, and Equivalence Tests Chapter 59 Two Correlated Proportions on- Inferiority, Superiority, and Equivalence Tests Introduction This chapter documents three closely related procedures: non-inferiority tests, superiority (by a

More information

Downloaded from:

Downloaded from: Camacho, A; Kucharski, AJ; Funk, S; Breman, J; Piot, P; Edmunds, WJ (2014) Potential for large outbreaks of Ebola virus disease. Epidemics, 9. pp. 70-8. ISSN 1755-4365 DOI: https://doi.org/10.1016/j.epidem.2014.09.003

More information

ST-Links. SpatialKit. Version 3.0.x. For ArcMap. ArcMap Extension for Directly Connecting to Spatial Databases. ST-Links Corporation.

ST-Links. SpatialKit. Version 3.0.x. For ArcMap. ArcMap Extension for Directly Connecting to Spatial Databases. ST-Links Corporation. ST-Links SpatialKit For ArcMap Version 3.0.x ArcMap Extension for Directly Connecting to Spatial Databases ST-Links Corporation www.st-links.com 2012 Contents Introduction... 3 Installation... 3 Database

More information

Inferences on Linear Combinations of Coefficients

Inferences on Linear Combinations of Coefficients Inferences on Linear Combinations of Coefficients Note on required packages: The following code required the package multcomp to test hypotheses on linear combinations of regression coefficients. If you

More information

Understanding p Values

Understanding p Values Understanding p Values James H. Steiger Vanderbilt University James H. Steiger Vanderbilt University Understanding p Values 1 / 29 Introduction Introduction In this module, we introduce the notion of a

More information

Estimating the long-term health impact of air pollution using spatial ecological studies. Duncan Lee

Estimating the long-term health impact of air pollution using spatial ecological studies. Duncan Lee Estimating the long-term health impact of air pollution using spatial ecological studies Duncan Lee EPSRC and RSS workshop 12th September 2014 Acknowledgements This is joint work with Alastair Rushworth

More information

Computer Vision, Laboratory session 2

Computer Vision, Laboratory session 2 Centre for Mathematical Sciences, February 2011 Computer Vision, Laboratory session 2 In the first laboratory you downloaded datorsee.zip from the homepage: http://www.maths.lth.se/matematiklth/vision/datorsee

More information

Lecture 5 : The Poisson Distribution

Lecture 5 : The Poisson Distribution Lecture 5 : The Poisson Distribution Jonathan Marchini November 5, 2004 1 Introduction Many experimental situations occur in which we observe the counts of events within a set unit of time, area, volume,

More information

RNA-seq. Differential analysis

RNA-seq. Differential analysis RNA-seq Differential analysis DESeq2 DESeq2 http://bioconductor.org/packages/release/bioc/vignettes/deseq 2/inst/doc/DESeq2.html Input data Why un-normalized counts? As input, the DESeq2 package expects

More information

GIS Lecture 4: Data. GIS Tutorial, Third Edition GIS 1

GIS Lecture 4: Data. GIS Tutorial, Third Edition GIS 1 GIS Lecture 4: Data GIS 1 Outline Data Types, Tables, and Formats Geodatabase Tabular Joins Spatial Joins Field Calculator ArcCatalog Functions GIS 2 Data Types, Tables, Formats GIS 3 Directly Loadable

More information

Working with Digital Elevation Models in ArcGIS 8.3

Working with Digital Elevation Models in ArcGIS 8.3 Working with Digital Elevation Models in ArcGIS 8.3 The homework that you need to turn in is found at the end of this document. This lab continues your introduction to using the Spatial Analyst Extension

More information

Land Cover Data Processing Land cover data source Description and documentation Download Use Use

Land Cover Data Processing Land cover data source Description and documentation Download Use Use Land Cover Data Processing This document provides a step by step procedure on how to build the land cover data required by EnSim. The steps provided here my be long and there may be short cuts (like using

More information

In order to save time, the following data files have already been preloaded to the computer (most likely under c:\odmc2012\data\)

In order to save time, the following data files have already been preloaded to the computer (most likely under c:\odmc2012\data\) ODMC2012 QGIS Ex1 Schools and Public Transport In this exercise, users will learn how to a) map the location of secondary schools in and around the Southampton area; b) overlay the school location map

More information

Watershed Delineation

Watershed Delineation Watershed Delineation Jessica L. Watkins, University of Georgia 2 April 2009 Updated by KC Love February 25, 2011 PURPOSE For this project, I delineated watersheds for the Coweeta synoptic sampling area

More information

Community Health Needs Assessment through Spatial Regression Modeling

Community Health Needs Assessment through Spatial Regression Modeling Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular

More information

2013 Eric Pitman Summer Workshop in Computational Science....an introduction to R, statistics, programming, and getting to know datasets

2013 Eric Pitman Summer Workshop in Computational Science....an introduction to R, statistics, programming, and getting to know datasets 2013 Eric Pitman Summer Workshop in Computational Science...an introduction to R, statistics, programming, and getting to know datasets Introducing the Workshop Project Here's what we'll cover: The story

More information

LAB B. The Local Stellar Population

LAB B. The Local Stellar Population KEELE UNIVERSITY SCHOOL OF PHYSICAL AND GEOGRAPHICAL SCIENCES Year 1 ASTROPHYSICS LAB LAB B. The Local Stellar Population R. D. Jeffries version January 2010 D. E. McLaughlin Throughout this experiment

More information

Collated responses from R-help on confidence intervals for risk ratios

Collated responses from R-help on confidence intervals for risk ratios Collated responses from R-help on confidence intervals for risk ratios Michael E Dewey November, 2006 Introduction This document arose out of a problem assessing a confidence interval for the risk ratio

More information

Grid to LandGrid Volumetrics Workflow

Grid to LandGrid Volumetrics Workflow Grid to LandGrid Volumetrics Workflow Petra User Group Session Series Armin Schafer Technical Advisor Petra/Kingdom/GeoSyn/LogArc Encana Amphitheatre Nov 22, 2011 noon 1 pm PURPOSE: to demonstrate the

More information

UNST 232 Mentor Section Assignment 5 Historical Climate Data

UNST 232 Mentor Section Assignment 5 Historical Climate Data UNST 232 Mentor Section Assignment 5 Historical Climate Data 1 introduction Informally, we can define climate as the typical weather experienced in a particular region. More rigorously, it is the statistical

More information

GEOGRAPHICAL INFORMATION SYSTEMS. GIS Foundation Capacity Building Course. Introduction

GEOGRAPHICAL INFORMATION SYSTEMS. GIS Foundation Capacity Building Course. Introduction GEOGRAPHICAL INFORMATION SYSTEMS. GIS Foundation Capacity Building Course. Introduction In recent times digital mapping has become part and parcel of our daily lives with experience from Google Maps on

More information

Environmental Systems Research Institute

Environmental Systems Research Institute Introduction to ArcGIS ESRI Environmental Systems Research Institute Redlands, California 2 ESRI GIS Development Arc/Info (coverage model) Versions 1-7 from 1980 1999 Arc Macro Language (AML) ArcView (shapefile

More information

Introduction to ArcGIS 10.2

Introduction to ArcGIS 10.2 Introduction to ArcGIS 10.2 Francisco Olivera, Ph.D., P.E. Srikanth Koka Lauren Walker Aishwarya Vijaykumar Keri Clary Department of Civil Engineering April 21, 2014 Contents Brief Overview of ArcGIS 10.2...

More information

Diversity and composition of termites in Amazonia CSDambros 09 January, 2015

Diversity and composition of termites in Amazonia CSDambros 09 January, 2015 Diversity and composition of termites in Amazonia CSDambros 09 January, 2015 Put the abstract here Missing code is being cleaned. Abstract Contents 1 Intro 3 2 Load required packages 3 3 Import data 3

More information

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1 Models for Count and Binary Data Poisson and Logistic GWR Models 24/07/2008 GWR Workshop 1 Outline I: Modelling counts Poisson regression II: Modelling binary events Logistic Regression III: Poisson Regression

More information

Metric Predicted Variable on Two Groups

Metric Predicted Variable on Two Groups Metric Predicted Variable on Two Groups Tim Frasier Copyright Tim Frasier This work is licensed under the Creative Commons Attribution 4.0 International license. Click here for more information. Goals

More information

Overlay Analysis II: Using Zonal and Extract Tools to Transfer Raster Values in ArcMap

Overlay Analysis II: Using Zonal and Extract Tools to Transfer Raster Values in ArcMap Overlay Analysis II: Using Zonal and Extract Tools to Transfer Raster Values in ArcMap Created by Patrick Florance and Jonathan Gale, Edited by Catherine Ressijac on March 26, 2018 If you have raster data

More information

Scottish Atlas of Variation

Scottish Atlas of Variation Scottish Atlas of Variation User Guide March 2019 Contents Introduction... 3 Accessing the Atlas of Variation... 3 Dashboard... 4 Introduction... 4 Drop-down menus... 5 Explore icon... 5 Information icons...

More information

GeoDa-GWR Results: GeoDa-GWR Output (portion only): Program began at 4/8/2016 4:40:38 PM

GeoDa-GWR Results: GeoDa-GWR Output (portion only): Program began at 4/8/2016 4:40:38 PM New Mexico Health Insurance Coverage, 2009-2013 Exploratory, Ordinary Least Squares, and Geographically Weighted Regression Using GeoDa-GWR, R, and QGIS Larry Spear 4/13/2016 (Draft) A dataset consisting

More information

Introduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées.

Introduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées. Presentation of Presentation of Use of Introduction : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées. Authors of : Christine Thomas-Agnan,

More information

Task 1: Open ArcMap and activate the Spatial Analyst extension.

Task 1: Open ArcMap and activate the Spatial Analyst extension. Exercise 10 Spatial Analyst The following steps describe the general process that you will follow to complete the exercise. Specific steps will be provided later in the step-by-step instructions component

More information

The GeoDa Book. Exploring Spatial Data. Luc Anselin

The GeoDa Book. Exploring Spatial Data. Luc Anselin The GeoDa Book Exploring Spatial Data Luc Anselin Copyright 2017 by GeoDa Press LLC All rights reserved. ISBN: 0-9863421-2-2 ISBN-13: 978-0-9863421-2-7 GeoDa Press LLC, Chicago, IL GeoDa TM is a trade

More information

Utilizing Data from American FactFinder with TIGER/Line Shapefiles in ArcGIS

Utilizing Data from American FactFinder with TIGER/Line Shapefiles in ArcGIS Utilizing Data from American FactFinder with TIGER/Line Shapefiles in ArcGIS Web Adams, GISP Data Dissemination Specialist U.S. Census Bureau New York Regional Office 1 What We Do Decennial Census Every

More information

1. Data Overview: North Direction. Total observations for each intersection: 96,432 Missing Percentages:

1. Data Overview: North Direction. Total observations for each intersection: 96,432 Missing Percentages: Preliminary Report LOOP IMPUTATION OF HOURLY DATA ON A SECTION OF THE I-5 (BETWEEN ROUTES 14 AND 99) VIA FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS JAN DE LEEUW, IRINA KUKUYEVA Abstract. We present the results

More information

A Street Named for a King

A Street Named for a King A Street Named for a King Dr. Jerry Mitchell University of South Carolina OVERVIEW This lesson adapts the work of Dr. Derek Alderman, a geographer who has used the Martin Luther King, Jr. street-naming

More information

Introduction to Python Practical 2

Introduction to Python Practical 2 Introduction to Python Practical 2 Daniel Carrera & Brian Thorsbro November 2017 1 Searching through data One of the most important skills in scientific computing is sorting through large datasets and

More information

Map your way to deeper insights

Map your way to deeper insights Map your way to deeper insights Target, forecast and plan by geographic region Highlights Apply your data to pre-installed map templates and customize to meet your needs. Select from included map files

More information

Export Basemap Imagery from GIS to CAD

Export Basemap Imagery from GIS to CAD Export Basemap Imagery from GIS to CAD This tutorial illustrates how to add high resolution imagery as a basemap into an existing CAD drawing using ArcGIS and AutoCAD. Through this method, the imagery

More information

Using R in 200D Luke Sonnet

Using R in 200D Luke Sonnet Using R in 200D Luke Sonnet Contents Working with data frames 1 Working with variables........................................... 1 Analyzing data............................................... 3 Random

More information

Interpreting Regression

Interpreting Regression Interpreting Regression January 10, 2018 Contents Standard error of the estimate Homoscedasticity Using R to make interpretations about regresssion Questions In the tutorial on prediction we used the regression

More information

Spatial Econometrics

Spatial Econometrics Spatial Econometrics Lecture 1: The notion of spatial modelling. Visualisation of spatial data in R (1) Spatial Econometrics 1 / 30 Contents 1 Why spatial modelling? Space vs ties between units Applications

More information

ALDEx: ANOVA-Like Differential Gene Expression Analysis of Single-Organism and Meta-RNA-Seq

ALDEx: ANOVA-Like Differential Gene Expression Analysis of Single-Organism and Meta-RNA-Seq ALDEx: ANOVA-Like Differential Gene Expression Analysis of Single-Organism and Meta-RNA-Seq Andrew Fernandes, Gregory Gloor, Jean Macklaim July 18, 212 1 Introduction This guide provides an overview of

More information

Generalised linear models. Response variable can take a number of different formats

Generalised linear models. Response variable can take a number of different formats Generalised linear models Response variable can take a number of different formats Structure Limitations of linear models and GLM theory GLM for count data GLM for presence \ absence data GLM for proportion

More information

Working with Digital Elevation Models and Spot Heights in ArcMap

Working with Digital Elevation Models and Spot Heights in ArcMap Working with Digital Elevation Models and Spot Heights in ArcMap 10.3.1 1 TABLE OF CONTENTS INTRODUCTION... 3 WORKING WITH SPOT HEIGHTS FROM NRVIS, CITY OF KITCHENER, AND CITY OF TORONTO...4 WORKING WITH

More information

Visualization of Origin- Destination Commuter Flow Using CTPP Data and ArcGIS

Visualization of Origin- Destination Commuter Flow Using CTPP Data and ArcGIS Visualization of Origin- Destination Commuter Flow Using CTPP Data and ArcGIS Research & Analysis Department Southern California Association of Governments 2015 ESRI User Conference l July 23, 2015 l San

More information

User Manual for the New Zealand Marine Environment Classification

User Manual for the New Zealand Marine Environment Classification User Manual for the New Zealand Marine Environment Classification User Manual for the New Zealand Marine Environment Classification Katie Dey Mark Weatherhead Prepared for Ministry for the Environment

More information

GIS Functions and Integration. Tyler Pauley Associate Consultant

GIS Functions and Integration. Tyler Pauley Associate Consultant GIS Functions and Integration Tyler Pauley Associate Consultant Contents GIS in AgileAssets products Displaying data within AMS Symbolizing the map display Display on Bing Maps Demo- Displaying a map in

More information

Within this document, the term NHDPlus is used when referring to NHDPlus Version 2.1 (unless otherwise noted).

Within this document, the term NHDPlus is used when referring to NHDPlus Version 2.1 (unless otherwise noted). Exercise 7 Watershed Delineation Using ArcGIS Spatial Analyst Last Updated 4/6/2017 Within this document, the term NHDPlus is used when referring to NHDPlus Version 2.1 (unless otherwise noted). There

More information

NAVIGATION. 2. Marginal Information

NAVIGATION. 2. Marginal Information NAVIGATION 1. The Map Definition and Handling A map is a graphic representation, usually on a plane surface and at an established scale, of natural or artificial features on the surface of a part or a

More information

Outline. ArcGIS? ArcMap? I Understanding ArcMap. ArcMap GIS & GWR GEOGRAPHICALLY WEIGHTED REGRESSION. (Brief) Overview of ArcMap

Outline. ArcGIS? ArcMap? I Understanding ArcMap. ArcMap GIS & GWR GEOGRAPHICALLY WEIGHTED REGRESSION. (Brief) Overview of ArcMap GEOGRAPHICALLY WEIGHTED REGRESSION Outline GWR 3.0 Software for GWR (Brief) Overview of ArcMap Displaying GWR results in ArcMap stewart.fotheringham@nuim.ie http://ncg.nuim.ie ncg.nuim.ie/gwr/ ArcGIS?

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

CentropeSTATISTICS Working Interactively with Cross-Border Statistic Data Clemens Beyer, Walter Pozarek, Manfred Schrenk

CentropeSTATISTICS Working Interactively with Cross-Border Statistic Data Clemens Beyer, Walter Pozarek, Manfred Schrenk Clemens Beyer, Walter Pozarek, Manfred Schrenk (Dipl.-Ing. Clemens Beyer, CEIT ALANOVA, Concorde Business Park 2/F, 2320 Schwechat, Austria, c.beyer@ceit.at) (Dipl.-Ing. Walter Pozarek, PGO Planungsgemeinschaft

More information

Exercise 3: GIS data on the World Wide Web

Exercise 3: GIS data on the World Wide Web Exercise 3: GIS data on the World Wide Web These web sites are a few examples of sites that are serving free GIS data. Many other sites exist. Search in Google or other search engine to find GIS data for

More information

GPR 2014_003_ReadMe.PDF

GPR 2014_003_ReadMe.PDF CGG Project #: 12084 Archive Date: 3-10-2015 This archive consists of 1 DVD-ROM GPR 2014_003_ReadMe.PDF SOUTHERN DISHNA RIVER, FOX HILLS, AND BEAVER CREEK SURVEY AREAS: Project report, interpretation maps,

More information

Acknowledgments xiii Preface xv. GIS Tutorial 1 Introducing GIS and health applications 1. What is GIS? 2

Acknowledgments xiii Preface xv. GIS Tutorial 1 Introducing GIS and health applications 1. What is GIS? 2 Acknowledgments xiii Preface xv GIS Tutorial 1 Introducing GIS and health applications 1 What is GIS? 2 Spatial data 2 Digital map infrastructure 4 Unique capabilities of GIS 5 Installing ArcView and the

More information

7th FIG Regional Conference Spatial Data Serving People: Land Governance and the Environment - Building the Capacity

7th FIG Regional Conference Spatial Data Serving People: Land Governance and the Environment - Building the Capacity 7th FIG Regional Conference Spatial Data Serving People: Land Governance and the Environment - Building the Capacity Hanoi, Vietnam, 19-22 October 2009 ------------------ TS 3A - SDI in Support of Urban

More information

How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions

How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions 202 RESEARCH CONFERENCES How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions Author Olivier Bonin, Université Paris Est - IFSTTAR - LVMT, France

More information

Tutorial of some statistical methods using R language

Tutorial of some statistical methods using R language Tutorial of some statistical methods using R language Nobuhiko ENDO Japan Agency for Marine-Earth Science and Technology 1. Introduction Dr. Tomoshige Inoue (JAMSTEC) found that a linear relationship between

More information

Creating Watersheds from a DEM

Creating Watersheds from a DEM Creating Watersheds from a DEM These instructions enable you to create watersheds of specified area using a good quality Digital Elevation Model (DEM) in ArcGIS 8.1. The modeling is performed in ArcMap

More information

Session Objectives. Learn how to: Bring georeferenced aerial imagery into Civil 3D. Connect to and import GIS data using various tools and techniques.

Session Objectives. Learn how to: Bring georeferenced aerial imagery into Civil 3D. Connect to and import GIS data using various tools and techniques. GIS into Civil 3D Data 70th Annual Wisconsin Society of Land Surveyors' Institute January 24, 2019 1 Learn how to: Session Objectives Bring georeferenced aerial imagery into Civil 3D. Connect to and import

More information

Introduction to Statistics and R

Introduction to Statistics and R Introduction to Statistics and R Mayo-Illinois Computational Genomics Workshop (2018) Ruoqing Zhu, Ph.D. Department of Statistics, UIUC rqzhu@illinois.edu June 18, 2018 Abstract This document is a supplimentary

More information

Geographically Weighted Regression LECTURE 2 : Introduction to GWR II

Geographically Weighted Regression LECTURE 2 : Introduction to GWR II Geographically Weighted Regression LECTURE 2 : Introduction to GWR II Stewart.Fotheringham@nuim.ie http://ncg.nuim.ie/gwr A Simulation Experiment Y i = α i + β 1i X 1i + β 2i X 2i Data on X 1 and X 2 drawn

More information

Geodatabase An Overview

Geodatabase An Overview Federal GIS Conference February 9 10, 2015 Washington, DC Geodatabase An Overview Ralph Denkenberger - esri Session Path The Geodatabase - What is it? - Why use it? - What types are there? Inside the Geodatabase

More information