Integrating ARCGIS with Datamining Software to Predict Habitat for Red Sea Urchins on the Coast of British Columbia.

Similar documents
Karsten Vennemann, Seattle. QGIS Workshop CUGOS Spring Fling 2015

A Geographic Information System Based Model John L Marshall Geography 562 GIS Coastal Resources - Final Project: Fall 2017 University of Washington

Welcome to NR502 GIS Applications in Natural Resources. You can take this course for 1 or 2 credits. There is also an option for 3 credits.

Among various open-source GIS programs, QGIS can be the best suitable option which can be used across partners for reasons outlined below.

NR402 GIS Applications in Natural Resources

INTRODUCTION TO GEOGRAPHIC INFORMATION SYSTEM By Reshma H. Patil

USING HYPERSPECTRAL IMAGERY

3/29/11. Why bother with tools? Incorporating Decision Support Tools into Climate Adaptation Planning. A Simplified Planning Process

4. GIS Implementation of the TxDOT Hydrology Extensions

A GIS-Based Approach to Quantifying Pesticide Use Site Proximity to Salmonid Habitat

Course overview. Grading and Evaluation. Final project. Where and When? Welcome to REM402 Applied Spatial Analysis in Natural Resources.

Combing Open-Source Programming Languages with GIS for Spatial Data Science. Maja Kalinic Master s Thesis

D2D SALES WITH SURVEY123, OP DASHBOARD, AND MICROSOFT SSAS

Orbital Insight Energy: Oil Storage v5.1 Methodologies & Data Documentation

Modeling habitat suitability for the European green crab using the ShoreZone mapping system

GEOGRAPHIC INFORMATION SYSTEMS (GIS) AT GERS LAB

Introduction to the 176A labs and ArcGIS Purpose of the labs

Data Structures & Database Queries in GIS

Mapping Undersea Feature Names in S-100. UFNPT at SCUFN 31 Wellington, New Zealand October, 2018

Recommendations for installation and general overview of available tools

The Problem. Which ecosystems are the most sensitive? Where will ecosystems be migrating? Where will investments be overwhelmed?

Fusion of Geodesy and GIS at NOAA s National Geodetic Survey

Introduction. --- Chapter One --- What Is GIS?

GIS-BASED DISASTER WARNING SYSTEM OF LOW TEMPERATURE AND SPARE SUNLIGHT IN GREENHOUSE

European Commission STUDY ON INTERIM EVALUATION OF EUROPEAN MARINE OBSERVATION AND DATA NETWORK. Executive Summary

Introduction to the 176A labs and ArcGIS

Administering your Enterprise Geodatabase using Python. Jill Penney

A BASE SYSTEM FOR MICRO TRAFFIC SIMULATION USING THE GEOGRAPHICAL INFORMATION DATABASE

Use of Elevation Data in NOAA Coastal Mapping Shoreline Products. Coastal GeoTools April 1, 2015

DATA SCIENCE SIMPLIFIED USING ARCGIS API FOR PYTHON

Intelligence and statistics for rapid and robust earthquake detection, association and location

GIS Boot Camp for Education June th, 2011 Day 1. Instructor: Sabah Jabbouri Phone: (253) x 4854 Office: TC 136

GIS Software. Evolution of GIS Software

Cartographic and Geospatial Futures

Environmental Systems Research Institute

Agile GIS : Building applicationspecific spatial analytic software from freely available software tools

THE DIGITAL TERRAIN MAP LIBRARY: AN EXPLORATIONIST S RESOURCE

The integration of land change modeling framework FUTURES into GRASS GIS 7

Aim and objectives Components of vulnerability National Coastal Vulnerability Assessment 2

Empowering Local Health through GIS

Chapter 1. GIS Fundamentals

ISSUES AND APPROACHES TO COUPLING GIS TO IRRIGATION DISTRIBUTION NETWORK AND SEEPAGE LOSS MODELS

Land-Use Land-Cover Change Detector

GENERALIZATION IN THE NEW GENERATION OF GIS. Dan Lee ESRI, Inc. 380 New York Street Redlands, CA USA Fax:

NEW TOOLS TO IMPROVE DESKTOP SURVEYS

Introduction to Portal for ArcGIS. Hao LEE November 12, 2015

Machine Learning. Boris

Lab 7: Cell, Neighborhood, and Zonal Statistics

Problems and Challenges

SLR Calculator: Sea Level Rise (SLR) Inundation Surface Calculator Add-in for ArcGIS Desktop & 10.4

Gridded Ambient Air Pollutant Concentrations for Southern California, User Notes authored by Beau MacDonald, 11/28/2017

Arcmap Manual Esri 10 READ ONLINE

Arcmap Manual Esri 10 READ ONLINE

Introduction to Portal for ArcGIS

Forecast Predictions of Winds, Waves and Storm Surge during Hurricane Arthur (2014)

Annual Update Student Learning Outcomes Packet

The Changing Landscape of Land Administration

MUDMAP TM. Software Description

Generating Scheduled Rasters using Python

Measuring earthquake-generated surface offsets from high-resolution digital topography

Hydraulic Processes Analysis System (HyPAS)

BACHELOR OF TECHNOLOGY DEGREE PROGRAM IN COMPUTER SCIENCE AND ENGINEERING B.TECH (COMPUTER SCIENCE AND ENGINEERING) Program,

ARMY ITAM GIS: Automating Standard Army Training Map Production

GIS Level 2. MIT GIS Services

Digital Tax Maps Westport Island Project Summary

Esri UC Talking Points. Harmful Algae Blooms (HABs) Rapid growth, blooming of toxin producing algae

Geodatabase Programming with Python

ESRI* Object Models; Data Capture

Robert D. Borchert GIS Technician

Introduction to ArcGIS 10.2

IndiFrag v2.1: An Object-based Fragmentation Analysis Software Tool

ESRI Object Models and Data Capture 2/1/2018

CE 365K Exercise 1: GIS Basemap for Design Project Spring 2014 Hydraulic Engineering Design

New version of the Spatial Data Modeler tool: ArcSDM 5. ArcSDM 5 Final Seminar May 4 th 2018, Rovaniemi

Online visualization of multi-dimensional spatiotemporal

Put your story on the map (literally) John Nerge, GISP GIS Coordinator

Reductionist View: A Priori Algorithm and Vector-Space Text Retrieval. Sargur Srihari University at Buffalo The State University of New York

These modules are covered with a brief information and practical in ArcGIS Software and open source software also like QGIS, ILWIS.

Using web-based Java pplane applet to graph solutions of systems of differential equations

HASSET A probability event tree tool to evaluate future eruptive scenarios using Bayesian Inference. Presented as a plugin for QGIS.

HIRES 2017 Syllabus. Instructors:

GEOGRAPHY (GE) Courses of Instruction

Proposal to Include a Grid Referencing System in S-100

GIS Visualization: A Library s Pursuit Towards Creative and Innovative Research

Practical reliability approach to urban slope stability

Tips and Tricks for Using ArcGIS for Fire Pre-Incident Planning Version II By: Chris Rogers Firefighter Kirkland Fire Department Kirkland Washington

The Danish enterprise approach to create and publish nautical products.covering the waters along the west coast of Greenland

ESRI* Object Models; Data Capture

Automated Seabed Mapping and Data Delivery in the Cloud

Python Raster Analysis. Kevin M. Johnston Nawajish Noman

The Changing Face of Geospatial Technology

CREATING CUSTOMIZED DATE RANGE COLLECTIONS IN PRESENTATION STUDIO

Visualization of Commuter Flow Using CTPP Data and GIS

Transactions on Information and Communications Technologies vol 18, 1998 WIT Press, ISSN

Introduction to Google Mapping Tools

Manual for a computer class in ML

ENV208/ENV508 Applied GIS. Week 1: What is GIS?

Create your own map for tidepooling, beach field trips, boating, camping, kayaking, fishing, and exploring the beaches in Southcentral Alaska.

OBEUS. (Object-Based Environment for Urban Simulation) Shareware Version. Itzhak Benenson 1,2, Slava Birfur 1, Vlad Kharbash 1

GIS PORTFOLIO MOHAMED MAGDY MOHAMED HUSSAIN GIS ENGINEER. UWF GIS ONLINE CERTIFICATE GIS Internship (GIS4944)

Transcription:

Integrating ARCGIS with Datamining Software to Predict Habitat for Red Sea Urchins on the Coast of British Columbia. Wayne Hajas Pacific Biological Station Nanaimo, BC 1

Allison Smeaton GIS-student intern Dan Leus Field Biologist Acknowledgements Pacific Underwater Harvesters Association Industry association for the red-urchin harvesters 2

Not a GIS person Myself I may let some jargon slip out I might need more explanation on GIS matters Might have different expectations about how things should work 3

Statement of Problem Which parts of the British Columbia coast are habitat for red sea urchins? 4

Problem in General Terms Can Geographic Datasets be used as predictive tools? On an opportunistic basis? Can ARCGIS be extended through the arcpy/python interface? 5

Outline Describe the problem Describe the data Mathematical method (very brief) Software GIS Operations Results and why I think they are valid Conclusions Questions 6

The Problem (Red Sea Urchins) 7

The Problem (Red Sea Urchins) 8

Data 9

Training Data Prediction Required Data Structure for Datamining Test Data (Predictive Variables) Objective Variable 8 FALSE blue habitat 10 FALSE red habitat 11 FALSE blue nonhabitat 6 FALSE red habitat 14 FALSE red? 14 FALSE blue? 8 TRUE red? 6 FALSE blue? 14 FALSE blue? 12 FALSE blue? 7 FALSE blue? 11 TRUE blue? 6 FALSE red 10?

Test (Predictive) Data Shorezone Its objective is to produce an integrated, searchable inventory of geomorphic and biological features which can be used as a tool for science, education, management, and environmental hazard mitigation. (Coastal and Ocean Resources Inc.) Winter 2010/2011 of ArcNews 11

GIS database Test (Predictive) Data Shorezone BC Coast is divided into ~90,000 polylines Vegetation, geology, wave exposure, ~50 fields Data collected from aerial surveys in the 1980 s. Probably marine charts also Access controlled by BC government. Parallel projects in Alaska and Washington State. 12

Known Habitat Harvest Events Scientific Surveys Expert Opinion Known Nonhabitat Expert Opinion Training Data Points, lines and polygons Collected and managed independently of test data (Shorezone) 13

Mathematical Methods 14

Datamining ( machine learning, artificial intelligence ) The actual data mining task is the automatic or semi-automatic analysis of large quantities of data to extract previously unknown interesting patterns such as groups of data records, unusual records and dependencies. (Wikipedia) 15

Datamining Used in marketing, text processing, scientific,. Many methods, many implementations Blackbox Large datasets, computationally expensive Assumption that training data represents test data 16

How Datamining is Applied Training Data Datamining Software Prediction Model Test Data

Software 18

Goals Scripts and not GUI Repeatability Record of what was done Creates need to integrate disparate software Work from managed datasets Don t duplicate effort Minimal number of intermediate datasets Less to manage 19

Software - Components Python 2.6 Custom Interface arcpy/ ARCGIS 10.0 Datamining software 20

arcpy ARCGIS becomes a callable software library Alternative to point-and-click Can also be used for automation arcpy.union_analysis(["well_buff50","stream_buff200"],"water_buffers") 21

Custom Interface to arcpy Found myself writing my own interface around arcpy(in python) Examples Garbage Collection Extracting data Creating and populating new fields Just to make things more pythonic. 22

Python Computer language like C, ruby, java or BASIC Many applications outside of GIS Rapid and Structured Development Open source Two roles: 1. Controller (which databases to use, etc) 2. Integration of arcpy and datamining-software 23

Datamining Software Some (most?) is open source For integration: compatibility with Python 2.6 on Windows Installation can be an issue. 24

Datamining Software Scripting integrated-software approach: Scikit-learn Easy-to-use and general-purpose machine learning in Python Point and click Weka Collection of machine learning algorithms for solving data mining problems 25

GIS Operations 26

Defining Training Data Which shorezone segments known to be habitat or nonhabitat? Known habitat/nonhabitat not expressed as shorezone segment. Need a set of rules work most of the time Can be automated 27

28

Defining Training Data (the rules) Must be within 150m of shore Closest shorezone segment. Ties broken by random selection Each known habitat/nonhabitat instance contributes at most one record to training data (Spatial join with closest option) 29

GIS Operations (defining training data) ~8 hours to assemble training data Did not delete the final set 30

GIS Operations (train and apply the model) SearchCursor and UpdateCursor to retrieve data and record results Predictions put into GIS database 31

Results 32

Presenting the Results 33

Checking the Results Training data is a useful benchmark Will model work beyond the training data? Overparameterizaton? Want to impose an independence between training data and validation process. 34

Checking the Results Cross validations Use 9/10 of training data at a time. Make predictions for other 1/10 Compare prediction to actual value Repeat ten times 35

Checking the Results (success) 36

Conclusions ARCGIS is extensible through the arcpy/python interface What else would be useful? Large amounts of GIS data can be used as predictive tools. Can be opportunistic! Other applications? 37

Conclusions (continued) The ARCGIS-python interface could be further developed. Might be some common need. Open source? 38

Questions and Comments 39

Checking the Results (failure) 40