Introduction to Spatial Big Data Analytics. Zhe Jiang Office: SEC 3435

Similar documents
Spatial Decision Tree: A Novel Approach to Land-Cover Classification

ENVIRONMENT AND NATURAL RESOURCES 3700 Introduction to Spatial Information for Environment and Natural Resources. (2 Credit Hours) Semester Syllabus

Spatial Data Science. Soumya K Ghosh

geographic patterns and processes are captured and represented using computer technologies

CAS GE 365 Introduction to Geographical Information Systems. The Applications of GIS are endless

Introduction to GIS (GEOG 401) Spring 2014, 3 credit hours

The Changing Landscape of Land Administration

SYLLABUS SEFS 540 / ESRM 490 B Optimization Techniques for Natural Resources Spring 2017

GIST 4302/5302: Spatial Analysis and Modeling

Esri and GIS Education

IV Course Spring 14. Graduate Course. May 4th, Big Spatiotemporal Data Analytics & Visualization

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Geography 1103: Spatial Thinking

Techniques for Science Teachers: Using GIS in Science Classrooms.

CE 200 Surveying Fall 2017

Texts 1. Required Worboys, M. and Duckham, M. (2004) GIS: A Computing Perspective. Other readings see course schedule.

GIST 4302/5302: Spatial Analysis and Modeling

Welcome to Physics 161 Elements of Physics Fall 2018, Sept 4. Wim Kloet

Geography (GEOG) Introduction to Geography Global Change and Natural Disasters and Environmental Change

CS 6375 Machine Learning

GIS FOR PLANNING. Course Overview. Schedule. Instructor. Prerequisites. Urban Planning 792 Thursday s 5:30-8:10pm SARUP 158

PHYS 480/580 Introduction to Plasma Physics Fall 2017

MATH 251 Ordinary and Partial Differential Equations Summer Semester 2017 Syllabus

(Spatial) Big Data Debates: Infrastructure, Analytics & Science

Machine Learning CSE546

Department of Civil and Environmental Engineering. CE Surveying

GIST 4302/5302: Spatial Analysis and Modeling


Geo Business Gis In The Digital Organization

Midterm: CS 6375 Spring 2015 Solutions

Your web browser (Safari 7) is out of date. For more security, comfort and. the best experience on this site: Update your browser Ignore

Welcome. C o n n e c t i n g

School of Public Service and Health EDMG240 Chemistry of Hazardous Materials 3 Credit Hours 8 weeks Prerequisite(s): None

Introduction. Elevation Data Strategy. Status and Next Steps

DANIEL WILSON AND BEN CONKLIN. Integrating AI with Foundation Intelligence for Actionable Intelligence

Statewide Topographic Mapping Program

Advancing Machine Learning and AI with Geography and GIS. Robert Kircher

XXIII CONGRESS OF ISPRS RESOLUTIONS

Machine Learning (CS 567) Lecture 2

Geo-Enabling Mountain Bike Trail Maintenance:

Health and Medical Geography (GEOG 222)

Introduction to Geographic Information Systems

Key Points Sharing fosters participation and collaboration Metadata has a big role in sharing Sharing is not always easy

Syllabus Reminders. Geographic Information Systems. Components of GIS. Lecture 1 Outline. Lecture 1 Introduction to Geographic Information Systems

1.1 Administrative Stuff

Office Hours: Dr. Kruse: Tue, 14:30-15:30 & Fri, 10:30-11:30 in ABB 142 (Chemistry Help Centre) TA's: tutorial time

AE 200 Engineering Analysis and Control of Aerospace Systems

MOLECULAR MODELING IN BIOLOGY (BIO 3356) SYLLABUS

8/28/2011. Contents. Lecture 1: Introduction to GIS. Dr. Bo Wu Learning Outcomes. Map A Geographic Language.

Part : General Situation of Surveying and Mapping. The Development of Surveying and Mapping in China. The contents

Spatial Data, Spatial Analysis and Spatial Data Science

CHEM 333 Spring 2016 Organic Chemistry I California State University Northridge

EOS-310 Severe & Unusual Weather Spring, 2009 Associate Prof. Zafer Boybeyi 1/20/09

Geography for the 2020 Round of Census

Regional Centre for Mapping of Resources for Development (RCMRD), Nairobi, Kenya

GEOG 410: SPATIAL ANALYSIS SPRING 2016

GEOMATICS. Shaping our world. A company of

No. of Days. Building 3D cities Using Esri City Engine ,859. Creating & Analyzing Surfaces Using ArcGIS Spatial Analyst 1 7 3,139

Regional Centre for Mapping of Resources for Development (RCMRD), Nairobi, Kenya

COURSE INTRODUCTION & COURSE OVERVIEW

University of Houston-Clear Lake PHYS Modern Physics (Summer 2015) Syllabus 3:00-5:50pm Bayou 3324

My Map Activity MINNESOTA SOCIAL STUDIES STANDARDS & BENCHMARKS

Dr. Stephen J. Walsh Department of Geography, UNC-CH Fall, 2007 Monday 3:30-6:00 pm Saunders Hall Room 220. Introduction

PHYS100 General Physics - Mechanics and Thermodynamics Fall

Integrating Globus into a Science Gateway for Cryo-EM

Lesson Plan 3 Google Earth Tutorial on Land Use for Middle and High School

Don t Trust Atoms, they Make Up Everything High School Chemistry

Compact guides GISCO. Geographic information system of the Commission

UPDATING THE MINNESOTA NATIONAL WETLAND INVENTORY

An Introduction to Using Geographic Information Systems (GIS) in the Classroom

Regional Centre for Mapping of Resources for Development (RCMRD), Nairobi, Kenya. Introduction GIS ( 2 weeks: 10 days)

PHYSICS 206, Spring 2019

Science : Introduction to Astronomy Course Syllabus

GIS certificate; 24th 28th September and 3rd 7th December University of Venda

CCHEMISTRY 366. Inorganic Chemistry with Emphasis on Bioinorganic, Medicinal & Materials Chemistry

Today s Menu. Administrativia Two Problems Cutting a Pizza Lighting Rooms

HGP 470 GIS and Advanced Cartography in Social Science

CURRICULUM DESIGN FOR A UNIVERSITY GRADUATE COURSE ON ADVANCED TOPICS IN GIS FOR NATURAL RESOURCES MANAGEMENT

Anoka Hennepin K 12 Curriculum Unit Plan

MATH-3150H-A: Partial Differential Equation 2018WI - Peterborough Campus

Course Staff. Textbook

Petrology Spring Please grab a syllabus. Introductions

COURSE OUTLINE GEOL204 MINING COMPUTING 3 CREDITS

Regional Centre for Mapping of Resources for Development (RCMRD), Nairobi, Kenya. Introduction GIS (2 weeks: 10 days)

SYLLABUS FORM WESTCHESTER COMMUNITY COLLEGE Valhalla, NY lo CURRENT DATE: Please indicate whether this is a NEW COURSE or a REVISION:

Physics 112 Spring 2014

MIDTERM: CS 6375 INSTRUCTOR: VIBHAV GOGATE October,

Innovation and modernization of national statistical systems through trusted data collaboratives

REVIEW MAPWORK EXAM QUESTIONS 31 JULY 2014

STEREO ANALYST FOR ERDAS IMAGINE Stereo Feature Collection for the GIS Professional

Integrating Official Statistics and Geospatial Information NBS Experience

GEOGRAPHIC INFORMATION SYSTEMS Session 8

Cartographic and Geospatial Futures

Disaster Management in Republic of Korea

Commercialisation. Lessons learned from Dutch weather market

GEOG 3340: Introduction to Human Geography Research

GEOG 508 GEOGRAPHIC INFORMATION SYSTEMS I KANSAS STATE UNIVERSITY DEPARTMENT OF GEOGRAPHY FALL SEMESTER, 2002

USING GIS CARTOGRAPHIC MODELING TO ANALYSIS SPATIAL DISTRIBUTION OF LANDSLIDE SENSITIVE AREAS IN YANGMINGSHAN NATIONAL PARK, TAIWAN

ENV208/ENV508 Applied GIS. Week 1: What is GIS?

Transcription:

Introduction to Spatial Big Data Analytics Zhe Jiang zjiang@cs.ua.edu Office: SEC 3435 1

What is Big Data? Examples Internet data (images from the web) Earth observation data (nasa.gov) wikimedia.org www.me.mtu.edu www.slideshare.net vision.cloudera.com Manufacture sensor data Copyright of logos belong to the companies. ibmbigdatahub.com dataconomy.com Healthcare data 2

What is Big Data? Subjective, but some popular def. Volume, velocity, variety, value High dimensionality Data characteristics exceed capability of existing computational techniques! Illusion or reality? Business: mobile ads, personalization Manufacture: driverless car Health: diagnosis, personalized medicine Science: bioinformatics, GIScience Shortage of skilled people in big data! 3

Spatial Big Data (SBD) Geo-referenced big data Examples: GPS trajectories Check-in records Earth observation imagery Spatial events, e.g., crimes, accidents Climate model simulations Why spatial matters? Impact everyday life Computational challenges Beijing Taxi GPS trajectories Fbi.gov Microsoft.com Spatial events: highway serial killing Image from the web Mobile app data Nasa.gov Earth imagery Nooa.gov Climate model simulation 4

Spatial Big Data Analytics SBD analytics Process of identifying useful patterns or making predictions from SBD Types of patterns: Spatial prediction: earth image classification Colocation: crime and bars Descriptive v.s. Predictive? Hotspot: disease outbreak Spatial summarization, change, outlier, etc. Wetland mapping Spatial.cs.umn.edu Colocation patterns irregulartimes.com Crime hotspot 5 Trajectory summarization

Spatial Big Data Analytics Applications sustainableamerica.org Location based services gps4us.com arcgis.com Disaster response (before and after flood) Precision agriculture cdc.gov innovationtoronto.com Image from the web Disease outbreak detection 6

Spatial Big Data Analytics: Challenges Volume Velocity Variety Value Huge dataset sizes High update rate Heterogeneous data Special challenges for spatial big data: source and types Autocorrelation: interdependency across nearby locations Anisotropy: dependency varying across directions Heterogeneity: similar feature different classes Scales: multiple resolutions; local, regional, global scales Addressing societal issues 7

Spatial Big Data Analytics: Challenges (a) Aerial photo (NIR,G,B) in spring (b) Aerial photo (R,G,B) in summer (c) Ground truth wetland map (d) Decision tree prediction Wetland Dry land Need novel spatial models, e.g., spatial decision trees o Salt-and-pepper noise in decision tree prediction o Require labor intensive pre/post-processing o General issue for many models: random forest, SVM, neural network. 8

Spatial Data Scientist sparsity distribution high dimension patient symptoms doctor appropriate treatment A specialized doctor, e.g., Cardiologist Images in this slide are from the web noise data characteristics data scientist data mining methods Spatial data scientist? 9

Process of Spatial Big Data Analytics domain expert interpretation Input data Preprocessing, Exploratory data analysis Spatial Data Mining Output patterns Spatial statistics Computational infrastructure 10

Technical View of Spatial Big Data Spatial statistics Machine Learning Spatial Data Mining Pattern Recognition SBD Spatial database Geographic Information System Hadoop, Spark, GPU Data analytics (effectiveness) Question: what are the differences between statistics, machine learning and data mining? Domain Knowledge Understand the problem Interpret results E.g., biology, agriculture Data models and management, computational efficiency The main focus of this course is spatial data analytics! 11

Prerequisite Database Algorithm Basic statistic and probability Programming skills Additional data mining or machine learning background is preferred 12

Topics Covered in This Course Spatial data models Spatial statistics Spatial big data analytics tasks Spatial big data platforms Recent trends 13

Topics Covered in This Course Spatial data models Field model Google map Object model Mississippi River http://desktop.arcgis.com/ Spatial reference system Des Moines River Spatial operations 14

Topics Covered in This Course Spatial data models Spatial statistics ( source Visualeconomics ) Areal data model Geostatistics, e.g., Kriging (arcgis.com) dnainfo.com Spatial point process (shooting in Chicago) 15

Topics Covered in This Course Spatial data models Spatial statistics Spatial big data analytics tasks Spatial prediction Colocation Spatial outliers Spatial hotspot Spatial summarization Spatial change Spatial outliers Wetland mapping Colocation patterns Crime hotspot 16 Trajectory summarization

Topics Covered in This Course Spatial data models Spatial statistics Spatial big data analytics tasks Spatial big data platforms Recent trends 17

Topics Covered in This Course Spatial data models Spatial statistics Spatial big data analytics tasks Spatial big data platforms Recent trends 18

Course Website and Email Course website: http://zhejiang.cs.ua.edu/teaching/spring17/spatialbig Data/ Class schedule, notes, homework, announcement, etc. Course email: cs491591@gmail.com Homework submission, questions Please put CS491/591: Spatial Big Data in email title 19

Assignments Four question-and-answer homework One course project (see next slide) One news presentation 5 minutes at the beginning of each lecture Topic related to geospatial (big) data analytic Notes: All assignments are done in teams of two. Find a partner Due by the beginning of the first lecture in each week 20

Course Project Topic Real world spatial (big) data analytic problem Students are welcome to bring their own topics We provide some candidate topics Discuss with the instructor in office hours to decide topics Progress management Project assignments (deliverables) Decide project topic and datasets Literature survey (data preprocessing) Proposed approach (implementation) Experimental evaluation Final report (w/ source codes, dataset) Midterm project presentation Final project presentation Come to my office hours often for feedback and guidance! 21

Exams One close-book midterm exam (Possibly one close-book final exam) Some questions will be different between CS 491 and CS 591 22

Grading Policy News presentation: 5% Question and answer assignment: 30% Project assignments: 40% (or 35% if two exams) Exams: 25% (or 30% if two exams) 23

Homework 0 Due Next Week A background survey Not graded 24

Questions? Introduce yourself to other people, and team up! 25