A Cloud Computing Workflow for Scalable Integration of Remote Sensing and Social Media Data in Urban Studies

Similar documents
Detecting Origin-Destination Mobility Flows From Geotagged Tweets in Greater Los Angeles Area

DM-Group Meeting. Subhodip Biswas 10/16/2014

Cyberinfrastructure and CyberGIS: Recent Advances and Key Themes

Spatial Data Science. Soumya K Ghosh

The Challenge of Geospatial Big Data Analysis

Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data

* Abstract. Keywords: Smart Card Data, Public Transportation, Land Use, Non-negative Matrix Factorization.

Oak Ridge Urban Dynamics Institute

Changes in the Spatial Distribution of Mobile Source Emissions due to the Interactions between Land-use and Regional Transportation Systems

GeoWorlds: Integrated Digital Libraries and Geographic Information Systems

Enriching the GIScience Research Agenda: Fusing Augmented Reality (AR) and Location Based Social Networks (LBSN)

Climate Risk Visualization for Adaptation Planning and Emergency Response

Basics of GIS reviewed

Exploring Human Mobility with Multi-Source Data at Extremely Large Metropolitan Scales. ACM MobiCom 2014, Maui, HI

Rainfall data analysis and storm prediction system

Harvard Center for Geographic Analysis Geospatial on the MOC

Demystifying ArcGIS Online. Karen Lizcano Esri

Exploring Urban Areas of Interest. Yingjie Hu and Sathya Prasad

MANAGING TRANSPORTATION & LAND USE INTERACTIONS (PL-58)

Urban Geo-Informatics John W Z Shi

Mobility Analytics through Social and Personal Data. Pierre Senellart

Exploring Urban Mobility from Taxi Trajectories: A Case Study of Nanjing, China

ArcGIS is Advancing. Both Contributing and Integrating many new Innovations. IoT. Smart Mapping. Smart Devices Advanced Analytics

Geographic Data Science - Lecture II

Open Data meets Big Data

SCAUG Community Maps Building a Living Atlas of the World

Spatial Information Retrieval

Clustering Analysis of London Police Foot Patrol Behaviour from Raw Trajectories

Exploring Urban Mobility from Taxi Trajectories: A Case Study of Nanjing, China

Mining Users Behaviors and Environments for Semantic Place Prediction

Assessing pervasive user-generated content to describe tourist dynamics

Discovering Urban Spatial-Temporal Structure from Human Activity Patterns

Visualizing Big Data on Maps: Emerging Tools and Techniques. Ilir Bejleri, Sanjay Ranka

Manual of Digital Earth

1. INTRODUCTION 2. BIG DATA IN URBAN STUDIES 3. LESSONS FROM BIG DATA

Modelling Spatial Behaviour in Music Festivals Using Mobile Generated Data and Machine Learning

Extracting Personal Behavioral Patterns from Geo-Referenced Tweets

Applying Hazard Maps to Urban Planning

The output from my MapReduce job is a text file with each line containing a set of coordinates and a floating point value in the following format:

Geographical Bias on Social Media and Geo-Local Contents System with Mobile Devices

Grand Challenges in GIScience: UCGIS experiences

The Livehoods Project: Utilizing social media to understand the dynamics of a city. Trung Phan

LUIZ FERNANDO F. G. DE ASSIS, TÉSSIO NOVACK, KARINE R. FERREIRA, LUBIA VINHAS AND ALEXANDER ZIPF

A Vision for ArcGIS Applying Geography Everywhere

Geographic Information Systems (GIS) in Environmental Studies ENVS Winter 2003 Session III

The History Behind Census Geography

STEREO ANALYST FOR ERDAS IMAGINE Stereo Feature Collection for the GIS Professional

IV Course Spring 14. Graduate Course. May 4th, Big Spatiotemporal Data Analytics & Visualization

Exploring the Geography of Communities in Social Networks

GIS-based Smart Campus System using 3D Modeling

Modelling exploration and preferential attachment properties in individual human trajectories

June 19 Huntsville, Alabama 1

Bus Landscapes: Analyzing Commuting Pattern using Bus Smart Card Data in Beijing

Outline. Geographic Information Analysis & Spatial Data. Spatial Analysis is a Key Term. Lecture #1

Spatial Data, Spatial Analysis and Spatial Data Science

Discovery and Access of Geospatial Resources using the Geoportal Extension. Marten Hogeweg Geoportal Extension Product Manager

Esri Maps for Office

Tools to Assess Local Health Needs. Richard Leadbeater, Esri NACo 2011 Healthy Counties Forum December 1, 2011

Geo-Enabling Digital India. 15 th Esri India User Conference GIS and Smart Cities

Esri s Living Atlas of the World Community Maps

TRAITS to put you on the map

Thesis: Citizen Science Land Cover Classification Based on Ground and Aerial Imagery B.Sc. in Geography The Pennsylvania State University

MACHINE LEARNING FOR MOBILITY STUDIES IN URBAN AND NATURAL AREAS TUULI TOIVONEN / DIGITAL GEOGRAPHY LAB

Intelligent GIS: Automatic generation of qualitative spatial information

Data Mining II Mobility Data Mining

REPORT ON INVESTMENTS

From Where Do Tweets Originate? - A GIS Approach for User Location Inference Qunying Huang

USING SOCIAL MEDIA INFORMATION IN TRANSPORT- AND URBAN PLANNING IN SOUTH AFRICA

Tutorial: Urban Trajectory Visualization. Case Studies. Ye Zhao

Models to carry out inference vs. Models to mimic (spatio-temporal) systems 5/5/15

DANIEL WILSON AND BEN CONKLIN. Integrating AI with Foundation Intelligence for Actionable Intelligence

An Ontology-based Framework for Modeling Movement on a Smart Campus

Using Social Media for Geodemographic Applications

Editorial Introduction: Special Issue on Grids and Geospatial Information Systems

INDIANA ACADEMIC STANDARDS FOR SOCIAL STUDIES, WORLD GEOGRAPHY. PAGE(S) WHERE TAUGHT (If submission is not a book, cite appropriate location(s))

1. Introduction. Chaithanya, V.V. 1, Binoy, B.V. 2, Vinod, T.R. 2. Publication Date: 8 April DOI: /cloud.ijarsg.

Advances in Geographic Data Science and Urban Analytics

Exploring the Impact of Ambient Population Measures on Crime Hotspots

a system for input, storage, manipulation, and output of geographic information. GIS combines software with hardware,

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends

Profiling Burglary in London using Geodemographics. Chris Gale 1, Alex Singleton 2, Paul Longley 1

Spatial analysis in XML/GML/SVG based WebGIS

CS 350 A Computing Perspective on GIS

Texas A&M University

The Road to Data in Baltimore

Building a Vibrant and Enduring Spatial Science John P. Wilson IWGIS2014 Beijing, China

- Towards new concepts for handling geoinformation

LUU_CHECKER: A Tool for Dynamically Incorporating New Land Uses in SWAT

US National Spatial Data Infrastructure A Spatial Framework for Governance and Policy Development to Enable a Location-Based Digital Ecosystem

May 2011 Oracle Spatial User Conference

Welcome. C o n n e c t i n g

Mobility Patterns and User Dynamics in Racially Segregated Geographies of US Cities

PALS: Neighborhood Identification, City of Frederick, Maryland. David Boston Razia Choudhry Chris Davis Under the supervision of Chao Liu

Generalisation and Multiple Representation of Location-Based Social Media Data

GIS and the Built Environment

Lesson 16: Technology Trends and Research

Geospatial SDI Portal for effective Governance of Pune METROPOLIS region

An Internet-based Agricultural Land Use Trends Visualization System (AgLuT)

Modeling and Predicting of Future Urban Growth in the Charleston, South Carolina Area

AP Human Geography Free-response Questions

Transcription:

A Cloud Computing Workflow for Scalable Integration of Remote Sensing and Social Media Data in Urban Studies Aiman Soliman1, Kiumars Soltani1, Junjun Yin1, Balaji Subramaniam2, Pierre Riteau2, Kate Keahey2, Yan Liu1, Anand Padmanabhan1, Shaowen Wang1 1 National Center for Supercomputing Applications (NCSA) University of Illinois at Urbana-Champaign 2 Mathematics and Computer Science Division, Argonne National Laboratory University of Chicago December 18, 2015 CyberGIS@ncsa.illinois.edu

Outline Fusing Social and Sensor Data Fusion Conceptual Framework Scalable Spatial Synthesis Capabilities Demo : Urban Flow 2

Fusing Social and Sensor Data 3

Fusing Social and Sensor Data Location Based Social Networks (LBSN) is defined as ambient geographic information provided through different social channels (Twitter, Flicker, etc. ) Recent studies suggested the potential of fusing LBSN with physical sensor data LBSN provides direct observations about human presence over urban landscapes (social sensing). LBSN could complement sparse sensor data (e.g., disaster management). Challenges associated with LBSN data Data reliability Spatiotemporal properties of LBSN Lack of conceptual integration models Big data management and synthesis 4

Spatiotemporal Characteristics of LBSN LBSN data has peculiar spatiotemporal properties that need to be considered when fused with sensor data Spatial characteristics Geographically sparse Distinct sub-populations (e.g., clusters, sporadic, linear) Temporal characteristics Geo-located Twitter data spatial distribution in Detroit suburbs Users' engagement follows a bursty behavior Observations exhibit a fat-tail distribution. Fewer users engage the most (frequent user bias) Distribution of total number of tweets per user 5

Conceptual Framework 6

Urban Dynamics and Connectivity Mapping Urban Connectivity is important in urban planning, transportation and design of smart cities. Remote sensing has been used successfully to delineate urban cores (e.g., landcover, landuse new urban development and sprawl) Revealing the connectivity network between urban units can not be completed using sensor data and usually obtained from low latency surveys A cybergis based workflow was developed to study urban connectivity based on fusion of remotely sensed landuse maps and mobility patterns extracted from geo-located Twitter data 7

Conceptual Framework (I) Each tweet contains information about the user, the geographic location, time stamp and 140 characters text. Individual twitter user trajectories are dominated by frequently visited locations These clusters are explained by the high predictability of human movement (preferential return) Example of frequently visited location We examined the semantic composition of preferential return locations using remotely-sensed landuse maps CMAP landuse map CMAP 2014. Chicago Metropolitan Agency for Planning s 2010 Land Use Inventory for Northeastern Illinois. 8

Conceptual Framework (II) Semantics of frequent visited locations are dominated by one land use type Clusters' semantics are partially dependent on visitation rank LBSN has distinct temporal signatures Volume of tweets per hour Landuse purity of top visited locations Dominant land use composition for top 5 visited locations Soliman, A., Yin, J., Soltani, K., Padmanabhan, A., and Wang, S. 2015. Where Chicagoans tweet the most: Semantic analysis of preferential return locations of Twitter users. Proceedings of the First ACM SIGSPATIAL International Workshop on Smart Cities 9 and Urban Analytics (UrbanGIS'15)

Conceptual Framework (III) Our connectivity model is based on identifying shared frequent visitors between two polygons in terms of Strength : number of shared frequent visitors Purpose : semantic at origin and destination Demographic : dominant user language 10

Scalable Spatial Synthesis Capabilities 11

Architecture (Backend) 12

Distributed Point in Polygon Time-consuming to load the shapefile for each mapper (existing approach). Splits the large shapefile into smaller shapefiles. Recursive bi-section method. Mapper decides which small shapefile the point lie into based on the geographical bounds. Using R-tree. Reducer loads the small shapefile and finds the polygon the point lies into. Using quad-tree. To find the closest polygon to a point, we split the original shapefile into overlapping shapefiles. 13

Database Cluster Multiple instances of MongoDB. Shared (partitioned). Directly connected to Hadoop to avoid storing data in HDFS first. Connected to NodeJS query service. Gateway application contacts the database through a NodeJS query service. 14

Urban Flow Demo 15

Acknowledgements This research is based in part upon work supported by the U.S. National Science Foundation under grant numbers: 1047916, 1443080, and 1354329. Insightful comments were received from members of the CyberGIS Center for Advanced Digital and Spatial Studies. 16

Thank You 17