Exploring Urban Areas of Interest. Yingjie Hu and Sathya Prasad

Similar documents
Museumpark Revisit: A Data Mining Approach in the Context of Hong Kong. Keywords: Museumpark; Museum Demand; Spill-over Effects; Data Mining

Generalisation and Multiple Representation of Location-Based Social Media Data

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

Crowd-sourced Cartography: Measuring Socio-cognitive Distance for Urban Areas based on Crowd s Movement

Extracting Touristic Information from Online Image Collections

Geographical Bias on Social Media and Geo-Local Contents System with Mobile Devices

DM-Group Meeting. Subhodip Biswas 10/16/2014

How a Media Organization Tackles the. Challenge Opportunity. Digital Gazetteer Workshop December 8, 2006

Comparing Flickr tags to a geomorphometric classification. Christian Gschwend and Ross S. Purves

GeoFaceExplorer: Exploring the Geo-Dependence of Facial Attributes

A Cloud Computing Workflow for Scalable Integration of Remote Sensing and Social Media Data in Urban Studies

Towards data driven decision-making using crowd-sourced geographic information. Linna Li University of Redlands December 2, 2015

arxiv: v2 [cs.si] 6 Feb 2015

arxiv: v1 [cs.cy] 12 Dec 2017

Reimaging GIS: Geographic Information Society. Clint Brown Linda Beale Mark Harrower Esri

Collections of Points of Interest: How to Name Them and Why it Matters

Detecting Origin-Destination Mobility Flows From Geotagged Tweets in Greater Los Angeles Area

Introduction to ArcGIS Maps for Office. Greg Ponto Scott Ball

Journal of e-media Studies Volume 3, Issue 1, 2013 Dartmouth College

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

Challenges in Geocoding Socially-Generated Data

Place this Photo on a Map: A Study of Explicit Disclosure of Location Information

Crowdsourcing, Citizen Science & INSPIRE

Exploring Human Mobility with Multi-Source Data at Extremely Large Metropolitan Scales. ACM MobiCom 2014, Maui, HI

USDA CropScape Data Resources

Data Creation and Editing

Clustering Analysis of London Police Foot Patrol Behaviour from Raw Trajectories

Introduction to Google Mapping Tools

Clustering. CSL465/603 - Fall 2016 Narayanan C Krishnan

Open Data meets Big Data

ArcGIS Earth for Enterprises DARRON PUSTAM ARCGIS EARTH CHRIS ANDREWS 3D

Metropolitan Wi-Fi Research Network at the Los Angeles State Historic Park

Discovery and Access of Geospatial Resources using the Geoportal Extension. Marten Hogeweg Geoportal Extension Product Manager

Information Sharing and Taxonomies Practical Classification of Threat Indicators using MISP

The ESPON Programme and the use of spatial data on the European level

Exploring the Impact of Ambient Population Measures on Crime Hotspots

Using Social Media for Geodemographic Applications

Mapping Historical Information Using GIS

Mobility Patterns and User Dynamics in Racially Segregated Geographies of US Cities

Modeling face-to-face social interaction networks

Mobility Analytics through Social and Personal Data. Pierre Senellart

Modelling exploration and preferential attachment properties in individual human trajectories

Leveraging ArcGIS Online Elevation and Hydrology Services. Steve Kopp, Jian Lange

Introduction to ArcGIS Server Development

Visualizing The Semantic Similarity of Geographic Features

ArcGIS Enterprise: Out-of-the-Box Spatial Analysis. Vicki Cove Hilary Curtis

Quality of Information collected with the help of Map-Based Questionnaires

ArcGIS Data Reviewer: Assessing Positional Accuracy. Roslyn Dunn

Figure Figure

A Hierarchical, Multi-modal Approach for Placing Videos on the Map using Millions of Flickr Photographs

December 3, Dipartimento di Informatica, Università di Torino. Felicittà. Visualizing and Estimating Happiness in

Administering your Enterprise Geodatabase using Python. Jill Penney

A method of Area of Interest and Shooting Spot Detection using Geo-tagged Photographs

Sampling. Module II Chapter 3

SocViz: Visualization of Facebook Data

PALS: Neighborhood Identification, City of Frederick, Maryland. David Boston Razia Choudhry Chris Davis Under the supervision of Chao Liu

Assessing pervasive user-generated content to describe tourist dynamics

A Map Through Time Virtual Historic Cities

Steve Pietersen Office Telephone No

Non-parametric bootstrap and small area estimation to mitigate bias in crowdsourced data Simulation study and application to perceived safety

VISUAL EXPLORATION OF SPATIAL-TEMPORAL TRAFFIC CONGESTION PATTERNS USING FLOATING CAR DATA. Candra Kartika 2015

The Role of the Louisiana Geographic Information Center in the Response to Hurricane Katrina

Identification of disaster-affected areas using exploratory visual analysis of georeferenced Tweets: application to a flood event

ArcGIS 10.1 An Overview of the System

arxiv: v2 [cs.si] 13 Apr 2016

USING SOCIAL MEDIA INFORMATION IN TRANSPORT- AND URBAN PLANNING IN SOUTH AFRICA

Lecture 9: Geocoding & Network Analysis

Esri Production Mapping: An Introduction

The role of topological outliers in the spatial analysis of georeferenced social media data

Integrating Authoritative and Volunteered Geographic Information for spatial planning

Multiscale Spatio-Temporal Data Aggregation and Mapping for Urban Data Exploration

Urban land use information retrieval based on scene classification of Google Street View images

Infrastructure to Explore Geographic Systems through Models and Maps

Administering Your Enterprise Geodatabase using Python. Gerhard Trichtl

Slide 1 of 31 OPENGEOFICTION. Drawing a collaborative fictional world using the OSM software

A Novel Popular Tourist Attraction Discovering Approach Based on Geo-Tagged Social Media Big Data

Geographic Knowledge Discovery Using Ground-Level Images and Videos

The Livehoods Project: Utilizing social media to understand the dynamics of a city. Trung Phan

RESEARCH ARTICLE. A Data-Synthesis-Driven Method for Detecting and Extracting Vague Cognitive Regions

Citizen Science at the. U.S. Geological Survey

Application Note 12: Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova

Prediction of Citations for Academic Papers

KEYWORDS: census maps, map scale, inset maps, feature density analysis, batch mapping. Introduction

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

The integration of land change modeling framework FUTURES into GRASS GIS 7

Geodatabase Programming with Python

Datahoods or Data-Driven Neighborhoods. Using GIS to better understand neighborhoods

GIS for ChEs Introduction to Geographic Information Systems

Smart Data Collection and Real-time Digital Cartography

City, University of London Institutional Repository

Using Open Data to Analyze Urban Mobility from Social Networks

Latent Geographic Feature Extraction from Social Media

GREAT BRITAIN: INDUSTRIAL REVOLUTION TO 1851 Student Worksheet

Modeling Controversy Within Populations

Georgia Kayser, PhD. Module 4 Approaches to Sampling. Hello and Welcome to Monitoring Evaluation and Learning: Approaches to Sampling.

The GapVis project and automated textual geoparsing

3D Urban Information Models in making a smart city the i-scope project case study

Multimedia analysis and retrieval

DISTRIBUTIONAL SEMANTICS

Arboretum Explorer: Using GIS to map the Arnold Arboretum

Transcription:

Exploring Urban Areas of Interest Yingjie Hu and Sathya Prasad

What is Urban Areas of Interest (AOIs)?

What is Urban Areas of Interest (AOIs)? Urban AOIs exist in people s minds and defined by people s behaviors Essentially fuzzy Different people may have different opinions Can we identify urban AOIs generally agreed by most people? How?

One possible data source: remote sensing data Unfortunately, remote sensing data don t record people s interests. A RS image of Shanghai A photo of Shanghai

Another possible data source: questionnaire survey Labor intensive and time consuming Please tell us the areas you consider interesting in the city, and draw them on the map.

Social media data Social media data provide records for people s interactions with the urban environment. They can be efficiently retrieved from public APIs. Many social media data contain location information. - Geotagged Tweets - Geotagged Flickr photos - Foursquare checkins -

Project Develop an automation workflow to extract AOIs from social media Show the evolution of AOIs in different cities in different years Understand AOIs Explore potential applications of AOIs

Why Flickr data? Reflect locations people consider interesting Cover a timespan of the past 10 years Publicly available through APIs Large number of users (around 100 million users)

Which parts of Flickr data are used in this project General metadata: locations, time, photo id, owner id, server id, Text tags: what are people talking about here? Photos: what are people looking at here?

Project Stage 1: data retrieval Cities: New York, London, Paris, Shanghai, Mumbai, Dubai Timespan: 2004-2014 Method: Flickr public API City # User # photo New York 2,751 2,761,542 London 2,357 2,876,013 Paris 3,019 1,456,298 Shanghai 1,775 254,123 Mumbai 1,901 55,532 Dubai 2,176 89,457

Project Stage 2: extracting AOIs from Flickr Data Goal: identifying AOIs (clusters) from photo locations (points) Data: Flickr metadata, including locations, time, user id, Challenges of Flickr data: - Biased: not representative of entire population; active users vs inactive users. - Noisy: errors exist in the user-specified locations. - Varied: different years and cities may have very different numbers of photos.

Project Stage 2: extracting AOIs from Flickr Data How do we handle these challenges: - Bias issue: reducing bias by removing additional photos taken by the same user within a radius of 200 meters. - Noise issue: choosing a clustering algorithm that is robust to data noise. - Variation issue: detecting AOIs based on the percentage of people.

Project Stage 2: extracting AOIs from Flickr Data Method: - DBSCAN (Density Based Spatial Clustering of Applications with Noise) - Advantages of DBSCAN to this problem - Doesn t require a pre-determined k - Clusters can be any arbitrary shape - Robust to noise K-means DBSCAN K-means DBSCAN

Project Stage 2: extracting AOIs from Flickr Data Two parameters of DBSCAN: - search radius: - Larger radius will produce larger clusters - We choose 200 meters for extracting neighborhood level AOIs - minimum number of points within the radius: - The minimum requirement for a cluster to be formed - Larger number will produce fewer clusters - We choose 2% of all Flickr users based on experiments - Two parameters together determine the meaning of AOI - In this project, AOIs are city regions that have been visited by at least 2% of all people who have taken photos in that city and in that year.

Project Stage 2: extracting AOIs from Flickr Data Applying DBSCAN to extract clusters Using Chi-shape algorithm to form concave hull User Percentage DBSCAN Chi-shape

Project Stage 2: extracting AOIs from Flickr Data Convex hull vs. Concave hull Convex hull Concave hull

Project Stage 3: understanding AOIs What are people talking about in these AOIs? Data: text-based tags attached to photos Challenge: some tags are common to many AOIs - E.g., Paris and France are very common to AOIs in the city of Paris Goal: highlight the tags that can characterize the local AOI, while reducing the common text descriptions. Method: term-frequency and inverse document frequency (TF-IDF)

Project Stage 3: understanding AOIs What are people talking about in these AOIs? Two examples produced by the algorithm: Eiffel tower area, Paris, 2014 Time Square area, New York, 2005

Project Stage 3: understanding AOIs What are people looking at in these AOIs? Data: Flickr photos Challenge: - Photos taken in an area have random qualities - A huge number of photos to process - 1,456,298 in Paris - 2,761,542 in New York -

Project Stage 3: understanding AOIs What are people looking at in these AOIs? Goal: automatically select the photos that can represent the preferable views from most people, while removing more personal photos. Method: - Human face detection - Image similarity comparison - Image clustering

Project Stage 3: understanding AOIs What are people looking at in these AOIs? An example of how the algorithms work Face detection Representative image Noisy image

Demo http://maps.esri.com/sp_demos/urbanaois

Project Summary Developed a reusable and automated software program that can be applied to point datasets in different domains The derived AOIs are objective and based on crowdsourcing data (for different cities in different years) Revealed the historical evolution patterns of AOIs over space and time The derived AOIs can be used in geodesign, location analytics, and other applications. The developed application is online, and can be accessed.