The role of topological outliers in the spatial analysis of georeferenced social media data

Similar documents
Mapping and Analysis for Spatial Social Science

Challenges in Geocoding Socially-Generated Data

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Non-parametric bootstrap and small area estimation to mitigate bias in crowdsourced data Simulation study and application to perceived safety

Spatial Analysis 2. Spatial Autocorrelation

Urban Spatial Pattern and Interaction based on Analysis of Nighttime Remote Sensing Data and Geo-social Media Information

Opinion mining from Twitter and spatial crime distribution for hockey events in Vancouver

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Detecting Origin-Destination Mobility Flows From Geotagged Tweets in Greater Los Angeles Area

Regression Analysis. A statistical procedure used to find relations among a set of variables.

Statistical Perspectives on Geographic Information Science. Michael F. Goodchild University of California Santa Barbara

A Framework for Implementing Volunteered Geographic Information Systems

A spatial literacy initiative for undergraduate education at UCSB

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

Spatial Data, Spatial Analysis and Spatial Data Science

Tracey Farrigan Research Geographer USDA-Economic Research Service

Seymour Centre 2017 Education Program 2071 CURRICULUM LINKS

DM-Group Meeting. Subhodip Biswas 10/16/2014

Outline ESDA. Exploratory Spatial Data Analysis ESDA. Luc Anselin

The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale

A MULTISCALE APPROACH TO DETECT SPATIAL-TEMPORAL OUTLIERS

Cluster Analysis using SaTScan. Patrick DeLuca, M.A. APHEO 2007 Conference, Ottawa October 16 th, 2007

Using Social Media for Geodemographic Applications

An Introduction to Pattern Statistics

Exploring Urban Areas of Interest. Yingjie Hu and Sathya Prasad

SOCIAL MEDIA IN THE COMMUNICATIONS CENTRE

Urban GIS for Health Metrics

The Cost of Transportation : Spatial Analysis of US Fuel Prices

Introduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées.

Generalisation and Multiple Representation of Location-Based Social Media Data

Statistics: A review. Why statistics?

The Nature of Geographic Data

Innovation and Regional Growth in the European Union

Herfort et al. Twitter Analysis of River Elbe Flood 2013

Curriculum map GEOGRAPHY

December 3, Dipartimento di Informatica, Università di Torino. Felicittà. Visualizing and Estimating Happiness in

The Emotion-Aware City: Using Ambient Geographic Information (AGI) in order to understand emotion & stress within smart cities

Where Do Overweight Women In Ghana Live? Answers From Exploratory Spatial Data Analysis

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Exploring the Geography of Communities in Social Networks

An adapted intensity estimator for linear networks with an application to modelling anti-social behaviour in an urban environment

USING DOWNSCALED POPULATION IN LOCAL DATA GENERATION

Spatial correlation and demography.

Spatial Modeling, Regional Science, Arthur Getis Emeritus, San Diego State University March 1, 2016

ENGRG Introduction to GIS

Spatial segregation and socioeconomic inequalities in health in major Brazilian cities. An ESRC pathfinder project

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Integrated spatial analysis of volunteered geographic information

Not All Apps Are Created Equal:

Place Syntax Tool (PST)

Application of eigenvector-based spatial filtering approach to. a multinomial logit model for land use data

Spatial Autocorrelation and Random Effects in Digitizing Error

Introduction to Spatial Statistics and Modeling for Regional Analysis

Reimaging GIS: Geographic Information Society. Clint Brown Linda Beale Mark Harrower Esri

GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara

Friendship and Mobility: User Movement In Location-Based Social Networks. Eunjoon Cho* Seth A. Myers* Jure Leskovec

Spatial Filtering with EViews and MATLAB

Knowledge Spillovers, Spatial Dependence, and Regional Economic Growth in U.S. Metropolitan Areas. Up Lim, B.A., M.C.P.

A conceptual model for quality assessment of VGI for the purpose of flood management

LDA Midterm Due: 02/21/2005

This report details analyses and methodologies used to examine and visualize the spatial and nonspatial

VGIscience Summer School Interpretation, Visualisation and Social Computing of Volunteered Geographic Information (VGI)

Lab #3 Background Material Quantifying Point and Gradient Patterns

Spatial Heterogeneity, Scale, Data Character and Sustainable Transport in the Big Data Era

Spatial and Temporal Geovisualisation and Data Mining of Road Traffic Accidents in Christchurch, New Zealand

A Short Note on the Proportional Effect and Direct Sequential Simulation

Exploring the Association Between Family Planning and Developing Telecommunications Infrastructure in Rural Peru

Spatial Trends of unpaid caregiving in Ireland

Basics of Geographic Analysis in R

The Implementation of Autocorrelation-Based Regioclassification in ArcMap Using ArcObjects

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Fundamental Spatial Concepts. Michael F. Goodchild University of California Santa Barbara

Knowledge and understanding Geographical skills. Sample pages. features people processes world places events environments characteristics

Ethnic and socioeconomic segregation in Belgium A multi-scalar approach using individualised neighbourhoods

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

The Scope and Growth of Spatial Analysis in the Social Sciences

KEY WORDS: balancing factor, eigenfunction spatial filter, gravity model, spatial autocorrelation, spatial interaction

An Introduction to Spatial Autocorrelation and Kriging

Principal Component Analysis

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

Polarization and Protests: Understanding Complex Social and Political Processes Using Spatial Data and Agent-Based Modeling Simulations

Cluster Analysis using SaTScan

Concepts and Applications of Kriging

The linguistic landscape of social media: A case study of the Senate Square, Helsinki

Your web browser (Safari 7) is out of date. For more security, comfort and the best experience on this site: Update your browser Ignore

Twitter s Effectiveness on Blackout Detection during Hurricane Sandy

A Cloud Computing Workflow for Scalable Integration of Remote Sensing and Social Media Data in Urban Studies

Mapping the Urban Farming in Chinese Cities:

Crowdsourcing Semantics for Big Data in Geoscience Applications

Spatial Structure and Spatial Interaction: 25 Years Later

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

Year 10 Geography Unit Three Issues in Australian Environments

An advanced systematic literature review on spatiotemporal analyses of twitter data

Fuzzy Geographically Weighted Clustering

A statistical test on the local effects of spatially structured variance

Building a Vibrant and Enduring Spatial Science John P. Wilson IWGIS2014 Beijing, China

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

The Building Blocks of the City: Points, Lines and Polygons

Clustering Analysis of London Police Foot Patrol Behaviour from Raw Trajectories

Transcription:

06 April 2017 The role of topological outliers in the spatial analysis of georeferenced social media data René Westerholt, Heidelberg University Seminar on Spatial urban analytics: big data, methodologies, and behavioural implications Geography Colloquium, Harvard University 1

Companion paper GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 2

What is a topological outlier? A spatial unit that interacts in an unusual way and causes topologically-induced variance. Tiefelsdorf (1999) Local areas Counties GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 3

What is a topological outlier? A spatial unit that interacts in an unusual way and causes topologically-induced variance. Tiefelsdorf (1999) GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 4

Why topological outliers in social media data? Users perceive space in different ways Environmental acoustics (Iosa et al. 2012) Age (Sugovic & Witt 2013) Emotional and bodily states (Zadra & Clore 2011)... Individual linguistic skills Technical restrictions (e.g., 140 characters)... Leads to inevitable heterogeneity Leads to erroneous connections within spatial analyses GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 5

Research questions 1. What effects do (strong) topological outliers have on the spatial analysis of social media feeds? 2. What is the role of scale? Methods: Semivariogram Covariance Moran s I / Moran Scatterplot Eigenvalue analysis of spatial weights... GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 6

Data Synthetic dataset 2 different scales 2 different Gaussians Partial overlap Twitter dataset 23 million tweets London, one year NLP (LDA) GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 7

Semivariogram from tweets Unusual shape Lack of spatial structure appears quickly Semivariogram levels-off to nonspatial variance Clustering and repulsion at small scales GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 8

Eigenvalue analysis GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 9

Moran scatterplot Positive slope = positive spatial autocorrelation GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 10

Spatial artefact patterns Small-scale interacting with large-scale Positive slope = positive spatial autocorrelation Large-scale interacting with small-scale GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 11

Spatial artefact patterns Artefacts are function of topological variability Pattern is a function of attribute and spatial lag GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 12

Effect of differing scales GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 13

Conclusions Topological outiers lead to unexpected analysis outcomes Type of spatial weights is very important contradictions possible Erroneous spatial interaction causes fake spatial processes Scale differences have impact on how fake processes operate Spatial analytical approaches must account for topological outliers Heterogeneity is also a chance: it allows better understanding of how neighbourhoods are composed GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 14

Thank you! Questions? René Westerholt westerholt@uni-heidelberg.de Further information: Westerholt, R., Resch, B., & Zipf, A. (2015). A local scale-sensitive indicator of spatial autocorrelation for assessing high-and low-value clusters in multiscale datasets. International Journal of Geographical Information Science, 29 (5), 868-887. Westerholt, R., Steiger, E., Resch, B., & Zipf, A. (2016). Abundant Topological Outliers in Social Media Data and Their Effect on Spatial Analysis. PLOS ONE, 11(9), e0162360. Steiger, E., Westerholt, R., & Zipf, A. (2016). Research on social media feeds A GIScience perspective. European Handbook of Crowdsourced Geographic Information, 237-254. GIScience Research Group / Institute of Geography / Heidelberg University / René Westerholt 15