Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data

Similar documents
Exploring Human Mobility with Multi-Source Data at Extremely Large Metropolitan Scales. ACM MobiCom 2014, Maui, HI

* Abstract. Keywords: Smart Card Data, Public Transportation, Land Use, Non-negative Matrix Factorization.

Detecting Origin-Destination Mobility Flows From Geotagged Tweets in Greater Los Angeles Area

VISUAL EXPLORATION OF SPATIAL-TEMPORAL TRAFFIC CONGESTION PATTERNS USING FLOATING CAR DATA. Candra Kartika 2015

A route map to calibrate spatial interaction models from GPS movement data

Where to Find My Next Passenger?

A Probabilistic Embedding Clustering Method for Urban Structure Detection

Urban characteristics attributable to density-driven tie formation

Spatial Data Science. Soumya K Ghosh

Urban Computing Using Big Data to Solve Urban Challenges

Exploiting Geographic Dependencies for Real Estate Appraisal

City monitoring with travel demand momentum vector fields: theoretical and empirical findings

A framework for spatio-temporal clustering from mobile phone data

Urban Population Migration Pattern Mining Based on Taxi Trajectories

Mobility Analytics through Social and Personal Data. Pierre Senellart

IV Course Spring 14. Graduate Course. May 4th, Big Spatiotemporal Data Analytics & Visualization

Exploring spatial decay effect in mass media and social media: a case study of China

Discovering Urban Spatial-Temporal Structure from Human Activity Patterns

Modelling exploration and preferential attachment properties in individual human trajectories

GIScience & Mobility. Prof. Dr. Martin Raubal. Institute of Cartography and Geoinformation SAGEO 2013 Brest, France

IMPROVING INFRASTRUCTURE FOR TRANSPORTATION SYSTEMS USING CLUSTERING

Exploring Urban Mobility from Taxi Trajectories: A Case Study of Nanjing, China

Mobility Pattern of Taxi Passengers at Intra-Urban Scale: Empirical Study of Three Cities

Encapsulating Urban Traffic Rhythms into Road Networks

Exploring Urban Mobility from Taxi Trajectories: A Case Study of Nanjing, China

Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness

LABELING RESIDENTIAL COMMUNITY CHARACTERISTICS FROM COLLECTIVE ACTIVITY PATTERNS USING TAXI TRIP DATA

SPATIO-TEMPORAL PATTERN MINING ON TRAJECTORY DATA USING ARM

Construction and Analysis of Climate Networks

Validating general human mobility patterns on massive GPS data

Bus Landscapes: Analyzing Commuting Pattern using Bus Smart Card Data in Beijing

Minimum message length estimation of mixtures of multivariate Gaussian and von Mises-Fisher distributions

Inferring Friendship from Check-in Data of Location-Based Social Networks

Latent spatio-temporal activity structures: a new approach to inferring intra-urban functional regions via social media check-in data

A Cloud Computing Workflow for Scalable Integration of Remote Sensing and Social Media Data in Urban Studies

Analysis of the urban travel structure using smartcard and GPS data from Santiago, Chile

MobiSeg: Interactive Region Segmentation Using Heterogeneous Mobility Data

Sensing Urban Density using Mobile Phone GPS Locations: A case study of Odaiba area, Japan

Incorporating Spatio-Temporal Smoothness for Air Quality Inference

arxiv: v3 [physics.soc-ph] 2 Feb 2015

A Geo-Statistical Approach for Crime hot spot Prediction

Measurement of human activity using velocity GPS data obtained from mobile phones

Travel Pattern Recognition using Smart Card Data in Public Transit

A Location Predictor based on Dependencies Between Multiple Lifelog Data

Data Mining II Mobility Data Mining

Classification in Mobility Data Mining

Clustering Analysis of London Police Foot Patrol Behaviour from Raw Trajectories

Understanding taxi travel demand patterns through Floating Car Data Nuzzolo, A., Comi, A., Papa, E. and Polimeni, A.

Social and Technological Network Analysis: Spatial Networks, Mobility and Applications

Spatial Data, Spatial Analysis and Spatial Data Science

Diagnosing New York City s Noises with Ubiquitous Data

Research Article Urban Mobility Dynamics Based on Flexible Discrete Region Partition

Social and Technological Network Analysis. Lecture 11: Spa;al and Social Network Analysis. Dr. Cecilia Mascolo

A Computational Movement Analysis Framework For Exploring Anonymity In Human Mobility Trajectories

Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information

A Machine Learning Approach to Trip Purpose Imputation in GPS-Based Travel Surveys

Caesar s Taxi Prediction Services

Driving Restriction, Traffic Congestion, and Air Pollution: Evidence from Beijing

Friendship and Mobility: User Movement In Location-Based Social Networks. Eunjoon Cho* Seth A. Myers* Jure Leskovec

Urban Attractors: Discovering Patterns in Regions of Attraction in Cities. Abstract. Introduction

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data

Supplementary Figures

Time Series Data Cleaning

Modeling Crowd Flows Network to Infer Origins and Destinations of Crowds from Cellular Data

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida

Not All Apps Are Created Equal:

Perspectives on Stability and Mobility of Passenger s Travel Behavior through Smart Card Data

Extracting Patterns of Individual Movement Behaviour from a Massive Collection of Tracked Positions

Estimating Large Scale Population Movement ML Dublin Meetup

Spatial Extension of the Reality Mining Dataset

A Survey on Trajectory Data Mining: Techniques and Applications

Measuring Social Functions of City Regions from Large-scale Taxi Behaviors

Models to carry out inference vs. Models to mimic (spatio-temporal) systems 5/5/15

Visualisation of Spatial Data

NEIGHBOURHOOD DETECTION WITH ANALYTICAL TOOLS

Texas A&M University

ECE521 Tutorial 11. Topic Review. ECE521 Winter Credits to Alireza Makhzani, Alex Schwing, Rich Zemel and TAs for slides. ECE521 Tutorial 11 / 4

Research Seminar on Urban Information Systems

Supporting Statistical Hypothesis Testing Over Graphs

Spatial-Temporal Analytics with Students Data to recommend optimum regions to stay

Probabilistic Graphical Models

Big Data Analysis to Measure Delays of Canadian Domestic and Cross-Border Truck Trips

Discovering Geographical Topics in Twitter

Brief Introduction of Machine Learning Techniques for Content Analysis

Using Open Data to Analyze Urban Mobility from Social Networks

Trip Distribution Modeling Milos N. Mladenovic Assistant Professor Department of Built Environment

Bayesian Hierarchical Models

1. INTRODUCTION 2. BIG DATA IN URBAN STUDIES 3. LESSONS FROM BIG DATA

Research on Patterns Mining Method for Moving Objects

Quantifying Demand Dynamics for Supporting Optimal Taxi Services Strategies

Urban Computing Using Big Data to Solve Urban Challenges

Smart Route Recommendations based on Historical GPS Trajectories and Weather Information

Probabilistic Graphical Models

Extracting mobility behavior from cell phone data DATA SIM Summer School 2013

An Introduction to Reversible Jump MCMC for Bayesian Networks, with Application

Believe it Today or Tomorrow? Detecting Untrustworthy Information from Dynamic Multi-Source Data

UnSAID: Uncertainty and Structure in the Access to Intensional Data

Correlations and Scaling Laws in Human Mobility

Exploring Human Mobility with Multi-Source Data at Extremely Large Metropolitan Scales

LAND USE CLASSIFICATION FROM COMBINED USE OF REMOTE SENSING AND SOCIAL SENSING DATA

Transcription:

Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data Jinzhong Wang April 13, 2016 The UBD Group Mobile and Social Computing Laboratory School of Software, Dalian University of Technology

Outline Part 1 Part 2 Part 3 Part 4 Part 5 Introduction Related Work Methodologies Experimental Results Conclusion 1/24

Part 1 Introduction In recent years, multi-source traffic data are more easily collected than before. Analyzing and mining the laws of human mobility hidden in traffic big data have been a hot research field [1], [2]. There exist a variety of traffic trajectory data, which contributes to exploring the patterns of human mobility more deeply [3]. [1] J. Yuan, Y. Zheng, and X. Xie, Discovering regions of different functions in a city using human mobility and POIs, SIGKDD on Knowledge discovery and data mining. ACM, 2012, pp. 186 194. [2] Y. Zheng, L. Liu, L. Wang, and X. Xie, Learning transportation mode from raw GPS data for geographic applications on the web, WWW. ACM, 2008, pp. 247 256. [3] A. Noulas, S. Scellato, R. Lambiotte, M. Pontil, and C. Mascolo, A tale of many cities: universal patterns in human urban mobility, PloS one, vol. 7, no. 5, p. 37027, 2012. 2/24

Part 1 Introduction To our knowledge, domestic and foreign researchers have proposed quite a few spatio-temporal patterns of human mobility. Power Law Exponential Law Log-Normal Law 3/24

Part 1 Introduction They only focus on a single dataset which seems not to analyze the laws of human mobility very well. The evaluation standards are different based on different studying data. Subway smart transit data with 14 subway lines and Qiangsheng taxi data in Shanghai with 13 thousand taxis. 3 metrics (trip displacement, trip duration and trip interval). Maximum likelihood estimation (MLE) and Bayesian information criterion (BIC). 4/24

Part 1 Introduction We are the first to propose the distribution of human travel taking the subway. We discover that the laws of trip displacement, trip duration and trip interval by subway and taxi. Trip Displacement Subway = Taxi Log-normal Trip Duration Subway!= Taxi Weibull Log-normal Trip Interval Subway!= Taxi Two-regime Weibull We observe quite a few interesting phenomena, which contribute to inferring human mobility patterns more deeply by subway and taxi. 5/24

Part 2 Related Work Processing Technology of Trajectory Data. Substring tree to detect frequent movement sequences Clustering algorithm to partition and group traces Hidden Markov model to provide personalized recommendation Principal component analysis to gain insight into the relation A multivariate regression model to predict human mobility Patterns of Human Mobility. Levy flight model Continuous-time random walk model Exponential law distribution Power law with exponential cut-off Power law distribution Radiation model Log-normal distribution Gamma distribution 6/24

Part 2 Related Work The results are mainly based on single source trajectory data. The patterns of human by subway are not investigated more deeply and thoroughly. The statistical results lack a universal scaling law and are not independent of their own characteristics. Therefore, our research is extremely indispensable and gains a valuable insight into human mobility patterns. 7/24

Part 3 Methodologies (MLE BIC) We firstly collect the traffic data from Shanghai SODA competition website. Then we leverage data manipulation language to clean the acquired datasets. According to graph theory, we construct Human Mobility Network (HMN). Especially, we introduce 3 evaluation metrics such as trip displacement, trip duration, and trip interval, and analyze two datasets quantitatively and comparatively. Through using MLE and BIC, we finally explore the patterns of human mobility on weekend and weekday. Fig. 1. Overview of exploring the patterns of human mobility. 8/24

Part 3 Methodologies D s1 Subway weekend D s2 Subway weekday D t1 Taxi weekend D t2 Taxi weekday In this paper, we use two datasets generated by subway smart card transactions (D s ) and Qiangsheng taxi GPS trajectory (D t ) from Shanghai in China. Both (D s ) and (D t ) contain a total of more than 451 million trading records by 14 subway lines and 34 billion GPS records by about 13695 taxis respectively. 9/24

Part 3 Methodologies Definition 1. A Human Mobility Network (HMN) G = (V,E) is a directed graph. V denotes the set of the origins or the destinations. For subway, V denotes the latitude and longitude coordinate of a subway station or picking-up and dropping-off location for taxi. E represents the set of edges which connect an origin location to a destination. Fig. 2. Human mobility network(hmn). 10/24

Part 3 Methodologies Definition 2. A Trip Displacement (d) is the spherical distance with a pair (v i,v i+1 ) in HMN. the spherical distance is the inferior arc length by using great-circle between v i and v i+1 on the surface of a sphere. r denotes the Earth radius with the value of 6370 km ϕ1 and ϕ2 represent the origin s longitude and latitude respectively ψ1 and ψ2 represent the destination s longitude and latitude separately. 11/24

Part 3 Methodologies Definition 3. A Trip Duration (t d ) is the elapsed time between an origin v i destination v j in HMN. It is defined as follows: and a where t j and t i denote the drop-off time and the pick-up time of a trip. Definition 4. A Trip Interval (t v ) is the elapsed time between two consecutive trips by the same person taking the subway or the same taxi. t j denotes the drop-off time of the last trip. In a word, the travel time consists of t d and t v alternatively on human mobility. 12/24

Part 3 Methodologies We introduce 4 widely used models to evaluate our 2 datasets, and unveil the best fitting distribution. The 4 distributions are showed in the following Equation4-7. Power Exponential Log-Normal Weibull 13/24

Part 3 Methodologies We use MLE to estimate the parameters in every distribution. Then we continue to calculate the value of BIC as follows: According to the BIC value of every distribution, we go a step further to obtain the Bayesian weights W i to determine which is the best law. 14/24

Part 4 Experimental Results (a) 7:00-9:00 on weekend (b) 11:00-13:00 on weekend (c) 17:00-19:00 on weekend (d) 7:00-9:00 on weekday (e) 11:00-13:00 on weekday (f) 17:00-19:00 on weekday 15/24

Part 4 Experimental Results (a) 7:00-9:00 on weekend (b) 11:00-13:00 on weekend (c) 17:00-19:00 on weekend (d) 7:00-9:00 on weekday (e) 11:00-13:00 on weekday (f) 17:00-19:00 on weekday 16/24

Part 4 Experimental Results Trip Displacement. 17/24

Part 4 Experimental Results Trip Displacement. 18/24

Part 4 Experimental Results Trip Duration. 26 min Subway weekend 28 min Subway weekday 8min Taxi weekend 8min Taxi weekday 19/24

Part 4 Experimental Results Trip Coefficient. 20/24

Part 4 Experimental Results Trip Interval. 21/24

Part 4 Experimental Results Trip Duration. 90% of trip interval by subway is within 580 min on weekend and 643 min on weekday respectively. In other words, folks go to another consecutive trip earlier about 1 hour on weekend than weekday, which implies that most individuals travel for social activity rather than working on weekend. 22/24

Part 5 Conclusion In summary, we find that human mobility patterns in Shanghai have its own characteristics differing from existing statistical results. Trip displacement is better fitted to the log-normal distribution rather than the exponential model. The Weibull distribution fits the elapsed time by subway, whereas the log-normal distribution fits the trip time by taxi. The patterns of trip interval by taxi follow the Weibull distribution, whereas there exist two-regime laws by subway which is the first part following the Weibull distribution and the tail part following the log-normal distribution. 23/24

THANK YOU FOR YOUR ATTENTION 24/24