Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data

Similar documents
Collaborative Nowcasting for Contextual Recommendation

Where to Find My Next Passenger?

Exploiting Geographic Dependencies for Real Estate Appraisal

Learning to Recommend Point-of-Interest with the Weighted Bayesian Personalized Ranking Method in LBSNs

Friendship and Mobility: User Movement In Location-Based Social Networks. Eunjoon Cho* Seth A. Myers* Jure Leskovec

Human Mobility Pattern Prediction Algorithm using Mobile Device Location and Time Data

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends

Aggregated Temporal Tensor Factorization Model for Point-of-interest Recommendation

Location Regularization-Based POI Recommendation in Location-Based Social Networks

Collaborative Location Recommendation by Integrating Multi-dimensional Contextual Information

arxiv: v1 [cs.si] 23 Jan 2017

Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data

Mixed Membership Matrix Factorization

Exploring Human Mobility with Multi-Source Data at Extremely Large Metropolitan Scales. ACM MobiCom 2014, Maui, HI

Mixed Membership Matrix Factorization

Discovering Geographical Topics in Twitter

Understanding People Lifestyles: Construction of Urban Movement Knowledge Graph from GPS Trajectory

Large-Scale Social Network Data Mining with Multi-View Information. Hao Wang

Incorporating Spatio-Temporal Smoothness for Air Quality Inference

Collaborative topic models: motivations cont

Click Prediction and Preference Ranking of RSS Feeds

Mining Personal Context-Aware Preferences for Mobile Users

Service Recommendation for Mashup Composition with Implicit Correlation Regularization

A Novel Click Model and Its Applications to Online Advertising

Little Is Much: Bridging Cross-Platform Behaviors through Overlapped Crowds

Encapsulating Urban Traffic Rhythms into Road Networks

RaRE: Social Rank Regulated Large-scale Network Embedding

* Abstract. Keywords: Smart Card Data, Public Transportation, Land Use, Non-negative Matrix Factorization.

Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness

A Modified PMF Model Incorporating Implicit Item Associations

UAPD: Predicting Urban Anomalies from Spatial-Temporal Data

Revisitation in Urban Space vs. Online: A Comparison across POIs, Websites, and Smartphone Apps.

Circle-based Recommendation in Online Social Networks

Collaborative Recommendation with Multiclass Preference Context

Mining Triadic Closure Patterns in Social Networks

GeoMF: Joint Geographical Modeling and Matrix Factorization for Point-of-Interest Recommendation

Andriy Mnih and Ruslan Salakhutdinov

Collaborative Topic Modeling for Recommending Scientific Articles

Restricted Boltzmann Machines for Collaborative Filtering

Detecting Origin-Destination Mobility Flows From Geotagged Tweets in Greater Los Angeles Area

Building a Timeline Action Network for Evacuation in Disaster

Time-aware Point-of-interest Recommendation

Scaling Neighbourhood Methods

Clustering based tensor decomposition

Linear Dynamical Systems (Kalman filter)

Joint user knowledge and matrix factorization for recommender systems

Data Mining Techniques

Unsupervised Regressive Learning in High-dimensional Space

VISUAL EXPLORATION OF SPATIAL-TEMPORAL TRAFFIC CONGESTION PATTERNS USING FLOATING CAR DATA. Candra Kartika 2015

Mining Users Behaviors and Environments for Semantic Place Prediction

A Deep Learning Approach to the Citywide Traffic Accident Risk Prediction

Urban Computing Using Big Data to Solve Urban Challenges

Recommendation Systems

GraphRNN: A Deep Generative Model for Graphs (24 Feb 2018)

A Spatial-Temporal Probabilistic Matrix Factorization Model for Point-of-Interest Recommendation

Inferring Friendship from Check-in Data of Location-Based Social Networks

Smart Route Recommendations based on Historical GPS Trajectories and Weather Information

K-Nearest Neighbor Temporal Aggregate Queries

Yahoo! Labs Nov. 1 st, Liangjie Hong, Ph.D. Candidate Dept. of Computer Science and Engineering Lehigh University

Believe it Today or Tomorrow? Detecting Untrustworthy Information from Dynamic Multi-Source Data

Predictive Discrete Latent Factor Models for large incomplete dyadic data

Understanding Social Characteristic from Spatial Proximity in Mobile Social Network

Discovering Urban Spatial-Temporal Structure from Human Activity Patterns

Understanding Individual Daily Activity Space Based on Large Scale Mobile Phone Location Data

A Who, Where, When and What: a Non-parametric Bayesian Approach to Context-aware Recommendation and Search for Twitter Users

Urban Computing Using Big Data to Solve Urban Challenges

Modeling Temporal-Spatial Correlations for Crime Prediction

GLAD: Group Anomaly Detection in Social Media Analysis

A Contextual Collaborative Approach for App Usage Forecasting

Sparse and Robust Optimization and Applications

Exploring the Impact of Ambient Population Measures on Crime Hotspots

Detecting Anomaly in Traffic Flow from Road Similarity Analysis

Distributed ML for DOSNs: giving power back to users

Sensing Urban Density using Mobile Phone GPS Locations: A case study of Odaiba area, Japan

Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach

T-Finder: A Recommender System for Finding Passengers and Vacant Taxis

Nonnegative Matrix Factorization

Factor Modeling for Advertisement Targeting

NetBox: A Probabilistic Method for Analyzing Market Basket Data

Recommender Systems EE448, Big Data Mining, Lecture 10. Weinan Zhang Shanghai Jiao Tong University

Collaborative Filtering. Radek Pelánek

Crowd Sensing of Traffic Anomalies based on Human Mobility and Social Media

NOWADAYS, Collaborative Filtering (CF) [14] plays an

LEARNING SPARSE STRUCTURED ENSEMBLES WITH STOCASTIC GTADIENT MCMC SAMPLING AND NETWORK PRUNING

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project

Sparse Gaussian Markov Random Field Mixtures for Anomaly Detection

A Dimensionality Reduction Framework for Detection of Multiscale Structure in Heterogeneous Networks

Relational Stacked Denoising Autoencoder for Tag Recommendation. Hao Wang

Binary Principal Component Analysis in the Netflix Collaborative Filtering Task

Modelling exploration and preferential attachment properties in individual human trajectories

Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation

Measuring Social Functions of City Regions from Large-scale Taxi Behaviors

Validating general human mobility patterns on massive GPS data

Supplementary Figures

Research Article Urban Mobility Dynamics Based on Flexible Discrete Region Partition

Statistical Pattern Recognition

Assessing spatial distribution and variability of destinations in inner-city Sydney from travel diary and smartphone location data

A Joint Modeling of User Check-in Behaviors for Real-time Point-of-Interest Recommendation

Liangjie Hong, Ph.D. Candidate Dept. of Computer Science and Engineering Lehigh University Bethlehem, PA

Time-aware Collaborative Topic Regression: Towards Higher Relevance in Textual Item Recommendation

Transcription:

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data Yingzi Wang 12, Nicholas Jing Yuan 2, Defu Lian 3, Linli Xu 1 Xing Xie 2, Enhong Chen 1, Yong Rui 2 1 University of Science and Technology of China 2 Microsoft Research 3 University of Electronic Science and Technology of China

Regularity Conformity

Regularity 93%: the limit of predictability of human mobility(song et al. [1]) Major hubs: homes, workplaces Minor hubs: shopping malls, gyms, and restaurants [1]C. Song, Z. Qu, N. Blumm, and A.-L. Barabási. Limits of predictability in human mobility. Science, 327(5968):1018 1021, 2010.

Conformity

Related work Individual models methods Collaborative models target feature CI GPS Wifi SMP TC IT SR CF NextPlace(Scellato et al.) WhereNext(Monreale et al.) W 4 (Yuan et al.) methods target feature CI GPS Wifi SMP TC IT SR CF PSMM(Cho et al.) SHM(Gao et al.) gscorr(gao et al.) CI: check-in, SMP: spatial mobility pattern, TC: text content, IT: individual temporal patterns, SR: social relationship, CF: collaborative filtering, HT: heterogeneous mobility datasets Limits: 1. Failed to Incorporate both regularity and conformity of human mobility 2. Static, not time-aware 3. Homogeneous data CEPR(Lian et al.)

Problem definition A B C A B C

Problem definition B A C A C B A C B Regularity Conformity

Main idea overview Split days into T time slots T = t 1, t 2,, t T M users and N venues U = u 1, u 2,, u M V = v 1, v 2,, v N Preference matrix of U to V at time t : R(t) R M N R ij t = R ij r (t) + R ij c (t) t v j u i Regularity term conformity term

Conformity term (check-in data) Traditional collaborative model: Matrix Factorization R ij = U i V j T v j = u i K: length of latent factor Time-aware Matrix Factorization t R ij c t = U i + U i t V j T v j t = ( u i + ) Unchangeable interests Changeable preferences

Regularity term(heterogeneous data) Split the city into I grid cells: C = {d 1, d 2,, d I } v j belongs to a grid d kj u i travels from a grid d k to v j Pr(v j u i ) σi k=1 Pr(d k u i ) Pr(v j d k ) = σi k=1 Pr(d k u i ) Pr(d kj d k ) Pr(v j d kj ) (r) H ik Q jk R ij = T Hi Q j u i d k d k d kj u i v j d kj Pr(v j u i ) Pr(d k u i ) Pr(d kj d k ) Pr(v j d kj )

Gravity model O i dis i,j D j T i,j =c (O i ) a (D j ) b exp(r dis i,j ), B, A, C m 1 (O i ) a, O i : number of individuals leaving grid d i in data* m 2 (D j ) b, D j : number of people going toward d j in data* R 2 exp(r dis i,j ), dis i,j : distance between d i and d j B: bus data A: taxi data C: check-in data c,a,b,r: constants

Achieve the two-level sparsity Cluster grids into G group (r) R ij = T Hi Q j (r) R ij = g H i (g) Qj (g)t t number of grids t v j Q 1 T u i u i v j = Q 2 T H (1) H (2) H (G) Q G T

RCH model Sparse group lasso Objective function: Ρ(U, U t, V, H t, θ B, θ A ) = σ t T R t σ g G H g t σ P θ Q g T (U + U(T))V T F 2 + σ t T ( 1 α σ σ j=1 M σ g G H g j t +γ U 2 F + V 2 F + β σ t T U t F 2, where P = {A, B, C} and θ C = 1. + ασ σ j=1 M H j (t) ) 2 1 Offers groupwise sparsity Offers withingroup sparsity Optimization: alternative minimization

Heterogeneous mobility data Data Set Check-in(Sina Weibo) Bus Data Taxi Data City Beijing Beijing Beijing Scale of Data Period Content 12,133,504 checkins Mar. 2011 to Sep. 2013 user ID, check-in time, venue Id, venue s geocoordinates 3,000,000 bustrips Aug. 2012 to May 2013 card Id, alighting time, boarding and alighting stops 19,400,000 taxi transitions Mar. 2011 to Aug. 2011 times, geocoordinates of boarding and alighting

Experiments Baselines MF(Most Frequent Model) Calculate the frequencies of users check-ins PMM (Periodic Mobility Model) 2-dimentional (home, work) Time-independent spatial Gaussian Mixture W 3 (Who, When, Where) Probabilistic model CEPR Human mobility: regular and novel ones Divide the check-in data into two parts by time order: training part and testing part (7:3) Metrics: Acc@topP: prediction accuracy for prediction list with length P APR(average percentile rank): for the actually visited venues Parameters: Default values of {θ A, θ B, γ, β, α, σ} are {1, 1, 0.005, 0.005, 0.95, 10 5 }

Results RCH C shows improvements over all the 4 baselines; RCH BAC has better performance than RCH C ; Acc@topP Workdays have better prediction accuracies than holidays; The time slot 2 of workdays has the highest accuracy; Acc@topP for different type of days and time slots

Results Acc@top60 of visited and unvisited venues APR of our models in different time slots CEPR and RCH BAC : outperform PMM and W 3 apparently for unvisited venues benefit from collaborative filtering; CEPR and RCH BAC : accuracy of unvisited venues on holidays is higher than workdays; RCH ABC has highest APR in our models in different time slots;

Summary Integrate both the regularity and conformity of human mobility Provide a time-aware collaborative model Incorporate heterogeneous mobility data into prediction model Learn spatial influence and group structure based on gravity model and sparse group Lasso RCH model: significantly outperforms existing approaches Yingzi Wang