A Spatial-Temporal Probabilistic Matrix Factorization Model for Point-of-Interest Recommendation
|
|
- Oliver Thomas
- 5 years ago
- Views:
Transcription
1 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see Abstract A Spatial-Temporal Probabilistic Matrix Factorization Model for Point-of-Interest Recommendation Huayu Li Richang Hong + Zhiang Wu Yong Ge With the rapid development of Location-based Social Network (LBSN) services, a large number of Point-of-Interests (POIs) have been available, which consequently raises a great demand of building personalized POI recommender systems. A personalized POI recommender system can significantly help users to find their preferred POIs and assist POI owners to attract more customers. However, due to the complexity of users checkin decision making process that is influenced by many different factors such as POI distance and region s prosperity, and the dynamics of user s preference, POI recommender systems usually suffer from many challenges. Although different latent factor based methods (e.g., probabilistic matrix factorization) have been proposed, most of them do not successfully incorporate both geographical influence and temporal effect together into latent factor models. To this end, in this paper, we propose a new Spatial-Temporal Probabilistic Matrix Factorization () model that models a user s preference for POI as the combination of his geographical preference and other general interest in POI. Furthermore, in addition to static general interest of user, we capture the temporal dynamics of user s interest as well by modeling checkin data in a unique way. To evaluate the proposed model, we conduct extensive experiments with many state-of-the-art baseline methods and evaluation metrics on two real-world data sets. The experimental results clearly demonstrate the effectiveness of our proposed model. Keywords: POI recommendation, Spatial-Temporal Probabilistic Matrix Factorization 1 Introduction Recent years have witnessed the rapid prevalence of location-based social network (LBSN) services such as Foursquare, Jiepang, and Facebook Places that can significantly facilitate users outdoor activities by providing a large number of nearby Point-of-Interests (POIs) in a real-time fashion. A variety of user interaction da- Computer Science Department, UNC Charlotte, {hli38,yong.ge}@uncc.edu. + Hefei University of Technology, hongrc@hfut.edu.cn. Nanjing University of Finance and Economics, zawu@seu.edu.cn. ta with these LBSN services such as searching locations, providing checkin information and posting tips after visiting a POI have been accumulated that provide a good opportunity for developing personalized POI recommender systems. Indeed, the accurate and personalized POI recommendation is a crucial demand in LBSN services. First, given the massive locations, it is very difficult for users to find their preferred ones in an efficient way. A personalized POI recommender system would help users easily find relevant POIs without spending much time on searching, particularly when a user is in a new region. Also, it is very challenging for POI owners to deliver right POIs to various users. A personalized POI recommender system is able to not only ease the burden, but also attract more customers with the recommended POIs. While developing personalized POI recommender systems could greatly benefit both users and POI owners, it is a very challenging problem due to two main factors. First, a user s checkin decision making process is very complex and could be influenced by many different kinds of factors. For example, the distance of a POI might have influence on user s preference for it. A user would like to prefer a nearby POI rather than a far away one. Meanwhile, a user may choose a POI which is located at a relatively prosperous region for the sake of high quality services and crowd human activities. In addition, whether a user would check in a POI or not may depend on his specific purpose. For instance, when people want to have lunch, they would like to choose those POIs relevant to food rather than sights. Some people may be used to going a certain region for the same purpose, such as food. These factors play an important role in user s preference for a POI. Second, a user s preference may change over time. For example, a user likely chooses a restaurant next to his work place for convenience in week days, but he possibly goes to a bar for entertainment at weekends. These factors make building POI recommender systems become difficult. In the literature, some related works on POI recommendations have been proposed [11, 6, 12, 24, 1, 27]. For example, [23] leveraged the power law property between the checkin probability and the distance of pair- 117
2 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see wise visited locations for POI recommendation. [15] proposed to model a user s preference for a location as a multiplication of his interest in POI, the popularity of POI, and the distance between him and POI based on Bayesian non-negative matrix factorization. Moreover, [6] assumed each individual user had different latent preference vector in each temporal state in order to capture the temporal effect. However, these models have not incorporated both geographical information and temporal effects together into latent factor models such as probabilistic matrix factorization. In addition, the geographical information used in these models is limited. In other words, only the distance of POI is leveraged to predict a user s preference for a POI. In fact, as mentioned earlier, there are many other geographical factors implicitly affecting a user s checkin decision making process. To address these limitations, in this paper, we propose a novel model that incorporates multiple geographical features and temporal characteristics into matrix factorization for POI recommendation. To this end, we first propose a new framework that incorporates multiple geographical features into matrix factorization. The preference of a user for a POI is modeled as the combination of his geographical preference for the POI and his general interest in the POI. User s geographical preference for a POI is modeled based on three types of geographical features including distance, prosperity and category. Unlike traditional matrix factorization methods that consider user s general interest as static, in this paper we also model the temporal dynamics of user s general interest. Specifically, we propose to represent each checkin event with a tuple including POI category and checkin time. We analogize a checkin event as a term and one user s historical checkins as a document, and then adapt topic modeling to model all observed checkins. Each user s historical checkins will be characterized with a set of topics, each of which is essentially a distribution over all tuples and represents a particular kind of checkin pattern. As checkin time is included in each tuple, the learned topic distribution for each user naturally captures the temporal dynamics of user s general interest. Consequently, we propose our Spatial-Temporal Probabilistic Matrix Factorization () model that takes into account all these factors. To evaluate the proposed model, we conduct extensive experiments with two real-world data sets and compare our method with many state-of-the-art baseline methods based on different validation metrics. The experimental results clearly demonstrate the effectiveness of our model. 2 Method Let us first introduce the formal definition of Geographical Feature and Checkin Event, and then present a general framework to incorporate geographical features into matrix factorization and learn user preference in a more robust way. Finally, we present the specific Spatial- Temporal Probabilistic Matrix Factorization () model and our estimation method. 2.1 Definitions. Recent works on POI recommendation have revealed that the distance of POI has great influence on user s preference for POIs [23, 2, 5, 26, 14, 15, 13]. In fact, there are other more geographical factors that also significantly affect user s checkin decision making process. For example, the business prosperity of a region where a POI is located might affect user s preference. Users are likely to choose a POI where its surrounding environment is relatively prosperous as it often indicates high-credibility and high-quality services and crowd human activities. In addition, users are often used to visiting a certain region with the same purposes such as shopping or eating. In fact, this has been reflected in many real-world checkin data, where each POI belongs to one or more categories such as food, sights and etc. For example, we pick up a user s historical checkins from our data and show them in Figure 1(b), where red marker represents category nightlife and black marker indicates category sights. Two regions are observed: one top red box area and one bottom red box area where there are about 2689 and 125 services in their neighbors respectively. We can see that: (1) He would like to go to the top region for visiting sights and prefer the bottom region for nightlife; (2) He likes those POIs in rather prosperous regions. Thus, a group of geographical features are defined as follows. Definition 1. (Geographical Features) Geographical features of a POI include the business prosperity of the region where the POI is located, the distance of the POI, and the number of user s historical POIs in this region associated with each category. Then geographical features of POI j for user i could be represented with a multi-dimensional vector: (2.1) f ij = (prosperity(j), d(i, j), n i1,..., n ic ) T, where prosperity(j) is the prosperity of region where POI j is located and could be measured by the number of business services in this region. d(i, j) represents the distance between the home of user i and POI j. n ic is the number of POIs within category c visited by user i in this region. Note that more other geographical information could be included into f ij if it is available for a particular data set. User s temporal preference for checkins follows a cyclic pattern [6, 7, 5]. Therefore, we divide the time into two parts: time slot in a day and day in a week. Specifically, time slot in a day includes morning, noon, 118
3 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see afternoon, evening and midnight. While traditional methods simply regard each checkin as a location, we represent a checkin event with category and temporal information as follows. Definition 2. (Checkin Event) A checkin event is a tuple consisting of the category of POI, the checkin day in a week, and the checkin time slot in a day. For example, an checkin event can be represented as Restaurant, Saturday, Evening, where Restaurant is the category. As can be seen, a checkin event essentially describes what a user is doing at a particular time. In above example, it indicates that the user is eating at a restaurant on Saturday evening. λv Vj j = 1...M fij Rij λ φt T lin Ui Wi gi λg β n = 1...Ni zin θi U s i i = 1...N (a) (b) Figure 1: (a)the graphical model of. (b) An example of a user s historical check-ins. Red box represents a region, and each color indicates a category. α λu λs λw The General Framework. We propose a general framework to generate accurate POI recommendation for N users over M locations. In reality, there are many latent factors affecting users preference for POIs. Particularly, different from consuming movies, when users visit or check in POIs, most of them would take into account the geographical factors [23] such as distance and region prosperity, in addition to other general factors such as intrinsic property of POIs. Thus, based on matrix factorization method, we propose to model the rating (i.e. checkin frequency) denoted as ˆR ij of user i for POI j as a combination of the general preference and the geographical preference of user i for POI j as: (2.2) ˆR ij = Ui T V j + g i Wi T f ij, where f ij represents the observed geographical features defined in section 2.1. W i is the i-th user s latent preference for geographical features and is assumed to be drawn from a Gaussian distribution with zeromean and covariance matrix as λ 1 w I G ) (where G is the dimension of geographical feature). g i is a weight indicating how much the i-th user s checkin decision is affected by geographical factors. The item Wi T f ij captures the geographical effect on check-in decision. U i and V j are the K-dimensional user and location specific latent vectors respectively, both of which are drawn from Gaussian distribution. Ui T V j are assumed to capture the i-th user s preference for the j-th POI s other intrinsic property, such as environment, price and service, that also affect the checkin decision. Furthermore, user s general preference may change over time. Thus we argue that user s preference characteristics may consist of two parts: stationary preference which has nothing to do with time, and temporal preference that changes over time. Specifically, the i-th user s general preference could be decomposed into stationary preference Ui s and temporal preference Ui b as follows: U i = Ui s + Ui b + K, where K is the K-dimensional Guassian noise. In other words, U i can be regarded to be drawn from the Gaussian distribution with Ui s +Ub i as mean and λ 1 u I K as covariance matrix as the following: (2.3) U i N (Ui s + Ui b, λ 1 u I K ). In this paper, we will adapt topic modeling technique to discover the temporal preference Ui b based on the defined checkin events, which will be discussed in details in the following section. In addition, as mentioned in [22, 6], the location characteristics captured with V j are inherent properties and do not change much as time goes. Therefore, we only model the dynamics for user preference. Ui s is assumed to be drawn from a Gaussian distribution while Ui b is a latent vector which could have various modeling methods according to application circumstance. We want to emphasize that the proposed model for incorporating geographical information is different from [13, 15, 23]. [15, 23] only model POI distance and [13] only considers an area s influence, all of which are not able to take other more geographical factors into consideration for users checkin decision making process. However, our proposed framework is much more general, and could incorporate different kinds of geographical features into matrix factorization. 2.3 The Model. Two recent works have been proposed to incorporate temporal effect into matrix factorization for improving the accuracy of recommendation [1, 6]. However, there are some shortcomings for these two methods. First, both of them utilize multiple user-specific latent vectors to model each individual s preference at different temporal state. Each temporal state usually corresponds to a time window. For example, [1] considers each day as a different temporal state. And as cyclic pattern is taken into account, [6] regards a pair of hour and day as a temporal state. Both of them have many different temporal states for each user. As a result, the number of observed ratings for fitting each user latent vector at each temporal state would significantly decrease, and learning so many userspecific latent vectors for each individual makes recom- 119
4 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see mender systems very inefficient. Second, both methods could not predict the rating of one user well when the corresponding temporal state does not appear in training set. But in fact, for the location checkin application, only a small set of ratings per user are observed that belong to a very limited number of temporal states. Thus these methods could not make reasonable predictions for users at new temporal states. All these limitations motivate us to find a more feasible and effective way to capture user s temporal preference. If we take a close look at user s checkin records, we can find that there are some underlying topics that could explain the reason why users check in POIs at particular times. For example, users are very likely to frequently check in some similar locations such as bars or restaurants around evening at weekend for a similar purpose such as party. In other words, there will be many checkin events that have similar categories and close times and often appear together in user s checkin records. And a set of such checkin events essentially describe a purpose of checkin behavior. As each checkin event has both category and time information, such a checkin purpose (e.g., party) provides a possible interpretation for the reason why users would like to check in these POIs at a particular time. And each user will have different preference for checkin purposes. Based on this assumption, for the user i, we analogize a checkin event l in as a term and his historical checkins {l in n = 1,..., N i } as a document (N i is the number of checkin events for user i), and then employ topic modeling to model all observed checkins. The learned topic will be a distribution over all checkin events. Each individual s checkins are a mixture over a set of interpretable topics. As a checkin event includes checkin time, each individual s topic distribution naturally captures his temporal preference. Consequently, we develop a specific method namely Spatial-Temporal Probabilistic Matrix Factorization () model. Its generative process and graphical model are shown in Table 1 and Figure 1(a) respectively. In the model, ϕ t is the distribution over checkin events for the t-th topic, and drawn from Dirichlet(β) prior. Each checkin event l in is drawn from ϕ corresponding to its hidden topic z in being drawn from the individual-specific mixture weights θ i over T topics. Thus, based on Eq.(2.3), we assume that the user s general preference consists of a static one and temporal one, and is assumed to be drawn from the Gaussian distribution with Ui s + Xθ i as the mean vector: (2.4) U i N (Ui s + Xθ i, λ u I K ), where X is a matrix that is used to transfer user topical space into the user latent space, and Xθ i indicate user s temporal preference. Also, we place the zero-mean Table 1: The generative process for model. 1. For global context, draw each entry in matrix X ij N (, λ 1 x ). 2. For each user i, a. Draw topic proportion θ i Dir(α). b. Draw user static preference U s i N (, λ 1 s I K). c. Draw user general preference U i N (U s i + Xθ i, λ 1 u I K). d. Draw user graphical preference W i N (, λ 1 w I G). e. Draw user graphical influence g i N (, λ 1 g ) f. For each check-in event l in, i. Draw topic z in Mult(θ i ). ii. Draw check-in event l in Mult(ϕ zin ). 3. For each topic t, draw ϕ t Dir(β). 4. For each location j, a. Draw location latent vector V j N (, λ 1 v I K). 5. For each user-location pair (i, j), a. Draw rating R ij N (U T i V j + g i W T i f ij, λ 1 ). spherical Gaussian prior on each entry of X. There are three advantages to adopt this method to capture users temporal preference: (1) The extracted topic provides a novel interpretation for a user s checkin event; (2) The process of extracting topics from large numbers of checkin events is a kind of dimensionality deduction. Thus, we do not need to model multiple user-specific preferences at different temporal states for each individual; (3) It is able to address the prediction problem mentioned before, because each user has a different topic distribution learned in training data and not explicitly related to each specific temporal state. Thus unlike [1, 6], even though one users some temporal states may be not observed in the training set, their learned topic distributions could be still used as their dynamic preference for the prediction. 2.4 Parameter Estimation. We employ the Maximum-A-Posteriori (MAP) for the parameter estimation of model. Given the observed rating R, geographical feature f and hyperparameters, maximizing the posterior is equivalent to maximizing the log-likelihood of latent parameters. Thus, the log-likelihood is obtained as follows: (2.5) N M L = λ I ij(r ij U T i V j g iw T i fij)2 i j N N i T N T + log ϕ t,lin θ it + (α t 1) log θ it i n i t T L + (β l 1) log ϕ tl. 2 F, t l s.t. k, < ϕ il, θ it 1, L l=1 ϕ il = 1, T θit = 1, where I ij is the indicator function that is equal to 1 if the i-th user checks in the j-th location and equal to otherwise.. 2 F is denoted as:. 2 F = λ u U U s Xθ 2 F + λ s U s 2 F + λ v V 2 F + λw W 2 F + λx X 2 F + λg g 2 F. As in Equation (2.5) the summation over t for item ϕ t,lin θ it is within the log function, it is intractable for us to estimate θ by maximizing the log-likelihood. To overcome this problem, We adopt the similar method proposed in [21] to obtain the relaxed lower-bound. Specifically, we first define q(z in = t) = ψ int, and then 12
5 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see apply Jensen s inequality. Finally, the object function with respect to θ has the following lower-bound: N N i T N N i T log ϕ t,lin θ it >= ψ int [log θ it ϕ t,lin log ψ int ], i n i n s.t. t, < ψ int 1, and T ψint = 1. Therefore, we can relax the object function by using Variational Inference method for latent variable Z. The lower-bound for the object function is obtained by integrating above equation back into Eq(2.5): (2.6) N M L = λ I ij (R ij U T i V j g i W T i f ij) 2 i j N N i T + { ψ int [log θ it ϕ t,lin log ψ int ] i n t T T L + (α t 1) log θ it } + (β l 1) log ϕ tl Θ 2 F, t t l s.t. k, < ϕ il, θ it, ψ int 1, L ϕ il = 1, T {θ it, ψ int } = 1. l=1 Alternating Least Squares (ALS) is a popular optimization method leading to more accurate parameter estimation and faster convergence. Thus, we utilize ALS method to compute each latent variable by fixing the other variables when maximizing the relaxed loglikelihood in Eq(2.6). The updating equation for each variable of interest shown in Table 2 is obtained by setting its gradient of the relaxed log-likelihood to zero. However, it is difficult to get the close-form of variable θ. Therefore, we utilize the gradient-based searching algorithm to assist with searching its optimal solutions. By fixing other variables, we could obtain the object function with respect to θ as follows: ˆθ it = argmin θ it {λ u (U i U s i Xθ i) T (U i U s i Xθ i) (2.7) N i ψint log θit (αt 1)logθit}, n=1 s.t. t, < θ it 1, and T θit = 1. To optimize θ, the gradient descent is employed to find the optimal solution with the following gradient: L(θ it ) (2.8) = 2λ u(xθ i + U s i θ Ui)(X)t it t Ni n=1 ψ int + α t 1 θ it, where (X) t represents the t-th column in matrix X. More details of the algorithm is shown in Table 2. In addition, C i is a diagonal matrix with λi ij (j = 1,..., M) as its diagonal elements, denoted as C i = (λi ij ) M j=1. Similarly, we denote C j = (λi ij ) N i=1, F i = (g i f ij ) G j=1, R i,\uv = (R ij U T i V j) M j=1, and R i,\w f = (R ij g i W T i V j) M j=1. 3 Experimental Results 3.1 Datasets. In this paper, we use two Foursquare data sets to evaluate our proposed model: (1) Data 1 [9] contains the checkin history of users who live in the California, ranging from December 29 to January 213; (2) Data 2 [4] contains the checkin data from September 21 to January 211. Each checkin record in both two data sets includes user ID, location ID and timestamp, where each location has latitude, longitude and category information. After Table 2: The Algorithm for Model. Step 1: Randomly initialize variables. Step 2: Execute E-Step and M-Step in each iteration repeatedly until the log-likelihood in Eq(2.6) converges: E-Step: V j = ( ju T + λ vi K ) 1 jr j,\w f U i = (V C i V T + λ u I K ) 1 [V C i R i,\w f + λ u (U s i + Xθ i)] W i = (F i C i F T i + λ w I G ) 1 F i C i R i,\uv U s i = λ u (Ui Xθi) λu+λs X = (λ uθθ T + λ xi T ) 1 λ u(u U s )θ T ψ int = θ it ϕ t,lin Normalize ψ ijt. ϕ tl = N,Ni i,n ψ int I(l in = l) + β l 1 Normalize ϕ tl. Update θ it: Use gradient descent with constraints to find the optimal solution of θ it with its object function shown in Eq(2.7) and its gradient as Eq(2.8), where the constraint is s.t. t, < θ it 1, and T θit = 1. M-Step: g i = λ M j=1 I ij (R ij U T i V j )W T i f ij λg +λ M j=1 I ij (W T i f ij )2 Step 3: After obtaining optimal Û, ˆV, Ŵ and ĝ, the predicted rating on i-th user for j-th location is calculated by: R ij = Û T ˆV i j + ĝ i Ŵ T i f ij. merging the overlapped categories, we totally obtain 6 (or 7) different kinds of categories in data 1 (or data 2). Meanwhile, we remove users who have visited less than 5 locations and locations which are visited by less than 2 users. The data statistics for two data sets are reported in Table 3. We can observe that Data 2 is nearly 1 times sparser than Data 1. Table 3: Statistics of Data Sets Data Set #User #Location #Checkin Sparsity Data 1 3,375 36, ,568.25% Data 2 74, ,161 3,51,68.24% In addition, we divide the space into a set of grids based on latitude and longitude. Each grid is regarded as a region. In our experiment, the size of region is about 5km 5km (i.e..45 by.45 square degrees). We then crawl all venues for each individual region with Foursquare API. As Foursquare API returns at most 5 venues (locations) with a specified center point and searching radius, we propose a simple method to crawl all venues in each region as follows. First, we get all venues in one specified region if Foursquare API returns less than 5 venues, and we end up searching. Otherwise we further divide this region into four small areas and repeat the above steps in each small area until each area contains less than 5 venues or its area size is less than 1m 1m. Second, we merge all crawled venues in the same region according to the venue ID. We use the number of venues in each region to indicate its prosperity degree.moreover, in the experiments we adopt the method [4] to locate user s home from his all historical checkins for calculating the POI distance. In recommendation system, we aim to recommend 121
6 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see those unvisited locations for users. Thus, we split the training and testing data as follows: for each user, (1) aggregating the checkins for each individual location; (2) sorting the location according to the first time that user checks in; (3) selecting the earliest 8% to train the model, and using the next 2% as testing. 3.2 Experimental Settings. Parameters λ and α are set as.1 and 1.1 respectively. Meanwhile, other parameters are specified as. 3.3 Evaluation Metrics. The model is quantitatively evaluated in terms of prediction accuracy, ranking accuracy and Top-K recommendation performance. Prediction Accuracy. RMSE [19] is exploited to directly measure the prediction accuracy for the unobserved ratings in the testing. Ranking Accuracy. Both Kendall Tau Coefficients (Tau) and Normalized Discounted Cumulative Gain (ndcg) are used to measure the overall ranking accuracy. Suppose each estate i is associated with a groundtruth rating x i and a predicted rating y i. There are n pairs. Each pair < i, j > is said to be concordant, if both x i > x j and y i > y j or if both x i < x j and y i < y j. Also, it is said to be discordant, if both x i > x j and y i < y j or if both x i < x j and y i > y j. Tau is then defined as (#conc #disc)/( 1 2n(n 1)), where #conc is the number of concordant pairs and #disc is the number of discordant pairs. DCG is defined as N i=1 2rel i 1 log 2 (1+i), where rel i is the testing rating. Given the ideal DCG (IDCG), i.e. DCG of the groundtruth ratings, ndcg is defined as DCG/IDCG. Top-K Recommendation. Precision@K and Recall@K would be used to evaluate the Top-K recommendation performance. Meanwhile we also adopt MAP metric, mean of average precision (AP). AP is N j=1 p(j) rel(j) computed as AP i = #relavant locations. Precision@K and Recall@K are calculated as P recision@k = L i L T K(L i ) L L i L R K(L i ) and Recall@K = i L T K(L i ) #relavant locations, respectively. Specifically, R K (L i ) are the top-k locations recommended to user i, T K (L i ) denotes all truly relevant locations among R K (L i ), L represents the set of locations in the testing, j is the position in the rank list, N is the number of returned items in the list, p(j) is the precision of a cut-off rank list from 1 to j, and rel(j) is an indicator function. The term relevant locations is defined as the locations in the testing. 3.4 Baseline Methods. To demonstrate the effectiveness of our model, we compare it with five baseline methods: (1) [6] that assumes users have different preference in each temporal state and use multiple user-specific latent vectors for different temporal states; (2) [23], that considers both geographical effect based on power law characteristics, and user interest based on user-based collaborative filtering for POI recommendation; (3) [19], that assumes the user and location latent vectors to be drawn from Gaussian distribution and estimates a user s preference on a location as the dot product of user-specific and location-specific latent vector; (4) [2], a Bayesian non-negative matrix factorization algorithm that places exponential priors on latent vectors and utilizes the Gibbs sampler to approximate the posteriors of latent variables; (5) F, user-based collaborative filtering with Pearson correlation as the similarity measurement. RMSE Kendall Tau ndcg (a) RMSE on Data 1 F (c) Kendall Tau on Data F RMSE Kendall Tau (b) RMSE on Data 2 F (d) Kendall Tau on Data (e) ndcg on Data 1 (f) ndcg on Data 2 Figure 2: Performance comparison in terms of prediction accuracy and ranking performance. 3.5 Performance Comparison. We will evaluate our proposed model with the baseline methods. Specifically, the topic number is set to be the same value as the latent feature number, and we will provide more discussions about it in Section Prediction Accuracy Performance. We examine the prediction accuracy performance for different models. Figure 2(a) and 2(b) show the comparison of RMSE for our proposed model verse baseline methods on two data sets. Since only outputs the proba- ndcg F 122
7 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see (a) Precison@5 on Data (b) Precison@1 on Data (c) Recall@5 on Data (d) Recall@1 on Data 1. (e) Precison@5 on Data 2. (f) Precison@1 on Data 2. (g) Recall@5 on Data 2. (h) Recall@1 on Data 2. Figure 3: Performance comparisons in term of precision and recall with top 5 and 1 on two data sets. bility for a unvisited location, we do not compare it in this part. The RMSE of F on Data 1 and Data 2 is.6443 and.6591, and the RMSE of on Data 2 is ranging from.5929 to.611. As they are far larger than other models, we do not plot them in the figure. The lower RMSE indicates the better prediction to groundtruth ratings. From the result, we can see that achieves the best performance on both of two data sets, while F performs the worst. The superior performance of is due to the usage of geographical information and the dynamic characteristics. As the observed data is very sparse, it makes the possibility of finding users who have similar POI preferences become much smaller, which as a result leads to a poor prediction performance on F. also could model the temporal effects for user preference, but when the data set becomes much sparser, the number of the observed ratings distributed to model each individual s different user-specific latent vector in each temporal s- tate would decrease significantly. Thus it becomes hard to fit the model very well. It exactly explains why performs better than on data 1 but slightly worse than on data 2, where data 2 is much sparser than data 1. Based on the result, we also can observe that performs not very well on RMSE metric probably. The reason is that is difficult to optimize the model s latent vectors on checkin data Ranking Accuracy Performance. Figure 2(c) (Figure 2(d)) and Figure 2(e) (Figure 2(f)) show the overall ranking performance of and baseline methods in terms of Tau and ndcg on data 1 (data 2). From the results, we can find that is much superior than the baseline methods on both two data sets in terms of two metrics. Since could accurately model one user s checkin decision making process, it obviously leads to a better result. By exploiting temporal effect to model user-specific preference, is better than, and F. The performance of and is similar, which shows the importance of geographical and temperate effects in location prediction. However, we also can see that and perform much worse than, because (1) is hard to fit each user-specific latent vector with such less observed ratings; (2) only utilizes the distance characteristic. The sparseness of data makes F achieve the poor performance, especially in data 2 where the s- parseness is more evident, F has a much worse result. fails to work well due to that more other explicit factors would affect user s checkin decision making process. Also, the sparseness causes both and to perform very worse Top-K Recommendation Performance. The performance of Precison@K, Recall@K and MAP for different models on two data sets are reported in Figure 3, 4(a) and 4(b). Totally, performs the best among all methods on these three metrics; while, and F are nearly the worst. It occurs likely because they do not model the geographical and temporal effects. is superior than specifically on Data 2 due to (1) Most of users would visit a nearby location, which could be modeled with distance property. When recommending top K locations among all locations, would have more accurate results. (2) When data becomes much sparser, each user s user-specific la- 123
8 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see (a) (b) (c) (d) Figure 4: (a) (b) Performance comparison in terms of MAP on Data 1 and Data 2. (c) Performance comparison with different g i value. (d) Relationship between user latent feature and topic number. tent vector for each different state in would be fitted by much less observed ratings. It also illustrates the geographical effects play an more important role in POI recommendation. appropriately leverages multiple geographical features to learn user s interests, as a result it improves the recommendation accuracy. It can be observed that the performance on data 1 is worse than the one on data 2 (which is much sparser) possibly due to that a much smaller set of the observed data is employed to train the model in Data Geographical Influence. The numeric value g i indicates how much the check-in decision of user i at a POI is affected by the geographical influence. In our model, g i could be any manually specified value or automatically learned by optimization algorithm (denoted as Geo-Learned). Therefore we construct the following experiments to discuss the geographical influence on location prediction. We fix g i with value,.5 and 1, denoted as Geo-, Geo-.5 and Geo- 1, respectively. The result based on Tau in Data 1 is shown in Figure 4(c). Based on the results, we could obtain the following observations. First, our proposed method always outperforms. In particular, as the value of g i increases, the improvement becomes more obvious. When only models the dynamic of user s general preference (i.e. g i is ), it outperforms. It directly indicates that modeling one user s general temporal preference by leveraging his topic distribution is superior than by using multiple user-specific latent vectors on different temporal states. Second, with the increase of the value of g i, performs much better. It illustrates that user pays more attention on geographical information when he chooses a POI. Third, the performance of the learned g i is slightly worse than that of g i = 1, but better than other values. It happens due to that it is difficult to find a very accurate optimal solution for g i with the limited observed ratings. However, automatically learning geographical influence is meaningful as the geographical influence on the checkin decision for each Kendall Tau Geo Learned Geo 1 Geo.5 Geo Kendall Tau individual user is very different User Latent- and Topical- Space Relationship. In the model, we utilize the learned topic distribution to capture the dynamics of user s general preference. It meaningful to examine the relationship between user latent space and user topical space, which could be reflected from the relationship between user latent feature number and topic number. Therefore, we show the ranking accuracy performance in terms of Tau on Data 1 for with different topic numbers in Figure 4(d) and the latent feature dimension is set to 3. The Tau performance of is also reported as comparison. Based on the results, we can see that no matter what the topic number is, always performs better than. It indicates that incorporating geographical influence and modeling the dynamic of user s general preference would benefit the POI recommendations. Moreover, we could observe that when the topic number is 3, which has the same value as the specified feature number, the performance is nearly superior than others. It illustrates that the topical space dimension is supposed to be consistent with the user latent feature dimension. If the topical dimension is larger than the user latent feature dimension, it becomes more difficult to align the topical space with user latent feature space. As a result, the performance is not good as the one with matches between two spaces. 4 Related Work Related works can be grouped into two categories. The first category is about Matrix Factorization [18, 2]. For example, [19] assumed that users and items were modeled by latent factors with Gaussian prior, and the observed ratings were drawn from a Gaussian distribution with the mean as the dot product of latent user and location factors. [2] regarded both user and item latent vectors are non-negative with exponential prior and proposed to exploit Gibbs sampler to approximate the posteriors of latent factors. The second category is about POI Recommendation. The related works about POI recommenda- 124
9 Downloaded 9/13/17 to Redistribution subject to SIAM license or copyright; see tion mainly consist of two sub-categories. The first subcategory throws light on modeling geographical influence. For example, [23] proposed to leverage a power law distribution to estimate the check-in probability with the distance of any pair of visited POIs, due to the spatial clustering phenomenon exhibited in LBSNs. However, to model the geographical distance, [5, 2, 26] further adopted the Gaussian based model and [26, 14] utilized the Kernel density estimation method. In addition, [15] proposed to exploit the framework of. Based on the weighted matrix matrix factorization, [13] modeled the POI s geographical influence with user s activity area and POI s influence area, and [16] considered two types of geographical characteristics: geographical neighborhood and region effect. The second sub-category focuses on modeling temporal effect [3, 17, 22, 8]. For example, [6] distinguished user s latent factors in different temporal state and then took several strategies to aggregate user check-in preference in each temporal state. [7, 5] employed multicenter Gaussian mixture model to capture a user s temporal preference. Moreover, [25] adopted the summation to fuse user s interest for POI and graphical influence, where the user s interest for POI could be learned by a User-based Collaborative Filtering extension with temporal state. However, our work is different from these existing works: (1) The geographical features taken into account for POI recommendation could be extended to include multiple different factors. (2) It is novel to use topic modeling similar method to capture the temporal effect in Point-of-Interest recommendation. 5 Conclusions In this paper, we proposed a novel model for POI recommendation that incorporates both geographical influence and temporal effect into matrix factorization. We modeled user s preference on a POI as the summation over his geographical preference for the POI and his general interest in the POI. User s geographical preference is captured by leveraging multiple geographical features. Meanwhile user s general preference is decomposed into a static one and a dynamic one. Specifically, we analogized a checkin event as a term and one user s checkins as a document, and employed topic modeling to model all observed checkins. As the checkin event includes the checkin time and each topic is characterized by a distribution over checkin events, the learned topic distribution naturally captures the dynamics of user s general interest. To this end, a specific model namely was proposed. Finally, the experimental results on real-world data sets demonstrate the improvements of our proposed model. 6 Acknowledgements This research was supported in part by NIH (1R21AA ), NSFC ( , , ), and National Center for International Joint Research on E-Business Information Processing (213B135). References [1] J. Bao, Y. Zheng, and M. F. Mokbel. Location-based and preference-aware recommendation using sparse geo-social networking data. In SIGSPATIAL, pages , 212. [2] C. Cheng, H. Yang, I. King, and M. R. Lyu. Fused matrix factorization with geographical and social influence in locationbased social networks. In AAAI, 212. [3] C. Cheng, H. Yang, M. R. Lyu, and I. King. Where you like to go next: Successive point-of-interest recommendation. In IJCAI, pages , 213. [4] Z. Cheng, J. Caverlee, K. Lee, and D. Z. Sui. Exploring millions of footprints in location sharing services. In ICWSM, 211. [5] E. Cho, S. A. Myers, and J. Leskovec. Friendship and mobility: User movement in location-based social networks. In SIGKDD, pages , 211. [6] H. Gao, J. Tang, X. Hu, and H. Liu. Exploring temporal effects for location recommendation on location-based social networks. In RecSys, 213. [7] H. Gao, J. Tang, X. Hu, and H. Liu. Modeling temporal effects of human mobile behavior on location-based social networks. In CIKM, 213. [8] S. Hanhuai and B. Arindam. Generalized probabilistic matrix factorizations for collaborative filtering. In ICDM, 21. [9] G. Huiji and L. Huan. Location-based social network data repository, 214. [1] Y. Koren. Collaborative filtering with temporal dynamics. In KDD, pages , 29. [11] H. Li, R. Hong, S. Zhu, and Y. Ge. Point-of-interest recommender systems: A separate-space perspective. In ICDM, pages , 215. [12] X. Li, G. Cong, X.-L. Li, T.-A. N. Pham, and S. Krishnaswamy. Rank-geofm: A ranking based geographical factorization method for point of interest recommendation. In SIGIR, 215. [13] D. Lian, C. Zhao, X. Xie, G. Sun, E. Chen, and Y. Rui. Geomf: Joint geographical modeling and matrix factorization for pointof-interest recommendation. In KDD, pages , 214. [14] M. Lichman and P. Smyth. Modeling human location data with mixture of kernel densities. In KDD, 214. [15] B. Liu, Y. Fu, Z. Yao, and H. Xiong. Learning geographical preferences for point-of-interest recommendation. In KDD, 213. [16] Y. Liu, W. Wei, A. Sun, and C. Miao. Exploiting geographical neighborhood characteristics for location recommendation. In CIKM, pages , 214. [17] QuanYuan, G. Cong, and A. Sun. Graph-based point-of-interest recommendation with geographical and temporal influences. In CIKM, 214. [18] S. Ruslan and M. Andriy. Bayesian probabilistic matrix factorization using markov chain monte carlo. In ICML, 28. [19] R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In NIPS, pages , 27. [2] M. N. Schmidt, O. Winther, and L. K. Hansen. Bayesian nonnegative matrix factorization. In ICASS, 29. [21] C. Wang and D. M. Blei. Collaborative topic modeling for recommending scientific articles. In KDD, 211. [22] M. Ye, K. Janowicz, C. Mlligann, and W.-C. Lee. What you are is when you are: The temporal dimension of feature types in location-based social networks. In GIS, pages , 211. [23] M. Ye, P. Yin, W.-C. Lee, and D.-L. Lee. Exploiting geographical influence for collaborative point-of-interest recommendation. In SIGIR, 211. [24] H. Yin, Y. Sun, B. Cui, Z. Hu, and L. Chen. Lcars: A locationcontent-aware recommender system. In KDD, 213. [25] Q. Yuan, G. Cong, Z. Ma, A. Sun, and N. Magnenat-Thalmann. Time-aware point-of-interest recommendation. In SIGIR, 213. [26] J. Zhang and C. Chow. igslr: Personalized geo-social location recommendation - a kernel density estimation approach. In SIGSPATIAL, pages , 213. [27] Y. Zheng and X. Xie. Learning travel recommendations from user-generated gps traces. In TIST,
Aggregated Temporal Tensor Factorization Model for Point-of-interest Recommendation
Aggregated Temporal Tensor Factorization Model for Point-of-interest Recommendation Shenglin Zhao 1,2B, Michael R. Lyu 1,2, and Irwin King 1,2 1 Shenzhen Key Laboratory of Rich Media Big Data Analytics
More informationPoint-of-Interest Recommendations: Learning Potential Check-ins from Friends
Point-of-Interest Recommendations: Learning Potential Check-ins from Friends Huayu Li, Yong Ge +, Richang Hong, Hengshu Zhu University of North Carolina at Charlotte + University of Arizona Hefei University
More informationExploiting Geographic Dependencies for Real Estate Appraisal
Exploiting Geographic Dependencies for Real Estate Appraisal Yanjie Fu Joint work with Hui Xiong, Yu Zheng, Yong Ge, Zhihua Zhou, Zijun Yao Rutgers, the State University of New Jersey Microsoft Research
More informationScaling Neighbourhood Methods
Quick Recap Scaling Neighbourhood Methods Collaborative Filtering m = #items n = #users Complexity : m * m * n Comparative Scale of Signals ~50 M users ~25 M items Explicit Ratings ~ O(1M) (1 per billion)
More informationLearning to Recommend Point-of-Interest with the Weighted Bayesian Personalized Ranking Method in LBSNs
information Article Learning to Recommend Point-of-Interest with the Weighted Bayesian Personalized Ranking Method in LBSNs Lei Guo 1, *, Haoran Jiang 2, Xinhua Wang 3 and Fangai Liu 3 1 School of Management
More informationLocation Regularization-Based POI Recommendation in Location-Based Social Networks
information Article Location Regularization-Based POI Recommendation in Location-Based Social Networks Lei Guo 1,2, * ID, Haoran Jiang 3 and Xinhua Wang 4 1 Postdoctoral Research Station of Management
More informationRegularity and Conformity: Location Prediction Using Heterogeneous Mobility Data
Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data Yingzi Wang 12, Nicholas Jing Yuan 2, Defu Lian 3, Linli Xu 1 Xing Xie 2, Enhong Chen 1, Yong Rui 2 1 University of Science
More informationCollaborative Location Recommendation by Integrating Multi-dimensional Contextual Information
1 Collaborative Location Recommendation by Integrating Multi-dimensional Contextual Information LINA YAO, University of New South Wales QUAN Z. SHENG, Macquarie University XIANZHI WANG, Singapore Management
More informationTime-aware Point-of-interest Recommendation
Time-aware Point-of-interest Recommendation Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, and Nadia Magnenat-Thalmann School of Computer Engineering Nanyang Technological University Presented by ShenglinZHAO
More informationClick Prediction and Preference Ranking of RSS Feeds
Click Prediction and Preference Ranking of RSS Feeds 1 Introduction December 11, 2009 Steven Wu RSS (Really Simple Syndication) is a family of data formats used to publish frequently updated works. RSS
More informationAsymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie, Shenglin Zhao, Zibin Zheng, Jieming Zhu and Michael R. Lyu School of Computer Science and Technology, Southwest
More informationDownloaded 09/30/17 to Redistribution subject to SIAM license or copyright; see
Latent Factor Transition for Dynamic Collaborative Filtering Downloaded 9/3/17 to 4.94.6.67. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php Chenyi Zhang
More informationA Survey of Point-of-Interest Recommendation in Location-Based Social Networks
Trajectory-Based Behavior Analytics: Papers from the 2015 AAAI Workshop A Survey of Point-of-Interest Recommendation in Location-Based Social Networks Yonghong Yu Xingguo Chen Tongda College School of
More informationAndriy Mnih and Ruslan Salakhutdinov
MATRIX FACTORIZATION METHODS FOR COLLABORATIVE FILTERING Andriy Mnih and Ruslan Salakhutdinov University of Toronto, Machine Learning Group 1 What is collaborative filtering? The goal of collaborative
More informationLiangjie Hong, Ph.D. Candidate Dept. of Computer Science and Engineering Lehigh University Bethlehem, PA
Rutgers, The State University of New Jersey Nov. 12, 2012 Liangjie Hong, Ph.D. Candidate Dept. of Computer Science and Engineering Lehigh University Bethlehem, PA Motivation Modeling Social Streams Future
More informationRestricted Boltzmann Machines for Collaborative Filtering
Restricted Boltzmann Machines for Collaborative Filtering Authors: Ruslan Salakhutdinov Andriy Mnih Geoffrey Hinton Benjamin Schwehn Presentation by: Ioan Stanculescu 1 Overview The Netflix prize problem
More informationFriendship and Mobility: User Movement In Location-Based Social Networks. Eunjoon Cho* Seth A. Myers* Jure Leskovec
Friendship and Mobility: User Movement In Location-Based Social Networks Eunjoon Cho* Seth A. Myers* Jure Leskovec Outline Introduction Related Work Data Observations from Data Model of Human Mobility
More informationPredicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence AAAI-16 Predicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts Qiang Liu, Shu Wu, Liang Wang, Tieniu
More informationGraphical Models for Collaborative Filtering
Graphical Models for Collaborative Filtering Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Sequence modeling HMM, Kalman Filter, etc.: Similarity: the same graphical model topology,
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate
More informationMixed Membership Matrix Factorization
Mixed Membership Matrix Factorization Lester Mackey University of California, Berkeley Collaborators: David Weiss, University of Pennsylvania Michael I. Jordan, University of California, Berkeley 2011
More information* Matrix Factorization and Recommendation Systems
Matrix Factorization and Recommendation Systems Originally presented at HLF Workshop on Matrix Factorization with Loren Anderson (University of Minnesota Twin Cities) on 25 th September, 2017 15 th March,
More informationMixed Membership Matrix Factorization
Mixed Membership Matrix Factorization Lester Mackey 1 David Weiss 2 Michael I. Jordan 1 1 University of California, Berkeley 2 University of Pennsylvania International Conference on Machine Learning, 2010
More informationInferring Friendship from Check-in Data of Location-Based Social Networks
Inferring Friendship from Check-in Data of Location-Based Social Networks Ran Cheng, Jun Pang, Yang Zhang Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Luxembourg
More informationCircle-based Recommendation in Online Social Networks
Circle-based Recommendation in Online Social Networks Xiwang Yang, Harald Steck*, and Yong Liu Polytechnic Institute of NYU * Bell Labs/Netflix 1 Outline q Background & Motivation q Circle-based RS Trust
More informationLarge-scale Ordinal Collaborative Filtering
Large-scale Ordinal Collaborative Filtering Ulrich Paquet, Blaise Thomson, and Ole Winther Microsoft Research Cambridge, University of Cambridge, Technical University of Denmark ulripa@microsoft.com,brmt2@cam.ac.uk,owi@imm.dtu.dk
More informationRecommendation Systems
Recommendation Systems Pawan Goyal CSE, IITKGP October 21, 2014 Pawan Goyal (IIT Kharagpur) Recommendation Systems October 21, 2014 1 / 52 Recommendation System? Pawan Goyal (IIT Kharagpur) Recommendation
More informationCollaborative topic models: motivations cont
Collaborative topic models: motivations cont Two topics: machine learning social network analysis Two people: " boy Two articles: article A! girl article B Preferences: The boy likes A and B --- no problem.
More informationYahoo! Labs Nov. 1 st, Liangjie Hong, Ph.D. Candidate Dept. of Computer Science and Engineering Lehigh University
Yahoo! Labs Nov. 1 st, 2012 Liangjie Hong, Ph.D. Candidate Dept. of Computer Science and Engineering Lehigh University Motivation Modeling Social Streams Future work Motivation Modeling Social Streams
More informationDiscovering Geographical Topics in Twitter
Discovering Geographical Topics in Twitter Liangjie Hong, Lehigh University Amr Ahmed, Yahoo! Research Alexander J. Smola, Yahoo! Research Siva Gurumurthy, Twitter Kostas Tsioutsiouliklis, Twitter Overview
More informationCollaborative Recommendation with Multiclass Preference Context
Collaborative Recommendation with Multiclass Preference Context Weike Pan and Zhong Ming {panweike,mingz}@szu.edu.cn College of Computer Science and Software Engineering Shenzhen University Pan and Ming
More informationMining Personal Context-Aware Preferences for Mobile Users
2012 IEEE 12th International Conference on Data Mining Mining Personal Context-Aware Preferences for Mobile Users Hengshu Zhu Enhong Chen Kuifei Yu Huanhuan Cao Hui Xiong Jilei Tian University of Science
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationIntroduction to Machine Learning
Introduction to Machine Learning Brown University CSCI 1950-F, Spring 2012 Prof. Erik Sudderth Lecture 25: Markov Chain Monte Carlo (MCMC) Course Review and Advanced Topics Many figures courtesy Kevin
More informationScalable Bayesian Matrix Factorization
Scalable Bayesian Matrix Factorization Avijit Saha,1, Rishabh Misra,2, and Balaraman Ravindran 1 1 Department of CSE, Indian Institute of Technology Madras, India {avijit, ravi}@cse.iitm.ac.in 2 Department
More informationHBGG: a Hierarchical Bayesian Geographical Model for Group Recommendation
: a Hierarchical Bayesian Geographical Model for Group Recommendation Ziyu Lu Hui Li ikos Mamoulis David W. Cheung Abstract Location-based social networks such as Foursquare and have gained increasing
More informationLarge-scale Collaborative Ranking in Near-Linear Time
Large-scale Collaborative Ranking in Near-Linear Time Liwei Wu Depts of Statistics and Computer Science UC Davis KDD 17, Halifax, Canada August 13-17, 2017 Joint work with Cho-Jui Hsieh and James Sharpnack
More informationDecoupled Collaborative Ranking
Decoupled Collaborative Ranking Jun Hu, Ping Li April 24, 2017 Jun Hu, Ping Li WWW2017 April 24, 2017 1 / 36 Recommender Systems Recommendation system is an information filtering technique, which provides
More informationExploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data
Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data Jinzhong Wang April 13, 2016 The UBD Group Mobile and Social Computing Laboratory School of Software, Dalian University
More informationImpact of Data Characteristics on Recommender Systems Performance
Impact of Data Characteristics on Recommender Systems Performance Gediminas Adomavicius YoungOk Kwon Jingjing Zhang Department of Information and Decision Sciences Carlson School of Management, University
More informationUser Preference Learning with Multiple Information Fusion. restaurant recommendations.
Downloaded 02/8/8 to 46.3.93.252. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php User Preference Learning with Multiple Information Fusion for Restaurant
More informationA Modified PMF Model Incorporating Implicit Item Associations
A Modified PMF Model Incorporating Implicit Item Associations Qiang Liu Institute of Artificial Intelligence College of Computer Science Zhejiang University Hangzhou 31007, China Email: 01dtd@gmail.com
More informationRecurrent Latent Variable Networks for Session-Based Recommendation
Recurrent Latent Variable Networks for Session-Based Recommendation Panayiotis Christodoulou Cyprus University of Technology paa.christodoulou@edu.cut.ac.cy 27/8/2017 Panayiotis Christodoulou (C.U.T.)
More informationManotumruksa, J., Macdonald, C. and Ounis, I. (2018) Contextual Attention Recurrent Architecture for Context-aware Venue Recommendation. In: 41st International ACM SIGIR Conference on Research and Development
More informationContent-based Recommendation
Content-based Recommendation Suthee Chaidaroon June 13, 2016 Contents 1 Introduction 1 1.1 Matrix Factorization......................... 2 2 slda 2 2.1 Model................................. 3 3 flda 3
More informationTime-aware Collaborative Topic Regression: Towards Higher Relevance in Textual Item Recommendation
Time-aware Collaborative Topic Regression: Towards Higher Relevance in Textual Item Recommendation Anas Alzogbi Department of Computer Science, University of Freiburg 79110 Freiburg, Germany alzoghba@informatik.uni-freiburg.de
More informationGeoMF: Joint Geographical Modeling and Matrix Factorization for Point-of-Interest Recommendation
GeoMF: Joint Geographical Modeling and Matrix Factorization for Point-of-Interest Recommendation Defu Lian, Cong Zhao, Xing Xie, Guangzhong Sun, Enhong Chen, Yong Rui University of Science and Technology
More informationData Mining Techniques
Data Mining Techniques CS 622 - Section 2 - Spring 27 Pre-final Review Jan-Willem van de Meent Feedback Feedback https://goo.gl/er7eo8 (also posted on Piazza) Also, please fill out your TRACE evaluations!
More informationService Recommendation for Mashup Composition with Implicit Correlation Regularization
205 IEEE International Conference on Web Services Service Recommendation for Mashup Composition with Implicit Correlation Regularization Lina Yao, Xianzhi Wang, Quan Z. Sheng, Wenjie Ruan, and Wei Zhang
More informationCoSoLoRec: Joint Factor Model with Content, Social, Location for Heterogeneous Point-of-Interest Recommendation
CoSoLoRec: Joint Factor Model with Content, Social, Location for Heterogeneous Point-of-Interest Recommendation Hao Guo 1,XinLi 3,MingHe 1, Xiangyu Zhao 1, Guiquan Liu 1(B), and Guandong Xu 2 1 University
More informationBayesian Contextual Multi-armed Bandits
Bayesian Contextual Multi-armed Bandits Xiaoting Zhao Joint Work with Peter I. Frazier School of Operations Research and Information Engineering Cornell University October 22, 2012 1 / 33 Outline 1 Motivating
More informationHuman Mobility Pattern Prediction Algorithm using Mobile Device Location and Time Data
Human Mobility Pattern Prediction Algorithm using Mobile Device Location and Time Data 0. Notations Myungjun Choi, Yonghyun Ro, Han Lee N = number of states in the model T = length of observation sequence
More informationLecture 16 Deep Neural Generative Models
Lecture 16 Deep Neural Generative Models CMSC 35246: Deep Learning Shubhendu Trivedi & Risi Kondor University of Chicago May 22, 2017 Approach so far: We have considered simple models and then constructed
More informationTransfer Learning for Collective Link Prediction in Multiple Heterogenous Domains
Transfer Learning for Collective Link Prediction in Multiple Heterogenous Domains Bin Cao caobin@cse.ust.hk Nathan Nan Liu nliu@cse.ust.hk Qiang Yang qyang@cse.ust.hk Hong Kong University of Science and
More informationRecent Advances in Bayesian Inference Techniques
Recent Advances in Bayesian Inference Techniques Christopher M. Bishop Microsoft Research, Cambridge, U.K. research.microsoft.com/~cmbishop SIAM Conference on Data Mining, April 2004 Abstract Bayesian
More informationSQL-Rank: A Listwise Approach to Collaborative Ranking
SQL-Rank: A Listwise Approach to Collaborative Ranking Liwei Wu Depts of Statistics and Computer Science UC Davis ICML 18, Stockholm, Sweden July 10-15, 2017 Joint work with Cho-Jui Hsieh and James Sharpnack
More informationUAPD: Predicting Urban Anomalies from Spatial-Temporal Data
UAPD: Predicting Urban Anomalies from Spatial-Temporal Data Xian Wu, Yuxiao Dong, Chao Huang, Jian Xu, Dong Wang and Nitesh V. Chawla* Department of Computer Science and Engineering University of Notre
More informationSCMF: Sparse Covariance Matrix Factorization for Collaborative Filtering
SCMF: Sparse Covariance Matrix Factorization for Collaborative Filtering Jianping Shi Naiyan Wang Yang Xia Dit-Yan Yeung Irwin King Jiaya Jia Department of Computer Science and Engineering, The Chinese
More informationProbabilistic Local Matrix Factorization based on User Reviews
Probabilistic Local Matrix Factorization based on User Reviews Xu Chen 1, Yongfeng Zhang 2, Wayne Xin Zhao 3, Wenwen Ye 1 and Zheng Qin 1 1 School of Software, Tsinghua University 2 College of Information
More informationMatrix Factorization Techniques for Recommender Systems
Matrix Factorization Techniques for Recommender Systems By Yehuda Koren Robert Bell Chris Volinsky Presented by Peng Xu Supervised by Prof. Michel Desmarais 1 Contents 1. Introduction 4. A Basic Matrix
More informationMixture-Rank Matrix Approximation for Collaborative Filtering
Mixture-Rank Matrix Approximation for Collaborative Filtering Dongsheng Li 1 Chao Chen 1 Wei Liu 2 Tun Lu 3,4 Ning Gu 3,4 Stephen M. Chu 1 1 IBM Research - China 2 Tencent AI Lab, China 3 School of Computer
More informationExploiting Local and Global Social Context for Recommendation
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Exploiting Local and Global Social Context for Recommendation Jiliang Tang, Xia Hu, Huiji Gao, Huan Liu Computer
More informationCollaborative Filtering. Radek Pelánek
Collaborative Filtering Radek Pelánek 2017 Notes on Lecture the most technical lecture of the course includes some scary looking math, but typically with intuitive interpretation use of standard machine
More informationSTA 4273H: Sta-s-cal Machine Learning
STA 4273H: Sta-s-cal Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 2 In our
More informationRecommender System for Yelp Dataset CS6220 Data Mining Northeastern University
Recommender System for Yelp Dataset CS6220 Data Mining Northeastern University Clara De Paolis Kaluza Fall 2016 1 Problem Statement and Motivation The goal of this work is to construct a personalized recommender
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 7 Approximate
More informationECS289: Scalable Machine Learning
ECS289: Scalable Machine Learning Cho-Jui Hsieh UC Davis Oct 11, 2016 Paper presentations and final project proposal Send me the names of your group member (2 or 3 students) before October 15 (this Friday)
More informationRelational Stacked Denoising Autoencoder for Tag Recommendation. Hao Wang
Relational Stacked Denoising Autoencoder for Tag Recommendation Hao Wang Dept. of Computer Science and Engineering Hong Kong University of Science and Technology Joint work with Xingjian Shi and Dit-Yan
More informationRecommendation Systems
Recommendation Systems Pawan Goyal CSE, IITKGP October 29-30, 2015 Pawan Goyal (IIT Kharagpur) Recommendation Systems October 29-30, 2015 1 / 61 Recommendation System? Pawan Goyal (IIT Kharagpur) Recommendation
More informationRating Prediction with Topic Gradient Descent Method for Matrix Factorization in Recommendation
Rating Prediction with Topic Gradient Descent Method for Matrix Factorization in Recommendation Guan-Shen Fang, Sayaka Kamei, Satoshi Fujita Department of Information Engineering Hiroshima University Hiroshima,
More informationarxiv: v2 [cs.ir] 14 May 2018
A Probabilistic Model for the Cold-Start Problem in Rating Prediction using Click Data ThaiBinh Nguyen 1 and Atsuhiro Takasu 1, 1 Department of Informatics, SOKENDAI (The Graduate University for Advanced
More informationJoint user knowledge and matrix factorization for recommender systems
World Wide Web (2018) 21:1141 1163 DOI 10.1007/s11280-017-0476-7 Joint user knowledge and matrix factorization for recommender systems Yonghong Yu 1,2 Yang Gao 2 Hao Wang 2 Ruili Wang 3 Received: 13 February
More informationDemand and Trip Prediction in Bike Share Systems
Demand and Trip Prediction in Bike Share Systems Team members: Zhaonan Qu SUNet ID: zhaonanq December 16, 2017 1 Abstract 2 Introduction Bike Share systems are becoming increasingly popular in urban areas.
More informationAlgorithms for Collaborative Filtering
Algorithms for Collaborative Filtering or How to Get Half Way to Winning $1million from Netflix Todd Lipcon Advisor: Prof. Philip Klein The Real-World Problem E-commerce sites would like to make personalized
More informationGentle Introduction to Infinite Gaussian Mixture Modeling
Gentle Introduction to Infinite Gaussian Mixture Modeling with an application in neuroscience By Frank Wood Rasmussen, NIPS 1999 Neuroscience Application: Spike Sorting Important in neuroscience and for
More informationStudy Notes on the Latent Dirichlet Allocation
Study Notes on the Latent Dirichlet Allocation Xugang Ye 1. Model Framework A word is an element of dictionary {1,,}. A document is represented by a sequence of words: =(,, ), {1,,}. A corpus is a collection
More information13 : Variational Inference: Loopy Belief Propagation and Mean Field
10-708: Probabilistic Graphical Models 10-708, Spring 2012 13 : Variational Inference: Loopy Belief Propagation and Mean Field Lecturer: Eric P. Xing Scribes: Peter Schulam and William Wang 1 Introduction
More informationReview: Probabilistic Matrix Factorization. Probabilistic Matrix Factorization (PMF)
Case Study 4: Collaborative Filtering Review: Probabilistic Matrix Factorization Machine Learning for Big Data CSE547/STAT548, University of Washington Emily Fox February 2 th, 214 Emily Fox 214 1 Probabilistic
More informationCollaborative Filtering Applied to Educational Data Mining
Journal of Machine Learning Research (200) Submitted ; Published Collaborative Filtering Applied to Educational Data Mining Andreas Töscher commendo research 8580 Köflach, Austria andreas.toescher@commendo.at
More information27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling
10-708: Probabilistic Graphical Models 10-708, Spring 2014 27 : Distributed Monte Carlo Markov Chain Lecturer: Eric P. Xing Scribes: Pengtao Xie, Khoa Luu In this scribe, we are going to review the Parallel
More informationLatent Dirichlet Allocation
Outlines Advanced Artificial Intelligence October 1, 2009 Outlines Part I: Theoretical Background Part II: Application and Results 1 Motive Previous Research Exchangeability 2 Notation and Terminology
More informationPersonalized POI Recommendation on Location-Based Social Networks. Huiji Gao
Personalized POI Recommendation on Location-Based Social Networks by Huiji Gao A Dissertation Presented in Partial Fulfillment of the Requirement for the Degree Doctor of Philosophy Approved November 2014
More informationNOWADAYS, Collaborative Filtering (CF) [14] plays an
JOURNAL OF L A T E X CLASS FILES, VOL. 4, NO. 8, AUGUST 205 Multi-behavioral Sequential Prediction with Recurrent Log-bilinear Model Qiang Liu, Shu Wu, Member, IEEE, and Liang Wang, Senior Member, IEEE
More informationEM-algorithm for motif discovery
EM-algorithm for motif discovery Xiaohui Xie University of California, Irvine EM-algorithm for motif discovery p.1/19 Position weight matrix Position weight matrix representation of a motif with width
More informationFaster Stochastic Variational Inference using Proximal-Gradient Methods with General Divergence Functions
Faster Stochastic Variational Inference using Proximal-Gradient Methods with General Divergence Functions Mohammad Emtiyaz Khan, Reza Babanezhad, Wu Lin, Mark Schmidt, Masashi Sugiyama Conference on Uncertainty
More informationCS 781 Lecture 9 March 10, 2011 Topics: Local Search and Optimization Metropolis Algorithm Greedy Optimization Hopfield Networks Max Cut Problem Nash
CS 781 Lecture 9 March 10, 2011 Topics: Local Search and Optimization Metropolis Algorithm Greedy Optimization Hopfield Networks Max Cut Problem Nash Equilibrium Price of Stability Coping With NP-Hardness
More informationLecture 13 : Variational Inference: Mean Field Approximation
10-708: Probabilistic Graphical Models 10-708, Spring 2017 Lecture 13 : Variational Inference: Mean Field Approximation Lecturer: Willie Neiswanger Scribes: Xupeng Tong, Minxing Liu 1 Problem Setup 1.1
More informationCollaborative Topic Modeling for Recommending Scientific Articles
Collaborative Topic Modeling for Recommending Scientific Articles Chong Wang and David M. Blei Best student paper award at KDD 2011 Computer Science Department, Princeton University Presented by Tian Cao
More informationA Novel Click Model and Its Applications to Online Advertising
A Novel Click Model and Its Applications to Online Advertising Zeyuan Zhu Weizhu Chen Tom Minka Chenguang Zhu Zheng Chen February 5, 2010 1 Introduction Click Model - To model the user behavior Application
More informationModeling User Rating Profiles For Collaborative Filtering
Modeling User Rating Profiles For Collaborative Filtering Benjamin Marlin Department of Computer Science University of Toronto Toronto, ON, M5S 3H5, CANADA marlin@cs.toronto.edu Abstract In this paper
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is
More informationA graph contains a set of nodes (vertices) connected by links (edges or arcs)
BOLTZMANN MACHINES Generative Models Graphical Models A graph contains a set of nodes (vertices) connected by links (edges or arcs) In a probabilistic graphical model, each node represents a random variable,
More informationDeep Poisson Factorization Machines: a factor analysis model for mapping behaviors in journalist ecosystem
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationRaRE: Social Rank Regulated Large-scale Network Embedding
RaRE: Social Rank Regulated Large-scale Network Embedding Authors: Yupeng Gu 1, Yizhou Sun 1, Yanen Li 2, Yang Yang 3 04/26/2018 The Web Conference, 2018 1 University of California, Los Angeles 2 Snapchat
More informationActivity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness
Activity Identification from GPS Trajectories Using Spatial Temporal POIs Attractiveness Lian Huang, Qingquan Li, Yang Yue State Key Laboratory of Information Engineering in Survey, Mapping and Remote
More informationMulti-Attribute Bayesian Optimization under Utility Uncertainty
Multi-Attribute Bayesian Optimization under Utility Uncertainty Raul Astudillo Cornell University Ithaca, NY 14853 ra598@cornell.edu Peter I. Frazier Cornell University Ithaca, NY 14853 pf98@cornell.edu
More informationSTA414/2104 Statistical Methods for Machine Learning II
STA414/2104 Statistical Methods for Machine Learning II Murat A. Erdogdu & David Duvenaud Department of Computer Science Department of Statistical Sciences Lecture 3 Slide credits: Russ Salakhutdinov Announcements
More informationProbabilistic Matrix Factorization
Probabilistic Matrix Factorization David M. Blei Columbia University November 25, 2015 1 Dyadic data One important type of modern data is dyadic data. Dyadic data are measurements on pairs. The idea is
More informationPattern Recognition and Machine Learning
Christopher M. Bishop Pattern Recognition and Machine Learning ÖSpri inger Contents Preface Mathematical notation Contents vii xi xiii 1 Introduction 1 1.1 Example: Polynomial Curve Fitting 4 1.2 Probability
More informationVCMC: Variational Consensus Monte Carlo
VCMC: Variational Consensus Monte Carlo Maxim Rabinovich, Elaine Angelino, Michael I. Jordan Berkeley Vision and Learning Center September 22, 2015 probabilistic models! sky fog bridge water grass object
More information