Collaborative Filtering

Similar documents
Collaborative Filtering. Radek Pelánek

Recommendation Systems

Recommendation Systems

Recommendation Systems

Recommender Systems. Dipanjan Das Language Technologies Institute Carnegie Mellon University. 20 November, 2007

Matrix Factorization Techniques for Recommender Systems

Matrix Factorization Techniques For Recommender Systems. Collaborative Filtering

Matrix Factorization and Collaborative Filtering

Matrix Factorization Techniques for Recommender Systems

Using SVD to Recommend Movies

Binary Principal Component Analysis in the Netflix Collaborative Filtering Task

Collaborative Filtering Matrix Completion Alternating Least Squares

Preliminaries. Data Mining. The art of extracting knowledge from large bodies of structured data. Let s put it to use!

Collabora've Filtering

Andriy Mnih and Ruslan Salakhutdinov

Introduction to Computational Advertising

Recommender Systems EE448, Big Data Mining, Lecture 10. Weinan Zhang Shanghai Jiao Tong University

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties

Collaborative topic models: motivations cont

Jeffrey D. Ullman Stanford University

Data Mining Techniques

Binary Classification / Perceptron

Collaborative Filtering

* Matrix Factorization and Recommendation Systems

CS425: Algorithms for Web Scale Data

Dimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas

A Modified PMF Model Incorporating Implicit Item Associations

Data Science Mastery Program

Matrix Factorization In Recommender Systems. Yong Zheng, PhDc Center for Web Intelligence, DePaul University, USA March 4, 2015

CS 6375 Machine Learning

Restricted Boltzmann Machines for Collaborative Filtering

Algorithms for Collaborative Filtering

Nonnegative Matrix Factorization

Recommender Systems: Overview and. Package rectools. Norm Matloff. Dept. of Computer Science. University of California at Davis.

Collaborative Filtering Applied to Educational Data Mining

Collaborative Topic Modeling for Recommending Scientific Articles

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Recommendation. Tobias Scheffer

Scaling Neighbourhood Methods

Decoupled Collaborative Ranking

a Short Introduction

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends

Generative Models for Discrete Data

CS425: Algorithms for Web Scale Data

CS 175: Project in Artificial Intelligence. Slides 4: Collaborative Filtering

Principal Component Analysis and Singular Value Decomposition. Volker Tresp, Clemens Otte Summer 2014

Collaborative Filtering: A Machine Learning Perspective

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University

Joint user knowledge and matrix factorization for recommender systems

Collaborative Recommendation with Multiclass Preference Context

Ranking and Filtering

Department of Computer Science, Guiyang University, Guiyang , GuiZhou, China

Lecture 9: September 28

A Simple Algorithm for Nuclear Norm Regularized Problems

SQL-Rank: A Listwise Approach to Collaborative Ranking

MIDTERM: CS 6375 INSTRUCTOR: VIBHAV GOGATE October,

Probabilistic Neighborhood Selection in Collaborative Filtering Systems

Recommendation Systems

Data Mining and Matrices

Stat 315c: Introduction

Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent

Principal Component Analysis (PCA) for Sparse High-Dimensional Data

Collaborative Filtering with Temporal Dynamics with Using Singular Value Decomposition

Data Mining Techniques

Techniques for Dimensionality Reduction. PCA and Other Matrix Factorization Methods

SVAN 2016 Mini Course: Stochastic Convex Optimization Methods in Machine Learning

Variance Reduction and Ensemble Methods

Learning to Recommend Point-of-Interest with the Weighted Bayesian Personalized Ranking Method in LBSNs

Introduction to Logistic Regression

Domokos Miklós Kelen. Online Recommendation Systems. Eötvös Loránd University. Faculty of Natural Sciences. Advisor:

Matrix and Tensor Factorization from a Machine Learning Perspective

Jointly Clustering Rows and Columns of Binary Matrices: Algorithms and Trade-offs

Probabilistic Low-Rank Matrix Completion with Adaptive Spectral Regularization Algorithms

DATA MINING LECTURE 8. Dimensionality Reduction PCA -- SVD

Similarity and recommender systems

INFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from

6.034 Introduction to Artificial Intelligence

Neural Networks. Nicholas Ruozzi University of Texas at Dallas

Latent Semantic Analysis. Hongning Wang

Little Is Much: Bridging Cross-Platform Behaviors through Overlapped Crowds

TAPER: A Contextual Tensor- Based Approach for Personalized Expert Recommendation

CS249: ADVANCED DATA MINING

SoCo: A Social Network Aided Context-Aware Recommender System

EE 381V: Large Scale Learning Spring Lecture 16 March 7

Graphical Models for Collaborative Filtering

Item Recommendation for Emerging Online Businesses

Circle-based Recommendation in Online Social Networks

Stat 406: Algorithms for classification and prediction. Lecture 1: Introduction. Kevin Murphy. Mon 7 January,

ELE 538B: Mathematics of High-Dimensional Data. Spectral methods. Yuxin Chen Princeton University, Fall 2018

Machine Learning Basics

Recommender Systems. From Content to Latent Factor Analysis. Michael Hahsler

Online Social Networks and Media. Link Analysis and Web Search

Collaborative Filtering for Implicit Feedback

Algorithms for Data Science: Lecture on Finding Similar Items

Clustering based tensor decomposition

Mixed Membership Matrix Factorization

Collaborative Filtering on Ordinal User Feedback

Lecture Notes 10: Matrix Factorization

Recommender systems, matrix factorization, variable selection and social graph data

Learning Theory Continued

Transcription:

Collaborative Filtering Nicholas Ruozzi University of Texas at Dallas based on the slides of Alex Smola & Narges Razavian

Collaborative Filtering Combining information among collaborating entities to make recommendations and predictions Can be viewed as a supervised learning problem (with some caveats) Because of its many, many applications, it gets a special name

Examples Movie/TV recommendation (Netflix, Hulu, itunes) Product recommendation (Amazon) Social recommendation (Facebook) News content recommendation (Yahoo) Priority inbox & spam filtering (Google) Online dating (OK Cupid) 3

Netflix Movie Recommendation Training Data user movie rating 1 14 3 1 00 4 1 315 1 15 5 136 1 3 35 3 4 79 3 Test Data user movie rating 1 50? 1 8? 94? 3? 3 11? 4 99? 4 54? 4

Recommender Systems Content-based recommendations Recommendations based on a user profile (specific interests) or previously consumed content Collaborative filtering Recommendations based on the content preferences of similar users Hybrid approaches 5

Collaborative Filtering Most widely-used recommendation approach k-nearest neighbor methods Matrix factorization based methods Predict the utility of items for a user based on the items previously rated by other like-minded users 6

Collaborative Filtering Make recommendations based on user/item similarities User similarity Works well if number of items is much smaller than the number of users Works well if the items change frequently Item similarity (recommend new items that were also liked by the same users) Works well if the number of users is small 7

k-nearest Neighbor Similar to the spectral clustering based approach from the homework Create a similarity matrix for pairs of users Use k-nn to find the k closest users to a target user Use the ratings of the k nearest neighbors to make predictions 8

User-User Similarity Let r u,i be the rating of the i th item under user u, r u be the average rating of user u, and com(u, v) be the set of items rated by both user u and user v The similarity between user u and user v is then given by Pearson s correlation coefficient sim u, v = i com u,v r u,i r u r v,i r v i com u,v r u,i r u i com u,v r v,i r v 9

User-User Similarity Let nn(u) denote the set of k-nn to u p u,i, the predicted rating for the i th item of user u, is given by p u,i = r u + v nn(u) sim u, v r v,i r v sim(u, v) v nn(u) This is the average rating of user u plus the weighted average of the ratings of u s k nearest neighbors 10

User-User Similarity Issue: could be expensive to find the k-nn if the number of users is very large Possible solutions? 11

Item-Item Similarity Use Pearson s correlation coefficient to compute the similarity between pairs of items Let com(i, j) be the set of users common to items i and j The similarity between items i and j is given by sim i, j = u com i,j r u,i r u r u,j r u u com i,j r u,i r u u com i,j r u,j r u 1

Item-Item Similarity Let nn(i) denote the set of k-nn to i p u,i, the predicted rating for the i th item of user u, is given by p u,i = j nn(i) sim i, j r u,j sim(i, j) j nn(i) This is the weighted average of the ratings of i s k nearest neighbors 13

k-nearest Neighbor Easy to train Easily adapts to new users/items Can be difficult to scale (finding closest pairs requires forming the similarity matrix) Less of a problem for item-item assuming number of items is much smaller than the number of users Not sure how to choose k Can lead to poor accuracy 14

k-nearest Neighbor Tough to use without any ratings information to start with Cold Start New users should rate some initial items to have personalized recommendations Could also have new users describe tastes, etc. New Item/Movie may require content analysis or a non-cf based approach 15

Matrix Factorization There could be a number of latent factors that affect the recommendation Style of movie: serious vs. funny vs. escapist Demographic: is it preferred more by men or women Alternative approach: view CF as a matrix factorization problem 16

Matrix Factorization Express a matrix M R m n approximately as a product of factors A R m p and B R p n M~A B Approximate the user items matrix as a product of matrices in this way Similar to SVD decompositions that we saw earlier (SVD can t be used for a matrix with missing entries) Think of the entries of M as corresponding to an inner product of latent factors 17

Matrix Factorization 18 [from slides of Alex Smola]

Matrix Factorization 19 [from slides of Alex Smola]

Matrix Factorization 0 [from slides of Alex Smola]

Matrix Factorization We can express finding the closest matrix as an optimization problem min A,B u,i observed M u,i A u,:, B :,i + λ A F + B F 1

Matrix Factorization We can express finding the closest matrix as an optimization problem min A,B u,i observed M u,i A u,:, B :,i + λ A F + B F Computes the error in the approximation of the observed matrix entries

Matrix Factorization We can express finding the closest matrix as an optimization problem min A,B u,i observed M u,i A u,:, B :,i + λ A F + B F Regularization preferences matrices with small Frobenius norm 3

Matrix Factorization We can express finding the closest matrix as an optimization problem min A,B u,i observed M u,i A u,:, B :,i + λ A F + B F How to optimize this objective? 4

Matrix Factorization We can express finding the closest matrix as an optimization problem min A,B u,i observed M u,i A u,:, B :,i + λ A F + B F How to optimize this objective? (Stochastic) gradient descent! 5

Extensions The basic matrix factorization approach doesn t take into account the observation that some people are tougher reviewers than others and that some movies are over-hyped Can correct for this by introducing a bias term for each user and a global bias min M u,i μ b i b u A u,:, B :,i A,B,μ,b u,i observed +λ A F + B F + ν i b i + u b u 6