Introduction to Computational Advertising
|
|
- Edwina Hampton
- 6 years ago
- Views:
Transcription
1 Introduction to Computational Advertising MS&E 9 Stanford University Autumn Instructors: Dr. Andrei Broder and Dr. Vanja Josifovski Yahoo! Research
2 General course info Course Website: Instructors Dr. Andrei Broder, Yahoo! Research, Dr. Vanja Josifovski, Yahoo! Research, TA: Krishnamurthy Iyer Office hours: Tuesdays 6:pm-7:pm, Huang Course lists Staff:msande9-aut-staff All: msande9-aut-students Please use the staff list to communicate with the staff Lectures: am ~ :pm Fridays in HP Office Hours: After class and by appointment Andrei and Vanja will be on campus for times each to meet and discuss with students. Feel free to come and chat about even issues that go beyond the class.
3 Course Overview (subject to change). 9/ Overview and Introduction. /7 Marketplace and Economics. / Textual Advertising : Sponsored Search. / Textual Advertising : Contextual Advertising. /8 Display Advertising 6. / Display Advertising 7. / Targeting 8. /8 Recommender Systems 9. / Mobile, Video and other Emerging Formats. /9 Project Presentations
4 Lecture 8: Recommender Systems for Display Advertising Targeting
5 Disclaimers This talk presents the opinions of the authors. It does not necessarily reflect the views of Yahoo! inc or any other entity. Algorithms, techniques, features, etc mentioned here might or might not be in use by Yahoo! or any other company. This lecture is largely based on slides by Yehuda Koren. His help very much appreciated! Other contributors: Amr Ahmed and Alex Smola
6 Lecture 8 plan Problem definition Neighborhood based approaches Matrix factorization approaches Generative models for factorization: Latent Dirichlet Allocation External information in campaign optimization (time permitting) 6
7 Checkpoint - targeting Targeting is a key step in differentiation of impressions and extracting value! Traditional targeting: demo, geo, BT How to get the data from the user? Infer the data from historical activity One of the key step in targeting is user profile generation Generative models to assign probability of a sequence of events Weighting based on time, event type and content Predict the counts of events in certain categories Clustering and other unsupervised techniques useful more to come in the next lecture 7
8 Recommender systems What is actually recommender system technology? A set of techniques to recommend items based on explicit (rating) or implicit (page visits, ad clicks) It collects the user responses and assumes the items are opaque Usually does not take in account the content of the item Opposed to matching of ads we have discussed so far Recent techniques combine the two Shown to be effective in many domains, including advertising Slides mostly on movie ratings, we will discuss the similarities/differences wit the ad domain 8
9 Collaborative filtering Recommend items based on past transactions of many users Analyze relations between users and/or items Specific data characteristics are irrelevant Domain-free: user/item attributes are not necessary Can identify elusive aspects
10
11 score date movie user /7/ 8// /6/ // 7// 768 // 76 8// 9// 68 // /8/ 8// // 6 6 score date movie user? /6/ 6? 9// 96? 8/8/ 7? //? 6// 7? 8//? 9//? 8/7/ 8? // 9? 7/6/ 7? // 69 6? // 8 6 Training data Test data Movie rating data
12 Alternate view of the data: matrix ? 6 users items
13 Major Challenges. Countless factors may affect preferences Genre, movie/tv series/other Style of action, dialogue, plot, music et al. Director, actors. Large imbalances. Scalability Most user-item preferences are unknown Number of ratings per user or item may vary by several orders of magnitude Information to estimate individual parameters varies widely Some datasets contain millions of users/items
14 Conventions r ui r ˆui - rating by user u to item i - predicted rating by user u to item i Error function: rmse( S) = ( ui, ) S rˆ ( ) ui S r ui
15 How does this map to the ad world Ad matrix a lot sparser As with movies, no info does not mean negative response We could determine negative responses by analysis of user history Ranking metrics might be better option AUC of ROC curve Need to limit to the top-k items We cannot show every ad to every user In practice combine rec sys methods with predictive modeling for best performance
16 Neighborhood methods # # # Joe #
17 Neighborhood-based CF Earliest and most popular collaborative filtering method Derive unknown ratings from those of similar items (item-item variant) A parallel user-user flavor: rely on ratings of like-minded users
18 Neighborhood-based CF users items - unknown rating - rating between to
19 Neighborhood-based CF ? 6 users items - estimate rating of item by user
20 Neighborhood-based CF ? 6 users Neighbor selection: Identify items similar to, rated by user items
21 Neighborhood-based CF ? 6 users Compute similarity weights: s =., s 6 =. items
22 Neighborhood-based CF users Predict by taking a weighted average: (.*+.*)/(.+.)=.6 items
23 Properties of neighborhood-based CF Intuitive Easy to explain reasoning behind a recommendation Handles new ratings/users seamlessly No substantial preprocessing is required (?) Accurate (enough?)
24 Data normalization Need to identify relations and mix ratings across items/ users However: User and item-specific variability masks fundamental relationships Examples: Some items are systematically rated higher Some items were rated by users that tend to rate low Ratings change along time Normalization is critical to the success of a k-nn approach
25 Data normalization Remove data characteristics that are unlikely to be explained by k- NN Common practice is to use centering: Remove user- and item-means A more comprehensive approach eliminates additional interfering variability such as time effects See global Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights, ICDM 7 Here, we normalize by removing the baseline predictors: b ui = µ + b u + b i Global mean User bias Item bias
26 Baseline predictors (biases) Mean rating:.7 stars ( ) The Sixth Sense is. stars above avg (b i ) Joe rates. stars below avg (b u ) è Baseline estimation: Joe will rate The Sixth Sense stars ( + b i + b u )
27 Estimation of biases Try to explain each r ui in the train set as Solve the regularized least squares problem: An alternative: µ + b + b min ( r µ b b) + λ ( b + b ) b* ui u i u i ( ui, ) K u i Error for a training case u Regularization First, estimate item biases by averaging over users that rated the item: b i = u R( i) ( r µ ) ui λ + R() i Then, estimate user biases by averaging residuals over items rated by the user: b u = i R( u) ( r µ b) ui λ + R( u) i i
28 Residual: ratings of similar items by the same user. Define a similarity measure between items: s ij. Use s ij to select neighbors s k (i;u): k items most similar to i, that were rated by u. Estimate unknown rating, r ui, as the weighted average rating that u gave to the neighbors: rˆ ui = b + ui k j S ( i; u) s ( r b ) k ij uj uj j S ( i; u) How to compute item-item similarity? s ij
29 Estimating item-item similarities Common practice rely on Pearson correlation coeff Challenge non-uniform user support of item ratings, each item rated by a distinct set of users User ratings for item i:??????????? User ratings for item j:??????????? Compute correlation over shared support
30 Estimating item-item similarities Empirical Pearson correlation coefficient on shared support of items i and j: ˆ ρ = ij u U( i, j) ( r b )( r b ) ui ui uj uj ( r b ) ( r b ) ui ui uj uj u U( i, j) u U( i, j) U(i,j) contains the users who rated both items i and j Estimates with smaller supports are less reliable Use shrunk correlation coeff as a similarity measure: s ij = λ penalizes small supports: Uij (, ) ˆ ρij Uij (, ) + λ U(i,j) << λ è s ij U(i,j) >>λ è s ij ˆij ρ
31 Improvements to common practice Use transformed similarities as interpolation coeff s E.g., by squaring we emphasize stronger relations: rˆ ui = b + ui k j S ( i; u) s ( r b ) k ij uj uj j S ( i; u) See A. Toscher, M. Jahrer and R. Legenstein, Improved Neighborhood-Based Algorithms for Large-Scale Recommender Systems for sigmodial transformations Shrink towards baseline when not enough neighborhood info is available: rˆ ui = b + ui % ' & k j S ( i; u) λ + s ij s ( r b ) ij uj uj k j S ( i; u) s ij " s ij!! # ˆr ui $ b ui j!s k (i;u) ( * )
32 Similarities for binary data Often data is not ratings but binary. Example: ad clicks and conversions This requires other natural similarity measures Notation: m i - #users acting on i m ij - #users acting on both i and j m - overall #users () Jaccard similarity: s Shrink estimates: s ij ij = = m ij m + m m i j ij m ij α + m + m m i j ij
33 Similarities for binary data # () Observed/Expected ratio: Under random sampling, expected i-j intersection : m m (expectation of a hypergeometric distribution) i j m observed mij = expected ( m m / m) i j As usual, need to shrink: s ij = α + m ij ( m m / m) i j
34 A user-user approach Dual to the so-far described item-item approach (with similar derivation) Predict a rating from ratings of similar users on the same item Building stones are user-user similarities s uv : rˆ ui = b + ui k v S ( u; i) λ + s ( r b ) uv vi vi k v S ( u; i) Item-item is commonly considered advantageous over user-user when #items < #users: less item-item relations to store, more stable relations, more reliable estimation Item-item meshes better with new users and explaining rec s In some cases user-user becomes more sensible: When users are the more stable anchor of the system (e.g. items are web articles that quickly expire) When #users < #items s uv
35 Part II: Matrix factorization techniques ~
36 Latent factor models serious The Color Purple Amadeus Braveheart Geared towards females Sense and Sensibility Ocean s Lethal Weapon Geared towards males Dave The Princess Diaries The Lion King escapist Independence Day Gus Dumb and Dumber
37 Basic matrix factorization model items ~ ~ items users users A rank- SVD approximation
38 Estimate unknown ratings as inner-products of factors: items ~ ~ items users A rank- SVD approximation users?
39 Estimate unknown ratings as inner-products of factors: items ~ ~ items users A rank- SVD approximation users?
40 Estimate unknown ratings as inner-products of factors: items ~ ~ items users. A rank- SVD approximation users
41 Matrix factorization model ~ Why can t use standard SVD R = M V? SVD isn t defined when entries are missing Regularization is necessary: Estimate as much signal as possible where there are sufficient data, without over fitting where data are scarce
42 A regularized model Limit the values the factors can take Unlimited values will produce an overfit Reduce the optimization space User factors: Model a user u as a vector p u ~ N k (µ, Σ) Item factors: Model an item i as a vector q i ~ N k (, ) Ratings: Measure agreement between u and i: r ui ~ N(p ut q i, ε ) Simplifying assumptions: µ = =, Σ= = λi
43 Matrix factorization as a cost function Min ( ) ( ) T r p q + λ p + q p*, q* ui u i u i known r ui p u - user-factor of u regularization q i - item-factor of i r ui - rating by u for i Optimize by either stochastic gradient-descent or alternating least squares
44 Stochastic gradient descent optimization Perform till convergence: For each training example r ui : Compute prediction error: e ui = r ui p ut q i Update item factor: q i ß q i +γ(p u e ui -λq i ) Update user factor: p u ß p u +γ(q i e ui -λp u ) Two constants to tune: γ (step size) and λ (regulariza7on) Find values that minimize error on valida&on set See, e.g., Simon Funk, Netflix Update: Try This at Home,
45 R P Q
46 R P Q
47 Matrix factorization with biases T rˆ = µ + b + b + p q ui u i u i Baseline predictors: µ global average b u bias of u b i bias of i è Minimization problem: ( ) ( ) T r µ + b + b + p q + λ p + q + b + b Min ( ) p*, p*, b* ui u i u i u i u i known r ui regularization See, e.g., A. Paterek, Improving regularized singular value decomposition for collaborative filtering, Proc. KDD Cup and Workshop 7
48 Ratings values vs rating occurrences There is information in the fact that user has rated a movie The user chose to see the movie The user chose to rate the movie The choice depends on many factors Can we use this information to improve the factorization?
49 Ratings are not given at random! Distribution of ratings Netflix ratings Yahoo! music ratings Yahoo! survey answers B. Marlin et al., Collaborative Filtering and the Missing at Random Assumption UAI 7
50 A powerful source of information: Characterize users by which items they rated, rather than how they rated è A dense binary representation of the data: users items users items Which items users rate? { }, R r ui ui = { }, B b ui ui =
51 Factoring the binary view Describe both R and B using a factor model: r = µ + b + b + p T q ui u i u i b = p T x ui u i User factors are shared across models Each item i is associated with two factor vectors: q i and x i
52 Factoring the binary view T ui u i i: b = p x X = ( x, x,!, x ) n B = ( b,b,!,b ) u u u un p u = (XX T )! XB u = p XB b x u u uj j j User factor is indirectly defined by item factors: sum of item factors for items rated by u
53 Integrating ratings and binary views So far: Each user u is associated with a factor vector p u Each item i is associated with two factor vectors: q i and x i The pure rating model: r = µ + b + b + q p T ui u i i u The binary view of the user factors: pu bujxj = xj j j rated by u
54 Integrating ratings and binary views So far: Each user u is associated with a factor vector p u Each item i is associated with two factor vectors: q i and x i The pure rating model: r = µ + b + b + q p T ui u i i u The binary view of the user factors: pu bujxj = xj j j rated by u r = µ + b + b + q p + b x T ui u i i u uj j j Factorization Meets the Neighborhood, KDD 8 R. Salakhutdinov and A. Mnih, Probabilistic Matrix Factorization, NIPS 7
55 RMSE Factor models: Error vs. #parameters (Ne=lix) who rated what NMF BiasSVD SVD++ SVD v. SVD v. SVD v..87 Millions of Parameters
56 Temporal dynamics Panta rhei 9
57 Something Happened in Early Netflix ratings by date
58 Are movies getting better with time?
59 Multiple sources of temporal dynamics Item-side effects: Product perception and popularity are constantly changing Seasonal patterns influence items popularity User-side effects: Customers redefine their taste Transient, short-term bias; anchoring Drifting rating scale Change of rater within household
60 Temporal dynamics - challenges Multiple effects: Both items and users are changing over time è Scarce data per target Inter-related targets: Signal needs to be shared among users foundation of collaborative filtering è cannot isolate multiple problems è Common concept drift methodologies won t hold. E.g., underweighting older instances is unappealing
61 Addressing temporal dynamics Factor model conveniently allows separately treating different aspects We observe changes in:. Rating scale of individual users Baseline predictors. Popularity of individual items. User preferences User factors T r ( t) = µ + b () t + b( t) + q p ( t) ui u i i u
62 Parameterizing the model T r ( t) = µ + b () t + b( t) + q p ( t) ui u i i u Use functional forms: b u (t)=f(u,t), b i (t)=g(i,t), p u (t)=h(u,t) Need to find adequate f(), g(), h() General guidelines: Items show slower temporal changes Users exhibit frequent and sudden changes Factors p u (t) are expensive to model Gain flexibility by heavily parameterizing the functions Collaborative Filtering with Temporal Dynamics, KDD 9
63 Factor models: Error vs. #parameters NMF BiasSVD SVD++ RMSE temporal effects SVD v. SVD v. SVD v..87 Millions of Parameters
CS425: Algorithms for Web Scale Data
CS: Algorithms for Web Scale Data Most of the slides are from the Mining of Massive Datasets book. These slides have been modified for CS. The original slides can be accessed at: www.mmds.org J. Leskovec,
More informationMining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University
Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them to fit
More informationCollaborative Filtering. Radek Pelánek
Collaborative Filtering Radek Pelánek 2017 Notes on Lecture the most technical lecture of the course includes some scary looking math, but typically with intuitive interpretation use of standard machine
More informationCollaborative Filtering
Collaborative Filtering Nicholas Ruozzi University of Texas at Dallas based on the slides of Alex Smola & Narges Razavian Collaborative Filtering Combining information among collaborating entities to make
More informationRecommender Systems EE448, Big Data Mining, Lecture 10. Weinan Zhang Shanghai Jiao Tong University
2018 EE448, Big Data Mining, Lecture 10 Recommender Systems Weinan Zhang Shanghai Jiao Tong University http://wnzhang.net http://wnzhang.net/teaching/ee448/index.html Content of This Course Overview of
More informationRecommender Systems. Dipanjan Das Language Technologies Institute Carnegie Mellon University. 20 November, 2007
Recommender Systems Dipanjan Das Language Technologies Institute Carnegie Mellon University 20 November, 2007 Today s Outline What are Recommender Systems? Two approaches Content Based Methods Collaborative
More informationRecommendation Systems
Recommendation Systems Pawan Goyal CSE, IITKGP October 21, 2014 Pawan Goyal (IIT Kharagpur) Recommendation Systems October 21, 2014 1 / 52 Recommendation System? Pawan Goyal (IIT Kharagpur) Recommendation
More informationRecommendation Systems
Recommendation Systems Popularity Recommendation Systems Predicting user responses to options Offering news articles based on users interests Offering suggestions on what the user might like to buy/consume
More informationRecommendation Systems
Recommendation Systems Pawan Goyal CSE, IITKGP October 29-30, 2015 Pawan Goyal (IIT Kharagpur) Recommendation Systems October 29-30, 2015 1 / 61 Recommendation System? Pawan Goyal (IIT Kharagpur) Recommendation
More informationMatrix Factorization Techniques for Recommender Systems
Matrix Factorization Techniques for Recommender Systems By Yehuda Koren Robert Bell Chris Volinsky Presented by Peng Xu Supervised by Prof. Michel Desmarais 1 Contents 1. Introduction 4. A Basic Matrix
More informationMatrix Factorization Techniques For Recommender Systems. Collaborative Filtering
Matrix Factorization Techniques For Recommender Systems Collaborative Filtering Markus Freitag, Jan-Felix Schwarz 28 April 2011 Agenda 2 1. Paper Backgrounds 2. Latent Factor Models 3. Overfitting & Regularization
More informationCollaborative Filtering Applied to Educational Data Mining
Journal of Machine Learning Research (200) Submitted ; Published Collaborative Filtering Applied to Educational Data Mining Andreas Töscher commendo research 8580 Köflach, Austria andreas.toescher@commendo.at
More informationData Science Mastery Program
Data Science Mastery Program Copyright Policy All content included on the Site or third-party platforms as part of the class, such as text, graphics, logos, button icons, images, audio clips, video clips,
More informationAndriy Mnih and Ruslan Salakhutdinov
MATRIX FACTORIZATION METHODS FOR COLLABORATIVE FILTERING Andriy Mnih and Ruslan Salakhutdinov University of Toronto, Machine Learning Group 1 What is collaborative filtering? The goal of collaborative
More informationCollaborative Filtering on Ordinal User Feedback
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Collaborative Filtering on Ordinal User Feedback Yehuda Koren Google yehudako@gmail.com Joseph Sill Analytics Consultant
More informationScaling Neighbourhood Methods
Quick Recap Scaling Neighbourhood Methods Collaborative Filtering m = #items n = #users Complexity : m * m * n Comparative Scale of Signals ~50 M users ~25 M items Explicit Ratings ~ O(1M) (1 per billion)
More informationData Mining Techniques
Data Mining Techniques CS 622 - Section 2 - Spring 27 Pre-final Review Jan-Willem van de Meent Feedback Feedback https://goo.gl/er7eo8 (also posted on Piazza) Also, please fill out your TRACE evaluations!
More information* Matrix Factorization and Recommendation Systems
Matrix Factorization and Recommendation Systems Originally presented at HLF Workshop on Matrix Factorization with Loren Anderson (University of Minnesota Twin Cities) on 25 th September, 2017 15 th March,
More informationA Modified PMF Model Incorporating Implicit Item Associations
A Modified PMF Model Incorporating Implicit Item Associations Qiang Liu Institute of Artificial Intelligence College of Computer Science Zhejiang University Hangzhou 31007, China Email: 01dtd@gmail.com
More informationAlgorithms for Collaborative Filtering
Algorithms for Collaborative Filtering or How to Get Half Way to Winning $1million from Netflix Todd Lipcon Advisor: Prof. Philip Klein The Real-World Problem E-commerce sites would like to make personalized
More informationCS425: Algorithms for Web Scale Data
CS: Algorithms for Web Scale Data Most of the slides are from the Mining of Massive Datasets book. These slides have been modified for CS. The original slides can be accessed at: www.mmds.org Customer
More informationCS 175: Project in Artificial Intelligence. Slides 4: Collaborative Filtering
CS 175: Project in Artificial Intelligence Slides 4: Collaborative Filtering 1 Topic 6: Collaborative Filtering Some slides taken from Prof. Smyth (with slight modifications) 2 Outline General aspects
More informationMatrix Factorization Techniques for Recommender Systems
Matrix Factorization Techniques for Recommender Systems Patrick Seemann, December 16 th, 2014 16.12.2014 Fachbereich Informatik Recommender Systems Seminar Patrick Seemann Topics Intro New-User / New-Item
More informationData Mining and Matrices
Data Mining and Matrices 04 Matrix Completion Rainer Gemulla, Pauli Miettinen May 02, 2013 Recommender systems Problem Set of users Set of items (movies, books, jokes, products, stories,...) Feedback (ratings,
More informationMatrix Factorization and Collaborative Filtering
10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Matrix Factorization and Collaborative Filtering MF Readings: (Koren et al., 2009)
More informationImpact of Data Characteristics on Recommender Systems Performance
Impact of Data Characteristics on Recommender Systems Performance Gediminas Adomavicius YoungOk Kwon Jingjing Zhang Department of Information and Decision Sciences Carlson School of Management, University
More informationCollaborative topic models: motivations cont
Collaborative topic models: motivations cont Two topics: machine learning social network analysis Two people: " boy Two articles: article A! girl article B Preferences: The boy likes A and B --- no problem.
More informationPreliminaries. Data Mining. The art of extracting knowledge from large bodies of structured data. Let s put it to use!
Data Mining The art of extracting knowledge from large bodies of structured data. Let s put it to use! 1 Recommendations 2 Basic Recommendations with Collaborative Filtering Making Recommendations 4 The
More informationThe BigChaos Solution to the Netflix Prize 2008
The BigChaos Solution to the Netflix Prize 2008 Andreas Töscher and Michael Jahrer commendo research & consulting Neuer Weg 23, A-8580 Köflach, Austria {andreas.toescher,michael.jahrer}@commendo.at November
More informationCollaborative Filtering with Temporal Dynamics with Using Singular Value Decomposition
ISSN 1330-3651 (Print), ISSN 1848-6339 (Online) https://doi.org/10.17559/tv-20160708140839 Original scientific paper Collaborative Filtering with Temporal Dynamics with Using Singular Value Decomposition
More informationRestricted Boltzmann Machines for Collaborative Filtering
Restricted Boltzmann Machines for Collaborative Filtering Authors: Ruslan Salakhutdinov Andriy Mnih Geoffrey Hinton Benjamin Schwehn Presentation by: Ioan Stanculescu 1 Overview The Netflix prize problem
More informationDeep Learning Basics Lecture 7: Factor Analysis. Princeton University COS 495 Instructor: Yingyu Liang
Deep Learning Basics Lecture 7: Factor Analysis Princeton University COS 495 Instructor: Yingyu Liang Supervised v.s. Unsupervised Math formulation for supervised learning Given training data x i, y i
More informationUsing SVD to Recommend Movies
Michael Percy University of California, Santa Cruz Last update: December 12, 2009 Last update: December 12, 2009 1 / Outline 1 Introduction 2 Singular Value Decomposition 3 Experiments 4 Conclusion Last
More informationINFO 4300 / CS4300 Information Retrieval. slides adapted from Hinrich Schütze s, linked from
INFO 4300 / CS4300 Information Retrieval slides adapted from Hinrich Schütze s, linked from http://informationretrieval.org/ IR 8: Evaluation & SVD Paul Ginsparg Cornell University, Ithaca, NY 20 Sep 2011
More informationDecoupled Collaborative Ranking
Decoupled Collaborative Ranking Jun Hu, Ping Li April 24, 2017 Jun Hu, Ping Li WWW2017 April 24, 2017 1 / 36 Recommender Systems Recommendation system is an information filtering technique, which provides
More informationCollaborative Recommendation with Multiclass Preference Context
Collaborative Recommendation with Multiclass Preference Context Weike Pan and Zhong Ming {panweike,mingz}@szu.edu.cn College of Computer Science and Software Engineering Shenzhen University Pan and Ming
More informationCS249: ADVANCED DATA MINING
CS249: ADVANCED DATA MINING Recommender Systems Instructor: Yizhou Sun yzsun@cs.ucla.edu May 17, 2017 Methods Learnt: Last Lecture Classification Clustering Vector Data Text Data Recommender System Decision
More information6.034 Introduction to Artificial Intelligence
6.34 Introduction to Artificial Intelligence Tommi Jaakkola MIT CSAIL The world is drowning in data... The world is drowning in data...... access to information is based on recommendations Recommending
More informationRanking and Filtering
2018 CS420, Machine Learning, Lecture 7 Ranking and Filtering Weinan Zhang Shanghai Jiao Tong University http://wnzhang.net http://wnzhang.net/teaching/cs420/index.html Content of This Course Another ML
More informationUniversität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Recommendation. Tobias Scheffer
Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen Recommendation Tobias Scheffer Recommendation Engines Recommendation of products, music, contacts,.. Based on user features, item
More informationFactor Modeling for Advertisement Targeting
Ye Chen 1, Michael Kapralov 2, Dmitry Pavlov 3, John F. Canny 4 1 ebay Inc, 2 Stanford University, 3 Yandex Labs, 4 UC Berkeley NIPS-2009 Presented by Miao Liu May 27, 2010 Introduction GaP model Sponsored
More informationPoint-of-Interest Recommendations: Learning Potential Check-ins from Friends
Point-of-Interest Recommendations: Learning Potential Check-ins from Friends Huayu Li, Yong Ge +, Richang Hong, Hengshu Zhu University of North Carolina at Charlotte + University of Arizona Hefei University
More informationarxiv: v2 [cs.ir] 14 May 2018
A Probabilistic Model for the Cold-Start Problem in Rating Prediction using Click Data ThaiBinh Nguyen 1 and Atsuhiro Takasu 1, 1 Department of Informatics, SOKENDAI (The Graduate University for Advanced
More informationDomokos Miklós Kelen. Online Recommendation Systems. Eötvös Loránd University. Faculty of Natural Sciences. Advisor:
Eötvös Loránd University Faculty of Natural Sciences Online Recommendation Systems MSc Thesis Domokos Miklós Kelen Applied Mathematics MSc Advisor: András Benczúr Ph.D. Department of Operations Research
More informationMixed Membership Matrix Factorization
Mixed Membership Matrix Factorization Lester Mackey University of California, Berkeley Collaborators: David Weiss, University of Pennsylvania Michael I. Jordan, University of California, Berkeley 2011
More informationService Recommendation for Mashup Composition with Implicit Correlation Regularization
205 IEEE International Conference on Web Services Service Recommendation for Mashup Composition with Implicit Correlation Regularization Lina Yao, Xianzhi Wang, Quan Z. Sheng, Wenjie Ruan, and Wei Zhang
More informationContext-aware Ensemble of Multifaceted Factorization Models for Recommendation Prediction in Social Networks
Context-aware Ensemble of Multifaceted Factorization Models for Recommendation Prediction in Social Networks Yunwen Chen kddchen@gmail.com Yingwei Xin xinyingwei@gmail.com Lu Yao luyao.2013@gmail.com Zuotao
More informationLarge-scale Ordinal Collaborative Filtering
Large-scale Ordinal Collaborative Filtering Ulrich Paquet, Blaise Thomson, and Ole Winther Microsoft Research Cambridge, University of Cambridge, Technical University of Denmark ulripa@microsoft.com,brmt2@cam.ac.uk,owi@imm.dtu.dk
More informationMatrix Factorization In Recommender Systems. Yong Zheng, PhDc Center for Web Intelligence, DePaul University, USA March 4, 2015
Matrix Factorization In Recommender Systems Yong Zheng, PhDc Center for Web Intelligence, DePaul University, USA March 4, 2015 Table of Contents Background: Recommender Systems (RS) Evolution of Matrix
More informationMixed Membership Matrix Factorization
Mixed Membership Matrix Factorization Lester Mackey 1 David Weiss 2 Michael I. Jordan 1 1 University of California, Berkeley 2 University of Pennsylvania International Conference on Machine Learning, 2010
More informationFrom Non-Negative Matrix Factorization to Deep Learning
The Math!! From Non-Negative Matrix Factorization to Deep Learning Intuitions and some Math too! luissarmento@gmailcom https://wwwlinkedincom/in/luissarmento/ October 18, 2017 The Math!! Introduction Disclaimer
More informationGenerative Models for Discrete Data
Generative Models for Discrete Data ddebarr@uw.edu 2016-04-21 Agenda Bayesian Concept Learning Beta-Binomial Model Dirichlet-Multinomial Model Naïve Bayes Classifiers Bayesian Concept Learning Numbers
More informationRegression. Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning)
Linear Regression Regression Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning) Example: Height, Gender, Weight Shoe Size Audio features
More informationSCMF: Sparse Covariance Matrix Factorization for Collaborative Filtering
SCMF: Sparse Covariance Matrix Factorization for Collaborative Filtering Jianping Shi Naiyan Wang Yang Xia Dit-Yan Yeung Irwin King Jiaya Jia Department of Computer Science and Engineering, The Chinese
More informationRegression. Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning)
Linear Regression Regression Goal: Learn a mapping from observations (features) to continuous labels given a training set (supervised learning) Example: Height, Gender, Weight Shoe Size Audio features
More informationMatrix and Tensor Factorization from a Machine Learning Perspective
Matrix and Tensor Factorization from a Machine Learning Perspective Christoph Freudenthaler Information Systems and Machine Learning Lab, University of Hildesheim Research Seminar, Vienna University of
More informationPersonalized Ranking for Non-Uniformly Sampled Items
JMLR: Workshop and Conference Proceedings 18:231 247, 2012 Proceedings of KDD-Cup 2011 competition Personalized Ranking for Non-Uniformly Sampled Items Zeno Gantner Lucas Drumond Christoph Freudenthaler
More informationCircle-based Recommendation in Online Social Networks
Circle-based Recommendation in Online Social Networks Xiwang Yang, Harald Steck*, and Yong Liu Polytechnic Institute of NYU * Bell Labs/Netflix 1 Outline q Background & Motivation q Circle-based RS Trust
More informationSQL-Rank: A Listwise Approach to Collaborative Ranking
SQL-Rank: A Listwise Approach to Collaborative Ranking Liwei Wu Depts of Statistics and Computer Science UC Davis ICML 18, Stockholm, Sweden July 10-15, 2017 Joint work with Cho-Jui Hsieh and James Sharpnack
More informationCollaborative Topic Modeling for Recommending Scientific Articles
Collaborative Topic Modeling for Recommending Scientific Articles Chong Wang and David M. Blei Best student paper award at KDD 2011 Computer Science Department, Princeton University Presented by Tian Cao
More informationRating Prediction with Topic Gradient Descent Method for Matrix Factorization in Recommendation
Rating Prediction with Topic Gradient Descent Method for Matrix Factorization in Recommendation Guan-Shen Fang, Sayaka Kamei, Satoshi Fujita Department of Information Engineering Hiroshima University Hiroshima,
More informationTopicMF: Simultaneously Exploiting Ratings and Reviews for Recommendation
Proceedings of the wenty-eighth AAAI Conference on Artificial Intelligence opicmf: Simultaneously Exploiting Ratings and Reviews for Recommendation Yang Bao Hui Fang Jie Zhang Nanyang Business School,
More informationStat 406: Algorithms for classification and prediction. Lecture 1: Introduction. Kevin Murphy. Mon 7 January,
1 Stat 406: Algorithms for classification and prediction Lecture 1: Introduction Kevin Murphy Mon 7 January, 2008 1 1 Slides last updated on January 7, 2008 Outline 2 Administrivia Some basic definitions.
More informationBinary Principal Component Analysis in the Netflix Collaborative Filtering Task
Binary Principal Component Analysis in the Netflix Collaborative Filtering Task László Kozma, Alexander Ilin, Tapani Raiko first.last@tkk.fi Helsinki University of Technology Adaptive Informatics Research
More informationContent-based Recommendation
Content-based Recommendation Suthee Chaidaroon June 13, 2016 Contents 1 Introduction 1 1.1 Matrix Factorization......................... 2 2 slda 2 2.1 Model................................. 3 3 flda 3
More informationPredictive Discrete Latent Factor Models for large incomplete dyadic data
Predictive Discrete Latent Factor Models for large incomplete dyadic data Deepak Agarwal, Srujana Merugu, Abhishek Agarwal Y! Research MMDS Workshop, Stanford University 6/25/2008 Agenda Motivating applications
More informationData Mining Techniques
Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 21: Review Jan-Willem van de Meent Schedule Topics for Exam Pre-Midterm Probability Information Theory Linear Regression Classification Clustering
More informationa Short Introduction
Collaborative Filtering in Recommender Systems: a Short Introduction Norm Matloff Dept. of Computer Science University of California, Davis matloff@cs.ucdavis.edu December 3, 2016 Abstract There is a strong
More informationAsymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation
Asymmetric Correlation Regularized Matrix Factorization for Web Service Recommendation Qi Xie, Shenglin Zhao, Zibin Zheng, Jieming Zhu and Michael R. Lyu School of Computer Science and Technology, Southwest
More informationJeffrey D. Ullman Stanford University
Jeffrey D. Ullman Stanford University 2 Often, our data can be represented by an m-by-n matrix. And this matrix can be closely approximated by the product of two matrices that share a small common dimension
More informationarxiv: v2 [cs.ir] 4 Jun 2018
Metric Factorization: Recommendation beyond Matrix Factorization arxiv:1802.04606v2 [cs.ir] 4 Jun 2018 Shuai Zhang, Lina Yao, Yi Tay, Xiwei Xu, Xiang Zhang and Liming Zhu School of Computer Science and
More informationThe BellKor Solution to the Netflix Grand Prize
1 The BellKor Solution to the Netflix Grand Prize Yehuda Koren August 2009 I. INTRODUCTION This article describes part of our contribution to the Bell- Kor s Pragmatic Chaos final solution, which won the
More informationBayesian Matrix Factorization with Side Information and Dirichlet Process Mixtures
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI-10) Bayesian Matrix Factorization with Side Information and Dirichlet Process Mixtures Ian Porteous and Arthur Asuncion
More informationLarge-Scale Matrix Factorization with Distributed Stochastic Gradient Descent
Large-Scale Matrix Factorization with Distributed Stochastic Gradient Descent KDD 2011 Rainer Gemulla, Peter J. Haas, Erik Nijkamp and Yannis Sismanis Presenter: Jiawen Yao Dept. CSE, UT Arlington 1 1
More informationELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties
ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties Prof. James She james.she@ust.hk 1 Last lecture 2 Selected works from Tutorial
More informationECS289: Scalable Machine Learning
ECS289: Scalable Machine Learning Cho-Jui Hsieh UC Davis Oct 11, 2016 Paper presentations and final project proposal Send me the names of your group member (2 or 3 students) before October 15 (this Friday)
More informationStat 315c: Introduction
Stat 315c: Introduction Art B. Owen Stanford Statistics Art B. Owen (Stanford Statistics) Stat 315c: Introduction 1 / 14 Stat 315c Analysis of Transposable Data Usual Statistics Setup there s Y (we ll
More informationA Regularized Recommendation Algorithm with Probabilistic Sentiment-Ratings
A Regularized Recommendation Algorithm with Probabilistic Sentiment-Ratings Filipa Peleja, Pedro Dias and João Magalhães Department of Computer Science Faculdade de Ciências e Tecnologia Universidade Nova
More informationMatrix Factorization and Factorization Machines for Recommender Systems
Talk at SDM workshop on Machine Learning Methods on Recommender Systems, May 2, 215 Chih-Jen Lin (National Taiwan Univ.) 1 / 54 Matrix Factorization and Factorization Machines for Recommender Systems Chih-Jen
More informationRecommender Systems with Social Regularization
Recommender Systems with Social Regularization Hao Ma The Chinese University of Hong Kong Shatin, N.T., Hong Kong hma@cse.cuhk.edu.hk Michael R. Lyu The Chinese University of Hong Kong Shatin, N.T., Hong
More informationReview: Probabilistic Matrix Factorization. Probabilistic Matrix Factorization (PMF)
Case Study 4: Collaborative Filtering Review: Probabilistic Matrix Factorization Machine Learning for Big Data CSE547/STAT548, University of Washington Emily Fox February 2 th, 214 Emily Fox 214 1 Probabilistic
More informationIntroduction to Logistic Regression
Introduction to Logistic Regression Guy Lebanon Binary Classification Binary classification is the most basic task in machine learning, and yet the most frequent. Binary classifiers often serve as the
More informationThe Pragmatic Theory solution to the Netflix Grand Prize
The Pragmatic Theory solution to the Netflix Grand Prize Martin Piotte Martin Chabbert August 2009 Pragmatic Theory Inc., Canada nfpragmatictheory@gmail.com Table of Contents 1 Introduction... 3 2 Common
More informationA Matrix Factorization Technique with Trust Propagation for Recommendation in Social Networks
A Matrix Factorization Technique with Trust Propagation for Recommendation in Social Networks ABSTRACT Mohsen Jamali School of Computing Science Simon Fraser University Burnaby, BC, Canada mohsen_jamali@cs.sfu.ca
More informationCollaborative Location Recommendation by Integrating Multi-dimensional Contextual Information
1 Collaborative Location Recommendation by Integrating Multi-dimensional Contextual Information LINA YAO, University of New South Wales QUAN Z. SHENG, Macquarie University XIANZHI WANG, Singapore Management
More informationLarge-scale Collaborative Ranking in Near-Linear Time
Large-scale Collaborative Ranking in Near-Linear Time Liwei Wu Depts of Statistics and Computer Science UC Davis KDD 17, Halifax, Canada August 13-17, 2017 Joint work with Cho-Jui Hsieh and James Sharpnack
More informationNonnegative Matrix Factorization
Nonnegative Matrix Factorization Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr
More informationMatrix Factorization & Latent Semantic Analysis Review. Yize Li, Lanbo Zhang
Matrix Factorization & Latent Semantic Analysis Review Yize Li, Lanbo Zhang Overview SVD in Latent Semantic Indexing Non-negative Matrix Factorization Probabilistic Latent Semantic Indexing Vector Space
More informationRaRE: Social Rank Regulated Large-scale Network Embedding
RaRE: Social Rank Regulated Large-scale Network Embedding Authors: Yupeng Gu 1, Yizhou Sun 1, Yanen Li 2, Yang Yang 3 04/26/2018 The Web Conference, 2018 1 University of California, Los Angeles 2 Snapchat
More informationAlternating Least Squares for Personalized Ranking
Alternating Least Squares for Personalized Ranking ABSTRACT Gábor Takács Széchenyi István University, Dept. of Mathematics and Computer Science Egyetem tér 1. Győr, Hungary gtakacs@sze.hu Two flavors of
More informationRanking-Oriented Evaluation Metrics
Ranking-Oriented Evaluation Metrics Weike Pan College of Computer Science and Software Engineering Shenzhen University W.K. Pan (CSSE, SZU) Ranking-Oriented Evaluation Metrics 1 / 21 Outline 1 Introduction
More informationJoint user knowledge and matrix factorization for recommender systems
World Wide Web (2018) 21:1141 1163 DOI 10.1007/s11280-017-0476-7 Joint user knowledge and matrix factorization for recommender systems Yonghong Yu 1,2 Yang Gao 2 Hao Wang 2 Ruili Wang 3 Received: 13 February
More informationDepartment of Computer Science, Guiyang University, Guiyang , GuiZhou, China
doi:10.21311/002.31.12.01 A Hybrid Recommendation Algorithm with LDA and SVD++ Considering the News Timeliness Junsong Luo 1*, Can Jiang 2, Peng Tian 2 and Wei Huang 2, 3 1 College of Information Science
More informationImproving Response Prediction for Dyadic Data
Improving Response Prediction for Dyadic Data Stat 598T Statistical Network Analysis Project Report Nik Tuzov* April 2008 Keywords: Dyadic data, Co-clustering, PDLF model, Neural networks, Unsupervised
More informationA Gradient-based Adaptive Learning Framework for Efficient Personal Recommendation
A Gradient-based Adaptive Learning Framework for Efficient Personal Recommendation Yue Ning 1 Yue Shi 2 Liangjie Hong 2 Huzefa Rangwala 3 Naren Ramakrishnan 1 1 Virginia Tech 2 Yahoo Research. Yue Shi
More informationKernelized Matrix Factorization for Collaborative Filtering
Kernelized Matrix Factorization for Collaborative Filtering Xinyue Liu Charu Aggarwal Yu-Feng Li Xiangnan Kong Xinyuan Sun Saket Sathe Abstract Matrix factorization (MF) methods have shown great promise
More informationFacing the information flood in our daily lives, search engines mainly respond
Interaction-Rich ransfer Learning for Collaborative Filtering with Heterogeneous User Feedback Weike Pan and Zhong Ming, Shenzhen University A novel and efficient transfer learning algorithm called interaction-rich
More informationItem Recommendation for Emerging Online Businesses
Item Recommendation for Emerging Online Businesses Chun-Ta Lu Sihong Xie Weixiang Shao Lifang He Philip S. Yu University of Illinois at Chicago Presenter: Chun-Ta Lu New Online Businesses Emerge Rapidly
More informationMatrix Factorization and Neighbor Based Algorithms for the Netflix Prize Problem
Matrix Factorization and Neighbor Based Algorithms for the Netflix Prize Problem Gábor Takács Dept. of Mathematics and Computer Science Széchenyi István University Egyetem tér 1. Győr, Hungary gtakacs@sze.hu
More informationA Simple Algorithm for Nuclear Norm Regularized Problems
A Simple Algorithm for Nuclear Norm Regularized Problems ICML 00 Martin Jaggi, Marek Sulovský ETH Zurich Matrix Factorizations for recommender systems Y = Customer Movie UV T = u () The Netflix challenge:
More information