Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization

Similar documents
Localized user-driven topic discovery via boosted ensemble of nonnegative matrix factorization

Dimension Reduction and Iterative Consensus Clustering

Fast Coordinate Descent methods for Non-Negative Matrix Factorization

Orthogonal Nonnegative Matrix Factorization: Multiplicative Updates on Stiefel Manifolds

Introduction to Boosting and Joint Boosting

MULTIPLICATIVE ALGORITHM FOR CORRENTROPY-BASED NONNEGATIVE MATRIX FACTORIZATION

Nonnegative matrix factorization for interactive topic modeling and document clustering

A Randomized Approach for Crowdsourcing in the Presence of Multiple Views

Nonnegative Matrix Factorization Clustering on Multiple Manifolds

Variable Latent Semantic Indexing

arxiv: v3 [cs.lg] 18 Mar 2013

Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data

Heterogeneous Learning. Jingrui He Computer Science Department Stevens Institute of Technology

Data Mining Techniques. Lecture 2: Regression

CPSC 340: Machine Learning and Data Mining. Sparse Matrix Factorization Fall 2018

Introduction to Course

Robust Asymmetric Nonnegative Matrix Factorization

CPSC 340: Machine Learning and Data Mining. More PCA Fall 2016

Machine Learning for Signal Processing Sparse and Overcomplete Representations

COMS 4721: Machine Learning for Data Science Lecture 18, 4/4/2017

RaRE: Social Rank Regulated Large-scale Network Embedding

c 4, < y 2, 1 0, otherwise,

Chapter 4 The Simplex Algorithm Part II

A CURRICULUM RESOURCE NEWSLETTER

Clustering Lecture 1: Basics. Jing Gao SUNY Buffalo

P leiades: Subspace Clustering and Evaluation

ECS289: Scalable Machine Learning

Behavioral Data Mining. Lecture 2

An Optimized Interestingness Hotspot Discovery Framework for Large Gridded Spatio-temporal Datasets

Believe it Today or Tomorrow? Detecting Untrustworthy Information from Dynamic Multi-Source Data

CPSC 340: Machine Learning and Data Mining. More PCA Fall 2017

Multi-Task Clustering using Constrained Symmetric Non-Negative Matrix Factorization

Differential Modeling for Cancer Microarray Data

Non-Negative Matrix Factorization

Using co-clustering to analyze spatio-temporal patterns: a case study based on spring phenology

Solutions to Section 2.9 Homework Problems Problems 1 5, 7, 9, 10 15, (odd), and 38. S. F. Ellermeyer June 21, 2002

Fast Nonnegative Matrix Factorization with Rank-one ADMM

Online Nonnegative Matrix Factorization with General Divergences

Machine Learning Linear Regression. Prof. Matteo Matteucci

Distance Metric Learning in Data Mining (Part II) Fei Wang and Jimeng Sun IBM TJ Watson Research Center

Alternative Clustering, Multiview Clustering: What Can We Learn From Each Other?

Feature Engineering, Model Evaluations

arxiv: v1 [stat.ml] 23 Dec 2015

Fast Nonnegative Tensor Factorization with an Active-Set-Like Method

EUSIPCO

Preferences in college applications

Interpretable Nonnegative Matrix Decompositions

Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time

Matrix Decomposition in Privacy-Preserving Data Mining JUN ZHANG DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF KENTUCKY

Uncovering the Latent Structures of Crowd Labeling

ECE521 lecture 4: 19 January Optimization, MLE, regularization

Meelis Kull Autumn Meelis Kull - Autumn MTAT Data Mining - Lecture 05

Performance Evaluation of the Matlab PCT for Parallel Implementations of Nonnegative Tensor Factorization

Dimension Reduction (PCA, ICA, CCA, FLD,

CPSC 340: Machine Learning and Data Mining. Sparse Matrix Factorization Fall 2017

How GIS can be used for improvement of literacy and CE programmes

Matrix Factorization Techniques for Recommender Systems

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

Learning Task Grouping and Overlap in Multi-Task Learning

Non-negative Matrix Factorization: Algorithms, Extensions and Applications

Multi-Task Co-clustering via Nonnegative Matrix Factorization

Introduction to Data Mining

Large-scale Collaborative Ranking in Near-Linear Time

Latent Variable Models and Expectation Maximization

Unsupervised learning: beyond simple clustering and PCA

Convolutional Neural Networks

Star-Structured High-Order Heterogeneous Data Co-clustering based on Consistent Information Theory

Gender Systematics in Telescope Time Allocation at ESO. F. Patat, The Messenger, September 2016, No. 165, p. 2

NONNEGATIVE matrix factorization (NMF) is a

A Spatial Analysis Approach for Johor s Demographic Profile : Soft Warning of the Decline Birth Number Associated with Gender Gap in Johor

Simplex tableau CE 377K. April 2, 2015

Basic Test. To show three vectors are independent, form the system of equations

Graduate Education in Institute of Chemistry, Chinese Academy of Sciences

Sparse and Shift-Invariant Feature Extraction From Non-Negative Data

MATH 680 Fall November 27, Homework 3

Design and Optimize Sample Plans at The Dow Chemical Company

CHAPTER 3 PROBABILITY: EVENTS AND PROBABILITIES

Latent Variable Models and Expectation Maximization

Application of a GA/Bayesian Filter-Wrapper Feature Selection Method to Classification of Clinical Depression from Speech Data

Adaptive Crowdsourcing via EM with Prior

ONP-MF: An Orthogonal Nonnegative Matrix Factorization Algorithm with Application to Clustering

Geospatial Science and Technology Model Courses and Certificate:

Case Study 1: Estimating Click Probabilities. Kakade Announcements: Project Proposals: due this Friday!

he Applications of Tensor Factorization in Inference, Clustering, Graph Theory, Coding and Visual Representation

L 2,1 Norm and its Applications

Probabilistic Latent Semantic Analysis

CPSC 340: Machine Learning and Data Mining. Stochastic Gradient Fall 2017

Filling in the Blanks: Network Structure and Systemic Risk

Decision Trees & Random Forests

CHE Chemical Engineering Operations

Machine Learning. Principal Components Analysis. Le Song. CSE6740/CS7641/ISYE6740, Fall 2012

ENGG5781 Matrix Analysis and Computations Lecture 10: Non-Negative Matrix Factorization and Tensor Decomposition

An Approach to Classification Based on Fuzzy Association Rules

Lecture 9: Policy Gradient II (Post lecture) 2

Processing Big Data Matrix Sketching

Algorithms for Collaborative Filtering

Non-negative matrix factorization and term structure of interest rates

CSE 494/598 Lecture-6: Latent Semantic Indexing. **Content adapted from last year s slides

Edge-Weighted Personalized PageRank: Breaking a Decade-Old Performance Barrier

In order to compare the proteins of the phylogenomic matrix, we needed a similarity

Transcription:

Simultaneous Discovery of Common and Discriminative Topics via Joint Nonnegative Matrix Factorization Hannah Kim 1, Jaegul Choo 2, Jingu Kim 3, Chandan K. Reddy 4, Haesun Park 1 Georgia Tech 1, Korea University 2, Netflix Inc. 3, Wayne State University 4 hannahkim@gatech.edu August 11 th 2015 SIGKDD

Outline Motivation Topic Modeling via NMF Experiments Quantitative Evaluation Case Study Conclusion 2

Motivation Understanding large-scale document collections is important In many real world applications, we often need to compare and contrast document sets We may want to analyze w.r.t. additional information author information (e.g., gender, age, and location) network information (e.g., co-authorship and citation) publishing information (e.g., year, publisher, and venue) 3

Example (1) E.g., Male- vs. female-authored documents Distinct topics Common topics Distinct topics 4

Example (2) E.g., Old documents vs. new documents Distinct topics Common topics Distinct topics 5

Motivation However, standard topic modeling cannot fully satisfy the needs to compare and contrast document sets Independently running standard NMF algorithms on different document sets does not clearly reveal their common and discriminative topics 6

Data mining papers published in 2000-2005 vs. 2006-2008 Running topic modeling separately? Are these common topics really common? 7

Data mining papers published in 2000-2005 vs. 2006-2008 Our joint topic modeling Our joint NMF model identifies common topics and discriminative topics 8

Nonnegative Matrix Factorization (NMF) for Topic Modeling X WHT term-document matrix(x) term-topic matrix(w), topic-document matrix(h T ) Each topic, a nonnegative vector of words (value: word's rank in the topic) Each document, a linear combination of topic vectors Algorithm Initialize W, H apple fruit orange terms (words) Update W, H to optimize min W,H 0 WHT F 2 9 2 3 2 docs terms topics 2 0 1 1 0 1 1 2 topics docs

Our Joint NMF-based Model GOAL: Given two datasets, find common topics and discriminative topics from each dataset Formula X 1 W 1 H 1 T X 2 W 2 H 2 T, where W 1,c W 2,c and W 1,d W 2,d 10 Common topics Discriminative topics

Our Batch Processing Approach Optimize min W 1,H 1,W 2,H 2 0 1 n1 X 1 W 1 H 1 T +α W 1,c W 2,c 2 + 1 F n2 X 2 W 2 H 2 T 2 F 2 F + β T W1,d W 2,d 1,1 Commonality penalty term Distinctiveness penalty term Block-coordinate descent framework: Solve the objective function for a column while fixing the other column vectors of W 1, W 2, H 1, H 2 11

Our Pseudo-deflation Approach In practice, to understand topics, people check only a small number of the most representative, thus meaningful keywords. Our pseudo-deflation approach considers only the top keywords in each topic. However, considering only the top keywords presents a challenge the objective function could change every iteration. To solve this, our pseudo-deflation approach discovers discriminative topics one by one, in a manner similar to a rankdeflation procedure. Please see our paper for detailed algorithm (Section 3.4) 12

Quantitative Evaluation Clustering Assumption: by jointly performing clustering on multiple data sets and allowing both common and discriminative topics, our method would show better clustering performance Compared methods: Standard NMF Our batch processing method (BS) Our pseudo-deflation method (PD) Multiview NMF (MV) by Liu et al. SDM 13 Regularized shared subspace NMF (RS) by Gupta et al. DMKD 13 Performance measures: accuracy, normalized mutual information, average cluster entropy, and cluster purity 13

Quantitative Evaluation 14

Case Study (1) VAST vs. InfoVis Conferences Visual Analytics Science and Technology (VAST) Information Visualization (InfoVis) 15

Case Study (2) Loan Description in Micro-finance.org is a nonprofit crowd-funding website where people in developing countries post loan requests Lenders can make a loan individually or as a team By analyzing loan description data, our method can help to characterize and promote lending activities 16

Teams Etsy.com Handmade vs. Guys holding fish homophily People tend to fund loans similar to what they like. 17

Teams Thailand vs. Greece 40% of Thai laborers work in agriculture (13% in Greece). Construction and Industrial Manufacturing are big in Greece. 18

Lender Occupation Distinct topics of loans funded by a subset of lenders with the same occupation against the rest Art vs. non-art Driver vs. non-driver Teacher vs. non-teacher People tend to fund loans related to their jobs. 19

Conclusion We presented a joint NMF-based topic model that identifies common and distinct topics between document sets We performed a detailed quantitative analysis as well as indepth case studies We plan to Build a real-time visual analytics system Extend to compare multiple subsets Thank you! Hannah Kim hannahkim@gatech.edu Apply block principal pivoting method 20