GLAD: Group Anomaly Detection in Social Media Analysis

Similar documents
Mixed Membership Stochastic Blockmodels

Non-Parametric Bayes

Unified Modeling of User Activities on Social Networking Sites

Mixed Membership Stochastic Blockmodels

Latent Dirichlet Allocation (LDA)

Mixed Membership Matrix Factorization

Applying hlda to Practical Topic Modeling

Mixed Membership Stochastic Blockmodels

Mixed Membership Matrix Factorization

Latent Dirichlet Allocation (LDA)

Latent Dirichlet Allocation Introduction/Overview

Applying Latent Dirichlet Allocation to Group Discovery in Large Graphs

Pachinko Allocation: DAG-Structured Mixture Models of Topic Correlations

arxiv: v1 [cs.si] 7 Dec 2013

Information retrieval LSI, plsi and LDA. Jian-Yun Nie

Lecture 13 : Variational Inference: Mean Field Approximation

COMS 4721: Machine Learning for Data Science Lecture 18, 4/4/2017

PROBABILISTIC LATENT SEMANTIC ANALYSIS

CS Lecture 18. Topic Models and LDA

Dimension Reduction (PCA, ICA, CCA, FLD,

Incorporating Social Context and Domain Knowledge for Entity Recognition

Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach

One-Class Support Measure Machines for Group Anomaly Detection

MEI: Mutual Enhanced Infinite Community-Topic Model for Analyzing Text-augmented Social Networks

Smoothed Gradients for Stochastic Variational Inference

Understanding Comments Submitted to FCC on Net Neutrality. Kevin (Junhui) Mao, Jing Xia, Dennis (Woncheol) Jeong December 12, 2014

Hierarchical Models, Nested Models and Completely Random Measures

Variational Inference with Copula Augmentation

arxiv: v1 [cs.cv] 13 Apr 2018

Kernel Density Topic Models: Visual Topics Without Visual Words

Introduction To Machine Learning

Topic Modeling Using Latent Dirichlet Allocation (LDA)

Conquering the Complexity of Time: Machine Learning for Big Time Series Data

A Unified Posterior Regularized Topic Model with Maximum Margin for Learning-to-Rank

Sparse Stochastic Inference for Latent Dirichlet Allocation

Learning latent structure in complex networks

CS145: INTRODUCTION TO DATA MINING

Content-based Recommendation

Recent Advances in Bayesian Inference Techniques

Topic Models. Brandon Malone. February 20, Latent Dirichlet Allocation Success Stories Wrap-up

RECSM Summer School: Facebook + Topic Models. github.com/pablobarbera/big-data-upf

Fast Inference and Learning for Modeling Documents with a Deep Boltzmann Machine

Collaborative topic models: motivations cont

Lecture 21: Spectral Learning for Graphical Models

Latent Dirichlet Allocation Based Multi-Document Summarization

Online Bayesian Passive-Agressive Learning

Network Event Data over Time: Prediction and Latent Variable Modeling

Nonparametric Bayes Pachinko Allocation

Distributed ML for DOSNs: giving power back to users

Readings: K&F: 16.3, 16.4, Graphical Models Carlos Guestrin Carnegie Mellon University October 6 th, 2008

Bayesian Nonparametrics for Speech and Signal Processing

Topic Models and Applications to Short Documents

Evaluating Electricity Theft Detectors in Smart Grid Networks. Group 5:

Topic Modeling: Beyond Bag-of-Words

Streaming Variational Bayes

Advanced Machine Learning

Mixed-membership Models (and an introduction to variational inference)

CS570 Data Mining. Anomaly Detection. Li Xiong. Slide credits: Tan, Steinbach, Kumar Jiawei Han and Micheline Kamber.

Support Measure Data Description for Group Anomaly Detection

You Lu, Jeff Lund, and Jordan Boyd-Graber. Why ADAGRAD Fails for Online Topic Modeling. Empirical Methods in Natural Language Processing, 2017.

Graphical Models for Query-driven Analysis of Multimodal Data

20: Gaussian Processes

An Empirical-Bayes Score for Discrete Bayesian Networks

RETRIEVAL MODELS. Dr. Gjergji Kasneci Introduction to Information Retrieval WS

Probabilistic Graphical Models. Guest Lecture by Narges Razavian Machine Learning Class April

Bayesian nonparametric models for bipartite graphs

Relative Performance Guarantees for Approximate Inference in Latent Dirichlet Allocation

Topic Models. Advanced Machine Learning for NLP Jordan Boyd-Graber OVERVIEW. Advanced Machine Learning for NLP Boyd-Graber Topic Models 1 of 1

Scaling Neighbourhood Methods

Variational Bayesian Dirichlet-Multinomial Allocation for Exponential Family Mixtures

Structure Learning: the good, the bad, the ugly

Hybrid Models for Text and Graphs. 10/23/2012 Analysis of Social Media

Data Mining Techniques

Topic Modelling and Latent Dirichlet Allocation

Welcome to CAMCOS Reports Day Fall 2011

CSCI-567: Machine Learning (Spring 2019)

Probabilistic Latent Semantic Analysis

Nonparametric Bayesian Matrix Factorization for Assortative Networks

Generative Clustering, Topic Modeling, & Bayesian Inference

Text Mining for Economics and Finance Latent Dirichlet Allocation

Machine Learning Summer School, Austin, TX January 08, 2015

Bayesian non-parametric model to longitudinally predict churn

The Infinite PCFG using Hierarchical Dirichlet Processes

Lecture 3a: Dirichlet processes

Classification, Linear Models, Naïve Bayes

PROBABILISTIC PROGRAMMING: BAYESIAN MODELLING MADE EASY

Spatial Bayesian Nonparametrics for Natural Image Segmentation

Discovering Latent Patterns with Hierarchical Bayesian Mixed-Membership Models

Latent Topic Models for Hypertext

Applying LDA topic model to a corpus of Italian Supreme Court decisions

arxiv: v1 [stat.me] 30 May 2007

Fast Likelihood-Free Inference via Bayesian Optimization

Gaussian Processes for Big Data. James Hensman

Click Prediction and Preference Ranking of RSS Feeds

Recommendation Systems

Collaborative User Clustering for Short Text Streams

Bayesian Machine Learning

Latent Dirichlet Allocation and Singular Value Decomposition based Multi-Document Summarization

13: Variational inference II

Study Notes on the Latent Dirichlet Allocation

Transcription:

GLAD: Group Anomaly Detection in Social Media Analysis Poster #: 1150 Rose Yu, Xinran He and Yan Liu University of Southern California

Group Anomaly Detection Anomalous phenomenon in social media data may not only appear as individual points, but also as groups. Group Review Spamming Organized Viral Campaign Massive Cyber Attack We develop a hierarchical Bayes model to automatically discover social media groups and detect group anomalies simultaneously.

Challenges Detecting group anomalies require us to exploit the structure of the social media groups as well as the attributes of the individual points. Point-wise features data and pairwise relational data coexist and are mutually dependent. Collective anomalous activities might appear normal at the individual level. [Chandola2007] People constantly switch groups and we can hardly know groups beforehand. [An Example of Collective Anomaly]

Group Anomaly Definition Input: Pairwise communication network; Pointwise node features; Output: List of groups ranked by anomaly score Model a group as a mixture of roles Same roles,different role mixture rate Role? Definition: Group anomaly has a role mixture rate pattern that does not conform to the majority of other groups. Existing Approaches: Mixture Genre Model (MGM) [xiong et al 2011a] Flexible Genre Model (FGM) [xiong et al 2011b] Support Measure Machine (SMM) [muandet et al 2013] Two- Stage Approaches!

Group Latent Anomaly Detection (GLAD) θ M α π G R B Y X, β π p ~ Dirichlet(α) G p ~ Categorical(π p ) R p ~ Categorical(θ Gp ) Y p ~ Bernouli(B Gp,G q ) X p ~ Multinomial(β Rp ) K

Dynamic extension of GLAD (d-glad) p G (1) p Y p,q (1) B G (2) p Y p,q (2) B G (t) p Y (t) p,q B R p (1) X p (1) K R p (2) X p (2) K R (t) p X (t) p K 0 (1) M (2) M (t) M θ t m ~ Gaussian(θ m t 1,σ )

Procedure Use BIC to decide # of groups and # of roles Learn GLAD/ d-glad to infer role mixture rates Rank groups with respect to the anomaly score Perform significant test and raise alarms GLAD : the expected likelihood of role distribution p G AnomalyScore GLAD ~ E q [ p(r p θ)] d-glad : the change of role mixture rate over time AnomalyScore d GLAD ~ θ m t 1 θ m t 2

Synthetic Experiments 500 nodes network, 2 roles, different number of groups ormal group mixture rate [0.9, 0.1], anomalous group mixture rate [0.1,0.9], 20% injected group anomaly

Detect ``anomalous `` research communities Pointwise: Bag of words of paper abstracts Pairwise: Common authorship 28,702 authors, 104,962 links, 11,771 vocabulary size, 4 topics. Injected 20% group anomalies from other venues.

Detect Party Affiliation Changes Pointwise: Senator attributes Pairwise: Common votes 24 months voting records of 100 senators

Conclusion Formulate the problem of group anomaly detection social media analysis for both static and dynamic settings Develop a unified hierarchical Bayes model GLAD to infer the groups and detect group anomalies simultaneously Experiments on both synthetic, academic publications and senator voting datasets show benefits over two-stage approaches. Future Work on-parametric Bayes model to automatically decide # of groups and # of roles. Better detection evaluation procedures and metrics other than anomaly injection.

References [1] V. Chandola, A. Banerjee, and V. Kumar. Outlier detection: A survey. 2007. [2] Liang Xiong, Barnab ás P oczos, Jeff G Schneider, Andrew Connolly, and Jake Vanderplas. Hierarchical probabilistic models for group anomaly detection. In International Conference on Artificial Intelligence and Statistics, pages 789 797, 2011. [3] Liang Xiong, Barnab ás P oczos, and Jeff G Schneider. Group anomaly detection using flexible genre models. In Advances in eural Information Processing Systems, pages 1071 1079, 2011. [4] Krikamol Muandet and Bernhard Sch ölkopf. One-class support measure machines for group anomaly detection. arxiv preprint arxiv:1303.0309, 2013. [5] David M Blei, Andrew Y g, and Michael I Jordan. Latent dirichlet allocation. the Journal of machine Learning research, 3:993 1022, 2003. [6] Edoardo M Airoldi, David M Blei, Stephen E Fienberg, and Eric P Xing. Mixed membership stochastic blockmodels. Journal of Machine Learning Research, 9(1981-2014):3, 2008. Thank You!