Mixed Membership Stochastic Blockmodels

Similar documents
Mixed Membership Stochastic Blockmodels

Mixed Membership Stochastic Blockmodels

Two-sample hypothesis testing for random dot product graphs

arxiv: v1 [cs.si] 7 Dec 2013

Applying Latent Dirichlet Allocation to Group Discovery in Large Graphs

Nonparametric Bayesian Matrix Factorization for Assortative Networks

GLAD: Group Anomaly Detection in Social Media Analysis

Network Event Data over Time: Prediction and Latent Variable Modeling

Hierarchical Models for Social Networks

IV. Analyse de réseaux biologiques

arxiv: v2 [stat.ap] 24 Jul 2010

Theory and Methods for the Analysis of Social Networks

Statistical Mechanics of Semi Supervised Clustering in Sparse Graphs

Mixed Membership Matrix Factorization

GraphRNN: A Deep Generative Model for Graphs (24 Feb 2018)

Random function priors for exchangeable arrays with applications to graphs and relational data

arxiv: v1 [stat.ml] 29 Jul 2012

A Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities

Modeling heterogeneity in random graphs

arxiv: v1 [stat.me] 12 May 2017

Mixed Membership Matrix Factorization

Nonparametric Latent Feature Models for Link Prediction

arxiv: v1 [stat.me] 30 May 2007

Modeling homophily and stochastic equivalence in symmetric relational data

Modeling Organizational Positions Chapter 2

Confidence Sets for Network Structure

Collaborative topic models: motivations cont

Hierarchical Models, Nested Models and Completely Random Measures

An introduction to clustering techniques

Nonparametric Mixed Membership Models

Characterization and inference of weighted graph topologies from observations of diffused signals

A Latent Mixed Membership Model for Relational Data

Outline Inverse of a Relation Properties of Relations. Relations. Alice E. Fischer. April, 2018

Likelihood Inference for Large Scale Stochastic Blockmodels with Covariates based on a Divide-and-Conquer Parallelizable Algorithm with Communication

Hierarchical Mixed Membership Stochastic Blockmodels for Multiple Networks and Experimental Interventions

A Tensor Spectral Approach to Learning Mixed Membership Community Models

Intelligent Systems (AI-2)

Dynamic Probabilistic Models for Latent Feature Propagation in Social Networks

Lecture 6: Gaussian Mixture Models (GMM)

13: Variational inference II

Learning latent structure in complex networks

Bayesian Learning (II)

Haupthseminar: Machine Learning. Chinese Restaurant Process, Indian Buffet Process

Topic Models. Brandon Malone. February 20, Latent Dirichlet Allocation Success Stories Wrap-up

Statistical Model for Soical Network

Period: Date: Lesson 3B: Properties of Dilations and Equations of lines

Topic Modelling and Latent Dirichlet Allocation

Lecture 11: Unsupervised Machine Learning

Machine Learning for Data Science (CS4786) Lecture 12

Lecture: Local Spectral Methods (1 of 4)

Review and Motivation

Stochastic blockmodeling of relational event dynamics

Lecture 6 (supplemental): Stochastic Block Models

Dynamic Infinite Relational Model for Time-varying Relational Data Analysis

Bayesian decision making

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project

Stochastic calculus for summable processes 1

Comparing Means from Two-Sample

Lecture 3a: Dirichlet processes

Non-Parametric Bayes

Unified Modeling of User Activities on Social Networking Sites

Community Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria

Advanced Machine Learning

Probabilistic Graphical Models

STAT 518 Intro Student Presentation

arxiv: v1 [stat.me] 6 Nov 2014

Quilting Stochastic Kronecker Graphs to Generate Multiplicative Attribute Graphs

Introduction to statistical analysis of Social Networks

Latent Dirichlet Allocation (LDA)

Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach

Functions. Copyright Cengage Learning. All rights reserved.

Lecture 12: Link Analysis for Web Retrieval

Journal of the American Statistical Association, Vol. 96, No (Sep., 2001), pp

Machine Learning. Gaussian Mixture Models. Zhiyao Duan & Bryan Pardo, Machine Learning: EECS 349 Fall

Variational Bayesian Dirichlet-Multinomial Allocation for Exponential Family Mixtures

STATS 306B: Unsupervised Learning Spring Lecture 2 April 2

Mixtures of Gaussians. Sargur Srihari

An Infinite Product of Sparse Chinese Restaurant Processes

Machine Learning for Data Science (CS4786) Lecture 24

The social sciences have investigated the structure of small

Applications of Convex Optimization on the Graph Isomorphism problem

Mixed-membership Models (and an introduction to variational inference)

arxiv: v1 [math.st] 26 Jan 2018

Variational inference for the Stochastic Block-Model

Bayesian Hidden Markov Models and Extensions

STA 4273H: Statistical Machine Learning

MAD-Bayes: MAP-based Asymptotic Derivations from Bayes

Computer Vision Group Prof. Daniel Cremers. 6. Mixture Models and Expectation-Maximization

RaRE: Social Rank Regulated Large-scale Network Embedding

Matrix estimation by Universal Singular Value Thresholding

arxiv: v2 [stat.ml] 30 Dec 2015

Bayesian non parametric inference of discrete valued networks

Bayesian Nonparametrics: Dirichlet Process

Efficient Online Inference for Bayesian Nonparametric Relational Models

A Random Dot Product Model for Weighted Networks arxiv: v1 [stat.ap] 8 Nov 2016

Spectral thresholds in the bipartite stochastic block model

Machine Learning Summer School, Austin, TX January 08, 2015

Bayesian Networks. Machine Learning, Fall Slides based on material from the Russell and Norvig AI Book, Ch. 14

arxiv: v2 [cs.lg] 18 Jun 2018

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

Transcription:

Mixed Membership Stochastic Blockmodels Journal of Machine Learning Research, 2008 by E.M. Airoldi, D.M. Blei, S.E. Fienberg, E.P. Xing as interpreted by Ted Westling STAT 572 Intro Talk April 22, 2014 Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 1

Overview 1. Notation and motivation 2. Previous blockmodels 3. The Mixed Membership Stochastic Blockmodel 4. Conclusion and next steps Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 2

Overview 1. Notation and motivation 2. Previous blockmodels 3. The Mixed Membership Stochastic Blockmodel 4. Conclusion and next steps Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 3

Network theory: notation Individuals p, q {1,..., N}. We observe relations/interactions R(p, q) on pairs of individuals. Here we assume R(p, q) {0, 1}, R(p, p) = 0, but do not assume R(p, q) = R(q, p) (we deal with directed networks). Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 4

Network theory: data representations p q R(p,q) 1 2 1 2 3 1 3 2 1 1 2 0 1 0 0 0 1 0 1 0 1 2 3 1 2 3 Table 3 Graph Adjacency matrix i, j element is R(i, j) Adjacency matrix, black=1, white=0 Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 5

The Problem: Scientific Motivation We believe the relations are a function of unobserved groupings among the individuals. We want to recover the groups so we can a) predict new relations or b) interpret the existing network structure. Example: Monk network. Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 6

The Problem: Pictures Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 7

The Problem: Pictures Figure: Two visualizations of the same binary adjacency matrix. Each filled-in square represents a directed edge. Left: ordered randomly. Right: ordered by group membership. Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 7

Overview 1. Notation and motivation 2. Previous blockmodels 3. The Mixed Membership Stochastic Blockmodel 4. Conclusion and next steps Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 8

Brief blockmodel history 1975: CONCOR developed by Harrison White and colleagues 1983: Holland, Laskey & Leinhardt introduce stochastic blockmodel for blocks known a priori. 1987: Wasserman & Anderson extend to a posteriori estimation. 2004: Kemp, Griffiths & Tenenbaum allow unknown and unlimited number of blocks in the Infinite Relational Model. Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 9

Infinite Relational Model Observe binary relations R(p, q) between nodes p, q {1,..., N}. Each node p is a member of exactly one block of K total blocks, K N unknown. Let z p be an indicator vector of block membership for node p, i.e. z p = (0, 1, 0). B is a K K matrix of block relationships. If p is in block g and q is in block h then the probability of observing an interaction from node p to node q is B gh. R(p, q) Bernoulli(z T p Bz q ). For example, if p is in block 3 and q is in block 2 then P(R(p, q) = 1) = B 32. Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 10

Block structure.8.3 0 0.1.2.9 0 0.5.6 0.1 0.4.8 0.8 0.2 0.1 1 2 0.3 0.9 0.5 0.1 3 4 0.4 0.6 0.8 Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 11

Overview 1. Notation and motivation 2. Previous blockmodels 3. The Mixed Membership Stochastic Blockmodel 4. Conclusion and next steps Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 12

The Mixed Membership Stochastic Blockmodel Previous models assume each node is assumed to belong to exactly one latent block - e.g. z p = (0, 1, 0, 0). Instead, in the MMB we assume each node has a distribution π p over the latent blocks. For each interaction from p to q, both p and q draw a particular block to be a part of for the interaction: z p q Discrete(π p ), z p q Discrete(π q ). Then R(p, q) Bernoulli(z T p qbz p q ). K chosen by BIC. Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 13

Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 14

Overview 1. Notation and motivation 2. Previous blockmodels 3. The Mixed Membership Stochastic Blockmodel 4. Conclusion and next steps Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 15

Conclusion and next steps Blockmodels allow clustering nodes from observed network data. MMB extends blockmodels to let nodes be in different groups to different extents, but commit to one group during any given interaction. Next steps: estimation... Ted Westling Mixed Membership Stochastic Blockmodels STAT 572 Intro Talk 16