Mixed Membership Stochastic Blockmodels

Similar documents
Mixed Membership Stochastic Blockmodels

Mixed Membership Stochastic Blockmodels

Nonparametric Bayesian Matrix Factorization for Assortative Networks

Lecture 6 (supplemental): Stochastic Block Models

GLAD: Group Anomaly Detection in Social Media Analysis

Modeling heterogeneity in random graphs

GraphRNN: A Deep Generative Model for Graphs (24 Feb 2018)

Two-sample hypothesis testing for random dot product graphs

Learning latent structure in complex networks

IV. Analyse de réseaux biologiques

Collaborative topic models: motivations cont

Hierarchical Models for Social Networks

Hierarchical Mixed Membership Stochastic Blockmodels for Multiple Networks and Experimental Interventions

Lecture 13 : Variational Inference: Mean Field Approximation

Unified Modeling of User Activities on Social Networking Sites

A Random Dot Product Model for Weighted Networks arxiv: v1 [stat.ap] 8 Nov 2016

Study Notes on the Latent Dirichlet Allocation

Quilting Stochastic Kronecker Graphs to Generate Multiplicative Attribute Graphs

Sparse Stochastic Inference for Latent Dirichlet Allocation

Lifted and Constrained Sampling of Attributed Graphs with Generative Network Models

arxiv: v1 [stat.me] 30 May 2007

Probabilistic Graphical Models

Chaos, Complexity, and Inference (36-462)

Probabilistic Graphical Models

Information Recovery from Pairwise Measurements

STA414/2104. Lecture 11: Gaussian Processes. Department of Statistics

Latent Dirichlet Allocation (LDA)

Network Event Data over Time: Prediction and Latent Variable Modeling

Bayesian Nonparametrics for Speech and Signal Processing

13: Variational inference II

Chapter 16. Structured Probabilistic Models for Deep Learning

Chaos, Complexity, and Inference (36-462)

STA 4273H: Statistical Machine Learning

Variational Inference with Copula Augmentation

Mixed Membership Matrix Factorization

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Clustering bi-partite networks using collapsed latent block models

Theory and Methods for the Analysis of Social Networks

arxiv: v1 [cs.si] 7 Dec 2013

ECE521 Tutorial 11. Topic Review. ECE521 Winter Credits to Alireza Makhzani, Alex Schwing, Rich Zemel and TAs for slides. ECE521 Tutorial 11 / 4

Latent Dirichlet Allocation (LDA)

Math Review Sheet, Fall 2008

arxiv: v2 [math.st] 8 Dec 2010

CPSC 540: Machine Learning

Discrete Temporal Models of Social Networks

Machine Learning

Probabilistic Matrix Factorization

Bayesian model selection for computer model validation via mixture model estimation

19 : Bayesian Nonparametrics: The Indian Buffet Process. 1 Latent Variable Models and the Indian Buffet Process

Lecture 21: Spectral Learning for Graphical Models

Latent Dirichlet Bayesian Co-Clustering

Learning Gaussian Graphical Models with Unknown Group Sparsity

Comment on Article by Scutari

Large-Scale Feature Learning with Spike-and-Slab Sparse Coding

Variational Inference (11/04/13)

Topic 12 Overview of Estimation

GWAS V: Gaussian processes

Stochastic blockmodeling of relational event dynamics

Bayesian Nonparametric Models

arxiv: v2 [stat.ap] 24 Jul 2010

Text Mining for Economics and Finance Latent Dirichlet Allocation

Introduction to Bayesian inference

PMR Learning as Inference

Collaborative Topic Modeling for Recommending Scientific Articles

Properties of Latent Variable Network Models

Undirected Graphical Models

Introduction to Probabilistic Machine Learning

Linear Dynamical Systems

Lecture 16 Deep Neural Generative Models

Fitting Narrow Emission Lines in X-ray Spectra

Mixed Membership Matrix Factorization

Introduction to Bayesian Statistics with WinBUGS Part 4 Priors and Hierarchical Models

Recent Advances in Bayesian Inference Techniques

Robust Principal Component Analysis

Learning in Bayesian Networks

Probabilistic Graphical Models

Deep Poisson Factorization Machines: a factor analysis model for mapping behaviors in journalist ecosystem

Readings: K&F: 16.3, 16.4, Graphical Models Carlos Guestrin Carnegie Mellon University October 6 th, 2008

Tree-Based Inference for Dirichlet Process Mixtures

Document and Topic Models: plsa and LDA

Smoothed Gradients for Stochastic Variational Inference

Statistical Debugging with Latent Topic Models

Graph Detection and Estimation Theory

Topic Modelling and Latent Dirichlet Allocation

Evaluating the Variance of

Based on slides by Richard Zemel

Efficient Online Inference for Bayesian Nonparametric Relational Models

13 : Variational Inference: Loopy Belief Propagation and Mean Field

arxiv: v2 [stat.ml] 30 Dec 2015

DAG models and Markov Chain Monte Carlo methods a short overview

Introduction to Applied Bayesian Modeling. ICPSR Day 4

Bayesian nonparametric models of sparse and exchangeable random graphs

Generative Clustering, Topic Modeling, & Bayesian Inference

Deep Learning Srihari. Deep Belief Nets. Sargur N. Srihari

Latent Dirichlet Alloca/on

AN INTRODUCTION TO TOPIC MODELS

MCMC and Gibbs Sampling. Kayhan Batmanghelich

ICML Scalable Bayesian Inference on Point processes. with Gaussian Processes. Yves-Laurent Kom Samo & Stephen Roberts

Variational Bayesian Dirichlet-Multinomial Allocation for Exponential Family Mixtures

An Empirical-Bayes Score for Discrete Bayesian Networks

Transcription:

Mixed Membership Stochastic Blockmodels (2008) Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg and Eric P. Xing Herrissa Lamothe Princeton University Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 1 / 28

Outline 1 Overview 2 The MMSB Model Mixed Membership Model Estimation 3 Application of Mixed Membership Model Empirical and Synthetic Data Drawbacks to the MMSB Model Flexibility Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 2 / 28

Motivating question Are there certain rules dictating how individual vertices (or nodes) make decisions to connect/not to connect to other vertices? Let s suppose we can draw a network graph, G from a generative model, so that G comes from a probability distribution Pr(G θ), governed by parameter θ so that if θ is partly known, it can act as a constraint in generating synthetic graphs (similar to G) if G is partly known, we can use it to infer a θ that make G likely Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 3 / 28

How do we obtain θ? Pr(G θ) = ij Pr(A ij θ) Vertex-level attributes: too chaotic (over-fit) Global-network attributes: too general (over-generalize) Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 4 / 28

Introducing structure Differentiates data from noise Captures relevant patterns Describing patterns and predicting them Vertices Communities Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 5 / 28

Let s Rephrase the Motivating Question How do groups of vertices make decisions to connect/not to connect to other groups? Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 6 / 28

The stochastic block model Class of models of which Mixed Membership Stochastic Blockmodel is a variant. Aaron Clauset (UColorado at Boulder, Computer Science) has amazing slides... Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 7 / 28

the stochastic block model each vertex i has type z i 2 {1,...,k} ( k vertex types or groups) stochastic block matrix M of group-level connection probabilities probability that i, j are connected = M zi,z j community = vertices with same pattern of inter-community connections inferred M 9 D

the stochastic block model assortative edges within groups disassortative edges between groups ordered linear group hierarchy core-periphery dense core, sparse periphery

the stochastic block model the most general SBM Pr(A z, ) = Y i,j f A ij R(zi,z j ) A ij : value of adjacency R : partition of adjacencies f : probability function a, : pattern for a-type adjacencies Binomial = simple graphs Poisson = multi-graphs Normal = weighted graphs etc. 11 12 21 22 13 23 14 24 31 32 33 34 41 42 43 44

Interactions across blocks Classes of SBMs are looking at interactions across blocks They are not directly looking for patches of connectedness among nodes. Assume that individual nodes behavior can be explained entirely by group membership. Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 8 / 28

Sociological contributions If this way of operationalizing the problem (in terms of group membership) seems familiar, that is because sociologists have made many contributions to this line of research. Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 9 / 28

Implications for interactions within blocks Given this framework, what are we implying about connections between vertices i and j if they belong to the same group k? Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 10 / 28

Outline 1 Overview 2 The MMSB Model Mixed Membership Model Estimation 3 Application of Mixed Membership Model Empirical and Synthetic Data Drawbacks to the MMSB Model Flexibility Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 11 / 28

Mixed Membership Mixed membership stochastic block model (MMSB) (f = Bernoulli) Similar to SBM, but with an extra layer of parameters to estimate. Key assumptions remain: Pr(i j) = M zi,zj M = Stochastic Block Matrix But, z i and z j must be estimated for each dyadic interaction between all i and j vertices, based on a latent mixed membership vector for each i. Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 12 / 28

Mixed Membership each vertex i has a mixed membership vector θ i Dirichlet(α) vertex i takes on a single group membership with probability θ i in the context of a directed dyadic interaction with vertex j For each pair of vertices (i, j) [1, N]x[1, N] (adjacency matrix), Sample group membership of i and j independently Sample group z i j Multinomial(θ i, 1) Sample group z i j Multinomial(θ i, 1) Note: This model specification can be adapted to undirected interactions easily, z i j Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 13 / 28

Mixed Membership Key Innovations: nodes belong to more than one group nodes belong to groups with different strengths of membership nodes take on a specific group membership for the duration of an interaction include sparsity parameter, control model s sensitivity to zeros in adjacency matrix due to noise. Final sampling of A ij Bernoulli (ρ z i j M z i j + (1 ρ) δ 0 ) Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 14 / 28

Mixed Membership joint probability distribution: p(a, θ, z i, z j α, M) = Where, N N p(θ i α) p(z i j θ i )p(z i j θ j )p(a ij z i j, z i j, M) i=1 j=1 A is the observed adjacency matrix M is the block matrix θ i and θ j are the mixed membership vectors for i and j z i and z j are the group membership indicators for i and j during their interaction The only input to this model is the number of groups. Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 15 / 28

Outline 1 Overview 2 The MMSB Model Mixed Membership Model Estimation 3 Application of Mixed Membership Model Empirical and Synthetic Data Drawbacks to the MMSB Model Flexibility Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 16 / 28

MMSB Estimation We won t focus on this too much... Only to note that the actual marginal probability (likelihood) of p(a α, M) is not tractable to compute (i.e. we cannot integrate out z and α). Airoldi et al. carry out an approximate inference and parameter estimation. In order to compute the posterior degrees of membership for all i given hyperparameters (θ and α): p(θ A, α, M) = p(θ, A α, M) p(a α, M) They use variational methods: find a lower bound of the likelihood and approximate posterior distributions for each objects membership vector. (Airoldi et al, 2015 p. 7) Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 17 / 28

MMSB Estimation What are the implications of this estimation strategy for the model? Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 18 / 28

Outline 1 Overview 2 The MMSB Model Mixed Membership Model Estimation 3 Application of Mixed Membership Model Empirical and Synthetic Data Drawbacks to the MMSB Model Flexibility Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 19 / 28

Sampson monk factions (a) MMSB with LDA (b) Airoldi et al.: Variational Methods Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 20 / 28

Synthetic data How to create good synthetic data? various levels of difficulty of detection? (c in c out ) specific block patterns? always the problem of linking recovered partitions to actual theorized groups Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 21 / 28

Outline 1 Overview 2 The MMSB Model Mixed Membership Model Estimation 3 Application of Mixed Membership Model Empirical and Synthetic Data Drawbacks to the MMSB Model Flexibility Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 22 / 28

Potential Issues 1 Estimation procedure 2 Selecting K number of groups Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 23 / 28

Selecting K Two Goals: Prediction and Interpretation 1 Prediction: We want to identify the number of group affiliations, K, most predictive of observed patterns of interaction, G. 2 Interpretation (Hypothesis-Driven): We want to determine how predictive a specific set of hypothesized group affiliations, K, is of observed patterns of interaction, G. Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 24 / 28

Selecting K Our goal affects how we deal with two key issues in community detection: 1 How do we determine the right number of groups in analyzing the interactions in G? prediction: Find the most predictive K (i.e. BIC, or cross-validation). interpretation:? 2 How do we know that the way our algorithm ( heuristic ) partitioned the data is always the same for K groups? prediction: permutation of components of θ i to interpret [E θ i (k) A], if we know components of each functional group. interpretation:? Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 25 / 28

Outline 1 Overview 2 The MMSB Model Mixed Membership Model Estimation 3 Application of Mixed Membership Model Empirical and Synthetic Data Drawbacks to the MMSB Model Flexibility Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 26 / 28

There are also times when we, as researchers, have prior (if partial) information that we want to test and inject in our model... partial information on K partial information on M (block, or mixing matrix) partial information on θ (mixed membership vectors partial information on decision rule for selecting z i j or z i j. (i.e. we might want to break the independence assumption). or any combination of the above Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 27 / 28

Questions to the room Are there tools available to allow researchers to take advantage of this potential flexibility in the MMSB and SBMs in general? What types of research question can we address using the MMSB model? What types of research question can we address if we can take full advantage of the model s potential flexibility? and as always, there are questions about the assumptions made by this model. Under what circumstances (for what research questions) are we willing to make them? Under what circumstances would these assumptions not hold? Herrissa Lamothe (Princeton University) Mixed Membership Stochastic Blockmodels 28 / 28