Reconstruction in the Sparse Labeled Stochastic Block Model

Similar documents
Reconstruction in the Generalized Stochastic Block Model

Statistical and Computational Phase Transitions in Planted Models

Community detection in stochastic block models via spectral methods

Spectral thresholds in the bipartite stochastic block model

Community Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria

Spectral Partitiong in a Stochastic Block Model

Belief Propagation, Robust Reconstruction and Optimal Recovery of Block Models

How Robust are Thresholds for Community Detection?

Jointly Clustering Rows and Columns of Binary Matrices: Algorithms and Trade-offs

Clustering from Sparse Pairwise Measurements

Spectral thresholds in the bipartite stochastic block model

Phase transitions in discrete structures

Random Graph Coloring

Computational Lower Bounds for Community Detection on Random Graphs

The non-backtracking operator

COMMUNITY DETECTION IN SPARSE NETWORKS VIA GROTHENDIECK S INEQUALITY

U.C. Berkeley CS294: Beyond Worst-Case Analysis Handout 3 Luca Trevisan August 31, 2017

Information Recovery from Pairwise Measurements

Robust Principal Component Analysis

Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices

The condensation phase transition in random graph coloring

PU Learning for Matrix Completion

Community Detection and Stochastic Block Models: Recent Developments

Compressed Sensing and Linear Codes over Real Numbers

Ramanujan Graphs of Every Size

arxiv: v15 [math.oc] 19 Oct 2016

A Spectral Approach to Analysing Belief Propagation for 3-Colouring

U.C. Berkeley CS294: Beyond Worst-Case Analysis Handout 8 Luca Trevisan September 19, 2017

8.1 Concentration inequality for Gaussian random matrix (cont d)

Chasing the k-sat Threshold

SPARSE RANDOM GRAPHS: REGULARIZATION AND CONCENTRATION OF THE LAPLACIAN

Distributed Optimization over Networks Gossip-Based Algorithms

Matrix completion: Fundamental limits and efficient algorithms. Sewoong Oh Stanford University

How robust are reconstruction thresholds for community detection?

Online Social Networks and Media. Link Analysis and Web Search

Lecture 6: September 22

ELE 538B: Mathematics of High-Dimensional Data. Spectral methods. Yuxin Chen Princeton University, Fall 2018

Workshop on Discrete Harmonic Analysis Newton Institute, March 2011

Bandit View on Continuous Stochastic Optimization

Link Analysis Ranking

On Bit Error Rate Performance of Polar Codes in Finite Regime

Spectral Redemption: Clustering Sparse Networks

Hard-Core Model on Random Graphs

High-dimensional graphical model selection: Practical and information-theoretic limits

Part 1: Les rendez-vous symétriques

Independence and chromatic number (and random k-sat): Sparse Case. Dimitris Achlioptas Microsoft

CS Communication Complexity: Applications and New Directions

Community Detection on Euclidean Random Graphs

Predicting Phase Transitions in Hypergraph q-coloring with the Cavity Method

On the Spectra of General Random Graphs

Models of collective inference

New algorithms for Disjoint Paths and Routing Problems

Graph-based codes for flash memory

Online Social Networks and Media. Link Analysis and Web Search

A Note on Interference in Random Networks

Belief Propagation for Traffic forecasting

LIMITATION OF LEARNING RANKINGS FROM PARTIAL INFORMATION. By Srikanth Jagabathula Devavrat Shah

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS

CS249: ADVANCED DATA MINING

Binary Compressive Sensing via Analog. Fountain Coding

Consistency Thresholds for the Planted Bisection Model

Lecture 10: Dimension Reduction Techniques

Ranking from Pairwise Comparisons

High-dimensional graphical model selection: Practical and information-theoretic limits

Privacy and Fault-Tolerance in Distributed Optimization. Nitin Vaidya University of Illinois at Urbana-Champaign

Dissertation Defense

Random Sampling of Bandlimited Signals on Graphs

Local Differential Privacy

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

FEL3330 Networked and Multi-Agent Control Systems. Lecture 11: Distributed Estimation

Lecture 8: February 8

Spectra of Large Random Stochastic Matrices & Relaxation in Complex Systems

ISIT Tutorial Information theory and machine learning Part II

Budget-Optimal Task Allocation for Reliable Crowdsourcing Systems

Approximate Message Passing

Approximation & Complexity

Manifold Coarse Graining for Online Semi-supervised Learning

Diffuse interface methods on graphs: Data clustering and Gamma-limits

Additive distances. w(e), where P ij is the path in T from i to j. Then the matrix [D ij ] is said to be additive.

Faster quantum algorithm for evaluating game trees

Practical Key Recovery for Discrete-Logarithm Based Authentication Schemes from Random Nonce Bits

Message passing and approximate message passing

arxiv: v1 [math.st] 26 Jan 2018

Capacitated Metric Labeling

Upper Bounds on Expected Hitting Times in Mostly-Covered Delay-Tolerant Networks

Parameterized Expectations Algorithm

5. Density evolution. Density evolution 5-1

Algorithms Reading Group Notes: Provable Bounds for Learning Deep Representations

ACO Comprehensive Exam October 14 and 15, 2013

Phylogenetic Reconstruction with Insertions and Deletions

PCPs and Inapproximability Gap-producing and Gap-Preserving Reductions. My T. Thai

The Minesweeper game: Percolation and Complexity

Introduction to graphical models: Lecture III

Graph Sparsifiers. Smaller graph that (approximately) preserves the values of some set of graph parameters. Graph Sparsification

Applications of the Lopsided Lovász Local Lemma Regarding Hypergraphs

Lecture 5: The Principle of Deferred Decisions. Chernoff Bounds

CS 450 Numerical Analysis. Chapter 5: Nonlinear Equations

From trees to seeds:

How to Play Unique Games against a Semi-Random Adversary

A class of non-convex polytopes that admit no orthonormal basis of exponentials

Transcription:

Reconstruction in the Sparse Labeled Stochastic Block Model Marc Lelarge 1 Laurent Massoulié 2 Jiaming Xu 3 1 INRIA-ENS 2 INRIA-Microsoft Research Joint Centre 3 University of Illinois, Urbana-Champaign September 10, 2013

Motivation: community detection Identify underlying communities based on the pairwise interactions represented by graph 4 5 5 3 3 5 4 5 Network of political webblogs [Adamic-Glance 05] Recommendation system

A popular model: stochastic block model Graph generated from stochastic block model with n = 5000, r = 10, p = 0.999, q = 0.001. Ref. https://projects.skewed.de/graph-tool.

Model we use: sparse labeled stochastic block model Two important aspects received little attention: 1 Sparse graph: Limited amount of interaction 2 Interaction can be of multiple types: ratings can be 1 to 5 We use sparse labeled stochastic block model: 1 Nodes have bounded average degree: edge probabilities p, q = O(1/n). 2 Edges carry label: label represents interaction type

Model we use: sparse labeled stochastic block model Two important aspects received little attention: 1 Sparse graph: Limited amount of interaction 2 Interaction can be of multiple types: ratings can be 1 to 5 We use sparse labeled stochastic block model: 1 Nodes have bounded average degree: edge probabilities p, q = O(1/n). 2 Edges carry label: label represents interaction type

Sparse labeled stochastic block model: two symmetric communities A random graph model on n nodes with two constants, a, b 0 and two discrete prob. distributions, µ, ν. n = 10

Sparse labeled stochastic block model: two symmetric communities A random graph model on n nodes with two constants, a, b 0 and two discrete prob. distributions, µ, ν. +1 1 Assign each node to community +1 or 1 uniformly at random. n = 10

Sparse labeled stochastic block model: two symmetric communities A random graph model on n nodes with two constants, a, b 0 and two discrete prob. distributions, µ, ν. +1 1 Independently for each pair of nodes: Draw an edge w.p. a/n if they are in the same community; w.p. b/n otherwise. a/n b/n a/n a = 4,b = 2

Sparse labeled stochastic block model: two symmetric communities A random graph model on n nodes with two constants, a, b 0 and two discrete prob. distributions, µ, ν. Independently for each edge: Label the edge w.d. µ if the two endpoints are in the same community; w.d. ν otherwise. +1 1 a/n b/n a/n µ(r) = 0.6, µ(b) = 0.4 ν(r) = 0.4, ν(b) = 0.6

Correlated reconstruction of community assignment Isolated nodes render the exact reconstruction impossible. Focus on correlated reconstruction, i.e., agrees with the true community assignment in more than half of all nodes.

Correlated reconstruction of community assignment Isolated nodes render the exact reconstruction impossible. Focus on correlated reconstruction, i.e., agrees with the true community assignment in more than half of all nodes.

Phase transition for correlated reconstruction [Heimlicher et al. 12] conjectures: Impossible 0 1 BP τ τ = a + b 2 l L aµ(l) + bν(l) a + b ( ) aµ(l) bν(l) 2 aµ(l) + bν(l) It generalizes the conjecture in stochastic block model [Decelle et al. 11] [Mossel et al. 12].

Phase transition for correlated reconstruction [Heimlicher et al. 12] conjectures: Impossible 0 1 BP τ τ = a + b 2 l L aµ(l) + bν(l) a + b ( ) aµ(l) bν(l) 2 aµ(l) + bν(l) It generalizes the conjecture in stochastic block model [Decelle et al. 11] [Mossel et al. 12].

Our contribution 0 1 32 log 2 64 C a + b Impossible, MinBisection, Spectral*, Spectral τ (A) Prove the impossibility result: correlated reconstruction is impossible if τ < 1. (B) Prove the achievability result: correlated reconstruction is possible if τ > 32 log 2 by MinBisection algorithm. (C) Propose a polynomial-time Spectral algorithm and show it achieves correlated reconstruction if τ is large enough.

Our contribution 0 1 32 log 2 64 C a + b Impossible, MinBisection, Spectral*, Spectral τ (A) Prove the impossibility result: correlated reconstruction is impossible if τ < 1. (B) Prove the achievability result: correlated reconstruction is possible if τ > 32 log 2 by MinBisection algorithm. (C) Propose a polynomial-time Spectral algorithm and show it achieves correlated reconstruction if τ is large enough.

Our contribution 0 1 32 log 2 64 C a + b Impossible, MinBisection, Spectral*, Spectral τ (A) Prove the impossibility result: correlated reconstruction is impossible if τ < 1. (B) Prove the achievability result: correlated reconstruction is possible if τ > 32 log 2 by MinBisection algorithm. (C) Propose a polynomial-time Spectral algorithm and show it achieves correlated reconstruction if τ is large enough.

(A) Impossibility for correlated reconstruction if τ < 1 First step of proof: correlated reconstruction = long range correlation: The community memberships of any two nodes randomly chosen are asymptotically correlated conditional on the labeled graph [Mossel et al. 12].

(A) Impossibility for correlated reconstruction if τ < 1 Second step of proof: The small local neighborhood of a node looks like a labeled Poisson tree [Mossel et al. 12]. +1 +1 +1 +1-1 +1 +1-1 +1-1 -1 +1 +1 +1-1

(A) Impossibility for correlated reconstruction if τ < 1 Third step: The community memberships of leaf nodes is uninformative on the community membership of root asymptotically [Heimlicher et al. 12].? -1 +1-1 -1 +1 +1 +1-1

(B) Achievability of correlated reconstruction if τ > 32 log 2 Consider minimum bisection algorithm: Find two equal-sized parts with the minimum sum of weights of edges between parts with weight function w(l) = aµ(l) bν(l) aµ(l)+bν(l). Theorem solutions of minimum bisection are correlated with the true community assignment if τ > 32 log 2 and a l µ(l)w 2 (l), b l ν(l)w 2 (l) > 8 ln 2. Proof uses the Chernoff bound and the weight function w(l) is chosen optimally.

(B) Achievability of correlated reconstruction if τ > 32 log 2 Consider minimum bisection algorithm: Find two equal-sized parts with the minimum sum of weights of edges between parts with weight function w(l) = aµ(l) bν(l) aµ(l)+bν(l). Theorem solutions of minimum bisection are correlated with the true community assignment if τ > 32 log 2 and a l µ(l)w 2 (l), b l ν(l)w 2 (l) > 8 ln 2. Proof uses the Chernoff bound and the weight function w(l) is chosen optimally.

(B) Achievability of correlated reconstruction if τ > 32 log 2 Consider minimum bisection algorithm: Find two equal-sized parts with the minimum sum of weights of edges between parts with weight function w(l) = aµ(l) bν(l) aµ(l)+bν(l). Theorem solutions of minimum bisection are correlated with the true community assignment if τ > 32 log 2 and a l µ(l)w 2 (l), b l ν(l)w 2 (l) > 8 ln 2. Proof uses the Chernoff bound and the weight function w(l) is chosen optimally.

(C) Polynomial-time reconstruction for τ large enough A polynomial-time algorithm exploiting the spectrum of weighted adjacency matrix W with weight function w(l) = aµ(l) bν(l) aµ(l)+bν(l) The expectation of W conditional on the true community assignment σ satisfies: E[W σ] = a b 2n 11 + τ n σσ. If W E[W σ], then the top left singular vector of W a b 2n 11 approximates σ. Due to nodes with high degrees Ω( log n log log n ), W is not concentrated around its conditional mean.

(C) Polynomial-time reconstruction for τ large enough A polynomial-time algorithm exploiting the spectrum of weighted adjacency matrix W with weight function w(l) = aµ(l) bν(l) aµ(l)+bν(l) The expectation of W conditional on the true community assignment σ satisfies: E[W σ] = a b 2n 11 + τ n σσ. If W E[W σ], then the top left singular vector of W a b 2n 11 approximates σ. Due to nodes with high degrees Ω( log n log log n ), W is not concentrated around its conditional mean.

(C) Polynomial-time reconstruction for τ large enough A polynomial-time algorithm exploiting the spectrum of weighted adjacency matrix W with weight function w(l) = aµ(l) bν(l) aµ(l)+bν(l) The expectation of W conditional on the true community assignment σ satisfies: E[W σ] = a b 2n 11 + τ n σσ. If W E[W σ], then the top left singular vector of W a b 2n 11 approximates σ. Due to nodes with high degrees Ω( log n log log n ), W is not concentrated around its conditional mean.

(C) Polynomial-time reconstruction for τ large enough Spectral-Reconstruction algorithm: Input a, b, µ, ν, W ; Output community assignment. 1 Remove nodes with degree greater than 3 2 (a + b) and assign random community to these nodes. 2 Define W by setting to zero the rows and columns of W corresponding to the nodes removed. 3 Let ˆx denote the left singular vector associated with the largest singular values of W a b n 11. Output sign(ˆx) as the community assignment for remaining nodes.

(C) Polynomial-time reconstruction for τ large enough Spectral-Reconstruction algorithm: Input a, b, µ, ν, W ; Output community assignment. 1 Remove nodes with degree greater than 3 2 (a + b) and assign random community to these nodes. 2 Define W by setting to zero the rows and columns of W corresponding to the nodes removed. 3 Let ˆx denote the left singular vector associated with the largest singular values of W a b n 11. Output sign(ˆx) as the community assignment for remaining nodes.

(C) Polynomial-time reconstruction for τ large enough Spectral-Reconstruction algorithm: Input a, b, µ, ν, W ; Output community assignment. 1 Remove nodes with degree greater than 3 2 (a + b) and assign random community to these nodes. 2 Define W by setting to zero the rows and columns of W corresponding to the nodes removed. 3 Let ˆx denote the left singular vector associated with the largest singular values of W a b n 11. Output sign(ˆx) as the community assignment for remaining nodes.

(C) Polynomial-time reconstruction for τ large enough Theorem Spectral-Reconstruction algorithm outputs a community assignment correlated with the true one if τ > C a + b. Proof: The spectral norm of W E[W σ] is O( a + b). Ref. [Feige-Ofek 05], [Coja-Oghlan 10]. The L 2 distance between σ/ n and ˆx is bounded by O( W E[W σ] /τ).

(C) Polynomial-time reconstruction for τ large enough Theorem Spectral-Reconstruction algorithm outputs a community assignment correlated with the true one if τ > C a + b. Proof: The spectral norm of W E[W σ] is O( a + b). Ref. [Feige-Ofek 05], [Coja-Oghlan 10]. The L 2 distance between σ/ n and ˆx is bounded by O( W E[W σ] /τ).

(C) Polynomial-time reconstruction for τ large enough Theorem Spectral-Reconstruction algorithm outputs a community assignment correlated with the true one if τ > C a + b. Proof: The spectral norm of W E[W σ] is O( a + b). Ref. [Feige-Ofek 05], [Coja-Oghlan 10]. The L 2 distance between σ/ n and ˆx is bounded by O( W E[W σ] /τ).

(C) Polynomial-time reconstruction for τ > 64 Better performance guarantee if max{a, b} = Ω(log 6 n). Theorem Spectral-Reconstruction algorithm outputs a community assignment correlated with the true one if τ > 64 and max{a, b} = Ω(log 6 n). Proof: W E[W σ] 2 2τ. Ref. [Vu 07].

(C) Polynomial-time reconstruction for τ > 64 Better performance guarantee if max{a, b} = Ω(log 6 n). Theorem Spectral-Reconstruction algorithm outputs a community assignment correlated with the true one if τ > 64 and max{a, b} = Ω(log 6 n). Proof: W E[W σ] 2 2τ. Ref. [Vu 07].

Simulation of Spectral-Reconstruction algorithm 1 0.9 0.8 0.7 a=12,b=8 a=6,b=4 a=3,b=1 n=1000 0.6 Overlap 0.5 0.4 0.3 0.2 0.1 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 ε Two labels: r and b with µ(r) = 0.5 + ɛ and ν(r) = 0.5 ɛ. The threshold τ = 1 is depicted as a vertical dash line.

Summary of our results 0 1 32 log 2 64 C a + b τ Impossible, Open,MinBisection, Spectral*, Spectral

Future work Apply sparse labeled stochastic block model into real data Convergence of belief propagation Reconstruction algorithms approaching the reconstruction threshold

Future work Apply sparse labeled stochastic block model into real data Convergence of belief propagation Reconstruction algorithms approaching the reconstruction threshold

Future work Apply sparse labeled stochastic block model into real data Convergence of belief propagation Reconstruction algorithms approaching the reconstruction threshold

Future work Apply sparse labeled stochastic block model into real data Convergence of belief propagation Reconstruction algorithms approaching the reconstruction threshold