Spectral thresholds in the bipartite stochastic block model

Size: px
Start display at page:

Download "Spectral thresholds in the bipartite stochastic block model"

Transcription

1 Spectral thresholds in the bipartite stochastic block model Laura Florescu and Will Perkins NYU and U of Birmingham September 27, 2016 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 1 / 30

2 Stochastic Block Model Figure: Red edges added with P = p and blue edges with P = q. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 2 / 30

3 Community detection Goal: Detect communities in networks. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 3 / 30

4 Stochastic Block Model Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 4 / 30

5 Entries are not colored Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 5 / 30

6 Nor ordered Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 6 / 30

7 Stochastic Block Model First introduced by Holland, Laskey, Leinhardt in Motivation: discover communities in large networks. Theorem (Boppana, Dyer/Frieze, Snijders/Nowicki, Condon/Karp, McSherry, Bickel/Chen, etc) There are efficient algorithms for exactly recovering the true colors, provided that p q is large enough as n. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 7 / 30

8 Bipartite Stochastic Block Model Figure: Bipartite stochastic model on V 1 and V 2. Red edges added with P = δp(n 1, n 2 ) and blue edges with P = (2 δ)p(n 1, n 2 ). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 8 / 30

9 SBM Goal: get the planted assignment σ (on V 1 for bipartite stochastic model) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 9 / 30

10 SBM Goal: get the planted assignment σ (on V 1 for bipartite stochastic model) Detection: compute v that agrees with σ on 1/2 + ɛ fraction of vertices Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 9 / 30

11 SBM Goal: get the planted assignment σ (on V 1 for bipartite stochastic model) Detection: compute v that agrees with σ on 1/2 + ɛ fraction of vertices Recovery: compute v that agrees with σ on 1 o(1) fraction of vertices Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 9 / 30

12 Background Intermediate step in recovering solutions in planted problems [Feldman, Perkins, Vempala 14]. planted constraint satisfaction problems (CSP) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 10 / 30

13 Background Intermediate step in recovering solutions in planted problems [Feldman, Perkins, Vempala 14]. planted constraint satisfaction problems (CSP) Reducing planted problems on n variables will give vertex sets of size n 1 = n, n 2 = n k 1. (n 2 n 2 ) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 10 / 30

14 Unified Planted k-csp model Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30

15 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30

16 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30

17 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, define the random constraint satisfaction problem F Q,σ (n, m) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30

18 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, define the random constraint satisfaction problem F Q,σ (n, m) by drawing m k-clauses from C k (the set of all k-tuples) independently according to Q(σ(C)) Q σ (C) = C C k Q(σ(C )) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30

19 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, define the random constraint satisfaction problem F Q,σ (n, m) by drawing m k-clauses from C k (the set of all k-tuples) independently according to Q(σ(C)) Q σ (C) = C C k Q(σ(C )) where σ(c) is the vector of values that σ assigns to the k-tuple of literals comprising C. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30

20 Planted random k-sat and Goldreich PRG Planted random k-sat: Form a truth assignment φ of literals, then select each clause independently from the k-tuples of literals where at least one literal is set to 1 by φ. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 12 / 30

21 Planted random k-sat and Goldreich PRG Planted random k-sat: Form a truth assignment φ of literals, then select each clause independently from the k-tuples of literals where at least one literal is set to 1 by φ. Goldreich PRG: also add a 0/1, depending on a predicate evaluated on literals. (cryptography) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 12 / 30

22 Planted random k-sat and Goldreich PRG Planted random k-sat: Form a truth assignment φ of literals, then select each clause independently from the k-tuples of literals where at least one literal is set to 1 by φ. Goldreich PRG: also add a 0/1, depending on a predicate evaluated on literals. (cryptography) Feldman, Perkins, Vempala 14 gave a reduction of above and others to the BSBM. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 12 / 30

23 Information theory threshold When p = a/n and q = b/n Theorem (Mossel, Neeman, Sly, 2012) There is a test to distinguish the partition that succeeds with high probability if and only if a + b > 2 and (a b) 2 > 2(a + b). Proves conjecture of [Decelle, Krzakala, Moore, Zdeborova 13]. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 13 / 30

24 Computational threshold Dyer, Frieze 1989 p = na > q = nb fixed Condon, Karp 2001 a b n 1/2 McSherry 2001 a b b log n Coja-Oghlan 2010 a b b Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 14 / 30

25 Computational threshold Dyer, Frieze 1989 p = na > q = nb fixed Condon, Karp 2001 a b n 1/2 McSherry 2001 a b b log n Coja-Oghlan 2010 a b b Massoulié 2013 and Mossel, Neeman, Sly detection possible and efficient (a b) 2 > 2(a + b). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 14 / 30

26 Computational threshold Dyer, Frieze 1989 p = na > q = nb fixed Condon, Karp 2001 a b n 1/2 McSherry 2001 a b b log n Coja-Oghlan 2010 a b b Massoulié 2013 and Mossel, Neeman, Sly detection possible and efficient (a b) 2 > 2(a + b). Ingenious spectral methods Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 14 / 30

27 Previous work MNS Idea: nbhd of vertex in G(n, a/n, b/n) looks like a random labelled tree, where each child gives birth to Pois(a) vertices of same type, Pois(b) vertices of different type Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 15 / 30

28 Previous work MNS Idea: nbhd of vertex in G(n, a/n, b/n) looks like a random labelled tree, where each child gives birth to Pois(a) vertices of same type, Pois(b) vertices of different type show that conditioned on the labels of the bdry of the tree, the label of root is asymp indep of the rest of graph Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 15 / 30

29 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30

30 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30

31 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30

32 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30

33 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Can think of p c (T ) as the edge density at which the tree is connected. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30

34 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Can think of p c (T ) as the edge density at which the tree is connected. trees with offspring distribution Pois( a+b a 2 ) and take 1 η = a+b. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30

35 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Can think of p c (T ) as the edge density at which the tree is connected. trees with offspring distribution Pois( a+b a 2 ) and take 1 η = a+b. Then threshold reduces to (a b) 2 2(a + b). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30

36 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30

37 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition typical analysis of spectral algos: 2nd singular value > spectral norm of noise matrix M EM; Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30

38 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition typical analysis of spectral algos: 2nd singular value > spectral norm of noise matrix M EM; here λ 2 (EM) = Θ(p n 1 n 2 ), norm of noise M EM = Θ( pn 2 ). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30

39 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition typical analysis of spectral algos: 2nd singular value > spectral norm of noise matrix M EM; here λ 2 (EM) = Θ(p n 1 n 2 ), norm of noise M EM = Θ( pn 2 ). Feldman, Perkins, Vempala 14: subsampled power iteration recovers partition whp with p = Õ((n 1n 2 ) 1/2 ) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30

40 Questions 1 Here λ 2 < M EM. Is SVD doomed for p 1/n 1? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 18 / 30

41 Questions 1 Here λ 2 < M EM. Is SVD doomed for p 1/n 1? 2 What is the optimal threshold for detection in BSBM? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 18 / 30

42 Our results - sharp reconstruction/impossibility Theorem On the other hand, if n 2 n 1 and p 1 (δ 1) 2 n 1 n 2, then no algorithm can detect the partition. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 19 / 30

43 Our results - sharp reconstruction/impossibility Theorem On the other hand, if n 2 n 1 and p 1 (δ 1) 2 n 1 n 2, then no algorithm can detect the partition. Idea: Couple to a broadcast model on a multi-type Galton Watson tree. Show that conditioned on the labels of a log n bdry of the tree, the label of root is asymp indep of the rest of graph. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 19 / 30

44 Our results - sharp reconstruction/impossibility Theorem Let n 2 n 1. Then there is a polynomial-time algorithm that detects the partition V 1 = A 1 B 1 if p > 1 + ɛ (δ 1) 2 n 1 n 2 for any fixed ɛ > 0. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 20 / 30

45 Our results - sharp reconstruction/impossibility Theorem Let n 2 n 1. Then there is a polynomial-time algorithm that detects the partition V 1 = A 1 B 1 if p > 1 + ɛ (δ 1) 2 n 1 n 2 for any fixed ɛ > 0. Idea: reduce to SBM on graph on V 1 induced by paths of length 2 in bipartite graph. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 20 / 30

46 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

47 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

48 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

49 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

50 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

51 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

52 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Now compute a and b accordingly: a = (1 + ɛ)(2 2δ + δ2 ) (δ 1) 4 (1 + o(1)) b = (1 + ɛ)(2δ δ2 ) (δ 1) 4 (1 + o(1)) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

53 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Now compute a and b accordingly: a = (1 + ɛ)(2 2δ + δ2 ) (δ 1) 4 (1 + o(1)) b = (1 + ɛ)(2δ δ2 ) (δ 1) 4 (1 + o(1)) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

54 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Now compute a and b accordingly: a = (1 + ɛ)(2 2δ + δ2 ) (δ 1) 4 (1 + o(1)) b = (1 + ɛ)(2δ δ2 ) (δ 1) 4 (1 + o(1)) Apply criterion (a b) 2 (1 + ɛ)2(a + b). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30

55 Implications for planted k-sat - detection in the block model exhibits a sharp threshold at m = Θ(n r/2 ) hyperedges/clauses Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 22 / 30

56 Implications for planted k-sat - detection in the block model exhibits a sharp threshold at m = Θ(n r/2 ) hyperedges/clauses Definition The distribution complexity r of the planting distribution Q is the smallest r > 0 so that Q is an (r 1)-wise independent distribution on {±} k but not r-wise independent. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 22 / 30

57 Spectral algorithms Standard SVD: Compute left singular vector of M (adjacency matrix) corresponding to 2nd singular value, round signs to get v; compare σ and v Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 23 / 30

58 Spectral algorithms Standard SVD: Compute left singular vector of M (adjacency matrix) corresponding to 2nd singular value, round signs to get v; compare σ and v Diagonal deletion SVD: Set diagonal entries of MM T to 0, compute second eigenvector, round signs to get v; compare σ and v Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 23 / 30

59 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30

60 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. 2 If p V > n 2/3 the partition. 1 n 1/3 2, then whp the standard SVD algorithm recovers Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30

61 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. 2 If p V > n 2/3 the partition. 1 n 1/3 2, then whp the standard SVD algorithm recovers Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30

62 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. 2 If p V > n 2/3 the partition. 1 n 1/3 2, then whp the standard SVD algorithm recovers When n 2 = n 2, p D n 3/2, p V n 4/3. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30

63 Timeline Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 25 / 30

64 Our results Plot of correlation as a function of p(n1,n2), n1= 1000, n2=100000, delta=0.2 v dd ess Plot of the top eigenvalues of MM T for delta=0.5 and delta=1 for the various regimes p(n1,n2) delta=0.5 delta= correlation top 10 normalized eigenvalues p(n1,n2) x p1=(n1*n2)^( 1/2)*log(n1), p2=n1^( 2/3)*n2^( 2/3), p3=n1^( 2/3)*n2^( 1/3)*log(n1) Figure: Correlations of computed vectors with planted vector Figure: Eigenvalue separation Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 26 / 30

65 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30

66 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30

67 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30

68 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30

69 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) ( ( ) = O 1 log n 1 ); = O 1 log n 1 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30

70 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) ( ( ) = O 1 log n 1 ); = O 1 log n 1 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 e 2 (B) σ/ n 1 = O(log 1 n 1 ) (by special case of Sin Theta Theorem). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30

71 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) ( ( ) = O 1 log n 1 ); = O 1 log n 1 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 e 2 (B) σ/ n 1 = O(log 1 n 1 ) (by special case of Sin Theta Theorem). Conclude by rounding signs of e 2 (B). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30

72 Conclusions Theorem Can efficiently detect partition in BSBM if p > 1+ɛ (δ 1) 2 n 1 n 2 Cannot detect if p 1 (δ 1) 2 n 1 n 2 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 28 / 30

73 Conclusions Theorem Can efficiently detect partition in BSBM if p > 1+ɛ (δ 1) 2 n 1 n 2 Cannot detect if p 1 (δ 1) 2 n 1 n 2 spectral method still works if λ 2 norm of noise matrix Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 28 / 30

74 Conclusions Theorem Can efficiently detect partition in BSBM if p > 1+ɛ (δ 1) 2 n 1 n 2 Cannot detect if p 1 (δ 1) 2 n 1 n 2 spectral method still works if λ 2 norm of noise matrix modifying adjacency matrix improves recovery significantly Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 28 / 30

75 Open problems apply Diagonal Deletion type of algorithm for improvement over SVD in other problems? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 29 / 30

76 Open problems apply Diagonal Deletion type of algorithm for improvement over SVD in other problems? sharper detection thresholds for planted k-sat? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 29 / 30

77 Thank you! Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 30 / 30

Spectral thresholds in the bipartite stochastic block model

Spectral thresholds in the bipartite stochastic block model JMLR: Workshop and Conference Proceedings vol 49:1 17, 2016 Spectral thresholds in the bipartite stochastic block model Laura Florescu New York University Will Perkins University of Birmingham FLORESCU@CIMS.NYU.EDU

More information

Reconstruction in the Generalized Stochastic Block Model

Reconstruction in the Generalized Stochastic Block Model Reconstruction in the Generalized Stochastic Block Model Marc Lelarge 1 Laurent Massoulié 2 Jiaming Xu 3 1 INRIA-ENS 2 INRIA-Microsoft Research Joint Centre 3 University of Illinois, Urbana-Champaign GDR

More information

Belief Propagation, Robust Reconstruction and Optimal Recovery of Block Models

Belief Propagation, Robust Reconstruction and Optimal Recovery of Block Models JMLR: Workshop and Conference Proceedings vol 35:1 15, 2014 Belief Propagation, Robust Reconstruction and Optimal Recovery of Block Models Elchanan Mossel mossel@stat.berkeley.edu Department of Statistics

More information

Statistical and Computational Phase Transitions in Planted Models

Statistical and Computational Phase Transitions in Planted Models Statistical and Computational Phase Transitions in Planted Models Jiaming Xu Joint work with Yudong Chen (UC Berkeley) Acknowledgement: Prof. Bruce Hajek November 4, 203 Cluster/Community structure in

More information

How Robust are Thresholds for Community Detection?

How Robust are Thresholds for Community Detection? How Robust are Thresholds for Community Detection? Ankur Moitra (MIT) joint work with Amelia Perry (MIT) and Alex Wein (MIT) Let me tell you a story about the success of belief propagation and statistical

More information

Community detection in stochastic block models via spectral methods

Community detection in stochastic block models via spectral methods Community detection in stochastic block models via spectral methods Laurent Massoulié (MSR-Inria Joint Centre, Inria) based on joint works with: Dan Tomozei (EPFL), Marc Lelarge (Inria), Jiaming Xu (UIUC),

More information

Reconstruction in the Sparse Labeled Stochastic Block Model

Reconstruction in the Sparse Labeled Stochastic Block Model Reconstruction in the Sparse Labeled Stochastic Block Model Marc Lelarge 1 Laurent Massoulié 2 Jiaming Xu 3 1 INRIA-ENS 2 INRIA-Microsoft Research Joint Centre 3 University of Illinois, Urbana-Champaign

More information

Community Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria

Community Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria Community Detection fundamental limits & efficient algorithms Laurent Massoulié, Inria Community Detection From graph of node-to-node interactions, identify groups of similar nodes Example: Graph of US

More information

Spectral Partitiong in a Stochastic Block Model

Spectral Partitiong in a Stochastic Block Model Spectral Graph Theory Lecture 21 Spectral Partitiong in a Stochastic Block Model Daniel A. Spielman November 16, 2015 Disclaimer These notes are not necessarily an accurate representation of what happened

More information

ISIT Tutorial Information theory and machine learning Part II

ISIT Tutorial Information theory and machine learning Part II ISIT Tutorial Information theory and machine learning Part II Emmanuel Abbe Princeton University Martin Wainwright UC Berkeley 1 Inverse problems on graphs Examples: A large variety of machine learning

More information

Predicting Phase Transitions in Hypergraph q-coloring with the Cavity Method

Predicting Phase Transitions in Hypergraph q-coloring with the Cavity Method Predicting Phase Transitions in Hypergraph q-coloring with the Cavity Method Marylou Gabrié (LPS, ENS) Varsha Dani (University of New Mexico), Guilhem Semerjian (LPT, ENS), Lenka Zdeborová (CEA, Saclay)

More information

On the Complexity of Random Satisfiability Problems with Planted Solutions

On the Complexity of Random Satisfiability Problems with Planted Solutions On the Complexity of Random Satisfiability Problems with Planted Solutions Vitaly Feldman Will Perkins Santosh Vempala Abstract The problem of identifying a planted assignment given a random k-sat formula

More information

COMMUNITY DETECTION IN SPARSE NETWORKS VIA GROTHENDIECK S INEQUALITY

COMMUNITY DETECTION IN SPARSE NETWORKS VIA GROTHENDIECK S INEQUALITY COMMUNITY DETECTION IN SPARSE NETWORKS VIA GROTHENDIECK S INEQUALITY OLIVIER GUÉDON AND ROMAN VERSHYNIN Abstract. We present a simple and flexible method to prove consistency of semidefinite optimization

More information

On the Complexity of Random Satisfiability Problems with Planted Solutions

On the Complexity of Random Satisfiability Problems with Planted Solutions On the Complexity of Random Satisfiability Problems with Planted Solutions Vitaly Feldman Will Perkins Santosh Vempala arxiv:1311.4821v8 [cs.cc] 6 Mar 2018 Abstract The problem of identifying a planted

More information

Independence and chromatic number (and random k-sat): Sparse Case. Dimitris Achlioptas Microsoft

Independence and chromatic number (and random k-sat): Sparse Case. Dimitris Achlioptas Microsoft Independence and chromatic number (and random k-sat): Sparse Case Dimitris Achlioptas Microsoft Random graphs W.h.p.: with probability that tends to 1 as n. Hamiltonian cycle Let τ 2 be the moment all

More information

Consistency Thresholds for the Planted Bisection Model

Consistency Thresholds for the Planted Bisection Model Consistency Thresholds for the Planted Bisection Model [Extended Abstract] Elchanan Mossel Joe Neeman University of Pennsylvania and University of Texas, Austin University of California, Mathematics Dept,

More information

How robust are reconstruction thresholds for community detection?

How robust are reconstruction thresholds for community detection? How robust are reconstruction thresholds for community detection? The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published

More information

The non-backtracking operator

The non-backtracking operator The non-backtracking operator Florent Krzakala LPS, Ecole Normale Supérieure in collaboration with Paris: L. Zdeborova, A. Saade Rome: A. Decelle Würzburg: J. Reichardt Santa Fe: C. Moore, P. Zhang Berkeley:

More information

Statistical-Computational Phase Transitions in Planted Models: The High-Dimensional Setting

Statistical-Computational Phase Transitions in Planted Models: The High-Dimensional Setting : The High-Dimensional Setting Yudong Chen YUDONGCHEN@EECSBERELEYEDU Department of EECS, University of California, Berkeley, Berkeley, CA 94704, USA Jiaming Xu JXU18@ILLINOISEDU Department of ECE, University

More information

Matrix estimation by Universal Singular Value Thresholding

Matrix estimation by Universal Singular Value Thresholding Matrix estimation by Universal Singular Value Thresholding Courant Institute, NYU Let us begin with an example: Suppose that we have an undirected random graph G on n vertices. Model: There is a real symmetric

More information

SPIN SYSTEMS: HARDNESS OF APPROXIMATE COUNTING VIA PHASE TRANSITIONS

SPIN SYSTEMS: HARDNESS OF APPROXIMATE COUNTING VIA PHASE TRANSITIONS SPIN SYSTEMS: HARDNESS OF APPROXIMATE COUNTING VIA PHASE TRANSITIONS Andreas Galanis Joint work with: Jin-Yi Cai Leslie Ann Goldberg Heng Guo Mark Jerrum Daniel Štefankovič Eric Vigoda The Power of Randomness

More information

The Tightness of the Kesten-Stigum Reconstruction Bound for a Symmetric Model With Multiple Mutations

The Tightness of the Kesten-Stigum Reconstruction Bound for a Symmetric Model With Multiple Mutations The Tightness of the Kesten-Stigum Reconstruction Bound for a Symmetric Model With Multiple Mutations City University of New York Frontier Probability Days 2018 Joint work with Dr. Sreenivasa Rao Jammalamadaka

More information

Algorithmic Barriers from Phase Transitions

Algorithmic Barriers from Phase Transitions Algorithmic Barriers from Phase Transitions Dimitris Achlioptas Amin Coja-Oghlan Abstract For many random Constraint Satisfaction Problems, by now there exist asymptotically tight estimates of the largest

More information

Lecture 8: February 8

Lecture 8: February 8 CS71 Randomness & Computation Spring 018 Instructor: Alistair Sinclair Lecture 8: February 8 Disclaimer: These notes have not been subjected to the usual scrutiny accorded to formal publications. They

More information

SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices)

SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Chapter 14 SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Today we continue the topic of low-dimensional approximation to datasets and matrices. Last time we saw the singular

More information

Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices

Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices Journal of Machine Learning Research 17 (2016) 1-57 Submitted 8/14; Revised 5/15; Published 4/16 Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number

More information

Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora

Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora Scribe: Today we continue the

More information

Advanced Algorithms 南京大学 尹一通

Advanced Algorithms 南京大学 尹一通 Advanced Algorithms 南京大学 尹一通 Constraint Satisfaction Problem variables: (CSP) X = {x 1, x2,..., xn} each variable ranges over a finite domain Ω an assignment σ ΩX assigns each variable a value in Ω constraints:

More information

Solving Random Satisfiable 3CNF Formulas in Expected Polynomial Time

Solving Random Satisfiable 3CNF Formulas in Expected Polynomial Time Solving Random Satisfiable 3CNF Formulas in Expected Polynomial Time Michael Krivelevich and Dan Vilenchik Tel-Aviv University Solving Random Satisfiable 3CNF Formulas in Expected Polynomial Time p. 1/2

More information

Quantum walk algorithms

Quantum walk algorithms Quantum walk algorithms Andrew Childs Institute for Quantum Computing University of Waterloo 28 September 2011 Randomized algorithms Randomness is an important tool in computer science Black-box problems

More information

Chasing the k-sat Threshold

Chasing the k-sat Threshold Chasing the k-sat Threshold Amin Coja-Oghlan Goethe University Frankfurt Random Discrete Structures In physics, phase transitions are studied by non-rigorous methods. New mathematical ideas are necessary

More information

Approximation Algorithms

Approximation Algorithms Approximation Algorithms Chapter 26 Semidefinite Programming Zacharias Pitouras 1 Introduction LP place a good lower bound on OPT for NP-hard problems Are there other ways of doing this? Vector programs

More information

Spectral Redemption: Clustering Sparse Networks

Spectral Redemption: Clustering Sparse Networks Spectral Redemption: Clustering Sparse Networks Florent Krzakala Cristopher Moore Elchanan Mossel Joe Neeman Allan Sly, et al. SFI WORKING PAPER: 03-07-05 SFI Working Papers contain accounts of scienti5ic

More information

Algorithms Reading Group Notes: Provable Bounds for Learning Deep Representations

Algorithms Reading Group Notes: Provable Bounds for Learning Deep Representations Algorithms Reading Group Notes: Provable Bounds for Learning Deep Representations Joshua R. Wang November 1, 2016 1 Model and Results Continuing from last week, we again examine provable algorithms for

More information

PCP Theorem and Hardness of Approximation

PCP Theorem and Hardness of Approximation PCP Theorem and Hardness of Approximation An Introduction Lee Carraher and Ryan McGovern Department of Computer Science University of Cincinnati October 27, 2003 Introduction Assuming NP P, there are many

More information

Community Detection and Stochastic Block Models: Recent Developments

Community Detection and Stochastic Block Models: Recent Developments Journal of Machine Learning Research 18 (2018) 1-86 Submitted 9/16; Revised 3/17; Published 4/18 Community Detection and Stochastic Block Models: Recent Developments Emmanuel Abbe Program in Applied and

More information

Local MAX-CUT in Smoothed Polynomial Time

Local MAX-CUT in Smoothed Polynomial Time Local MAX-CUT in Smoothed Polynomial Time Omer Angel 1 Sebastien Bubeck 2 Yuval Peres 2 Fan Wei 3 1 University of British Columbia 2 Microsoft Research Redmond 3 Stanford University November 9, 2016 Introduction

More information

SPARSE RANDOM GRAPHS: REGULARIZATION AND CONCENTRATION OF THE LAPLACIAN

SPARSE RANDOM GRAPHS: REGULARIZATION AND CONCENTRATION OF THE LAPLACIAN SPARSE RANDOM GRAPHS: REGULARIZATION AND CONCENTRATION OF THE LAPLACIAN CAN M. LE, ELIZAVETA LEVINA, AND ROMAN VERSHYNIN Abstract. We study random graphs with possibly different edge probabilities in the

More information

Phase Transitions in Random Discrete Structures

Phase Transitions in Random Discrete Structures Institut für Optimierung und Diskrete Mathematik Phase Transition in Thermodynamics The phase transition deals with a sudden change in the properties of an asymptotically large structure by altering critical

More information

Limitations in Approximating RIP

Limitations in Approximating RIP Alok Puranik Mentor: Adrian Vladu Fifth Annual PRIMES Conference, 2015 Outline 1 Background The Problem Motivation Construction Certification 2 Planted model Planting eigenvalues Analysis Distinguishing

More information

Benchmarking recovery theorems for the DC-SBM

Benchmarking recovery theorems for the DC-SBM Benchmarking recovery theorems for the DC-SBM Yali Wan Department of Statistics University of Washington Seattle, WA 98195-4322, USA Marina Meila Department of Statistics University of Washington Seattle,

More information

Benchmarking recovery theorems for the DC-SBM

Benchmarking recovery theorems for the DC-SBM Benchmarking recovery theorems for the DC-SBM Yali Wan Department of Statistics University of Washington Seattle, WA 98195-4322, USA yaliwan@washington.edu Marina Meilă Department of Statistics University

More information

Lecture 21 (Oct. 24): Max Cut SDP Gap and Max 2-SAT

Lecture 21 (Oct. 24): Max Cut SDP Gap and Max 2-SAT CMPUT 67: Approximation Algorithms Fall 014 Lecture 1 Oct. 4): Max Cut SDP Gap and Max -SAT Lecturer: Zachary Friggstad Scribe: Chris Martin 1.1 Near-Tight Analysis of the Max Cut SDP Recall the Max Cut

More information

arxiv: v1 [math.st] 26 Jan 2018

arxiv: v1 [math.st] 26 Jan 2018 CONCENTRATION OF RANDOM GRAPHS AND APPLICATION TO COMMUNITY DETECTION arxiv:1801.08724v1 [math.st] 26 Jan 2018 CAN M. LE, ELIZAVETA LEVINA AND ROMAN VERSHYNIN Abstract. Random matrix theory has played

More information

A Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities

A Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities Journal of Advanced Statistics, Vol. 3, No. 2, June 2018 https://dx.doi.org/10.22606/jas.2018.32001 15 A Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities Laala Zeyneb

More information

Information-theoretic bounds and phase transitions in clustering, sparse PCA, and submatrix localization

Information-theoretic bounds and phase transitions in clustering, sparse PCA, and submatrix localization Information-theoretic bounds and phase transitions in clustering, sparse PCA, and submatrix localization Jess Banks Cristopher Moore Roman Vershynin Nicolas Verzelen Jiaming Xu Abstract We study the problem

More information

MIXING TIMES OF RANDOM WALKS ON DYNAMIC CONFIGURATION MODELS

MIXING TIMES OF RANDOM WALKS ON DYNAMIC CONFIGURATION MODELS MIXING TIMES OF RANDOM WALKS ON DYNAMIC CONFIGURATION MODELS Frank den Hollander Leiden University, The Netherlands New Results and Challenges with RWDRE, 11 13 January 2017, Heilbronn Institute for Mathematical

More information

The giant component in a random subgraph of a given graph

The giant component in a random subgraph of a given graph The giant component in a random subgraph of a given graph Fan Chung, Paul Horn, and Linyuan Lu 2 University of California, San Diego 2 University of South Carolina Abstract We consider a random subgraph

More information

Clustering from Sparse Pairwise Measurements

Clustering from Sparse Pairwise Measurements Clustering from Sparse Pairwise Measurements arxiv:6006683v2 [cssi] 9 May 206 Alaa Saade Laboratoire de Physique Statistique École Normale Supérieure, 24 Rue Lhomond Paris 75005 Marc Lelarge INRIA and

More information

ELE 538B: Mathematics of High-Dimensional Data. Spectral methods. Yuxin Chen Princeton University, Fall 2018

ELE 538B: Mathematics of High-Dimensional Data. Spectral methods. Yuxin Chen Princeton University, Fall 2018 ELE 538B: Mathematics of High-Dimensional Data Spectral methods Yuxin Chen Princeton University, Fall 2018 Outline A motivating application: graph clustering Distance and angles between two subspaces Eigen-space

More information

Random Graph Coloring

Random Graph Coloring Random Graph Coloring Amin Coja-Oghlan Goethe University Frankfurt A simple question... What is the chromatic number of G(n,m)? [ER 60] n vertices m = dn/2 random edges ... that lacks a simple answer Early

More information

Math 391: Midterm 1.0 Spring 2016

Math 391: Midterm 1.0 Spring 2016 Math 391: Midterm 1.0 Spring 2016 James Skon skonjp@kenyon.edu Test date: October 5 Submission Instructions: Give all your assumptions. Show all your work. Be as complete as possible. Grading Problem 1

More information

CS 301: Complexity of Algorithms (Term I 2008) Alex Tiskin Harald Räcke. Hamiltonian Cycle. 8.5 Sequencing Problems. Directed Hamiltonian Cycle

CS 301: Complexity of Algorithms (Term I 2008) Alex Tiskin Harald Räcke. Hamiltonian Cycle. 8.5 Sequencing Problems. Directed Hamiltonian Cycle 8.5 Sequencing Problems Basic genres. Packing problems: SET-PACKING, INDEPENDENT SET. Covering problems: SET-COVER, VERTEX-COVER. Constraint satisfaction problems: SAT, 3-SAT. Sequencing problems: HAMILTONIAN-CYCLE,

More information

Lecture 20: Course summary and open problems. 1 Tools, theorems and related subjects

Lecture 20: Course summary and open problems. 1 Tools, theorems and related subjects CSE 533: The PCP Theorem and Hardness of Approximation (Autumn 2005) Lecture 20: Course summary and open problems Dec. 7, 2005 Lecturer: Ryan O Donnell Scribe: Ioannis Giotis Topics we discuss in this

More information

Extreme eigenvalues of Erdős-Rényi random graphs

Extreme eigenvalues of Erdős-Rényi random graphs Extreme eigenvalues of Erdős-Rényi random graphs Florent Benaych-Georges j.w.w. Charles Bordenave and Antti Knowles MAP5, Université Paris Descartes May 18, 2018 IPAM UCLA Inhomogeneous Erdős-Rényi random

More information

Linear inverse problems on Erdős-Rényi graphs: Information-theoretic limits and efficient recovery

Linear inverse problems on Erdős-Rényi graphs: Information-theoretic limits and efficient recovery Linear inverse problems on Erdős-Rényi graphs: Information-theoretic limits and efficient recovery Emmanuel Abbe, Afonso S. Bandeira, Annina Bracher, Amit Singer Princeton University Abstract This paper

More information

COMP Analysis of Algorithms & Data Structures

COMP Analysis of Algorithms & Data Structures COMP 3170 - Analysis of Algorithms & Data Structures Shahin Kamali Computational Complexity CLRS 34.1-34.4 University of Manitoba COMP 3170 - Analysis of Algorithms & Data Structures 1 / 50 Polynomial

More information

NP-complete Problems

NP-complete Problems NP-complete Problems HP, TSP, 3COL, 0/1IP Dimitris Diamantis µπλ November 6, 2014 Dimitris Diamantis (µπλ ) NP-complete Problems November 6, 2014 1 / 34 HAMILTON PATH is NP-Complete Definition Given an

More information

The condensation phase transition in random graph coloring

The condensation phase transition in random graph coloring The condensation phase transition in random graph coloring Victor Bapst Goethe University, Frankfurt Joint work with Amin Coja-Oghlan, Samuel Hetterich, Felicia Rassmann and Dan Vilenchik arxiv:1404.5513

More information

1 Adjacency matrix and eigenvalues

1 Adjacency matrix and eigenvalues CSC 5170: Theory of Computational Complexity Lecture 7 The Chinese University of Hong Kong 1 March 2010 Our objective of study today is the random walk algorithm for deciding if two vertices in an undirected

More information

ON THE QUALITY OF SPECTRAL SEPARATORS

ON THE QUALITY OF SPECTRAL SEPARATORS ON THE QUALITY OF SPECTRAL SEPARATORS STEPHEN GUATTERY AND GARY L. MILLER Abstract. Computing graph separators is an important step in many graph algorithms. A popular technique for finding separators

More information

Spectral Clustering. Spectral Clustering? Two Moons Data. Spectral Clustering Algorithm: Bipartioning. Spectral methods

Spectral Clustering. Spectral Clustering? Two Moons Data. Spectral Clustering Algorithm: Bipartioning. Spectral methods Spectral Clustering Seungjin Choi Department of Computer Science POSTECH, Korea seungjin@postech.ac.kr 1 Spectral methods Spectral Clustering? Methods using eigenvectors of some matrices Involve eigen-decomposition

More information

Random matrices: A Survey. Van H. Vu. Department of Mathematics Rutgers University

Random matrices: A Survey. Van H. Vu. Department of Mathematics Rutgers University Random matrices: A Survey Van H. Vu Department of Mathematics Rutgers University Basic models of random matrices Let ξ be a real or complex-valued random variable with mean 0 and variance 1. Examples.

More information

Approximation & Complexity

Approximation & Complexity Summer school on semidefinite optimization Approximation & Complexity David Steurer Cornell University Part 1 September 6, 2012 Overview Part 1 Unique Games Conjecture & Basic SDP Part 2 SDP Hierarchies:

More information

Lecture 06 01/31/ Proofs for emergence of giant component

Lecture 06 01/31/ Proofs for emergence of giant component M375T/M396C: Topics in Complex Networks Spring 2013 Lecture 06 01/31/13 Lecturer: Ravi Srinivasan Scribe: Tianran Geng 6.1 Proofs for emergence of giant component We now sketch the main ideas underlying

More information

XVI International Congress on Mathematical Physics

XVI International Congress on Mathematical Physics Aug 2009 XVI International Congress on Mathematical Physics Underlying geometry: finite graph G=(V,E ). Set of possible configurations: V (assignments of +/- spins to the vertices). Probability of a configuration

More information

Label Cover Algorithms via the Log-Density Threshold

Label Cover Algorithms via the Log-Density Threshold Label Cover Algorithms via the Log-Density Threshold Jimmy Wu jimmyjwu@stanford.edu June 13, 2017 1 Introduction Since the discovery of the PCP Theorem and its implications for approximation algorithms,

More information

Algorithms and Theory of Computation. Lecture 22: NP-Completeness (2)

Algorithms and Theory of Computation. Lecture 22: NP-Completeness (2) Algorithms and Theory of Computation Lecture 22: NP-Completeness (2) Xiaohui Bei MAS 714 November 8, 2018 Nanyang Technological University MAS 714 November 8, 2018 1 / 20 Set Cover Set Cover Input: a set

More information

CMSC Discrete Mathematics FINAL EXAM Tuesday, December 5, 2017, 10:30-12:30

CMSC Discrete Mathematics FINAL EXAM Tuesday, December 5, 2017, 10:30-12:30 CMSC-37110 Discrete Mathematics FINAL EXAM Tuesday, December 5, 2017, 10:30-12:30 Name (print): Email: This exam contributes 40% to your course grade. Do not use book, notes, scrap paper. NO ELECTRONIC

More information

Unique Games Conjecture & Polynomial Optimization. David Steurer

Unique Games Conjecture & Polynomial Optimization. David Steurer Unique Games Conjecture & Polynomial Optimization David Steurer Newton Institute, Cambridge, July 2013 overview Proof Complexity Approximability Polynomial Optimization Quantum Information computational

More information

U.C. Berkeley Better-than-Worst-Case Analysis Handout 3 Luca Trevisan May 24, 2018

U.C. Berkeley Better-than-Worst-Case Analysis Handout 3 Luca Trevisan May 24, 2018 U.C. Berkeley Better-than-Worst-Case Analysis Handout 3 Luca Trevisan May 24, 2018 Lecture 3 In which we show how to find a planted clique in a random graph. 1 Finding a Planted Clique We will analyze

More information

Eigenvalues, random walks and Ramanujan graphs

Eigenvalues, random walks and Ramanujan graphs Eigenvalues, random walks and Ramanujan graphs David Ellis 1 The Expander Mixing lemma We have seen that a bounded-degree graph is a good edge-expander if and only if if has large spectral gap If G = (V,

More information

CS264: Beyond Worst-Case Analysis Lectures #9 and 10: Spectral Algorithms for Planted Bisection and Planted Clique

CS264: Beyond Worst-Case Analysis Lectures #9 and 10: Spectral Algorithms for Planted Bisection and Planted Clique CS64: Beyond Worst-Case Analysis Lectures #9 and 10: Spectral Algorithms for Planted Bisection and Planted Clique Tim Roughgarden February 7 and 9, 017 1 Preamble Lectures #6 8 studied deterministic conditions

More information

What can be sampled locally?

What can be sampled locally? What can be sampled locally? Yitong Yin Nanjing University Joint work with: Weiming Feng, Yuxin Sun Local Computation Locality in distributed graph algorithms. [Linial, FOCS 87, SICOMP 92] the LOCAL model:

More information

A spectral technique for random satisfiable 3CNF formulas

A spectral technique for random satisfiable 3CNF formulas A spectral technique for random satisfiable 3CNF formulas Abraham Flaxman Carnegie Mellon University Mathematical Sciences Department abie@cmu.edu Abstract Let I be a random 3CNF formula generated by choosing

More information

arxiv: v2 [cs.ds] 3 Oct 2017

arxiv: v2 [cs.ds] 3 Oct 2017 Orthogonal Vectors Indexing Isaac Goldstein 1, Moshe Lewenstein 1, and Ely Porat 1 1 Bar-Ilan University, Ramat Gan, Israel {goldshi,moshe,porately}@cs.biu.ac.il arxiv:1710.00586v2 [cs.ds] 3 Oct 2017 Abstract

More information

Community Detection. Data Analytics - Community Detection Module

Community Detection. Data Analytics - Community Detection Module Community Detection Data Analytics - Community Detection Module Zachary s karate club Members of a karate club (observed for 3 years). Edges represent interactions outside the activities of the club. Community

More information

PCPs and Inapproximability Gap-producing and Gap-Preserving Reductions. My T. Thai

PCPs and Inapproximability Gap-producing and Gap-Preserving Reductions. My T. Thai PCPs and Inapproximability Gap-producing and Gap-Preserving Reductions My T. Thai 1 1 Hardness of Approximation Consider a maximization problem Π such as MAX-E3SAT. To show that it is NP-hard to approximation

More information

Non-Approximability Results (2 nd part) 1/19

Non-Approximability Results (2 nd part) 1/19 Non-Approximability Results (2 nd part) 1/19 Summary - The PCP theorem - Application: Non-approximability of MAXIMUM 3-SAT 2/19 Non deterministic TM - A TM where it is possible to associate more than one

More information

Reconstruction for Models on Random Graphs

Reconstruction for Models on Random Graphs 1 Stanford University, 2 ENS Paris October 21, 2007 Outline 1 The Reconstruction Problem 2 Related work and applications 3 Results The Reconstruction Problem: A Story Alice and Bob Alice, Bob and G root

More information

Clustering Algorithms for Random and Pseudo-random Structures

Clustering Algorithms for Random and Pseudo-random Structures Abstract Clustering Algorithms for Random and Pseudo-random Structures Pradipta Mitra 2008 Partitioning of a set objects into a number of clusters according to a suitable distance metric is one of the

More information

Mixing time and diameter in random graphs

Mixing time and diameter in random graphs October 5, 2009 Based on joint works with: Asaf Nachmias, and Jian Ding, Eyal Lubetzky and Jeong-Han Kim. Background The mixing time of the lazy random walk on a graph G is T mix (G) = T mix (G, 1/4) =

More information

Random Lifts of Graphs

Random Lifts of Graphs 27th Brazilian Math Colloquium, July 09 Plan of this talk A brief introduction to the probabilistic method. A quick review of expander graphs and their spectrum. Lifts, random lifts and their properties.

More information

(a) If A is a 3 by 4 matrix, what does this tell us about its nullspace? Solution: dim N(A) 1, since rank(a) 3. Ax =

(a) If A is a 3 by 4 matrix, what does this tell us about its nullspace? Solution: dim N(A) 1, since rank(a) 3. Ax = . (5 points) (a) If A is a 3 by 4 matrix, what does this tell us about its nullspace? dim N(A), since rank(a) 3. (b) If we also know that Ax = has no solution, what do we know about the rank of A? C(A)

More information

Graph Partitioning. Under the Spectral Lens

Graph Partitioning. Under the Spectral Lens Graph Partitioning Under the Spectral Lens Konstantinos Koiliaris A Dissertation Presented to the Faculty of MPLA in Candidacy for the Degree of Master of Science Recommended for Acceptance by the Department

More information

CS369N: Beyond Worst-Case Analysis Lecture #4: Probabilistic and Semirandom Models for Clustering and Graph Partitioning

CS369N: Beyond Worst-Case Analysis Lecture #4: Probabilistic and Semirandom Models for Clustering and Graph Partitioning CS369N: Beyond Worst-Case Analysis Lecture #4: Probabilistic and Semirandom Models for Clustering and Graph Partitioning Tim Roughgarden April 25, 2010 1 Learning Mixtures of Gaussians This lecture continues

More information

P, NP, NP-Complete, and NPhard

P, NP, NP-Complete, and NPhard P, NP, NP-Complete, and NPhard Problems Zhenjiang Li 21/09/2011 Outline Algorithm time complicity P and NP problems NP-Complete and NP-Hard problems Algorithm time complicity Outline What is this course

More information

Information Recovery from Pairwise Measurements

Information Recovery from Pairwise Measurements Information Recovery from Pairwise Measurements A Shannon-Theoretic Approach Yuxin Chen, Changho Suh, Andrea Goldsmith Stanford University KAIST Page 1 Recovering data from correlation measurements A large

More information

CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory

CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory Tim Roughgarden & Gregory Valiant May 2, 2016 Spectral graph theory is the powerful and beautiful theory that arises from

More information

Tight Bounds For Random MAX 2-SAT

Tight Bounds For Random MAX 2-SAT Tight Bounds For Random MAX 2-SAT MohammadTaghi Hajiaghayi Jeong Han Kim Abstract For a conjunctive normal form formula F with n variables and m = cn 2-variable clauses (c is called the density), denote

More information

Spectral gap in bipartite biregular graphs and applications

Spectral gap in bipartite biregular graphs and applications Spectral gap in bipartite biregular graphs and applications Ioana Dumitriu Department of Mathematics University of Washington (Seattle) Joint work with Gerandy Brito and Kameron Harris ICERM workshop on

More information

Lecture 18: More NP-Complete Problems

Lecture 18: More NP-Complete Problems 6.045 Lecture 18: More NP-Complete Problems 1 The Clique Problem a d f c b e g Given a graph G and positive k, does G contain a complete subgraph on k nodes? CLIQUE = { (G,k) G is an undirected graph with

More information

Lecture 8: Statistical vs Computational Complexity Tradeoffs via SoS

Lecture 8: Statistical vs Computational Complexity Tradeoffs via SoS CS369H: Hierarchies of Integer Programming Relaxations Spring 2016-2017 Lecture 8: Statistical vs Computational Complexity Tradeoffs via SoS Pravesh Kothari Scribes: Carrie and Vaggos Overview We show

More information

Notes for Lecture 2. Statement of the PCP Theorem and Constraint Satisfaction

Notes for Lecture 2. Statement of the PCP Theorem and Constraint Satisfaction U.C. Berkeley Handout N2 CS294: PCP and Hardness of Approximation January 23, 2006 Professor Luca Trevisan Scribe: Luca Trevisan Notes for Lecture 2 These notes are based on my survey paper [5]. L.T. Statement

More information

Goldreich s PRG: Evidence for near-optimal polynomial stretch

Goldreich s PRG: Evidence for near-optimal polynomial stretch Goldreich s PRG: Evidence for near-optimal polynomial stretch Ryan O Donnell Computer Science Department Carnegie Mellon University Pittsburgh, PA odonnell@cs.cmu.edu David Witmer Computer Science Department

More information

the configuration space of Branched polymers R. Kenyon, P. Winkler

the configuration space of Branched polymers R. Kenyon, P. Winkler the configuration space of Branched polymers R. Kenyon, P. Winkler artificial blood catalyst recovery Branched polymers (dendrimers) in modern science artificial photosynthesis A branched polymer is a

More information

A reverse Sidorenko inequality Independent sets, colorings, and graph homomorphisms

A reverse Sidorenko inequality Independent sets, colorings, and graph homomorphisms A reverse Sidorenko inequality Independent sets, colorings, and graph homomorphisms Yufei Zhao (MIT) Joint work with Ashwin Sah (MIT) Mehtaab Sawhney (MIT) David Stoner (Harvard) Question Fix d. Which

More information

Approximating MAX-E3LIN is NP-Hard

Approximating MAX-E3LIN is NP-Hard Approximating MAX-E3LIN is NP-Hard Evan Chen May 4, 2016 This lecture focuses on the MAX-E3LIN problem. We prove that approximating it is NP-hard by a reduction from LABEL-COVER. 1 Introducing MAX-E3LIN

More information

Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Spectral Graph Theory and its Applications Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Outline Adjacency matrix and Laplacian Intuition, spectral graph drawing

More information