Spectral thresholds in the bipartite stochastic block model
|
|
- Barrie Garrett
- 5 years ago
- Views:
Transcription
1 Spectral thresholds in the bipartite stochastic block model Laura Florescu and Will Perkins NYU and U of Birmingham September 27, 2016 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 1 / 30
2 Stochastic Block Model Figure: Red edges added with P = p and blue edges with P = q. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 2 / 30
3 Community detection Goal: Detect communities in networks. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 3 / 30
4 Stochastic Block Model Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 4 / 30
5 Entries are not colored Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 5 / 30
6 Nor ordered Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 6 / 30
7 Stochastic Block Model First introduced by Holland, Laskey, Leinhardt in Motivation: discover communities in large networks. Theorem (Boppana, Dyer/Frieze, Snijders/Nowicki, Condon/Karp, McSherry, Bickel/Chen, etc) There are efficient algorithms for exactly recovering the true colors, provided that p q is large enough as n. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 7 / 30
8 Bipartite Stochastic Block Model Figure: Bipartite stochastic model on V 1 and V 2. Red edges added with P = δp(n 1, n 2 ) and blue edges with P = (2 δ)p(n 1, n 2 ). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 8 / 30
9 SBM Goal: get the planted assignment σ (on V 1 for bipartite stochastic model) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 9 / 30
10 SBM Goal: get the planted assignment σ (on V 1 for bipartite stochastic model) Detection: compute v that agrees with σ on 1/2 + ɛ fraction of vertices Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 9 / 30
11 SBM Goal: get the planted assignment σ (on V 1 for bipartite stochastic model) Detection: compute v that agrees with σ on 1/2 + ɛ fraction of vertices Recovery: compute v that agrees with σ on 1 o(1) fraction of vertices Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 9 / 30
12 Background Intermediate step in recovering solutions in planted problems [Feldman, Perkins, Vempala 14]. planted constraint satisfaction problems (CSP) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 10 / 30
13 Background Intermediate step in recovering solutions in planted problems [Feldman, Perkins, Vempala 14]. planted constraint satisfaction problems (CSP) Reducing planted problems on n variables will give vertex sets of size n 1 = n, n 2 = n k 1. (n 2 n 2 ) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 10 / 30
14 Unified Planted k-csp model Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30
15 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30
16 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30
17 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, define the random constraint satisfaction problem F Q,σ (n, m) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30
18 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, define the random constraint satisfaction problem F Q,σ (n, m) by drawing m k-clauses from C k (the set of all k-tuples) independently according to Q(σ(C)) Q σ (C) = C C k Q(σ(C )) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30
19 Unified Planted k-csp model Definition (Feldman-Perkins-Vempala 14) Given a planting distribution Q : {±1} k [0, 1], and an assignment σ {±1} n, define the random constraint satisfaction problem F Q,σ (n, m) by drawing m k-clauses from C k (the set of all k-tuples) independently according to Q(σ(C)) Q σ (C) = C C k Q(σ(C )) where σ(c) is the vector of values that σ assigns to the k-tuple of literals comprising C. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 11 / 30
20 Planted random k-sat and Goldreich PRG Planted random k-sat: Form a truth assignment φ of literals, then select each clause independently from the k-tuples of literals where at least one literal is set to 1 by φ. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 12 / 30
21 Planted random k-sat and Goldreich PRG Planted random k-sat: Form a truth assignment φ of literals, then select each clause independently from the k-tuples of literals where at least one literal is set to 1 by φ. Goldreich PRG: also add a 0/1, depending on a predicate evaluated on literals. (cryptography) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 12 / 30
22 Planted random k-sat and Goldreich PRG Planted random k-sat: Form a truth assignment φ of literals, then select each clause independently from the k-tuples of literals where at least one literal is set to 1 by φ. Goldreich PRG: also add a 0/1, depending on a predicate evaluated on literals. (cryptography) Feldman, Perkins, Vempala 14 gave a reduction of above and others to the BSBM. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 12 / 30
23 Information theory threshold When p = a/n and q = b/n Theorem (Mossel, Neeman, Sly, 2012) There is a test to distinguish the partition that succeeds with high probability if and only if a + b > 2 and (a b) 2 > 2(a + b). Proves conjecture of [Decelle, Krzakala, Moore, Zdeborova 13]. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 13 / 30
24 Computational threshold Dyer, Frieze 1989 p = na > q = nb fixed Condon, Karp 2001 a b n 1/2 McSherry 2001 a b b log n Coja-Oghlan 2010 a b b Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 14 / 30
25 Computational threshold Dyer, Frieze 1989 p = na > q = nb fixed Condon, Karp 2001 a b n 1/2 McSherry 2001 a b b log n Coja-Oghlan 2010 a b b Massoulié 2013 and Mossel, Neeman, Sly detection possible and efficient (a b) 2 > 2(a + b). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 14 / 30
26 Computational threshold Dyer, Frieze 1989 p = na > q = nb fixed Condon, Karp 2001 a b n 1/2 McSherry 2001 a b b log n Coja-Oghlan 2010 a b b Massoulié 2013 and Mossel, Neeman, Sly detection possible and efficient (a b) 2 > 2(a + b). Ingenious spectral methods Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 14 / 30
27 Previous work MNS Idea: nbhd of vertex in G(n, a/n, b/n) looks like a random labelled tree, where each child gives birth to Pois(a) vertices of same type, Pois(b) vertices of different type Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 15 / 30
28 Previous work MNS Idea: nbhd of vertex in G(n, a/n, b/n) looks like a random labelled tree, where each child gives birth to Pois(a) vertices of same type, Pois(b) vertices of different type show that conditioned on the labels of the bdry of the tree, the label of root is asymp indep of the rest of graph Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 15 / 30
29 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30
30 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30
31 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30
32 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30
33 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Can think of p c (T ) as the edge density at which the tree is connected. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30
34 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Can think of p c (T ) as the edge density at which the tree is connected. trees with offspring distribution Pois( a+b a 2 ) and take 1 η = a+b. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30
35 Binary symmetric broadcast model T : Galton-Watson tree with mean offspring distribution mean b. Root R labeled uniformly +1/ 1, each child takes parent s label with P = 1 η and opposite label with P = η. Goal: reconstruct value of R from labels at level n. Theorem (Evans, Kenyon, Peres, Schulman 00) Probability of correct reconstruction of value of R tends to 1 2 as n if (1 2η) 2 p c (T ), where p c (T ) is the critical probability for percolation on T. Can think of p c (T ) as the edge density at which the tree is connected. trees with offspring distribution Pois( a+b a 2 ) and take 1 η = a+b. Then threshold reduces to (a b) 2 2(a + b). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 16 / 30
36 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30
37 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition typical analysis of spectral algos: 2nd singular value > spectral norm of noise matrix M EM; Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30
38 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition typical analysis of spectral algos: 2nd singular value > spectral norm of noise matrix M EM; here λ 2 (EM) = Θ(p n 1 n 2 ), norm of noise M EM = Θ( pn 2 ). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30
39 Previous work - Spectral methods Applying some classical results to bipartite model using spectrum with p = O(1/n 1 ) recovers partition typical analysis of spectral algos: 2nd singular value > spectral norm of noise matrix M EM; here λ 2 (EM) = Θ(p n 1 n 2 ), norm of noise M EM = Θ( pn 2 ). Feldman, Perkins, Vempala 14: subsampled power iteration recovers partition whp with p = Õ((n 1n 2 ) 1/2 ) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 17 / 30
40 Questions 1 Here λ 2 < M EM. Is SVD doomed for p 1/n 1? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 18 / 30
41 Questions 1 Here λ 2 < M EM. Is SVD doomed for p 1/n 1? 2 What is the optimal threshold for detection in BSBM? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 18 / 30
42 Our results - sharp reconstruction/impossibility Theorem On the other hand, if n 2 n 1 and p 1 (δ 1) 2 n 1 n 2, then no algorithm can detect the partition. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 19 / 30
43 Our results - sharp reconstruction/impossibility Theorem On the other hand, if n 2 n 1 and p 1 (δ 1) 2 n 1 n 2, then no algorithm can detect the partition. Idea: Couple to a broadcast model on a multi-type Galton Watson tree. Show that conditioned on the labels of a log n bdry of the tree, the label of root is asymp indep of the rest of graph. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 19 / 30
44 Our results - sharp reconstruction/impossibility Theorem Let n 2 n 1. Then there is a polynomial-time algorithm that detects the partition V 1 = A 1 B 1 if p > 1 + ɛ (δ 1) 2 n 1 n 2 for any fixed ɛ > 0. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 20 / 30
45 Our results - sharp reconstruction/impossibility Theorem Let n 2 n 1. Then there is a polynomial-time algorithm that detects the partition V 1 = A 1 B 1 if p > 1 + ɛ (δ 1) 2 n 1 n 2 for any fixed ɛ > 0. Idea: reduce to SBM on graph on V 1 induced by paths of length 2 in bipartite graph. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 20 / 30
46 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
47 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
48 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
49 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
50 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
51 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
52 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Now compute a and b accordingly: a = (1 + ɛ)(2 2δ + δ2 ) (δ 1) 4 (1 + o(1)) b = (1 + ɛ)(2δ δ2 ) (δ 1) 4 (1 + o(1)) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
53 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Now compute a and b accordingly: a = (1 + ɛ)(2 2δ + δ2 ) (δ 1) 4 (1 + o(1)) b = (1 + ɛ)(2δ δ2 ) (δ 1) 4 (1 + o(1)) Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
54 Proof sketch Reduce to a graph G by replacing each path of length 2 from V 1 to V 2 back to V 1 with a single edge between the endpoints in V 1. E = (1+ɛ)2 n 1 (1 + o(1)) (δ 1) 4 Now we can compute p a = P[e = (u, v) σ(u) = σ(v)] and p b = P[e = (u, v) σ(u) σ(v)] Now compute a and b accordingly: a = (1 + ɛ)(2 2δ + δ2 ) (δ 1) 4 (1 + o(1)) b = (1 + ɛ)(2δ δ2 ) (δ 1) 4 (1 + o(1)) Apply criterion (a b) 2 (1 + ɛ)2(a + b). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 21 / 30
55 Implications for planted k-sat - detection in the block model exhibits a sharp threshold at m = Θ(n r/2 ) hyperedges/clauses Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 22 / 30
56 Implications for planted k-sat - detection in the block model exhibits a sharp threshold at m = Θ(n r/2 ) hyperedges/clauses Definition The distribution complexity r of the planting distribution Q is the smallest r > 0 so that Q is an (r 1)-wise independent distribution on {±} k but not r-wise independent. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 22 / 30
57 Spectral algorithms Standard SVD: Compute left singular vector of M (adjacency matrix) corresponding to 2nd singular value, round signs to get v; compare σ and v Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 23 / 30
58 Spectral algorithms Standard SVD: Compute left singular vector of M (adjacency matrix) corresponding to 2nd singular value, round signs to get v; compare σ and v Diagonal deletion SVD: Set diagonal entries of MM T to 0, compute second eigenvector, round signs to get v; compare σ and v Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 23 / 30
59 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30
60 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. 2 If p V > n 2/3 the partition. 1 n 1/3 2, then whp the standard SVD algorithm recovers Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30
61 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. 2 If p V > n 2/3 the partition. 1 n 1/3 2, then whp the standard SVD algorithm recovers Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30
62 Our results - spectral Theorem Let n 2 n 1, with n 1. Then 1 If p D > (n 1 n 2 ) 1/2, then whp the diagonal deletion SVD algorithm recovers the partition V 1 = A 1 B 1. 2 If p V > n 2/3 the partition. 1 n 1/3 2, then whp the standard SVD algorithm recovers When n 2 = n 2, p D n 3/2, p V n 4/3. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 24 / 30
63 Timeline Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 25 / 30
64 Our results Plot of correlation as a function of p(n1,n2), n1= 1000, n2=100000, delta=0.2 v dd ess Plot of the top eigenvalues of MM T for delta=0.5 and delta=1 for the various regimes p(n1,n2) delta=0.5 delta= correlation top 10 normalized eigenvalues p(n1,n2) x p1=(n1*n2)^( 1/2)*log(n1), p2=n1^( 2/3)*n2^( 2/3), p3=n1^( 2/3)*n2^( 1/3)*log(n1) Figure: Correlations of computed vectors with planted vector Figure: Eigenvalue separation Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 26 / 30
65 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30
66 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30
67 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30
68 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30
69 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) ( ( ) = O 1 log n 1 ); = O 1 log n 1 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30
70 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) ( ( ) = O 1 log n 1 ); = O 1 log n 1 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 e 2 (B) σ/ n 1 = O(log 1 n 1 ) (by special case of Sin Theta Theorem). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30
71 Thresholds origins DiagD: B = MM T D V, SVD: B = B + D V σ: partition, e 2 (B): second largest eigenvector of B, D V : degrees. DiagD: sin(b, EB) C B EB SVD: sin(b, EB ) C B EB + D V ED V λ 2 λ 2 by Sin Theta Theorem - sin of angle between eigenvector spaces norm/eigenvalue gap C n1/2 (δ 1) 2 n 1 n 2 ; C n1/2 p 2 1 n 1/2 2 p (2nd λ asymptotics) ( ( ) = O 1 log n 1 ); = O 1 log n 1 1 n 1/2 2 p+(c n 2 p log n 1 ) (δ 1) 2 n 1 n 2 p 2 e 2 (B) σ/ n 1 = O(log 1 n 1 ) (by special case of Sin Theta Theorem). Conclude by rounding signs of e 2 (B). Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 27 / 30
72 Conclusions Theorem Can efficiently detect partition in BSBM if p > 1+ɛ (δ 1) 2 n 1 n 2 Cannot detect if p 1 (δ 1) 2 n 1 n 2 Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 28 / 30
73 Conclusions Theorem Can efficiently detect partition in BSBM if p > 1+ɛ (δ 1) 2 n 1 n 2 Cannot detect if p 1 (δ 1) 2 n 1 n 2 spectral method still works if λ 2 norm of noise matrix Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 28 / 30
74 Conclusions Theorem Can efficiently detect partition in BSBM if p > 1+ɛ (δ 1) 2 n 1 n 2 Cannot detect if p 1 (δ 1) 2 n 1 n 2 spectral method still works if λ 2 norm of noise matrix modifying adjacency matrix improves recovery significantly Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 28 / 30
75 Open problems apply Diagonal Deletion type of algorithm for improvement over SVD in other problems? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 29 / 30
76 Open problems apply Diagonal Deletion type of algorithm for improvement over SVD in other problems? sharper detection thresholds for planted k-sat? Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 29 / 30
77 Thank you! Laura Florescu and Will Perkins Spectral thresholds in the bipartite stochastic block model 30 / 30
Spectral thresholds in the bipartite stochastic block model
JMLR: Workshop and Conference Proceedings vol 49:1 17, 2016 Spectral thresholds in the bipartite stochastic block model Laura Florescu New York University Will Perkins University of Birmingham FLORESCU@CIMS.NYU.EDU
More informationReconstruction in the Generalized Stochastic Block Model
Reconstruction in the Generalized Stochastic Block Model Marc Lelarge 1 Laurent Massoulié 2 Jiaming Xu 3 1 INRIA-ENS 2 INRIA-Microsoft Research Joint Centre 3 University of Illinois, Urbana-Champaign GDR
More informationBelief Propagation, Robust Reconstruction and Optimal Recovery of Block Models
JMLR: Workshop and Conference Proceedings vol 35:1 15, 2014 Belief Propagation, Robust Reconstruction and Optimal Recovery of Block Models Elchanan Mossel mossel@stat.berkeley.edu Department of Statistics
More informationStatistical and Computational Phase Transitions in Planted Models
Statistical and Computational Phase Transitions in Planted Models Jiaming Xu Joint work with Yudong Chen (UC Berkeley) Acknowledgement: Prof. Bruce Hajek November 4, 203 Cluster/Community structure in
More informationHow Robust are Thresholds for Community Detection?
How Robust are Thresholds for Community Detection? Ankur Moitra (MIT) joint work with Amelia Perry (MIT) and Alex Wein (MIT) Let me tell you a story about the success of belief propagation and statistical
More informationCommunity detection in stochastic block models via spectral methods
Community detection in stochastic block models via spectral methods Laurent Massoulié (MSR-Inria Joint Centre, Inria) based on joint works with: Dan Tomozei (EPFL), Marc Lelarge (Inria), Jiaming Xu (UIUC),
More informationReconstruction in the Sparse Labeled Stochastic Block Model
Reconstruction in the Sparse Labeled Stochastic Block Model Marc Lelarge 1 Laurent Massoulié 2 Jiaming Xu 3 1 INRIA-ENS 2 INRIA-Microsoft Research Joint Centre 3 University of Illinois, Urbana-Champaign
More informationCommunity Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria
Community Detection fundamental limits & efficient algorithms Laurent Massoulié, Inria Community Detection From graph of node-to-node interactions, identify groups of similar nodes Example: Graph of US
More informationSpectral Partitiong in a Stochastic Block Model
Spectral Graph Theory Lecture 21 Spectral Partitiong in a Stochastic Block Model Daniel A. Spielman November 16, 2015 Disclaimer These notes are not necessarily an accurate representation of what happened
More informationISIT Tutorial Information theory and machine learning Part II
ISIT Tutorial Information theory and machine learning Part II Emmanuel Abbe Princeton University Martin Wainwright UC Berkeley 1 Inverse problems on graphs Examples: A large variety of machine learning
More informationPredicting Phase Transitions in Hypergraph q-coloring with the Cavity Method
Predicting Phase Transitions in Hypergraph q-coloring with the Cavity Method Marylou Gabrié (LPS, ENS) Varsha Dani (University of New Mexico), Guilhem Semerjian (LPT, ENS), Lenka Zdeborová (CEA, Saclay)
More informationOn the Complexity of Random Satisfiability Problems with Planted Solutions
On the Complexity of Random Satisfiability Problems with Planted Solutions Vitaly Feldman Will Perkins Santosh Vempala Abstract The problem of identifying a planted assignment given a random k-sat formula
More informationCOMMUNITY DETECTION IN SPARSE NETWORKS VIA GROTHENDIECK S INEQUALITY
COMMUNITY DETECTION IN SPARSE NETWORKS VIA GROTHENDIECK S INEQUALITY OLIVIER GUÉDON AND ROMAN VERSHYNIN Abstract. We present a simple and flexible method to prove consistency of semidefinite optimization
More informationOn the Complexity of Random Satisfiability Problems with Planted Solutions
On the Complexity of Random Satisfiability Problems with Planted Solutions Vitaly Feldman Will Perkins Santosh Vempala arxiv:1311.4821v8 [cs.cc] 6 Mar 2018 Abstract The problem of identifying a planted
More informationIndependence and chromatic number (and random k-sat): Sparse Case. Dimitris Achlioptas Microsoft
Independence and chromatic number (and random k-sat): Sparse Case Dimitris Achlioptas Microsoft Random graphs W.h.p.: with probability that tends to 1 as n. Hamiltonian cycle Let τ 2 be the moment all
More informationConsistency Thresholds for the Planted Bisection Model
Consistency Thresholds for the Planted Bisection Model [Extended Abstract] Elchanan Mossel Joe Neeman University of Pennsylvania and University of Texas, Austin University of California, Mathematics Dept,
More informationHow robust are reconstruction thresholds for community detection?
How robust are reconstruction thresholds for community detection? The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published
More informationThe non-backtracking operator
The non-backtracking operator Florent Krzakala LPS, Ecole Normale Supérieure in collaboration with Paris: L. Zdeborova, A. Saade Rome: A. Decelle Würzburg: J. Reichardt Santa Fe: C. Moore, P. Zhang Berkeley:
More informationStatistical-Computational Phase Transitions in Planted Models: The High-Dimensional Setting
: The High-Dimensional Setting Yudong Chen YUDONGCHEN@EECSBERELEYEDU Department of EECS, University of California, Berkeley, Berkeley, CA 94704, USA Jiaming Xu JXU18@ILLINOISEDU Department of ECE, University
More informationMatrix estimation by Universal Singular Value Thresholding
Matrix estimation by Universal Singular Value Thresholding Courant Institute, NYU Let us begin with an example: Suppose that we have an undirected random graph G on n vertices. Model: There is a real symmetric
More informationSPIN SYSTEMS: HARDNESS OF APPROXIMATE COUNTING VIA PHASE TRANSITIONS
SPIN SYSTEMS: HARDNESS OF APPROXIMATE COUNTING VIA PHASE TRANSITIONS Andreas Galanis Joint work with: Jin-Yi Cai Leslie Ann Goldberg Heng Guo Mark Jerrum Daniel Štefankovič Eric Vigoda The Power of Randomness
More informationThe Tightness of the Kesten-Stigum Reconstruction Bound for a Symmetric Model With Multiple Mutations
The Tightness of the Kesten-Stigum Reconstruction Bound for a Symmetric Model With Multiple Mutations City University of New York Frontier Probability Days 2018 Joint work with Dr. Sreenivasa Rao Jammalamadaka
More informationAlgorithmic Barriers from Phase Transitions
Algorithmic Barriers from Phase Transitions Dimitris Achlioptas Amin Coja-Oghlan Abstract For many random Constraint Satisfaction Problems, by now there exist asymptotically tight estimates of the largest
More informationLecture 8: February 8
CS71 Randomness & Computation Spring 018 Instructor: Alistair Sinclair Lecture 8: February 8 Disclaimer: These notes have not been subjected to the usual scrutiny accorded to formal publications. They
More informationSVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices)
Chapter 14 SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Today we continue the topic of low-dimensional approximation to datasets and matrices. Last time we saw the singular
More informationStatistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices
Journal of Machine Learning Research 17 (2016) 1-57 Submitted 8/14; Revised 5/15; Published 4/16 Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number
More informationLecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora
princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora Scribe: Today we continue the
More informationAdvanced Algorithms 南京大学 尹一通
Advanced Algorithms 南京大学 尹一通 Constraint Satisfaction Problem variables: (CSP) X = {x 1, x2,..., xn} each variable ranges over a finite domain Ω an assignment σ ΩX assigns each variable a value in Ω constraints:
More informationSolving Random Satisfiable 3CNF Formulas in Expected Polynomial Time
Solving Random Satisfiable 3CNF Formulas in Expected Polynomial Time Michael Krivelevich and Dan Vilenchik Tel-Aviv University Solving Random Satisfiable 3CNF Formulas in Expected Polynomial Time p. 1/2
More informationQuantum walk algorithms
Quantum walk algorithms Andrew Childs Institute for Quantum Computing University of Waterloo 28 September 2011 Randomized algorithms Randomness is an important tool in computer science Black-box problems
More informationChasing the k-sat Threshold
Chasing the k-sat Threshold Amin Coja-Oghlan Goethe University Frankfurt Random Discrete Structures In physics, phase transitions are studied by non-rigorous methods. New mathematical ideas are necessary
More informationApproximation Algorithms
Approximation Algorithms Chapter 26 Semidefinite Programming Zacharias Pitouras 1 Introduction LP place a good lower bound on OPT for NP-hard problems Are there other ways of doing this? Vector programs
More informationSpectral Redemption: Clustering Sparse Networks
Spectral Redemption: Clustering Sparse Networks Florent Krzakala Cristopher Moore Elchanan Mossel Joe Neeman Allan Sly, et al. SFI WORKING PAPER: 03-07-05 SFI Working Papers contain accounts of scienti5ic
More informationAlgorithms Reading Group Notes: Provable Bounds for Learning Deep Representations
Algorithms Reading Group Notes: Provable Bounds for Learning Deep Representations Joshua R. Wang November 1, 2016 1 Model and Results Continuing from last week, we again examine provable algorithms for
More informationPCP Theorem and Hardness of Approximation
PCP Theorem and Hardness of Approximation An Introduction Lee Carraher and Ryan McGovern Department of Computer Science University of Cincinnati October 27, 2003 Introduction Assuming NP P, there are many
More informationCommunity Detection and Stochastic Block Models: Recent Developments
Journal of Machine Learning Research 18 (2018) 1-86 Submitted 9/16; Revised 3/17; Published 4/18 Community Detection and Stochastic Block Models: Recent Developments Emmanuel Abbe Program in Applied and
More informationLocal MAX-CUT in Smoothed Polynomial Time
Local MAX-CUT in Smoothed Polynomial Time Omer Angel 1 Sebastien Bubeck 2 Yuval Peres 2 Fan Wei 3 1 University of British Columbia 2 Microsoft Research Redmond 3 Stanford University November 9, 2016 Introduction
More informationSPARSE RANDOM GRAPHS: REGULARIZATION AND CONCENTRATION OF THE LAPLACIAN
SPARSE RANDOM GRAPHS: REGULARIZATION AND CONCENTRATION OF THE LAPLACIAN CAN M. LE, ELIZAVETA LEVINA, AND ROMAN VERSHYNIN Abstract. We study random graphs with possibly different edge probabilities in the
More informationPhase Transitions in Random Discrete Structures
Institut für Optimierung und Diskrete Mathematik Phase Transition in Thermodynamics The phase transition deals with a sudden change in the properties of an asymptotically large structure by altering critical
More informationLimitations in Approximating RIP
Alok Puranik Mentor: Adrian Vladu Fifth Annual PRIMES Conference, 2015 Outline 1 Background The Problem Motivation Construction Certification 2 Planted model Planting eigenvalues Analysis Distinguishing
More informationBenchmarking recovery theorems for the DC-SBM
Benchmarking recovery theorems for the DC-SBM Yali Wan Department of Statistics University of Washington Seattle, WA 98195-4322, USA Marina Meila Department of Statistics University of Washington Seattle,
More informationBenchmarking recovery theorems for the DC-SBM
Benchmarking recovery theorems for the DC-SBM Yali Wan Department of Statistics University of Washington Seattle, WA 98195-4322, USA yaliwan@washington.edu Marina Meilă Department of Statistics University
More informationLecture 21 (Oct. 24): Max Cut SDP Gap and Max 2-SAT
CMPUT 67: Approximation Algorithms Fall 014 Lecture 1 Oct. 4): Max Cut SDP Gap and Max -SAT Lecturer: Zachary Friggstad Scribe: Chris Martin 1.1 Near-Tight Analysis of the Max Cut SDP Recall the Max Cut
More informationarxiv: v1 [math.st] 26 Jan 2018
CONCENTRATION OF RANDOM GRAPHS AND APPLICATION TO COMMUNITY DETECTION arxiv:1801.08724v1 [math.st] 26 Jan 2018 CAN M. LE, ELIZAVETA LEVINA AND ROMAN VERSHYNIN Abstract. Random matrix theory has played
More informationA Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities
Journal of Advanced Statistics, Vol. 3, No. 2, June 2018 https://dx.doi.org/10.22606/jas.2018.32001 15 A Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities Laala Zeyneb
More informationInformation-theoretic bounds and phase transitions in clustering, sparse PCA, and submatrix localization
Information-theoretic bounds and phase transitions in clustering, sparse PCA, and submatrix localization Jess Banks Cristopher Moore Roman Vershynin Nicolas Verzelen Jiaming Xu Abstract We study the problem
More informationMIXING TIMES OF RANDOM WALKS ON DYNAMIC CONFIGURATION MODELS
MIXING TIMES OF RANDOM WALKS ON DYNAMIC CONFIGURATION MODELS Frank den Hollander Leiden University, The Netherlands New Results and Challenges with RWDRE, 11 13 January 2017, Heilbronn Institute for Mathematical
More informationThe giant component in a random subgraph of a given graph
The giant component in a random subgraph of a given graph Fan Chung, Paul Horn, and Linyuan Lu 2 University of California, San Diego 2 University of South Carolina Abstract We consider a random subgraph
More informationClustering from Sparse Pairwise Measurements
Clustering from Sparse Pairwise Measurements arxiv:6006683v2 [cssi] 9 May 206 Alaa Saade Laboratoire de Physique Statistique École Normale Supérieure, 24 Rue Lhomond Paris 75005 Marc Lelarge INRIA and
More informationELE 538B: Mathematics of High-Dimensional Data. Spectral methods. Yuxin Chen Princeton University, Fall 2018
ELE 538B: Mathematics of High-Dimensional Data Spectral methods Yuxin Chen Princeton University, Fall 2018 Outline A motivating application: graph clustering Distance and angles between two subspaces Eigen-space
More informationRandom Graph Coloring
Random Graph Coloring Amin Coja-Oghlan Goethe University Frankfurt A simple question... What is the chromatic number of G(n,m)? [ER 60] n vertices m = dn/2 random edges ... that lacks a simple answer Early
More informationMath 391: Midterm 1.0 Spring 2016
Math 391: Midterm 1.0 Spring 2016 James Skon skonjp@kenyon.edu Test date: October 5 Submission Instructions: Give all your assumptions. Show all your work. Be as complete as possible. Grading Problem 1
More informationCS 301: Complexity of Algorithms (Term I 2008) Alex Tiskin Harald Räcke. Hamiltonian Cycle. 8.5 Sequencing Problems. Directed Hamiltonian Cycle
8.5 Sequencing Problems Basic genres. Packing problems: SET-PACKING, INDEPENDENT SET. Covering problems: SET-COVER, VERTEX-COVER. Constraint satisfaction problems: SAT, 3-SAT. Sequencing problems: HAMILTONIAN-CYCLE,
More informationLecture 20: Course summary and open problems. 1 Tools, theorems and related subjects
CSE 533: The PCP Theorem and Hardness of Approximation (Autumn 2005) Lecture 20: Course summary and open problems Dec. 7, 2005 Lecturer: Ryan O Donnell Scribe: Ioannis Giotis Topics we discuss in this
More informationExtreme eigenvalues of Erdős-Rényi random graphs
Extreme eigenvalues of Erdős-Rényi random graphs Florent Benaych-Georges j.w.w. Charles Bordenave and Antti Knowles MAP5, Université Paris Descartes May 18, 2018 IPAM UCLA Inhomogeneous Erdős-Rényi random
More informationLinear inverse problems on Erdős-Rényi graphs: Information-theoretic limits and efficient recovery
Linear inverse problems on Erdős-Rényi graphs: Information-theoretic limits and efficient recovery Emmanuel Abbe, Afonso S. Bandeira, Annina Bracher, Amit Singer Princeton University Abstract This paper
More informationCOMP Analysis of Algorithms & Data Structures
COMP 3170 - Analysis of Algorithms & Data Structures Shahin Kamali Computational Complexity CLRS 34.1-34.4 University of Manitoba COMP 3170 - Analysis of Algorithms & Data Structures 1 / 50 Polynomial
More informationNP-complete Problems
NP-complete Problems HP, TSP, 3COL, 0/1IP Dimitris Diamantis µπλ November 6, 2014 Dimitris Diamantis (µπλ ) NP-complete Problems November 6, 2014 1 / 34 HAMILTON PATH is NP-Complete Definition Given an
More informationThe condensation phase transition in random graph coloring
The condensation phase transition in random graph coloring Victor Bapst Goethe University, Frankfurt Joint work with Amin Coja-Oghlan, Samuel Hetterich, Felicia Rassmann and Dan Vilenchik arxiv:1404.5513
More information1 Adjacency matrix and eigenvalues
CSC 5170: Theory of Computational Complexity Lecture 7 The Chinese University of Hong Kong 1 March 2010 Our objective of study today is the random walk algorithm for deciding if two vertices in an undirected
More informationON THE QUALITY OF SPECTRAL SEPARATORS
ON THE QUALITY OF SPECTRAL SEPARATORS STEPHEN GUATTERY AND GARY L. MILLER Abstract. Computing graph separators is an important step in many graph algorithms. A popular technique for finding separators
More informationSpectral Clustering. Spectral Clustering? Two Moons Data. Spectral Clustering Algorithm: Bipartioning. Spectral methods
Spectral Clustering Seungjin Choi Department of Computer Science POSTECH, Korea seungjin@postech.ac.kr 1 Spectral methods Spectral Clustering? Methods using eigenvectors of some matrices Involve eigen-decomposition
More informationRandom matrices: A Survey. Van H. Vu. Department of Mathematics Rutgers University
Random matrices: A Survey Van H. Vu Department of Mathematics Rutgers University Basic models of random matrices Let ξ be a real or complex-valued random variable with mean 0 and variance 1. Examples.
More informationApproximation & Complexity
Summer school on semidefinite optimization Approximation & Complexity David Steurer Cornell University Part 1 September 6, 2012 Overview Part 1 Unique Games Conjecture & Basic SDP Part 2 SDP Hierarchies:
More informationLecture 06 01/31/ Proofs for emergence of giant component
M375T/M396C: Topics in Complex Networks Spring 2013 Lecture 06 01/31/13 Lecturer: Ravi Srinivasan Scribe: Tianran Geng 6.1 Proofs for emergence of giant component We now sketch the main ideas underlying
More informationXVI International Congress on Mathematical Physics
Aug 2009 XVI International Congress on Mathematical Physics Underlying geometry: finite graph G=(V,E ). Set of possible configurations: V (assignments of +/- spins to the vertices). Probability of a configuration
More informationLabel Cover Algorithms via the Log-Density Threshold
Label Cover Algorithms via the Log-Density Threshold Jimmy Wu jimmyjwu@stanford.edu June 13, 2017 1 Introduction Since the discovery of the PCP Theorem and its implications for approximation algorithms,
More informationAlgorithms and Theory of Computation. Lecture 22: NP-Completeness (2)
Algorithms and Theory of Computation Lecture 22: NP-Completeness (2) Xiaohui Bei MAS 714 November 8, 2018 Nanyang Technological University MAS 714 November 8, 2018 1 / 20 Set Cover Set Cover Input: a set
More informationCMSC Discrete Mathematics FINAL EXAM Tuesday, December 5, 2017, 10:30-12:30
CMSC-37110 Discrete Mathematics FINAL EXAM Tuesday, December 5, 2017, 10:30-12:30 Name (print): Email: This exam contributes 40% to your course grade. Do not use book, notes, scrap paper. NO ELECTRONIC
More informationUnique Games Conjecture & Polynomial Optimization. David Steurer
Unique Games Conjecture & Polynomial Optimization David Steurer Newton Institute, Cambridge, July 2013 overview Proof Complexity Approximability Polynomial Optimization Quantum Information computational
More informationU.C. Berkeley Better-than-Worst-Case Analysis Handout 3 Luca Trevisan May 24, 2018
U.C. Berkeley Better-than-Worst-Case Analysis Handout 3 Luca Trevisan May 24, 2018 Lecture 3 In which we show how to find a planted clique in a random graph. 1 Finding a Planted Clique We will analyze
More informationEigenvalues, random walks and Ramanujan graphs
Eigenvalues, random walks and Ramanujan graphs David Ellis 1 The Expander Mixing lemma We have seen that a bounded-degree graph is a good edge-expander if and only if if has large spectral gap If G = (V,
More informationCS264: Beyond Worst-Case Analysis Lectures #9 and 10: Spectral Algorithms for Planted Bisection and Planted Clique
CS64: Beyond Worst-Case Analysis Lectures #9 and 10: Spectral Algorithms for Planted Bisection and Planted Clique Tim Roughgarden February 7 and 9, 017 1 Preamble Lectures #6 8 studied deterministic conditions
More informationWhat can be sampled locally?
What can be sampled locally? Yitong Yin Nanjing University Joint work with: Weiming Feng, Yuxin Sun Local Computation Locality in distributed graph algorithms. [Linial, FOCS 87, SICOMP 92] the LOCAL model:
More informationA spectral technique for random satisfiable 3CNF formulas
A spectral technique for random satisfiable 3CNF formulas Abraham Flaxman Carnegie Mellon University Mathematical Sciences Department abie@cmu.edu Abstract Let I be a random 3CNF formula generated by choosing
More informationarxiv: v2 [cs.ds] 3 Oct 2017
Orthogonal Vectors Indexing Isaac Goldstein 1, Moshe Lewenstein 1, and Ely Porat 1 1 Bar-Ilan University, Ramat Gan, Israel {goldshi,moshe,porately}@cs.biu.ac.il arxiv:1710.00586v2 [cs.ds] 3 Oct 2017 Abstract
More informationCommunity Detection. Data Analytics - Community Detection Module
Community Detection Data Analytics - Community Detection Module Zachary s karate club Members of a karate club (observed for 3 years). Edges represent interactions outside the activities of the club. Community
More informationPCPs and Inapproximability Gap-producing and Gap-Preserving Reductions. My T. Thai
PCPs and Inapproximability Gap-producing and Gap-Preserving Reductions My T. Thai 1 1 Hardness of Approximation Consider a maximization problem Π such as MAX-E3SAT. To show that it is NP-hard to approximation
More informationNon-Approximability Results (2 nd part) 1/19
Non-Approximability Results (2 nd part) 1/19 Summary - The PCP theorem - Application: Non-approximability of MAXIMUM 3-SAT 2/19 Non deterministic TM - A TM where it is possible to associate more than one
More informationReconstruction for Models on Random Graphs
1 Stanford University, 2 ENS Paris October 21, 2007 Outline 1 The Reconstruction Problem 2 Related work and applications 3 Results The Reconstruction Problem: A Story Alice and Bob Alice, Bob and G root
More informationClustering Algorithms for Random and Pseudo-random Structures
Abstract Clustering Algorithms for Random and Pseudo-random Structures Pradipta Mitra 2008 Partitioning of a set objects into a number of clusters according to a suitable distance metric is one of the
More informationMixing time and diameter in random graphs
October 5, 2009 Based on joint works with: Asaf Nachmias, and Jian Ding, Eyal Lubetzky and Jeong-Han Kim. Background The mixing time of the lazy random walk on a graph G is T mix (G) = T mix (G, 1/4) =
More informationRandom Lifts of Graphs
27th Brazilian Math Colloquium, July 09 Plan of this talk A brief introduction to the probabilistic method. A quick review of expander graphs and their spectrum. Lifts, random lifts and their properties.
More information(a) If A is a 3 by 4 matrix, what does this tell us about its nullspace? Solution: dim N(A) 1, since rank(a) 3. Ax =
. (5 points) (a) If A is a 3 by 4 matrix, what does this tell us about its nullspace? dim N(A), since rank(a) 3. (b) If we also know that Ax = has no solution, what do we know about the rank of A? C(A)
More informationGraph Partitioning. Under the Spectral Lens
Graph Partitioning Under the Spectral Lens Konstantinos Koiliaris A Dissertation Presented to the Faculty of MPLA in Candidacy for the Degree of Master of Science Recommended for Acceptance by the Department
More informationCS369N: Beyond Worst-Case Analysis Lecture #4: Probabilistic and Semirandom Models for Clustering and Graph Partitioning
CS369N: Beyond Worst-Case Analysis Lecture #4: Probabilistic and Semirandom Models for Clustering and Graph Partitioning Tim Roughgarden April 25, 2010 1 Learning Mixtures of Gaussians This lecture continues
More informationP, NP, NP-Complete, and NPhard
P, NP, NP-Complete, and NPhard Problems Zhenjiang Li 21/09/2011 Outline Algorithm time complicity P and NP problems NP-Complete and NP-Hard problems Algorithm time complicity Outline What is this course
More informationInformation Recovery from Pairwise Measurements
Information Recovery from Pairwise Measurements A Shannon-Theoretic Approach Yuxin Chen, Changho Suh, Andrea Goldsmith Stanford University KAIST Page 1 Recovering data from correlation measurements A large
More informationCS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory
CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory Tim Roughgarden & Gregory Valiant May 2, 2016 Spectral graph theory is the powerful and beautiful theory that arises from
More informationTight Bounds For Random MAX 2-SAT
Tight Bounds For Random MAX 2-SAT MohammadTaghi Hajiaghayi Jeong Han Kim Abstract For a conjunctive normal form formula F with n variables and m = cn 2-variable clauses (c is called the density), denote
More informationSpectral gap in bipartite biregular graphs and applications
Spectral gap in bipartite biregular graphs and applications Ioana Dumitriu Department of Mathematics University of Washington (Seattle) Joint work with Gerandy Brito and Kameron Harris ICERM workshop on
More informationLecture 18: More NP-Complete Problems
6.045 Lecture 18: More NP-Complete Problems 1 The Clique Problem a d f c b e g Given a graph G and positive k, does G contain a complete subgraph on k nodes? CLIQUE = { (G,k) G is an undirected graph with
More informationLecture 8: Statistical vs Computational Complexity Tradeoffs via SoS
CS369H: Hierarchies of Integer Programming Relaxations Spring 2016-2017 Lecture 8: Statistical vs Computational Complexity Tradeoffs via SoS Pravesh Kothari Scribes: Carrie and Vaggos Overview We show
More informationNotes for Lecture 2. Statement of the PCP Theorem and Constraint Satisfaction
U.C. Berkeley Handout N2 CS294: PCP and Hardness of Approximation January 23, 2006 Professor Luca Trevisan Scribe: Luca Trevisan Notes for Lecture 2 These notes are based on my survey paper [5]. L.T. Statement
More informationGoldreich s PRG: Evidence for near-optimal polynomial stretch
Goldreich s PRG: Evidence for near-optimal polynomial stretch Ryan O Donnell Computer Science Department Carnegie Mellon University Pittsburgh, PA odonnell@cs.cmu.edu David Witmer Computer Science Department
More informationthe configuration space of Branched polymers R. Kenyon, P. Winkler
the configuration space of Branched polymers R. Kenyon, P. Winkler artificial blood catalyst recovery Branched polymers (dendrimers) in modern science artificial photosynthesis A branched polymer is a
More informationA reverse Sidorenko inequality Independent sets, colorings, and graph homomorphisms
A reverse Sidorenko inequality Independent sets, colorings, and graph homomorphisms Yufei Zhao (MIT) Joint work with Ashwin Sah (MIT) Mehtaab Sawhney (MIT) David Stoner (Harvard) Question Fix d. Which
More informationApproximating MAX-E3LIN is NP-Hard
Approximating MAX-E3LIN is NP-Hard Evan Chen May 4, 2016 This lecture focuses on the MAX-E3LIN problem. We prove that approximating it is NP-hard by a reduction from LABEL-COVER. 1 Introducing MAX-E3LIN
More informationSpectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity
Spectral Graph Theory and its Applications Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Outline Adjacency matrix and Laplacian Intuition, spectral graph drawing
More information