Lecture 7: The Subgraph Isomorphism Problem

Similar documents
ON THE AC 0 COMPLEXITY OF SUBGRAPH ISOMORPHISM

Lecture 7: February 6

RANDOM GRAPHS. Joel Spencer ICTP 12

Small subgraphs of random regular graphs

COMP 382: Reasoning about algorithms

Combinatorial Optimization

Notes 6 : First and second moment methods

FIRST ORDER SENTENCES ON G(n, p), ZERO-ONE LAWS, ALMOST SURE AND COMPLETE THEORIES ON SPARSE RANDOM GRAPHS

Lecture 24: Approximate Counting

Algorithms and Theory of Computation. Lecture 2: Big-O Notation Graph Algorithms

Subhypergraph counts in extremal and random hypergraphs and the fractional q-independence

18.5 Crossings and incidences

Asymptotic Analysis. Slides by Carl Kingsford. Jan. 27, AD Chapter 2

Decomposing oriented graphs into transitive tournaments

RANDOM GRAPHS BY JOEL SPENCER. as n.

Stein s Method for concentration inequalities

BIRTHDAY PROBLEM, MONOCHROMATIC SUBGRAPHS & THE SECOND MOMENT PHENOMENON / 23

A = A U. U [n] P(A U ). n 1. 2 k(n k). k. k=1

3.1 Asymptotic notation

Avoider-Enforcer games played on edge disjoint hypergraphs

Vertex colorings of graphs without short odd cycles

Applications of the Lopsided Lovász Local Lemma Regarding Hypergraphs

Hard-Core Model on Random Graphs

(K2) For sum of the potential differences around any cycle is equal to zero. r uv resistance of {u, v}. [In other words, V ir.]

Katarzyna Mieczkowska

Taking Stock. IE170: Algorithms in Systems Engineering: Lecture 3. Θ Notation. Comparing Algorithms

Lecture 22: Counting

Induced Turán numbers

The concentration of the chromatic number of random graphs

Online Balanced Graph Avoidance Games

Cycle lengths in sparse graphs

Lecture 5: January 30

2.2 Asymptotic Order of Growth. definitions and notation (2.2) examples (2.4) properties (2.2)

Random Graphs. 7.1 Introduction

Clique is hard on average for regular resolution

ON SENSITIVITY OF k-uniform HYPERGRAPH PROPERTIES

On the Logarithmic Calculus and Sidorenko s Conjecture

Short cycles in random regular graphs

Lecture 4: Phase Transition in Random Graphs

Graph coloring, perfect graphs

Growth of Functions (CLRS 2.3,3)

Finding and Counting Given Length Cycles 1. multiplication. Even cycles in undirected graphs can be found even faster. A C 4k 2 in an undirected

Edge-disjoint induced subgraphs with given minimum degree

Zero-Sum Flows in Regular Graphs

Packing and decomposition of graphs with trees

Jacques Verstraëte

Off-diagonal hypergraph Ramsey numbers

The Rainbow Turán Problem for Even Cycles

Limitations of Algorithm Power

RECAP: Extremal problems Examples

On the Sensitivity of Cyclically-Invariant Boolean Functions

NP-Completeness Review

< k 2n. 2 1 (n 2). + (1 p) s) N (n < 1

Packing triangles in regular tournaments

Asymptotically optimal induced universal graphs

1 Perfect Matching and Matching Polytopes

SUBGRAPHS OF WEAKLY QUASI-RANDOM ORIENTED GRAPHS. Omid Amini, Simon Griffiths & Florian Huc

Some hard families of parameterised counting problems

Asymptotically optimal induced universal graphs

P, NP, NP-Complete, and NPhard

Copyright 2000, Kevin Wayne 1

An alternative proof of the linearity of the size-ramsey number of paths. A. Dudek and P. Pralat

Sensitivity, Block Sensitivity and Certificate Complexity of Boolean Functions (Master s Thesis)

Balanced online Ramsey games in random graphs

Asymptotic Analysis 1

Induced Saturation of Graphs

The regularity method and blow-up lemmas for sparse graphs

Packing and Covering Dense Graphs

An algorithmic framework for obtaining lower bounds for random Ramsey problems

The neighborhood complex of a random graph

Lecture 3: Nondeterminism, NP, and NP-completeness

arxiv: v1 [math.co] 22 Aug 2014

Lecture 14 - P v.s. NP 1

New Results on Graph Packing

Induced subgraphs of Ramsey graphs with many distinct degrees

Induced subgraphs with many repeated degrees

Finding a Heaviest Triangle is not Harder than Matrix Multiplication

Induced subgraphs of graphs with large chromatic number. IX. Rainbow paths

The 3-rainbow index of graph operations

4 Packing T-joins and T-cuts

Lecture 12: Introduction to Spectral Graph Theory, Cheeger s inequality

Acyclic subgraphs with high chromatic number

MA008/MIIZ01 Design and Analysis of Algorithms Lecture Notes 2

Graph Theory. Thomas Bloom. February 6, 2015

Random Geometric Graphs

Testing Graph Isomorphism

CSC Design and Analysis of Algorithms. Lecture 1

Packing k-partite k-uniform hypergraphs

Quasi-randomness of graph balanced cut properties

arxiv: v1 [math.co] 24 Apr 2014

Ring Sums, Bridges and Fundamental Sets

CSE 421: Intro Algorithms. 2: Analysis. Winter 2012 Larry Ruzzo

The Frankl-Wilson theorem and some consequences in Ramsey theory and combinatorial geometry

Random Lifts of Graphs

COS597D: Information Theory in Computer Science October 5, Lecture 6

High-dimensional permutations

Acyclic and Oriented Chromatic Numbers of Graphs

Rational exponents in extremal graph theory

A New Random Graph Model with Self-Optimizing Nodes: Connectivity and Diameter

The Probabilistic Method

Transcription:

CSC2429, MAT1304: Circuit Complexity October 25, 2016 Lecture 7: The Subgraph Isomorphism Problem Instructor: Benjamin Rossman 1 The Problem SUB(G) Convention 1 (Graphs). Graphs are finite simple graphs with no isolated vertices. Formally, a graph G consists of a finite vertex set V (G) and an edge set E(G) ( ) V (G) 2 such that e E(G) e = V (G). There are 2 E(G) subgraphs of G. (We may thus identify subgraphs G with elements of the Boolean hypercube {0, 1} E(G).) For k 2: K k denotes the complete graph of order k, and P k denotes the path of order k. Definition 2 (The Blow-Up G n of a Graph G). For a graph G and n N, let G n denote the graph V (G n ) = {v (i) : v V (G), i [n]}, E(G n ) = { {v (i), w (j) } : {v, w} E(G), i, j [n] }. For α [n] V (G), let G (α) denote the graph V (G (α) ) = { v (αv) : v V (G) }, E(G (α) ) = { {v (αv), w (αw) } : {v, w} E(G) }. Each G (α) is an isomorphic copy of G that sits inside G n. We have a similar notation for subgraphs of G and their copies inside G n. For H G and β [n] V (H), let H (β) denote the graph For X G n and H G, let V (H (β) ) = { v (βv) : v V (H) }, E(H (β) ) = { {v (βv), w (βw) } : {v, w} E(H) }. Sub H (X) = { H (β) : β [n] V (H) such that H (β) X }, sub H (X) = Sub H (X). We refer to elements of Sub H (X) as H-subgraphs of X. Definition 3 (The Colored G-Subgraph Isomorphism Problem). For a graph G and n N, let SUB(G, n) be the problem, given a subgraph X G n, of determining whether or not there exists α [n] V (G) such that G (α) X. For complexity purposes, we regard SUB(G, n) as a Boolean function {0, 1} E(G) n2 {0, 1} with variables {X e } e E(G n ). We refer to SUB(G) = {SUB(G, n)} n N as the colorful G-subgraph isomorphism problem. 1

Note the brute-force upper bound: SUB(G) is solvable by monotone depth-2 (OR AND) formulas of size O( E(G) n V (G) ): α [n] V (G) {v,w} E(G) X {v (αv),w (αw) }. As we will see later on, there is a better upper bound (for monotone circuits of O( V (G) ) depth) in terms of the tree-width of G. In the next few lectures, we will show nearly matching lower bounds for bounded-depth circuits, as well as monotone circuits. Two important special cases of the problem SUB(G) are when G = K k (the k-clique problem ) and G = P k (the distance-k connectivity problem ). 2 Relationship to SUB uncolored (G) We remark on the relationship between SUB(G) and its uncolored version, denoted SUB uncolored (G). This is the problem: given a graph X K n (i.e. a graph with V (X) {1,..., n}), determine whether or not X contains a subgraph isomorphic to G. (Note that k-clique is precisely the problem SUB uncolored (K k ).) There is a well-known reduction from SUB uncolored (G) to SUB(G). Theorem 4 ( Color-Coding Technique, Alon-Yuster-Zwick 1995). There is a quasi-linear size AC 0 reduction from SUB uncolored (G) to SUB(G). Proof Sketch. We are given an uncolored graph X K n and wish to determine whether or not it contains a subgraph isomorphic to G. For a function ϕ : [n] V (G), let X ϕ G n be the graph with E(X ϕ ) = { {v (i), w (j) } : {v, w} E(G) and {i, j} E(X) and ϕ(i) ϕ(j) }. Suppose we have a family Φ of functions ϕ : [n] V (G) with the property that, for every U [n] with U = V (G), there exists ϕ Φ with ϕ(u) = V (G). Such a family Φ is called a k-perfect family of hash functions where k = V (G) and explicit constructions are known of size O k (log n) (the constant here is exponential in k). The reduction from SUB uncolored (G) to SUB(G) works as follows: For each ϕ Φ, test whether sub G (X ϕ ) 1. If sub G (X ϕ ) 1 for any ϕ Φ, then accept (i.e. conclude that X contain a subgraph isomorphic to G); otherwise, reject. To see that this reduction is correct, note that if X does not contain a subgraph isomorphic to G, then clearly sub G (X ϕ ) = 0 for every function ϕ : [n] V (G). On the other hand, if X contains a subgraph G isomorphic to G, then for any ϕ Φ with ϕ(v (G )) = V (G), we have sub G (X ϕ ) 1. The fact that this reduction can be implemented by AC 0 circuits of size O(n log n) was noted by Amano (2010). In many cases, there is a trivial reduction in the opposite direction from SUB(G) to SUB uncolored (G). Definition 5. A graph G is a core if every homomorphism G G is one-to-one (and hence an automorphism). For example, the complete graph K k is a core. 2

Lemma 6. If G is a core and X G n, then X contains a subgraph isomorphic to G if and only if sub G (X) 1. Proof. In one direction, if sub G (X) 1 then clearly X contains a subgraph isomorphic to G. In the other direction, assume X contains a subgraph G isomorphic to G. Let ϕ : V (G ) = V (G) be an isomorphism. Let π : V (G n ) V (G) be the homomorphism v (i) v. Then π ϕ 1 is a homomorphism G G and, therefore, one-to-one (since G is a core). It follows that there exists an automorphism σ of G such that π ϕ 1 σ : V (G) V (G) is the identity function. Then G (α) X where α [n] V (G) is given by α v = ϕ 1 (σ(v)). Corollary 7. If G is a core, then there is a linear-size AC 0 reduction (in fact, monotone projection) from SUB uncolored (G) to SUB(G). Proof. This reduction simply maps X regarded as an instance of SUB(G, n) (i.e. a subgraph G n ) to X regarded as an instance of SUB uncolored (G, V (G) n) (i.e. a subgraph of K V (G) n ). Theorem 4 and Corollary 7 show that SUB(G) and SUB uncolored (G) are equivalent problems (up to quasi-linear AC 0 reductions) for cores G such as K k. Henceforth, we shall focus almost entirely on the colored problem SUB(G), which appears to be more well-structured and better behaved than the uncolored version. 3 Threshold Weightings For the uncolored subgraph isomorphism problem SUB uncolored (G), there is a canonical choice of input distribution (i.e. random graph) with respect to which the corresponding Boolean function {0, 1} (n 2) {0, 1} is balanced (i.e. has expectation bounded away from 0 and 1). This is the Erdos-Renyi random graph G(n, p) for an appropriate threshold value of p (see Example 15). In the colored setting, we can identify a family of product distributions (what might be called G-colored Erdos-Renyi random graphs ) with respect to SUB(G) is balanced. This family is nicely indexed by a convex polytope of edge-weightings E(G) [0, 2]. Definition 8. A threshold weighting on G is a function θ : E(G) R 0 such that 1. e E(H) θ(e) V (H) for all H G, 2. e E(G) θ(e) = V (G). For H G, we define θ (H) := V (H) θ(e). e E(H) Thus, condition (1) states that θ (H) 0 for all H G and condition (2) states that θ (G) = 0. We say that θ is strict if θ (H) > 0 for all H G. (Note that θ is strict G is connected.) Example 9. 1. Here is a threshold weighting on a particular 4-vertex graph: 3

2. The set of threshold weightings on G is a polytope in R E(G). In particular, if θ 1, θ 2 are threshold weightings, then so is every convex combination λθ 1 + (1 λ)θ 2 for 0 < λ < 1. 3. If G is r-regular, then the constant function θ 2/r is a threshold weighting. Exercise: If G is an r-regular expander, then 2/r (H) Ω(min{ V (H), V (G) V (H) }). 4. More generally, the function θ({v, w}) := 1 deg(v) + 1 deg(w) 4 The Random Graph X θ G n is a threshold weighting. Definition 10. For every threshold weighting θ on G, we define a random subgraph X θ,n G n (we write X θ when n is understood from context) where, independently for all {v (i), w (j) } E(G n ), P [ {v (i), w (j) } E(X θ ) ] = n θ({v,w}). We will be interested in the average-case complexity of SUB(G) on X θ. Definition 11. If f = (f n ) is a sequence of Boolean functions f n : {subgraphs of G n } {0, 1}, we say that f solves SUB(G) in the average-case on X θ if lim P [ f n (X θ,n ) = 1 sub G (X θ,n ) 1 ] = 1. n Clearly, the worst-case complexity of SUB(G) is lower-bounded by the maximum over threshold weightings θ of its average-case complexity with respect to X θ. Our method for proving lower bounds on SUB(G) will consist of two parts: (i) For each threshold weighting θ, we characterize the average-case AC 0 circuit size of SUB(G) on X θ in terms of a combinatorial parameter κ θ (G) (defined in the next lecture): we show a lower bound of n κ θ(g) o(1), as well as an upper bound of n 2κ θ(g)+o(1). (ii) For every graph G, we show that there exists a threshold weighting θ such that κ θ (G) = Ω(tw(G)/ log tw(g)) where tw(g) is the tree-width of G. In special cases, such as G = K k or in general when G is an expander, we achieve optimal bound κ θ (G) = Ω( V (G) ). Together (i) and (ii) imply nearly tight lower bounds on the worst-case (really: worst average-case) AC 0 circuit size of of SUB(G). The upper and lower bounds in (i) rely on properties of the random graph X θ. In particular, we require a good understanding of the subgraph counts sub H (X θ ) for H G. Lemma 12. For all H G, we have E [ sub H (X θ ) ] = n θ(h). In particular, E [ sub G (X θ ) ] = 1. Proof. Direct from definitions. In the case where θ is strict, we can say a lot more. Theorem 13. If θ is strict, then the following hold: 4

1. sub G (X θ ) is asymptotically Poisson(1). That is, for every k N, lim P [ sub G (X θ ) = k ] = 1 n ek!. In particular, lim n P[ sub G(X θ ) 1 ] = 1 1 e (i.e. SUB(G) is balanced with respect to X θ, justifying terminology threshold weighting ). 2. For all H G, for a constant c > 0 depending on θ and H. P [ sub H (X θ ) < 1 2 n θ(h) ] exp ( Ω(n c ) ), P [ sub H (X θ ) > 3 2 n θ(h) ] exp ( Ω(n c ) ). Time permitting, we will see elements of the proof of Theorem 13 in a future lecture. 5 More Examples of Threshold Weightings Example 14 (Threshold weightings from Markov chains). If M V (G) V (G) [0, 1] is the transition matrix for any Markov chain on G, that is, 1. w M(v, w) = 1 for every v V (G) and Then 2. M(v, w) > 0 = {v, w} E(G). θ({v, w}) := M(v, w) + M(w, v) is a threshold weighting for G. In this case, we have θ (H) = (v,w) : v V (H) and {v,w} E(G)\E(H) M(v, w), that is, θ (H) is equal to the amount of M-flow that leaves V (H) via edges in E(G) \ E(H). This is derived as follows: ( ) M (H) = V (H) M(v, w) + M(w, v) = v V (H) = = v V (H) 1 {v,w} E(H) w : {v,w} E(H) w : {v,w}/ E(H) M(v, w) M(v, w) (v,w) : v V (H) and {v,w} E(G)\E(H) M(v, w). 5

Example 15. The threshold exponent of G is defined by V (H) τ(g) := min H G E(H). This constant characterizes the appearance of subgraphs isomorphic to G in the Erdos-Renyi random graph G(n, p): for p(n) = n τ(g), the probability that G(n, p) contains a subgraph isomorphic to G is bounded away from 0 and 1 (while this probability tends to 0 or 1 for p n τ(g) and p n τ(g) respectively). The Erdos-Renyi random graph G(n, n τ(g) ) is a canonical input distribution for studying the average-case complexity of SUB uncolored (G). The average-case analysis of SUB uncolored (G) on G(n, n θ(g) ) is essentially equivalent to the average-case analysis of SUB(G) on X θ for the threshold weighting θ : E(G) [0, 2] defined by τ(g) if {v, w} E(H) for any H G with τ(g) = θ({v, w}) := 0 otherwise. V (H) E(H), All of the lower/upper bounds we show for average-case SUB(G) on X θ carry over to SUB uncolored (G) on G(n, n θ(g) ) for this particular setting of θ. (This is not by a formal reduction; I am only claiming that the calculations work out similarly.) We get more powerful results by working with the colored version SUB(G), thanks to the freedom to choose optimal θ. For this reason, going forward we will focus entirely on the average-case complexity of SUB(G) on X θ and leave the uncolored setting as a special case. Remark 16. G is balanced if τ(g) = V (G) / E(G), and G is strictly balanced if τ(g) < V (H) / E(H) for all H G. If G is balanced, then the above-defined θ is identically τ(g). If G is strictly balanced, then this θ is strict. 6