Algorithms and Complexity Results for Exact Bayesian Structure Learning

Size: px
Start display at page:

Download "Algorithms and Complexity Results for Exact Bayesian Structure Learning"

Transcription

1 Algorithms and Complexity Results or Exact Bayesian Structure Learning Sebastian Ordyniak and Stean Szeider Institute o Inormation Systems Vienna University o Technology, Austria Abstract Bayesian structure learning is the NP-hard problem o discovering a Bayesian network that optimally represents a given set o training data. In this paper we study the computational worst-case complexity o exact Bayesian structure learning under graph theoretic restrictions on the super-structure. The super-structure (a concept introduced by Perrier, Imoto, and Miyano, JMLR 2008) is an undirected graph that contains as subgraphs the skeletons o solution networks. Our results apply to several variants o score-based Bayesian structure learning where the score o a network decomposes into local scores o its nodes. Results: We show that exact Bayesian structure learning can be carried out in non-uniorm polynomial time i the super-structure has bounded treewidth and in linear time i in addition the super-structure has bounded maximum degree. We complement this with a number o hardness results. We show that both restrictions (treewidth and degree) are essential and cannot be dropped without loosing uniorm polynomial time tractability (subject to a complexity-theoretic assumption). Furthermore, we show that the restrictions remain essential i we do not search or a globally optimal network but we aim to improve a given network by means o at most k arc additions, arc deletions, or arc reversals (k-neighborhood local search). Keywords: Bayesian structure learning, super-structure, treewidth, ixed-parameter tractability, parameterized complexity 1 Introduction Bayesian structure learning is the important task o discovering a Bayesian network that represents a given set o training data. Unortunately the problem is NPhard (Chickering 1996). This predicament has motivated Research supported by the European Research Council, grant reerence a wide range o approaches, including heuristicsbased algorithms that compute near-optimal solutions (see, e.g., Heckerman, Geiger, & Chickering 1995; Chickering 2003). In recent years several exponential-time algorithms or exact Bayesian structure learning have been proposed (see, e.g., Parviainen & Koivisto 2009; Perrier, Imoto, & Miyano 2008; Silander & Myllymäki 2006). Recent progress has been made to limit the space requirement by advanced dynamic programming techniques (Parviainen & Koivisto 2009) and to limit the exponential time requirement by restricting the search to networks whose skeletons are subgraphs o a given undirected graph that speciies a super-structure (Perrier, Imoto, & Miyano 2008). Recent research indicates that the super-structure can be practically computed and eectually used to guide the search or near-optimal Bayesian networks (Mukund & Je 2004; Anton & Carlos 2007; Pieter, Daphne, & Andrew 2006). In this paper we study the worst-case time complexity o exact Bayesian structure learning under graph-theoretic restrictions on the super-structure. In particular, we consider bounds on the treewidth and on the maximum degree o super-structures. Our results are as ollows: (1) Exact Bayesian structure learning is easible in non-uniorm polynomial time i the treewidth o the super-structure is bounded by an arbitrary constant. (2) Exact Bayesian structure learning is easible in linear time i both treewidth and maximum degree o the super-structure are bounded by arbitrary constants. By non-uniorm we mean that the order o the polynomial depends on the treewidth. We obtain results (1) and (2) by means o a dynamic programming algorithm along a decomposition tree o the super-structure. We show that in a certain sense both results are optimal: (3) Exact Bayesian structure learning or instances with super-structures o maximum degree 4 (but unbounded treewidth) is not easible in polynomial time unless P = NP. Thus, in (1) and (2) we cannot drop the bound on the treewidth.

2 (4) Exact Bayesian structure learning or instances with super-structures o bounded treewidth (but unbounded maximum degree) is not easible in uniorm polynomial time unless FPT = W[1]. Thus, in (2) we cannot drop the bound on the degree. FPT W[1] is a widely accepted complexity theoretic assumption (Downey & Fellows 1999). For example, FPT = W[1] implies the (unlikely) existence o a 2 o(n) algorithm or n-variable 3SAT (Impagliazzo, Paturi, & Zane 2001; Flum & Grohe 2006). Result (3) easily ollows rom Chickering s reduction (Chickering 1996). We establish result (4) by means o a parameterized reduction rom a variant o the Maximum Clique problem. We will provide necessary background on parameterized complexity and parameterized reductions in Section 2.2. We urther extend the hardness results (3) and (4) rom the search or an optimal network to the presumably easier problem o improving a given network by changing at most k o its arcs (with the operations o arc addition, arc deletion, and arc reversal). We reer to this restricted problem as k-neighborhood local search or k-local search or short. By trivial reasons k-local search is easible in non-uniorm polynomial time n O(k). We show, however, that uniorm polynomial-time tractability is again unlikely: (5) k-local search or instances with super-structures o bounded maximum degree is not possible in uniorm polynomial time unless FPT = W[1]. (6) k-local search or instances with super-structures o bounded treewidth is not possible in uniorm polynomial time unless FPT = W[1]. We obtain result (5) by a reduction rom the Red/Blue Non-Blocker problem (Downey & Fellows 1999). I both the maximum degree and the treewidth are bounded, then k-local search is easible in linear time, however this result is subsumed by (2). Both hardness results (5) and (6) even hold or several cases where not all o the three operations (addition, deletion, reversal) are available, or example i arc reversal is the only operation. 2 Preliminaries In this section we will introduce the basic concepts and notions that we will use throughout the paper. 2.1 Basic Graph Theory We will assume that the reader is amiliar with basic graph theory. We consider undirected graphs and directed graphs (digraphs). A DAG is a directed acyclic graph. We write V (G) = V and E(G) = E or the sets o vertices and edges o a (directed or undirected) graph G = (V, E). We denote an undirected edge between vertices u and v as {u, v} and a directed edge (or arc), directed rom u to v as (u, v). For a subset V V we write G[V ] to denote the induced subgraph G = (V, E ) where E = { e V : e E } i G is undirected and E = { e V V : e E } i G is directed. I G is a digraph we deine P G (v) = { u V (G) : (u, v) E(G) } as the set o parents o v in G. An undirected graph G = (V, E ) is the skeleton o G i E = { {u, v} : (u, v) E(G) }. 2.2 Parameterized Complexity Parameterized complexity provides a theoretical ramework to distinguish between uniorm and non-uniorm polynomial-time tractability with respect to a parameter. An instance o a parameterized problem is a pair (I, k) where I is the main part and k is the parameter; the latter is usually a non-negative integer. A parameterized problem is ixed-parameter tractable i there exist a computable unction and a constant c such that instances (I, k) o size n can be solved in time O((k)n c ). FPT is the class o all ixed-parameter tractable decision problems. Fixed-parameter tractable problems are also called uniorm polynomial-time tractable because i k is considered constant, then instances with parameter k can be solved in polynomial time where the order o the polynomial is independent o k (in contrast to non-uniorm polynomial-time running times such as n k ). Parameterized complexity oers a completeness theory similar to the theory o NP-completeness. One uses parameterized reductions which are many-one reductions where the parameter or one problem maps into the parameter or the other. More speciically, problem L reduces to problem L i there is a mapping R rom instances o L to instances o L such that (i) (I, k) is a yes-instance o L i and only i (I, k ) = R(I, k) is a yes-instance o L, (ii) k = g(k) or a computable unction g, and (iii) R can be computed in time O((k)n c ) where is a computable unction, c is a constant, and n denotes the size o (I, k). The parameterized complexity class W[1] is considered as the parameterized analog to NP. For example, the parameterized Maximum Clique problem (given a graph G and a parameter k 0, does G contain a complete subgraph on k vertices?) is W[1]-complete under parameterized reductions. Note that there exists a trivial non-uniorm polynomial-time n k algorithm or the Maximum Clique problems that checks all sets o k vertices. 2.3 Tree Decompositions Treewidth is an important graph parameter that indicates in a certain sense the tree-likeness o a graph. The treewidth o a graph G = (V, E) is deined via the ollowing notion o decomposition: a tree decomposition o G is a pair (T, χ) where T is a tree and χ is a labeling unction with χ(t) V or every tree node t, such that the ollowing conditions hold:

3 1. Every vertex o G occurs in χ(t) or some tree node t. 2. For every edge {u, v} o G there is a tree node t such that u, v χ(t). 3. For every vertex v o G, the tree nodes t with v χ(t) induce a connected subtree o T. The width o a tree decomposition (T, χ) is the size o a largest set χ(t) minus 1 among all nodes t o T. A tree decomposition o smallest width is optimal. The treewidth o a graph G, denoted tw(g), is the width o an optimal tree decomposition o G. Given G with n vertices and a constant w, it is possible to decide whether G has treewidth at most w, and i so, to compute an optimal tree decomposition o G in time O(n) (Bodlaender 1996). Furthermore there exist powerul heuristics to compute tree decomposition o small width in a practically easible way (Gogate & Dechter 2004). 3 Bayesian Structure Learning In this section we deine the theoretical ramework or Bayesian structure learning that we shall use or our considerations. We closely ollow the abstract ramework used by Parviainen and Koivisto (2009) which encloses a wide range o score-based approaches to structure learning. We assume that the input data speciies a set V o nodes (or variables) and a local score unction that assigns to each v V and each subset A V \ {v} a non-negative real number (v, A). Given the local score unction, the problem is to ind a DAG D = (V, E) such that the score o D under (D) := v V (v, P D (v)) is as large as possible (the DAG D together with certain local probability distributions orms a Bayesian network). This setting accommodates several popular scores like BDe, BIC and AIC (Parviainen & Koivisto 2009; Chickering 1995). We consider the ollowing decision problem: EXACT BAYESIAN STRUCTURE LEARNING Instance: A local score unction deined on a set V o nodes, a real number s > 0. Question: Is there a DAG D such that (D) s? For our complexity theoretic considerations we will assume that the local score unction is given as the list o all tuples (v, A, (v, A)) or v V and A V \ {v} where (v, A) > 0. We deine P (v) := { P V : (v, P ) > 0} { } to be the set o all potential parent sets o v. We also deine δ := max v V P (v) ; which will be an important measurement or our worst-case analysis o running times. Let be a local score unction deined on a set V o nodes. The super-structure o is the undirected graph S = (V, E ) where E contains an edge {u, v} i and only i u is a potential parent o v, i.e., i u P or some P P (v). We say that a DAG D is admissible or i the skeleton o D is a spanning subgraph o the super-structure S. Furthermore, we say that a DAG D is strictly admissible or i or every vertex v V (D) we have P D (v) P (v). Note that every strictly admissible DAG is also admissible. Furthermore, there always exists a (strictly) admissible DAG D with the highest score: I D is not (strictly) admissible, i.e., i there exists v V (D) such that (v, P D (v)) = 0, we can delete all arcs (w, v) such that w P D (v). This does not decrease the score since (v, ) (v, P D (v)) = 0 or every such v. 4 An Algorithm or Exact Bayesian Structure Learning In this section we present the dynamic programming algorithm and establish our tractability results. For the remainder o this section w denotes an arbitrary but ixed constant. Theorem 1. Given a local score unction with a super-structure S = (V, E ) o treewidth bounded by a constant w. Then we can ind in time O(δ w+1 V ) a DAG D with maximal score (D). Corollary 1. EXACT BAYESIAN STRUCTURE LEARNING can be decided in polynomial time or instances where the super-structure has bounded treewidth. The problem can be decided in linear time i additionally the super-structure has bounded maximum degree. Proo. The irst statement ollows immediately rom the theorem since δ is bounded by the total input size o the instance. The second statement ollows since δ is bounded whenever the maximum degree d o the super-structure is bounded as clearly δ 2 d. We are going to establish Theorem 1 by means o a dynamic programming algorithm along a tree decomposition or S, computing local inormation at the nodes o the tree decomposition that can then be put together to orm an optimal DAG. For this approach, it is convenient to consider tree decompositions in the ollowing normal orm (Kloks 1994): A triple (T, χ, r) is a nice tree decomposition o a graph G i (T, χ) is a tree decomposition o G, the tree T is rooted at node r, and each node o T is o one o the ollowing our types: 1. a lea node: a node having no children;

4 2. a join node: a node t having exactly two children t 1, t 2, and χ(t) = χ(t 1 ) = χ(t 2 ); 3. an introduce node: a node t having exactly one child t, and χ(t) = χ(t ) {v} or a vertex v o G; 4. a orget node: a node t having exactly one child t, and χ(t) = χ(t ) \ {v} or a vertex v o G. For convenience we will also assume that χ(r) = or the root r o T. For a nice tree decomposition (T, χ, r) we deine χ (t) to be the union o all the sets χ(t ) where t is contained in the subtree o T rooted at t. Given a tree decomposition o a graph G o width w, one can eectively obtain in time O( V (G) ) a nice tree decomposition o G with O( V (G) ) nodes and o width at most w (Kloks 1994). In the ollowing we will assume that we are given an instance I = (V, ) o EXACT BAYESIAN STRUCTURE LEARNING together with a nice tree decomposition (T, χ, r) or S o width at most w. A partial solution or a tree node t V (T ) is a digraph that can be obtained as the induced subdigraph D[χ (t)] o a strictly admissible DAG D or. For a tree node t let D(t) denote the set o all partial solutions or t. For a partial solution D D(t) we set t (D) = (v, P D (v)), v (V (D)\χ(t)) i.e., t (D) is the sum o the scores o all nodes o D except or the nodes in χ(t). A record o a tree node t V (T ) is a triple R = (a, p, s) such that: 1. a is a mapping χ(t) P (v); 2. p is a transitive binary relation on χ(t); 3. s is a non-negative real number. We say that a record represents a partial solution D D(t) i it satisies the ollowing conditions: 1. a(v) V (D) = P D (v) or every v χ(t). 2. For every pair o vertices v 1, v 2 χ(t) it holds that (v 1, v 2 ) p i and only i D contains a directed path rom v 1 to v 2. We say that a record R = (a, p, s) o a tree node t V (T ) is valid i it represents some DAG D D(t) and s is the maximum score t (D) over all DAGs in D(t) represented by R. With each tree node t V (T ) we associate the set R(t) o all valid records representing partial solutions in D(t). In a certain sense, R(t) is a succinct representation o the optimal elements o D(t), using space that only depends on w and δ, but not on V. The next three lemmas will allow us to compute the valid records o a tree node rom the valid records o its children. Lemma 1 (join nodes). Let t 1, t 2 be the children o t in T. Then R(t) can be computed rom R(t 1 ) and R(t 2 ) in time O(δ w+1 ). Proo. It ollows rom the above deinitions that a record R = (a, p, s) o t is valid i and only i there are valid records R 1 = (a 1, p 1, s 1 ) R(t 1 ) and R 2 = (a 2, p 2, s 2 ) R(t 2 ) such that: 1. a = a 1 = a p is the transitive closure o p 1 p p is irrelexive, i.e., there is no v χ(t) such that (v, v) p. 4. s = s 1 + s 2. It ollows that R(t) can be computed by considering all pairs o records R 1 R(t 1 ) and R 2 R(t 2 ) and checking conditions 1 4. Since, there are at most O(δ w+1 ) valid records or every t V (T ) and or every such pair the time required to check the conditions only depend on w, the result ollows. Lemma 2 (introduce node). Let t be an introduce node with child t, such that χ(t) = χ(t ) {v 0 }. Then R(t) can be computed rom R(t ) in time O(δ w+1 ). Proo. A record R = (a, p, s) o t is valid i and only i there is a set P P (v 0 ) and a valid record R = (a, p, s ) R(t ) such that: 1. a(v 0 ) = P. 2. For every v χ(t ) it holds that a(v) = a (v). 3. p is the transitive closure o the relation p { (u, v 0 ) : u P } { (v 0, u) : v 0 a (u), u χ(t ) }. 4. p is irrelexive. 5. s = s. It ollows that R(t) can be computed by checking or every pair (P,R ) as deined above, whether it satisies conditions 1 5. Since there are at most δ possible sets P and at most O(δ w ) possible valid records or t (observe that χ(t ) w) the result ollows rom the act that or every pair (P,R ) the conditions can be checked in time that only depends on w. Lemma 3 (orget node). Let t be a orget node with child t such that χ(t) = χ(t ) \ {v 0 }. Then R(t) can be computed rom R(t ) in time O(δ w+1 ). Proo. A record R = (a, p, s) o t is valid i and only i there is a valid record R = (a, p, s ) R(t ) such that: 1. a and p are the restrictions o a and p to χ(t), respectively. That is, a(u) = a (u) or all u χ(t), and p = { (u, v) p : u, v χ(t) }. 2. s = s + (v 0, a (v 0 )). Evidently R(t) can be computed rom R(t ) in time O(δ w+1 ).

5 We are now ready to prove Theorem 1. Proo. Let I = (V, ) be an instance o EXACT BAYESIAN STRUCTURE LEARNING where the super-structure S has treewidth w (a constant) and V = n. We compute a nice tree decomposition (T, χ, r) o S o width w and with O(n) nodes. This can be accomplished in time O(n) (see the discussion in Section 2.3). Next we compute the sets R(t) via a bottom-up traversal o T. For a lea node t we can compute R(t) just by considering all valid records or every possible strictly admissible DAG on the at most w + 1 vertices in χ(t). We can now use Lemmas 1, 2 and 3 to compute the sets R(t) or all other O(n) tree nodes in time O(δ w+1 n). Since χ(r) =, the partial solutions or the root r o T are exactly the strictly admissible DAGs or, and we have r (D) = (D) or each such DAG D. Ater the computation o the sets R(t) or all tree nodes t, the set R(r) contains exactly one record R = (,, s). By the above considerations, it ollows that s is the largest score o all strictly admissible DAGs or, and, as noted in Section 3, this is also the largest score o any DAG whose vertices belong to V. It is now easy to compute a DAG D with score (D) = s via a top-down traversal o T starting rom r and using the inormation previously stored at each node in T. This can also be accomplished in time O(δ w+1 n). 5 Hardness Results Theorem 2 (Chickering 1996). EXACT BAYESIAN STRUCTURE LEARNING is NP-hard or instances with super-structures o maximum degree 4. Proo. This theorem ollows rom Chickering s proo, we only sketch the argument. The reduction is rom FEED- BACK ARC SET (FAS). The problem asks whether a digraph D = (V, E) can be made acyclic by deleting at most k arcs (the deleted arcs orm a eedback arc set o D). The problem is NP-hard or digraphs with skeletons o maximum degree 4 (Karp 1972). Given an instance (D, k) o FAS, where the skeleton o D has maximum degree 4, we construct a set V = V (D) E(D) o nodes and a local score unction on V by setting ((u, v), {u}) = 1 or all (u, v) E(D), (v, { (u, v) : u P D (v) }) = P D (v) or all v V (D), and (v, P ) = 0 in all other cases. Clearly, the super-structure S is the undirected graph obtained rom the skeleton o D ater subdividing every edge once, hence the maximum degree o S is at most 4. It is easy to see that D has a eedback arc set o size k i and only i there exists a DAG D whose skeleton is a spanning subgraph o S with (D ) 2 E k. Theorem 3. EXACT BAYESIAN STRUCTURE LEARNING parameterized by the treewidth o the super-structure is W[1]-hard. Proo. We devise a parameterized reduction rom the ollowing problem, which is well-known to be W[1]- complete (Pietrzak 2003). PARTITIONED CLIQUE Instance: A k-partite graph G = (V, E) with partition V 1,..., V k such that V i = V j = n or 1 i < j k. Parameter: The integer k. Question: Are there vertices v 1,..., v k such that v i V i or 1 i k and {v i, v j } E or 1 i < j k? (The graph K = ({v 1,..., v k }, { {v i, v j } : 1 i < j k }) is a k-clique o G.) Let G = (V, E) be an instance o this problem with partition V 1,..., V k, V 1 = = V k = n. Let α = k 2 1 and ɛ = 2k. We construct a set N o nodes and a local score unction on N such that (i) tw(s ) k(k 1)/2 and (ii) G has a k-clique i and only i there exists a DAG D such that (D) k(n 1)α + (k(k 1)/2)ɛ. See Figure 1 or an illustration. v 12 v 11 v 3 v 2 v 12 a 13 v 11 a 12 v 3 v 2 a 23 Figure 1: Illustration or the reduction in the proo o Theorem 3, k = 3. We set A = { a ij : 1 i < j k }, N = V (G) A, and A i = { a lk : l = i or k = i } or every 1 i k. We are now ready to deine. We set (v, A i ) = α or every v V i, and (a ij, {u, w}) = ɛ or every 1 i < j k, u V i, w V j, and {u, w} E(G). Furthermore we set (v, P ) = 0 or all the remaining combinations o v and P. It is easy to see claim (i) as deleting the k(k 1)/2 vertices a ij rom S yields a collection o isolated vertices, i.e., a graph o treewidth 0. Hence, it remains to show claim (ii). So suppose that G has a k-clique K = ({v 1,..., v k }, E K ), such that v i V i or every 1 i k. It ollows that or the DAG D with arc set E(D) = { (v i, a) : 1 i k, a A i } { (a, v) : 1 i k, a A i, v V i \ {v i } } the ollowing holds: 1. (v, P D (v)) = 0, or every v V (K); 2. (v, P D (v)) = α, or every v V (G) \ V (K); 3. (a, P D (a)) = ɛ, or every a A. Hence, (D) = k(n 1)α + (k(k 1)/2)ɛ and the only-i direction o claim (ii) ollows. To show the i direction o claim (ii) suppose that there exists a DAG D such that (D) k(n 1)α + (k(k 1)/2)ɛ. It can be shown that such a score can only be obtained i every vertex in A attains its maximum score and

6 exactly one vertex v i rom every V i does not. It is then easy to see that the vertices {v 1,..., v k } orm a k-clique in G and the claim ollows. Note that in contrast to Theorem 2, it is essential or Theorem 3 that the super-structure has unbounded degree: i both degree and treewidth are bounded then the problem is ixed-parameter tractable by Corollary 1 and so unlikely to be W[1]-hard. 6 k-neighborhood Local Search Important and widely used algorithms or Bayesian structure learning are based on local search methods (Heckerman, Geiger, & Chickering 1995). Usually the local search algorithm tries to improve the score o a given DAG by transorming it into a new DAG by adding, deleting, or reversing an arc (in symbols ADD, DEL, and REV, respectively). The main obstacle or local search methods is the danger o getting stuck at a poor local optimum. A possibility or decreasing this danger is to perorm k > 1 elementary changes in one step, known as k-neighborhood local search or k-local search or short. For Bayesian structure learning, when we try to improve the score o a DAG on n nodes, the k-local search space is o order n O(k). Thereore, i carried out by brute-orth, k-local search is too costly even or small values o k. It is thereore not surprising that most practical local search algorithms or Bayesian structure learning consider 1-neighborhoods only. In this section we investigate whether under restrictions on the super-structure where EXACT BAYESIAN STRUCTURE LEARNING remains hard (as considered in Theorems 2 and 3) at least k-local search becomes easier compared to the general unrestricted case. Our results are mostly negative. In act, somewhat surprisingly, k-local search remains hard even i edge reversal is the only allowed operation. Beore we give the hardness proos we deine k-local search more ormally. Let k 0 and O {ADD, DEL, REV}. Consider a DAG D = (V, E). A directed graph D = (V, E ) is a k-o-neighbor o D i 1. D is a DAG, 2. V = V, 3. E can be obtained rom E by perorming at most k operations rom the set O. For O {ADD, DEL, REV} we consider the ollowing parameterized decision problem. k-o-local SEARCH BAYESIAN STRUCTURE LEARN- ING Instance: A local score unction, a DAG D that is admissible or, and an integer k. Question: Is there a k-o-neighbor D o D with a higher score than D? Note that the problem does not change i we require D to be admissible, as we can always avoid the addition o an inadmissible arc. Theorem 4. I O = {ADD} or O = {DEL}, then k-o- LOCAL SEARCH BAYESIAN STRUCTURE LEARNING is solvable in polynomial time. Proo. We only consider O = {ADD} as the proo or O = {DEL} is analogous. Let I = (D,, k) be the given instance o k-{add}-local SEARCH BAYESIAN STRUC- TURE LEARNING. Note that there exists a k-{add}- neighbor D o D with (D ) > (D) i and only i there exists a vertex v V (D) such that the addition o at most k incoming arcs increases the score o v and the resulting digraph remains acyclic. Now, or every entry (v, P ) such that P V (D) \ {v} one can easily check whether (v, P ) > (v, P D (v)) and whether P can be obtained rom P D (v) via the addition o at most k incoming arcs such that the resulting digraph is acyclic. In view o Theorem 4 let us deine a set O {ADD, DEL, REV} to be non-trivial i O / {, {ADD}, {DEL}}. Theorem 5. Let O {ADD, DEL, REV} be non-trivial. Then k-o-local SEARCH BAYESIAN STRUCTURE LEARNING is W[1]-hard or parameter tw(s ) + k. Proo. We slightly modiy the reduction given in the proo o Theorem 3. Let D be the directed acyclic graph with vertex set N, arc set { (a, v) : a A i, v V i }. We set k = k(k 1)/2. Then, or every O that contains the operation REV, it is easy to see using the same arguments as in the proo o Theorem 3 that G has a k-clique i and only i D has a k -O-neighbor D with (D ) > (D). Similarly, or the remaining case O = {ADD, DEL}, one can show that G has a k-clique i and only i D has a 2k - O-neighbor D with (D ) > (D). Theorem 6. Let O {ADD, DEL, REV} be non-trivial. Then k-o-local SEARCH BAYESIAN STRUCTURE LEARNING is W[1]-hard or parameter k, hardness even holds i the super-structure S has bounded maximum degree. Proo. We devise a parameterized reduction rom the ollowing problem which is known to be W[1]-complete or every constant d 3 (Downey & Fellows 1999; Flum & Grohe 2006). BOUNDED DEGREE RED/BLUE NON-BLOCKER Instance: An undirected graph G = (V, E) with maximum degree d, where V is the disjoint union o sets Red and Blue, and an integer k. Parameter: The integer k.

7 Question: Is there a set S Red o size k such that every v Blue has a neighbor outside o S? (S is a k-red/blue non-blocker o G). Let (G, Red, Blue, k) be an instance o this problem with Red = {vr, 1..., vr n } and Blue = {vb 1,..., vm b }. We may assume that G is bipartite with partition {Red, Blue}. To see this, observe that without aecting the answer we can remove every edge in G between two vertices in Red and similarly we can remove every vertex in Blue that has a neighbor in Blue. Let k = (d + 1)k + 1. We construct a DAG D and a local score unction such that G has a k-red/blue non-blocker i and only i D has a k -O-neighbor D with a higher score than D. The construction given below applies to all cases with REV O. For the only remaining nontrivial set O = {DEL, ADD} it is easy to adapt the construction by setting k to 2((d + 1)k + 1). To make the ollowing arguments easier, it is convenient that all vertices in Red are o degree exactly d. Hence we introduce an intermediate graph G that is obtained rom G by adding d d(v) vertices or every v Red and connecting each o these vertices by an edge to the corresponding v. The DAG D is obtained rom G by applying the ollowing steps (see Figure 2 or an illustration): v 1 r v 2 r v 3 r v 4 r v 1 b v 2 b v 3 b l1 1 l2 1 l1 2 l2 2 r 1 r 2 l1 3 l2 3 v 1 r v 2 r v 3 r v 1 b v 2 b v 3 b Figure 2: Illustration or the reduction in the proo o Theorem 6, k = 3. To improve readability, vertices in V (G ) \ V (G) and most o the arcs between the leaves o T 1 and T 2 and the vertices in Red are omitted. 1. We replace every edge {v, w} o G with v Red by an arc (v, w). 2. We add the complete binary tree T 1 o lowest height with at least n leaves, with edges directed away rom the root. Let r 1 denote the root and l 1 1,..., l n 1 leaves o T We add the complete binary tree T 2 o lowest height with at least n leaves, with edges directed towards the root. Let r 2 denote the root and l 1 2,..., l n 2 leaves o T For every 1 i n we add the arcs (v i r, l i 1) and (v i r, l i 2), running between G and the trees T 1 and T 2. l1 4 l2 4 v 4 r 5. We add the arc (r 2, r 1 ). This completes the construction o D. Next we deine the local score unction or V = V (D). Let α = k 1, β = n and ɛ = 1. Furthermore, or v V (G ) we write N G (v) = { u V (G ) : {u, v} E(G )}. (i) For every vr i Red we set (vr, i N G (vr) i l1) i = ɛ. (ii) For every vb i Blue and = P N G (vi b ) we set (vb i, P ) = β. (iii) For every v V (D) \ (V (G ) {r 2, l1, 1..., l1 n }) we set (v, P D (v)) = α. (iv) For the root o T 2 we set (r 2, P D (r 2 )) = (r 2, P D (r 2 ) {r 1 }) = α. (v) For every l1 i we set (l1, i P D (l1)) i = (l1, i P D (l1) i \ {vr}) i = α. (vi) For all the remaining combinations o v V (D) and P V (D) we set (v, P ) = 0. Evidently D is acyclic and both D and can be constructed rom G in polynomial time. Observe that the super-structure S is exactly the skeleton o D. Hence, by construction, the degree o every vertex o S is bounded by d + 2. It remains to show that G has a k-red/blue non-blocker i and only i D has a k -neighbor D with a higher score than D. To see this, we irst assume that G contains a k-red/blue non-blocker S Red and S = k. We obtain D rom D by reversing the k arcs in { (vr, i w) : vr i S, w N G (vr) i {l1} i } {(r 2, r 1 )}. Note that the reversal o the arc (r 2, r 1 ) ensures that D is acyclic, and since S is a k-red/blue non-blocker in G it ollows that the score or every vertex in Blue does not change. Hence (D ) = (D) α + kɛ = (D) + 1 > (D). To see the reverse direction, note that the vertices in Red are the only vertices o D whose score is not yet maximum. Increasing the score o any o these vertices v Red introduces a cycle that uses only vertices in V (T 1 ) V (T 2 ) {v}. It is easy too see that in order to break this cycle the score or at least one vertex in V (T 1 ) V (T 2 ) has to be decreased by α and that all cycles produced in this way can be destroyed by reversing the arc (r 2, r 1 ). Since α = (k 1)ɛ it ollows that in order to increase the score or D the score or at least k vertices in Red must be increased to ɛ. Let S be the set o these vertices in Red whose score has been increased in this manner. Since or every vertex in S exactly k + 1 arcs need to be reversed and k < (d + 1)(k + 1) it ollows that S k and hence S = k. Because, β = nɛ it ollows that all vertices in Blue must have kept their score and hence S is a k-red/blue non-blocker or G. Theorem 6 provides a surprising contrast to a similar study o k-local search or MAX-SAT where the problem is ixed-parameter tractable or instances o bounded degree (Szeider 2009). A possible explanation or the surprising hardness o k-o-local SEARCH BAYESIAN STRUCTURE LEARNING could be that, in contrast to MAX-SAT, a global property o the entire instance (acyclicity) must be checked.

8 network n m w d link alarm carpo barley hailinder diabetes insurance win95pts mildew munin munin munin munin pigs water Table 1: Bayesian networks rom Repository/. n = number o nodes, m = number o edges, w = upper bound on the treewidth, d = maximum degree. All parameters reer to the skeleton o the network. 7 Conclusion We have studied the computational complexity o exact Bayesian Structure Learning under graph-theoretic restrictions on the super-structure. Our results show that exact learning is linear-time tractable i the super-structure has bounded treewidth and bounded maximum degree, but none o the two restrictions can be dropped without loosing linear time tractability (or uniorm polynomialtime tractability). Our algorithm is based on dynamic programming along a tree decomposition o the superstructure. We have ocused on theoretical worst-case complexity results, leaving an empirical evaluation o the algorithm on real-world data or uture research. As a irst step in that direction we have computed treewidth and maximum degree o the skeletons o some well-known benchmark networks and ound relatively small numbers, see Table 1. We take this as an encouraging indication that rom a practical point o view it makes sense to consider super-structures o small treewidth and small maximum degree. In act, it is desirable to learn networks o small treewidth and small maximum degree as such networks allow eicient reasoning. On the theoretical side we oer as an objective or uture research the identiication o other graph-theoretic parameters that allow eicient exact structure learning. In particular, it would be interesting to identiy parameters that, in contrast to treewidth and maximum degree, separate the (parameterized) complexities o inding globally optimal networks rom improving networks locally by k-neighborhood local search. Reerences Anton, C., and Carlos, G Eicient principled learning o thin junction trees. In Advances in Neural Inormation Processing Systems (NIPS 2007). Bodlaender, H. L A linear-time algorithm or inding tree-decompositions o small treewidth. SIAM J. Comput. 25(6): Chickering, D. M A transormational characterization o equivalent Bayesian network structures. In UAI Chickering, D. M Learning Bayesian networks is NP-complete. In Learning rom Data: Artiicial Intelligence and Statistics, Springer Verlag Chickering, D. M Optimal structure identiication with greedy search. J. Mach. Learn. Res. 3(3): Downey, R. G., and Fellows, M. R Parameterized Complexity. Springer Verlag. Flum, J., and Grohe, M Parameterized Complexity Theory. Springer Verlag. Gogate, V., and Dechter, R A complete anytime algorithm or treewidth. In UAI Heckerman, D.; Geiger, D.; and Chickering, D. M Learning Bayesian networks: The combination o knowledge and statistical data. Machine Learning 20(3): Impagliazzo, R.; Paturi, R.; and Zane, F Which problems have strongly exponential complexity? J. Comput. System Sci. 63(4): Karp, R. M Reducibility among combinatorial problems. In Complexity o Computer Computations Kloks, T Treewidth: Computations and Approximations. Springer Verlag. Mukund, N., and Je, B PAC-learning bounded tree-width graphical models. In UAI Parviainen, P., and Koivisto, M Exact structure discovery in Bayesian networks with less space. In UAI Perrier, E.; Imoto, S.; and Miyano, S Finding optimal Bayesian network given a super-structure. J. Mach. Learn. Res. 9: Pieter, A.; Daphne, K.; and Andrew, Y. N Learning actor graphs in polynomial time and sample complexity. J. Mach. Learn. Res. 7: Pietrzak, K On the parameterized complexity o the ixed alphabet shortest common supersequence and longest common subsequence problems. J. Comput. System Sci. 67(4): Silander, T., and Myllymäki, P A simple approach or inding the globally optimal Bayesian network structure. In UAI Szeider, S The Parameterized Complexity o k-flip Local Search or SAT and MAX SAT. In SAT Lecture Notes in Computer Science 5584, Springer Verlag,

On Finding Optimal Polytrees

On Finding Optimal Polytrees Serge Gaspers The University of New South Wales and Vienna University of Technology gaspers@kr.tuwien.ac.at Mathieu Liedloff Université d Orléans mathieu.liedloff@univ-orleans.fr On Finding Optimal Polytrees

More information

The Parameterized Complexity of k-flip Local Search for SAT and MAX SAT

The Parameterized Complexity of k-flip Local Search for SAT and MAX SAT The Parameterized Complexity of k-flip Local Search for SAT and MAX SAT Stefan Szeider Vienna University of Technology, Institute of Information Systems Favoritenstraße 9-11, A-1040 Vienna, Austria, stefan@szeider.net

More information

The Complexity of Maximum. Matroid-Greedoid Intersection and. Weighted Greedoid Maximization

The Complexity of Maximum. Matroid-Greedoid Intersection and. Weighted Greedoid Maximization Department of Computer Science Series of Publications C Report C-2004-2 The Complexity of Maximum Matroid-Greedoid Intersection and Weighted Greedoid Maximization Taneli Mielikäinen Esko Ukkonen University

More information

Monadic Second Order Logic on Graphs with Local Cardinality Constraints

Monadic Second Order Logic on Graphs with Local Cardinality Constraints Monadic Second Order Logic on Graphs with Local Cardinality Constraints STEFAN SZEIDER Vienna University of Technology, Austria We introduce the class of MSO-LCC problems which are problems of the following

More information

The Mixed Chinese Postman Problem Parameterized by Pathwidth and Treedepth

The Mixed Chinese Postman Problem Parameterized by Pathwidth and Treedepth The Mixed Chinese Postman Problem Parameterized by Pathwidth and Treedepth Gregory Gutin, Mark Jones, and Magnus Wahlström Royal Holloway, University of London Egham, Surrey TW20 0EX, UK Abstract In the

More information

arxiv: v2 [cs.dm] 12 Jul 2014

arxiv: v2 [cs.dm] 12 Jul 2014 Interval Scheduling and Colorful Independent Sets arxiv:1402.0851v2 [cs.dm] 12 Jul 2014 René van Bevern 1, Matthias Mnich 2, Rolf Niedermeier 1, and Mathias Weller 3 1 Institut für Softwaretechnik und

More information

The Necessity of Bounded Treewidth for Efficient Inference in Bayesian Networks

The Necessity of Bounded Treewidth for Efficient Inference in Bayesian Networks The Necessity of Bounded Treewidth for Efficient Inference in Bayesian Networks Johan H.P. Kwisthout and Hans L. Bodlaender and L.C. van der Gaag 1 Abstract. Algorithms for probabilistic inference in Bayesian

More information

Maximum Flow. Reading: CLRS Chapter 26. CSE 6331 Algorithms Steve Lai

Maximum Flow. Reading: CLRS Chapter 26. CSE 6331 Algorithms Steve Lai Maximum Flow Reading: CLRS Chapter 26. CSE 6331 Algorithms Steve Lai Flow Network A low network G ( V, E) is a directed graph with a source node sv, a sink node tv, a capacity unction c. Each edge ( u,

More information

On Graph Contractions and Induced Minors

On Graph Contractions and Induced Minors On Graph Contractions and Induced Minors Pim van t Hof, 1, Marcin Kamiński 2, Daniël Paulusma 1,, Stefan Szeider, 3, and Dimitrios M. Thilikos 4, 1 School of Engineering and Computing Sciences, Durham

More information

Parameterized Complexity

Parameterized Complexity Parameterized Complexity Stefan Szeider Vienna University of Technology, Austria SAT-SMT Summer School 2013 Espoo, Finland 1 / 97 Outline Foundations Backdoors Kernelization Decompositions Local Search

More information

More on NP and Reductions

More on NP and Reductions Indian Institute of Information Technology Design and Manufacturing, Kancheepuram Chennai 600 127, India An Autonomous Institute under MHRD, Govt of India http://www.iiitdm.ac.in COM 501 Advanced Data

More information

A Fixed-Parameter Algorithm for Max Edge Domination

A Fixed-Parameter Algorithm for Max Edge Domination A Fixed-Parameter Algorithm for Max Edge Domination Tesshu Hanaka and Hirotaka Ono Department of Economic Engineering, Kyushu University, Fukuoka 812-8581, Japan ono@cscekyushu-uacjp Abstract In a graph,

More information

Some hard families of parameterised counting problems

Some hard families of parameterised counting problems Some hard families of parameterised counting problems Mark Jerrum and Kitty Meeks School of Mathematical Sciences, Queen Mary University of London {m.jerrum,k.meeks}@qmul.ac.uk September 2014 Abstract

More information

arxiv: v1 [cs.ds] 2 Oct 2018

arxiv: v1 [cs.ds] 2 Oct 2018 Contracting to a Longest Path in H-Free Graphs Walter Kern 1 and Daniël Paulusma 2 1 Department of Applied Mathematics, University of Twente, The Netherlands w.kern@twente.nl 2 Department of Computer Science,

More information

On improving matchings in trees, via bounded-length augmentations 1

On improving matchings in trees, via bounded-length augmentations 1 On improving matchings in trees, via bounded-length augmentations 1 Julien Bensmail a, Valentin Garnero a, Nicolas Nisse a a Université Côte d Azur, CNRS, Inria, I3S, France Abstract Due to a classical

More information

Dynamic Programming on Trees. Example: Independent Set on T = (V, E) rooted at r V.

Dynamic Programming on Trees. Example: Independent Set on T = (V, E) rooted at r V. Dynamic Programming on Trees Example: Independent Set on T = (V, E) rooted at r V. For v V let T v denote the subtree rooted at v. Let f + (v) be the size of a maximum independent set for T v that contains

More information

Circuit Complexity / Counting Problems

Circuit Complexity / Counting Problems Lecture 5 Circuit Complexity / Counting Problems April 2, 24 Lecturer: Paul eame Notes: William Pentney 5. Circuit Complexity and Uniorm Complexity We will conclude our look at the basic relationship between

More information

Approximability and Parameterized Complexity of Consecutive Ones Submatrix Problems

Approximability and Parameterized Complexity of Consecutive Ones Submatrix Problems Proc. 4th TAMC, 27 Approximability and Parameterized Complexity of Consecutive Ones Submatrix Problems Michael Dom, Jiong Guo, and Rolf Niedermeier Institut für Informatik, Friedrich-Schiller-Universität

More information

Lecture 4: NP and computational intractability

Lecture 4: NP and computational intractability Chapter 4 Lecture 4: NP and computational intractability Listen to: Find the longest path, Daniel Barret What do we do today: polynomial time reduction NP, co-np and NP complete problems some examples

More information

Decomposing oriented graphs into transitive tournaments

Decomposing oriented graphs into transitive tournaments Decomposing oriented graphs into transitive tournaments Raphael Yuster Department of Mathematics University of Haifa Haifa 39105, Israel Abstract For an oriented graph G with n vertices, let f(g) denote

More information

Parameterized Algorithms and Kernels for 3-Hitting Set with Parity Constraints

Parameterized Algorithms and Kernels for 3-Hitting Set with Parity Constraints Parameterized Algorithms and Kernels for 3-Hitting Set with Parity Constraints Vikram Kamat 1 and Neeldhara Misra 2 1 University of Warsaw vkamat@mimuw.edu.pl 2 Indian Institute of Science, Bangalore neeldhara@csa.iisc.ernet.in

More information

Paths and cycles in extended and decomposable digraphs

Paths and cycles in extended and decomposable digraphs Paths and cycles in extended and decomposable digraphs Jørgen Bang-Jensen Gregory Gutin Department of Mathematics and Computer Science Odense University, Denmark Abstract We consider digraphs called extended

More information

A Single-Exponential Fixed-Parameter Algorithm for Distance-Hereditary Vertex Deletion

A Single-Exponential Fixed-Parameter Algorithm for Distance-Hereditary Vertex Deletion A Single-Exponential Fixed-Parameter Algorithm for Distance-Hereditary Vertex Deletion Eduard Eiben a, Robert Ganian a, O-joung Kwon b a Algorithms and Complexity Group, TU Wien, Vienna, Austria b Logic

More information

SEPARATED AND PROPER MORPHISMS

SEPARATED AND PROPER MORPHISMS SEPARATED AND PROPER MORPHISMS BRIAN OSSERMAN Last quarter, we introduced the closed diagonal condition or a prevariety to be a prevariety, and the universally closed condition or a variety to be complete.

More information

Lecture 14 - P v.s. NP 1

Lecture 14 - P v.s. NP 1 CME 305: Discrete Mathematics and Algorithms Instructor: Professor Aaron Sidford (sidford@stanford.edu) February 27, 2018 Lecture 14 - P v.s. NP 1 In this lecture we start Unit 3 on NP-hardness and approximation

More information

Figure 1: Bayesian network for problem 1. P (A = t) = 0.3 (1) P (C = t) = 0.6 (2) Table 1: CPTs for problem 1. (c) P (E B) C P (D = t) f 0.9 t 0.

Figure 1: Bayesian network for problem 1. P (A = t) = 0.3 (1) P (C = t) = 0.6 (2) Table 1: CPTs for problem 1. (c) P (E B) C P (D = t) f 0.9 t 0. Probabilistic Artiicial Intelligence Problem Set 3 Oct 27, 2017 1. Variable elimination In this exercise you will use variable elimination to perorm inerence on a bayesian network. Consider the network

More information

On High-Rate Cryptographic Compression Functions

On High-Rate Cryptographic Compression Functions On High-Rate Cryptographic Compression Functions Richard Ostertág and Martin Stanek Department o Computer Science Faculty o Mathematics, Physics and Inormatics Comenius University Mlynská dolina, 842 48

More information

Fixed Parameter Algorithms for Interval Vertex Deletion and Interval Completion Problems

Fixed Parameter Algorithms for Interval Vertex Deletion and Interval Completion Problems Fixed Parameter Algorithms for Interval Vertex Deletion and Interval Completion Problems Arash Rafiey Department of Informatics, University of Bergen, Norway arash.rafiey@ii.uib.no Abstract We consider

More information

Limitations of Algorithm Power

Limitations of Algorithm Power Limitations of Algorithm Power Objectives We now move into the third and final major theme for this course. 1. Tools for analyzing algorithms. 2. Design strategies for designing algorithms. 3. Identifying

More information

Classical Complexity and Fixed-Parameter Tractability of Simultaneous Consecutive Ones Submatrix & Editing Problems

Classical Complexity and Fixed-Parameter Tractability of Simultaneous Consecutive Ones Submatrix & Editing Problems Classical Complexity and Fixed-Parameter Tractability of Simultaneous Consecutive Ones Submatrix & Editing Problems Rani M. R, Mohith Jagalmohanan, R. Subashini Binary matrices having simultaneous consecutive

More information

Exact Inference: Variable Elimination

Exact Inference: Variable Elimination Readings: K&F 9.2 9. 9.4 9.5 Exact nerence: Variable Elimination ecture 6-7 Apr 1/18 2011 E 515 tatistical Methods pring 2011 nstructor: u-n ee University o Washington eattle et s revisit the tudent Network

More information

Roberto s Notes on Differential Calculus Chapter 8: Graphical analysis Section 1. Extreme points

Roberto s Notes on Differential Calculus Chapter 8: Graphical analysis Section 1. Extreme points Roberto s Notes on Dierential Calculus Chapter 8: Graphical analysis Section 1 Extreme points What you need to know already: How to solve basic algebraic and trigonometric equations. All basic techniques

More information

Kernelization Lower Bounds: A Brief History

Kernelization Lower Bounds: A Brief History Kernelization Lower Bounds: A Brief History G Philip Max Planck Institute for Informatics, Saarbrücken, Germany New Developments in Exact Algorithms and Lower Bounds. Pre-FSTTCS 2014 Workshop, IIT Delhi

More information

On the Space Complexity of Parameterized Problems

On the Space Complexity of Parameterized Problems On the Space Complexity of Parameterized Problems Michael Elberfeld Christoph Stockhusen Till Tantau Institute for Theoretical Computer Science Universität zu Lübeck D-23538 Lübeck, Germany {elberfeld,stockhus,tantau}@tcs.uni-luebeck.de

More information

W -Hardness under Linear FPT-Reductions: Structural Properties and Further Applications

W -Hardness under Linear FPT-Reductions: Structural Properties and Further Applications W -Hardness under Linear FPT-Reductions: Structural Properties and Further Applications Jianer Chen 1 Xiuzhen Huang 2 Iyad A. Kanj 3 Ge Xia 4 1 Dept. of Computer Science, Texas A&M University, College

More information

Tree-width and algorithms

Tree-width and algorithms Tree-width and algorithms Zdeněk Dvořák September 14, 2015 1 Algorithmic applications of tree-width Many problems that are hard in general become easy on trees. For example, consider the problem of finding

More information

On the hardness of losing width

On the hardness of losing width On the hardness of losing width Marek Cygan 1, Daniel Lokshtanov 2, Marcin Pilipczuk 1, Micha l Pilipczuk 1, and Saket Saurabh 3 1 Institute of Informatics, University of Warsaw, Poland {cygan@,malcin@,mp248287@students}mimuwedupl

More information

Parameterised Subgraph Counting Problems

Parameterised Subgraph Counting Problems Parameterised Subgraph Counting Problems Kitty Meeks University of Glasgow University of Strathclyde, 27th May 2015 Joint work with Mark Jerrum (QMUL) What is a counting problem? Decision problems Given

More information

Acyclic and Oriented Chromatic Numbers of Graphs

Acyclic and Oriented Chromatic Numbers of Graphs Acyclic and Oriented Chromatic Numbers of Graphs A. V. Kostochka Novosibirsk State University 630090, Novosibirsk, Russia X. Zhu Dept. of Applied Mathematics National Sun Yat-Sen University Kaohsiung,

More information

On the hardness of losing weight

On the hardness of losing weight On the hardness of losing weight Andrei Krokhin 1 and Dániel Marx 2 1 Department of Computer Science, Durham University, Durham, DH1 3LE, UK andrei.krokhin@durham.ac.uk 2 Department of Computer Science

More information

On the mean connected induced subgraph order of cographs

On the mean connected induced subgraph order of cographs AUSTRALASIAN JOURNAL OF COMBINATORICS Volume 71(1) (018), Pages 161 183 On the mean connected induced subgraph order of cographs Matthew E Kroeker Lucas Mol Ortrud R Oellermann University of Winnipeg Winnipeg,

More information

34.1 Polynomial time. Abstract problems

34.1 Polynomial time. Abstract problems < Day Day Up > 34.1 Polynomial time We begin our study of NP-completeness by formalizing our notion of polynomial-time solvable problems. These problems are generally regarded as tractable, but for philosophical,

More information

Efficient Approximation for Restricted Biclique Cover Problems

Efficient Approximation for Restricted Biclique Cover Problems algorithms Article Efficient Approximation for Restricted Biclique Cover Problems Alessandro Epasto 1, *, and Eli Upfal 2 ID 1 Google Research, New York, NY 10011, USA 2 Department of Computer Science,

More information

Approximating fractional hypertree width

Approximating fractional hypertree width Approximating fractional hypertree width Dániel Marx Abstract Fractional hypertree width is a hypergraph measure similar to tree width and hypertree width. Its algorithmic importance comes from the fact

More information

Tree-Width and the Computational Complexity of MAP Approximations in Bayesian Networks

Tree-Width and the Computational Complexity of MAP Approximations in Bayesian Networks Journal of Artificial Intelligence Research 53 (2015) 699-720 Submitted 03/15; published 08/15 Tree-Width and the Computational Complexity of MAP Approximations in Bayesian Networks Johan Kwisthout Radboud

More information

Graph coloring, perfect graphs

Graph coloring, perfect graphs Lecture 5 (05.04.2013) Graph coloring, perfect graphs Scribe: Tomasz Kociumaka Lecturer: Marcin Pilipczuk 1 Introduction to graph coloring Definition 1. Let G be a simple undirected graph and k a positive

More information

The Maximum Flow Problem with Disjunctive Constraints

The Maximum Flow Problem with Disjunctive Constraints The Maximum Flow Problem with Disjunctive Constraints Ulrich Pferschy Joachim Schauer Abstract We study the maximum flow problem subject to binary disjunctive constraints in a directed graph: A negative

More information

Alternating paths revisited II: restricted b-matchings in bipartite graphs

Alternating paths revisited II: restricted b-matchings in bipartite graphs Egerváry Research Group on Combinatorial Optimization Technical reports TR-2005-13. Published by the Egerváry Research Group, Pázmány P. sétány 1/C, H 1117, Budapest, Hungary. Web site: www.cs.elte.hu/egres.

More information

1a. Introduction COMP6741: Parameterized and Exact Computation

1a. Introduction COMP6741: Parameterized and Exact Computation 1a. Introduction COMP6741: Parameterized and Exact Computation Serge Gaspers 12 1 School of Computer Science and Engineering, UNSW Sydney, Asutralia 2 Decision Sciences Group, Data61, CSIRO, Australia

More information

COMP538: Introduction to Bayesian Networks

COMP538: Introduction to Bayesian Networks COMP538: Introduction to Bayesian Networks Lecture 9: Optimal Structure Learning Nevin L. Zhang lzhang@cse.ust.hk Department of Computer Science and Engineering Hong Kong University of Science and Technology

More information

A Cubic-Vertex Kernel for Flip Consensus Tree

A Cubic-Vertex Kernel for Flip Consensus Tree To appear in Algorithmica A Cubic-Vertex Kernel for Flip Consensus Tree Christian Komusiewicz Johannes Uhlmann Received: date / Accepted: date Abstract Given a bipartite graph G = (V c, V t, E) and a nonnegative

More information

Structure Theorem and Isomorphism Test for Graphs with Excluded Topological Subgraphs

Structure Theorem and Isomorphism Test for Graphs with Excluded Topological Subgraphs Structure Theorem and Isomorphism Test for Graphs with Excluded Topological Subgraphs Martin Grohe and Dániel Marx November 13, 2014 Abstract We generalize the structure theorem of Robertson and Seymour

More information

List H-Coloring a Graph by Removing Few Vertices

List H-Coloring a Graph by Removing Few Vertices List H-Coloring a Graph by Removing Few Vertices Rajesh Chitnis 1, László Egri 2, and Dániel Marx 2 1 Department of Computer Science, University of Maryland at College Park, USA, rchitnis@cs.umd.edu 2

More information

Essential facts about NP-completeness:

Essential facts about NP-completeness: CMPSCI611: NP Completeness Lecture 17 Essential facts about NP-completeness: Any NP-complete problem can be solved by a simple, but exponentially slow algorithm. We don t have polynomial-time solutions

More information

SEPARATED AND PROPER MORPHISMS

SEPARATED AND PROPER MORPHISMS SEPARATED AND PROPER MORPHISMS BRIAN OSSERMAN The notions o separatedness and properness are the algebraic geometry analogues o the Hausdor condition and compactness in topology. For varieties over the

More information

Chapter 34: NP-Completeness

Chapter 34: NP-Completeness Graph Algorithms - Spring 2011 Set 17. Lecturer: Huilan Chang Reference: Cormen, Leiserson, Rivest, and Stein, Introduction to Algorithms, 2nd Edition, The MIT Press. Chapter 34: NP-Completeness 2. Polynomial-time

More information

2. ETA EVALUATIONS USING WEBER FUNCTIONS. Introduction

2. ETA EVALUATIONS USING WEBER FUNCTIONS. Introduction . ETA EVALUATIONS USING WEBER FUNCTIONS Introduction So ar we have seen some o the methods or providing eta evaluations that appear in the literature and we have seen some o the interesting properties

More information

Structure Learning: the good, the bad, the ugly

Structure Learning: the good, the bad, the ugly Readings: K&F: 15.1, 15.2, 15.3, 15.4, 15.5 Structure Learning: the good, the bad, the ugly Graphical Models 10708 Carlos Guestrin Carnegie Mellon University September 29 th, 2006 1 Understanding the uniform

More information

Packing and decomposition of graphs with trees

Packing and decomposition of graphs with trees Packing and decomposition of graphs with trees Raphael Yuster Department of Mathematics University of Haifa-ORANIM Tivon 36006, Israel. e-mail: raphy@math.tau.ac.il Abstract Let H be a tree on h 2 vertices.

More information

Constraint Satisfaction with Bounded Treewidth Revisited

Constraint Satisfaction with Bounded Treewidth Revisited Constraint Satisfaction with Bounded Treewidth Revisited Marko Samer and Stefan Szeider Department of Computer Science Durham University, UK Abstract The constraint satisfaction problem can be solved in

More information

P is the class of problems for which there are algorithms that solve the problem in time O(n k ) for some constant k.

P is the class of problems for which there are algorithms that solve the problem in time O(n k ) for some constant k. Complexity Theory Problems are divided into complexity classes. Informally: So far in this course, almost all algorithms had polynomial running time, i.e., on inputs of size n, worst-case running time

More information

10.3 Matroids and approximation

10.3 Matroids and approximation 10.3 Matroids and approximation 137 10.3 Matroids and approximation Given a family F of subsets of some finite set X, called the ground-set, and a weight function assigning each element x X a non-negative

More information

On disconnected cuts and separators

On disconnected cuts and separators On disconnected cuts and separators Takehiro Ito 1, Marcin Kamiński 2, Daniël Paulusma 3 and Dimitrios M. Thilikos 4 1 Graduate School of Information Sciences, Tohoku University, Aoba-yama 6-6-05, Sendai,

More information

FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY. FLAC (15-453) Spring l. Blum TIME COMPLEXITY AND POLYNOMIAL TIME;

FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY. FLAC (15-453) Spring l. Blum TIME COMPLEXITY AND POLYNOMIAL TIME; 15-453 TIME COMPLEXITY AND POLYNOMIAL TIME; FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY NON DETERMINISTIC TURING MACHINES AND NP THURSDAY Mar 20 COMPLEXITY THEORY Studies what can and can t be computed

More information

Preliminaries and Complexity Theory

Preliminaries and Complexity Theory Preliminaries and Complexity Theory Oleksandr Romanko CAS 746 - Advanced Topics in Combinatorial Optimization McMaster University, January 16, 2006 Introduction Book structure: 2 Part I Linear Algebra

More information

Dept. of Computer Science, University of British Columbia, Vancouver, BC, Canada.

Dept. of Computer Science, University of British Columbia, Vancouver, BC, Canada. EuroComb 2005 DMTCS proc. AE, 2005, 67 72 Directed One-Trees William Evans and Mohammad Ali Safari Dept. of Computer Science, University of British Columbia, Vancouver, BC, Canada. {will,safari}@cs.ubc.ca

More information

A Note on the Complexity of Network Reliability Problems. Hans L. Bodlaender Thomas Wolle

A Note on the Complexity of Network Reliability Problems. Hans L. Bodlaender Thomas Wolle A Note on the Complexity of Network Reliability Problems Hans L. Bodlaender Thomas Wolle institute of information and computing sciences, utrecht university technical report UU-CS-2004-001 www.cs.uu.nl

More information

STAT 801: Mathematical Statistics. Hypothesis Testing

STAT 801: Mathematical Statistics. Hypothesis Testing STAT 801: Mathematical Statistics Hypothesis Testing Hypothesis testing: a statistical problem where you must choose, on the basis o data X, between two alternatives. We ormalize this as the problem o

More information

Learning Large-Alphabet and Analog Circuits with Value Injection Queries

Learning Large-Alphabet and Analog Circuits with Value Injection Queries Learning Large-Alphabet and Analog Circuits with Value Injection Queries Dana Angluin 1 James Aspnes 1, Jiang Chen 2, Lev Reyzin 1,, 1 Computer Science Department, Yale University {angluin,aspnes}@cs.yale.edu,

More information

Tree Decompositions and Tree-Width

Tree Decompositions and Tree-Width Tree Decompositions and Tree-Width CS 511 Iowa State University December 6, 2010 CS 511 (Iowa State University) Tree Decompositions and Tree-Width December 6, 2010 1 / 15 Tree Decompositions Definition

More information

NP-Hardness and Fixed-Parameter Tractability of Realizing Degree Sequences with Directed Acyclic Graphs

NP-Hardness and Fixed-Parameter Tractability of Realizing Degree Sequences with Directed Acyclic Graphs NP-Hardness and Fixed-Parameter Tractability of Realizing Degree Sequences with Directed Acyclic Graphs Sepp Hartung and André Nichterlein Institut für Softwaretechnik und Theoretische Informatik TU Berlin

More information

CS 361 Meeting 28 11/14/18

CS 361 Meeting 28 11/14/18 CS 361 Meeting 28 11/14/18 Announcements 1. Homework 9 due Friday Computation Histories 1. Some very interesting proos o undecidability rely on the technique o constructing a language that describes the

More information

1 Relative degree and local normal forms

1 Relative degree and local normal forms THE ZERO DYNAMICS OF A NONLINEAR SYSTEM 1 Relative degree and local normal orms The purpose o this Section is to show how single-input single-output nonlinear systems can be locally given, by means o a

More information

P, NP, NP-Complete, and NPhard

P, NP, NP-Complete, and NPhard P, NP, NP-Complete, and NPhard Problems Zhenjiang Li 21/09/2011 Outline Algorithm time complicity P and NP problems NP-Complete and NP-Hard problems Algorithm time complicity Outline What is this course

More information

Finding Consensus Strings With Small Length Difference Between Input and Solution Strings

Finding Consensus Strings With Small Length Difference Between Input and Solution Strings Finding Consensus Strings With Small Length Difference Between Input and Solution Strings Markus L. Schmid Trier University, Fachbereich IV Abteilung Informatikwissenschaften, D-54286 Trier, Germany, MSchmid@uni-trier.de

More information

Parameterized Domination in Circle Graphs

Parameterized Domination in Circle Graphs Parameterized Domination in Circle Graphs Nicolas Bousquet 1, Daniel Gonçalves 1, George B. Mertzios 2, Christophe Paul 1, Ignasi Sau 1, and Stéphan Thomassé 3 1 AlGCo project-team, CNRS, LIRMM, Montpellier,

More information

1 Primals and Duals: Zero Sum Games

1 Primals and Duals: Zero Sum Games CS 124 Section #11 Zero Sum Games; NP Completeness 4/15/17 1 Primals and Duals: Zero Sum Games We can represent various situations of conflict in life in terms of matrix games. For example, the game shown

More information

arxiv: v1 [cs.ds] 20 Feb 2017

arxiv: v1 [cs.ds] 20 Feb 2017 AN OPTIMAL XP ALGORITHM FOR HAMILTONIAN CYCLE ON GRAPHS OF BOUNDED CLIQUE-WIDTH BENJAMIN BERGOUGNOUX, MAMADOU MOUSTAPHA KANTÉ, AND O-JOUNG KWON arxiv:1702.06095v1 [cs.ds] 20 Feb 2017 Abstract. For MSO

More information

On the hardness of losing width

On the hardness of losing width On the hardness of losing width Marek Cygan 1, Daniel Lokshtanov 2, Marcin Pilipczuk 1, Micha l Pilipczuk 1, and Saket Saurabh 3 1 Institute of Informatics, University of Warsaw, Poland {cygan@,malcin@,mp248287@students}mimuwedupl

More information

Approximating the Partition Function by Deleting and then Correcting for Model Edges (Extended Abstract)

Approximating the Partition Function by Deleting and then Correcting for Model Edges (Extended Abstract) Approximating the Partition Function by Deleting and then Correcting for Model Edges (Extended Abstract) Arthur Choi and Adnan Darwiche Computer Science Department University of California, Los Angeles

More information

Reductions. Reduction. Linear Time Reduction: Examples. Linear Time Reductions

Reductions. Reduction. Linear Time Reduction: Examples. Linear Time Reductions Reduction Reductions Problem X reduces to problem Y if given a subroutine for Y, can solve X. Cost of solving X = cost of solving Y + cost of reduction. May call subroutine for Y more than once. Ex: X

More information

MINIMALLY NON-PFAFFIAN GRAPHS

MINIMALLY NON-PFAFFIAN GRAPHS MINIMALLY NON-PFAFFIAN GRAPHS SERGUEI NORINE AND ROBIN THOMAS Abstract. We consider the question of characterizing Pfaffian graphs. We exhibit an infinite family of non-pfaffian graphs minimal with respect

More information

An Improved Algorithm for Parameterized Edge Dominating Set Problem

An Improved Algorithm for Parameterized Edge Dominating Set Problem An Improved Algorithm for Parameterized Edge Dominating Set Problem Ken Iwaide and Hiroshi Nagamochi Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Japan,

More information

Theory of Computation Chapter 9

Theory of Computation Chapter 9 0-0 Theory of Computation Chapter 9 Guan-Shieng Huang May 12, 2003 NP-completeness Problems NP: the class of languages decided by nondeterministic Turing machine in polynomial time NP-completeness: Cook

More information

On the Complexity of Budgeted Maximum Path Coverage on Trees

On the Complexity of Budgeted Maximum Path Coverage on Trees On the Complexity of Budgeted Maximum Path Coverage on Trees H.-C. Wirth An instance of the budgeted maximum coverage problem is given by a set of weighted ground elements and a cost weighted family of

More information

1 Review of Vertex Cover

1 Review of Vertex Cover CS266: Parameterized Algorithms and Complexity Stanford University Lecture 3 Tuesday, April 9 Scribe: Huacheng Yu Spring 2013 1 Review of Vertex Cover In the last lecture, we discussed FPT algorithms for

More information

Parameterized Algorithms for the H-Packing with t-overlap Problem

Parameterized Algorithms for the H-Packing with t-overlap Problem Journal of Graph Algorithms and Applications http://jgaa.info/ vol. 18, no. 4, pp. 515 538 (2014) DOI: 10.7155/jgaa.00335 Parameterized Algorithms for the H-Packing with t-overlap Problem Alejandro López-Ortiz

More information

CS 350 Algorithms and Complexity

CS 350 Algorithms and Complexity 1 CS 350 Algorithms and Complexity Fall 2015 Lecture 15: Limitations of Algorithmic Power Introduction to complexity theory Andrew P. Black Department of Computer Science Portland State University Lower

More information

Discrete Mathematics. On the number of graphs with a given endomorphism monoid

Discrete Mathematics. On the number of graphs with a given endomorphism monoid Discrete Mathematics 30 00 376 384 Contents lists available at ScienceDirect Discrete Mathematics journal homepage: www.elsevier.com/locate/disc On the number o graphs with a given endomorphism monoid

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear

More information

NP-completeness. Chapter 34. Sergey Bereg

NP-completeness. Chapter 34. Sergey Bereg NP-completeness Chapter 34 Sergey Bereg Oct 2017 Examples Some problems admit polynomial time algorithms, i.e. O(n k ) running time where n is the input size. We will study a class of NP-complete problems

More information

3 Undirected Graphical Models

3 Undirected Graphical Models Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science 6.438 Algorithms For Inference Fall 2014 3 Undirected Graphical Models In this lecture, we discuss undirected

More information

Cographs; chordal graphs and tree decompositions

Cographs; chordal graphs and tree decompositions Cographs; chordal graphs and tree decompositions Zdeněk Dvořák September 14, 2015 Let us now proceed with some more interesting graph classes closed on induced subgraphs. 1 Cographs The class of cographs

More information

Data Structures in Java

Data Structures in Java Data Structures in Java Lecture 21: Introduction to NP-Completeness 12/9/2015 Daniel Bauer Algorithms and Problem Solving Purpose of algorithms: find solutions to problems. Data Structures provide ways

More information

The Parameterized Complexity of Approximate Inference in Bayesian Networks

The Parameterized Complexity of Approximate Inference in Bayesian Networks JMLR: Workshop and Conference Proceedings vol 52, 264-274, 2016 PGM 2016 The Parameterized Complexity of Approximate Inference in Bayesian Networks Johan Kwisthout Radboud University Donders Institute

More information

On the Complexity of the Minimum Independent Set Partition Problem

On the Complexity of the Minimum Independent Set Partition Problem On the Complexity of the Minimum Independent Set Partition Problem T-H. Hubert Chan 1, Charalampos Papamanthou 2, and Zhichao Zhao 1 1 Department of Computer Science the University of Hong Kong {hubert,zczhao}@cs.hku.hk

More information

Graphical Model Inference with Perfect Graphs

Graphical Model Inference with Perfect Graphs Graphical Model Inference with Perfect Graphs Tony Jebara Columbia University July 25, 2013 joint work with Adrian Weller Graphical models and Markov random fields We depict a graphical model G as a bipartite

More information

NP-Completeness. Until now we have been designing algorithms for specific problems

NP-Completeness. Until now we have been designing algorithms for specific problems NP-Completeness 1 Introduction Until now we have been designing algorithms for specific problems We have seen running times O(log n), O(n), O(n log n), O(n 2 ), O(n 3 )... We have also discussed lower

More information

On the rank of Directed Hamiltonicity and beyond

On the rank of Directed Hamiltonicity and beyond Utrecht University Faculty of Science Department of Information and Computing Sciences On the rank of Directed Hamiltonicity and beyond Author: Ioannis Katsikarelis Supervisors: Dr. Hans L. Bodlaender

More information

k-distinct In- and Out-Branchings in Digraphs

k-distinct In- and Out-Branchings in Digraphs k-distinct In- and Out-Branchings in Digraphs Gregory Gutin 1, Felix Reidl 2, and Magnus Wahlström 1 arxiv:1612.03607v2 [cs.ds] 21 Apr 2017 1 Royal Holloway, University of London, UK 2 North Carolina State

More information