Reconstructing Phylogenetic Networks

Size: px
Start display at page:

Download "Reconstructing Phylogenetic Networks"

Transcription

1 Reconstructing Phylogenetic Networks Mareike Fischer, Leo van Iersel, Steven Kelk, Nela Lekić, Simone Linz, Celine Scornavacca, Leen Stougie Centrum Wiskunde & Informatica (CWI) Amsterdam MCW Prague, 3 July 3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

2 Definition Let X be a finite set. A (rooted) phylogenetic tree on X is a rooted tree with no indegree- outdegree- vertices whose leaves are bijectively labelled by the elements of X. 8 Mya 76 Mya 68 Mya 35 Mya Kiwi (New Zealand) Cassowary (New Guinea + Australia) Leo van Iersel (CWI) Emu (Australia) Ostrich Moa (Africa) (New Zealand) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 3 / 44

4 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 4 / 44

5 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 5 / 44

6 W.F. Doolittle et al. () Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 6 / 44

7 Definition Let X be a finite set. A (rooted) phylogenetic network on X is a rooted directed acyclic graph with no indegree- outdegree- vertices whose leaves are bijectively labelled by the elements of X. Parasitic Jaeger Pomarine Skua Great Skua Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 7 / 44

8 The first phylogenetic network (Buffon, 755) Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 8 / 44

9 Phylogenetic network for humans (Reich et al., ) Definition A reticulation is a vertex with indegree at least. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 9 / 44

10 Tree-based methods Compute trees from DNA sequences. Different parts of DNA might give different trees. Try to induce a phylogenetic network from the trees. Definition A phylogenetic tree T is displayed by a phylogenetic network N if T can be obtained from a subgraph of N by contracting edges. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

11 Example: tree T is displayed by network N a h a h g g b c d e f N b c d e f T Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

12 The other binary tree T displayed by network N a h a h g g b c d e f b c d e f N T Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

13 Challenge: try to reconstruct the network from the trees a h g a h b c d e f g a b c d e f h g b c d e f Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 3 / 44

14 Definition The reticulation number of a phylogenetic network N is d (v). Problem Minimum Reticulation v V \{root} Instance: phylogenetic trees T, T Solution: phylogenetic network that displays T and T Minimize: reticulation number of the network. Theorem There exists a constant factor approximation algorithm for Minimum Reticulation if and only if there exists a constant factor approximation algorithm for Directed Feedback Vertex Set. Open question: how to handle more than two trees (efficiently)? Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 4 / 44

15 Reconstructing phylogenetic networks Tree-based methods Construct trees from DNA sequences. Find a network that displays the trees and has minimum reticulation number. Sequence-based methods Find a network directly from the DNA sequences. Optimize Parsimony or Likelihood score of network. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 5 / 44

16 Maximum Parsimony for trees Small parsimony problem: given a tree and a sequence for each leaf, assign sequences to the internal vertices in order to minimize the total number of mutations. ACCTG ATCTG ATCTC GTAAA TTACT Example input Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 6 / 44

17 Maximum Parsimony for trees Small parsimony problem: given a tree and a sequence for each leaf, assign sequences to the internal vertices in order to minimize the total number of mutations. TTCTA ATCTA ATCTG TTAAA ACCTG ATCTG ATCTC GTAAA TTACT Example labelling of internal vertices Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 7 / 44

18 Maximum Parsimony for trees Small parsimony problem: given a tree and a sequence for each leaf, assign sequences to the internal vertices in order to minimize the total number of mutations. ATCTA TTCTA ATCTG TTAAA ACCTG ATCTG ATCTC GTAAA TTACT Example of one mutation Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 8 / 44

19 Maximum Parsimony for trees Small parsimony problem: given a tree and a sequence for each leaf, assign sequences to the internal vertices in order to minimize the total number of mutations. TTCTA ATCTA ATCTG TTAAA ACCTG ATCTG ATCTC GTAAA TTACT All 9 mutations. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 9 / 44

20 Maximum Parsimony for trees Small parsimony problem: given a tree and a sequence for each leaf, assign sequences to the interior vertices in order to minimize the total number of mutations. Polynomial-time solvable: Consider each character separately. Use dynamic programming (Fitch, 97). Two possible extensions to networks: hardwired softwired Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

21 Hardwired Maximum Parsimony on Networks A p-state character on X is a function α : X {,..., p}. The change c τ (e) on edge e = (u, v) w.r.t. a p-state character τ on V (N) is defined as: { if τ(u) = τ(v) c τ (e) = if τ(u) τ(v). The hardwired parsimony score of a phylogenetic network N and p-state character α is given by PS hw (N, α) = min c τ (e), τ e E(N) where the minimum is taken over all p-state characters τ on V (N) that extend α. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

22 Example input: (N, α) 3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 / 44

23 A 3-state character τ on V (N) that extends α. 3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 3 / 44

24 PS hw (N, α) = 4 3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 4 / 44

25 Softwired Maximum Parsimony on Networks The softwired parsimony score of a phylogenetic network N and p-state character α is given by PS sw (N, α) = min PS(T, α), T T (N) where T (N) is the set of trees on X displayed by N. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 5 / 44

26 One of the two trees on X displayed by the network 3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 6 / 44

27 A 3-state character τ on V (T ) that extends α. 3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 7 / 44

28 There are 3 changes 3 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 8 / 44

29 The other tree needs 4 changes 3 The minimum over the two trees is 3, so PS sw (N, α) = 3. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 9 / 44

30 Proposition PS hw (N, α) is not an o(n)-approximation of PS sw (N, α). Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 3 / 44

31 Softwired Parsimony Score PS sw (N, α) = Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 3 / 44

32 Hardwired Parsimony Score with r the number of reticulations. PS hw (N, α) = 4 = r + Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 3 / 44

33 Proposition Let G be the graph obtained from network N by merging all leaves x with α(x) = i into a single node γ i, for i =,..., p. Then, PS hw (N, α) equals the size of a minimum multiterminal cut in G with terminals γ,..., γ p. Corollary Computing the hardwired parsimony score of a phylogenetic network and a binary character is polynomial-time solvable. Corollary Computing the hardwired parsimony score of a phylogenetic network and a p-state character, for p 3, is NP-hard and APX-hard but fixed-parameter tractable (FPT) in the parsimony score, and there exists a polynomial-time.3438-approximation for all p and a -approximation for p = 3. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 33 / 44

34 Example Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 34 / 44

35 Merge -leaves and -leaves Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 35 / 44

36 Hardwired Parsimony Score is 4 Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 36 / 44

37 Observation There exists a (trivial) X -approximation for computing the softwired parsimony score of a phylogenetic network. Theorem For every constant ɛ > there is no polynomial-time approximation algorithm that approximates PS sw (N, α) to a factor X ɛ, for a phylogenetic network N and a binary character α, unless P = NP. Definition A phylogenetic network is binary if the root has outdegree and all other vertices have total degree or 3. Theorem For every constant ɛ > there is no polynomial-time approximation algorithm that approximates PS sw (N, α) to a factor X 3 ɛ, for a binary phylogenetic network N and a binary character α, unless P = NP. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 37 / 44

38 Proof: reduction from 3SAT Ú s s variable gadget x :x y :y z :z f(n,ï) copies f(n,ï) copies clause gadget... f(n,ï)... copies x _ :y :x _ y _ z f(n,ï) copies Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 38 / 44

39 Binary case x :x y :y z :z dx Îx dy Îy dz Îz dx Îz dx 3 x _ y x _ :y _ z x _ :z :x _ :z Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 39 / 44

40 Theorem There is no FPT algorithm for computing the softwired parsimony score, with the score as parameter, unless P = NP. Definition A phylogenetic network is level-k if each biconnected component has reticulation number at most k. Theorem There is an FPT algorithm for computing the softwired parsimony score, with the level of the network as parameter. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 4 / 44

41 ILP for softwired parsimony score min c e e E s.t. x v,s = s P c e x u,s x v,s ( y e ) c e x v,s x u,s ( y e ) y (v,r) = v:(v,r) E y e = x v,α(v) = c e, y e {, } x v,s {, } for all v V for all e = (u, v) E, s P for all e = (u, v) E, s P for each reticulation r for each non-reticulate edge e for each leaf v for all e E for all v V, s P with P = {,..., p} and α(v) the given character state of a leaf v. Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 4 / 44

42 Both parsimony scores can be computed quickly using ILP X Avg. Average computation time (s) num. of Hardwired PS Softwired PS retic. -state 3-state 4-state -state 3-state 4-state Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 4 / 44

43 Future Work Are there approximation or FPT algorithms for computing the softwired parsimony score of restricted classes of networks? How to search for an optimal network? What if the different characters are not independent? Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 43 / 44

44 Thanks Mareike Fischer (Greifswald) Steven Kelk (Maastricht) Celine Scornavacca (Montpellier) Simone Linz (Christchurch) Leen Stougie (Amsterdam) Nela Lekić (Maastricht) Leo van Iersel (CWI) Reconstructing Phylogenetic Networks MCW Prague, 3 July 3 44 / 44

arxiv: v3 [q-bio.pe] 1 May 2014

arxiv: v3 [q-bio.pe] 1 May 2014 ON COMPUTING THE MAXIMUM PARSIMONY SCORE OF A PHYLOGENETIC NETWORK MAREIKE FISCHER, LEO VAN IERSEL, STEVEN KELK, AND CELINE SCORNAVACCA arxiv:32.243v3 [q-bio.pe] May 24 Abstract. Phylogenetic networks

More information

Downloaded 10/29/15 to Redistribution subject to SIAM license or copyright; see

Downloaded 10/29/15 to Redistribution subject to SIAM license or copyright; see SIAM J. DISCRETE MATH. Vol. 29, No., pp. 559 585 c 25 Society for Industrial and Applied Mathematics Downloaded /29/5 to 3.8.29.24. Redistribution subject to SIAM license or copyright; see http://www.siam.org/journals/ojsa.php

More information

Improved maximum parsimony models for phylogenetic networks

Improved maximum parsimony models for phylogenetic networks Improved maximum parsimony models for phylogenetic networks Leo van Iersel Mark Jones Celine Scornavacca December 20, 207 Abstract Phylogenetic networks are well suited to represent evolutionary histories

More information

arxiv: v5 [q-bio.pe] 24 Oct 2016

arxiv: v5 [q-bio.pe] 24 Oct 2016 On the Quirks of Maximum Parsimony and Likelihood on Phylogenetic Networks Christopher Bryant a, Mareike Fischer b, Simone Linz c, Charles Semple d arxiv:1505.06898v5 [q-bio.pe] 24 Oct 2016 a Statistics

More information

arxiv: v1 [q-bio.pe] 12 Jul 2017

arxiv: v1 [q-bio.pe] 12 Jul 2017 oname manuscript o. (will be inserted by the editor) Finding the most parsimonious or likely tree in a network with respect to an alignment arxiv:1707.03648v1 [q-bio.pe] 12 Jul 27 Steven Kelk Fabio Pardi

More information

arxiv: v1 [cs.cc] 9 Oct 2014

arxiv: v1 [cs.cc] 9 Oct 2014 Satisfying ternary permutation constraints by multiple linear orders or phylogenetic trees Leo van Iersel, Steven Kelk, Nela Lekić, Simone Linz May 7, 08 arxiv:40.7v [cs.cc] 9 Oct 04 Abstract A ternary

More information

A new algorithm to construct phylogenetic networks from trees

A new algorithm to construct phylogenetic networks from trees A new algorithm to construct phylogenetic networks from trees J. Wang College of Computer Science, Inner Mongolia University, Hohhot, Inner Mongolia, China Corresponding author: J. Wang E-mail: wangjuanangle@hit.edu.cn

More information

An introduction to phylogenetic networks

An introduction to phylogenetic networks An introduction to phylogenetic networks Steven Kelk Department of Knowledge Engineering (DKE) Maastricht University Email: steven.kelk@maastrichtuniversity.nl Web: http://skelk.sdf-eu.org Genome sequence,

More information

Reconstruction of certain phylogenetic networks from their tree-average distances

Reconstruction of certain phylogenetic networks from their tree-average distances Reconstruction of certain phylogenetic networks from their tree-average distances Stephen J. Willson Department of Mathematics Iowa State University Ames, IA 50011 USA swillson@iastate.edu October 10,

More information

Phylogenetic Networks with Recombination

Phylogenetic Networks with Recombination Phylogenetic Networks with Recombination October 17 2012 Recombination All DNA is recombinant DNA... [The] natural process of recombination and mutation have acted throughout evolution... Genetic exchange

More information

Cherry picking: a characterization of the temporal hybridization number for a set of phylogenies

Cherry picking: a characterization of the temporal hybridization number for a set of phylogenies Bulletin of Mathematical Biology manuscript No. (will be inserted by the editor) Cherry picking: a characterization of the temporal hybridization number for a set of phylogenies Peter J. Humphries Simone

More information

arxiv: v1 [cs.ds] 2 Sep 2016

arxiv: v1 [cs.ds] 2 Sep 2016 On unrooted and root-uncertain variants of several well-known phylogenetic network problems arxiv:1609.00544v1 [cs.ds] 2 Sep 2016 Leo van Iersel 1, Steven Kelk 2, Georgios Stamoulis 2, Leen Stougie 3,4,

More information

Parsimonious Migration History Problem: Complexity and Algorithms. Mohammed El-Kebir, University of Illinois at Urbana-Champaign WABI 2018

Parsimonious Migration History Problem: Complexity and Algorithms. Mohammed El-Kebir, University of Illinois at Urbana-Champaign WABI 2018 Parsimonious Migration History Problem: Complexity and Algorithms Mohammed El-Kebir, University of Illinois at Urbana-Champaign WABI 2018 Tumorigenesis: (i) Cell Division, (ii) Mutation & (iii) Migration

More information

arxiv: v1 [q-bio.pe] 1 Jun 2014

arxiv: v1 [q-bio.pe] 1 Jun 2014 THE MOST PARSIMONIOUS TREE FOR RANDOM DATA MAREIKE FISCHER, MICHELLE GALLA, LINA HERBST AND MIKE STEEL arxiv:46.27v [q-bio.pe] Jun 24 Abstract. Applying a method to reconstruct a phylogenetic tree from

More information

Phylogenetic Networks, Trees, and Clusters

Phylogenetic Networks, Trees, and Clusters Phylogenetic Networks, Trees, and Clusters Luay Nakhleh 1 and Li-San Wang 2 1 Department of Computer Science Rice University Houston, TX 77005, USA nakhleh@cs.rice.edu 2 Department of Biology University

More information

THE THREE-STATE PERFECT PHYLOGENY PROBLEM REDUCES TO 2-SAT

THE THREE-STATE PERFECT PHYLOGENY PROBLEM REDUCES TO 2-SAT COMMUNICATIONS IN INFORMATION AND SYSTEMS c 2009 International Press Vol. 9, No. 4, pp. 295-302, 2009 001 THE THREE-STATE PERFECT PHYLOGENY PROBLEM REDUCES TO 2-SAT DAN GUSFIELD AND YUFENG WU Abstract.

More information

Properties of normal phylogenetic networks

Properties of normal phylogenetic networks Properties of normal phylogenetic networks Stephen J. Willson Department of Mathematics Iowa State University Ames, IA 50011 USA swillson@iastate.edu August 13, 2009 Abstract. A phylogenetic network is

More information

Tree-average distances on certain phylogenetic networks have their weights uniquely determined

Tree-average distances on certain phylogenetic networks have their weights uniquely determined Tree-average distances on certain phylogenetic networks have their weights uniquely determined Stephen J. Willson Department of Mathematics Iowa State University Ames, IA 50011 USA swillson@iastate.edu

More information

LEO VAN IERSEL TU DELFT

LEO VAN IERSEL TU DELFT LEO VAN IERSEL TU DELFT UT LEO VAN IERSEL TU DELFT UT LEO VAN IERSEL TU DELFT TU/ CWI UT LEO VAN IERSEL TU DELFT TU/ CWI UT TUD LEO VAN IERSEL TU DELFT TU/ Tnzni & Kny yr LEO VAN IERSEL TU DELFT Nw Zln.5

More information

Phylogenetics: Parsimony

Phylogenetics: Parsimony 1 Phylogenetics: Parsimony COMP 571 Luay Nakhleh, Rice University he Problem 2 Input: Multiple alignment of a set S of sequences Output: ree leaf-labeled with S Assumptions Characters are mutually independent

More information

Branching. Teppo Niinimäki. Helsinki October 14, 2011 Seminar: Exact Exponential Algorithms UNIVERSITY OF HELSINKI Department of Computer Science

Branching. Teppo Niinimäki. Helsinki October 14, 2011 Seminar: Exact Exponential Algorithms UNIVERSITY OF HELSINKI Department of Computer Science Branching Teppo Niinimäki Helsinki October 14, 2011 Seminar: Exact Exponential Algorithms UNIVERSITY OF HELSINKI Department of Computer Science 1 For a large number of important computational problems

More information

A Collection of Problems in Propositional Logic

A Collection of Problems in Propositional Logic A Collection of Problems in Propositional Logic Hans Kleine Büning SS 2016 Problem 1: SAT (respectively SAT) Instance: A propositional formula α (for SAT in CNF). Question: Is α satisfiable? The problems

More information

arxiv: v1 [cs.dm] 29 Oct 2012

arxiv: v1 [cs.dm] 29 Oct 2012 arxiv:1210.7684v1 [cs.dm] 29 Oct 2012 Square-Root Finding Problem In Graphs, A Complete Dichotomy Theorem. Babak Farzad 1 and Majid Karimi 2 Department of Mathematics Brock University, St. Catharines,

More information

Evolutionary Tree Analysis. Overview

Evolutionary Tree Analysis. Overview CSI/BINF 5330 Evolutionary Tree Analysis Young-Rae Cho Associate Professor Department of Computer Science Baylor University Overview Backgrounds Distance-Based Evolutionary Tree Reconstruction Character-Based

More information

Trees. A tree is a graph which is. (a) Connected and. (b) has no cycles (acyclic).

Trees. A tree is a graph which is. (a) Connected and. (b) has no cycles (acyclic). Trees A tree is a graph which is (a) Connected and (b) has no cycles (acyclic). 1 Lemma 1 Let the components of G be C 1, C 2,..., C r, Suppose e = (u, v) / E, u C i, v C j. (a) i = j ω(g + e) = ω(g).

More information

Boolean decision diagrams and SAT-based representations

Boolean decision diagrams and SAT-based representations Boolean decision diagrams and SAT-based representations 4th July 200 So far we have seen Kripke Structures 2 Temporal logics (and their semantics over Kripke structures) 3 Model checking of these structures

More information

On the Space Complexity of Parameterized Problems

On the Space Complexity of Parameterized Problems On the Space Complexity of Parameterized Problems Michael Elberfeld Christoph Stockhusen Till Tantau Institute for Theoretical Computer Science Universität zu Lübeck D-23538 Lübeck, Germany {elberfeld,stockhus,tantau}@tcs.uni-luebeck.de

More information

Restricted trees: simplifying networks with bottlenecks

Restricted trees: simplifying networks with bottlenecks Restricted trees: simplifying networks with bottlenecks Stephen J. Willson Department of Mathematics Iowa State University Ames, IA 50011 USA swillson@iastate.edu February 17, 2011 Abstract. Suppose N

More information

From graph classes to phylogenetic networks Philippe Gambette

From graph classes to phylogenetic networks Philippe Gambette 40 années d'algorithmique de graphes 40 Years of Graphs and Algorithms 11/10/2018 - Paris From graph classes to phylogenetic networks Philippe Gambette Outline Discovering graph classes with Michel An

More information

Integer Programming for Phylogenetic Network Problems

Integer Programming for Phylogenetic Network Problems Integer Programming for Phylogenetic Network Problems D. Gusfield University of California, Davis Presented at the National University of Singapore, July 27, 2015.! There are many important phylogeny problems

More information

Limitations of Algorithm Power

Limitations of Algorithm Power Limitations of Algorithm Power Objectives We now move into the third and final major theme for this course. 1. Tools for analyzing algorithms. 2. Design strategies for designing algorithms. 3. Identifying

More information

The Spanning Galaxy Problem

The Spanning Galaxy Problem The Spanning Galay Problem Daniel Gonçalves Frédéric Havet Aleandre Pinlou Stéphan Thomassé August 23, 2009 Abstract In a directed graph, a star is an arborescence with at least one arc, in which the root

More information

Solving the Maximum Agreement Subtree and Maximum Comp. Tree problems on bounded degree trees. Sylvain Guillemot, François Nicolas.

Solving the Maximum Agreement Subtree and Maximum Comp. Tree problems on bounded degree trees. Sylvain Guillemot, François Nicolas. Solving the Maximum Agreement Subtree and Maximum Compatible Tree problems on bounded degree trees LIRMM, Montpellier France 4th July 2006 Introduction The Mast and Mct problems: given a set of evolutionary

More information

Acyclic Digraphs arising from Complete Intersections

Acyclic Digraphs arising from Complete Intersections Acyclic Digraphs arising from Complete Intersections Walter D. Morris, Jr. George Mason University wmorris@gmu.edu July 8, 2016 Abstract We call a directed acyclic graph a CI-digraph if a certain affine

More information

NOTE ON THE HYBRIDIZATION NUMBER AND SUBTREE DISTANCE IN PHYLOGENETICS

NOTE ON THE HYBRIDIZATION NUMBER AND SUBTREE DISTANCE IN PHYLOGENETICS NOTE ON THE HYBRIDIZATION NUMBER AND SUBTREE DISTANCE IN PHYLOGENETICS PETER J. HUMPHRIES AND CHARLES SEMPLE Abstract. For two rooted phylogenetic trees T and T, the rooted subtree prune and regraft distance

More information

Monadic Second Order Logic on Graphs with Local Cardinality Constraints

Monadic Second Order Logic on Graphs with Local Cardinality Constraints Monadic Second Order Logic on Graphs with Local Cardinality Constraints STEFAN SZEIDER Vienna University of Technology, Austria We introduce the class of MSO-LCC problems which are problems of the following

More information

arxiv: v2 [cs.dm] 12 Jul 2014

arxiv: v2 [cs.dm] 12 Jul 2014 Interval Scheduling and Colorful Independent Sets arxiv:1402.0851v2 [cs.dm] 12 Jul 2014 René van Bevern 1, Matthias Mnich 2, Rolf Niedermeier 1, and Mathias Weller 3 1 Institut für Softwaretechnik und

More information

X X (2) X Pr(X = x θ) (3)

X X (2) X Pr(X = x θ) (3) Notes for 848 lecture 6: A ML basis for compatibility and parsimony Notation θ Θ (1) Θ is the space of all possible trees (and model parameters) θ is a point in the parameter space = a particular tree

More information

Finding a gene tree in a phylogenetic network Philippe Gambette

Finding a gene tree in a phylogenetic network Philippe Gambette LRI-LIX BioInfo Seminar 19/01/2017 - Palaiseau Finding a gene tree in a phylogenetic network Philippe Gambette Outline Phylogenetic networks Classes of phylogenetic networks The Tree Containment Problem

More information

Lower bounds for polynomial kernelization

Lower bounds for polynomial kernelization Lower bounds for polynomial kernelization Part 2 Micha l Pilipczuk Institutt for Informatikk, Universitetet i Bergen August 20 th, 2014 Micha l Pilipczuk Kernelization lower bounds, part 2 1/33 Outline

More information

RECOVERING NORMAL NETWORKS FROM SHORTEST INTER-TAXA DISTANCE INFORMATION

RECOVERING NORMAL NETWORKS FROM SHORTEST INTER-TAXA DISTANCE INFORMATION RECOVERING NORMAL NETWORKS FROM SHORTEST INTER-TAXA DISTANCE INFORMATION MAGNUS BORDEWICH, KATHARINA T. HUBER, VINCENT MOULTON, AND CHARLES SEMPLE Abstract. Phylogenetic networks are a type of leaf-labelled,

More information

Lecture 24 : Even more reductions

Lecture 24 : Even more reductions COMPSCI 330: Design and Analysis of Algorithms December 5, 2017 Lecture 24 : Even more reductions Lecturer: Yu Cheng Scribe: Will Wang 1 Overview Last two lectures, we showed the technique of reduction

More information

Problem Set 3 Due: Wednesday, October 22nd, 2014

Problem Set 3 Due: Wednesday, October 22nd, 2014 6.89: Algorithmic Lower Bounds Fall 24 Prof. Erik Demaine TAs: Sarah Eisenstat, Jayson Lynch Problem Set 3 Due: Wednesday, October 22nd, 24 Problem. A Tour of Hamiltonicity Variants For each of the following

More information

Solving the Tree Containment Problem for Genetically Stable Networks in Quadratic Time

Solving the Tree Containment Problem for Genetically Stable Networks in Quadratic Time Solving the Tree Containment Problem for Genetically Stable Networks in Quadratic Time Philippe Gambette, Andreas D.M. Gunawan, Anthony Labarre, Stéphane Vialette, Louxin Zhang To cite this version: Philippe

More information

Graph. Supply Vertices and Demand Vertices. Supply Vertices. Demand Vertices

Graph. Supply Vertices and Demand Vertices. Supply Vertices. Demand Vertices Partitioning Graphs of Supply and Demand Generalization of Knapsack Problem Takao Nishizeki Tohoku University Graph Supply Vertices and Demand Vertices Supply Vertices Demand Vertices Graph Each Supply

More information

Theory of Computation Time Complexity

Theory of Computation Time Complexity Theory of Computation Time Complexity Bow-Yaw Wang Academia Sinica Spring 2012 Bow-Yaw Wang (Academia Sinica) Time Complexity Spring 2012 1 / 59 Time for Deciding a Language Let us consider A = {0 n 1

More information

Detecting Backdoor Sets with Respect to Horn and Binary Clauses

Detecting Backdoor Sets with Respect to Horn and Binary Clauses Detecting Backdoor Sets with Respect to Horn and Binary Clauses Naomi Nishimura 1,, Prabhakar Ragde 1,, and Stefan Szeider 2, 1 School of Computer Science, University of Waterloo, Waterloo, Ontario, N2L

More information

Comp487/587 - Boolean Formulas

Comp487/587 - Boolean Formulas Comp487/587 - Boolean Formulas 1 Logic and SAT 1.1 What is a Boolean Formula Logic is a way through which we can analyze and reason about simple or complicated events. In particular, we are interested

More information

CS21 Decidability and Tractability

CS21 Decidability and Tractability CS21 Decidability and Tractability Lecture 20 February 23, 2018 February 23, 2018 CS21 Lecture 20 1 Outline the complexity class NP NP-complete probelems: Subset Sum NP-complete problems: NAE-3-SAT, max

More information

arxiv: v1 [cs.ds] 1 Nov 2018

arxiv: v1 [cs.ds] 1 Nov 2018 An O(nlogn) time Algorithm for computing the Path-length Distance between Trees arxiv:1811.00619v1 [cs.ds] 1 Nov 2018 David Bryant Celine Scornavacca November 5, 2018 Abstract Tree comparison metrics have

More information

A CLUSTER REDUCTION FOR COMPUTING THE SUBTREE DISTANCE BETWEEN PHYLOGENIES

A CLUSTER REDUCTION FOR COMPUTING THE SUBTREE DISTANCE BETWEEN PHYLOGENIES A CLUSTER REDUCTION FOR COMPUTING THE SUBTREE DISTANCE BETWEEN PHYLOGENIES SIMONE LINZ AND CHARLES SEMPLE Abstract. Calculating the rooted subtree prune and regraft (rspr) distance between two rooted binary

More information

On the Subnet Prune and Regraft distance

On the Subnet Prune and Regraft distance On the Subnet Prune and Regraft distance Jonathan Klawitter and Simone Linz Department of Computer Science, University of Auckland, New Zealand jo. klawitter@ gmail. com, s. linz@ auckland. ac. nz arxiv:805.07839v

More information

A 3-APPROXIMATION ALGORITHM FOR THE SUBTREE DISTANCE BETWEEN PHYLOGENIES. 1. Introduction

A 3-APPROXIMATION ALGORITHM FOR THE SUBTREE DISTANCE BETWEEN PHYLOGENIES. 1. Introduction A 3-APPROXIMATION ALGORITHM FOR THE SUBTREE DISTANCE BETWEEN PHYLOGENIES MAGNUS BORDEWICH 1, CATHERINE MCCARTIN 2, AND CHARLES SEMPLE 3 Abstract. In this paper, we give a (polynomial-time) 3-approximation

More information

The Complexity of Maximum. Matroid-Greedoid Intersection and. Weighted Greedoid Maximization

The Complexity of Maximum. Matroid-Greedoid Intersection and. Weighted Greedoid Maximization Department of Computer Science Series of Publications C Report C-2004-2 The Complexity of Maximum Matroid-Greedoid Intersection and Weighted Greedoid Maximization Taneli Mielikäinen Esko Ukkonen University

More information

More on NP and Reductions

More on NP and Reductions Indian Institute of Information Technology Design and Manufacturing, Kancheepuram Chennai 600 127, India An Autonomous Institute under MHRD, Govt of India http://www.iiitdm.ac.in COM 501 Advanced Data

More information

Phylogenetics: Parsimony and Likelihood. COMP Spring 2016 Luay Nakhleh, Rice University

Phylogenetics: Parsimony and Likelihood. COMP Spring 2016 Luay Nakhleh, Rice University Phylogenetics: Parsimony and Likelihood COMP 571 - Spring 2016 Luay Nakhleh, Rice University The Problem Input: Multiple alignment of a set S of sequences Output: Tree T leaf-labeled with S Assumptions

More information

NP Completeness. CS 374: Algorithms & Models of Computation, Spring Lecture 23. November 19, 2015

NP Completeness. CS 374: Algorithms & Models of Computation, Spring Lecture 23. November 19, 2015 CS 374: Algorithms & Models of Computation, Spring 2015 NP Completeness Lecture 23 November 19, 2015 Chandra & Lenny (UIUC) CS374 1 Spring 2015 1 / 37 Part I NP-Completeness Chandra & Lenny (UIUC) CS374

More information

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming Lecturer: Matt Weinberg Scribe: Sanjeev Arora One of the running themes in this course is

More information

CS 301: Complexity of Algorithms (Term I 2008) Alex Tiskin Harald Räcke. Hamiltonian Cycle. 8.5 Sequencing Problems. Directed Hamiltonian Cycle

CS 301: Complexity of Algorithms (Term I 2008) Alex Tiskin Harald Räcke. Hamiltonian Cycle. 8.5 Sequencing Problems. Directed Hamiltonian Cycle 8.5 Sequencing Problems Basic genres. Packing problems: SET-PACKING, INDEPENDENT SET. Covering problems: SET-COVER, VERTEX-COVER. Constraint satisfaction problems: SAT, 3-SAT. Sequencing problems: HAMILTONIAN-CYCLE,

More information

Regular networks are determined by their trees

Regular networks are determined by their trees Regular networks are determined by their trees Stephen J. Willson Department of Mathematics Iowa State University Ames, IA 50011 USA swillson@iastate.edu February 17, 2009 Abstract. A rooted acyclic digraph

More information

Lecture 15 - NP Completeness 1

Lecture 15 - NP Completeness 1 CME 305: Discrete Mathematics and Algorithms Instructor: Professor Aaron Sidford (sidford@stanford.edu) February 29, 2018 Lecture 15 - NP Completeness 1 In the last lecture we discussed how to provide

More information

The complexity of enumeration and counting for acyclic conjunctive queries

The complexity of enumeration and counting for acyclic conjunctive queries The complexity of enumeration and counting for acyclic conjunctive queries Cambridge, march 2012 Counting and enumeration Counting Output the number of solutions Example : sat, Perfect Matching, Permanent,

More information

Approximating Longest Directed Path

Approximating Longest Directed Path Electronic Colloquium on Computational Complexity, Report No. 3 (003) Approximating Longest Directed Path Andreas Björklund Thore Husfeldt Sanjeev Khanna April 6, 003 Abstract We investigate the hardness

More information

Limits to Approximability: When Algorithms Won't Help You. Note: Contents of today s lecture won t be on the exam

Limits to Approximability: When Algorithms Won't Help You. Note: Contents of today s lecture won t be on the exam Limits to Approximability: When Algorithms Won't Help You Note: Contents of today s lecture won t be on the exam Outline Limits to Approximability: basic results Detour: Provers, verifiers, and NP Graph

More information

Inferring a level-1 phylogenetic network from a dense set of rooted triplets

Inferring a level-1 phylogenetic network from a dense set of rooted triplets Theoretical Computer Science 363 (2006) 60 68 www.elsevier.com/locate/tcs Inferring a level-1 phylogenetic network from a dense set of rooted triplets Jesper Jansson a,, Wing-Kin Sung a,b, a School of

More information

Reconstructing Phylogenetic Networks Using Maximum Parsimony

Reconstructing Phylogenetic Networks Using Maximum Parsimony Reconstructing Phylogenetic Networks Using Maximum Parsimony Luay Nakhleh Guohua Jin Fengmei Zhao John Mellor-Crummey Department of Computer Science, Rice University Houston, TX 77005 {nakhleh,jin,fzhao,johnmc}@cs.rice.edu

More information

The maximum edge-disjoint paths problem in complete graphs

The maximum edge-disjoint paths problem in complete graphs Theoretical Computer Science 399 (2008) 128 140 www.elsevier.com/locate/tcs The maximum edge-disjoint paths problem in complete graphs Adrian Kosowski Department of Algorithms and System Modeling, Gdańsk

More information

lamsade 331 Décembre 2012 Laboratoire d Analyses et Modélisation de Systèmes pour l Aide à la Décision UMR 7243

lamsade 331 Décembre 2012 Laboratoire d Analyses et Modélisation de Systèmes pour l Aide à la Décision UMR 7243 lamsade Laboratoire d Analyses et Modélisation de Systèmes pour l Aide à la Décision UMR 7243 CAHIER DU LAMSADE 331 Décembre 2012 Moderate exponential time approximation and branching algorithms B. Escoffier,

More information

On the Computational Hardness of Graph Coloring

On the Computational Hardness of Graph Coloring On the Computational Hardness of Graph Coloring Steven Rutherford June 3, 2011 Contents 1 Introduction 2 2 Turing Machine 2 3 Complexity Classes 3 4 Polynomial Time (P) 4 4.1 COLORED-GRAPH...........................

More information

ACYCLIC DIGRAPHS GIVING RISE TO COMPLETE INTERSECTIONS

ACYCLIC DIGRAPHS GIVING RISE TO COMPLETE INTERSECTIONS ACYCLIC DIGRAPHS GIVING RISE TO COMPLETE INTERSECTIONS WALTER D. MORRIS, JR. ABSTRACT. We call a directed acyclic graph a CIdigraph if a certain affine semigroup ring defined by it is a complete intersection.

More information

The complexity of acyclic subhypergraph problems

The complexity of acyclic subhypergraph problems The complexity of acyclic subhypergraph problems David Duris and Yann Strozecki Équipe de Logique Mathématique (FRE 3233) - Université Paris Diderot-Paris 7 {duris,strozecki}@logique.jussieu.fr Abstract.

More information

1 Review of Vertex Cover

1 Review of Vertex Cover CS266: Parameterized Algorithms and Complexity Stanford University Lecture 3 Tuesday, April 9 Scribe: Huacheng Yu Spring 2013 1 Review of Vertex Cover In the last lecture, we discussed FPT algorithms for

More information

REPORT MAS-R0404 DECEMBER

REPORT MAS-R0404 DECEMBER C e n t r u m v o o r W i s k u n d e e n I n f o r m a t i c a MAS Modelling, Analysis and Simulation Modelling, Analysis and Simulation Approximating the minimal-cost sensor-selection for discrete-event

More information

Shorelines of islands of tractability: Algorithms for parsimony and minimum perfect phylogeny haplotyping problems

Shorelines of islands of tractability: Algorithms for parsimony and minimum perfect phylogeny haplotyping problems 1 arxiv:q-bio/0605024v3 [q-bio.ot] 12 Jan 2007 Shorelines of islands of tractability: Algorithms for parsimony and minimum perfect phylogeny haplotyping problems Leo van Iersel, Judith Keijsper, Steven

More information

Lecture 22: Counting

Lecture 22: Counting CS 710: Complexity Theory 4/8/2010 Lecture 22: Counting Instructor: Dieter van Melkebeek Scribe: Phil Rydzewski & Chi Man Liu Last time we introduced extractors and discussed two methods to construct them.

More information

HW Graph Theory SOLUTIONS (hbovik) - Q

HW Graph Theory SOLUTIONS (hbovik) - Q 1, Diestel 3.5: Deduce the k = 2 case of Menger s theorem (3.3.1) from Proposition 3.1.1. Let G be 2-connected, and let A and B be 2-sets. We handle some special cases (thus later in the induction if these

More information

5. Complexity Theory

5. Complexity Theory Course: Combinatorial Optimisation DOCUMENT LAST MODIFIED: MARCH 2, 2011 VU University Amsterdam 5. Complexity Theory Guido Schäfer March 2011 These lecture notes cover the material of the lectures given

More information

Efficient Parsimony-based Methods for Phylogenetic Network Reconstruction

Efficient Parsimony-based Methods for Phylogenetic Network Reconstruction BIOINFORMATICS Vol. 00 no. 00 2006 Pages 1 7 Efficient Parsimony-based Methods for Phylogenetic Network Reconstruction Guohua Jin a, Luay Nakhleh a, Sagi Snir b, Tamir Tuller c a Dept. of Computer Science,

More information

The Complexity of Mining Maximal Frequent Subgraphs

The Complexity of Mining Maximal Frequent Subgraphs The Complexity of Mining Maximal Frequent Subgraphs Benny Kimelfeld IBM Research Almaden kimelfeld@us.ibm.com Phokion G. Kolaitis UC Santa Cruz & IBM Research Almaden kolaitis@cs.ucsc.edu ABSTRACT A frequent

More information

On vertex coloring without monochromatic triangles

On vertex coloring without monochromatic triangles On vertex coloring without monochromatic triangles arxiv:1710.07132v1 [cs.ds] 19 Oct 2017 Micha l Karpiński, Krzysztof Piecuch Institute of Computer Science University of Wroc law Joliot-Curie 15, 50-383

More information

CS 161: Design and Analysis of Algorithms

CS 161: Design and Analysis of Algorithms CS 161: Design and Analysis of Algorithms NP- Complete I P, NP Polynomial >me reduc>ons NP- Hard, NP- Complete Sat/ 3- Sat Decision Problem Suppose there is a func>on A that outputs True or False A decision

More information

CS154, Lecture 15: Cook-Levin Theorem SAT, 3SAT

CS154, Lecture 15: Cook-Levin Theorem SAT, 3SAT CS154, Lecture 15: Cook-Levin Theorem SAT, 3SAT Definition: A language B is NP-complete if: 1. B NP 2. Every A in NP is poly-time reducible to B That is, A P B When this is true, we say B is NP-hard On

More information

Complexity and Approximation of the Minimum Recombination Haplotype Configuration Problem

Complexity and Approximation of the Minimum Recombination Haplotype Configuration Problem Complexity and Approximation of the Minimum Recombination Haplotype Configuration Problem Lan Liu 1, Xi Chen 3, Jing Xiao 3, and Tao Jiang 1,2 1 Department of Computer Science and Engineering, University

More information

HARDNESS AND ALGORITHMS FOR RAINBOW CONNECTIVITY

HARDNESS AND ALGORITHMS FOR RAINBOW CONNECTIVITY HARDNESS AND ALGORITHMS FOR RAINBOW CONNECTIVITY SOURAV CHAKRABORTY 1 AND ELDAR FISCHER 1 AND ARIE MATSLIAH 2 AND RAPHAEL YUSTER 3 1 Department of Computer Science, Technion, Haifa 32000, Israel. E-mail

More information

Computational Complexity

Computational Complexity Computational Complexity Algorithm performance and difficulty of problems So far we have seen problems admitting fast algorithms flow problems, shortest path, spanning tree... and other problems for which

More information

Partitioning Metric Spaces

Partitioning Metric Spaces Partitioning Metric Spaces Computational and Metric Geometry Instructor: Yury Makarychev 1 Multiway Cut Problem 1.1 Preliminaries Definition 1.1. We are given a graph G = (V, E) and a set of terminals

More information

Bayesian Networks Factor Graphs the Case-Factor Algorithm and the Junction Tree Algorithm

Bayesian Networks Factor Graphs the Case-Factor Algorithm and the Junction Tree Algorithm Bayesian Networks Factor Graphs the Case-Factor Algorithm and the Junction Tree Algorithm 1 Bayesian Networks We will use capital letters for random variables and lower case letters for values of those

More information

k-distinct In- and Out-Branchings in Digraphs

k-distinct In- and Out-Branchings in Digraphs k-distinct In- and Out-Branchings in Digraphs Gregory Gutin 1, Felix Reidl 2, and Magnus Wahlström 1 arxiv:1612.03607v2 [cs.ds] 21 Apr 2017 1 Royal Holloway, University of London, UK 2 North Carolina State

More information

DRAFT. Algebraic computation models. Chapter 14

DRAFT. Algebraic computation models. Chapter 14 Chapter 14 Algebraic computation models Somewhat rough We think of numerical algorithms root-finding, gaussian elimination etc. as operating over R or C, even though the underlying representation of the

More information

Institute of Operating Systems and Computer Networks Algorithms Group. Network Algorithms. Tutorial 3: Shortest paths and other stuff

Institute of Operating Systems and Computer Networks Algorithms Group. Network Algorithms. Tutorial 3: Shortest paths and other stuff Institute of Operating Systems and Computer Networks Algorithms Group Network Algorithms Tutorial 3: Shortest paths and other stuff Christian Rieck Shortest paths: Dijkstra s algorithm 2 Dijkstra s algorithm

More information

NP-Completeness I. Lecture Overview Introduction: Reduction and Expressiveness

NP-Completeness I. Lecture Overview Introduction: Reduction and Expressiveness Lecture 19 NP-Completeness I 19.1 Overview In the past few lectures we have looked at increasingly more expressive problems that we were able to solve using efficient algorithms. In this lecture we introduce

More information

Enumerating the edge-colourings and total colourings of a regular graph

Enumerating the edge-colourings and total colourings of a regular graph Enumerating the edge-colourings and total colourings of a regular graph S. Bessy and F. Havet November 5, 2011 Abstract In this paper, we are interested in computing the number of edge colourings and total

More information

Branchwidth of graphic matroids.

Branchwidth of graphic matroids. Branchwidth of graphic matroids. Frédéric Mazoit and Stéphan Thomassé Abstract Answering a question of Geelen, Gerards, Robertson and Whittle [2], we prove that the branchwidth of a bridgeless graph is

More information

Phylogenetic Tree Reconstruction

Phylogenetic Tree Reconstruction I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven

More information

I. Introduction to NP Functions and Local Search

I. Introduction to NP Functions and Local Search I. Introduction to NP Functions and Local Search (UCSD) sbuss@math.ucsd.edu Prague, September 2009 NP Functions TFNP [JPY 88, Papadimitriou 94]. Definition TFNP, the class of Total NP Functions is the

More information

Classes of Problems. CS 461, Lecture 23. NP-Hard. Today s Outline. We can characterize many problems into three classes:

Classes of Problems. CS 461, Lecture 23. NP-Hard. Today s Outline. We can characterize many problems into three classes: Classes of Problems We can characterize many problems into three classes: CS 461, Lecture 23 Jared Saia University of New Mexico P is the set of yes/no problems that can be solved in polynomial time. Intuitively

More information

Beyond Galled Trees Decomposition and Computation of Galled Networks

Beyond Galled Trees Decomposition and Computation of Galled Networks Beyond Galled Trees Decomposition and Computation of Galled Networks Daniel H. Huson & Tobias H.Kloepper RECOMB 2007 1 Copyright (c) 2008 Daniel Huson. Permission is granted to copy, distribute and/or

More information

Learning Large-Alphabet and Analog Circuits with Value Injection Queries

Learning Large-Alphabet and Analog Circuits with Value Injection Queries Learning Large-Alphabet and Analog Circuits with Value Injection Queries Dana Angluin 1 James Aspnes 1, Jiang Chen 2, Lev Reyzin 1,, 1 Computer Science Department, Yale University {angluin,aspnes}@cs.yale.edu,

More information

Digital search trees JASS

Digital search trees JASS Digital search trees Analysis of different digital trees with Rice s integrals. JASS Nicolai v. Hoyningen-Huene 28.3.2004 28.3.2004 JASS 04 - Digital search trees 1 content Tree Digital search tree: Definition

More information

Acyclic orientations of graphs

Acyclic orientations of graphs Discrete Mathematics 306 2006) 905 909 www.elsevier.com/locate/disc Acyclic orientations of graphs Richard P. Stanley Department of Mathematics, University of California, Berkeley, Calif. 94720, USA Abstract

More information