Evolutionary Trees. Evolutionary tree. To describe the evolutionary relationship among species A 3 A 2 A 4. R.C.T. Lee and Chin Lung Lu


 Brice Griffin
 2 years ago
 Views:
Transcription
1 Evolutionary Trees R.C.T. Lee and Chin Lung Lu CS 5313 Algorithms for Molecular Biology Evolutionary Trees p.1 Evolutionary tree To describe the evolutionary relationship among species Root A 3 Bifurcating Speciation Extinct Ancestor A 4 Internal Node A4 A Extinct Ancestor Leaf Extant Species Evolutionary Trees p.
2 Construction of Evolutionary Tree Man Monkey Dog Horse Donkey Pig Rabbit Kangaroo 3.5 Man Monkey Dog Horse Donkey Pig Rabbit Kangaroo Distance Matrix M 0 Kangaroo , Man Monkey Horse Donkey Dog Pig Rabbit Evolutionary Tree T Evolutionary Trees p.3 Rooted Evolutionary Tree The degree of each internal node i, except the root node. A 1 A A 3 A4 Evolutionary Trees p.4
3 Unrooted Evolutionary Tree Thedegreeofeachinternalnodeis3. A A 4 A 3 Evolutionary Trees p.5 Number of unrooted trees n = n =3 No of Trees Structure of Trees No of Edges ➊ n =4 3 5 Evolutionary Trees p.6
4 Number of unrooted trees How to add a new species into the tree? ➋ s 3 If a new species is added to an unrooted tree, the number of edges is increased by. NE(n): number of edges of an unrooted tree with n species By induction, wehavene(n) =n 3. Evolutionary Trees p.7 Number of unrooted trees ➌ TU(n): number of unrooted trees for n species Since NE(n) =n 3, wehave TU(n +1)=(n 3)TU(n) or TU(n) =(n 5)TU(n 1) That is, TU(n) =(n 5)(n 7) 1 Evolutionary Trees p.8
5 Number of unrooted trees ➍ No of Species No of Unrooted Trees ,190,83,353,69, ,898,783,96,510, ,33,659,870,76,850,65 0 1,643,095,476,699,771,875 Evolutionary Trees p.9 Change unrooted into rooted ➊ ➋ root root ➌ root Evolutionary Trees p.10
6 Number of rooted trees TR(n): number of rooted trees for n species Since there are n 3 edges in every unrooted tree for n species, we have ➊ TR(n) = (n 3)TU(n) =(n 3)(n 5)(n 7) 1 = TU(n +1) Evolutionary Trees p.11 Number of rooted trees ➋ No of Species No of Rooted Trees ,898,783,96,510, ,33,659,870,76,850, ,643,095,476,699,771, ,00,794,53,637,891,559,375 Evolutionary Trees p.1
7 Evolution Tree Problem Input: A distance matrix M of n species Output: Find an evolution tree T under some criterion Basic criterion: d T (s i,s j ) d M (s i,s j ) d M (s i,s j ): the distance between species s i and s j in M d T (s i,s j ): the distance between s i and s j in T Further criterion: when two species are close to each other in the distance matrix, they should be close in the evolution tree Evolutionary Trees p.13 Criteria of Evolution Tree MINIMAX evolution tree: an evolutionary tree with d T (s i,s j ) d M (s i,s j ) such that max [d T (s i,s j ) d M (s i,s j )] is minimized 1 i<j n MINISUM evolution tree: an evolutionary tree with d T (s i,s j ) d M (s i,s j ) such that 1 i<j n d T (s i,s j ) is minimized MINISIZE evolution tree: an evolutionary tree with d T (s i,s j ) d M (s i,s j ) such that the total length of the tree is minimized Evolutionary Trees p.14
8 Complexities of Evolutionary Tree MINIMAX MINISUM MINISIZE Rooted O(n ) NPC NPc Unrooted NPC NPC? It is still unknown whether the unrooted MINISIZE evolutionary tree problem is NPcomplete. Evolutionary Trees p.15 Minimax rooted tree algorithm Input: a distance matrix of a set S of n species 1. If S contains only one species, return it as the tree.. Find a minimal spanning tree T of S. 3. Find the longest d(s i,s j ) in the distance matrix. Find the longest edge e in the path linking s i and s j in T. Let S i and S j be the two sets of species obtained by breaking edge e. 4. Use this algorithm recursively to find subtrees T i and T j for S i and S j respectively. 5. Construct a rooted tree with T i and T j as subtrees such that dt(s i,s j )=d(s i,s j ). Evolutionary Trees p.16
9 Basic priciple of Minimax Let s i and s j be the two species which have the longest distance in the distance matrix. 1 d(s i,s j ) T i T j s i s j The longest distance is exactly preserved. Evolutionary Trees p.17 Example of Minimax algorithm 1. Consider the distance matrix as follows: ➊ Construct a minimal spanning tree T as follows: 3 1 Evolutionary Trees p.18
10 Example of Minimax algorithm 3. The distance between and is the longest. ➋ The path linking and in T in which (, ) is the longest edge. 3 1 Evolutionary Trees p.19 Example of Minimax algorithm 5. Breaking (, ) obtains two subsets of species 1 ➌ S 1 S 6. Construct two subtrees T 1 and T for S 1 and S respectively T 1 T Evolutionary Trees p.0
11 Example of Minimax algorithm 7. Combine T 1 and T by making sure that dt(, )=d(, )=5 ➍ Evolutionary Trees p.1 Determination of edge weights How to determine the edge weights if the evolutionary tree structure is given? Example: Givena distance matrix and an unrooted evolutionary tree, determne the edge weights such that the tree size is minimized x 1 x 3 x 4 x x 5 Evolutionary Trees p.
12 Determination of edge weights x 4 x 5 Determine x i by linear programming Minimize x 1 + x + x 3 + x 4 + x 5 x 1 + x d 1 x 3 x 1 x Subject to x 1 + x 3 + x 4 d 13 x 1 + x 3 + x 5 d 14 x + x 3 + x 4 d 3 Unrooted Tree x + x 3 + x 5 d 4 x 4 + x 5 d 34 Evolutionary Trees p.3 Determination of edge weights Minimize x 1 + x + x 3 + x 4 + x 5 + x 6 Subject to x 1 + x d 1 x 5 x 6 x 1 x x 3 x 4 Rooted Tree x 1 + x 5 + x 6 + x 3 d 13 x 1 + x 5 + x 6 + x 4 d 14 x + x 5 + x 6 + x 3 d 3 x + x 5 + x 6 + x 4 d 4 x 3 + x 4 d 34 x 5 + x 1 = x 5 + x = x 6 + x 3 = x 6 + x 4 Evolutionary Trees p.4
13 UPGMA: create a rooted tree UPGMA: Unweighted Pair Group Method with Arithmetic Mean Example: consider the distance matrix as follows Output: a rooted evolutionary tree ➊ Evolutionary Trees p.5 UPGMA: create a rooted tree ➋ 1. Select the pair of species (, ) with the smallest distance andconstructarootedtreewith and as leaf nodes (, ) 1 d(, )=1 Evolutionary Trees p.6
14 UPGMA: create a rooted tree. Consider (, ) as a new species and the distances are updated as follows: d(, (, )) = 1 (d(, )+d(, )) = 1 (4 + 3) = 3.5 d(, (, )) = 1 (d(, )+d(, )) = 1 (6 + 5) = 5.5 (, ) (, ) 0 ➌ Evolutionary Trees p.7 UPGMA: create a rooted tree ➍ 3. Select the pair of species (, (, )) with the smallest distance and construct a rooted as follows: (, (, )) (, ) d(, (, )) = Evolutionary Trees p.8
15 UPGMA: create a rooted tree 4. Since istheonlyspecieleft,thefinaltreewill look like as follows: ➎ d(, (, (, ))) = =.5 1 Evolutionary Trees p.9 Neighbor joining: unrooted tree ➊ Construct a starlike tree as follows: L 1X L X X L 5X s 5 L 3X L 4X Tree length n S 0 = L ix = 1 n 1 i=1 1 i<j n d(s i,s j ) Evolutionary Trees p.30
16 Neighbor joining: unrooted tree ➋ Consider the tree T 1 as follows: L 1Y L 3X Y L XY X L 4X L Y L 5X s 5 [ 1 n L XY = (d(,s k )+d(,s k )) (n ) k=3 ] n (n )(L 1Y + L Y ) L ix i=3 Evolutionary Trees p.31 Neighbor joining: unrooted tree ➌ ThetreelengthS 1 of T 1 is: S 1 = L XY + (L 1Y + L Y ) + = 1 n i=3 L ix n (n ) k=3 (d(,s k )+d(,s k )) + 1 d(, )+ 1 n 3 i<j n d(s i,s j ) L 1Y L 3X Y L XY X L 4X L Y L 5X s 5 Evolutionary Trees p.3
17 Neighbor joining: unrooted tree ➍ Consider (, ) as a new speices s (1) and the distances are updated as follows: for 3 i n d(s (1),s i )= d(,s i )+d(,s i ) L 3X s (1) L (1)X X L 4X L 5X s 5 Evolutionary Trees p.33 Neighbor joining: algorithm ➎ 1. For all pairs of s i and s j, we compute the tree size S ij of tree T ij.( n(n 1) such trees). Choose the pair with the smallest tree size. Suppose S 1 is such a pair. Then consider (, ) as a new speices s (1) and the distances are updated as follows: d(s (1),s i )= d(,s i )+d(,s i ) for 3 i n 3. The number of species is reduced by one and for the new distance matrix, the above procedure is again applied to find the next pair of neighbors until the number of species become. Evolutionary Trees p.34
18 Neighbor joining: example Example: consider the distance matrix as follows ➏ d(s i,s j ) L X X L 3X L 1X L 4X Evolutionary Trees p.35 Neighbor joining: example ➐ S ij L 1Y L 3X Y L XX X L Y L 4X Evolutionary Trees p.36
19 Neighbor joining: example ➑ d(s i,s j ) s (1) 4+6 s (1) 0 =5 3+5 =4 0 0 L 3X s (1) L (1)X X L 4X Evolutionary Trees p.37 Estimate edge lengths Fitch and Margoliash s (1967) mehtod: x z s y d(, )=x + y, d(, )=x + z, d(, )=y + z x = d(, )+d(, ) d(, ) y = d(, )+d(, ) d(, ) z = d(, )+d(, ) d(, ) Evolutionary Trees p.38
20 Estimate edge lengths L 1Y1 = d(, )+d(,s (345) ) d(,s (345) ) d(,s (345) )= d(, )+d(, )+d(,s 5 ) 3 d(,s (345) )= d(, )+d(, )+d(,s 5 ) 3 L Y1 = d(, )+d(,s (345) ) d(,s (345) ) L 1Y1 L 3X Y1 L XY1 X L 4X s (345) L Y1 L 5X s 5 Evolutionary Trees p.39 Estimate edge lengths Compute L (1)Y and L 3Y using the same way. s (1) L 1Y1 L Y1 Y 1 L (1)Y Y L XY X L 4X s (45) L 3Y L 5X s 5 L (1)Y = L 1Y 1 +L Y1 Y +L Y1 +L Y1 Y = L Y1 Y + L 1Y 1 +L Y1 = L Y1 Y + d(, ) L Y1 Y = L (1)Y d(, ) Evolutionary Trees p.40
21 Unrooted minisize: approximation How to design a approximation algorithm for the unrooted minisize evolution tree problem? d(, ) x 1 x 0 0 Evolutionary Trees p.41 Unrooted minisize: approximation Construct a minimal spanning tree out of the distance matrix d(, ) Evolutionary Trees p.4
22 Unrooted minisize: approximation Conduct a bradth first search on the minimal spaning tree by choosing an arbitrary node as root Start from the root and visit all of the first level descendants first, then the second level descendants, and so on ➋ ➊ ➌ ➍ Evolutionary Trees p.43 Unrooted minisize: approximation Transform the minimal spanning tree into an evolutionary tree by adding nodes one by one ➊ ➋ ➌ 3 x 1 x 1 x Evolutionary Trees p.44
23 Unrooted minisize: approximation The created tree above is indeed an evolution tree: Each species is at leaf. Thedegreeofeachinternalnodeisthree. For any two species s i and s j,wehave d T (s i,s j ) d(s i,s j ) because the distance d T (s i,s j ) between any two species s i and s j on the evolution tree is exactly the same as that on the minimal spanning tree, which is then d(s i,s j ) by triangular inequality Evolutionary Trees p.45 Unrooted minisize: approximation How to prove AP P < OP T? AP P : the length of our approximate solution OP T : an optimal solution 1. AP P = MST < TSP MST: the mininal spanning tree of distance matrix TSP: the optimal solution of the travelling salesperson problem for distance matrix. TSP OP T Evolutionary Trees p.46
24 Unrooted minisize: approximation AP P = MST < TSP By construction, AP P = MST TSP: findahamiltonian cycle (visiting all of the nodes exactly once) with minimum length Deleting any edge from TSP obtains a spanning tree Evolutionary Trees p.47 Unrooted minisize: approximation TSP OP T ET: aneuler tour (visiting all of edges exactly once) ofopt( ET = OP T ) CET: the cycle of spcies corresponding to ET ( TSP CET CET is a Hamilt. cycle) CET ET since d T (s i,s j ) d(s i,s j ) Evolutionary Trees p.48
Phylogenetic trees 07/10/13
Phylogenetic trees 07/10/13 A tree is the only figure to occur in On the Origin of Species by Charles Darwin. It is a graphical representation of the evolutionary relationships among entities that share
More informationEvolutionary Tree Analysis. Overview
CSI/BINF 5330 Evolutionary Tree Analysis YoungRae Cho Associate Professor Department of Computer Science Baylor University Overview Backgrounds DistanceBased Evolutionary Tree Reconstruction CharacterBased
More informationDNA Phylogeny. Signals and Systems in Biology Kushal EE, IIT Delhi
DNA Phylogeny Signals and Systems in Biology Kushal Shah @ EE, IIT Delhi Phylogenetics Grouping and Division of organisms Keeps changing with time Splitting, hybridization and termination Cladistics :
More informationPhylogenetic Tree Reconstruction
I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven
More informationCS5238 Combinatorial methods in bioinformatics 2003/2004 Semester 1. Lecture 8: Phylogenetic Tree Reconstruction: Distance Based  October 10, 2003
CS5238 Combinatorial methods in bioinformatics 2003/2004 Semester 1 Lecture 8: Phylogenetic Tree Reconstruction: Distance Based  October 10, 2003 Lecturer: WingKin Sung Scribe: Ning K., Shan T., Xiang
More informationPhylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center
Phylogenetic Analysis Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center Outline Basic Concepts Tree Construction Methods Distancebased methods
More informationJuly 18, Approximation Algorithms (Travelling Salesman Problem)
Approximation Algorithms (Travelling Salesman Problem) July 18, 2014 The travellingsalesman problem Problem: given complete, undirected graph G = (V, E) with nonnegative integer cost c(u, v) for each
More informationBINF6201/8201. Molecular phylogenetic methods
BINF60/80 Molecular phylogenetic methods 0706 Phylogenetics Ø According to the evolutionary theory, all life forms on this planet are related to one another by descent. Ø Traditionally, phylogenetics
More informationVIII. NPcompleteness
VIII. NPcompleteness 1 / 15 NPCompleteness Overview 1. Introduction 2. P and NP 3. NPcomplete (NPC): formal definition 4. How to prove a problem is NPC 5. How to solve a NPC problem: approximate algorithms
More informationTree of Life iological Sequence nalysis Chapter http://tolweb.org/tree/ Phylogenetic Prediction ll organisms on Earth have a common ancestor. ll species are related. The relationship is called a phylogeny
More informationWhat is Phylogenetics
What is Phylogenetics Phylogenetics is the area of research concerned with finding the genetic connections and relationships between species. The basic idea is to compare specific characters (features)
More informationAlgorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Distance Methods Character Methods
More informationfrom notes written mostly by Dr. Matt Stallmann: All Rights Reserved
CSC 505, Fall 000: Week 0 Objectives: understand problem complexity and classes defined by it understand relationships among decision, witness, evaluation, and optimization problems understand what it
More informationDecision Problems TSP. Instance: A complete graph G with nonnegative edge costs, and an integer
Decision Problems The theory of NPcompleteness deals only with decision problems. Why? Because if a decision problem is hard, then the corresponding optimization problem must be hard too. For example,
More informationPhylogenetics: Distance Methods. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Distance Methods COMP 571  Spring 2015 Luay Nakhleh, Rice University Outline Evolutionary models and distance corrections Distancebased methods Evolutionary Models and Distance Correction
More informationTheory of Evolution Charles Darwin
Theory of Evolution Charles arwin 85859: Origin of Species 5 year voyage of H.M.S. eagle (8336) Populations have variations. Natural Selection & Survival of the fittest: nature selects best adapted varieties
More information9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)
I9 Introduction to Bioinformatics, 0 Phylogenetic ree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & omputing, IUB Evolution theory Speciation Evolution of new organisms is driven by
More information"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky
MOLECULAR PHYLOGENY "Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky EVOLUTION  theory that groups of organisms change over time so that descendeants differ structurally
More informationBioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics
Bioinformatics 1 Biology, Sequences, Phylogenetics Part 4 Sepp Hochreiter Klausur Mo. 30.01.2011 Zeit: 15:30 17:00 Raum: HS14 Anmeldung Kusss Contents Methods and Bootstrapping of Maximum Methods Methods
More informationNP and Computational Intractability
NP and Computational Intractability 1 PolynomialTime Reduction Desiderata'. Suppose we could solve X in polynomialtime. What else could we solve in polynomial time? don't confuse with reduces from Reduction.
More informationAlgorithm Design Strategies V
Algorithm Design Strategies V Joaquim Madeira Version 0.0 October 2016 U. Aveiro, October 2016 1 Overview The 01 Knapsack Problem Revisited The Fractional Knapsack Problem Greedy Algorithms Example Coin
More informationABHELSINKI UNIVERSITY OF TECHNOLOGY
Approximation Algorithms Seminar 1 Set Cover, Steiner Tree and TSP Siert Wieringa siert.wieringa@tkk.fi Approximation Algorithms Seminar 1 1/27 Contents Approximation algorithms for: Set Cover Steiner
More informationRECOVERING NORMAL NETWORKS FROM SHORTEST INTERTAXA DISTANCE INFORMATION
RECOVERING NORMAL NETWORKS FROM SHORTEST INTERTAXA DISTANCE INFORMATION MAGNUS BORDEWICH, KATHARINA T. HUBER, VINCENT MOULTON, AND CHARLES SEMPLE Abstract. Phylogenetic networks are a type of leaflabelled,
More informationECS122A Handout on NPCompleteness March 12, 2018
ECS122A Handout on NPCompleteness March 12, 2018 Contents: I. Introduction II. P and NP III. NPcomplete IV. How to prove a problem is NPcomplete V. How to solve a NPcomplete problem: approximate algorithms
More informationPhylogeny: traditional and Bayesian approaches
Phylogeny: traditional and Bayesian approaches 5Feb2014 DEKM book Notes from Dr. B. John Holder and Lewis, Nature Reviews Genetics 4, 275284, 2003 1 Phylogeny A graph depicting the ancestordescendent
More informationCS/COE
CS/COE 1501 www.cs.pitt.edu/~nlf4/cs1501/ P vs NP But first, something completely different... Some computational problems are unsolvable No algorithm can be written that will always produce the correct
More informationWeek 5: Distance methods, DNA and protein models
Week 5: Distance methods, DNA and protein models Genome 570 February, 2016 Week 5: Distance methods, DNA and protein models p.1/69 A tree and the expected distances it predicts E A 0.08 0.05 0.06 0.03
More informationUNIVERSITY OF YORK. MSc Examinations 2004 MATHEMATICS Networks. Time Allowed: 3 hours.
UNIVERSITY OF YORK MSc Examinations 2004 MATHEMATICS Networks Time Allowed: 3 hours. Answer 4 questions. Standard calculators will be provided but should be unnecessary. 1 Turn over 2 continued on next
More informationBioinformatics 1  lecture 9. Phylogenetic trees Distancebased tree building Parsimony
ioinformatics  lecture 9 Phylogenetic trees istancebased tree building Parsimony (,(,(,))) rees can be represented in "parenthesis notation". Each set of parentheses represents a branchpoint (bifurcation),
More informationPhylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science
Phylogeny and Evolution Gina Cannarozzi ETH Zurich Institute of Computational Science History Aristotle (384322 BC) classified animals. He found that dolphins do not belong to the fish but to the mammals.
More informationPhylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz
Phylogenetic Trees What They Are Why We Do It & How To Do It Presented by Amy Harris Dr Brad Morantz Overview What is a phylogenetic tree Why do we do it How do we do it Methods and programs Parallels
More informationOrganisatorische Details
Organisatorische Details Vorlesung: Di 1314, Do 1012 in DI 205 Übungen: Do 16:1518:00 Laborraum Schanzenstrasse Vorwiegend Programmieren in Matlab/Octave Teilnahme freiwillig. Übungsblätter jeweils
More informationPhylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.
Five Sami Khuri Department of Computer Science San José State University San José, California, USA sami.khuri@sjsu.edu v Distance Methods v Character Methods v Molecular Clock v UPGMA v Maximum Parsimony
More informationEVOLUTIONARY DISTANCES
EVOLUTIONARY DISTANCES FROM STRINGS TO TREES Luca Bortolussi 1 1 Dipartimento di Matematica ed Informatica Università degli studi di Trieste luca@dmi.units.it Trieste, 14 th November 2007 OUTLINE 1 STRINGS:
More informationTheory of Evolution. Charles Darwin
Theory of Evolution harles arwin 85859: Origin of Species 5 year voyage of H.M.S. eagle (86) Populations have variations. Natural Selection & Survival of the fittest: nature selects best adapted varieties
More informationPlan: Evolutionary trees, characters. Perfect phylogeny Methods: NJ, parsimony, max likelihood, Quartet method
Phylogeny 1 Plan: Phylogeny is an important subject. We have 2.5 hours. So I will teach all the concepts via one example of a chain letter evolution. The concepts we will discuss include: Evolutionary
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: istancebased methods Ultrametric Additive: UPGMA Transformed istance NeighborJoining Characterbased Maximum Parsimony Maximum Likelihood
More informationExploring Treespace. Katherine St. John. Lehman College & the Graduate Center. City University of New York. 20 June 2011
Exploring Treespace Katherine St. John Lehman College & the Graduate Center City University of New York 20 June 2011 (Joint work with the Treespace Working Group, CUNY: Ann Marie Alcocer, Kadian Brown,
More informationEvolutionary trees. Describe the relationship between objects, e.g. species or genes
Evolutionary trees Bonobo Chimpanzee Human Neanderthal Gorilla Orangutan Describe the relationship between objects, e.g. species or genes Early evolutionary studies Anatomical features were the dominant
More informationSAT, Coloring, Hamiltonian Cycle, TSP
1 SAT, Coloring, Hamiltonian Cycle, TSP Slides by Carl Kingsford Apr. 28, 2014 Sects. 8.2, 8.7, 8.5 2 Boolean Formulas Boolean Formulas: Variables: x 1, x 2, x 3 (can be either true or false) Terms: t
More informationPhylogenetic inference: from sequences to trees
W ESTFÄLISCHE W ESTFÄLISCHE W ILHELMS U NIVERSITÄT NIVERSITÄT WILHELMSU ÜNSTER MM ÜNSTER VOLUTIONARY FUNCTIONAL UNCTIONAL GENOMICS ENOMICS EVOLUTIONARY Bioinformatics 1 Phylogenetic inference: from sequences
More informationCS 220: Discrete Structures and their Applications. Mathematical Induction in zybooks
CS 220: Discrete Structures and their Applications Mathematical Induction 6.4 6.6 in zybooks Why induction? Prove algorithm correctness (CS320 is full of it) The inductive proof will sometimes point out
More informationCS483 Design and Analysis of Algorithms
CS483 Design and Analysis of Algorithms Lectures 1516 Dynamic Programming Instructor: Fei Li lifei@cs.gmu.edu with subject: CS483 Office hours: STII, Room 443, Friday 4:00pm  6:00pm or by appointments
More informationAlgorithms Design & Analysis. Approximation Algorithm
Algorithms Design & Analysis Approximation Algorithm Recap External memory model Merge sort Distribution sort 2 Today s Topics Hard problem Approximation algorithms Metric traveling salesman problem A
More informationMultiple Sequence Alignment. Sequences
Multiple Sequence Alignment Sequences > YOR020c mstllksaksivplmdrvlvqrikaqaktasglylpe knveklnqaevvavgpgftdangnkvvpqvkvgdqvl ipqfggstiklgnddevilfrdaeilakiakd > crassa mattvrsvksliplldrvlvqrvkaeaktasgiflpe
More informationMolecular Evolution and Phylogenetic Tree Reconstruction
1 4 Molecular Evolution and Phylogenetic Tree Reconstruction 3 2 5 1 4 2 3 5 Orthology, Paralogy, Inparalogs, Outparalogs Phylogenetic Trees Nodes: species Edges: time of independent evolution Edge length
More informationCSCI1950 Z Computa4onal Methods for Biology Lecture 5
CSCI1950 Z Computa4onal Methods for Biology Lecture 5 Ben Raphael February 6, 2009 hip://cs.brown.edu/courses/csci1950 z/ Alignment vs. Distance Matrix Mouse: ACAGTGACGCCACACACGT Gorilla: CCTGCGACGTAACAAACGC
More information1 Heuristics for the Traveling Salesman Problem
Praktikum AlgorithmenEntwurf (Teil 9) 09.12.2013 1 1 Heuristics for the Traveling Salesman Problem We consider the following problem. We want to visit all the nodes of a graph as fast as possible, visiting
More informationCS 581 Paper Presentation
CS 581 Paper Presentation Muhammad Samir Khan Recovering the Treelike Trend of Evolution Despite Extensive Lateral Genetic Transfer: A Probabilistic Analysis by Sebastien Roch and Sagi Snir Overview Introduction
More information3.4 Relaxations and bounds
3.4 Relaxations and bounds Consider a generic Discrete Optimization problem z = min{c(x) : x X} with an optimal solution x X. In general, the algorithms generate not only a decreasing sequence of upper
More informationUniversity of New Mexico Department of Computer Science. Final Examination. CS 362 Data Structures and Algorithms Spring, 2007
University of New Mexico Department of Computer Science Final Examination CS 362 Data Structures and Algorithms Spring, 2007 Name: Email: Print your name and email, neatly in the space provided above;
More informationPhylogeny. November 7, 2017
Phylogeny November 7, 2017 Phylogenetics Phylon = tribe/race, genetikos = relative to birth Phylogenetics: study of evolutionary relationships among organisms, sequences, or anything in between Related
More informationCS60007 Algorithm Design and Analysis 2018 Assignment 1
CS60007 Algorithm Design and Analysis 2018 Assignment 1 Palash Dey and Swagato Sanyal Indian Institute of Technology, Kharagpur Please submit the solutions of the problems 6, 11, 12 and 13 (written in
More information15.081J/6.251J Introduction to Mathematical Programming. Lecture 24: Discrete Optimization
15.081J/6.251J Introduction to Mathematical Programming Lecture 24: Discrete Optimization 1 Outline Modeling with integer variables Slide 1 What is a good formulation? Theme: The Power of Formulations
More informationHamiltonian Graphs Graphs
COMP2121 Discrete Mathematics Hamiltonian Graphs Graphs Hubert Chan (Chapter 9.5) [O1 Abstract Concepts] [O2 Proof Techniques] [O3 Basic Analysis Techniques] 1 Hamiltonian Paths and Circuits [O1] A Hamiltonian
More informationPhylogeny: building the tree of life
Phylogeny: building the tree of life Dr. Fayyaz ul Amir Afsar Minhas Department of Computer and Information Sciences Pakistan Institute of Engineering & Applied Sciences PO Nilore, Islamabad, Pakistan
More informationDiscrete Wiskunde II. Lecture 5: Shortest Paths & Spanning Trees
, 2009 Lecture 5: Shortest Paths & Spanning Trees University of Twente m.uetz@utwente.nl wwwhome.math.utwente.nl/~uetzm/dw/ Shortest Path Problem "#$%&'%()*%"()$#+,& Given directed "#$%&'()*+,%+('*.#/'01234564'.*,'7+"%/8',&'5"4'84%#3
More informationD1 Discrete Mathematics The Travelling Salesperson problem. The Nearest Neighbour Algorithm The Lower Bound Algorithm The Tour Improvement Algorithm
1 iscrete Mathematics The Travelling Salesperson problem The Nearest Neighbour lgorithm The Lower ound lgorithm The Tour Improvement lgorithm The Travelling Salesperson: Typically a travelling salesperson
More informationDynamic Programming: Shortest Paths and DFA to Reg Exps
CS 374: Algorithms & Models of Computation, Fall 205 Dynamic Programming: Shortest Paths and DFA to Reg Exps Lecture 7 October 22, 205 Chandra & Manoj (UIUC) CS374 Fall 205 / 54 Part I Shortest Paths with
More informationLecture 4: NP and computational intractability
Chapter 4 Lecture 4: NP and computational intractability Listen to: Find the longest path, Daniel Barret What do we do today: polynomial time reduction NP, conp and NP complete problems some examples
More informationFall 2003 BMI 226 / CS 426 LIMITED VALIDITY STRUCTURES
Notes III1 LIMITED VALIDITY STRUCTURES Notes III2 TRAVELING SALESPERSON PROBLEM (TSP) Given a (symmetric) matrix of distances between N cities Salesperson is to visit each city once and only once Goal
More informationUniversity of Washington March 21, 2013 Department of Computer Science and Engineering CSEP 521, Winter Exam Solution, Monday, March 18, 2013
University of Washington March 21, 2013 Department of Computer Science and Engineering CSEP 521, Winter 2013 Exam Solution, Monday, March 18, 2013 Instructions: NAME: Closed book, closed notes, no calculators
More informationDynamic Programming: Shortest Paths and DFA to Reg Exps
CS 374: Algorithms & Models of Computation, Spring 207 Dynamic Programming: Shortest Paths and DFA to Reg Exps Lecture 8 March 28, 207 Chandra Chekuri (UIUC) CS374 Spring 207 / 56 Part I Shortest Paths
More informationAlgorithms and Theory of Computation. Lecture 22: NPCompleteness (2)
Algorithms and Theory of Computation Lecture 22: NPCompleteness (2) Xiaohui Bei MAS 714 November 8, 2018 Nanyang Technological University MAS 714 November 8, 2018 1 / 20 Set Cover Set Cover Input: a set
More informationGraph Transformations T1 and T2
Graph Transformations T1 and T2 We now introduce two graph transformations T1 and T2. Reducibility by successive application of these two transformations is equivalent to reducibility by intervals. The
More informationEvolutionary trees. Describe the relationship between objects, e.g. species or genes
Evolutionary trees Bonobo Chimpanzee Human Neanderthal Gorilla Orangutan Describe the relationship between objects, e.g. species or genes Early evolutionary studies The evolutionary relationships between
More informationBuilding Phylogenetic Trees UPGMA & NJ
uilding Phylogenetic Trees UPGM & NJ UPGM UPGM Unweighted PairGroup Method with rithmetic mean Unweighted = all pairwise distances contribute equally. PairGroup = groups are combined in pairs. rithmetic
More information8. INTRACTABILITY I. Lecture slides by Kevin Wayne Copyright 2005 PearsonAddison Wesley. Last updated on 2/6/18 2:16 AM
8. INTRACTABILITY I polytime reductions packing and covering problems constraint satisfaction problems sequencing problems partitioning problems graph coloring numerical problems Lecture slides by Kevin
More informationX. Hu, R. Shonkwiler, and M.C. Spruill. School of Mathematics. Georgia Institute of Technology. Atlanta, GA 30332
Approximate Speedup by Independent Identical Processing. Hu, R. Shonkwiler, and M.C. Spruill School of Mathematics Georgia Institute of Technology Atlanta, GA 30332 Running head: Parallel iip Methods Mail
More informationGraduate Algorithms CS F21 NP & Approximation Algorithms
Graduate Algorithms CS6732016F21 NP & Approximation Algorithms David Galles Department of Computer Science University of San Francisco 210: Classes of Problems Consider three problem classes: Polynomial
More information1 Review of Vertex Cover
CS266: Parameterized Algorithms and Complexity Stanford University Lecture 3 Tuesday, April 9 Scribe: Huacheng Yu Spring 2013 1 Review of Vertex Cover In the last lecture, we discussed FPT algorithms for
More informationProof of Theorem 1. Tao Lei CSAIL,MIT. Here we give the proofs of Theorem 1 and other necessary lemmas or corollaries.
Proof of Theorem 1 Tao Lei CSAIL,MIT Here we give the proofs of Theorem 1 and other necessary lemmas or corollaries. Lemma 1 (Reachability) Any two trees y, y are reachable to each other. Specifically,
More informationTraveling Salesman Problem
Traveling Salesman Problem Zdeněk Hanzálek hanzalek@fel.cvut.cz CTU in Prague April 17, 2017 Z. Hanzálek (CTU) Traveling Salesman Problem April 17, 2017 1 / 33 1 Content 2 Solved TSP instances in pictures
More informationWalks in Phylogenetic Treespace
Walks in Phylogenetic Treespace lan Joseph aceres Samantha aley John ejesus Michael Hintze iquan Moore Katherine St. John bstract We prove that the spaces of unrooted phylogenetic trees are Hamiltonian
More informationMechanisms of Evolution Darwinian Evolution
Mechanisms of Evolution Darwinian Evolution Descent with modification by means of natural selection All life has descended from a common ancestor The mechanism of modification is natural selection Concept
More informationPhylogenetics: Parsimony
1 Phylogenetics: Parsimony COMP 571 Luay Nakhleh, Rice University he Problem 2 Input: Multiple alignment of a set S of sequences Output: ree leaflabeled with S Assumptions Characters are mutually independent
More informationEfficient parallel query processing by graph ranking
Fundamenta Informaticae 69 (006) 7 85 7 IOS Press Efficient parallel query processing by graph ranking Dariusz Dereniowski and Marek Kubale Department of Algorithms and System Modeling Faculty of Electronics,
More informationI wonder about Trees Robert Frost
I wonder about Trees Robert Frost We wonder about Robert Frost  Trees Hey! How come no turtles in this tree? Phylogenetic Trees From Darwin s notebooks, 1837 Really? The only figure in On the Origin
More informationWho Has Heard of This Problem? Courtesy: Jeremy Kun
P vs. NP 02201 Who Has Heard of This Problem? Courtesy: Jeremy Kun Runtime Analysis Last time, we saw that there is no solution to the Halting Problem. Halting Problem: Determine if a program will halt.
More information2 Notation and Preliminaries
On Asymmetric TSP: Transformation to Symmetric TSP and Performance Bound Ratnesh Kumar Haomin Li epartment of Electrical Engineering University of Kentucky Lexington, KY 405060046 Abstract We show that
More informationHierarchical Clustering
Hierarchical Clustering Some slides by Serafim Batzoglou 1 From expression profiles to distances From the Raw Data matrix we compute the similarity matrix S. S ij reflects the similarity of the expression
More informationReconstructibility of trees from subtree size frequencies
Stud. Univ. BabeşBolyai Math. 59(2014), No. 4, 435 442 Reconstructibility of trees from subtree size frequencies Dénes Bartha and Péter Burcsi Abstract. Let T be a tree on n vertices. The subtree frequency
More informationImproved maximum parsimony models for phylogenetic networks
Improved maximum parsimony models for phylogenetic networks Leo van Iersel Mark Jones Celine Scornavacca December 20, 207 Abstract Phylogenetic networks are well suited to represent evolutionary histories
More informationToday s Outline. CS 362, Lecture 24. The Reduction. Hamiltonian Cycle. Reduction Wrapup Approximation algorithms for NPHard Problems
Today s Outline CS 362, Lecture 24 Jared Saia University of New Mexico Reduction Wrapup Approximation algorithms for NPHard Problems 1 Hamiltonian Cycle The Reduction A Hamiltonian Cycle in a graph is
More informationNATIONAL UNIVERSITY OF SINGAPORE CS3230 DESIGN AND ANALYSIS OF ALGORITHMS SEMESTER II: Time Allowed 2 Hours
NATIONAL UNIVERSITY OF SINGAPORE CS3230 DESIGN AND ANALYSIS OF ALGORITHMS SEMESTER II: 2017 2018 Time Allowed 2 Hours INSTRUCTIONS TO STUDENTS 1. This assessment consists of Eight (8) questions and comprises
More informationThe Complexity of Constructing Evolutionary Trees Using Experiments
The Complexity of Constructing Evolutionary Trees Using Experiments Gerth Stlting Brodal 1,, Rolf Fagerberg 1,, Christian N. S. Pedersen 1,, and Anna Östlin2, 1 BRICS, Department of Computer Science, University
More informationCSCI1950 Z Computa4onal Methods for Biology Lecture 4. Ben Raphael February 2, hhp://cs.brown.edu/courses/csci1950 z/ Algorithm Summary
CSCI1950 Z Computa4onal Methods for Biology Lecture 4 Ben Raphael February 2, 2009 hhp://cs.brown.edu/courses/csci1950 z/ Algorithm Summary Parsimony Probabilis4c Method Input Output Sankoff s & Fitch
More informationThe minimum G c cut problem
The minimum G c cut problem Abstract In this paper we define and study the G c cut problem. Given a complete undirected graph G = (V ; E) with V = n, edge weighted by w(v i, v j ) 0 and an undirected
More informationWorst Case Performance of Approximation Algorithm for Asymmetric TSP
Worst Case Performance of Approximation Algorithm for Asymmetric TSP Anna Palbom Department of Numerical Analysis and Computer Science Royal Institute of Technology S100 44 Stockholm, Sweden Email: annap@nada.kth.se
More information10.4 The Kruskal Katona theorem
104 The Krusal Katona theorem 141 Example 1013 (Maximum weight traveling salesman problem We are given a complete directed graph with nonnegative weights on edges, and we must find a maximum weight Hamiltonian
More informationAlgorithm Design and Analysis
Algorithm Design and Analysis LECTURE 7 Greedy Graph Algorithms Shortest paths Minimum Spanning Tree Sofya Raskhodnikova 9/14/016 S. Raskhodnikova; based on slides by E. Demaine, C. Leiserson, A. Smith,
More informationDistribution of the Number of Encryptions in Revocation Schemes for Stateless Receivers
Discrete Mathematics and Theoretical Computer Science DMTCS vol. subm., by the authors, 1 1 Distribution of the Number of Encryptions in Revocation Schemes for Stateless Receivers Christopher Eagle 1 and
More informationPart III: Traveling salesman problems
Transportation Logistics Part III: Traveling salesman problems c R.F. Hartl, S.N. Parragh 1/282 Motivation Motivation Why do we study the TSP? c R.F. Hartl, S.N. Parragh 2/282 Motivation Motivation Why
More informationImproved methods for the Travelling Salesperson with Hotel Selection
Improved methods for the Travelling Salesperson with Hotel Selection Marco Castro marco.castro@ua.ac.be ANT/OR November 23rd, 2011 Contents Problem description Motivation Mathematical formulation Solution
More informationDiscrete Optimization 2010 Lecture 10 P, N P, and N PCompleteness
Discrete Optimization 2010 Lecture 10 P, N P, and N PCompleteness Marc Uetz University of Twente m.uetz@utwente.nl Lecture 9: sheet 1 / 31 Marc Uetz Discrete Optimization Outline 1 N P and con P 2 N Pcompleteness
More informationPreliminaries. Graphs. E : set of edges (arcs) (Undirected) Graph : (i, j) = (j, i) (edges) V = {1, 2, 3, 4, 5}, E = {(1, 3), (3, 2), (2, 4)}
Preliminaries Graphs G = (V, E), V : set of vertices E : set of edges (arcs) (Undirected) Graph : (i, j) = (j, i) (edges) 1 2 3 5 4 V = {1, 2, 3, 4, 5}, E = {(1, 3), (3, 2), (2, 4)} 1 Directed Graph (Digraph)
More informationPhylogenetics: Building Phylogenetic Trees
1 Phylogenetics: Building Phylogenetic Trees COMP 571 Luay Nakhleh, Rice University 2 Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary model should
More informationCombinatorial Optimization
Combinatorial Optimization Problem set 8: solutions 1. Fix constants a R and b > 1. For n N, let f(n) = n a and g(n) = b n. Prove that f(n) = o ( g(n) ). Solution. First we observe that g(n) 0 for all
More informationThe Mixed Chinese Postman Problem Parameterized by Pathwidth and Treedepth
The Mixed Chinese Postman Problem Parameterized by Pathwidth and Treedepth Gregory Gutin, Mark Jones, and Magnus Wahlström Royal Holloway, University of London Egham, Surrey TW20 0EX, UK Abstract In the
More informationACO Comprehensive Exam October 18 and 19, Analysis of Algorithms
Consider the following two graph problems: 1. Analysis of Algorithms Graph coloring: Given a graph G = (V,E) and an integer c 0, a ccoloring is a function f : V {1,,...,c} such that f(u) f(v) for all
More information