Phylogenetic inference: from sequences to trees

Similar documents
Constructing Evolutionary/Phylogenetic Trees

Algorithms in Bioinformatics

Constructing Evolutionary/Phylogenetic Trees

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

A (short) introduction to phylogenetics

Phylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz

Theory of Evolution Charles Darwin

BINF6201/8201. Molecular phylogenetic methods

How to read and make phylogenetic trees Zuzana Starostová

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

Phylogenetic inference

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Phylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

What is Phylogenetics

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

C.DARWIN ( )

Phylogenetics: Building Phylogenetic Trees

Dr. Amira A. AL-Hosary

DNA Phylogeny. Signals and Systems in Biology Kushal EE, IIT Delhi

EVOLUTIONARY DISTANCES

Phylogenetic Tree Reconstruction

Evolutionary Tree Analysis. Overview

Multiple Sequence Alignment. Sequences

Theory of Evolution. Charles Darwin

Phylogenetic analyses. Kirsi Kostamo

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT

Michael Yaffe Lecture #5 (((A,B)C)D) Database Searching & Molecular Phylogenetics A B C D B C D

Phylogeny: building the tree of life

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Introduction to characters and parsimony analysis

Plan: Evolutionary trees, characters. Perfect phylogeny Methods: NJ, parsimony, max likelihood, Quartet method

Phylogeny: traditional and Bayesian approaches


Phylogenetic Analysis

Phylogenetic Analysis

Phylogenetic Analysis

Phylogenetic trees 07/10/13

Consistency Index (CI)

Lecture 6 Phylogenetic Inference

Phylogenetics: Distance Methods. COMP Spring 2015 Luay Nakhleh, Rice University

Estimating Evolutionary Trees. Phylogenetic Methods

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Algorithmic Methods Well-defined methodology Tree reconstruction those that are well-defined enough to be carried out by a computer. Felsenstein 2004,

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Jan 27 & 29):

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Bioinformatics 1 -- lecture 9. Phylogenetic trees Distance-based tree building Parsimony

Inferring phylogeny. Today s topics. Milestones of molecular evolution studies Contributions to molecular evolution

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Gel Electrophoresis. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 1

Inferring Molecular Phylogeny

How should we organize the diversity of animal life?

Introduction to Bioinformatics Introduction to Bioinformatics

Estimating Phylogenies (Evolutionary Trees) II. Biol4230 Thurs, March 2, 2017 Bill Pearson Jordan 6-057

A Phylogenetic Network Construction due to Constrained Recombination

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)

Phylogenetics. BIOL 7711 Computational Bioscience

Phylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science

Inferring Molecular Phylogeny

Bioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Molecular evidence for multiple origins of Insectivora and for a new order of endemic African insectivore mammals

molecular evolution and phylogenetics

Evolutionary trees. Describe the relationship between objects, e.g. species or genes

Phylogenetics: Parsimony

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

BIOL 1010 Introduction to Biology: The Evolution and Diversity of Life. Spring 2011 Sections A & B

Systematics - Bio 615

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

CS5263 Bioinformatics. Guest Lecture Part II Phylogenetics

Additive distances. w(e), where P ij is the path in T from i to j. Then the matrix [D ij ] is said to be additive.

Phylogenetic methods in molecular systematics

Phylogeny. November 7, 2017

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

Supplementary Information

Chapter 26 Phylogeny and the Tree of Life

Assessing an Unknown Evolutionary Process: Effect of Increasing Site- Specific Knowledge Through Taxon Addition

8/23/2014. Phylogeny and the Tree of Life

Phylogeny Tree Algorithms

Today's project. Test input data Six alignments (from six independent markers) of Curcuma species

1 ATGGGTCTC 2 ATGAGTCTC

I. Short Answer Questions DO ALL QUESTIONS

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

Bayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies

Intraspecific gene genealogies: trees grafting into networks

Evolutionary Theory and Principles of Phylogenetics. Lucy Skrabanek ICB, WMC March 19, 2008

Taming the Beast Workshop

Organizing Life s Diversity

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Inferring Phylogenetic Trees. Distance Approaches. Representing distances. in rooted and unrooted trees. The distance approach to phylogenies

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2009 University of California, Berkeley

Phylogenetic Analysis and Intraspeci c Variation : Performance of Parsimony, Likelihood, and Distance Methods

Anatomy of a species tree

Phylogene)cs. IMBB 2016 BecA- ILRI Hub, Nairobi May 9 20, Joyce Nzioki

Molecular Evolution and Phylogenetic Tree Reconstruction

Concepts and Methods in Molecular Divergence Time Estimation

Reading for Lecture 13 Release v10

Transcription:

W ESTFÄLISCHE W ESTFÄLISCHE W ILHELMS -U NIVERSITÄT NIVERSITÄT WILHELMS-U ÜNSTER MM ÜNSTER VOLUTIONARY FUNCTIONAL UNCTIONAL GENOMICS ENOMICS EVOLUTIONARY Bioinformatics 1 Phylogenetic inference: from sequences to trees Claudia Acquisti Evolutionary Functional Genomics Institute for Evolution and Biodiversity, WWU Münster claudia.acquisti@uni-muenster.de

Where Do We Come From? What Are We? Where Are We Going? Paul Gauguin, 1897 Long-standing questions... Where do species come from? How do they change over time? How are their evolutionary histories linked?

Tree of life: an ancient metaphor Urartu Helmet 1300 BC Gustav Klimt Tree of life 1911

First attempts of phylogenetic inference C. Darwin 1837 E. Haeckel 1866

Tackling the problem the classical way Fragmentary Incomplete

Success stories: transitional fossils Archaeopteryx 1880, Berlin specimen

Tackling the problem the classical way Comparative Morphology and Physiology High complexity Big controversies

The molecular approach Comparison of the blueprint of life between species 1. (Almost) all species use DNA as genetic material 2. Mathematical modeling of sequence change 3. Large amount of phylogenetic information

DNA damage Copying errors MUTATIONS: heritable changes to the genome, essential for evolution. LARGE SCALE Chromosomal rearrangements Point mutations SMALL SCALE? POLYMORPHISMS SUBSTITUTIONS

Ancestral phylogenetic trees C. Darwin 1837 E. Haeckel 1866

Tree topologies Unrooted Rooted

How many trees can we build? It depends on the number (m) of taxa ROOTED TREES N(m) = [(2m-3)!] / [2m-2(m-2)!] UNROOTED TREES N(m) = [(2m-5)!] / [2m-3(m-3)!] N(4)= 15 N(10) = 34,459, 425 N(4)= 3 N(10) = 2,027, 025 and the number of possible trees grows very fast with the number of taxa! FINDING THE TRUE TREE IS A DIFFICULT PROBLEM ALREADY FOR FEW SPECIES Co le mp x ch r a se ce a p s

H ow do w e bu ild on e gi ve n th e da ta? Tree topologies

Alignment: a key step to recover evolutionary history from DNA sequences

Alignment: a key step to recover evolutionary history from DNA sequences

Alignment: a key step to recover evolutionary history from DNA sequences

Alignment: a key step to recover evolutionary history from DNA sequences

Alignment: a key step to recover evolutionary history from DNA sequences

Alignment: a key step to recover evolutionary history from DNA sequences 5 a b c d e f

Reading trees: rooting a tree

Reading trees: changes that do not make a difference

Statistical methods for phylogenetic inference 1. Distance based methods 2. Parsimony methods 3. Maximum likelihood methods

Distance-based methods for phylogenetic inference 1. Compute evolutionary distances for all pairs of taxa Human Human Horse Cow Kangaroo Newt Carp 0.129 0.129 0.205 0.572 0.665 0.129 0.232 0.638 0.651 0.197 0.598 0.624 0.638 0.708 Horse 0.134 Cow 0.134 0.134 Kangaroo 0.216 0.246 0.207 Newt 0.662 0.751 0.697 0.751 Carp 0.789 0.770 0.733 0.849 0.752 0.913 PC correction, and gamma distance (alpha=2)

Distance-based methods for phylogenetic inference 1. Compute evolutionary distances for all pairs of taxa 2. Construct trees from distance data Examine different possible topologies and chose the best as the true topology

Distance-based methods for phylogenetic inference 1. Compute evolutionary distances for all pairs of taxa 2. Construct trees from distance data UPGMA: Unweighted pair-group method (using arithmetic averages) Hp) Constant rate of evolution between lineages Ts) Wrong topology when Hp does not hold

Distance-based methods for phylogenetic inference 1. Compute evolutionary distances for all pairs of taxa 2. Construct trees from distance data UPGMA: Unweighted pair-group method (using arithmetic averages) ME: Minimum evolution method Idea: Minimization of the sum of all branch length estimates Very intense and slow computations!

Distance-based methods for phylogenetic inference 1. Compute evolutionary distances for all pairs of taxa 2. Construct trees from distance data UPGMA: Unweighted pair-group method (using arithmetic averages) ME: Minimum evolution method NJ: Neighbor joining method Hp) ME method + Applied neighbor joining strategy

Statistical methods for phylogenetic inference 1. Distance based methods 2. Parsimony methods 3. Maximum likelihood methods

Maximum Parsimony methods for phylogenetic inference 1. Alignment of the sequences 2. Select the tree that requires the smallest number of substitutions to explain the entire evolutionary process Complex and slow computations! Specific methods have been designed to reduce the search space A typical issue: IGNORING MULTIPLE HITS! Long branch attraction Parallel vs. Convergent evolution

I have a tree now. How do I know that I can trust it? BOOTSTRAP http://akira.ruc.dk/~olesk/sekvens/

I have a tree now. How do I know that I can trust it? BOOTSTRAP The percentage of bootstrap trees that support a given node

Revisiting the evolutionary history of dinosaurs with a molecular approach

Molecular Phylogenetics of Mastodon and Tyrannosaurus rex

YOUR PRACTICAL: Using molecular phylogenetics to reveal a back to the sea tale Ambulocetus