Phylogeny. Properties of Trees. Properties of Trees. Trees represent the order of branching only. Phylogeny: Taxon: a unit of classification

Similar documents
9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)


Phylogenetic Tree Reconstruction

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Phylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science

(Stevens 1991) 1. morphological characters should be assumed to be quantitative unless demonstrated otherwise

BINF6201/8201. Molecular phylogenetic methods

Lecture V Phylogeny and Systematics Dr. Kopeny

Phylogenetics. BIOL 7711 Computational Bioscience

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

8/23/2014. Phylogeny and the Tree of Life

Evolutionary Tree Analysis. Overview

Phylogenetic inference

Theory of Evolution. Charles Darwin

Chapter 16: Reconstructing and Using Phylogenies

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

Constructing Evolutionary/Phylogenetic Trees

What is Phylogenetics

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Dr. Amira A. AL-Hosary

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

Algorithms in Bioinformatics

Phylogeny: building the tree of life

Chapter 26 Phylogeny and the Tree of Life

Chapter 26 Phylogeny and the Tree of Life

Theory of Evolution Charles Darwin

How to read and make phylogenetic trees Zuzana Starostová

C3020 Molecular Evolution. Exercises #3: Phylogenetics

1 ATGGGTCTC 2 ATGAGTCTC

Phylogenetic Analysis

Phylogenetic Analysis

Phylogenetic Analysis

Chapter 26: Phylogeny and the Tree of Life

Phylogeny and the Tree of Life

Chapter 19: Taxonomy, Systematics, and Phylogeny

CLASSIFICATION OF LIVING THINGS. Chapter 18

PHYLOGENY & THE TREE OF LIFE

Introduction to characters and parsimony analysis

Phylogeny & Systematics: The Tree of Life

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

ELE4120 Bioinformatics Tutorial 8

Macroevolution Part I: Phylogenies

Lecture 11 Friday, October 21, 2011

Reconstructing the history of lineages

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

Phylogeny and Systematics

How should we organize the diversity of animal life?

Classification, Phylogeny yand Evolutionary History

Michael Yaffe Lecture #5 (((A,B)C)D) Database Searching & Molecular Phylogenetics A B C D B C D

Global biodiversity: how many species of arthropods are there? George Weiblen Plant Biology

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley

Phylogeny. November 7, 2017

Phylogeny and the Tree of Life

Bioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics

The practice of naming and classifying organisms is called taxonomy.

Phylogeny Tree Algorithms

Phylogeny is the evolutionary history of a group of organisms. Based on the idea that organisms are related by evolution

Phylogeny and the Tree of Life

Principles of Phylogeny Reconstruction How do we reconstruct the tree of life? Basic Terminology. Looking at Trees. Basic Terminology.

Lab 06 Phylogenetics, part 1

Phylogenetics: Parsimony

Molecular evolution. Joe Felsenstein. GENOME 453, Autumn Molecular evolution p.1/49

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Phylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz

Biology 211 (2) Week 1 KEY!

Bioinformatics 1 -- lecture 9. Phylogenetic trees Distance-based tree building Parsimony

PHYLOGENY AND SYSTEMATICS

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

PHYLOGENY WHAT IS EVOLUTION? 1/22/2018. Change must occur in a population via allele

AP Biology. Cladistics

Classification and Phylogeny

Phylogeny and the Tree of Life

Biologists have used many approaches to estimating the evolutionary history of organisms and using that history to construct classifications.

Classification and Phylogeny

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

Phylogeny. Information. ARB-Workshop 14/ CEH Oxford. Molecular Markers. Phylogeny The Backbone of Biology. Why? Zuckerkandl and Pauling 1965

Notung: A Program for Dating Gene Duplications and Optimizing Gene Family Trees

Phylogeny and the Tree of Life

Constructing Evolutionary/Phylogenetic Trees

Plan: Evolutionary trees, characters. Perfect phylogeny Methods: NJ, parsimony, max likelihood, Quartet method

Name. Ecology & Evolutionary Biology 2245/2245W Exam 2 1 March 2014

Name: Class: Date: ID: A

Molecular Evolution & Phylogenetics

Lecture 6 Phylogenetic Inference

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Cladistics and Bioinformatics Questions 2013

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)

Is the equal branch length model a parsimony model?

Bio94 Discussion Activity week 3: Chapter 27 Phylogenies and the History of Life

Mechanisms of Evolution Darwinian Evolution

Inferring phylogeny. Today s topics. Milestones of molecular evolution studies Contributions to molecular evolution

Phylogenetics in the Age of Genomics: Prospects and Challenges

Modern Evolutionary Classification. Section 18-2 pgs

Transcription:

Multiple sequence alignment global local Evolutionary tree reconstruction Pairwise sequence alignment (global and local) Substitution matrices Gene Finding Protein structure prediction N structure prediction Database searching BLS Sequence statistics omputational genomics Phylogeny: Phylogeny n evolutionary tree. hypothesis concerning the evolutionary history of a group of taxa and their ancestors. axon: a unit of classification strain, species, individual, gene contemporary taxa = leaves of tree ancestral taxa = internal nodes of tree taxa are also called OUs (Operational taxonomic units) Properties of rees Leaf nodes contemporary taxa Internal nodes - ancestral taxa opology relationships between species Branch lengths degree of change Ernst Haeckel (1834-1919) Properties of rees If the mutation rate is constant in all lineages (molecular clock hypothesis), the branch lengths are proportional to time. B D rees represent the order of branching only F E G D F E G B 1

Which of the following is different? ooted vs unrooted trees: B D E B D E root: common ancestor arp rout Zebrafish Salmon E D B D E B Human Mouse hicken Salmon Unroooted trees give no information about the order of speciation events Various types of trees you will see Which is different? B B D E F D F E D E F B D E B F Gene rees vs Species rees lose-up view of divergence Gene sequences can be used to infer the history of speciation infer the history of gene families aveat emptor: the history of the gene, may not be the same as the history of the organism gene duplications horizontal gene transfer Modified from Hennig, W. (1966) Phylogenetic Systematics 2

Species relationships Why phylogeny reconstruction? Some applications Fly mphioxus Hagfish Lamprey Shark Bony fish Mammals Narayanan., MU Dating events Insulin growth factor receptors ime of duplication Gene family history Insulin receptors mphioxus Hagfish Lamprey Bony fish Mammals Narayanan., MU E Worm, DM-Fly, -mosquito, MP- mphioxus, OM-rout, PO-Flounder, D- Zebrafish, XL - Frog, N rat, MM Mouse, HS - Human E Worm, DM-Fly, -mosquito, MP- mphioxus, OM-rout, PO-Flounder, D- Zebrafish, XL - Frog, N rat, MM Mouse, HS - Human haracter evolution Biogeography 3

omparing ecology and evolution: Which insects are eating which tropical shrubs? Forensics ubiaceae Psychotria Moraceae Ficus Euphorbiaceae Macaranga George Weiblen, U. Minnesota Which moths are eating which tropical shrubs? Plant phylogeny vs. faunal similiarity Is there phylogenetic structure in faunal similarity? patterns of herbivore association = feeding = not feeding alanga sexipunctalis (rambidae) Macaranga Oiketicus sp. (Psychidae) Psychotria host plant phylogeny George Weiblen, U. Minnesota host plant phylogeny faunal similarity phenogram lustering of host plants based on faunal similarity corresponds poorly with host plant phylogeny Nnone the less, some host plant clades have very similar caterpillar faunas (e.g. Macaranga and Psychotria) George Weiblen, U. Minnesota How to root an unrooted tree Outgroup If the mutation rate is constant in all lineages, the distance from root to leaf leaf is the same for all leaves. We can obtain a rooted tree algorithmically. Otherwise, use an outgroup, a taxon that is distantly related to all other leaf taxa. merican Scientist 4

Phylogeny reconstruction Given data observations of contemporary taxa, reconstruct the evolutionary history. Data for phylogeny reconstruction Morphology Behavior Biochemistry Molecular and sequence data haracter data: shared characteristics Distance data: difference between species haracter data Distance data Bees Moths nts entipedes Primitive character:wingless Bees Moths nts entipedes Bees Moths nts entipedes Bees Moths nts entipedes Bees Moths nts entipedes merican Scientist merican Scientist 5

Other examples: Niche (e.g., what finches eat) Biochemistry: serum:anti-serum reactions. Behavior: Firefly flashing patterns merican Scientist Multiple Sequence lignment Questions trees can address: Glb2;Sgl; ~~~~LEKQELLKQSWEVLKQNIPHSLLFLIIEPESKYVFSFLKDS Glb2;Sgl; ~~~MLEKQELLKQSWEVLKQNIPHSLLFLILEPESKYVFSFLKDS Glb2;Sgl; ~~~MLEQELLKQSWEVLKQNIPGHSLLFLIIEPESKYVFSFLKDS Glb2;Sgl; ~~~~~~~~~~ELLKQSWEVLKQNIPGHSLLFLIIEPESKYVFSFLKDS HUMN BI PIG HIK Four class 2 globins from asuarina glauca MKWVFISLL FLFSSYSG V..FD.H KSEVHFKD LGEENFKLV MKWVFISLL FLFSSYSG V..FE.H KSEIHFND VGEEHFIGLV ~~WVFISLL FLFSSYSG V..FD.Y KSEIHFKD LGEQYFKGLV MKWVLISFI FLFSSSN LQFDEH KSEIHYND LKEEFKV Which taxa are most closely related? What is the ancestral state? Where is rapid change occuring When did lineages diverge? lbumin in four species Evolutionary ree econstruction Given observations similarities and differences between k species, find the best hypothesis (tree) of their evolutionary history Maximum Parsimony: nature is thrifty he best tree requires the fewest mutations. e.g., jaws were only invented once backbone skulls riteria for evaluating which tree best fits the data: Maximum parsimony (character data) Minimum evolution (distance data) Maximum Likelihood (character data) tetrapody terrestrial animals bony skeletons fish sharks jaws hagfish lampreys 6

Maximum Parsimony Problem: Not all characters are parsimonious Parsimony score: minimum number of mutations needed to explain data ssumptions Selection dominates -> Few changes No multiple substitutions -> Sites are independent Duck Platapus Bat Platapus Duck Bat wings wings Platypus Platypus Duck Platapus Duck wings Platypus If the mutation rate is high, sequence data is not parsimonious rue tree: -> -> -> Bat Platapus Placenta Placenta Platypus placenta Bat Most parsimonious, but false, tree: -> -> Given a tree topology ssociate characters with leaves of tree Find the optimal labeling of internal nodes ount mutations (1) (2) (3) G 7

(1) (2) (3) (1) (2) (3) G G (1) (2) (3) (1) (2) (3) 000 010 100 010 001 G Parsimony score: 4 G Note: there can be more than one most parsimonious tree (1) (2) (3) 100 010 010 101 _ 100 010 001 _ Given a tree topology ssociate characters with leaves of tree Find the optimal labeling of internal nodes ount mutations o find the optimal tree, we need to consider all topologies. How many are there? 8

How many unrooted trees with k leaves? Number of unrooted trees for k taxa hree taxa Four axa k E(k) (k) 3 3 1 4 5 3 E( k) = E(( k 1) + 2 ( k) = k 1 i= 3 (2i 3) Five taxa 5 7 15 (2k 5)! ( k) = k 3 2 ( k 3)! he number of trees gets big fast How do you find the optimal tree? Number of leaves 3 4 5 6 10 20 50 500 Number of unrooted binary trees 1 3 15 105 2,027,025 2.2 x 10 20 2.8 x 10 74 1 x 10 1074 1. Exhaustive search (<12 taxa) (Phylogeny reconstruction is NP-complete.) How do you find the optimal tree? How do you find the optimal tree? Method Exhaustive search esult Optimal ime (k) ypical k 12 2. Branch-and-bound (<18 taxa) Note that the parsimony score is non-decreasing as you add edges = infinity, L = {3} BB(L) For each tree, t, in L If t has k leaves If Score(t) <, = Score(t). Else if Score(t) >, return //Bound Else NewL = empty set. //Branch For every edge in t» t = t plus a new edge» NewL = L U {t } BB(NewL) 9

How do you find the optimal tree? How do you find a pretty good tree? Method Exhaustive search Branch and bound esult Optimal Optimal ime (k) (k) ypical k 12 18 3. Heuristic search Search for optimal trees by finding good trees and then rearranging them in the hopes of finding an even better tree Heuristic search Global optimum Suboptimal island of trees Branch swapping Nearest-neighbour interchange (NNI) Starting trees reespace Branch swapping Branch swapping Subtree pruning and regrafting (SP) ree-bisection reconnection (B) 10

How do you find the optimal tree? Pairwise sequence alignment (global and local) Method Exhaustive search Branch and bound Heuristic search esult Optimal Optimal Suboptimal ime (k) (k) You choose ypical k 12 18 Multiple sequence alignment global local Substitution matrices Gene Finding Database searching BLS Sequence statistics Evolutionary tree reconstruction Protein structure prediction N structure prediction omputational genomics 11