Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Similar documents
Phylogenetics Todd Vision Spring Some applications. Uncultured microbial diversity


C3020 Molecular Evolution. Exercises #3: Phylogenetics

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Phylogenetic inference

Dr. Amira A. AL-Hosary

Theory of Evolution. Charles Darwin

Constructing Evolutionary/Phylogenetic Trees

8/23/2014. Phylogeny and the Tree of Life

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Theory of Evolution Charles Darwin

What is Phylogenetics

A (short) introduction to phylogenetics

9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Bioinformatics 1 -- lecture 9. Phylogenetic trees Distance-based tree building Parsimony

Phylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Seuqence Analysis '17--lecture 10. Trees types of trees Newick notation UPGMA Fitch Margoliash Distance vs Parsimony

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Phylogeny and the Tree of Life

Principles of Phylogeny Reconstruction How do we reconstruct the tree of life? Basic Terminology. Looking at Trees. Basic Terminology.

Phylogenetic Tree Reconstruction

Constructing Evolutionary/Phylogenetic Trees

Estimating Evolutionary Trees. Phylogenetic Methods

Comparative Genomics II

Multiple Sequence Alignment. Sequences

Reconstructing the history of lineages

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT

Phylogenetics. BIOL 7711 Computational Bioscience

Consensus Methods. * You are only responsible for the first two

Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis

Phylogenetic trees 07/10/13

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

Introduction to characters and parsimony analysis

Lecture 6 Phylogenetic Inference

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley

Phylogenetic Analysis

BINF6201/8201. Molecular phylogenetic methods

Phylogenetic Analysis

Phylogenetic Analysis

ELE4120 Bioinformatics Tutorial 8

MOLECULAR SYSTEMATICS: A SYNTHESIS OF THE COMMON METHODS AND THE STATE OF KNOWLEDGE

Evolutionary Tree Analysis. Overview

STEM-hy: Species Tree Estimation using Maximum likelihood (with hybridization)

Phylogenetic analysis. Characters

BIOLOGY 432 Midterm I - 30 April PART I. Multiple choice questions (3 points each, 42 points total). Single best answer.

Phylogeny. Properties of Trees. Properties of Trees. Trees represent the order of branching only. Phylogeny: Taxon: a unit of classification

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

Chapter 26: Phylogeny and the Tree of Life

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2009 University of California, Berkeley

C.DARWIN ( )

Intraspecific gene genealogies: trees grafting into networks

Phylogeny and the Tree of Life

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

How should we organize the diversity of animal life?

Session 5: Phylogenomics

Classification and Phylogeny

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Jan 27 & 29):

Algorithms in Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics

Chapter 16: Reconstructing and Using Phylogenies

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)

Classification and Phylogeny

Chapter 26 Phylogeny and the Tree of Life

1 ATGGGTCTC 2 ATGAGTCTC

Phylogeny. November 7, 2017

Phylogene)cs. IMBB 2016 BecA- ILRI Hub, Nairobi May 9 20, Joyce Nzioki

Chapter 19: Taxonomy, Systematics, and Phylogeny

Molecular Phylogenetics (part 1 of 2) Computational Biology Course João André Carriço

Biology 211 (2) Week 1 KEY!

Macroevolution Part I: Phylogenies

Copyright notice. Molecular Phylogeny and Evolution. Goals of the lecture. Introduction. Introduction. December 15, 2008

BIOL 1010 Introduction to Biology: The Evolution and Diversity of Life. Spring 2011 Sections A & B

Name. Ecology & Evolutionary Biology 2245/2245W Exam 2 1 March 2014

Biologists have used many approaches to estimating the evolutionary history of organisms and using that history to construct classifications.

Inferring Molecular Phylogeny

Computational methods for predicting protein-protein interactions

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2011 University of California, Berkeley

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

1/27/2010. Systematics and Phylogenetics of the. An Introduction. Taxonomy and Systematics

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

CS5263 Bioinformatics. Guest Lecture Part II Phylogenetics

LECTURE 08. Today: 3/3/2014

Week 7: Bayesian inference, Testing trees, Bootstraps

Systematics - Bio 615

PHYLOGENY & THE TREE OF LIFE

Phylogeny Tree Algorithms

--Therefore, congruence among all postulated homologies provides a test of any single character in question [the central epistemological advance].

molecular evolution and phylogenetics

Phylogeny and the Tree of Life

Phylogenetics: Building Phylogenetic Trees

Transcription:

Phylogenetics Todd Vision iology 522 March 26, 2007 pplications of phylogenetics Studying organismal or biogeographic history Systematics ating events in the fossil record onservation biology Studying gene and protein families (molecular evolution) Studying functional specificity and divergence Identifying selection at the molecular level Understanding host-parasite coevolution Identifying lateral transfer events Outline Unrooted networks vs. rooted trees How to read a tree How to infer a tree Including how to measure confidence in your answer! pplications Testing for lateral gene transfer Studying ancient endosymbiosis ssessing microbial diversity Inferring gene function Taxa Unrooted Rooted 3 1 3 5 15 105 10 2,027,025 34,459,425 time 1

Root Unscaled vs. scaled branch lengths Without a root, we do not know what direction time is running on a branch! Two ways to root a phylogeny Outgroup Midpoint Only valid in the presence of a clock O Unscaled Scaled outgroup midpoint ifurcating vs multifurcating nodes 2 descendants per node 2 or more descendants per node lades and monophyly Monophyly ontaining all descendents of a common ancestor efines a clade Paraphyly Some descendents not included (e.g. reptiles) Polyphyly ommon ancestor not included (e.g. flying animals) 2

Evolutionary relationships among genes Gene trees and species trees: lineage sorting Homologs: descended from a common ancestor Orthologs: homologs that diverged through speciation Paralogs: homologs that diverged through duplication time Species 1 Species 2 Gene trees and species trees: lateral transfer Example: genetic exchange among thermophiles Phylogenetic methods istance matrix methods Neighbor Joining haracter-based methods Non-statistical: Maximum Parsimony Statistical: Maximum Likelihood & ayesian Inference They will agree unless there is onvergence, reversion or parallelism (homoplasy) Rate heterogeneity Long branches 3

The starting data: a sequence alignment Homoplasy Maximum parsimony Neighbor joining acgttgccga acgttactgg cgtaagatcg cgtaaaaccg 111112131- lusters taxa based on a matrix of pairwise distance The simplest distance is percent divergence N or amino acid Only works for very similar sequences etter to correct for multiple substitutions iming for a distance measure that is linear with time ifferent methods used for N, codons and amino acids Special methods need to be used where G content is not constant 4

Neighbor joining Statistical methods Raw distances Tree distances - 5 4 6-5 5-2 - 4 4.5 5.5-4.5 5.5-3 2 2 1.5 1 2 Two flavors Maximum likelihood ayesian inference Lend themselves to testing hypotheses about trees Getting the correct tree can be less important than testing the hypothesis Statistical methods and ayes Rule Posterior P(M ) = where M = model = data Likelihood Prior P( M)P(M) P( M)P(M) " for all M oin toss example coin turns up heads with probability p What is the likelihood of the data if we get k heads out of n tosses? " n% L = P(x = k n, p) = $ ' # k pk (1( p) n(k & L p 5

Maximum likelihood The phylogeny with the highest likelihood is taken to be correct alculating the likelihood of a phylogeny is computationally intensive For each possible topology, the probability of all possible ancestral states are calculated for each column The topology, branch lengths and substitution model are equivalent to the probability of heads vs. tails g c g c g a a g c g ootstrapping How much should you trust any given branch, or clade? With NJ, parsimony and ML, clades do not come with marginal probabilities ootstrap by resampling the original alignment hoose an alignment having the same number of columns with replacement ompute a new tree Repeat this many times ount the proportion of resampled trees in which each original branch appears Label the branches on the original tree with these proportions ayesian inference We may have an a priori hypothesis that a coin will be fair p=0.5 Is our belief shaken if we observe 9/10 heads? we observe 900/1000 heads? ayes' Rule gives us the posterior probability of the hypothesis the product of the likelihood and the prior probability Pros and cons We get a more intuitive interpretation of probability We get a natural measure of clade support It turns out that larger datasets can be analyzed with ayesian algorithms ut we must specify the (unknown) prior probability of each tree Some applications of phylogenetics to microbial genetics Testing for lateral gene transfer events Studying ancient endosymbiosis events haracterizing microbial diversity metagenomics Functional annotation of microbial genomes 6

Lateral gene transfer Two views of the tree of life Non-vertical transmission of genetic material Mechanisms Viral transfer Symbiosis/phagocytosis Transformation etecting lateral gene transfer losest database match ltered base/codon usage Phylogenetic incongruence W. Ford oolittle Stuying ancient endosymbiosis One gene s view of the tree of life 7

Uncultured microbial diversity ontinued discovery of protein families Hugenholtz P (2000) Genome iology 3, 1 Kunin et al. (2003) Genome iol 4, 401 Points to remember Getting a good tree n accurate phylogeny depend on an accurate sequence alignment ll methods will give similar results in the absence of homoplasy and rate heterogeneity Use NJ for a quick and dirty tree, ML or ayesian methods for greater accuracy and to test hypotheses To interpret the tree You need to root it (don t be fooled by the graphic!) Obtain some measure of clade support (e.g. bootstrap) ranches with low support should be collapsed to polytomies Think of the phylogeny as a series of nested clades pproach the tree with a hypothesis, or at least have a clear goal in mind e.g. what would the tree look like in the absence of horizontal transfer? 8