Evolutionary trees. Describe the relationship between objects, e.g. species or genes

Similar documents
Evolutionary trees. Describe the relationship between objects, e.g. species or genes

Phylogeny and Molecular Evolution. Introduction

Phylogenetic Trees. How do the changes in gene sequences allow us to reconstruct the evolutionary relationships between related species?

Phylogeny and Molecular Evolution. Introduction

Phylogeny: building the tree of life

Multiple Sequence Alignment. Sequences

Evolutionary Tree Analysis. Overview

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Lecture 11 Friday, October 21, 2011

Algorithms in Bioinformatics

Phylogenetic inference

Phylogenetic trees 07/10/13

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics


Theory of Evolution Charles Darwin

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

A Phylogenetic Network Construction due to Constrained Recombination

Phylogenetic Tree Reconstruction

Dr. Amira A. AL-Hosary

What is Phylogenetics

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

BINF6201/8201. Molecular phylogenetic methods

C.DARWIN ( )

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Constructing Evolutionary/Phylogenetic Trees

Phylogeny: traditional and Bayesian approaches

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

CSCI1950 Z Computa4onal Methods for Biology Lecture 5

C3020 Molecular Evolution. Exercises #3: Phylogenetics

EVOLUTIONARY DISTANCES

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

Session 5: Phylogenomics

Plan: Evolutionary trees, characters. Perfect phylogeny Methods: NJ, parsimony, max likelihood, Quartet method

9/19/2012. Chapter 17 Organizing Life s Diversity. Early Systems of Classification

Inferring Phylogenetic Trees. Distance Approaches. Representing distances. in rooted and unrooted trees. The distance approach to phylogenies

Evolution Problem Drill 10: Human Evolution

CSCI1950 Z Computa3onal Methods for Biology* (*Working Title) Lecture 1. Ben Raphael January 21, Course Par3culars

Theory of Evolution. Charles Darwin

Constructing Evolutionary Trees

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Organizing Life s Diversity

Skulls & Evolution. Procedure In this lab, groups at the same table will work together.

Supplementary Information

List of Code Challenges. About the Textbook Meet the Authors... xix Meet the Development Team... xx Acknowledgments... xxi

Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab

Phylogenetics: Parsimony

Reconstruire le passé biologique modèles, méthodes, performances, limites

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Evolutionary Analysis, 5e (Herron/Freeman) Chapter 2 The Pattern of Evolution

How to read and make phylogenetic trees Zuzana Starostová

Microbial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B

Algorithmic Methods Well-defined methodology Tree reconstruction those that are well-defined enough to be carried out by a computer. Felsenstein 2004,

Phylogeny. November 7, 2017

Phylogenetics: Building Phylogenetic Trees

A (short) introduction to phylogenetics

Phylogenetics: Distance Methods. COMP Spring 2015 Luay Nakhleh, Rice University

Cladistics and Bioinformatics Questions 2013

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

Mul$ple Sequence Alignment Methods. Tandy Warnow Departments of Bioengineering and Computer Science h?p://tandy.cs.illinois.edu

Phylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

HUMAN EVOLUTION. Where did we come from?

DNA Phylogeny. Signals and Systems in Biology Kushal EE, IIT Delhi

Evolutionary Trees. Evolutionary tree. To describe the evolutionary relationship among species A 3 A 2 A 4. R.C.T. Lee and Chin Lung Lu

Additive distances. w(e), where P ij is the path in T from i to j. Then the matrix [D ij ] is said to be additive.

I. Short Answer Questions DO ALL QUESTIONS

Phylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science

Molecular evolution. Joe Felsenstein. GENOME 453, Autumn Molecular evolution p.1/49

Chapter 16: Reconstructing and Using Phylogenies

Identify the 6 kingdoms into which all life is classified.

Evolution & Natural Selection

6 HOW DID OUR ANCESTORS EVOLVE?

Adaptation. Adaptation A trait that allows a species to survive more easily and reproduce.

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Classification and Phylogeny

Bayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies

Molecular Phylogenetics (part 1 of 2) Computational Biology Course João André Carriço

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogenetics. BIOL 7711 Computational Bioscience

PHYLOGENY AND SYSTEMATICS

9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)

Phylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz

Phylogenetic inference: from sequences to trees

Guided Notes: Evolution. is the change in traits through generations over! Occurs in, NOT individual organisms

Chapter 27: Evolutionary Genetics

Phylogenetic analyses. Kirsi Kostamo

Gel Electrophoresis. 10/28/0310/21/2003 CAP/CGS 5991 Lecture 10Lecture 9 1

Classification and Phylogeny

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Welcome to Evolution 101 Reading Guide

CS 394C Algorithms for Computational Biology. Tandy Warnow Spring 2012

Seuqence Analysis '17--lecture 10. Trees types of trees Newick notation UPGMA Fitch Margoliash Distance vs Parsimony

66 Bioinformatics I, WS 09-10, D. Huson, December 1, Evolutionary tree of organisms, Ernst Haeckel, 1866

Week 5: Distance methods, DNA and protein models

Phylogene)cs. IMBB 2016 BecA- ILRI Hub, Nairobi May 9 20, Joyce Nzioki

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Transcription:

Evolutionary trees Bonobo Chimpanzee Human Neanderthal Gorilla Orangutan Describe the relationship between objects, e.g. species or genes

Early evolutionary studies Anatomical features were the dominant criteria used to derive evolutionary relationships between species since Darwin till early 1960s The evolutionary relationships derived from these relatively subjective observations were often inconclusive, and some of them were later proved incorrect

Early evolutionary studies Anatomical features were the dominant criteria used to derive evolutionary relationships between species since Darwin till early 1960s The Giant Panda riddle The evolutionary relationships derived from these relatively subjective observations were often inconclusive, and some of them were later proved incorrect For roughly 100 years scientists were unable to figure out which family the giant panda belongs to Giant pandas look like bears but have features that are unusual for bears and typical for raccoons, e.g., they do not hibernate In 1985, Steven O Brien and colleagues solved the giant panda classification problem using DNA sequences and algorithms

Evolutionary trees: DNA-based approach 50 years ago: Emile Zuckerkandl and Linus Pauling brought reconstructing evolutionary relationships with DNA into the spotlight In the first few years after Zuckerkandl and Pauling proposed using DNA for evolutionary studies, the possibility of reconstructing evolutionary trees by DNA analysis was hotly debated Now it is a dominant approach to study evolution

Evolutionary trees: DNA-based approach 40 years ago: Emile Zuckerkandl and Linus Pauling brought reconstructing evolutionary relationships From with the DNA point of hemoglobin structure, it appears that into the spotlight gorilla is just an abnormal human, or man an abnormal gorilla, and the two species form actually one continuous In the first few years after population. Zuckerkandl and Pauling proposed using DNA for evolutionary studies, the Emile possibility Zuckerkandl, of reconstructing evolutionary Classification trees by and Human Evolution, 1963 DNA analysis was hotly debated From any point of view other than that properly Now it is a dominant approach specified, to that study is of course nonsense. What the evolution comparison really indicate is that hemoglobin is a bad choice and has nothing to tell us about attributes, or indeed tells us a lie. Gaylord Simpson, Science, 1964

Molecular evolution 100 million years ago AGTAGCAGTACGATACG! AGTTGCAGTAGGATATG! Today

The molecular clock

Genetic distances Human Neanderthal Human Human Neanderthal Neanderthal Chimpanzee Bonobo Chimpanzee Bonobo Chimpanzee Bonobo

A phylogenetic tree Bonobo Chimpanzee Human Neanderthal

Species trees and Gene trees The species tree can differs from a tree built from homologous genes

Species trees and Gene trees The species tree can differs from a tree built from homologous genes

Species trees and Gene trees Human speciation

Species trees and Gene trees Human speciation

Species trees and Gene trees Human speciation

Species trees and Gene trees Human speciation

A speculatively rooted tree for rrna genes

Terminology Labeling / annotation leaves (and nodes) are labelled with species/objects (1, 2,..., n) edge lengths reflect amount of evolution/time along the edge Tree structure / topology rooted vs. unrooted binary (bifurcating) vs. non-binary (multifurcating)

Distances in trees j i d 1,4 = 12 + 13 + 14 + 17 + 13 = 69

Layout don t matter... these two rooted binary trees are considered equal...

Newick format http://en.wikipedia.org/wiki/newick_format http://www.birc.au.dk/~mailund/newick.html

Reconstructing trees Reconstruct the unknown true tree from info about the species Character-based information Each species is characterized by a number of characters which can be in different states, e.g. a multiple alignment: species 1: acgcta--cacacagt species 2: acgcca---acg-agt species 3: acg--attt-agc--t Distance-based information The distance D(i,j) is given for each pair of species!

Distance based reconstruction The ideal case The tree reflects the distance matrix, i.e. the distances induced by the tree are equal to the distances in the matrix If this is possible, then the distance matrix is called additive Reconstruction problem Find both the tree topology and the edge lengths...

Distance based reconstruction Goal: Reconstruct an evolutionary tree from a distance matrix Input: n x n distance matrix D ij Output: weighted tree T with n leaves fitting D If D is additive, this problem has an efficient solution (which we will come back to), otherwise we should aim at finding a tree which a good as possible

A reconstruction method Least square methods Given a tree topology T, select edge lengths such that Q(T) = n n wij(dij dij) 2 i=1 j=1 is minimized, where D ij is given by the matrix, d ij is the distance induced by the tree and the selected edge lengths, and w ij is the confidence we have in D ij, e.g. w ij =1. Optimal edge lengths can be found by standard least square methods in time O(n 3 ). The large version of the problem when the tree is unknown, e.g. we must also minimize over T, is NP-complete...

Two practical algorithms UPGMA constructs good trees (rooted binary trees) if a molecular clock is thought to be a reasonable assumption, if the distance matrix comes from a clocklike tree, then UPGMA finds the tree that is the exact reprensentation of the matrix Neighbor-Joining constructs trees (unrooted binary trees) that can be seen as a rough approximations to least-square methods, if the distance matrix is additive, then Neighbor-Joining finds the tree that is the exact representation of the matrix Check out JSplits and Dendroscope for implementations of various tree reconstruction methods and visualizations: http://www.splitstree.org and http://www.dendroscope.org

Counting rooted binary trees

Counting rooted binary trees A rooted binary tree with n leaves has 2n-2 edges. A new species can be added to any edge or above the root, i.e: R(n) = R(n-1) (2(n-1)-2 + 1) = R(n-1) (2n - 3) What about unrooted trees? What about non-binary trees?