Constructing Evolutionary Trees

Similar documents
UNITY AND DIVERSITY. Why do we classify things? Organizing the world of organsims. The Tree of Life

Basic Tree Thinking Assessment David A. Baum, Stacey DeWitt Smith, Samuel S. Donovan

Beaming in your answers

BIOLOGY I, PRE-AP. Section Description State Standard Addressed

AP Biology. Cladistics

Classification. Grouping & Identifying Living Things

GRADE 6: Life science 3. UNIT 6L.3 6 hours. Classification. Resources. About this unit. Previous learning. Expectations

Biology B. There are no objectives for this lesson.

Basic Tree Thinking Assessment David A. Baum, Stacey DeWitt Smith, Samuel S. Donovan

Chetek-Weyerhaeuser High School

Classification Flow Chart

Multiple Sequence Alignment. Sequences

Classification Chapter 18

California Biology Handbook... CA1

Algorithms in Bioinformatics

Tree thinking pretest

Cladistics and Bioinformatics Questions 2013

WHAT AM I? Classification. Game

Protists. Simple Eukaryotes. Regents Biology Common ancestor. Domain Archaebacteria. Domain Eukaryotes. Domain Bacteria

5.5 Organisms (Living Systems)

Resources. Visual Concepts. Chapter Presentation. Copyright by Holt, Rinehart and Winston. All rights reserved.

WHAT AM I? Classification. Game

Life Complexity and Diversity

Organizing Life on Earth

Diversity of Organisms and Classification

Multiple Choice Write the letter on the line provided that best answers the question or completes the statement.

Written by Pamela Jennett

Six Kingdoms By Cindy Grigg. 1 The first scientist to try to classify organisms was the

Biology IA & IB Syllabus Mr. Johns/Room 2012/August,

Section 18-1 Finding Order in Diversity

CLASSIFICATION NOTES

Chapter 18 Systematics: Seeking Order Amidst Diversity

PSI Biology Classification Classification

A Brief Survey of Life s Diversity 1

BINF6201/8201. Molecular phylogenetic methods

Chapter 19: Taxonomy, Systematics, and Phylogeny

Today: Animal Body Plans. Animal Body Plans: The Gut. The Animal Kingdom- General Characteristics: Animal Body Plans: Symmetry

Six Kingdoms By Cindy Grigg

Biol/Env St 204 Quiz 2 Spring 2008

SCIENCE REVISION BOOKLET MID SEMESTER

CSCI1950 Z Computa4onal Methods for Biology Lecture 5

Chapter 16: Reconstructing and Using Phylogenies

Classification. Living. Things. Amy Brown Science Stuff

FROM BACTERIA TO PLANTS

Evolution & Biodiversity: Origins, Niches, & Adaptation

Biology Keystone (PA Core) Quiz Theory of Evolution - (BIO.B ) Theory Of Evolution, (BIO.B ) Scientific Terms

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

BIOLOGY. Classification & Phylogeny. Slide 1 / 92. Slide 2 / 92. Slide 3 / 92. Vocabulary Click on each word below to go to the definition.

Unit B: Diversity of Living Things

AP: CHAPTER 18: the Genetics of VIRUSES p What makes microbes good models to study molecular mechanisms? 4. What is a bacteriophage?

RCPS Curriculum Pacing Guide Subject: Biology. Remembering, Understanding, Applying, Analyzing, Evaluating, Creating

thebiotutor.com AS Biology Unit 2 Classification, Adaptation & Biodiversity

McDougal Littell Science, Cells and Heredity MAZER PDF. IL Essential Lesson. IL Extend Lesson. Program Planning Guide LP page.

Kentucky Core Content for Science Assessment Correlations

Back to the life forms!

AUSTRALIAN HOMESCHOOLING SERIES SAMPLE. Biology. Secondary Science 7B. Years 7 9. Written by Valerie Marett. CORONEOS PUBLICATIONS Item No 542

The City School North Nazimabad Boys Campus

What is Phylogenetics

Phylogeny is the evolutionary history of a group of organisms. Based on the idea that organisms are related by evolution

Phylogeny and the Tree of Life

Basic Tree Thinking Assessment David A. Baum, Stacey DeWitt Smith, Samuel S. Donovan

8/23/2014. Phylogeny and the Tree of Life

b. In Table 1 (question #2 on the Answer Sheet describe the function of each set of bones and answer the question.)

Phylogenetic trees 07/10/13

Sponges, Cnidarians and Worms

Evolutionary Tree Analysis. Overview

Lecture 11 Friday, October 21, 2011

Links to help understand the immensity of the Geologic Time Scale

In the past unit, you looked at the life cycles of plants and

Phylogeny: building the tree of life

EVOLUTIONARY DISTANCES

Lab 06 Phylogenetics, part 1

T.1. LIVING ORGANISMS.

CLASSIFICATION. Why Classify? 2/18/2013. History of Taxonomy Biodiversity: variety of organisms at all levels from populations to ecosystems.

The Diversity of Living Things

Kingdoms and Domains. Lisa Michalek

Biodiversity. The Road to the Six Kingdoms of Life

PHYLOGENY & THE TREE OF LIFE

LEARNING OBJECTIVES FOR BY 124 EXAM II. 1. List characteristics that distinguish fungi from organisms in other kingdoms.

Molecular evolution. Joe Felsenstein. GENOME 453, Autumn Molecular evolution p.1/49

Classification. Essential Question Why is it important to place living things into categories?

Domains and Kingdoms

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Student Instruction Book

Phylogenetic Trees. How do the changes in gene sequences allow us to reconstruct the evolutionary relationships between related species?

Molecular Evolution and Phylogenetic Tree Reconstruction

Characteristics and Classification of Living Organism (IGCSE Biology Syllabus )

BOOK 3 OUR PLANET SECTION 2 WORLD OF LIFE

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

Phylogenetics. BIOL 7711 Computational Bioscience

Apply scientific methods to problemsolving. Demonstrate how to measure using scientific instruments and units.

1. Construct and use dichotomous keys to identify organisms.

Classification of organisms. The grouping of objects or information based on similarities Taxonomy: branch of biology that classifies organisms


BIOLOGY Unit 2 Biodiversity and Physiology of Body Systems

Station A: #3. If two organisms belong to the same order, they must also belong to the same

Unit 4 Lesson 5 How Do Animals Grow and Reproduce? Copyright Houghton Mifflin Harcourt Publishing Company

SG 9.2 notes Ideas about targets and terms: 9.2 In the past, all living things were classified in either the kingdom of animals or plants

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

Science Notes. P3 Diversity. Living Things

Transcription:

Constructing Evolutionary Trees 0-0

HIV Evolutionary Tree SIVs (monkeys)! HIV (human)! human infection! human HIV/M human HIV/M chimpanzee SIV chimpanzee SIV human HIV/N human HIV/N chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV human HIV/O human HIV/O chimpanzee SIV chimpanzee SIV red-capped manabey SIV drill SIV vervet monkey SIV tantalus monkey SIV sooty mangabey SIV human HIV/A human HIV/B sooty mangabey SIV Sykes s monkey SIV greater spot-nosed monkey SIV De Brazzas monkey SIV

HIV Evolutionary Tree SIVs (monkeys)! HIV (human)! human infection! But how did biologists generate this? human HIV/M human HIV/M chimpanzee SIV chimpanzee SIV human HIV/N human HIV/N chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV chimpanzee SIV human HIV/O human HIV/O chimpanzee SIV chimpanzee SIV red-capped manabey SIV drill SIV vervet monkey SIV tantalus monkey SIV sooty mangabey SIV human HIV/A human HIV/B sooty mangabey SIV Sykes s monkey SIV greater spot-nosed monkey SIV De Brazzas monkey SIV

Constructing a Distance Matrix SPECIES ALIGNMENT Chimp Human Seal Whale ACGTAGGCCT ATGTAAGACT TCGAGAGCAC TCGAAAGCAT

Constructing a Distance Matrix D i,j = number of differing symbols between i-th and j-th species. SPECIES ALIGNMENT DISTANCE MATRIX Chimp Human Seal Whale Chimp ACGTAGGCCT 0 3 6 4 Human ATGTAAGACT 3 0 7 5 Seal TCGAGAGCAC 6 7 0 Whale TCGAAAGCAT 4 5 0

Constructing a Distance Matrix D i,j = number of differing symbols between i-th and j-th species. SPECIES ALIGNMENT DISTANCE MATRIX Chimp Human Seal Whale Chimp ACGTAGGCCT 0 3 6 4 Human ATGTAAGACT 3 0 7 5 Seal TCGAGAGCAC 6 7 0 Whale TCGAAAGCAT 4 5 0

bacteria LIFE Trees archaebacteria EUKARYOTES protoctists PLANTS green algae fungi ANIMALS Tree: Connected network containing no cycles. mosses sponges ferns cnidarians flowering! seed plants non-flowering! seed plants flatworms VERTEBRATES lophophorates echinoderms rotifers roundworms ARTHROPODS TETRAPODS AMNIOTES cartilaginous! fish bony fish amphibians segmented! worms mollusks crustaceans insects chelicerates mammals turtles snakes! crocodiles! & lizards & birds

bacteria LIFE Trees archaebacteria EUKARYOTES protoctists PLANTS green algae fungi ANIMALS Tree: Connected network containing no cycles. mosses sponges flowering! seed plants non-flowering! seed plants ferns cnidarians flatworms Leaves (degree = ): present-day species lophophorates rotifers roundworms VERTEBRATES echinoderms ARTHROPODS TETRAPODS AMNIOTES cartilaginous! fish bony fish amphibians segmented! worms mollusks crustaceans insects chelicerates mammals turtles snakes! crocodiles! & lizards & birds

bacteria LIFE Trees archaebacteria EUKARYOTES protoctists PLANTS green algae fungi ANIMALS Tree: Connected network containing no cycles. mosses sponges flowering! seed plants non-flowering! seed plants ferns cnidarians flatworms Leaves (degree = ): present-day species lophophorates rotifers roundworms TETRAPODS AMNIOTES VERTEBRATES cartilaginous! fish bony fish amphibians echinoderms segmented! worms mollusks crustaceans insects ARTHROPODS chelicerates Internal nodes (degree ): ancestral species mammals turtles snakes! crocodiles! & lizards & birds

bacteria LIFE Trees archaebacteria EUKARYOTES protoctists PLANTS green algae fungi ANIMALS Tree: Connected network containing no cycles. mosses sponges flowering! seed plants non-flowering! seed plants ferns cnidarians flatworms Leaves (degree = ): present-day species lophophorates rotifers roundworms TETRAPODS AMNIOTES VERTEBRATES cartilaginous! fish bony fish amphibians echinoderms segmented! worms mollusks crustaceans insects ARTHROPODS chelicerates Internal nodes (degree ): ancestral species turtles snakes! crocodiles! & lizards & birds mammals Exercise: Design a Tree struct with a few methods.

Trees Most Recent Ancestor! TIME! Present Day! Rooted tree: one node is designated as the root (most recent common ancestor)

Trees mathoverflow.net! Unrooted tree: no node is designated as the root (we haven t inferred the location of most recent ancestor).

Distance-Based Phylogeny Distance-Based Phylogeny Problem: Construct an evolutionary tree from a distance matrix. Input: A distance matrix. Output: The unrooted tree corresponding to this distance matrix. Think: Is this problem clearly stated?

Distance-Based Phylogeny Distance-Based Phylogeny Problem: Construct an evolutionary tree from a distance matrix. Input: A distance matrix. Output: The unrooted tree corresponding to this distance matrix. Think: Is this problem clearly stated? We haven t stated what corresponding to means, so this isn t a computational problem!

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Exercise: Find a tree fitting this distance matrix.

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Fitting a Tree to a Matrix Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Return to Distance-Based Phylogeny Distance-Based Phylogeny Problem: Construct an evolutionary tree from a distance matrix. Input: A distance matrix. Output: The unrooted tree fitting this distance matrix. Think: Is this problem clearly formulated?

Return to Distance-Based Phylogeny Exercise: Design an algorithm (pseudocode) that will construct a tree fitting any input distance matrix (mammal matrix reproduced below for reference). Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0

Return to Distance-Based Phylogeny Exercise: What tree does your method construct for the following matrix? i j k l i 0 3 j 3 0 3 k 0 3 l 3 3 0

Return to Distance-Based Phylogeny Exercise: Find the tree fitting this distance matrix. i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0

Sometimes, No Tree Fits Exercise: Find the tree fitting this distance matrix. i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0 NO Solution! Additive matrix: distance matrix such that there exists an unrooted tree fitting it.

Sometimes, More Than One Tree Fits Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 Chimp! 3 Seal! Human! 0 Whale!

Sometimes, More Than One Tree Fits Chimp Human Seal Whale Chimp 0 3 6 4 Human 3 0 7 5 Seal 6 7 0 Whale 4 5 0 0.5 Seal! Chimp! 3.5 0 Whale! Human!

Which Tree is Better? Chimp! 3 Seal! Human! 0 Whale! 0.5 Seal! Chimp! 3.5 0 Whale! Human!

Which Tree is Better? Chimp! 3 Seal! Human! 0 Whale! 0.5 Seal! Chimp! 3.5 # incoming edges =! # incoming edges =! 0 Whale! Human!

Which Tree is Better? Chimp! 3 Seal! Human! 0 Whale! Degree: number of edges touching a node.

Which Tree is Better? Chimp! 3 Seal! Human! 0 Whale! Degree: number of edges touching a node. Simple tree: tree with no nodes of degree.

Which Tree is Better? Chimp! 3 Seal! Human! 0 Whale! Degree: number of edges touching a node. Simple tree: tree with no nodes of degree. Theorem: There is a unique simple tree fitting an additive matrix.

Reformulating Distance-Based Phylogeny Distance-Based Phylogeny Problem: Construct an evolutionary tree from a distance matrix. Input: A distance matrix. Output: The simple tree fitting this distance matrix (if this matrix is additive).

Reformulating Distance-Based Phylogeny Distance-Based Phylogeny Problem: Construct an evolutionary tree from a distance matrix. Input: A distance matrix. Output: The simple tree fitting this distance matrix (if this matrix is additive). But what should we do about non-additive matrices?

Reformulating Distance-Based Phylogeny Distance-Based Phylogeny Problem: Construct an evolutionary tree from a distance matrix. Input: A distance matrix. Output: The simple tree fitting this distance matrix (if this matrix is additive). But what should we do about non-additive matrices? Heuristic: A quick, practical method that does not necessarily solve a given problem.

Modeling Speciations Researchers often assume that all internal nodes correspond to speciations, where one species splits into two.

Modeling Speciations Squirrel Monkey! Orangutan! Unrooted binary tree: every node has degree or 3. Chimpanzee! Bonobo! Baboon! Human! Gorilla!

Modeling Speciations Rooted binary tree: an unrooted binary tree with a root (of degree ) on one of its edges. Squirrel Monkey! Baboon! Orangutan! Gorilla! Chimpanzee! Bonobo! Human!

Ultrametric Trees 33 3 Molecular clock: assigns ages to each node in the tree (age of leaves = 0). 3 7 6 Squirrel Monkey! Baboon! Orangutan! Gorilla! Chimpanzee! Bonobo! Human!

Ultrametric Trees 0 33 3 edge weights: correspond to difference in ages on the nodes the edge connects. 33 3 0 3 6 7 3 7 6 6 Squirrel Monkey! Baboon! Orangutan! Gorilla! Chimpanzee! Bonobo! Human!

Ultrametric Trees 0 33 3 Ultrametric tree: distance from root to any leaf is the same (i.e., age of root). 33 3 0 3 6 7 3 7 6 6 Squirrel Monkey! Baboon! Orangutan! Gorilla! Chimpanzee! Bonobo! Human!

UPGMA: A Clustering Heuristic. Form a cluster for each present-day species, each containing a single leaf. i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0 i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic. Find the two closest clusters C and C according to the average distance D avg (C, C ) = Σ i in C, j in C D i,j / C C where C denotes the number of elements in C. i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0 i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic 3. Merge C and C into a single cluster C. i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0 { k, l } i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic 4. Form a new node for C and connect to C and C by an edge. Set age of C as D avg (C, C )/. i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0 { k, l } i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic 5. Update the distance matrix by computing the average distance between each pair of clusters. i j { k, l } i 0 3 3.5 j 3 0 4.5 { k, l } 3.5 4.5 0 { k, l } i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic 6. Iterate until a single cluster contains all species. i j { k, l } i 0 3 3.5 j 3 0 4.5 { k, l } 3.5 4.5 0 { i, j }.5.5.5 i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic 6. Iterate until a single cluster contains all species. {i, j} { k, l } {i, j} 0 4 { k, l } 4 0.5 { i, j }.5.5 i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic 6. Iterate until a single cluster contains all species. {i, j} { k, l } {i, j} 0 4 { k, l } 4 0.5 0.5.5.5 i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic 6. Iterate until a single cluster contains all species. 0.5.5.5.5 i 0 j 0 k 0 l 0

UPGMA: A Clustering Heuristic UPGMA(D):. Form a cluster for each present-day species, each containing a single leaf.. Find the two closest clusters C and C according to the average distance D avg (C, C ) = Σ i in C, j in C D i,j / C C where C denotes the number of elements in C 3. Merge C and C into a single cluster C. 4. Form a new node for C and connect to C and C by an edge. Set age of C as D avg (C, C )/. 5. Update the distance matrix by computing the average distance between each pair of clusters. 6. Iterate steps -5 until a single cluster contains all species.

UPGMA Doesn t Fit a Tree to a Matrix i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0 0.5.5.5.5 i 0 j 0 k 0 l 0

UPGMA Doesn t Fit a Tree to a Matrix i j k l i 0 3 4 3 j 3 0 4 5 k 4 4 0 l 3 5 0 0.5.5.5.5 i 0 j 0 k 0 l 0

Quick Quiz Exercise: Apply UPGMA to the following matrix. i j k l i 0 0 9 j 0 0 7 k 9 7 0 8 l 8 0