Molecular evolution. Joe Felsenstein. GENOME 453, Autumn Molecular evolution p.1/49

Similar documents
Inference of phylogenies, with some thoughts on statistics and geometry p.1/31

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Lecture 11 Friday, October 21, 2011

8/23/2014. Phylogeny and the Tree of Life


Multiple Sequence Alignment. Sequences

Phylogenetic Trees. How do the changes in gene sequences allow us to reconstruct the evolutionary relationships between related species?

Algorithms in Bioinformatics

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

PHYLOGENY AND SYSTEMATICS

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

Constructing Evolutionary Trees

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Theory of Evolution Charles Darwin

Evolutionary trees. Describe the relationship between objects, e.g. species or genes

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Phylogeny. Properties of Trees. Properties of Trees. Trees represent the order of branching only. Phylogeny: Taxon: a unit of classification

BINF6201/8201. Molecular phylogenetic methods

Cladistics and Bioinformatics Questions 2013

Phylogenetic inference

Phylogeny and the Tree of Life

AN ALTERNATING LEAST SQUARES APPROACH TO INFERRING PHYLOGENIES FROM PAIRWISE DISTANCES

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family

Chapter 26 Phylogeny and the Tree of Life

Chapter 27: Evolutionary Genetics

Biology Keystone (PA Core) Quiz Theory of Evolution - (BIO.B ) Theory Of Evolution, (BIO.B ) Scientific Terms

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

Lecture 7 Mutation and genetic variation

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

What can sequences tell us?

Biodiversity. The Road to the Six Kingdoms of Life

Microbial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B

Phylogenetics in the Age of Genomics: Prospects and Challenges

Phylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Processes of Evolution

Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

FUNDAMENTALS OF MOLECULAR EVOLUTION

Early History up to Schedule. Proteins DNA & RNA Schwann and Schleiden Cell Theory Charles Darwin publishes Origin of Species

Chapter 26: Phylogeny and the Tree of Life

Principles of Phylogeny Reconstruction How do we reconstruct the tree of life? Basic Terminology. Looking at Trees. Basic Terminology.

Biodiversity. The Road to the Six Kingdoms of Life

Phylogeny and the Tree of Life

Organizing Life on Earth

Michael Yaffe Lecture #5 (((A,B)C)D) Database Searching & Molecular Phylogenetics A B C D B C D

Chapter 18 Systematics: Seeking Order Amidst Diversity

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Name: Class: Date: ID: A

Title slide (1) Tree of life 1891 Ernst Haeckel, Title on left

Tree thinking pretest

Mechanisms of Evolution Darwinian Evolution

Phylogenetic Analysis

Phylogenetic Tree Reconstruction

Molecular Evolution and Comparative Genomics

Theory of Evolution. Charles Darwin

7. Tests for selection

Phylogenetics. BIOL 7711 Computational Bioscience

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Chapter 16: Reconstructing and Using Phylogenies

Chapter 19 Organizing Information About Species: Taxonomy and Cladistics

How should we organize the diversity of animal life?

Evolution. Changes over Time

Michael Yaffe Lecture #4 (((A,B)C)D) Database Searching & Molecular Phylogenetics A B C D B C D

Bioinformatics Exercises

Name. Ecology & Evolutionary Biology 2245/2245W Exam 2 1 March 2014

Phylogeny Tree Algorithms

Emily Blanton Phylogeny Lab Report May 2009

Primate Diversity & Human Evolution (Outline)

Evolutionary trees. Describe the relationship between objects, e.g. species or genes

Phylogenetic Analysis

Phylogenetic Analysis

The neutral theory of molecular evolution

Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30

9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)

Lecture Notes: BIOL2007 Molecular Evolution

Constructing Evolutionary/Phylogenetic Trees

Phylogeny and Molecular Evolution. Introduction

Lecture 27. Phylogeny methods, part 4 (Models of DNA and protein change) p.1/26

Bio Microbiology - Spring 2013 Study Guide 14.

Phylogeny and the Tree of Life

Primate phylogeny: molecular evidence for a pongid clade excluding humans and a prosimian clade containing tarsiers

Many of the slides that I ll use have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

Sequence Analysis 17: lecture 5. Substitution matrices Multiple sequence alignment

Biology 211 (2) Week 1 KEY!

Understanding relationship between homologous sequences

Dr. Amira A. AL-Hosary

9/19/2012. Chapter 17 Organizing Life s Diversity. Early Systems of Classification

Phylogeny & Systematics: The Tree of Life

Evidence of EVOLUTION

Transcription:

Molecular evolution Joe Felsenstein GENOME 453, utumn 2009 Molecular evolution p.1/49

data example for phylogeny inference Five DN sequences, for some gene in an imaginary group of species whose names are lpha, Beta, Gamma, Delta, and Epsilon: site species 1 2 3 4 5 6 lpha T G G C Beta C T C T C Gamma G G T C Delta G G G T Epsilon C T C G C Molecular evolution p.2/49

The tree we are evaluating lpha Delta Gamma Beta Epsilon Molecular evolution p.3/49

Steps in character 1 C C lpha Delta Gamma Beta Epsilon lpha Beta Gamma Delta Epsilon 1 2 3 4 5 6 T G G C C T C T C G G T C G G G T C T C G C or C C lpha Delta Gamma Beta Epsilon Molecular evolution p.4/49

Steps in character 2 T G G T T lpha Delta Gamma Beta Epsilon lpha Beta Gamma Delta Epsilon 1 2 3 4 5 6 T G G C C T C T C G G T C G G G T C T C G C or T G G T T lpha Delta Gamma Beta Epsilon or T G G T T lpha Delta Gamma Beta Epsilon Molecular evolution p.5/49

Steps in character 3 G G G C C lpha Delta Gamma Beta Epsilon lpha Beta Gamma Delta Epsilon 1 2 3 4 5 6 T G G C C T C T C G G T C G G G T C T C G C G G G C C lpha Delta Gamma Beta Epsilon Molecular evolution p.6/49

Steps in character 4 T T lpha Delta Gamma Beta Epsilon lpha Beta Gamma Delta Epsilon 1 2 3 4 5 6 T G G C C T C T C G G T C G G G T C T C G C or T T lpha Delta Gamma Beta Epsilon Molecular evolution p.7/49

Steps in character 5 G G G lpha Delta Gamma Beta Epsilon lpha Beta Gamma Delta Epsilon 1 2 3 4 5 6 T G G C C T C T C G G T C G G G T C T C G C or G G G lpha Delta Gamma Beta Epsilon Molecular evolution p.8/49

Steps in character 6 lpha Beta Gamma Delta Epsilon 1 2 3 4 5 6 T G G C C T C T C G G T C G G G T C T C G C C T C C C lpha Delta Gamma Beta Epsilon Molecular evolution p.9/49

Steps in all characters showing one of their possible placements lpha Delta Gamma Beta Epsilon 6 2 5 5 4 4 2 3 1 Molecular evolution p.10/49

The most parsimonious tree with one possible placement of the changes Gamma Delta lpha Beta Epsilon 5 4 6 5 4 2 1 3 Molecular evolution p.11/49

The same tree as an unrooted tree shown as an unrooted tree root can be anywhere changes can occur in either direction lpha Gamma Beta 5 4 2 1 3 4 5 6 Delta Epsilon Molecular evolution p.12/49

Direction of change depends on where root is Placement of the root affects which way bases change but not how many changes there are lpha Gamma Delta G if root is here C T T 5 6 4 2 G T C G C G T 1 3 4 Beta 5 Epsilon Molecular evolution p.13/49

Changing the root changes the direction Placement of the root affects which way bases change but not how many changes there are lpha Gamma Beta G T 5 4 2 G T 1 3 4 5 C T 6 G T C G C Delta if root is here Epsilon Molecular evolution p.14/49

ll possible trees (15 in all) C D B E D B E C B C E D B D C E B C E D E B C D D B C E B C D E C B D E D B E C B D C E B C D E C B D E E B D C E C B D Molecular evolution p.15/49

Their best numbers of nucleotide substitutions C D D B B E E C 11 9 B C E B D C D E 11 B C 11 D E E B 11 B C C D D E B 11 11 D E B D B C C E B D C B 9 E C 11 C E D E C 9 9 E D B D 10 C B 10 D E 8 D B E C 9 Molecular evolution p.16/49

The most parsimonious tree C D D B B E E C 11 9 B C E B D C D E 11 B C 11 D E E B 11 B C C D D E B 11 11 D E B D B C C E B D C B 9 E C 11 C E D E C 9 9 E D B D 10 C B 10 D E 8 D B E C 9 Molecular evolution p.17/49

Distance matrix methods C 5 0.5 2.5 1.5 Each possible tree (with branch lengths) predict pairwise distances F D B 4 2 3 2 3 E B C D E F B C D E F 0 11 8 14 15 9 11 0 9 7 810 8 14 15 9 9 7 8 10 0 13 14 13 14 2 2 0 513 5 014 13 14 0 Find the tree which comes closest to predicting the observed pairwise distances compare observed distances calculated from the data B C D E F B C D E F 0 10 9 12 16 9 10 0 10 6 9 9 9 10 0 10 15 2 12 6 10 0 613 16 9 15 6 015 9 9 2 13 15 0 Molecular evolution p.18/49

n example Turbeville. J. McC., Schulz, J.R. and R.. Raff. 1994. Deuterostome phylogeny and the sister group of the chordates: evidence from molecules and morphology. Molecular Biology and Evolution 11: 648-655. Xenopus?TCCTGGTTGTCCTGCCGTG-CTTGCTTGTCTCGTTGCCTGCCG Sebastol??????????????????????G-CTTGCTTGTCTCGTTGCCTGCG Latimeri?TCCTGGTTGTCCTGCCGTG-CTTGCTTGTCTCGTTGCCTGCTG Squalus??????????????????????G-CTTGCTTGTCTCGTTGCCTGCTG Myxine??CCCTGGTTGTCCTGCCGCCG-CTTGCTTGTCTCGCTGCCTGCTG Petromyz???CCTGGTTGTCCTGCCGTG-CTTGCTTGTCTCGTTGCCTGCTG Branch???CCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCCG Styela??TCTGGTTGTCCTGCCGTGTGTTGCTTGTCTCGTTGCCTGCGG Herdman?TTCTGGTTGTCCTGCCGTGTGTTGCTTGTCTC-GTTGCCTGCGG Saccogl??CCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCTG Ophiophol??CCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCTG Strongyl??CCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCTG Placopec CCCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCTG Limicol?TTCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCTG Eurypelm?TCCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCTG Tenebrio?TCCCTGGTTGTCCTGCCGTGTCTTGCTTGTCTCGTTGCCTGCTG (and so on for 33 more pages) Molecular evolution p.19/49

Tree using parsimony (acorn worms) Saccogl Ophiophol (echinoderms) Strongyl Herdman (sea squirts) Styela Brachiostoma (amphioxus) Placopec (molluscs) Limicol Petromyz(lamprey) Eurypelm (insects) Tenebrio Latimeri (coelacanth) Squalus(dogfish) Sebastol(fish) Xenopus (frog) Myxine (hagfish) Molecular evolution p.20/49

Saccogl Myxine Tree using a distance method Herdman Styela Ophiophol Branch (echinoderms) (acorn worms) (sea squirts) (amphioxus) Strongyl Placopec Squalus Sebastol (molluscs) Latimeri Xenopus Petromyz Limicol Eurypelm Tenebrio (insects) (lamprey) (dogfish) (fish) (coelacanth) (frog) (hagfish) Molecular evolution p.21/49

Tree using a maximum likelihood method Herdman (sea squirts) (acorn worm) Styela Saccogl Branch (amphioxus) Ophiophol (lamprey) (echinoderms) Petromyz Strongyl Myxine (hagfish) Placopec (molluscs) Limicol Squalus (dogfish) Sebastol (teleost) Xenopus Latimeri (frog) (coelacanth) Eurypelm (insects) Tenebrio Molecular evolution p.22/49

The tree with images of the animals Placopecten (scallop) Strongylocentrotus (sea urchin) Saccoglossus (acorn worm) Molecular evolution p.23/49

The tree with images of the animals Branchiostoma (amphioxus) tunicate Myxine (hagfish) Molecular evolution p.24/49

The tree with images of the animals Petromyzon dogfish dogfish (lamprey) Molecular evolution p.25/49

The tree with images of the animals Sebastoles (lrockfish) Latimeria (coelacanth) Xenopus (clawed frog) Molecular evolution p.26/49

Blair and Hedges alternative tree (in Molecular Biology and Evolution, 2005.) Molecular evolution p.27/49

Molecular evolution (1963 on) Linus Pauling in 1963 Emile Zuckerkandl, more recently Molecular evolution p.28/49

The late Margaret Dayhoff Responsible for the first computer-produced molecular phylogeny (1966), the start of protein sequence databases (1965), the recognition of gene families (1960s-1970s) Molecular evolution p.29/49

Morris Goodman Using immunological methods with proteins, defined the human-chimp-gorilla clade (1962), later pioneered work on evolution of gene families, especially the globin family. Molecular evolution p.30/49

The late llan Wilson With Vincent Sarich, supported the human-chimp-gorilla clade. dvocated use of the molecular clock by which they dated the divergence of these species to 5 million years ago. Later (as we shall see) found the tree of human mitochondria, whose ancestor was mitochondrial Eve. Molecular evolution p.31/49

n example: who is most closely related to whales? from mrine-madsen, H. et al., 2003, Molecular Phylogenetics and Evolution Molecular evolution p.32/49

The tree of human ancestry Old world monkeys Orangutan Chimp Bonobo Humans Gorilla Gibbon New world monkeys Molecular evolution p.33/49

Just who are you calling an ape? Old world monkeys Orangutan Chimp Bonobo Humans Gorilla Gibbon New world monkeys Molecular evolution p.34/49

Just who are you calling an ape? Old world monkeys Orangutan Chimp Bonobo Humans Gorilla Gibbon New world monkeys Molecular evolution p.35/49

Molecular phylogenies Some examples of other important conclusions from molecular phylogenies: Using immunological distances, Morris Goodman (1962 on) and later Wilson and Sarich (1966) show that humans, gorilla, and chimps were a clade. Wilson and Sarich (in that work, 1967) date the divergence of humans to 5 million years. Charles Sibley and Jon hlquist (1984) use DN hybridization to argue for the clade humans-chimps. Carl Woese (1978) uses rrn trees to introduce evolution into microbiology, argue for the domain rchaea. Much progress on early radiation of angiosperms Protostome-deuterostome tree of metazoans (more or less) replaced by deuterostome-lophotrichozoa-ecdysozoa tree. Fungi closer to animals than either is to plants. Symbiotic origin of mitochondria and of chloroplasts verified. mphioxus diverged before split of tunicates from craniate chordates. Lots of horizontal gene transfer in prokaryotes, almost not a tree. Molecular evolution p.36/49

Different parts of hemoglobin genes lignment of hemoglobin ε loci of Human, Tarsier Exon 1 Exon 2 Exon 3 Intron 1 Intron 2 region bases differences % different upstream 100 12 12.0 exon 1 92 9 9.8 intron 1 126 26 20.6 exon2 223 26 11.7 intron 2 820 239 29.1 exon 3 129 13 10.1 downstream 100 13 13.0 Differences in exons position 1 10 position 2 5 position 3 33 Molecular evolution p.37/49

Higher Rates of Substitution for... 1.... some proteins than others 2.... some sites within proteins than others (less in active sites, interior sites) 3.... some amino acid replacements than others (less changes to chemically more similar amino acids) 4.... silent changes than nonsilent ones 5.... in-between" DN than introns, introns than coding sequences 6.... transitions than transversions Molecular evolution p.38/49

Morris Goodman tabulation for β hemoglobin where sites change/100myr Heme contacts 21 0.02 Nonheme contacts 10 0.02 Salt bridges β-β 4 0.00 2,3-DPG binding 4 0.10 Nonsalt bridge α,β contacts 16 0.16 Remaining interior 21 0.09 Remaining exterior 70 0.20 ll 146 0.13 Molecular evolution p.39/49

PhyloHMM analysis of multiple genomes From a paper by Siepel and Haussler (Journal of Computational Biology, 2004) describing the machinery for finding conserved regions of multiple genomes. Molecular evolution p.40/49

Rates of change from neutral and selective mechanisms Neutral mutations fraction µ of all copies of a gene mutate. Of these 1 2N (equal to the initial frequency of the mutant) succeed in drifting to fixation for the mutant. There are in all 2N copies of the gene available to mutate. The resulting rate of substitution is µ 1 2N 2N = µ So the rate of substitution of neutral mutations is equal to the mutation rate (the mutation rate of neutral mutants, not the total mutation rate). Molecular evolution p.41/49

Change by neutral mutation end of a lineage which is the ancestry of a single copy of the gene: change is expected to be µ per generation gene copy ancestry Interestingly, the expected rate of change is the same no matter what the population size is (small populations make all copies more likely to become the same, but all are still expected to differ from their ancestors by this amount) time species boundary Molecular evolution p.42/49

Rates from neutral and selective mechanisms Selectively advantageous mutations fraction µ of copies of the gene mutate. There are in all 2N copies available. fraction 2s succeed in fixing. The resulting rate of substitution is µ 2N 2s = 4Nsµ Note that this is 4Ns times as high as for neutral mutants, if the mutation rate in both categories were equal (which it isn t). Molecular evolution p.43/49

Substitution of neutral and advantageous mutations Suppose that the population size is N = 1, 000, 000, and mutation rates are: dvantageous mutations Neutral mutations u a = 10 7 u n = 10 6 If the selection coefficient in favor of advantageous mutations is s = 0.0001, the rates of substitution expected are: dvantageous mutations Neutral mutations (4Ns)u a = 4 10 5 u n = 10 6 Molecular evolution p.44/49

The molecular clock (from Wilson, 1976) Molecular evolution p.45/49

not-quite-clocklike tree of Plethodon salamanders (Palmer et al., 2005, in Molecular Biology and Evolution) Molecular evolution p.46/49

Phylogeny and gene tree with a gene duplication phylogeny with a gene duplication event: Frog Human Monkey Squirrel a b a b a b species boundary gene duplication tree of genes Molecular evolution p.47/49

Phylogenies, gene trees, and gene duplication So when genes are all aligned with each other, their "gene tree" is: Frog Human Monkey Squirrel Human Monkey Squirrel a a a b b b These two subtrees should be identical Molecular evolution p.48/49

gene family Molecular evolution p.49/49