Time-dependent Molecular Rates: A Very Rough Guide

Similar documents
Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

PHYLOGENY AND SYSTEMATICS

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Dr. Amira A. AL-Hosary

Lecture 11 Friday, October 21, 2011

8/23/2014. Phylogeny and the Tree of Life

Processes of Evolution

Cladistics and Bioinformatics Questions 2013

Chapter 16: Reconstructing and Using Phylogenies

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Understanding relationship between homologous sequences

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

Phylogeny and the Tree of Life

Evolution Problem Drill 09: The Tree of Life

Big Idea #1: The process of evolution drives the diversity and unity of life

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

How should we organize the diversity of animal life?

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Modern Evolutionary Classification. Section 18-2 pgs

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

Chapter 27: Evolutionary Genetics

Chapter 19: Taxonomy, Systematics, and Phylogeny

Phylogenetic Trees. How do the changes in gene sequences allow us to reconstruct the evolutionary relationships between related species?

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Examples of Phylogenetic Reconstruction

Estimating Evolutionary Trees. Phylogenetic Methods

Chapter 26 Phylogeny and the Tree of Life

Name: Class: Date: ID: A

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Rate of Evolution Juliana Senawi

Welcome to Evolution 101 Reading Guide

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

Evolution Problem Drill 10: Human Evolution

Organizing Life s Diversity

Biology 211 (2) Week 1 KEY!

AP Biology Review Packet 5- Natural Selection and Evolution & Speciation and Phylogeny

Lecture 7 Mutation and genetic variation

Multiple Sequence Alignment. Sequences

Intraspecific gene genealogies: trees grafting into networks

Macroevolution Part I: Phylogenies

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley

Evolutionary Patterns, Rates, and Trends

Theory a well supported testable explanation of phenomenon occurring in the natural world.

Selection against A 2 (upper row, s = 0.004) and for A 2 (lower row, s = ) N = 25 N = 250 N = 2500

Chapter 16: Evolutionary Theory

Temporal Trails of Natural Selection in Human Mitogenomes. Author. Published. Journal Title DOI. Copyright Statement.

Name: Period Study Guide 17-1 and 17-2

Chapter 7: Covalent Structure of Proteins. Voet & Voet: Pages ,

Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab

5/31/17. Week 10; Monday MEMORIAL DAY NO CLASS. Page 88

Phylogeny and the Tree of Life

Lesson 1 Syllabus Reference

Evidence for Evolution

The Theory of Evolution

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility

BIOLOGY 432 Midterm I - 30 April PART I. Multiple choice questions (3 points each, 42 points total). Single best answer.

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.

Examining the Fossil Record

The abundance of deleterious polymorphisms in humans

Evaluate evidence provided by data from many scientific disciplines to support biological evolution. [LO 1.9, SP 5.3]

The neutral theory of molecular evolution

BINF6201/8201. Molecular phylogenetic methods

In a way, organisms determine who belongs to their species by choosing with whom they will! MODERN EVOLUTIONARY CLASSIFICATION 18-2 MATE

Chapter 26 Phylogeny and the Tree of Life

AP Biology Notes Outline Enduring Understanding 1.B. Big Idea 1: The process of evolution drives the diversity and unity of life.

Reconstructing the history of lineages

The practice of naming and classifying organisms is called taxonomy.

Molecular Markers, Natural History, and Evolution

UON, CAS, DBSC, General Biology II (BIOL102) Dr. Mustafa. A. Mansi. The Origin of Species

Concepts and Methods in Molecular Divergence Time Estimation

Classification and Phylogeny

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

The Tempo of Macroevolution: Patterns of Diversification and Extinction

How Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building

Lecture Notes: BIOL2007 Molecular Evolution

C.DARWIN ( )

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Chapter 7. Evolution and the Fossil Record

9/19/2012. Chapter 17 Organizing Life s Diversity. Early Systems of Classification

Human Evolution. Darwinius masillae. Ida Primate fossil from. in Germany Ca.47 M years old. Cantius, ca 55 mya

Classification and Phylogeny

Phylogeny and the Tree of Life

Biology. Slide 1 of 24. End Show. Copyright Pearson Prentice Hall

Outline. Classification of Living Things

10 Biodiversity Support. AQA Biology. Biodiversity. Specification reference. Learning objectives. Introduction. Background

Origin and Evolution of Life

Thursday, January 14. Teaching Point: SWBAT. assess their knowledge to prepare for the Evolution Summative Assessment. (TOMORROW) Agenda:

Genomes and Their Evolution

Theory of Evolution Charles Darwin

Transcription:

Time-dependent Molecular Rates: A Very Rough Guide

Background / Introduction

- Time dependency of morphological evolution: inverse correlation with time interval over which the rate was measured rate of change between successive generations exceeded macroevolutionary rates by several orders of magnitude. (Kurtén 1959) o confirmed in a number of subsequent studies - The molecular clock: proteins experience amino acid replacements at a surprisingly consistent rate across very different species presumed single, uniform rate of genetic evolution. (Zuckerkandl & Pauling 1965) o Versatile tool for dating genetic events - The updated molecular clock: Rates can vary across various dimensions (Ho & Lee 2016) o Site effects: different parts of the genome o Lineage effects: taxa o And, seemingly, across time: Inverse correlation between measured rate and timescale of measurement time-dependent molecular rate (TDMR)

A B Site effect Lineage effect Time (My ago) 0 100 Clade X Clad Clade X 200 C D Epoch effect Site & Lineage effects 0 Time (My ago) and ver, he ary ntially fossil ock n birds h radiated period e oldest h groups of the ular rigin of ms) more the first ar (Figure ncies il ned. As enerally ous fossils, mmal or y to be ation or t largely molecular more adiations, ences would be models to es. an also traits and evolution. nalyses ains ecular 100 Clade X Clade X 200 Key: Gene 1 Gene 2 Fast evolutionary rate Slow evolutionary rate Current Biology Figure 2. Modern molecular clocks can accommodate complex variation in rates of genetic change across the tree of life. (A) Rate variation across sites: gene 1 evolves rapidly but gene 2 evolves slowly, across all lineages. (B) Rate variation across lineages: genes 1 and 2 both evolve rapidly in clade X. (C) Rate variation across time periods or epochs : genes 1 and 2 both evolved rapidly between 140 and 80

What (is TDMR)

- Disparity between spontaneous mutation rates measured over short timescales and substitution rates over geological timescales - In other words: mutation rate (µ) vs. substitution rate (σ) - All mutations strictly neutral => σ = µ (Kimura 1968) - => Expect µ and σ to differ when many mutations aren t neutral - Rates on different timescales reflect different biological processes: o Very short (e.g. between successive generations): Can include all but the most detrimental mutations Approaches µ (discounting lethal mutations) o Very long (e.g. between distantly related species): Usually dominated by substitutions (fixed mutations) Approximates σ: < µ due to purifying (negative) selection

Present AB C D t 1 coalescent history of extant individuals t 2 between intraspecific genealogies and speciation event Past t 3 coalescent history of ancestral population prior to speciation Fig. 3 A simplified representation of genealogical history at a single locus in a pair of species. Each circle represents a randomly mating individual and each row represents one generation. The ancestral species (spanning the time period t 3 ) splits into two descendent species (spanning the periods t 2 and t 1 ). Four contemporary individuals labelled A, B, C and D are referred to in the text.

- A and B closely related individuals: Observed differences dominated by mutations - A and D from different species: Differences likely dominated by substitutions (fixed during t2, sometimes t3), though some may be polymorphisms (generally arising in t1) - Proportions of t1, t2, t3 -> relative influence of µ and σ on rate estimate from A-D comparison, e.g. o t2 long relative to t1, t3 => estimate approximates σ - Another factor: effective population size o E.g. if large and most mutations deleterious: estimate from A-C < estimate from A-B

How

The degree of divergence between 2 sequences is determined by: a. The rate of molecular change b. TMRCA (time to most recent common ancestor) => To estimate the rate (a), independent information about the evolutionary timescale (b) is needed.

Typical calibration techniques for the shortest timescales: - Laboratory mutation-accumulation lines (Keightley et al. 2009) - Studies of pedigrees (Haag-Liautard et al. 2008) - Heterochronous sampling: sampled sequences of different ages (Duchêne et al. 2015) o Also used for longer timescales, e.g. with adna (ancient DNA) o Calibration technique: temporal structure eliminates need for calibration by fixing an internal node at a point in time

Calibration techniques for longer timescales: - Dated geological events (Burridge et al. 2008) - Archaeological and anthropological evidence (Henn et al. 2009) Fossil record usually for calibrations at least several million years in age - Evidence for earliest appearance of members of separate lineages o Minimum age constraint for divergence event - Usually insufficient morphological variation on shorter timescales for reliable diagnostic usage

ANIMALS AND PLANTS Fossil record Geology/Biogeography Ancient DNA Pedigree 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 Lab line Serial sampling Years before present Archaeology Ancient DNA BACTERIA AND VIRUSES Pop. co-divergence Species co-divergence Fig. 2 Typical age ranges of different forms of calibrating information.

Where

- mtdna in: o Humans: pedigree rate of mtdna sequence divergence higher than phylogenetic rate (meta-analysis by Howell et al. 2003) o Fish (Burridge et al. 2008) o Birds (Subramanian et al. 2009) o Insects (Papadopoulou et al. 2010) - Bacteria (Comas et al. 2013) - Viruses (Aiewsakun & Katzourakis 2015)

It s Complicated

- The role of purifying selection in removing some mutations, and the existence of some degree of TDMR is uncontroversial - But: Magnitude of TDMR is debated - Support for order-of-magnitude differences - But that is disputed (e.g. on methodological grounds, e.g. Emerson & Hickerson 2015) - 2 illustrations of order-of-magnitude differences, 1 of sub-order-ofmagnitude difference:

Evolutionary rate (substitutions/site/year) 10 5 10 6 10 7 10 8 10 9 10 10 10 5 10 6 10 7 10 8 10 0 (a) (c) 10 1 10 2 10 4 10 6 10 8 10 2 10 3 10 4 (b) (d) 10 3 10 4 10 5 10 6 10 7 10 6 10 7 10 8 10 2 10 4 10 6 10 10 2 10 0 10 2 10 4 10 6 10 8 8 Fig. 1 Time-dependent patterns in rate estimates have been observed in a variety of taxonomic groups, including: (a) noncoding mitochondrial DNA from amniotes (data from Fig. 1b in Molak & Ho 2015), (b) mitochondrial DNA from insects (data from Fig. 2 in Ho & Lo 2013), (c) genomic DNA from bacteria (data from Fig. 4 in Comas et al. 2013) and (d) genomic RNA and DNA from viruses (data from Fig. 1a in Duch^ene et al. 2014). Trend lines are based on those estimated in the original analyses of these data sets. In panel d, separate trend lines are given for RNA viruses (solid) and DNA viruses (dashed). Calibration age (years)

FIG. 4. Difference between mtdna mutation rate estimates and rates expected under the K2P þ c model. Phylogeny-based mutation rate estimates are drawn from tables 1 and 2. Solid black line: effective mutation rate under our general model of K2P þ c assuming published pedigreebased mutation rate (0.95/bp/My) taken from Howell et al. (2003), ts/tv 5 20/1, a 5 0.4. We explored the sensitivity of the model parameters, examples for each parameter are displayed. Solid gray lines: effective mutation rate given alternative instantaneous pedigree mutation rates 1.20 and 0.75/bp/My. Dotted line: effective mutation rate given ts/tv 5 100/1. Dashed lines: effective mutation rate given alternative a 5 0.8 and 0.2.

Why

Roughly classifiable into biological and methodological factors - Note: Only one purely biological factor, i.e. purifying selection - Most methodological factors involve inadequate modelling of biological processes - Relative importance of factors likely vary across cases Natural selection: purifying (negative) selection - Substantial proportion of slightly deleterious mutations lost continuously from mtdna gene pool over a prolonged period (Kivisild et al. 2006) - Time-dependent decline in the ratio of non-synonymous to synonymous changes in coding sequences (Peterson & Masel 2009)

Methodological factors Calibration error - For calibrations based on the presumed timing of population- or speciesdivergence event, genetic divergence typically assumed to coincide with population divergence. - Genetic divergence often precedes population/species divergence => underestimate of time since genetic divergence => rate overestimation - Can result in rate underestimation, when genetic divergence postdates calibration event - Magnitude of bias greatest on short timescales, where the difference between genetic and population divergence times is an appreciable proportion of the total time separating the two populations or species

T T split T T split Lineage 1 Lineage 2 T T split Species A Species B

Mutational saturation - E.g. when one nucleotide undergoes multiple substitutions from its initial state, and only the first and last states are observed - Unobserved substitutions bias rate estimates downward - Likely to be less of a problem over very short time frames, but important over longer time frames - Under-correction for saturation can contribute to time-dependent rates

Rate heterogeneity across sites (RHAS) - Genomic sites can evolve at significantly different rates o E.g. mtdna exhibits up to 1,000-fold difference between the fastest and slowest evolving sites - Decreases rate estimates through increasing saturation (more unobserved substitutions) at fast-evolving sites over time - Time-dependent underestimate more pronounced both when the proportion of mutational hotspots increases, and when the ratio between high and low rates increases in the data (Soubrier et al. 2012)

A B FIG. 3.Mathematical exploration of the relationship between divergence time, actual substitution rate, and inferred rate from the mathematical model. (A) Rates for various proportions of fastsites, when the ratio of fast to slow rates is 1,000. (B) Ratesforvariousratiosoffastto slow rates, when the proportion of fast sites is fixed at 10%. * represents the parameter values chosen in subsequent simulations.

Demographic factors - E.g. population expansions o For humans, purifying selection further weakened by population expansions associated with the out-of Africa migration and the end of the Last Ice Age. (Henn et al. 2009) o => Higher rate estimates for timescales since then compared to before

References Burridge CP, Craw D, Fletcher D, Waters JM (2008). Geological dates and molecular rates: fish DNA sheds light on time dependency. Molecular Biology and Evolution 25:624 633. Comas I, Coscolla M, Luo T et al. (2013). Out-of-Africa migration and Neolithic coexpansion of Mycobacterium tuberculosis with modern humans. Nature Genetics 45:1176 1182. Duchêne S, Duchêne D, Holmes EC, Ho SYW (2015). The performance of the date-randomisation test in phylogenetic analyses of time-structured virus data. Molecular Biology and Evolution 32:1895 1906. Haag-Liautard C, Coffey N, Houle D et al. (2008). Direct estimation of the mitochondrial DNA mutation rate in Drosophila melanogaster. PLoS Biology 6:e204. Henn BM, Gignoux CR, Feldman MW, Mountain JL (2009). Characterizing the time dependency of human mitochondrial DNA mutation rate estimates. Molecular Biology and Evolution 26:217 230. Howell N, Smejkal CB, Mackey DA, Chinnery PF, Turnbull DM, Herrnstadt C (2003). The pedigree rate of sequence divergence in the human mitochondrial genome. There is a difference between phylogenetic and pedigree rates. Am J Hum Genet 72:659 670. Keightley PD, Trivedi U, Thomson M et al. (2009). Analysis of the genome sequences of three Drosophila melanogaster spontaneous mutation accumulation lines. Genome Research 19:1195 1201. Kimura M (1968). Evolutionary rate at the molecular level. Nature 217:624 626.

Kivisild T, Shen P, Wall DP, Do B, Sung R, Davis K et al. (2006). The role of selection in the evolution of human mitochondrial genomes. Genetics 172:373 387. Lee MSY, Ho SYW (2016). Molecular clocks. Current Biology 26:R387-R407 Loogväli EL, Kivisild T, Margus T, Villems R (2009). Explaining the imperfection of the molecular clock of hominid mitochondria. PLoS ONE 4:e8260. Papadopoulou A, Anastasiou I, Vogler AP (2010). Revisiting the insect mitochondrial molecular clock: the mid- Aegean trench calibration. Molecular Biology and Evolution 27:1659 1672. Peterson GI, Masel J (2009). Quantitative prediction of molecular clock and Ka Ks at short timescales. Molecular Biology and Evolution 26:2595 2603. Soubrier J, Steel M, Lee MSY et al. (2012). The influence of rate heterogeneity among sites on the time dependence of molecular rates. Molecular Biology and Evolution 29:3345-3358. Subramanian S, Denver DR, Millar CD et al. (2009). High mitogenomic evolutionary rates and time dependency. Trends in Genetics 25:482 486. Zuckerkandl E, and Pauling L (1962). Molecular disease, evolution, and genic heterogeneity. In Horizons in Biochemistry, M. Kasha, and B. Pullman, eds. (New York: Academic Press): 189 225.