Molecular Clocks. The Holy Grail. Rate Constancy? Protein Variability. Evidence for Rate Constancy in Hemoglobin. Given
|
|
- Reginald Blankenship
- 5 years ago
- Views:
Transcription
1 Molecular Clocks Rose Hoberman The Holy Grail Fossil evidence is sparse and imprecise (or nonexistent) Predict divergence times by comparing molecular data Given a phylogenetic tree branch lengths (rt) a time estimate for one (or more) node C D R M 110 MYA H Rate Constancy? Can we date other nodes in the tree? Yes... if the rate of molecular change is constant across all branches Page & Holmes p240 Protein Variability Protein structures & functions differ Proportion of neutral sites differ Rate constancy does not hold across different protein types However... Each protein does appear to have a characteristic rate of evolution Evidence for Rate Constancy in Hemoglobin Large carniverous marsupial Page and Holmes p229 1
2 The Molecular Clock Hypothesis Amount of genetic difference between sequences is a function of time since separation. Rate of molecular change is constant (enough) to predict times of divergence Outline Methods for estimating time under a molecular clock Estimating genetic distance Determining and using calibration points Sources of error Rate heterogeneity reasons for variation how its taken into account when estimating times Reliability of time estimates Estimating gene duplication times Measuring Evolutionary time with a molecular clock 1. Estimate genetic distance d = number amino acid replacements 2. Use paleontological data to determine date of common ancestor T = time since divergence 3. Estimate calibration rate (number of genetic changes expected per unit time) r = d / 2T 4. Calculate time of divergence for novel sequences T_ij = d_ij / 2r Estimating Genetic Differences If all nt equally likely, observed difference would plateau at 0.75 Simply counting differences underestimates distances Fails to count for multiple hits (Page & Holmes p148) Estimating Genetic Distance with a Substitution Model accounts for relative frequency of different types of substitutions allows variation in substitution rates between sites given learned parameter values nucleotide frequencies transition/transversion bias alpha parameter of gamma distribution can infer branch length from differences Distances from Gamma-Distributed Rates rate variation among sites fast/variable sites 3 rd codon positions codons on surface of globular protein slow/invariant sites Trytophan (1 codon) structurally required 1 st or 2 nd codon position when di-sulfide bond needed alpha parameter of gamma distribution describes degree of variation of rates across positions modeling rate variation changes branch length/ sequence differences curve 2
3 Gamma Corrected Distances high rate sites saturate quickly sequence difference rises much more slowly as the low-rate sites gradually accumulate differences Felsenstein Inferring Phylogenies p219 The Sloppy Clock Ticks are stochastic, not deterministic Mutations happen randomly according to a Poisson distribution. Many divergence times can result in the same number of mutations Actually over-dispersed Poisson Correlations due to structural constraints Poisson Variance (Assuming A Pefect Molecular Clock) If mutation every MY Poisson variance 95% lineages 15 MYA old have 8-22 substitutions 8 substitutions also could be 5 MYA Molecular Systematics p532 Need for Calibrations Changes = rate*time Can explain any observed branch length Fast rate, short time Slow rate, long time Suppose 16 changes along a branch Could be 2 * 8 or 8 * 2 No way to distinguish If told time = 8, then rate = 2 Assume rate=2 along all branches Can infer all times Estimating Calibration Rate Calculate separate rate for each data set (species/genes) using known date of divergence (from fossil, biogeography) One calibration point Rate = d/2t More than one calibration point use regression use generative model that constrains time estimates (more later) Calibration Complexities Cannot date fossils perfectly Fossils usually not direct ancestors branched off tree before (after?) splitting event. Impossible to pinpoint the age of last common ancestor of a group of living species 3
4 Linear Regression Fix intercept at (0,0) Fit line between divergence estimates and calibration times Calculate regression and prediction confidence limits Molecular Systematics p536 Molecular Dating Sources of Error Both X and Y values only estimates substitution model could be incorrect tree could be incorrect errors in orthology assignment Poisson variance is large Pairwise divergences correlated (Systematics p534?) inflates correlation between divergence & time Sometimes calibrations correlated if using derived calibration points Error in inferring slope Confidence interval for predictions much larger than confidence interval for slope Rate Heterogeneity Rate of molecular evolution can differ between nucleotide positions genes genomic regions genomes (nuclear vs organelle), species species over time If not considered, introduces bias into time estimates Rate Heterogeneity among Lineages Cause Repair equipment Metabolic rate Generation time Population size Reason e.g. RNA viruses have error-prone polymerases More free radicals Copies DNA more frequently Effects mutation fixation rate Local Clocks? Closely related species often share similar properties, likely to have similar rates For example murid rodents on average 2-6 times faster than apes and humans (Graur & Li p150) mouse and rat rates are nearly equal (Graur & Li p146) Rate Changes within a Lineage Cause Population size changes Strength of selection changes over time Reason Genetic drift more likely to fix neutral alleles in small population 1. new role/environment 2. gene duplication 3. change in another gene 4
5 Working Around Rate Heterogeneity 1. Identify lineages that deviate and remove them 2. Quantify degree of rate variation to put limits on possible divergence dates requires several calibration dates, not always available gives very conservative estimates of molecular dates 3. Explicity model rate variation Search for Genes with Uniform Rate across Taxa Many clock tests: Relative rates tests compares rates of sister nodes using an outgroup Tajima test Number of sites in which character shared by outgroup and only one of two ingroups should be equal for both ingroups Branch length test deviation of distance from root to leaf compared to average distance Likelihood ratio test identifies deviance from clock but not the deviant sequences Likelihood Ratio Test estimate a phylogeny under molecular clock and without it e.g. root-to-tip distances must be equal difference in likelihood ~ 2*Chi^2 with n-2 degrees of freedom asymptotically when models are nested when nested parameters aren t set to boundary Relative Rates Tests Tests whether distance between two taxa and an outgroup are equal (or average rate of two clades vs an outgroup) need to compute expected variance many triples to consider, and not independent Lacks power, esp short sequences low rates of change Given length and number of variable sites in typical sequences used for dating, (Bronham et al 2000) says: unlikely to detect moderate variation between lineages (1.5-4x) likely to result in substantial error in date estimates Modeling Rate Variation Relaxing the Molecular Clock D E F Learn rates and times, not just M branch lengths A B C Assume root-to-tip times equal Allow different rates on different branches Rates of descendants correlate with that of common acnestor Restricts choice of rates, but still too much flexibility to choose rates well R N Relaxing the Molecular Clock Likelihood analysis Assign each branch a rate parameter explosion of parameters, not realistic User can partition branches based on domain knowledge Rates of partitions are independent Nonparametric methods smooth rates along tree Bayesian approach stochastic model of evolutionary change prior distribution of rates Bayes theorem MCMC 5
6 Parsimonious Approaches Sanderson 1997, 2002 infer branch lengths via parsimony fit divergence times to minimize difference between rates in successive branches (unique solution?) Cutler 2000 infer branch lengths via parsimony rates drawn from a normal distribution (negative rates set to zero) Bayesian Approaches Learn rates, times, and substitution parameters simultaneously Devise model of relationship between rates Thorne/Kishino et al Assigns new rates to descendant lineages from a lognormal distribution with mean equal to ancestral rate and variance increasing with branch length Huelsenbeck et al Poisson process generates random rate changes along tree new rate is current rate * gamma-distributed random variable Comparison of Likelihood & Bayesan Approaches for Estimating Divergence Times (Yang & Yoder 2003) Analyzed two mitochondrial genes each codon position treated separately tested different model assumptions used 7 calibration points Neither model reliable when using only one codon position using a single model for all positions Results similar for both methods using the most complex model use separate parameters for each codon position (could use codon model?) Sources of Error/Variance Lack of rate constancy (due to lineage, population size or selection effects) Wrong assumptions in evolutionary model Errors in orthology assignment Incorrect tree Stochastic variability Imprecision of calibration points Imprecision of regression Human sloppiness in analysis self-fulfilling prophecies Reading the entrails of chickens (Graur and Martin 2004) single calibration point error bars removed from calibration points standard error bars instead of 95% confidence intervals secondary/tertiary calibration points treated as reliable and precise based on incorrect initial estimates variance increases with distance from original estimate few proteins used Multiple Gene Loci Trying to estimate time of divergence from one protein is like trying to estimate the average height of humans by measuring one human --Molecular Systematics p539 Use multiple genes! (and multiple calibration points) 6
7 Even so... Be Very Wary Of Molecular Times Point estimates are absurd Sample errors often based only on the difference between estimates in the same study Even estimates with confidence intervals unlikely to really capture all sources of variance McLysaght, Hokamp, Wolfe 2002 Dating Human Gene Duplications [758] Trees generated (ML method using PAM matrix) [602] Alpha parameter for gamma distribution learned (Gu and Zhang 1997) faster than ML, more accurate than parsimony Thrown out if variance > mean. Why would this happen? May be problematic to apply this model for gene family evolution because of the possible functional divergence among paralogous genes [481] NJ trees built from Gamma-corrected distances Family kept only if worm/fly group together [191] Two-cluster test of rate constancy (Takezaki et al 1995) Blanc, Hokamp, Wolfe Dating Arabadopsis Duplications Create nucleotide alignments Estimate Level of Synonymous substitutions (Yang s ML method) per site? per synonymous site? Ks values > 10 ignored (Yang; Anisimova) Why used different method than for human? How reliable is ranking of Ks values? How much variance expected? Ks > 10 unreliable? Yang (abstract) calculates effect of evolutionary rate on accuracy of phylogenic reconstruction Anisimova calculates accuracy and power of LRT in detecting adaptive molecular evolution Neither seems to give any cutoff regarding ds > 10. Future Improvements Calculate accurate confidence intervals taking into account multiple sources of variance Novel models that account for variation in rates between taxa Build explicit models that predict rates based on an understanding of the underlying processes that generate differences in substitutions rates General References Reviews/Critiques 1. Bronham and Penny. The modern molecular clock, Nature review in genetics?, Graur and Martin. Reading the entrails of chickens...the illusion of precision. Trends in Genetics, Textbooks: 1. Molecular Systematics. 2 nd edition. Edited by Hillis, Moritz, and Mable. 2. Inferring Phylogenies. Felsenstein. 3. Molecular Evolution, a phylogenetic approach. Page and Holmes. 7
8 Rate Heterogeneity References Dealing with Rate Heterogeneity 1. Yang and Yoder. Comparison of likelihood and bayesian methods for estimating divergence times... Syst. Biol, Kishino, Thorne, and Bruno. Performance of a divergence time estimation method under a probabilistic model of rate evolution. Mol. Biol. Evol, Huelsenbeck, Larget, and Swofford. A compound poisson process for relaxing the molecular clock. Genetics, Testing for Rate heterogeneity 1. Takezaki, Rzhetsky and Nei. Phylogenetic test of the molecular clock and linearized trees. Mol. Bio. Evol., Bronham, Penny, Rambaut, and Hendy. The power of relative rates test depends on the data. J Mol Evol, Dating Duplications References Dating duplications: McLysaght, Hokamp, and Wolfe. Extensive genomic duplication during early chordate evolution. Nature Genetics?, Blanc, Hokamp, and Wolfe. Recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Research, Reference used for dating duplications in above papers Gu and Zhang. A simple method for estimating the parameter of substitution rate variation among sites. Mol. Biol. Evol., Yang Z. On the best evolutionary rate for phylogenetic analysis. Syst. Biol, Anisimova, Bielawski, Yang. Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution. Mol. Biol. Evol., Relative vs Absolute Rates M. Systematics p540 Differences in rates of divergence among lineages detract only from methods of analysis that require clocklike behavior of molecules, and alternative methods of analysis exist for all applications of molecular systematics except for the absolute estimation of time. t1 = 2 * t2 still requires clocklike behavior? Synonymous vs Nonsynonymous Distance Syn sites are sites where a nt change does not cause an AA change only ~25% of sites, so become saturated more quickly Between proteins more variation in non-synonymous rates Within same protein more variation in synonymous rates Which are used? What is effect? Two-cluster Test Takezaki, Rzhetsky and Nei (1995?) estimate tree for each nonroot interior node: calculate average rate for both descendant clades test equality of rates (using variance & covariance of branch lengths) [doesn t appear to correct for multiple testing] move up from leaves, eliminating a cluster if not equal finally, linear tree created reestimate branch lengths under clock constraint Neutral Hypothesis Most mutations have no influence on fitness of the organism Advantageous mutations rare Deleterious mutations rapidly removed Greatest proportion of mutations have no effect on protein function Rate of change is thus affected only by mutation rate, and so should be relatively constant within a species Variation in rate among genes b/c differences in selective constraints 8
9 Mutation Rate in Nuclear Genes of Mammals (Yang & Nielsen 1997) Acid phosphotase Myelin Proteolipid Interleukin 6 IGF binding 1 Thrombomodulin Average ds (P) ds (R) dn (P) dn(r) Perfect Molecular Clock Change linear function time (substitutions ~ Poisson) Rates constant (positions/lineages) Tree perfect Molecular distance estimated perfectly Calibration dates without error Regression (time vs substitutions) without error Yang, effect of evol. rate abstract Yang calculates effect of evolutionary rate on accuracy of phylogenic reconstruction simulation study branch length = expected total number nt substitutions per site (not synonymous?) estimates proportion of correctly recovered branch partitions optimum levels of sequence divergence were even higher than previously suggested for saturation of substitutions, indicating that the problem of saturation may have been exaggerated Bayesian parametric estimation Density function for x, given the training data set ( n) 1 N X = x x {,..., } ( ) ( ) ( n n p x X ) p( x, θ X ) d From the definition of conditional probability densities p X p X p X ( x, θ ) = ( x θ, ) ( θ ). ( ) ( ) ( ) The first factor is independent of X (n) since it just our ( n) assumed form p( x θ, X ) p( x θ ) for parameterized density. Therefore ( ) ( ) ( n n p x X ) p( x ) p( X ) d = = θ θ θ θ Bayesian parametric estimation Instead of choosing a specific value θ, the Bayesian approach performs a weighted average over all values of θ. ( n) If the weighting factor p( θ X ), which is a posterior of θ peaks very sharply about some value we obtain ( n) $θ p( x X ) p( x $ θ ). Thus the optimal estimator is the most likely value of θ given the data and the prior of θ. The Holy Grail Fossil evidence is sparse and imprecise (or nonexistent) Predict divergence times by comparing molecular data 9
C3020 Molecular Evolution. Exercises #3: Phylogenetics
C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from
More informationConcepts and Methods in Molecular Divergence Time Estimation
Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks
More informationIntegrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley
Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley B.D. Mishler Feb. 14, 2018. Phylogenetic trees VI: Dating in the 21st century: clocks, & calibrations;
More informationDr. Amira A. AL-Hosary
Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological
More informationAmira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut
Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological
More informationCHAPTERS 24-25: Evidence for Evolution and Phylogeny
CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology
More informationUsing phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)
Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures
More informationPOPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics
POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the
More informationInferring Speciation Times under an Episodic Molecular Clock
Syst. Biol. 56(3):453 466, 2007 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150701420643 Inferring Speciation Times under an Episodic Molecular
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: istance-based methods Ultrametric Additive: UPGMA Transformed istance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood
More information8/23/2014. Phylogeny and the Tree of Life
Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major
More informationPhylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.
Supplementary Note S2 Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Phylogenetic trees reconstructed by a variety of methods from either single-copy orthologous loci (Class
More information7. Tests for selection
Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info
More informationUnderstanding relationship between homologous sequences
Molecular Evolution Molecular Evolution How and when were genes and proteins created? How old is a gene? How can we calculate the age of a gene? How did the gene evolve to the present form? What selective
More informationDATING LINEAGES: MOLECULAR AND PALEONTOLOGICAL APPROACHES TO THE TEMPORAL FRAMEWORK OF CLADES
Int. J. Plant Sci. 165(4 Suppl.):S7 S21. 2004. Ó 2004 by The University of Chicago. All rights reserved. 1058-5893/2004/1650S4-0002$15.00 DATING LINEAGES: MOLECULAR AND PALEONTOLOGICAL APPROACHES TO THE
More informationBio 1B Lecture Outline (please print and bring along) Fall, 2007
Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution
More informationEstimating Divergence Dates from Molecular Sequences
Estimating Divergence Dates from Molecular Sequences Andrew Rambaut and Lindell Bromham Department of Zoology, University of Oxford The ability to date the time of divergence between lineages using molecular
More informationChapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships
Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic
More informationLikelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution
Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Ziheng Yang Department of Biology, University College, London An excess of nonsynonymous substitutions
More informationPhylogenetics. BIOL 7711 Computational Bioscience
Consortium for Comparative Genomics! University of Colorado School of Medicine Phylogenetics BIOL 7711 Computational Bioscience Biochemistry and Molecular Genetics Computational Bioscience Program Consortium
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: Distance-based methods Ultrametric Additive: UPGMA Transformed Distance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood
More informationAssessing an Unknown Evolutionary Process: Effect of Increasing Site- Specific Knowledge Through Taxon Addition
Assessing an Unknown Evolutionary Process: Effect of Increasing Site- Specific Knowledge Through Taxon Addition David D. Pollock* and William J. Bruno* *Theoretical Biology and Biophysics, Los Alamos National
More informationLecture Notes: BIOL2007 Molecular Evolution
Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits
More informationLetter to the Editor. Department of Biology, Arizona State University
Letter to the Editor Traditional Phylogenetic Reconstruction Methods Reconstruct Shallow and Deep Evolutionary Relationships Equally Well Michael S. Rosenberg and Sudhir Kumar Department of Biology, Arizona
More informationUoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)
- Phylogeny? - Systematics? The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogenetic systematics? Connection between phylogeny and classification. - Phylogenetic systematics informs the
More informationLetter to the Editor. Temperature Hypotheses. David P. Mindell, Alec Knight,? Christine Baer,$ and Christopher J. Huddlestons
Letter to the Editor Slow Rates of Molecular Evolution Temperature Hypotheses in Birds and the Metabolic Rate and Body David P. Mindell, Alec Knight,? Christine Baer,$ and Christopher J. Huddlestons *Department
More informationInferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT
Inferring phylogeny Constructing phylogenetic trees Tõnu Margus Contents What is phylogeny? How/why it is possible to infer it? Representing evolutionary relationships on trees What type questions questions
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More information"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky
MOLECULAR PHYLOGENY "Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky EVOLUTION - theory that groups of organisms change over time so that descendeants differ structurally
More informationPhylogenetic inference
Phylogenetic inference Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, March 7 th 016 After this lecture, you can discuss (dis-) advantages of different information types
More informationLecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30
Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30 A non-phylogeny
More informationPhylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?
Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species
More informationAccuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular Evolution
Accuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular Evolution Maria Anisimova, Joseph P. Bielawski, and Ziheng Yang Department of Biology, Galton Laboratory, University College
More informationGENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.
!! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory
More informationEVOLUTIONARY DISTANCES
EVOLUTIONARY DISTANCES FROM STRINGS TO TREES Luca Bortolussi 1 1 Dipartimento di Matematica ed Informatica Università degli studi di Trieste luca@dmi.units.it Trieste, 14 th November 2007 OUTLINE 1 STRINGS:
More informationPhylogenetic Tree Reconstruction
I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven
More informationBioinformatics tools for phylogeny and visualization. Yanbin Yin
Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and
More informationConsensus Methods. * You are only responsible for the first two
Consensus Trees * consensus trees reconcile clades from different trees * consensus is a conservative estimate of phylogeny that emphasizes points of agreement * philosophy: agreement among data sets is
More informationFUNDAMENTALS OF MOLECULAR EVOLUTION
FUNDAMENTALS OF MOLECULAR EVOLUTION Second Edition Dan Graur TELAVIV UNIVERSITY Wen-Hsiung Li UNIVERSITY OF CHICAGO SINAUER ASSOCIATES, INC., Publishers Sunderland, Massachusetts Contents Preface xiii
More informationLecture 11 Friday, October 21, 2011
Lecture 11 Friday, October 21, 2011 Phylogenetic tree (phylogeny) Darwin and classification: In the Origin, Darwin said that descent from a common ancestral species could explain why the Linnaean system
More informationT R K V CCU CG A AAA GUC T R K V CCU CGG AAA GUC. T Q K V CCU C AG AAA GUC (Amino-acid
Lecture 11 Increasing Model Complexity I. Introduction. At this point, we ve increased the complexity of models of substitution considerably, but we re still left with the assumption that rates are uniform
More informationSEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION
Annu. Rev. Genomics Hum. Genet. 2003. 4:213 35 doi: 10.1146/annurev.genom.4.020303.162528 Copyright c 2003 by Annual Reviews. All rights reserved First published online as a Review in Advance on June 4,
More informationEfficiencies of maximum likelihood methods of phylogenetic inferences when different substitution models are used
Molecular Phylogenetics and Evolution 31 (2004) 865 873 MOLECULAR PHYLOGENETICS AND EVOLUTION www.elsevier.com/locate/ympev Efficiencies of maximum likelihood methods of phylogenetic inferences when different
More informationA (short) introduction to phylogenetics
A (short) introduction to phylogenetics Thibaut Jombart, Marie-Pauline Beugin MRC Centre for Outbreak Analysis and Modelling Imperial College London Genetic data analysis with PR Statistics, Millport Field
More informationQ1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.
OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall
More informationReconstructing the history of lineages
Reconstructing the history of lineages Class outline Systematics Phylogenetic systematics Phylogenetic trees and maps Class outline Definitions Systematics Phylogenetic systematics/cladistics Systematics
More informationProcesses of Evolution
15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection
More informationMaximum Likelihood Estimation on Large Phylogenies and Analysis of Adaptive Evolution in Human Influenza Virus A
J Mol Evol (2000) 51:423 432 DOI: 10.1007/s002390010105 Springer-Verlag New York Inc. 2000 Maximum Likelihood Estimation on Large Phylogenies and Analysis of Adaptive Evolution in Human Influenza Virus
More informationPhylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline
Phylogenetics Todd Vision iology 522 March 26, 2007 pplications of phylogenetics Studying organismal or biogeographic history Systematics ating events in the fossil record onservation biology Studying
More informationIntraspecific gene genealogies: trees grafting into networks
Intraspecific gene genealogies: trees grafting into networks by David Posada & Keith A. Crandall Kessy Abarenkov Tartu, 2004 Article describes: Population genetics principles Intraspecific genetic variation
More informationNatural selection on the molecular level
Natural selection on the molecular level Fundamentals of molecular evolution How DNA and protein sequences evolve? Genetic variability in evolution } Mutations } forming novel alleles } Inversions } change
More informationMaximum Likelihood Until recently the newest method. Popularized by Joseph Felsenstein, Seattle, Washington.
Maximum Likelihood This presentation is based almost entirely on Peter G. Fosters - "The Idiot s Guide to the Zen of Likelihood in a Nutshell in Seven Days for Dummies, Unleashed. http://www.bioinf.org/molsys/data/idiots.pdf
More informationConsistency Index (CI)
Consistency Index (CI) minimum number of changes divided by the number required on the tree. CI=1 if there is no homoplasy negatively correlated with the number of species sampled Retention Index (RI)
More informationBootstrapping and Tree reliability. Biol4230 Tues, March 13, 2018 Bill Pearson Pinn 6-057
Bootstrapping and Tree reliability Biol4230 Tues, March 13, 2018 Bill Pearson wrp@virginia.edu 4-2818 Pinn 6-057 Rooting trees (outgroups) Bootstrapping given a set of sequences sample positions randomly,
More informationDating r8s, multidistribute
Phylomethods Fall 2006 Dating r8s, multidistribute Jun G. Inoue Software of Dating Molecular Clock Relaxed Molecular Clock r8s multidistribute r8s Michael J. Sanderson UC Davis Estimation of rates and
More informationAUTHOR COPY ONLY. Hetero: a program to simulate the evolution of DNA on a four-taxon tree
APPLICATION NOTE Hetero: a program to simulate the evolution of DNA on a four-taxon tree Lars S Jermiin, 1,2 Simon YW Ho, 1 Faisal Ababneh, 3 John Robinson, 3 Anthony WD Larkum 1,2 1 School of Biological
More information9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)
I9 Introduction to Bioinformatics, 0 Phylogenetic ree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & omputing, IUB Evolution theory Speciation Evolution of new organisms is driven by
More informationTree of Life iological Sequence nalysis Chapter http://tolweb.org/tree/ Phylogenetic Prediction ll organisms on Earth have a common ancestor. ll species are related. The relationship is called a phylogeny
More informationAnatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses
Anatomy of a tree outgroup: an early branching relative of the interest groups sister taxa: taxa derived from the same recent ancestor polytomy: >2 taxa emerge from a node Anatomy of a tree clade is group
More informationMOLECULAR SYSTEMATICS: A SYNTHESIS OF THE COMMON METHODS AND THE STATE OF KNOWLEDGE
CELLULAR & MOLECULAR BIOLOGY LETTERS http://www.cmbl.org.pl Received: 16 August 2009 Volume 15 (2010) pp 311-341 Final form accepted: 01 March 2010 DOI: 10.2478/s11658-010-0010-8 Published online: 19 March
More informationBINF6201/8201. Molecular phylogenetic methods
BINF60/80 Molecular phylogenetic methods 0-7-06 Phylogenetics Ø According to the evolutionary theory, all life forms on this planet are related to one another by descent. Ø Traditionally, phylogenetics
More informationMETHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.
Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern
More informationBioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics
Bioinformatics 1 Biology, Sequences, Phylogenetics Part 4 Sepp Hochreiter Klausur Mo. 30.01.2011 Zeit: 15:30 17:00 Raum: HS14 Anmeldung Kusss Contents Methods and Bootstrapping of Maximum Methods Methods
More informationHow Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building
How Molecules Evolve Guest Lecture: Principles and Methods of Systematic Biology 11 November 2013 Chris Simon Approaching phylogenetics from the point of view of the data Understanding how sequences evolve
More informationPAML 4: Phylogenetic Analysis by Maximum Likelihood
PAML 4: Phylogenetic Analysis by Maximum Likelihood Ziheng Yang* *Department of Biology, Galton Laboratory, University College London, London, United Kingdom PAML, currently in version 4, is a package
More informationMOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei"
MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS Masatoshi Nei" Abstract: Phylogenetic trees: Recent advances in statistical methods for phylogenetic reconstruction and genetic diversity analysis were
More informationBootstrap confidence levels for phylogenetic trees B. Efron, E. Halloran, and S. Holmes, 1996
Bootstrap confidence levels for phylogenetic trees B. Efron, E. Halloran, and S. Holmes, 1996 Following Confidence limits on phylogenies: an approach using the bootstrap, J. Felsenstein, 1985 1 I. Short
More informationPhylogenetics: Bayesian Phylogenetic Analysis. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Bayesian Phylogenetic Analysis COMP 571 - Spring 2015 Luay Nakhleh, Rice University Bayes Rule P(X = x Y = y) = P(X = x, Y = y) P(Y = y) = P(X = x)p(y = y X = x) P x P(X = x 0 )P(Y = y X
More informationPhylogenomics. Jeffrey P. Townsend Department of Ecology and Evolutionary Biology Yale University. Tuesday, January 29, 13
Phylogenomics Jeffrey P. Townsend Department of Ecology and Evolutionary Biology Yale University How may we improve our inferences? How may we improve our inferences? Inferences Data How may we improve
More informationPhylogenetics: Distance Methods. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Distance Methods COMP 571 - Spring 2015 Luay Nakhleh, Rice University Outline Evolutionary models and distance corrections Distance-based methods Evolutionary Models and Distance Correction
More informationEstimating the Rate of Evolution of the Rate of Molecular Evolution
Estimating the Rate of Evolution of the Rate of Molecular Evolution Jeffrey L. Thorne,* Hirohisa Kishino, and Ian S. Painter* *Program in Statistical Genetics, Statistics Department, North Carolina State
More informationMultiple Sequence Alignment. Sequences
Multiple Sequence Alignment Sequences > YOR020c mstllksaksivplmdrvlvqrikaqaktasglylpe knveklnqaevvavgpgftdangnkvvpqvkvgdqvl ipqfggstiklgnddevilfrdaeilakiakd > crassa mattvrsvksliplldrvlvqrvkaeaktasgiflpe
More informationSystematics - Bio 615
Bayesian Phylogenetic Inference 1. Introduction, history 2. Advantages over ML 3. Bayes Rule 4. The Priors 5. Marginal vs Joint estimation 6. MCMC Derek S. Sikes University of Alaska 7. Posteriors vs Bootstrap
More informationBayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies
Bayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies 1 What is phylogeny? Essay written for the course in Markov Chains 2004 Torbjörn Karfunkel Phylogeny is the evolutionary development
More informationRELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG
RELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG Department of Biology (Galton Laboratory), University College London, 4 Stephenson
More informationInference of Viral Evolutionary Rates from Molecular Sequences
Inference of Viral Evolutionary Rates from Molecular Sequences Alexei Drummond 1,2, Oliver G. Pybus 1 and Andrew Rambaut 1 * 1 Department of Zoology, University of Oxford, South Parks Road, Oxford, OX1
More information1 ATGGGTCTC 2 ATGAGTCTC
We need an optimality criterion to choose a best estimate (tree) Other optimality criteria used to choose a best estimate (tree) Parsimony: begins with the assumption that the simplest hypothesis that
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 2009 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More informationAlgorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Distance Methods Character Methods
More informationmolecular evolution and phylogenetics
molecular evolution and phylogenetics Charlotte Darby Computational Genomics: Applied Comparative Genomics 2.13.18 https://www.thinglink.com/scene/762084640000311296 Internal node Root TIME Branch Leaves
More informationStatistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution?
Statistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution? 30 July 2011. Joe Felsenstein Workshop on Molecular Evolution, MBL, Woods Hole Statistical nonmolecular
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 200 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More informationIntroduction to Bioinformatics Introduction to Bioinformatics
Dr. rer. nat. Gong Jing Cancer Research Center Medicine School of Shandong University 2012.11.09 1 Chapter 4 Phylogenetic Tree 2 Phylogeny Evidence from morphological ( 形态学的 ), biochemical, and gene sequence
More informationSequence Divergence & The Molecular Clock. Sequence Divergence
Sequence Divergence & The Molecular Clock Sequence Divergence v simple genetic distance, d = the proportion of sites that differ between two aligned, homologous sequences v given a constant mutation/substitution
More informationWhat Is Conservation?
What Is Conservation? Lee A. Newberg February 22, 2005 A Central Dogma Junk DNA mutates at a background rate, but functional DNA exhibits conservation. Today s Question What is this conservation? Lee A.
More informationLetter to the Editor. The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1
Letter to the Editor The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1 Department of Zoology and Texas Memorial Museum, University of Texas
More informationEstimating Absolute Rates of Molecular Evolution and Divergence Times: A Penalized Likelihood Approach
Estimating Absolute Rates of Molecular Evolution and Divergence Times: A Penalized Likelihood Approach Michael J. Sanderson Section of Evolution and Ecology, University of California, Davis Rates of molecular
More informationUnit 7: Evolution Guided Reading Questions (80 pts total)
AP Biology Biology, Campbell and Reece, 10th Edition Adapted from chapter reading guides originally created by Lynn Miriello Name: Unit 7: Evolution Guided Reading Questions (80 pts total) Chapter 22 Descent
More informationLecture 27. Phylogeny methods, part 4 (Models of DNA and protein change) p.1/26
Lecture 27. Phylogeny methods, part 4 (Models of DNA and protein change) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 27. Phylogeny methods, part 4 (Models of DNA and
More informationElements of Bioinformatics 14F01 TP5 -Phylogenetic analysis
Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis 10 December 2012 - Corrections - Exercise 1 Non-vertebrate chordates generally possess 2 homologs, vertebrates 3 or more gene copies; a Drosophila
More informationPhylogeny and Systematics
Chapter 25 Phylogeny and Systematics PowerPoint Lectures for Biology, Seventh Edition Neil Campbell and Jane Reece Lectures by Chris Romero Modified by Maria Morlin racing phylogeny Phylogeny: he evolutionary
More informationTheory of Evolution Charles Darwin
Theory of Evolution Charles arwin 858-59: Origin of Species 5 year voyage of H.M.S. eagle (83-36) Populations have variations. Natural Selection & Survival of the fittest: nature selects best adapted varieties
More informationPhylogene)cs. IMBB 2016 BecA- ILRI Hub, Nairobi May 9 20, Joyce Nzioki
Phylogene)cs IMBB 2016 BecA- ILRI Hub, Nairobi May 9 20, 2016 Joyce Nzioki Phylogenetics The study of evolutionary relatedness of organisms. Derived from two Greek words:» Phle/Phylon: Tribe/Race» Genetikos:
More informationProceedings of the SMBE Tri-National Young Investigators Workshop 2005
Proceedings of the SMBE Tri-National Young Investigators Workshop 25 Control of the False Discovery Rate Applied to the Detection of Positively Selected Amino Acid Sites Stéphane Guindon,* Mik Black,*à
More informationTaming the Beast Workshop
Workshop and Chi Zhang June 28, 2016 1 / 19 Species tree Species tree the phylogeny representing the relationships among a group of species Figure adapted from [Rogers and Gibbs, 2014] Gene tree the phylogeny
More informationPhylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University
Phylogenetics: Building Phylogenetic Trees COMP 571 - Fall 2010 Luay Nakhleh, Rice University Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary
More informationPhylogeny and the Tree of Life
LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert B. Jackson Chapter 26 Phylogeny and the Tree of Life
More informationChapter 19: Taxonomy, Systematics, and Phylogeny
Chapter 19: Taxonomy, Systematics, and Phylogeny AP Curriculum Alignment Chapter 19 expands on the topics of phylogenies and cladograms, which are important to Big Idea 1. In order for students to understand
More informationPOPULATION GENETICS Biology 107/207L Winter 2005 Lab 5. Testing for positive Darwinian selection
POPULATION GENETICS Biology 107/207L Winter 2005 Lab 5. Testing for positive Darwinian selection A growing number of statistical approaches have been developed to detect natural selection at the DNA sequence
More informationPhylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)
Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction Lesser Tenrec (Echinops telfairi) Goals: 1. Use phylogenetic experimental design theory to select optimal taxa to
More information