Homoplasy. Selection of models of molecular evolution. Evolutionary correction. Saturation
|
|
- Griffin Haynes
- 6 years ago
- Views:
Transcription
1 Homoplasy Selection of models of molecular evolution David Posada Homoplasy indicates identity not produced by descent from a common ancestor. Graduate class in Phylogenetics, Campus Agrário de Vairão, Vila do Conde, Portugal, 6 April 2006 David Posada University of Vigo, Spain, Saturation Evolutionary correction Models of substitution describe how sequences evolve through time and help correct for the saturation effect. Multiple hits lead to saturation at some point in time. David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
2 Models of nucleotide substitution Proportion of invariable sites If some sites do not change it is possible that fast evolving sequences show less divergence than slower sequences. b a c f d e Q A: rate = 0.005, p-inv = 0.2 B: rate = 0.02, p-inv = 0.5 P(t = e-qt David Posada University of Vigo, Spain, Rate variation among sites David Posada University of Vigo, Spain, Models of evolution A discrete gamma distribution with four categories is commonly used David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
3 Models are abstractions Models are important Models have a clear influence in Models asume independence of sites, stationarity, equilibrium frecuencies... All models are wrong, but indeed,...some are useful. - Box, 1976 Parameter estimation (ti/tv,!,p i,.. Topology estimation Phylogenetic confidence Hypothesis testing David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, Effect of model in topology estimation: HIV pol Statistical tests of models Model adequacy is an absolute measure of the fit of a model to a particular data set. Model choice is a relative procedure to select the models that results in the best fit to a particular data set. We measure fit using the likelihood. A NJ K80 B NJ - GTR +! There are differences in the position of the subtype A. David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
4 Likelihood In phylogenetics see 18,19 we define the likelihood (L as (proportional to the probability of the data (D given a model of evolution (M, a vector of K model parameters! = (! 1,! 2,...,! K, a tree topology (", and a vector of S branch lengths,! = (! 1,! 2,...,! S : Statistical methods for model selection Hierarchical LRTs BIC Bayes factors AIC L = P(D M,!,",# David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, Parsimony principle Likelihood ratio tests (LRT All model selection methods are somehow based on the principle of parsimony. LRT = 2 (! 1!! 0! 1 is the log of the maximized likelihood under the alternative hypothesis (complex model! 0 is the log of the maximized likelihood under the null hypothesis (simple model If models are nested, the LRT is distributed as a " 2 (or a mixed " 2, with q degrees of freedom, where q is the difference in number of free parameters between the two models 22 David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
5 Hierarchical LRTs David Posada University of Vigo, Spain,
6 Problems with hlrts Bayes factors 1 Maybe there is not an optimal model 2 The optimal model depends on the significance level 3 Local optima Bayes factors 23 are the Bayesian analogue of the LRT 24. B ij = P(D M i P(D M j Evidence for M 2 is considered very strong if B ij >150, strong if 12<B ij <150, positive if 3<B ij <12, barely worth mentioning if 1<B ij <3, and negative (supports M j if B ij <1 25. Used to infer the occurrence of recombination 26, to compare different phylogenetic hypothesis and for model selection 30 David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, Posterior probabilities For multiple models P(M i D =! P(D M i P(M i R r =1 P(D M r P(M r Bayesian Information Criterion (BIC Simple approximation to the log marginal likelihood of a model BIC =!2! + K logn We generally use MCMC to approximate model likelihoods: harmonic mean of the likelihood. Given equal priors for all competing models, choosing the model with the smallest BIC is equivalent to selecting the model with the maximum posterior probability David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
7 Model selection uncertainty Model averaging We can establish a "95% credible set of models by summing the posterior probabilities from largest to smallest until the sum is just # 0.95 (Occam s window. We can make inferences based on the entire set of candidate models. the overall posterior mean of the shape of the gamma distribution (! would be: G E(! D = "! ˆ P(M D i i, i =1 David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, Parameter importance The relative importance for the shape of the gamma distribution across all candidate models is: Kullback-Leibler distance ( = f (x log I f, g " f (x % ( $ ' dx # g(x! & R w + (! = " P(D M i P(M i I! (M i, i =1 where " I! (M i = 1 if! is in model M i # $ 0 otherwise I(f,g is the information lost when using g to approximate f David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
8 Akaike s information criterion (AIC The AIC estimates the expected K-L information. The AIC penalized the likelihood by the number of parameters (K AIC =!2! + 2K We prefer the model with smallest AIC. For small samples ($ n/k < 40: AIC c = AIC + 2K(K +1 n! K!1 Akaike differences AIC (% AIC difference (% i are rescaled AICs reescalated with the minimum AIC (% i =0 As a rule of thumb 31 :! i = AIC i " min AIC % i & 1-2 receive substantial support and are considered when making inferences 4 & % i & 7 have considerably less support % i > 10 receive no support David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, Akaike weights (w i Akaike 32 suggested that the exp(!1/2" i approximate the relative likelihood of the models given the data L(M i D. Akaike weights (w i for R models: w i = # exp(!1/2" i R r =1 exp(!1/2" r Model selection uncertainty wity the AIC The Akaike weights are very useful for assessing model-selection uncertainty. We can establish a 95% confidence set of models for the best K-L model by summing the Akaike weights from largest to smallest until the sum is just # We can interpret them as the probability of a model is the best approximation of the true model, given the data. David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
9 Model averaging We can also make inferences based on the entire set of candidate models using the AIC Estimating phylogenies with 24 models! ˆ = " R i=1 w i I(M i! i w + (!, where R w + (! = " w i I! (M i, i=1 " I! (M i = 1 if! is in model M i # $ 0 otherwise. Also, w + (! estimates the relative importance of any parameter. David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, Akaike weights Model AIC ' ( Cumulative ( GTR GTR+* SYM SYM+* HKY HKY+* K K80+* GTR+* SYM+* HKY+* F F81+* K80+* JC JC+* F81+* JC+* JC F K HKY SYM GTR Consensus tree for 24 models using Akaike weights David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
10 Parameter importance and model-averaged estimates Parameter Importance Model-averaged estimates fa fc fg ft TiTv rac rag rat rcg rct pinv(i alpha(g pinv(i+ig alpha(g+ig AIC and Bayes AIC approximates reality Bayesian approaches try to identify the most probable model If, as is usually the case, none of the models is correct, what sense can we make of prior or posterior odds? Bayesian priors What is exactly the sample size n? *for 56 models David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, Model selection strategies Modeltest Good properties for model selection methods hlrt Bayesian AIC Applies easily to nonnested models No Yes Yes Allows for the simultaneous comparison of multiple models It does not depend on a subjective significance level No Yes Yes No Yes Yes Incorporates topology uncertainty No Yes* No Easy to compute Yes No* Yes Implements hlrt, AIC for 56 models. It will implement the BIC and tools for model selection uncertainty and model averaging. Google Any OS Assesses model selection uncertainty No Yes Yes So far, it needs PAUP* Allows model averaging No Yes Yes Provides the possibility of specifying prior information for models No Yes* Yes Web-server Provides the possibility of specifying prior information for model parameters Designed to approximate, rather than to identify, truth No Yes* No No No Yes *Not the BIC David Posada University of Vigo, Spain, David Posada University of Vigo, Spain,
11 Take home 1. Models are abstractions to learn about processes. 2. We can statistically select the best-fit model for our data. I recommend AIC or Bayes. 3. It is easy to implement using PAUP* and Modeltest. Model selection is a useful tool for research, but it is not a substitute for careful thinking and common sense reasoning - Browne, David Posada University of Vigo, Spain, David Posada University of Vigo, Spain, References 1. Felsenstein, J. Cases in which parsimony or compatibility methods will be positively misleading. Systematic Zoology 27, ( Huelsenbeck, J.P. & Hillis, D.M. Success of phylogenetic methods in the four-taxon case. Systematic Biology 42, ( Penny, D., Lockhart, P.J., Steel, M.A. & Hendy, M.D. The role of models in reconstructing evolutionary trees. in Models in Phylogenetic Reconstruction, Vol. 52 (eds. Scotland, R.W., Siebert, D.J. & Williams, D.M (Clarendon Press, Oxford, Bruno, W.J. & Halpern, A.L. Topological bias and inconsistency of maximum likelihood using wrong models. Molecular Biology and Evolution 16, ( Sullivan, J. & Swofford, D.L. Are guinea pigs rodents? The importance of adequate models in molecular phylogenies. Journal of Mammalian Evolution 4, ( Kelsey, C.R., Crandall, K.A. & Voevodin, A.F. Different models, different trees: The geographic origin of PTLV-I. Molecular Phylogenetics and Evolution 13, ( Buckley, T.R. & Cunningham, C.W. The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support. Molecular Biology and Evolution 19, ( Buckley, T.R., Simon, C. & Chambers, G.K. Exploring among-site rate variation models in a maximum likelihood framework using empirical data: The effects of model assumptions on estimates of topology, edge lengths, and bootstrap support. Systematic Biology 50, ( Buckley, T.R. Model misspecification and probabilistic tests of topology: evidence from empirical data sets. Systematic Biology 51, ( Zhang, J. Performance of likelihood ratio tests of evolutionary hypotheses under inadequate substitution models. Molecular Biology and Evolution 16, ( Sullivan, J. & Swofford, D.L. Should we use model-based methods for phylogenetic inference when we know that assumptions about among-site rate variation and nucleotide substitution pattern are violated? Systematic Biology 50, ( Pupko, T., Huchon, D., Cao, Y., Okada, N. & Hasegawa, M. Combining Multiple Data Sets in a Likelihood Analysis: Which Models are the Best? Molecular Biology and Evolution 19, ( Tamura, K. Model selection in the estimation of the number of nucleotide substitutions. Molecular Biology and Evolution 11, ( Yang, Z., Goldman, N. & Friday, A. Maximum likelihood trees from DNA sequences: a peculiar statistical estimation problem. Systematic Biology 44, ( Yang, Z. How often do wrong models produce better phylogenies? Molecular Biology and Evolution 14, ( Xia, X. Phylogenetic relationships among horseshoe crab species: Effect of substitution models in phylogenetic analysis. Systematic Biology 49, ( Gaut, B.S. & Lewis, P.O. Success of maximum likelihood phylogeny inference in the four-taxon case. Molecular Biology and Evolution 12, (1995. David Posada University of Vigo, Spain, Felsenstein, J. Evolutionary trees from DNA sequences: A maximum likelihood approach. Journal of Molecular Evolution 17, ( Goldman, N. Maximum likelihood inference of phylogenetic trees, with special reference to a Poisson process model of DNA substitution and to parsimony analyses. Systematic Zoology 39, ( Goldman, N. Simple diagnostic statistical test of models of DNA substitution. Journal of Molecular Evolution 37, ( Swofford, D.L., Olsen, G.J., Waddell, P.J. & Hillis, D.M. Phylogenetic Inference. in Molecular Systematics (eds. Hillis, D.M., Moritz, C. & Mable, B.K (Sinauer Associates, Sunderland, MA, Kendall, M. & Stuart, A. The Advanced Theory of Statistics, (Charles Griffin, London, Kass, R.E. & Raftery, A.E. Bayes factors. Journal of the American Statistical Association 90, ( Suchard, M.A., Kitchen, C.M., Sinsheimer, J.S. & Weiss, R.E. Hierarchical phylogenetic models for analyzing multipartite sequence data. Systematic Biology 52, ( Raftery, A.E. Hypothesis testing and model selection. in Markov Chain Monte Carlo in practice (eds. Gilks, W.R., Richardson, S. & Spiegelhalter, D.J (Chapman & Hall, London ; New York, Suchard, M.A., Weiss, R.E., Dorman, K.S. & Sinsheimer, J.S. Oh brother, where art thou? A Bayes factor test for recombination with uncertain heritage. Systematic Biology 51, ( Huelsenbeck, J.P. & Imennov, N.S. Geographic origin of human mitochondrial DNA: accommodating phylogenetic uncertainty and model comparison. Systematic Biology 51, ( Huelsenbeck, J.P., Rannala, B. & Larget, B. A Bayesian framework for the analysis of cospeciation. Evolution Int J Org Evolution 54, ( Suchard, M.A., Weiss, R.E. & Sinsheimer, J.S. Testing a Molecular Clock without an Outgroup: Derivations of Induced Priors on Branch-Length Restrictions in a Bayesian Framework. Systematic Biology 52, ( Suchard, M.A., Weiss, R.E. & Sinsheimer, J.S. Bayesian selection of continuous-time Markov chain evolutionary models. Molecular Biology and Evolution 18, ( Burnham, K.P. & Anderson, D.R. Model selection and multimodel inference: a practical information-theoretic approach, (Springer-Verlag, New York, NY, Akaike, H. Information measures and model selection. International Statistical Institute 22, (1983. David Posada University of Vigo, Spain,
Preliminaries. Download PAUP* from: Tuesday, July 19, 16
Preliminaries Download PAUP* from: http://people.sc.fsu.edu/~dswofford/paup_test 1 A model of the Boston T System 1 Idea from Paul Lewis A simpler model? 2 Why do models matter? Model-based methods including
More informationPhylogenetic Inference using RevBayes
Phylogenetic Inference using RevBayes Model section using Bayes factors Sebastian Höhna 1 Overview This tutorial demonstrates some general principles of Bayesian model comparison, which is based on estimating
More informationPhylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University
Phylogenetics: Building Phylogenetic Trees COMP 571 - Fall 2010 Luay Nakhleh, Rice University Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary
More informationPhylogenetics: Building Phylogenetic Trees
1 Phylogenetics: Building Phylogenetic Trees COMP 571 Luay Nakhleh, Rice University 2 Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary model should
More informationAssessing an Unknown Evolutionary Process: Effect of Increasing Site- Specific Knowledge Through Taxon Addition
Assessing an Unknown Evolutionary Process: Effect of Increasing Site- Specific Knowledge Through Taxon Addition David D. Pollock* and William J. Bruno* *Theoretical Biology and Biophysics, Los Alamos National
More informationIntegrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2016 University of California, Berkeley. Parsimony & Likelihood [draft]
Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2016 University of California, Berkeley K.W. Will Parsimony & Likelihood [draft] 1. Hennig and Parsimony: Hennig was not concerned with parsimony
More informationThe Importance of Proper Model Assumption in Bayesian Phylogenetics
Syst. Biol. 53(2):265 277, 2004 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150490423520 The Importance of Proper Model Assumption in Bayesian
More informationAkaike Information Criterion
Akaike Information Criterion Shuhua Hu Center for Research in Scientific Computation North Carolina State University Raleigh, NC February 7, 2012-1- background Background Model statistical model: Y j =
More informationEfficiencies of maximum likelihood methods of phylogenetic inferences when different substitution models are used
Molecular Phylogenetics and Evolution 31 (2004) 865 873 MOLECULAR PHYLOGENETICS AND EVOLUTION www.elsevier.com/locate/ympev Efficiencies of maximum likelihood methods of phylogenetic inferences when different
More informationHow should we go about modeling this? Model parameters? Time Substitution rate Can we observe time or subst. rate? What can we observe?
How should we go about modeling this? gorilla GAAGTCCTTGAGAAATAAACTGCACACACTGG orangutan GGACTCCTTGAGAAATAAACTGCACACACTGG Model parameters? Time Substitution rate Can we observe time or subst. rate? What
More informationKaKs Calculator: Calculating Ka and Ks Through Model Selection and Model Averaging
Method KaKs Calculator: Calculating Ka and Ks Through Model Selection and Model Averaging Zhang Zhang 1,2,3#, Jun Li 2#, Xiao-Qian Zhao 2,3, Jun Wang 1,2,4, Gane Ka-Shu Wong 2,4,5, and Jun Yu 1,2,4 * 1
More informationInferring Molecular Phylogeny
Dr. Walter Salzburger he tree of life, ustav Klimt (1907) Inferring Molecular Phylogeny Inferring Molecular Phylogeny 55 Maximum Parsimony (MP): objections long branches I!! B D long branch attraction
More informationDoes Choice in Model Selection Affect Maximum Likelihood Analysis?
Syst. Biol. 57(1):76 85, 2008 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150801898920 Does Choice in Model Selection Affect Maximum Likelihood
More informationConsensus Methods. * You are only responsible for the first two
Consensus Trees * consensus trees reconcile clades from different trees * consensus is a conservative estimate of phylogeny that emphasizes points of agreement * philosophy: agreement among data sets is
More informationLecture 4. Models of DNA and protein change. Likelihood methods
Lecture 4. Models of DNA and protein change. Likelihood methods Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 4. Models of DNA and protein change. Likelihood methods p.1/36
More informationPerformance-Based Selection of Likelihood Models for Phylogeny Estimation
Syst. Biol. 52(5):674 683, 2003 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150390235494 Performance-Based Selection of Likelihood Models for
More informationMOLECULAR SYSTEMATICS: A SYNTHESIS OF THE COMMON METHODS AND THE STATE OF KNOWLEDGE
CELLULAR & MOLECULAR BIOLOGY LETTERS http://www.cmbl.org.pl Received: 16 August 2009 Volume 15 (2010) pp 311-341 Final form accepted: 01 March 2010 DOI: 10.2478/s11658-010-0010-8 Published online: 19 March
More informationLab 9: Maximum Likelihood and Modeltest
Integrative Biology 200A University of California, Berkeley "PRINCIPLES OF PHYLOGENETICS" Spring 2010 Updated by Nick Matzke Lab 9: Maximum Likelihood and Modeltest In this lab we re going to use PAUP*
More informationBayesian support is larger than bootstrap support in phylogenetic inference: a mathematical argument
Bayesian support is larger than bootstrap support in phylogenetic inference: a mathematical argument Tom Britton Bodil Svennblad Per Erixon Bengt Oxelman June 20, 2007 Abstract In phylogenetic inference
More informationEstimating Divergence Dates from Molecular Sequences
Estimating Divergence Dates from Molecular Sequences Andrew Rambaut and Lindell Bromham Department of Zoology, University of Oxford The ability to date the time of divergence between lineages using molecular
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More informationA Bayesian Approach to Phylogenetics
A Bayesian Approach to Phylogenetics Niklas Wahlberg Based largely on slides by Paul Lewis (www.eeb.uconn.edu) An Introduction to Bayesian Phylogenetics Bayesian inference in general Markov chain Monte
More informationEstimating Phylogenies (Evolutionary Trees) II. Biol4230 Thurs, March 2, 2017 Bill Pearson Jordan 6-057
Estimating Phylogenies (Evolutionary Trees) II Biol4230 Thurs, March 2, 2017 Bill Pearson wrp@virginia.edu 4-2818 Jordan 6-057 Tree estimation strategies: Parsimony?no model, simply count minimum number
More informationUsing phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)
Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures
More informationUsing models of nucleotide evolution to build phylogenetic trees
Developmental and Comparative Immunology 29 (2005) 211 227 www.elsevier.com/locate/devcompimm Using models of nucleotide evolution to build phylogenetic trees David H. Bos a, *, David Posada b a School
More informationConsistency Index (CI)
Consistency Index (CI) minimum number of changes divided by the number required on the tree. CI=1 if there is no homoplasy negatively correlated with the number of species sampled Retention Index (RI)
More informationLetter to the Editor. Department of Biology, Arizona State University
Letter to the Editor Traditional Phylogenetic Reconstruction Methods Reconstruct Shallow and Deep Evolutionary Relationships Equally Well Michael S. Rosenberg and Sudhir Kumar Department of Biology, Arizona
More informationPoints of View JACK SULLIVAN 1 AND DAVID L. SWOFFORD 2
Points of View Syst. Biol. 50(5):723 729, 2001 Should We Use Model-Based Methods for Phylogenetic Inference When We Know That Assumptions About Among-Site Rate Variation and Nucleotide Substitution Pattern
More informationAre Guinea Pigs Rodents? The Importance of Adequate Models in Molecular Phylogenetics
Journal of Mammalian Evolution, Vol. 4, No. 2, 1997 Are Guinea Pigs Rodents? The Importance of Adequate Models in Molecular Phylogenetics Jack Sullivan1'2 and David L. Swofford1 The monophyly of Rodentia
More informationSystematics - Bio 615
Bayesian Phylogenetic Inference 1. Introduction, history 2. Advantages over ML 3. Bayes Rule 4. The Priors 5. Marginal vs Joint estimation 6. MCMC Derek S. Sikes University of Alaska 7. Posteriors vs Bootstrap
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More information"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2009 University of California, Berkeley
"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2009 University of California, Berkeley B.D. Mishler Jan. 22, 2009. Trees I. Summary of previous lecture: Hennigian
More informationLikelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution
Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Ziheng Yang Department of Biology, University College, London An excess of nonsynonymous substitutions
More informationOEB 181: Systematics. Catalog Number: 5459
OEB 181: Systematics Catalog Number: 5459 Tu & Th, 10-11:30 am, MCZ 202 Wednesdays, 2-4 pm, Science Center 418D Gonzalo Giribet (Biolabs 1119, ggiribet@oeb.harvard.edu) Charles Marshall (MCZ 111A, cmarshall@fas.harvard.edu
More informationMaximum Likelihood Tree Estimation. Carrie Tribble IB Feb 2018
Maximum Likelihood Tree Estimation Carrie Tribble IB 200 9 Feb 2018 Outline 1. Tree building process under maximum likelihood 2. Key differences between maximum likelihood and parsimony 3. Some fancy extras
More informationReconstruire le passé biologique modèles, méthodes, performances, limites
Reconstruire le passé biologique modèles, méthodes, performances, limites Olivier Gascuel Centre de Bioinformatique, Biostatistique et Biologie Intégrative C3BI USR 3756 Institut Pasteur & CNRS Reconstruire
More informationBayesian inference & Markov chain Monte Carlo. Note 1: Many slides for this lecture were kindly provided by Paul Lewis and Mark Holder
Bayesian inference & Markov chain Monte Carlo Note 1: Many slides for this lecture were kindly provided by Paul Lewis and Mark Holder Note 2: Paul Lewis has written nice software for demonstrating Markov
More informationLecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30
Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30 A non-phylogeny
More informationGeographic Origin of Human Mitochondrial DNA: Accommodating Phylogenetic Uncertainty and Model Comparison
2002 POINTS OF VIEW 155 constructive comments and Olaf Bininda-Emonds and Mike Sanderson for providing a preprint of their simulation work. REFERENCES BARRETT, M., M. J. DONOGHUE, AND E. SOBER. 1991. Against
More informationPenalized Loss functions for Bayesian Model Choice
Penalized Loss functions for Bayesian Model Choice Martyn International Agency for Research on Cancer Lyon, France 13 November 2009 The pure approach For a Bayesian purist, all uncertainty is represented
More informationBootstrap confidence levels for phylogenetic trees B. Efron, E. Halloran, and S. Holmes, 1996
Bootstrap confidence levels for phylogenetic trees B. Efron, E. Halloran, and S. Holmes, 1996 Following Confidence limits on phylogenies: an approach using the bootstrap, J. Felsenstein, 1985 1 I. Short
More informationInfer relationships among three species: Outgroup:
Infer relationships among three species: Outgroup: Three possible trees (topologies): A C B A B C Model probability 1.0 Prior distribution Data (observations) probability 1.0 Posterior distribution Bayes
More informationPhylogenetics: Distance Methods. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Distance Methods COMP 571 - Spring 2015 Luay Nakhleh, Rice University Outline Evolutionary models and distance corrections Distance-based methods Evolutionary Models and Distance Correction
More informationMinimum evolution using ordinary least-squares is less robust than neighbor-joining
Minimum evolution using ordinary least-squares is less robust than neighbor-joining Stephen J. Willson Department of Mathematics Iowa State University Ames, IA 50011 USA email: swillson@iastate.edu November
More informationMolecular Clocks. The Holy Grail. Rate Constancy? Protein Variability. Evidence for Rate Constancy in Hemoglobin. Given
Molecular Clocks Rose Hoberman The Holy Grail Fossil evidence is sparse and imprecise (or nonexistent) Predict divergence times by comparing molecular data Given a phylogenetic tree branch lengths (rt)
More informationPOPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics
POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the
More informationBayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies
Bayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies 1 What is phylogeny? Essay written for the course in Markov Chains 2004 Torbjörn Karfunkel Phylogeny is the evolutionary development
More informationThe Phylogenetic Handbook
The Phylogenetic Handbook A Practical Approach to DNA and Protein Phylogeny Edited by Marco Salemi University of California, Irvine and Katholieke Universiteit Leuven, Belgium and Anne-Mieke Vandamme Rega
More informationBayesian phylogenetics. the one true tree? Bayesian phylogenetics
Bayesian phylogenetics the one true tree? the methods we ve learned so far try to get a single tree that best describes the data however, they admit that they don t search everywhere, and that it is difficult
More informationThanks to Paul Lewis and Joe Felsenstein for the use of slides
Thanks to Paul Lewis and Joe Felsenstein for the use of slides Review Hennigian logic reconstructs the tree if we know polarity of characters and there is no homoplasy UPGMA infers a tree from a distance
More informationFrequentist Properties of Bayesian Posterior Probabilities of Phylogenetic Trees Under Simple and Complex Substitution Models
Syst. Biol. 53(6):904 913, 2004 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150490522629 Frequentist Properties of Bayesian Posterior Probabilities
More informationAn Introduction to Bayesian Inference of Phylogeny
n Introduction to Bayesian Inference of Phylogeny John P. Huelsenbeck, Bruce Rannala 2, and John P. Masly Department of Biology, University of Rochester, Rochester, NY 427, U.S.. 2 Department of Medical
More informationMolecular Evolution, course # Final Exam, May 3, 2006
Molecular Evolution, course #27615 Final Exam, May 3, 2006 This exam includes a total of 12 problems on 7 pages (including this cover page). The maximum number of points obtainable is 150, and at least
More informationBuilding trees of algae: some advances in phylogenetic and evolutionary analysis
Eur. J. Phycol., (2008), 43(3): 229 252 Building trees of algae: some advances in phylogenetic and evolutionary analysis HEROEN VERBRUGGEN 1 AND EDWARD C. THERIOT 2 1 Phycology Research Group and Centre
More informationPhylogenetic Assumptions
Substitution Models and the Phylogenetic Assumptions Vivek Jayaswal Lars S. Jermiin COMMONWEALTH OF AUSTRALIA Copyright htregulation WARNING This material has been reproduced and communicated to you by
More informationThe Importance of Data Partitioning and the Utility of Bayes Factors in Bayesian Phylogenetics
Syst. Biol. 56(4):643 655, 2007 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150701546249 The Importance of Data Partitioning and the Utility
More informationBayesian Phylogenetics
Bayesian Phylogenetics Paul O. Lewis Department of Ecology & Evolutionary Biology University of Connecticut Woods Hole Molecular Evolution Workshop, July 27, 2006 2006 Paul O. Lewis Bayesian Phylogenetics
More informationThe Causes and Consequences of Variation in. Evolutionary Processes Acting on DNA Sequences
The Causes and Consequences of Variation in Evolutionary Processes Acting on DNA Sequences This dissertation is submitted for the degree of Doctor of Philosophy at the University of Cambridge Lee Nathan
More informationLetter to the Editor. The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1
Letter to the Editor The Effect of Taxonomic Sampling on Accuracy of Phylogeny Estimation: Test Case of a Known Phylogeny Steven Poe 1 Department of Zoology and Texas Memorial Museum, University of Texas
More informationAccuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular Evolution
Accuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular Evolution Maria Anisimova, Joseph P. Bielawski, and Ziheng Yang Department of Biology, Galton Laboratory, University College
More informationParameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1
Parameter Estimation William H. Jefferys University of Texas at Austin bill@bayesrules.net Parameter Estimation 7/26/05 1 Elements of Inference Inference problems contain two indispensable elements: Data
More informationAmira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut
Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological
More informationInferring Speciation Times under an Episodic Molecular Clock
Syst. Biol. 56(3):453 466, 2007 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150701420643 Inferring Speciation Times under an Episodic Molecular
More informationBayesian Selection of Continuous-Time Markov Chain Evolutionary Models
Bayesian Selection of Continuous-Time Markov Chain Evolutionary Models Marc A. Suchard,* Robert E. Weiss, and Janet S. Sinsheimer* *Department of Biomathematics, UCLA School of Medicine; Department of
More informationChapter 7: Models of discrete character evolution
Chapter 7: Models of discrete character evolution pdf version R markdown to recreate analyses Biological motivation: Limblessness as a discrete trait Squamates, the clade that includes all living species
More informationST440/540: Applied Bayesian Statistics. (9) Model selection and goodness-of-fit checks
(9) Model selection and goodness-of-fit checks Objectives In this module we will study methods for model comparisons and checking for model adequacy For model comparisons there are a finite number of candidate
More informationPhylogenetic Networks, Trees, and Clusters
Phylogenetic Networks, Trees, and Clusters Luay Nakhleh 1 and Li-San Wang 2 1 Department of Computer Science Rice University Houston, TX 77005, USA nakhleh@cs.rice.edu 2 Department of Biology University
More informationDr. Amira A. AL-Hosary
Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological
More informationMarkov chain Monte-Carlo to estimate speciation and extinction rates: making use of the forest hidden behind the (phylogenetic) tree
Markov chain Monte-Carlo to estimate speciation and extinction rates: making use of the forest hidden behind the (phylogenetic) tree Nicolas Salamin Department of Ecology and Evolution University of Lausanne
More informationInferring phylogeny. Today s topics. Milestones of molecular evolution studies Contributions to molecular evolution
Today s topics Inferring phylogeny Introduction! Distance methods! Parsimony method!"#$%&'(!)* +,-.'/01!23454(6!7!2845*0&4'9#6!:&454(6 ;?@AB=C?DEF Overview of phylogenetic inferences Methodology Methods
More informationBranch-Length Prior Influences Bayesian Posterior Probability of Phylogeny
Syst. Biol. 54(3):455 470, 2005 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150590945313 Branch-Length Prior Influences Bayesian Posterior Probability
More informationLecture 27. Phylogeny methods, part 4 (Models of DNA and protein change) p.1/26
Lecture 27. Phylogeny methods, part 4 (Models of DNA and protein change) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 27. Phylogeny methods, part 4 (Models of DNA and
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More informationLie Markov models. Jeremy Sumner. School of Physical Sciences University of Tasmania, Australia
Lie Markov models Jeremy Sumner School of Physical Sciences University of Tasmania, Australia Stochastic Modelling Meets Phylogenetics, UTAS, November 2015 Jeremy Sumner Lie Markov models 1 / 23 The theory
More informationC3020 Molecular Evolution. Exercises #3: Phylogenetics
C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from
More informationPHYLOGENY ESTIMATION AND HYPOTHESIS TESTING USING MAXIMUM LIKELIHOOD
Annu. Rev. Ecol. Syst. 1997. 28:437 66 Copyright c 1997 by Annual Reviews Inc. All rights reserved PHYLOGENY ESTIMATION AND HYPOTHESIS TESTING USING MAXIMUM LIKELIHOOD John P. Huelsenbeck Department of
More informationQuestions we can ask. Recall. Accuracy and Precision. Systematics - Bio 615. Outline
Outline 1. Mechanistic comparison with Parsimony - branch lengths & parameters 2. Performance comparison with Parsimony - Desirable attributes of a method - The Felsenstein and Farris zones - Heterotachous
More informationLecture 24. Phylogeny methods, part 4 (Models of DNA and protein change) p.1/22
Lecture 24. Phylogeny methods, part 4 (Models of DNA and protein change) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 24. Phylogeny methods, part 4 (Models of DNA and
More informationCopyright. Jeremy Matthew Brown
Copyright by Jeremy Matthew Brown 2009 The Dissertation Committee for Jeremy Matthew Brown certifies that this is the approved version of the following dissertation: Improving the Accuracy and Realism
More informationOn the Uniqueness of the Selection Criterion in Neighbor-Joining
Journal of Classification 22:3-15 (2005) DOI: 10.1007/s00357-005-0003-x On the Uniqueness of the Selection Criterion in Neighbor-Joining David Bryant McGill University, Montreal Abstract: The Neighbor-Joining
More informationCromwell's principle idealized under the theory of large deviations
Cromwell's principle idealized under the theory of large deviations Seminar, Statistics and Probability Research Group, University of Ottawa Ottawa, Ontario April 27, 2018 David Bickel University of Ottawa
More informationBiology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Jan 27 & 29):
Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Jan 27 & 29): Statistical estimation of models of sequence evolution Phylogenetic inference using maximum likelihood:
More informationJML: testing hybridization from species trees
Molecular Ecology Resources (2012) 12, 179 184 doi: 10.1111/j.1755-0998.2011.03065.x JML: testing hybridization from species trees SIMON JOLY Institut de recherche en biologie végétale, Université de Montréal
More informationPhylogenetics: Bayesian Phylogenetic Analysis. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Bayesian Phylogenetic Analysis COMP 571 - Spring 2015 Luay Nakhleh, Rice University Bayes Rule P(X = x Y = y) = P(X = x, Y = y) P(Y = y) = P(X = x)p(y = y X = x) P x P(X = x 0 )P(Y = y X
More informationLecture 6: Model Checking and Selection
Lecture 6: Model Checking and Selection Melih Kandemir melih.kandemir@iwr.uni-heidelberg.de May 27, 2014 Model selection We often have multiple modeling choices that are equally sensible: M 1,, M T. Which
More informationBayesian Phylogenetics:
Bayesian Phylogenetics: an introduction Marc A. Suchard msuchard@ucla.edu UCLA Who is this man? How sure are you? The one true tree? Methods we ve learned so far try to find a single tree that best describes
More informationMaximum Likelihood Until recently the newest method. Popularized by Joseph Felsenstein, Seattle, Washington.
Maximum Likelihood This presentation is based almost entirely on Peter G. Fosters - "The Idiot s Guide to the Zen of Likelihood in a Nutshell in Seven Days for Dummies, Unleashed. http://www.bioinf.org/molsys/data/idiots.pdf
More informationESTIMATING DIVERGENCE TIMES FROM MOLECULAR DATA ON PHYLOGENETIC
Annu. Rev. Ecol. Syst. 2002. 33:707 40 doi: 10.1146/annurev.ecolsys.33.010802.150500 Copyright c 2002 by Annual Reviews. All rights reserved ESTIMATING DIVERGENCE TIMES FROM MOLECULAR DATA ON PHYLOGENETIC
More informationSubstitution = Mutation followed. by Fixation. Common Ancestor ACGATC 1:A G 2:C A GAGATC 3:G A 6:C T 5:T C 4:A C GAAATT 1:G A
GAGATC 3:G A 6:C T Common Ancestor ACGATC 1:A G 2:C A Substitution = Mutation followed 5:T C by Fixation GAAATT 4:A C 1:G A AAAATT GAAATT GAGCTC ACGACC Chimp Human Gorilla Gibbon AAAATT GAAATT GAGCTC ACGACC
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: istance-based methods Ultrametric Additive: UPGMA Transformed istance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood
More informationPhylogeny. November 7, 2017
Phylogeny November 7, 2017 Phylogenetics Phylon = tribe/race, genetikos = relative to birth Phylogenetics: study of evolutionary relationships among organisms, sequences, or anything in between Related
More informationModel Comparison. Course on Bayesian Inference, WTCN, UCL, February Model Comparison. Bayes rule for models. Linear Models. AIC and BIC.
Course on Bayesian Inference, WTCN, UCL, February 2013 A prior distribution over model space p(m) (or hypothesis space ) can be updated to a posterior distribution after observing data y. This is implemented
More informationA note on Reversible Jump Markov Chain Monte Carlo
A note on Reversible Jump Markov Chain Monte Carlo Hedibert Freitas Lopes Graduate School of Business The University of Chicago 5807 South Woodlawn Avenue Chicago, Illinois 60637 February, 1st 2006 1 Introduction
More informationPHYLOGENY ESTIMATION: TRADITIONAL AND BAYESIAN APPROACHES
PHYLOGENY ESTIMATION: TRADITIONAL AND BAYESIAN APPROACHES Mark Holder and Paul O. Lewis The construction of evolutionary trees is now a standard part of exploratory sequence analysis. Bayesian methods
More informationBiol 206/306 Advanced Biostatistics Lab 12 Bayesian Inference
Biol 206/306 Advanced Biostatistics Lab 12 Bayesian Inference By Philip J. Bergmann 0. Laboratory Objectives 1. Learn what Bayes Theorem and Bayesian Inference are 2. Reinforce the properties of Bayesian
More informationBayesian Inference. Anders Gorm Pedersen. Molecular Evolution Group Center for Biological Sequence Analysis Technical University of Denmark (DTU)
Bayesian Inference Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical University of Denmark (DTU) Background: Conditional probability A P (B A) = A,B P (A,
More informationPhylogenetic Tree Reconstruction
I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven
More informationIntegrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley
Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley B.D. Mishler Feb. 14, 2018. Phylogenetic trees VI: Dating in the 21st century: clocks, & calibrations;
More informationPAML 4: Phylogenetic Analysis by Maximum Likelihood
PAML 4: Phylogenetic Analysis by Maximum Likelihood Ziheng Yang* *Department of Biology, Galton Laboratory, University College London, London, United Kingdom PAML, currently in version 4, is a package
More informationKei Takahashi and Masatoshi Nei
Efficiencies of Fast Algorithms of Phylogenetic Inference Under the Criteria of Maximum Parsimony, Minimum Evolution, and Maximum Likelihood When a Large Number of Sequences Are Used Kei Takahashi and
More information