Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Similar documents
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2011 University of California, Berkeley

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Phenotypic Evolution. and phylogenetic comparative methods. G562 Geometric Morphometrics. Department of Geological Sciences Indiana University

Discrete & continuous characters: The threshold model

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200 Spring 2018 University of California, Berkeley

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

Phylogenetic analyses. Kirsi Kostamo

Optimum selection and OU processes

Algorithms in Bioinformatics

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

Statistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution?

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

How can molecular phylogenies illuminate morphological evolution?

Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley

Bayesian Inference for Normal Mean

Pinvar approach. Remarks: invariable sites (evolve at relative rate 0) variable sites (evolves at relative rate r)

Appendix from L. J. Revell, On the Analysis of Evolutionary Change along Single Branches in a Phylogeny

Algorithmic Methods Well-defined methodology Tree reconstruction those that are well-defined enough to be carried out by a computer. Felsenstein 2004,

Testing quantitative genetic hypotheses about the evolutionary rate matrix for continuous characters

A Bayesian Approach to Phylogenetics

Quantitative evolution of morphology

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Many of the slides that I ll use have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Contrasts for a within-species comparative method

Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution

Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles

Concepts and Methods in Molecular Divergence Time Estimation

One-Way Analysis of Variance (ANOVA) Paul K. Strode, Ph.D.

Base Range A 0.00 to 0.25 C 0.25 to 0.50 G 0.50 to 0.75 T 0.75 to 1.00

Thanks to Paul Lewis and Joe Felsenstein for the use of slides

User s Manual for. Continuous. (copyright M. Pagel) Mark Pagel School of Animal and Microbial Sciences University of Reading Reading RG6 6AJ UK

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

Chapter 12 - Lecture 2 Inferences about regression coefficient

Biol 206/306 Advanced Biostatistics Lab 11 Models of Trait Evolution Fall 2016

MA : Introductory Probability

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Fig. 26.7a. Biodiversity. 1. Course Outline Outcomes Instructors Text Grading. 2. Course Syllabus. Fig. 26.7b Table

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

1. Can we use the CFN model for morphological traits?

Statistics notes. A clear statistical framework formulates the logic of what we are doing and why. It allows us to make precise statements.

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

Studying the effect of species dominance on diversity patterns using Hill numbers-based indices

Phylogeny, trees and morphospace

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

How should we organize the diversity of animal life?

7. Tests for selection

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

Bootstrap confidence levels for phylogenetic trees B. Efron, E. Halloran, and S. Holmes, 1996

Quantitative trait evolution with mutations of large effect

Week 8: Testing trees, Bootstraps, jackknifes, gene frequencies

Package OUwie. August 29, 2013

Comparative Methods for Studying Primate Adaptation and Allometry

Statistics Introduction to Probability

AP Biology. Cladistics

Constructing Evolutionary/Phylogenetic Trees

Power of the Concentrated Changes Test for Correlated Evolution

Inference and Regression

Reconstructing the history of lineages

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Improving divergence time estimation in phylogenetics: more taxa vs. longer sequences

Quantitative Genetics

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

Evolutionary Tree Analysis. Overview

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

Chapter 26 Phylogeny and the Tree of Life

Data files for today. CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav

Calculation of IBD probabilities

Evolutionary Physiology

DNA-based species delimitation

8/23/2014. Phylogeny and the Tree of Life

Power Analysis. Introduction to Power

Modeling Phylogenetic Comparative Methods with Hybridization

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Lecture 6 Phylogenetic Inference

Topic 22 Analysis of Variance

STT 843 Key to Homework 1 Spring 2018

Preliminary Statistics. Lecture 3: Probability Models and Distributions

The required region is shown in the diagram above. The

Bivariate Paired Numerical Data

Chapter 19: Logistic regression

Macroevolution Part I: Phylogenies

One-minute responses. Nice class{no complaints. Your explanations of ML were very clear. The phylogenetics portion made more sense to me today.

Nonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p.

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Rapid evolution of the cerebellum in humans and other great apes

Chapter 7: Models of discrete character evolution

Session 5: Phylogenomics

Statistical Distribution Assumptions of General Linear Models

Evaluating phylogenetic hypotheses

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Statistics for the LHC Lecture 1: Introduction

Rejection regions for the bivariate case

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Two-Variable Regression Model: The Problem of Estimation

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Biology 211 (2) Week 1 KEY!

Transcription:

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis

Correlation of states in a discrete-state model character 1: character 2: species states change in character 1 branch changes #1 #2 change in character 2 #1 #2 Y N 0 6 Y 1 0 4 0 N 0 18 Week 8: Paired sites tests, gene frequencies, continuous characters p.35/44

Example from pp. 429-430 of the MacClade 4 manual. Character 1 Q: Does character 2 tend to change from yellow blue only when character 1 is in the blue state? Character 2 2007 by Paul O. Lewis 2

Concentrated Changes Test More precisely: how often would three yellow blue changes occur in the blue areas of the cladogram on the left if these 3 changes were thrown down at random on branches of the tree? Answer: 12.0671% of the time. Thus, we cannot reject the null hypothesis that the observed coincident changes in the two characters were simply the result of chance. 2007 by Paul O. Lewis 3

Data for Two Characters, X and Y A B X 27 33 Y 122 124 Var(X) Var(Y) Cov(XY) Correlation = = = = 42.000 6.667-10.000-0.5976 C D 18 22 126 128 The negative correlation is fairly strong, but would it weaken if it were recognized that there are not really 4 independent data points here... 2007 by Paul O. Lewis 5

Y 122 123 124 125 126 127 128 20 25 30 X

Fig. 6 from Felsenstein (1985) 2007 by Paul O. Lewis 3

Fig. 7 from Felsenstein (1985) 2007 by Paul O. Lewis 4

Fig. 5 from Felsenstein (1985) Felsenstein, J. 1985. Phylogenies and the comparative method. American Naturalist 125:1-15. 2007 by Paul O. Lewis 2

An outcome of Brownian motion on a 5-species tree Week 8: Paired sites tests, gene frequencies, continuous characters p.11/44

An outcome of Brownian motion on a 5-species tree Week 8: Paired sites tests, gene frequencies, continuous characters p.12/44

An outcome of Brownian motion on a 5-species tree Week 8: Paired sites tests, gene frequencies, continuous characters p.13/44

An outcome of Brownian motion on a 5-species tree Week 8: Paired sites tests, gene frequencies, continuous characters p.14/44

Likelihood under Brownian motion with two species f ( x; µ, σ 2) = ) 1 ( σ 2π exp (x µ)2 2σ 2 L = p i=1 ( 1 (2π)σi 2 exp 1 v1 v 2 2σi 2 [ (x 1i x 0i ) 2 + (x 2i x 0i ) 2 v 1 v 2 ]) Week 8: Paired sites tests, gene frequencies, continuous characters p.15/44

Brownian motion along a tree x 1 x x x 1 8 2 v x x 1 2 8 v 2 x 8 x x 8 9 v 8 x 3 x x 1 3 9 8 v 3 x 9 x x 9 0 v 9 x 6 x x 5 7 x x 6 10 x x v 7 10 x x x 6 v7 4 5 11 x 10 v x x 5 4 12 v x 4 11 x x 11 12 v x 11 12 x x 12 0 v 12 x x 10 11 v 10 x 0 Week 8: Paired sites tests, gene frequencies, continuous characters p.8/44

Covariances of species on the tree v 1 + v 8 + v 9 v 8 + v 9 v 9 0 0 0 0 v 8 + v 9 v 2 + v 8 + v 9 v 9 0 0 0 0 v 9 v 9 v 3 + v 9 0 0 0 0 0 0 0 v 4 + v 12 v 12 v 12 v 12 0 0 0 v 12 v 5 + v 11 + v 12 v 11 + v 12 v 11 + v 12 0 0 0 v 12 v 11 + v 12 v 6 + v 10 + v 11 + v 12 v 10 + v 11 + v 12 0 0 0 v 12 v 11 + v 12 v 10 + v 11 + v 12 v 7 + v 10 + v 11 + v 12 Week 8: Paired sites tests, gene frequencies, continuous characters p.9/44

Covariances are of form a b c 0 0 0 0 b d c 0 0 0 0 c c e 0 0 0 0 0 0 0 f g g g 0 0 0 g h i i 0 0 0 g i j k 0 0 0 g i k l Week 8: Paired sites tests, gene frequencies, continuous characters p.10/44

5.0 4.0 5.2 6.0 4.0 5.0 3.0 4.0 6.0 5.0 3.0

Brownian Motion Model X i Model predicts that the uncertainty associated with the difference in trait values (i.e. D ij = X i -X j ) grows linearly with time. That is, Var(D ij ) = (t i +t j ) σ 2 So the longer you wait, the less you can t i predict about t j D ij X j X k 2007 by Paul O. Lewis 6

Brownian Motion Model X i X j t i t j Ancestral trait value estimated to be weighted average of descendants' values. Weights are inverses of branch lengths (i.e. times) X k = X i /t i + X j /t j 1/t i + 1/t j 2007 by Paul O. Lewis 7

Brownian Motion Model X i t i X k t j X j X k is not observed, and thus has some extra uncertainty associated with its value. This extra uncertainty can be modeled by adding an extra bit of length (t k ') to the branch subtending X k t k + t k ' The amount of extra uncertainty that should be added is: t k ' = t i t j / (t i + t j ) 2007 by Paul O. Lewis 8

Data for Two Characters on Tree X Y 27 33 18 22 122 124 126 128 A B C D 2 2 2 2 E F 3.5 3.5 2007 by Paul O. Lewis 9

Contrasts (left minus right) X Y 27-6 33 18-4 22 122-2 124 126-2 128 2 2 2 2 30 123 3.5 10-4 3.5 20 127 2007 by Paul O. Lewis 10

Scaling Contrasts A-B and C-D contrasts are expected to be on the same scale because the path length associated with both of these is 4 E-F has a path length of 7, which means this contrast is expected to be larger Adding in the extra uncertainty associated with estimating E and F, this path length expands from 7 to 9 Can put all 3 contrasts on same scale by dividing by standard deviation (square root of variance) 2007 by Paul O. Lewis 11

Rescaled Contrasts Asterisks indicate that these original trait values have been scaled by dividing by the square root of the path length. X Y Variance proportional to X* Y* A-B -6-2 4-6/2 = -3-2/2 = -1 C-D -4-2 4-4/2 = -2-2/2 = -1 E-F 10-4 9 10/3 = 3.33-4/3 = -1.33 These path lengths are proportional to the variance, and thus they are all that are needed to place the 3 contrasts for a given trait on the same scale. 2007 by Paul O. Lewis 12

Y contrast 2 1 0 1 2 0 1 2 3 4 X contrast

Correlation of Contrasts A-B C-D X -3-2 Y -1-1 Var(X) Var(Y) Cov(XY) Correlation = = = = 8.0370 1.2593 0.1852 0.05821 E-F 3.33-1.33 The CONTRAST program in Joe Felsenstein's PHYLIP package performs independent contrasts Correlation of the raw X and Y trait values was -0.5976, which is both stronger and of opposite sign. Note that the sample size is now 3 rather than 4. 2007 by Paul O. Lewis 13