Intraspecific gene genealogies: trees grafting into networks

Similar documents
Dr. Amira A. AL-Hosary

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Constructing Evolutionary/Phylogenetic Trees

C3020 Molecular Evolution. Exercises #3: Phylogenetics

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

Constructing Evolutionary/Phylogenetic Trees

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Big Idea #1: The process of evolution drives the diversity and unity of life

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION" Integrative Biology 200B Spring 2011 University of California, Berkeley

Molecular Phylogenetics (part 1 of 2) Computational Biology Course João André Carriço

What is Phylogenetics

D. Incorrect! That is what a phylogenetic tree intends to depict.

Estimating Evolutionary Trees. Phylogenetic Methods

Integrative Biology 200A "PRINCIPLES OF PHYLOGENETICS" Spring 2012 University of California, Berkeley

Processes of Evolution

Theory of Evolution Charles Darwin

Lecture 11 Friday, October 21, 2011

Introduction to characters and parsimony analysis

BINF6201/8201. Molecular phylogenetic methods

Many of the slides that I ll use have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Anatomy of a tree. clade is group of organisms with a shared ancestor. a monophyletic group shares a single common ancestor = tapirs-rhinos-horses

A (short) introduction to phylogenetics

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Cladistics and Bioinformatics Questions 2013

Reconstructing the history of lineages

8/23/2014. Phylogeny and the Tree of Life

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

ESS 345 Ichthyology. Systematic Ichthyology Part II Not in Book

A Phylogenetic Network Construction due to Constrained Recombination

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

Classification and Phylogeny

How should we organize the diversity of animal life?

Evolution. Species Changing over time

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation

Biology 211 (2) Week 1 KEY!

Unit 9: Evolution Guided Reading Questions (80 pts total)

Classification and Phylogeny

Phylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships


Phylogenetic inference

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation

Chapter 22: Descent with Modification 1. BRIEFLY summarize the main points that Darwin made in The Origin of Species.

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Beyond Galled Trees Decomposition and Computation of Galled Networks

Reproduction- passing genetic information to the next generation

6 Introduction to Population Genetics

Chapter 17: Population Genetics and Speciation

Copyright (c) 2008 Daniel Huson. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation

Theory of Evolution. Charles Darwin

Phylogenetic Networks, Trees, and Clusters

Phylogenetic Analysis

Phylogenetic analysis. Characters

C.DARWIN ( )

Lecture 6 Phylogenetic Inference

Unit 7: Evolution Guided Reading Questions (80 pts total)

6 Introduction to Population Genetics

Properties of normal phylogenetic networks

Phylogenetics. BIOL 7711 Computational Bioscience

I. Short Answer Questions DO ALL QUESTIONS

Outline. Classification of Living Things

Michael Yaffe Lecture #5 (((A,B)C)D) Database Searching & Molecular Phylogenetics A B C D B C D

PHYLOGENY & THE TREE OF LIFE

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Phylogenetics: Parsimony

Consensus Methods. * You are only responsible for the first two

Chapter 27: Evolutionary Genetics

Phylogeny: building the tree of life

Evolution. Species Changing over time

STEM-hy: Species Tree Estimation using Maximum likelihood (with hybridization)

Bioinformatics 1 -- lecture 9. Phylogenetic trees Distance-based tree building Parsimony


Algorithms in Bioinformatics

Macroevolution Part I: Phylogenies

Chapter 16. Table of Contents. Section 1 Genetic Equilibrium. Section 2 Disruption of Genetic Equilibrium. Section 3 Formation of Species

Phylogenetic analyses. Kirsi Kostamo

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

JML: testing hybridization from species trees

Phylogenetic Analysis

Phylogenetic Analysis

Neutral behavior of shared polymorphism

How to read and make phylogenetic trees Zuzana Starostová

1/27/2010. Systematics and Phylogenetics of the. An Introduction. Taxonomy and Systematics

The Origin of Species

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

Curriculum Map. Biology, Quarter 1 Big Ideas: From Molecules to Organisms: Structures and Processes (BIO1.LS1)

Phylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz

(Stevens 1991) 1. morphological characters should be assumed to be quantitative unless demonstrated otherwise

Consensus methods. Strict consensus methods

Evolutionary Tree Analysis. Overview

Formative/Summative Assessments (Tests, Quizzes, reflective writing, Journals, Presentations)

Is the equal branch length model a parsimony model?

--Therefore, congruence among all postulated homologies provides a test of any single character in question [the central epistemological advance].

Anatomy of a species tree

BIOLOGY 432 Midterm I - 30 April PART I. Multiple choice questions (3 points each, 42 points total). Single best answer.

Phylogenetic methods in molecular systematics

Transcription:

Intraspecific gene genealogies: trees grafting into networks by David Posada & Keith A. Crandall Kessy Abarenkov Tartu, 2004

Article describes: Population genetics principles Intraspecific genetic variation why networks? Available methods and software

Population genetics principles Coalescent event the time inverse of a DNA replication event; that is, the event leading to the common ancestor of two sequences looking back in time Haplotype space the collection of points representing the possible different haplotypes. The dimension of this space is the number of characters (L) Homoplasy a similarity that is not a result of common history. It is caused by parallel, convergent or reverse mutations Tokogeny nonhierarchical genetic relationships among individuals. Arising by sexual reproduction

Problems with interspecific methods at the intraspecific level Interspecific relationships are hierarchical reproductive isolation, population fission over longer timescales mutation + population divergence = fixation of different alleles nonoverlapping gene pools intraspecific are not result of sexual reproduction, smaller numbers of relatively recent mutations, frequently recombination

Problems with interspecific methods at the intraspecific level More traditional methods (ML, MP, ME) cannot take account of the fact that, at the populational level, several phenomena violate some of them assumptions low divergence -> fewer characters for phylogenetic analysis extant ancestral nodes (ancestral haplotypes are expected to persist in the population and to be sampled together with their descendants) multifurcations (single ancestral haplotype will often give rise to multiple descendant haplotypes, yielding a haplotype tree with true multifications) reticulation caused by recombination between genes, hybridization between lineages, and homoplasy large sample sizes

Solution: network methods Networks can account effectively for processes acting at the species level allow for persistent ancestral nodes, multifurcations and reticulations provide a way of representing more of the phylogenetic information present in a data set (loops -> recombination, reverse or parallel mutations) Can and have been used for: detecting recombination delimiting species inferring models of speciation partitioning population history and structure studying genotype and phenotype associations

Network algorithms (a) UPGAM (b) Maximum parsimony (c) Pyramid (d) Statistical geometry (e) Split decomposition (f) Minimum spanning network (g) Median-joining network (h) Statistical parsimony (i) Reticulogram

Pyramids An extension of the hierarchical clustering framework Represent a set of clades that can overlap Can be used to represent reticulate events (among terminal nodes that are sister taxa) http://bioweb.pasteur.fr/seqanal/interfaces/pyramids.html outgroup TAA1815aus TAA1597est KHL8459swe RS08097fin TAA1815est HK16835fin

Statistical geometry haplotypes are considered as geometric configurations in the haplotype space does not offer an estimate of the sequence genealogy incorporates a model of nucleotide substitution through the estimation of haplotype distances and allows a reliable assessment of the derived conclusions Geometry Statgeom

Split decomposition data set is partitioned into set of sequences or 'splits' network is built by taking these splits and combining them successively when splits are incompatible, a loop is introduced to indicate that there are alternative splits fast, nucleotide or protein data, allows for the inclusion of models of nucleotide substitution or amino acid replacement, bootstrap evaluation SplitsTree web interface - http://bibiserv.techfak.unibielefeld.de/splits/

Statistical parsimony begins by estimating the maximum number of differences among haplotypes as a result of single substitutions with a 95% statistical confidence after that, haplotypes differing by one change are connected, then those that differing by two, by three and so on, until all the haplotypes are included in a single network or the parsimony connection limit is reached the statistical parsimony method emphasizes what is shared among haplotypes that differ minimally rather than differences among the haplotypes, and provides an empirical assessment of deviations from parsimony TCS - http://darwin.uvigo.es/software/tcs.html

Reticulogram addition of reticulations to a bifurcating tree the minimum number of reticulations required to maximize the fit of the network to the data is calculated T-Rex - http://www.info.uqam.ca/~makarenv/trex.html

Other methods Median networks (no program) Median-joining networks (not suitable at the population level, requires the absence of recombination) Molecular-variance parsimony (uses sampled haplotype frequencies and geographic subdivisions to present the solution in the form of a set of (near) optimal networks, http://lgb.unige.ch/arlequin/) Netting (no program) Likelihood network (open-source Java library, > 120 modules, http://www.pal-project.org/)

Instraspecific gene genealogies: trees grafting into networks, David Posada and Keith A. Crandall, TRENDS in Ecology & Evolution Vol. 16 No. 1 January 2001