Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis

Similar documents
Session 5: Phylogenomics

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)

Phylogeny and the Tree of Life

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family

1 ATGGGTCTC 2 ATGAGTCTC

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

Phylogeny and the Tree of Life

Chapter 26: Phylogeny and the Tree of Life

Molecular phylogeny How to infer phylogenetic trees using molecular sequences

8/23/2014. Phylogeny and the Tree of Life

Phylogenetic inference

Phylogeny and the Tree of Life

A bioinformatics approach to the structural and functional analysis of the glycogen phosphorylase protein family

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

Phylogenetics. Applications of phylogenetics. Unrooted networks vs. rooted trees. Outline

Phylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University

Bootstrapping and Tree reliability. Biol4230 Tues, March 13, 2018 Bill Pearson Pinn 6-057

Phylogenetic Tree Reconstruction

Comparative Bioinformatics Midterm II Fall 2004

Dr. Amira A. AL-Hosary

Phylogenetics: Building Phylogenetic Trees

9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)

Phylogeny and the Tree of Life

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Chapter 27: Evolutionary Genetics

Symmetric Tree, ClustalW. Divergence x 0.5 Divergence x 1 Divergence x 2. Alignment length

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Evolution by duplication

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

RELATIONSHIPS BETWEEN GENES/PROTEINS HOMOLOGUES

Constructing Evolutionary/Phylogenetic Trees

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

Introduction to Bioinformatics Introduction to Bioinformatics

AP Biology Notes Outline Enduring Understanding 1.B. Big Idea 1: The process of evolution drives the diversity and unity of life.

What is Phylogenetics

BIOLOGY. Phylogeny and the Tree of Life CAMPBELL. Reece Urry Cain Wasserman Minorsky Jackson

Phylogeny and the Tree of Life

Cladistics and Bioinformatics Questions 2013

Chapter 26 Phylogeny and the Tree of Life

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Phylogeny Fig Overview: Inves8ga8ng the Tree of Life Phylogeny Systema8cs

The History of Life on Earth

BINF6201/8201. Molecular phylogenetic methods

molecular evolution and phylogenetics

Macroevolution Part I: Phylogenies

Biology 211 (2) Week 1 KEY!

Chapter 26 Phylogeny and the Tree of Life

Interpreting the Molecular Tree of Life: What Happened in Early Evolution? Norm Pace MCD Biology University of Colorado-Boulder

Multiple Sequence Alignment. Sequences

Reconstructing the history of lineages

Microbial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B

Consensus Methods. * You are only responsible for the first two

PHYLOGENY AND SYSTEMATICS

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

C.DARWIN ( )


Phylogenetic Analysis

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

Constructing Evolutionary/Phylogenetic Trees

Chapter 19: Taxonomy, Systematics, and Phylogeny

a-dB. Code assigned:

Phylogeny & Systematics: The Tree of Life

Ch. 26 Phylogeny BIOL 221

PhyQuart-A new algorithm to avoid systematic bias & phylogenetic incongruence

Computational approaches for functional genomics

Phylogeny and Systematics

PGA: A Program for Genome Annotation by Comparative Analysis of. Maximum Likelihood Phylogenies of Genes and Species

Phylogenetic analysis. Characters

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Biological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor

a-fB. Code assigned:

Phylogenetic Analysis

Phylogenetic Analysis

Algorithms in Bioinformatics

BLAST Database Searching. BME 110: CompBio Tools Todd Lowe April 8, 2010

Computational methods for predicting protein-protein interactions

Nature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species.

20 Phylogeny CAMPBELL BIOLOGY IN FOCUS. Urry Cain Wasserman Minorsky Jackson Reece. Lecture Presentations by Kathleen Fitzpatrick and Nicole Tunbridge

Classification and Phylogeny

Origins of Life. Fundamental Properties of Life. Conditions on Early Earth. Evolution of Cells. The Tree of Life

林仲彥. Dec 4,

Using Bioinformatics to Study Evolutionary Relationships Instructions

Name: Class: Date: ID: A

AP Biology Notes Outline Enduring Understanding 1.B. Big Idea 1: The process of evolution drives the diversity and unity of life.

1/27/2010. Systematics and Phylogenetics of the. An Introduction. Taxonomy and Systematics

Classification and Phylogeny

AP Biology. Cladistics

CS5263 Bioinformatics. Guest Lecture Part II Phylogenetics

Phylogeny and Evolution. Gina Cannarozzi ETH Zurich Institute of Computational Science

PHYLOGENY & THE TREE OF LIFE

Many of the slides that I ll use have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Basic Local Alignment Search Tool

Unsupervised Learning in Spectral Genome Analysis

Transcription:

Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis 10 December 2012 - Corrections -

Exercise 1 Non-vertebrate chordates generally possess 2 homologs, vertebrates 3 or more gene copies; a Drosophila homolog serves as outgroup. A first duplication event (POPD1, POPD2/POPD3) occured before the speciation of Ciona and the vertebrate lineage, and probably after the divergence of arthropods The second duplication event happened in vertebrates and gave rise to the subfamilies POPD2 and POPD3. POPD2 and POPD3 are thus more closely related to each other than to subfamily POPD1 Ortholog to POPD1_HUMAN (Q8NE79): chordate POPD1 clade, and Drosophila bves; all other genes are paralog Ortholog to Ciona POPD1: same as for human POPD1 Ortholog to Ciona POPD23: vertebrate families POPD2 and POPD3, and most probably Drosophila bves PhyML. Tree search : NNIs. Initial tree : BIONJ. Model of amino acids substitution : JTT. Number of taxa : 41. Log-likelihood : -7592.66297. Discrete gamma model : Yes - Number of categories : 8 - Gamma shape parameter : 1.764. Proportion of invariant : 0.061 Further phylogenetic trees are available at http://wiki.isb-sib.ch/swisstree/st001_evidence

Exercise 2 Branch support

Exercise 3

Exercise 3 eggnog Was not available when preparing the slides

From last year, might have changed slightly

Exercise 3 HOVERGEN

Exercise 3

Exercise 3

Exercise 3

Exercise 3

Exercise 3

Exercise 3

Exercise 4 Blast result: vertebrate sequences closest to gem-1 seem to be MIRO2.

Further below (same Blast than previous page), there is a 2 nd vertebrate protein group annotated as MIRO1 again with similarity over the full length of the protein. We copy all the data from the species of interest to infer the phylogenies. Please note that there is no MIRO1 gene for X. tropicalis.

Phylogenetic tree reconstructed using the 1-click mode with default parameter settings. The topology of the result tree corresponds to the species tree with one exception: chicken and frog seem to be monophyletic - this is probably not the case. The result tree further suggests that the vertebrate genes MIRO1 and MIRO2 are paralogs and both are co-orthologs of the fungi gene gem-1 and invertebrate gene MIRO.

Tree robustness Bootstrap replicates are created randomly therefore results for the same analysis can slightly differ. It is thus more meaningful to perform an analysis with 1000 replicates.

Exercise 5

Dataset 2 Phylogenetic analysis parameters: Alignment: muscle (default); Data model: Gblocks stringent; Tree-building: PhyML (JTT, gamma(4), shape parameter estimated, invariable sites fixed (0.0)); Test for topology robustness: alrt (SH). The 1st tree includes species from Archaea and Bacteria, which form the 2 major branches of the tree. The 2 nd dataset consists of sequences derived from Eukaryotes. A phylogenetic tree reconstructed with both datasets suggests, that eukaryotic sequences are more closely related to Bacteria than to Archaea. This result is not in agreement with the species tree. An exaplanation could be that Eukaryotes derived this gene from bacteria by horizontal gene transfer (HGT).

Phylogenetic tree of datasets 1, 2 and 3. The sequences of the 3rd dataset consist of eukaryotic homologs. The result suggests that eukaryotic sequences are possibly derived from a single gene transfer event. Sequence annotation indicates that one group of the eukaryotic proteins ( SYAM ) are mitochondrial. A possible scenario: Eukaryotes could have derived a gene copy by endosymbiontic gene transfer (EGT) from the mitochondrion to the nucleus, followed by a gene duplication event. One of the copies subsequently evolved into the cytoplasmic form. Analysis protokol: Alignment: muscle (default); Data model: Gblocks stringent; Tree-building: PhyML (JTT, gamma(4), shape parameter estimated, invariable sites fixed (0.0)); Test for topology robustness: alrt (SH).

Can you root this tree? Exercise 6

Eukaryota Archaea Eukaryota Bacteria Eukaryota

Eukaryota, nuclear Archaea Eukaryota, mitochondrial, nuclear encoded Bacteria Eukaryota, chloroplast,arath: nuclear encoded CYCME: chloropl. encoded

To be improved: Add more sequence data to the dataset Use alignments methods such as probcons or T-coffee Analyze individual branches, e.g. cyanobacteria and chloroplastic data, proteobacteria and mitochondrial-derived sequences (& meaningful outgroups) ML methods or MrBayes Calculate support values for internal nodes Run other tests, which we have not discussed here Can this tree be rooted by an outgroup? No, as no species is known that diverged prior to the divergence of the 2 major branches of life!