The African coelacanth genome provides insights into tetrapod evolution

Similar documents
Classification and Phylogeny

Biologists have used many approaches to estimating the evolutionary history of organisms and using that history to construct classifications.

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Classification and Phylogeny

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

From DNA to Diversity

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

18.4 Embryonic development involves cell division, cell differentiation, and morphogenesis

Evidences of Evolution. Read Section 8.2 on pp of your textbook

Where Do Bat Wings Come From?

12-5 Gene Regulation

3/8/ Complex adaptations. 2. often a novel trait

5/4/05 Biol 473 lecture

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Homeotic Genes and Body Patterns

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

Chapter 18 Lecture. Concepts of Genetics. Tenth Edition. Developmental Genetics

Modern Evolutionary Classification. Section 18-2 pgs

1. What are the three general areas of the developing vertebrate limb? 2. What embryonic regions contribute to the developing limb bud?

Genomes and Their Evolution

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

AP Biology Notes Outline Enduring Understanding 1.B. Big Idea 1: The process of evolution drives the diversity and unity of life.

Analysis of the African coelacanth genome sheds light on tetrapod evolution

Classification, Phylogeny yand Evolutionary History

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

150 years after the Origin. Does Evolution Make Big Changes? Progressive Creationism. Idea goes back at least to And was discussed in 1866

REVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E

Ensembl focuses on metazoan (animal) genomes. The genomes currently available at the Ensembl site are:

Chapter 19: Taxonomy, Systematics, and Phylogeny

The gain, loss, and modification of gene

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

Evidence for Evolution

How should we organize the diversity of animal life?

Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab

Exam 1 ID#: October 4, 2007

Cell-Cell Communication in Development

Patterns of Evolution

Supplemental Figure 1.

Evidence for Evolution by Natural Selection Regents Biology

Classifications can be based on groupings g within a phylogeny

Chapter 16: Reconstructing and Using Phylogenies

Bi 117 Final (60 pts) DUE by 11:00 am on March 15, 2012 Box by Beckman Institute B9 or to a TA

The Evidence for Evolution Chapter 21. Evidence of Natural Selection. Evidence of Natural Selection 4/4/14

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Phylogenetic analysis. Characters

Adaptation, Evolution & development

Biology 20 Chapter 5 Lesson 2 Evidence for Evolution. Today s species that exist have evolved from ancestral ones.

AP Biology. Evolution is "so overwhelmingly established that it has become irrational to call it a theory." Evidence of Evolution by Natural Selection

Evidence of Evolution

Macroevolution Part I: Phylogenies

Developmental Biology Lecture Outlines

Big Idea 1: The process of evolution drives the diversity and unity of life.

Genetic transcription and regulation

Classification. copyright cmassengale

AP Curriculum Framework with Learning Objectives

TE content correlates positively with genome size

Grade 11 Biology SBI3U 12

A bioinformatics approach to the structural and functional analysis of the glycogen phosphorylase protein family

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Supplementary Information

Organizing Life s Diversity

In a way, organisms determine who belongs to their species by choosing with whom they will! MODERN EVOLUTIONARY CLASSIFICATION 18-2 MATE

Stnd: 8 e C-Notes: Evidence of Evolution

Developmental processes Differential gene expression Introduction to determination The model organisms used to study developmental processes

Bio94 Discussion Activity week 3: Chapter 27 Phylogenies and the History of Life

Evolution of the Skeleton

The Evidence for Evolution. Chapter 21

Bio 1B Lecture Outline (please print and bring along) Fall, 2008

Bio1B Evolution 12 Last lecture: Fossil record

Inferring phylogeny. Constructing phylogenetic trees. Tõnu Margus. Bioinformatics MTAT

Chapter 7- Changes Over Time. Section 2 Evidence of Evolution

BIS &003 Answers to Assigned Problems May 23, Week /18.6 How would you distinguish between an enhancer and a promoter?

Evidences of Evolution (Clues)

Map of AP-Aligned Bio-Rad Kits with Learning Objectives

Evolutionary Developmental Biology

Chapter 26 Phylogeny and the Tree of Life

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution.

8/23/2014. Phylogeny and the Tree of Life

Evidence of Evolution by Natural Selection. Evidence supporting evolution. Fossil record. Fossil record. Anatomical record.

Evolution Common Assessment 1

Reconstructing the history of lineages

Lecture 6 Phylogenetic Inference

A A A A B B1

Why Flies? stages of embryogenesis. The Fly in History

THE EVIDENCE FOR EVOLUTION

The Eukaryotic Genome and Its Expression. The Eukaryotic Genome and Its Expression. A. The Eukaryotic Genome. Lecture Series 11

Phylogeny and the Tree of Life

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

PHYLOGENY WHAT IS EVOLUTION? 1/22/2018. Change must occur in a population via allele

Name: Period: Evidence for Evolution Part I. Introduction

Supplementary Materials for

Supplemental table S7.

Evidence of Evolution by Natural Selection. Dodo bird

Role of Organizer Chages in Late Frog Embryos

6 Evidence from the Fossil Record Th e f o s s i l r e c o r d includes all of the fossils that have existed in the

Transcription:

The African coelacanth genome provides insights into tetrapod evolution bioinformaatika ajakirjaklubi 27.05.2013

Ülesehitus Täisgenoomi sekveneerimisest vankrid mille ette neid andmeid on rakendatud evolutsiooni kiirus "elus fossiil" - kas tõesti? elu siirdumine maismaale jäsemete areng lämmastikust lahtisaamine loote areng Ig-M/Ig-W

Latimeria chalumnae Leiti 1938a India Ookeanist Coelacanth /ˈsiːləkæn /; Order Coelacanthiformes Peeti väljasurnuks 70 millionit aastat tagasi millionit aastat

Genome assembly and annotation karyotype: 48 Chromosomes It was sequenced by Illumina sequencing technology and assembled using the short read genome assembler ALLPATHS-LG The draft assembly is 2.86 gigabases (Gb) in size and is composed of 2.18 Gb of sequence plus gaps between contigs. Contig N50 size 12.7 kb Scaffold N50 size of 924 kb

Genome annotation by Ensemble pipeline Ensembl gene annotation pipeline ( release 66, Feb. 2012) + RNA-seq data generated from muscle 18 Gb of paired-end reads were assembled using Trinity software orthology with other vertebrates 19,033 protein-coding genes containing 21,817 transcripts

Genome annotation by MAKER pipeline used the L. chalumnae Ensembl gene set, Uniprot protein alignments, L. chalumnae (muscle) and L. menadoensis (liver and testis) 15 RNA-seq data to create gene models 29,237 protein-coding gene annotations. 2,894 short non-coding RNAs, 1,214 long non-coding RNAs, > 24,000 conserved RNA secondary structures Supplementary Note 2, Supplementary Tables 3 and 4, Supplementary Data 1 3 and Supplementary Fig. 3

The closest living fish relative of tetrapods The question of which living fish is the closest relative to the fish that first crawled on to land has long captured our imagination: among scientists the odds have been placed on either the lungfish or the coelacanth. Analyses of small to moderate amounts of sequence data (ranging from 1 to 43 genes) has tended to favour the lungfishes. The alternative hypothesis that the lungfish and the coelacanth are equally closely related to the tetrapods could not be rejected. To perform a reliable analysis we selected 251 genes in which a 1:1 orthology ratio was clear The resulting phylogeny, based on 100,583 concatenated amino acid positions

PP 1.0 Jack-knife 100

The slowly evolving coelacanth Earlier work showed that Hox and protocadherins have comparatively slower protein-coding evolution in coelacanth than in other vertebrate lineages Protein-coding gene evolution was examined using the 251 concatenated proteins. Pair-wise distances between taxa were calculated from the branch lengths of the tree using the two-cluster test proposed previously to test for equality of average substitution rates The coelacanth proteins that were tested were significantly more slowly evolving (0.890 subst.per site) than the lungfish (1.05 subst. per site), chicken (1.09 subst. per site) and mammalian (1.21 subst. per site) orthologues. (P < 10-6 in all cases) (Supplementary Data 5)

Transposable elements Theoretically, transposable elements may make their greatest contribution to the evolution of a species by generating templates for exaptation to form novel regulatory elements and exons, and by acting as substrates for genomic rearrangement The coelacanth genome contains a wide variety of transposable-element superfamilies and has a relatively high transposable-element content (25%). Analysis of RNA-seq data and of the divergence of individual transposable-element copies from consensus sequences show that 14 coelacanth transposable-element superfamilies are currently active. Supplementary Note 5 and Supplementary Tables 7 10 Supplementary Note 6, Supplementary Table 10 and Supplementary Fig. 5

The adaptation of vertebrates to land To understand the aspect of the water-to-land transition and how it has been reflected on genome level when vertebrates have lived on land over the 400 Myr. Genes that were present in the last common ancestor of all bony fish (including the coelacanth) but that are missing from tetrapod genomes.

Genes missing from tetrapod genomes More than 50 such genes, including components of fibroblast growth factor (FGF) signalling, TGF-b and bone morphogenic protein (BMP) signalling, and WNT signalling pathways, etc... Functional annotation: fin development (13genes); otolith and ear-development(8 genes); kidney development(7 genes); trunk, somite,and tail development(11 genes); eye(13 genes); brain development (23 genes). (Supplementary Data 7 and Supplementary Fig. 9)

The water-to-land transition This implies that critical characters in the morphological transition from water to land (for example, fin-to-limb transition and remodelling of the ear) are reflected in the loss of specific genes along the phylogenetic branch leading to tetrapods. Homeobox genes, which are responsible for the development of an organism s basic body plan, show only slight differences between Latimeria, ray-finned fish and tetrapods; Supplementary Note 9, Supplementary Tables 12 16 and Sup- plementary Fig. 10

Conserved non-coding elements (CNEs) Conserved non-coding elements (CNEs) are strong candidates for gene regulatory elements. They can act as promoters, enhancers, repressors and insulators, and have been implicated as major facilitators of evolutionary change. To identify CNEs that originated in the most recent common ancestor of tetrapods, they predicted CNEs that evolved in various bony vertebrate (that is, ray-finned fish, coela- canth and tetrapod) lineages and assigned them to their likely branch points of origin.

Conserved non-coding elements (CNEs) 44,200 ancestral tetrapod CNEs that originated after the divergence of the coelacanth lineage was identified. They represent 6% of the 739,597 CNEs that are under constraint in the bony vertebrate lineage. They compared the ancestral tetrapod CNEs to mouse embryo ChIP-seq (chromatin immunoprecipitation followed by sequencing). This resulted in a sevenfold enrichment in the p300 binding sites for our candidate CNEs and confirmed that these CNEs are indeed enriched for gene regulatory elements.

Ancestral tetrapod CNEs and GO annotation of closest genes also, Each tetrapod CNE was assigned to the gene whose transcription start site was closest, and gene-ontology (GO) category enrichment was calculated for those genes. The most enriched categories were involved with smell perception (for example, sensory perception of smell, detection of chemical stimulus and olfactory receptor activity). morphogenesis (radial pattern formation, hind limb morphogenesis, kidney morphogenesis) cell differentiation (endothelial cell fate commitment, epithelial cell fate commitment) immunoglobulin VDJ recombination Supplementary Note 10 and Supplementary Tables 17 24

Limbs development A major innovation of tetrapods is the evolution of limbs characterized by digits. The limb skeleton consists of: a stylopod the zeugopod an autopod There are two major hypotheses about the origins of the autopod; that it was a novel feature of tetrapods, that it has antecedents in the fins of fish

The HOX-D cluster They examined the Hox regulation of limb development in ray-finned fish, coelacanth and tetrapods to address these hypotheses. Among the six cis-regulatory sequences previously identified in this gene desert, three sequences show sequence conservation restricted to tetrapods. One regulatory sequence (island 1) is shared by tetrapods and coelacanth, but not by ray-finned fish. Fig. 2b and Supplementary Fig. 14

Experiment When tested in a transient transgenic assay in mouse, the coelacanth sequence of island 1 was able to drive reporter expression in a limb-specific pattern Figure 2C. An upstream gene desert identifies conserved limb enhancers. Amemiya C.T. et al. Nature 2013

This suggests that island 1 was a lobe-fin developmental enhancer in the fish ancestor of tetrapods that was then coopted into the autopod enhancer of modern tetrapods. In this case, the autopod developmental regulation was derived from an ancestral lobe-finned fish regulatory element.

Changes in the urea cycle Liigsest lämmastikust vabanemine: In aquatic environments >> ammonia on land >> urea or uric acid The widespread and almost exclusive occurrence of urea excretion in amphibians, some turtles and mammals has led to the hypothesis that the use of urea as the main nitrogenous waste product was a key innovation in the vertebrate transition from water to land.

Changes in the urea cycle HYPHY package, which estimates the ratio of synonymous (ds) to non-synonymous (dn) substitutions ( values) among different branches and among different sites (codons) across a multiple-species sequence alignment. The rate-limiting enzyme of the hepatic urea cycle, carbamoyl phosphate synthase I (CPS1), only one branch of the tree shows a strong signature of selection (P = 0.02), namely the branch leading to tetrapods and the branch leading to amniotes.

Conclusions Genome is assembled with a reasonable quality and annotated well for whole genome comparisons with other genomes Slow evolution speed is observable in the case of selected (251) coding genes. However, analyzing other regions do not support the "living fossile" at genome level. The adaptation of vertebrates to land 50 genes were lost in tetrapode lineages Many genes were classified in important developmental categories Changes in the urea cycle Changes in the physiological environment of the developing embryo and fetus The coelacanth lacks immunoglobulin-m (thy have Ig-W)

The adaptation of vertebrates to land More than 50 genes were lost in tetrapode lineages fin development (13genes); otolith and ear development (8 genes); kidney development (7 genes); trunk, somite and tail development (11 genes); eye (13 genes); and brain development (23 genes). Conserved non-coding elements (CNEs) identified 44,200 ancestral tetrapod CNEs that originated after the divergence of the coelacanth lineage. Changes in the urea cycle changes in the physiological environment of the developing embryo and fetus The coelacanth lacks immunoglobulin-m