An Optimal System for Evolutionary Cell Biology: the genus Paramecium

Similar documents
THE genus Paramecium has been used as a model unicellular

Insights into three whole-genome duplications gleaned from the. *Department of Biology, Indiana University, Bloomington, IN, USA 47405

Genomes and Their Evolution

Molecular evolution - Part 1. Pawan Dhar BII

Chapter 27: Evolutionary Genetics

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

The Origin of Species

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Mole_Oce Lecture # 24: Introduction to genomics

8/23/2014. Phylogeny and the Tree of Life

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Processes of Evolution

Unit 7: Evolution Guided Reading Questions (80 pts total)

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

Eukaryotic vs. Prokaryotic genes

Impact of recurrent gene duplication on adaptation of plant genomes

UON, CAS, DBSC, General Biology II (BIOL102) Dr. Mustafa. A. Mansi. The Origin of Species

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Large Global Effective Population Sizes in Paramecium

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Processes of Evolution

How much non-coding DNA do eukaryotes require?

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Lecture 11 Friday, October 21, 2011

WHERE DOES THE VARIATION COME FROM IN THE FIRST PLACE?

Intro Gene regulation Synteny The End. Today. Gene regulation Synteny Good bye!

Speciation. Mechanisms of Speciation. Title goes here. Some Key Tenets of the Modern Synthesis

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

Computational approaches for functional genomics

Unit 9: Evolution Guided Reading Questions (80 pts total)

Novel Population Genetics in Ciliates due to Life Cycle and Nuclear Dimorphism

Roadmap. Sexual Selection. Evolution of Multi-Gene Families Gene Duplication Divergence Concerted Evolution Survey of Gene Families

Comparative genomics: Overview & Tools + MUMmer algorithm

Evolution by duplication

Evolutionary Genomics and Proteomics

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION

HORIZONTAL TRANSFER IN EUKARYOTES KIMBERLEY MC GRAIL FERNÁNDEZ GENOMICS

EVOLUTION ALGEBRA Hartl-Clark and Ayala-Kiger

Evaluate evidence provided by data from many scientific disciplines to support biological evolution. [LO 1.9, SP 5.3]

Multiple Choice Review- Eukaryotic Gene Expression

Principles of Genetics

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites

A diploid somatic cell from a rat has a total of 42 chromosomes (2n = 42). As in humans, sex chromosomes determine sex: XX in females and XY in males.

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Origins of Life. Fundamental Properties of Life. Conditions on Early Earth. Evolution of Cells. The Tree of Life

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Biology II. Evolution

Evolutionary Mythology versus Evolutionary Science

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

Speciation. Today s OUTLINE: Mechanisms of Speciation. Mechanisms of Speciation. Geographic Models of speciation. (1) Mechanisms of Speciation

Lecture Notes: BIOL2007 Molecular Evolution

Conceptually, we define species as evolutionary units :

GACE Biology Assessment Test I (026) Curriculum Crosswalk

Name: Class: Date: ID: A

Outline. Classification of Living Things

Multiple routes to subfunctionalization and gene duplicate. specialization

Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.

4/4/2017. Extrinsic Isolating Barriers. 1. Biological species concept: 2. Phylogenetic species concept:

How to detect paleoploidy?

REVIEWS. The evolution of gene duplications: classifying and distinguishing between models

Lecture 20 DNA Repair and Genetic Recombination (Chapter 16 and Chapter 15 Genes X)

TE content correlates positively with genome size

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

Classification and Phylogeny

Heterogeneous Rates of Molecular Evolution Among Cryptic Species of the Ciliate Morphospecies Chilodonella uncinata

(Write your name on every page. One point will be deducted for every page without your name!)

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

"Cell Action in Evolution"

Processes of Evolution

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Chapter 13 Meiosis and Sexual Reproduction

Biology 211 (2) Week 1 KEY!

Chapter 14 The Origin of Species

Classification and Phylogeny

7. Tests for selection

Binary fission occurs in prokaryotes. parent cell. DNA duplicates. cell begins to divide. daughter cells

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

Sperm & Eggs & Variation..OH MY!

Genome-wide analysis of the MYB transcription factor superfamily in soybean

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Computational analyses of ancient polyploidy

Understanding relationship between homologous sequences

Evolutionary change. Evolution and Diversity. Two British naturalists, one revolutionary idea. Darwin observed organisms in many environments

Big Questions. Is polyploidy an evolutionary dead-end? If so, why are all plants the products of multiple polyploidization events?

18.4 Embryonic development involves cell division, cell differentiation, and morphogenesis

Chapter Chemical Uniqueness 1/23/2009. The Uses of Principles. Zoology: the Study of Animal Life. Fig. 1.1

D. Incorrect! That is what a phylogenetic tree intends to depict.

Cell Division: the process of copying and dividing entire cells The cell grows, prepares for division, and then divides to form new daughter cells.

CHAPTER 12 THE CELL CYCLE. Section A: The Key Roles of Cell Division

J. MITCHELL MCGRATH, LESLIE G. HICKOK, and ERAN PICHERSKY

Saturday, August 24, Speciation

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

AP Biology Exam #7 (PRACTICE) Subunit #7: Diversity of Life

Transcription:

An Optimal System for Evolutionary Cell Biology: the genus Paramecium Presence of a transcriptionally silent germline (micronucleus) and an expression-active somatic macronucleus. Geographically ubiquitous, large, free-living cells. Can be maintained as pure lines by autogamy. Direct effects of gene variants can be evaluated via macronuclear transformation. A product of two ancient whole-genome duplication events. Availability of pre-duplication outgroup species. Complete genome sequences for all species. One of the simplest genomic architectures in all eukaryotes.

Paramecium tetraurelia: a Descendant of Three Whole-Genome Duplications ~40,000 protein-coding genes in ~100 linkage groups, all grouping into sets of four syntenic groups (Aury et al., 2006, Nature).

Paramecium aurelia lineage: a cryptic species complex with at least 14 members. Tracy Sonneborn (1905 1981) From Aury et al. (2006)

Key Unresolved Issues: When did the whole-genome duplications in the Paramecium lineage occur? Are all Paramecium species octoploids? Are just the aurelia species or just P. tetraurelia polyploid? Are there phylogenetically independent genome-duplication events? How old are the Paramecium lineages and the underlying duplication events? What is the fundamental gene number in Paramecium? What are the fates of the duplicate genes? Is whole-genome duplication a causal factor in the emergence of the cryptic aurelia species complex?

Ohno s 2R Hypothesis and the Origin of Vertebrate Innovations Susumu Ohno (1928-2000)

Analysis of Genome Stability with a Mutation-accumulation Experiment Starting with a single stem mother, sublines are maintained by single-progeny descent, preventing selection from removing spontaneous mutations. Mutations in the transcriptionally silent germline are invisible to selection. This protocol is continued for thousands of cell divisions.

Although P. tetraurelia Has the Lowest Known Mutation Rate Per Cell Division, Its Rate Per Sexual Episode is Compatible With Other Species Base-substitution Mutation Rate (/10 9 bp/generation) 10 2 Mammals Invertebrates Plant Unicellular euks. Eubacteria 10 1 Archaea 10 0 10-1 Col 31 vs Col 32 Further work underway: Paramecium biaurelia Paramecium sexaurelia Tetrahymena thermophila 10-2 10 1 10 2 10 3 Genome Size (Mb) Sung et al. (2012, PNAS)

Nucleotide Diversity at Silent Sites in Protein-coding Genes Estimates 4N e u u = base-substitution mutation rate N e = genetic effective population size 2u Expected nucleotide heterozygosity at neutral sites: 4N e u 1/(2N e ) ACA ACC ACG ACT Silent site The Four Threonine Codons

4N e u for Paramecium Nuclear Genes is Exceptionally High Implies N e 10 8, the highest estimated value for a eukaryote. Prokaryotes (0.1044, n = 48) Invertebrates (0.0219, n = 12) Uni/oligocellular sps. (0.0571, n = 27) Paramecium (0.005 to 0.159) Land plants (0.0159, n = 21) Vertebrates (0.0044, n = 19) 10-3 10-2 10-1 10 0 Silent-site Nucleotide Heterozygosity Possibly downwardly biased: Probably some selection on silent sites.

Population-genomic Sequencing Parul Johri, Tom Doak, Sascha Krenek Thanks to: Thomas Berendonk, Eike Dusi, Sergei Fokin, Alexey Potekhin, Eva Pryzbos

Cohesive Species Phylogenies Determined from Whole-genome Sequencing caudatum tetraurelia biaurelia sexaurelia

The two most recent whole-genome (WGD) duplication events preceded the emergence of the Aurelia species. These WGD events are unique to the Aurelia lineage and not shared by P. caudatum or P. multimicronucleatum. Very little post-wgd duplication: only ~300 tandem duplicates in each species, and only 8% of these are lineage specific. P. caudatum has ~1050 genes not found in any Aurelia species, and the latter have ~350 genes not shared by P. caudatum.

Evidence of On-going Gene Conversion Between Paralogs: Implications for the Age of WGD Events Paralog 1 Pb Pt Intraspecific paralogs Paralog 2 Pb Pt Frequency Interspecific paralogs Silent-site Divergence

Ages of the Two Most Recent WGD Events With 1000 cell divisions per year, these estimates should be divided by 10. ds = 1.7 between interspecific paralogs 2.5 x 10 9 years? ds = 8.0 1.5 x 10 9 years ago Mutation rate = 2.6 x 10-11 / site / division 100 cell divisions / year ~32 billion cell divisions ~320 x 10 6 years ago

A Powerful Resource for Studying the Evolutionary Demography of Gene Duplicates Almost no post-wgd chromosomal rearrangements. Almost no post-wgd single-gene duplications. Almost no transposable elements. caudatum Patterns of gene loss for 6086 unambiguous co-orthologous genes present prior to the aurelia-specific whole-genome duplication events.

Inferring the Fates of Gene Duplicates Uniform retention Divergent resolutions One paralog lost prior to the second WGD event caudatum

Stochastic, Divergent Losses of Duplicate Genes Leads to the Passive Origin of Reproductive Isolating Barriers by Inducing Microchromosomal Rearrangements Ancestral Species Acquisition of Duplicate Gene Geographic Isolation and Divergent Gene Silencing Hybridization Nonfunctional Gametes Nonfunctional Progeny

Dramatic Repatterning of Chromosomes by Divergent Resolution Drives Ongoing Emergence of Post-mating Isolating Barriers Among the Aurelia Lineages 2312 observed cases of divergent resolution between P. sexaurelia and P. biaurelia / tetraurelia. 2741 cases of ancestral / parallel loss. Paralog 1 X Pb/t Ps This balance is expected if the split between sexaurelia and biaurelia / tetraurelia occurred shortly after the last whole-genome duplication event. Paralog 2 Pb/t Ps 113 divergent resolutions between P. biaurelia and P. tetraurelia. 5421 cases of ancestral / parallel loss. Paralog 1 X Paralog 2 Pb Pt Pb Pt

The DDC Model coding region Duplication Degeneration Complementation subfunctionalization neofunctionalization nonfunctionalization Force et al. (Genetics, 1999)

Expression Level is the Strongest Predictor of Duplicate-gene Retention Probability P. biaurelia P. tetraurelia P. sexaurelia Frequency Retention Log (Expression Level)

The Expression Level of P. caudatum Genes Predicts the Retention Probability of Duplicate Genes in the Aurelia Species Recent WGD Intermediate WGD P. biaurelia P. sexaurelia P. tetraurelia

Despite saturation at silent sites, expression levels of orthologous proteins remain correlated. Ortholog 2 Expression Level P. biaurelia P. tetraurelia P. sexaurelia Ortholog 1 Expression Level

Quantitative Subfunctionalization: Joint Meandering of Paralog Expression Along the Drift Barrier Ortholog 2 Expression Ortholog 1 Expression

Extraordinary Constraints on Introns tetraurelia biaurelia sexaurelia caudatum Nucleotide Heterozygosity Length (bp) Silent-site heterozygosity in protein-coding genes = 0.13

Paramecium sps. have the smallest known intron sizes in eukaryotes. P. caudatum has one of the shortest average intergenic distances known in eukaryotes.

Strong Purifying Selection in Intergenic Regions Nucleotide heterozygosities: Silent sites Intergenic regions P. biaurelia 0.0079 0.0047 P. sexaurelia 0.0245 0.0155 P. tetraurelia 0.0051 0.0031 P. caudatum 0.1329 0.0311 P. multimicronucleatum 0.1589 0.0337

Regulatory Motifs Can be Detected Bioinformatically Putative activators have positive effects that decrease with distance from the gene. Putative repressors have negative effects that increase with distance from the gene. Associated Average Motif Gene Number Distance (bp) Reg. Coef. Significance

Using Injected Mini-chromosomes to Study Gene Expression Subcellular localization of fluorescent protein fusions indicates that functional change has occurred between paralogous Rab genes in P. tetraurelia (Lydia Bright).

A Strategy for Identifying Transcription-factor Binding Sites All 4096 possible 6-mers are contained in 184 unique 15-mers. The full set of constructs can be co-injected into the macronucleus, and interrogated for activity with high-throughput sequencing of the bar-coded regions. telomere unique 15-mer minimal promoter GFP unique 8-mer telomere Sfi1 1) Sph1 2) Sal1 AAAAA Sfi1

Host TFBS Cell Source Context Linear model: A A native use z AA = μ + h + f + x B B native use z BB = μ - h - f + x A B cross-species z AB = μ + h - f - x B A cross-species z BA = μ - h + f - x Host Species A B Full-parameter interpretation: Grand mean: μ = (z AA + z BB + z AB + z BA )/4 Host-cell (trans) effect: h = (z AA + z AB - z BB - z BA )/4 Binding-site (cis) effect: f = (z AA + z BA - z BB - z AB )/4 cis x trans interaction: x = (z AA + z BB - z AB - z BA )/4 Partial-parameter interpretation: Net host-cell effect: (h + x) = (z AA - z BA )/2 Net binding-site effect: (f + x) = (z AA - z AB )/2 A B Injected TFBSs

Casey McGrath Tom Doak Jean-Francois Gout Lydia Bright CC Chen Hongan Long Parul Johri Georgi Marinov Francesco Catania John Preer