Gene regulation: From biophysics to evolutionary genetics

Similar documents
Fitness landscapes and seascapes

Graph Alignment and Biological Networks

Supporting Information

Evolution of molecular phenotypes under stabilizing selection

Introduction to Bioinformatics

Population Genetics and Evolution III

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Intensive Course on Population Genetics. Ville Mustonen

Lecture Notes: BIOL2007 Molecular Evolution

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility

Neutral Theory of Molecular Evolution

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

Why EvoSysBio? Combine the rigor from two powerful quantitative modeling traditions: Molecular Systems Biology. Evolutionary Biology

Frequent Gain and Loss of Functional Transcription Factor Binding Sites

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law

Why transcription factor binding sites are ten nucleotides long

Genetically Engineering Yeast to Understand Molecular Modes of Speciation

Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution. Classical vs. balanced views of genome structure

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION

Predicting Protein Functions and Domain Interactions from Protein Interactions

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Pervasive adaptation of gene expression in Drosophila

D. Incorrect! That is what a phylogenetic tree intends to depict.

Phylogenetics in the Age of Genomics: Prospects and Challenges

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

Computational approaches for functional genomics

Population Genetics: a tutorial

Computational Biology: Basics & Interesting Problems

From Biophysics to Evolutionary Genetics: Statistical Aspects of Gene Regulation

Binding of transcription factors adapts to resolve information-energy tradeoff

Lecture 1 Modeling in Biology: an introduction

Genetic assimilation can occur in the absence of selection for the assimilating phenotype, suggesting a role for the canalization heuristic

Computational methods for predicting protein-protein interactions

Different gene regulation strategies revealed by analysis of binding motifs

Classical Selection, Balancing Selection, and Neutral Mutations

Lecture 2: Read about the yeast MAT locus in Molecular Biology of the Gene. Watson et al. Chapter 10. Plus section on yeast as a model system Read

Bypass and interaction suppressors; pathway analysis

How much non-coding DNA do eukaryotes require?

TE content correlates positively with genome size

ACTA PHYSICA DEBRECINA XLVI, 47 (2012) MODELLING GENE REGULATION WITH BOOLEAN NETWORKS. Abstract

7. Tests for selection

Weighing the evidence for adaptation at the molecular level

Understanding relationship between homologous sequences

Basic modeling approaches for biological systems. Mahesh Bule

56:198:582 Biological Networks Lecture 8

Population Genetics I. Bio

Emergence of gene regulatory networks under functional constraints

Biophysical Fitness Landscapes for Transcription Factor Binding Sites

Casting Polymer Nets To Optimize Molecular Codes

Natural Selection results in increase in one (or more) genotypes relative to other genotypes.

Genome-wide evolutionary rates in laboratory and wild yeast

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Fitness constraints on horizontal gene transfer

Surfing genes. On the fate of neutral mutations in a spreading population

GSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION

Survival of the simplest: the cost of complexity in microbial evolution. Abstract. Introduction

Recombina*on and Linkage Disequilibrium (LD)

2. Der Dissertation zugrunde liegende Publikationen und Manuskripte. 2.1 Fine scale mapping in the sex locus region of the honey bee (Apis mellifera)

Rise and Fall of Mutator Bacteria

EVOLUTIONARY DYNAMICS AND THE EVOLUTION OF MULTIPLAYER COOPERATION IN A SUBDIVIDED POPULATION

Plant Molecular and Cellular Biology Lecture 8: Mechanisms of Cell Cycle Control and DNA Synthesis Gary Peter

Different gene regulation strategies revealed by analysis of binding motifs

Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

The Research Plan. Functional Genomics Research Stream. Transcription Factors. Tuning In Is A Good Idea

Multiple routes to subfunctionalization and gene duplicate. specialization

The neutral theory of molecular evolution

Principles of Genetics

Computational Analysis of Transcriptional Programs: Function and Evolution

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

Processes of Evolution

THE EVOLUTION OF DUPLICATED GENES CONSIDERING PROTEIN STABILITY CONSTRAINTS

Polymorphism due to multiple amino acid substitutions at a codon site within

Supplemental Information for Pramila et al. Periodic Normal Mixture Model (PNM)

What is Systems Biology

The Emergence of Modularity in Biological Systems

Contributed by Michael Lynch, November 20, 2014 (sent for review September 16, 2014; reviewed by Jeremy Gunawardena and Michael Lassig) Significance

Content Standards Learning and Performance Expectations Assessment of Learning

Introduction to population genetics & evolution

FOGA II. WHAT DOES A GENOME HAVE TO DO? - GENOME FUNCTION AND ORGANIZATION

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.

Sex accelerates adaptation

Unifying theories of molecular, community and network evolution 1

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

Mole_Oce Lecture # 24: Introduction to genomics

- point mutations in most non-coding DNA sites likely are likely neutral in their phenotypic effects.

Genetic transcription and regulation

Identifying targets of positive selection in non-equilibrium populations: The population genetics of adaptation

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Measuring TF-DNA interactions

GENE duplication has long been viewed as a mechanism

Exploring the network of genetic interactions and evolutionary innovations with metabolic modeling

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

Bi 8 Lecture 11. Quantitative aspects of transcription factor binding and gene regulatory circuit design. Ellen Rothenberg 9 February 2016

Curriculum Map. Biology, Quarter 1 Big Ideas: From Molecules to Organisms: Structures and Processes (BIO1.LS1)

Transcription:

Gene regulation: From biophysics to evolutionary genetics Michael Lässig Institute for Theoretical Physics University of Cologne

Thanks Ville Mustonen Johannes Berg Stana Willmann Curt Callan (Princeton) Justin Kinney (Princeton) Luca Peliti (Naples) SFB TR 12 SFB 680 European Research Training Network STIPCO

Structure and dynamics of molecular networks Structure: Random parts? Functional design? Evolution: Pathways? Tempo?

1. Evolution of regulatory DNA

Genomic encoding of network interactions Multiple binding sites allow for complex regulation of individual genes in higher organisms: [Bolouri and Davidson, 2002] Input-output relation? Evolutionary dynamics?

Biophysics of transcriptional regulation Transcription factor proteins bind to specific DNA sites catalyzing transcription. a 1 a k E(a) Binding energy E(a) can be obtained from - low-throughput measurements [Fields et al. 97] - position weight matrix of functional sites [Berg and v.hippel 86] - ChIP-chip data [Float et al. 05, Kinney et al. 06] - high-throughput measurements [Maerkl and Quake 07]. E(a) depends on the site sequence a = (a 1,,a k ): + nonlinear terms? E(a) is the molecular phenotype of a site, which quantifies its functionality.

Cis-regulatory elements: from sequence to phenotype The binding energy E(a) is the molecular phenotype of a site, which quantifies its functionality. This phenotype predicts binding intensity to promoters in yeast: [Mustonen, Kinney, Callen, M.L., PNAS 2008] ChIP-chip data for Abf1 binding in yeast [Lee et al., Science 2002]

Cis-regulatory elements: from phenotype to fitness For broad-acting transcription factors, high-affinity sites (E < E b ) are statistically overrepresented. P 0 (E) Q Q(E) At stationarity, the ensembles of functional and background sites determine the average fitness landscape F(E) of a site: E b E [kb T] This predicts a moderate fitness effect per functional site: Abf1 binding sites in S. cerevisiae [Berg, Willmann, M.L., BMC Evol. Biol. 2004, Mustonen, M.L., PNAS 2005, Mustonen, Kinney, Callen, M.L., PNAS 2008 ]

Population genetics Selection: sequence state a has fitness Point mutations:

Population genetics Genetic drift: Kimura-Ohta substitution rates Ratio of forward and backward rates: b a

Population genetics Evolutionary equilibria in sequence space: Given two families of loci, - background loci with stationary sequence distribution P 0 (a) under neutral evolution - functional loci with stationary sequence distribution Q(a) under selection the fitness landscape F(a) for the functional loci is given by N: effective population size. [J.Berg, S. Willmann, M.L., BMC Evol. Biol. (2004)] [V. Mustonen, M.L., Proc. Natl. Acad. Sci. (2005)]

Phenotypic evolution of binding sites The inferred fitness landscape quantitatively predicts the evolution of the phenotype E: E 1 Abf1 binding energy differences of sites in S. cerevisiae, S.paradoxus, S. mikatae, S. bayanus [Mustonen, Kinney, Callen, M.L., PNAS 2008]

Pathways of promoter evolution Conservation of site and function fitness E Site turnover fitness fitness E E + conservation of function + turnover of function Time-dependent selection + adaptation fitness fitness

Conservation of binding sites Sequences of conserved sites evolve by compensatory mutations: but E Hence, the energy phenotype is more constrained than the site sequence: S. cerevisiae S. paradoxus S. mikatae S. bayanus [cf. Kellis et al., Nature 2003] [Mustonen, Kinney, Callen, M.L., PNAS 2008] divergence time from cer

Loss and gain of function Turnover of promoter function determines loss and gain of regulatory interactions: E Species-specific loss of sites: S. cerevisiae S. paradoxus S. mikatae S. bayanus E bay Functional turnover rate γ f ~ 0.1 µ [J. Kinney, V. Mustonen, C. Callan, M.L., PNAS 2008] cer par mik bay µt

Evolutionary systems biology Natural selection acts on complex systems in a scale-dependent way: system level function turnover substitutions in binding sites synonymous substitutions 1 2 3 4 selection strength 2N ΔF

Evolutionary systems biology Laboratory experiments, modeling, and evolutionary genomics address complementary aspects of biological systems: N -1 10-5 10-4 10-3 10-2 10-1 1 fitness neutral evolution network turnover, differentiation, innovation network kernels lab experiments network modeling evolutionary genomics

2. Evolution of the Drosophila genome

From fitness landscapes to seascapes Phenotypic concept of Darwinian selection: newly arising selection and response by adaptation. Can we trace the time-dependence of selection in genomic data?

Genome evolution under constant and fluctuating selection x(t) Fig. 1a x(t) time time Allele frequency x(t) evolves under selection, mutations, stochastic fluctuations (genetic drift). Constant selection leads to evolutionary equilibrium, p eq (x). Fluctuating selection ΔF (t) = f χ(t) with switching rate γ, leads to adaptation: excess number of uphill mutations w/r to equilibrium.

Adaptation and fitness flux Substitutions and polymorphism spectra [Glinka et al 2003, Ometto et al 2005] are used to infer a surplus of beneficial over deleterious substitutions. Adaptation is quantified by a positive fitness flux = (substititution rate) x (average selection coefficient of substitutions). γ f [Mustonen and M.L, PNAS 2007]

Fitness seascapes What drives the waves? systems component: correlations (epistasis) cause compensatory mutations. external component: time-dependent genomic component: environment linkage to other loci Nonequilibrium + correlations: one external change can trigger an avalanche of responses.

Conclusions Conclusions Adaptive evolution should be viewed as a nonequilibrium phenomenon. Adaptation can be quantified by the fitness flux in a population over a given time interval. The biophysical binding energy is a quantitative molecular phenotype for regulatory sequences in yeast. Genomic sequence analysis can be used to infer fitness landscapes for this phenotype. In Drosophila, fitness seascapes drive adaptive evolution. Review articles: From Biophysics to evolutionary genetics, M.L., BMC Bioinformatics 2007 From fitness landscapes to seascapes: The dynamics of selection and adaptation, V. Mustonen and M.L., Trends in Genetics 2009