Error thresholds on realistic fitness landscapes

Similar documents
Evolution on simple and realistic landscapes

Is the Concept of Error Catastrophy Relevant for Viruses? Peter Schuster

Evolution of Biomolecular Structure 2006 and RNA Secondary Structures in the Years to Come. Peter Schuster

Complexity in Evolutionary Processes

Tracing the Sources of Complexity in Evolution. Peter Schuster

How Nature Circumvents Low Probabilities: The Molecular Basis of Information and Complexity. Peter Schuster

RNA Bioinformatics Beyond the One Sequence-One Structure Paradigm. Peter Schuster

Evolution on Realistic Landscapes

Neutral Networks of RNA Genotypes and RNA Evolution in silico

How computation has changed research in chemistry and biology

RNA From Mathematical Models to Real Molecules

Evolution and Molecules

Mathematical Modeling of Evolution

Chemistry on the Early Earth

Designing RNA Structures

Mathematische Probleme aus den Life-Sciences

The Advantage of Using Mathematics in Biology

RNA From Mathematical Models to Real Molecules

Problem solving by inverse methods in systems biology

Mechanisms of molecular cooperation

Origin of life and early evolution in the light of present day molecular biology. Peter Schuster

Chemistry and Evolution at the Origin of Life. Visions and Reality

Different kinds of robustness in genetic and metabolic networks

The Mathematics of Darwinian Systems

Von der Thermodynamik zu Selbstorganisation, Evolution und Information

Self-Organization and Evolution

Linear Algebra of Eigen s Quasispecies Model

Linear Algebra of the Quasispecies Model

Population Genetics and Evolution III

A Method for Estimating Mean First-passage Time in Genetic Algorithms

Systems biology and complexity research

arxiv:physics/ v1 [physics.bio-ph] 27 Jun 2001

Error Propagation in the Hypercycle

Systems Biology: A Personal View IX. Landscapes. Sitabhra Sinha IMSc Chennai

Critical Mutation Rate Has an Exponential Dependence on Population Size

Evolutionary Dynamics and Optimization. Neutral Networks as Model-Landscapes. for. RNA Secondary-Structure Folding-Landscapes

Relaxation Property and Stability Analysis of the Quasispecies Models

Quasispecies distribution of Eigen model

Convergence of a Moran model to Eigen s quasispecies model

Bridging from Chemistry to the Life Sciences Evolution seen with the Glasses of a Physicist a

Replication and Mutation on Neutral Networks: Updated Version 2000

Evolutionary Dynamics & its Tendencies. David Krakauer, Santa Fe Institute.

COMP598: Advanced Computational Biology Methods and Research

Exploration of population fixed-points versus mutation rates for functions of unitation

The tree of life: Darwinian chemistry as the evolutionary force from cyanic acid to living molecules and cells

Applications of Diagonalization

Evolution and Design

Molecular models of biological evolution have attracted

Chemical Organization Theory

Neutral Evolution of Mutational Robustness

Discrete Nodal Domain Theorems

Lecture Notes in Mathematics Editors: J.-M. Morel, Cachan F. Takens, Groningen B. Teissier, Paris

arxiv:physics/ v1 [physics.bio-ph] 19 Apr 2000

Vom Modell zur Steuerung

Institut für Mathematik, Universität Wien, Strudlhofgasse 4, A-1090 Wien, Austria.

The Role of Topology in the Study of Evolution

Statistics of RNA Melting Kinetics

RNA folding at elementary step resolution

Critical Properties of Complex Fitness Landscapes

arxiv: v1 [q-bio.pe] 3 May 2017

Problems on Evolutionary dynamics

A coarse-grained Schrödinger cat

Field Theory of a Reaction- Diffusion Model of Quasispecies Dynamics

Fitness landscapes and seascapes

Provided for non-commercial research and educational use. Not for reproduction, distribution or commercial use.

Evolution of Model Proteins on a Foldability Landscape

Networks in Molecular Evolution

Effects of Neutral Selection on the Evolution of Molecular Species

The phenomenon of biological evolution: 19th century misconception

Optimal Mutation Rate Control under Selection in Hamming Spaces

Computer Vision Group Prof. Daniel Cremers. 11. Sampling Methods: Markov Chain Monte Carlo

Error threshold in simple landscapes

Baryon spectroscopy with spatially improved quark sources

Molecular evolution - Part 1. Pawan Dhar BII

Prediction of Locally Stable RNA Secondary Structures for Genome-Wide Surveys

Endowed with an Extra Sense : Mathematics and Evolution

Section 1.7: Properties of the Leslie Matrix

arxiv: v1 [q-bio.pe] 11 Dec 2017

Chance and necessity in evolution: lessons from RNA

MATH6142 Complex Networks Exam 2016

When to use bit-wise neutrality

Why EvoSysBio? Combine the rigor from two powerful quantitative modeling traditions: Molecular Systems Biology. Evolutionary Biology

Applications of The Perron-Frobenius Theorem

EVOLUTIONARY DYNAMICS AND THE EVOLUTION OF MULTIPLAYER COOPERATION IN A SUBDIVIDED POPULATION

W.Fontana et al.: RNA Landscapes Page 1 Summary. RNA secondary structures are computed from primary sequences by means of a folding algorithm which us

On the dynamics of prebiotic evolution

2 Discrete-Time Markov Chains

Biology Scope & Sequence

Unifying theories of molecular, community and network evolution 1

Dynamical community structure of populations evolving on genotype networks

Complex Systems Theory and Evolution

The Particle Filter. PD Dr. Rudolph Triebel Computer Vision Group. Machine Learning for Computer Vision

A Note On Minimum Path Bases

6.207/14.15: Networks Lectures 4, 5 & 6: Linear Dynamics, Markov Chains, Centralities

On the Halting Problem of Finite-State Programs

Starting with a brief historical introduction, we review recent results in the theory of evolutionary dynamics, emphasizing new mathematical and

The diversity of evolutionary dynamics on epistatic versus non-epistatic fitness landscapes

Unifying Evolutionary Dynamics

Introduction. Genetic Algorithm Theory. Overview of tutorial. The Simple Genetic Algorithm. Jonathan E. Rowe

Stochastic Demography, Coalescents, and Effective Population Size

Transcription:

Error thresholds on realistic fitness landscapes Peter Schuster Institut für Theoretische Chemie, Universität Wien, Austria and The Santa Fe Institute, Santa Fe, New Mexico, USA Evolutionary Dynamics: Mutation, Selection, and the Origin of Information University of Utrecht, 07.04.2010

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

Prologue The work on a molecular theory of evolution started 42 years ago... 1971 1977 1988 Chemical kinetics of molecular evolution

1. Open system and constant population size 2. Chemical kinetics of replication and mutation 3. Complexity of fitness landscapes 4. Quasispecies on realistic landscapes 5. Neutrality and replication 6. Lethal variants and mutagenesis

1. Open system and constant population size 2. Chemical kinetics of replication and mutation 3. Complexity of fitness landscapes 4. Quasispecies on realistic landscapes 5. Neutrality and replication 6. Lethal variants and mutagenesis

Stock solution: [A] = a = a 0 Flow rate: r = R -1 The population size N, the number of polynucleotide molecules, is controlled by the flow r N ( t) N ± N The flowreactor is a device for studying evolution in vitro and in silico

Replication and mutation in the flowreactor

Derivation of the replication-mutation equation from the flowreactor

1. Open system and constant population size 2. Chemical kinetics of replication and mutation 3. Complexity of fitness landscapes 4. Quasispecies on realistic landscapes 5. Neutrality and replication 6. Lethal variants and mutagenesis

Manfred Eigen 1927 - = = = = = = n i i n i i i j i n i ji j x x f Φ n j Φ x x W x 1 1 1, 1,2, ; dt d K Mutation and (correct) replication as parallel chemical reactions M. Eigen. 1971. Naturwissenschaften 58:465, M. Eigen & P. Schuster.1977. Naturwissenschaften 64:541, 65:7 und 65:341

Taq = thermus aquaticus Accuracy of replication: Q = q 1 q 2 q 3 q n The logics of DNA replication

RNA replication by Q -replicase C. Weissmann, The making of a phage. FEBS Letters 40 (1974), S10-S18

dx dt dx dt 1 2 = f2 x2 and = f1 x1 x, f = f f 1 = f2 ξ1 x2 = f1 ξ2, ζ = ξ1 + ξ2, η = ξ1 ξ2, 1 2 η( t) = η(0) e ft ζ ( t) = ζ (0) e ft Complementary replication as the simplest molecular mechanism of reproduction

Christof K. Biebricher, 1941-2009 Kinetics of RNA replication C.K. Biebricher, M. Eigen, W.C. Gardiner, Jr. Biochemistry 22:2544-2559, 1983

= = = = = = = = n i i n i i i j i i n i ji j i n i ji j x x f Φ n j Φ x x f Q Φ x x W x 1 1 1 1, 1,2, ; dt d K Factorization of the value matrix W separates mutation and fitness effects.

Mutation-selection equation: [I i ] = x i 0, f i 0, Q ij 0 dx i n n n = Q f x x i n x f x j ij j j i φ, = 1,2, L, ; i i = 1; φ = j j j dt = = 1 = 1 = 1 f solutions are obtained after integrating factor transformation by means of an eigenvalue problem x i () t = n 1 k n j= 1 ( 0) exp( λkt) c ( 0) exp( λ t) l = 0 ik ck ; i = 1,2, L, n; c (0) = n 1 k l k= 0 jk k k n i= 1 h ki x i (0) W 1 { f Q ; i, j= 1,2, L, n} ; L = { l ; i, j= 1,2, L, n} ; L = H = { h ; i, j= 1,2, L, n} i ij ij ij { λ ; k = 0,1,, n 1} 1 L W L = Λ = k L

Perron-Frobenius theorem applied to the value matrix W W is primitive: (i) is real and strictly positive (ii) λ 0 λ0 > λ k for all k 0 λ 0 (iii) is associated with strictly positive eigenvectors λ 0 (iv) is a simple root of the characteristic equation of W (v-vi) etc. W is irreducible: (i), (iii), (iv), etc. as above (ii) λ0 λ k for all k 0

constant level sets of Selection of quasispecies with f 1 = 1.9, f 2 = 2.0, f 3 = 2.1, and p = 0.01, parametric plot on S 3

Chain length and error threshold n p n p n p p n p Q n σ σ σ σ σ ln : constant ln : constant ln ) ln(1 1 ) (1 max max = K K sequence master superiority of length chain rate error accuracy replication ) (1 K K K K = = m j j m n f f σ n p p Q

Quasispecies Uniform distribution Stationary population or quasispecies as a function of the mutation or error rate p 0.00 0.05 0.10 Error rate p = 1-q

Eigenvalues of the matrix W as a function of the error rate p

Quasispecies Driving virus populations through threshold The error threshold in replication: No mutational backflow approximation

Molecular evolution of viruses

1. Open system and constant population size 2. Chemical kinetics of replication and mutation 3. Complexity of fitness landscapes 4. Quasispecies on realistic landscapes 5. Neutrality and replication 6. Lethal variants and mutagenesis

0, 0 largest eigenvalue and eigenvector diagonalization of matrix W complicated but not complex W = G F mutation matrix fitness landscape ( complex ) complex sequence structure complex mutation selection Complexity in molecular evolution

Sewall Wright. 1931. Evolution in Mendelian populations. Genetics 16:97-159. -- --. 1932. The roles of mutation, inbreeding, crossbreeding, and selection in evolution. In: D.F.Jones, ed. Proceedings of the Sixth International Congress on Genetics, Vol.I. Brooklyn Botanical Garden. Ithaca, NY, pp. 356-366. -- --. 1988. Surfaces of selective value revisited. The American Naturalist 131:115-131.

Build-up principle of binary sequence spaces

Build-up principle of four letter (AUGC) sequence spaces

linear and multiplicative hyperbolic Smooth fitness landscapes

The linear fitness landscape shows no error threshold

Error threshold on the hyperbolic landscape

single peak landscape step linear landscape Rugged fitness landscapes

Error threshold on the single peak landscape

Error threshold on a single peak fitness landscape with n = 50 and = 10

Error threshold on the step linear landscape

The error threshold can be separated into three phenomena: 1. Decrease in the concentration of the master sequence to very small values. 2. Sharp change in the stationary concentration of the quasispecies distribuiton. 3. Transition to the uniform distribution at small mutation rates. All three phenomena coincide for the quasispecies on the single peak fitness lanscape.

The error threshold can be separated into three phenomena: 1. Decrease in the concentration of the master sequence to very small values. 2. Sharp change in the stationary concentration of the quasispecies distribuiton. 3. Transition to the uniform distribution at small mutation rates. All three phenomena coincide for the quasispecies on the single peak fitness lanscape.

1. Open system and constant population size 2. Chemical kinetics of replication and mutation 3. Complexity of fitness landscapes 4. Quasispecies on realistic landscapes 5. Neutrality and replication 6. Lethal variants and mutagenesis

single peak landscape realistic landscape Rugged fitness landscapes over individual binary sequences with n = 10

Error threshold: Individual sequences n = 10, = 2, s = 491 and d = 0, 1.0, 1.875

Shift of the error threshold with increasing ruggedness of the fitness landscape

d = 0.100 d = 0.200 Case I: Strong Quasispecies n = 10, f 0 = 1.1, f n = 1.0, s = 919

d = 0.190 d = 0.190 Case II: Dominant single transition n = 10, f 0 = 1.1, f n = 1.0, s = 541

d = 0.190 d = 0.195 Case II: Dominant single transition n = 10, f 0 = 1.1, f n = 1.0, s = 541

d = 0.199 d = 0.199 Case II: Dominant single transition n = 10, f 0 = 1.1, f n = 1.0, s = 541

d = 0.100 d = 0.195 Case III: Multiple transitions n = 10, f 0 = 1.1, f n = 1.0, s = 637

d = 0.199 d = 0.200 Case III: Multiple transitions n = 10, f 0 = 1.1, f n = 1.0, s = 637

1. Open system and constant population size 2. Chemical kinetics of replication and mutation 3. Complexity of fitness landscapes 4. Quasispecies on realistic landscapes 5. Neutrality and replication 6. Lethal variants and mutagenesis

Motoo Kimuras population genetics of neutral evolution. Evolutionary rate at the molecular level. Nature 217: 624-626, 1955. The Neutral Theory of Molecular Evolution. Cambridge University Press. Cambridge, UK, 1983.

Motoo Kimura Is the Kimura scenario correct for frequent mutations?

0.5 ) ( ) ( lim 2 1 0 = = p x p x p d H = 1 a p x a p x p p = = 1 ) ( lim ) ( lim 2 0 1 0 d H = 2 d H 3 1 ) ( 0,lim ) ( lim or 0 ) ( 1,lim ) ( lim 2 0 1 0 2 0 1 0 = = = = p x p x p x p x p p p p Random fixation in the sense of Motoo Kimura Pairs of neutral sequences in replication networks P. Schuster, J. Swetina. 1988. Bull. Math. Biol. 50:635-650

A fitness landscape including neutrality

Neutral network: Individual sequences n = 10, = 1.1, d = 1.0

Consensus sequence of a quasispecies of two strongly coupled sequences of Hamming distance d H (X i,,x j ) = 1.

Neutral network: Individual sequences n = 10, = 1.1, d = 1.0

Consensus sequence of a quasispecies of two strongly coupled sequences of Hamming distance d H (X i,,x j ) = 2.

N = 7 Adjacency matrix Neutral networks with increasing : = 0.10, s = 229

1. Open system and constant population size 2. Chemical kinetics of replication and mutation 3. Complexity of fitness landscapes 4. Quasispecies on realistic landscapes 5. Neutrality and replication 6. Lethal variants and mutagenesis

Coworkers Peter Stadler, Bärbel M. Stadler, Universität Leipzig, GE Walter Fontana, Harvard Medical School, MA Universität Wien Martin Nowak, Harvard University, MA Christian Reidys, Nankai University, Tien Tsin, China Thomas Wiehe, Ulrike Göbel, Walter Grüner, Stefan Kopp, Jaqueline Weber, Institut für Molekulare Biotechnologie, Jena, GE Ivo L.Hofacker, Christoph Flamm, Universität Wien, AT Kurt Grünberger, Michael Kospach, Andreas Wernitznig, Stefanie Widder, Stefan Wuchty, Jan Cupal, Stefan Bernhart, Lukas Endler, Ulrike Langhammer, Rainer Machne, Ulrike Mückstein, Erich Bornberg-Bauer, Universität Wien, AT

Acknowledgement of support Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093 13887, and 14898 Universität Wien Wiener Wissenschafts-, Forschungs- und Technologiefonds (WWTF) Project No. Mat05 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Contracts No. 98-0189, 12835 (NEST) Austrian Genome Research Program GEN-AU: Bioinformatics Network (BIN) Österreichische Akademie der Wissenschaften Siemens AG, Austria Universität Wien and the Santa Fe Institute

Thank you for your attention!

Web-Page for further information: http://www.tbi.univie.ac.at/~pks