Convergence of a Moran model to Eigen s quasispecies model

Similar documents
On Lotka-Volterra Evolution Law

3.8 Three Types of Convergence

Solutions of some selected problems of Homework 4

ON REGULARITY, TRANSITIVITY, AND ERGODIC PRINCIPLE FOR QUADRATIC STOCHASTIC VOLTERRA OPERATORS MANSOOR SABUROV

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013).

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization

CSE525: Randomized Algorithms and Probabilistic Analysis May 16, Lecture 13

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis

Interactive Markov Models of Evolutionary Algorithms

A Fast Algorithm for the Spread of HIV in a System of Prisons

A Markov Framework for the Simple Genetic Algorithm

A note on the multiplication of sparse matrices

STRONG LAW OF LARGE NUMBERS FOR SCALAR-NORMED SUMS OF ELEMENTS OF REGRESSIVE SEQUENCES OF RANDOM VARIABLES

Poornima University, For any query, contact us at: , 18

Block designs and statistics

Note on generating all subsets of a finite set with disjoint unions

Quantum algorithms (CO 781, Winter 2008) Prof. Andrew Childs, University of Waterloo LECTURE 15: Unstructured search and spatial search

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices

Modeling Chemical Reactions with Single Reactant Specie

A note on the realignment criterion

Polygonal Designs: Existence and Construction

AN APPLICATION OF CUBIC B-SPLINE FINITE ELEMENT METHOD FOR THE BURGERS EQUATION

Math 262A Lecture Notes - Nechiporuk s Theorem

The linear sampling method and the MUSIC algorithm

This model assumes that the probability of a gap has size i is proportional to 1/i. i.e., i log m e. j=1. E[gap size] = i P r(i) = N f t.

Research Article On the Isolated Vertices and Connectivity in Random Intersection Graphs

Computational and Statistical Learning Theory

Intelligent Systems: Reasoning and Recognition. Artificial Neural Networks

The degree of a typical vertex in generalized random intersection graph models

Asymptotics of weighted random sums

The Hilbert Schmidt version of the commutator theorem for zero trace matrices

2 Q 10. Likewise, in case of multiple particles, the corresponding density in 2 must be averaged over all

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels

Supplementary Material for Fast and Provable Algorithms for Spectrally Sparse Signal Reconstruction via Low-Rank Hankel Matrix Completion

}, (n 0) be a finite irreducible, discrete time MC. Let S = {1, 2,, m} be its state space. Let P = [p ij. ] be the transition matrix of the MC.

On the characterization of non-linear diffusion equations. An application in soil mechanics

arxiv: v3 [cs.sy] 29 Dec 2014

Pattern Recognition and Machine Learning. Artificial Neural networks

Chaotic Coupled Map Lattices

Uniform Approximation and Bernstein Polynomials with Coefficients in the Unit Interval

A Note on the Application of Incidence Matrices of Simplicial Complexes

On the Communication Complexity of Lipschitzian Optimization for the Coordinated Model of Computation

The Weierstrass Approximation Theorem

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

Characterization of the Line Complexity of Cellular Automata Generated by Polynomial Transition Rules. Bertrand Stone

The Fundamental Basis Theorem of Geometry from an algebraic point of view

Completeness of Bethe s states for generalized XXZ model, II.

Metric Entropy of Convex Hulls

Algorithms for parallel processor scheduling with distinct due windows and unit-time jobs

Linguistic majorities with difference in support

Lecture 20 November 7, 2013

arxiv:physics/ v1 [physics.bio-ph] 27 Jun 2001

A Simple Regression Problem

arxiv: v2 [math.co] 8 Mar 2018

Lecture 9 November 23, 2015

About the definition of parameters and regimes of active two-port networks with variable loads on the basis of projective geometry

arxiv: v1 [math.co] 19 Apr 2017

Applied Mathematics Letters

Inclusions Between the Spaces of Strongly Almost Convergent Sequences Defined by An Orlicz Function in A Seminormed Space

Recursive Algebraic Frisch Scheme: a Particle-Based Approach

Randomized Recovery for Boolean Compressed Sensing

Slanted coupling of one-dimensional arrays of small tunnel junctions

Testing Properties of Collections of Distributions

Graphical Models in Local, Asymmetric Multi-Agent Markov Decision Processes

arxiv: v1 [math.nt] 14 Sep 2014

On Conditions for Linearity of Optimal Estimation

Handout 7. and Pr [M(x) = χ L (x) M(x) =? ] = 1.

A1. Find all ordered pairs (a, b) of positive integers for which 1 a + 1 b = 3

Linear Algebra of Eigen s Quasispecies Model

Multi-Dimensional Hegselmann-Krause Dynamics

Comparison of Stability of Selected Numerical Methods for Solving Stiff Semi- Linear Differential Equations

Pattern Recognition and Machine Learning. Artificial Neural networks

arxiv: v2 [math.co] 3 Dec 2008

Physics 215 Winter The Density Matrix

1 Proof of learning bounds

List Scheduling and LPT Oliver Braun (09/05/2017)

4 = (0.02) 3 13, = 0.25 because = 25. Simi-

Lean Walsh Transform

Hybrid System Identification: An SDP Approach

When Short Runs Beat Long Runs

Chapter 6 1-D Continuous Groups

Study on Markov Alternative Renewal Reward. Process for VLSI Cell Partitioning

ON THE TWO-LEVEL PRECONDITIONING IN LEAST SQUARES METHOD

Lecture 21 Principle of Inclusion and Exclusion

Page 1 Lab 1 Elementary Matrix and Linear Algebra Spring 2011

Beyond Mere Convergence

A Self-Organizing Model for Logical Regression Jerry Farlow 1 University of Maine. (1900 words)

Testing equality of variances for multiple univariate normal populations

Computable Shell Decomposition Bounds

A new type of lower bound for the largest eigenvalue of a symmetric matrix

Support recovery in compressed sensing: An estimation theoretic approach

Rademacher Complexity Margin Bounds for Learning with a Large Number of Classes

The Simplex Method is Strongly Polynomial for the Markov Decision Problem with a Fixed Discount Rate

Deflation of the I-O Series Some Technical Aspects. Giorgio Rampa University of Genoa April 2007

PREPRINT 2006:17. Inequalities of the Brunn-Minkowski Type for Gaussian Measures CHRISTER BORELL

An Algorithm for Quantization of Discrete Probability Distributions

Chapter 1: Basics of Vibrations for Simple Mechanical Systems

THE AVERAGE NORM OF POLYNOMIALS OF FIXED HEIGHT

Supplement to: Subsampling Methods for Persistent Homology

Donald Fussell. October 28, Computer Science Department The University of Texas at Austin. Point Masses and Force Fields.

Transcription:

Convergence of a Moran odel to Eigen s quasispecies odel Joseba Dalau Université Paris Sud and ENS Paris June 6, 2018 arxiv:1404.2133v1 [q-bio.pe] 8 Apr 2014 Abstract We prove that a Moran odel converges in probability to Eigen s quasispecies odel in the infinite population liit. 1 Introduction The concept of quasispecies was proposed by Manfred Eigen in order to explain how a population of acroolecules behaves when subject to an evolutionary process with selection and utation. In his celebrated paper [8], Eigen odels the evolution of a population of acroolecules via a syste of differential equations, which arises fro the laws of cheical kinetics. Selection is perfored according to a fitness landscape, and utations occur in the course of reproductions, independently at each locus with rate q. On the sharp peak landscape all but one sequence, the aster sequence, have the sae fitness and the aster sequence has higher fitness than the rest Eigen discovered that an error threshold phenoenon takes place: there exists a critical utation rate q such that if q > q then at equilibriu the population is totally rando, while if q < q then at equilibriu the population fors a quasispecies, i.e., it contains a positive fraction of the aster sequence along with a cloud of utants that closely reseble the aster sequence. The concepts of error threshold and quasispecies ight not only be relevant in olecular genetics, but also in several other areas of biology, naely population genetics or virology [7]. Nevertheless, in Eigen s odel the dynaics of the concentrations of the different genotypes is driven by a syste of differential equations, which is a ajor drawback for the viability of the odel in settings ore coplex than the olecular level [18]. A finite and stochastic version of Eigen s quasispecies odel would be uch ore suitable to expand the quasispecies theory to other areas [9, 16, 18]. 1

The issue of designing a finite population version of the quasispecies odel has been tackled by several authors. Different approaches have been considered in the literature: Alves and Fontanari [1] propose a finite population odel and they study the dependence of the error threshold on the population size, a siilar approach is taken by McCaskill [11], Park, Muñoz, Dee [14] and Saakian, Dee, Hu [15], who all suggest different kinds of finite population odels. In [13], Nowak and Schuster derive the error threshold for finite populations using a birth and death chain. More recently, in [2, 3], Cerf shows that the error threshold and quasispecies concepts arise for both the Moran odel and the classical Wright Fisher odel in the appropriate asyptotic regies. Soe other authors propose stochastic odels that converge to Eigen s odel in the infinite population liit, this is the approach taken by Deetrius, Schuster, Sigund [5], who use branching processes, Dixit, Srivastava, Vishnoi [6] or Musso [12]. Showing convergence of a finite population odel to Eigen s odel is in general a delicate atter; to our knowledge, all the works that have been done in this direction prove that soe stochastic process converges to Eigen s odel in expectation. As pointed out in [5], convergence in expectation can be isleading soeties, ainly due to the fact that variation ight increase as expectation converges, leading to a poor understanding of the asyptotic behaviour of the stochastic process. In the current work we consider the Moran odel studied in [2, 4] driving the evolution of a finite population subject to selection and utation effects. This Moran odel is shown to converge to Eigen s quasispecies odel in the infinite population liit, independently of the fitness landscape: on any finite tie interval, we prove convergence in probability for the supreu nor. The interest of our result not only lies on the type of convergence, but also on the choice of the odel: the Moran odel is possibly one of the siplest odels for which such a result can be expected. The result is proven by eans of a theore due to Kurtz [10], which gives sufficient conditions for the convergence of a sequence of Markov processes to a deterinistic trajectory, characterised by a syste of differential equations. The article is organised as follows: first we briefly introduce Eigen s quasispecies odel and the Moran odel. We state the ain result in section 3. In section 4 we adapt Kurtz s theore, which originally deals with continuous state space Markov chains, to the discrete state space setting. Finally, in section 5, we apply Kurtz s theore in order to prove the ain result. 2

2 The Eigen and Moran odels We present here Eigen s quasispecies odel and a discrete Moran odel. Consider a set of N different genotypes, labelled fro 1 to N. Both Eigen s odel and the Moran odel describe the evolution of a population of individuals having genotypes 1,...,N. In both odels the evolution of the population is driven by two ain forces: selection and utation. The selection and utation echaniss depend only on the genotypes, and are coon to both odels. Selection is perfored with a fitness landscape (f i ) 1 i N, f i being the reproduction rate of an individual having genotype i. The utation schee is encoded in a utation atrix(q ij ) 1 i,j N, Q ij being the probability that an individual having genotype i utates into an individual having genotype j. The utation atrix is assued to be stochastic, i.e., its entries are non negative and the rows add up to 1. Eigen s odel. Eigen originally forulated the quasispecies odel to explain the evolution of a population of acroolecules. The evolution of the concentration of the different genotypes is driven by a syste of differential equations, obtained fro the theory of cheical kinetics. Let us denote by S N the unit siplex, i.e., S N = {x R N : x i 0, 1 i N and x 1 + +x N = 1}. An eleent x S N represents a population in which the concentration of the individuals having the i th genotype is x i, for 1 i N. Let x 0 S N be the starting population and let us denote by x(t) the population at tie t > 0. Eigen s odel describes the dynaics of x(t) thorough the following syste of differential equations: ( ) x i(t) = N N f k Q ki x k (t) x i (t) f k x k (t), 1 i N, k=1 k=1 with initial condition x(0) = x 0. The first ter in the differential equation accounts for the replication rate and utations towards the i th genotype, while the second ter helps to keep the total concentration constant. A recent review on Eigen s quasispecies odel can be found in [17]. The Moran odel. Moran odels ai at describing the evolution of a finite population. The dynaics of the population is stochastic, the evolution is described by a Markov chain. Loosely speaking, the Moran odel evolves as follows: at each step of tie, an individual is selected fro the current 3

population according to its fitness, this individual then produces an offspring, which is subject to utations. Finally, an individual chosen uniforly at rando fro the population is replaced by the offspring. The state space of the Moran process will be the set PN of the ordered partitions of the integer in at ost N parts: P N = {z N N : z 1 + +z N = }. An eleent z P N represents a population in which z i individuals have the genotype i, for 1 i N. The only allowed changes at each tie step consist in replacing an individual fro the current population by a new one. If we denote by (e i ) 1 i N the canonical basis of R N, the only allowed changes in a population are of the for z z e i +e j 1 i,j N. Let λ be a constant such that λ ax{f i : 1 i N }. The Moran process is the Markov chain (Z n ) n 0 having state space PN and transition atrix p given by: for all z Pl+1 and i,j {1,...,N } such that i j, p(z,z e i +e j ) = z i 1 N f k Q kj z k. λ The other non diagonal coefficients of the transition atrix are null, the diagonal coefficients are arranged so that the atrix is stochastic, i.e., the entries are non negative and the rows add up to 1. k=1 3 Main result Our ai is to show that Eigen s quasispecies odel arises as the infinite population liit of the Moran odel. More precisely, we will prove the following result: Theore 3.1. Let (Z n ) n 0 be the Moran process described above. Suppose that we have the convergence of the initial conditions towards x 0 : 1 li Z 0 = x 0, and let x(t) be the solution of the syste of differential equations ( ) with initial condition x(0) = x 0. Then, for every δ,t > 0, we have ( li P sup 0 t T ) 1 Z λt x(t) > δ = 0. 4

This result is an iediate consequence of theore 4.7 in [10]. In order to prove the result, we proceed in two steps. We state first theore 4.7 in [10], and we show next that all the hypotheses needed to apply the theore are fulfilled in our particular setting. 4 Convergence of a faily of Markov chains Let d 1 and let E be a subset of R d. Let ( (X n ) n 0, 1 ) be a sequence of discrete tie Markov chains with state spaces E E and transition atrices (p (x,y)) x,y E. Let F : R d R d and consider the syste of differential equations x i (t) = F i(x(t)), 1 i N. Theore 4.7 in [10] gives a series of sufficient conditions under which the sequence of Markov chains (X ) 1 converges to a solution of the above syste of differential equations. The original stateent of theore 4.7 in [10] is written for the ore general setting of continuous state space Markov chains. We odify just the notation in [10] in order to state the result in a way which is ore suited to our particular setting. Theore 4.7 in [10] can be applied if the following set of conditions is satisfied. There exist sequences of positive nubers (α ) 1 and (ε ) 1 such that 1. li α = and li ε = 0. 2. sup sup α y x p (x,y) <. 1 x E y E 3. li sup α y x p (x,y) = 0. x E y E : y x >ε Define, for 1, F (x) = α (y x)p (x,y). y E 4. li sup x E F (x) F(x) = 0. 5. There exists a constant M such that x,y E, F(x) F(y) M x y. 5

Theore 4.1 (Kurtz). Suppose that conditions 1 5 are satisfied. Suppose further that we have the convergence of the initial conditions Then, for every δ,t > 0, we have ( li P sup 0 t T li X 0 = x0. X αt x(t) ) > δ = 0. 5 Proof of theore 3.1 Let (Z n ) n 0 be the Moran process defined in section 2. Our ai is to apply theore 4.1 to the sequence of Markov chains ( (Z n /) n 0, 1 ). We only need to find the appropriate sequences (α ) 1 and (ε ) 1 and verify that conditions 1 5 are satisfied in our setting. For 1, let α = λ and ε = 2/. The sequences (α ) 1 and(ε ) 1 obviously verify condition 1. As for condition 2, we have, for z P N z z p(z,z ) = z P N N e i + e j p(z,z e i +e j ) i,j=1 2. Thus, sup 1 sup z P N α z P N z z p(z,z ) λ 2 <, as required for condition 2. Since p(z,z ) > 0 if and only if z z 2, for all 1 and z PN, we have z P N : z z >ε z z p(z,z ) = 0, and condition 3 is also satisfied. Let F : R N R N be the function defined by i {1,...,N }, x R N, F i (x) = N N f j Q ji x j x i f j x j. j=1 j=1 6

Since all the partial derivatives of F are bounded on the siplex S N, F is a Lipschitz function on S N, i.e., condition 5 holds. Finally, let us copute, for 1 and x P N /, the value of F (x). By definition, F (x) = λ z P N ( z x ) p(x,z) = λ Thus, for i {1,...,N } we have N ( e i +e j )p(x,x e i +e j ). i,j=1 Fi (x) = λ p(x,x e k +e i ) λ i +e k ) k:k i k:k ip(x,x e = N x k f j Q ji x j N x i f j Q jk x j. k:k i j=1 k:k i j=1 Since x 1 + +x N = 1 and for all i {1,...,N }, Q i1 + +Q in = 1, F i (x) = (1 x i ) N N f j Q ji x j x i f j (1 Q ji )x j = j=1 j=1 N N f j Q ji x j x i f j x j. j=1 j=1 Thus, the function F coincides with the function F on the set P N /, which readily iplies condition 4. Since all five conditions are satisfied, we can apply theore 4.1 to the sequence of Markov chains ( (Z n /) n 0, 1 ) and we obtain the desired result. References [1] Doingos Alves and Jose Fernando Fontanari. Error threshold in finite populations. Phys. Rev. E, 57:7008 7013, 1998. [2] Raphaël Cerf. Critical population and error threshold on the sharp peak landscape for a Moran odel. preprint, 2012. [3] Raphaël Cerf. Critical population and error threshold on the sharp peak landscape for the Wright Fisher odel. preprint, 2012. [4] Raphaël Cerf and Joseba Dalau. The distribution of the quasispecies for a Moran odel on the sharp peak landscape. preprint, 2013. [5] Lloyd Deetrius, Peter Schuster, and Karl Sigund. Polynucleotide evolution and branching processes. Bulletin of Matheatical Biology, 47(2):239 262, 1985. 7

[6] Narendra M. Dixit, Piyush Srivastava, and Nisheeth K. Vishnoi. A finite population odel of olecular evolution: theory and coputation. J. Coput. Biol., 19(10):1176 1202, 2012. [7] Esteban Doingo. Quasispecies theory in virology. Journal of Virology, 76(1):463 465, 2002. [8] Manfred Eigen. Self-organization of atter and the evolution of biological acroolecules. Naturwissenschaften, 58(10):465 523, 1971. [9] Manfred Eigen, John McCaskill, and Peter Schuster. The olecular quasi-species. Advances in Cheical Physics, 75:149 263, 1989. [10] Thoas G. Kurtz. Solutions of ordinary differential equations as liits of pure jup Markov processes. J. Appl. Probability, 7:49 58, 1970. [11] John McCaskill. A stochastic theory of acroolecular evolution. Biological Cybernetics, 50:63 73, 1984. [12] Fabio Musso. A stochastic version of the Eigen odel. Bull. Math. Biol., 73(1):151 180, 2011. [13] Martin A. Nowak and Peter Schuster. Error thresholds of replication in finite populations. Mutation frequencies and the onset of Muller s ratchet. Journal of theoretical Biology, 137 (4):375 395, 1989. [14] Jeong-Man Park, Enrique Muñoz, and Michael W. Dee. Quasispecies theory for finite populations. Phys. Rev. E, 81:011902, 2010. [15] David B. Saakian, Michael W. Dee, and Chin-Kun Hu. Finite population size effects in quasispecies odels with single-peak fitness landscape. Europhysics Letters, 98(1):18001, 2012. [16] Peter Schuster. Matheatical odeling of evolution. Solved and open probles. Theory Biosci., 130:71 89, 2011. [17] Peter Schuster. The atheatics of Darwin s theory of evolution: 1859 and 150 years later. In The atheatics of Darwin s legacy, Math. Biosci. Interact., pages 27 66. Birkhäuser/Springer Basel AG, Basel, 2011. [18] Claus Wilke. Quasispecies theory in the context of population genetics. BMC Evolutionary Biology, 5:1 8, 2005. 8