Surfing genes. On the fate of neutral mutations in a spreading population

Similar documents
Gene Surfing in Expanding Populations

The fate of beneficial mutations in a range expansion

Wave-like phenomena are ubiquitous in nature and have. Fluctuations uncover a distinct class of traveling waves

Genetic drift at expanding frontiers promotes gene segregation (Preprint; look up PNAS website for published version)

Genetic Drift in Human Evolution

Statistical Models in Evolutionary Biology An Introductory Discussion

How robust are the predictions of the W-F Model?

Population Genetics I. Bio

RANGE expansion through a series of colonization events. Serial Founder Effects During Range Expansion: A Spatial Analog of Genetic Drift

6 Introduction to Population Genetics

Dynamics of the evolving Bolthausen-Sznitman coalescent. by Jason Schweinsberg University of California at San Diego.

Mathematical models in population genetics II

arxiv: v2 [q-bio.pe] 13 Apr 2011

6 Introduction to Population Genetics

Demography April 10, 2015

Evolution in range expansions with competition at rough boundaries

How should we organize the diversity of animal life?

The Wright-Fisher Model and Genetic Drift

GENETICS. On the Fixation Process of a Beneficial Mutation in a Variable Environment

Population Genetics: a tutorial

Lecture 18 : Ewens sampling formula

Frequency Spectra and Inference in Population Genetics

Endowed with an Extra Sense : Mathematics and Evolution

The genealogy of branching Brownian motion with absorption. by Jason Schweinsberg University of California at San Diego

Theory a well supported testable explanation of phenomenon occurring in the natural world.

Rare Alleles and Selection

Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles

Computational Systems Biology: Biology X

Genealogies in Growing Solid Tumors

Yaglom-type limit theorems for branching Brownian motion with absorption. by Jason Schweinsberg University of California San Diego

URN MODELS: the Ewens Sampling Lemma

arxiv: v1 [q-bio.pe] 27 Sep 2016

Genetic Variation in Finite Populations

Understanding Natural Selection

Chapter 5 Evolution of Biodiversity

Expected coalescence times and segregating sites in a model of glacial cycles

Expansion Under Climate Change: The Genetic Consequences

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009

1.3 Forward Kolmogorov equation

NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT

Evolution in a spatial continuum

Analysis of the Seattle SNP, Perlegen, and HapMap data sets

Casey Leonard. Multiregional model vs. Out of Africa theory SLCC

Evolution & Natural Selection

Shift in the velocity of a front due to a cutoff

UNIT V. Chapter 11 Evolution of Populations. Pre-AP Biology

Estimating Evolutionary Trees. Phylogenetic Methods

ANCESTRAL PROCESSES WITH SELECTION: BRANCHING AND MORAN MODELS

The nested Kingman coalescent: speed of coming down from infinity. by Jason Schweinsberg (University of California at San Diego)

Lecture 14 Chapter 11 Biology 5865 Conservation Biology. Problems of Small Populations Population Viability Analysis

Eindhoven University of Technology MASTER. Population dynamics and genetics in inhomogeneous media. Alards, K.M.J.

Evolution of Populations. Populations evolve. Changes in populations. Natural selection acts on individuals differential survival. Populations evolve

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution

Chapter 16: Evolutionary Theory

Darwin s Observations & Conclusions The Struggle for Existence

THE EVOLUTION OF POPULATIONS THE EVOLUTION OF POPULATIONS

Biology 11 UNIT 1: EVOLUTION LESSON 2: HOW EVOLUTION?? (MICRO-EVOLUTION AND POPULATIONS)

Citation PHYSICAL REVIEW E (2005), 72(1) RightCopyright 2005 American Physical So

Mathematical Population Genetics II

Gene regulation: From biophysics to evolutionary genetics

The Moran Process as a Markov Chain on Leaf-labeled Trees

Section 15 3 Darwin Presents His Case

Evolution Test Review

Concepts and Methods in Molecular Divergence Time Estimation

Supplementary Figures.

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Mathematical Population Genetics II

Perplexing Observations. Today: Thinking About Darwinian Evolution. We owe much of our understanding of EVOLUTION to CHARLES DARWIN.

The dynamics of small particles whose size is roughly 1 µmt or. smaller, in a fluid at room temperature, is extremely erratic, and is

Selection against A 2 (upper row, s = 0.004) and for A 2 (lower row, s = ) N = 25 N = 250 N = 2500

Processes of Evolution

Reproduction- passing genetic information to the next generation

Coalescent based demographic inference. Daniel Wegmann University of Fribourg

Chapter 16. Table of Contents. Section 1 Genetic Equilibrium. Section 2 Disruption of Genetic Equilibrium. Section 3 Formation of Species

The mathematical challenge. Evolution in a spatial continuum. The mathematical challenge. Other recruits... The mathematical challenge

- point mutations in most non-coding DNA sites likely are likely neutral in their phenotypic effects.

List the five conditions that can disturb genetic equilibrium in a population.(10)

arxiv: v2 [q-bio.pe] 20 Mar 2015

A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences

Challenges when applying stochastic models to reconstruct the demographic history of populations.

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

Introduction to Advanced Population Genetics

MATH The Derivative as a Function - Section 3.2. The derivative of f is the function. f x h f x. f x lim

the role of standing genetic variation

EQ: How are genetic variations caused and how do they lead to natural selection?

the rate of change of velocity with time a graphical representation of the distribution of ages within a population

Stochastic Demography, Coalescents, and Effective Population Size

Anatoly B. Kolomeisky. Department of Chemistry CAN WE UNDERSTAND THE COMPLEX DYNAMICS OF MOTOR PROTEINS USING SIMPLE STOCHASTIC MODELS?

Big Idea #1: The process of evolution drives the diversity and unity of life

Fitness landscapes and seascapes

The implications of neutral evolution for neutral ecology. Daniel Lawson Bioinformatics and Statistics Scotland Macaulay Institute, Aberdeen

arxiv:nlin/ v1 [nlin.ao] 25 Sep 2000

Evolutionary Forces. What changes populations (Ch. 17)

The coalescent process

OVERVIEW. L5. Quantitative population genetics

Evolution by Natural Selection

Vertical transmission of culture and the distribution of family names

Extremely rare interbreeding events can explain Neanderthal DNA in modern humans

Mutation surfing and the evolution of dispersal during range expansions

Transcription:

Surfing genes On the fate of neutral mutations in a spreading population Oskar Hallatschek David Nelson Harvard University ohallats@physics.harvard.edu

Genetic impact of range expansions Population expansions have a profound effect on the genealogy of a species, because the gene pool for the new habitat is provided only by a small number of founder individuals. The genetic footprints of these pioneers are often recognizable today and provide information about the migrational history of the species. https://www3.nationalgeographic.com/genographic/ E.g.: In agreement with the out-of- Africa hypothesis, it is found that the neutral* genetic diversity of humans decreases roughly linearly with the distance from Africa. See, e.g., Ramachandran S, et al. (005) PNAS. (*Neutral mutations have no effect on the phenotype; act like inheritable

Abstract To assist the inference of migrations from spatially resolved genetic data, we study the dynamics of neutral gene frequencies in a population undergoing a continual range expansion in one dimension. During such a colonization period, lineages can fix at the wave front by means of a surfing mechanism*. We quantify this phenomenon in terms of (i) the spatial distribution of lineages that reach fixation and, closely related, (ii) the continual loss of genetic diversity at the wave front, characterizing the approach to fixation. Our simulations show that an effective population size can be assigned to the wave that controls the (observable) gradient in genetic diversity left behind the colonization process. This effective population size is markedly higher in pushed waves than in pulled waves, and increases only sub-linearly with deme size. We develop an analytical approach, based on the physics of reaction-diffusion systems, that yields simple predictions for any deterministic population dynamics. O. H. and David Nelson (007) http://arxiv.org/abs/q-bio/0703040 *Edmonds, C. A., Lillie, A.S., & Cavalli-Sforza LL. (004) PNAS. S. Klopfstein, M. Currat, and L. Excoffier, Mol. Biol. Evol. 3, 48 490 (006)

Surfing neutral mutations t 0 A neutral mutant (label red) arises in the wave front at lattice site i=9. Due to random number fluctuations, the genetic diversity is strongly reduced. By chance, red mutants dominate. t t f Fixation in the co-moving frame of descendents of red. ( successful surfing event) Numbers in these sketches represent inheritable labels that are used in our simulations to trace back the spatial origin of individuals in the wave front. For instance, descendents of red are associated with position 9 in the co-moving frame. The dashed blue frame indicates the comoving simulation box.

Statistical mechanics of gene surfing x Where do successful surfers originate? How fast does the genetic diversity decrease at the wave front?

Ancestral distribution (Simulation) P i Fig. 1. Distributions of the spatial origin of mutations that reach fixation (bell-curves) together with averaged wave profiles n i (sigmoidal curves) for two discrete lattice models of propagating fronts. In the continuum limit, (a) describes a stochastic Fisher wave with linear growth rate of s 0.1, diffusion constant of 1 and carrying capacity N. (b) Identical to (a) except that the effective growth rate is set to zero for ni N c.

Theory vs. simulation Deterministic Theory: P n expv i i (See Eq. (1), below) i Fig.. The data from Fig. b (thin lines) superimposed with predictions (thick lines) for the ancestral distribution function P i based on the theoretical expression (red box) that depends on the wave profile n i and the velocity v. The apparent systematic deviations in the pulled case ( N c 0 ) are caused by fluctuations at the tip of the wave.

Decrease in genetic diversity Fig. 3. H t Pr t c t is the probability that one has to go back at least a time t c (coalescence time) to find the most recent common ancestor of two individuals randomly chosen from the simulation box. Both models show that an exponential decay is reached asymptotically. By analogy with well-mixed populations, in which H (t) decays exponentially with rate / N (Moran model), it is convenient to express the decay rate as / Ne, i.e., in terms of an effective populations size N e.

Effective population size Theory 1 N e Pi / i n (See Eq. (), below) i Fig. 5. The measured effective population size as a function of carrying capacity N on a log-log scale for pulled waves ( N c 0, asterisks) and pushed waves ( N c 10, crosses). The dashed and dotted lines have slope.30 and.4, respectively, i.e., significantly smaller than 1. Triangles represent effective populations sizes as inferred from the strong migration approximation (red box). The inset shows the behavior of N e for varying cutoffvalue and fixed carrying capacity N=1000, again, on a log-log scale. N c N e

(Theory) Ancestral distribution Backward in time: Drift & diffusion, no reaction time t x,t G, x, t, space G, x, t : probability that a x,t particle at had an ancestor at, Fokker-Planck description: G, x, t D G VG D, Diffusion constant V U P v U, Drift coefficient Dlog lim G, x, t exp c, Deterministic approx.: c, c P : q-bio/0703040 U / D c expv D s / Ancestral distribution function s (1)

(Theory) Coalescence time t 0 distribution of two lineages Pr Pr Coalescence rate of two lineages. t Prt G Probability that the lineages have not coalesced earlier. c c s c Probability of meeting at the same place. (Strong migration approximation*) s d / Ne 1 t c e with N P / c O. H. and David Nelson (007) http://arxiv.org/abs/q-bio/0703040 e Rate of coalescence given the lineages are at the same place. d Effective population size *Nagylaki, T. (1980) J. Math. Biol. 9, 101-114 s t c x ()

(Appendix) No surfing on Fisher waves In the case of the deterministic Fisher wave (Fisher 1937), the wave profile obeys in the foot of the wave c exp vx / D const., for x. As a consequence, Eq. (1) is not normalizable and 0. P I.e., gene surfing is not possible on a deterministic Fisher wave, in contrast to our simulations of a stochastic Fisher wave, cf. Fig.1a. For an approximate description of the stochastic case, a heuristic cutoff* (function) in the growth rates may be introduced to ensure that the survival probability u of a mutant does not exceed the theoretical limit (O. Hallatschek, unpublished). P / c *Brunet, E. and Derrida, B. (1997) Phys. Rev. E 56, 597-604.