STAT 536: Migration. Karin S. Dorman. October 3, Department of Statistics Iowa State University

Similar documents
Microevolution Changing Allele Frequencies

Problems for 3505 (2011)

Microevolution 2 mutation & migration

Population Structure

Population genetics snippets for genepop

Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency

Selection Page 1 sur 11. Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION

8. Genetic Diversity

Chapter 16. Table of Contents. Section 1 Genetic Equilibrium. Section 2 Disruption of Genetic Equilibrium. Section 3 Formation of Species

Population Genetics & Evolution

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?

Processes of Evolution

Outline of lectures 3-6

Natural Selection results in increase in one (or more) genotypes relative to other genotypes.

Outline of lectures 3-6

The Wright-Fisher Model and Genetic Drift

Mathematical Population Genetics II

Mathematical Population Genetics II

The theory of evolution continues to be refined as scientists learn new information.

List the five conditions that can disturb genetic equilibrium in a population.(10)

STAT 536: Genetic Statistics

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Population Genetics I. Bio

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

Mechanisms of Evolution

URN MODELS: the Ewens Sampling Lemma

NOTES CH 17 Evolution of. Populations

Population Genetics: a tutorial

D. Incorrect! That is what a phylogenetic tree intends to depict.

overproduction variation adaptation Natural Selection speciation adaptation Natural Selection speciation

Outline of lectures 3-6

Evolutionary Genetics Midterm 2008

Strong Selection is Necessary for Repeated Evolution of Blindness in Cavefish

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution

(Write your name on every page. One point will be deducted for every page without your name!)

A. Correct! Genetically a female is XX, and has 22 pairs of autosomes.

How to Use This Presentation

Mechanisms of Evolution Microevolution. Key Concepts. Population Genetics

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014

Evolution Module. 6.2 Selection (Revised) Bob Gardner and Lev Yampolski

Lecture 2. Basic Population and Quantitative Genetics

Lecture 2: Introduction to Quantitative Genetics

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Introduction to Natural Selection. Ryan Hernandez Tim O Connor

Mathematical modelling of Population Genetics: Daniel Bichener

Tutorial on Theoretical Population Genetics

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

The Genetics of Natural Selection

Inbreeding depression due to stabilizing selection on a quantitative character. Emmanuelle Porcher & Russell Lande

Biology 213 Summer 2004 Midterm III Choose the most correct answer and mark it on the scantron sheet. (2 pts each)

Microevolution (Ch 16) Test Bank

POPULATIONS. p t+1 = p t (1-u) + q t (v) p t+1 = p t (1-u) + (1-p t ) (v) Phenotypic Evolution: Process HOW DOES MUTATION CHANGE ALLELE FREQUENCIES?

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Evolutionary quantitative genetics and one-locus population genetics

Evolution Test Review

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

Evolution of Populations. Chapter 17

BIOLOGY FINAL EXAM REVIEW

Mathematical models in population genetics II

Educational Items Section

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

Introduction to population genetics & evolution

Case Studies in Ecology and Evolution

I of a gene sampled from a randomly mating popdation,

Introductory seminar on mathematical population genetics

Evolution of Populations

Long-Term Response and Selection limits

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Quiz Section 4 Molecular analysis of inheritance: An amphibian puzzle

AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity,

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation

Genetics word list. the molecule which contains genes. This will be looked at in more detail. it is shaped in a double helix (spiral)

STUDY GUIDE SECTION 16-1 Genetic Equilibrium

How robust are the predictions of the W-F Model?

Evolution. Changes over Time

Study of similarities and differences in body plans of major groups Puzzling patterns:

Evolutionary Forces. What changes populations (Ch. 17)

Lecture 4: Allelic Effects and Genetic Variances. Bruce Walsh lecture notes Tucson Winter Institute 7-9 Jan 2013

Q Expected Coverage Achievement Merit Excellence. Punnett square completed with correct gametes and F2.

UNIT V. Chapter 11 Evolution of Populations. Pre-AP Biology

STAT 536: Genetic Statistics

Mechanisms of Evolution. Adaptations. Old Ideas about Evolution. Behavioral. Structural. Biochemical. Physiological

Reinforcement Unit 3 Resource Book. Meiosis and Mendel KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Chapter 13 Meiosis and Sexual Reproduction

Formalizing the gene centered view of evolution

Theory of Natural Selection

Enduring Understanding: Change in the genetic makeup of a population over time is evolution Pearson Education, Inc.

Darwin s Observations & Conclusions The Struggle for Existence

Genetic hitch-hiking in a subdivided population

Biology 20 Evolution

Genetics and Natural Selection

Evolution. Before You Read. Read to Learn

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50

Big Idea #1: The process of evolution drives the diversity and unity of life

How did it all begin?

Since we re not going to have review this week either

Population Genetics of Selection

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

Transcription:

STAT 536: Migration Karin S. Dorman Department of Statistics Iowa State University October 3, 2006

Migration Introduction Migration is the movement of individuals between populations. Until now we have considered a single, panmictic population. (Panmictic means the population is completely mixed; there are no subpopulations or substructures within the population.) Now, we consider a population with immigration and emmigration.

Allele Migration Model Assume that who migrates is determined independent of genotype. Assume two populations: the one we study and another one from which migrants arrive. Assume both of those populations are panmictic. Then, let m be the fraction of the population we study in each generation that are new immigrants. We assume m is not dependent on time, so each generation immigrants make up the same proportion of the population. We need to track allele frequencies in two populations. Suppose the allele frequency of a gene of interest in our study population is p 1 and in the foreign population is p 2. The recursion relation for the allele frequency in our study population is p 1 (t + 1) = (1 m)p 1 (t) + mp 2 (t).

Extension to Multiple Alleles or Populations The equation p 1 (t + 1) = (1 m)p 1 (t) + mp 2 (t). applies to any number of alleles. All alleles satisfy the same equation. We need only determine the allele s frequency in both populations. If we allow for n distinct populations, with population 1 our study population, then we need to define n m i migration proportions, where m i is the proportion of the study population that came from population i. Then ( ) n n p 1 (t + 1) = 1 m i p 1 (t) + m i p i (t). i=2 i=2

Migration at the Genotype Level In diploid populations, the immigrants are diploid and carry two genes in with them. Let P (k) ij (t) be the frequency of genotype ij in population k at generation t. Then, for two populations P (1) ij (t + 1) = (1 m)p (1) ij (t) + mp (2) ij (t). What does migration do to Hardy Weinberg equilibrium? It can disrupt it. To see this, consider the example where P (1) AA (t) = 1 and (t) = 1. Then, after one generation of migration, P (2) BB P (1) AA (t + 1) = (1 m) P (1) BB (t + 1) = m. Is this population in Hardy Weinberg equilibrium?

Whalund Effect In general, for multiple populations each with allele A frequency p i, we have P (1) AA (t + 1) = m i P (i) AA (t), i where we write m 1 = 1 n i=2 m i. If the populations are each at HWE, then P (1) AA (t + 1) = i m i p 2 i (t). Now, examine the above equation. The right side is just the definition of the expectation of a discrete random variable p, where P(p = p i ) = m i. So, we can write P (1) AA (t + 1) = E ( p 2), where the expectation is taken over populations.

Whalund Effect II Now, recall the study population has allele frequency after migration of ( ) n n n p 1 (t + 1) = 1 m i p 1 (t) + m i p i (t) = m i p i (t), i=2 but this is recognizable as i=2 p 1 (t + 1) = E(p). If the study population were in HWE, then P (1) AA (t + 1) = [E(p)]2, but it isn t and the deviation from HWE is i=1 P (1) AA (t + 1) [E(p)]2 = E ( p 2) [E(p)] 2 = Var[p(t)].

Whalund Effect III We conclude then, that P (1) AA (t + 1) = [E(p)]2 + Var[p(t)]. In words, the frequency of homozygotes in a population receiving immigrants from multiple population is increased over Hardy-Weinberg expected frequencies by a factor equalling the variance in allele frequencies across different populations and weighted by their relative contributions. This is yet another reason why we may not observe HWE in a sample from a population. If instead of sampling from one population, we are actually sampling from multiple populations in proportions m i, then the presence of homozygotes will be increased above expectation. In the two allele case, heterozygotes will be deficient. In the multi-allele case, some heterozygotes may be more common than expected, but overall there will be fewer heterozygotes than expected under HWE.

Life of Whalund Effect How persistent is the Whalund effect? It is true immediately after migration, but what happens if migration stops and this population is allowed to randomly mate? What happens if sperm and eggs are the objects that are migrating, not adults?

Linkage Disequilibrium and Migration We have already seen an example of mixing two populations in linkage equilibrium that would create huge linkage disequilibrium. Consider a population of AACC individuals and another with BBDD individuals. Then, after equal mixing the population produces only AC and BD gametes and now AD or BC gametes though they should be equally likely. To consider the case in greater detail, let p i be the frequency of allele A at locus one in population i and q i be the frequency of allele C at locus two in population i. Consider an individual producing gametes. If the individual immigrated from population i, and population i is in linkage equilibrium, then this individual will produce gamete AC with probability p i q i. Overall, n P (1) AC (t + 1) = i=1 m i p i q i.

Linkage Disequilibrium and Migration The allele frequencies in the study population are as usual p (1) A (t + 1) = E (p) p (1) C (t + 1) = E (q). The amount of linkage disequilibrium is n AC (t + 1) = m i p i q i E (p) E (q) D (1) i=1 = E (pq) E (p) E (q) = Cov (p, q). So the amount of linkage disequilibrium in the study population is given by the covariance in the allele frequencies over populations, weighted by the relative contributions of those populations to the study population.

Gene Flow Migration can introduce a very temporary Hardy Weinberg Disequilibrium that disappears after one generation of random mating. Migration can introduce linkage disequilibrium that can disappear slowly with random mating. Eventually, a new Hardy Weinberg and linkage equilibrium will be achieved. The only long-term impact of migration is to change the allele frequencies in the study population. The new allele frequencies are the average of allele frequencies in the populations. Since migration in the long-run only changes allele frequencies, migration is often referred to as gene flow and the populations that are receiving or contributing alleles are called gene pools.

Estimating Migration If the three populations are identified, the two sources of immigration and the mixture population that results, and the allele frequencies in each population are known, then the rate of migration is simple to estimate using the recurrence relation Solve for m to obtain p 1 (t + 1) = (1 m)p 1 (t) + mp 2 (t). m = p 1(t + 1) p 1 (t) p 2 (t) p 1 (t) Exercise. How would you get a variance on this one? Application: Estimating flow of a biogenic gene used in a crop plant to surrounding wild plant populations.

One-Island Model p i ( t ) m p c m is the migration rate from the continent, p c is the fixed allele frequency on the continent, and p i (t) is the variable allele frequency on the island. In the one-island model, there is a small island neighboring a large continent with different allele frequencies. Migration from island to continent is too small to affect the continent, but migration from continent to island changes allele frequency on the island. The recurrence relation for allele frequency on the island is p i (t + 1) = (1 m)p i (t) + mp c,

One-Island Model Equilibrium We would like to determine the equilibrium gene frequency on the island p i (t) p i. Solve the equation by dropping dependence on t, p i = (1 m)p i + mp c yields p i = p c. So, eventually the island allele frequency will match the continent allele frequency. The rate of approach to equilibrium is determined by m. To see, consider p i (t + 1) p c = (1 m)p i (t) p c + mp c = (1 m)(p i (t) pc). So, each generation the amount of disequilibrium p i (t) p c declines by fraction m. The more migration, the faster the island matches the continent.

Island Model p ( t ) 2 m n 1 p ( t ) 1 p ( t ) 3 p ( t ) 4 m n 1 is the migration rate between islands and p i (t) is the variable allele frequency on the ith island. Assume there are n islands. In each generation, every island provides a fraction m n 1 to every other island, i.e. a fraction m of the population is in flux every generation, and there is completely balanced exchange of individuals between islands. Population i allele frequency is is updated as p i (t + 1) = (1 m)p i (t) + j i m n 1 p j(t).

Island Model Equilibrium At equilibrium, the allele frequency in population i is p i and the following equation is satisified. p i = (1 m)p i + m n 1 p j j i ( = 1 m m ) p i + m p j n 1 n 1 j ( = 1 mn ) p i + mn n 1 n 1 p p i = p. In addition, one can show p(t) = 1 n i p i(t) is unchanging with respect to time. So, the equilibrium allele frequency that will settle on every population is the average allele frequency across all islands. Migration does not change the overall allele frequency, just its distribution around the islands.

Island Model Rate of Approach to Equil. The rate of approach is p i (t + 1) p = = ( 1 mn ) p i (t) + n 1 ( 1 mn n 1 ) (p i (t) p), mn p(t) p n 1 because p(t) = p. So the rate of approach is like that of the one-island model with m = mn n 1. Notice, that for n large, mn n 1 m, so the models are essentially equivalent when there are lots of islands.

Stepping-Stone Model In this model migration only occurs between neighbors. For the m linear model 2 of the population in the next generation comes from each neighbor. For the 2- dim model m 4 of the population in the next generation comes from each neighbor. More complex models of migration exist, for example, where the migration rate m(d) depends on the distance d between populations.

General Migration Matrix Assume n populations and let m ij be the fraction of individuals in population i that are new immigrants from population j at each generation. Then, we can write p(t + 1) = Mp(t), where p(t) = (p 1, p 2,..., p n ) t is the vector of allele frequencies in the n populations and superscript t indicates transpose. Equivalently, we can let m ij be the fraction of population i individuals that emigrate to population j each generation. Then, p(t + 1) t = p(t) t M.

Summary of Migration All populations achieve the same equilibrium allele frequency if at every generation each population retains some of its population (not everyone emigrates) and all populations can receive immigrants from all other populations, even if it takes multiple generations. (Mathematically: If the Markov chain M is irreducible and ergodic, then there exists a limiting distribution independent of the starting state, and since all alleles achieve the same equilibrium distribution, the allele frequencies will be the same across islands.) The rate of approach to equilibrium is controlled by m. In general, migration eliminates geographic differences and the speed with which it does this is determined by the amount of migration m.

Migration and Selection As before, we are now interested in how migration interacts with other forces. In particular, selection. Migration, like mutation acts to homogenize populations. Selection is the only force that can act in the opposite direction. For spatial migration, selection can lead to more spatial heterogeneity when the most fit variant depends on geographic location.

One-Island Haploid Suppose in the one-island model allele B is fixed on the contintent (allele A has been eliminated by selection against it). On the island, allele A is favored by selection. Let the relative fitnesses on the island be 1 for A and 1 s for B. If there were no immigration, allele A would dominate on the island and would achieve mutation/selection balance. Immigration disrupts equilibrium. How much immigration is allowed while still maintaining allele A on the island? Let the allele frequency of A start at p on the island. After selection, the allele frequency of allele A is p = p 1 (1 p)s. After migration, the allele frequency becomes p = (1 m)p + m 0 = (1 m)p = p(1 m) 1 (1 p)s.

One-Island Haploid We recognize the equation as the one for mutation/selection balance in a haploid population. We know that at equilibrium either p e = 0 or p e = 1 m s, where migration rate m has replaced mutation rate u. When m s, then the second equilibrium predicts p e 0, so p e = 0 is the only valid equilibrium. In other words, migration will overwhelm the efforts of selection and will eliminate allele A. When m < s, p e = 1 m s will be a stable equilibrium (not shown). In this case, selection dominates migration and is able to maintain some beneficial A allele on the island. Note, that the island population is experiencing fitness degradation because of immigration from the island. It would be to their benefit to not intermix with incoming immigrants and, assuming the immigrants compete for the same resources, even prevent them from arriving!

One-Island Diploid Dominant Now consider a diploid population. We need to carefully define when immigration happens. Suppose selection occurs first, then migrants arrive. We consider the diploid dominant where AA and AB individuals have fitness 1 and BB individuals have fitness 1 s. Then the update equations for the allele frequency of allele A are p = and the change in one generation is The change is positive if p(1 m) 1 s(1 p) 2. p p = p ( m + s(1 p) 2) 1 s(1 p) 2. m < s(1 p) 2.

One-Island Diploid Dominant m < s(1 p) 2 When m > s, the above condition cannot be met and allele A will decline in the population until it is eliminated. When m < s, selection is able to control migration and there is a stable equilibrium m p e = 1 s.

One-Island Diploid Recessive Now consider a recessive allele such that AA individuals have relative fitness 1 and AB and BB individuals have relative fitness 1 s. The update equations for allele frequency of A with migration after selection is p p(1 m) [1 (1 p)s] = 1 [2p(1 p) + (1 p) 2 ] s, leading to per-generation change of p p = p { (1 m) [1 (1 p)s] 1 + [ 2p(1 p) + (1 p) 2] s } 1 [2p(1 p) + (1 p) 2 ] s = p { (1 m) [1 (1 p)s] [ 1 s(1 p 2 ) ]} 1 [2p(1 p) + (1 p) 2. ] s Whether the change is negative, positive, or there is no change depends on (1 m) [1 (1 p)s] <=> [ 1 s(1 p 2 ) ].

One-Island Diploid Recessive p = 0 is always a stable equilibrium. There are two additional equilibria when s 2 (1 m) 2 4ms(1 s) > 0. When s 1 and m 1, then there are two equilibria if s > 4m. Only the large of the two is a stable equilibrium. Equilibrium p = 0 will attract if allele frequency of A is too low. Equilibrium p > 0 will attract if allele frequency of A is high enough.

Locally Adaptive Recessive Alleles Locally advantageous alleles (alleles that are favored in limited geographic areas) are not expected to be recessive alleles. If there are locally favored recessive alleles, they should not appear at frequencies less than 1 2. To see this, consider the maximum of the parabola (p = 1 m 2 1 2 ).

Migration and Selection When migration dominates selection, a locally adaptive allele cannot be maintained in the local population. When selection dominants migration, a haploid or diploid dominant allele can be maintained in the local population. When selection dominants migration, a diploid recessive allele can only be persist in the local population if it is initially present at high frequency (above about 1 2 ).