MOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT

Size: px
Start display at page:

Download "MOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT"

Transcription

1 MOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT JOHN H. GILLESPIE Department of Biology. Uniuersity of Pennsylvania, Philadelphia, Pennsylvania Manuscript received June 8, 1978 Revised copy received February 20,1979 ABSTRACT A model of multi-allelic selection in a random environment, the SAS-CFF model, is examined for compatability with allele frequency and genetic distance data acquired by electrophoresis. The symmetric version of the model tends to predict higher than observed levels of polymorphism unless substantial positive autocorrelations in the environment are postulated. The actual allele frequency configurations observed in nature are in rough agreement with those predicted by the SAS-CFF model. An approximate analysis of the transient properties of the SAS-CFF model shows that, in broad outline, the behavior is quite similar to that of the neutral model. ONE of the major accomplishments of the neutral allele theory has been to provide a model that can simultaneously account for enzyme polymorphism and molecular evolution. In fact, under this model, enzyme polymorphism is viewed as a phase of molecular evolution (KIMURA and OHTA 1971). To date, models of natural selection have not achieved this broad level of applicability. Most work has focused on genetic polymorphism and has ignored the question of molecular evolution. This contributes, no doubt, to the increasing frequency with which evolutionary patterns are explained in the context of the neutral allele theory. The major technical obstacle to developing theories of evolution and polymorphism in selectionist models is the high dimensionality of the state space. Both asymptotic and transient states of multidimensional diffusions are difficult to describe. In the neutral model, this is not a serious problem since the frequency of any particular allele is a Markovian variable whose properties can give quite a bit of information about the entire process. In selectionist models, individual allele frequencies are not Markovian variables, making the techniques used to investigate the neutral model inapplicable. There are, however, certain ways of examining selectionist models that provide insight into their evolutionary dynamics. These ways will be exploited in this paper to describe the dynamics of the so-called SAS-CW model of selection in a random environment (GILLESPIE 1978). The SAS-CFF model was developed to account simultaneously for observations on the quantitative genetics of fitness traits and on enzyme polymorphisms. The model incorporates a stochastic additive scale (SAS), which is a stochastic process Genetics 93 : November, 1979.

2 738 JOHN H. GILLESPIE reflecting spatial and temporal fluctuations in the environment, as well as the additive nature of enzyme activity in heterozygotes; and a concave fitness function (CFF), which maps the stochastic additive scale to fitness. More formally, we could describe the one-locus model by a series of postulates: (1) Each homozygous genotype is characterized by a random variable, Yi (n,t), which will be called the enzyme activity of the genotype. The enzyme activity will generally fluctuate at random in both time and space, the arguments referring to the nth patch in the Tth generation. The i refers to the ith allele. (2) The enzyme activity of a heterozygote is exactly intermediate between the activities of the two associated homozygotes. If we refer to the activity of the A,A, homozygote as Yi(n,T), then the activity of the A,Aj heterozygote will be [Yi(n,T) + Yj(n,T')]/2. (3) There exists a function, (~(y), which maps the activity of the enzyme onto the fitness of the genotype. It is assumed that this fitness function is strictly monotonically increasing, is concave throughout its domain, and approaches a finite limit as y+w. That is, +'(y)>o,+"(y)<o, and +(y)<k<w. The compatability of this model with experiments on the quantitative genetics of fitness traits is covered in GILLESPIE (1978). The dynamic properties of the SAS-CFF model, like those of the neutral model, can be approximated by a multidimensional diffusion process with a single parameter, B. In order to achieve this approximating diffusion model, various assumptions about the moments of the Yi(n,T) and about the structure of the environment must be made. In this paper, we will deal only with the symmetric SAS-CFF model, so that we will assume that Thus, all homozygotes have the same mean and variance of their enzyme activities. The parameter T is introduced as a scaling factor that will be allowed to approach zero in order to arrive at the limiting diffusion model. The form of the diffusion is insensitive to the assumptions made about the structure of the environment. The state space of this process is the vector X(t) = [X,(t), X2(t),...,XK(t) 1, where Xi (t) represents the frequency of the ith allele. Since there are K segregating alleles, K z X,(t)=l. j=1 The approximating diffusion model is defined by its drift and diffusion coefficients which are given by

3 MOLECULAR EVOLUTION AND POLYMORPHISM 739 is the homozygosity of the population (GILLESPIE 1977). The asymptotic density of this process is the Dirichlet distribution r(2b-2) IC (2B-2)/K-l ri xi r ( (2B-2) /K) i=i The expected homozygosity at equilibrium is given by (5) EF=-f B-2 2B--1 K 2B-1 As K+ m, that is, as we go from a finite to an infinite-allele model, the Dirichlet distribution approaches the Poisson-Dirichlet limit, and the expected homozygosity approaches 1 EF=-. 2B- 1 Thus, the asymptotic densities of the finite and infinite allele versions of the SAS-CFF model are of the same form as the symmetric neutral allele model densities. This paper will explore three aspects of the SAS-CFF model. First, data on enzyme variation will be used to place bounds on the probable numerical value of B. Second, the initial rate of change of the genetic distance between two isolated populations will be examined in order to compare the evolutionary rate of the SAS-CFF model with that of the neutral model. Third, the transient distribution of the genetic identity of two isolated populations will be examined. The general conclusions are that evolutionary dynamics of the SAS-CFF model are similar to those of the neutral model and could not be distinguished from the neutral model at the level of resolution of the currently available data on molecular evolution or on the distribution of genetic distances between taxa. In the symmetric SAS-CFF model, the parameter B plays a role analogous to 8 in the neutral model. In fact, when 2(B-1) equals 8, the asymptotic densities of both models are identical. The unfortunate aspect of the parameter B is that it does not represent a simple biological quantity as 8=4Nu does. In general, the values of B will depend on the amount of dominance, on the amount of temporal autocorrelation in the environment, and on the amount and kind of environmental subdivision of the environment. (For the details of this dependence, see GILLESPIE 1978). These details need not concern us, however, when using certain kinds of data to estimate the numerical value of B. One approach to the estimation of B would be to apply the estimation procedure for 6' developed by EWENS (1972) for the neutral model. Rather than pursuing this approach, we will put bounds on B by the same method that LEWONTIN (1974) used to place bounds on 8. This method uses the observation that average heterozygosities from natural populations fall within the range to LEWONTIN (1974, Figure 18) showed that this range corresponds to B values between and Similarly, we can argue that this (6) (7)

4 740 JOHN H. GILLESPIE 0 a0 - alleles - exact 10 - alleles - approx (B -1) FIGURE 1.-The relationship between B-1 and heterozygosity. The dots represent the relationship for the approximate process (35) with K = 10 alleles. corresponds to B values of to (Figure 1). To arrive at this use, (7) to derive the expected heterozygosity I EH is the function plotted in Figure 1. How can this range of B values be reconciled with the underlying biology of the SAS-CFF model? To explore this question, we will first examine the B value for an infinite-allele model where there are nonautocorrelated temporal fluctuations in an environment with no spatial heterogeneity. In this case B = 2 ( 1 -+"/2+") (9) (GILLESPIE 1978), where + is the fitness function with the derivatives evaluated at one. It was argued in GILLESPIE (1976) that, for the fitness function the data on the quantitative genetics of viability suggest that.a =: Substituting this into (9), we get B = 2(1+cr-l) (11)

5 MOLECULAR EVOLUTION AND POLYMORPHISM 74 1 This would correspond to an average heterozygosity of about 0.9, which is far above the range observed in nature. In a sense, we are in the same dilemma that AYALA, POWELL and TRACY (1972) put the neutral-allele hypothesis into by suggesting that there were fewer alleles segregating in natural populations than the theory predicts. In our case, however, there are other models of environmental fluctuations that can lower the B value to give better agreement with the B value estimated from enzyme heterozygosities. Among these possibilities are asymmetries in the enzyme activities or nonequilibrium. These possibilities will be investigated in another paper. Here we will consider only possibilities that are possible within the context of the symmetric, equilibrium SAS-CFF model. For symmetric SAS-CFP models, environmental subdivision has the effect of raising the B values, so that the addition of a patchy structure can only make matters worse. On the other hand, positive autocorrelations in the environment have the effect of lowering the B value. For a single patch in a weakly autocorrelated environment (terminology of GILLESPIE and GUESS 1978), the B value is given by B=~(~-~/cY)/(~-cz), (12) where a is a measure of the strength of the autocorrelation. The values of a range between zero for a completely autocorrelated environment (i.e., nonfluctuating) through one for independent environments. Values larger than one correspond to negatively autocorrelated environments. Given a 0.06, we require a =: in order to have a B value of 1.1, which is typical of the enzyme polymorphism data. This indicates that one viable explanation for the low heterozygosity observed in nature would be that the environmental fluctuations are positively autocorrelated. Of course, if further work on hidden alleles reveals generally higher heterozygosities, then we would lower the a value appropriately. If the data are so extreme as to yield B values in excess of 35, then we could invoke environmental subdivision to raise the value. Finally, if we incorporate restricted migration, this could also result in either higher or lower heterozygosities, depending on the structure of the fluctuations. Until the data on polymorphism are more complete, a final decision on the nature of the environmental fluctuations must be postponed. However, it is clear from the above that the SAS-CFF models can, in principle, make use of two sorts of data-those on the quantitative genetics of viability and on enzyme polymorphism-to make critical inferences on the process of selection in natural populations. In the foregoing, we used enzyme data to estimate B, but this, in itself, does not provide a test of the agreement of the data with the Poisson-Dirichlet density of the infinite-allele SAS-CFF model. However, since the neutral-allele model also has an asymptotic Poisson-Dirichlet limit, the recent work of FUERST, CHAKRABORTY and NEI (1977), which argues for the agreement of enzyme data with the neutral-allele model, may also be used to argue for agreement of these data with the SAS-CFF model. In particular, the mean and variance of heterozygosity for the SAS-CFF model: 2B-2 4(B-I) EH=--- VAR H = 2B- 1 (2B-1) '2B(2B+1)

6 742 JOHN H. GILLESPIE show the same functional relationship to each other as they do in the neutralallele model. It is this relationship that FUERST, CHAKRABORTY and NEI (1977) exploited in their test of the agreement of data with the Poisson-Dirichlet distribution of allele frequencies. Thus, two claims about the variable-environment selection model by FUERST, CHAKRABORTY and NEI (1977) are clearly incorrect: (1) In this hypothesis, the agreement of the observed variance and the expected variance of the mutation-drift hypothesis must be accidental. (2) According to this hypthesis, the heterozygosity for a given locus should not vary extensively among related species. These two statements appear on page 475 of FUERST, CHAKRABORTY and NEI (1977). NEI, CHAKRABORTY and FUERST (1976) made an intriguing modification of the neutral-allele model that allows for locus-by-locus variation in the mutation rate. In particular, they assumed that the mutation rate was gamma-distributed and estimated the mean and variance in the rate using data on the rates of evolution of proteins. In a similar way, we could allow locus-by-locus variation in B values, although at the present time we are unable to find an independent method of estimating the distribution of B. This variation in B would reflect the very plausible notion that certain loci experience types of environmental fluctuation different from that of other loci. If B were gamma-distributed, then the agreement of data with the variable-mutation rate neutral model would argue equally for a variable-b SAS-CFF model. This point will be taken up again when the distribution of genetic distances is examined in the next section. As a final point, it is often argued that there should be a simple relationship between the variance in the environment and the level of heterozygosity in variable environment models. For example, OHTA (1976) uses the fact that attempts to find correlations between heterozygosity and ecological conditions have not been successful to argue against the relevance of positive Darwinian selection to the evolution of enzymes. More recently, POWELL and WISTRAND (1978) claimed that there should be a positive correlation between the heterogeneity of environments and the amount of genetic variation of populations living in them. However, the value of B, and consequently the average heterozygosity, are independent of the variance U*. Thus, in the symmetric SAS-CFF model, we would not expect a relationship between U* and H. This somewhat surprising observation stems from the fact that the variance enters both the drift coefficient and the diffusion coefficient of the diffusion model as a constant multiple. Increasing the variance increases both the centripetal and centrifugal aspects of the process in direct proportion, so that the two influences exactly cancel. This property does not hold, however, for asymmetric models. GENETIC IDENTITY There is presently available a large body of data on the genetic identity and/or distance between semi-isolated populations and between various taxons. CHAKRA- BORTY, FUERST and NEI (1978) have recently examined these data and have

7 MOLECULAR EVOLUTION AND POLYMORPHISM 743 shown that the major features of the data are in essential agreement with the statistical properties of the neutral-allele model. It would be interesting to see if the dynamic properties of the genetic distance between two isolated SAS-CFF populations behaved in a similar fashion. Unfortunately, the transient solution to the diffusion (3) is not available, so that we must resort to an approximate treatment of the dynamic behavior of genetic distance. Before undertaking this approximate treatment, we will first make an effort to set a time scale for change under the SAS-CFF model. We begin by establishing the definition of genetic identity that will be utilized. In this, we follow NEI (1972) exactly. Consider a pair of populations with the ith allele in one of the populations being Xi and the frequency of the same allele in the other population being Yi. For this locus, the primitive statistics are the random variables jz and jy are the homozygosities of the two populations, while jzzr is the probability of identity of two alleles, one chosen at random from each of the two populations. These random variables were used by NEI (1972) to define two different measures of genetic identity. In the first instance, we have the normalized genetic identity of the jth locus, which is the random variable In the second instance, we have what NEI ( 1972) called the normalized identity of genes with respect to all loci, which is the ratio of expectations Of course, EIjZI. To set the time scale of the change of genetic distance under the SAS-CFF model, we will examine the instantaneous rate of change of I at the first moment of isolation of two populations that are derived from a single equilibrium population. Since the process starts at equilibrium, we have I We cannot calculate Ejxy, but we can calculate which is the mean rate of change of jsy averaged over the asymptotic Dirichlet distribution (5). To calculate the mean rate, we note that

8 744 JOHN H. GILLESPIE In this calculation, we use the fact that the second-order derivatives with respect to xi or yi are zero. We now take the expectation of (20) with respect to (5) and let K+ 00, giving -2(B-1) - -2 (B-I ) - * EF = (2B-1) (2Bfl) 2Bfl To obtain the rate of change of I, we divide by EF giving -2(B-I) 2Bfl This rate, which lies between zero and one, is analagous to the rate -0 = -4Na, which is obtained from the normalized neutral model. In the neutral model, time is measured in units of 2N generations, so that the rate in real time is given by -e 2N- Y (23) as given by NEI (1972). In deriving the diffusion equations for the SAS-CFF model, the time scale was also normalized. For a model with a single patch plus autocorrelations, the normalization was written as (GILLESPIE 1978), were T is a variable that is introduced for technical reasons. If we suppress T and let u2 be the actual variance rather than U ~ being T the actual variance, we can write the real time rate as v = -2 (B-I ) u2. (1-p) 4 2 (2/a- 1 ) 2Bfl 2. In this expression we know, approximately, that B =: 1.1, + 2 =: and a =: 0.067, giving IvI u2(1-p). (2.6) We are left with an expression for the rate of change of I at the moment of isolation that depends on the parameters u2 (1-p). If we knew the average rate of change of genetic identity at the moment of isolation, we could estimate these parameters. Since this information is not available, we can at least ask the following question: What value would m2 (l-p) have to have in order for the rate of change of genetic identity under the SAS-CFF model to equal the rate of change under the neutral model? The neutral rate of change is obtained from gene substitution data and is generally estimated on a per cistron level to be Iv/ = lo- to IO+. For the desired agreement, we require = 10-4 to (27)

9 MOLECULAR EVOLUTION AND POLYMORPHISM 745 If p = 0, this would correspond to a standard deviation of enzyme activity of roughly 0.01 to These calculations are necessarily crude, but indicate that the SAS-CFF model, with the same asymptotic density as the neutral model, initially evolves at about the same rate as the neutral model if U' (1-p) = 1 e4 to Thus, the SAS-CFF model can share the property of polymorphism, plus slow evolutionary change, which was thought to be uniquely characteristic of the neutral model. We now turn to the distribution of Zj between two isolated populations that are drawn from an equilibrium SAS-CFF population. We will use an approximate analysis that has tremendous potential in the present context, as well, as for other problems associated with transient states of the SAS-CFF model. The basic idea is to find a process similar to (3), but whose complete transient solution can be obtained. If we write the drift coefficient of (3) in the form Edxi = [zi(f-zi)+(b-l)zi(f-zi)]dt, (28) it is tempting, given the smallness of (B-1), to seek an approximate process of the form Edzi = [zi(f-~i) + (B-l)g(~)]dt, (29) where g(z) is a convenient function. In this way as (B-l)-tO, the original and the approximate processes approach the same process and, presumably, each other. To arrive at a convenient approximating process whose transient densities can be explicitly determined, we first transform (3) to the new process ~i =ln(~i/~k), i=l, 2,... K-1. (30) Then it is easy to show that the new process has drift and diffusion coefficients Edui = [(B-I) (l-eui)/(lf~eu~)]dt Eduiduj = (l+sij)dt. If we now linearize the drift coefficient around ui = 0, we get a new process with the same diffusion coefficient, but with the new drift coefficient -(B-1) Edui = ui. K This process is a (K- 1 )-dimensional Orstein-Uhlenbeck process of a particularly simple form. Its transient densities are Gaussian with mean and second order moments -(B-l) -t Eui(t) = ui(o>e K VAR ui(t) =- B-I COV[Ui(t>, U j(t)] = -2(B-1) t [I-e ] K 2(B-1) -2 IB-1),, t [I-e 1. (33) ( 34)

10 746 JOHN H. GILLESPIE This process can now be transformed back to the allele frequency process. When this is accomplished, we get the approximating process 1 " Edzi {zi(f-zi) + (B-1)- [ X Zj lnzj - lnzi]}dt K ~=1 EdZidzj =zi~j(sij 4- F- ~ i zj)dt -. (35) This approximating process has the same diffusion coefficients as the original process, but differs in the form of the drift coefficient. For B 1, we expect this process to behave very much like the SAS-CFF process. Since we know the solution to the process ui, we can easily obtain the transient solution to hte process zi. Let where A= 2(B-I) K -A t mi(t) = In[zi(O)/q(O)]e2. We can now write the transient density of (35) as where yt = dk [27r~-1( I -e-it )] (K--1)/2 The asymptotic density of (35) is given by (38) with (37) (38) (39) These densities are the multivatriate extensions of the Se distributions given by JOHNSON 1949). Unfortunately, the moments of these distributions are quite difficult to calculate even in the univariate case. However, because these distributions are simple transformations of Gaussian distributions, it is quite easy to obtain any desired information from a Monte-Carlo approach, using a computer. Before using this approximate process to obtain information on the distributions of Zj, it will be useful to examine the accuracy of the approximation. No analytic results on this question are presently available, but some indirect evidence is encouraging. In the two-allele case, Figure 2 shows how the drift coefficients of the two processes compare for B = 1.5 and B = 1.1. It is imposible to distinguish the two processes on a graph of this resolution for B = 1.l. However, it is possible to show that the approximating process errs in the direction of providing too great a centripetal push to the allele frequencies. Thus, we would expect to see slightly

11 +I' a, 0.I.I yl o s E 0' -0.1 MOLECULAR EVOLUTION AND POLYMORPHISM B=l.5 approx. - B=1.5 exact _.-._ B=l.l approx. and exact o Allele Frequency FIGURE 2.-A comparison of the drift coefficients for the two-allele version of the exact and approximating process. higher heterozygosities than predicted by the exact model, as well as a lower variance in heterozygosity. This is illustrated in Figures 1 and 3. For B close to one, the agreement is quite good, although it deviates in the expected direction of excessive heterozygosity. For large B, the heterozygosity is too low. This is due to the fact that there are a finite, rather than an infinite, number of alleles in the approximating process. For B close to one, the finiteness of the number of alleles becomes irrelevant. The value of the approximating process lies, in part, in the ease with which it can be used to generate allele-frequency configurations on a computer. This property will now be exploited to examine the distribution of Zj. The simplest way to generate this distribution is first to generate equilibrium configurations, and then to generate two further configurations with the equilibrium configuration as the initial point and with the desired value of t. The random variable Zj is then calculated for these two populations. This experiment is repeated many times and the distribution of Zj may be displayed as a histogram. In carrying out this approach, all of the computations are routine except for the desirability of a fast normal random variate generator. In these Monte-Carlo calculations, the decomposition method given in NEWMAN and ODELL (1971) was used to generate the normal deviates. The results for B = 1.1 are shown in Figure 4. The results are presented for

12 748 JOHN H. GILLESPIE 0.08 LL rc Q1 0 c.- Q o F FIGURE 3.-The line is the relationship between the mean and variance of F for the Poisson- Dirichlet limit; the dots for the results of the approximating process (35). all trials, as well as those trials for polymorphic populations that are trials in which at least one of the two derived populations the most frequent allele has a frequency of less than This should be compared to the analagous histograms in AYALA and GILPIN (1974) and NEI and TATANO (1975). Th is comparison reveals that the SAS-CFF model behaves in a manner that is very similar to the neutral model when histograms of Zj are visually compared in the cases of monomorphic plus polymorphic populations. This is not surprising, since both models are constrained to having relatively low heterozygosities and, therefore, most If values will be near zerg or one with relatively few in the middle of the unit interval. The temporal sequence of J-shaped to U-shaped to reflected J-shaped distributions will be displayed by essentially any evolutionary model with constant rates of evolution and low heterozygosities. From this we must conclude that histograms of Zi may not be useful in distinguishing the neutral allele model from the SAS-CFF model, or, presumably, from any other Ftationary model of evolution with low heterozygosities. In the case of polymorphic populations, the SAS-CFF model exhibits a bimodal shape, which AYALA (1977) showed is characteristic of the Drosophila data, but is not exhibited by the neutral-allele model. This property suggests that the SAS- CFF model may be preferable to the neutral model as an explanation for enzyme polymorphism in Drosophila. CHAKRABORTY, FUERST and NEI (1978) have carried the analysis of genetic

13 MOLECULAR EVOLUTION AND POLYMORPHISM Polymorphic and Monomorphic Loci Polymorphic Loci o o Genetic I dent ity FIGURE 4.-Histogram for genetic identities based on 200 replicates with B = 1.1. distance data quite a bit further than the simple visual inspection of histograms. They have looked at the functional relationships between genetic distance and the correlation of heterozygosity, and between the mean and variance of genetic distance under the neutral-allele model. These relationships are compared to enzyme data. The agreement is variable, but there is no compelling reason to reject the neutral-allele theory on the bases of these comparisons except, perhaps, for Drosophila. VVe can examine the two relationships used by CHAKRABORTY, FUERST and NEI (1978) using the approximate process (35) for the SAS-CFF model. In

14 750 JOHN H. GILLESPIE both cases, we will be interested in both the quantitative and the qualitative aspects of the comparison between these relationships for the neutral-allele and the SAS-CFF models. Genetic identity and correlation of heterozygosity OHTA (1976) showed that the genetic identity given by (16) and the correlation between heterozygosities, r, are related by TZ 12(1+1/8). (41) The correlation, as a function of I, depends only on the single paramater 8, which may be estimated independently. This should, in principle, provide a test for the neutral-allele model. The relationship between r and I for the approximate SAS-CFF model is given in Figure 5 for the heterozygosities H = 0.05 and H = The curves are the relationship (41) for the neutral model for these same heterozygosities. We see from this figure that the drop off of the correlation of heterozygosity with genetic identity is slower under the SAS-CFF model than under the neutral model. It is also apparent from this figure that the functional dependence of r and I depends on the heterozygosity of the population. This is contrary to the claim of CHAKRABORTY, FUERST and NEI (1978) that, under a random fluctuation model, the correlation should occur irrespective of the level H = H = I Im I 0:, o Genetic identity FIGURE 5.-The relationship of genetic identity with the correlation of heterozygosity. The lines are for the neutral allele model; the dots for the approximate SAS-CFF model. I J

15 MOLECULAR EVOLUTION AND POLYMORPHISM 75 1 of heterozygosity. Comparison of Figure 4 with Figure 3 of CHAKRABORTY, FUERST and NEI (1978) shows that the SAS-CFF model fits the data better than does the neutral-allele model with constant mutation rate, but does not fit as well, in the case of Drosophila, as the neutral model with variable mutation rates. We can, however, modify the SAS-CFF model to include locus-by-locus variation in B, as discussed earlier. This will have the same effect as does varialion in the mutation rate and will make the agreement of the SAS-CFF model with the Drosophila data accept2ble. Mean and variance of genetic distance The comparison of the mean and variance of genetic distance, d,, for the neutral allele and the SAS-CFF models is given in Figure 6. For the definition of d, and for its moments under neutrality, see CHAKRABORTY, FUERST and NEI (1978). The form of the relationship is similar to that under neutrality although the variance under the SAS-CFF model is less than that of the neutral model for a given d*$. To some extent this will be caused by the approximation (35) since, as demonstrated in Figure 3, the approximate process will generally show reduced variances in measures like genetic distance and heterozygosity. There is also an effect due to the fact that the Monte-Carlo program utilized only ten alleles. Nonetheless, if we accept the comparison at face value and examine Figure 5 of CHAKRABORTY, FUERST and NEI (1978), which looks at the enzyme data, we see that the SAS-CFF model will fit the data adequately for all groups except, perhaps, Drosophila, where the curve under the SAS-CFF model will 0.3 E c1 0.2 rc 0 Q) 0 c.- (TJ > H= H= /e--- \ 0 0 *, dml \ \ P \ \ \,\ FIGURE 6.-The relationship between d, and the variance in ans. The lines are for the neutral model; the dots are from the approximate SAS-CFF model.

16 752 JOHN H. GILLESPIE fall below the data points. The curve for the constant-mutation neutral model falls above the data points, while the variable mutation neutral curve fits the data quite well. In summary, the behavior of genetic distance under the SAS-CFF model is qualitatively like the behavior under neutrality, although quantitatively there are differences. At the present time both behaviors seem to fit the enzyme data adequately. A definitive test will be unlikely until the problem of the nonidentification OF alleles is resolved. DISCUSSION The SAS-CFF model is a model of a particular form of natural selection in an infinitely large population experiencing no mutation. Since all natural populations are finite and all loci are mutable. it is natural to query the domain of applicability of the SAS-CFF model. The asymptotic and transient properties explored in this paper should be virtually unaffected by genetic drift and mutation if the coefficient of xixj [S,j + F - xi - ~ j ] (42) in the moment EAX,AX~ is much larger than the reciprocal of the population size. For example, in the two-allele case. the combined mutation + drift + SAS-CFF model for a single patch and no autocorrelation has the drift and diffusion coefficients Edx = { ~Bx( 1-S) ( S-S) +e8 ( S-X) } Ed2 = 22' ( 1-X) '+ex (1-s), (43) where =2Na2. It is obvious that, as E-O, the combined model approaches the simple SAS-CFF model. This is also reflected in the asymptotic density, which may be written as c [z ( 1 -x) ] B-2 [ 1 +,/ex ( 1 -x)]b-1-8. (44) where [z(l-x)]b-2 will be recognized as the asymptotic density of the SAS- CFF model without the effects of drift and mutation. Thus, when 2Na2 is very large, we would not expect the infinite and the finite N models to behave very differently. There have been other attempts to examine the evolutionary dynamics of random environment models. The two most conspicuous of these are by NEI and YOKOYAMA (1976) and by TAKAHATA, KAZUSHIGE and MATSUDA (1975). In both instances, the models included random fluctuations, genetic drift and mutation. However, in both cases the analyses make the error of assuming that the multiple-allelic aspects of selection in a fluctuating environment may be inferred from the single allelic behavior. In particular, NEI and YOKOYAMA (1976) incorrectly assume that the diffusion coefficient for the ith allele is of the form Edxiz Vs~iz (l-xi) * + xi (1-~i)/2N, (451

17 MOLECULAR EVOLUTION AND POLYMORPHISM 753 where the correct form for the term reflecting the effects of a variable environment is given by (3). The entire discussion of the incompatibility of random environment models with enzyme data in NEI and YOKOYAMA (1976) must be discounted. Similarly TAKAHATA, KAZUSHIGE and MATSUDA (1975) have assumed that one can infer multi-allelic properties from single-allele models. They have made the additional assumption that a Strotonovich-type approach io the random fluctuation equation is appropriate. For a discussion of the problems of this approach, see TURELLI (1977) or GILLESPIE and GUESS (1978). In conclusion, this paper shows that the SAS-CFF model is in reasonable agreement with the presently available enzyme data. There is no way at present of convincingly rejecting any of a number of models that are also in reasonable agreement with the data. The main virtue of the SAS-CFF model as it is presently developed is that it addresses a wider range of phenomena than do any of the competing models. LITERATURE CITED AYALA, F. J., 1977 Protein evolution in different species: Is it a random process? pp In: Malecular Evolution and Polymorphism. Edited by M. KIMARA, Mitshima, Japan. AYALA, F. J. and M. E. GILPIN, 1974 Gene frequency comparisons between taxa: support for the natural selection of protein polymorphisms. Proc. Natl. Acad. Sci. US. 71 : AYALA, F. J., J. R. POWELL and M. L. TRACY, 1972 Enzyme variability in the Drosophila willistoni group. V. genetic variation in natural populations of Drosophila equinoxialis. Genet. Res. 20: CHAKRABORTY, R., P. A. FUERST and M. NEI, 1978 Statistical studies on protein polymorphism in natural populations. 11. Gene differentiation between populations. Genetic 88: EWENS, W. J., The sampling theory of selectivity neutral alleles. Theoret. Pop. Biol. 3: FUERST, P. A., R. CHAKRABORTY and M. NEI, 1977 Statistical studies on protein polymorphism in natural populations. I. Distribution of single locus heterozygosity. Genetics 86 : GILLESPIE, J. H., 1976 A general model to account for enzyme variation in natural populations. 11. Characterization of the fitness function. Am. Naturalist 110: , 1977 Sampling theory for alleles in a random environment. Nature 226: , 1978 A general model to account for enzyme variation in natural populations. V. The SAS-CFF model. Theoret. Pop. Bid. 14: 1-46, GILLESPIE, J. H. and H. A4. GUESS, 1978 The effects of environmental autocorrelations on the progress of selection in a random environment. Am. Naturalist 112: JOHNSON, N. L., 1949 Systems of frequency curves generated by methods of translation. Biometrika 36: KIMURA, M. and T. OHTA, 1971 Protein polymorphism as a phase of molecular evolution. Nature 228: LEWONTIN, R. C., 1974 New York. NEI, M., 1972 The Genetic Basis of Evolutionary Change. Columbia Univ. Press, Genetic distance between populations. Am. Naturalist 106: NEI, M., R. CHAKRABORTY and P. A. FUERST, 1976 Infinite allele model with varying mutation rate. Proc. Natl. Acad. Sci. US. 73: NEI, M. and Y. TATENO, 1975 Interlocus variation of genetic distance and the neutral mutation theory. Proc. Natl. Acad. Sci. US. 72:

18 754 JOHN H. GILLESPIE NEI, M. and S. YOKOYAMA, 1976 Effects of random fluctuations of selection intensity on genetic variability in a fi'iite population. Japan I. Genet. 51 : NEWMAN, T. G. and P. L. ODELL, 1971 York. OHTA, T., 1976 Role of very slightly deleterious mutations in molecular evolution and polymorphism. Theoret. Pop. Biol. 10: POWELL, J. R. and H. WISTRAND, 1978 The effects of heterogeneous environments and a competitor on genetic variation in Drosophila. Am Naturalist 112 : TAKAHATA, J., I. KAZUSHIGE and H. MATSUDA, 1975 Effect of tempera1 fluctuation of selection coeficient on gene frequency in a population. Proc. Natl. Acad. Sci. U.S. 72 : TURELLI, M., The Generation of Random Variates. Hafner, New Random environments and stochastic calculus. Theoret. Pop. Biol. 12: Corresponding editor: W. J. EWENS

NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT

NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE Department of Biology, University of Penmyluania, Philadelphia, Pennsyluania 19174 Manuscript received September 17,

More information

Lecture Notes: BIOL2007 Molecular Evolution

Lecture Notes: BIOL2007 Molecular Evolution Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits

More information

The neutral theory of molecular evolution

The neutral theory of molecular evolution The neutral theory of molecular evolution Introduction I didn t make a big deal of it in what we just went over, but in deriving the Jukes-Cantor equation I used the phrase substitution rate instead of

More information

DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION'

DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION' DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION' WEN-HSIUNG LI Center for Demographic and Population Genetics, University of Texas Health Science Center,

More information

GENETIC VARIABILITY AND RATE OF GENE SUBSTITUTION IN A FINITE POPULATION UNDER MUTATION AND FLUCTUATING SELECTION* NAOYUKI TAKAHATA ABSTRACT

GENETIC VARIABILITY AND RATE OF GENE SUBSTITUTION IN A FINITE POPULATION UNDER MUTATION AND FLUCTUATING SELECTION* NAOYUKI TAKAHATA ABSTRACT GENETC VARABLTY AND RATE OF GENE SUBSTTUTON N A FNTE POPULATON UNDER MUTATON AND FLUCTUATNG SELECTON* NAOYUK TAKAHATA National nstitute of Genetics, Mishima, 411 Japan ABSTRACT By using a numerical method

More information

A simple genetic model with non-equilibrium dynamics

A simple genetic model with non-equilibrium dynamics J. Math. Biol. (1998) 36: 550 556 A simple genetic model with non-equilibrium dynamics Michael Doebeli, Gerdien de Jong Zoology Institute, University of Basel, Rheinsprung 9, CH-4051 Basel, Switzerland

More information

Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution. Classical vs. balanced views of genome structure

Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution. Classical vs. balanced views of genome structure Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution Classical vs. balanced views of genome structure - the proposal of the neutral theory by Kimura in 1968 led to the so-called neutralist-selectionist

More information

I of a gene sampled from a randomly mating popdation,

I of a gene sampled from a randomly mating popdation, Copyright 0 1987 by the Genetics Society of America Average Number of Nucleotide Differences in a From a Single Subpopulation: A Test for Population Subdivision Curtis Strobeck Department of Zoology, University

More information

11. TEMPORAL HETEROGENEITY AND

11. TEMPORAL HETEROGENEITY AND GENETIC VARIATION IN A HETEROGENEOUS ENVIRONMENT. 11. TEMPORAL HETEROGENEITY AND DIRECTIONAL SELECTION1 PHILIP W. HEDRICK Division of Biological Sciences, University of Kansas, Lawrence, Kansas 66045 Manuscript

More information

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation Ann. Hum. Genet., Lond. (1975), 39, 141 Printed in Great Britain 141 A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation BY CHARLES F. SING AND EDWARD D.

More information

LINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI

LINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI Copyright 0 1984 by the Genetics Society of America LINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI ALAN HASTINGS Defartinent of Matheinntics, University of California, Davis, Calijornia

More information

Natural Selection results in increase in one (or more) genotypes relative to other genotypes.

Natural Selection results in increase in one (or more) genotypes relative to other genotypes. Natural Selection results in increase in one (or more) genotypes relative to other genotypes. Fitness - The fitness of a genotype is the average per capita lifetime contribution of individuals of that

More information

The Wright-Fisher Model and Genetic Drift

The Wright-Fisher Model and Genetic Drift The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population

More information

Neutral Theory of Molecular Evolution

Neutral Theory of Molecular Evolution Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation

More information

VARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION

VARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION Copyright 0 1983 by the Genetics Society of America VARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION G. B. GOLDING' AND C. STROBECK Deportment of Genetics, University of Alberta, Edmonton,

More information

Wright-Fisher Models, Approximations, and Minimum Increments of Evolution

Wright-Fisher Models, Approximations, and Minimum Increments of Evolution Wright-Fisher Models, Approximations, and Minimum Increments of Evolution William H. Press The University of Texas at Austin January 10, 2011 1 Introduction Wright-Fisher models [1] are idealized models

More information

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information # Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either

More information

There s plenty of time for evolution

There s plenty of time for evolution arxiv:1010.5178v1 [math.pr] 25 Oct 2010 There s plenty of time for evolution Herbert S. Wilf Department of Mathematics, University of Pennsylvania Philadelphia, PA 19104-6395 Warren

More information

Mathematical Population Genetics. Introduction to the Stochastic Theory. Lecture Notes. Guanajuato, March Warren J Ewens

Mathematical Population Genetics. Introduction to the Stochastic Theory. Lecture Notes. Guanajuato, March Warren J Ewens 1 Mathematical Population Genetics Introduction to the Stochastic Theory Lecture Notes Guanajuato, March 2009 Warren J Ewens 2 Preface These notes should, ideally, be read before the Guanajuato meeting

More information

Introduction to population genetics & evolution

Introduction to population genetics & evolution Introduction to population genetics & evolution Course Organization Exam dates: Feb 19 March 1st Has everybody registered? Did you get the email with the exam schedule Summer seminar: Hot topics in Bioinformatics

More information

Population Structure

Population Structure Ch 4: Population Subdivision Population Structure v most natural populations exist across a landscape (or seascape) that is more or less divided into areas of suitable habitat v to the extent that populations

More information

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Evolutionary Theory Mathematical and Conceptual Foundations Sean H. Rice Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Contents Preface ix Introduction 1 CHAPTER 1 Selection on One

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

Fitness landscapes and seascapes

Fitness landscapes and seascapes Fitness landscapes and seascapes Michael Lässig Institute for Theoretical Physics University of Cologne Thanks Ville Mustonen: Cross-species analysis of bacterial promoters, Nonequilibrium evolution of

More information

Gene regulation: From biophysics to evolutionary genetics

Gene regulation: From biophysics to evolutionary genetics Gene regulation: From biophysics to evolutionary genetics Michael Lässig Institute for Theoretical Physics University of Cologne Thanks Ville Mustonen Johannes Berg Stana Willmann Curt Callan (Princeton)

More information

Genetic Variation in Finite Populations

Genetic Variation in Finite Populations Genetic Variation in Finite Populations The amount of genetic variation found in a population is influenced by two opposing forces: mutation and genetic drift. 1 Mutation tends to increase variation. 2

More information

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift: 9. Genetic Drift Genetic drift is the alteration of gene frequencies due to sampling variation from one generation to the next. It operates to some degree in all finite populations, but can be significant

More information

(Write your name on every page. One point will be deducted for every page without your name!)

(Write your name on every page. One point will be deducted for every page without your name!) POPULATION GENETICS AND MICROEVOLUTIONARY THEORY FINAL EXAMINATION (Write your name on every page. One point will be deducted for every page without your name!) 1. Briefly define (5 points each): a) Average

More information

Space Time Population Genetics

Space Time Population Genetics CHAPTER 1 Space Time Population Genetics I invoke the first law of geography: everything is related to everything else, but near things are more related than distant things. Waldo Tobler (1970) Spatial

More information

I genetic systems in an effort to understand the structure of natural populations.

I genetic systems in an effort to understand the structure of natural populations. ON TREATING THE CHROMOSOME AS THE UNIT OF SELECTION* MONTGOMERY SLATKIN Department of Theoretical Biology, Uniuersity of Chicago, Chicago, Illinois 60637 Manuscript received July 19, 1971 ABSTRACT A simple

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Long-term adaptive diversity in Levene-type models

Long-term adaptive diversity in Levene-type models Evolutionary Ecology Research, 2001, 3: 721 727 Long-term adaptive diversity in Levene-type models Éva Kisdi Department of Mathematics, University of Turku, FIN-20014 Turku, Finland and Department of Genetics,

More information

NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS

NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS æ 2 NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS 19 May 2014 Variations neither useful nor injurious would not be affected by natural selection, and would be left either a fluctuating element, as perhaps

More information

Study of similarities and differences in body plans of major groups Puzzling patterns:

Study of similarities and differences in body plans of major groups Puzzling patterns: Processes of Evolution Evolutionary Theories Widely used to interpret the past and present, and even to predict the future Reveal connections between the geological record, fossil record, and organismal

More information

The Mechanisms of Evolution

The Mechanisms of Evolution The Mechanisms of Evolution Figure.1 Darwin and the Voyage of the Beagle (Part 1) 2/8/2006 Dr. Michod Intro Biology 182 (PP 3) 4 The Mechanisms of Evolution Charles Darwin s Theory of Evolution Genetic

More information

Estimating Evolutionary Trees. Phylogenetic Methods

Estimating Evolutionary Trees. Phylogenetic Methods Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent

More information

Classical Selection, Balancing Selection, and Neutral Mutations

Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection Perspective of the Fate of Mutations All mutations are EITHER beneficial or deleterious o Beneficial mutations are selected

More information

Darwinian Selection. Chapter 7 Selection I 12/5/14. v evolution vs. natural selection? v evolution. v natural selection

Darwinian Selection. Chapter 7 Selection I 12/5/14. v evolution vs. natural selection? v evolution. v natural selection Chapter 7 Selection I Selection in Haploids Selection in Diploids Mutation-Selection Balance Darwinian Selection v evolution vs. natural selection? v evolution ² descent with modification ² change in allele

More information

Massachusetts Institute of Technology Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution

Massachusetts Institute of Technology Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution Massachusetts Institute of Technology 6.877 Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution 1. Rates of amino acid replacement The initial motivation for the neutral

More information

Long-Term Response and Selection limits

Long-Term Response and Selection limits Long-Term Response and Selection limits Bruce Walsh lecture notes Uppsala EQG 2012 course version 5 Feb 2012 Detailed reading: online chapters 23, 24 Idealized Long-term Response in a Large Population

More information

Linear Regression (1/1/17)

Linear Regression (1/1/17) STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression

More information

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page. EVOLUTIONARY BIOLOGY EXAM #1 Fall 2017 There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page. Part I. True (T) or False (F) (2 points each). Circle

More information

Contrasts for a within-species comparative method

Contrasts for a within-species comparative method Contrasts for a within-species comparative method Joseph Felsenstein, Department of Genetics, University of Washington, Box 357360, Seattle, Washington 98195-7360, USA email address: joe@genetics.washington.edu

More information

PROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1

PROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1 PROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1 MOT00 KIMURA AND TOMOKO OHTA National Institute of Genetics, Mishima, Japan Received December

More information

The Role of Causal Processes in the Neutral and Nearly Neutral Theories

The Role of Causal Processes in the Neutral and Nearly Neutral Theories The Role of Causal Processes in the Neutral and Nearly Neutral Theories Michael R. Dietrich and Roberta L. Millstein The neutral and nearly neutral theories of molecular evolution are sometimes characterized

More information

Selection Page 1 sur 11. Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION

Selection Page 1 sur 11. Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION Selection Page 1 sur 11 Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION * I- Introduction II- Modeling and selective values III- Basic model IV- Equation of the recurrence of allele

More information

8. Genetic Diversity

8. Genetic Diversity 8. Genetic Diversity Many ways to measure the diversity of a population: For any measure of diversity, we expect an estimate to be: when only one kind of object is present; low when >1 kind of objects

More information

Sequence evolution within populations under multiple types of mutation

Sequence evolution within populations under multiple types of mutation Proc. Natl. Acad. Sci. USA Vol. 83, pp. 427-431, January 1986 Genetics Sequence evolution within populations under multiple types of mutation (transposable elements/deleterious selection/phylogenies) G.

More information

WITH WEAK INTERACTION

WITH WEAK INTERACTION EQUILIBRIA FOR GENETIC SYSTEMS WITH WEAK INTERACTION SAMUEL KARLIN WEIZMANN INSTITUTE OF SCIENCE, ISRAEL and STANFORD UNIVERSITY and JAMES McGREGOR STANFORD UNIVERSITY 1. Introduction The following principle

More information

AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity,

AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity, AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity, Today: Review Probability in Populatin Genetics Review basic statistics Population Definition

More information

BIOL Evolution. Lecture 9

BIOL Evolution. Lecture 9 BIOL 432 - Evolution Lecture 9 J Krause et al. Nature 000, 1-4 (2010) doi:10.1038/nature08976 Selection http://www.youtube.com/watch?v=a38k mj0amhc&feature=playlist&p=61e033 F110013706&index=0&playnext=1

More information

Bayesian analysis of the Hardy-Weinberg equilibrium model

Bayesian analysis of the Hardy-Weinberg equilibrium model Bayesian analysis of the Hardy-Weinberg equilibrium model Eduardo Gutiérrez Peña Department of Probability and Statistics IIMAS, UNAM 6 April, 2010 Outline Statistical Inference 1 Statistical Inference

More information

Heterozygosity is variance. How Drift Affects Heterozygosity. Decay of heterozygosity in Buri s two experiments

Heterozygosity is variance. How Drift Affects Heterozygosity. Decay of heterozygosity in Buri s two experiments eterozygosity is variance ow Drift Affects eterozygosity Alan R Rogers September 17, 2014 Assumptions Random mating Allele A has frequency p N diploid individuals Let X 0,1, or 2) be the number of copies

More information

STRONG balancing selection can result from over- approximation of Gillespie (1984, 1991). This approximation

STRONG balancing selection can result from over- approximation of Gillespie (1984, 1991). This approximation Copyright 1999 by the Genetics Society of America Overdominant Alleles in a Population of Variable Size Montgomery Slatkin and Christina A. Muirhead Department of Integrative Biology, University of California,

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

1. they are influenced by many genetic loci. 2. they exhibit variation due to both genetic and environmental effects.

1. they are influenced by many genetic loci. 2. they exhibit variation due to both genetic and environmental effects. October 23, 2009 Bioe 109 Fall 2009 Lecture 13 Selection on quantitative traits Selection on quantitative traits - From Darwin's time onward, it has been widely recognized that natural populations harbor

More information

A SIMPLE METHOD TO ACCOUNT FOR NATURAL SELECTION WHEN

A SIMPLE METHOD TO ACCOUNT FOR NATURAL SELECTION WHEN Genetics: Published Articles Ahead of Print, published on September 14, 2008 as 10.1534/genetics.108.090597 Title: A SIMPLE METHOD TO ACCOUNT FOR NATURAL SELECTION WHEN PREDICTING INBREEDING DEPRESSION

More information

Genetical theory of natural selection

Genetical theory of natural selection Reminders Genetical theory of natural selection Chapter 12 Natural selection evolution Natural selection evolution by natural selection Natural selection can have no effect unless phenotypes differ in

More information

Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles

Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles John Novembre and Montgomery Slatkin Supplementary Methods To

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

The phenomenon of biological evolution: 19th century misconception

The phenomenon of biological evolution: 19th century misconception The phenomenon of biological evolution: 19th century misconception Dalius Balciunas Abstract. Scientists still thin that biological evolution is driven by the process named natural selection. Perhaps this

More information

Problems for 3505 (2011)

Problems for 3505 (2011) Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.

More information

Genetic hitch-hiking in a subdivided population

Genetic hitch-hiking in a subdivided population Genet. Res., Camb. (1998), 71, pp. 155 160. With 3 figures. Printed in the United Kingdom 1998 Cambridge University Press 155 Genetic hitch-hiking in a subdivided population MONTGOMERY SLATKIN* AND THOMAS

More information

Lecture 24: Multivariate Response: Changes in G. Bruce Walsh lecture notes Synbreed course version 10 July 2013

Lecture 24: Multivariate Response: Changes in G. Bruce Walsh lecture notes Synbreed course version 10 July 2013 Lecture 24: Multivariate Response: Changes in G Bruce Walsh lecture notes Synbreed course version 10 July 2013 1 Overview Changes in G from disequilibrium (generalized Bulmer Equation) Fragility of covariances

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

Non-independence in Statistical Tests for Discrete Cross-species Data

Non-independence in Statistical Tests for Discrete Cross-species Data J. theor. Biol. (1997) 188, 507514 Non-independence in Statistical Tests for Discrete Cross-species Data ALAN GRAFEN* AND MARK RIDLEY * St. John s College, Oxford OX1 3JP, and the Department of Zoology,

More information

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 Homework 4 (version 3) - posted October 3 Assigned October 2; Due 11:59PM October 9 Problem 1 (Easy) a. For the genetic regression model: Y

More information

NOTES CH 17 Evolution of. Populations

NOTES CH 17 Evolution of. Populations NOTES CH 17 Evolution of Vocabulary Fitness Genetic Drift Punctuated Equilibrium Gene flow Adaptive radiation Divergent evolution Convergent evolution Gradualism Populations 17.1 Genes & Variation Darwin

More information

Evolution Module. 6.2 Selection (Revised) Bob Gardner and Lev Yampolski

Evolution Module. 6.2 Selection (Revised) Bob Gardner and Lev Yampolski Evolution Module 6.2 Selection (Revised) Bob Gardner and Lev Yampolski Integrative Biology and Statistics (BIOL 1810) Fall 2007 1 FITNESS VALUES Note. We start our quantitative exploration of selection

More information

The Wright Fisher Controversy. Charles Goodnight Department of Biology University of Vermont

The Wright Fisher Controversy. Charles Goodnight Department of Biology University of Vermont The Wright Fisher Controversy Charles Goodnight Department of Biology University of Vermont Outline Evolution and the Reductionist Approach Adding complexity to Evolution Implications Williams Principle

More information

Formalizing the gene centered view of evolution

Formalizing the gene centered view of evolution Chapter 1 Formalizing the gene centered view of evolution Yaneer Bar-Yam and Hiroki Sayama New England Complex Systems Institute 24 Mt. Auburn St., Cambridge, MA 02138, USA yaneer@necsi.org / sayama@necsi.org

More information

The Genetics of Natural Selection

The Genetics of Natural Selection The Genetics of Natural Selection Introduction So far in this course, we ve focused on describing the pattern of variation within and among populations. We ve talked about inbreeding, which causes genotype

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 009 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME MATHEMATICAL MODELING AND THE HUMAN GENOME Hilary S. Booth Australian National University, Australia Keywords: Human genome, DNA, bioinformatics, sequence analysis, evolution. Contents 1. Introduction:

More information

Bias-Variance Error Bounds for Temporal Difference Updates

Bias-Variance Error Bounds for Temporal Difference Updates Bias-Variance Bounds for Temporal Difference Updates Michael Kearns AT&T Labs mkearns@research.att.com Satinder Singh AT&T Labs baveja@research.att.com Abstract We give the first rigorous upper bounds

More information

Population Genetics: a tutorial

Population Genetics: a tutorial : a tutorial Institute for Science and Technology Austria ThRaSh 2014 provides the basic mathematical foundation of evolutionary theory allows a better understanding of experiments allows the development

More information

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm *

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm * Evolutionary Dynamics and Extensive Form Games by Ross Cressman Reviewed by William H. Sandholm * Noncooperative game theory is one of a handful of fundamental frameworks used for economic modeling. It

More information

Lecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition

Lecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition Lecture 3 Introduction on Quantitative Genetics: I Fisher s Variance Decomposition Bruce Walsh. Aug 004. Royal Veterinary and Agricultural University, Denmark Contribution of a Locus to the Phenotypic

More information

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION Annu. Rev. Genomics Hum. Genet. 2003. 4:213 35 doi: 10.1146/annurev.genom.4.020303.162528 Copyright c 2003 by Annual Reviews. All rights reserved First published online as a Review in Advance on June 4,

More information

There are 3 parts to this exam. Take your time and be sure to put your name on the top of each page.

There are 3 parts to this exam. Take your time and be sure to put your name on the top of each page. EVOLUTIONARY BIOLOGY BIOS 30305 EXAM #2 FALL 2011 There are 3 parts to this exam. Take your time and be sure to put your name on the top of each page. Part I. True (T) or False (F) (2 points each). 1)

More information

THE SAMPLING DISTRIBUTION OF LINKAGE DISEQUILIBRIUM UNDER AN INFINITE ALLELE MODEL WITHOUT SELECTION ABSTRACT

THE SAMPLING DISTRIBUTION OF LINKAGE DISEQUILIBRIUM UNDER AN INFINITE ALLELE MODEL WITHOUT SELECTION ABSTRACT Copyright 1985 by the Genetics Society of America THE SAMPLING DISTRIBUTION OF LINKAGE DISEQUILIBRIUM UNDER AN INFINITE ALLELE MODEL WITHOUT SELECTION RICHARD R. HUDSON Biometry and Risk Assessment Program,

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 007 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes. February 8, 2005 Bio 107/207 Winter 2005 Lecture 11 Mutation and transposable elements - the term mutation has an interesting history. - as far back as the 17th century, it was used to describe any drastic

More information

Lecture 2. Basic Population and Quantitative Genetics

Lecture 2. Basic Population and Quantitative Genetics Lecture Basic Population and Quantitative Genetics Bruce Walsh. Aug 003. Nordic Summer Course Allele and Genotype Frequencies The frequency p i for allele A i is just the frequency of A i A i homozygotes

More information

Supplementary Figures.

Supplementary Figures. Supplementary Figures. Supplementary Figure 1 The extended compartment model. Sub-compartment C (blue) and 1-C (yellow) represent the fractions of allele carriers and non-carriers in the focal patch, respectively,

More information

Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution

Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Ziheng Yang Department of Biology, University College, London An excess of nonsynonymous substitutions

More information

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species

More information

7. Tests for selection

7. Tests for selection Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info

More information

Big Idea 1: The process of evolution drives the diversity and unity of life.

Big Idea 1: The process of evolution drives the diversity and unity of life. Big Idea 1: The process of evolution drives the diversity and unity of life. understanding 1.A: Change in the genetic makeup of a population over time is evolution. 1.A.1: Natural selection is a major

More information

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution.

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution. The AP Biology course is designed to enable you to develop advanced inquiry and reasoning skills, such as designing a plan for collecting data, analyzing data, applying mathematical routines, and connecting

More information

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall

More information

AP Curriculum Framework with Learning Objectives

AP Curriculum Framework with Learning Objectives Big Ideas Big Idea 1: The process of evolution drives the diversity and unity of life. AP Curriculum Framework with Learning Objectives Understanding 1.A: Change in the genetic makeup of a population over

More information

Genetic assimilation can occur in the absence of selection for the assimilating phenotype, suggesting a role for the canalization heuristic

Genetic assimilation can occur in the absence of selection for the assimilating phenotype, suggesting a role for the canalization heuristic doi: 10.1111/j.1420-9101.2004.00739.x Genetic assimilation can occur in the absence of selection for the assimilating phenotype, suggesting a role for the canalization heuristic J. MASEL Department of

More information

Genetic proof of chromatin diminution under mitotic agamospermy

Genetic proof of chromatin diminution under mitotic agamospermy Genetic proof of chromatin diminution under mitotic agamospermy Evgenii V. Levites Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Email: levites@bionet.nsc.ru

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 013 Population genetics Outline of lectures 3-6 1. We ant to kno hat theory says about the reproduction of genotypes in a population. This results

More information

the experiment; and the distribution of residuals (or non-heritable

the experiment; and the distribution of residuals (or non-heritable SOME GENOTYPIC FREQUENCIES AND VARIANCE COMPONENTS OCCURRING IN BIOMETRICAL GENETICS J. A. NELDER National Vegetable Research Station, Wellesbourne, Warwicks. Received 30.xi.5 i 1. INTRODUCTION MATHER

More information

Case Studies in Ecology and Evolution

Case Studies in Ecology and Evolution 3 Non-random mating, Inbreeding and Population Structure. Jewelweed, Impatiens capensis, is a common woodland flower in the Eastern US. You may have seen the swollen seed pods that explosively pop when

More information

Predicting Fitness Effects of Beneficial Mutations in Digital Organisms

Predicting Fitness Effects of Beneficial Mutations in Digital Organisms Predicting Fitness Effects of Beneficial Mutations in Digital Organisms Haitao Zhang 1 and Michael Travisano 1 1 Department of Biology and Biochemistry, University of Houston, Houston TX 77204-5001 Abstract-Evolutionary

More information