A MODEL ALLOWING CONTINUOUS VARIATION IN ELECTROPHORETIC MOBILITY OF NEUTRAL ALLELES

Size: px
Start display at page:

Download "A MODEL ALLOWING CONTINUOUS VARIATION IN ELECTROPHORETIC MOBILITY OF NEUTRAL ALLELES"

Transcription

1 A MODEL ALLOWING CONTINUOUS VARIATION IN ELECTROPHORETIC MOBILITY OF NEUTRAL ALLELES GARY COBBS Department of Biology, University of Louisville, Louisville, Keniucky Manuscript received April 25, 1978 Revised copy received January 22, 1979 ABSTRACT The infinite-sites model with no recombination is extended to include mutations that affect electrophoretic mobility. The model allows the effect of a single-site mutation to have a continuous effect on mobility. Formulae are obtained for the variance of electrophoretic mobility of alleles after an arbitrary length of time. A special case of the general model is the case of stepwise production of neutral alleles with an arbitrary number of steps. MODELS accounting for the existence of genetically determined variation in electrophoretic mobility of proteins have developed extensively during the past decade. A persistent problem has been that of allowing for the limited resolving power of gel electrophoresis. This problem was dealt with, at least in part, by the model of stepwise production of neutral mobility classes introduced by OHTA and KIMURA (1973). The essential assumption of this step moldel is that only amino acid changes that involve charged residues will be detectable by gel electrophoresis. The model assumes that electrophoretic mobility is changed by discrete jumps, all of equal magnitude, in either the positive or negative direction. Some justification for this assumption is given in their original paper (OHTA and KIMURA 1973). Since this initial paper, there have been numerous extensions and refinements of the mathematical treatment of this and closely related models (KIMURA and OHTA 1975; MORAN 1975; WEHRHAHN 1975). Results for various kinds of step models have been derived from assumptions about the relationship between mutations and electrophoretic mobility and the effect of genetic drift due to finite population size (OHTA and KIMURA 1973; WEHRHAHN 1975; MORAN 1975). We will show here how some of the same results can be obtained by starting irom results for the infinite-sites model with no recombination (WATTERSON 1975; LI 1977). We will also generalize the model to include the case where the effect of a site mutation on electrophoretic mobility is a continuous variable. GENERAL THEORY We start with the results of Lr (1977) for the infinite-sites model with no recombination. Let Pk(t) = Prob(dt = k) denote the probability that at genera- Genetics 92: June. 1979

2 6 70 G. COBBS tion time t the number (d) of site differences between two randomly chosen cistrons is k. The probability-generating function for the &(t) values is then LI (1 977) has shown, for the infinite-sites model with no recombination in which it is assumed that no two mutations ever occur at the same site, that -1 G(Z,t) = 2Na(Z) + 2Na(Z) ea'z) t + ea(z)tg(z,o), where a(2) = vZ and A = 1/2N + 2v. Here, N is the effective population size and v is the mutation rate. For t infinitely large, we have 1 G(Z,m) = 1+6'-02 ' where 6' = 4Nv. In order to obtain results pertaining to protein polymorphism, we will henceforth let k be the number of site differences in cistrons that cause amino acid differences in the proteins coded for by the cistrons. We also let v be restricted to mutations causing amino acid changes. Now let the random variable x1 be the difference in electrophoretic mobility of the protein products of two randomly selected cistrons that have a single-site difference. This random variable has range (- 00, f a) and may be discrete or continuous. Denote the probability density function (or probability function for the discrete case) of x1 by f(xl) and its characteristic function by +(s). The moments of x1 can be found from +(s). We will make most use of the fact that E(x12) = -+"(O). We will assume that the effect of a single-site mutation on mobility has a symmetrical distribution about zero so that its distribution is identical to that of xl. Should this not be true in nature, then. our model may still be used and we simply take f(xl) to be the symmetricized distribution of the effect of a single random-site mutation on electrophoretic mobility. This can be done since z1 is defined using two randomly selected cistrons. Now let the random variable xh be the effect of k site differences on electrophoretic mobility. The variable xk is the sum of k independent and identically distributed values of xl. This is an important assumption underlying this model, as well as all the step models. Essentially, we are assuming that an amino acid substitution in one part of a protein does not alter the effect on mobility of another substitution in another part of the same protein. Some data on the electrophoretic and mobilities of hemoglobin alleles does support this assumption (see COBBS PRAKASH ). Now regard k as a random variable with probability-generating function given by (1) and let the random variable x be the mobility difference between the protein products of two randomly selected cistrons. Then by a lemma

3 ELECTROPHORETIC MOBILITY 671 given by FELLER (1966, p. 477) and using (1), we may write the characteristic function of the distribution of x as From the manner of definition of zk and x, we know that they must have symmetrical distributions and expectation of zero. This results from the fact that we define ztgand x using two randomly selected cistrons. The two sequences of selection of any two cistrons are equally likely. Since E(z) = 0, the central moments of x are the same as the moments about the origin. We may obtain the moments of the distribution of z from the derivatives of (3), using well known properties of derivatives of characteristic functions. Denote the nta moment of x at time t by u,(t), which may be found from (3) as where 2 = -1 and H(")(O,t) denotes the nth derivative of H(s,t) with respect to s evaluated at s = 0 (see for example EISEN 1969, p. 334). In theory, the probability-density function of z could be obtained by inversion of (3), though we shall not do so here. Substitution of (3) into (4) yields Var (qt) = 4NvVar (z,) (1 - et/sn) + Var(x,O), (5) where Var (XJ) denotes the variance of x at time t. In principle, we could obtain results analogous to (5) for higher moments of x. Now let mi and pi be the mobility and frequency in a population at time t of allelic state i. Here, we will suppress the variable t for convenience. The values u,~ may be written in terms of mi and pi as Some algebra shows that (6) is equivalent to U,, =Z Z pipj(mi - mj)". (6) a? where U, = z pjmir and is the rth moment about the origin of the electrophoretic mobilities, Equation (7) reveals that u,(t) = 0 for k = 1, 3, 5,... and also we have I Var(z,t) = 2Var(m,t), (8) where Var(m,t) denotes the variance of electrophoretic mobilities at time t. Substitution of (8) into (5) gives Var (m,t) = 2NvVar (x,) (I - &IZN) + Var (m,o) e-t/zx, (9) from which we obtain the steady state value Var(m,m) = 2NvVar(z,). (10)

4 6 72 G. COBBS Equation (9) reveals that the rate of approach to the equilibrium variance is slow. If a population is initially monomorphic for electrophoretic mobility, then Var(m,O) = 0, and it takes 4.6N generations to attain 90% of the equilibrium variance in electrophoretic mobility. SPECIAL CASES Step models: Here we assume that changes in mobility occur in discrete jumps of unit value and that a single-site difference may cause any number, up to a maximum of r, of such jumps. The probability that a single-site difference causes a mobility change of j units in the positive and negative directions will be denoted by c+? and c-j, respectively, for j = 1, 2,..,r. The probability that the site difference causes no change in mobility is co. In terms of our general theory, we write Prob(x, = i) = cj. This set of 2r + I values constitutes an empirical probability distribution of xl. Here, we shall assume that c+j = cq for all i in order to obtain symmetry of the distribution of xl. Some information on the actual values of cj for the case I = 2 may be found in genetic literature (MARSHALL and BROWN 1975; LI 1977). The variance of the distribution of x1 is easily found from the values of cj and use of (9) yields Var (m,t) = 2Nv ( ] =-I Z Pcj) ( 1 -&IzN) + Var (m,o) ctien. (11) For a population at equilibrium, the variance of mobility classes is then Var(m,w) = 2Nv I: jzcj. (12) If we allow only jumps of a single unit in either direction (r = 1). then (12) reduces to Var (m, 00) = 2Nv ( 1 -c,). This is very close to the result given by MORAN (1975) of 2(N-l)~. which he obtains from a slightly different model in which he assumes c, = 0. If r = 2 then (12) becomes Var(m, w) = ~NV[C-~+C+,+~(C-,+C+?)], which is the same as a result given by WEHR- HAHN (1975) for a model allowing one or two jumps. I we assume that the steps are of length w rather than one, we then have v J=-r Var(m,w) = 2Nv I =-r I: (jo)zcj. (13) Here, o may be regarded as the amount of change in the relative mobility of proteins. This is the most common method of expressing mobility differences of alloz ymes. Continuous models: Here we assume that changes in mobility are continuous in nature. The model follows closely that of the step model except that f(xl) is now considered to be a density function of a continuous random variable. We assume that f(xl) has the form

5 ELECTROPHORETIC MOBILITY 6 73 where fj (x,) is a density function and cj is the probability that z1 follows the density fj (xl). As in the step models, we assume that c-j = c+j for all j. The cj values here have essentially the same interpretation as they did in the step models. We further assume the density fj (zl) has mean pi and variance uj2 and that fj(xl) =f-j(-xl) for j = 1,2,...,I. (15) Thus. pj = -,p-j and ai2 = uq2. The characteristic function of fj (xl) and f(x1) are denoted by +] (s) and (~(s), respectively, and from (14) we have from which we obtain E(x,) = 0 and Var(x,) = 7. 1 =-r cj(pj2 +U,'). (17) Further simplification results from the reasonable assumption that pi = io. Here, is the expected value of the absolute value of the change in relative mobility caused by a random amino acid substitution that changes the number of ionizable groups per monomeric subunit by exactly one. This random variable will be denoted by w. The parameter pj represents the average effect on relative mobility of an amino acid substitution that changes i ionization groups per monomeric subunit of the protein. We may arbitrarily regard values of j as the number of acidic groups added minus the number of basic groups added. For example, consider the substitution lys + glu. This is a change from a monobasic amino acid to a monoacidic one; thus, j = 2. Using pli = io and (1 7) in (9) gives Var (m,t) = 2Nv, cj [ (io) + uj2] (1 + Var (m,o), ( 18) 3 =-r and at equilibrium 1- Var(m,m) =2Nv,E =-r cj [(jo)2+uj2]. (19) If = 1 and uj2 = 0 for all j, then the continuous model becomes the step model with steps of unit value, as (18) and (19) are then identical to (11) and (12). DISCUSSION Our results give a relation between mutations in the infinite-sites model with no recombination and mutations affecting electrophoretic mobility. These results may be used to gain some idea as to whether or not, for a given locus, the observed variation in mobility is in agreement with what is expected for neutral alleles. Here we will consider only the predicted variance for an equilibrium population. In order to calculate a predicted variance with (19), we need estimates of 0.0, and ci for i = 0, i.1, *2,.. fr. Values of cli for an average protein inay be estimated from knowledge of the genetic code, structure of amino acids and the amino acid composition of an average protein. MARSHALL and BROWN (1975) estimate (G. C-~, co, c+~, c+~) to be

6 6 74 G. COBBS (0.0059, , , , ) for an average protein. They also suggest that cj = 0 for lii > 2. Our niodel assumes that c+j = c-j, so that we average the estimates of c+j and c-j to obtain our estimates of c+j and c-j to obtain our estimates of c+j and c+ to obtain our estimates of c+, and c-j. This procedure gives the estimates (0.0075, , , , ), which we will use in our calculations. The value of 0 = 4Nu may be estimated from the number olf alleles occurring in a sample. The estimating procedure is described by EWENS (1972) for the infinite-alleles model, which. for our purposes, is equivalent to the infinite-sites model with no recombination (see WATTERSON 1975). Here, it is necessary to assume that all alleles present in a sample are detected even if they have negligible effect on electrophoretic mobility. Methods are available that detect at least some of the alleles that are not detected by routine methods of gel electro- phoresis (see MCDOWELL and PRAKASH 1976; COBBS and PRAKASH 1977a). In any case, we may obtain a minimum estimate of 6' from the number of alleles in a sample. According to the theory given by EWENS (1972), if more alleles are detected in a sample by some additional procedure, then the estimate of 6' will be increased; it cannot be decreased. We shall estimate 6' for the esterase-5 locus of Drosophila pseudoolbscura from the data o MCDOWELL and PRAKASH (1976) and for D. miranda from the data of COBBS and PRAKASH (1977a). The values of w and ai2 are unknown for nearly all loci. Preliminary estimates of these quantities for the esterase-5 locus of the Drosophila obscura species group are given by COBBS and PRAKASH (1977b). Their data suggests that the relation is approximately true, and we shall use it here. We shall also assume ao2 ui2. COBBS and PRAKASH (1977b) report data that give the estimates o = and ai2 = 0.000,841 for dimer mobilities at the esterase-5 locus. Using procedures discussed in the APPENDIX, an unbiased estimate of is found, using the data of COBBS and PRAKASH (1977b), to be With these values in (20) and using our values of cj equation (17) gives the estimate Var(xl) = 0.001,465 for the esterase-5 locus. The estimates of 0 and the predicted variance for D. pseudo- TABLE 1 Comparison of predicied and observed variance of mobility for the esterase-5 locus of Drosophila pseudoobscura and D. miranda D. pseudoobscura D. miranda ,006, , , ,998 0,002, ,

7 ELECTROPHORETIC MOBILITY 675 obscurg and D. miranda are given in Table 1. 'The observed variance of electrophorectic mobility is calculated from the data of PRAKASH (1977). It is desirable to have some idea ob the accuracy with which we can predict the mobility variance for an equilibrium population. The accuracy will depend on the error in estimation of all the underlying parameters. Here, we will give only an approximate method that may be used to analyze the agreement between the observed and predicted values for the mobility variance. The predicted variance and its sampling variance will be denoted by V, and Var (Vp), respectively. The observed mobility variance will be denoted by V,. Using (20) in (19) and substituting estimates for the population quantities yields A V, = BY (21) Var (v,) + ~zvar(6) +var(e^) Var (Y). Here 6 is the estimate of 8 obtained using methods given by EWENS (1972) and where U(Ul2) and U(d) are unbiased estimates of the population variance and the square of the population mean, respectively, of the random variable w. The quantities U(u12) and U(d) may be obtained from sample values of w. A formula for U ( ui2) is well known and a formula for U (Uz) is given in the APPENDIX. A sample of 13 observations of w for the esterase-5 locus may be obtained by multiplying each of the 13 observations in the rightmost column of Table 3 of COBBS and PRAKASH (1977b) by 2. These numbers may then be used to e-t' imate u12 and d. From (22) we obtain where Cov (.,.) denotes the covariance. Formulas for unbiased estimates of Var [U( u12) 1, Var [ U(d) 1 and Cov [U(U{), U (0')] are given in the APPENDIX. The sampling variance of the estimate 8 is found numerically using formula of EWENS (1972). The value of e is used in EWEN'S eq. (23) to generate the distribution ni, which is the probability that the sample has i alleles conditioned

8 676 G. COBBS on 0. For each value of i = 1,2,.., n - 1, we then calculate the value 0i, which is the estimate of 0 if i alleles occurred in a sample of n genes. The values ii are then considered a random variable with probabilities ~ i and, the variance is calculated and taken to be the sample variance of h. We now have the ability to calculate V, and Var (V,) using (21), (22) and (23). As a measure of disagreement between the predictions of the model and the observed mobility variation, we will use the difference V, - Vp. In order to roughly assess the significance of the difference V, - V,, we will examine the variable If the value of L falls within the interval (-2, +2), we shall conclude there is no significant difference between V, and V,. This is equivalent to the condition that V, fall withir? two standard deviation units of either side of V,. The expected value of L is probably not zero. but it should be close to zero if the model is being followed in nature. The procedure outlined here is approximate in that, among other things, it does not include the sample variance of V, nor error in estimates of the ci values. However, if a nonsignificant deviation is found with the above procedure, it would almost certainly bc nonsignificant in any procedure that did take into account the variance of Vo. The results of this analysis for the esterase-5 locus of D. pseudoobscura and D. miranda are given in Table 1. Here, we see that both species have a lower rariance than the model predicts, though in neither case is this deviation statistically significant by our test. This analysis is not intended to be a rigorous test of the model, and more precise statistical procedures are clearly needed. Further improvements of the analysis would also result from more accurate estimates of Q and ui2, The estimates used here are derived from the data of COBBS and PRAKASH (1977b). Their procedure used chemically modified proteins as an attempt to simulate the effect of amino acid substitutions in proteins. The accuracy of this procedure should be checked by estimating 0 and u12 from naturally occurring amino acid substitutions. We are currently using this procedure in our laboratory to obtain improved estimates of 0 and ut2. The models presented here should be useful in determining whether or not the processes of neutral mutation and genetic drift are important causes of genetic variation for electrophoretic mobility at protein-producing loci. LITERATURE CITED COBBS, 6. and S. PRAKASH, 1977a A comparatiie study of the esterase-5 locus in Drosophila pseudoobscura, D. persmilis and D. miranda. Genetics 85: , 1977b An experimental investigation of the unit charge model of protein polymorphism and its relation to the esterase-5 locus of Drosophila pseudoobscura, D. persimilis and D. miranda. Genetics 87: CRAMER, H., 1946 New Jersey. Mathematical Methods of Statistics. Princeton University Press, Princeton,

9 ELECTROPHORETIC MOBILITY 677 EISEN, W. J., 1972 Introduction to Mathematical Probability Theory. Prentice-Hall, Englewood Cliffs, New Jersey. EWENS, W. J., 1972 The sampling theory of selectively neutral alleles. Theoret. Pop. Biol. 3: FELLER, W., 1966 Introduction to Mathemrrtical Probability Theory and its Applications, vol. 11. Wiley, New York. KIMURA, M. and T. OHTA, 1975 Distribution of allelic frequencies in a finite population under stepwise production of neutral alleles. Proc. Natl. Acad. Sci. U.S. 72: LI, W.-H., 1977 Distribution of nucleotide differences between two randomly chosen cistrons in a finite population. Genetics 85: MARSHALL, D. R. and A. H. D. BROWN, 1975 The charge-state model of protein polymorphism in natural populations. J. Mol. Evol. 6: MCDOWELL, R. and S. PKAKASH, 1976 Allelic heterogeneity within allozymes separated by electrophoresis in Drosophila pseudoobscura. Proc. Natl. Acad. Sci. US. 73: MORAN, P. A. P., 1975 Wandering distributions and the electrophoretic profile. Theoret. Pop. Biol. 8: OHTA, T. and M. KIMURA, 1973 A model of mutation appropriate to estimate the number of electrophoretically detectable alleles in a finite population. Genet. Res. 22 : PRAK~SH, S., 1977 Genetic divergence in closely related sibling species Drosophila pseudoobscura, D. persimilis, and D. miranda. Evolution 31: WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theoret. Pop. Biol. 7 : WEHRHAHN, C. F, 1975 The evolution of selectively similar electrophoretically detectable alleles in finite natural populations. Genetics 80: Corresponding editor: W. W. ANDERSON APPENDIX Consider a sample of n independent observations zl, z2, ----, I, from a population whose kth moment about zero is ak and whose kt central moment is Pk. It is possible to estimate the quantities ak and Pk, using the sample values. Unbiased estimates of a6 and PI, will be denoted by U(a,) and U(&), respectively. In most statistical literature U(al) and U(p,) are denoted by Z and s2, respectively. In general we shall denote an unbiased estimate of any population quantity, say z, by U(z). The sample moment about the origin will be denoted by 1 k a7( = - Z ~,k. It is well known that U(a,) = ak. Formulae for U(P,) far k= 1, 2, 3, 4 in n 2=1 terms of sample quantities are given by CRAMER (1946, p. 352). The purpose of this APPENDIX is to provide formulae for U(a12) and unbiased estimates of Var[U(a12)], Var[U(p,)] and Cov[U(a,Z), U(P,)]. Here Var(.) and Cov(.,.) denote the 1-ariance and covariance of the quantities within the brackets. The formula U(p,) = (L) n-1 (a2-alz) is well established and needs no further mention here. Given below are the expected values of some sample quantities that we shall find useful. These results may be obtained using procedures discussed in CRAMER (1946, p. 347). (n-i) a12 E (a12) = - + (AI 1 n (n-1) (n-2) (n-3) a14f6 (n-1 ) (n-2) a12 a,f3 (n-l)a22+4 (n-1) al a3+a4 E(a,*) =- (A21 n3

10 G 78 G. COBBS E(a,2 a,) = - (n-i) (n-2) a12 a2 + 2(n-1) a1 as + (n-i) a 2 + a4 n2 From equations (AI) through (A5), we obtain the results given below. U(a12) = a12-3- n n3 a14 - U(a4) 6U(alZa2) 3U(a22) + 4U(a, a,) U(a14) = (n-i) (n-2) (n-3) n - 3 (n-2) (n-3) n a2, - a4 U(a2*) = n-i ('46) (A71 (-48) na cz -a4 U((Y1a3) =-l n,- 1 These unbiased estimates will be useful in estimating the quantities Var ( U(al2)), Var(U (p2)) and cov (U(a,2), U(P2)). Beginning with the relation Var [U(al*)] = E[U(a,Z) - al2i2, substituting in (A6) and expanding and taking expectation, using (AI) through (A5), yields (n-i ) (n-2) ( n-3)a14+6 Var(U(a,z)) = (n-i ) (n-2) a12a2+3 (n-1 ) 01,~+4 (n-1 )a1a3+a4 ~ _. (All) n3 We may now obtain an unbiased estimate of Var(U(n,z)) by substituting unbiased estimates for the terms appearing on the right side of (All). Beginning with the relation Var(V(P,)) = E[U(P,) - P21z, substituting formulae for U@,) in terms of ak and P2 in terms of ak, expanding and then taking expectstions by using (AI) through (A5) gives n 2 ~ar[~(p,)l = (--) [~(a,2) - 2~(a,za,) + ~(a,4)1- ((Y2-a12)2. n-i In order to estimate Var[U(P2)], we use (A2), (A3) and (A4) to obtain an expression for Var[U(P2)] in terms of population quantities. We then substitute the unbiased estimates of the terms into the expression to obtain an unbiased estimate of Var[U(P,)]. Beginning with the relation covru(cu,~), U(P,)l = E[U(C12) U(P2)I P? and inserting the formulae for U(,,?) and U(P2) in terms of ak and for P2 in terms of afg and then expanding and taking expectations yields Cov(U(a12), U(P2)) = (E-)[? E(a,za,) -E(a14) --] E(a2') -in12a2+a14. (A121 n-i n We obtain an unbiased estimate of Cov[U(a12), U(P,)] from (Ale) using a procedure similar to that used for estimating Var[U(aIz)] and Var[U(p,)].

DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION'

DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION' DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION' WEN-HSIUNG LI Center for Demographic and Population Genetics, University of Texas Health Science Center,

More information

Genetic Variation in Finite Populations

Genetic Variation in Finite Populations Genetic Variation in Finite Populations The amount of genetic variation found in a population is influenced by two opposing forces: mutation and genetic drift. 1 Mutation tends to increase variation. 2

More information

NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT

NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE Department of Biology, University of Penmyluania, Philadelphia, Pennsyluania 19174 Manuscript received September 17,

More information

The neutral theory of molecular evolution

The neutral theory of molecular evolution The neutral theory of molecular evolution Introduction I didn t make a big deal of it in what we just went over, but in deriving the Jukes-Cantor equation I used the phrase substitution rate instead of

More information

I of a gene sampled from a randomly mating popdation,

I of a gene sampled from a randomly mating popdation, Copyright 0 1987 by the Genetics Society of America Average Number of Nucleotide Differences in a From a Single Subpopulation: A Test for Population Subdivision Curtis Strobeck Department of Zoology, University

More information

Variances of the Average Numbers of Nucleotide Substitutions Within and Between Populations

Variances of the Average Numbers of Nucleotide Substitutions Within and Between Populations Variances of the Average Numbers of Nucleotide Substitutions Within and Between Populations Masatoshi Nei and Li Jin Center for Demographic and Population Genetics, Graduate School of Biomedical Sciences,

More information

VARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION

VARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION Copyright 0 1983 by the Genetics Society of America VARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION G. B. GOLDING' AND C. STROBECK Deportment of Genetics, University of Alberta, Edmonton,

More information

LINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI

LINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI Copyright 0 1984 by the Genetics Society of America LINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI ALAN HASTINGS Defartinent of Matheinntics, University of California, Davis, Calijornia

More information

GENETIC VARIABILITY AND RATE OF GENE SUBSTITUTION IN A FINITE POPULATION UNDER MUTATION AND FLUCTUATING SELECTION* NAOYUKI TAKAHATA ABSTRACT

GENETIC VARIABILITY AND RATE OF GENE SUBSTITUTION IN A FINITE POPULATION UNDER MUTATION AND FLUCTUATING SELECTION* NAOYUKI TAKAHATA ABSTRACT GENETC VARABLTY AND RATE OF GENE SUBSTTUTON N A FNTE POPULATON UNDER MUTATON AND FLUCTUATNG SELECTON* NAOYUK TAKAHATA National nstitute of Genetics, Mishima, 411 Japan ABSTRACT By using a numerical method

More information

Gene regulation: From biophysics to evolutionary genetics

Gene regulation: From biophysics to evolutionary genetics Gene regulation: From biophysics to evolutionary genetics Michael Lässig Institute for Theoretical Physics University of Cologne Thanks Ville Mustonen Johannes Berg Stana Willmann Curt Callan (Princeton)

More information

A TAXONOMIC APPROACH TO EVALUATION OF THE CHARGE STATE MODEL USING TWELVE SPECIES OF SEA ANEMONE ABSTRACT

A TAXONOMIC APPROACH TO EVALUATION OF THE CHARGE STATE MODEL USING TWELVE SPECIES OF SEA ANEMONE ABSTRACT Copyright 0 1983 by the Genetics Society of America A TAXONOMIC APPROACH TO EVALUATION OF THE CHARGE STATE MODEL USING TWELVE SPECIES OF SEA ANEMONE STEVEN A. McCOMMAS University of Houston Morine Science

More information

Australian bird data set comparison between Arlequin and other programs

Australian bird data set comparison between Arlequin and other programs Australian bird data set comparison between Arlequin and other programs Peter Beerli, Kevin Rowe March 7, 2006 1 Data set We used a data set of Australian birds in 5 populations. Kevin ran the program

More information

Sequence evolution within populations under multiple types of mutation

Sequence evolution within populations under multiple types of mutation Proc. Natl. Acad. Sci. USA Vol. 83, pp. 427-431, January 1986 Genetics Sequence evolution within populations under multiple types of mutation (transposable elements/deleterious selection/phylogenies) G.

More information

Neutral behavior of shared polymorphism

Neutral behavior of shared polymorphism Proc. Natl. Acad. Sci. USA Vol. 94, pp. 7730 7734, July 1997 Colloquium Paper This paper was presented at a colloquium entitled Genetics and the Origin of Species, organized by Francisco J. Ayala (Co-chair)

More information

The Wright-Fisher Model and Genetic Drift

The Wright-Fisher Model and Genetic Drift The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population

More information

MOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT

MOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT MOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT JOHN H. GILLESPIE Department of Biology. Uniuersity of Pennsylvania, Philadelphia, Pennsylvania 19104 Manuscript received June 8, 1978 Revised

More information

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research education use, including for instruction at the authors institution

More information

AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity,

AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity, AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity, Today: Review Probability in Populatin Genetics Review basic statistics Population Definition

More information

BIOL Evolution. Lecture 9

BIOL Evolution. Lecture 9 BIOL 432 - Evolution Lecture 9 J Krause et al. Nature 000, 1-4 (2010) doi:10.1038/nature08976 Selection http://www.youtube.com/watch?v=a38k mj0amhc&feature=playlist&p=61e033 F110013706&index=0&playnext=1

More information

- per ordinary gene locus (GELBART and

- per ordinary gene locus (GELBART and Copyright 0 1989 by the Genetics Society of America Time for Spreading of Compensatory Mutations Under Gene Duplication Tomoko Ohta National Institute ofgenetics, Mishima 41 1, Japan Manuscript received

More information

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) 12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²

More information

Long-Term Response and Selection limits

Long-Term Response and Selection limits Long-Term Response and Selection limits Bruce Walsh lecture notes Uppsala EQG 2012 course version 5 Feb 2012 Detailed reading: online chapters 23, 24 Idealized Long-term Response in a Large Population

More information

Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data

Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data Proc. Natl. Acad. Sci. USA Vol. 95, pp. 5456 546, May 998 Statistics Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data ANDRZEJ

More information

Statistical population genetics

Statistical population genetics Statistical population genetics Lecture 7: Infinite alleles model Xavier Didelot Dept of Statistics, Univ of Oxford didelot@stats.ox.ac.uk Slide 111 of 161 Infinite alleles model We now discuss the effect

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 007 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

Variance and Covariances of the Numbers of Synonymous and Nonsynonymous Substitutions per Site

Variance and Covariances of the Numbers of Synonymous and Nonsynonymous Substitutions per Site Variance and Covariances of the Numbers of Synonymous and Nonsynonymous Substitutions per Site Tatsuya Ota and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology, The

More information

Hubby and Lewontin on Protein Variation in Natural Populations: When Molecular Genetics Came to the Rescue of Population Genetics

Hubby and Lewontin on Protein Variation in Natural Populations: When Molecular Genetics Came to the Rescue of Population Genetics HIGHLIGHTED ARTICLE PERSPECTIVES Hubby and Lewontin on Protein Variation in Natural Populations: When Molecular Genetics Came to the Rescue of Population Genetics Brian Charlesworth,*,1 Deborah Charlesworth,*

More information

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME MATHEMATICAL MODELING AND THE HUMAN GENOME Hilary S. Booth Australian National University, Australia Keywords: Human genome, DNA, bioinformatics, sequence analysis, evolution. Contents 1. Introduction:

More information

Massachusetts Institute of Technology Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution

Massachusetts Institute of Technology Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution Massachusetts Institute of Technology 6.877 Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution 1. Rates of amino acid replacement The initial motivation for the neutral

More information

NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS

NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS æ 2 NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS 19 May 2014 Variations neither useful nor injurious would not be affected by natural selection, and would be left either a fluctuating element, as perhaps

More information

MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei"

MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS Masatoshi Nei" Abstract: Phylogenetic trees: Recent advances in statistical methods for phylogenetic reconstruction and genetic diversity analysis were

More information

Lecture Notes: BIOL2007 Molecular Evolution

Lecture Notes: BIOL2007 Molecular Evolution Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits

More information

A simple genetic model with non-equilibrium dynamics

A simple genetic model with non-equilibrium dynamics J. Math. Biol. (1998) 36: 550 556 A simple genetic model with non-equilibrium dynamics Michael Doebeli, Gerdien de Jong Zoology Institute, University of Basel, Rheinsprung 9, CH-4051 Basel, Switzerland

More information

How robust are the predictions of the W-F Model?

How robust are the predictions of the W-F Model? How robust are the predictions of the W-F Model? As simplistic as the Wright-Fisher model may be, it accurately describes the behavior of many other models incorporating additional complexity. Many population

More information

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information # Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either

More information

ILLUSTRATIVE EXAMPLES OF PRINCIPAL COMPONENTS ANALYSIS

ILLUSTRATIVE EXAMPLES OF PRINCIPAL COMPONENTS ANALYSIS ILLUSTRATIVE EXAMPLES OF PRINCIPAL COMPONENTS ANALYSIS W. T. Federer, C. E. McCulloch and N. J. Miles-McDermott Biometrics Unit, Cornell University, Ithaca, New York 14853-7801 BU-901-MA December 1986

More information

to be tested with great accuracy. The contrast between this state

to be tested with great accuracy. The contrast between this state STATISTICAL MODELS IN BIOMETRICAL GENETICS J. A. NELDER National Vegetable Research Station, Wellesbourne, Warwick Received I.X.52 I. INTRODUCTION THE statistical models belonging to the analysis of discontinuous

More information

A Likelihood Approach to Populations Samples of Microsatellite Alleles

A Likelihood Approach to Populations Samples of Microsatellite Alleles Copyright 0 1997 by the Genetics Society of America A Likelihood Approach to Populations Samples of Microsatellite Alleles Rasmus Nielsen Department of Integrative Biology, University of California, Berkelqr,

More information

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009 Gene Genealogies Coalescence Theory Annabelle Haudry Glasgow, July 2009 What could tell a gene genealogy? How much diversity in the population? Has the demographic size of the population changed? How?

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 009 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Frequency Estimation Karin S. Dorman Department of Statistics Iowa State University August 28, 2006 Fundamental rules of genetics Law of Segregation a diploid parent is equally

More information

Neutral Theory of Molecular Evolution

Neutral Theory of Molecular Evolution Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation

More information

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility

SWEEPFINDER2: Increased sensitivity, robustness, and flexibility SWEEPFINDER2: Increased sensitivity, robustness, and flexibility Michael DeGiorgio 1,*, Christian D. Huber 2, Melissa J. Hubisz 3, Ines Hellmann 4, and Rasmus Nielsen 5 1 Department of Biology, Pennsylvania

More information

Population Structure

Population Structure Ch 4: Population Subdivision Population Structure v most natural populations exist across a landscape (or seascape) that is more or less divided into areas of suitable habitat v to the extent that populations

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

PROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1

PROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1 PROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1 MOT00 KIMURA AND TOMOKO OHTA National Institute of Genetics, Mishima, Japan Received December

More information

Theoretical Population Biology

Theoretical Population Biology Theoretical Population Biology 87 (013) 6 74 Contents lists available at SciVerse ScienceDirect Theoretical Population Biology journal homepage: www.elsevier.com/locate/tpb Genotype imputation in a coalescent

More information

A NONINFORMATIVE BAYESIAN APPROACH FOR TWO-STAGE CLUSTER SAMPLING

A NONINFORMATIVE BAYESIAN APPROACH FOR TWO-STAGE CLUSTER SAMPLING Sankhyā : The Indian Journal of Statistics Special Issue on Sample Surveys 1999, Volume 61, Series B, Pt. 1, pp. 133-144 A OIFORMATIVE BAYESIA APPROACH FOR TWO-STAGE CLUSTER SAMPLIG By GLE MEEDE University

More information

Natural Selection results in increase in one (or more) genotypes relative to other genotypes.

Natural Selection results in increase in one (or more) genotypes relative to other genotypes. Natural Selection results in increase in one (or more) genotypes relative to other genotypes. Fitness - The fitness of a genotype is the average per capita lifetime contribution of individuals of that

More information

Fitness landscapes and seascapes

Fitness landscapes and seascapes Fitness landscapes and seascapes Michael Lässig Institute for Theoretical Physics University of Cologne Thanks Ville Mustonen: Cross-species analysis of bacterial promoters, Nonequilibrium evolution of

More information

(Write your name on every page. One point will be deducted for every page without your name!)

(Write your name on every page. One point will be deducted for every page without your name!) POPULATION GENETICS AND MICROEVOLUTIONARY THEORY FINAL EXAMINATION (Write your name on every page. One point will be deducted for every page without your name!) 1. Briefly define (5 points each): a) Average

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

A MARKOV PROCESS OF GENE FREQUENCY CHANGE IN A

A MARKOV PROCESS OF GENE FREQUENCY CHANGE IN A A MARKOV PROCESS OF GENE FREQUENCY CHANGE IN A GEOGRAPHICALLY STRUCTURED POPULATION TAKE0 MARUYAMA National Instituie of Genetics, Mishima, Japan and Cenier for Demographic and Population Genetics, University

More information

Problems for 3505 (2011)

Problems for 3505 (2011) Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.

More information

122 9 NEUTRALITY TESTS

122 9 NEUTRALITY TESTS 122 9 NEUTRALITY TESTS 9 Neutrality Tests Up to now, we calculated different things from various models and compared our findings with data. But to be able to state, with some quantifiable certainty, that

More information

EVOLUTIONARY DISTANCE MODEL BASED ON DIFFERENTIAL EQUATION AND MARKOV PROCESS

EVOLUTIONARY DISTANCE MODEL BASED ON DIFFERENTIAL EQUATION AND MARKOV PROCESS August 0 Vol 4 No 005-0 JATIT & LLS All rights reserved ISSN: 99-8645 wwwjatitorg E-ISSN: 87-95 EVOLUTIONAY DISTANCE MODEL BASED ON DIFFEENTIAL EUATION AND MAKOV OCESS XIAOFENG WANG College of Mathematical

More information

Series Solutions. 8.1 Taylor Polynomials

Series Solutions. 8.1 Taylor Polynomials 8 Series Solutions 8.1 Taylor Polynomials Polynomial functions, as we have seen, are well behaved. They are continuous everywhere, and have continuous derivatives of all orders everywhere. It also turns

More information

Introduction to population genetics & evolution

Introduction to population genetics & evolution Introduction to population genetics & evolution Course Organization Exam dates: Feb 19 March 1st Has everybody registered? Did you get the email with the exam schedule Summer seminar: Hot topics in Bioinformatics

More information

Statistical Tests for Detecting Positive Selection by Utilizing High. Frequency SNPs

Statistical Tests for Detecting Positive Selection by Utilizing High. Frequency SNPs Statistical Tests for Detecting Positive Selection by Utilizing High Frequency SNPs Kai Zeng *, Suhua Shi Yunxin Fu, Chung-I Wu * * Department of Ecology and Evolution, University of Chicago, Chicago,

More information

Evolution and maintenance of quantitative genetic variation by mutations

Evolution and maintenance of quantitative genetic variation by mutations Proc. Nad. Acad. Sci. USA Vol. 84, pp. 6205-6209, September 1987 Evolution Evolution and maintenance of quantitative genetic variation by mutations (founder populations/drift/equilibria/multiple alleles/additive

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA Human Population Genomics Outline 1 2 Damn the Human Genomes. Small initial populations; genes too distant; pestered with transposons;

More information

NORMAL CHARACTERIZATION BY ZERO CORRELATIONS

NORMAL CHARACTERIZATION BY ZERO CORRELATIONS J. Aust. Math. Soc. 81 (2006), 351-361 NORMAL CHARACTERIZATION BY ZERO CORRELATIONS EUGENE SENETA B and GABOR J. SZEKELY (Received 7 October 2004; revised 15 June 2005) Communicated by V. Stefanov Abstract

More information

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis Introduction to Time Series Analysis 1 Contents: I. Basics of Time Series Analysis... 4 I.1 Stationarity... 5 I.2 Autocorrelation Function... 9 I.3 Partial Autocorrelation Function (PACF)... 14 I.4 Transformation

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 Homework 4 (version 3) - posted October 3 Assigned October 2; Due 11:59PM October 9 Problem 1 (Easy) a. For the genetic regression model: Y

More information

THE SAMPLING DISTRIBUTION OF LINKAGE DISEQUILIBRIUM UNDER AN INFINITE ALLELE MODEL WITHOUT SELECTION ABSTRACT

THE SAMPLING DISTRIBUTION OF LINKAGE DISEQUILIBRIUM UNDER AN INFINITE ALLELE MODEL WITHOUT SELECTION ABSTRACT Copyright 1985 by the Genetics Society of America THE SAMPLING DISTRIBUTION OF LINKAGE DISEQUILIBRIUM UNDER AN INFINITE ALLELE MODEL WITHOUT SELECTION RICHARD R. HUDSON Biometry and Risk Assessment Program,

More information

DANIEL GIANOLA. Manuscript received May 29, 1979 Revised copy received October 29, 1979 ABSTRACT

DANIEL GIANOLA. Manuscript received May 29, 1979 Revised copy received October 29, 1979 ABSTRACT HERITABILITY OF POLYCHOTOMOUS CHARACTERS DANIEL GIANOLA Department of Animal Science, University of Illinois, Urbana 61801 Manuscript received May 29, 1979 Revised copy received October 29, 1979 ABSTRACT

More information

Convergence of Square Root Ensemble Kalman Filters in the Large Ensemble Limit

Convergence of Square Root Ensemble Kalman Filters in the Large Ensemble Limit Convergence of Square Root Ensemble Kalman Filters in the Large Ensemble Limit Evan Kwiatkowski, Jan Mandel University of Colorado Denver December 11, 2014 OUTLINE 2 Data Assimilation Bayesian Estimation

More information

THE THEORY OF GENETIC DISTANCE AND EVOLUTION OF HUMAN RACES 1. (The Japan Society of Human Genetics Award Lecture) MasatoshiNEI

THE THEORY OF GENETIC DISTANCE AND EVOLUTION OF HUMAN RACES 1. (The Japan Society of Human Genetics Award Lecture) MasatoshiNEI Jap. o r. Human Genet. 23, 341-369, 1978 THE THEORY OF GENETIC DISTANCE AND EVOLUTION OF HUMAN RACES 1 (The Japan Society of Human Genetics Award Lecture) MasatoshiNEI Center for Demographic and Population

More information

I genetic systems in an effort to understand the structure of natural populations.

I genetic systems in an effort to understand the structure of natural populations. ON TREATING THE CHROMOSOME AS THE UNIT OF SELECTION* MONTGOMERY SLATKIN Department of Theoretical Biology, Uniuersity of Chicago, Chicago, Illinois 60637 Manuscript received July 19, 1971 ABSTRACT A simple

More information

Population Genetics and Evolution III

Population Genetics and Evolution III Population Genetics and Evolution III The Mechanisms of Evolution: Drift São Paulo / January 2019 SMRI (Italy) luca@peliti.org 1 Drift 2 The Population Genetics Triad Sewall Wright Ronald A. Fisher Motoo

More information

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8 The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as

More information

Closed-form sampling formulas for the coalescent with recombination

Closed-form sampling formulas for the coalescent with recombination 0 / 21 Closed-form sampling formulas for the coalescent with recombination Yun S. Song CS Division and Department of Statistics University of California, Berkeley September 7, 2009 Joint work with Paul

More information

Introduction to Linkage Disequilibrium

Introduction to Linkage Disequilibrium Introduction to September 10, 2014 Suppose we have two genes on a single chromosome gene A and gene B such that each gene has only two alleles Aalleles : A 1 and A 2 Balleles : B 1 and B 2 Suppose we have

More information

Lecture 3 - Expectation, inequalities and laws of large numbers

Lecture 3 - Expectation, inequalities and laws of large numbers Lecture 3 - Expectation, inequalities and laws of large numbers Jan Bouda FI MU April 19, 2009 Jan Bouda (FI MU) Lecture 3 - Expectation, inequalities and laws of large numbersapril 19, 2009 1 / 67 Part

More information

Biochemistry. Biochemical Techniques. 01 Electrophoresis : Basic Concepts

Biochemistry. Biochemical Techniques. 01 Electrophoresis : Basic Concepts Description of Module Subject Name Paper Name 12 Module Name/Title 01 Electrophoresis: Basic Concept 1. Objectives 1.1 To understand basic concept of electrophoresis 1.2 To explain what determines charge

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Evolutionary Theory Mathematical and Conceptual Foundations Sean H. Rice Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Contents Preface ix Introduction 1 CHAPTER 1 Selection on One

More information

DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to

DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to 1 1 1 1 1 1 1 1 0 SUPPLEMENTARY MATERIALS, B. BIVARIATE PEDIGREE-BASED ASSOCIATION ANALYSIS Introduction We propose here a statistical method of bivariate genetic analysis, designed to evaluate contribution

More information

Estimating selection on non-synonymous mutations. Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh,

Estimating selection on non-synonymous mutations. Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Genetics: Published Articles Ahead of Print, published on November 19, 2005 as 10.1534/genetics.105.047217 Estimating selection on non-synonymous mutations Laurence Loewe 1, Brian Charlesworth, Carolina

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

Hooke s law and its consequences 1

Hooke s law and its consequences 1 AOE 354 Hooke s law and its consequences Historically, the notion of elasticity was first announced in 676 by Robert Hooke (635 73) in the form of an anagram, ceiinosssttuv. He explained it in 678 as Ut

More information

Population Genetics: a tutorial

Population Genetics: a tutorial : a tutorial Institute for Science and Technology Austria ThRaSh 2014 provides the basic mathematical foundation of evolutionary theory allows a better understanding of experiments allows the development

More information

Observation: we continue to observe large amounts of genetic variation in natural populations

Observation: we continue to observe large amounts of genetic variation in natural populations MUTATION AND GENETIC VARIATION Observation: we continue to observe large amounts of genetic variation in natural populations Problem: How does this variation arise and how is it maintained. Here, we look

More information

Genetic hitch-hiking in a subdivided population

Genetic hitch-hiking in a subdivided population Genet. Res., Camb. (1998), 71, pp. 155 160. With 3 figures. Printed in the United Kingdom 1998 Cambridge University Press 155 Genetic hitch-hiking in a subdivided population MONTGOMERY SLATKIN* AND THOMAS

More information

Quantitative trait evolution with mutations of large effect

Quantitative trait evolution with mutations of large effect Quantitative trait evolution with mutations of large effect May 1, 2014 Quantitative traits Traits that vary continuously in populations - Mass - Height - Bristle number (approx) Adaption - Low oxygen

More information

1 of 7 7/16/2009 6:12 AM Virtual Laboratories > 7. Point Estimation > 1 2 3 4 5 6 1. Estimators The Basic Statistical Model As usual, our starting point is a random experiment with an underlying sample

More information

(Genome-wide) association analysis

(Genome-wide) association analysis (Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 013 Population genetics Outline of lectures 3-6 1. We ant to kno hat theory says about the reproduction of genotypes in a population. This results

More information

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important? Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population

More information

Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates

Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates JOSEPH FELSENSTEIN Department of Genetics SK-50, University

More information

The concept of breeding value. Gene251/351 Lecture 5

The concept of breeding value. Gene251/351 Lecture 5 The concept of breeding value Gene251/351 Lecture 5 Key terms Estimated breeding value (EB) Heritability Contemporary groups Reading: No prescribed reading from Simm s book. Revision: Quantitative traits

More information

Chapter 2. Some Basic Probability Concepts. 2.1 Experiments, Outcomes and Random Variables

Chapter 2. Some Basic Probability Concepts. 2.1 Experiments, Outcomes and Random Variables Chapter 2 Some Basic Probability Concepts 2.1 Experiments, Outcomes and Random Variables A random variable is a variable whose value is unknown until it is observed. The value of a random variable results

More information

Understanding relationship between homologous sequences

Understanding relationship between homologous sequences Molecular Evolution Molecular Evolution How and when were genes and proteins created? How old is a gene? How can we calculate the age of a gene? How did the gene evolve to the present form? What selective

More information

8. Genetic Diversity

8. Genetic Diversity 8. Genetic Diversity Many ways to measure the diversity of a population: For any measure of diversity, we expect an estimate to be: when only one kind of object is present; low when >1 kind of objects

More information

A Few Notes on Fisher Information (WIP)

A Few Notes on Fisher Information (WIP) A Few Notes on Fisher Information (WIP) David Meyer dmm@{-4-5.net,uoregon.edu} Last update: April 30, 208 Definitions There are so many interesting things about Fisher Information and its theoretical properties

More information

WITH WEAK INTERACTION

WITH WEAK INTERACTION EQUILIBRIA FOR GENETIC SYSTEMS WITH WEAK INTERACTION SAMUEL KARLIN WEIZMANN INSTITUTE OF SCIENCE, ISRAEL and STANFORD UNIVERSITY and JAMES McGREGOR STANFORD UNIVERSITY 1. Introduction The following principle

More information

Lecture Notes: Markov chains

Lecture Notes: Markov chains Computational Genomics and Molecular Biology, Fall 5 Lecture Notes: Markov chains Dannie Durand At the beginning of the semester, we introduced two simple scoring functions for pairwise alignments: a similarity

More information

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection CHAPTER 23 THE EVOLUTIONS OF POPULATIONS Section C: Genetic Variation, the Substrate for Natural Selection 1. Genetic variation occurs within and between populations 2. Mutation and sexual recombination

More information