DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION'
|
|
- Nigel Beasley
- 6 years ago
- Views:
Transcription
1 DISTRIBUTION OF NUCLEOTIDE DIFFERENCES BETWEEN TWO RANDOMLY CHOSEN CISTRONS 1N A F'INITE POPULATION' WEN-HSIUNG LI Center for Demographic and Population Genetics, University of Texas Health Science Center, Houston, Texas Manuscript received October 2, 1975 Revised copy received August 16, 1975 ABSTRACT WATTERSON'S (1975) formula for the steady-state distribution of the niimber of nucleotide differences between two randomly chosen cistrons in a finite population has been extended to transient states. The rate for the mean of this distribution to approach its equilibrium value is 1/2N and independent of mu-- tation rate, but that for the variance is dependent on mutation rate, where N denotes the effective population size. Numerical computations show that if the heterozygosity (i.e., the probability that two cistrons are different) is low, say of the order of 0.1 or less, the probability that two cistrons differ at two or more nucleotide sites is less than 10 percent of the heterozygosity, whereas this probability may be as high as 50 percent of the heterozzgosity if the heterozygosity is 0.5. A skple estimate for the mean number (d) of site differences between cistrons is d = h/(i - h) where h is the heterozygosity. At equilibrium, the probability that two cistrons differ by more than one site is equal to h2, the square of heterozygosity. IN a pioneering work, KIMURA (1969) studied the number of heterozygous nucleotide sites per individual in a randomly mating population, assuming that sites are independent. Note that, under random mating, the number of heterozygous sites at a locus in a randomly chosen individual is equivalent to the number of nucleotide differences between two randomly chosen cistrons. Recently, EWENS (1974) found that, for independent sites with Poisson mutations, this number would be exactly Poisson distributed. On the other hand, WATTER- SON (1975) has shown that this number follows approximately a geometric distribution if there is no recombination between sites. All these authors were concerned only with the steady state and no one seems to have studied this number in transient states. The main purpose of this communication is to study the distribution of this number in transient states under the assumption of no recombination between sites. This assumption is more reasonable than that of independent sites since my main interest is the nucleotide differences between cistrons or amino acid differences between proteins. I shall Iollow the method used by WEHRHAHN (1975) and LI (1976). One alternative is the method used by WAT- TERSON (1975). No selection will be considered in this study. Th~study was supported by Public Health Service Grant GM Genetics 85: February, 1977
2 332 W-H. LI BASIC THEORY Consider a randomly mating population of effective size N. I assume that, in each generation, a cistron either mutates with probability U at one of the nonsegregating sites or remains unchanged with probability 1 - U. I use the model of infinite sites (KIMURA 1971), in which I assume that 110 two mutations ever occur at the same site (even in different cistrons). Suppose that two cistrons A and A at present were derived from the replication of a cistron s generations ago and let Pt (s) = Pr{js = k} be the probability that the number (j) of nucleotide differences between A and A) at present is k. To compute P; (s) we note that in passing from the previous generation to the present generation three possibilities can occur: (1) A and A) differed at k sites in the previous generation and no mutation occurred to them in coming to the present generation, (2) they differed at k - 1 sites and either of them gained one mutation, and (3) they differed at k - 2 sites and each of them gained one mutation. Since the first event occurs with probability (1 - U ) 2, the second with probability 2u( 1 - U ), and the third with probability u2, PE (s) = (1- U ) 2 P: (s- 1) v)p,*_, (s- 1) + u2p;-z (s- 1) (1-2v) P, (s - 1) + 2v P,*_, (s - 1 ), neglecting terms of order v2. It follows that the probability generating function (pgf) for the P; values is: m H(Z,s) = p0 Pk (s)zk m 7 B =O [(I -2v)Pt,(s-I) +2vP;-l(s-1)]z = (1-2v + 2vZ)H(Z,s-l) ezus(z-l). Equation (1) holds exactly if the number of mutations per cistron per generation follows the Poisson law with mean U. Note also that equation (1) is equivalent to approximating the discrete time model by a continuous time model. For mathematical ease, I shall consider a continuous time model instead of the discrete time model. Let P k(t) = Pr{dt = k} denote the probability that in generation t the number (d) of site differences between two randomly chosen cistrons is k and let the pgf for the Plc values be: (1) The probability that two cistrons chosen randomly at time t were not due to ihe replication of a cistron sometime in the past is given by f(t) = (1-1/2N) for a discrete time model, or f(t) = &Ira for a continuous time model. In this case, the pgf of the distribution of d is H(Z,t)G(Z,O). This follows from the fact that the number of site differences between the two cistrons is the sum of their initial differences and new mutations and from the fact that the pgf of the sum of two
3 NUCLEOTIDE DIFFERENCES 333 independent random variables is the product ocf their pgf's (cf. FELLER 1968). On the other hand, the probability that two cistrons chosen randomly at present were derived from the replication of a cistron s generations ago is given by F'(s) = df (s)/ds = e-s/2n/2n, since f(t) +j: F'(s)ds = 1. (Note that F (t) = 1 - f(t) is Wright's inbreeding coefficient in the absence of mutation.) In this case the pgf of the distribution of d is given by equation (1). Thus, G(Z,t) = 1,; F'(s)H(Z,s)ds + f(t)h(z,t)g(z,o) where a(2) = -h + 2vZ and h = 1/2N + 2v. Note that -1 G(Z,W) = 2Na (2) - 1 i+e-ez where 6' = 4Nv. That is, at steady state d follows a geometric distribution and (3) Formulas (3) and (4) are identical with formulas (2.14) and (1.8) of WATTER- SON (1975). It is also easy to see that Therefore, In particular, the homozygosity is given by Po(t) = P o(~) + e+[[po(o) - Po(~)l,
4 334 W-H. LI which is standard (MALECOT 1948). The mean and variance of dt are given by a az Var(dt) = - [ Z ag(i,t> az It is interesting to note that the rate for the mean of d to approach its equilibrium value is 1/2N and independent of mutation rate while that for the variance is retarded by mutation. At steady state - d, = 8, Var(d,) = , which agree with WATTERSON'S (1975) formula (1.10). For a comparison of (6a) and (7a) with the results of KIMURA (1969) and EWENS (1974), readers may refer to WATTERSON (1975). DISCUSSION In the derivation of equation (2), I reasoned from time 0 to t. A simpler derivation is to reason from time t - 1 to t. The arbwent is briefly as follows. At generation t, the pgf for the number of site differences between two randomly chosen cistrons is given by g(2) =H (2,l) if they came hom a cistron in generation t - l, but it is G(2,t - l)g(z) if they came from two cistrons in generation t - 1. Thus, 1 G(Z,t) = [- + cg(2,~-1)] g(z), 2N where c = 1-1J2N (see (2.3) of WATTERSON 1975). The solution of the above equation is Using the following two approximations g(2) = 1 and - 1 = a(2), equation (8) reduces to equation (2). Therefore, these two approaches lead to the same result. Table 1 shows the probability that two randomly chosen cistrons are different, i.e., the heterozygosity (h), and its decomposition into the probabilities that they
5 NUCLEOTIDE DIFFERENCES 335 TABLE 1 Distribution of site differences t= W 0.4N 4N 40N or m 4vv =o. 1 k>l k=l k> Nu = 1 k>l k=l k The probability for k 2 1 is equivalent to heterozygosity. N denotes effective population size. differ by one nucleotide and by more than one, respectively, assuming that the initial population is completely homozygous. It is seen that if 4Nv is 0.1, the probability that two cistrons differ at two or more nucleotide sites is small, being less than 10 percent OP the heterozygosity even at equilibrium. On the other hand, if 4Nv is 1, this probability is larger than 40 percent of the heterozygosity even as early as t = 0.04N and consists of 50 percent of the heterozygosity at equilibrium. Therefore, for a population with 4Nv of the order of 1 or larger, the actual genic variation may be considerably larger than that revealed by heterozygosity. However, the mean number (2) of site differences between cistrons may. NEI (1975) used z= --log, (1 - h). Theo- be estimated from heterozygosity (h) retically, ;Ir=4Nv at equilibrium [see formula (6a)l for all models studied (KIMURA 1969, EWENS 1974, WATTERSON 1975), while -Zoge(l - h) = loge( 1 + 4Nu) since h = 4Nv/(1 + 4Nv) (KIMURA 1968). If 4Nv = 0.1, -log,(l - h) = 0.095, which gives only a 5 percent underestimate, but if 4Nv = 1, -log, (1 - h) = 0.693, which gives a 30 percent underestimate. Although in practice NEI S formula holds approximately, since 4Nv is usually of the order of 0.15 or less (NEI 1975), theoretically one should USE z= h/(l - h), since h/( 1 - h) = 4Nv. Note that h/( 1 -h) is the ratio of heterozygosity to homozygosity. Note also that at equilibrium the probability that two cistrons differ by more than one site is equal to PJ(1 + e) 2, which equals h2 (see formula (4) ), It also follows that the expected number of site differences between two cistrons is ejh = 1 4-0, given the condition that they differ at least by one site. The above argument is based on the assumption that allelic variants are identified at nucleotide or codon level (in the latter case, site refers to codon instead of nucleotide). In practice, however, genetic variation is mostly studied by elec- trophoresis. The model of stepwise change of electrophoretic mobility of protein has recently been studied fairly extensively (e.g., OHTA and KIMURA 1973, KING 1973, NEI and CHAKRABORTY 1973, OHTA and KIMURA 1974, WEHRHAHN 1975, KIMURA and OHTA 1975, LI 1976). It should be of interest to compare the results of infinite site model with those of stepwise mutation models. For simplicity I shall consider only the steady-state values. I consider two electrophoretic models: (1) one-step model in which only one-step mutations can occur (cf. OHTA and
6 336 W-H. LI TABLE 2 Equilibrium distributions of state differences for various models h k=l k= k?3 4Nu = 0.1 Infinite site model One-step model Two-step model Nv = 1 Infinite site model 0.5 One-step model Two-step model ~ m The mutation rate (and therefore 4Nu) in the one-step and two-step models is assumed to be one-third of that in the infinite site model. For details, see text. KIMURA 1973) and (2) two-step model in which iwo-step mutations as well as one-step mutations can occur (cf. WEHRHAHN 1975, LI 1976). In the electrophoretic models the mutation rate is assumed to be one-third of that of the infinite site model. In the two-step model, the proportion of two-step mutations is assumed to be 10 percent of all the mutations involving electrophoretic charge changes, though NEI and CHAKRABORTY S (1973) results indicate ihat it is somewhat less than 10 percent. The state differences in Table 2 refer to amino acid differences between pmteins in the case of infinite site model, while they refer to charge differences between proteins in the case of one-step and two-step models. Note that if 4Nu = 0.1, the underestimate of the heterozygosity at amino acid level by using electrophoretic data is mainly due to the underestimate occurring at the first class, i.e., k = 1. On the other hand, if 4Nu = 1, the underestimate occurs mainly at the second class, i.e., k = 2 and the higher classes, i.e., k 2 3. The twostep model gives only a slight improvement over the one-step model. Although the assumption of stepwise change of electrophoretic mobility may not be very realistic (cf. JOHNSON 1974), Table 2 gives us some rough estimates of the detectability of electrophoresis. The present results may be extended to study gene differentiation between populations. A simple situation is as follows: A population splits into two poplations at t = 0 and thereafter there is no migration between them. Let Dk(t) be the probability that at generation t two randomly chosen cistrons, one from each 00 population, differ at k sites. Then the pgf D (2,t) = kzo Dk( 1) Zk is given by D (2,t) = D (Z,O) H (2,t) (9) where D (2,O) = ZDk( 0) Zk is the pgf for the ancestral population at the moment of splitting. It follows that Var(dt) = Var(dn) + 2vt. (12)
7 NUCLEOTIDE DIFFERENCES 33 7 Note that the mean cumber of site differences between two cistrons, one from each population, is equal to the mean number (&) of site differences in the ancestral population plus 2ut, the amount of differentiation after separation. The latter component agrees with the results of NEI (1972). I am greatly indebted to DR. M. NEI for valuable suggestions and discussions. Thanks also are due to DR. R. CHAKRABORTY for discussions. I thank a reviewer for valuable suggestions. LITERATURE CITED EWENS, W. J., 1974 A note on the sampling theory for infinite alleles and infinite sites models. Theor. Pop. Biol. 6: FELLER, W., 1968 An Introduction to Probability Theory and Its Applications, 3rd ed. John TViley and Sons, New York. JOHNSON, G. B., 1974 On the estimation of effective number of alleles from electrophoretic data. Genetics 78: , KIMURA, M., 1968 Genetic voriability maintained in a finite population due to mutational production of neutral and nearly neutral isoalleles. Genet. Res. 11: , 1969 The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutation. Genetics 61 : , 1971 Theoretical foundation of population genetics at the molecular level. Theor. Fop. Biol. 2 : KIMURA, M. and T. OHTA, 1975 Distribution of allelic frequencies in a finite population under stepwise production of neutral alleles. Proc. Nat. Acad. Sci. U.S. 72 : KING, 5. L., 1973 The probability of electrophcretic identity of proteins as a function of amino acid divergence. J. Molec. Evol. 2: LI, W-H., 1976 Electrophoretic identity of proteins in a finite population and genetic distance between taxa. Genet. Res. 28: MALECOT, G., 1943 Les Mathematiques de 2 Heredite. Masson et Cie, Paris. NEI, M., 1972 Genetic distance between populations. Am. Nat. 106: , 1975 Molecular Population Genetics and Euoluiion. N-orth-IIolland Publishing Company, Amsterolam. NEI, M. and R. CHAKRABORTY, 1973 Genetic distance and electrophoretic identity of proteilis between taxa. J. Molec. Evol. 2: OHTA, T. and M. KIMURA, 1973 A rricilel of mutation appropriate to estimate the nuniber of electrophoretically detectab:e alleles in a finite population. Genet. Res. 2.2 : , 1974 Simulation studie? on electrophoretically detectable genetic variability in a finite population. Genetics 76: WATTERSON, G. A., 1975 On the number of segregating sites in genetical models without recombination. Theor. POP. Biol. 7: WEHRHAHN, C. F., 1975 The evolution of selectively similar electrophoretically detectable alleles in finite natural populations. Genetics 80: Corresponding editor: J. F. CROW
I of a gene sampled from a randomly mating popdation,
Copyright 0 1987 by the Genetics Society of America Average Number of Nucleotide Differences in a From a Single Subpopulation: A Test for Population Subdivision Curtis Strobeck Department of Zoology, University
More informationGenetic Variation in Finite Populations
Genetic Variation in Finite Populations The amount of genetic variation found in a population is influenced by two opposing forces: mutation and genetic drift. 1 Mutation tends to increase variation. 2
More informationA MODEL ALLOWING CONTINUOUS VARIATION IN ELECTROPHORETIC MOBILITY OF NEUTRAL ALLELES
A MODEL ALLOWING CONTINUOUS VARIATION IN ELECTROPHORETIC MOBILITY OF NEUTRAL ALLELES GARY COBBS Department of Biology, University of Louisville, Louisville, Keniucky 40208 Manuscript received April 25,
More informationVARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION
Copyright 0 1983 by the Genetics Society of America VARIANCE AND COVARIANCE OF HOMOZYGOSITY IN A STRUCTURED POPULATION G. B. GOLDING' AND C. STROBECK Deportment of Genetics, University of Alberta, Edmonton,
More informationSolutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin
Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele
More informationThe Wright-Fisher Model and Genetic Drift
The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population
More informationThe neutral theory of molecular evolution
The neutral theory of molecular evolution Introduction I didn t make a big deal of it in what we just went over, but in deriving the Jukes-Cantor equation I used the phrase substitution rate instead of
More informationApplication of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data
Proc. Natl. Acad. Sci. USA Vol. 95, pp. 5456 546, May 998 Statistics Application of a time-dependent coalescence process for inferring the history of population size changes from DNA sequence data ANDRZEJ
More informationBustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #
Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either
More informationNEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS
æ 2 NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS 19 May 2014 Variations neither useful nor injurious would not be affected by natural selection, and would be left either a fluctuating element, as perhaps
More informationPopulation Structure
Ch 4: Population Subdivision Population Structure v most natural populations exist across a landscape (or seascape) that is more or less divided into areas of suitable habitat v to the extent that populations
More informationLINKAGE DISEQUILIBRIUM IN SUBDIVIDED POPULATIONS MASATOSHI NE1 AND WEN-HSIUNG LI
LINKAGE DISEQUILIBRIUM IN SUBDIVIDED POPULATIONS MASATOSHI NE1 AND WEN-HSIUNG LI Center for Demographic and Population Genetics, University of Texas, Houston, Texas 77025, and Department of Medical Genetics,
More informationPROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1
PROBABILITY OF FIXATION OF A MUTANT GENE IN A FINITE POPULATION WHEN SELECTIVE ADVANTAGE DECREASES WITH TIME1 MOT00 KIMURA AND TOMOKO OHTA National Institute of Genetics, Mishima, Japan Received December
More informationLINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI
Copyright 0 1984 by the Genetics Society of America LINKAGE DISEQUILIBRIUM, SELECTION AND RECOMBINATION AT THREE LOCI ALAN HASTINGS Defartinent of Matheinntics, University of California, Davis, Calijornia
More informationComputational Systems Biology: Biology X
Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA Human Population Genomics Outline 1 2 Damn the Human Genomes. Small initial populations; genes too distant; pestered with transposons;
More informationSequence evolution within populations under multiple types of mutation
Proc. Natl. Acad. Sci. USA Vol. 83, pp. 427-431, January 1986 Genetics Sequence evolution within populations under multiple types of mutation (transposable elements/deleterious selection/phylogenies) G.
More informationQ1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.
OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall
More informationVariance and Covariances of the Numbers of Synonymous and Nonsynonymous Substitutions per Site
Variance and Covariances of the Numbers of Synonymous and Nonsynonymous Substitutions per Site Tatsuya Ota and Masatoshi Nei Institute of Molecular Evolutionary Genetics and Department of Biology, The
More informationMOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS. Masatoshi Nei"
MOLECULAR PHYLOGENY AND GENETIC DIVERSITY ANALYSIS Masatoshi Nei" Abstract: Phylogenetic trees: Recent advances in statistical methods for phylogenetic reconstruction and genetic diversity analysis were
More informationTHE THEORY OF GENETIC DISTANCE AND EVOLUTION OF HUMAN RACES 1. (The Japan Society of Human Genetics Award Lecture) MasatoshiNEI
Jap. o r. Human Genet. 23, 341-369, 1978 THE THEORY OF GENETIC DISTANCE AND EVOLUTION OF HUMAN RACES 1 (The Japan Society of Human Genetics Award Lecture) MasatoshiNEI Center for Demographic and Population
More informationI negligible, and in this case it is possible to construct an evolutionary tree EVOLUTIONARY RELATIONSHIP OF DNA SEQUENCES IN FINITE POPULATIONS
Copyright 0 1983 by the Genetics Society of America EVOLUTIONARY RELATIONSHIP OF DNA SEQUENCES IN FINITE POPULATIONS FUMIO TAJlMA Center far Demographic and Population Genetics, The University of Texas
More informationMOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT
MOLECULAR EVOLUTION AND POLYMORPHISM IN A RANDOM ENVIRONMENT JOHN H. GILLESPIE Department of Biology. Uniuersity of Pennsylvania, Philadelphia, Pennsylvania 19104 Manuscript received June 8, 1978 Revised
More informationCONGEN Population structure and evolutionary histories
CONGEN Population structure and evolutionary histories The table below shows allele counts at a microsatellite locus genotyped in 12 populations of Atlantic salmon. Review the table and prepare to discuss
More informationNeutral behavior of shared polymorphism
Proc. Natl. Acad. Sci. USA Vol. 94, pp. 7730 7734, July 1997 Colloquium Paper This paper was presented at a colloquium entitled Genetics and the Origin of Species, organized by Francisco J. Ayala (Co-chair)
More informationThe Combinatorial Interpretation of Formulas in Coalescent Theory
The Combinatorial Interpretation of Formulas in Coalescent Theory John L. Spouge National Center for Biotechnology Information NLM, NIH, DHHS spouge@ncbi.nlm.nih.gov Bldg. A, Rm. N 0 NCBI, NLM, NIH Bethesda
More informationVariances of the Average Numbers of Nucleotide Substitutions Within and Between Populations
Variances of the Average Numbers of Nucleotide Substitutions Within and Between Populations Masatoshi Nei and Li Jin Center for Demographic and Population Genetics, Graduate School of Biomedical Sciences,
More informationA comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences
Indian Academy of Sciences A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences ANALABHA BASU and PARTHA P. MAJUMDER*
More informationPopulation Genetics I. Bio
Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn
More informationGene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009
Gene Genealogies Coalescence Theory Annabelle Haudry Glasgow, July 2009 What could tell a gene genealogy? How much diversity in the population? Has the demographic size of the population changed? How?
More informationNATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT
NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE Department of Biology, University of Penmyluania, Philadelphia, Pennsyluania 19174 Manuscript received September 17,
More informationObservation: we continue to observe large amounts of genetic variation in natural populations
MUTATION AND GENETIC VARIATION Observation: we continue to observe large amounts of genetic variation in natural populations Problem: How does this variation arise and how is it maintained. Here, we look
More informationEvolution and maintenance of quantitative genetic variation by mutations
Proc. Nad. Acad. Sci. USA Vol. 84, pp. 6205-6209, September 1987 Evolution Evolution and maintenance of quantitative genetic variation by mutations (founder populations/drift/equilibria/multiple alleles/additive
More informationA SIMPLE METHOD TO ACCOUNT FOR NATURAL SELECTION WHEN
Genetics: Published Articles Ahead of Print, published on September 14, 2008 as 10.1534/genetics.108.090597 Title: A SIMPLE METHOD TO ACCOUNT FOR NATURAL SELECTION WHEN PREDICTING INBREEDING DEPRESSION
More informationAustralian bird data set comparison between Arlequin and other programs
Australian bird data set comparison between Arlequin and other programs Peter Beerli, Kevin Rowe March 7, 2006 1 Data set We used a data set of Australian birds in 5 populations. Kevin ran the program
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 007 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results
More informationStatistical Tests for Detecting Positive Selection by Utilizing High. Frequency SNPs
Statistical Tests for Detecting Positive Selection by Utilizing High Frequency SNPs Kai Zeng *, Suhua Shi Yunxin Fu, Chung-I Wu * * Department of Ecology and Evolution, University of Chicago, Chicago,
More informationEstimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates
Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates JOSEPH FELSENSTEIN Department of Genetics SK-50, University
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 009 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results
More informationGENETIC VARIABILITY AND RATE OF GENE SUBSTITUTION IN A FINITE POPULATION UNDER MUTATION AND FLUCTUATING SELECTION* NAOYUKI TAKAHATA ABSTRACT
GENETC VARABLTY AND RATE OF GENE SUBSTTUTON N A FNTE POPULATON UNDER MUTATON AND FLUCTUATNG SELECTON* NAOYUK TAKAHATA National nstitute of Genetics, Mishima, 411 Japan ABSTRACT By using a numerical method
More informationProcesses of Evolution
15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection
More informationEffective population size and patterns of molecular evolution and variation
FunDamental concepts in genetics Effective population size and patterns of molecular evolution and variation Brian Charlesworth Abstract The effective size of a population,, determines the rate of change
More information(Write your name on every page. One point will be deducted for every page without your name!)
POPULATION GENETICS AND MICROEVOLUTIONARY THEORY FINAL EXAMINATION (Write your name on every page. One point will be deducted for every page without your name!) 1. Briefly define (5 points each): a) Average
More informationA Likelihood Approach to Populations Samples of Microsatellite Alleles
Copyright 0 1997 by the Genetics Society of America A Likelihood Approach to Populations Samples of Microsatellite Alleles Rasmus Nielsen Department of Integrative Biology, University of California, Berkelqr,
More informationTHE genetic consequences of population structure simple migration models with no selection (cited above),
Copyright 2003 by the Genetics Society of America A Diffusion Approximation for Selection and Drift in a Subdivided Population Joshua L. Cherry 1 and John Wakeley Department of Organismic and Evolutionary
More informationAEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity,
AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity, Today: Review Probability in Populatin Genetics Review basic statistics Population Definition
More informationBreeding Values and Inbreeding. Breeding Values and Inbreeding
Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A
More information7. Tests for selection
Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info
More informationA Sampling Theory of Selectively Neutral Alleles in a Subdivided Population
Copyright 0 1988 by the Genetics Society of America A Sampling Theory of Selectively Neutral Alleles in a Subdivided Population Elisabeth R. Tillier and G. Brian Golding Department of Biology, York University,
More informationRecombina*on and Linkage Disequilibrium (LD)
Recombina*on and Linkage Disequilibrium (LD) A B a b r = recombina*on frac*on probability of an odd Number of crossovers occur Between our markers 0
More information9 Genetic diversity and adaptation Support. AQA Biology. Genetic diversity and adaptation. Specification reference. Learning objectives.
Genetic diversity and adaptation Specification reference 3.4.3 3.4.4 Learning objectives After completing this worksheet you should be able to: understand how meiosis produces haploid gametes know how
More informationA TAXONOMIC APPROACH TO EVALUATION OF THE CHARGE STATE MODEL USING TWELVE SPECIES OF SEA ANEMONE ABSTRACT
Copyright 0 1983 by the Genetics Society of America A TAXONOMIC APPROACH TO EVALUATION OF THE CHARGE STATE MODEL USING TWELVE SPECIES OF SEA ANEMONE STEVEN A. McCOMMAS University of Houston Morine Science
More informationConcepts and Methods in Molecular Divergence Time Estimation
Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks
More informationEstimating selection on non-synonymous mutations. Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh,
Genetics: Published Articles Ahead of Print, published on November 19, 2005 as 10.1534/genetics.105.047217 Estimating selection on non-synonymous mutations Laurence Loewe 1, Brian Charlesworth, Carolina
More informationHow robust are the predictions of the W-F Model?
How robust are the predictions of the W-F Model? As simplistic as the Wright-Fisher model may be, it accurately describes the behavior of many other models incorporating additional complexity. Many population
More informationLevels of genetic variation for a single gene, multiple genes or an entire genome
From previous lectures: binomial and multinomial probabilities Hardy-Weinberg equilibrium and testing HW proportions (statistical tests) estimation of genotype & allele frequencies within population maximum
More informationLECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50
LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if
More information6 Introduction to Population Genetics
70 Grundlagen der Bioinformatik, SoSe 11, D. Huson, May 19, 2011 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,
More information- per ordinary gene locus (GELBART and
Copyright 0 1989 by the Genetics Society of America Time for Spreading of Compensatory Mutations Under Gene Duplication Tomoko Ohta National Institute ofgenetics, Mishima 41 1, Japan Manuscript received
More informationIntroduction to Wright-Fisher Simulations. Ryan Hernandez
Introduction to Wright-Fisher Simulations Ryan Hernandez 1 Goals Simulate the standard neutral model, demographic effects, and natural selection Start with single sites, and build in multiple sites 2 Hardy-Weinberg
More informationA MARKOV PROCESS OF GENE FREQUENCY CHANGE IN A
A MARKOV PROCESS OF GENE FREQUENCY CHANGE IN A GEOGRAPHICALLY STRUCTURED POPULATION TAKE0 MARUYAMA National Instituie of Genetics, Mishima, Japan and Cenier for Demographic and Population Genetics, University
More informationMolecular Population Genetics
Molecular Population Genetics The 10 th CJK Bioinformatics Training Course in Jeju, Korea May, 2011 Yoshio Tateno National Institute of Genetics/POSTECH Top 10 species in INSDC (as of April, 2011) CONTENTS
More informationACCORDING to current estimates of spontaneous deleterious
GENETICS INVESTIGATION Effects of Interference Between Selected Loci on the Mutation Load, Inbreeding Depression, and Heterosis Denis Roze*,,1 *Centre National de la Recherche Scientifique, Unité Mixte
More informationQuestion: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?
October 12, 2009 Bioe 109 Fall 2009 Lecture 8 Microevolution 1 - selection The Hardy-Weinberg-Castle Equilibrium - consider a single locus with two alleles A 1 and A 2. - three genotypes are thus possible:
More informationEVOLUTIONARY DISTANCE MODEL BASED ON DIFFERENTIAL EQUATION AND MARKOV PROCESS
August 0 Vol 4 No 005-0 JATIT & LLS All rights reserved ISSN: 99-8645 wwwjatitorg E-ISSN: 87-95 EVOLUTIONAY DISTANCE MODEL BASED ON DIFFEENTIAL EUATION AND MAKOV OCESS XIAOFENG WANG College of Mathematical
More information6 Introduction to Population Genetics
Grundlagen der Bioinformatik, SoSe 14, D. Huson, May 18, 2014 67 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,
More informationFebuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution. Classical vs. balanced views of genome structure
Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution Classical vs. balanced views of genome structure - the proposal of the neutral theory by Kimura in 1968 led to the so-called neutralist-selectionist
More informationreciprocal altruism by kin or group selection can be analyzed by using the same approach (6).
Proc. Nati. Acad. Sci. USA Vol. 81, pp. 6073-6077, October 1984 Evolution Group selection for a polygenic behavioral trait: Estimating the degree of population subdivision (altruism/kin selection/population
More informationp(d g A,g B )p(g B ), g B
Supplementary Note Marginal effects for two-locus models Here we derive the marginal effect size of the three models given in Figure 1 of the main text. For each model we assume the two loci (A and B)
More informationLikelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution
Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Ziheng Yang Department of Biology, University College, London An excess of nonsynonymous substitutions
More information11. TEMPORAL HETEROGENEITY AND
GENETIC VARIATION IN A HETEROGENEOUS ENVIRONMENT. 11. TEMPORAL HETEROGENEITY AND DIRECTIONAL SELECTION1 PHILIP W. HEDRICK Division of Biological Sciences, University of Kansas, Lawrence, Kansas 66045 Manuscript
More informationShane s Simple Guide to F-statistics
Pop. g. stats2.doc - 9/3/05 Shane s Simple Guide to F-statistics Intro The aim here is simple & very focussed: im : a very brief introduction to the most common statistical methods of analysis of population
More informationEVOLUTIONARY DYNAMICS AND THE EVOLUTION OF MULTIPLAYER COOPERATION IN A SUBDIVIDED POPULATION
Friday, July 27th, 11:00 EVOLUTIONARY DYNAMICS AND THE EVOLUTION OF MULTIPLAYER COOPERATION IN A SUBDIVIDED POPULATION Karan Pattni karanp@liverpool.ac.uk University of Liverpool Joint work with Prof.
More informationMajor questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.
Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary
More informationExpected coalescence times and segregating sites in a model of glacial cycles
F.F. Jesus et al. 466 Expected coalescence times and segregating sites in a model of glacial cycles F.F. Jesus 1, J.F. Wilkins 2, V.N. Solferini 1 and J. Wakeley 3 1 Departamento de Genética e Evolução,
More informationEXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?
Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population
More informationThere are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.
EVOLUTIONARY BIOLOGY EXAM #1 Fall 2017 There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page. Part I. True (T) or False (F) (2 points each). Circle
More informationSupporting Information
Supporting Information Hammer et al. 10.1073/pnas.1109300108 SI Materials and Methods Two-Population Model. Estimating demographic parameters. For each pair of sub-saharan African populations we consider
More informationLecture 18 : Ewens sampling formula
Lecture 8 : Ewens sampling formula MATH85K - Spring 00 Lecturer: Sebastien Roch References: [Dur08, Chapter.3]. Previous class In the previous lecture, we introduced Kingman s coalescent as a limit of
More informationInbreeding depression due to stabilizing selection on a quantitative character. Emmanuelle Porcher & Russell Lande
Inbreeding depression due to stabilizing selection on a quantitative character Emmanuelle Porcher & Russell Lande Inbreeding depression Reduction in fitness of inbred vs. outbred individuals Outcrossed
More informationNeutral Theory of Molecular Evolution
Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation
More informationSEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION
Annu. Rev. Genomics Hum. Genet. 2003. 4:213 35 doi: 10.1146/annurev.genom.4.020303.162528 Copyright c 2003 by Annual Reviews. All rights reserved First published online as a Review in Advance on June 4,
More informationGene Pool Recombination in Genetic Algorithms
Gene Pool Recombination in Genetic Algorithms Heinz Mühlenbein GMD 53754 St. Augustin Germany muehlenbein@gmd.de Hans-Michael Voigt T.U. Berlin 13355 Berlin Germany voigt@fb10.tu-berlin.de Abstract: A
More informationA simple genetic model with non-equilibrium dynamics
J. Math. Biol. (1998) 36: 550 556 A simple genetic model with non-equilibrium dynamics Michael Doebeli, Gerdien de Jong Zoology Institute, University of Basel, Rheinsprung 9, CH-4051 Basel, Switzerland
More informationFrequency Spectra and Inference in Population Genetics
Frequency Spectra and Inference in Population Genetics Although coalescent models have come to play a central role in population genetics, there are some situations where genealogies may not lead to efficient
More information2. Map genetic distance between markers
Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,
More informationFitness landscapes and seascapes
Fitness landscapes and seascapes Michael Lässig Institute for Theoretical Physics University of Cologne Thanks Ville Mustonen: Cross-species analysis of bacterial promoters, Nonequilibrium evolution of
More information- point mutations in most non-coding DNA sites likely are likely neutral in their phenotypic effects.
January 29 th, 2010 Bioe 109 Winter 2010 Lecture 10 Microevolution 3 - random genetic drift - one of the most important shifts in evolutionary thinking over the past 30 years has been an appreciation of
More informationPopulation Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda
1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;
More informationRare Alleles and Selection
Theoretical Population Biology 59, 8796 (001) doi:10.1006tpbi.001.153, available online at http:www.idealibrary.com on Rare Alleles and Selection Carsten Wiuf Department of Statistics, University of Oxford,
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 013 Population genetics Outline of lectures 3-6 1. We ant to kno hat theory says about the reproduction of genotypes in a population. This results
More informationSpace Time Population Genetics
CHAPTER 1 Space Time Population Genetics I invoke the first law of geography: everything is related to everything else, but near things are more related than distant things. Waldo Tobler (1970) Spatial
More informationSTRONG balancing selection can result from over- approximation of Gillespie (1984, 1991). This approximation
Copyright 1999 by the Genetics Society of America Overdominant Alleles in a Population of Variable Size Montgomery Slatkin and Christina A. Muirhead Department of Integrative Biology, University of California,
More informationLecture Notes: BIOL2007 Molecular Evolution
Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits
More informationSelection and Population Genetics
Selection and Population Genetics Evolution by natural selection can occur when three conditions are satisfied: Variation within populations - individuals have different traits (phenotypes). height and
More informationON THE DIFFUSION OPERATOR IN POPULATION GENETICS
J. Appl. Math. & Informatics Vol. 30(2012), No. 3-4, pp. 677-683 Website: http://www.kcam.biz ON THE DIFFUSION OPERATOR IN POPULATION GENETICS WON CHOI Abstract. W.Choi([1]) obtains a complete description
More informationSequence Analysis 17: lecture 5. Substitution matrices Multiple sequence alignment
Sequence Analysis 17: lecture 5 Substitution matrices Multiple sequence alignment Substitution matrices Used to score aligned positions, usually of amino acids. Expressed as the log-likelihood ratio of
More informationIntroductory seminar on mathematical population genetics
Exercises Sheets Introductory seminar on mathematical population genetics WS 20/202 Kristan Schneider, Ada Akerman Ex Assume a single locus with alleles A and A 2 Denote the frequencies of the three (unordered
More informationStatistical population genetics
Statistical population genetics Lecture 7: Infinite alleles model Xavier Didelot Dept of Statistics, Univ of Oxford didelot@stats.ox.ac.uk Slide 111 of 161 Infinite alleles model We now discuss the effect
More informationChapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)
12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²
More informationThe Quantitative TDT
The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus
More information