On the informativeness of dominant and co-dominant genetic markers for Bayesian supervised clustering

Size: px
Start display at page:

Download "On the informativeness of dominant and co-dominant genetic markers for Bayesian supervised clustering"

Transcription

1 On the inormativeness o dominant and co-dominant genetic marers or Bayesian supervised clustering Gilles Guillot and Alexandra Carpentier-Sandalis September 24, 2010 Abstract We study the accuracy o Bayesian supervised method used to cluster individuals into genetically homogeneous groups on the basis o dominant or codominant molecular marers. We provide a ormula relating an error criterion the number o loci used and the number o clusters. This ormula is exact and holds or arbitrary number o clusters and marers. Our wor suggests that dominant marers studies can achieve an accuracy similar to that o codominant marers studies i the number o marers used in the ormer is about 1.7 times larger than in the latter. 1 1 Bacground A common problem in population genetics consists in assigning an individual to one o K populations on the basis o its genotype and inormation about the distribution o the various alleles in the K populations. This question has received a considerable attention in the population genetics and molecular ecology literature [1, 2, 3, 4] as it can provide important insight about gene low patterns and migration rates. It is or example widely used in epidemiology to detect the origin o a pathogens or o their hosts (see e.g. [5, 6, 7] or examples) or in conservation biology and population management to detect illegal trans-location or poaching [8]. See [9] or a review o related methods. In a statistical phrasing, assigning an individual to some nown clusters is a supervised clustering problem. This requires to observe the genotype o the individual to be assigned and those o some individuals in the various clusters. For diploid organisms (i.e. organisms harbouring Department o Inormatics and Mathematical Modelling, Technical University o Denmar, 2800, Lyngby, Copenhagen, Denmar Centre or Ecological and Evolutionary Synthesis, Department o Biology, University o Oslo, P.O. Box 1066 Blindern, 0316 Oslo, Norway 1

2 two copies o each chromosome), certain lab techniques allow one to retrieve the exact genotype o each individual. In contrast, or some marers it is only possible to say whether a certain allele A (reerred hereater as to dominant allele) is present or not at a locus. In this case, one can not distinguish the heterozygous genotype Aa rom the homozygous genotype AA or the dominant allele. The ormer type o marers are said to be codominant while the latter are said to be dominant. It is clear that the the second genotyping method incurs a loss o inormation. The consequence o this loss o inormation has been studied rom an empirical point o view [10] but it has never been studied on a theoretical basis. The choice to use one type o marers or empirical studies is thereore oten motivated mostly by practical considerations rather than by an objective rationale [11, 12]. The objective o the present article is to compare the accuracy achieved with dominant and codominant marers when they are used to perorm supervised clustering and to derive some recommendations about the number o marers required to achieve a certain accuracy. Dominant marers are essentially bi-allelic in the sense that they record the presence o the absence o a certain allele. We are not concerned here by the relation between inormativeness and the level o polymorphisms (c [13, 14] or reerences on this aspect). We thereore ocus on 28 bi-allelic dominant and co-dominant marers. Hence our study is representative o Ampliied Fragment Length Polymorphism (AFLP) and Single Nucleotide Polymorphism (SNP) marers, which are some o the most employed marers in genetics Inormativeness o dominant and co-dominant marers 2.1 Cluster model We will consider here the case o diploid organisms at L bi-allelic loci. We denote by z = (z l ) l=1,...,l the genotype o an individual. We denote by l the requency o allele A in cluster at locus l We assume that each cluster is at Hardy-Weinberg equilibrium (HWE) at each locus. HWE is deined as the conditions under which the allele carried at a locus on one chromosome 37 is independent o the allele carried at the same locus on the homologous chromosome. This situation is observed at neutral loci when individuals mate at random in a cluster. Denoting by z l the number o copies o allele A carried by an individual, we have: For co-dominant marers, 2

3 40 this can be expressed as p(z l = 2 ) = 2 l (1) p(z l = 1 ) = 2 l (1 l ) (2) p(z l = 0 ) = (1 l ) 2 (3) For dominant marers, z l is equal to 0 or 1 depending on whether a copy o allele A is present in the genotype o the individual. Under HWE we have: p(z l = 1 ) = 2 l + 2 l (1 l ) (4) p(z l = 0 ) = (1 l ) 2 (5) In addition to HWE, we also assume that the various loci are at linage equilibrium (henceorth HWLE), i.e. that the probability o a multilocus genotype is equal to the product o probabilities o single-locus genotypes: p(z 1,..., z L ) = l p(z l). We assume that the individual to be classiied has origin in one o the K clusters (no admixture) Sampling model We will measure the accuracy o a classiying rule or a given type o marers by the probability to assign correctly an individual with unnown origin. We are interested in deriving results that are independent (i) on the particular origin c o the individual to be classiied (ii) on the genotype z o this individual and (iii) on the allele requencies in the various clusters. We will thereore derive results that are conditional on c, z and and then compute Bayesian averages under suitable prior distributions. The mechanism assumed in the sequel is as ollows 1. The individual has origin in one o the K clusters. This origin is unnown and all origins are equally liely. We thereore assume a uniorm prior or c on {1,..., K}. 2. In each cluster, or each locus the allele requencies ollow a Dirichlet(1,1) distribution with independent across clusters and loci. 3. Conditionally on c and, the probability o the genotype o the individual is given by 3

4 59 60 equations (1-3) or (4-5), i.e we assume that the individual has been sampled at random among all individuals in his cluster o origin Accuracy o assignments under a maximum lielihood principle We consider an individual o unnown origin c with nown genotype z with potential origin in K clusters with nown allele requencies. Following a maximum lielihood principle, it is natural to estimate c as the cluster label or which the probability o observing this particular genotype is maximal. Formally: c = Argmax p(z c =, ). This assignment rule is deterministic, but whether the individual is correctly assigned will depend on its genotype and on cluster allele requencies. Randomising these quantities and averaging over all possible values, we can derive a generic ormula or the probability o correct assignment p MLA as p MLA = ϕ ζ max p(c =, z = ζ = ϕ)dp(ϕ) (6) See section A in appendix or details. This ormula is o little practical use and deriving some more explicit expression or arbitrary value o K and L seems to be out o reach. However, or K = 2 and L = 1, under the assumptions that the individual has a priori equally liely ancestry in each cluster and that each has a Dirichlet distribution with parameter (1,..., 1) (lat). we get p MLA c (K = 2, L = 1) = 17/24 or codominant marers (7) 74 and p MLA d (K = 2, L = 1) = 16/24 or dominant marers. (8) 75 Because o the lac o practical useulness o eq. (6), we now deine an alternative rule or assignment that is similar in spirit to maximum lielihood but also leads to more tractable equations. 4

5 Accuracy o assignments under a stochastic rule Considering the collection o lielihood values p(z c =, ) or = 1,..., K, ollowing [15], we deine a stochastic assignment (SA) rule by assigning the individual to a group at random with probabilities proportional to p(z c =, ). In words, an individual with genotype z is randomly assigned to cluster with a probability proportional to the probability to observe this genotype in cluster. The rationale behind this rule is that high values o p(z c =, ) indicate strong evidence o ancestry in group but do not guarantee against miss-assignments. To derive the probability o correct assignment, we irst consider that the allele requencies are nown, and the account or the uncertainty about these requencies by Bayesian in integration. The use o a Bayesian ramewor is motivated by the act that (i) there is genuine uncertainty on allele requencies which can not be overlooed, and (ii) under some airly mild assumptions, allele requencies are nown to be Dirichlet distributed (possibly with a degree o approximation see e.g. [16, 17]). Reer to [18] or urther discussion o the Bayesian paradigm in population genetics. We now give our main results regarding this clustering rule. For bi-allelic loci and denoting by p SA c marers we have: p SA c (K, L) = For bi-allelic loci and denoting by p SA d marers is the probability o correct assignment using codominant (K 1)(5/8) L (9) the probability o correct assignment using dominant p SA d (K, L) = (K 1)(25/33) L (10) 96 3 Implications Our investigations considered bi-allelic loci and are thereore representative o AFLP and SNP marers which are some o the most employed marers in genetics. In this context, or supervised clustering, our main conclusions are that (i) codominant marers are more accurate than dominant marers, (ii) the dierence o accuracy decreases toward 0 as the number o marers L increases, (iii) L d dominant marers can achieve an accuracy even higher than that o L c codominant marers as long as the numbers o loci used satisy L d λl c where λ = ln(5/8)/ ln(25/33)

6 The igures reported have to be taen with a grain o salt as they may depend on some speciic aspects o the models considered. For example, the model considered here assumes independence o allele requencies across clusters. This assumption is relevant in case o populations displaying low migration rates and low amount o shared ancestry. When one o these assumptions is violated, an alternative parametric model based on the Dirichlet distribution that accounts or correlation o allele requencies across population is oten used (see [16] and reerences therein). It is expected that the accuracy obtained with both marers would be lower under this model. Besides, the present study does not account or ascertainment bias [19, 20, 21, 22], an aspect that might aect the results but is notoriously diicult to deal with. However, it is important to note that the conditions considered in the present study were the same or dominant and codominant marers so that results should not be biased toward one type o marer. Our global result about the relative inormativeness o dominant and co-dominant marers contrasts with the common belie that dominant marers are expedient one would resort to when co-dominant marers are not available (see [12] or discussions). A comparison o dominant and codominant marers or unsupervised clustering has been carried out [23]. This study based on simulations suggests that the loss o accuracy incurred by dominant marers in unsupervised clustering is much larger than or supervised clustering. This is presumably explained by the act that in case o HWLE clusters, supervised clustering sees to optimise a criterion based on allele requencies only. This contrasts with unsupervised clustering which sees to optimise a criterion based on allele requencies and HWLE. A similar theoretical analysis o unsupervised clustering algorithm similar to the present study would be valuable but we anticipate that it would present more diiculties. 125 Acnowlegement This wor has been supported by Agence Nationale de la Recherche grant ANR-09-BLAN

7 128 A Supervised clustering with a maximum lielihood principle We consider the setting where the unnown ancestry c o an individual with genotype z is estimated by c = Argmax c p(z c, ). As this estimator is a deterministic unction o z we denote it by c z or clarity in the sequel. Consider or now that the allele requencies are nown to be equal to some ϕ. Under this setting, randomness comes rom the sampling o c and then rom the sampling o z (c, ). We are concerned with the event E deined as E = {the individual is correctly assigned} 129. Applying the total probability ormula, we can write p(e = ϕ) = p(e, c =, z = ζ = ϕ) (11) ζ 130 In the sum over, only one term is not equal to 0, this is the term or = c, hence p(e = ϕ) = ζ = ζ = ζ = ζ p(e, c = c ζ, z = ζ = ϕ) (12) p(c = c ζ, z = ζ = ϕ) (13) p(c = c ζ = ϕ)p(z = ζ c = c ζ, = ϕ) (14) p(c = c ζ )p(z = ζ c = c ζ, = ϕ) (15) Assuming that the individual has a priori equally liely ancestry in each cluster, i.e. assuming a uniorm distribution or the class variable c, we get p(e = ϕ) = K 1 ζ p(z = ζ c = c ζ, = ϕ) (16) 7

8 133 By deinition, c z satisies p(z c z, ) = max p(z c =, ), hence p(e = ϕ) = K 1 ζ max p(z = ζ c =, = ϕ) (17) = ζ = ζ max p(c = )p(z = ζ c =, = ϕ) (18) max p(c =, z = ζ = ϕ) (19) We see an expression o the probability o correct assignment that does not depend on particular values o allele requencies. This can be obtained by integrating over allele requencies, namely p(e) = = ϕ ϕ p(e = ϕ)dp(ϕ) (20) ζ max p(c =, z = ζ = ϕ)dp(ϕ) (21) Note that identity (21) holds or any number o cluster K, any number o loci L and any type o marers (dominant vs. codominant) We now consider a two cluster problem in the case where the genotype o an individual has been recorded at a single bi-allelic locus. We denote by 1 (resp. 2 ) the requency o allele A in cluster 1 (resp. cluster 2) A.1 Codominant marers: There are only three genotypes: AA, Aa and aa. Denoting by the requency o allele A in cluster and conditionally on, these three genotypes occur in cluster with probabilities 2, 2 (1 ) and (1 ) 2, and equation (21) can be simpliied as p(e) = ϕ [ p(c) max ] max (1 ) + max (1 ) 2 dp(ϕ) (22) We need to derive the distribution o max 2 and o max (1 ). Assuming a lat Dirichlet distribution or, elementary computations give: 8

9 149 i.e max 2 p(max 2 ollows a uniorm distribution on [0, 1] so that < x) = x (23) E(max 2 ) = 1/2 (24) 150 Besides, we also get p(max (1 ) < x) = (1 1 4x) 2 (25) 151 and deriving dp dx (max (1 ) < x) = x (26) 1 4x 152 Integrating by part, we get E(max (1 )) = 1/4 0 4x 1 1 4x 1 4x dx = 5/24 (27) 153 Eventually p(e) = 17/24 (28) 154 which proves equation (7) A.2 Dominant marers: For a single locus, there are two genotypes A and a. Conditionally on, these two genotypes are observed in cluster with probabilities 1 2 and 2. Equation (21) can be simpliied here as p(e) = ϕ [ ] p(c) max 2 + max (1 2 ) dp(ϕ) (29) 9

10 159 We now need the density o 1 2 p(max (1 2 ) < x) = (1 1 x) 2 (30) and Eventually we get dp dx (max(1 2 ) < x) = 1 1 (31) 1 x E(max (1 2 )) = 1 0 ( ) 1 x 1 dx = 5/6 (32) 1 x p(e) = 16/24 (33) 162 which proves equation (8). 163 B Stochastic assignment rule The maximum lielihood assignment rule considered above is not tractable or arbitrary values o K and L (c. eq. (21)). In particular, a diiculty arises rom the maximisation involved. We consider here an assignment rule that does not involve maximisation. The unnown ancestry c o an individual with genotype z is predicted by a random variable c with values in {1,..., K} and such that p(c = z, ) p(z c =, ). As in the previous sections, we irst consider that the allele requencies are nown, however we sip this dependence in the notation at the beginning 170 or clarity. We will account or the uncertainty about these requencies later by Bayesian in integration. In this setting, the structure o conditional probability dependence can be represented by a directed acyclic graph as in the on let-hand side o igure 1. We are concerned with evaluating the probability o event E deined as E = {the individual is correctly assigned} 173. i.e. E = {c = c }. We denote by p a (resp. p b ) probabilities under the two conditional dependence 10

11 c c* c c z (a) z z (b) Figure 1: Directed acyclic graph or our stochastic assignment rule (let) and or an alternative scheme (right). All downward arrows represent the same conditional dependence given by our lielihood model. Upward arrow represents the reverse probability dependence structure o igure 1. Some elementary computations show that p(e) can be expressed in terms o a probability in the model o the right-hand-side o the DAG in igure 1, namely: p a (c = c ) = p b (c = c z = z ) (34) 176 The let-hand-side o this expression can be written as p b (c = c z = z ) = p b (c = c, z = z )/p b (z = z ) (35) It is more convenient to manipulate this expression than p b (c = c ). We will to use it to evaluate p a (E) B.1 Codominant marers: We assume that the individual has a priori equally liely ancestry in each cluster. We slightly change the notation denoting by z l the count o allele A at locus l or the individual to be assigned. Then maing the dependence on explicit in the notation, we have p b (c = c, z = z ) = z = z p 2 b (c, z ) (36) [ 2 1 z l K (1 ) 2 z l (2 δz 1 l )] (37) l 183 where δ 1 z l denotes the Kronecer symbol that equals 1 i z l = 1 and 0 otherwise. 11

12 184 Accounting or uncertainty about by integration, we get p b (c = c, z = z ) = = p b (c = c, z = z )d (38) [ 2 1 z l K (1 ) 2 z l (2 δz 1 l )] d (39) z l Among the terms enumerated in the sum over z above, let us consider a generic term z or which the number o loci having exactly h heterozygous genotypes. The term corresponding to such a genotype z in the sum above can be written [ ] 1 h [ L h K 2 22h 2 (1 ) 2 d d] 4 (40) 188 Denoting by C h L the binomial coeicient, there are Ch L2 L h such terms. Equation (39) becomes p b (c = c, z = z ) = h CL2 h L h 1 [ ] h [ L h K 2 22h 2 (1 ) 2 d d] 4 (41) 189 Assuming a lat Dirichlet distribution or the allele requencies, we get p b (c = c, z = z ) = 1 K ( ) 8 L (42) We now need to evaluate p b (z = z ), but since p b (z = z ) = z p 2 b (z ) = z ( 2 p b (, z )), (43) p b (z = z ) = = p 2 b (z ) = ( 2 p b (, z )) z (44) p b (, z ) 2 + p b (, z )p b (, z ) z (45) z z z = p b (c = c, z = z ) + 12

13 CL2 h L h ( [ 1 2 K 2 22h h = 1 K ] 2h [ (1 )d d] ) 4(L h) (46) ( ) 8 L + K K 3 L (47) 191 Eventually, p(e) = (K 1) ( ) 5 L (48) which proves equation (9). 193 B.2 Dominant marers: 194 We still have p b (c = c, z = z ) = [ 2 1 z l K (1 ) 2 z l (2 δz 1 l )] d (49) z l For a generic genotype z in the sum above, let us denote by r the number o loci carrying exactly one copy o the recessive allele, then p b (c = c, z = z ) = [ ] Cr l 1 r [ L r K 2 4 d (1 d] 2 )2 (50) r = ( ) Cr l 1 1 L ( ) 8 L r K 2 (51) 5 3 r = 1 ( ) 11 L (52) K Moreover, by arguments similar to those used or codominant marers, we get p b (z = z ) = 1 K ( ) 11 L + K 1 15 K ( ) 5 L (53) 9 13

14 198 And we get p b (z = z ) = 1 K ( ) 11 L + K 1 15 K ( ) 5 L (54) Eventually, p(e) = (K 1) ( ) 25 L (55) which proves equation (10). 201 Reerences [1] B. Rannala and J. Mountain, Detecting immigration by using multilocus genotypes, Pro- ceedings o the National Academy o Sciences USA, vol. 94, pp , [2] J. Cornuet, S. Piry, G. Luiart, A. Estoup, and M. Solignac, New methods employing multilocus genotypes to select or exclude populations as origins o individuals, Genetics, vol. 153, pp , [3] D. Paetau, R. Slade, M. Burdens, and A. Estoup, Genetic assignment methods or the direct, real-time estimation o migration rate: a simulation-based exploration o accuracy and power, Molecular Ecology, vol. 15, pp , [4] A. Piry, S. Alapetite, J. Cornuet, D. Paetau, L. Baudoin, and A. Estoup, Geneclass2: A sotware or genetic assignment and irst-generation migrant detection, Journal o Heredity, vol. 95, no. 6, pp , [5] P. Gladieux, X. Zhang, D. Aoua-Bastien, R. V. Sanhueza, M. Sbaghi,, and B. L. Cam, On the origin and spread o the scab disease o apple: Out o central Asia, PLoS One, vol. 3, no. 1, p. e1455,

15 [6] A. P. de Rosas, E. Segura, L. Fichera, and B. García, Macrogeographic and microgeographic genetic structure o the chagas disease vector triatoma inestans (hemiptera: reduviidae) rom Catamarca, Argentina, Genetica, vol. 133, no. 3, pp , [7] A. Bataille, A. A. Cunningham, V. Cedeño, L. Patiño, A. Constantinou, L. Kramer, and S. J. Goodman, Natural colonization and adaptation o a mosquito species in Galápagos and its implications or disease threats to endemic wildlie, Proceedings o the National Academy o Sciences, vol. 106, no. 25, pp , [8] S. Manel, P. Berthier, and G. Luiart, Detecting wildle poaching: identiying the origin o individuals with Bayesian assignment test and multilocus genotypes, Conservation biology, vol. 13, no. 3, pp , [9] S. Manel, O. Gaggiotti, and R. Waples, Assignment methods: matching biological questions with appropriate techniques, Trends in Ecology and Evolution, vol. 20, pp , [10] D. Campbell, P. Duchesne, and L. Bernatchez, AFLP utility or population assignment studies: analytical investigation and empirical comparison with microsatellites, Molecular Ecology, vol. 12, pp , [11] C. Schlötterer, The evolution o molecular marers - just a matter o ashion? Nature Review Genetics, vol. 5, pp , [12] A. Bonin, D. Ehrich, and S. Manel, Statistical analysis o ampliied ragment length poly- morphism data: a toolbox or molecular ecologists and evolutionists, Molecular Ecology, vol. 16, no. 18, pp , [13] N. Rosenberg, T. Bure, K. Elo, M. Feldman, P. Friedlin, M. Groenen, J. Hillel, A. Mäi- Tanila, M. Tixier-Boichard, A. Vignal, K. Wimmers, and S. Weigend, Empirical evaluation o genetic clustering methods using multilocus genotypes rom 20 chicen breeds, Genetics, vol. 159, pp , [14] S. T. Kalinowsi, Do polymorphic loci require larger sample sizes to estimate genetic dis- tances? Heredity, vol. 94, pp ,

16 [15] N. Rosenberg, L. Li, R. Ward, and J. K. Pritchard, Inormativeness o genetic marers or inerence o ancestry, American Journal o Human Genetics, vol. 73, pp , [16] G. Guillot, Inerence o structure in subdivided populations at low levels o genetic di- erentiation.the correlated allele requencies model revisited, Bionormatics, vol. 24, pp , [17] O. Gaggiotti and M. Foll, Quantiying population structure using the -model, Molecular Ecology Resources, vol. 10, no. 5, p , [18] M. A. Beaumont and B. Rannala, The Bayesian revolution in genetics, Nature Review Genetics, vol. 5, pp , [19] R. Nielsen and J. Signorovitch, Correcting or ascertainment biases when analyzing SNP data: applications to the estimation o linage quilibrium, Theoretical Population Biology, vol. 63, pp , [20] R. Nielsen, M. Hubisz, and A. Clar, Reconstituting the requency spectrum o ascertained single-nucleotide polymorphism data, Genetics, vol. 168, pp , [21] M. Foll, M. Beaumont, and O. Gaggiotti, An approximate Bayesian computation approach to overcome biases that arise when using AFLP marers to study population structure, Genetics, vol. 179, pp , [22] G. Guillot and M. Foll, Accounting or the ascertainment bias in Marov chain Monte Carlo inerences o population structure, Bioinormatics, vol. 25, no. 4, pp , [23] G. Guillot and F. Santos, Using AFLP marers and the Geneland program or the inerence o population genetic structure, Molecular Ecology Resources, 2010, to appear. 16

Microsatellites as genetic tools for monitoring escapes and introgression

Microsatellites as genetic tools for monitoring escapes and introgression Microsatellites as genetic tools for monitoring escapes and introgression Alexander TRIANTAFYLLIDIS & Paulo A. PRODÖHL What are microsatellites? Microsatellites (SSR Simple Sequence Repeats) The repeat

More information

Estimating the location and shape of hybrid zones.

Estimating the location and shape of hybrid zones. Estimating the location and shape of hybrid zones. Benjamin Guedj and Gilles Guillot May 24, 2011 Abstract We propose a new model to make use of geo-referenced genetic data for inferring the location and

More information

Visualizing Population Genetics

Visualizing Population Genetics Visualizing Population Genetics Supervisors: Rachel Fewster, James Russell, Paul Murrell Louise McMillan 1 December 2015 Louise McMillan Visualizing Population Genetics 1 December 2015 1 / 29 Outline 1

More information

Roberto s Notes on Differential Calculus Chapter 8: Graphical analysis Section 1. Extreme points

Roberto s Notes on Differential Calculus Chapter 8: Graphical analysis Section 1. Extreme points Roberto s Notes on Dierential Calculus Chapter 8: Graphical analysis Section 1 Extreme points What you need to know already: How to solve basic algebraic and trigonometric equations. All basic techniques

More information

Bayesian analysis of the Hardy-Weinberg equilibrium model

Bayesian analysis of the Hardy-Weinberg equilibrium model Bayesian analysis of the Hardy-Weinberg equilibrium model Eduardo Gutiérrez Peña Department of Probability and Statistics IIMAS, UNAM 6 April, 2010 Outline Statistical Inference 1 Statistical Inference

More information

The Wright-Fisher Model and Genetic Drift

The Wright-Fisher Model and Genetic Drift The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population

More information

BIOINFORMATICS. Gilles Guillot

BIOINFORMATICS. Gilles Guillot BIOINFORMATICS Vol. 00 no. 00 2008 Pages 1 19 Supplementary material for: Inference of structure in subdivided populations at low levels of genetic differentiation. The correlated allele frequencies model

More information

Microsatellite data analysis. Tomáš Fér & Filip Kolář

Microsatellite data analysis. Tomáš Fér & Filip Kolář Microsatellite data analysis Tomáš Fér & Filip Kolář Multilocus data dominant heterozygotes and homozygotes cannot be distinguished binary biallelic data (fragments) presence (dominant allele/heterozygote)

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

Example: When describing where a function is increasing, decreasing or constant we use the x- axis values.

Example: When describing where a function is increasing, decreasing or constant we use the x- axis values. Business Calculus Lecture Notes (also Calculus With Applications or Business Math II) Chapter 3 Applications o Derivatives 31 Increasing and Decreasing Functions Inormal Deinition: A unction is increasing

More information

Problems for 3505 (2011)

Problems for 3505 (2011) Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.

More information

On a Closed Formula for the Derivatives of e f(x) and Related Financial Applications

On a Closed Formula for the Derivatives of e f(x) and Related Financial Applications International Mathematical Forum, 4, 9, no. 9, 41-47 On a Closed Formula or the Derivatives o e x) and Related Financial Applications Konstantinos Draais 1 UCD CASL, University College Dublin, Ireland

More information

Extreme Values of Functions

Extreme Values of Functions Extreme Values o Functions When we are using mathematics to model the physical world in which we live, we oten express observed physical quantities in terms o variables. Then, unctions are used to describe

More information

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8 The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as

More information

Population Structure

Population Structure Ch 4: Population Subdivision Population Structure v most natural populations exist across a landscape (or seascape) that is more or less divided into areas of suitable habitat v to the extent that populations

More information

2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd.

2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd. .6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics Spline interpolation was originally developed or image processing. In GIS, it is mainly used in visualization o spatial

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics 1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

More information

Ex x xf xdx. Ex+ a = x+ a f x dx= xf x dx+ a f xdx= xˆ. E H x H x H x f x dx ˆ ( ) ( ) ( ) μ is actually the first moment of the random ( )

Ex x xf xdx. Ex+ a = x+ a f x dx= xf x dx+ a f xdx= xˆ. E H x H x H x f x dx ˆ ( ) ( ) ( ) μ is actually the first moment of the random ( ) Fall 03 Analysis o Eperimental Measurements B Eisenstein/rev S Errede The Epectation Value o a Random Variable: The epectation value E[ ] o a random variable is the mean value o, ie ˆ (aa μ ) For discrete

More information

The achievable limits of operational modal analysis. * Siu-Kui Au 1)

The achievable limits of operational modal analysis. * Siu-Kui Au 1) The achievable limits o operational modal analysis * Siu-Kui Au 1) 1) Center or Engineering Dynamics and Institute or Risk and Uncertainty, University o Liverpool, Liverpool L69 3GH, United Kingdom 1)

More information

Microevolution Changing Allele Frequencies

Microevolution Changing Allele Frequencies Microevolution Changing Allele Frequencies Evolution Evolution is defined as a change in the inherited characteristics of biological populations over successive generations. Microevolution involves the

More information

Objectives. By the time the student is finished with this section of the workbook, he/she should be able

Objectives. By the time the student is finished with this section of the workbook, he/she should be able FUNCTIONS Quadratic Functions......8 Absolute Value Functions.....48 Translations o Functions..57 Radical Functions...61 Eponential Functions...7 Logarithmic Functions......8 Cubic Functions......91 Piece-Wise

More information

CINQA Workshop Probability Math 105 Silvia Heubach Department of Mathematics, CSULA Thursday, September 6, 2012

CINQA Workshop Probability Math 105 Silvia Heubach Department of Mathematics, CSULA Thursday, September 6, 2012 CINQA Workshop Probability Math 105 Silvia Heubach Department of Mathematics, CSULA Thursday, September 6, 2012 Silvia Heubach/CINQA 2012 Workshop Objectives To familiarize biology faculty with one of

More information

Evaluating the performance of a multilocus Bayesian method for the estimation of migration rates

Evaluating the performance of a multilocus Bayesian method for the estimation of migration rates University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Publications, Agencies and Staff of the U.S. Department of Commerce U.S. Department of Commerce 2007 Evaluating the performance

More information

8.4 Inverse Functions

8.4 Inverse Functions Section 8. Inverse Functions 803 8. Inverse Functions As we saw in the last section, in order to solve application problems involving eponential unctions, we will need to be able to solve eponential equations

More information

On the Efficiency of Maximum-Likelihood Estimators of Misspecified Models

On the Efficiency of Maximum-Likelihood Estimators of Misspecified Models 217 25th European Signal Processing Conerence EUSIPCO On the Eiciency o Maximum-ikelihood Estimators o Misspeciied Models Mahamadou amine Diong Eric Chaumette and François Vincent University o Toulouse

More information

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular

More information

10. Joint Moments and Joint Characteristic Functions

10. Joint Moments and Joint Characteristic Functions 10. Joint Moments and Joint Characteristic Functions Following section 6, in this section we shall introduce various parameters to compactly represent the inormation contained in the joint p.d. o two r.vs.

More information

Picture from "Mendel's experiments: Figure 3," by Robert Bear et al

Picture from Mendel's experiments: Figure 3, by Robert Bear et al Plant Genetics Learning Objectives: Understand the basics o genetics Understand how genetics is used in Plant Science today Learn the basics o extracting DNA Learn how to make crosses between plants and

More information

Scattered Data Approximation of Noisy Data via Iterated Moving Least Squares

Scattered Data Approximation of Noisy Data via Iterated Moving Least Squares Scattered Data Approximation o Noisy Data via Iterated Moving Least Squares Gregory E. Fasshauer and Jack G. Zhang Abstract. In this paper we ocus on two methods or multivariate approximation problems

More information

The effect of strong purifying selection on genetic diversity

The effect of strong purifying selection on genetic diversity Genetics: Early Online, published on May 9, 08 as 0.534/genetics.8.30058 The eect o strong puriying selection on genetic diversity Ivana Cvijović,4, Benjamin H. Good,4, and Michael M. Desai,3,4 Department

More information

Breeding Values and Inbreeding. Breeding Values and Inbreeding

Breeding Values and Inbreeding. Breeding Values and Inbreeding Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A

More information

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift: 9. Genetic Drift Genetic drift is the alteration of gene frequencies due to sampling variation from one generation to the next. It operates to some degree in all finite populations, but can be significant

More information

Reliability assessment using probabilistic support vector machines. Anirban Basudhar and Samy Missoum*

Reliability assessment using probabilistic support vector machines. Anirban Basudhar and Samy Missoum* 156 Int. J. Reliability and Saety, Vol. 7, o. 2, 2013 Reliability assessment using probabilistic support vector machines Anirban Basudhar and Samy Missoum* Aerospace and Mechanical Engineering Department,

More information

NOTES CH 17 Evolution of. Populations

NOTES CH 17 Evolution of. Populations NOTES CH 17 Evolution of Vocabulary Fitness Genetic Drift Punctuated Equilibrium Gene flow Adaptive radiation Divergent evolution Convergent evolution Gradualism Populations 17.1 Genes & Variation Darwin

More information

GETTING READY TO LEARN Preview Key Concepts 6.1 Chromosomes and Meiosis Gametes have half the number of chromosomes that body cells have.

GETTING READY TO LEARN Preview Key Concepts 6.1 Chromosomes and Meiosis Gametes have half the number of chromosomes that body cells have. CHAPTER 6 Meiosis and Mendel GETTING READY TO LEARN Preview Key Concepts 6.1 Chromosomes and Meiosis Gametes have hal the number o chromosomes that body cells have. 6.2 Process o Meiosis During meiosis,

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) 12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²

More information

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES: .5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the

More information

Big Idea #1: The process of evolution drives the diversity and unity of life

Big Idea #1: The process of evolution drives the diversity and unity of life BIG IDEA! Big Idea #1: The process of evolution drives the diversity and unity of life Key Terms for this section: emigration phenotype adaptation evolution phylogenetic tree adaptive radiation fertility

More information

Estimation of Sample Reactivity Worth with Differential Operator Sampling Method

Estimation of Sample Reactivity Worth with Differential Operator Sampling Method Progress in NUCLEAR SCIENCE and TECHNOLOGY, Vol. 2, pp.842-850 (2011) ARTICLE Estimation o Sample Reactivity Worth with Dierential Operator Sampling Method Yasunobu NAGAYA and Takamasa MORI Japan Atomic

More information

Lecture 13: Population Structure. October 8, 2012

Lecture 13: Population Structure. October 8, 2012 Lecture 13: Population Structure October 8, 2012 Last Time Effective population size calculations Historical importance of drift: shifting balance or noise? Population structure Today Course feedback The

More information

Mathematical models in population genetics II

Mathematical models in population genetics II Mathematical models in population genetics II Anand Bhaskar Evolutionary Biology and Theory of Computing Bootcamp January 1, 014 Quick recap Large discrete-time randomly mating Wright-Fisher population

More information

GF(4) Based Synthesis of Quaternary Reversible/Quantum Logic Circuits

GF(4) Based Synthesis of Quaternary Reversible/Quantum Logic Circuits GF(4) ased Synthesis o Quaternary Reversible/Quantum Logic Circuits MOZAMMEL H. A. KHAN AND MAREK A. PERKOWSKI Department o Computer Science and Engineering, East West University 4 Mohakhali, Dhaka, ANGLADESH,

More information

Symbolic-Numeric Methods for Improving Structural Analysis of DAEs

Symbolic-Numeric Methods for Improving Structural Analysis of DAEs Symbolic-Numeric Methods or Improving Structural Analysis o DAEs Guangning Tan, Nedialko S. Nedialkov, and John D. Pryce Abstract Systems o dierential-algebraic equations (DAEs) are generated routinely

More information

The concept of limit

The concept of limit Roberto s Notes on Dierential Calculus Chapter 1: Limits and continuity Section 1 The concept o limit What you need to know already: All basic concepts about unctions. What you can learn here: What limits

More information

VALUATIVE CRITERIA FOR SEPARATED AND PROPER MORPHISMS

VALUATIVE CRITERIA FOR SEPARATED AND PROPER MORPHISMS VALUATIVE CRITERIA FOR SEPARATED AND PROPER MORPHISMS BRIAN OSSERMAN Recall that or prevarieties, we had criteria or being a variety or or being complete in terms o existence and uniqueness o limits, where

More information

Case-Control Association Testing. Case-Control Association Testing

Case-Control Association Testing. Case-Control Association Testing Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies

More information

Mechanisms of Evolution

Mechanisms of Evolution Mechanisms of Evolution 36-149 The Tree of Life Christopher R. Genovese Department of Statistics 132H Baker Hall x8-7836 http://www.stat.cmu.edu/ ~ genovese/. Plan 1. Two More Generations 2. The Hardy-Weinberg

More information

RATIONAL FUNCTIONS. Finding Asymptotes..347 The Domain Finding Intercepts Graphing Rational Functions

RATIONAL FUNCTIONS. Finding Asymptotes..347 The Domain Finding Intercepts Graphing Rational Functions RATIONAL FUNCTIONS Finding Asymptotes..347 The Domain....350 Finding Intercepts.....35 Graphing Rational Functions... 35 345 Objectives The ollowing is a list o objectives or this section o the workbook.

More information

Categories and Natural Transformations

Categories and Natural Transformations Categories and Natural Transormations Ethan Jerzak 17 August 2007 1 Introduction The motivation or studying Category Theory is to ormalise the underlying similarities between a broad range o mathematical

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 007 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

Neutral Theory of Molecular Evolution

Neutral Theory of Molecular Evolution Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation

More information

Chapter 6 Reliability-based design and code developments

Chapter 6 Reliability-based design and code developments Chapter 6 Reliability-based design and code developments 6. General Reliability technology has become a powerul tool or the design engineer and is widely employed in practice. Structural reliability analysis

More information

Finite Dimensional Hilbert Spaces are Complete for Dagger Compact Closed Categories (Extended Abstract)

Finite Dimensional Hilbert Spaces are Complete for Dagger Compact Closed Categories (Extended Abstract) Electronic Notes in Theoretical Computer Science 270 (1) (2011) 113 119 www.elsevier.com/locate/entcs Finite Dimensional Hilbert Spaces are Complete or Dagger Compact Closed Categories (Extended bstract)

More information

Heredity and Genetics WKSH

Heredity and Genetics WKSH Chapter 6, Section 3 Heredity and Genetics WKSH KEY CONCEPT Mendel s research showed that traits are inherited as discrete units. Vocabulary trait purebred law of segregation genetics cross MAIN IDEA:

More information

Basic mathematics of economic models. 3. Maximization

Basic mathematics of economic models. 3. Maximization John Riley 1 January 16 Basic mathematics o economic models 3 Maimization 31 Single variable maimization 1 3 Multi variable maimization 6 33 Concave unctions 9 34 Maimization with non-negativity constraints

More information

On High-Rate Cryptographic Compression Functions

On High-Rate Cryptographic Compression Functions On High-Rate Cryptographic Compression Functions Richard Ostertág and Martin Stanek Department o Computer Science Faculty o Mathematics, Physics and Inormatics Comenius University Mlynská dolina, 842 48

More information

Microevolution 2 mutation & migration

Microevolution 2 mutation & migration Microevolution 2 mutation & migration Assumptions of Hardy-Weinberg equilibrium 1. Mating is random 2. Population size is infinite (i.e., no genetic drift) 3. No migration 4. No mutation 5. No selection

More information

General Bayes Filtering of Quantized Measurements

General Bayes Filtering of Quantized Measurements 4th International Conerence on Inormation Fusion Chicago, Illinois, UA, July 5-8, 20 General Bayes Filtering o Quantized Measurements Ronald Mahler Uniied Data Fusion ciences, Inc. Eagan, MN, 5522, UA.

More information

Population Genetics: a tutorial

Population Genetics: a tutorial : a tutorial Institute for Science and Technology Austria ThRaSh 2014 provides the basic mathematical foundation of evolutionary theory allows a better understanding of experiments allows the development

More information

EM algorithm and applications Lecture #9

EM algorithm and applications Lecture #9 EM algorithm and applications Lecture #9 Bacground Readings: Chapters 11.2, 11.6 in the text boo, Biological Sequence Analysis, Durbin et al., 2001.. The EM algorithm This lecture plan: 1. Presentation

More information

A Brief Survey on Semi-supervised Learning with Graph Regularization

A Brief Survey on Semi-supervised Learning with Graph Regularization 000 00 002 003 004 005 006 007 008 009 00 0 02 03 04 05 06 07 08 09 020 02 022 023 024 025 026 027 028 029 030 03 032 033 034 035 036 037 038 039 040 04 042 043 044 045 046 047 048 049 050 05 052 053 A

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

AH 2700A. Attenuator Pair Ratio for C vs Frequency. Option-E 50 Hz-20 khz Ultra-precision Capacitance/Loss Bridge

AH 2700A. Attenuator Pair Ratio for C vs Frequency. Option-E 50 Hz-20 khz Ultra-precision Capacitance/Loss Bridge 0 E ttenuator Pair Ratio or vs requency NEEN-ERLN 700 Option-E 0-0 k Ultra-precision apacitance/loss ridge ttenuator Ratio Pair Uncertainty o in ppm or ll Usable Pairs o Taps 0 0 0. 0. 0. 07/08/0 E E E

More information

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50 LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if

More information

What the No Free Lunch Theorems Really Mean; How to Improve Search Algorithms

What the No Free Lunch Theorems Really Mean; How to Improve Search Algorithms What the No Free Lunch Theorems Really Mean; How to Improve Search Algorithms David H. Wolpert SFI WORKING PAPER: 2012-10-017 SFI Working Papers contain accounts o scientiic work o the author(s) and do

More information

A Rigorous Analysis of Population Stratification with Limited Data.

A Rigorous Analysis of Population Stratification with Limited Data. A Rigorous Analysis o Population Stratiication with Limited Data. Kamalika Chaudhuri, Eran Halperin, Satish Rao, Shuheng Zhou ovember, 006 Abstract Finding the genetic actors o complex diseases such as

More information

Q Expected Coverage Achievement Merit Excellence. Punnett square completed with correct gametes and F2.

Q Expected Coverage Achievement Merit Excellence. Punnett square completed with correct gametes and F2. NCEA Level 2 Biology (91157) 2018 page 1 of 6 Assessment Schedule 2018 Biology: Demonstrate understanding of genetic variation and change (91157) Evidence Q Expected Coverage Achievement Merit Excellence

More information

Special types of Riemann sums

Special types of Riemann sums Roberto s Notes on Subject Chapter 4: Deinite integrals and the FTC Section 3 Special types o Riemann sums What you need to know already: What a Riemann sum is. What you can learn here: The key types o

More information

Segregation versus mitotic recombination APPENDIX

Segregation versus mitotic recombination APPENDIX APPENDIX Waiting time until the first successful mutation The first time lag, T 1, is the waiting time until the first successful mutant appears, creating an Aa individual within a population composed

More information

Bayesian Technique for Reducing Uncertainty in Fatigue Failure Model

Bayesian Technique for Reducing Uncertainty in Fatigue Failure Model 9IDM- Bayesian Technique or Reducing Uncertainty in Fatigue Failure Model Sriram Pattabhiraman and Nam H. Kim University o Florida, Gainesville, FL, 36 Copyright 8 SAE International ABSTRACT In this paper,

More information

Classification of effective GKM graphs with combinatorial type K 4

Classification of effective GKM graphs with combinatorial type K 4 Classiication o eective GKM graphs with combinatorial type K 4 Shintarô Kuroki Department o Applied Mathematics, Faculty o Science, Okayama Uniervsity o Science, 1-1 Ridai-cho Kita-ku, Okayama 700-0005,

More information

Review of Prerequisite Skills for Unit # 2 (Derivatives) U2L2: Sec.2.1 The Derivative Function

Review of Prerequisite Skills for Unit # 2 (Derivatives) U2L2: Sec.2.1 The Derivative Function UL1: Review o Prerequisite Skills or Unit # (Derivatives) Working with the properties o exponents Simpliying radical expressions Finding the slopes o parallel and perpendicular lines Simpliying rational

More information

ROBUST STABILITY AND PERFORMANCE ANALYSIS OF UNSTABLE PROCESS WITH DEAD TIME USING Mu SYNTHESIS

ROBUST STABILITY AND PERFORMANCE ANALYSIS OF UNSTABLE PROCESS WITH DEAD TIME USING Mu SYNTHESIS ROBUST STABILITY AND PERFORMANCE ANALYSIS OF UNSTABLE PROCESS WITH DEAD TIME USING Mu SYNTHESIS I. Thirunavukkarasu 1, V. I. George 1, G. Saravana Kumar 1 and A. Ramakalyan 2 1 Department o Instrumentation

More information

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM Life Cycles, Meiosis and Genetic Variability iclicker: 1. A chromosome just before mitosis contains two double stranded DNA molecules. 2. This replicated chromosome contains DNA from only one of your parents

More information

Scattering of Solitons of Modified KdV Equation with Self-consistent Sources

Scattering of Solitons of Modified KdV Equation with Self-consistent Sources Commun. Theor. Phys. Beijing, China 49 8 pp. 89 84 c Chinese Physical Society Vol. 49, No. 4, April 5, 8 Scattering o Solitons o Modiied KdV Equation with Sel-consistent Sources ZHANG Da-Jun and WU Hua

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

Throughput-Lifetime Tradeoffs in Multihop Wireless Networks under an SINR-based Interference Model

Throughput-Lifetime Tradeoffs in Multihop Wireless Networks under an SINR-based Interference Model 1 Throughput-Lietime Tradeos in Multihop Wireless Networks under an SINR-based Intererence Model Jun Luo, Aravind Iyer and Catherine Rosenberg Abstract High throughput and lietime are both crucial design

More information

Numerical Solution of Ordinary Differential Equations in Fluctuationlessness Theorem Perspective

Numerical Solution of Ordinary Differential Equations in Fluctuationlessness Theorem Perspective Numerical Solution o Ordinary Dierential Equations in Fluctuationlessness Theorem Perspective NEJLA ALTAY Bahçeşehir University Faculty o Arts and Sciences Beşiktaş, İstanbul TÜRKİYE TURKEY METİN DEMİRALP

More information

A Systematic Approach to Frequency Compensation of the Voltage Loop in Boost PFC Pre- regulators.

A Systematic Approach to Frequency Compensation of the Voltage Loop in Boost PFC Pre- regulators. A Systematic Approach to Frequency Compensation o the Voltage Loop in oost PFC Pre- regulators. Claudio Adragna, STMicroelectronics, Italy Abstract Venable s -actor method is a systematic procedure that

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,

More information

The neutral theory of molecular evolution

The neutral theory of molecular evolution The neutral theory of molecular evolution Introduction I didn t make a big deal of it in what we just went over, but in deriving the Jukes-Cantor equation I used the phrase substitution rate instead of

More information

arxiv: v2 [math.co] 29 Mar 2017

arxiv: v2 [math.co] 29 Mar 2017 COMBINATORIAL IDENTITIES FOR GENERALIZED STIRLING NUMBERS EXPANDING -FACTORIAL FUNCTIONS AND THE -HARMONIC NUMBERS MAXIE D. SCHMIDT arxiv:6.04708v2 [math.co 29 Mar 207 SCHOOL OF MATHEMATICS GEORGIA INSTITUTE

More information

Fluctuationlessness Theorem and its Application to Boundary Value Problems of ODEs

Fluctuationlessness Theorem and its Application to Boundary Value Problems of ODEs Fluctuationlessness Theorem and its Application to Boundary Value Problems o ODEs NEJLA ALTAY İstanbul Technical University Inormatics Institute Maslak, 34469, İstanbul TÜRKİYE TURKEY) nejla@be.itu.edu.tr

More information

Solving the Vehicle Routing Problem with Stochastic Demands via Hybrid Genetic Algorithm-Tabu Search

Solving the Vehicle Routing Problem with Stochastic Demands via Hybrid Genetic Algorithm-Tabu Search Journal o Mathematics and Statistics: 4(3): 161-167, 2008 ISSN: 1549-3644 2008 Science Publications Solving the Vehicle Routing Problem with Stochastic Demands via Hybrid Genetic Algorithm-Tabu Search

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 009 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

MODELLING PUBLIC TRANSPORT CORRIDORS WITH AGGREGATE AND DISAGGREGATE DEMAND

MODELLING PUBLIC TRANSPORT CORRIDORS WITH AGGREGATE AND DISAGGREGATE DEMAND MODELLIG PUBLIC TRASPORT CORRIDORS WITH AGGREGATE AD DISAGGREGATE DEMAD Sergio Jara-Díaz Alejandro Tirachini and Cristián Cortés Universidad de Chile ITRODUCTIO Traditional microeconomic modeling o public

More information

APPENDIX 1 ERROR ESTIMATION

APPENDIX 1 ERROR ESTIMATION 1 APPENDIX 1 ERROR ESTIMATION Measurements are always subject to some uncertainties no matter how modern and expensive equipment is used or how careully the measurements are perormed These uncertainties

More information

CHAPTER 1: INTRODUCTION. 1.1 Inverse Theory: What It Is and What It Does

CHAPTER 1: INTRODUCTION. 1.1 Inverse Theory: What It Is and What It Does Geosciences 567: CHAPTER (RR/GZ) CHAPTER : INTRODUCTION Inverse Theory: What It Is and What It Does Inverse theory, at least as I choose to deine it, is the ine art o estimating model parameters rom data

More information

Learning ancestral genetic processes using nonparametric Bayesian models

Learning ancestral genetic processes using nonparametric Bayesian models Learning ancestral genetic processes using nonparametric Bayesian models Kyung-Ah Sohn October 31, 2011 Committee Members: Eric P. Xing, Chair Zoubin Ghahramani Russell Schwartz Kathryn Roeder Matthew

More information

Supplementary Figures.

Supplementary Figures. Supplementary Figures. Supplementary Figure 1 The extended compartment model. Sub-compartment C (blue) and 1-C (yellow) represent the fractions of allele carriers and non-carriers in the focal patch, respectively,

More information

Section 3.4: Concavity and the second Derivative Test. Find any points of inflection of the graph of a function.

Section 3.4: Concavity and the second Derivative Test. Find any points of inflection of the graph of a function. Unit 3: Applications o Dierentiation Section 3.4: Concavity and the second Derivative Test Determine intervals on which a unction is concave upward or concave downward. Find any points o inlection o the

More information

Introduction to Natural Selection. Ryan Hernandez Tim O Connor

Introduction to Natural Selection. Ryan Hernandez Tim O Connor Introduction to Natural Selection Ryan Hernandez Tim O Connor 1 Goals Learn about the population genetics of natural selection How to write a simple simulation with natural selection 2 Basic Biology genome

More information

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have. Section 1: Chromosomes and Meiosis KEY CONCEPT Gametes have half the number of chromosomes that body cells have. VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous

More information

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation? October 12, 2009 Bioe 109 Fall 2009 Lecture 8 Microevolution 1 - selection The Hardy-Weinberg-Castle Equilibrium - consider a single locus with two alleles A 1 and A 2. - three genotypes are thus possible:

More information

A maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes

A maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes Genetics: Published Articles Ahead of Print, published on July 30, 2012 as 10.1534/genetics.112.139519 A maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate

More information

OPTIMALITY AND STABILITY OF SYMMETRIC EVOLUTIONARY GAMES WITH APPLICATIONS IN GENETIC SELECTION. (Communicated by Yang Kuang)

OPTIMALITY AND STABILITY OF SYMMETRIC EVOLUTIONARY GAMES WITH APPLICATIONS IN GENETIC SELECTION. (Communicated by Yang Kuang) MATHEMATICAL BIOSCIENCES doi:10.3934/mbe.2015.12.503 AND ENGINEERING Volume 12, Number 3, June 2015 pp. 503 523 OPTIMALITY AND STABILITY OF SYMMETRIC EVOLUTIONARY GAMES WITH APPLICATIONS IN GENETIC SELECTION

More information

1. Definition: Order Statistics of a sample.

1. Definition: Order Statistics of a sample. AMS570 Order Statistics 1. Deinition: Order Statistics o a sample. Let X1, X2,, be a random sample rom a population with p.d.. (x). Then, 2. p.d.. s or W.L.O.G.(W thout Loss o Ge er l ty), let s ssu e

More information

CHAPTER 5 Reactor Dynamics. Table of Contents

CHAPTER 5 Reactor Dynamics. Table of Contents 1 CHAPTER 5 Reactor Dynamics prepared by Eleodor Nichita, UOIT and Benjamin Rouben, 1 & 1 Consulting, Adjunct Proessor, McMaster & UOIT Summary: This chapter addresses the time-dependent behaviour o nuclear

More information