Stationary Distribution of the Linkage Disequilibrium Coefficient r 2

Size: px
Start display at page:

Download "Stationary Distribution of the Linkage Disequilibrium Coefficient r 2"

Transcription

1 Stationary Distribution of the Linkage Disequilibrium Coefficient r 2 Wei Zhang, Jing Liu, Rachel Fewster and Jesse Goodman Department of Statistics, The University of Auckland December 1, 2015

2 Overview 1 Introduction 2 Background 3 Methodology 4 Results 5 Acknowledgements 6 References

3 Introduction 1 Linkage disequilibrium (LD) indicates the statistical dependence of alleles at different loci in population genetics. Introduction December 1, / 35

4 Introduction 1 Linkage disequilibrium (LD) indicates the statistical dependence of alleles at different loci in population genetics. LD can be used to find genetic markers that might be linked to genes for diseases and understand the evolutionary history. Introduction December 1, / 35

5 Introduction 1 Linkage disequilibrium (LD) indicates the statistical dependence of alleles at different loci in population genetics. 2 r 2 is a quantitative measure of LD and we are aiming to find its stationary distribution under models for genetic drift. Introduction December 1, / 35

6 Introduction 1 Linkage disequilibrium (LD) indicates the statistical dependence of alleles at different loci in population genetics. 2 r 2 is a quantitative measure of LD and we are aiming to find its stationary distribution under models for genetic drift. 3 Given some moments of the unknown distribution, the maximum entropy (Maxent) principle can be used to approximate the density function of r 2. Introduction December 1, / 35

7 Introduction 1 Linkage disequilibrium (LD) indicates the statistical dependence of alleles at different loci in population genetics. 2 r 2 is a quantitative measure of LD and we are aiming to find its stationary distribution under models for genetic drift. 3 Given some moments of the unknown distribution, the maximum entropy (Maxent) principle can be used to approximate the density function of r 2. 4 The diffusion approximation is a powerful tool to compute certain expectations at stationarity. Introduction December 1, / 35

8 Background The TLD model (Liu, 2012) Diffusion approximation Maximum entropy principle Background December 1, / 35

9 Some terminologies in genetics 1 One locus is a position of a gene or significant DNA sequence on a chromosome. 2 An allele is a variant form of a gene. 3 Diploid describes a cell or an organism that has paired chromosomes, one from each parent. 4 A mutation is a permanent change in the DNA sequence. 5 Recombination is the production of offspring with combinations of traits that differ from those found in either parent. Background The TLD model December 1, / 35

10 Some terminologies in genetics Background The TLD model December 1, / 35

11 Some terminologies in genetics Background The TLD model December 1, / 35

12 The TLD model TLD is short for two-locus diallelic model with mutation and recombination. Background The TLD model December 1, / 35

13 The TLD model TLD is short for two-locus diallelic model with mutation and recombination. Model assumptions and notations: The two possible alleles on each of the two loci are assumed to be A 1, A 2 and B 1, B 2, thus the four possible types of gamete are A 1 B 1, A 1 B 2, A 2 B 1 and A 2 B 2. Recombination rate C: A i B j + A m B n A i B n /A m B j. Equal mutation rate µ for both loci: A 1 A 2 and B 1 B 2. Background The TLD model December 1, / 35

14 The TLD model Background The TLD model December 1, / 35

15 The TLD model Table 1: The proportions of gametes in generation T and the expected proportions in generation T + 1 in the population. N is the population size. Generation Gamete A 1 B 1 A 1 B 2 A 2 B 1 A 2 B 2 T x 1 2N x 2 2N x 3 2N x 4 2N T + 1 φ 1 φ 2 φ 3 φ 4 Background The TLD model December 1, / 35

16 The TLD model Suppose We have φ 1 (x) = x 1 2N (1 µ)2 + φ 2 (x) = x 2 2N (1 µ)2 + φ 3 (x) = x 3 2N (1 µ)2 + φ 4 (x) = x 4 2N (1 µ)2 + D(x) = x 1 2N ( x2 2N + x 3 2N ( x1 2N + x 4 2N ( x1 2N + x 4 2N ( x2 2N + x 3 2N x 4 2N x 2 2N x 3 2N. ) µ(1 µ) + x 4 ) µ(1 µ) + x 3 ) µ(1 µ) + x 2 2N µ2 CD(x)(1 2µ) 2 2N µ2 + CD(x)(1 2µ) 2 2N µ2 + CD(x)(1 2µ) 2 ) µ(1 µ) + x 1 2N µ2 CD(x)(1 2µ) 2. Background The TLD model December 1, / 35

17 The TLD model The transition probability of going from x = (x 1, x 2, x 3, x 4 ) to y = (y 1, y 2, y 3, y 4 ) is: p xy = P (y x) = (2N)! y 1!y 2!y 3!y 4! (φ 1 (x)) y 1 (φ 2 (x)) y 2 (φ 3 (x)) y 3 (φ 4 (x)) y 4. Background The TLD model December 1, / 35

18 The TLD model The transition probability of going from x = (x 1, x 2, x 3, x 4 ) to y = (y 1, y 2, y 3, y 4 ) is: p xy = P (y x) = (2N)! y 1!y 2!y 3!y 4! (φ 1 (x)) y 1 (φ 2 (x)) y 2 (φ 3 (x)) y 3 (φ 4 (x)) y 4. The TLD model is an irreducible aperiodic Markov chain, thus there exists a unique stationary distribution. Background The TLD model December 1, / 35

19 Diffusion approximation The main idea is to rescale the discrete state space and time space by a factor related to the population size N, so that the gap between two successive states in the new space is infinitesimal when N is large enough. Background Diffusion approximation December 1, / 35

20 Diffusion approximation The main idea is to rescale the discrete state space and time space by a factor related to the population size N, so that the gap between two successive states in the new space is infinitesimal when N is large enough. Example State space {0, 1, 2,, 2N} (2N) 1 {0, 1 2N, 2 2N,, 1}. Time space {0, 1, 2, } (2N) 1 {0δt, 1δt, 2δt, }, where δt = 1 2N. Background Diffusion approximation December 1, / 35

21 Diffusion approximation When N is large enough, the new chain is approximately continuous. In the derivation process, Taylor series expansion and the definition of derivative are used to get two important results for the TLD model. f f(x+ x) f(x) (x) = lim x 0 x Background Diffusion approximation December 1, / 35

22 The diffusion operator and master equation The diffusion operator for the TLD model is L = 1 2 p (1 p) 2 p q (1 q) 2 q { p (1 p) q (1 q) + D (1 2p) (1 2q) D 2 } 2 2 D 2 + D 2 p q + D (1 2p) 2 p D + D (1 2q) 2 q D + θ 4 (1 2p) p + θ 4 (1 2q) q ( D 1 + ρ ) 2 + θ D and the master equation at stationarity is E {Lf (p, q, D)} = 0. The master equation means the expected evolution over time of any nice function of p, q and D is zero at stationarity. Background Diffusion approximation December 1, / 35

23 The diffusion operator and master equation The diffusion operator for the TLD model is L = 1 2 p (1 p) 2 p q (1 q) 2 q { p (1 p) q (1 q) + D (1 2p) (1 2q) D 2 } 2 2 D 2 + D 2 p q + D (1 2p) 2 p D + D (1 2q) 2 q D + θ 4 (1 2p) p + θ 4 (1 2q) q ( D 1 + ρ ) 2 + θ D and the master equation at stationarity is E {Lf (p, q, D)} = 0. Here p and q are the frequencies of A 1 and B 1, D = f 11 pq, f 11 is the frequency of gamete A 1 B 1, ρ = 2NC, θ = 2Nµ and f is any twice continuously differentiable function with compact support. Background Diffusion approximation December 1, / 35

24 A simple example If letting f (p, q, D) = D, we can get that Lf (p, q, D) = D (1 + ρ ) 2 + θ and so E {Lf (p, q, D)} = (1 + ρ ) 2 + θ E (D) = 0 E (D) = 0. Background Diffusion approximation December 1, / 35

25 Maximum entropy principle Definition Entropy is the quantitative measure of disorder in a system. Suppose a random variable Z has K possible outcomes with probabilities p 1, p 2, p 3,, p K, the entropy is: I (Z) = K p i log K (p i ). i=1 Background Maximum entropy principle December 1, / 35

26 Maximum entropy principle Definition Entropy is the quantitative measure of disorder in a system. Suppose a random variable Z has K possible outcomes with probabilities p 1, p 2, p 3,, p K, the entropy is: I (Z) = K p i log K (p i ). i=1 The maximum entropy principle states that the solution that maximises the entropy is the most honest one. Background Maximum entropy principle December 1, / 35

27 An example In a coin toss experiment, suppose Pr(head)=p and Pr(tail)=1 p, then the entropy is: I = p log 2 (p) (1 p) log 2 (1 p). Background Maximum entropy principle December 1, / 35

28 An example Relationship of the entropy and p Entropy (0.5, 1.0) p Background Maximum entropy principle December 1, / 35

29 Maximum entropy principle The Maxent solution of an unknown probability density function π (p) given knowledge of n moments m i = E ( p i), i = 1,, n is the solution π n (p) that maximizes: I = π n (p) log { π n (p)} dp subject to m i = where Ω is the support of π. Ω Ω p i π n (p) dp for i = 0, 1,, n Background Maximum entropy principle December 1, / 35

30 Maximum entropy principle Considering the Lagrange function and Euler-Lagrange equation, then the Maxent solution is: π n (p) = exp ( λ 0 + λ 1 p + λ 2 p λ n p n) where λ i, i = 0,, n are the solutions of { arg min λ exp ( λ 0 + λ 1 p + λ 2 p λ n p n) dp Ω } n λ i m i. i=0 Background Maximum entropy principle December 1, / 35

31 Linkage disequilibrium coefficient r 2 The definition of r 2 is: r 2 = D 2 p (1 p) q (1 q) where p and q are the frequencies of A 1 and B 1, D = f 11 pq and f 11 is the frequency of gamete A 1 B 1. Methodology Analytic computation of the moments December 1, / 35

32 Reformulation of the problem Let u = 1 2p and v = 1 2q. Then the diffusion generator can be rewritten as L = 1 2 ( 1 u 2 ) 2 u ( 1 v 2 ) 2 v D 2 u v 2Du 2 D u 2Dv 2 D v 1 2 θu { 1 ( 1 u 2 ) ( 1 v 2) } + Duv D D 2 u 1 2 θv ( v D ρ + θ ) D. This reparameterization yields r 2 = D 2 p (1 p) q (1 q) = 16D 2 (1 u 2 ) (1 v 2 ). Methodology Analytic computation of the moments December 1, / 35

33 Analytic computation of the moments Note that when 0 u 2, v 2 < 1: 1 1 u 2 = k=0 u 2k and 1 1 v 2 = v 2l. l=0 Considering the new form of r 2, it follows that when M = 1, 2 E (r ) 2M = 16 M k 1 =0 k M =0 l 1 =0 which can be simplified to E ( r 2M) = 16 M K=0 L=0 l M =0 { } E D 2M u 2(k 1+k 2 + +k M ) v 2(l 1+l 2 + +l M ), ( )( K + M 1 L + M 1 M 1 M 1 ) E ( D 2M u 2K v 2L). Methodology Analytic computation of the moments December 1, / 35

34 Analytic computation of the moments Our problem now is how to compute E ( D 2M u 2K v 2L) for all possible M, K and L. Step 1: Let f in the master equation be some specific forms of u n, uv, u 2 v and Du n, we can get the results of E (u n ), E (uv), E ( u 2 v ) etc. Step 2: Given a value of (m, n), apply the function f = D k u m+2 k v n+2 k into the master equation with k {0, 1, 2,, n + 2}. A system of n + 3 linear equations is generated, whose solutions are: E ( u m+2 v n+2), E ( Du m+1 v n+1), E ( D 2 u m v n) Methodology Analytic computation of the moments December 1, / 35

35 Analytic computation of the moments Given a M N and a truncation level l max N, we use E ( r 2M ) l max = 16 M 2K+2L=l max K,L 0 ( )( ) K + M 1 L + M 1 E ( D 2M u 2K v 2L) M 1 M 1 to approximate the M-th moment of r 2. Methodology Analytic computation of the moments December 1, / 35

36 Variance of r 2 Table 2: V ( r 2) computed by our method (l max = 700) ρ θ Results Variance of r 2 December 1, / 35

37 Variance of r 2 V(r 2 ) Variance of r 2 with ρ=0.25 Liu (2012) Analytic method θ V(r 2 ) Variance of r 2 with ρ=2.50 Liu (2012) Analytic method θ Figure 1: Comparison of V ( r 2) between our analytic method and the method in Liu(2012) Results Variance of r 2 December 1, / 35

38 Probability density function of r 2 We can compute 50 moments in 2.5 hours with l max = 2000 on a laptop. Use n = 18 moments to calculate Maxent π ( r 2). Compare moments 19, 20,..., 50 using π ( r 2) vs analytic method. 2 π~ ) n(r Maxent density of r 2 using moments up to order n ( θ = ; ρ = 5 ) ( No.Nodes = 1000 ) n = 18 Moment 5.0e e e 06 Analytic moments compared with the results of Maxent ( θ = ; ρ = 5 ) ( No.Nodes = 1000 ; No.Moments = 18 ) Analytic Maxent r Order of moment Results Probability distribution of r 2 December 1, / 35

39 Probability density function of r 2 2 π~ ) n(r Maxent density of r 2 using moments up to order n ( θ = ; ρ = 0 ) ( No.Nodes = 1000 ) n = 16 Moment Analytic moments compared with the results of Maxent ( θ = ; ρ = 0 ) ( No.Nodes = 1000 ; No.Moments = 16 ) Analytic Maxent 0e+00 2e 04 4e 04 6e 04 8e 04 1e 03 r Order of moment Figure 2: Stationary density functions of r 2 for two pairs of θ and ρ approximated by the numerical univariate Maxent method. Results Probability distribution of r 2 December 1, / 35

40 Acknowledgements I would like to sincerely and gratefully thank my supervisor Rachel Fewster for her guidance, Dr. Jesse Goodman and Dr. Jing Liu for their useful ideas. Acknowledgements December 1, / 35

41 References Y. S. Song and J. S. Song, Analytic computation of the expectation of the linkage disequilibrium coefficient r 2, Theoretical population biology, vol. 71, no. 1, pp , J. Liu, Reconstruction of probability distributions in population genetics. PhD thesis, The University of Auckland, X. Wu, Calculation of maximum entropy densities with application to income distribution, Journal of Econometrics, vol. 115, no. 2, pp , M. Slatkin, Linkage disequilibrium understanding the evolutionary past and mapping the medical future, Nature Reviews Genetics, vol. 9, no. 6, pp , W. G. Hill and A. Robertson, Linkage disequilibrium in finite populations, Theoretical and Applied Genetics, vol. 38, no. 6, pp , L. R. Mead and N. Papanicolaou, Maximum entropy in the problem of moments, Journal of Mathematical Physics, vol. 25, no. 8, pp , References December 1, / 35

42 Thanks!

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Evolutionary Theory Mathematical and Conceptual Foundations Sean H. Rice Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Contents Preface ix Introduction 1 CHAPTER 1 Selection on One

More information

Population Genetics: a tutorial

Population Genetics: a tutorial : a tutorial Institute for Science and Technology Austria ThRaSh 2014 provides the basic mathematical foundation of evolutionary theory allows a better understanding of experiments allows the development

More information

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) 12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²

More information

Analytic Computation of the Expectation of the Linkage Disequilibrium Coefficient r 2

Analytic Computation of the Expectation of the Linkage Disequilibrium Coefficient r 2 Analytic Computation of the Expectation of the Linkage Disequilibrium Coefficient r 2 Yun S. Song a and Jun S. Song b a Department of Computer Science, University of California, Davis, CA 95616, USA b

More information

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M. STAT 550 Howework 6 Anton Amirov 1. This question relates to the same study you saw in Homework-4, by Dr. Arno Motulsky and coworkers, and published in Thompson et al. (1988; Am.J.Hum.Genet, 42, 113-124).

More information

(Genome-wide) association analysis

(Genome-wide) association analysis (Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

Problems for 3505 (2011)

Problems for 3505 (2011) Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.

More information

Lecture 24: Multivariate Response: Changes in G. Bruce Walsh lecture notes Synbreed course version 10 July 2013

Lecture 24: Multivariate Response: Changes in G. Bruce Walsh lecture notes Synbreed course version 10 July 2013 Lecture 24: Multivariate Response: Changes in G Bruce Walsh lecture notes Synbreed course version 10 July 2013 1 Overview Changes in G from disequilibrium (generalized Bulmer Equation) Fragility of covariances

More information

Runaway. demogenetic model for sexual selection. Louise Chevalier. Jacques Labonne

Runaway. demogenetic model for sexual selection. Louise Chevalier. Jacques Labonne Runaway demogenetic model for sexual selection Louise Chevalier Master 2 thesis UPMC, Specialization Oceanography and Marine Environments Jacques Labonne UMR Ecobiop INRA - National Institute for Agronomic

More information

Exercises with solutions (Set D)

Exercises with solutions (Set D) Exercises with solutions Set D. A fair die is rolled at the same time as a fair coin is tossed. Let A be the number on the upper surface of the die and let B describe the outcome of the coin toss, where

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES: .5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the

More information

Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency

Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 2018 1 Outline Genetics of complex

More information

The Quantitative TDT

The Quantitative TDT The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus

More information

Mathematical models in population genetics II

Mathematical models in population genetics II Mathematical models in population genetics II Anand Bhaskar Evolutionary Biology and Theory of Computing Bootcamp January 1, 014 Quick recap Large discrete-time randomly mating Wright-Fisher population

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics 1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

More information

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.

More information

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017 Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping

More information

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular

More information

When one gene is wild type and the other mutant:

When one gene is wild type and the other mutant: Series 2: Cross Diagrams Linkage Analysis There are two alleles for each trait in a diploid organism In C. elegans gene symbols are ALWAYS italicized. To represent two different genes on the same chromosome:

More information

Introduc)on to Gene)cs How to Analyze Your Own Genome Fall 2013

Introduc)on to Gene)cs How to Analyze Your Own Genome Fall 2013 Introduc)on to Gene)cs 02-223 How to Analyze Your Own Genome Fall 2013 Overview Primer on gene

More information

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype. Series 2: Cross Diagrams - Complementation There are two alleles for each trait in a diploid organism In C. elegans gene symbols are ALWAYS italicized. To represent two different genes on the same chromosome:

More information

Inbreeding depression due to stabilizing selection on a quantitative character. Emmanuelle Porcher & Russell Lande

Inbreeding depression due to stabilizing selection on a quantitative character. Emmanuelle Porcher & Russell Lande Inbreeding depression due to stabilizing selection on a quantitative character Emmanuelle Porcher & Russell Lande Inbreeding depression Reduction in fitness of inbred vs. outbred individuals Outcrossed

More information

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important? Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population

More information

Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012

Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012 Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012 Last Time Sequence data and quantification of variation Infinite sites model Nucleotide diversity (π) Sequence-based

More information

Linkage and Linkage Disequilibrium

Linkage and Linkage Disequilibrium Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies

More information

CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation

CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation Instructor: Arindam Banerjee November 26, 2007 Genetic Polymorphism Single nucleotide polymorphism (SNP) Genetic Polymorphism

More information

A. Correct! Genetically a female is XX, and has 22 pairs of autosomes.

A. Correct! Genetically a female is XX, and has 22 pairs of autosomes. MCAT Biology - Problem Drill 08: Meiosis and Genetic Variability Question No. 1 of 10 1. A human female has pairs of autosomes and her sex chromosomes are. Question #01 (A) 22, XX. (B) 23, X. (C) 23, XX.

More information

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Overview - 1 Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Elizabeth Thompson University of Washington Seattle, WA, USA MWF 8:30-9:20; THO 211 Web page: www.stat.washington.edu/ thompson/stat550/

More information

Bayesian Methods with Monte Carlo Markov Chains II

Bayesian Methods with Monte Carlo Markov Chains II Bayesian Methods with Monte Carlo Markov Chains II Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University hslu@stat.nctu.edu.tw http://tigpbp.iis.sinica.edu.tw/courses.htm 1 Part 3

More information

Evolutionary quantitative genetics and one-locus population genetics

Evolutionary quantitative genetics and one-locus population genetics Evolutionary quantitative genetics and one-locus population genetics READING: Hedrick pp. 57 63, 587 596 Most evolutionary problems involve questions about phenotypic means Goal: determine how selection

More information

Q2 (4.6) Put the following in order from biggest to smallest: Gene DNA Cell Chromosome Nucleus. Q8 (Biology) (4.6)

Q2 (4.6) Put the following in order from biggest to smallest: Gene DNA Cell Chromosome Nucleus. Q8 (Biology) (4.6) Q1 (4.6) What is variation? Q2 (4.6) Put the following in order from biggest to smallest: Gene DNA Cell Chromosome Nucleus Q3 (4.6) What are genes? Q4 (4.6) What sort of reproduction produces genetically

More information

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM Life Cycles, Meiosis and Genetic Variability iclicker: 1. A chromosome just before mitosis contains two double stranded DNA molecules. 2. This replicated chromosome contains DNA from only one of your parents

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

Lecture WS Evolutionary Genetics Part I 1

Lecture WS Evolutionary Genetics Part I 1 Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in

More information

1.3 Forward Kolmogorov equation

1.3 Forward Kolmogorov equation 1.3 Forward Kolmogorov equation Let us again start with the Master equation, for a system where the states can be ordered along a line, such as the previous examples with population size n = 0, 1, 2,.

More information

NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS

NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS æ 2 NEUTRAL EVOLUTION IN ONE- AND TWO-LOCUS SYSTEMS 19 May 2014 Variations neither useful nor injurious would not be affected by natural selection, and would be left either a fluctuating element, as perhaps

More information

BIRS workshop Sept 6 11, 2009

BIRS workshop Sept 6 11, 2009 Diploid biparental Moran model with large offspring numbers and recombination Bjarki Eldon New mathematical challenges from molecular biology and genetics BIRS workshop Sept 6, 2009 Mendel s Laws The First

More information

MEIOSIS C H A P T E R 1 3

MEIOSIS C H A P T E R 1 3 MEIOSIS CHAPTER 13 CENTRAL DOGMA OF BIOLOGY DNA RNA Protein OFFSPRING ACQUIRE GENES FROM PARENTS Genes are segments of DNA that program specific traits. Genetic info is transmitted as specific sequences

More information

p(d g A,g B )p(g B ), g B

p(d g A,g B )p(g B ), g B Supplementary Note Marginal effects for two-locus models Here we derive the marginal effect size of the three models given in Figure 1 of the main text. For each model we assume the two loci (A and B)

More information

URN MODELS: the Ewens Sampling Lemma

URN MODELS: the Ewens Sampling Lemma Department of Computer Science Brown University, Providence sorin@cs.brown.edu October 3, 2014 1 2 3 4 Mutation Mutation: typical values for parameters Equilibrium Probability of fixation 5 6 Ewens Sampling

More information

Introduction to Natural Selection. Ryan Hernandez Tim O Connor

Introduction to Natural Selection. Ryan Hernandez Tim O Connor Introduction to Natural Selection Ryan Hernandez Tim O Connor 1 Goals Learn about the population genetics of natural selection How to write a simple simulation with natural selection 2 Basic Biology genome

More information

Frequency Spectra and Inference in Population Genetics

Frequency Spectra and Inference in Population Genetics Frequency Spectra and Inference in Population Genetics Although coalescent models have come to play a central role in population genetics, there are some situations where genealogies may not lead to efficient

More information

Bayesian Inference of Interactions and Associations

Bayesian Inference of Interactions and Associations Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,

More information

Selection and Population Genetics

Selection and Population Genetics Selection and Population Genetics Evolution by natural selection can occur when three conditions are satisfied: Variation within populations - individuals have different traits (phenotypes). height and

More information

Patterns of inheritance

Patterns of inheritance Patterns of inheritance Learning goals By the end of this material you would have learnt about: How traits and characteristics are passed on from one generation to another The different patterns of inheritance

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

Linking levels of selection with genetic modifiers

Linking levels of selection with genetic modifiers Linking levels of selection with genetic modifiers Sally Otto Department of Zoology & Biodiversity Research Centre University of British Columbia @sarperotto @sse_evolution @sse.evolution Sally Otto Department

More information

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013 Lecture 9 Short-Term Selection Response: Breeder s equation Bruce Walsh lecture notes Synbreed course version 3 July 2013 1 Response to Selection Selection can change the distribution of phenotypes, and

More information

Computational Approaches to Statistical Genetics

Computational Approaches to Statistical Genetics Computational Approaches to Statistical Genetics GWAS I: Concepts and Probability Theory Christoph Lippert Dr. Oliver Stegle Prof. Dr. Karsten Borgwardt Max-Planck-Institutes Tübingen, Germany Tübingen

More information

Lecture 9. QTL Mapping 2: Outbred Populations

Lecture 9. QTL Mapping 2: Outbred Populations Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred

More information

Linkage and Chromosome Mapping

Linkage and Chromosome Mapping Linkage and Chromosome Mapping I. 1 st year, 2 nd semester, week 11 2008 Aleš Panczak, ÚBLG 1. LF a VFN Terminology, definitions The term recombination ratio (fraction), Θ (Greek letter theta), is used

More information

Darwinian Selection. Chapter 6 Natural Selection Basics 3/25/13. v evolution vs. natural selection? v evolution. v natural selection

Darwinian Selection. Chapter 6 Natural Selection Basics 3/25/13. v evolution vs. natural selection? v evolution. v natural selection Chapter 6 Natural Selection Basics Natural Selection Haploid Diploid, Sexual Results for a Diallelic Locus Fisher s Fundamental Theorem Darwinian Selection v evolution vs. natural selection? v evolution

More information

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype. Series 1: Cross Diagrams There are two alleles for each trait in a diploid organism In C. elegans gene symbols are ALWAYS italicized. To represent two different genes on the same chromosome: When both

More information

A New Metric for Parental Selection in Plant Breeding

A New Metric for Parental Selection in Plant Breeding Graduate Theses and Dissertations Graduate College 2014 A New Metric for Parental Selection in Plant Breeding Ye Han Iowa State University Follow this and additional works at: http://libdriastateedu/etd

More information

Genetics (patterns of inheritance)

Genetics (patterns of inheritance) MENDELIAN GENETICS branch of biology that studies how genetic characteristics are inherited MENDELIAN GENETICS Gregory Mendel, an Augustinian monk (1822-1884), was the first who systematically studied

More information

Tutorial on Theoretical Population Genetics

Tutorial on Theoretical Population Genetics Tutorial on Theoretical Population Genetics Joe Felsenstein Department of Genome Sciences and Department of Biology University of Washington, Seattle Tutorial on Theoretical Population Genetics p.1/40

More information

Segregation versus mitotic recombination APPENDIX

Segregation versus mitotic recombination APPENDIX APPENDIX Waiting time until the first successful mutation The first time lag, T 1, is the waiting time until the first successful mutant appears, creating an Aa individual within a population composed

More information

Chapter 13: Meiosis and Sexual Life Cycles Overview: Hereditary Similarity and Variation

Chapter 13: Meiosis and Sexual Life Cycles Overview: Hereditary Similarity and Variation Chapter 13: Meiosis and Sexual Life Cycles Overview: Hereditary Similarity and Variation Living organisms Are distinguished by their ability to reproduce their own kind Biology, 7 th Edition Neil Campbell

More information

2. Map genetic distance between markers

2. Map genetic distance between markers Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,

More information

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees:

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees: MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

How robust are the predictions of the W-F Model?

How robust are the predictions of the W-F Model? How robust are the predictions of the W-F Model? As simplistic as the Wright-Fisher model may be, it accurately describes the behavior of many other models incorporating additional complexity. Many population

More information

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS. !! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory

More information

Outline for today s lecture (Ch. 14, Part I)

Outline for today s lecture (Ch. 14, Part I) Outline for today s lecture (Ch. 14, Part I) Ploidy vs. DNA content The basis of heredity ca. 1850s Mendel s Experiments and Theory Law of Segregation Law of Independent Assortment Introduction to Probability

More information

Affected Sibling Pairs. Biostatistics 666

Affected Sibling Pairs. Biostatistics 666 Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD

More information

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation? October 12, 2009 Bioe 109 Fall 2009 Lecture 8 Microevolution 1 - selection The Hardy-Weinberg-Castle Equilibrium - consider a single locus with two alleles A 1 and A 2. - three genotypes are thus possible:

More information

1. Natural selection can only occur if there is variation among members of the same species. WHY?

1. Natural selection can only occur if there is variation among members of the same species. WHY? 1. Natural selection can only occur if there is variation among members of the same species. WHY? Variation in a population results from mutation and the recombination of alleles during meiosis and fertilization.

More information

Linear Regression (1/1/17)

Linear Regression (1/1/17) STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression

More information

Sexual Reproduction and Genetics

Sexual Reproduction and Genetics Sexual Reproduction and Genetics Mitosis is a form of asexual reproduction This means that it only requires 1 organism (ex. Skin cells dividing) For growth and repair in somatic (body) cells! Results

More information

Association studies and regression

Association studies and regression Association studies and regression CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar Association studies and regression 1 / 104 Administration

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities

More information

Agenda. 1. Lesson Learning Goals 2. Meiosis 3. Meiosis Bingo

Agenda. 1. Lesson Learning Goals 2. Meiosis 3. Meiosis Bingo Meiosis SBI 3U Agenda 1. Lesson Learning Goals 2. Meiosis 3. Meiosis Bingo Learning Goals By the end of today s lesson, you will be able: To use proper vocabulary related to this unit, including meiosis,

More information

8. Genetic Diversity

8. Genetic Diversity 8. Genetic Diversity Many ways to measure the diversity of a population: For any measure of diversity, we expect an estimate to be: when only one kind of object is present; low when >1 kind of objects

More information

arxiv: v2 [q-bio.pe] 14 Mar 2016

arxiv: v2 [q-bio.pe] 14 Mar 2016 A general approximation for the dynamics of quantitative traits Katarína Bod ová, Gašper Tkačik, Nicholas H. Barton Institute of Science and Technology Austria (IST Austria), Am Campus, Klosterneuburg

More information

Natural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate

Natural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate Natural Selection Population Dynamics Humans, Sickle-cell Disease, and Malaria How does a population of humans become resistant to malaria? Overproduction Environmental pressure/competition Pre-existing

More information

I genetic systems in an effort to understand the structure of natural populations.

I genetic systems in an effort to understand the structure of natural populations. ON TREATING THE CHROMOSOME AS THE UNIT OF SELECTION* MONTGOMERY SLATKIN Department of Theoretical Biology, Uniuersity of Chicago, Chicago, Illinois 60637 Manuscript received July 19, 1971 ABSTRACT A simple

More information

Random Times and Their Properties

Random Times and Their Properties Chapter 6 Random Times and Their Properties Section 6.1 recalls the definition of a filtration (a growing collection of σ-fields) and of stopping times (basically, measurable random times). Section 6.2

More information

Unit 6 : Meiosis & Sexual Reproduction

Unit 6 : Meiosis & Sexual Reproduction Unit 6 : Meiosis & Sexual Reproduction 2006-2007 Cell division / Asexual reproduction Mitosis produce cells with same information identical daughter cells exact copies clones same number of chromosomes

More information

Solutions to Problem Set 4

Solutions to Problem Set 4 Question 1 Solutions to 7.014 Problem Set 4 Because you have not read much scientific literature, you decide to study the genetics of garden peas. You have two pure breeding pea strains. One that is tall

More information

Econometrics I, Estimation

Econometrics I, Estimation Econometrics I, Estimation Department of Economics Stanford University September, 2008 Part I Parameter, Estimator, Estimate A parametric is a feature of the population. An estimator is a function of the

More information

Closed-form sampling formulas for the coalescent with recombination

Closed-form sampling formulas for the coalescent with recombination 0 / 21 Closed-form sampling formulas for the coalescent with recombination Yun S. Song CS Division and Department of Statistics University of California, Berkeley September 7, 2009 Joint work with Paul

More information

If we want to analyze experimental or simulated data we might encounter the following tasks:

If we want to analyze experimental or simulated data we might encounter the following tasks: Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction

More information

Microevolution Changing Allele Frequencies

Microevolution Changing Allele Frequencies Microevolution Changing Allele Frequencies Evolution Evolution is defined as a change in the inherited characteristics of biological populations over successive generations. Microevolution involves the

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm

More information

Ch 11.Introduction to Genetics.Biology.Landis

Ch 11.Introduction to Genetics.Biology.Landis Nom Section 11 1 The Work of Gregor Mendel (pages 263 266) This section describes how Gregor Mendel studied the inheritance of traits in garden peas and what his conclusions were. Introduction (page 263)

More information

Objectives. Announcements. Comparison of mitosis and meiosis

Objectives. Announcements. Comparison of mitosis and meiosis Announcements Colloquium sessions for which you can get credit posted on web site: Feb 20, 27 Mar 6, 13, 20 Apr 17, 24 May 15. Review study CD that came with text for lab this week (especially mitosis

More information

1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals.

1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals. 1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals. Which statement best explains how these processes often produce offspring that have traits not

More information

Meiosis. Two distinct divisions, called meiosis I and meiosis II

Meiosis. Two distinct divisions, called meiosis I and meiosis II Meiosis A process in which the number of chromosomes per cell is cut in half through the separation of homologous chromosomes to form gametes, or sex cells Two distinct divisions, called meiosis I and

More information

Introduction to Linkage Disequilibrium

Introduction to Linkage Disequilibrium Introduction to September 10, 2014 Suppose we have two genes on a single chromosome gene A and gene B such that each gene has only two alleles Aalleles : A 1 and A 2 Balleles : B 1 and B 2 Suppose we have

More information

Research Question How are gametes produced?

Research Question How are gametes produced? Name Date AP Biology Unit 3 Research Question How are gametes produced? Background Meiosis resembles mitosis but serves a very different purpose. Meiosis is a cell division resulting in the halving, or

More information

1 Errors in mitosis and meiosis can result in chromosomal abnormalities.

1 Errors in mitosis and meiosis can result in chromosomal abnormalities. Slide 1 / 21 1 Errors in mitosis and meiosis can result in chromosomal abnormalities. a. Identify and describe a common chromosomal mutation. Slide 2 / 21 Errors in mitosis and meiosis can result in chromosomal

More information

Genetic Variation in Finite Populations

Genetic Variation in Finite Populations Genetic Variation in Finite Populations The amount of genetic variation found in a population is influenced by two opposing forces: mutation and genetic drift. 1 Mutation tends to increase variation. 2

More information

Linkage and Chromosome Mapping

Linkage and Chromosome Mapping Linkage and Chromosome Mapping I. 1 st year, 2 nd semester, week 11 2007 Aleš Panczak, ÚBLG 1. LF a VFN Terminology, definitions The term recombination ratio (fraction), Θ (Greek letter theta), is used

More information

Math 151. Rumbos Fall Solutions to Review Problems for Exam 2. Pr(X = 1) = ) = Pr(X = 2) = Pr(X = 3) = p X. (k) =

Math 151. Rumbos Fall Solutions to Review Problems for Exam 2. Pr(X = 1) = ) = Pr(X = 2) = Pr(X = 3) = p X. (k) = Math 5. Rumbos Fall 07 Solutions to Review Problems for Exam. A bowl contains 5 chips of the same size and shape. Two chips are red and the other three are blue. Draw three chips from the bowl at random,

More information

Overview. Background

Overview. Background Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems

More information

Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February 5 th /6 th

Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February 5 th /6 th Name: Date: Block: Chapter 6 Meiosis and Mendel Section 6.1 Chromosomes and Meiosis 1. How do gametes differ from somatic cells? Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February

More information

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift: 9. Genetic Drift Genetic drift is the alteration of gene frequencies due to sampling variation from one generation to the next. It operates to some degree in all finite populations, but can be significant

More information