Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011

Similar documents
Lecture 4: Allelic Effects and Genetic Variances. Bruce Walsh lecture notes Tucson Winter Institute 7-9 Jan 2013

Lecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition

Lecture 2. Fisher s Variance Decomposition

Lecture 2. Basic Population and Quantitative Genetics

Lecture 2: Introduction to Quantitative Genetics

Partitioning the Genetic Variance

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Lecture 9. QTL Mapping 2: Outbred Populations

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE

Quantitative Genetics I: Traits controlled my many loci. Quantitative Genetics: Traits controlled my many loci

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

Partitioning Genetic Variance

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

Lecture 3 Basic Concepts in Mendelian, Population and Quantitative Genetics

Variance Components: Phenotypic, Environmental and Genetic

Partitioning the Genetic Variance. Partitioning the Genetic Variance

Quantitative Genetics

Linear Regression (1/1/17)

Introduction to Genetic Analysis: Basic Concepts in Mendelian, Population and Quantitative Genetics

Quantitative characters - exercises

1. they are influenced by many genetic loci. 2. they exhibit variation due to both genetic and environmental effects.

Lecture WS Evolutionary Genetics Part I 1

Evolution of phenotypic traits

Introduction to Quantitative Genetics. Introduction to Quantitative Genetics

Lecture 6. QTL Mapping

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 2. Genetics of quantitative (multifactorial) traits What is known about such traits How they are modeled

The concept of breeding value. Gene251/351 Lecture 5

Statistical power for detecting epistasis QTL effects under the F-2 design

The Quantitative TDT

Lecture 4. Basic Designs for Estimation of Genetic Parameters

Observing Patterns in Inherited Traits

Evolution of quantitative traits

Quantitative characters II: heritability

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013

Lecture 13 Family Selection. Bruce Walsh lecture notes Synbreed course version 4 July 2013

3. Properties of the relationship matrix

SOLUTIONS TO EXERCISES FOR CHAPTER 9

Quantitative characters

- interactions between alleles. - multiple phenotypic effects of one gene. - phenotypic variability in single genes. - interactions between genes

Introduction to Genetics

H = σ 2 G / σ 2 P heredity determined by genotype. degree of genetic determination. Nature vs. Nurture.

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next

Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency

Genetics (patterns of inheritance)

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES

Major Genes, Polygenes, and

Quantitative characters

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

Quantitative Trait Variation

Lecture 11: Multiple trait models for QTL analysis

MIXED MODELS THE GENERAL MIXED MODEL

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17

Lecture 6: Selection on Multiple Traits

Natural Selection results in increase in one (or more) genotypes relative to other genotypes.

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013

Introduction to Genetics

Long-Term Response and Selection limits

Variation and its response to selection

Chapter 7 The Genetic Model for Quantitative Traits

Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype.

I Have the Power in QTL linkage: single and multilocus analysis

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012

Resemblance among relatives

Lecture 3: Basic Statistical Tools. Bruce Walsh lecture notes Tucson Winter Institute 7-9 Jan 2013

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 3. The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs

Genetical theory of natural selection

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

Overview. Background

Guided Notes Unit 6: Classical Genetics

Mapping QTL to a phylogenetic tree

Legend: S spotted Genotypes: P1 SS & ss F1 Ss ss plain F2 (with ratio) 1SS :2 WSs: 1ss. Legend W white White bull 1 Ww red cows ww ww red

Problems for 3505 (2011)

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50

Breeding Values and Inbreeding. Breeding Values and Inbreeding

Lecture 7 Correlated Characters

DNA Structure and Function

Estimating Breeding Values

Chapter 13 Meiosis and Sexual Reproduction

Affected Sibling Pairs. Biostatistics 666

Population Genetics: a tutorial

Genetics and Natural Selection

Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Uppsala EQG course version 31 Jan 2012

Evolutionary quantitative genetics and one-locus population genetics

Calculation of IBD probabilities

6.6 Meiosis and Genetic Variation. KEY CONCEPT Independent assortment and crossing over during meiosis result in genetic diversity.

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

Natural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate

Calculation of IBD probabilities

Computational Systems Biology: Biology X

Lecture 2: Linear and Mixed Models

heritable diversity feb ! gene 8840 biol 8990

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

Evolutionary Genetics Midterm 2008

Multiple random effects. Often there are several vectors of random effects. Covariance structure

On coding genotypes for genetic markers with multiple alleles in genetic association study of quantitative traits

1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals.

Transcription:

Lecture 6: Introduction to Quantitative genetics Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011

Quantitative Genetics The analysis of traits whose variation is determined by both a number of genes and environmental factors Phenotype is highly uninformative as to underlying genotype

Complex (or Quantitative) trait No (apparent) simple Mendelian basis for variation in the trait May be a single gene strongly influenced by environmental factors May be the result of a number of genes of equal (or differing) effect Most likely, a combination of both multiple genes and environmental factors Example: Blood pressure, cholesterol levels Known genetic and environmental risk factors Molecular traits can also be quantitative traits mrna level on a microarray analysis Protein spot volume on a 2-D gel

Phenotypic distribution of a trait

Consider a specific locus influencing the trait For this locus, mean phenotype = 0.15, while overall mean phenotype = 0

Goals of Quantitative Genetics Partition total trait variation into genetic (nature) vs. environmental (nurture) components Predict resemblance between relatives If a sib has a disease/trait, what are your odds? Find the underlying loci contributing to genetic variation QTL -- quantitative trait loci Deduce molecular basis for genetic trait variation eqtls -- expression QTLs, loci with a quantitative influence on gene expression e.g., QTLs influencing mrna abundance on a microarray

Dichotomous (binary) traits Presence/absence traits (such as a disease) can (and usually do) have a complex genetic basis Consider a disease susceptibility (DS) locus underlying a disease, with alleles D and d, where allele D significantly increases your disease risk In particular, Pr(disease DD) = 0.5, so that the penetrance of genotype DD is 50% Suppose Pr(disease Dd ) = 0.2, Pr(disease dd) = 0.05 dd individuals can rarely display the disease, largely because of exposure to adverse environmental conditions

dd individuals can give rise to phenocopies 5% of the time, showing the disease but not as a result of carrying the risk allele If freq(d) = 0.9, what is Prob (DD show disease)? freq(disease) = 0.1 2 *0.5 + 2*0.1*0.9*0.2 + 0.9 2 *0.05 = 0.0815 From Bayes theorem, Pr(DD disease) = Pr(disease DD)*Pr(DD)/Prob(disease) = 0.1 2 *0.5 / 0.0815 = 0.06 (6 %) Pr(Dd disease) = 0.442, Pr(dd disease) = 0.497 Thus about 50% of the diseased individuals are phenocopies

Basic model of Quantitative Genetics Phenotypic value -- we will occasionally also use z for this value Basic model: P = G + E Environmental value Genotypic value G = average phenotypic value for that genotype if we are able to replicate it over the universe of environmental values, G = E[P]

Basic model of Quantitative Genetics Basic model: P = G + E G = average phenotypic value for that genotype if we are able to replicate it over the universe of environmental values, G = E[P] G x E interaction --- G values are different across environments. Basic model now becomes P = G + E + GE

Contribution of a locus to a trait Q 1 Q 1 Q 2 Q 1 Q 2 Q 2 C C + a(1+k) C + 2a C C + a + d C + 2a C -a C + d C + a 2a = G(Q 2 Q 2 ) - G(Q 1 Q 1 ) d measures dominance, with d = 0 if the heterozygote is exactly intermediate to the two homozygotes d = ak =G(Q 1 Q 2 ) - [G(Q 2 Q 2 ) + G(Q 1 Q 1 ) ]/2 k = d/a is a scaled measure of the dominance

Example: Apolipoprotein E & Alzheimer s Genotype ee Ee EE Average age of onset 68.4 75.5 84.3 2a = G(EE) - G(ee) = 84.3-68.4 --> a = 7.95 ak =d = G(Ee) - [ G(EE)+G(ee)]/2 = -0.85 k = d/a = 0.10 Only small amount of dominance

Example: Booroola (B) gene Genotype bb Bb BB Average Litter size 1.48 2.17 2.66 2a = G(BB) - G(bb) = 2.66-1.46 --> a = 0.59 ak =d = G(Bb) - [ G(BB)+G(bb)]/2 = 0.10 k = d/a = 0.17

Fisher s (1918) Decomposition of G One of Fisher s key insights was that the genotypic value consists of a fraction that can be passed from parent to offspring and a fraction that cannot. In particular, under sexual reproduction, parents only pass along SINGLE ALLELES to their offspring Consider the genotypic value G ij resulting from an A i A j individual G ij = µ G + α i + α j + δ ij Average contribution to genotypic value for allele i Mean value, with µg = Gij freq(qiqj )

G ij = µ G + α i + α j + δ ij Since parents pass along single alleles to their offspring, the! i (the average effect of allele i) represent these contributions The average effect for an allele is POPULATION- SPECIFIC, as it depends on the types and frequencies of alleles that it pairs with The genotypic value predicted from the individual allelic effects is thus Ĝ ij = µ G + α i + α j

G ij = µ G + α i + α j + δ ij The genotypic value predicted from the individual allelic effects is thus Ĝ ij = µ G + α i + α j Dominance deviations --- the difference (for genotype A i A j ) between the genotypic value predicted from the two single alleles and the actual genotypic value, G ij G ij = δ ij

Fisher s decomposition is a Regression G ij = µ G + α i + α j + δ ij Predicted value Residual error A notational change clearly shows this is a regression, Gij = µg + 2α 1 + (α 2 α 1 )N + δij Independent (predictor) variable N = # of Q 2 alleles

Gij = µg + 2α 1 + (α 2 α 1 )N + δij Intercept Regression slope 2α 1 + (α 2 α 1 )N = 2α 1 forn = 0, e.g, Q 1 Q 1 α 1 + α 2 forn = 1, e.g, Q 1 Q 2 2α 2 forn = 2, e.g, Q 2 Q 2

Allele Q 2 common,! 1 >! 2 G 21 G G 22 G 11 0 1 2 N

Allele Q 1 common,! 2 >! 1 G 21 Slope =!2 -!1 G G 22 G 11 0 1 2 N

Both Q 1 and Q 2 frequent,! 1 =! 2 = 0 G 21 G G 22 G 11 0 1 2 N

Consider a diallelic locus, where p 1 = freq(q 1 ) Genotype Q 1 Q 1 Q 2 Q 1 Q 2 Q 2 Genotypic value 0 a(1+k) 2a Mean µ G = 2p 2 a(1 + p 1 k) Allelic effects α 2 = p 1 a [ 1 + k ( p 1 p 2 ) ] α 1 = p 2 a [1 + k ( p 1 p 2 ) ] Dominance deviations δij = Gij µg αi αj

Average effects and Additive Genetic Values The! values are the average effects of an allele A key concept is the Additive Genetic Value (A) of an individual A (G ij ) = α i + α j A = n k=1 ( ) α ( k) i + α ( k) k A is called the Breeding value or the Additive genetic value

A = n k=1 ( ) α ( k) i + α ( k) k Why all the fuss over A? Suppose father has A = 10 and mother has A = -2 for (say) blood pressure Expected blood pressure in their offspring is (10-2)/2 = 4 units above the population mean. Offspring A = average of parental A s KEY: parents only pass single alleles to their offspring. Hence, they only pass along the A part of their genotypic value G

Genetic Variances G ij = µ g + (α i + α j ) + δ ij σ 2 (G) = n k=1 σ 2 (α ( k) i + α ( k) j ) + n k=1 σ 2 (δ ( k) ij ) σ 2 (G) = σ 2 (µ g + (α i + α j ) + δ ij ) = σ 2 (α i + α j ) + σ 2 (δ ij ) As Cov(!,") = 0

Genetic Variances σ 2 (G) = n k=1 σ 2 (α ( k) i + α ( k) j ) + n k=1 σ 2 (δ ( k) ij ) Additive Genetic Variance (or simply Additive Variance) Dominance Genetic Variance (or simply dominance variance) σ 2 G = σ 2 A + σ 2 D

Key concepts (so far)! i = average effect of allele i Property of a single allele in a particular population (depends on genetic background) A = Additive Genetic Value (A) A = sum (over all loci) of average effects Fraction of G that parents pass along to their offspring Property of an Individual in a particular population Var(A) = additive genetic variance Variance in additive genetic values Property of a population Can estimate A or Var(A) without knowing any of the underlying genetical detail (forthcoming)

σ 2 A = 2E[α 2 ] = 2 m i=1 α 2 i pi Q 1 Q 1 Q 1 Q 2 Q 2 Q 2 0 a(1+k) 2a Since E[!] = 0, Var(!) = E[(! -µ a ) 2 ] = E[! 2 ] One locus, 2 alleles: σ 2 A = 2p 1 p 2 a 2 [ 1 + k ( p 1 p 2 ) ] 2 Dominance alters additive variance When dominance present, Additive variance is an asymmetric function of allele frequencies

Dominance variance Q 1 Q 1 Q 1 Q 2 Q 2 Q 2 0 a(1+k) 2a σ 2 D = E[δ 2 ] = m i=1 m δ 2 ij p i p j j=1 Equals zero if k = 0 One locus, 2 alleles: σ 2 D = (2p 1 p 2 ak) 2 This is a symmetric function of allele frequencies Can also be expressed in terms of d = ak

Additive variance, V A, with no dominance (k = 0) V A Allele frequency, p

Complete dominance (k = 1) V A V D Allele frequency, p

Epistasis G ijkl = µ G + (α i + α j + α k + α l ) + (δ ij + δ kj ) + (αα ik + αα il + αα jk + αα jl ) + (αδ ikl + αδ jkl + αδ kij + αδ lij ) + (δδ ijkl ) = µ G + A + D + AA + AD + DD These components are defined to be uncorrelated, (or orthogonal), so that σ 2 G = σ 2 A + σ 2 D + σ 2 AA + σ 2 AD + σ 2 DD

G ijkl = µ G + (α i + α j + α k + α l ) + (δ ij + δ kj ) + (αα ik + αα il + αα jk + αα jl ) + (αδ ikl + αδ jkl + αδ kij + αδ lij ) + (δδ ijkl ) = µ G + A + D + AA + AD + DD Additive x Additive interactions --!!, AA interactions between a single allele at one locus with a single allele at another Additive x Dominance interactions --!", AD interactions between an allele at one locus with the genotype at another, e.g. allele A i and genotype B kj Dominance x dominance interaction --- "", DD the interaction between the dominance deviation at one locus with the dominance deviation at another.