Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power

Similar documents
Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

Partitioning the Genetic Variance

Partitioning the Genetic Variance. Partitioning the Genetic Variance

(Genome-wide) association analysis

Lecture WS Evolutionary Genetics Part I 1

Linear Regression (1/1/17)

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

Case-Control Association Testing. Case-Control Association Testing

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES

Nature Genetics: doi: /ng Supplementary Figure 1. Number of cases and proxy cases required to detect association at designs.

Association studies and regression

Breeding Values and Inbreeding. Breeding Values and Inbreeding

HERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA)

Partitioning Genetic Variance

Friday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo

Genotype Imputation. Biostatistics 666

Lecture 2: Introduction to Quantitative Genetics

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

The Mystery of Missing Heritability: Genetic interactions create phantom heritability - Supplementary Information

Evolution of phenotypic traits

Lecture 9. QTL Mapping 2: Outbred Populations

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Linkage and Linkage Disequilibrium

Overview. Background

EM algorithm. Rather than jumping into the details of the particular EM algorithm, we ll look at a simpler example to get the idea of how it works

Quantitative characters

Variance Component Models for Quantitative Traits. Biostatistics 666

Multiple regression. CM226: Machine Learning for Bioinformatics. Fall Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar

Evolution of quantitative traits

Quantitative characters

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015

GWAS for Compound Heterozygous Traits: Phenotypic Distance and Integer Linear Programming Dan Gusfield, Rasmus Nielsen.

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

The Quantitative TDT

Resemblance among relatives

I Have the Power in QTL linkage: single and multilocus analysis

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

Lecture 4: Allelic Effects and Genetic Variances. Bruce Walsh lecture notes Tucson Winter Institute 7-9 Jan 2013

Calculation of IBD probabilities

Quantitative Genetics I: Traits controlled my many loci. Quantitative Genetics: Traits controlled my many loci

Lecture 28: BLUP and Genomic Selection. Bruce Walsh lecture notes Synbreed course version 11 July 2013

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013

Lecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition

Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics

Bayesian Inference of Interactions and Associations

Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014

Evolutionary quantitative genetics and one-locus population genetics

Régression en grande dimension et épistasie par blocs pour les études d association

Lecture 2. Basic Population and Quantitative Genetics

Variance Components: Phenotypic, Environmental and Genetic

Resemblance between relatives

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X

Quantitative Trait Variation

Mapping QTL to a phylogenetic tree

Supplementary Information for: Detection and interpretation of shared genetic influences on 42 human traits

Evolutionary Genetics Midterm 2008

Introduction to Linkage Disequilibrium

Quantitative Traits Modes of Selection

SNP Association Studies with Case-Parent Trios

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17

Population Genetics I. Bio

Variation and its response to selection

DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

The concept of breeding value. Gene251/351 Lecture 5

Integer Programming in Computational Biology. D. Gusfield University of California, Davis Presented December 12, 2016.!

Inferring Genetic Architecture of Complex Biological Processes

Population Genetics: a tutorial

Theoretical and computational aspects of association tests: application in case-control genome-wide association studies.

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 2. Genetics of quantitative (multifactorial) traits What is known about such traits How they are modeled

Power and sample size calculations for designing rare variant sequencing association studies.

Lecture 11: Multiple trait models for QTL analysis

Calculation of IBD probabilities

2. Map genetic distance between markers

p(d g A,g B )p(g B ), g B

Multiple QTL mapping

BTRY 4830/6830: Quantitative Genomics and Genetics

Quantitative Genetics

On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease

Prediction of the Confidence Interval of Quantitative Trait Loci Location

SNP-SNP Interactions in Case-Parent Trios

Lecture 2. Fisher s Variance Decomposition

Supplementary Figures

Quantitative characters II: heritability

Quantitative Genetics & Evolutionary Genetics

Methods for Cryptic Structure. Methods for Cryptic Structure

BTRY 7210: Topics in Quantitative Genomics and Genetics

Statistical Methods in Mapping Complex Diseases

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Introduction to Quantitative Genetics. Introduction to Quantitative Genetics

Principles of QTL Mapping. M.Imtiaz

Lecture 24: Multivariate Response: Changes in G. Bruce Walsh lecture notes Synbreed course version 10 July 2013

Introduction to Statistical Genetics (BST227) Lecture 6: Population Substructure in Association Studies

Genetic Association Studies in the Presence of Population Structure and Admixture

Transcription:

Proportional Variance Explained by QTL and Statistical Power

Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion of the variance of the trait that can be attributed to both genetic (additive and dominance) and environment (shared and unique) factors We demonstrated that resemblance of trait values among relatives we can be used to obtain estimates of the variance components of a quantitative trait without using genotype data. For quantitative traits, there generally is no (apparent) simple Mendelian basis for variation in the trait

Partitioning the Genetic Variance May be a single gene strongly influenced by environmental factors May be the result of a number of genes of equal (or differing) effect Most likely, a combination of both multiple genes and environmental factors Examples: Blood pressure, cholesterol levels, IQ, height, etc.

GWAS and Linear Regression Genome-wide association studies (GWAS) are commonly used for the identification of QTL Single SNP association testing with linear regression models are often used

Partition of Phenotypic Values Today we will focus on Contribution of a QTL to the variance of a quantitative trait Statistical power for detecting QTL in GWAS Consider once again the classical quantitative genetics model of Y = G + E where Y is the phenotype value, G is the genotypic value, and E is the environmental deviation that is assumed to have a mean of 0 such that E(Y ) = E(G)

Representation of Genotypic Values For a single locus with alleles A 1 and A 2, the genotypic values for the three genotypes can be represented as follows a if genotype is A 2 A 2 Genotype Value = d if genotype is A 1 A 2 a if genotype is A 1 A 1 If p and q are the allele frequencies of the A 1 and A 2 alleles, respectively in the population, we previously showed that µ G = a(p q) + d(2pq) and that the genotypic value at a locus can be decomposed into additive effects and dominance deviations: G ij = G A ij + δ ij = µ G + α i + α j + δ ij

Linear Regression Figure for Genetic Values Falconer model for single biallelic QTL a d m -a bb Bb BB Var (X) = Regression Variance + Residual Variance = Additive Variance + Dominance Variance 15

Decomposition of Genotypic Values The model can be given in terms of a linear regression of genotypic values on the number of copies of the A 1 allele such that: G ij = β 0 + β 1 X ij 1 + δ ij where X ij 1 is the number of copies of the type A 1 allele in genotype G ij, and with β 0 = µ G + 2α 2 and β 1 = α 1 α 2 = α, the average effect of allele substitution. Recall that α = a + d(q p) and that α 1 = qα and α 2 = pα

Additive Genetic Model The following additive model is commonly used association studies with quantitative traits Y = β 0 + β 1 X + ɛ where X is the number of copies of the reference allele (A 1 ) and individual has For this a single locus trait, how would you interpret ɛ for this particular model?

Statistical Power for Detectng QTL Y = β 0 + β 1 X + ɛ Assume, without loss of generality, that Y is a standardized trait with σ 2 Y = 1 Test statistics for H 0 : β 1 = 0 versus H a : β 1 0 And we have that T = ˆβ 1 /σ( ˆβ) t N 2 N(0, 1) for large N T 2 = ˆβ 2 1/var( ˆβ) F 1,N 2 χ 2 1 for large N var( ˆβ) = σ2 ɛ S XX σ2 ɛ Nσ 2 X = σ 2 ɛ 2Np(1 p) where S XX is the corrected sum of squares for the X i s

Statistical Power for Detecting QTL Y = β 0 + β 1 X + ɛ, so we have that σ 2 Y = β2 1σ 2 X + σ2 ɛ = β 2 12p(1 p) + σ 2 ɛ Recall that β 2 1 = α2 = [a + d(q p)] 2, so σ 2 Y = 2p(1 p)[a + d(q p)]2 + σ 2 ɛ = h 2 s + σ 2 ɛ where h 2 s = 2p(1 p)[a + d(q p)] 2. Interpret h 2 s (note that we assume that trait is standardized such that σ 2 Y = 1)

Statistical Power for Detecting QTL Also note that σ 2 ɛ = 1 h 2 s, so we can write Var( ˆβ 1 ) as the following: var( ˆβ 1 ) = σ 2 ɛ 2Np(1 p) = 1 h2 s 2Np(1 p) To calculate power of the test statistic T 2 for a given sample size N, we need to first obtain the expected value of the non-centrality parameter λ of the chi-squared distribution which is the expected value of the test statistic T squared: λ = [E(T )] 2 β2 1 2Np(1 p)[a + d(q p)]2 var( ˆβ = 1 ) 1 hs 2 = Nh2 s 1 h 2 s

Power Can also obtain the required sample size given type-i error α and power 1 β, where the type II error is β : N = 1 h2 s h 2 s ( z(1 α/2) + z (1 β ) ) 2 where z (1 α/2) and z (1 β) are the (1 α/2)th and (1 β)th quantiles, respectively, for the standard normal distribution.

Statistical Power for Detecting QTL

Genetic Power Calculator (PGC) http://pngu.mgh.harvard.edu/~purcell/gpc/ 23

!0,0-.6%2'=0(%O+?6*?+-'(%H$7+*,%2*(60??M 7--8CVV8,B*4:B747+(/+(;40;*Vm8*(60??VB86V

Missing Heritability Disease Number of loci Percent of Heritability Measure Explained Heritability Measure Age-related macular degeneration 5 50% Sibling recurrence risk Crohn s disease 32 20% Genetic risk (liability) Systemic lupus erythematosus 6 15% Sibling recurrence risk Type 2 diabetes 18 6% Sibling recurrence risk HDL cholesterol 7 5.2% Phenotypic variance Height 40 5% Phenotypic variance Early onset myocardial 9 2.8% Phenotypic infarction variance Fasting glucose 4 1.5% Phenotypic variance!"#$%='(a1 D&&06-%1.T01%+(0%-F8.6+??F%1:+?? >.10+10C%<a%mI4I%-'%mI4d h*+,-.-+-./0%-(+.-1c%n%/+(%0e8?+.,0;% ooin

LD Mapping of QTL For GWAS, the QTL will generally not be genotyped in a study #$%&'()*)+,$-$./$,0**********************#$%&'()*1$2)+,$-$./$,0 Q 1 M 1 Q 1 M 1 Q 2 M 1 Q 1 M 1 Q 2 M 2 Q 1 M 1 Q 2 M 2 Q 2 M 1 Q 2 M 2 Q 2 M 2 Q 1 M 2 Q 2 M 2 Q 2 M 2 Q 1 M 2 Q 1 M 1 Q 1 M 1

LD Mapping of QTL Linkage disequilibrium around an ancestral mutation 5 [Ardlie et al. 2002]

LD Mapping of QTL r 2 = LD correlation between QTL and genotyped SNP Proportion of variance of the trait explained at a SNP r 2 h 2 s Required sample size for detection is N 1 r 2 h 2 s r 2 h 2 s ( z(1 α/2) + z (1 γ) ) 2 Power of LD mapping depends on the experimental sample size, variance explained by the causal variant and LD with a genotyped SNP