DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to

Size: px
Start display at page:

Download "DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to"

Transcription

1 SUPPLEMENTARY MATERIALS, B. BIVARIATE PEDIGREE-BASED ASSOCIATION ANALYSIS Introduction We propose here a statistical method of bivariate genetic analysis, designed to evaluate contribution of the DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to variation of two interrelated traits without predefined distribution (i.e. could be quantitative and/or qualitative phenotypes). Our approach is an alternative of the liability-threshold concept (Falconer, 1), and it is based on the discrete models of genetic and familial effects. It can be helpful in testing the questions such as: Whether the observed two-dimensional phenotypic distribution may be explained without presuming common additive gene effects? What are the relative contributions of the specific and common environmental and genetic factors to variation of each of the two traits? Let and Y are two traits, where is a quantitative trait and Y is a binary trait (until further notice). Let G be a bi-allelic SNP (genotypes: aa, ab, bb) associated with both traits. In order to take into account additive effect of the other genes on the traits', we introduce three independent binary factors Z, Z Y, and Z Y (Fig. A1). In our model they represent genetic factors affecting variation of each trait separately (Z and Z Y ) and both traits simultaneously (Z Y ), pleiotropic effect. It is obvious that a binary factor can t express the entire variation caused by additive genetic effect. However, the specific portion of the variation explained by binary variable can be assigned. Gene-independent effects, caused by random or common familial effects on the phenotype variation are also taken into account in the model as provided below. 1

2 Fig. A1. Outlines of the model presenting factors that are taking into account in the analysis of two traits; and Y. Non-genetic factors are marked with rectangle. Model formalization and simplifications Binary model of additive genetic effect on trait variation The genetic component, in the proposed regression model (see below), is represented by random binary variables Z: Z=1 or Z=-1. The probability of a child value (Z C ) depends on the parental values, Z M and Z F, according to formulae presented in Table A1. Table A1. Probability of a child Z value conditioned by the parental values, P(Z C /Z F,Z M ). Z F Z M Z C P(Z C /Z F,Z M ) p* p +1 p + Δ -1 (1 - p) - Δ +1 p - Δ -1 (1 - p) + Δ *p = q + Δ(1- q), 0< q<1, Δ<1/. Where q is the frequency of positive genetic effect (Z=+1),

3 Δ is the extent of the heritability (transmittable factors) effect. This two-parametric model has the number of properties. The first parameter q is stationary, i.e. the expected value of q remains constant from generation to generation, like the allele frequency in the classical equilibrium model (see downmost Lemma 1). Correspondingly, the expected population s variance of Z doesn t depend on the second parameter Δ. The expected portion of Z C variance attributable to Z M and Z F values is Δ (see Lemma ). In case Δ=0. the correlation between the relatives (Table A) corresponds to the classical model of additive genetic effects (Falconer and Mackay, 1). Table A. Intra-family Z value correlations* Relatives with known Z value Explained variance Explained variance (h =0.) One of the parents 0. h 0. Both of the parents h 0.0 One of the sibs h 0. One of the monozygotic twins *h =Δ The proportion of the phenotypic variation of the study trait attributable to Z components of the model are estimated implementing the below presented bivariate regression model. Bivariate regression model In addition to SNP genotype (G), the discrete components of the regression model include three genetic components (Z & Z Y & Z Y), defined above. They could be interpreted in terms of the additive genetic and three common family environment components (F & F Y & F Y ). The Z, F and Z Y, F Y represent phenotype specific genetic and environmental factors, Z Y and F Y - represent genetic an environmental factors shared by both phenotypes. The regression analysis performed under the following simplifications (S1-S): S1. Family distribution of Z values (Z & Z Y & Z Y) is defined by the - binary model (see above) under Δ=0., while monozygotic twins shared the same Z values. The common family environment factors are

4 represented by binary variables (F & F Y & F Y). The F values are equally probable, i.e. P(F=+1) = P(F=- 1) = 0., and the offspring of the same parents share the same F values. Population's allele-frequency of the SNP is known, the genotype frequencies and the transmission probabilities follow the Hardy- Weinberg and Mendelian expectations, respectively. S. The trait has a normal distribution, P()=N[M, ]; where the average, M, depends on the individual genetic (G, Z and Z Y) and familial (F and F Y) factors, is the trait variation independent on the above factors. The sample average and the standard deviation are m = 0 and σ =1, respectively. Effect of G, Z, Z Y, F and F Y on the -average is additive and defined by linear regression model: M = m + G A α 1 + Z α + Z Y α + F α + F Y α G ( g p)/ p(1 p), where g is the SNP genotype (namely, the minor allele dose: 0, 1 or ), p is A the minor allele frequency; α 1, α, α, α and α unknown parameters. S. The Y trait has a binomial distribution P(Y=µ) = (M Y ) µ (1- M Y ) 1-µ ; where 0<M Y <1, µ=0 or µ=1. Effect of individual genetic (G, Z Y, Z Y) and family factors (F Y, F Y) on Y average (M Y), is additive and defined by logit model: ln(p(y=1)/p(y=0)) ln(m y/(1-m y)) +G Z Z F F Y1 Y Y Y Y Y Y Y Y Where m Y is the sample frequency of individuals with Y=1; β Y1, β Y, β Y, β Y, β Y are unknown parameters. Similar model of binary trait variation was previously described (Mukhopadhyay et al, 0). S. The factor-independent correlation between and Y was modeled with the following equations: P( Y, ) PY ( ) P ( / Y); P ( / Y) NM [ '( ), ' ]; ' 1 r M'( / Y1) M( ) r PY ( 1) / PY ( 1) ; M'( / Y1) M( ) r PY ( 1) / PY ( 1) ;

5 Where M() = M and P(Y=1) = M Y are defined above (the equations S1-S); r is the correlation parameter (-1<r< 1). S. In case of two quantitative traits, the two-dimensional normal distribution was applied: P(,Y)=N[M(),M(Y),, Y,r]. Where M(Y) and Y are defined by the same way as described in S1-S above. In case of two binary traits, the two-dimensional binomial distribution was applied: PY (, ) P ( ) PY ( ) sr Y. Where P() is defined by the same way as described in S above; s sign[( P ) (Y PY )] ; Y and Y are standard deviation of the traits: (1 P( 1)) P( 1), Y (1 PY ( 1)) PY ( 1). In case of mixture of traits (quantitative and binary) the model includes twelve parameters: α 1, α, α, α, α,, β Y1, β Y, β Y, β Y, β Y and r. In case of two quantitative or two binary traits the model includes 1 and parameters, respectively. Variance component estimation Since in our discrete factors model the genetic variation is approximated by using binary factors (predictors), the genetic effect on the trait s variation can be underestimated, while the environmental effect - overestimated accordingly. This source of systematic bias could be taking into account and corrected. Let Z N, F N and effects, respectively; parameters S N are unknown additive genetic, common family and individual environment Z B, F B and NormalDistribution.html). It defines the first equation: S B are the variance component estimates (the squares of the regression coefficients) obtained using binary approximation of additive genetic component. In case of a sample of twins the following three equations define the relations between the first and the second sets of values. Assuming the normal distribution of the additive genetic component, the proportion of its variation explained by the binary approximation is /π (

6 Z B Z N Remaining part of the additive genetic variance, 1-/π, is the mixture in F B and in variance estimates. S B On average, additive genetic variance shared by sibs equals to (1+P MZ)/; where P MZ is frequency of monozygotic twins among sib-pairs. The second equation is F (1 ) 1 P MZ / B F N Z N The residual, i.e. equation is 1 1+P / 1P /, is the mixture in MZ MZ S (1 ) 1 P MZ / B S N Z N S B variance estimate. Then the third Thus there is reciprocation between variance estimates obtained in our binary approximation model and the Z, N F N and S N values, i.e. ; ; Z N Z B FN FB 1 PMZ SN S 1 PMZ. B The last equations should be applied to both common and trait specific variance component estimates obtained using binary model of additive genetic effect. Limitation on input data The present version of the software has a limited option for simultaneous testing and adjustment for covariates and reserved for examination of the SNP effect. Currently, adjustment for other covariates could be done prior to analysis, taking into account familial structure of the sample. There are several available software, such as MAN (Malkin and Ginsburg, 01) that could manage this procedure. Adjustment of quantitative traits is usually achieved by using polynomial functions, while the qualitative phenotypes could be adjusted using logistic regression and/or the Gompertz-Makeham functions (e.g.

7 Korostishevsky et al. 01). Correspondent regression functions could replace m and/or m Y in formulae S and S. Likelihood evaluation and statistical inferences Consider first unrelated individuals. In case of n unrelated individuals the model-based likelihood (LH) of the observation ({ & Y & G}, i 1,,, n) is calculated by formula: i i i LH P( Z ) P( F) ( / G & Z & F & Y) P( Y / G & Z & F) i Z, F i i i i i i i i i i i 1 1 Where Z and F are three- and one-dimensional variables, respectively; ( / G & Z & F & Y) is the density of the normal distribution: N[ M( i / Gi & Zi & Fi & Yi), ]. i i i i i In case of pedigree data, the LH value also depends on correlations between relatives caused by genetic and environmental factors. In our model these correlations are defined by the following three discrete functions: P(G C/G F,G M), P(Z C/Z F,Z M) and P(F C/F SIB). The LH computation on pedigree data was previously repeatedly described (Ott J, 1; Lange K, Elston RC. 1). Comparison of the general and restricted models is based on the likelihood ratio test (LRT). SD of the maximum likelihood estimates (MLE) is evaluated following the asymptotic approximation: ( Lemma 1. * * / LRT The binary model of genetic variation (see Table A1 for definitions) is stationary by the q parameter. Proof: Let q be the frequency of individuals possessing Z=1 in the parent generation. Then, the expected frequency of individuals possessing Z=1 in the offspring generation (Q), satisfies the following equation: Q= q (1 - q) p+ q (p + Δ) + (1 - q) (p Δ) Where p = q + Δ(1- q). The direct substitution leads to the following equation: Q = q

8 Lemma. The expected portion of the Z variance attributable to parents, h, equals to Δ. Proof: The population variance of a binary trait Z (e.g. Z=1 or Z=0) is defined in the ordinary way: Var pop = q (1 - q) For known parent combinations of Z values (01, or 00), the child variance of Z value is defined according the binary model of genetic variation (Table A1): Var 01 = p(1 - p); Var = (p + Δ)(1 - p - Δ); Var 00 = (p + Δ)(1 - p - Δ) For random family, the expected child variance is: Var exp = q(1-q)var 01+q Var + (1 - q) Var 00 = q (1-q) (1- Δ ) The direct substitution leads to Var exp = q (1-q) (1- Δ ) Accordingly to this, the portion of Z variance explained by the parents values is h = = 1 Var exp /Var pop Δ In the same way, one can find out the portions of Z variance explained by one of the parents values, by the sib s value and by the twin s value (see Table A). In this context, h for generally used quasi genetic quantitative trait - number of specific SNP allele copies (0, 1, ), is equal to 0.. 1

9 1 1 1 References Falconer DS. Introduction to Quantitative Genetics, rd ed. UK/New York: Longmans Green/John Wiley & Sons, Harlow, Essex; 1. Korostishevsky M, Williams F, Hart D, Blumenfeld O, Spector T, Livshits G. Implementation of the simplified stochastic model of ageing for longitudinal osteoarthritis data assessment. Ann Hum Biol. 01;():1-. Lange K, Elston RC. Extensions to pedigree analysis I. Likehood calculations for simple and complex pedigrees. Hum Hered. 1;():. Mukhopadhyay I, Saha S, Ghosh S. Integrating binary traits with quantitative phenotypes for association mapping of multivariate phenotypes. BMC Proceedings, 0;.Suppl :S Ott J. Estimation of the recombination fraction in human pedigrees: efficient computation of the likelihood for human linkage studies. Am J Hum Genet. 1;():. Weisstein, Eric W. "Half-Normal Distribution." From MathWorld--A Wolfram Web Resource. Half-NormalDistribution.html

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE NORMAL DISTRIBUTIONS OF PHENOTYPES Mice Fruit Flies In:Introduction to Quantitative Genetics Falconer & Mackay 1996 CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE Mean and variance are two quantities

More information

Variance Component Models for Quantitative Traits. Biostatistics 666

Variance Component Models for Quantitative Traits. Biostatistics 666 Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond

More information

Affected Sibling Pairs. Biostatistics 666

Affected Sibling Pairs. Biostatistics 666 Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD

More information

Lecture WS Evolutionary Genetics Part I 1

Lecture WS Evolutionary Genetics Part I 1 Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in

More information

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17 Modeling IBD for Pairs of Relatives Biostatistics 666 Lecture 7 Previously Linkage Analysis of Relative Pairs IBS Methods Compare observed and expected sharing IBD Methods Account for frequency of shared

More information

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017 Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping

More information

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics 1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

More information

Lecture 11: Multiple trait models for QTL analysis

Lecture 11: Multiple trait models for QTL analysis Lecture 11: Multiple trait models for QTL analysis Julius van der Werf Multiple trait mapping of QTL...99 Increased power of QTL detection...99 Testing for linked QTL vs pleiotropic QTL...100 Multiple

More information

Linear Regression (1/1/17)

Linear Regression (1/1/17) STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression

More information

Biometrical Genetics. Lindon Eaves, VIPBG Richmond. Boulder CO, 2012

Biometrical Genetics. Lindon Eaves, VIPBG Richmond. Boulder CO, 2012 Biometrical Genetics Lindon Eaves, VIPBG Richmond Boulder CO, 2012 Biometrical Genetics How do genes contribute to statistics (e.g. means, variances,skewness, kurtosis)? Some Literature: Jinks JL, Fulker

More information

Biometrical Genetics

Biometrical Genetics Biometrical Genetics 2016 International Workshop on Statistical Genetic Methods for Human Complex Traits Boulder, CO. Lindon Eaves, VIPBG, Richmond VA. March 2016 Biometrical Genetics How do genes contribute

More information

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by

More information

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative

More information

Lecture 9. QTL Mapping 2: Outbred Populations

Lecture 9. QTL Mapping 2: Outbred Populations Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred

More information

2. Map genetic distance between markers

2. Map genetic distance between markers Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,

More information

The concept of breeding value. Gene251/351 Lecture 5

The concept of breeding value. Gene251/351 Lecture 5 The concept of breeding value Gene251/351 Lecture 5 Key terms Estimated breeding value (EB) Heritability Contemporary groups Reading: No prescribed reading from Simm s book. Revision: Quantitative traits

More information

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013 Lecture 9 Short-Term Selection Response: Breeder s equation Bruce Walsh lecture notes Synbreed course version 3 July 2013 1 Response to Selection Selection can change the distribution of phenotypes, and

More information

Quantitative characters - exercises

Quantitative characters - exercises Quantitative characters - exercises 1. a) Calculate the genetic covariance between half sibs, expressed in the ij notation (Cockerham's notation), when up to loci are considered. b) Calculate the genetic

More information

I Have the Power in QTL linkage: single and multilocus analysis

I Have the Power in QTL linkage: single and multilocus analysis I Have the Power in QTL linkage: single and multilocus analysis Benjamin Neale 1, Sir Shaun Purcell 2 & Pak Sham 13 1 SGDP, IoP, London, UK 2 Harvard School of Public Health, Cambridge, MA, USA 3 Department

More information

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion

More information

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification, Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability

More information

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015 Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.

More information

MIXED MODELS THE GENERAL MIXED MODEL

MIXED MODELS THE GENERAL MIXED MODEL MIXED MODELS This chapter introduces best linear unbiased prediction (BLUP), a general method for predicting random effects, while Chapter 27 is concerned with the estimation of variances by restricted

More information

Resemblance among relatives

Resemblance among relatives Resemblance among relatives Introduction Just as individuals may differ from one another in phenotype because they have different genotypes, because they developed in different environments, or both, relatives

More information

On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease

On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease Yuehua Cui 1 and Dong-Yun Kim 2 1 Department of Statistics and Probability, Michigan State University,

More information

Association studies and regression

Association studies and regression Association studies and regression CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar Association studies and regression 1 / 104 Administration

More information

Quantitative Genetics

Quantitative Genetics Bruce Walsh, University of Arizona, Tucson, Arizona, USA Almost any trait that can be defined shows variation, both within and between populations. Quantitative genetics is concerned with the analysis

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm

More information

SNP Association Studies with Case-Parent Trios

SNP Association Studies with Case-Parent Trios SNP Association Studies with Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health September 3, 2009 Population-based Association Studies Balding (2006). Nature

More information

Major Genes, Polygenes, and

Major Genes, Polygenes, and Major Genes, Polygenes, and QTLs Major genes --- genes that have a significant effect on the phenotype Polygenes --- a general term of the genes of small effect that influence a trait QTL, quantitative

More information

Prediction of the Confidence Interval of Quantitative Trait Loci Location

Prediction of the Confidence Interval of Quantitative Trait Loci Location Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28

More information

Combining dependent tests for linkage or association across multiple phenotypic traits

Combining dependent tests for linkage or association across multiple phenotypic traits Biostatistics (2003), 4, 2,pp. 223 229 Printed in Great Britain Combining dependent tests for linkage or association across multiple phenotypic traits XIN XU Program for Population Genetics, Harvard School

More information

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013 Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 013 1 Estimation of Var(A) and Breeding Values in General Pedigrees The classic

More information

Genetics and Natural Selection

Genetics and Natural Selection Genetics and Natural Selection Darwin did not have an understanding of the mechanisms of inheritance and thus did not understand how natural selection would alter the patterns of inheritance in a population.

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

The Quantitative TDT

The Quantitative TDT The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus

More information

Case-Control Association Testing. Case-Control Association Testing

Case-Control Association Testing. Case-Control Association Testing Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies

More information

Genotype Imputation. Biostatistics 666

Genotype Imputation. Biostatistics 666 Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives

More information

Variation and its response to selection

Variation and its response to selection and its response to selection Overview Fisher s 1 is the raw material of evolution no natural selection without phenotypic variation no evolution without genetic variation Link between natural selection

More information

(Genome-wide) association analysis

(Genome-wide) association analysis (Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by

More information

Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics

Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics Lee H. Dicker Rutgers University and Amazon, NYC Based on joint work with Ruijun Ma (Rutgers),

More information

Breeding Values and Inbreeding. Breeding Values and Inbreeding

Breeding Values and Inbreeding. Breeding Values and Inbreeding Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A

More information

SNP-SNP Interactions in Case-Parent Trios

SNP-SNP Interactions in Case-Parent Trios Detection of SNP-SNP Interactions in Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health June 2, 2009 Karyotypes http://ghr.nlm.nih.gov/ Single Nucleotide Polymphisms

More information

Quantitative characters II: heritability

Quantitative characters II: heritability Quantitative characters II: heritability The variance of a trait (x) is the average squared deviation of x from its mean: V P = (1/n)Σ(x-m x ) 2 This total phenotypic variance can be partitioned into components:

More information

Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control

Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control Xiaoquan Wen Department of Biostatistics, University of Michigan A Model

More information

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture16: Population structure and logistic regression I Jason Mezey jgm45@cornell.edu April 11, 2017 (T) 8:40-9:55 Announcements I April

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Frequency Estimation Karin S. Dorman Department of Statistics Iowa State University August 28, 2006 Fundamental rules of genetics Law of Segregation a diploid parent is equally

More information

A Parametric Copula Model for Analysis of Familial Binary Data

A Parametric Copula Model for Analysis of Familial Binary Data Am. J. Hum. Genet. 64:886 893, 1999 A Parametric Copula Model for Analysis of Familial Binary Data David-Alexandre Trégouët, 1 Pierre Ducimetière, 1 Valéry Bocquet, 1 Sophie Visvikis, 3 Florent Soubrier,

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities

More information

Theoretical and computational aspects of association tests: application in case-control genome-wide association studies.

Theoretical and computational aspects of association tests: application in case-control genome-wide association studies. Theoretical and computational aspects of association tests: application in case-control genome-wide association studies Mathieu Emily November 18, 2014 Caen mathieu.emily@agrocampus-ouest.fr - Agrocampus

More information

Overview. Background

Overview. Background Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems

More information

Analytic power calculation for QTL linkage analysis of small pedigrees

Analytic power calculation for QTL linkage analysis of small pedigrees (2001) 9, 335 ± 340 ã 2001 Nature Publishing Group All rights reserved 1018-4813/01 $15.00 www.nature.com/ejhg ARTICLE for QTL linkage analysis of small pedigrees FruÈhling V Rijsdijk*,1, John K Hewitt

More information

Recent advances in statistical methods for DNA-based prediction of complex traits

Recent advances in statistical methods for DNA-based prediction of complex traits Recent advances in statistical methods for DNA-based prediction of complex traits Mintu Nath Biomathematics & Statistics Scotland, Edinburgh 1 Outline Background Population genetics Animal model Methodology

More information

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.

More information

Variance Components: Phenotypic, Environmental and Genetic

Variance Components: Phenotypic, Environmental and Genetic Variance Components: Phenotypic, Environmental and Genetic You should keep in mind that the Simplified Model for Polygenic Traits presented above is very simplified. In many cases, polygenic or quantitative

More information

Multidimensional heritability analysis of neuroanatomical shape. Jingwei Li

Multidimensional heritability analysis of neuroanatomical shape. Jingwei Li Multidimensional heritability analysis of neuroanatomical shape Jingwei Li Brain Imaging Genetics Genetic Variation Behavior Cognition Neuroanatomy Brain Imaging Genetics Genetic Variation Neuroanatomy

More information

Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope

Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope Oct 2017 1 / 28 Minimum MSE Y is the response variable, X the predictor variable, E(X) = E(Y) = 0. BLUP of Y minimizes average discrepancy var (Y ux) = C YY 2u C XY + u 2 C XX This is minimized when u

More information

3. Properties of the relationship matrix

3. Properties of the relationship matrix 3. Properties of the relationship matrix 3.1 Partitioning of the relationship matrix The additive relationship matrix, A, can be written as the product of a lower triangular matrix, T, a diagonal matrix,

More information

THEORETICAL ASPECTS OF PEDIGREE ANALYSIS

THEORETICAL ASPECTS OF PEDIGREE ANALYSIS THEORETICAL ASPECTS OF PEDIGREE ANALYSIS E. Ginsburg, I. Malkin, R.C. Elston THEORETICAL ASPECTS OF PEDIGREE ANALYSIS RAMOT PUBLISHING HOUSE TEL-AVIV UNIVERSITY, ISRAEL E. Ginsburg, I. Malkin, R.C. Elston

More information

Estimating Breeding Values

Estimating Breeding Values Estimating Breeding Values Principle how is it estimated? Properties Accuracy Variance Prediction Error Selection Response select on EBV GENE422/522 Lecture 2 Observed Phen. Dev. Genetic Value Env. Effects

More information

Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Uppsala EQG course version 31 Jan 2012

Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Uppsala EQG course version 31 Jan 2012 Short-Term Selection Response: Breeder s equation Bruce Walsh lecture notes Uppsala EQG course version 31 Jan 2012 Response to Selection Selection can change the distribution of phenotypes, and we typically

More information

Partitioning the Genetic Variance

Partitioning the Genetic Variance Partitioning the Genetic Variance 1 / 18 Partitioning the Genetic Variance In lecture 2, we showed how to partition genotypic values G into their expected values based on additivity (G A ) and deviations

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 3. The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 3. The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs INTRODUCTION TO ANIMAL BREEDING Lecture Nr 3 The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs Etienne Verrier INA Paris-Grignon, Animal Sciences Department

More information

Resemblance between relatives

Resemblance between relatives Resemblance between relatives 1 Key concepts Model phenotypes by fixed effects and random effects including genetic value (additive, dominance, epistatic) Model covariance of genetic effects by relationship

More information

Objectives. Announcements. Comparison of mitosis and meiosis

Objectives. Announcements. Comparison of mitosis and meiosis Announcements Colloquium sessions for which you can get credit posted on web site: Feb 20, 27 Mar 6, 13, 20 Apr 17, 24 May 15. Review study CD that came with text for lab this week (especially mitosis

More information

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Lecture 6: Introduction to Quantitative genetics Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Quantitative Genetics The analysis of traits whose variation is determined by both a

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Lecture 2. Basic Population and Quantitative Genetics

Lecture 2. Basic Population and Quantitative Genetics Lecture Basic Population and Quantitative Genetics Bruce Walsh. Aug 003. Nordic Summer Course Allele and Genotype Frequencies The frequency p i for allele A i is just the frequency of A i A i homozygotes

More information

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8 The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as

More information

A simple genetic model with non-equilibrium dynamics

A simple genetic model with non-equilibrium dynamics J. Math. Biol. (1998) 36: 550 556 A simple genetic model with non-equilibrium dynamics Michael Doebeli, Gerdien de Jong Zoology Institute, University of Basel, Rheinsprung 9, CH-4051 Basel, Switzerland

More information

heritable diversity feb ! gene 8840 biol 8990

heritable diversity feb ! gene 8840 biol 8990 heritable diversity feb 25 2015! gene 8840 biol 8990 D. Gordon E. Robertson - photo from Wikipedia HERITABILITY DEPENDS ON CONTEXT heritability: how well does parent predict offspring phenotype? how much

More information

Continuously moderated effects of A,C, and E in the twin design

Continuously moderated effects of A,C, and E in the twin design Continuously moderated effects of A,C, and E in the twin design Conor V Dolan & Sanja Franić Boulder Twin Workshop March 8, 2016 Includes slides by Sophie van der Sluis & Marleen de Moor 1977 Acta Genet

More information

Lecture 7 Correlated Characters

Lecture 7 Correlated Characters Lecture 7 Correlated Characters Bruce Walsh. Sept 2007. Summer Institute on Statistical Genetics, Liège Genetic and Environmental Correlations Many characters are positively or negatively correlated at

More information

Notes on Population Genetics

Notes on Population Genetics Notes on Population Genetics Graham Coop 1 1 Department of Evolution and Ecology & Center for Population Biology, University of California, Davis. To whom correspondence should be addressed: gmcoop@ucdavis.edu

More information

SUPPLEMENTARY SIMULATIONS & FIGURES

SUPPLEMENTARY SIMULATIONS & FIGURES Supplementary Material: Supplementary Material for Mixed Effects Models for Resampled Network Statistics Improve Statistical Power to Find Differences in Multi-Subject Functional Connectivity Manjari Narayan,

More information

population when only records from later

population when only records from later Original article Estimation of heritability in the base population when only records from later generations are available L Gomez-Raya LR Schaeffer EB Burnside University of Guelph, Centre for Genetic

More information

contents: BreedeR: a R-package implementing statistical models specifically suited for forest genetic resources analysts

contents: BreedeR: a R-package implementing statistical models specifically suited for forest genetic resources analysts contents: definitions components of phenotypic correlations causal components of genetic correlations pleiotropy versus LD scenarios of correlation computing genetic correlations why genetic correlations

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

Bayesian Methods with Monte Carlo Markov Chains II

Bayesian Methods with Monte Carlo Markov Chains II Bayesian Methods with Monte Carlo Markov Chains II Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University hslu@stat.nctu.edu.tw http://tigpbp.iis.sinica.edu.tw/courses.htm 1 Part 3

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

Mixture Models. Pr(i) p i (z) i=1. It is usually assumed that the underlying distributions are normals, so this becomes. 2 i

Mixture Models. Pr(i) p i (z) i=1. It is usually assumed that the underlying distributions are normals, so this becomes. 2 i Mixture Models The Distribution under a Mixture Model Assume the distribution of interest results from a weighted mixture of several underlying distributions. If there are i = 1,,n underlying distributions,

More information

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, Ph.D. Computer Science, Kennesaw State University Problems

More information

1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals.

1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals. 1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals. Which statement best explains how these processes often produce offspring that have traits not

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012 Mixed-Model Estimation of genetic variances Bruce Walsh lecture notes Uppsala EQG 01 course version 8 Jan 01 Estimation of Var(A) and Breeding Values in General Pedigrees The above designs (ANOVA, P-O

More information

The Mystery of Missing Heritability: Genetic interactions create phantom heritability - Supplementary Information

The Mystery of Missing Heritability: Genetic interactions create phantom heritability - Supplementary Information The Mystery of Missing Heritability: Genetic interactions create phantom heritability - Supplementary Information 1 Contents List of Figures 4 List of Tables 5 List of Symbols 6 1 Calculating the top-down

More information

Bayesian Nonparametric Meta-Analysis Model George Karabatsos University of Illinois-Chicago (UIC)

Bayesian Nonparametric Meta-Analysis Model George Karabatsos University of Illinois-Chicago (UIC) Bayesian Nonparametric Meta-Analysis Model George Karabatsos University of Illinois-Chicago (UIC) Collaborators: Elizabeth Talbott, UIC. Stephen Walker, UT-Austin. August 9, 5, 4:5-4:45pm JSM 5 Meeting,

More information

An introduction to quantitative genetics

An introduction to quantitative genetics An introduction to quantitative genetics 1. What is the genetic architecture and molecular basis of phenotypic variation in natural populations? 2. Why is there phenotypic variation in natural populations?

More information

Lecture 2: Introduction to Quantitative Genetics

Lecture 2: Introduction to Quantitative Genetics Lecture 2: Introduction to Quantitative Genetics Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 2018 1 Basic model of Quantitative Genetics Phenotypic value --

More information

Lecture 28: BLUP and Genomic Selection. Bruce Walsh lecture notes Synbreed course version 11 July 2013

Lecture 28: BLUP and Genomic Selection. Bruce Walsh lecture notes Synbreed course version 11 July 2013 Lecture 28: BLUP and Genomic Selection Bruce Walsh lecture notes Synbreed course version 11 July 2013 1 BLUP Selection The idea behind BLUP selection is very straightforward: An appropriate mixed-model

More information

Statistical Methods and Software for Forensic Genetics. Lecture I.1: Basics

Statistical Methods and Software for Forensic Genetics. Lecture I.1: Basics Statistical Methods and Software for Forensic Genetics. Lecture I.1: Basics Thore Egeland (1),(2) (1) Norwegian University of Life Sciences, (2) Oslo University Hospital Workshop. Monterrey, Mexico, Nov

More information

Statistical inference on the penetrances of rare genetic mutations based on a case family design

Statistical inference on the penetrances of rare genetic mutations based on a case family design Biostatistics (2010), 11, 3, pp. 519 532 doi:10.1093/biostatistics/kxq009 Advance Access publication on February 23, 2010 Statistical inference on the penetrances of rare genetic mutations based on a case

More information

Partitioning Genetic Variance

Partitioning Genetic Variance PSYC 510: Partitioning Genetic Variance (09/17/03) 1 Partitioning Genetic Variance Here, mathematical models are developed for the computation of different types of genetic variance. Several substantive

More information

Generalized, Linear, and Mixed Models

Generalized, Linear, and Mixed Models Generalized, Linear, and Mixed Models CHARLES E. McCULLOCH SHAYLER.SEARLE Departments of Statistical Science and Biometrics Cornell University A WILEY-INTERSCIENCE PUBLICATION JOHN WILEY & SONS, INC. New

More information

MACAU 2.0 User Manual

MACAU 2.0 User Manual MACAU 2.0 User Manual Shiquan Sun, Jiaqiang Zhu, and Xiang Zhou Department of Biostatistics, University of Michigan shiquans@umich.edu and xzhousph@umich.edu April 9, 2017 Copyright 2016 by Xiang Zhou

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

On Computation of P-values in Parametric Linkage Analysis

On Computation of P-values in Parametric Linkage Analysis On Computation of P-values in Parametric Linkage Analysis Azra Kurbašić Centre for Mathematical Sciences Mathematical Statistics Lund University p.1/22 Parametric (v. Nonparametric) Analysis The genetic

More information

BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section. A/a ; B/B ; d/d X A/a ; b/b ; D/d

BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section. A/a ; B/B ; d/d X A/a ; b/b ; D/d BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section 1. In the following cross, all genes are on separate chromosomes. A is dominant to a, B is dominant to b and D is dominant

More information

Name: Per: Task: To create a model that explains how bi-racial parents can have black and white twins

Name: Per: Task: To create a model that explains how bi-racial parents can have black and white twins Name: Per: Genetics Test Review Task: To create a model that explains how bi-racial parents can have black and white twins Part 1: DNA to Protein to Trait LT15 (Protein and Traits) - Proteins express inherited

More information