Distinctive aspects of non-parametric fitting

Size: px
Start display at page:

Download "Distinctive aspects of non-parametric fitting"

Transcription

1 5. Introduction to nonparametric curve fitting: Loess, kernel regression, reproducing kernel methods, neural networks Distinctive aspects of non-parametric fitting Objectives: investigate patterns free of strictures imposed by parametric models Can produce surprising results Regression coefficients appear but (typically) do not have an obvious interpretation Often have very good predictive performance in cross-validation Tuning methods similar to those for parametric methods 1

2 Example: thin-plate splines Risk of heart attack after 19 years as a function of cholesterol level and blood pressure. Left: logistic regression model. Right: thin plate spline fit. Wahba (007) log ) ( j i j i N j j i j i j i i i x x x x x x x x x x f x LOESS REGRESSION: Non-parametric exploration of inbreeding depression for yield and somatic cell count in Jersey cattle

3 AN OVERVIEW OF LOWESS REGRESSION 1) DATA POINTS x i, y i ; i 1,,...,n ) SPANNING PARAMETER f; 0 f 1 k fn; k LARGEST INTEGER fn 3) FOR EACH x 0 FIND k POINTS x i CLOSEST TO x 0 Nx 0 NEIGHBORHOOD OF k POINTS 4) COMPUTE Δx 0 max x i Nx 0 x o x i 5) TO EACH x i, y i ; x i Nx 0 ASSIGN WEIGHT w i x 0 1 x o x i Δx ) FIT BY WEIGHTED LEAST-SQUARES k w i x 0y i 0 1 x i x i i1 RETURN yx x i x i 7) REPEAT FOR EACH OF THE x 0 3

4 ROBUST LOWESS STANDARD LOWESS NOT ROBUST BASED ON LEAST-SQUARES WEIGHTS BI-SQUARE LOWESS RE-WEIGHT POINTS ACCORDING TO RESIDUAL IF RESIDUAL LARGE, WEIGHT IS DECREASED 1) FIT DATA USING STANDARD LOESS ) CALCULATE LOESS RESIDUALS y i y i 3) COMPUTE q 1 median y i y i r i 4) CALCULATE BI-SQUARE ROBUST WEIGHTS 1 y i y i 6 q 1 5) REPEAT LOESS WITH WEIGHTS r i w ix 0 6) REPEAT -5 UNTIL LOESS CURVE CONVERGES 4

5 Example Birth rate in US population (U. S. Department of Health, Education and Welfare) n=96 births per 1000 US population during Top > Ordinary Least Squares with 1 st, nd & 3 rd degree polynomial Bottom > LOWESS fit with f =., f=.4 & f=.6 5

6 GALTON S BEND (Wachsmuth et al. 003, Am. Stat.) A possibility is that Galton ignored concealed heterogeneity 6

7 Does the bend disappear by disaggregation of the sample? Analysis of data from Pearson and Lee (1903) BEND STILL THERE! Wachsmuth et al. (003) write: 7

8 INBREEDING DEPRESSION Examine relationships of yield (milk, protein, fat) and somatic cell score (SCS) with inbreeding coefficient (F) using field data from US Jerseys Use REML, BLUP and local regression method (LOESS) for this purpose LEVEL OF INBREEDING IN HOLSTEINS, USA 8

9 Relationship between mean value of a quantitative trait and inbreeding coefficient (F) expected to be linear under dominance Not so ifepistatic interactions between dominance effects exist (Crow & Kimura, 1970) ONE-LOCUS MODEL GENOTYPE X A 1 A 1 A 1 A A A FREQUENCY p 1 1 F p 1 F p 1 p 1 F p 1 F p F PHENOTYPE A D A EX Ap p 1 p 1 p D p 1 p DF F 1 F 1 %Heterozygosity ADDITIVE MODEL WITH F (or H) AS COVARIATECONTRADICTORY 9

10 TWO (UNLINKED) LOCI: NO EPISTASIS Joint frequencies are product of marginal frequencies GENOTYPE A 1 A 1 A 1 A A A FREQUENCY p 1 1 F p 1 F p 1 p 1 F p 1 F p F B 1 B 1 r 1 1 F r 1 F A B D A B A B B 1 B r 1 r 1 F A DB D A DB A DB B B r 1 F r F A B D A B A B EX Ap p 1 Br r 1 p 1 p D A r 1 r DB p 1 p D A r 1 r D BF F TWO (UNLINKED) LOCI: EPISTASIS GENOTYPE A 1 A 1 A 1 A A A FREQUENCY p 1 1 F p 1 F p 1 p 1 F p 1 F p F B 1 B 1 r 1 1 F r 1 F A B I D A B L A B I B 1 B r 1 r 1 F A D B K D A D B J A D B K B B r 1 F r F A B I D A B L A B I ALLELES AT A and B LOCI SAME SUBSCRIPTADD I (ADDITIVE X ADDITIVE) HOMOZYGOUS AT A HETEROZYGOUS AT BSUBSTRACT AND ADD K HOMOZYGOUS AT B HETEROZYGOUS AT ASUBSTRACT AND ADD L (ADDITIVE X DOMINANCE) HETEROZYGOUS AT A AND BADD J (DOMINANCE X DOMINANCE) I,J,K,L: parameters (4 d. freedom) 10

11 Mean value under dominance x dominance epistasis EX Ap p 1 Br r 1 p 1 p D A r 1 r D B Ip 1 p r 1 r Lp 1 p r 1 r Kr 1 r p 1 p 4Jp 1 p r 1 r p 1 p D A r 1 r D B Lp 1 p r 1 r Kr 1 r p 1 p 4Jp 1 p r 1 r F 4Jp 1 p r 1 r F F F Dominance, additive x dominance, and dominance x dominance intervene in linear regression Epistasis without dominance does not enter into mean-f relationship Dominance x dominance intervenes in second-order regression DATA First lactation records (herds) on 59,778 (1,14) Jersey cows 6 generations of known pedigree First calving between 1995 and

12 Distribution of F F calculated from all known pedigree information F ranged between 0 and 34% Median F = 6.5% Histogram of F values F(%) 1

13 Procedures Fit linear models without F as covariate Compute EBLUP residuals from these models Fit nonparametric regression to EBLUP residuals in order to obtain nonparametric lines describing relationship between performance and inbreeding level Linear Models Model yijk HYSi AGEj 1 ( Dijk D) ak eijk y ijk = somatic cell score (SCS), milk, protein, or fat yield; HYS i = fixed effect of herd-year-season (i = 1,,.,176 for DS; for DS4 or 6406 for DS6, with seasons classes January April, May August, September December); AGE j = fixed effect of age at calving class; j = 1,,.,6 (< 617, , , , , or >1016 days of age); = fixed regression coefficient of performance on days in milk; 1 D ijk = days in milk for animal k in herd-year-season i and age of calving class j; D = 63; a k e ijk = random additive genetic effect of animal k, and = random residual. 13

14 Linear Model Assumptions Genetic and residual effects assumed mutually independent, with e ~ N( 0, I e ) and a ~ N( 0, Aσ a ) where A is the additive relationship matrix (1 + F k in the k th diagonal position, F k is the inbreeding coefficient of animal k) Nonparametric regression Fit LOESS regression to BLUP residuals with F as covariate Vary spanning parameter & degree of local polynomial Plot fitted values of residuals against F 14

15 ~ ij LOESS (Fitting done by locally weighted least squares) is LOESS fit using only residuals in the neighborhood of F i, i=1,, n (i=1,,,n animals; j=1,,4 traits) Size of neighborhood determined by f q = number of points in neighborhood n = total number of points q n Robust LOESS Weights assigned to : ˆijk I) II) w w [ t 1] [ t ] [ t ] ijk ijk ijk t=1,,3,4 [1] Fk Fi 3 3 w ijk [1 ( ) ] l 1,,... q max( Fl Fi ) ~ ijk ˆ [ t] ijk ijk [1 ( ) ] 6 med ~ med median of all ( ˆ ) ijk ijk 15

16 Cows withat least6 generations of known pedigree f 1 residual σ a nd degree local polynomial F(%) Robust original (black) with bootstrap (light blue) LOESS curves of yields for US Jerseys with at least 6 generations of known pedigree, based on medians of EBLUP residuals (y-axis = eˆ / ˆ ) ijk a f=0.9 f=0.5 f=0.9 nd degree local polynomial 16

17 Conclusions LOESS analysis suggested local relationships. Effects of inbreeding seem nil, until for F values up to ~7% Effects of inbreeding not accounted well by additive models Results may be confounded by effects of selection that are unaccounted for Kernel Regression 17

18 y i gx i e i ; i 1,,...,n where: y i is the measurement taken on individual i x i is a p 1 vector of observed SNP genotypes g. is some unknown function relating genotypes to phenotypes. Set gx i Ey i x i conditional expectation function e i 0, is a random residual 18

19 19

20 0

21 1

22

Procedure 2 of Section 2 of ICAR Guidelines Computing of Accumulated Lactation Yield. Computing Lactation Yield

Procedure 2 of Section 2 of ICAR Guidelines Computing of Accumulated Lactation Yield. Computing Lactation Yield of ICAR Guidelines Computing of Accumulated Lactation Yield Table of Contents 1 The Test Interval Method (TIM) (Sargent, 1968)... 4 2 Interpolation using Standard Lactation Curves (ISLC) (Wilmink, 1987)...

More information

Genetic relationships and trait comparisons between and within lines of local dual purpose cattle

Genetic relationships and trait comparisons between and within lines of local dual purpose cattle 67 th Annual meeting of the European Association for Animal Production Belfast, 2016 Genetic relationships and trait comparisons between and within lines of local dual purpose cattle M. Jaeger, K. Brügemann,

More information

Effects of inbreeding on milk production, fertility, and somatic cell count in Norwegian Red

Effects of inbreeding on milk production, fertility, and somatic cell count in Norwegian Red NORWEGIAN UNIVERSITY OF LIFE SCIENCES Effects of inbreeding on milk production, fertility, and somatic cell count in Norwegian Red K. Hov Martinsen 1, E. Sehested 2 and B. Heringstad* 1,2 1, Norwegian

More information

Simulation Study on Heterogeneous Variance Adjustment for Observations with Different Measurement Error Variance

Simulation Study on Heterogeneous Variance Adjustment for Observations with Different Measurement Error Variance Simulation Study on Heterogeneous Variance Adjustment for Observations with Different Measurement Error Variance Pitkänen, T. 1, Mäntysaari, E. A. 1, Nielsen, U. S., Aamand, G. P 3., Madsen 4, P. and Lidauer,

More information

MIXED MODELS THE GENERAL MIXED MODEL

MIXED MODELS THE GENERAL MIXED MODEL MIXED MODELS This chapter introduces best linear unbiased prediction (BLUP), a general method for predicting random effects, while Chapter 27 is concerned with the estimation of variances by restricted

More information

Lecture 9. QTL Mapping 2: Outbred Populations

Lecture 9. QTL Mapping 2: Outbred Populations Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred

More information

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by

More information

Case-Control Association Testing. Case-Control Association Testing

Case-Control Association Testing. Case-Control Association Testing Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies

More information

Prediction of Future Milk Yield with Random Regression Model Using Test-day Records in Holstein Cows

Prediction of Future Milk Yield with Random Regression Model Using Test-day Records in Holstein Cows 9 ` Asian-Aust. J. Anim. Sci. Vol. 19, No. 7 : 9-921 July 26 www.ajas.info Prediction of Future Milk Yield with Random Regression Model Using Test-day Records in Holstein Cows Byoungho Park and Deukhwan

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#7:(Mar-23-2010) Genome Wide Association Studies 1 The law of causality... is a relic of a bygone age, surviving, like the monarchy,

More information

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013 Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 013 1 Estimation of Var(A) and Breeding Values in General Pedigrees The classic

More information

Breeding Values and Inbreeding. Breeding Values and Inbreeding

Breeding Values and Inbreeding. Breeding Values and Inbreeding Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A

More information

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50 LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if

More information

(Genome-wide) association analysis

(Genome-wide) association analysis (Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by

More information

Genetic parameters for various random regression models to describe total sperm cells per ejaculate over the reproductive lifetime of boars

Genetic parameters for various random regression models to describe total sperm cells per ejaculate over the reproductive lifetime of boars Published December 8, 2014 Genetic parameters for various random regression models to describe total sperm cells per ejaculate over the reproductive lifetime of boars S. H. Oh,* M. T. See,* 1 T. E. Long,

More information

Animal Models. Sheep are scanned at maturity by ultrasound(us) to determine the amount of fat surrounding the muscle. A model (equation) might be

Animal Models. Sheep are scanned at maturity by ultrasound(us) to determine the amount of fat surrounding the muscle. A model (equation) might be Animal Models 1 Introduction An animal model is one in which there are one or more observations per animal, and all factors affecting those observations are described including an animal additive genetic

More information

Animal Model. 2. The association of alleles from the two parents is assumed to be at random.

Animal Model. 2. The association of alleles from the two parents is assumed to be at random. Animal Model 1 Introduction In animal genetics, measurements are taken on individual animals, and thus, the model of analysis should include the animal additive genetic effect. The remaining items in the

More information

Contrasting Models for Lactation Curve Analysis

Contrasting Models for Lactation Curve Analysis J. Dairy Sci. 85:968 975 American Dairy Science Association, 2002. Contrasting Models for Lactation Curve Analysis F. Jaffrezic,*, I. M. S. White,* R. Thompson, and P. M. Visscher* *Institute of Cell,

More information

Overview. Background

Overview. Background Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems

More information

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative

More information

Quantitative Genetics I: Traits controlled my many loci. Quantitative Genetics: Traits controlled my many loci

Quantitative Genetics I: Traits controlled my many loci. Quantitative Genetics: Traits controlled my many loci Quantitative Genetics: Traits controlled my many loci So far in our discussions, we have focused on understanding how selection works on a small number of loci (1 or 2). However in many cases, evolutionary

More information

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 20: Epistasis and Alternative Tests in GWAS Jason Mezey jgm45@cornell.edu April 16, 2016 (Th) 8:40-9:55 None Announcements Summary

More information

I Have the Power in QTL linkage: single and multilocus analysis

I Have the Power in QTL linkage: single and multilocus analysis I Have the Power in QTL linkage: single and multilocus analysis Benjamin Neale 1, Sir Shaun Purcell 2 & Pak Sham 13 1 SGDP, IoP, London, UK 2 Harvard School of Public Health, Cambridge, MA, USA 3 Department

More information

Genetic Parameter Estimation for Milk Yield over Multiple Parities and Various Lengths of Lactation in Danish Jerseys by Random Regression Models

Genetic Parameter Estimation for Milk Yield over Multiple Parities and Various Lengths of Lactation in Danish Jerseys by Random Regression Models J. Dairy Sci. 85:1596 1606 American Dairy Science Association, 2002. Genetic Parameter Estimation for Milk Yield over Multiple Parities and Various Lengths of Lactation in Danish Jerseys by Random Regression

More information

Quantitative characters - exercises

Quantitative characters - exercises Quantitative characters - exercises 1. a) Calculate the genetic covariance between half sibs, expressed in the ij notation (Cockerham's notation), when up to loci are considered. b) Calculate the genetic

More information

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion

More information

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture16: Population structure and logistic regression I Jason Mezey jgm45@cornell.edu April 11, 2017 (T) 8:40-9:55 Announcements I April

More information

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables Causal inference in biomedical sciences: causal models involving genotypes Causal models for observational data Instrumental variables estimation and Mendelian randomization Krista Fischer Estonian Genome

More information

GBLUP and G matrices 1

GBLUP and G matrices 1 GBLUP and G matrices 1 GBLUP from SNP-BLUP We have defined breeding values as sum of SNP effects:! = #$ To refer breeding values to an average value of 0, we adopt the centered coding for genotypes described

More information

Impact of Using Reduced Rank Random Regression Test-Day Model on Genetic Evaluation

Impact of Using Reduced Rank Random Regression Test-Day Model on Genetic Evaluation Impact of Using Reduced Rank Random Regression Test-Day on Genetic Evaluation H. Leclerc 1, I. Nagy 2 and V. Ducrocq 2 1 Institut de l Elevage, Département Génétique, Bât 211, 78 352 Jouy-en-Josas, France

More information

Methods for Cryptic Structure. Methods for Cryptic Structure

Methods for Cryptic Structure. Methods for Cryptic Structure Case-Control Association Testing Review Consider testing for association between a disease and a genetic marker Idea is to look for an association by comparing allele/genotype frequencies between the cases

More information

Genotyping strategy and reference population

Genotyping strategy and reference population GS cattle workshop Genotyping strategy and reference population Effect of size of reference group (Esa Mäntysaari, MTT) Effect of adding females to the reference population (Minna Koivula, MTT) Value of

More information

Lecture 6. QTL Mapping

Lecture 6. QTL Mapping Lecture 6 QTL Mapping Bruce Walsh. Aug 2003. Nordic Summer Course MAPPING USING INBRED LINE CROSSES We start by considering crosses between inbred lines. The analysis of such crosses illustrates many of

More information

Genetic parameters for female fertility in Nordic dairy cattle

Genetic parameters for female fertility in Nordic dairy cattle Genetic parameters for female fertility in Nordic dairy cattle K.Muuttoranta 1, A-M. Tyrisevä 1, E.A. Mäntysaari 1, J.Pösö 2, G.P. Aamand 3, J-Å. Eriksson 4, U.S. Nielsen 5, and M. Lidauer 1 1 Natural

More information

Lecture 11: Multiple trait models for QTL analysis

Lecture 11: Multiple trait models for QTL analysis Lecture 11: Multiple trait models for QTL analysis Julius van der Werf Multiple trait mapping of QTL...99 Increased power of QTL detection...99 Testing for linked QTL vs pleiotropic QTL...100 Multiple

More information

Multiple-Trait Across-Country Evaluations Using Singular (Co)Variance Matrix and Random Regression Model

Multiple-Trait Across-Country Evaluations Using Singular (Co)Variance Matrix and Random Regression Model Multiple-rait Across-Country Evaluations Using Singular (Co)Variance Matrix and Random Regression Model Esa A. Mäntysaari M Agrifood Research Finland, Animal Production, SF 31600 Jokioinen 1. Introduction

More information

Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype.

Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype. Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype. please read pages 38-47; 49-55;57-63. Slide 1 of Chapter 2 1 Extension sot Mendelian Behavior of Genes Single gene inheritance

More information

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Lecture 6: Introduction to Quantitative genetics Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Quantitative Genetics The analysis of traits whose variation is determined by both a

More information

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017 Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping

More information

Chap 1. Overview of Statistical Learning (HTF, , 2.9) Yongdai Kim Seoul National University

Chap 1. Overview of Statistical Learning (HTF, , 2.9) Yongdai Kim Seoul National University Chap 1. Overview of Statistical Learning (HTF, 2.1-2.6, 2.9) Yongdai Kim Seoul National University 0. Learning vs Statistical learning Learning procedure Construct a claim by observing data or using logics

More information

Genetic Parameters for Stillbirth in the Netherlands

Genetic Parameters for Stillbirth in the Netherlands Genetic Parameters for Stillbirth in the Netherlands Arnold Harbers, Linda Segeren and Gerben de Jong CR Delta, P.O. Box 454, 68 AL Arnhem, The Netherlands Harbers.A@CR-Delta.nl 1. Introduction Stillbirth

More information

Examining the Relative Influence of Familial, Genetic and Covariate Information In Flexible Risk Models. Grace Wahba

Examining the Relative Influence of Familial, Genetic and Covariate Information In Flexible Risk Models. Grace Wahba Examining the Relative Influence of Familial, Genetic and Covariate Information In Flexible Risk Models Grace Wahba Based on a paper of the same name which has appeared in PNAS May 19, 2009, by Hector

More information

Mixed-Models. version 30 October 2011

Mixed-Models. version 30 October 2011 Mixed-Models version 30 October 2011 Mixed models Mixed models estimate a vector! of fixed effects and one (or more) vectors u of random effects Both fixed and random effects models always include a vector

More information

Lecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition

Lecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition Lecture 3 Introduction on Quantitative Genetics: I Fisher s Variance Decomposition Bruce Walsh. Aug 004. Royal Veterinary and Agricultural University, Denmark Contribution of a Locus to the Phenotypic

More information

Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics

Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics Lee H. Dicker Rutgers University and Amazon, NYC Based on joint work with Ruijun Ma (Rutgers),

More information

Lecture 4 Multiple linear regression

Lecture 4 Multiple linear regression Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters

More information

Quantitative characters II: heritability

Quantitative characters II: heritability Quantitative characters II: heritability The variance of a trait (x) is the average squared deviation of x from its mean: V P = (1/n)Σ(x-m x ) 2 This total phenotypic variance can be partitioned into components:

More information

B) Describe the structures and functions of a Paramecium. Draw a Paramecium.

B) Describe the structures and functions of a Paramecium. Draw a Paramecium. Living Organisms & Genetics Final Exam Review Answers A) Describe the structures and functions of a Euglena. Draw a Euglena. Eyespot Helps the Euglena to detect light so it can move to a bright area to

More information

Case Studies in Ecology and Evolution

Case Studies in Ecology and Evolution 3 Non-random mating, Inbreeding and Population Structure. Jewelweed, Impatiens capensis, is a common woodland flower in the Eastern US. You may have seen the swollen seed pods that explosively pop when

More information

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012 Mixed-Model Estimation of genetic variances Bruce Walsh lecture notes Uppsala EQG 01 course version 8 Jan 01 Estimation of Var(A) and Breeding Values in General Pedigrees The above designs (ANOVA, P-O

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

Best unbiased linear Prediction: Sire and Animal models

Best unbiased linear Prediction: Sire and Animal models Best unbiased linear Prediction: Sire and Animal models Raphael Mrode Training in quantitative genetics and genomics 3 th May to th June 26 ILRI, Nairobi Partner Logo Partner Logo BLUP The MME of provided

More information

Lecture 32: Infinite-dimensional/Functionvalued. Functions and Random Regressions. Bruce Walsh lecture notes Synbreed course version 11 July 2013

Lecture 32: Infinite-dimensional/Functionvalued. Functions and Random Regressions. Bruce Walsh lecture notes Synbreed course version 11 July 2013 Lecture 32: Infinite-dimensional/Functionvalued Traits: Covariance Functions and Random Regressions Bruce Walsh lecture notes Synbreed course version 11 July 2013 1 Longitudinal traits Many classic quantitative

More information

Evaluation of Autoregressive Covariance Structures for Test-Day Records of Holstein Cows: Estimates of Parameters

Evaluation of Autoregressive Covariance Structures for Test-Day Records of Holstein Cows: Estimates of Parameters J. Dairy Sci. 88:2632 2642 American Dairy Science Association, 2005. Evaluation of Autoregressive Covariance Structures for Test-Day Records of Holstein Cows: Estimates of Parameters R. M. Sawalha, 1 J.

More information

Just to review Genetics and Cells? How do Genetics and Cells Relate? The cell s NUCLEUS contains all the genetic information.

Just to review Genetics and Cells? How do Genetics and Cells Relate? The cell s NUCLEUS contains all the genetic information. Just to review Genetics and Cells? How do Genetics and Cells Relate? The cell s NUCLEUS contains all the genetic information. It s called: DNA A. Describe what Gregor Mendel discovered in his experiments

More information

A Modern Look at Classical Multivariate Techniques

A Modern Look at Classical Multivariate Techniques A Modern Look at Classical Multivariate Techniques Yoonkyung Lee Department of Statistics The Ohio State University March 16-20, 2015 The 13th School of Probability and Statistics CIMAT, Guanajuato, Mexico

More information

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification, Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability

More information

Robustness and Reproducing Kernel Hilbert Spaces. Grace Wahba

Robustness and Reproducing Kernel Hilbert Spaces. Grace Wahba Robustness and Reproducing Kernel Hilbert Spaces Grace Wahba Part 1. Regularized Kernel Estimation RKE. (Robustly) Part 2. Smoothing Spline ANOVA SS-ANOVA and RKE. Part 3. Partly missing covariates SS-ANOVA

More information

Legend: S spotted Genotypes: P1 SS & ss F1 Ss ss plain F2 (with ratio) 1SS :2 WSs: 1ss. Legend W white White bull 1 Ww red cows ww ww red

Legend: S spotted Genotypes: P1 SS & ss F1 Ss ss plain F2 (with ratio) 1SS :2 WSs: 1ss. Legend W white White bull 1 Ww red cows ww ww red On my honor, this is my work GENETICS 310 EXAM 1 June 8, 2018 I. Following are 3 sets of data collected from crosses: 1. Spotted by Plain gave all spotted in the F1 and 9 spotted and 3 plain in the F2.

More information

DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to

DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to 1 1 1 1 1 1 1 1 0 SUPPLEMENTARY MATERIALS, B. BIVARIATE PEDIGREE-BASED ASSOCIATION ANALYSIS Introduction We propose here a statistical method of bivariate genetic analysis, designed to evaluate contribution

More information

Estimates of genetic parameters for total milk yield over multiple ages in Brazilian Murrah buffaloes using different models

Estimates of genetic parameters for total milk yield over multiple ages in Brazilian Murrah buffaloes using different models Estimates of genetic parameters for total milk yield over multiple ages in Brazilian Murrah buffaloes using different models R.C. Sesana 1, F. Baldi 1, R.R.A. Borquis 1, A.B. Bignardi 1, N.A. Hurtado-Lugo

More information

LINEAR MODELS FOR THE PREDICTION OF ANIMAL BREEDING VALUES SECOND EDITION

LINEAR MODELS FOR THE PREDICTION OF ANIMAL BREEDING VALUES SECOND EDITION LINEAR MODELS FOR THE PREDICTION OF ANIMAL BREEDING VALUES SECOND EDITION LINEAR MODELS FOR THE PREDICTION OF ANIMAL BREEDING VALUES Second Edition R.A. Mrode, PhD Scottish Agricultural College Sir Stephen

More information

Causal Graphical Models in Quantitative Genetics and Genomics

Causal Graphical Models in Quantitative Genetics and Genomics Causal Graphical Models in Quantitative Genetics and Genomics Guilherme J. M. Rosa Department of Animal Sciences Department of Biostatistics & Medical Informatics OUTLINE Introduction: Correlation and

More information

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next genetics the study of heredity heredity sequence of DNA that codes for a protein and thus determines a trait genome a specific characteristic that varies from one individual to another gene trait the passing

More information

Bayesian QTL mapping using skewed Student-t distributions

Bayesian QTL mapping using skewed Student-t distributions Genet. Sel. Evol. 34 00) 1 1 1 INRA, EDP Sciences, 00 DOI: 10.1051/gse:001001 Original article Bayesian QTL mapping using skewed Student-t distributions Peter VON ROHR a,b, Ina HOESCHELE a, a Departments

More information

Statistical issues in QTL mapping in mice

Statistical issues in QTL mapping in mice Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES: .5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the

More information

EXERCISES FOR CHAPTER 7. Exercise 7.1. Derive the two scales of relation for each of the two following recurrent series:

EXERCISES FOR CHAPTER 7. Exercise 7.1. Derive the two scales of relation for each of the two following recurrent series: Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 7 Exercise 7.. Derive the two scales of relation for each of the two following recurrent series: u: 0, 8, 6, 48, 46,L 36 7

More information

The concept of breeding value. Gene251/351 Lecture 5

The concept of breeding value. Gene251/351 Lecture 5 The concept of breeding value Gene251/351 Lecture 5 Key terms Estimated breeding value (EB) Heritability Contemporary groups Reading: No prescribed reading from Simm s book. Revision: Quantitative traits

More information

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015 Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.

More information

Crosses. Computation APY Sherman-Woodbury «hybrid» model. Unknown parent groups Need to modify H to include them (Misztal et al., 2013) Metafounders

Crosses. Computation APY Sherman-Woodbury «hybrid» model. Unknown parent groups Need to modify H to include them (Misztal et al., 2013) Metafounders Details in ssgblup Details in SSGBLUP Storage Inbreeding G is not invertible («blending») G might not explain all genetic variance («blending») Compatibility of G and A22 Assumption p(u 2 )=N(0,G) If there

More information

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics 1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

More information

Linkage and Linkage Disequilibrium

Linkage and Linkage Disequilibrium Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies

More information

Lecture 7 Correlated Characters

Lecture 7 Correlated Characters Lecture 7 Correlated Characters Bruce Walsh. Sept 2007. Summer Institute on Statistical Genetics, Liège Genetic and Environmental Correlations Many characters are positively or negatively correlated at

More information

Inversion Base Height. Daggot Pressure Gradient Visibility (miles)

Inversion Base Height. Daggot Pressure Gradient Visibility (miles) Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:

More information

Linear Models for the Prediction of Animal Breeding Values

Linear Models for the Prediction of Animal Breeding Values Linear Models for the Prediction of Animal Breeding Values R.A. Mrode, PhD Animal Data Centre Fox Talbot House Greenways Business Park Bellinger Close Chippenham Wilts, UK CAB INTERNATIONAL Preface ix

More information

Repeated Records Animal Model

Repeated Records Animal Model Repeated Records Animal Model 1 Introduction Animals are observed more than once for some traits, such as Fleece weight of sheep in different years. Calf records of a beef cow over time. Test day records

More information

Prediction of genetic Values using Neural Networks

Prediction of genetic Values using Neural Networks Prediction of genetic Values using Neural Networks Paulino Perez 1 Daniel Gianola 2 Jose Crossa 1 1 CIMMyT-Mexico 2 University of Wisconsin, Madison. September, 2014 SLU,Sweden Prediction of genetic Values

More information

3 Comparison with Other Dummy Variable Methods

3 Comparison with Other Dummy Variable Methods Stats 300C: Theory of Statistics Spring 2018 Lecture 11 April 25, 2018 Prof. Emmanuel Candès Scribe: Emmanuel Candès, Michael Celentano, Zijun Gao, Shuangning Li 1 Outline Agenda: Knockoffs 1. Introduction

More information

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important? Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population

More information

Resemblance between relatives

Resemblance between relatives Resemblance between relatives 1 Key concepts Model phenotypes by fixed effects and random effects including genetic value (additive, dominance, epistatic) Model covariance of genetic effects by relationship

More information

3. Properties of the relationship matrix

3. Properties of the relationship matrix 3. Properties of the relationship matrix 3.1 Partitioning of the relationship matrix The additive relationship matrix, A, can be written as the product of a lower triangular matrix, T, a diagonal matrix,

More information

RESTRICTED M A X I M U M LIKELIHOOD TO E S T I M A T E GENETIC P A R A M E T E R S - IN PRACTICE

RESTRICTED M A X I M U M LIKELIHOOD TO E S T I M A T E GENETIC P A R A M E T E R S - IN PRACTICE RESTRICTED M A X I M U M LIKELIHOOD TO E S T I M A T E GENETIC P A R A M E T E R S - IN PRACTICE K. M e y e r Institute of Animal Genetics, Edinburgh University, W e s t M a i n s Road, Edinburgh EH9 3JN,

More information

Generalized Additive Models

Generalized Additive Models Generalized Additive Models The Model The GLM is: g( µ) = ß 0 + ß 1 x 1 + ß 2 x 2 +... + ß k x k The generalization to the GAM is: g(µ) = ß 0 + f 1 (x 1 ) + f 2 (x 2 ) +... + f k (x k ) where the functions

More information

NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157)

NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157) NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157) Evidence Statement Q1 Expected coverage Merit Excellence

More information

Lecture 02 Linear classification methods I

Lecture 02 Linear classification methods I Lecture 02 Linear classification methods I 22 January 2016 Taylor B. Arnold Yale Statistics STAT 365/665 1/32 Coursewebsite: A copy of the whole course syllabus, including a more detailed description of

More information

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 4. The efficiency of selection The selection programmes

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 4. The efficiency of selection The selection programmes INTRODUCTION TO ANIMAL BREEDING Lecture Nr 4 The efficiency of selection The selection programmes Etienne Verrier INA Paris-Grignon, Animal Sciences Department Verrier@inapg.fr The genetic gain and its

More information

Lecture 2: Linear and Mixed Models

Lecture 2: Linear and Mixed Models Lecture 2: Linear and Mixed Models Bruce Walsh lecture notes Introduction to Mixed Models SISG, Seattle 18 20 July 2018 1 Quick Review of the Major Points The general linear model can be written as y =

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

Common Mating Designs in Agricultural Research and Their Reliability in Estimation of Genetic Parameters

Common Mating Designs in Agricultural Research and Their Reliability in Estimation of Genetic Parameters IOSR Journal of Agriculture and Veterinary Science (IOSR-JAVS) e-issn: 2319-2380, p-issn: 2319-2372. Volume 11, Issue 7 Ver. II (July 2018), PP 16-36 www.iosrjournals.org Common Mating Designs in Agricultural

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

Prediction of the Confidence Interval of Quantitative Trait Loci Location

Prediction of the Confidence Interval of Quantitative Trait Loci Location Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28

More information

Partitioning Genetic Variance

Partitioning Genetic Variance PSYC 510: Partitioning Genetic Variance (09/17/03) 1 Partitioning Genetic Variance Here, mathematical models are developed for the computation of different types of genetic variance. Several substantive

More information

Need for Several Predictor Variables

Need for Several Predictor Variables Multiple regression One of the most widely used tools in statistical analysis Matrix expressions for multiple regression are the same as for simple linear regression Need for Several Predictor Variables

More information

Bayesian Inference of Interactions and Associations

Bayesian Inference of Interactions and Associations Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,

More information

Model Selection for Multiple QTL

Model Selection for Multiple QTL Model Selection for Multiple TL 1. reality of multiple TL 3-8. selecting a class of TL models 9-15 3. comparing TL models 16-4 TL model selection criteria issues of detecting epistasis 4. simulations and

More information

Introduction to Nonparametric Regression

Introduction to Nonparametric Regression Introduction to Nonparametric Regression Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)

More information

Chapter 7 The Genetic Model for Quantitative Traits

Chapter 7 The Genetic Model for Quantitative Traits Chapter 7 The Genetic Model for Quantitative Traits I. The Basic Model II. Breeding Value III. Gene Combination Value IV. Producing Ability Chapter 7 The Genetic Model for Quantitative Traits Learning

More information

The Quantitative TDT

The Quantitative TDT The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus

More information