PREDICTION OF BREEDING VALUES FOR UNMEASURED TRAITS FROM MEASURED TRAITS

Similar documents
Mixed-Models. version 30 October 2011

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012

GENERALIZED LINEAR MIXED MODELS: AN APPLICATION

Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013

Best unbiased linear Prediction: Sire and Animal models

Models with multiple random effects: Repeated Measures and Maternal effects

Quantitative characters - exercises

MIXED MODELS THE GENERAL MIXED MODEL

Multiple random effects. Often there are several vectors of random effects. Covariance structure

Animal Model. 2. The association of alleles from the two parents is assumed to be at random.

RESTRICTED M A X I M U M LIKELIHOOD TO E S T I M A T E GENETIC P A R A M E T E R S - IN PRACTICE

3. Properties of the relationship matrix

Genetic Parameters for Stillbirth in the Netherlands

Reduced Animal Models

Lecture 9 Multi-Trait Models, Binary and Count Traits

Maternal Genetic Models

Likelihood Methods. 1 Likelihood Functions. The multivariate normal distribution likelihood function is

VARIANCE COMPONENT ESTIMATION & BEST LINEAR UNBIASED PREDICTION (BLUP)

Best Linear Unbiased Prediction: an Illustration Based on, but Not Limited to, Shelf Life Estimation

ASPECTS OF SELECTION FOR PERFORMANCE IN SEVERAL ENVIRONMENTS WITH HETEROGENEOUS VARIANCES

Estimation of Variance Components in Animal Breeding

RECENT DEVELOPMENTS IN VARIANCE COMPONENT ESTIMATION

Animal Models. Sheep are scanned at maturity by ultrasound(us) to determine the amount of fat surrounding the muscle. A model (equation) might be

NONLINEAR REGRESSION FOR SPLIT PLOT EXPERIMENTS

Linear Models for the Prediction of Animal Breeding Values

SIMULATION STUDY OF SPATIAL- POISSON DATA ASSESSING INCLUSION OF SPATIAL CORRELATION AND NON- NORMALITY IN THE ANALYSIS

EVALUATION OF FOUR COVARIATE TYPES USED FOR ADJUSTMENT OF SPATIAL VARIABILITY

Should genetic groups be fitted in BLUP evaluation? Practical answer for the French AI beef sire evaluation

Genetic grouping for direct and maternal effects with differential assignment of groups

Chapter 5 Prediction of Random Variables

THE EFFECT OF MONENSIN ON LACTATION DAIRY COWS: A DOSE RESPONSE EVALUATION

Repeated Records Animal Model

Contemporary Groups for Genetic Evaluations

Best linear unbiased prediction when error vector is correlated with other random vectors in the model

A simple method of computing restricted best linear unbiased prediction of breeding values

Multiple Trait Evaluation of Bulls for Calving Ease

Evaluation of Autoregressive Covariance Structures for Test-Day Records of Holstein Cows: Estimates of Parameters

Alternative implementations of Monte Carlo EM algorithms for likelihood inferences

of Nebraska - Lincoln

Multiple-Trait Across-Country Evaluations Using Singular (Co)Variance Matrix and Random Regression Model

Lecture 5 Basic Designs for Estimation of Genetic Parameters

Multiple regression. Partial regression coefficients

GBLUP and G matrices 1

Heterogeneity of variances by herd production level and its effect on dairy cow and sire evaluation

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 3. The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs

Prediction of breeding values with additive animal models for crosses from 2 populations

Chapter 11 MIVQUE of Variances and Covariances

Genetic evaluation for three way crossbreeding

BLUP without (inverse) relationship matrix

5. Best Linear Unbiased Prediction

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 4. The efficiency of selection The selection programmes

Modification of negative eigenvalues to create positive definite matrices and approximation of standard errors of correlation estimates

MEXICAN SIMMENTAL-BRAHMAN GENETIC CHARACTERIZATION, GENETIC PARAMETERS AND GENETIC TRENDS

I22. EARLE W. KLOSTERMAN Ohio A g r i c u l t u r a l Research and Development Center Wooster, Ohio

OPTIMUM DESIGN ON STEP-STRESS LIFE TESTING

REDUCED ANIMAL MODEL FOR MARKER ASSISTED SELECTION USING BEST LINEAR UNBIASED PREDICTION. R.J.C.Cantet 1 and C.Smith

RANDOM REGRESSION IN ANIMAL BREEDING

Genetic parameters for female fertility in Nordic dairy cattle

AN INTRODUCTION TO GENERALIZED LINEAR MIXED MODELS. Stephen D. Kachman Department of Biometry, University of Nebraska Lincoln

Raphael Mrode. Training in quantitative genetics and genomics 30 May 10 June 2016 ILRI, Nairobi. Partner Logo. Partner Logo

G-BLUP without inverting the genomic relationship matrix

A reduced animal model with elimination of quantitative trait loci equations for marker-assisted selection

Genetic Heterogeneity of Environmental Variance - estimation of variance components using Double Hierarchical Generalized Linear Models

Genetic Parameter Estimation for Milk Yield over Multiple Parities and Various Lengths of Lactation in Danish Jerseys by Random Regression Models

Estimation of the Proportion of Genetic Variation Accounted for by DNA Tests

Prediction of Future Milk Yield with Random Regression Model Using Test-day Records in Holstein Cows

Solving Systems of Linear Equations Using Matrices

Comparison of computing properties of derivative and derivative-free algorithms in variance component estimation by REML.

Chapter 12 REML and ML Estimation

Covariance functions and random regression models for cow weight in beef cattle

ANALYSING BINARY DATA IN A REPEATED MEASUREMENTS SETTING USING SAS

Recent advances in statistical methods for DNA-based prediction of complex traits

MULTIBREED ANIMAL EVALUATION AND ITS APPLICATION TO THE THAI ENVIRONMENT. Numbers of Sires. Multibreed Population. Numbers of Calves.

Lecture 28: BLUP and Genomic Selection. Bruce Walsh lecture notes Synbreed course version 11 July 2013

HOTELLING'S T 2 APPROXIMATION FOR BIVARIATE DICHOTOMOUS DATA

Phantom Groups LRS. July-Aug 2012 CGIL. LRS (CGIL) Summer Course July-Aug / 34

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

BIOAVAILABILITY AND CV COMPONENT COMPARISON IN A CROSSOVER

Supplementary File 3: Tutorial for ASReml-R. Tutorial 1 (ASReml-R) - Estimating the heritability of birth weight

Maine-Anjou. PO Box 1100 Platte City, MO (816) fax (816)

Solving Large Test-Day Models by Iteration on Data and Preconditioned Conjugate Gradient

Bayesian estimation in animal breeding using the Dirichlet process prior for correlated random effects

Contrasting Models for Lactation Curve Analysis

Lecture 13 Family Selection. Bruce Walsh lecture notes Synbreed course version 4 July 2013

MESSY EXPERIMENTAL DESIGNS

1 Arabidopsis growth curves

Lecture 32: Infinite-dimensional/Functionvalued. Functions and Random Regressions. Bruce Walsh lecture notes Synbreed course version 11 July 2013

A relationship matrix including full pedigree and genomic information

An indirect approach to the extensive calculation of relationship coefficients

Quantitative genetic (animal) model example in R

A simple method to separate base population and segregation effects in genomic relationship matrices

Estimating genetic parameters using

TWO-FACTOR AGRICULTURAL EXPERIMENT WITH REPEATED MEASURES ON ONE FACTOR IN A COMPLETE RANDOMIZED DESIGN

1 Outline. 1. Motivation. 2. SUR model. 3. Simultaneous equations. 4. Estimation

Lecture 7 Correlated Characters

ANALYSIS OF SPATIAL VARIABILITY USING PROC MIXED

Method % Variance Components Procedure: Application on the Simple Breeding Value

Introduction to Econometrics Final Examination Fall 2006 Answer Sheet

Distinctive aspects of non-parametric fitting

Use of sparse matrix absorption in animal breeding

Transcription:

Libraries Annual Conference on Applied Statistics in Agriculture 1994-6th Annual Conference Proceedings PREDICTION OF BREEDING VALUES FOR UNMEASURED TRAITS FROM MEASURED TRAITS Kristin L. Barkhouse L. Dale Van Vleck Follow this and additional works at: http://newprairiepress.org/agstatconference Part of the Agriculture Commons, and the Applied Statistics Commons This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License. Recommended Citation Barkhouse, Kristin L. and Van Vleck, L. Dale (1994). "PREDICTION OF BREEDING VALUES FOR UNMEASURED TRAITS FROM MEASURED TRAITS," Annual Conference on Applied Statistics in Agriculture. http://newprairiepress.org/agstatconference/ 1994/proceedings/22 This is brought to you for free and open access by the Conferences at. It has been accepted for inclusion in Annual Conference on Applied Statistics in Agriculture by an authorized administrator of. For more information, please contact cads@k-state.edu.

Applied Statistics in Agriculture 289 PREDICTION OF BREEDING V ALVES FOR UNMEASURED TRAITS FROM MEASURED TRAITS Kristin L. Barkhouse * and L. Dale Van Vleck University of Nebraska, Lincoln ARS-USDA, U.S. Meat Animal Research Center Clay Center, NE ABSTRACT Henderson (1977, 1984) described a method for prediction of breeding values for traits not in the model for records. This method may be practical for animal or sire models for the case when no measurements can be obtained on any animals for some traits to be evaluated. The least squares equations are augmented with A -1 ON1 rather than with A -1 OOl where A is the numerator relationship and 00 and ON are the genetic covariance matrices for measured and for all traits. This method can be used for each unmeasured trait or simultaneously for measured and all unmeasured traits. An option in the MTDFRUN module of the Multiple Trait Derivative Free REML (MTDFREML) program of Boldman et al. (1993) is to obtain solutions for breeding values and their prediction error variances. However, the preparation program (MTDFPREP) must be tricked to set-up equation numbers for breeding values of the unmeasured traits. Adding dummy records for the unmeasured traits but with missing records for the measured traits for dummy animals to the data file of animals with measured traits but with missing unmeasurable traits will result in the needed equations. At least two dummy records are needed to avoid a divide by zero error in calculating the sample standard deviation. The dummy records need to be associated with a level of at least one f!xed factor. The dummy animals also must be added to the pedigree file with unknown sires and dams before running the program to obtain the inverse of the numerator relationship matrix (MTDFNRM). In the program to obtain solutions to the multiple trait mixed model equations (MTDFRUN), the full genetic (co)vaiiance matrix, ON' for both measured and unmeasured traits is needed. The residual (co)variance matrix must have zero covariances between pairs of measured and unmeasured traits but the variance of the unmeasured trait must not be zero. This procedure provides direct solutions for breeding values of unmeasured traits based on mixed model predictions of breeding values of the measured traits and also allows calculation of standard errors of prediction for the solutions directly from elements of the inverse of the augmented coefficient matrix. For example, this procedure can be used to predict breeding values of bulls (which have tenderness measurements) for the correlated trait of tenderness as a steer or heifer which cannot be measured on the bull.

290 INTRODUCTION Breeding values of animals for traits which are not measured are often wanted; for example, breeding values of bulls for maternal traits such as milking ability. Although bulls do not have records, breeding values may be obtained through relationships with daughters that have records. If genetic and environmental covariances between measured trait(s) and unmeasured trait(s) are known, the mixed model equations (MME) can be augmented as described by Henderson (1977, 1984) to include unmeasured as well as measured traits. Henderson (1977) showed that augmenting the MME for unmeasured trait(s) results in solutions for breeding values which are best linear unbiased predictors, BLUP, and are independent of the genetic variances for the unmeasured traits. Prediction error variances as well as predicted breeding values can be obtained directly from the augmented MME. THE GENERAL MIXED MODEL The model for the measured traits is y = XB + Zu + e, with E(y) = XB and y V ZO R V [ u = OZ' o. 0 where e R 0 R y is the vector of observations for measured traits, B is the vector of fixed effects with incidence matrix X, u is the vector of breeding values for measured traits with incidence matrix Z, and e is the vector of random residuals. BLUP of u is known to be ft = OZ'V -ley - x(3) where fi is the vector of solutions to the mixed model equations for the measured traits or equivalently of solutions to the generalized least squares equations. As shown by Henderson (1977) for the vector of breeding values for the unmeasured trait, un' with E(un) = 0, V(un) = On' and Cov(u, ud = C, BLUP of un is = = = CZ'V-l (y - XB) CO-l OZ'V-l(y - Xfi) CO-1ft

Applied Statistics in Agriculture 291 AUGMENTED MIXED MODEL EQUATIONS The augmented MME as described by Henderson (1977) are X'R -IX X'R -IZ 0 /3 X'R -Iy where Z'R -IX Z'R -IZ + W ll W I2 U = Z'R -ly, 0 W IZ W Z2 un 0 (:~1 W u )" (G cr W1Z Wzz C On If the equations for ~ are absorbed, then because W 11 - W IZ W;I W;z = 0 -I, the results are the usual MME for U, i.e., U for the augmented equations is the same as BLUP for u from the usual MME. The solutions for ~ from the augmented MME can be shown to be BLUP (Henderson, 1977) as follows: From the last row of the augmented MME, un = - W;IW;2U. From multiplication of a matrix by its inverse W;zO + WzzC' = O. Thus, W;z = - WZZC'O-l, so that un = - Wzi(- W ZZ C'O-I)U = C'O-IU which is BLUP and does not depend on On' A PROGRAM TO FORM THE AUGMENTED EQUATIONS MTDFREML (Multiple Trait Derivative Free Restricted Maximum Likelihood) is a set of programs developed by Boldman et al: (1993) to solve the MME and to obtain REML estimates of variance components with a derivative-free algorithm. To augment the MME for the unmeasured traits requires modifications to input of two of the three MTDFREML programs. First, the preparation program (MTDFPREP) must be tricked to set up equations for breeding values of the unmeasured traits. Adding dummy records for the unmeasured traits but with missing records for the measured traits for dummy animals to the data file of animals with measured traits but with missing unmeasurable traits will result in the needed equations. The dummy animals also must be added to the pedigree file with urtknown sires and dams before running the program (MTDFNRM) to obtain the inverse of the numerator relationship matrix. The objective of this paper is to demonstrate the modifications to input to the MTDFREML programs that are required to implement Henderson's method of augmentation.

292 MATERIALS AND METHODS An example is used to demonstrate how the MME can be augmented as described by Henderson to include unmeasured traits. Various tenderness measures on meat of bulls will be used to predict breeding values for market animal tenderness. The MME are augmented to include market animal tenderness, the unmeasured trait. Data came from the Germ Plasm Evaluation Project of the U.S. Meat Animal Research Center, Clay Center, NE, where 127 bulls and 1400 steers and heifers were measured for taste panel tenderness (T) and Warner Bratzler Shear Force (SF). Genetic and environmental covariance matrices including measured and unmeasured traits are needed and must be previously estimated, in this case from 1400 measures on related steers and heifers (market animals). These covariance matrices will then provide the basis for the following analysis. Covariances for this example are between two measured traits, T and SF, and the unmeasured trait, T*. The MME in MTDFREML must be augmented to include equations for each bull for T*, the unmeasured trait. The original observation vector, y, includes only SF and T: The model included fixed effects for birth year, breed, covariates for fraction heterozygosity, weaning age and number of days on feed. AUGMENTING THE MME IN MTDFREML To augment the MME to include equations for T*, two of the MTDFREML programs must be tricked. At least two dummy records with T* measurements but with missing values for SF and T are needed to set up the needed equations. The following are the modillcations to the input for MTDFPREP, MTDFNRM and MTDFRUN. INPUT TO MTDFNRM 1) Add to pedigree file, the identification numbers for the two dummy animals with zeroes for each sire and dam. INPUT TO MTDFPREP 2) Make sure there is a field in the data file for the unmeasured trait. a) For real animals (bulls) with measured traits, the field for the UIuneasured trait must contain the missing observation indicator, e.g., -99.

Applied Statistics in Agriculture 293 b) For the dummy animals i) The fields for traits measured on the real animals (bulls) must contain the missing observation indicator, e.g., -99. ii) iii) The field for the trait not measured on the real animals must contain a dummy measurement. At least one field corresponding to fixed effects must contain a level to correspond to a f!xed effect for the dummy measurements on the dummy animals. INPUT TO MTDFRUN 3) Elements of covariance matrices (use option 4 for solutions to MME) a) Elements of the genetic covariance matrix among all measured traits and the unmeasured trait must be entered. The correct genetic variance of the unmeasured trait is needed for correct prediction error variances. Otherwise any non-zero entry will work if the full genetic covariance matrix is positive defrnite. b) Environmental covariances between unmeasured trait and each measured trait are entered as ZEROs. Environmental variance of unmeasured trait can be any positive value. The correct matrix of covariances among environmental effects of measured traits is needed. These modifications to the input of the MTDFREML programs result in the following equations for breeding values for the measured traits as well as the unmeasured traits: Shear force for bulls (records) Tenderness for bulls (records) Tenderness for bulls as market animals (unmeasured) Shear force as a bull for the dummy animals (unmeasured) Tenderness as a bull for the dummy animals (unmeasured) Tenderness for the dummy animals (dummy records)

294 This procedure leads to two independent sets of equations: the augmented mixed model equations and an independent set of equations for the dummy animals. TIlls independence is shown symbolically. Left hand sides of augmented and other set of MME are shown here symbolically before adding (G C )-1. C G. n X~f 0 0 o x; 0 o 0 x;. Z~f 0 0 o ~ 0 000 000 000 r~ =0 o 0 ~ ~ dummy market animals O's act as placeholders Cor unmeasured traits ~ o ZSF 0 roool ~o ] o 0 Zrloool XT* 0 0 L~~~I The equations for SF and T are linked by ~F.T, but equations for T* are not linked to either bull trait (rsf,t*, rt,t* = 0). As a result, the augmented left hand sides (LHS) are two independent sets of equations, one for the measured bull traits and the other representing the dummy animals with dummy market animal tenderness records.

Applied Statistics in Agriculture 295 The following portion of the C)-l LHS represents Henderson's augmented ( G equat~ons after adding C G to the highlighted box in the previous equations. n The zeros represent coefficients of equations for bull tenderness as a market animal (unmeasured). Z'.SF rsf7 -SF z;. rsf,t~f o 03x3 o o 03x3 o 0 o 0 +(G:rt~1 o Z~rT*z,.. The other elements result from tricking the programs to include the unmeasured traits.

296 RIGHT HAND SIDES (RHS) OF MME CORRESPONDING TO THE LHS:. XSF 0 0 0 X; 0 rsf~ rsf.~ rsf,t'~ YSF rt,sf~ 0 0 X~o r~ r T,TO~ YT. r T.,SF~ r T.,T~ rto~ YTo ZSF 0 0 0 ~ 0 0 0 0 0 0 0 0 0 0 0 0 00 ~o After symbolic multiplication, the right hand 'sides and corresponding solution vector, become:... SF XSFr YSF + X~pSF,TYT f3 SF X'rSF,Ty + T SF ' T XTr YT... f3t, T XT.r 'YT* f3t o, SF ZSFr YSF + Z~pSF,TYT z;.r SF,T Y SF + z;.rtyt bulls USF [ ut 0 0 USF [ " dummies" 0 ut, T 0.r 'YTo uto ~I * *where ut* represents the desired breeding values of bulls for T*, which was not measured on the bulls. Examination of these equations reveals the augmented equations to predict breeding values for unmeasured trait and an independent set of equations resulting from the dummy records of the dummy animals.

Applied Statistics in Agriculture 297 THE ESTIMATES OF COVARIANCES AMONG SF, T, AND T* USED FOR THE EXAMPLE ARE: G~ = SF T T*.619 -.082 -.122 -.082.015.040 -.122.040.174 and ~+ = SF T T* 1.383 -.686 0 -.686.643 0 0 0 1.527 RESULTS Range of predicted breeding values of the bulls for market animal tenderness was small due to the lack of genetic variation in tenderness, -.11 to.08. A typical prediction error variance from option 4 of MTDFRUN was (.39)2 =.1521 so that the average squared correlation between true and predicted breedin~ value for market animal tenderness of bulls measured for tenderness and shear force is rtr = 1 - (.1521/.174) =.13 with accuracy =.36. CONCLUSIONS Input to the MTDFREML programs, can be modified to predict breeding values of animals for traits not in the model for records, e.g., prediction of breeding values of bulls for market animal tenderness. REFERENCES Boldman, K. G., L. A. Kriese, L. D. Van Vleck, and S. D. Kachman. 1993. A Manual for use of MTDFREML. A set of programs to obtain estimates of variances and covariances [DRAFT]. U.S. Department of Agriculture, Agricultural Research Service. pp 120. Henderson, C. R. 1977. Best linear unbiased prediction of breeding values not in the model for records. 1. Dairy Sci. 60:783. Henderson, C. R. 1984. Application of linear models in animal breeding. Univ. Guelph, Guelph, ON, Canada.