Distinctive aspects of non-parametric fitting
|
|
- Chad Gilbert
- 5 years ago
- Views:
Transcription
1 5. Introduction to nonparametric curve fitting: Loess, kernel regression, reproducing kernel methods, neural networks Distinctive aspects of non-parametric fitting Objectives: investigate patterns free of strictures imposed by parametric models Can produce surprising results Regression coefficients appear but (typically) do not have an obvious interpretation Often have very good predictive performance in cross-validation Tuning methods similar to those for parametric methods 1
2 Example: thin-plate splines Risk of heart attack after 19 years as a function of cholesterol level and blood pressure. Left: logistic regression model. Right: thin plate spline fit. Wahba (007) log ) ( j i j i N j j i j i j i i i x x x x x x x x x x f x LOESS REGRESSION: Non-parametric exploration of inbreeding depression for yield and somatic cell count in Jersey cattle
3 AN OVERVIEW OF LOWESS REGRESSION 1) DATA POINTS x i, y i ; i 1,,...,n ) SPANNING PARAMETER f; 0 f 1 k fn; k LARGEST INTEGER fn 3) FOR EACH x 0 FIND k POINTS x i CLOSEST TO x 0 Nx 0 NEIGHBORHOOD OF k POINTS 4) COMPUTE Δx 0 max x i Nx 0 x o x i 5) TO EACH x i, y i ; x i Nx 0 ASSIGN WEIGHT w i x 0 1 x o x i Δx ) FIT BY WEIGHTED LEAST-SQUARES k w i x 0y i 0 1 x i x i i1 RETURN yx x i x i 7) REPEAT FOR EACH OF THE x 0 3
4 ROBUST LOWESS STANDARD LOWESS NOT ROBUST BASED ON LEAST-SQUARES WEIGHTS BI-SQUARE LOWESS RE-WEIGHT POINTS ACCORDING TO RESIDUAL IF RESIDUAL LARGE, WEIGHT IS DECREASED 1) FIT DATA USING STANDARD LOESS ) CALCULATE LOESS RESIDUALS y i y i 3) COMPUTE q 1 median y i y i r i 4) CALCULATE BI-SQUARE ROBUST WEIGHTS 1 y i y i 6 q 1 5) REPEAT LOESS WITH WEIGHTS r i w ix 0 6) REPEAT -5 UNTIL LOESS CURVE CONVERGES 4
5 Example Birth rate in US population (U. S. Department of Health, Education and Welfare) n=96 births per 1000 US population during Top > Ordinary Least Squares with 1 st, nd & 3 rd degree polynomial Bottom > LOWESS fit with f =., f=.4 & f=.6 5
6 GALTON S BEND (Wachsmuth et al. 003, Am. Stat.) A possibility is that Galton ignored concealed heterogeneity 6
7 Does the bend disappear by disaggregation of the sample? Analysis of data from Pearson and Lee (1903) BEND STILL THERE! Wachsmuth et al. (003) write: 7
8 INBREEDING DEPRESSION Examine relationships of yield (milk, protein, fat) and somatic cell score (SCS) with inbreeding coefficient (F) using field data from US Jerseys Use REML, BLUP and local regression method (LOESS) for this purpose LEVEL OF INBREEDING IN HOLSTEINS, USA 8
9 Relationship between mean value of a quantitative trait and inbreeding coefficient (F) expected to be linear under dominance Not so ifepistatic interactions between dominance effects exist (Crow & Kimura, 1970) ONE-LOCUS MODEL GENOTYPE X A 1 A 1 A 1 A A A FREQUENCY p 1 1 F p 1 F p 1 p 1 F p 1 F p F PHENOTYPE A D A EX Ap p 1 p 1 p D p 1 p DF F 1 F 1 %Heterozygosity ADDITIVE MODEL WITH F (or H) AS COVARIATECONTRADICTORY 9
10 TWO (UNLINKED) LOCI: NO EPISTASIS Joint frequencies are product of marginal frequencies GENOTYPE A 1 A 1 A 1 A A A FREQUENCY p 1 1 F p 1 F p 1 p 1 F p 1 F p F B 1 B 1 r 1 1 F r 1 F A B D A B A B B 1 B r 1 r 1 F A DB D A DB A DB B B r 1 F r F A B D A B A B EX Ap p 1 Br r 1 p 1 p D A r 1 r DB p 1 p D A r 1 r D BF F TWO (UNLINKED) LOCI: EPISTASIS GENOTYPE A 1 A 1 A 1 A A A FREQUENCY p 1 1 F p 1 F p 1 p 1 F p 1 F p F B 1 B 1 r 1 1 F r 1 F A B I D A B L A B I B 1 B r 1 r 1 F A D B K D A D B J A D B K B B r 1 F r F A B I D A B L A B I ALLELES AT A and B LOCI SAME SUBSCRIPTADD I (ADDITIVE X ADDITIVE) HOMOZYGOUS AT A HETEROZYGOUS AT BSUBSTRACT AND ADD K HOMOZYGOUS AT B HETEROZYGOUS AT ASUBSTRACT AND ADD L (ADDITIVE X DOMINANCE) HETEROZYGOUS AT A AND BADD J (DOMINANCE X DOMINANCE) I,J,K,L: parameters (4 d. freedom) 10
11 Mean value under dominance x dominance epistasis EX Ap p 1 Br r 1 p 1 p D A r 1 r D B Ip 1 p r 1 r Lp 1 p r 1 r Kr 1 r p 1 p 4Jp 1 p r 1 r p 1 p D A r 1 r D B Lp 1 p r 1 r Kr 1 r p 1 p 4Jp 1 p r 1 r F 4Jp 1 p r 1 r F F F Dominance, additive x dominance, and dominance x dominance intervene in linear regression Epistasis without dominance does not enter into mean-f relationship Dominance x dominance intervenes in second-order regression DATA First lactation records (herds) on 59,778 (1,14) Jersey cows 6 generations of known pedigree First calving between 1995 and
12 Distribution of F F calculated from all known pedigree information F ranged between 0 and 34% Median F = 6.5% Histogram of F values F(%) 1
13 Procedures Fit linear models without F as covariate Compute EBLUP residuals from these models Fit nonparametric regression to EBLUP residuals in order to obtain nonparametric lines describing relationship between performance and inbreeding level Linear Models Model yijk HYSi AGEj 1 ( Dijk D) ak eijk y ijk = somatic cell score (SCS), milk, protein, or fat yield; HYS i = fixed effect of herd-year-season (i = 1,,.,176 for DS; for DS4 or 6406 for DS6, with seasons classes January April, May August, September December); AGE j = fixed effect of age at calving class; j = 1,,.,6 (< 617, , , , , or >1016 days of age); = fixed regression coefficient of performance on days in milk; 1 D ijk = days in milk for animal k in herd-year-season i and age of calving class j; D = 63; a k e ijk = random additive genetic effect of animal k, and = random residual. 13
14 Linear Model Assumptions Genetic and residual effects assumed mutually independent, with e ~ N( 0, I e ) and a ~ N( 0, Aσ a ) where A is the additive relationship matrix (1 + F k in the k th diagonal position, F k is the inbreeding coefficient of animal k) Nonparametric regression Fit LOESS regression to BLUP residuals with F as covariate Vary spanning parameter & degree of local polynomial Plot fitted values of residuals against F 14
15 ~ ij LOESS (Fitting done by locally weighted least squares) is LOESS fit using only residuals in the neighborhood of F i, i=1,, n (i=1,,,n animals; j=1,,4 traits) Size of neighborhood determined by f q = number of points in neighborhood n = total number of points q n Robust LOESS Weights assigned to : ˆijk I) II) w w [ t 1] [ t ] [ t ] ijk ijk ijk t=1,,3,4 [1] Fk Fi 3 3 w ijk [1 ( ) ] l 1,,... q max( Fl Fi ) ~ ijk ˆ [ t] ijk ijk [1 ( ) ] 6 med ~ med median of all ( ˆ ) ijk ijk 15
16 Cows withat least6 generations of known pedigree f 1 residual σ a nd degree local polynomial F(%) Robust original (black) with bootstrap (light blue) LOESS curves of yields for US Jerseys with at least 6 generations of known pedigree, based on medians of EBLUP residuals (y-axis = eˆ / ˆ ) ijk a f=0.9 f=0.5 f=0.9 nd degree local polynomial 16
17 Conclusions LOESS analysis suggested local relationships. Effects of inbreeding seem nil, until for F values up to ~7% Effects of inbreeding not accounted well by additive models Results may be confounded by effects of selection that are unaccounted for Kernel Regression 17
18 y i gx i e i ; i 1,,...,n where: y i is the measurement taken on individual i x i is a p 1 vector of observed SNP genotypes g. is some unknown function relating genotypes to phenotypes. Set gx i Ey i x i conditional expectation function e i 0, is a random residual 18
19 19
20 0
21 1
22
Procedure 2 of Section 2 of ICAR Guidelines Computing of Accumulated Lactation Yield. Computing Lactation Yield
of ICAR Guidelines Computing of Accumulated Lactation Yield Table of Contents 1 The Test Interval Method (TIM) (Sargent, 1968)... 4 2 Interpolation using Standard Lactation Curves (ISLC) (Wilmink, 1987)...
More informationGenetic relationships and trait comparisons between and within lines of local dual purpose cattle
67 th Annual meeting of the European Association for Animal Production Belfast, 2016 Genetic relationships and trait comparisons between and within lines of local dual purpose cattle M. Jaeger, K. Brügemann,
More informationEffects of inbreeding on milk production, fertility, and somatic cell count in Norwegian Red
NORWEGIAN UNIVERSITY OF LIFE SCIENCES Effects of inbreeding on milk production, fertility, and somatic cell count in Norwegian Red K. Hov Martinsen 1, E. Sehested 2 and B. Heringstad* 1,2 1, Norwegian
More informationSimulation Study on Heterogeneous Variance Adjustment for Observations with Different Measurement Error Variance
Simulation Study on Heterogeneous Variance Adjustment for Observations with Different Measurement Error Variance Pitkänen, T. 1, Mäntysaari, E. A. 1, Nielsen, U. S., Aamand, G. P 3., Madsen 4, P. and Lidauer,
More informationMIXED MODELS THE GENERAL MIXED MODEL
MIXED MODELS This chapter introduces best linear unbiased prediction (BLUP), a general method for predicting random effects, while Chapter 27 is concerned with the estimation of variances by restricted
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationMODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES
MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by
More informationCase-Control Association Testing. Case-Control Association Testing
Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies
More informationPrediction of Future Milk Yield with Random Regression Model Using Test-day Records in Holstein Cows
9 ` Asian-Aust. J. Anim. Sci. Vol. 19, No. 7 : 9-921 July 26 www.ajas.info Prediction of Future Milk Yield with Random Regression Model Using Test-day Records in Holstein Cows Byoungho Park and Deukhwan
More informationComputational Systems Biology: Biology X
Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#7:(Mar-23-2010) Genome Wide Association Studies 1 The law of causality... is a relic of a bygone age, surviving, like the monarchy,
More informationLecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013
Lecture 5: BLUP (Best Linear Unbiased Predictors) of genetic values Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 013 1 Estimation of Var(A) and Breeding Values in General Pedigrees The classic
More informationBreeding Values and Inbreeding. Breeding Values and Inbreeding
Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A
More informationLECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50
LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if
More information(Genome-wide) association analysis
(Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by
More informationGenetic parameters for various random regression models to describe total sperm cells per ejaculate over the reproductive lifetime of boars
Published December 8, 2014 Genetic parameters for various random regression models to describe total sperm cells per ejaculate over the reproductive lifetime of boars S. H. Oh,* M. T. See,* 1 T. E. Long,
More informationAnimal Models. Sheep are scanned at maturity by ultrasound(us) to determine the amount of fat surrounding the muscle. A model (equation) might be
Animal Models 1 Introduction An animal model is one in which there are one or more observations per animal, and all factors affecting those observations are described including an animal additive genetic
More informationAnimal Model. 2. The association of alleles from the two parents is assumed to be at random.
Animal Model 1 Introduction In animal genetics, measurements are taken on individual animals, and thus, the model of analysis should include the animal additive genetic effect. The remaining items in the
More informationContrasting Models for Lactation Curve Analysis
J. Dairy Sci. 85:968 975 American Dairy Science Association, 2002. Contrasting Models for Lactation Curve Analysis F. Jaffrezic,*, I. M. S. White,* R. Thompson, and P. M. Visscher* *Institute of Cell,
More informationOverview. Background
Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems
More informationAssociation Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5
Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative
More informationQuantitative Genetics I: Traits controlled my many loci. Quantitative Genetics: Traits controlled my many loci
Quantitative Genetics: Traits controlled my many loci So far in our discussions, we have focused on understanding how selection works on a small number of loci (1 or 2). However in many cases, evolutionary
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 20: Epistasis and Alternative Tests in GWAS Jason Mezey jgm45@cornell.edu April 16, 2016 (Th) 8:40-9:55 None Announcements Summary
More informationI Have the Power in QTL linkage: single and multilocus analysis
I Have the Power in QTL linkage: single and multilocus analysis Benjamin Neale 1, Sir Shaun Purcell 2 & Pak Sham 13 1 SGDP, IoP, London, UK 2 Harvard School of Public Health, Cambridge, MA, USA 3 Department
More informationGenetic Parameter Estimation for Milk Yield over Multiple Parities and Various Lengths of Lactation in Danish Jerseys by Random Regression Models
J. Dairy Sci. 85:1596 1606 American Dairy Science Association, 2002. Genetic Parameter Estimation for Milk Yield over Multiple Parities and Various Lengths of Lactation in Danish Jerseys by Random Regression
More informationQuantitative characters - exercises
Quantitative characters - exercises 1. a) Calculate the genetic covariance between half sibs, expressed in the ij notation (Cockerham's notation), when up to loci are considered. b) Calculate the genetic
More informationProportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power
Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture16: Population structure and logistic regression I Jason Mezey jgm45@cornell.edu April 11, 2017 (T) 8:40-9:55 Announcements I April
More informationCausal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables
Causal inference in biomedical sciences: causal models involving genotypes Causal models for observational data Instrumental variables estimation and Mendelian randomization Krista Fischer Estonian Genome
More informationGBLUP and G matrices 1
GBLUP and G matrices 1 GBLUP from SNP-BLUP We have defined breeding values as sum of SNP effects:! = #$ To refer breeding values to an average value of 0, we adopt the centered coding for genotypes described
More informationImpact of Using Reduced Rank Random Regression Test-Day Model on Genetic Evaluation
Impact of Using Reduced Rank Random Regression Test-Day on Genetic Evaluation H. Leclerc 1, I. Nagy 2 and V. Ducrocq 2 1 Institut de l Elevage, Département Génétique, Bât 211, 78 352 Jouy-en-Josas, France
More informationMethods for Cryptic Structure. Methods for Cryptic Structure
Case-Control Association Testing Review Consider testing for association between a disease and a genetic marker Idea is to look for an association by comparing allele/genotype frequencies between the cases
More informationGenotyping strategy and reference population
GS cattle workshop Genotyping strategy and reference population Effect of size of reference group (Esa Mäntysaari, MTT) Effect of adding females to the reference population (Minna Koivula, MTT) Value of
More informationLecture 6. QTL Mapping
Lecture 6 QTL Mapping Bruce Walsh. Aug 2003. Nordic Summer Course MAPPING USING INBRED LINE CROSSES We start by considering crosses between inbred lines. The analysis of such crosses illustrates many of
More informationGenetic parameters for female fertility in Nordic dairy cattle
Genetic parameters for female fertility in Nordic dairy cattle K.Muuttoranta 1, A-M. Tyrisevä 1, E.A. Mäntysaari 1, J.Pösö 2, G.P. Aamand 3, J-Å. Eriksson 4, U.S. Nielsen 5, and M. Lidauer 1 1 Natural
More informationLecture 11: Multiple trait models for QTL analysis
Lecture 11: Multiple trait models for QTL analysis Julius van der Werf Multiple trait mapping of QTL...99 Increased power of QTL detection...99 Testing for linked QTL vs pleiotropic QTL...100 Multiple
More informationMultiple-Trait Across-Country Evaluations Using Singular (Co)Variance Matrix and Random Regression Model
Multiple-rait Across-Country Evaluations Using Singular (Co)Variance Matrix and Random Regression Model Esa A. Mäntysaari M Agrifood Research Finland, Animal Production, SF 31600 Jokioinen 1. Introduction
More informationChapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype.
Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype. please read pages 38-47; 49-55;57-63. Slide 1 of Chapter 2 1 Extension sot Mendelian Behavior of Genes Single gene inheritance
More informationLecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011
Lecture 6: Introduction to Quantitative genetics Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Quantitative Genetics The analysis of traits whose variation is determined by both a
More informationLecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017
Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping
More informationChap 1. Overview of Statistical Learning (HTF, , 2.9) Yongdai Kim Seoul National University
Chap 1. Overview of Statistical Learning (HTF, 2.1-2.6, 2.9) Yongdai Kim Seoul National University 0. Learning vs Statistical learning Learning procedure Construct a claim by observing data or using logics
More informationGenetic Parameters for Stillbirth in the Netherlands
Genetic Parameters for Stillbirth in the Netherlands Arnold Harbers, Linda Segeren and Gerben de Jong CR Delta, P.O. Box 454, 68 AL Arnhem, The Netherlands Harbers.A@CR-Delta.nl 1. Introduction Stillbirth
More informationExamining the Relative Influence of Familial, Genetic and Covariate Information In Flexible Risk Models. Grace Wahba
Examining the Relative Influence of Familial, Genetic and Covariate Information In Flexible Risk Models Grace Wahba Based on a paper of the same name which has appeared in PNAS May 19, 2009, by Hector
More informationMixed-Models. version 30 October 2011
Mixed-Models version 30 October 2011 Mixed models Mixed models estimate a vector! of fixed effects and one (or more) vectors u of random effects Both fixed and random effects models always include a vector
More informationLecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition
Lecture 3 Introduction on Quantitative Genetics: I Fisher s Variance Decomposition Bruce Walsh. Aug 004. Royal Veterinary and Agricultural University, Denmark Contribution of a Locus to the Phenotypic
More informationHeritability estimation in modern genetics and connections to some new results for quadratic forms in statistics
Heritability estimation in modern genetics and connections to some new results for quadratic forms in statistics Lee H. Dicker Rutgers University and Amazon, NYC Based on joint work with Ruijun Ma (Rutgers),
More informationLecture 4 Multiple linear regression
Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters
More informationQuantitative characters II: heritability
Quantitative characters II: heritability The variance of a trait (x) is the average squared deviation of x from its mean: V P = (1/n)Σ(x-m x ) 2 This total phenotypic variance can be partitioned into components:
More informationB) Describe the structures and functions of a Paramecium. Draw a Paramecium.
Living Organisms & Genetics Final Exam Review Answers A) Describe the structures and functions of a Euglena. Draw a Euglena. Eyespot Helps the Euglena to detect light so it can move to a bright area to
More informationCase Studies in Ecology and Evolution
3 Non-random mating, Inbreeding and Population Structure. Jewelweed, Impatiens capensis, is a common woodland flower in the Eastern US. You may have seen the swollen seed pods that explosively pop when
More informationMixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012
Mixed-Model Estimation of genetic variances Bruce Walsh lecture notes Uppsala EQG 01 course version 8 Jan 01 Estimation of Var(A) and Breeding Values in General Pedigrees The above designs (ANOVA, P-O
More informationPopulation Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda
1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;
More informationBest unbiased linear Prediction: Sire and Animal models
Best unbiased linear Prediction: Sire and Animal models Raphael Mrode Training in quantitative genetics and genomics 3 th May to th June 26 ILRI, Nairobi Partner Logo Partner Logo BLUP The MME of provided
More informationLecture 32: Infinite-dimensional/Functionvalued. Functions and Random Regressions. Bruce Walsh lecture notes Synbreed course version 11 July 2013
Lecture 32: Infinite-dimensional/Functionvalued Traits: Covariance Functions and Random Regressions Bruce Walsh lecture notes Synbreed course version 11 July 2013 1 Longitudinal traits Many classic quantitative
More informationEvaluation of Autoregressive Covariance Structures for Test-Day Records of Holstein Cows: Estimates of Parameters
J. Dairy Sci. 88:2632 2642 American Dairy Science Association, 2005. Evaluation of Autoregressive Covariance Structures for Test-Day Records of Holstein Cows: Estimates of Parameters R. M. Sawalha, 1 J.
More informationJust to review Genetics and Cells? How do Genetics and Cells Relate? The cell s NUCLEUS contains all the genetic information.
Just to review Genetics and Cells? How do Genetics and Cells Relate? The cell s NUCLEUS contains all the genetic information. It s called: DNA A. Describe what Gregor Mendel discovered in his experiments
More informationA Modern Look at Classical Multivariate Techniques
A Modern Look at Classical Multivariate Techniques Yoonkyung Lee Department of Statistics The Ohio State University March 16-20, 2015 The 13th School of Probability and Statistics CIMAT, Guanajuato, Mexico
More informationNormal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,
Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability
More informationRobustness and Reproducing Kernel Hilbert Spaces. Grace Wahba
Robustness and Reproducing Kernel Hilbert Spaces Grace Wahba Part 1. Regularized Kernel Estimation RKE. (Robustly) Part 2. Smoothing Spline ANOVA SS-ANOVA and RKE. Part 3. Partly missing covariates SS-ANOVA
More informationLegend: S spotted Genotypes: P1 SS & ss F1 Ss ss plain F2 (with ratio) 1SS :2 WSs: 1ss. Legend W white White bull 1 Ww red cows ww ww red
On my honor, this is my work GENETICS 310 EXAM 1 June 8, 2018 I. Following are 3 sets of data collected from crosses: 1. Spotted by Plain gave all spotted in the F1 and 9 spotted and 3 plain in the F2.
More informationDNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to
1 1 1 1 1 1 1 1 0 SUPPLEMENTARY MATERIALS, B. BIVARIATE PEDIGREE-BASED ASSOCIATION ANALYSIS Introduction We propose here a statistical method of bivariate genetic analysis, designed to evaluate contribution
More informationEstimates of genetic parameters for total milk yield over multiple ages in Brazilian Murrah buffaloes using different models
Estimates of genetic parameters for total milk yield over multiple ages in Brazilian Murrah buffaloes using different models R.C. Sesana 1, F. Baldi 1, R.R.A. Borquis 1, A.B. Bignardi 1, N.A. Hurtado-Lugo
More informationLINEAR MODELS FOR THE PREDICTION OF ANIMAL BREEDING VALUES SECOND EDITION
LINEAR MODELS FOR THE PREDICTION OF ANIMAL BREEDING VALUES SECOND EDITION LINEAR MODELS FOR THE PREDICTION OF ANIMAL BREEDING VALUES Second Edition R.A. Mrode, PhD Scottish Agricultural College Sir Stephen
More informationCausal Graphical Models in Quantitative Genetics and Genomics
Causal Graphical Models in Quantitative Genetics and Genomics Guilherme J. M. Rosa Department of Animal Sciences Department of Biostatistics & Medical Informatics OUTLINE Introduction: Correlation and
More informationgenome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next
genetics the study of heredity heredity sequence of DNA that codes for a protein and thus determines a trait genome a specific characteristic that varies from one individual to another gene trait the passing
More informationBayesian QTL mapping using skewed Student-t distributions
Genet. Sel. Evol. 34 00) 1 1 1 INRA, EDP Sciences, 00 DOI: 10.1051/gse:001001 Original article Bayesian QTL mapping using skewed Student-t distributions Peter VON ROHR a,b, Ina HOESCHELE a, a Departments
More informationStatistical issues in QTL mapping in mice
Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping
More informationStatistics 203: Introduction to Regression and Analysis of Variance Course review
Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying
More information1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:
.5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the
More informationEXERCISES FOR CHAPTER 7. Exercise 7.1. Derive the two scales of relation for each of the two following recurrent series:
Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 7 Exercise 7.. Derive the two scales of relation for each of the two following recurrent series: u: 0, 8, 6, 48, 46,L 36 7
More informationThe concept of breeding value. Gene251/351 Lecture 5
The concept of breeding value Gene251/351 Lecture 5 Key terms Estimated breeding value (EB) Heritability Contemporary groups Reading: No prescribed reading from Simm s book. Revision: Quantitative traits
More informationLecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015
Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.
More informationCrosses. Computation APY Sherman-Woodbury «hybrid» model. Unknown parent groups Need to modify H to include them (Misztal et al., 2013) Metafounders
Details in ssgblup Details in SSGBLUP Storage Inbreeding G is not invertible («blending») G might not explain all genetic variance («blending») Compatibility of G and A22 Assumption p(u 2 )=N(0,G) If there
More information1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics
1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
More informationLinkage and Linkage Disequilibrium
Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies
More informationLecture 7 Correlated Characters
Lecture 7 Correlated Characters Bruce Walsh. Sept 2007. Summer Institute on Statistical Genetics, Liège Genetic and Environmental Correlations Many characters are positively or negatively correlated at
More informationInversion Base Height. Daggot Pressure Gradient Visibility (miles)
Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:
More informationLinear Models for the Prediction of Animal Breeding Values
Linear Models for the Prediction of Animal Breeding Values R.A. Mrode, PhD Animal Data Centre Fox Talbot House Greenways Business Park Bellinger Close Chippenham Wilts, UK CAB INTERNATIONAL Preface ix
More informationRepeated Records Animal Model
Repeated Records Animal Model 1 Introduction Animals are observed more than once for some traits, such as Fleece weight of sheep in different years. Calf records of a beef cow over time. Test day records
More informationPrediction of genetic Values using Neural Networks
Prediction of genetic Values using Neural Networks Paulino Perez 1 Daniel Gianola 2 Jose Crossa 1 1 CIMMyT-Mexico 2 University of Wisconsin, Madison. September, 2014 SLU,Sweden Prediction of genetic Values
More information3 Comparison with Other Dummy Variable Methods
Stats 300C: Theory of Statistics Spring 2018 Lecture 11 April 25, 2018 Prof. Emmanuel Candès Scribe: Emmanuel Candès, Michael Celentano, Zijun Gao, Shuangning Li 1 Outline Agenda: Knockoffs 1. Introduction
More informationEXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?
Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population
More informationResemblance between relatives
Resemblance between relatives 1 Key concepts Model phenotypes by fixed effects and random effects including genetic value (additive, dominance, epistatic) Model covariance of genetic effects by relationship
More information3. Properties of the relationship matrix
3. Properties of the relationship matrix 3.1 Partitioning of the relationship matrix The additive relationship matrix, A, can be written as the product of a lower triangular matrix, T, a diagonal matrix,
More informationRESTRICTED M A X I M U M LIKELIHOOD TO E S T I M A T E GENETIC P A R A M E T E R S - IN PRACTICE
RESTRICTED M A X I M U M LIKELIHOOD TO E S T I M A T E GENETIC P A R A M E T E R S - IN PRACTICE K. M e y e r Institute of Animal Genetics, Edinburgh University, W e s t M a i n s Road, Edinburgh EH9 3JN,
More informationGeneralized Additive Models
Generalized Additive Models The Model The GLM is: g( µ) = ß 0 + ß 1 x 1 + ß 2 x 2 +... + ß k x k The generalization to the GAM is: g(µ) = ß 0 + f 1 (x 1 ) + f 2 (x 2 ) +... + f k (x k ) where the functions
More informationNCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157)
NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157) Evidence Statement Q1 Expected coverage Merit Excellence
More informationLecture 02 Linear classification methods I
Lecture 02 Linear classification methods I 22 January 2016 Taylor B. Arnold Yale Statistics STAT 365/665 1/32 Coursewebsite: A copy of the whole course syllabus, including a more detailed description of
More informationINTRODUCTION TO ANIMAL BREEDING. Lecture Nr 4. The efficiency of selection The selection programmes
INTRODUCTION TO ANIMAL BREEDING Lecture Nr 4 The efficiency of selection The selection programmes Etienne Verrier INA Paris-Grignon, Animal Sciences Department Verrier@inapg.fr The genetic gain and its
More informationLecture 2: Linear and Mixed Models
Lecture 2: Linear and Mixed Models Bruce Walsh lecture notes Introduction to Mixed Models SISG, Seattle 18 20 July 2018 1 Quick Review of the Major Points The general linear model can be written as y =
More informationCausal Inference Basics
Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,
More informationCommon Mating Designs in Agricultural Research and Their Reliability in Estimation of Genetic Parameters
IOSR Journal of Agriculture and Veterinary Science (IOSR-JAVS) e-issn: 2319-2380, p-issn: 2319-2372. Volume 11, Issue 7 Ver. II (July 2018), PP 16-36 www.iosrjournals.org Common Mating Designs in Agricultural
More informationSTA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).
STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population
More informationPrediction of the Confidence Interval of Quantitative Trait Loci Location
Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28
More informationPartitioning Genetic Variance
PSYC 510: Partitioning Genetic Variance (09/17/03) 1 Partitioning Genetic Variance Here, mathematical models are developed for the computation of different types of genetic variance. Several substantive
More informationNeed for Several Predictor Variables
Multiple regression One of the most widely used tools in statistical analysis Matrix expressions for multiple regression are the same as for simple linear regression Need for Several Predictor Variables
More informationBayesian Inference of Interactions and Associations
Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,
More informationModel Selection for Multiple QTL
Model Selection for Multiple TL 1. reality of multiple TL 3-8. selecting a class of TL models 9-15 3. comparing TL models 16-4 TL model selection criteria issues of detecting epistasis 4. simulations and
More informationIntroduction to Nonparametric Regression
Introduction to Nonparametric Regression Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)
More informationChapter 7 The Genetic Model for Quantitative Traits
Chapter 7 The Genetic Model for Quantitative Traits I. The Basic Model II. Breeding Value III. Gene Combination Value IV. Producing Ability Chapter 7 The Genetic Model for Quantitative Traits Learning
More informationThe Quantitative TDT
The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus
More information