On Computation of P-values in Parametric Linkage Analysis
|
|
- Jonas Cross
- 5 years ago
- Views:
Transcription
1 On Computation of P-values in Parametric Linkage Analysis Azra Kurbašić Centre for Mathematical Sciences Mathematical Statistics Lund University p.1/22
2 Parametric (v. Nonparametric) Analysis The genetic model is assumed to be known. (Although there are parametric models that are mode of inheritance free). p.2/22
3 Parametric (v. Nonparametric) Analysis The genetic model is assumed to be known. (Although there are parametric models that are mode of inheritance free). The parametric analysis is more powerful than nonparametric when the underlying genetic model is known and true, cf. Sham (1998). p.2/22
4 The lod score One marker I Z( ; P(Y, M, ) = log 10 P(Y, M 0.5, ) ) is used for testing H 0 : H 1 : [0, 0.5). = 0.5 against : the genetic model parameters : the recombination fraction (distance) between the marker and disease locus. Y and M denote the collection of disease phenotypes and marker data, respectively p.3/22
5 One marker II The maximal lod score Z max ( ) = sup Z( ; ) = (2 ln 10) 1 2 ln sup P(Y, M, P(Y, M 0.5, ) ) p.4/22
6 One marker II The maximal lod score Z max ( ) = sup Z( ; ) = (2 ln 10) 1 2 ln sup P(Y, M, P(Y, M 0.5, ) ) is asymptotically (2 ln 10) 1 ( (0) (1)) distributed under H 0. p.4/22
7 One marker II The maximal lod score Z max ( ) = sup Z( ; ) = (2 ln 10) 1 2 ln sup P(Y, M, P(Y, M 0.5, ) ) is asymptotically (2 ln 10) 1 ( (0) (1)) distributed under H 0. A mixture of 2 distributions arises because of the structure of the parameter space, cf. Self and Lian (1987) p.4/22
8 One marker III The same asymptotic limit distribution also appears when is misspecified. p.5/22
9 One marker III The same asymptotic limit distribution also appears when is misspecified. The significance level (T ) = P H0 (Z max ( ) T ) is asymptotically independent of the nuisance parameter. T is a given threshold. p.5/22
10 One marker III The same asymptotic limit distribution also appears when is misspecified. The significance level (T ) = P H0 (Z max ( ) T ) is asymptotically independent of the nuisance parameter. T is a given threshold. There is asymptotically a one-to-one correspondence between Z max and the p-value (Z max ). p.5/22
11 Multipoint Analysis I A simultaneous analysis of two/several linked markers on a chromosome is called a multipoint analysis. p.6/22
12 Multipoint Analysis I A simultaneous analysis of two/several linked markers on a chromosome is called a multipoint analysis. The parameter of interest in multipoint analysis is the disease locus x measured in Morgans. p.6/22
13 Multipoint Analysis I A simultaneous analysis of two/several linked markers on a chromosome is called a multipoint analysis. The parameter of interest in multipoint analysis is the disease locus x measured in Morgans. Lod Score is defined as Z(x; P(Y, M x, ) = log 10 P(Y, M, ) ), where x = corresponds to H 0. p.6/22
14 Multipoint Analysis I A simultaneous analysis of two/several linked markers on a chromosome is called a multipoint analysis. The parameter of interest in multipoint analysis is the disease locus x measured in Morgans. Lod Score is defined as Z(x; P(Y, M x, ) = log 10 P(Y, M, ) ), where x = corresponds to H 0. The maximal lod score, Z max ( used as measure of linkage. ) = max x H1 Z(x; ), is p.6/22
15 Multipoint Analysis II The parameter space is no longer connected!... l l l l k } H 1 H0 K : the number of chromosomes l i : the genetic length of chromosome p.7/22
16 Multipoint Analysis III The maximal lod score does no longer converge to a limiting distribution under H 0 as the number of pedigrees grows. p.8/22
17 Multipoint Analysis III The maximal lod score does no longer converge to a limiting distribution under H 0 as the number of pedigrees grows. It has a degenerated limit distribution at as the number of families N. p.8/22
18 Multipoint Analysis III The maximal lod score does no longer converge to a limiting distribution under H 0 as the number of pedigrees grows. It has a degenerated limit distribution at as the number of families N. The significance for a given threshold T depends heavily on, the number of pedigrees, their structure, and the disease and marker phenotypes. p.8/22
19 Multipoint Analysis III The maximal lod score does no longer converge to a limiting distribution under H 0 as the number of pedigrees grows. It has a degenerated limit distribution at as the number of families N. The significance for a given threshold T depends heavily on, the number of pedigrees, their structure, and the disease and marker phenotypes. Let us take a clooser look at the distribution! p.8/22
20 Pointwise Distribution I Multipoint analysis for a family with m meioses: p.9/22
21 Pointwise Distribution I Multipoint analysis for a family with m meioses: Scoring function, cf. Kruglyak et al (1996): S(v) = P(Y v) w V P uniform(w)p(y w) = P(Y v) w V 2 m P(Y w) p.9/22
22 Pointwise Distribution I Multipoint analysis for a family with m meioses: Scoring function, cf. Kruglyak et al (1996): S(v) = P(Y v) w V P uniform(w)p(y w) = P(Y v) w V 2 m P(Y w) Distribution: P x (w) = P(v(x) = w M) p.9/22
23 Pointwise Distribution I Multipoint analysis for a family with m meioses: Scoring function, cf. Kruglyak et al (1996): S(v) = P(Y v) w V P uniform(w)p(y w) = P(Y v) w V 2 m P(Y w) Distribution: P x (w) = P(v(x) = w M) Lod score: Z(x) = log 10 LR(x) = log 10 w S(w)P x (w) p.9/22
24 Pointwise Distribution II Two extreme cases of marker informativity (for one pedigree): perfect marker data: LR(x) = S(v(x)) no marker information at all: P x = P uniform Total multipoint lod score for N families is defined as N Z(x) = Z i (x). i=1 p.10/22
25 Pointwise Distribution II Two extreme cases of marker informativity (for one pedigree): perfect marker data: LR(x) = S(v(x)) no marker information at all: P x = P uniform Total multipoint lod score for N families is defined as N Z(x) = Z i (x). i=1 For two-point analysis we just replace x by Kurbasic & Hössjer (2003)., cf. p.10/22
26 Some Properties of Lod Score From the fact that the H 0 distribution of v(x) is P uniform and LR(x) = E(LR p (x) M) we get (i) E H0 (LR(x)) = E H0 (LR p (x)) = 1 (ii) Var H0 (LR(x)) Var H0 (LR p (x)) when the var. exist (iii) E H0 (Z p (x)) E H0 (Z(x)) 0, p.11/22
27 Some Properties of Lod Score From the fact that the H 0 distribution of v(x) is P uniform and LR(x) = E(LR p (x) M) we get (i) E H0 (LR(x)) = E H0 (LR p (x)) = 1 (ii) Var H0 (LR(x)) Var H0 (LR p (x)) when the var. exist (iii) E H0 (Z p (x)) E H0 (Z(x)) 0, (ii) and (iii) are deduced from Jensens inequality, cf. Royden (1968). p.11/22
28 Some Properties of Lod Score From the fact that the H 0 distribution of v(x) is P uniform and LR(x) = E(LR p (x) M) we get (i) E H0 (LR(x)) = E H0 (LR p (x)) = 1 (ii) Var H0 (LR(x)) Var H0 (LR p (x)) when the var. exist (iii) E H0 (Z p (x)) E H0 (Z(x)) 0, (ii) and (iii) are deduced from Jensens inequality, cf. Royden (1968). We conjecture that in most cases (iv) Var H0 (Z(x)) Var H0 (Z p (x)). p.11/22
29 Model for Lod Score Distribution I Equation (i) is true if the following is fulfilled 1 = E H0 [e ln 10Z ] = M(ln 10) = e c ln 10 (1 a ln 10) b c = b ln(1 a ln 10) ln 10 where M(t) = E H0 (e tz ) = e ct (1 at) b. =: c(a, b), p.12/22
30 Model for Lod Score Distribution I Equation (i) is true if the following is fulfilled 1 = E H0 [e ln 10Z ] = M(ln 10) = e c ln 10 (1 a ln 10) b c = b ln(1 a ln 10) ln 10 where M(t) = E H0 (e tz ) = e ct (1 at) b. =: c(a, b), This gives us a model Z(x) H 0 free parameters: (a, b, c) with two p.12/22
31 Model for Lod Score Distribution I Equation (i) is true if the following is fulfilled 1 = E H0 [e ln 10Z ] = M(ln 10) = e c ln 10 (1 a ln 10) b c = b ln(1 a ln 10) ln 10 where M(t) = E H0 (e tz ) = e ct (1 at) b. =: c(a, b), This gives us a model Z(x) H 0 (a, b, c) with two free parameters: the strength of the genetic model 0 < a < (ln 10) 1 p.12/22
32 Model for Lod Score Distribution I Equation (i) is true if the following is fulfilled 1 = E H0 [e ln 10Z ] = M(ln 10) = e c ln 10 (1 a ln 10) b c = b ln(1 a ln 10) ln 10 where M(t) = E H0 (e tz ) = e ct (1 at) b. =: c(a, b), This gives us a model Z(x) H 0 (a, b, c) with two free parameters: the strength of the genetic model 0 < a < (ln 10) 1 the proportion to the effective number of families in data b > 0 p.12/22
33 Significance Levels for Multipoint Scores Monte Carlo method: very general but can be slow, especially for large pedigrees and low marker informativity. Can be used for both perfect and non perfect data. p.13/22
34 Significance Levels for Multipoint Scores Monte Carlo method: very general but can be slow, especially for large pedigrees and low marker informativity. Can be used for both perfect and non perfect data. Perfect marker data: use analytical methods based on Gaussian extreme value theory i.e. normal approximation and adjusted normal approximation, cf. Lander & Kruglyak (1995), and Ängquist & Hössjer (2003). p.13/22
35 Significance Levels for Multipoint Scores Monte Carlo method: very general but can be slow, especially for large pedigrees and low marker informativity. Can be used for both perfect and non perfect data. Perfect marker data: use analytical methods based on Gaussian extreme value theory i.e. normal approximation and adjusted normal approximation, cf. Lander & Kruglyak (1995), and Ängquist & Hössjer (2003). When using the analytical methods above one should use standardized lod scores, i.e. Z(x) = N Z N i=1 i i (x)/ 2 i=1 i where Z i = (Z i (x) i)/ i for family i. p.13/22
36 Pedigrees used in the study. 1) ) ICEL80002) ) unaffected affected unknown Squares: males Circles: females " " " " " " # # # # # # $ $ $ $ $ $ $ $ & & & & 4) ICEL80004) * * Monozygotic twins are replaced by a single contrived individual p.14/22
37 Results Standardized lod scores Z(x) turned out very useful when calculating p-values in parametric linkage analysis. p.15/22
38 Results Standardized lod scores Z(x) turned out very useful when calculating p-values in parametric linkage analysis. Perfect marker data: stronger genetic model/ larger pedigree/ increased sample size gives a flatter p value curve and lower p value. p.15/22
39 Results Standardized lod scores Z(x) turned out very useful when calculating p-values in parametric linkage analysis. Perfect marker data: stronger genetic model/ larger pedigree/ increased sample size gives a flatter p value curve and lower p value. Imperfect marker data: The effect of marker informativity is stronger for the larger pedigrees. p.15/22
40 Results Standardized lod scores Z(x) turned out very useful when calculating p-values in parametric linkage analysis. Perfect marker data: stronger genetic model/ larger pedigree/ increased sample size gives a flatter p value curve and lower p value. Imperfect marker data: The effect of marker informativity is stronger for the larger pedigrees. Comparisons show that the perfect marker data approximations for lod score p-values can be either conservative or anticonservative, depending on the genetic model. p.15/22
41 Summary As a practical implication of our work, we suggest, if traditional lod scores are to be used in multipoint linkage analysis, that they are reported together with p-values. Alternatively, the standardized version Z could be used. p.16/22
42 Summary As a practical implication of our work, we suggest, if traditional lod scores are to be used in multipoint linkage analysis, that they are reported together with p-values. Alternatively, the standardized version Z could be used. For imperfect data we suggest to use nonparametric scores defined in Lander & Kruglyak (1995) or the approach in Kong & Cox (1997). p.16/22
43 Summary As a practical implication of our work, we suggest, if traditional lod scores are to be used in multipoint linkage analysis, that they are reported together with p-values. Alternatively, the standardized version Z could be used. For imperfect data we suggest to use nonparametric scores defined in Lander & Kruglyak (1995) or the approach in Kong & Cox (1997). How Z compares with other scores S in terms of power and robustness to model misspecification will depend on both the true and assumed genetic model, pedigree structure(s), sample size, etc. This is certainly a topic for future research. p.16/22
44 Acknowledgements Ola Hössjer, Pär-Ola Bendahl, Adalgeir Arason, Rosa Barkardottir, Lars Ängquist, Henrik Bengtsson, James Hakim and Joakim Lübeck. p.17/22
45 Acknowledgements Ola Hössjer, Pär-Ola Bendahl, Adalgeir Arason, Rosa Barkardottir, Lars Ängquist, Henrik Bengtsson, James Hakim and Joakim Lübeck. Thank you for your attention! p.17/22
46 References Kong A and Cox NJ: Allele-Sharing Models: LOD Scores and Accurate Linkage Tests. Am J Hum Genet 1997;61: Kruglyak L, Daly MJ, Reeve-Daly MP and Lander ES: Parametric and non-parametric Linkage Analysis: A Unified Multipoint Approach. Am J Hum Genet 1996;58: Kurbašić A and Hössjer O: On Computation of P-values in Parametric Linkage Analysis. To appear in Human Heredity. Lander E and Kruglyak L: Genetic dissection of Complex Traits: Guidelines for Interpreting and Reporting Linkage Results. Nature Genetics 1995;11: p.18/22
47 Self SG and Lian KY: Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests Under Nonstandard Conditions. Journal of the American Statistical Association 1987;82: Sham P: Statistics in Human Genetics. Arnold, a Member of the Holder Headline group Royden HL: Real analysis, 2d ed. Macmillan Publishing, New York Ängquist L and Hössjer O: Improving the Calculation of Statistical Significance in Genome-Wide Scans. Preprints in Mathematical Sciences, Centre for Mathematical Sciences, Lund University p.19/22
48 p.20/22 Appendix
49 Model for Lod Score Distribution II The addition of the lod score stays in the model, i.e. Z = Z 1 + Z 2 H 0 (a, b 1 + b 2, c(a, b 1 + b 2 ) The expectation under the H 0 0 = E H0 (Z) = c + ab = b(a + ln(1 a ln 10) ln 10 ) < 0 Further: 0 = a b 1 = ELOD = E H1 (Z) = zdp H1 (z) = zlr(z)dp H0 (z) = ze ln 10z dp H0 (z) = E H0 (Ze ln 10Z ) = M (ln 10) = M(ln 10)(c + ab ln(1 a ln 10) a 1 a ln 10 ) = b( ln a ln 10 ) p.21/22
50 Model for Lod Score Distribution III Imperfect Data: 1 Let a (0, ln 10 ) be a genetic model A weakened genetic model is a, 0 < 1 Equations (ii) (iv) must always be fullfilled. It follows from the fact that Z H 0 ( a, b, c( a, b)) Var H0 (LR) = E H0 (LR 2 ) E H0 (LR) 2 = E H0 (e 2 ln 10Z ) 1 2 = M Z (2 ln 10) 1 = e c( a,b)2 ln 10 (1 2 a ln 10) b 1 = (1 a ln 10) 2b (1 2 a ln 10) b 1 < (1 a ln 10) 2b (1 2a ln 10) b 1 = V H0 (LR p ) This assumes that both variances exist, that is 0 < a < (2 ln 10) 1. In the inequality we used V H0 (LR) = (1 x) 2b /(1 2x) b 1, where x = ln 10, which is an increasing function of x. p.22/22
The Lander-Green Algorithm. Biostatistics 666 Lecture 22
The Lander-Green Algorithm Biostatistics 666 Lecture Last Lecture Relationship Inferrence Likelihood of genotype data Adapt calculation to different relationships Siblings Half-Siblings Unrelated individuals
More informationTutorial Session 2. MCMC for the analysis of genetic data on pedigrees:
MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation
More informationVariance Component Models for Quantitative Traits. Biostatistics 666
Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond
More informationIntroduction to QTL mapping in model organisms
Human vs mouse Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] www.daviddeen.com
More informationThe Admixture Model in Linkage Analysis
The Admixture Model in Linkage Analysis Jie Peng D. Siegmund Department of Statistics, Stanford University, Stanford, CA 94305 SUMMARY We study an appropriate version of the score statistic to test the
More informationAffected Sibling Pairs. Biostatistics 666
Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm
More informationFor 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.
STAT 550 Howework 6 Anton Amirov 1. This question relates to the same study you saw in Homework-4, by Dr. Arno Motulsky and coworkers, and published in Thompson et al. (1988; Am.J.Hum.Genet, 42, 113-124).
More informationStatistical Applications in Genetics and Molecular Biology
Statistical Applications in Genetics and Molecular Biology Volume 4, Issue 1 2005 Article 11 Combined Association and Linkage Analysis for General Pedigrees and Genetic Models Ola Hössjer University of
More information1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics
1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
More informationAnalytic power calculation for QTL linkage analysis of small pedigrees
(2001) 9, 335 ± 340 ã 2001 Nature Publishing Group All rights reserved 1018-4813/01 $15.00 www.nature.com/ejhg ARTICLE for QTL linkage analysis of small pedigrees FruÈhling V Rijsdijk*,1, John K Hewitt
More informationPrediction of the Confidence Interval of Quantitative Trait Loci Location
Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28
More informationA Robust Identity-by-Descent Procedure Using Affected Sib Pairs: Multipoint Mapping for Complex Diseases
Original Paper Hum Hered 001;51:64 78 Received: May 1, 1999 Revision received: September 10, 1999 Accepted: October 6, 1999 A Robust Identity-by-Descent Procedure Using Affected Sib Pairs: Multipoint Mapping
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationOptimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives
Genetic Epidemiology 16:225 249 (1999) Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives Mary Sara McPeek* Department of Statistics, University of Chicago, Chicago, Illinois
More informationGene mapping in model organisms
Gene mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Goal Identify genes that contribute to common human diseases. 2
More informationExpression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia
Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.
More informationUNIT 8 BIOLOGY: Meiosis and Heredity Page 148
UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular
More informationTesting for Homogeneity in Genetic Linkage Analysis
Testing for Homogeneity in Genetic Linkage Analysis Yuejiao Fu, 1, Jiahua Chen 2 and John D. Kalbfleisch 3 1 Department of Mathematics and Statistics, York University Toronto, ON, M3J 1P3, Canada 2 Department
More informationModeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17
Modeling IBD for Pairs of Relatives Biostatistics 666 Lecture 7 Previously Linkage Analysis of Relative Pairs IBS Methods Compare observed and expected sharing IBD Methods Account for frequency of shared
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationLinkage and Linkage Disequilibrium
Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies
More informationStatistical issues in QTL mapping in mice
Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping
More informationMCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES
MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES Elizabeth A. Thompson Department of Statistics, University of Washington Box 354322, Seattle, WA 98195-4322, USA Email: thompson@stat.washington.edu This
More informationPower and Robustness of Linkage Tests for Quantitative Traits in General Pedigrees
Johns Hopkins University, Dept. of Biostatistics Working Papers 1-5-2004 Power and Robustness of Linkage Tests for Quantitative Traits in General Pedigrees Weimin Chen Johns Hopkins Bloomberg School of
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Backcross P 1 P 2 P 1 F 1 BC 4
More informationThe universal validity of the possible triangle constraint for Affected-Sib-Pairs
The Canadian Journal of Statistics Vol. 31, No.?, 2003, Pages???-??? La revue canadienne de statistique The universal validity of the possible triangle constraint for Affected-Sib-Pairs Zeny Z. Feng, Jiahua
More informationDNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to
1 1 1 1 1 1 1 1 0 SUPPLEMENTARY MATERIALS, B. BIVARIATE PEDIGREE-BASED ASSOCIATION ANALYSIS Introduction We propose here a statistical method of bivariate genetic analysis, designed to evaluate contribution
More informationLinkage Mapping. Reading: Mather K (1951) The measurement of linkage in heredity. 2nd Ed. John Wiley and Sons, New York. Chapters 5 and 6.
Linkage Mapping Reading: Mather K (1951) The measurement of linkage in heredity. 2nd Ed. John Wiley and Sons, New York. Chapters 5 and 6. Genetic maps The relative positions of genes on a chromosome can
More informationBS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section. A/a ; B/B ; d/d X A/a ; b/b ; D/d
BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section 1. In the following cross, all genes are on separate chromosomes. A is dominant to a, B is dominant to b and D is dominant
More informationQTL mapping under ascertainment
QTL mapping under ascertainment J. PENG Department of Statistics, University of California, Davis, CA 95616 D. SIEGMUND Department of Statistics, Stanford University, Stanford, CA 94305 February 15, 2006
More informationAFFECTED RELATIVE PAIR LINKAGE STATISTICS THAT MODEL RELATIONSHIP UNCERTAINTY
AFFECTED RELATIVE PAIR LINKAGE STATISTICS THAT MODEL RELATIONSHIP UNCERTAINTY by Amrita Ray BSc, Presidency College, Calcutta, India, 2001 MStat, Indian Statistical Institute, Calcutta, India, 2003 Submitted
More informationI Have the Power in QTL linkage: single and multilocus analysis
I Have the Power in QTL linkage: single and multilocus analysis Benjamin Neale 1, Sir Shaun Purcell 2 & Pak Sham 13 1 SGDP, IoP, London, UK 2 Harvard School of Public Health, Cambridge, MA, USA 3 Department
More informationMapping quantitative trait loci in oligogenic models
Biostatistics (2001), 2, 2,pp. 147 162 Printed in Great Britain Mapping quantitative trait loci in oligogenic models HSIU-KHUERN TANG, D. SIEGMUND Department of Statistics, 390 Serra Mall, Sequoia Hall,
More informationQTL Mapping I: Overview and using Inbred Lines
QTL Mapping I: Overview and using Inbred Lines Key idea: Looking for marker-trait associations in collections of relatives If (say) the mean trait value for marker genotype MM is statisically different
More informationSolutions to Problem Set 4
Question 1 Solutions to 7.014 Problem Set 4 Because you have not read much scientific literature, you decide to study the genetics of garden peas. You have two pure breeding pea strains. One that is tall
More informationLesson 4: Understanding Genetics
Lesson 4: Understanding Genetics 1 Terms Alleles Chromosome Co dominance Crossover Deoxyribonucleic acid DNA Dominant Genetic code Genome Genotype Heredity Heritability Heritability estimate Heterozygous
More informationNatural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate
Natural Selection Population Dynamics Humans, Sickle-cell Disease, and Malaria How does a population of humans become resistant to malaria? Overproduction Environmental pressure/competition Pre-existing
More informationBinary trait mapping in experimental crosses with selective genotyping
Genetics: Published Articles Ahead of Print, published on May 4, 2009 as 10.1534/genetics.108.098913 Binary trait mapping in experimental crosses with selective genotyping Ani Manichaikul,1 and Karl W.
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationObjectives. Announcements. Comparison of mitosis and meiosis
Announcements Colloquium sessions for which you can get credit posted on web site: Feb 20, 27 Mar 6, 13, 20 Apr 17, 24 May 15. Review study CD that came with text for lab this week (especially mitosis
More informationGamma frailty model for linkage analysis. with application to interval censored migraine data
Gamma frailty model for linkage analysis with application to interval censored migraine data M.A. Jonker 1, S. Bhulai 1, D.I. Boomsma 2, R.S.L. Ligthart 2, D. Posthuma 2, A.W. van der Vaart 1 1 Department
More informationGene mapping, linkage analysis and computational challenges. Konstantin Strauch
Gene mapping, linkage analysis an computational challenges Konstantin Strauch Institute for Meical Biometry, Informatics, an Epiemiology (IMBIE) University of Bonn E-mail: strauch@uni-bonn.e Genetics an
More informationConstructing a Pedigree
Constructing a Pedigree Use the appropriate symbols: Unaffected Male Unaffected Female Affected Male Affected Female Male carrier of trait Mating of Offspring 2. Label each generation down the left hand
More informationCausal Model Selection Hypothesis Tests in Systems Genetics
1 Causal Model Selection Hypothesis Tests in Systems Genetics Elias Chaibub Neto and Brian S Yandell SISG 2012 July 13, 2012 2 Correlation and Causation The old view of cause and effect... could only fail;
More informationMapping multiple QTL in experimental crosses
Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationMODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES
MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by
More informationAsymptotic properties of the likelihood ratio test statistics with the possible triangle constraint in Affected-Sib-Pair analysis
The Canadian Journal of Statistics Vol.?, No.?, 2006, Pages???-??? La revue canadienne de statistique Asymptotic properties of the likelihood ratio test statistics with the possible triangle constraint
More informationUse of hidden Markov models for QTL mapping
Use of hidden Markov models for QTL mapping Karl W Broman Department of Biostatistics, Johns Hopkins University December 5, 2006 An important aspect of the QTL mapping problem is the treatment of missing
More informationUnit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February 5 th /6 th
Name: Date: Block: Chapter 6 Meiosis and Mendel Section 6.1 Chromosomes and Meiosis 1. How do gametes differ from somatic cells? Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February
More informationCausal Graphical Models in Systems Genetics
1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation
More informationPowerful Regression-Based Quantitative-Trait Linkage Analysis of General Pedigrees
Am. J. Hum. Genet. 71:38 53, 00 Powerful Regression-Based Quantitative-Trait Linkage Analysis of General Pedigrees Pak C. Sham, 1 Shaun Purcell, 1 Stacey S. Cherny, 1, and Gonçalo R. Abecasis 3 1 Institute
More information2. Map genetic distance between markers
Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,
More informationMultiple QTL mapping
Multiple QTL mapping Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] 1 Why? Reduce residual variation = increased power
More informationInference on pedigree structure from genome screen data. Running title: Inference on pedigree structure. Mary Sara McPeek. The University of Chicago
1 Inference on pedigree structure from genome screen data Running title: Inference on pedigree structure Mary Sara McPeek The University of Chicago Address for correspondence: Department of Statistics,
More informationComputation of Multilocus Prior Probability of Autozygosity for Complex Inbred Pedigrees
Genetic Epidemiology 14:1 15 (1997) Computation of Multilocus Prior Probability of Autozygosity for Complex Inbred Pedigrees Sun-Wei Guo* Department of Biostatistics, University of Michigan, Ann Arbor
More informationConstructing Confidence Intervals for QTL Location. B. Mangin, B. Goffinet and A. Rebai
Copyright 0 1994 by the Genetics Society of America Constructing Confidence Intervals for QTL Location B. Mangin, B. Goffinet and A. Rebai Znstitut National de la Recherche Agronomique, Station de Biomitrie
More informationGenotype Imputation. Biostatistics 666
Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives
More informationLecture 11: Multiple trait models for QTL analysis
Lecture 11: Multiple trait models for QTL analysis Julius van der Werf Multiple trait mapping of QTL...99 Increased power of QTL detection...99 Testing for linked QTL vs pleiotropic QTL...100 Multiple
More informationCh 11.4, 11.5, and 14.1 Review. Game
Ch 11.4, 11.5, and 14.1 Review Game What happens to the chromosome number during meiosis? A It doubles B It stays the same C It halves D It becomes diploid Ans: C Gametes are A Sex cells B Sperm and eggs
More informationp(d g A,g B )p(g B ), g B
Supplementary Note Marginal effects for two-locus models Here we derive the marginal effect size of the three models given in Figure 1 of the main text. For each model we assume the two loci (A and B)
More informationAssociation Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5
Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative
More informationLecture WS Evolutionary Genetics Part I 1
Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in
More informationEstimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty
Estimation of Parameters in Random Effect Models with Incidence Matrix Uncertainty Xia Shen 1,2 and Lars Rönnegård 2,3 1 The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden; 2 School
More informationPopulation Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda
1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;
More informationMethods for Cryptic Structure. Methods for Cryptic Structure
Case-Control Association Testing Review Consider testing for association between a disease and a genetic marker Idea is to look for an association by comparing allele/genotype frequencies between the cases
More informationThe E-M Algorithm in Genetics. Biostatistics 666 Lecture 8
The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as
More informationUnit 7 Genetics. Meiosis
NAME: 1 Unit 7 Genetics 1. Gregor Mendel- was responsible for our 2. What organism did Mendel study? 3. Mendel stated that physical traits were inherited as 4. Today we know that particles are actually
More informationIntroduc)on to Gene)cs How to Analyze Your Own Genome Fall 2013
Introduc)on to Gene)cs 02-223 How to Analyze Your Own Genome Fall 2013 Overview Primer on gene
More information1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:
.5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the
More informationPrinciples of Genetics
Principles of Genetics Snustad, D ISBN-13: 9780470903599 Table of Contents C H A P T E R 1 The Science of Genetics 1 An Invitation 2 Three Great Milestones in Genetics 2 DNA as the Genetic Material 6 Genetics
More informationCombining dependent tests for linkage or association across multiple phenotypic traits
Biostatistics (2003), 4, 2,pp. 223 229 Printed in Great Britain Combining dependent tests for linkage or association across multiple phenotypic traits XIN XU Program for Population Genetics, Harvard School
More informationSection 11 1 The Work of Gregor Mendel
Chapter 11 Introduction to Genetics Section 11 1 The Work of Gregor Mendel (pages 263 266) What is the principle of dominance? What happens during segregation? Gregor Mendel s Peas (pages 263 264) 1. The
More informationQUANTITATIVE trait analysis has many applica- several crosses, since the correlations may not be the
Copyright 001 by the Genetics Society of America Statistical Issues in the Analysis of Quantitative Traits in Combined Crosses Fei Zou, Brian S. Yandell and Jason P. Fine Department of Statistics, University
More informationPRINCIPLES OF MENDELIAN GENETICS APPLICABLE IN FORESTRY. by Erich Steiner 1/
PRINCIPLES OF MENDELIAN GENETICS APPLICABLE IN FORESTRY by Erich Steiner 1/ It is well known that the variation exhibited by living things has two components, one hereditary, the other environmental. One
More informationAnalogy Principle. Asymptotic Theory Part II. James J. Heckman University of Chicago. Econ 312 This draft, April 5, 2006
Analogy Principle Asymptotic Theory Part II James J. Heckman University of Chicago Econ 312 This draft, April 5, 2006 Consider four methods: 1. Maximum Likelihood Estimation (MLE) 2. (Nonlinear) Least
More informationMultipoint Quantitative-Trait Linkage Analysis in General Pedigrees
Am. J. Hum. Genet. 6:9, 99 Multipoint Quantitative-Trait Linkage Analysis in General Pedigrees Laura Almasy and John Blangero Department of Genetics, Southwest Foundation for Biomedical Research, San Antonio
More informationGSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION
FIFTH EDITION IV I ^HHk ^ttm IZTI/^Q i I II MPHBBMWBBIHB '-llwmpbi^hbwm^^pfc ' GSBHSRSBRSRRk LlML I I \l 1MB ^HP'^^MMMP" jflp^^^^^^^^st I Iv^O FROM GENES TO GENOMES %^MiM^PM^^MWi99Mi$9i0^^ ^^^^^^^^^^^^^V^^^fii^^t^i^^^^^
More informationR/qtl workshop. (part 2) Karl Broman. Biostatistics and Medical Informatics University of Wisconsin Madison. kbroman.org
R/qtl workshop (part 2) Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Example Sugiyama et al. Genomics 71:70-77, 2001 250 male
More informationMapping QTL to a phylogenetic tree
Mapping QTL to a phylogenetic tree Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman Human vs mouse www.daviddeen.com 3 Intercross
More informationSupporting Information for Estimating restricted mean. treatment effects with stacked survival models
Supporting Information for Estimating restricted mean treatment effects with stacked survival models Andrew Wey, David Vock, John Connett, and Kyle Rudser Section 1 presents several extensions to the simulation
More informationMultidimensional heritability analysis of neuroanatomical shape. Jingwei Li
Multidimensional heritability analysis of neuroanatomical shape Jingwei Li Brain Imaging Genetics Genetic Variation Behavior Cognition Neuroanatomy Brain Imaging Genetics Genetic Variation Neuroanatomy
More informationarxiv: v4 [stat.me] 2 May 2015
Submitted to the Annals of Statistics arxiv: arxiv:0000.0000 BAYESIAN NONPARAMETRIC TESTS VIA SLICED INVERSE MODELING By Bo Jiang, Chao Ye and Jun S. Liu Harvard University, Tsinghua University and Harvard
More informationMOLECULAR MAPS AND MARKERS FOR DIPLOID ROSES
MOLECULAR MAPS AND MARKERS FOR DIPLOID ROSES Patricia E Klein, Mandy Yan, Ellen Young, Jeekin Lau, Stella Kang, Natalie Patterson, Natalie Anderson and David Byrne Department of Horticultural Sciences,
More informationThe Quantitative TDT
The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus
More informationMethods for QTL analysis
Methods for QTL analysis Julius van der Werf METHODS FOR QTL ANALYSIS... 44 SINGLE VERSUS MULTIPLE MARKERS... 45 DETERMINING ASSOCIATIONS BETWEEN GENETIC MARKERS AND QTL WITH TWO MARKERS... 45 INTERVAL
More informationLecture 8. QTL Mapping 1: Overview and Using Inbred Lines
Lecture 8 QTL Mapping 1: Overview and Using Inbred Lines Bruce Walsh. jbwalsh@u.arizona.edu. University of Arizona. Notes from a short course taught Jan-Feb 2012 at University of Uppsala While the machinery
More informationThe phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.
Series 1: Cross Diagrams There are two alleles for each trait in a diploid organism In C. elegans gene symbols are ALWAYS italicized. To represent two different genes on the same chromosome: When both
More informationName Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.
Section 1: Chromosomes and Meiosis KEY CONCEPT Gametes have half the number of chromosomes that body cells have. VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous
More informationLinkage analysis and QTL mapping in autotetraploid species. Christine Hackett Biomathematics and Statistics Scotland Dundee DD2 5DA
Linkage analysis and QTL mapping in autotetraploid species Christine Hackett Biomathematics and Statistics Scotland Dundee DD2 5DA Collaborators John Bradshaw Zewei Luo Iain Milne Jim McNicol Data and
More informationLecture 7 (FW) February 11, 2009 Phenotype and Genotype Reading: pp
Lecture 7 (FW) February 11, 2009 Phenotype and Genotype Reading: pp. 51-62 Annoucement: A review session for the first mid term will be held on Tuesday, 2/24, from 5-6:30 PM in 159 Mulford Hall. The mid
More information... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE
NORMAL DISTRIBUTIONS OF PHENOTYPES Mice Fruit Flies In:Introduction to Quantitative Genetics Falconer & Mackay 1996 CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE Mean and variance are two quantities
More informationCONVERTING OBSERVED LIKELIHOOD FUNCTIONS TO TAIL PROBABILITIES. D.A.S. Fraser Mathematics Department York University North York, Ontario M3J 1P3
CONVERTING OBSERVED LIKELIHOOD FUNCTIONS TO TAIL PROBABILITIES D.A.S. Fraser Mathematics Department York University North York, Ontario M3J 1P3 N. Reid Department of Statistics University of Toronto Toronto,
More informationQTL model selection: key players
Bayesian Interval Mapping. Bayesian strategy -9. Markov chain sampling 0-7. sampling genetic architectures 8-5 4. criteria for model selection 6-44 QTL : Bayes Seattle SISG: Yandell 008 QTL model selection:
More informationSimple, Robust Linkage Tests for Affected Sibs
Am. J. Hum. Genet. 6:8 4, 998 Simple, Robust Linkage Tests for Affected Sibs Alice S. Whittemore and I-Ping Tu Department of Health Research and Policy, Stanford University School of Medicine; and Stanford
More informationHow robust are the predictions of the W-F Model?
How robust are the predictions of the W-F Model? As simplistic as the Wright-Fisher model may be, it accurately describes the behavior of many other models incorporating additional complexity. Many population
More informationOn the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease
On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease Yuehua Cui 1 and Dong-Yun Kim 2 1 Department of Statistics and Probability, Michigan State University,
More informationQuantitative Genetics
Bruce Walsh, University of Arizona, Tucson, Arizona, USA Almost any trait that can be defined shows variation, both within and between populations. Quantitative genetics is concerned with the analysis
More information