SNP Association Studies with Case-Parent Trios

Size: px
Start display at page:

Download "SNP Association Studies with Case-Parent Trios"

Transcription

1 SNP Association Studies with Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health September 3, 2009 Population-based Association Studies Balding (2006). Nature Reviews Genetics 7(0): 78-9.

2 Case-control Design Balding (2006). Nature Review Genetics 7(0): Family-based Designs Laird and Lange (2006), Nature Reviews Genetics 7,

3 Linkage From Goncalo Abecasis Linkage versus Association From Goncalo Abecasis

4 Family-based Designs Benyamin, Visscher, and McRae (2009), Pharmacogenomics 0(2), Parent-of-Origin Sull et al (2008), Genetic Epidemiology 32:

5 De-novo Deletion Homozygous and Hemizygous Deletions GF GM M A B S Grandfather Grandmother Mother Aunt Brother Sister AB 0.0 BB 0.05 AB -0. AB AB 0.06 BB AA 0.27 NC AA AA AB 0.2 BB AB 0.5 NC AA AA AB BB AB NC AA AA AA 0.30 AA BB 0.20 NC NC BB BB 0.22 BB AB 0.03 NC -6.4 BB BB AB 0.09 AA AB NC BB -0.7 BB AB AA -.06 BB 0.0 NC BB 0.04 BB BB 0.4 NC AB 0.7 NC AA AA AA AA AB 0.0 NC AA AA AA 0.6 AA AB -0.0 NC BB BB AB 0.3 AA AB 0.0 NC BB -0.7 BB -0.5 AB -0.5 AA BB 0.6 NC BB BB BB 0.0 BB AB 0.06 NC AA AA AB 0.07 BB AA 0.7 NC AA AA AB BB BB 0.02 NC BB BB AB 0.3 AA AA 0.2 NC -6.0 AA AA AB 0.20 BB BB 0.05 BB 0. BB 0.5 BB 0.29 AB 0.3 AB -0.0

6 Family-based Designs - Advantages Robust to possible effects of population stratification and genetic heterogeneity. Parent-of-origin effects (imprinting) can be assessed. Data quality control is usually more thorough (e. g. genotyping errors and sample swaps are easier to catch). Distinction between de-novo and inherited events (copy number changes) is possible. Logistically easier for childhood diseases. In case-parent data, low minor allele frequencies are of less worry (genotyping errors are still possible). Case-parent designs do not require phenotyping parents. Linkage information from previous family studies can be employed in association studies. Family-based Designs - Disdvantages Typically less power per SNP typed than case-control studies. Pedigrees maybe hard to get except for childhood diseases, and may not be feasible for late-onset diseases. Can be a lot more expensive. Highly sensitive to genotyping errors. Might be computationally more demanding, especially for studies with large pedigrees. Software may be an issue.

7 Summary Pearson and Manolio (2008). JAMA. 299(): Family versus Case-control IGES 2009 (Hawaii) Monday, October 9 - Kuilima Ballroom Session 3: Family Studies & Family Data :30 2:30 Family Studies: Are they still relevant? Pro vs Con Discussion Nan Laird (Pro) & David Balding (Con)

8 TDT - Allelic The transmission disequilibrium test measures the over-transmission of an allele from parents to affected offsprings. For a set of n parents with alleles and 2 at a genetic locus, each parent can be summarized by the transmitted and the non-transmitted allele: Non-TA P 2 a b a + b TA 2 c d c + d P a + c b + d 2n Only the heterozygous parents contribute information! Under the null of no association, Even better, use binom.test() in R. (b c)2 b + c χ 2 TDT - Genotypic Assume that at a certain locus the father has alleles and the mother has alleles 2. The four Mendelian children thus have alleles, 2,, and 2. Assume the affected proband has genotype. The three Pseudo controls then have the genotypes, 2, and 2. Y X Affected proband Pseudo control # 0 Pseudo control #2 0 2 Pseudo control #3 0 2 We can use conditional logistic regression to analyze the data.

9 TDT Fallin MD et al (2006). Genetic Epidemiology 23:4586. International Cleft Consortium CL trios CLP trios CP trios Total Recruitment Site C NC C NC C NC C NC Utah 68 (6) 96 (20) 52 (2) 26 (48) Norway 06 (4) 74 (8) 07 (3) 387 (5) Korea 9 (0) 40 (2) 5 (0) 64 (2) Maryland 9 (2) 7 (42) 25 (8) 5 (72) Pittsburgh 26 (2) 70 (28) (4) 07 (34) Singapore 5 () 45 (7) 53 (4) 3 (2) Taiwan 42 (4) 76 () 74 (5) 292 (20) Iowa 6 (9) 29 () 24 (7) 69 (37) Denmark 6 (5) 5 (2) 5 (8) 26 (35) Philippines 0 (0) 94 (4) 0 (0) 94 (4) WuHan 39 (3) 36 (9) 42 (3) 29 (5)* Shandong Prov. 54 (2) 29 (70) 30 (8) 25 (0)* Western China 43 (3) 63 (3) 38 (2) 44 (8) Total 453 (90) 38 (227) 466 (84) 2060 (403)* *total includes probands of indeterminate cleft type: 2 in WuHan; 3 in Shandong Prov.

10 International Cleft Consortium International Cleft Consortium

11 Schizophrenia Study Case-parent trios of Ashkenazi Jewish descents. Diagnosis of SZ and SZA based on DSM IV. Dense coverage of 64 candidate genes. 375 SNPs on chromosomes genotyped for 32 trios. Original analysis through single marker TDT. Fallin MD et al (2005), Am J Hum Genet 77(6): Goal: Explore SNP-SNP interactions for association with SZ and SZA. Biological and Statistical Interactions BB Bb bb AA Aa aa (SNPA D SNPB R ) (SNPB D SNPA R ) Statistical interaction: Deviation from additivity in a linear statistical model. Epistasis: Masking of phenotype expressed by one gene by the effects of another gene. How can we detect models such as logit(p) =α + β Ind{(SNP24 D SNP80 R ) (SNP24 R SNP80 D )}...?

12 Logic Regression The predictors are the SNPs in dominant and recessive coding. SNP X X.R X.D AA 0 0 AT 0 TT Let X,...,X k be binary (0/) predictors, Y a response variable. Fit a model g(e(y)) = b 0 + t b j L j, j= where L j is a Boolean combination of the covariates, e.g. L j = (X X 2 ) X c 4. Determine the logic terms L j and estimate the b j simultaneously. Ruczinski et al (2003), Journal of Computational and Graphical Statistics 2(3): Trio Logic Regression The rough idea is as follows: Pseudo controls are generated from the trio data, taking the LD block structure into account. Missing data are handled using haplotype-based imputation. The conditional logistic likelihood is used in logic regression to assess differences in cases and pseudo controls.

13 Trio Logic Regression Trio Logic Regression The steps in more detail: Estimate the haplotype blocks and the haplotype frequencies using the parents genotypes. 2 For each block and each trio, sample haplotype pairs for the parents and the offspring consistent with the observed genotypes in the trio, allowing for missing data. 3 Generate the probands genotype data from the haplotypes that were passed from the parents. 4 For each block and each trio, generate genotypes for three pseudo-controls (PC, PC2, PC3) using the parents haplotypes that were not passed to the proband. The assignment to PC, PC2, and PC3 is random. 5 Assemble three pseudo-controls for each trio by augmenting the genotypes from the blocks. 6 For each locus, translate the genotype data into two binary variables in dominant and recessive coding.

14 Trio Logic Regression The conditional logistic likelihood is L(β) = ( ) N exp(x i0 β) i= exp(x i0 β)+σ 3 m= exp(x imβ) In this setting, i refers to a trio, X i0 refers to the exposure of the proband in trio i, and (X i, X i2, X i3 ) are the exposures of the 3 pseudo-controls in trio i. Does the likelihood always have a maximum? Trio Logic Regression Number of trios where k pseudo-controls match the proband k X i0 = 0 a 0 b 0 c 0 d 0 X i0 = a b c d Likelihood contributions for X i0 = 0 a 0 b 0 c 0 d 0 X i X i X i2 0 0 X i3 0 L i (β) +3 exp(β) 2+2 exp(β) 3+exp(β) 4 Likelihood contributions for X i0 = a b c d X i0 X i 0 X i2 0 0 X i L i (β) exp(β) exp(β)+3 exp(β) 2 exp(β)+2 exp(β) 3 exp(β)+ 4

15 Trio Logic Regression The log-likelihood is: log(l(β)) = a {β log(exp(β)+3)} + b {β log(2 exp(β)+2)} + c {β log(3 exp(β)+)} + d 4 a 0 log{ + 3 exp(β)} b 0 log{2 + 2 exp(β)} c 0 log{3 + exp(β)} d 0 4 Trio Logic Regression The first derivative of the log-likelihood is: log(l(β)) β = a (a + c 0 ) exp(β) exp(β)+3 + b (b + b 0 ) c (c + a 0 ) 2 exp(β) 2 exp(β) exp(β) 3 exp(β)+ The log-likelihood is monotonically decreasing in β. Further: lim β log(l(β)) β = a + b + c 0 and lim β + log(l(β)) β = (a 0 + b 0 + c 0 ) 0

16 Trio Logic Regression Since the derivative of the log likelihood function is a continuous function in β, it follows that the likelihood function has a maximum unless a + b + c = 0 or a 0 + b 0 + c 0 = 0. In other words, for either the exposure or the non-exposure groups, all pseudo-controls are equal to the respective probands. This might occur in a particular sample where the number of trios is small and the (or some of the) SNPs contributing to the exposure definition have extremely low minor allele frequency, however, it is not plausible that such a separation would exist in the population in truth. Missing Data

17 Simulation G SNP R 3 2 SNP R 4 2 SNPR (SNP R SNPD 6 2 ) SNPR 3 4 (SNP R 2 3 SNPD 5 2 ) SNPR (SNP D 4 SNPR 8 2 ) (SNPR 5 SNPD 6 ) 6 ((SNP R 4 2 SNPR 7 2 ) SNPR 8 3 ) (SNPR 9 2 SNPR 6 ) 7 (SNP R 3 SNPR 2 2 ) (SNPR 5 4 SNPR 5 ) (SNPR 9 2 SNPR 8 3 ) P(G) Haplotypes / block # of haplotypes # mating table rows , , 3, , 5, , 5, 5, , 8, 5, 3, 3 2, , 4, 5, 5, 3, 5 4, Simulation L/ L2/22 L/ Haplotype Frequencies population parents proband Haplotype Frequencies

18 Simulation When using conditional logistic regression to compare cases and pseudo-controls, the expected value of the parameter estimates is not the logs odds ratio β, but the log relative risk (Schaid 996). RR = P(D I G = ) P(D I G = 0) = exp(α + β)/( + exp(α + β)) exp(α)/( + exp(α)) = exp(β) ( ) + exp(α + β) + exp(α) Therefore log(rr) = β log ( ) + exp(α + β) + exp(α) Simulation log(rr) =β log! + exp(α + β) + exp(α) 4 3 2!^ 0 "!!5!3!!5!3!!5!3!!5!3!!5!3!

19 Simulation log(rr) =β log! + exp(α + β) + exp(α) 4 3 2!^ 0 "!!5!3!!5!3!!5!3!!5!3!!5!3! Software The trio logic regression methods are implemented as an augmentation in the logic regression R package LogicReg. 2 The R package trio contains functions to generate logic regression input from pedigree or genotype files, to check for Mendelian errors, to impute missing data, and to simulate case-parent trios. 3 A software vignette is also available.

20 Results Logic model exp( ˆβ) 0.67 I n o 302 D I n o 302 D 66 D I n o 302 D 66 D 48 D I n o 302 D 66 D 48 D 368 R 3.65 SNP 302 Chromosome 2 NOS SNP 66 Chromosome 8 CHRNB SNP 48 Chromosome 8 PNOC SNP 368 Chromosome 22 COMT Results Logic model exp( ˆβ) 0.67 I n o 302 D I n o 302 D 66 D I n o 302 D 66 D 48 D I n o 302 D 66 D 48 D 368 R 3.65 SNP 302 Chromosome 2 NOS SNP 66 Chromosome 8 CHRNB SNP 48 Chromosome 8 PNOC SNP 368 Chromosome 22 COMT

21 Results exp( ˆβ) ˆβ (se) z p Marginal 302 D (0.6) 4.2 4e D (0.22) Logic 302 D 66 D (0.8) 4.89 e-06 Additive 302 D (0.6) 4.5 3e D (0.22) D (0.9) 4.86 e-06 Additive 66 D (0.33) D : 66 D (0.34) Results exp( ˆβ) ˆβ (se) z p Marginal 302 D (0.6) 4.2 4e D (0.22) Logic 302 D 66 D (0.8) 4.89 e-06 Additive 302 D (0.6) 4.5 3e D (0.22) D (0.9) 4.86 e-06 Additive 66 D (0.33) D : 66 D (0.34)

22 Results exp( ˆβ) ˆβ (se) z p Marginal 302 D (0.6) 4.2 4e D (0.22) Logic 302 D 66 D (0.8) 4.89 e-06 Additive 302 D (0.6) 4.5 3e D (0.22) D (0.9) 4.86 e-06 Additive 66 D (0.33) D : 66 D (0.34) Results 0.67 I n 302 Do I {66D } 302 D 302 D D main effects only

23 Results 0.90 I n 302 Do I {66D }.09 In 302 D :66 Do 302 D 302 D D main effects + interaction Results 0.89 I n 302 D 66 Do 302 D 302 D D logic regression

24 Results 302 D 302 D 302 D 302 D 302 D 302 D D 66 D 66 D main effects only main effects + interaction logic regression Results exp( ˆβ) ˆβ(se) z p 302 D (0.6) 4.2 4e-05 Marginal 66 D (0.22) D (0.25) Logic 302 D 66 D 48 D (0.20) e-08

SNP-SNP Interactions in Case-Parent Trios

SNP-SNP Interactions in Case-Parent Trios Detection of SNP-SNP Interactions in Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health June 2, 2009 Karyotypes http://ghr.nlm.nih.gov/ Single Nucleotide Polymphisms

More information

SNP Association Studies with Case-Parent Trios

SNP Association Studies with Case-Parent Trios SNP Association Studies with Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health July, 00 Acknowledgments Collaborators: Qing Li, Rob Scharpf, Holger Schwender,

More information

Some New Methods for Family-Based Association Studies

Some New Methods for Family-Based Association Studies Some New Methods for Family-Based Association Studies Ingo Ruczinski Department of Biostatistics Johns Hopkins Bloomberg School of Public Health April 8, 20 http: //biostat.jhsph.edu/ iruczins/ Topics

More information

The Quantitative TDT

The Quantitative TDT The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus

More information

Preparing Case-Parent Trio Data and Detecting Disease-Associated SNPs, SNP Interactions, and Gene-Environment Interactions with trio

Preparing Case-Parent Trio Data and Detecting Disease-Associated SNPs, SNP Interactions, and Gene-Environment Interactions with trio Preparing Case-Parent Trio Data and Detecting Disease-Associated SNPs, SNP Interactions, and Gene-Environment Interactions with trio Holger Schwender, Qing Li, and Ingo Ruczinski Contents 1 Introduction

More information

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics 1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

More information

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017 Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping

More information

Hidden Markov Models for the Assessment of Chromosomal Alterations using High-throughput SNP Arrays

Hidden Markov Models for the Assessment of Chromosomal Alterations using High-throughput SNP Arrays Hidden Markov Models for the Assessment of Chromosomal Alterations using High-throughput SNP Arrays Department of Biostatistics Johns Hopkins Bloomberg School of Public Health November 18, 2008 Acknowledgments

More information

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative

More information

2. Map genetic distance between markers

2. Map genetic distance between markers Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,

More information

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular

More information

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015 Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm

More information

Linear Regression (1/1/17)

Linear Regression (1/1/17) STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression

More information

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by

More information

Genotype Imputation. Biostatistics 666

Genotype Imputation. Biostatistics 666 Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives

More information

Friday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo

Friday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo Friday Harbor 2017 From Genetics to GWAS (Genome-wide Association Study) Sept 7 2017 David Fardo Purpose: prepare for tomorrow s tutorial Genetic Variants Quality Control Imputation Association Visualization

More information

Lecture 9. QTL Mapping 2: Outbred Populations

Lecture 9. QTL Mapping 2: Outbred Populations Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred

More information

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion

More information

Case-Control Association Testing. Case-Control Association Testing

Case-Control Association Testing. Case-Control Association Testing Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

Missing Data. On Missing Data and Interactions in SNP Association Studies. Missing Data - Approaches. Missing Data - Approaches. Multiple Imputation

Missing Data. On Missing Data and Interactions in SNP Association Studies. Missing Data - Approaches. Missing Data - Approaches. Multiple Imputation October, 6 @ U of British Columbia Missing Data On Missing Data Interactions in SNP Association Studies 5 SNPs Ingo Ruczinski Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health

More information

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES: .5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the

More information

p(d g A,g B )p(g B ), g B

p(d g A,g B )p(g B ), g B Supplementary Note Marginal effects for two-locus models Here we derive the marginal effect size of the three models given in Figure 1 of the main text. For each model we assume the two loci (A and B)

More information

Lesson 4: Understanding Genetics

Lesson 4: Understanding Genetics Lesson 4: Understanding Genetics 1 Terms Alleles Chromosome Co dominance Crossover Deoxyribonucleic acid DNA Dominant Genetic code Genome Genotype Heredity Heritability Heritability estimate Heterozygous

More information

Outline. P o purple % x white & white % x purple& F 1 all purple all purple. F purple, 224 white 781 purple, 263 white

Outline. P o purple % x white & white % x purple& F 1 all purple all purple. F purple, 224 white 781 purple, 263 white Outline - segregation of alleles in single trait crosses - independent assortment of alleles - using probability to predict outcomes - statistical analysis of hypotheses - conditional probability in multi-generation

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

Complexity and Approximation of the Minimum Recombination Haplotype Configuration Problem

Complexity and Approximation of the Minimum Recombination Haplotype Configuration Problem Complexity and Approximation of the Minimum Recombination Haplotype Configuration Problem Lan Liu 1, Xi Chen 3, Jing Xiao 3, and Tao Jiang 1,2 1 Department of Computer Science and Engineering, University

More information

TOPICS IN STATISTICAL METHODS FOR HUMAN GENE MAPPING

TOPICS IN STATISTICAL METHODS FOR HUMAN GENE MAPPING TOPICS IN STATISTICAL METHODS FOR HUMAN GENE MAPPING by Chia-Ling Kuo MS, Biostatstics, National Taiwan University, Taipei, Taiwan, 003 BBA, Statistics, National Chengchi University, Taipei, Taiwan, 001

More information

Bayesian Inference of Interactions and Associations

Bayesian Inference of Interactions and Associations Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,

More information

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees:

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees: MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation

More information

Patterns of inheritance

Patterns of inheritance Patterns of inheritance Learning goals By the end of this material you would have learnt about: How traits and characteristics are passed on from one generation to another The different patterns of inheritance

More information

DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to

DNA polymorphisms such as SNP and familial effects (additive genetic, common environment) to 1 1 1 1 1 1 1 1 0 SUPPLEMENTARY MATERIALS, B. BIVARIATE PEDIGREE-BASED ASSOCIATION ANALYSIS Introduction We propose here a statistical method of bivariate genetic analysis, designed to evaluate contribution

More information

Ch 11.Introduction to Genetics.Biology.Landis

Ch 11.Introduction to Genetics.Biology.Landis Nom Section 11 1 The Work of Gregor Mendel (pages 263 266) This section describes how Gregor Mendel studied the inheritance of traits in garden peas and what his conclusions were. Introduction (page 263)

More information

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8 The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as

More information

Objectives. Announcements. Comparison of mitosis and meiosis

Objectives. Announcements. Comparison of mitosis and meiosis Announcements Colloquium sessions for which you can get credit posted on web site: Feb 20, 27 Mar 6, 13, 20 Apr 17, 24 May 15. Review study CD that came with text for lab this week (especially mitosis

More information

The Lander-Green Algorithm. Biostatistics 666 Lecture 22

The Lander-Green Algorithm. Biostatistics 666 Lecture 22 The Lander-Green Algorithm Biostatistics 666 Lecture Last Lecture Relationship Inferrence Likelihood of genotype data Adapt calculation to different relationships Siblings Half-Siblings Unrelated individuals

More information

BIOLOGY LTF DIAGNOSTIC TEST MEIOSIS & MENDELIAN GENETICS

BIOLOGY LTF DIAGNOSTIC TEST MEIOSIS & MENDELIAN GENETICS 016064 BIOLOGY LTF DIAGNOSTIC TEST MEIOSIS & MENDELIAN GENETICS TEST CODE: 016064 Directions: Each of the questions or incomplete statements below is followed by five suggested answers or completions.

More information

Non-iterative, regression-based estimation of haplotype associations

Non-iterative, regression-based estimation of haplotype associations Non-iterative, regression-based estimation of haplotype associations Benjamin French, PhD Department of Biostatistics and Epidemiology University of Pennsylvania bcfrench@upenn.edu National Cancer Center

More information

Genetics (patterns of inheritance)

Genetics (patterns of inheritance) MENDELIAN GENETICS branch of biology that studies how genetic characteristics are inherited MENDELIAN GENETICS Gregory Mendel, an Augustinian monk (1822-1884), was the first who systematically studied

More information

10. How many chromosomes are in human gametes (reproductive cells)? 23

10. How many chromosomes are in human gametes (reproductive cells)? 23 Name: Key Block: Define the following terms: 1. Dominant Trait-characteristics that are expressed if present in the genotype 2. Recessive Trait-characteristics that are masked by dominant traits unless

More information

Tilburg University. Published in: American Journal of Human Genetics. Document version: Peer reviewed version. Publication date: 2000

Tilburg University. Published in: American Journal of Human Genetics. Document version: Peer reviewed version. Publication date: 2000 Tilburg University Testing for linkage disequilibrium, maternal effects, and imprinting with (in)complete case-parent triads using the computer program LEM van den Oord, E.; Vermunt, Jeroen Published in:

More information

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17 Modeling IBD for Pairs of Relatives Biostatistics 666 Lecture 7 Previously Linkage Analysis of Relative Pairs IBS Methods Compare observed and expected sharing IBD Methods Account for frequency of shared

More information

A novel fuzzy set based multifactor dimensionality reduction method for detecting gene-gene interaction

A novel fuzzy set based multifactor dimensionality reduction method for detecting gene-gene interaction A novel fuzzy set based multifactor dimensionality reduction method for detecting gene-gene interaction Sangseob Leem, Hye-Young Jung, Sungyoung Lee and Taesung Park Bioinformatics and Biostatistics lab

More information

BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section. A/a ; B/B ; d/d X A/a ; b/b ; D/d

BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section. A/a ; B/B ; d/d X A/a ; b/b ; D/d BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section 1. In the following cross, all genes are on separate chromosomes. A is dominant to a, B is dominant to b and D is dominant

More information

Breeding Values and Inbreeding. Breeding Values and Inbreeding

Breeding Values and Inbreeding. Breeding Values and Inbreeding Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A

More information

1. Understand the methods for analyzing population structure in genomes

1. Understand the methods for analyzing population structure in genomes MSCBIO 2070/02-710: Computational Genomics, Spring 2016 HW3: Population Genetics Due: 24:00 EST, April 4, 2016 by autolab Your goals in this assignment are to 1. Understand the methods for analyzing population

More information

Guided Notes Unit 6: Classical Genetics

Guided Notes Unit 6: Classical Genetics Name: Date: Block: Chapter 6: Meiosis and Mendel I. Concept 6.1: Chromosomes and Meiosis Guided Notes Unit 6: Classical Genetics a. Meiosis: i. (In animals, meiosis occurs in the sex organs the testes

More information

Heredity and Genetics WKSH

Heredity and Genetics WKSH Chapter 6, Section 3 Heredity and Genetics WKSH KEY CONCEPT Mendel s research showed that traits are inherited as discrete units. Vocabulary trait purebred law of segregation genetics cross MAIN IDEA:

More information

Lecture 11: Multiple trait models for QTL analysis

Lecture 11: Multiple trait models for QTL analysis Lecture 11: Multiple trait models for QTL analysis Julius van der Werf Multiple trait mapping of QTL...99 Increased power of QTL detection...99 Testing for linked QTL vs pleiotropic QTL...100 Multiple

More information

Power and Design Considerations for a General Class of Family-Based Association Tests: Quantitative Traits

Power and Design Considerations for a General Class of Family-Based Association Tests: Quantitative Traits Am. J. Hum. Genet. 71:1330 1341, 00 Power and Design Considerations for a General Class of Family-Based Association Tests: Quantitative Traits Christoph Lange, 1 Dawn L. DeMeo, and Nan M. Laird 1 1 Department

More information

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013 Lecture 9 Short-Term Selection Response: Breeder s equation Bruce Walsh lecture notes Synbreed course version 3 July 2013 1 Response to Selection Selection can change the distribution of phenotypes, and

More information

Reinforcement Unit 3 Resource Book. Meiosis and Mendel KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Reinforcement Unit 3 Resource Book. Meiosis and Mendel KEY CONCEPT Gametes have half the number of chromosomes that body cells have. 6.1 CHROMOSOMES AND MEIOSIS KEY CONCEPT Gametes have half the number of chromosomes that body cells have. Your body is made of two basic cell types. One basic type are somatic cells, also called body cells,

More information

Affected Sibling Pairs. Biostatistics 666

Affected Sibling Pairs. Biostatistics 666 Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD

More information

Biol. 303 EXAM I 9/22/08 Name

Biol. 303 EXAM I 9/22/08 Name Biol. 303 EXAM I 9/22/08 Name -------------------------------------------------------------------------------------------------------------- This exam consists of 40 multiple choice questions worth 2.5

More information

Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February 5 th /6 th

Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February 5 th /6 th Name: Date: Block: Chapter 6 Meiosis and Mendel Section 6.1 Chromosomes and Meiosis 1. How do gametes differ from somatic cells? Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February

More information

Genetics Studies of Multivariate Traits

Genetics Studies of Multivariate Traits Genetics Studies of Multivariate Traits Heping Zhang Department of Epidemiology and Public Health Yale University School of Medicine Presented at Southern Regional Council on Statistics Summer Research

More information

Régression en grande dimension et épistasie par blocs pour les études d association

Régression en grande dimension et épistasie par blocs pour les études d association Régression en grande dimension et épistasie par blocs pour les études d association V. Stanislas, C. Dalmasso, C. Ambroise Laboratoire de Mathématiques et Modélisation d Évry "Statistique et Génome" 1

More information

EM algorithm. Rather than jumping into the details of the particular EM algorithm, we ll look at a simpler example to get the idea of how it works

EM algorithm. Rather than jumping into the details of the particular EM algorithm, we ll look at a simpler example to get the idea of how it works EM algorithm The example in the book for doing the EM algorithm is rather difficult, and was not available in software at the time that the authors wrote the book, but they implemented a SAS macro to implement

More information

Cell Division: the process of copying and dividing entire cells The cell grows, prepares for division, and then divides to form new daughter cells.

Cell Division: the process of copying and dividing entire cells The cell grows, prepares for division, and then divides to form new daughter cells. Mitosis & Meiosis SC.912.L.16.17 Compare and contrast mitosis and meiosis and relate to the processes of sexual and asexual reproduction and their consequences for genetic variation. 1. Students will describe

More information

Observing Patterns in Inherited Traits

Observing Patterns in Inherited Traits Observing Patterns in Inherited Traits Chapter 10 Before you go on Review the answers to the following questions to test your understanding of previous material. 1. Most organisms are diploid. What does

More information

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have. Section 1: Chromosomes and Meiosis KEY CONCEPT Gametes have half the number of chromosomes that body cells have. VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous

More information

Unit 3 Test 2 Study Guide

Unit 3 Test 2 Study Guide Unit 3 Test 2 Study Guide How many chromosomes are in the human body cells? 46 How many chromosomes are in the sex cells? 23 What are sex cells also known as? gametes What is fertilization? Union of the

More information

Linkage and Linkage Disequilibrium

Linkage and Linkage Disequilibrium Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies

More information

Solutions to Problem Set 4

Solutions to Problem Set 4 Question 1 Solutions to 7.014 Problem Set 4 Because you have not read much scientific literature, you decide to study the genetics of garden peas. You have two pure breeding pea strains. One that is tall

More information

F1 Parent Cell R R. Name Period. Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes

F1 Parent Cell R R. Name Period. Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes Name Period Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes 1. What is the chromosome theory of inheritance? 2. Explain the law of segregation. Use two different

More information

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next genetics the study of heredity heredity sequence of DNA that codes for a protein and thus determines a trait genome a specific characteristic that varies from one individual to another gene trait the passing

More information

Methods for Cryptic Structure. Methods for Cryptic Structure

Methods for Cryptic Structure. Methods for Cryptic Structure Case-Control Association Testing Review Consider testing for association between a disease and a genetic marker Idea is to look for an association by comparing allele/genotype frequencies between the cases

More information

The Chromosomal Basis of Inheritance

The Chromosomal Basis of Inheritance The Chromosomal Basis of Inheritance Mitosis and meiosis were first described in the late 800s. The chromosome theory of inheritance states: Mendelian genes have specific loci (positions) on chromosomes.

More information

Department of Forensic Psychiatry, School of Medicine & Forensics, Xi'an Jiaotong University, Xi'an, China;

Department of Forensic Psychiatry, School of Medicine & Forensics, Xi'an Jiaotong University, Xi'an, China; Title: Evaluation of genetic susceptibility of common variants in CACNA1D with schizophrenia in Han Chinese Author names and affiliations: Fanglin Guan a,e, Lu Li b, Chuchu Qiao b, Gang Chen b, Tinglin

More information

Introduction to Genetics

Introduction to Genetics Introduction to Genetics We ve all heard of it, but What is genetics? Genetics: the study of gene structure and action and the patterns of inheritance of traits from parent to offspring. Ancient ideas

More information

Introduction to Linkage Disequilibrium

Introduction to Linkage Disequilibrium Introduction to September 10, 2014 Suppose we have two genes on a single chromosome gene A and gene B such that each gene has only two alleles Aalleles : A 1 and A 2 Balleles : B 1 and B 2 Suppose we have

More information

On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease

On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease Yuehua Cui 1 and Dong-Yun Kim 2 1 Department of Statistics and Probability, Michigan State University,

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#7:(Mar-23-2010) Genome Wide Association Studies 1 The law of causality... is a relic of a bygone age, surviving, like the monarchy,

More information

Introduction to Genetics

Introduction to Genetics Introduction to Genetics The Work of Gregor Mendel B.1.21, B.1.22, B.1.29 Genetic Inheritance Heredity: the transmission of characteristics from parent to offspring The study of heredity in biology is

More information

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) 12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²

More information

Backward Genotype-Trait Association. in Case-Control Designs

Backward Genotype-Trait Association. in Case-Control Designs Backward Genotype-Trait Association (BGTA)-Based Dissection of Complex Traits in Case-Control Designs Tian Zheng, Hui Wang and Shaw-Hwa Lo Department of Statistics, Columbia University, New York, New York,

More information

Gene mapping in model organisms

Gene mapping in model organisms Gene mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Goal Identify genes that contribute to common human diseases. 2

More information

Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives

Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives Genetic Epidemiology 16:225 249 (1999) Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives Mary Sara McPeek* Department of Statistics, University of Chicago, Chicago, Illinois

More information

Association studies and regression

Association studies and regression Association studies and regression CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar Association studies and regression 1 / 104 Administration

More information

BTRY 7210: Topics in Quantitative Genomics and Genetics

BTRY 7210: Topics in Quantitative Genomics and Genetics BTRY 7210: Topics in Quantitative Genomics and Genetics Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine jgm45@cornell.edu February 12, 2015 Lecture 3:

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

The universal validity of the possible triangle constraint for Affected-Sib-Pairs

The universal validity of the possible triangle constraint for Affected-Sib-Pairs The Canadian Journal of Statistics Vol. 31, No.?, 2003, Pages???-??? La revue canadienne de statistique The universal validity of the possible triangle constraint for Affected-Sib-Pairs Zeny Z. Feng, Jiahua

More information

Lecture WS Evolutionary Genetics Part I 1

Lecture WS Evolutionary Genetics Part I 1 Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in

More information

A General and Accurate Approach for Computing the Statistical Power of the Transmission Disequilibrium Test for Complex Disease Genes

A General and Accurate Approach for Computing the Statistical Power of the Transmission Disequilibrium Test for Complex Disease Genes Genetic Epidemiology 2:53 67 (200) A General and Accurate Approach for Computing the Statistical Power of the Transmission Disequilibrium Test for Complex Disease Genes Wei-Min Chen and Hong-Wen Deng,2

More information

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M. STAT 550 Howework 6 Anton Amirov 1. This question relates to the same study you saw in Homework-4, by Dr. Arno Motulsky and coworkers, and published in Thompson et al. (1988; Am.J.Hum.Genet, 42, 113-124).

More information

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM Life Cycles, Meiosis and Genetic Variability iclicker: 1. A chromosome just before mitosis contains two double stranded DNA molecules. 2. This replicated chromosome contains DNA from only one of your parents

More information

Chapter 11 Meiosis and Genetics

Chapter 11 Meiosis and Genetics Chapter 11 Meiosis and Genetics Chapter 11 Meiosis and Genetics Grade:«grade» Subject:Biology Date:«date» 1 What are homologous chromosomes? A two tetrads, both from mom or both from dad B a matching pair

More information

Binomial Mixture Model-based Association Tests under Genetic Heterogeneity

Binomial Mixture Model-based Association Tests under Genetic Heterogeneity Binomial Mixture Model-based Association Tests under Genetic Heterogeneity Hui Zhou, Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 April 30,

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

Statistical issues in QTL mapping in mice

Statistical issues in QTL mapping in mice Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping

More information

Introduction to Genetics

Introduction to Genetics Introduction to Genetics We ve all heard of it, but What is genetics? Genetics: the study of gene structure and action and the patterns of inheritance of traits from parent to offspring. Ancient ideas

More information

Resemblance between relatives

Resemblance between relatives Resemblance between relatives 1 Key concepts Model phenotypes by fixed effects and random effects including genetic value (additive, dominance, epistatic) Model covariance of genetic effects by relationship

More information

Prediction of the Confidence Interval of Quantitative Trait Loci Location

Prediction of the Confidence Interval of Quantitative Trait Loci Location Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28

More information

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Lecture 6: Introduction to Quantitative genetics Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Quantitative Genetics The analysis of traits whose variation is determined by both a

More information

DNA Structure and Function

DNA Structure and Function DNA Structure and Function Nucleotide Structure 1. 5-C sugar RNA ribose DNA deoxyribose 2. Nitrogenous Base N attaches to 1 C of sugar Double or single ring Four Bases Adenine, Guanine, Thymine, Cytosine

More information

Section 11 1 The Work of Gregor Mendel

Section 11 1 The Work of Gregor Mendel Chapter 11 Introduction to Genetics Section 11 1 The Work of Gregor Mendel (pages 263 266) What is the principle of dominance? What happens during segregation? Gregor Mendel s Peas (pages 263 264) 1. The

More information

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture16: Population structure and logistic regression I Jason Mezey jgm45@cornell.edu April 11, 2017 (T) 8:40-9:55 Announcements I April

More information

Cover Requirements: Name of Unit Colored picture representing something in the unit

Cover Requirements: Name of Unit Colored picture representing something in the unit Name: Period: Cover Requirements: Name of Unit Colored picture representing something in the unit Biology B1 1 Target # Biology Unit B1 (Genetics & Meiosis) Learning Targets Genetics & Meiosis I can explain

More information