Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

Size: px
Start display at page:

Download "Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia"

Transcription

1 Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia

2 Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything. A gene position on a chromosome is called a locus. The alternate forms of the gene at a locus are called alleles.

3 Definitions, continued: A causative locus for a trait is a locus with DNA sequence variation between individuals that contributes to variation in the trait between them. A quantitative trait locus (QTL) is a causative locus for a quantitative trait (e.g. height, IQ).

4 Definitions: Genetic Linkage Genes on the same chromosome tend to be inherited together. This is called linkage. The closer together two genes lie, the more often they are inherited together.

5 Genetic Markers A genetic marker is a known and detectable variation in a gene or other stretch of DNA. The presence of a marker can be detected in an individual organism s genome and its inheritance can be followed. A marker that is linked to a locus affecting a trait will appear more often in individuals with that trait.

6 Gene Mapping Goal of gene mapping is to locate genetic loci that are responsible for variation in traits of interest (e.g. occurrence of disease, intelligence). Based on searching for correlations between markers and trait of interest.

7 Gene Mapping Successes Mendelian traits have inheritance patterns consistent with a single causative locus. Hundreds of genes affecting known Mendelian traits (e.g. cystic fibrosis, breast cancer) in humans have been mapped. Some cases have led to new treatments and/or screening methods.

8 Gene Mapping Failures Complex traits do not follow simple one- locus Mendelian expectations. Assumed to be caused by mutations at multiple loci. Examples include many common diseases: asthma, bipolar disorder, diabetes, prostate cancer, etc.

9 Mapping of complex disease genes little success Altmuller et al, 2001: Reviewed 101 full genome scans of 31 complex diseases. - 67% did not show significant linkage to any marker. - of the significant linkages, very few have been reproduced.

10 Low power for mapping complex trait loci Power to detect genes decreases as the number of loci affecting the trait increases. By definition, complex diseases should be harder to map. We don t know how many genes underlie complex diseases, so don t know whether to be surprised by lack of success.

11 The central problem of human disease genetics is to solving this dilemma.

12 New Buzzword: Genetical Genomics: : the combination of molecular marker data and genome-wide expression data to elucidate the genetics of complex traits

13 Expression QTLs (eqtls) Gene expression levels can be treated as quantitative traits and their quantitative trait loci (QTLs) mapped. Transcript: : gene whose expression level is being measured. An eqtl for that transcript refers to a genetic locus with DNA sequence variation causing variation in the expression level.

14 Expression QTLs (eqtls) The eqtl could be at the transcript locus itself, or elsewhere in the genome. By combining microarrays and marker data, eqtls can be simultaneously generated for thousands of gene expression levels

15 Recent eqtl surveys Numerous recent eqtl surveys have found that expression levels frequently exhibit intermediate to high heritabilities. E.g. Schadt et al 2003 found 4339 eqtls over 3701 genes with log-of of-odd odd scores>4.3 and 11,021 genes with eqtl LOD scores>3.0

16 Recent eqtl surveys Thus, it is relatively easy to map QTLs for many expression levels.

17 Will eqtls be useful for dissecting complex physiological traits (e.g. disease)? Presumably, disease causative loci are also eqtls for some gene expression levels. Two task must be accomplished: 1. Show that transcript is related to disease. 2. Map eqtl for transcript. Then this eqtl should be causative locus for disease.

18 Causative Loci Exp. Lev. Disease Events Exp. Lev. Exp. Lev. Disease

19 Model for Gene Expression Assume L disease causative loci. M expression levels ( (X 1 X M ) tested. Each expression level X i depends on some subset of the L causative loci (often empty). T = indicator variable for disease status.

20 Model for Gene Expression P ( X T = 1) = = P ( T = 1 X ) P( X ) P( T = 1) ( T = 1 G ) P( X G ) P( G ) P G K K = Disease prevalence

21 Expression Levels Assume expression levels are normally distributed on some scale (e.g. log scale). Mean and variance determined by genotype at one or more of the L causative loci. Expression level distribution is then a mixture of normal distributions with weights determined by genotype distribution conditioned on disease status.

22 P Multiplicative Model ( T = G) = P( T = 1 G ) P( T = 1 G )... ( 1 ) P P T L = G L ( = 0) = ( ) ( ) ( ) ( ) P X T g, g... g 1 2 Can show: ( X T = 1) c = g, g 1 2 P,... g c ( X g, g,... g ) u ( g ) u ( g )... u ( g ) P( g ) P( g )... P( g ) 1 2 c 1 1 K ( + ) ( ) ( ) P X g, g,... g 1 u G u G... u G K... K P g... P g 1 2 c c c c 1 L 1 c 1 K 1 2 K K c c c 1 2 c Where K i = P T G i ( = 1 G ) P( G ) and u ( G ) = P( T = 1 G ) i i i i i i i

23 If c=1 (expression controlled by one locus) and Hardy-Weinberg Equilibrium (alleles independent): P X T = 1 = 1 p u dd ϕ X μ, σ 2 ( ) ( ) ( ) ( ) p 1 p u Dd ϕ X μ, σ + ( ) ( ) ( ) pu DD ϕ X μ, σ Where p 1 is the allele frequency D=disease allele and d= normal allele ( X, ) φ μ σ is normal pdf DD DD dd ( ) ( ) DD DD dd Dd Dd

24 Additive Model P ( T = G ) = P( T = 1 G ) + P( T = 1 G ) P( 1 ) P T L = G L Can show: ( X T 1) ( X G )( u ( G ) + K K ) P( G ) P G1 = = K L 1 P ( X T = 0) = G 1 P ( X G )( 1 u ( G ) K K... K ) P( G ) K 2 3 L 1

25 Expression Level Mean Expression level mean assumed to have either multiplicative or additive dependence on genotype: μ g,..., g = μ g μ g... μ g or ( ) ( ) ( ) ( ) 1 c μ g,..., g = μ g + μ g μ g ( ) ( ) ( ) ( ) 1 c c c

26 Power Calculations Two groups: disease affected and unaffected. t-test test for group difference conducted for each gene. Calculate power to detect differential expression. Interested in relationship between power and genetic model.

27

28 Causative Loci Exp. Lev. Disease Events Exp. Lev. Exp. Lev. C=1 C=5 C=2 Disease

29 Additive Model

30 Detecting expression level differences: Conclusions Power to detect expression level differences is poor for a multiplicative model with c=1, but reasonable for c=2-5. Power to detect expression level differences is very poor for an additive model.

31 Power for Mapping eqtls Power for mapping QTLs in natural populations deteriorates quickly as the number of QTLs increases. Power poor even for c=2.

32 Conclusions Natural Populations Power to detect expression level differences extremely poor if disease probability is additive. For multiplicative model there is no value of c (number of controlling loci) where power is good for detecting both linkage and expression level differences.

33 Many Unknowns Distribution of relationships between expression level and causative loci. Dominance relationships. Form of gene interactions. Allele frequencies.

34 Much Potential For Improved Power There is much information in the joint behavior of expression levels. Better experimental designs are possible (e.g. utilize family structure).

35 How can we use joint expression information? Schadt et al 2003: Crossed two mouse strains: one susceptible to diabetes and weight gain on high fat diet and the other not. 111 offspring from cross were put on high fat diet for 4 months. Liver tissue from each was profiled using microarrays. Genotypes for several hundred markers were obtained for each mouse.

36 From Schadt et al., 2003 Genetics of gene expression surveyed in maize, mouse and man. Nature 422:

37 Results: Microarray data splits high FPM data into two groups. Gene mapping comparing separate high- FPM groups to low-fpm produced significant linkage for a obesity locus. Gene mapping comparing combined high- FPM groups to low-fpm groups do not find significant linkage.

38 Genetic Heterogeneity Affected Individuals Locus x x x 2 x x x x 3 x x 4 x x 5 x x x 6 x x Major variation in which disease mutations are carried by affected individuals

39 Affected Individuals Locus x x x 2 x x x x 3 x x 4 x x 5 x x x x 6 x x x

40 Goal: A method for identifying the genetic heterogeneity using genomewide expression data Gene mapping power could be dramatically improved if this heterogeneity could be accounted for

41 Proposed EM Algorithm Based Method: Data: N A affected and N U unaffected individuals. Microarray for each individual Disease status for each individual

42 Sample Indiv. Disease Status Transcript Y 1 X 11 X 12 X 13 X 14 X 15 X 16 2 Y 2 X 21 X 22 X 23 X 24 X 25 X 26 3 Y 3 X 31 X 32 X 33 X 34 X 35 X 36 4 Y 4 X 41 X 42 X 43 X 44 X 45 X 46 5 Y- 5 X 51 X 52 X 53 X 54 X 55 X 56

43 Unobserved Variables Z =controlling i locus for transcript i, i=1 to M V jk =genotype on controlling locus j in individual k. Controlling loci 1 to L are the disease loci. Controlling loci L+1 to L+c are null loci that do not affect disease.

44 More specific goal: Cluster sample individuals by their values of V (that jk is, their genotype on the disease loci. ).

45 Likelihood L( θ X, Y, Z, V) = P X ZV PY V P ZV (,, θ ) (, θ ) (, θ ) P X Z V P Y V P Z V N M N (,, ) (, ) (, ) ij j iz θ k k θ θ = i= 1 j= 1 k= 1 j

46 P X Z, V, = Expression level distn. for gene j ( θ ) ij j iz j in indiv. i P( Y, ) k Vk θ = Disease probability for indiv. k P Z, V = Controlling locus and genotype probs. ( θ )

47 Parameter Estimates: Expression level mean for transcript a with controlling locus genotype b: ˆ μ L N = t= 1 i= 1 ab L N t= 1 i= 1 x P Z t V b X Y ( =, =,, θ ) 0 ia a it P Z t V b X Y ( =, =,, θ ) a it 0

48 ˆ σ Expression level variance for transcript a with controlling locus = L N 2 t= 1 i= 1 ab L N genotype b: ( ) 2 x (,,, ) ia μab P Za = t Vit = b X Y θ0 t= 1 i= 1 P Z t V b X Y ( ) a =, it =,, θ0

49 Probability that the expression of gene r has controlling locus q) ˆ = P Z =,, α rq ( q X Y θ ) r 0

50 Probability that controlling locus s has genotype t N ˆ λ st = j= 1 ( = t XY,, θ ) js 0 PV N

51 Disease penetrance parameters The genotype specific risk parameters are found by numerical maximization.

52 Assigning Genotypes: ( k Y, W ) P v W ij = = = ij i ij P Y v k P W v k ( ) ( ) i ij = ij ij = PYW (, ) vector containing the values of expression levels in individual i that are controlled by locus j. i ij

53 Transcripts assigned to same controlling locus if their expression is correlated in a way that is consistent with genetic model. Sample individuals assigned to same genotype at controlling locus if controlled expression levels are similar.

54 How well will it work? Potentially much information about genotype available in expression levels. On the other hand: Highly assumption laden. Many parameters to estimate. High-level transcripts may overwhelm analysis.

55 Causative Loci Exp. Lev. Disease Events Exp. Lev. Exp. Lev. C=1 C=5 C=2 Disease

56 Mapping Power - multiplicative

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics 1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

More information

Major Genes, Polygenes, and

Major Genes, Polygenes, and Major Genes, Polygenes, and QTLs Major genes --- genes that have a significant effect on the phenotype Polygenes --- a general term of the genes of small effect that influence a trait QTL, quantitative

More information

2. Map genetic distance between markers

2. Map genetic distance between markers Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,

More information

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011

Lecture 6: Introduction to Quantitative genetics. Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Lecture 6: Introduction to Quantitative genetics Bruce Walsh lecture notes Liege May 2011 course version 25 May 2011 Quantitative Genetics The analysis of traits whose variation is determined by both a

More information

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by

More information

Affected Sibling Pairs. Biostatistics 666

Affected Sibling Pairs. Biostatistics 666 Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD

More information

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees:

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees: MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation

More information

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17 Modeling IBD for Pairs of Relatives Biostatistics 666 Lecture 7 Previously Linkage Analysis of Relative Pairs IBS Methods Compare observed and expected sharing IBD Methods Account for frequency of shared

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]

More information

Lecture 9. QTL Mapping 2: Outbred Populations

Lecture 9. QTL Mapping 2: Outbred Populations Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred

More information

GSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION

GSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H* ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION FIFTH EDITION IV I ^HHk ^ttm IZTI/^Q i I II MPHBBMWBBIHB '-llwmpbi^hbwm^^pfc ' GSBHSRSBRSRRk LlML I I \l 1MB ^HP'^^MMMP" jflp^^^^^^^^st I Iv^O FROM GENES TO GENOMES %^MiM^PM^^MWi99Mi$9i0^^ ^^^^^^^^^^^^^V^^^fii^^t^i^^^^^

More information

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion

More information

Lesson 4: Understanding Genetics

Lesson 4: Understanding Genetics Lesson 4: Understanding Genetics 1 Terms Alleles Chromosome Co dominance Crossover Deoxyribonucleic acid DNA Dominant Genetic code Genome Genotype Heredity Heritability Heritability estimate Heterozygous

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Human vs mouse Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] www.daviddeen.com

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Backcross P 1 P 2 P 1 F 1 BC 4

More information

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015 Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

Gene mapping in model organisms

Gene mapping in model organisms Gene mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Goal Identify genes that contribute to common human diseases. 2

More information

Inferring Genetic Architecture of Complex Biological Processes

Inferring Genetic Architecture of Complex Biological Processes Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison http://www.stat.wisc.edu/~yandell/statgen

More information

The universal validity of the possible triangle constraint for Affected-Sib-Pairs

The universal validity of the possible triangle constraint for Affected-Sib-Pairs The Canadian Journal of Statistics Vol. 31, No.?, 2003, Pages???-??? La revue canadienne de statistique The universal validity of the possible triangle constraint for Affected-Sib-Pairs Zeny Z. Feng, Jiahua

More information

Lecture WS Evolutionary Genetics Part I 1

Lecture WS Evolutionary Genetics Part I 1 Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in

More information

Probability of Detecting Disease-Associated SNPs in Case-Control Genome-Wide Association Studies

Probability of Detecting Disease-Associated SNPs in Case-Control Genome-Wide Association Studies Probability of Detecting Disease-Associated SNPs in Case-Control Genome-Wide Association Studies Ruth Pfeiffer, Ph.D. Mitchell Gail Biostatistics Branch Division of Cancer Epidemiology&Genetics National

More information

Case-Control Association Testing. Case-Control Association Testing

Case-Control Association Testing. Case-Control Association Testing Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies

More information

Classical Selection, Balancing Selection, and Neutral Mutations

Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection Perspective of the Fate of Mutations All mutations are EITHER beneficial or deleterious o Beneficial mutations are selected

More information

Genetic Association Studies in the Presence of Population Structure and Admixture

Genetic Association Studies in the Presence of Population Structure and Admixture Genetic Association Studies in the Presence of Population Structure and Admixture Purushottam W. Laud and Nicholas M. Pajewski Division of Biostatistics Department of Population Health Medical College

More information

The Admixture Model in Linkage Analysis

The Admixture Model in Linkage Analysis The Admixture Model in Linkage Analysis Jie Peng D. Siegmund Department of Statistics, Stanford University, Stanford, CA 94305 SUMMARY We study an appropriate version of the score statistic to test the

More information

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017 Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM Life Cycles, Meiosis and Genetic Variability iclicker: 1. A chromosome just before mitosis contains two double stranded DNA molecules. 2. This replicated chromosome contains DNA from only one of your parents

More information

Legend: S spotted Genotypes: P1 SS & ss F1 Ss ss plain F2 (with ratio) 1SS :2 WSs: 1ss. Legend W white White bull 1 Ww red cows ww ww red

Legend: S spotted Genotypes: P1 SS & ss F1 Ss ss plain F2 (with ratio) 1SS :2 WSs: 1ss. Legend W white White bull 1 Ww red cows ww ww red On my honor, this is my work GENETICS 310 EXAM 1 June 8, 2018 I. Following are 3 sets of data collected from crosses: 1. Spotted by Plain gave all spotted in the F1 and 9 spotted and 3 plain in the F2.

More information

Genotype Imputation. Biostatistics 666

Genotype Imputation. Biostatistics 666 Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives

More information

Phasing via the Expectation Maximization (EM) Algorithm

Phasing via the Expectation Maximization (EM) Algorithm Computing Haplotype Frequencies and Haplotype Phasing via the Expectation Maximization (EM) Algorithm Department of Computer Science Brown University, Providence sorin@cs.brown.edu September 14, 2010 Outline

More information

1. Understand the methods for analyzing population structure in genomes

1. Understand the methods for analyzing population structure in genomes MSCBIO 2070/02-710: Computational Genomics, Spring 2016 HW3: Population Genetics Due: 24:00 EST, April 4, 2016 by autolab Your goals in this assignment are to 1. Understand the methods for analyzing population

More information

Coding sequence array Office hours Wednesday 3-4pm 304A Stanley Hall

Coding sequence array Office hours Wednesday 3-4pm 304A Stanley Hall Coding sequence array Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100 Fig. 1.13 RNA-seq Expression effects of cancer AAAAA AAAAA AAAAA Solexa sequencing counts

More information

Solutions to Problem Set 4

Solutions to Problem Set 4 Question 1 Solutions to 7.014 Problem Set 4 Because you have not read much scientific literature, you decide to study the genetics of garden peas. You have two pure breeding pea strains. One that is tall

More information

p(d g A,g B )p(g B ), g B

p(d g A,g B )p(g B ), g B Supplementary Note Marginal effects for two-locus models Here we derive the marginal effect size of the three models given in Figure 1 of the main text. For each model we assume the two loci (A and B)

More information

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative

More information

Evolution of phenotypic traits

Evolution of phenotypic traits Quantitative genetics Evolution of phenotypic traits Very few phenotypic traits are controlled by one locus, as in our previous discussion of genetics and evolution Quantitative genetics considers characters

More information

Backward Genotype-Trait Association. in Case-Control Designs

Backward Genotype-Trait Association. in Case-Control Designs Backward Genotype-Trait Association (BGTA)-Based Dissection of Complex Traits in Case-Control Designs Tian Zheng, Hui Wang and Shaw-Hwa Lo Department of Statistics, Columbia University, New York, New York,

More information

Principles of Genetics

Principles of Genetics Principles of Genetics Snustad, D ISBN-13: 9780470903599 Table of Contents C H A P T E R 1 The Science of Genetics 1 An Invitation 2 Three Great Milestones in Genetics 2 DNA as the Genetic Material 6 Genetics

More information

HERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA)

HERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA) BIRS 016 1 HERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA) Malka Gorfine, Tel Aviv University, Israel Joint work with Li Hsu, FHCRC, Seattle, USA BIRS 016 The concept of heritability

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

KEY: Chapter 9 Genetics of Animal Breeding.

KEY: Chapter 9 Genetics of Animal Breeding. KEY: Chapter 9 Genetics of Animal Breeding. Answer each question using the reading assigned to you. You can access this information by clicking on the following URL: https://drive.google.com/a/meeker.k12.co.us/file/d/0b1yf08xgyhnad08xugxsnfvba28/edit?usp=sh

More information

Part 2- Biology Paper 2 Inheritance and Variation Knowledge Questions

Part 2- Biology Paper 2 Inheritance and Variation Knowledge Questions Part 2- Biology Paper 2 Inheritance and Variation Knowledge Questions AQA TRILOGY Biology (8464) from 2016 Topic T4.6 Inheritance, variation and evolution Topic Student Checklist R A G Describe features

More information

Quantitative Genetics & Evolutionary Genetics

Quantitative Genetics & Evolutionary Genetics Quantitative Genetics & Evolutionary Genetics (CHAPTER 24 & 26- Brooker Text) May 14, 2007 BIO 184 Dr. Tom Peavy Quantitative genetics (the study of traits that can be described numerically) is important

More information

Linkage and Linkage Disequilibrium

Linkage and Linkage Disequilibrium Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies

More information

Latent Variable models for GWAs

Latent Variable models for GWAs Latent Variable models for GWAs Oliver Stegle Machine Learning and Computational Biology Research Group Max-Planck-Institutes Tübingen, Germany September 2011 O. Stegle Latent variable models for GWAs

More information

NIH Public Access Author Manuscript Stat Sin. Author manuscript; available in PMC 2013 August 15.

NIH Public Access Author Manuscript Stat Sin. Author manuscript; available in PMC 2013 August 15. NIH Public Access Author Manuscript Published in final edited form as: Stat Sin. 2012 ; 22: 1041 1074. ON MODEL SELECTION STRATEGIES TO IDENTIFY GENES UNDERLYING BINARY TRAITS USING GENOME-WIDE ASSOCIATION

More information

Use of hidden Markov models for QTL mapping

Use of hidden Markov models for QTL mapping Use of hidden Markov models for QTL mapping Karl W Broman Department of Biostatistics, Johns Hopkins University December 5, 2006 An important aspect of the QTL mapping problem is the treatment of missing

More information

QTL model selection: key players

QTL model selection: key players Bayesian Interval Mapping. Bayesian strategy -9. Markov chain sampling 0-7. sampling genetic architectures 8-5 4. criteria for model selection 6-44 QTL : Bayes Seattle SISG: Yandell 008 QTL model selection:

More information

Variance Component Models for Quantitative Traits. Biostatistics 666

Variance Component Models for Quantitative Traits. Biostatistics 666 Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond

More information

Multiple QTL mapping

Multiple QTL mapping Multiple QTL mapping Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] 1 Why? Reduce residual variation = increased power

More information

Mapping QTL to a phylogenetic tree

Mapping QTL to a phylogenetic tree Mapping QTL to a phylogenetic tree Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman Human vs mouse www.daviddeen.com 3 Intercross

More information

I Have the Power in QTL linkage: single and multilocus analysis

I Have the Power in QTL linkage: single and multilocus analysis I Have the Power in QTL linkage: single and multilocus analysis Benjamin Neale 1, Sir Shaun Purcell 2 & Pak Sham 13 1 SGDP, IoP, London, UK 2 Harvard School of Public Health, Cambridge, MA, USA 3 Department

More information

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M. STAT 550 Howework 6 Anton Amirov 1. This question relates to the same study you saw in Homework-4, by Dr. Arno Motulsky and coworkers, and published in Thompson et al. (1988; Am.J.Hum.Genet, 42, 113-124).

More information

Causal Graphical Models in Systems Genetics

Causal Graphical Models in Systems Genetics 1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation

More information

Nature Genetics: doi: /ng Supplementary Figure 1. Number of cases and proxy cases required to detect association at designs.

Nature Genetics: doi: /ng Supplementary Figure 1. Number of cases and proxy cases required to detect association at designs. Supplementary Figure 1 Number of cases and proxy cases required to detect association at designs. = 5 10 8 for case control and proxy case control The ratio of controls to cases (or proxy cases) is 1.

More information

Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype.

Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype. Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype. please read pages 38-47; 49-55;57-63. Slide 1 of Chapter 2 1 Extension sot Mendelian Behavior of Genes Single gene inheritance

More information

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Overview - 1 Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Elizabeth Thompson University of Washington Seattle, WA, USA MWF 8:30-9:20; THO 211 Web page: www.stat.washington.edu/ thompson/stat550/

More information

Overview. Background

Overview. Background Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems

More information

Prediction of the Confidence Interval of Quantitative Trait Loci Location

Prediction of the Confidence Interval of Quantitative Trait Loci Location Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28

More information

Natural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate

Natural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate Natural Selection Population Dynamics Humans, Sickle-cell Disease, and Malaria How does a population of humans become resistant to malaria? Overproduction Environmental pressure/competition Pre-existing

More information

Biology. Revisiting Booklet. 6. Inheritance, Variation and Evolution. Name:

Biology. Revisiting Booklet. 6. Inheritance, Variation and Evolution. Name: Biology 6. Inheritance, Variation and Evolution Revisiting Booklet Name: Reproduction Name the process by which body cells divide:... What kind of cells are produced this way? Name the process by which

More information

Stationary Distribution of the Linkage Disequilibrium Coefficient r 2

Stationary Distribution of the Linkage Disequilibrium Coefficient r 2 Stationary Distribution of the Linkage Disequilibrium Coefficient r 2 Wei Zhang, Jing Liu, Rachel Fewster and Jesse Goodman Department of Statistics, The University of Auckland December 1, 2015 Overview

More information

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm

More information

1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c.

1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c. 1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c. Changes in allele frequencies in a population across generations

More information

Genetics (patterns of inheritance)

Genetics (patterns of inheritance) MENDELIAN GENETICS branch of biology that studies how genetic characteristics are inherited MENDELIAN GENETICS Gregory Mendel, an Augustinian monk (1822-1884), was the first who systematically studied

More information

Module 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline

Module 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline Module 4: Bayesian Methods Lecture 9 A: Default prior selection Peter Ho Departments of Statistics and Biostatistics University of Washington Outline Je reys prior Unit information priors Empirical Bayes

More information

Evolutionary Genetics Midterm 2008

Evolutionary Genetics Midterm 2008 Student # Signature The Rules: (1) Before you start, make sure you ve got all six pages of the exam, and write your name legibly on each page. P1: /10 P2: /10 P3: /12 P4: /18 P5: /23 P6: /12 TOT: /85 (2)

More information

BIOL Evolution. Lecture 9

BIOL Evolution. Lecture 9 BIOL 432 - Evolution Lecture 9 J Krause et al. Nature 000, 1-4 (2010) doi:10.1038/nature08976 Selection http://www.youtube.com/watch?v=a38k mj0amhc&feature=playlist&p=61e033 F110013706&index=0&playnext=1

More information

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) 12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²

More information

Statistical issues in QTL mapping in mice

Statistical issues in QTL mapping in mice Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping

More information

Network Biology-part II

Network Biology-part II Network Biology-part II Jun Zhu, Ph. D. Professor of Genomics and Genetic Sciences Icahn Institute of Genomics and Multi-scale Biology The Tisch Cancer Institute Icahn Medical School at Mount Sinai New

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

EM algorithm. Rather than jumping into the details of the particular EM algorithm, we ll look at a simpler example to get the idea of how it works

EM algorithm. Rather than jumping into the details of the particular EM algorithm, we ll look at a simpler example to get the idea of how it works EM algorithm The example in the book for doing the EM algorithm is rather difficult, and was not available in software at the time that the authors wrote the book, but they implemented a SAS macro to implement

More information

Problems for 3505 (2011)

Problems for 3505 (2011) Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.

More information

Friday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo

Friday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo Friday Harbor 2017 From Genetics to GWAS (Genome-wide Association Study) Sept 7 2017 David Fardo Purpose: prepare for tomorrow s tutorial Genetic Variants Quality Control Imputation Association Visualization

More information

(Genome-wide) association analysis

(Genome-wide) association analysis (Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by

More information

THE WORK OF GREGOR MENDEL

THE WORK OF GREGOR MENDEL GENETICS NOTES THE WORK OF GREGOR MENDEL Genetics-. - Austrian monk- the father of genetics- carried out his work on. Pea flowers are naturally, which means that sperm cells fertilize the egg cells in

More information

Quantitative trait evolution with mutations of large effect

Quantitative trait evolution with mutations of large effect Quantitative trait evolution with mutations of large effect May 1, 2014 Quantitative traits Traits that vary continuously in populations - Mass - Height - Bristle number (approx) Adaption - Low oxygen

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Exam 5 Review Questions and Topics

Exam 5 Review Questions and Topics Exam 5 Review Questions and Topics 1. In your own words, define Methodological Naturalism and what it can and cannot test: 2. Fill out the following table: Discovery Science What it does? Hypothesis-based

More information

Miller & Levine Biology

Miller & Levine Biology A Correlation of To the Science Biology A Correlation of, 2014 to the, Table of Contents From Molecules to Organisms: Structures and Processes... 3 Ecosystems: Interactions, Energy, and Dynamics... 4 Heredity:

More information

Joint Analysis of Multiple Gene Expression Traits to Map Expression Quantitative Trait Loci. Jessica Mendes Maia

Joint Analysis of Multiple Gene Expression Traits to Map Expression Quantitative Trait Loci. Jessica Mendes Maia ABSTRACT MAIA, JESSICA M. Joint Analysis of Multiple Gene Expression Traits to Map Expression Quantitative Trait Loci. (Under the direction of Professor Zhao-Bang Zeng). The goal of this dissertation is

More information

Linear Regression (1/1/17)

Linear Regression (1/1/17) STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression

More information

List the five conditions that can disturb genetic equilibrium in a population.(10)

List the five conditions that can disturb genetic equilibrium in a population.(10) List the five conditions that can disturb genetic equilibrium in a population.(10) The five conditions are non-random mating, small population size, immigration or emigration, mutations, and natural selection.

More information

Quantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012

Quantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012 Quantile based Permutation Thresholds for QTL Hotspots Brian S Yandell and Elias Chaibub Neto 17 March 2012 2012 Yandell 1 Fisher on inference We may at once admit that any inference from the particular

More information

Binomial Mixture Model-based Association Tests under Genetic Heterogeneity

Binomial Mixture Model-based Association Tests under Genetic Heterogeneity Binomial Mixture Model-based Association Tests under Genetic Heterogeneity Hui Zhou, Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 April 30,

More information

Parts 2. Modeling chromosome segregation

Parts 2. Modeling chromosome segregation Genome 371, Autumn 2018 Quiz Section 2 Meiosis Goals: To increase your familiarity with the molecular control of meiosis, outcomes of meiosis, and the important role of crossing over in generating genetic

More information

Statistical testing. Samantha Kleinberg. October 20, 2009

Statistical testing. Samantha Kleinberg. October 20, 2009 October 20, 2009 Intro to significance testing Significance testing and bioinformatics Gene expression: Frequently have microarray data for some group of subjects with/without the disease. Want to find

More information

Binary trait mapping in experimental crosses with selective genotyping

Binary trait mapping in experimental crosses with selective genotyping Genetics: Published Articles Ahead of Print, published on May 4, 2009 as 10.1534/genetics.108.098913 Binary trait mapping in experimental crosses with selective genotyping Ani Manichaikul,1 and Karl W.

More information

GREENWOOD PUBLIC SCHOOL DISTRICT Genetics Pacing Guide FIRST NINE WEEKS Semester 1

GREENWOOD PUBLIC SCHOOL DISTRICT Genetics Pacing Guide FIRST NINE WEEKS Semester 1 FIRST NINE WEEKS Semester 1 1 Aug. 4 1 Introduction to Course Aug. 7 11 5 2 Aug. 14 18 5 Overarching Science Engineering Practices (SEPs) These concepts and skills should be continuously embedded during

More information

On Computation of P-values in Parametric Linkage Analysis

On Computation of P-values in Parametric Linkage Analysis On Computation of P-values in Parametric Linkage Analysis Azra Kurbašić Centre for Mathematical Sciences Mathematical Statistics Lund University p.1/22 Parametric (v. Nonparametric) Analysis The genetic

More information

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have. Section 1: Chromosomes and Meiosis KEY CONCEPT Gametes have half the number of chromosomes that body cells have. VOCABULARY somatic cell autosome fertilization gamete sex chromosome diploid homologous

More information

Science Unit Learning Summary

Science Unit Learning Summary Learning Summary Inheritance, variation and evolution Content Sexual and asexual reproduction. Meiosis leads to non-identical cells being formed while mitosis leads to identical cells being formed. In

More information

Linkage analysis and QTL mapping in autotetraploid species. Christine Hackett Biomathematics and Statistics Scotland Dundee DD2 5DA

Linkage analysis and QTL mapping in autotetraploid species. Christine Hackett Biomathematics and Statistics Scotland Dundee DD2 5DA Linkage analysis and QTL mapping in autotetraploid species Christine Hackett Biomathematics and Statistics Scotland Dundee DD2 5DA Collaborators John Bradshaw Zewei Luo Iain Milne Jim McNicol Data and

More information

Resultants in genetic linkage analysis

Resultants in genetic linkage analysis Journal of Symbolic Computation 41 (2006) 125 137 www.elsevier.com/locate/jsc Resultants in genetic linkage analysis Ingileif B. Hallgrímsdóttir a,, Bernd Sturmfels b a Department of Statistics, University

More information

SOLUTIONS TO EXERCISES FOR CHAPTER 9

SOLUTIONS TO EXERCISES FOR CHAPTER 9 SOLUTIONS TO EXERCISES FOR CHPTER 9 gronomy 65 Statistical Genetics W. E. Nyquist March 00 Exercise 9.. a. lgebraic method for the grandparent-grandoffspring covariance (see parent-offspring covariance,

More information