Inferring Genetic Architecture of Complex Biological Processes

Similar documents
Model Selection for Multiple QTL

contact information & resources

Causal Graphical Models in Systems Genetics

Mapping QTL to a phylogenetic tree

Multiple QTL mapping

Gene mapping in model organisms

Quantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012

Bayesian Model Selection for Multiple QTL. Brian S. Yandell

QTL model selection: key players

Introduction to QTL mapping in model organisms

Statistical issues in QTL mapping in mice

Causal Model Selection Hypothesis Tests in Systems Genetics

Mapping multiple QTL in experimental crosses

QTL Model Search. Brian S. Yandell, UW-Madison January 2017

Introduction to QTL mapping in model organisms

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

Fast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets

A Statistical Framework for Expression Trait Loci (ETL) Mapping. Meng Chen

Evolution of phenotypic traits

QTL model selection: key players

Inferring Causal Phenotype Networks from Segregating Populat

Lecture WS Evolutionary Genetics Part I 1

what is Bayes theorem? posterior = likelihood * prior / C

Joint Analysis of Multiple Gene Expression Traits to Map Expression Quantitative Trait Loci. Jessica Mendes Maia

Mapping multiple QTL in experimental crosses

Overview. Background

Introduction to QTL mapping in model organisms

Major Genes, Polygenes, and

Introduction to QTL mapping in model organisms

TECHNICAL REPORT NO December 1, 2008

Principles of QTL Mapping. M.Imtiaz

Genotype Imputation. Biostatistics 666

Linear Regression (1/1/17)

Proportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power

Latent Variable models for GWAs

Introduction to QTL mapping in model organisms

R/qtl workshop. (part 2) Karl Broman. Biostatistics and Medical Informatics University of Wisconsin Madison. kbroman.org

QTL Mapping I: Overview and using Inbred Lines

Quantile-based permutation thresholds for QTL hotspot analysis: a tutorial

(Genome-wide) association analysis

CSS 350 Midterm #2, 4/2/01

Lecture 8. QTL Mapping 1: Overview and Using Inbred Lines

BAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI

Lecture 9. QTL Mapping 2: Outbred Populations

Calculation of IBD probabilities

Anumber of statistical methods are available for map- 1995) in some standard designs. For backcross populations,

Bayesian Partition Models for Identifying Expression Quantitative Trait Loci

Causal Network Models for Correlated Quantitative Traits. outline

GSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION

Use of hidden Markov models for QTL mapping

NIH Public Access Author Manuscript Ann Appl Stat. Author manuscript; available in PMC 2011 January 7.

Causal Model Selection Hypothesis Tests. in Systems Genetics

1. what is the goal of QTL study?

Calculation of IBD probabilities

Cheng-Ruei Lee 1 *, Jill T. Anderson 2, Thomas Mitchell-Olds 1,3. Abstract. Introduction

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.

Lecture 11: Multiple trait models for QTL analysis

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

Eiji Yamamoto 1,2, Hiroyoshi Iwata 3, Takanari Tanabata 4, Ritsuko Mizobuchi 1, Jun-ichi Yonemaru 1,ToshioYamamoto 1* and Masahiro Yano 5,6

Prediction of the Confidence Interval of Quantitative Trait Loci Location

Partitioning Genetic Variance

Chapter 2: Extensions to Mendel: Complexities in Relating Genotype to Phenotype.

Solutions to Problem Set 4

DISSECTION OF COMPLEX GENE EXPRESSION USING THE COMBINED ANALYSIS OF PLEIOTROPY AND EPISTASIS

The genomes of recombinant inbred lines

Seattle Summer Institute : Systems Genetics for Experimental Crosses

Identifying the molecular basis of QTLs: eqtls add a new dimension

A new simple method for improving QTL mapping under selective genotyping

Network Biology-part II

MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES

Thematic review series: Systems Biology Approaches to Metabolic and Cardiovascular Disorders

Linkage and Chromosome Mapping

Quantitative Genetics I: Traits controlled my many loci. Quantitative Genetics: Traits controlled my many loci

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE

An Introduction to the spls Package, Version 1.0

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Theoretical and computational aspects of association tests: application in case-control genome-wide association studies.

STUDIES of genotype phenotype association are central to. Multivariate Analysis of Genotype Phenotype Association HIGHLIGHTED ARTICLE INVESTIGATION

Quantitative Genetics

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

A partial regression coefficient analysis framework to infer candidate genes potentially causal to traits in recombinant inbred lines

MANY complex human diseases and traits of bio- tive corrections for multiple testing. Non-Bayesian model

AP Biology Essential Knowledge Cards BIG IDEA 1

Variance Component Models for Quantitative Traits. Biostatistics 666

Natural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate

AP Biology Curriculum Framework

A CAUSAL GENE NETWORK WITH GENETIC VARIATIONS INCORPORATING BIOLOGICAL KNOWLEDGE AND LATENT VARIABLES. By Jee Young Moon

arxiv: v1 [stat.ap] 7 Oct 2010

Learning in Bayesian Networks

Hierarchical Generalized Linear Models for Multiple QTL Mapping

Computational Approaches to Statistical Genetics

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.

THE problem of identifying the genetic factors un- QTL models because of their ability to separate linked

Introduction to Bioinformatics

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

Bayesian Inference of Interactions and Associations

MULTIPLE-TRAIT MULTIPLE-INTERVAL MAPPING OF QUANTITATIVE-TRAIT LOCI ROBY JOEHANES

An applied statistician does probability:

Transcription:

Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison http://www.stat.wisc.edu/~yandell/statgen fri 14 jul 2006 BTCI Yandell 2006 1 Multiple Traits & Microarrays 1. why study multiple traits together? 2-10 diabetes case study 2. design issues 11-13 selective phenotyping 3. why are traits correlated? 14-17 close linkage or pleiotropy? 4. modern high throughput 18-31 principal components & discriminant analysis 5. graphical models 32-36 building causal biochemical networks fri 14 jul 2006 BTCI Yandell 2006 2

1. why study multiple traits together? avoid reductionist approach to biology address physiological/biochemical mechanisms Schmalhausen (1942); Falconer (1952) separate close linkage from pleiotropy 1 locus or 2 linked loci? identify epistatic interaction or canalization influence of genetic background establish QTL x environment interactions decompose genetic correlation among traits increase power to detect QTL fri 14 jul 2006 BTCI Yandell 2006 3 Type 2 Diabetes Mellitus fri 14 jul 2006 BTCI Yandell 2006 4

Insulin Requirement decompensation from fri Unger 14 jul & 2006 Orci FASEB J. (2001) 15,312 BTCI Yandell 2006 5 glucose insulin (courtesy AD Attie) fri 14 jul 2006 BTCI Yandell 2006 6

studying diabetes in an F2 segregating cross of inbred lines B6.ob x BTBR.ob F1 F2 selected mice with ob/ob alleles at leptin gene (chr 6) measured and mapped body weight, insulin, glucose at various ages (Stoehr et al. 2000 Diabetes) sacrificed at 14 weeks, tissues preserved gene expression data Affymetrix microarrays on parental strains, F1 (Nadler et al. 2000 PNAS; Ntambi et al. 2002 PNAS) RT-PCR for a few mrna on 108 F2 mice liver tissues (Lan et al. 2003 Diabetes; Lan et al. 2003 Genetics) Affymetrix microarrays on 60 F2 mice liver tissues design (Jin et al. 2004 Genetics tent. accept) analysis (work in prep.) fri 14 jul 2006 BTCI Yandell 2006 7 The intercross (from K Broman) λ fri 14 jul 2006 BTCI Yandell 2006 8

why map gene expression as a quantitative trait? cis- or trans-action? does gene control its own expression? or is it influenced by one or more other genomic regions? evidence for both modes (Brem et al. 2002 Science) simultaneously measure all mrna in a tissue ~5,000 mrna active per cell on average ~30,000 genes in genome use genetic recombination as natural experiment mechanics of gene expression mapping measure gene expression in intercross (F2) population map expression as quantitative trait (QTL) adjust for multiple testing fri 14 jul 2006 BTCI Yandell 2006 9 LOD map for PDI: cis-regulation (Lan et al. 2003) fri 14 jul 2006 BTCI Yandell 2006 10

mrna expression as phenotype: interval mapping for SCD1 is complicated effect (add=blue, dom=red) -0.5 0.0 0.5 1.0 LOD 0 2 4 6 8 0 50 100 150 200 250 300 chr2 chr5 chr9 0 50 100 150 200 250 300 chr2 chr5 chr9 fri 14 jul 2006 BTCI Yandell 2006 11 Bayesian model assessment: number of QTL for SCD1 with R/bim QTL posterior Bayes factor ratios QTL posterior 0.00 0.05 0.10 0.15 0.20 0.25 posterior / prior 1 5 10 50 500 strong moderate weak 1 2 3 4 5 6 7 8 9 11 13 1 2 3 4 5 6 7 8 9 11 13 number of QTL number of QTL fri 14 jul 2006 BTCI Yandell 2006 12

Bayesian model assessment genetic architecture: chromosome pattern model posterior 0.00 0.05 0.10 0.15 3:1,2,3 4:2*1,2,3 pattern posterior 4:1,2,2*3 4:1,2*2,3 5:3*1,2,3 5:2*1,2,2*3 5:2*1,2*2,3 6:3*1,2,2*3 6:3*1,2*2,3 5:1,2*2,2*3 6:4*1,2,3 6:2*1,2*2,2*3 2:1,3 3:2*1,2 2:1,2 1 3 5 7 9 11 13 15 model index posterior / prior 0.2 0.4 0.6 0.8 3 4 4 4 Bayes factor ratios 5 6 moderate 6 5 5 6 5 6 weak 2 3 2 1 3 5 7 9 11 13 15 model index fri 14 jul 2006 BTCI Yandell 2006 13 Bayesian LOD and h 2 for SCD1 (summaries from R/bim) density 0.00 0.10 LOD 5 10 15 20 0 5 10 15 20 marginal LOD, m 1 1 2 3 4 5 6 7 8 9 10 12 14 LOD conditional on number of QTL density 0 1 2 3 4 heritability 0.1 0.3 0.5 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 marginal heritability, m 1 1 2 3 4 5 6 7 8 9 10 12 14 heritability conditional on number of QTL fri 14 jul 2006 BTCI Yandell 2006 14

trans-acting QTL for SCD1 Bayesian model averaging with R/bim additive QTL? dominant QTL? fri 14 jul 2006 BTCI Yandell 2006 15 1-D and 2-D marginals pr(qtl at λ Y,X, m) unlinked loci linked loci fri 14 jul 2006 BTCI Yandell 2006 16

2-D scan: assumes only 2 QTL! epistasis LOD peaks joint LOD peaks fri 14 jul 2006 BTCI Yandell 2006 17 sub-peaks can be easily overlooked! fri 14 jul 2006 BTCI Yandell 2006 18

epistatic model fit fri 14 jul 2006 BTCI Yandell 2006 19 Cockerham epistatic effects fri 14 jul 2006 BTCI Yandell 2006 20

mapping microarray data single gene expression as trait (single QTL) Dumas et al. (2000 J Hypertens) overview, wish lists Jansen, Nap (2001 Trends Gen); Cheung, Spielman (2002); Doerge (2002 Nat Rev Gen); Bochner (2003 Nat Rev Gen) microarray scan via 1 QTL interval mapping Brem et al. (2002 Science); Schadt et al. (2003 Nature); Yvert et al. (2003 Nat Gen) found putative cis- and trans- acting genes multivariate and multiple QTL approach Lan et al. (2003 Genetics) fri 14 jul 2006 BTCI Yandell 2006 21 fri 14 jul 2006 BTCI Yandell 2006 22

2. design issues for expensive phenotypes (thanks to CF Amy Jin) microarray analysis ~ $1000 per mouse can only afford to assay 60 of 108 in panel wish to not lose much power to detect QTL selective phenotyping genotype all individuals in panel select subset for phenotyping previous studies can provide guide fri 14 jul 2006 BTCI Yandell 2006 23 selective phenotyping emphasize additive effects in F2 F2 design: 1QQ:2Qq:1qq best design for additive only: 1QQ:1Qq drop heterozygotes (Qq) reduce sample size by half with no power loss emphasize general effects in F2 best design: 1QQ:1Qq:1qq drop half of heterozygotes (25% reduction) multiple loci same idea but care is needed drop 7/16 of sample for two unlinked loci fri 14 jul 2006 BTCI Yandell 2006 24

is this relevant to large QTL studies? why not phenotype entire mapping panel? selectively phenotype subset of 50-67% may capture most effects with little loss of power two-stage selective phenotyping? genotype & phenotype subset of 100-300 could selectively phenotype using whole genome QTL map to identify key genomic regions selectively phenotype subset using key regions fri 14 jul 2006 BTCI Yandell 2006 25 3. why are traits correlated? environmental correlation non-genetic, controllable by design historical correlation (learned behavior) physiological correlation (same body) genetic correlation pleiotropy one gene, many functions common biochemical pathway, splicing variants close linkage two tightly linked genes genotypes Q are collinear fri 14 jul 2006 BTCI Yandell 2006 26

interplay of pleiotropy & correlation pleiotropy only correlation only Korol et al. (2001) both fri 14 jul 2006 BTCI Yandell 2006 27 3 correlated traits (Jiang Zeng 1995) ellipses centered on genotypic value width for nominal frequency main axis angle environmental correlation 3 QTL, F2 ρ P = 0.3, ρ G = 0.54, ρ E = 0.2 27 genotypes note signs of genetic and environmental correlation jiang1-2 -1 0 1 2-3 -2-1 0 1 2 3-3 -2-1 0 1 2 3 jiang3 ρ P = -0.07, ρ G = -0.22, ρ E = 0 jiang2 jiang3 fri 14 jul 2006 BTCI Yandell 2006 28 jiang2-3 -2-1 0 1 2 3 jiang1-2 -1 0 1 2 ρ P = 0.06, ρ G = 0.68, ρ E = -0.2-2 -1 0 1 2

pleiotropy or close linkage? 2 traits, 2 qtl/trait pleiotropy @ 54cM linkage @ 114,128cM Jiang Zeng (1995) fri 14 jul 2006 BTCI Yandell 2006 29