contact information & resources
|
|
- Agatha Webster
- 6 years ago
- Views:
Transcription
1 Seattle Summer Institute 006 Advanced QTL Brian S. Yandell University of Wisconsin-Madison Bayesian QTL mapping & model selection data examples in detail multiple phenotypes & microarrays software demo & automated strategy QTL : Overview Seattle SISG: Yandell contact information & resources byandell@wisc.edu web: QTL & microarray resources references, software, people thanks: students: Jaya Satagopan, Pat Gaffney, Fei Zou, Amy Jin, W. Whipple Neely faculty/staff: Alan Attie, Michael Newton, Nengjun Yi, Gary Churchill, Hong Lan, Christina Kendziorski, Tom Osborn, Jason Fine, Tapan Mehta, Hao Wu, Samprit Banerjee, Daniel Shriner QTL : Overview Seattle SISG: Yandell 006
2 Bayesian Interval Mapping 1. what is goal of QTL study? -8. Bayesian QTL mapping Markov chain sampling sampling across architectures epistatic interactions comparing models QTL : Bayes Seattle SISG: Yandell what is the goal of QTL study? uncover underlying biochemistry identify how networks function, break down find useful candidates for (medical) intervention epistasis may play key role statistical goal: maximize number of correctly identified QTL basic science/evolution how is the genome organized? identify units of natural selection additive effects may be most important (Wright/Fisher debate) statistical goal: maximize number of correctly identified QTL select elite individuals predict phenotype (breeding value) using suite of characteristics (phenotypes) translated into a few QTL statistical goal: mimimize prediction error QTL : Bayes Seattle SISG: Yandell 006
3 cross two inbred lines linkage disequilibrium associations linked segregating QTL (after Gary Churchill) QTL Marker Trait QTL : Bayes Seattle SISG: Yandell pragmatics of multiple QTL evaluate some objective for model given data classical likelihood Bayesian posterior search over possible genetic architectures (models) number and positions of loci gene action: additive, dominance, epistasis estimate features of model means, variances & covariances, confidence regions marginal or conditional distributions art of model selection how select best or better model(s)? how to search over useful subset of possible models? QTL : Bayes Seattle SISG: Yandell 006 4
4 advantages of multiple QTL approach improve statistical power, precision increase number of QTL detected better estimates of loci: less bias, smaller intervals improve inference of complex genetic architecture patterns and individual elements of epistasis appropriate estimates of means, variances, covariances asymptotically unbiased, efficient assess relative contributions of different QTL improve estimates of genotypic values less bias (more accurate) and smaller variance (more precise) mean squared error = MSE = (bias) + variance QTL : Bayes Seattle SISG: Yandell major QTL on linkage map 3 Pareto diagram of QTL effects additive effect major QTL (modifiers) minor QTL polygenes rank order of QTL QTL : Bayes Seattle SISG: Yandell 006 6
5 limits of multiple QTL? limits of statistical inference power depends on sample size, heritability, environmental variation best model balances fit to data and complexity (model size) genetic linkage = correlated estimates of gene effects limits of biological utility sampling: only see some patterns with many QTL marker assisted selection (Bernardo 001 Crop Sci) 10 QTL ok, 50 QTL are too many phenotype better predictor than genotype when too many QTL increasing sample size may not give multiple QTL any advantage hard to select many QTL simultaneously 3 m possible genotypes to choose from QTL : Bayes Seattle SISG: Yandell QTL below detection level? problem of selection bias QTL of modest effect only detected sometimes their effects are biased upwards when detected probability that QTL detected avoids sharp in/out dichotomy avoid pitfalls of one best model examine better models with more probable QTL build m = number of QTL detected into QTL model directly allow uncertainty in genetic architecture model selection over genetic architecture QTL : Bayes Seattle SISG: Yandell 006 8
6 . Bayesian QTL mapping Reverend Thomas Bayes ( ) part-time mathematician buried in Bunhill Cemetary, Moongate, London famous paper in 1763 Phil Trans Roy Soc London was Bayes the first with this idea? (Laplace?) basic idea (from Bayes original example) two billiard balls tossed at random (uniform) on table where is first ball if the second is to its left? prior: anywhere on the table posterior: more likely toward right end of table QTL : Bayes Seattle SISG: Yandell Bayes posterior for normal data n large n small prior prior mean actual mean n large n small prior prior mean actual mean y = phenotype values y = phenotype values small prior variance large prior variance QTL : Bayes Seattle SISG: Yandell
7 Bayes posterior for normal data model y i = µ + e i environment e ~ N( 0, σ ), σ known likelihood y~n( µ, σ ) prior µ ~N( µ 0, κσ ), κ known posterior: mean tends to sample mean single individual µ ~N( µ 0 + b 1 (y 1 µ 0 ), b 1 σ ) sample of n individuals µ ~ N with y ( bn y + (1 bn ) µ 0, bnσ / n) = sum y / n { i= 1,..., n} i fudge factor (shrinks to 1) κn bn = 1 κn + 1 QTL : Bayes Seattle SISG: Yandell Bayesian QTL: key players observed measurements y = phenotypic trait m = markers & linkage map i = individual index (1,,n) missing data missing marker data missing q = QT genotypes alleles QQ, Qq, or qq at locus unknown quantities λ = QT locus (or loci) µ = phenotype model parameters H = QTL model/genetic architecture pr(q m,λ,h) genotype model grounded by linkage map, experimental cross recombination yields multinomial for q given m pr(y q,µ,h) phenotype model distribution shape (assumed normal here) unknown parameters µ (could be non-parametric) observed mx Yy Qq unknown λ θµ H after Sen Churchill (001) QTL : Bayes Seattle SISG: Yandell 006 1
8 pr(y q,µ) phenotype model n large n small prior qq Qq QQ y = phenotype values QTL : Bayes Seattle SISG: Yandell Bayes posterior QTL means posterior centered on sample genotypic mean but shrunken slightly toward overall mean prior: µ q ~ N ( y, κσ ) posterior: µ ~ N q ( bq yq + (1 bq ) y, bqσ / nq ) fudge factor: n b q q = count{ q κnq = 1 κn + 1 q i = q}, y q = sum y { qi = q} i / n q QTL : Bayes Seattle SISG: Yandell
9 partition of multiple QTL effects partition genotype-specific mean into QTL effects µ q = mean + main effects + epistatic interactions µ q = µ + β q = µ +sum j in H β qj priors on mean and effects µ ~ N(µ 0, κ 0 σ ) grand mean β q ~ N(0, κ 1 σ ) model-independent genotypic effect β qj ~ N(0, κ 1 σ / H ) effects down-weighted by size of H determine hyper-parameters via empirical Bayes h 1 h σ σ G µ 0 Y and κ1 = QTL : Bayes Seattle SISG: Yandell pr(q m,λ) recombination model pr(q m,λ) = pr(geno map, locus) pr(geno flanking markers, locus) q? m1 m m 3 m m 4 5 m6 markers λ distance along chromosome QTL : Bayes Seattle SISG: Yandell
10 how does phenotype Y improve posterior for genotype Q? D4Mit41 D4Mit14 bp what are probabilities for genotype Q between markers? recombinants AA:AB all 1:1 if ignore Y and if we use Y? AA AA AB AA AA AB AB AB Genotype QTL : Bayes Seattle SISG: Yandell posterior on QTL genotypes full conditional for q depends data for individual i proportional to prior pr(q m i, λ ) weight toward q that agrees with flanking markers proportional to likelihood pr(y i q,µ) weight toward q so that group mean µ q y i phenotype and prior recombination may conflict posterior recombination balances these two weights this is E step in EM for classical QTL analysis pr( q mi, λ)pr( yi q, µ ) pr( q yi, mi, µ, λ) = pr( y m, µ, λ) i i QTL : Bayes Seattle SISG: Yandell
11 Bayesian model posterior augment data (y,m) with unknowns q study unknowns (µ,λ,q) given data (y,m) properties of posterior pr(µ,λ,q y,m ) sample from posterior in some clever way multiple imputation or MCMC pr( y q, µ )pr( q m, λ)pr( µ )pr( λ m) pr( q, µ, λ y, m) = pr( y m) pr( µ, λ y, m) = sum q pr( q, µ, λ y, n) QTL : Bayes Seattle SISG: Yandell Bayesian priors for QTL missing genotypes q pr( q m, λ ) recombination model is formally a prior effects ( µ, σ ) prior = pr( µ q σ ) pr(σ ) use conjugate priors for normal phenotype pr( µ q σ ) = normal pr(σ ) = inverse chi-square each locus λ may be uniform over genome pr(λ m ) = 1 / length of genome combined prior pr( q, µ, λ m ) = pr( q m, λ ) pr( µ ) pr(λ m ) QTL : Bayes Seattle SISG: Yandell 006 0
12 3. Markov chain sampling of architectures construct Markov chain around posterior want posterior as stable distribution of Markov chain in practice, the chain tends toward stable distribution initial values may have low posterior probability burn-in period to get chain mixing well hard to sample (q, µ, λ, H) from joint posterior update (q,µ,λ) from full conditionals for model H update genetic architecture H ( q, µ, λ, H) ~ pr( q, µ, λ, H y, m) ( q, µ, λ, H) 1 ( q, µ, λ, H) L ( q, µ, λ, H) N QTL : Bayes Seattle SISG: Yandell MCMC sampling of (λ,q,µ) Gibbs sampler genotypes q effects µ not loci λ q ~ pr( q y, m, µ, λ) pr( y q, µ )pr( µ ) µ ~ pr( y q) Metropolis-Hastings sampler extension of Gibbs sampler does not require normalization pr( q m ) = sum λ pr( q m, λ ) pr(λ ) i pr( q m, λ)pr( λ m) λ ~ pr( q m) i QTL : Bayes Seattle SISG: Yandell 006
13 full conditional for locus cannot easily sample from locus full conditional pr(λ y,m,µ,q) = pr( λ m,q) = pr( q m, λ ) pr(λ ) / constant constant is very difficult to compute explicitly must average over all possible loci λ over genome must do this for every possible genotype q Gibbs sampler will not work in general but can use method based on ratios of probabilities Metropolis-Hastings is extension of Gibbs sampler QTL : Bayes Seattle SISG: Yandell Gibbs sampler idea toy problem want to study two correlated effects could sample directly from their bivariate distribution instead use Gibbs sampler: sample each effect from its full conditional given the other pick order of sampling at random repeat many times µ ~ N, µ 0 ρ µ ~ N 1 µ ~ N ( ρµ,1 ρ ) ( ρµ,1 ρ ) QTL : Bayes Seattle SISG: Yandell ρ 1
14 Gibbs sampler samples: ρ = 0.6 N = 50 samples N = 00 samples Gibbs: mean Markov chain index Gibbs: mean 1 Gibbs: mean Gibbs: mean Gibbs: mean Gibbs: mean Markov chain index Gibbs: mean 1 Gibbs: mean Gibbs: mean Gibbs: mean Markov chain index Gibbs: mean Markov chain index Gibbs: mean 1 QTL : Bayes Seattle SISG: Yandell Metropolis-Hastings idea want to study distribution f(λ) take Monte Carlo samples unless too complicated take samples using ratios of f Metropolis-Hastings samples: propose new value λ * near (?) current value λ from some distribution g accept new value with prob a Gibbs sampler: a = 1 always a = * * f ( λ ) g( λ λ) min 1, * f ( λ) g( λ λ ) QTL : Bayes Seattle SISG: Yandell f(λ) g(λ λ * )
15 Metropolis-Hastings samples mcmc sequence N = 00 samples N = 1000 samples narrow g wide g narrow g wide g mcmc sequence mcmc sequence mcmc sequence histogram pr(θ Y) θ λ θλ θ λ θ λ histogram pr(θ Y) histogram pr(θ Y) histogram pr(θ Y) θ θ θ θ QTL : Bayes Seattle SISG: Yandell sampling across architectures search across genetic architectures M of various sizes allow change in number of QTL allow change in types of epistatic interactions methods for search reversible jump MCMC Gibbs sampler with loci indicators complexity of epistasis Fisher-Cockerham effects model general multi-qtl interaction & limits of inference QTL : Bayes Seattle SISG: Yandell 006 8
16 model selection in regression consider known genotypes q at known loci λ models with 1 or QTL jump between 1-QTL and -QTL models adjust parameters when model changes β q1 estimate changes between models 1 and due to collinearity of QTL genotypes m = 1 : µ q = µ + β q1 m = : µ q = µ + β q1 + β q QTL : Bayes Seattle SISG: Yandell geometry of reversible jump Move Between Models Reversible Jump Sequence b β m= m=1 c1 = 0.7 b β β b1 b1 1 β 1 QTL : Bayes Seattle SISG: Yandell
17 geometry allowing q and λ to change a short sequence first 1000 with m<3 β b b1 b β b1 β 1 β 1 QTL : Bayes Seattle SISG: Yandell collinear QTL = correlated effects 4-week 8-week additive effect cor = additive effect cor = additive 1 effect additive 1 effect 1 linked QTL = collinear genotypes correlated estimates of effects (negative if in coupling phase) sum of linked effects usually fairly constant QTL : Bayes Seattle SISG: Yandell 006 3
18 reversible jump MCMC idea 0 λ 1 λ m+1 λ λ m L Metropolis-Hastings updates: draw one of three choices update m-qtl model with probability 1-b(m+1)-d(m) update current model using full conditionals sample m QTL loci, effects, and genotypes add a locus with probability b(m+1) propose a new locus and innovate new genotypes & genotypic effect decide whether to accept the birth of new locus drop a locus with probability d(m) propose dropping one of existing loci decide whether to accept the death of locus Satagopan Yandell (1996, 1998); Sillanpaa Arjas (1998); Stevens Fisch (1998) these build on RJ-MCMC idea of Green (1995); Richardson Green (1997) QTL : Bayes Seattle SISG: Yandell Gibbs sampler with loci indicators consider only QTL at pseudomarkers every 1- cm modest approximation with little bias use loci indicators in each pseudomarker δ = 1 if QTL present δ = 0 if no QTL present Gibbs sampler on loci indicators δ relatively easy to incorporate epistasis Yi, Yandell, Churchill, Allison, Eisen, Pomp (005 Genetics) (see earlier work of Nengjun Yi and Ina Hoeschele) µ = µ + δ β + δ q β 1 q1 q QTL : Bayes Seattle SISG: Yandell
19 5. Gene Action and Epistasis additive, dominant, recessive, general effects of a single QTL (Gary Churchill) QTL : Bayes Seattle SISG: Yandell additive effects of two QTL (Gary Churchill) µ q = µ + β q1 + β q QTL : Bayes Seattle SISG: Yandell
20 Epistasis (Gary Churchill) The allelic state at one locus can mask or uncover the effects of allelic variation at another. - W. Bateson, QTL : Bayes Seattle SISG: Yandell epistasis in parallel pathways (GAC) Z keeps trait value low neither E 1 nor E is rate limiting X Y E 1 E Z loss of function alleles are segregating from parent A at E 1 and from parent B at E QTL : Bayes Seattle SISG: Yandell
21 epistasis in a serial pathway (GAC) Z keeps trait value high neither E 1 nor E is rate limiting X E 1 Y E Z loss of function alleles are segregating from parent B at E 1 and from parent A at E QTL : Bayes Seattle SISG: Yandell y = µ + e,var( e) = σ G q1 var( µ ) = σ h QTL with epistasis same phenotype model overview q q partition of genotypic value with epistasis µ = µ + β q G σg = σ + σ + β q = h QTL : Bayes Seattle SISG: Yandell β + h q1 partition of genetic variance & heritability = σ + σ + σ + h 1 1
22 epistatic interactions model space issues -QTL interactions only? or general interactions among multiple QTL? partition of effects Fisher-Cockerham or tree-structured or? model search issues epistasis between significant QTL check all possible pairs when QTL included? allow higher order epistasis? epistasis with non-significant QTL whole genome paired with each significant QTL? pairs of non-significant QTL? Yi Xu (000) Genetics; Yi, Xu, Allison (003) Genetics; Yi et al. (005) Genetics QTL : Bayes Seattle SISG: Yandell limits of epistatic inference power to detect effects epistatic model size grows exponentially H = 3 nqtl for general interactions power depends on ratio of n to model size want n / H to be fairly large (say > 5) n = 100, nqtl = 3, n / H 4 empty cells mess up adjusted (Type 3) tests missing q 1 Q / q 1 Q or q 1 Q q 3 / q 1 Q q 3 genotype null hypotheses not what you would expect can confound main effects and interactions can bias AA, AD, DA, DD partition QTL : Bayes Seattle SISG: Yandell 006 4
23 6. comparing QTL models balance model fit with model "complexity want maximum likelihood without too complicated a model information criteria quantifies the balance Bayes information criteria (BIC) for likelihood Bayes factors for Bayesian approach QTL : Bayes Seattle SISG: Yandell Bayes factors & BIC pr( model1 Y ) / pr( model Y ) pr( Y model1) B 1 = = pr( model ) / pr( model ) pr( Y model ) what is a Bayes factor? ratio of posterior odds to prior odds ratio of model likelihoods BF is equivalent to LR statistic when comparing two nested models simple hypotheses (e.g. 1 vs QTL) BF is equivalent to Bayes Information Criteria (BIC) for general comparison of any models want Bayes factor to be substantially larger than 1 (say 10 or more) 1 1 log( B1) = log( LR) ( p p ) log( n) QTL : Bayes Seattle SISG: Yandell
24 Bayes factors and genetic model H H = number of QTL prior pr(h) chosen by user posterior pr(h y,m) sampled marginal histogram shape affected by prior pr(h) pr( H y, m) / pr( H ) + 1 = pr( H + 1 y, m) / pr( H + 1) BF H,H prior probability pattern of QTL across genome gene action and epistasis e e exponential p Poisson e u uniform p p u u p eu u pu u u e e p p e e p e p e p e u u pu p e u m = number of QTL QTL : Bayes Seattle SISG: Yandell issues in computing Bayes factors BF insensitive to shape of prior on nqtl geometric, Poisson, uniform precision improves when prior mimics posterior BF sensitivity to prior variance on effects θ prior variance should reflect data variability resolved by using hyper-priors automatic algorithm; no need for user tuning easy to compute Bayes factors from samples sample posterior using MCMC posterior pr(nqtl y,m) is marginal histogram QTL : Bayes Seattle SISG: Yandell
25 examples in detail simulation study (after Stephens & Fisch (1998) days to flower for Brassica napus (plant) (n = 108) single chromosome with linked loci whole genome gonad shape in Drosophila spp. (insect) (n = 1000) multiple traits reduced by PC many QTL and epistasis expression phenotype (SCD1) in mice (n = 108) multiple QTL and epistasis obesity in mice (n = 41) epistatic QTLs with no main effects QTL : Data Seattle SISG: Yandell simulation with 8 QTL simulated F intercross, 8 QTL (Stephens, Fisch 1998) n=00, heritability = 50% detected 3 QTL increase to detect all 8 n=500, heritability to 97% QTL chr loci effect ch1 ch ch3 ch4 ch5 ch6 ch7 ch8 ch9 ch10 frequency in % Genetic map posterior number of QTL QTL : Data Seattle SISG: Yandell 006
26 loci pattern across genome notice which chromosomes have persistent loci best pattern found 4% of the time Chromosome m Count of QTL : Data Seattle SISG: Yandell Brassica napus: 1 chromosome 4-week & 8-week vernalization effect log(days to flower) genetic cross of Stellar (annual canola) Major (biennial rapeseed) 105 F1-derived double haploid (DH) lines homozygous at every locus (QQ or qq) 10 molecular markers (RFLPs) on LG9 two QTLs inferred on LG9 (now chromosome N) corroborated by Butruille (1998) exploiting synteny with Arabidopsis thaliana QTL : Data Seattle SISG: Yandell 006 4
27 Brassica 4- & 8-week data 8-week week week vernalization summaries of raw data joint scatter plots (identity line) separate histograms week vernalization QTL : Data Seattle SISG: Yandell Brassica credible regions 4-week 8-week additive additive distance (cm) distance (cm) QTL : Data Seattle SISG: Yandell 006 6
28 B. napus 8-week vernalization whole genome study 108 plants from double haploid similar genetics to backcross: follow 1 gamete parents are Major (biennial) and Stellar (annual) 300 markers across genome 19 chromosomes average 6cM between markers median 3.8cM, max 34cM 83% markers genotyped phenotype is days to flowering after 8 weeks of vernalization (cooling) Stellar parent requires vernalization to flower Ferreira et al. (1994); Kole et al. (001); Schranz et al. (00) QTL : Data Seattle SISG: Yandell row 1: # QTL row : pattern Bayesian model assessment col 1: posterior col : Bayes factor note error bars on bf evidence suggests 4-5 QTL N(-3),N3,N16 QTL posterior model posterior QTL posterior number of QTL * pattern posterior :,1 3:*,1 :,13 :,3 :,16 :,11 3:*,3 :,15 4:*,3,16 3:*,13 :, model index model index QTL : Data Seattle SISG: Yandell posterior / prior posterior / prior 5 e-01 1 e+01 5 e+0 Bayes factor ratios strong moderate weak number of QTL 1 Bayes factor ratios 3 strong moderate weak 3 4 3
29 Bayesian estimates of loci & effects histogram of loci blue line is density red lines at estimates loci histogram napus8 summaries with pattern 1,1,,3 and m 4 N N3 N estimate additive effects (red circles) grey points sampled from posterior blue line is cubic spline dashed line for SD additive N N3 N QTL : Data Seattle SISG: Yandell Bayesian model diagnostics pattern: N(),N3,N16 col 1: density col : boxplots by m environmental variance σ =.008, σ =.09 heritability h = 5% LOD = 16 (highly significant) but note change with m density density density marginal envvar, m marginal heritability, m marginal LOD, m envvar conditional on number of QTL QTL : Data Seattle SISG: Yandell envvar heritability LOD heritability conditional on number of QT LOD conditional on number of QTL
30 shape phenotype in BC study indexed by PC1 Liu et al. (1996) Genetics QTL : Data Seattle SISG: Yandell shape phenotype via PC Liu et al. (1996) Genetics QTL : Data Seattle SISG: Yandell 006 1
31 Zeng et al. (000) CIM vs. MIM composite interval mapping (Liu et al. 1996) narrow peaks miss some QTL multiple interval mapping (Zeng et al. 000) triangular peaks both conditional 1-D scans fixing all other "QTL" QTL : Data Seattle SISG: Yandell cim CIM, MIM and IM pairscan mim QTL : Data Seattle SISG: Yandell
32 QTL + epistasis: IM versus multiple imputation IM pairscan multiple imputation QTL : Data Seattle SISG: Yandell multiple QTL: CIM, MIM and BIM cim mim bim QTL : Data Seattle SISG: Yandell
33 studying diabetes in an F segregating cross of inbred lines B6.ob x BTBR.ob F1 F selected mice with ob/ob alleles at leptin gene (chr 6) measured and mapped body weight, insulin, glucose at various ages (Stoehr et al. 000 Diabetes) sacrificed at 14 weeks, tissues preserved gene expression data Affymetrix microarrays on parental strains, F1 key tissues: adipose, liver, muscle, β-cells novel discoveries of differential expression (Nadler et al. 000 PNAS; Lan et al. 00 in review; Ntambi et al. 00 PNAS) RT-PCR on 108 F mice liver tissues 15 genes, selected as important in diabetes pathways SCD1, PEPCK, ACO, FAS, GPAT, PPARgamma, PPARalpha, G6Pase, PDI, QTL : Data Seattle SISG: Yandell Multiple Interval Mapping (QTLCart) SCD1: multiple QTL plus epistasis! effect (add=blue, dom=red) LOD chr chr5 chr chr chr5 chr9 QTL : Data Seattle SISG: Yandell
34 Bayesian model assessment: number of QTL for SCD1 QTL posterior Bayes factor ratios QTL posterior posterior / prior strong moderate weak number of QTL number of QTL QTL : Data Seattle SISG: Yandell Bayesian LOD and h for SCD1 density LOD marginal LOD, m LOD conditional on number of QTL density heritability marginal heritability, m heritability conditional on number of QTL QTL : Data Seattle SISG: Yandell 006 0
35 Bayesian model assessment: chromosome QTL pattern for SCD1 model posterior :1,,3 4:*1,,3 pattern posterior 4:1,,*3 4:1,*,3 5:3*1,,3 5:*1,,*3 5:*1,*,3 6:3*1,,*3 6:3*1,*,3 5:1,*,*3 6:4*1,,3 6:*1,*,*3 :1,3 3:*1, :1, model index posterior / prior Bayes factor ratios 5 6 moderate weak model index QTL : Data Seattle SISG: Yandell trans-acting QTL for SCD1 (no epistasis yet: see Yi, Xu, Allison 003) dominance? QTL : Data Seattle SISG: Yandell 006
36 -D scan: assumes only QTL! epistasis LOD peaks joint LOD peaks QTL : Data Seattle SISG: Yandell sub-peaks can be easily overlooked! QTL : Data Seattle SISG: Yandell 006 4
37 epistatic model fit QTL : Data Seattle SISG: Yandell Cockerham epistatic effects QTL : Data Seattle SISG: Yandell 006 6
38 obesity in CAST/Ei BC onto M16i 41 mice (Daniel Pomp) (13 male, 08 female) 9 microsatellites on 19 chromosomes 114 cm map subcutaneous fat pads pre-adjusted for sex and dam effects Yi, Yandell, Churchill, Allison, Eisen, Pomp (005) Genetics (in press) QTL : Data Seattle SISG: Yandell non-epistatic analysis LOD score Bayes factor single QTL LOD profile multiple QTL Bayes factor profile QTL : Data Seattle SISG: Yandell 006 8
39 posterior profile of main effects in epistatic analysis Heritability Main effect Bayes factor main effects & heritability profile Bayes factor profile QTL : Data Seattle SISG: Yandell posterior profile of main effects in epistatic analysis QTL : Data Seattle SISG: Yandell
40 model selection via Bayes factors for epistatic model number of QTL QTL pattern QTL : Data Seattle SISG: Yandell posterior probability of effects Posterior probability Chr13(0,4)*Chr15(1,31) Chr7(50,75)*Chr19(15,45) Chr(7,85)*Chr14(1,41) Chr15(1,31)*Chr19(15,45) Chr(7,85)*Chr13(0,4) Chr1(6,54)*Chr18(43,71) Chr14(1,41) Chr7(50,75) Chr19(15,45) Chr1(6,54) Chr18(43,71) Chr15(1,31) Chr13(0,4) Chr(7,85) QTL : Data Seattle SISG: Yandell 006 3
41 scatterplot estimates of epistatic loci QTL : Data Seattle SISG: Yandell stronger epistatic effects QTL : Data Seattle SISG: Yandell
42 model selection for pairs QTL : Data Seattle SISG: Yandell our RJ-MCMC software R: freely available statistical computing application R library(bim) builds on Broman s library(qtl) QTLCart: statgen.ncsu.edu/qtlcart Bmapqtl incorporated into QTLCart (S Wang 003) R/bim initially designed by JM Satagopan (1996) major revision and extension by PJ Gaffney (001) whole genome, multivariate and long range updates speed improvements, pre-burnin built as official R library (H Wu, Yandell, Gaffney, CF Jin 003) R/bmqtl collaboration with N Yi, H Wu, GA Churchill initial working module: Winter 005 improved module and official release: Summer/Fall 005 major NIH grant (PI: Yi) QTL : Data Seattle SISG: Yandell
43 Multiple Traits & Microarrays 1. why study multiple traits together? -10 diabetes case study. design issues selective phenotyping 3. why are traits correlated? close linkage or pleiotropy? 4. modern high throughput principal components & discriminant analysis 5. graphical models 3-36 building causal biochemical networks Traits NCSU QTL II: Yandell why study multiple traits together? avoid reductionist approach to biology address physiological/biochemical mechanisms Schmalhausen (194); Falconer (195) separate close linkage from pleiotropy 1 locus or linked loci? identify epistatic interaction or canalization influence of genetic background establish QTL x environment interactions decompose genetic correlation among traits increase power to detect QTL Traits NCSU QTL II: Yandell 005
44 Type Diabetes Mellitus Traits NCSU QTL II: Yandell Insulin Requirement decompensation from Traits Unger & Orci FASEB J. (001) 15,31 NCSU QTL II: Yandell 005 4
45 glucose insulin (courtesy AD Attie) Traits NCSU QTL II: Yandell studying diabetes in an F segregating cross of inbred lines B6.ob x BTBR.ob F1 F selected mice with ob/ob alleles at leptin gene (chr 6) measured and mapped body weight, insulin, glucose at various ages (Stoehr et al. 000 Diabetes) sacrificed at 14 weeks, tissues preserved gene expression data Affymetrix microarrays on parental strains, F1 (Nadler et al. 000 PNAS; Ntambi et al. 00 PNAS) RT-PCR for a few mrna on 108 F mice liver tissues (Lan et al. 003 Diabetes; Lan et al. 003 Genetics) Affymetrix microarrays on 60 F mice liver tissues design (Jin et al. 004 Genetics tent. accept) analysis (work in prep.) Traits NCSU QTL II: Yandell 005 6
46 why map gene expression as a quantitative trait? cis- or trans-action? does gene control its own expression? or is it influenced by one or more other genomic regions? evidence for both modes (Brem et al. 00 Science) simultaneously measure all mrna in a tissue ~5,000 mrna active per cell on average ~30,000 genes in genome use genetic recombination as natural experiment mechanics of gene expression mapping measure gene expression in intercross (F) population map expression as quantitative trait (QTL) adjust for multiple testing Traits NCSU QTL II: Yandell LOD map for PDI: cis-regulation (Lan et al. 003) Traits NCSU QTL II: Yandell 005 8
47 mapping microarray data single gene expression as trait (single QTL) Dumas et al. (000 J Hypertens) overview, wish lists Jansen, Nap (001 Trends Gen); Cheung, Spielman (00); Doerge (00 Nat Rev Gen); Bochner (003 Nat Rev Gen) microarray scan via 1 QTL interval mapping Brem et al. (00 Science); Schadt et al. (003 Nature); Yvert et al. (003 Nat Gen) found putative cis- and trans- acting genes multivariate and multiple QTL approach Lan et al. (003 Genetics) Traits NCSU QTL II: Yandell Traits NCSU QTL II: Yandell
48 . design issues for expensive phenotypes (thanks to CF Amy Jin) microarray analysis ~ $1000 per mouse can only afford to assay 60 of 108 in panel wish to not lose much power to detect QTL selective phenotyping genotype all individuals in panel select subset for phenotyping previous studies can provide guide Traits NCSU QTL II: Yandell selective phenotyping emphasize additive effects in F F design: 1QQ:Qq:1qq best design for additive only: 1QQ:1Qq drop heterozygotes (Qq) reduce sample size by half with no power loss emphasize general effects in F best design: 1QQ:1Qq:1qq drop half of heterozygotes (5% reduction) multiple loci same idea but care is needed drop 7/16 of sample for two unlinked loci Traits NCSU QTL II: Yandell 005 1
49 is this relevant to large QTL studies? why not phenotype entire mapping panel? selectively phenotype subset of 50-67% may capture most effects with little loss of power two-stage selective phenotyping? genotype & phenotype subset of could selectively phenotype using whole genome QTL map to identify key genomic regions selectively phenotype subset using key regions Traits NCSU QTL II: Yandell why are traits correlated? environmental correlation non-genetic, controllable by design historical correlation (learned behavior) physiological correlation (same body) genetic correlation pleiotropy one gene, many functions common biochemical pathway, splicing variants close linkage two tightly linked genes genotypes Q are collinear Traits NCSU QTL II: Yandell
50 interplay of pleiotropy & correlation pleiotropy only correlation only Korol et al. (001) both Traits NCSU QTL II: Yandell correlated traits (Jiang Zeng 1995) ellipses centered on genotypic value width for nominal frequency main axis angle environmental correlation 3 QTL, F ρ P = 0.3, ρ G = 0.54, ρ E = 0. 7 genotypes note signs of genetic and environmental correlation jiang jiang3 ρ P = -0.07, ρ G = -0., ρ E = 0 jiang jiang3 Traits NCSU QTL II: Yandell jiang jiang ρ P = 0.06, ρ G = 0.68, ρ E =
51 pleiotropy or close linkage? traits, qtl/trait 54cM 114,18cM Jiang Zeng (1995) Traits NCSU QTL II: Yandell modern high throughput biology measuring the molecular dogma of biology DNA RNA protein metabolites measured one at a time only a few years ago massive array of measurements on whole systems ( omics ) thousands measured per individual (experimental unit) all (or most) components of system measured simultaneously whole genome of DNA: genes, promoters, etc. all expressed RNA in a tissue or cell all proteins all metabolites systems biology: focus on network interconnections chains of behavior in ecological community underlying biochemical pathways genetics as one experimental tool perturb system by creating new experimental cross each individual is a unique mosaic Traits NCSU QTL II: Yandell
52 coordinated expression in mouse genome (Schadt et al. 003) expression pleiotropy in yeast genome (Brem et al. 00) Traits NCSU QTL II: Yandell finding heritable traits (from Christina Kendziorski) reduce 30,000 traits to 300-3,000 heritable traits probability a trait is heritable pr(h Y,Q) = pr(y Q,H) pr(h Q) / pr(y Q) Bayes rule pr(y Q) = pr(y Q,H) pr(h Q) + pr(y Q, not H) pr(not H Q) phenotype averaged over genotypic mean µ pr(y Q, not H) = f 0 (Y) = f(y G ) pr(g) dg pr(y Q, H) = f 1 (Y Q) = q f 0 (Y q ) if not H if heritable Y q = {Y i Q i =q} = trait values with genotype Q=q Traits NCSU QTL II: Yandell 005 0
53 hierarchical model for expression phenotypes (EB arrays: Christina Kendziorski) f ( ) Y ~ G QQ QQ ~ f Qq ( G Qq ) mrna phenotype models given genotypic mean G q Y Y f ( ) qq ~ G qq G QQ G Qq G qq G q ~ pr ( ) common prior on G q across all mrna (use empirical Bayes to estimate prior) G QQ G Qq G qq Traits NCSU QTL II: Yandell expression meta-traits: pleiotropy reduce 3,000 heritable traits to 3 meta-traits(!) what are expression meta-traits? pleiotropy: a few genes can affect many traits transcription factors, regulators weighted averages: Z = YW principle components, discriminant analysis infer genetic architecture of meta-traits model selection issues are subtle missing data, non-linear search what is the best criterion for model selection? time consuming process heavy computation load for many traits subjective judgement on what is best Traits NCSU QTL II: Yandell 005
54 PC for two correlated mrna ettf PC (7%) etif3s6 PC1 (93%) Traits NCSU QTL II: Yandell PC across microarray functional groups Affy chips on 60 mice ~40,000 mrna 500+ mrna show DE (via EB arrays with marker regression) organized in 85 functional groups -35 mrna / group which are interesting? examine PC1, PC circle size = # unique mrna Traits NCSU QTL II: Yandell 005 4
55 84 PC meta-traits by functional group focus on interesting groups Traits NCSU QTL II: Yandell red lines: peak for PC meta-trait black/blue: peaks for mrna traits arrows: cis-action? Traits NCSU QTL II: Yandell 005 6
56 (portion of) chr 4 region chr 15 region? Traits NCSU QTL II: Yandell interaction plots for DA meta-traits DA for all pairs of markers: separate 9 genotypes based on markers (a) same locus pair found with PC meta-traits (b) Chr region interesting from biochemistry (Jessica Byers) (c) Chr 5 & Chr 9 identified as important for insulin, SCD Traits NCSU QTL II: Yandell 005 8
57 comparison of PC and DA meta-traits on mrna traits genotypes from Chr 4/Chr 15 locus pair (circle=centroid) PC captures spread without genotype DA creates best separation by genotype PC (1%) DA1 (37%) H.H A.B B.B A.B A.B H.B H.H H.H A.H A.B H.H B.A H.A H.H H.H H.A A.A H.HB.B H.H H.B H.A H.H H.B H.A H.B A.B H.H H.A B.A A.A A.A H.B PC1 (5%) H.H A.B B.B A.B H.H H.BA.B H.H PC ignores genotype A.H A.B B.A H.H H.A B.B H.HH.A H.H H.B A.A H.H.A H.H H.H H.B A.B H.A H.AH.H H.BH.H H.H H.A A.B correlation of PC and DA meta-traits A.B A.A H.B H.B A.H A.H A.A B.AH.A H.B H.A A.H A.A A.H A.H B.A A.H A.H A.H H.H A.H A.H A.H A.H H.H A.H A.A H.A A.H B.A A.H A.H DA (18%) DA (18%) H.H A.B H.B H.B H.AH.B B.A H.A H.H A.H.A B.A H.B H.A H.B H.H H.A A.H H.H H.B H.A A.A H.H H.H H.A H.H B.A H.H H.H H.H A.B B.B A.B H.H A.A A.A A.A H.H A.B A.B B.B A.B A.H A.H DA1 (37%) H.B H.H A.H H.B H.A H.A H.A H.H H.B H.H A.HH.B H.B H.H H.H H.H B.B H.H A.A A.B DA uses genotype A.A A.A A.B H.H A.B H.B A.H A.H H.A B.A H.A H.A H.HH.A A.A H.H B.A A.H A.H A.H A.HH.H A.H A.B B.B B.A A.H A.B A.H A.H A.H H.H A.B A.H note better spread of circles PC1 (5%) PC (1%) Traits NCSU QTL II: Yandell relating meta-traits to mrna traits SCD trait log expression DA meta-trait standard units Traits NCSU QTL II: Yandell
58 DA: a cautionary tale (184 mrna with cor > 0.5; mouse 13 drives heritability) Traits NCSU QTL II: Yandell building graphical models infer genetic architecture of meta-trait E(Z Q, M) = µ q = β 0 + {q in M} β qk find mrna traits correlated with meta-trait Z YW for modest number of traits Y extend meta-trait genetic architecture M = genetic architecture for Y expect subset of QTL to affect each mrna may be additional QTL for some mrna Traits NCSU QTL II: Yandell 005 3
59 posterior for graphical models posterior for graph given multivariate trait & architecture pr(g Y, Q, M) = pr(y Q, G) pr(g M) / pr(y Q) pr(g M) = prior on valid graphs given architecture multivariate phenotype averaged over genotypic mean µ pr(y Q, G) = f 1 (Y Q, G) = q f 0 (Yq G) f 0 (Y q G) = f(y q µ, G) pr(µ) dµ graphical model G implies correlation structure on Y genotype mean prior assumed independent across traits pr(µ) = t pr(µ t ) Traits NCSU QTL II: Yandell from graphical models to pathways build graphical models QTL RNA1 RNA class of possible models best model = putative biochemical pathway parallel biochemical investigation candidate genes in QTL regions laboratory experiments on pathway components Traits NCSU QTL II: Yandell
60 graphical models (with Elias Chaibub) f 1 (Y Q, G=g) = f 1 (Y 1 Q) f 1 (Y Q, Y 1 ) QTL DNA RNA protein unobservable meta-trait QTL D1 R1 P1 observable cis-action? D R P observable trans-action Traits NCSU QTL II: Yandell summary expression QTL are complicated need to consider multiple interacting QTL coherent approach for high-throughput traits identify heritable traits dimension reduction to meta-traits mapping genetic architecture extension via graphical models to networks many open questions model selection computation efficiency inference on graphical models Traits NCSU QTL II: Yandell
Inferring Genetic Architecture of Complex Biological Processes
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison http://www.stat.wisc.edu/~yandell/statgen
More informationQTL model selection: key players
Bayesian Interval Mapping. Bayesian strategy -9. Markov chain sampling 0-7. sampling genetic architectures 8-5 4. criteria for model selection 6-44 QTL : Bayes Seattle SISG: Yandell 008 QTL model selection:
More informationModel Selection for Multiple QTL
Model Selection for Multiple TL 1. reality of multiple TL 3-8. selecting a class of TL models 9-15 3. comparing TL models 16-4 TL model selection criteria issues of detecting epistasis 4. simulations and
More informationQTL model selection: key players
QTL Model Selection. Bayesian strategy. Markov chain sampling 3. sampling genetic architectures 4. criteria for model selection Model Selection Seattle SISG: Yandell 0 QTL model selection: key players
More informationwhat is Bayes theorem? posterior = likelihood * prior / C
who was Bayes? Reverend Thomas Bayes (70-76) part-time mathematician buried in Bunhill Cemetary, Moongate, London famous paper in 763 Phil Trans Roy Soc London was Bayes the first with this idea? (Laplace?)
More informationBayesian Model Selection for Multiple QTL. Brian S. Yandell
Bayesian Model Selection for Multiple QTL Brian S. Yandell University of Wisconsin-Madison www.stat.wisc.edu/~yandell/statgen Jackson Laboratory, October 005 October 005 Jax Workshop Brian S. Yandell 1
More information1. what is the goal of QTL study?
NSF UAB Course 008 Bayesian Interval Mapping Brian S. Yandell, UW-Madison www.stat.wisc.edu/~yandell/statgen overview: multiple QTL approaches Bayesian QTL mapping & model selection data examples in detail
More informationMultiple QTL mapping
Multiple QTL mapping Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] 1 Why? Reduce residual variation = increased power
More informationCausal Graphical Models in Systems Genetics
1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation
More informationStatistical issues in QTL mapping in mice
Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping
More informationMapping multiple QTL in experimental crosses
Human vs mouse Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman www.daviddeen.com
More informationIntroduction to QTL mapping in model organisms
Human vs mouse Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] www.daviddeen.com
More informationGene mapping in model organisms
Gene mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Goal Identify genes that contribute to common human diseases. 2
More informationSeattle Summer Institute : Systems Genetics for Experimental Crosses
Seattle Summer Institute 2012 15: Systems Genetics for Experimental Crosses Brian S. Yandell, UW-Madison Elias Chaibub Neto, Sage Bionetworks www.stat.wisc.edu/~yandell/statgen/sisg Real knowledge is to
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationFast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets
Fast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets Yu-Ling Chang A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationInferring Causal Phenotype Networks from Segregating Populat
Inferring Causal Phenotype Networks from Segregating Populations Elias Chaibub Neto chaibub@stat.wisc.edu Statistics Department, University of Wisconsin - Madison July 15, 2008 Overview Introduction Description
More informationQTL Model Search. Brian S. Yandell, UW-Madison January 2017
QTL Model Search Brian S. Yandell, UW-Madison January 2017 evolution of QTL models original ideas focused on rare & costly markers models & methods refined as technology advanced single marker regression
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Backcross P 1 P 2 P 1 F 1 BC 4
More informationMapping multiple QTL in experimental crosses
Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationOverview. Background
Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems
More informationQuantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012
Quantile based Permutation Thresholds for QTL Hotspots Brian S Yandell and Elias Chaibub Neto 17 March 2012 2012 Yandell 1 Fisher on inference We may at once admit that any inference from the particular
More informationCausal Model Selection Hypothesis Tests in Systems Genetics
1 Causal Model Selection Hypothesis Tests in Systems Genetics Elias Chaibub Neto and Brian S Yandell SISG 2012 July 13, 2012 2 Correlation and Causation The old view of cause and effect... could only fail;
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationQTL Mapping I: Overview and using Inbred Lines
QTL Mapping I: Overview and using Inbred Lines Key idea: Looking for marker-trait associations in collections of relatives If (say) the mean trait value for marker genotype MM is statisically different
More informationBAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI
BAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI By DÁMARIS SANTANA MORANT A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR
More informationR/qtl workshop. (part 2) Karl Broman. Biostatistics and Medical Informatics University of Wisconsin Madison. kbroman.org
R/qtl workshop (part 2) Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Example Sugiyama et al. Genomics 71:70-77, 2001 250 male
More informationExpression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia
Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.
More informationDetection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred population
Genet. Sel. Evol. 36 (2004) 415 433 415 c INRA, EDP Sciences, 2004 DOI: 10.1051/gse:2004009 Original article Detection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred
More informationUse of hidden Markov models for QTL mapping
Use of hidden Markov models for QTL mapping Karl W Broman Department of Biostatistics, Johns Hopkins University December 5, 2006 An important aspect of the QTL mapping problem is the treatment of missing
More informationBTRY 4830/6830: Quantitative Genomics and Genetics
BTRY 4830/6830: Quantitative Genomics and Genetics Lecture 23: Alternative tests in GWAS / (Brief) Introduction to Bayesian Inference Jason Mezey jgm45@cornell.edu Nov. 13, 2014 (Th) 8:40-9:55 Announcements
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationMapping QTL to a phylogenetic tree
Mapping QTL to a phylogenetic tree Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman Human vs mouse www.daviddeen.com 3 Intercross
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationContents. Part I: Fundamentals of Bayesian Inference 1
Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian
More informationA Statistical Framework for Expression Trait Loci (ETL) Mapping. Meng Chen
A Statistical Framework for Expression Trait Loci (ETL) Mapping Meng Chen Prelim Paper in partial fulfillment of the requirements for the Ph.D. program in the Department of Statistics University of Wisconsin-Madison
More informationEvolution of phenotypic traits
Quantitative genetics Evolution of phenotypic traits Very few phenotypic traits are controlled by one locus, as in our previous discussion of genetics and evolution Quantitative genetics considers characters
More informationQuantile-based permutation thresholds for QTL hotspot analysis: a tutorial
Quantile-based permutation thresholds for QTL hotspot analysis: a tutorial Elias Chaibub Neto and Brian S Yandell September 18, 2013 1 Motivation QTL hotspots, groups of traits co-mapping to the same genomic
More informationLecture WS Evolutionary Genetics Part I 1
Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in
More informationTutorial Session 2. MCMC for the analysis of genetic data on pedigrees:
MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation
More informationLecture 8. QTL Mapping 1: Overview and Using Inbred Lines
Lecture 8 QTL Mapping 1: Overview and Using Inbred Lines Bruce Walsh. jbwalsh@u.arizona.edu. University of Arizona. Notes from a short course taught Jan-Feb 2012 at University of Uppsala While the machinery
More informationLatent Variable models for GWAs
Latent Variable models for GWAs Oliver Stegle Machine Learning and Computational Biology Research Group Max-Planck-Institutes Tübingen, Germany September 2011 O. Stegle Latent variable models for GWAs
More informationBayesian Partition Models for Identifying Expression Quantitative Trait Loci
Journal of the American Statistical Association ISSN: 0162-1459 (Print) 1537-274X (Online) Journal homepage: http://www.tandfonline.com/loi/uasa20 Bayesian Partition Models for Identifying Expression Quantitative
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationBustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #
Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either
More informationGenotype Imputation. Biostatistics 666
Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives
More informationLinear Regression (1/1/17)
STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression
More informationAssociation Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5
Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative
More informationBayesian Inference of Interactions and Associations
Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,
More informationCausal Network Models for Correlated Quantitative Traits. outline
Causal Network Models for Correlated Quantitative Traits Brian S. Yandell UW Madison October 2012 www.stat.wisc.edu/~yandell/statgen Jax SysGen: Yandell 2012 1 outline Correlation and causation Correlatedtraitsinorganized
More informationLecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017
Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate
More informationMULTIPLE-TRAIT MULTIPLE-INTERVAL MAPPING OF QUANTITATIVE-TRAIT LOCI ROBY JOEHANES
MULTIPLE-TRAIT MULTIPLE-INTERVAL MAPPING OF QUANTITATIVE-TRAIT LOCI by ROBY JOEHANES B.S., Universitas Pelita Harapan, Indonesia, 1999 M.S., Kansas State University, 2002 A REPORT submitted in partial
More informationMajor Genes, Polygenes, and
Major Genes, Polygenes, and QTLs Major genes --- genes that have a significant effect on the phenotype Polygenes --- a general term of the genes of small effect that influence a trait QTL, quantitative
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters
More informationStatistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014
Overview - 1 Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Elizabeth Thompson University of Washington Seattle, WA, USA MWF 8:30-9:20; THO 211 Web page: www.stat.washington.edu/ thompson/stat550/
More informationMODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES
MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 20: Epistasis and Alternative Tests in GWAS Jason Mezey jgm45@cornell.edu April 16, 2016 (Th) 8:40-9:55 None Announcements Summary
More informationJoint Analysis of Multiple Gene Expression Traits to Map Expression Quantitative Trait Loci. Jessica Mendes Maia
ABSTRACT MAIA, JESSICA M. Joint Analysis of Multiple Gene Expression Traits to Map Expression Quantitative Trait Loci. (Under the direction of Professor Zhao-Bang Zeng). The goal of this dissertation is
More informationBayesian model selection: methodology, computation and applications
Bayesian model selection: methodology, computation and applications David Nott Department of Statistics and Applied Probability National University of Singapore Statistical Genomics Summer School Program
More informationBayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson
Bayesian variable selection via penalized credible regions Brian Reich, NC State Joint work with Howard Bondell and Ander Wilson Brian Reich, NCSU Penalized credible regions 1 Motivation big p, small n
More informationExpression Data Exploration: Association, Patterns, Factors & Regression Modelling
Expression Data Exploration: Association, Patterns, Factors & Regression Modelling Exploring gene expression data Scale factors, median chip correlation on gene subsets for crude data quality investigation
More informationTECHNICAL REPORT NO December 1, 2008
DEPARTMENT OF STATISTICS University of Wisconsin 300 University Avenue Madison, WI 53706 TECHNICAL REPORT NO. 46 December, 2008 Revised on January 27, 2009 Causal Graphical Models in System Genetics: a
More informationMethods for QTL analysis
Methods for QTL analysis Julius van der Werf METHODS FOR QTL ANALYSIS... 44 SINGLE VERSUS MULTIPLE MARKERS... 45 DETERMINING ASSOCIATIONS BETWEEN GENETIC MARKERS AND QTL WITH TWO MARKERS... 45 INTERVAL
More informationBayesian construction of perceptrons to predict phenotypes from 584K SNP data.
Bayesian construction of perceptrons to predict phenotypes from 584K SNP data. Luc Janss, Bert Kappen Radboud University Nijmegen Medical Centre Donders Institute for Neuroscience Introduction Genetic
More informationAn Introduction to the spls Package, Version 1.0
An Introduction to the spls Package, Version 1.0 Dongjun Chung 1, Hyonho Chun 1 and Sündüz Keleş 1,2 1 Department of Statistics, University of Wisconsin Madison, WI 53706. 2 Department of Biostatistics
More informationBayesian reversible-jump for epistasis analysis in genomic studies
Balestre and de Souza BMC Genomics (2016) 17:1012 DOI 10.1186/s12864-016-3342-6 RESEARCH ARTICLE Bayesian reversible-jump for epistasis analysis in genomic studies Marcio Balestre 1* and Claudio Lopes
More informationPrinciples of QTL Mapping. M.Imtiaz
Principles of QTL Mapping M.Imtiaz Introduction Definitions of terminology Reasons for QTL mapping Principles of QTL mapping Requirements For QTL Mapping Demonstration with experimental data Merit of QTL
More informationI Have the Power in QTL linkage: single and multilocus analysis
I Have the Power in QTL linkage: single and multilocus analysis Benjamin Neale 1, Sir Shaun Purcell 2 & Pak Sham 13 1 SGDP, IoP, London, UK 2 Harvard School of Public Health, Cambridge, MA, USA 3 Department
More informationHierarchical Generalized Linear Models for Multiple QTL Mapping
Genetics: Published Articles Ahead of Print, published on January 1, 009 as 10.1534/genetics.108.099556 Hierarchical Generalized Linear Models for Multiple QTL Mapping Nengun Yi 1,* and Samprit Baneree
More informationPrediction of the Confidence Interval of Quantitative Trait Loci Location
Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28
More informationMCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES
MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES Elizabeth A. Thompson Department of Statistics, University of Washington Box 354322, Seattle, WA 98195-4322, USA Email: thompson@stat.washington.edu This
More information1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics
1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationHomework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:
Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships
More informationLong-Term Response and Selection limits
Long-Term Response and Selection limits Bruce Walsh lecture notes Uppsala EQG 2012 course version 5 Feb 2012 Detailed reading: online chapters 23, 24 Idealized Long-term Response in a Large Population
More informationEstimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty
Estimation of Parameters in Random Effect Models with Incidence Matrix Uncertainty Xia Shen 1,2 and Lars Rönnegård 2,3 1 The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden; 2 School
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture16: Population structure and logistic regression I Jason Mezey jgm45@cornell.edu April 11, 2017 (T) 8:40-9:55 Announcements I April
More informationBayesian Nonparametric Regression for Diabetes Deaths
Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,
More informationModule 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline
Module 4: Bayesian Methods Lecture 9 A: Default prior selection Peter Ho Departments of Statistics and Biostatistics University of Washington Outline Je reys prior Unit information priors Empirical Bayes
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationComputational Genomics. Systems biology. Putting it together: Data integration using graphical models
02-710 Computational Genomics Systems biology Putting it together: Data integration using graphical models High throughput data So far in this class we discussed several different types of high throughput
More informationFor 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.
STAT 550 Howework 6 Anton Amirov 1. This question relates to the same study you saw in Homework-4, by Dr. Arno Motulsky and coworkers, and published in Thompson et al. (1988; Am.J.Hum.Genet, 42, 113-124).
More informationSupplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control
Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control Xiaoquan Wen Department of Biostatistics, University of Michigan A Model
More information(Genome-wide) association analysis
(Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More informationLecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis
Lecture 3 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,
More informationCheng-Ruei Lee 1 *, Jill T. Anderson 2, Thomas Mitchell-Olds 1,3. Abstract. Introduction
Unifying Genetic Canalization, Genetic Constraint, and Genotype-by-Environment Interaction: QTL by Genomic Background by Environment Interaction of Flowering Time in Boechera stricta Cheng-Ruei Lee 1 *,
More informationMOST complex traits are influenced by interacting
Copyright Ó 2009 by the Genetics Society of America DOI: 10.1534/genetics.108.099556 Hierarchical Generalized Linear Models for Multiple Quantitative Trait Locus Mapping Nengjun Yi*,1 and Samprit Banerjee
More information1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:
.5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the
More informationLocating multiple interacting quantitative trait. loci using rank-based model selection
Genetics: Published Articles Ahead of Print, published on May 16, 2007 as 10.1534/genetics.106.068031 Locating multiple interacting quantitative trait loci using rank-based model selection, 1 Ma lgorzata
More informationDoing Bayesian Integrals
ASTR509-13 Doing Bayesian Integrals The Reverend Thomas Bayes (c.1702 1761) Philosopher, theologian, mathematician Presbyterian (non-conformist) minister Tunbridge Wells, UK Elected FRS, perhaps due to
More information6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008
MIT OpenCourseWare http://ocw.mit.edu 6.047 / 6.878 Computational Biology: Genomes, Networks, Evolution Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationMaking rating curves - the Bayesian approach
Making rating curves - the Bayesian approach Rating curves what is wanted? A best estimate of the relationship between stage and discharge at a given place in a river. The relationship should be on the
More informationPartitioning Genetic Variance
PSYC 510: Partitioning Genetic Variance (09/17/03) 1 Partitioning Genetic Variance Here, mathematical models are developed for the computation of different types of genetic variance. Several substantive
More informationLecture 8 Genomic Selection
Lecture 8 Genomic Selection Guilherme J. M. Rosa University of Wisconsin-Madison Mixed Models in Quantitative Genetics SISG, Seattle 18 0 Setember 018 OUTLINE Marker Assisted Selection Genomic Selection
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More information