Bayesian Model Selection for Multiple QTL. Brian S. Yandell

Size: px
Start display at page:

Download "Bayesian Model Selection for Multiple QTL. Brian S. Yandell"

Transcription

1 Bayesian Model Selection for Multiple QTL Brian S. Yandell University of Wisconsin-Madison Jackson Laboratory, October 005 October 005 Jax Workshop Brian S. Yandell 1 outline 1. what is the goal of QTL study?. Bayesian priors & posteriors 3. model search using MCMC Gibbs sampler and Metropolis-Hastings 4. model assessment Bayes factors & model averaging 5. data examples in detail plants & animals October 005 Jax Workshop Brian S. Yandell

2 1. what is the goal of QTL study? major QTL on linkage map additive effect major QTL Pareto diagram of QTL effects minor QTL rank order of QTL polygenes October 005 Jax Workshop Brian S. Yandell 3 interval mapping basics observed measurements y = phenotypic trait m = markers & linkage map i = individual index (1,,n) missing data missing marker data missing q = QT genotypes alleles QQ, Qq, or qq at locus unknown quantities λ = QT locus (or loci) µ = phenotype model parameters H = QTL model/genetic architecture pr(q m,λ,h) genotype model grounded by linkage map, experimental cross recombination yields multinomial for q given m pr(y q,µ,h) phenotype model distribution shape (assumed normal here) unknown parameters µ (could be non-parametric) observed mx Yy Qq unknown λ θµ after Sen Churchill (001) October 005 Jax Workshop Brian S. Yandell 4

3 . Bayesian priors & posteriors augment data (y,m) with missing genotypes q study model parameters (µ,λ) given data (y,m,q) properties of posterior pr(µ,λ,q y,m ) average posterior over missing genotypes q pr(µ,λ y,m ) = sum q posterior sample from posterior in some clever way multiple imputation (Sen Churchill 00) Markov chain Monte Carlo (MCMC) posterior = likelihood * prior / constant pr( µ, λ, q y, m) = pr( y q, µ )pr( q m, λ)pr( µ )pr( λ m) pr( y m) October 005 Jax Workshop Brian S. Yandell 5 Bayes posterior vs. maximum likelihood classical approach maximizes likelihood Bayesian posterior averages over other parameters LOD( λ ) = log {max µ pr( y m, µ, λ)} + c LPD( λ) = log pr( y m, µ, λ) = {pr( λ m) q pr( y m, µ, λ)pr( µ ) dµ } + C pr( y q, µ )pr( q m, λ) October 005 Jax Workshop Brian S. Yandell 6

4 BC with 1 QTL: IM vs. BIM LOD and LPD: QTL at 100 substitution effect lod black=bim purple=im nd QTL? nd QTL? substitution effect blue=ideal Map position (cm) Map position (cm) October 005 Jax Workshop Brian S. Yandell 7 how does phenotype y improve posterior for genotype q? D4Mit41 D4Mit14 bp what are probabilities for genotype q between markers? recombinants AA:AB all 1:1 if ignore y and if we use y? AA AA AB AA AA AB AB AB Genotype October 005 Jax Workshop Brian S. Yandell 8

5 posterior on QTL genotypes full conditional of q given data, parameters proportional to prior pr(q m, λ ) weight toward q that agrees with flanking markers proportional to likelihood pr(y q,µ) weight toward q with similar phenotype values phenotype and flanking markers may conflict posterior recombination balances these two weights this is the E-step of EM computations pr( q y, m, µ, λ) = pr( y q, µ )pr( q m, λ) pr( y m, µ, λ) October 005 Jax Workshop Brian S. Yandell 9 prior & posteriors: genotypic means µ q data means prior mean data mean n large n small prior qq Qq QQ y = phenotype values October 005 Jax Workshop Brian S. Yandell 10

6 prior & posteriors: genotypic means µ q shrink posterior from sample genotypic mean toward overall mean partition individuals into sets S q by genotype q hyperparameter κ is related to heritability prior: µ q ~ N( y, κσ ) posterior given data: shrinkage factor: µ ~ q N b q y y b q q = sum Sq, n κnq = 1 κn + 1 q y n q q σ + (1 b ), q y bq n q q = count{ S q } October 005 Jax Workshop Brian S. Yandell 11 what if variance σ is unknown? sample variance is proportional to chi-square ns / σ ~ χ ( n ) likelihood of sample variance s given n, σ conjugate prior is inverse chi-square ντ / σ ~ χ ( ν ) prior of population variance σ given ν, τ posterior is weighted average of likelihood and prior (ντ +ns ) / σ ~ χ ( ν+n ) posterior of population variance σ given n, s, ν, τ empirical choice of hyper-parameters τ = s /3, ν=6 E(σ ν,τ ) = s /, Var(σ ν,τ ) = s 4 /4 October 005 Jax Workshop Brian S. Yandell 1

7 multiple QTL phenotype model phenotype affected by genotype & environment pr(y q,µ) ~ N(µ q, σ ) y = µ q + environment µ q = β 0 +sum {j in H} β j (q) number of terms in QTL model H nqtl (3 nqtl for F ) partition genotypic mean into QTL effects µ q = β 0 + β 1 (q 1 ) + β (q ) + β 1 (q 1,q ) µ q = mean + main effects + epistatic interactions partition prior and posterior (details omitted) October 005 Jax Workshop Brian S. Yandell 13 prior & posterior on QTL model H index model H by number of QTL what prior on number of QTL? uniform over some range Poisson with prior mean geometric with prior mean prior influences posterior good: reflects prior belief push data in discovery process bad: skeptic revolts! answer depends on guess prior probability m = number of QTL October 005 Jax Workshop Brian S. Yandell 14 e e p p e p u exponential Poisson uniform u u p eu u pu u u e e p e p e p e p e p e p p e u u u u

8 epistatic interactions model space issues -QTL interactions only? Fisher-Cockerham partition vs. tree-structured? general interactions among multiple QTL model search issues epistasis between significant QTL check all possible pairs when QTL included? allow higher order epistasis? epistasis with non-significant QTL whole genome paired with each significant QTL? pairs of non-significant QTL? Yi Xu (000) Genetics; Yi, Xu, Allison (003) Genetics; Yi (004) October 005 Jax Workshop Brian S. Yandell QTL Model Search using MCMC construct Markov chain around posterior want posterior as stable distribution of Markov chain in practice, the chain tends toward stable distribution initial values may have low posterior probability burn-in period to get chain mixing well update QTL model components from full conditionals update locus λ given q,h (using Metropolis-Hastings step) update genotypes q given λ,µ,y,h (using Gibbs sampler) update effects µ given q,y,h (using Gibbs sampler) update QTL model H given λ,µ,y,q (using Gibbs or M-H) ( λ, q, µ, H) ( λ, q, µ, H) ~ pr( λ, q, µ, H Y, X) ( λ, q, µ, H) L ( λ, q, 1 µ, H) N October 005 Jax Workshop Brian S. Yandell 16

9 Gibbs sampler idea two correlated normals (genotypic means in BC) could draw samples from both together but easier to sample one at a time Gibbs sampler: sample each from its full conditional pick order of sampling at random repeat N times µ µ µ QQ QQ Qq, µ Qq given µ given µ ~ N(0,1) but cor( µ Qq QQ ~ N ~ N ( ρµ Qq,1 ρ ) ( ρµ,1 ρ ) October 005 Jax Workshop Brian S. Yandell 17 QQ QQ, µ Qq ) = ρ Gibbs sampler samples: ρ = 0.6 N = 50 samples N = 00 samples Gibbs: mean Markov chain index Gibbs: mean 1 Gibbs: mean Gibbs: mean Gibbs: mean Gibbs: mean Markov chain index Gibbs: mean 1 Gibbs: mean Gibbs: mean Gibbs: mean Markov chain index Gibbs: mean Markov chain index Gibbs: mean 1 October 005 Jax Workshop Brian S. Yandell 18

10 How to sample a locus λ? cannot easily sample from locus full conditional pr(λ m,q) = pr(λ) pr(q m,λ) / constant constant determined by averaging over all possible genotypes over entire map Gibbs sampler will not work in general but can use method based on ratios of probabilities Metropolis-Hastings is extension of Gibbs sampler October 005 Jax Workshop Brian S. Yandell 19 Metropolis-Hastings idea want to study distribution f(λ) take Monte Carlo samples unless too complicated pick an arbitrary distribution g(λ, λ * ) draw Metropolis-Hastings samples: current sample value λ propose new value λ * from g(λ, λ * ) accept new value with prob A * * f ( λ ) g( λ, λ) A = min 1, * f ( λ) g( λ, λ ) Gibbs sampler is special case g(λ, λ * ) = f(λ * ) always accept new proposal: A = f(λ) g(λ λ * ) October 005 Jax Workshop Brian S. Yandell 0

11 mcmc sequence MCMC realization pr(θ Y) θ θ λ λ added twist: occasionally propose from whole domain October 005 Jax Workshop Brian S. Yandell 1 Metropolis-Hastings samples mcmc sequence N = 00 samples N = 1000 samples narrow g wide g narrow g wide g mcmc sequence mcmc sequence mcmc sequence pr(θ Y) θλ pr(θ Y) θλ pr(θ Y) θλ pr(θ Y) λθ λθ λθ λθ θλ October 005 Jax Workshop Brian S. Yandell

12 sampling across QTL models H 0 λ 1 λ m+1 λ λ m L action steps: draw one of three choices update QTL model H with probability 1-b(H)-d(H) update current model using full conditionals sample QTL loci, effects, and genotypes add a locus with probability b(h) propose a new locus along genome innovate new genotypes at locus and phenotype effect decide whether to accept the birth of new locus drop a locus with probability d(h) propose dropping one of existing loci decide whether to accept the death of locus October 005 Jax Workshop Brian S. Yandell 3 reversible jump MCMC consider known genotypes q at known loci λ models with 1 or QTL M-H step between 1-QTL and -QTL models model changes dimension (via careful bookkeeping) consider mixture over QTL models H nqtl = 1 : Y = β 0 + β ( q 1 1 ) + e nqtl = : Y = β 0 + β ( q 1 1 ) + β ( q ) + e October 005 Jax Workshop Brian S. Yandell 4

13 Gibbs sampler with loci indicators partition genome into intervals at most one QTL per interval interval = 1 cm in length assume QTL in middle of interval use loci to indicate presence/absence of QTL in each interval γ = 1 if QTL in interval γ = 0 if no QTL Gibbs sampler on loci indicators see work of Nengjun Yi (and earlier work of Ina Hoeschele) Y = β + γ β ( q ) + γ β ( q ) e October 005 Jax Workshop Brian S. Yandell 5 Bayesian shrinkage estimation soft loci indicators strength of evidence for λ j depends on variance of β j similar to γ > 0 on grey scale include all possible loci in model pseudo-markers at 1cM intervals Wang et al. (005 Genetics) Shizhong Xu group at U CA Riverside Y = β + β ( q ) + β ( q ) e β ( q j j ) ~ N (0, σ j ), σ j ~ inverse - chisquare October 005 Jax Workshop Brian S. Yandell 6

14 geometry of effect samples (collinearity of QTL yields correlated estimates) a short sequence first 1000 with m<3 β b b β β b1 b1 1 β 1 October 005 Jax Workshop Brian S. Yandell 7 4. Model Assessment balance model fit against model complexity smaller model bigger model model fit miss key features fits better prediction may be biased no bias interpretation easier more complicated parameters low variance high variance information criteria: penalize L by model size H compare IC = log L( H y ) + penalty( H ) Bayes factors: balance posterior by prior choice compare pr( data y model H ) October 005 Jax Workshop Brian S. Yandell 8

15 QTL Bayes factors BF = posterior odds / prior odds BF equivalent to BIC simple comparison: 1 vs QTL same as LOD test general comparison of models want Bayes factor >> 1 nqtl = number of QTL indexes model complexity genetic architecture also important BF nqtl,nqtl + 1 prior probability pr( nqtl data) / pr( nqtl) = pr( nqtl + 1data) / pr( nqtl + 1) e e exponential p Poisson e u uniform p p u u p eu u pu u u e e p p e e p e p e p e u u pu p e u m = number of QTL October 005 Jax Workshop Brian S. Yandell 9 Bayes factors to assess models Bayes factor: which model best supports the data? ratio of posterior odds to prior odds ratio of model likelihoods equivalent to LR statistic when comparing two nested models simple hypotheses (e.g. 1 vs QTL) Bayes Information Criteria (BIC) Schwartz introduced for model selection in general settings penalty to balance model size (p = number of parameters) B 1 pr( model1 Y ) / pr( model Y ) pr( Y model1) = = pr( model ) / pr( model ) pr( Y model ) log( B 1 1 ) = log( LR) ( p p )log( n) October 005 Jax Workshop Brian S. Yandell 30 1

16 simulations and data studies simulated F intercross, 8 QTL (Stephens, Fisch 1998) n=00, heritability = 50% detected 3 QTL increase to detect all 8 n=500, heritability to 97% QTL chr loci effect ch1 ch ch3 ch4 ch5 ch6 ch7 ch8 ch9 ch10 frequency in % Genetic map posterior number of QTL October 005 Jax Workshop Brian S. Yandell 31 loci pattern across genome notice which chromosomes have persistent loci best pattern found 4% of the time Chromosome m Count of October 005 Jax Workshop Brian S. Yandell 3

17 B. napus 8-week vernalization whole genome study 108 plants from double haploid similar genetics to backcross: follow 1 gamete parents are Major (biennial) and Stellar (annual) 300 markers across genome 19 chromosomes average 6cM between markers median 3.8cM, max 34cM 83% markers genotyped phenotype is days to flowering after 8 weeks of vernalization (cooling) Stellar parent requires vernalization to flower available in R/bim package Ferreira et al. (1994); Kole et al. (001); Schranz et al. (00) October 005 Jax Workshop Brian S. Yandell 33 Markov chain Monte Carlo sequence burnin (sets up chain) mcmc sequence number of QTL environmental variance h = heritability (genetic/total variance) LOD = likelihood nqtl envvar herit burnin sequence burnin sequence nqtl envvar burnin sequence LOD herit e+00 e+05 4 e+05 6 e+05 8 e+05 1 e+0 mcmc sequence 0 e+00 e+05 4 e+05 6 e+05 8 e+05 1 e+0 mcmc sequence 0 e+00 e+05 4 e+05 6 e+05 8 e+05 1 e+0 mcmc sequence LOD e+00 e+05 4 e+05 6 e+05 8 e+05 1 e+0 burnin sequence mcmc sequence October 005 Jax Workshop Brian S. Yandell 34

18 MCMC sampled loci tgf1 wg3f7c subset of chromosomes N, N3, N16 points jittered for view blue lines at markers note concentration on chromosome N includes all models MCMC sampled loci tgh10b wg1a10 ecd1a E35M E38M Aca1 tg6a1 wg5a5 wg8g1b wg6b10 E35M6.117 E33M59.59 wg7f3a ec3a8 E3M49.73 wg3g11 wg1g8a wgd11b ec3b1 E33M ece5a isoidh wge1b E33M49.11 wg6b wg9c7 E33M6.140 E33M wg4a4b E3M61.18 ec4e7a wg5b1a wg6c6 ec4g7b wg7a11 E35M wg4f4c E33M ecd8a E33M E38M E3M wg4d10 ec4h7 wg1g10b E38M6.9 E33M tg5d9b sl E35M59.85 g6 wg7f5b E3M ec5e1a wg5a1b wga3c ec3e1b N N3 N16 chromosome October 005 Jax Workshop Brian S. Yandell 35 row 1: # QTL row : pattern Bayesian model assessment col 1: posterior col : Bayes factor note error bars on bf evidence suggests 4-5 QTL N(-3),N3,N16 QTL posterior model posterior QTL posterior number of QTL * pattern posterior :,1 3:*,1 :,13 :,3 :,16 :,11 3:*,3 :,15 4:*,3,16 3:*,13 :, model index October 005 Jax Workshop Brian S. Yandell 36 posterior / prior posterior / prior 5 e-01 1 e+01 5 e+0 Bayes factor ratios strong moderate weak number of QTL 1 Bayes factor ratios 3 strong moderate weak model index

19 Bayesian estimates of loci & effects model averaging: at least 4 QTL histogram of loci blue line is density red lines at estimates estimate additive effects (red circles) grey points sampled from posterior blue line is cubic spline dashed line for SD loci histogram additive napus8 summaries with pattern 1,1,,3 and m 4 N N3 N N N3 N October 005 Jax Workshop Brian S. Yandell 37 Bayesian model diagnostics pattern: N(),N3,N16 col 1: density col : boxplots by m environmental variance σ =.008, σ =.09 heritability h = 5% LOD = 16 (highly significant) but note change with m density density density marginal envvar, m marginal heritability, m marginal LOD, m envvar conditional on number of QTL October 005 Jax Workshop Brian S. Yandell 38 envvar heritability LOD heritability conditional on number of QT LOD conditional on number of QTL

20 studying diabetes in an F segregating cross of inbred lines B6.ob x BTBR.ob F1 F selected mice with ob/ob alleles at leptin gene (chr 6) measured and mapped body weight, insulin, glucose at various ages (Stoehr et al. 000 Diabetes) sacrificed at 14 weeks, tissues preserved gene expression data Affymetrix microarrays on parental strains, F1 key tissues: adipose, liver, muscle, β-cells novel discoveries of differential expression (Nadler et al. 000 PNAS; Lan et al. 00 in review; Ntambi et al. 00 PNAS) RT-PCR on 108 F mice liver tissues 15 genes, selected as important in diabetes pathways SCD1, PEPCK, ACO, FAS, GPAT, PPARgamma, PPARalpha, G6Pase, PDI, October 005 Jax Workshop Brian S. Yandell 39 Multiple Interval Mapping (QTLCart) SCD1: multiple QTL plus epistasis! effect (add=blue, dom=red) LOD chr chr5 chr chr chr5 chr9 October 005 Jax Workshop Brian S. Yandell 40

21 Bayesian model assessment: number of QTL for SCD1 QTL posterior Bayes factor ratios QTL posterior posterior / prior strong moderate weak number of QTL number of QTL October 005 Jax Workshop Brian S. Yandell 41 Bayesian LOD and h for SCD1 density LOD marginal LOD, m LOD conditional on number of QTL density heritability marginal heritability, m heritability conditional on number of QTL October 005 Jax Workshop Brian S. Yandell 4

22 Bayesian model assessment: chromosome QTL pattern for SCD1 model posterior :1,,3 4:*1,,3 pattern posterior 4:1,,*3 4:1,*,3 5:3*1,,3 5:*1,,*3 5:*1,*,3 6:3*1,,*3 6:3*1,*,3 5:1,*,*3 6:4*1,,3 6:*1,*,*3 :1,3 3:*1, :1, model index posterior / prior Bayes factor ratios 5 6 moderate weak model index October 005 Jax Workshop Brian S. Yandell 43 trans-acting QTL for SCD1 (no epistasis yet: see Yi, Xu, Allison 003) dominance? October 005 Jax Workshop Brian S. Yandell 44

23 -D scan: assumes only QTL! epistasis LOD peaks joint LOD peaks October 005 Jax Workshop Brian S. Yandell 45 sub-peaks can be easily overlooked! October 005 Jax Workshop Brian S. Yandell 46

24 epistatic model fit October 005 Jax Workshop Brian S. Yandell 47 Cockerham epistatic effects October 005 Jax Workshop Brian S. Yandell 48

25 marginal heritability by locus black = total blue = additive red = dominant purple = add x add October 005 Jax Workshop Brian S. Yandell 49 marginal LOD by locus black = total blue = additive red = dominant purple = add x add green = scanone October 005 Jax Workshop Brian S. Yandell 50

26 Bayesian & classical (HK) October 005 Jax Workshop Brian S. Yandell 51 sex-adjusted anova for SCD1 (only 3-way sex interactions significant) Summary for fit QTL Method is: imp Number of observations: 107 Full model result Model formula is: y ~ sex * (Q1 + Q + Q3 + Q1:Q3 + Q:Q3) df SS MS LOD %var Pvalue(Chi) Pvalue(F) Model e e-09 Error Total Drop one QTL at a time ANOVA table: df Type III SS LOD %var F value Pvalue sex Chr@ e-06 Chr5@ e-06 Chr9@ e-06 Chr@105:Chr9@ e-06 Chr5@67:Chr9@ e-05 sex:chr@ sex:chr5@ sex:chr9@ sex:chr@105:chr9@ sex:chr5@67:chr9@ October 005 Jax Workshop Brian S. Yandell 5

27 anova for SCD1 (sex terms n.s.) Model: y ~ Chr@105 + Chr5@67 + Chr9@67 + Chr@80 + Chr9@67^ + Chr@105:Chr9@67 + Chr5@67:Chr9@67^ Df Sum Sq Mean Sq F value Pr(>F) Model <.e-16 *** Error Total Single term deletions Df Sum of Sq RSS AIC F value Pr(F) <none> Chr@ ** Chr@ Chr5@ Chr9@ ** Chr9@67^ Chr@105:Chr9@ e-05 *** Chr5@67:Chr9@67^ ** October 005 Jax Workshop Brian S. Yandell 53 epistatic interaction plots (chr 5x9 p=0.007; chr x9 p<0.0001) mostly axd mostly axa October 005 Jax Workshop Brian S. Yandell 54

28 SCD1 on chrx9 by sex October 005 Jax Workshop Brian S. Yandell 55 obesity in CAST/Ei BC onto M16i 41 mice (Daniel Pomp) (13 male, 08 female) 9 microsatellites on 19 chromosomes 114 cm map subcutaneous fat pads pre-adjusted for sex and dam effects Yi, Yandell, Churchill, Allison, Eisen, Pomp (005) Genetics (in press) October 005 Jax Workshop Brian S. Yandell 56

29 non-epistatic analysis LOD score Bayes factor single QTL LOD profile multiple QTL Bayes factor profile October 005 Jax Workshop Brian S. Yandell 57 posterior profile of main effects in epistatic analysis Heritability Main effect Bayes factor main effects & heritability profile Bayes factor profile October 005 Jax Workshop Brian S. Yandell 58

30 posterior profile of main effects in epistatic analysis October 005 Jax Workshop Brian S. Yandell 59 model selection via Bayes factors for epistatic model number of QTL QTL pattern October 005 Jax Workshop Brian S. Yandell 60

31 posterior probability of effects Posterior probability Chr13(0,4)*Chr15(1,31) Chr7(50,75)*Chr19(15,45) Chr(7,85)*Chr14(1,41) Chr15(1,31)*Chr19(15,45) Chr(7,85)*Chr13(0,4) Chr1(6,54)*Chr18(43,71) Chr14(1,41) Chr7(50,75) Chr19(15,45) Chr1(6,54) Chr18(43,71) Chr15(1,31) Chr13(0,4) Chr(7,85) October 005 Jax Workshop Brian S. Yandell 61 scatterplot estimates of epistatic loci October 005 Jax Workshop Brian S. Yandell 6

32 stronger epistatic effects October 005 Jax Workshop Brian S. Yandell 63 Bayesian software for QTLs R/bim: Bayesian IM (Satagopan Yandell 1996; Gaffney 001)* contributed package to CRAN version available within WinQTLCart (statgen.ncsu.edu/qtlcart) R/bmqtl: Bayesian Multiple QTL (N Yi, T Mehta & BS Yandell)* epistasis, new MCMC algorithms, extensive graphics extension of R/bim; due out Fall 005 R/qtl (Broman et al. 003 Bioinformatics)* biosun01.biostat.jhsph.edu/~kbroman/software contributed package Pseudomarker (Sen, Churchill 00 Genetics) Bayesian QTL / Multimapper Sillanpää Arjas (1998) Stephens & Fisch (1998 Biometrics) R/bqtl (C Berry, hacuna.ucsd.edu/bqtl) * Hao Wu, Jackson Labs, provide crucial computing support October 005 Jax Workshop Brian S. Yandell 64

33 R/bim: our software R contributed library ( library(bim) is cross-compatible with library(qtl) Bayesian module within WinQTLCart WinQTLCart output can be processed using R library Software history initially designed by JM Satagopan (1996) major revision and extension by PJ Gaffney (001) whole genome multivariate update of effects; long range position updates substantial improvements in speed, efficiency pre-burnin: initial prior number of QTL very large upgrade (H Wu, PJ Gaffney, CF Jin, BS Yandell 003) epistasis in progress (H Wu, BS Yandell, N Yi 004) October 005 Jax Workshop Brian S. Yandell 65 many thanks Jackson Labs Gary Churchill Hao Wu U AL Birmingham David Allison Nengjun Yi Tapan Mehta Tom Osborn David Butruille Marcio Ferrera Josh Udahl Pablo Quijada Alan Attie Jonathan Stoehr Hong Lan Susie Clee Jessica Byers Michael Newton Daniel Sorensen Daniel Gianola Liang Li my students Jaya Satagopan Fei Zou Patrick Gaffney Chunfang Jin Elias Chaibub Neto USDA Hatch, NIH/NIDDK (Attie), NIH/R01 (Yi) October 005 Jax Workshop Brian S. Yandell 66

QTL model selection: key players

QTL model selection: key players Bayesian Interval Mapping. Bayesian strategy -9. Markov chain sampling 0-7. sampling genetic architectures 8-5 4. criteria for model selection 6-44 QTL : Bayes Seattle SISG: Yandell 008 QTL model selection:

More information

QTL model selection: key players

QTL model selection: key players QTL Model Selection. Bayesian strategy. Markov chain sampling 3. sampling genetic architectures 4. criteria for model selection Model Selection Seattle SISG: Yandell 0 QTL model selection: key players

More information

Model Selection for Multiple QTL

Model Selection for Multiple QTL Model Selection for Multiple TL 1. reality of multiple TL 3-8. selecting a class of TL models 9-15 3. comparing TL models 16-4 TL model selection criteria issues of detecting epistasis 4. simulations and

More information

Inferring Genetic Architecture of Complex Biological Processes

Inferring Genetic Architecture of Complex Biological Processes Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison http://www.stat.wisc.edu/~yandell/statgen

More information

contact information & resources

contact information & resources Seattle Summer Institute 006 Advanced QTL Brian S. Yandell University of Wisconsin-Madison Bayesian QTL mapping & model selection data examples in detail multiple phenotypes & microarrays software demo

More information

1. what is the goal of QTL study?

1. what is the goal of QTL study? NSF UAB Course 008 Bayesian Interval Mapping Brian S. Yandell, UW-Madison www.stat.wisc.edu/~yandell/statgen overview: multiple QTL approaches Bayesian QTL mapping & model selection data examples in detail

More information

what is Bayes theorem? posterior = likelihood * prior / C

what is Bayes theorem? posterior = likelihood * prior / C who was Bayes? Reverend Thomas Bayes (70-76) part-time mathematician buried in Bunhill Cemetary, Moongate, London famous paper in 763 Phil Trans Roy Soc London was Bayes the first with this idea? (Laplace?)

More information

Multiple QTL mapping

Multiple QTL mapping Multiple QTL mapping Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] 1 Why? Reduce residual variation = increased power

More information

Mapping multiple QTL in experimental crosses

Mapping multiple QTL in experimental crosses Human vs mouse Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman www.daviddeen.com

More information

QTL Model Search. Brian S. Yandell, UW-Madison January 2017

QTL Model Search. Brian S. Yandell, UW-Madison January 2017 QTL Model Search Brian S. Yandell, UW-Madison January 2017 evolution of QTL models original ideas focused on rare & costly markers models & methods refined as technology advanced single marker regression

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Human vs mouse Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] www.daviddeen.com

More information

Causal Graphical Models in Systems Genetics

Causal Graphical Models in Systems Genetics 1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation

More information

Statistical issues in QTL mapping in mice

Statistical issues in QTL mapping in mice Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping

More information

QTL Mapping I: Overview and using Inbred Lines

QTL Mapping I: Overview and using Inbred Lines QTL Mapping I: Overview and using Inbred Lines Key idea: Looking for marker-trait associations in collections of relatives If (say) the mean trait value for marker genotype MM is statisically different

More information

Mapping multiple QTL in experimental crosses

Mapping multiple QTL in experimental crosses Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]

More information

Seattle Summer Institute : Systems Genetics for Experimental Crosses

Seattle Summer Institute : Systems Genetics for Experimental Crosses Seattle Summer Institute 2012 15: Systems Genetics for Experimental Crosses Brian S. Yandell, UW-Madison Elias Chaibub Neto, Sage Bionetworks www.stat.wisc.edu/~yandell/statgen/sisg Real knowledge is to

More information

Overview. Background

Overview. Background Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems

More information

Gene mapping in model organisms

Gene mapping in model organisms Gene mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Goal Identify genes that contribute to common human diseases. 2

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Backcross P 1 P 2 P 1 F 1 BC 4

More information

R/qtl workshop. (part 2) Karl Broman. Biostatistics and Medical Informatics University of Wisconsin Madison. kbroman.org

R/qtl workshop. (part 2) Karl Broman. Biostatistics and Medical Informatics University of Wisconsin Madison. kbroman.org R/qtl workshop (part 2) Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Example Sugiyama et al. Genomics 71:70-77, 2001 250 male

More information

Lecture 8. QTL Mapping 1: Overview and Using Inbred Lines

Lecture 8. QTL Mapping 1: Overview and Using Inbred Lines Lecture 8 QTL Mapping 1: Overview and Using Inbred Lines Bruce Walsh. jbwalsh@u.arizona.edu. University of Arizona. Notes from a short course taught Jan-Feb 2012 at University of Uppsala While the machinery

More information

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees:

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees: MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

Fast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets

Fast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets Fast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets Yu-Ling Chang A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment

More information

Detection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred population

Detection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred population Genet. Sel. Evol. 36 (2004) 415 433 415 c INRA, EDP Sciences, 2004 DOI: 10.1051/gse:2004009 Original article Detection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred

More information

Introduction to QTL mapping in model organisms

Introduction to QTL mapping in model organisms Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA

More information

BAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI

BAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI BAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI By DÁMARIS SANTANA MORANT A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR

More information

Causal Model Selection Hypothesis Tests in Systems Genetics

Causal Model Selection Hypothesis Tests in Systems Genetics 1 Causal Model Selection Hypothesis Tests in Systems Genetics Elias Chaibub Neto and Brian S Yandell SISG 2012 July 13, 2012 2 Correlation and Causation The old view of cause and effect... could only fail;

More information

Mapping QTL to a phylogenetic tree

Mapping QTL to a phylogenetic tree Mapping QTL to a phylogenetic tree Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman Human vs mouse www.daviddeen.com 3 Intercross

More information

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector

More information

Quantile-based permutation thresholds for QTL hotspot analysis: a tutorial

Quantile-based permutation thresholds for QTL hotspot analysis: a tutorial Quantile-based permutation thresholds for QTL hotspot analysis: a tutorial Elias Chaibub Neto and Brian S Yandell September 18, 2013 1 Motivation QTL hotspots, groups of traits co-mapping to the same genomic

More information

Inferring Causal Phenotype Networks from Segregating Populat

Inferring Causal Phenotype Networks from Segregating Populat Inferring Causal Phenotype Networks from Segregating Populations Elias Chaibub Neto chaibub@stat.wisc.edu Statistics Department, University of Wisconsin - Madison July 15, 2008 Overview Introduction Description

More information

Module 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline

Module 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline Module 4: Bayesian Methods Lecture 9 A: Default prior selection Peter Ho Departments of Statistics and Biostatistics University of Washington Outline Je reys prior Unit information priors Empirical Bayes

More information

Hierarchical Generalized Linear Models for Multiple QTL Mapping

Hierarchical Generalized Linear Models for Multiple QTL Mapping Genetics: Published Articles Ahead of Print, published on January 1, 009 as 10.1534/genetics.108.099556 Hierarchical Generalized Linear Models for Multiple QTL Mapping Nengun Yi 1,* and Samprit Baneree

More information

Causal Network Models for Correlated Quantitative Traits. outline

Causal Network Models for Correlated Quantitative Traits. outline Causal Network Models for Correlated Quantitative Traits Brian S. Yandell UW Madison October 2012 www.stat.wisc.edu/~yandell/statgen Jax SysGen: Yandell 2012 1 outline Correlation and causation Correlatedtraitsinorganized

More information

Contents. Part I: Fundamentals of Bayesian Inference 1

Contents. Part I: Fundamentals of Bayesian Inference 1 Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

MANY complex human diseases and traits of bio- tive corrections for multiple testing. Non-Bayesian model

MANY complex human diseases and traits of bio- tive corrections for multiple testing. Non-Bayesian model Copyright 2005 by the Genetics Society of America DOI: 10.1534/genetics.104.040386 Bayesian Model Selection for Genome-Wide Epistatic Quantitative Trait Loci Analysis Nengjun Yi,*,,1 Brian S. Yandell,

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate

More information

Use of hidden Markov models for QTL mapping

Use of hidden Markov models for QTL mapping Use of hidden Markov models for QTL mapping Karl W Broman Department of Biostatistics, Johns Hopkins University December 5, 2006 An important aspect of the QTL mapping problem is the treatment of missing

More information

Quantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012

Quantile based Permutation Thresholds for QTL Hotspots. Brian S Yandell and Elias Chaibub Neto 17 March 2012 Quantile based Permutation Thresholds for QTL Hotspots Brian S Yandell and Elias Chaibub Neto 17 March 2012 2012 Yandell 1 Fisher on inference We may at once admit that any inference from the particular

More information

Lecture 9. QTL Mapping 2: Outbred Populations

Lecture 9. QTL Mapping 2: Outbred Populations Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred

More information

Consistent high-dimensional Bayesian variable selection via penalized credible regions

Consistent high-dimensional Bayesian variable selection via penalized credible regions Consistent high-dimensional Bayesian variable selection via penalized credible regions Howard Bondell bondell@stat.ncsu.edu Joint work with Brian Reich Howard Bondell p. 1 Outline High-Dimensional Variable

More information

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.

More information

BTRY 4830/6830: Quantitative Genomics and Genetics

BTRY 4830/6830: Quantitative Genomics and Genetics BTRY 4830/6830: Quantitative Genomics and Genetics Lecture 23: Alternative tests in GWAS / (Brief) Introduction to Bayesian Inference Jason Mezey jgm45@cornell.edu Nov. 13, 2014 (Th) 8:40-9:55 Announcements

More information

MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES

MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES Elizabeth A. Thompson Department of Statistics, University of Washington Box 354322, Seattle, WA 98195-4322, USA Email: thompson@stat.washington.edu This

More information

MOST complex traits are influenced by interacting

MOST complex traits are influenced by interacting Copyright Ó 2009 by the Genetics Society of America DOI: 10.1534/genetics.108.099556 Hierarchical Generalized Linear Models for Multiple Quantitative Trait Locus Mapping Nengjun Yi*,1 and Samprit Banerjee

More information

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis Lecture 3 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,

More information

Bayesian Partition Models for Identifying Expression Quantitative Trait Loci

Bayesian Partition Models for Identifying Expression Quantitative Trait Loci Journal of the American Statistical Association ISSN: 0162-1459 (Print) 1537-274X (Online) Journal homepage: http://www.tandfonline.com/loi/uasa20 Bayesian Partition Models for Identifying Expression Quantitative

More information

Bayesian QTL mapping using skewed Student-t distributions

Bayesian QTL mapping using skewed Student-t distributions Genet. Sel. Evol. 34 00) 1 1 1 INRA, EDP Sciences, 00 DOI: 10.1051/gse:001001 Original article Bayesian QTL mapping using skewed Student-t distributions Peter VON ROHR a,b, Ina HOESCHELE a, a Departments

More information

Lecture 8 Genomic Selection

Lecture 8 Genomic Selection Lecture 8 Genomic Selection Guilherme J. M. Rosa University of Wisconsin-Madison Mixed Models in Quantitative Genetics SISG, Seattle 18 0 Setember 018 OUTLINE Marker Assisted Selection Genomic Selection

More information

A Statistical Framework for Expression Trait Loci (ETL) Mapping. Meng Chen

A Statistical Framework for Expression Trait Loci (ETL) Mapping. Meng Chen A Statistical Framework for Expression Trait Loci (ETL) Mapping Meng Chen Prelim Paper in partial fulfillment of the requirements for the Ph.D. program in the Department of Statistics University of Wisconsin-Madison

More information

Principles of Bayesian Inference

Principles of Bayesian Inference Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters

More information

Bayesian Inference of Interactions and Associations

Bayesian Inference of Interactions and Associations Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,

More information

Variance Component Models for Quantitative Traits. Biostatistics 666

Variance Component Models for Quantitative Traits. Biostatistics 666 Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond

More information

Bayesian reversible-jump for epistasis analysis in genomic studies

Bayesian reversible-jump for epistasis analysis in genomic studies Balestre and de Souza BMC Genomics (2016) 17:1012 DOI 10.1186/s12864-016-3342-6 RESEARCH ARTICLE Bayesian reversible-jump for epistasis analysis in genomic studies Marcio Balestre 1* and Claudio Lopes

More information

Locating multiple interacting quantitative trait. loci using rank-based model selection

Locating multiple interacting quantitative trait. loci using rank-based model selection Genetics: Published Articles Ahead of Print, published on May 16, 2007 as 10.1534/genetics.106.068031 Locating multiple interacting quantitative trait loci using rank-based model selection, 1 Ma lgorzata

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Bayesian linear regression

Bayesian linear regression Bayesian linear regression Linear regression is the basis of most statistical modeling. The model is Y i = X T i β + ε i, where Y i is the continuous response X i = (X i1,..., X ip ) T is the corresponding

More information

Estimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty

Estimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty Estimation of Parameters in Random Effect Models with Incidence Matrix Uncertainty Xia Shen 1,2 and Lars Rönnegård 2,3 1 The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden; 2 School

More information

CSC 2541: Bayesian Methods for Machine Learning

CSC 2541: Bayesian Methods for Machine Learning CSC 2541: Bayesian Methods for Machine Learning Radford M. Neal, University of Toronto, 2011 Lecture 3 More Markov Chain Monte Carlo Methods The Metropolis algorithm isn t the only way to do MCMC. We ll

More information

MCMC: Markov Chain Monte Carlo

MCMC: Markov Chain Monte Carlo I529: Machine Learning in Bioinformatics (Spring 2013) MCMC: Markov Chain Monte Carlo Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Contents Review of Markov

More information

GENOMIC SELECTION WORKSHOP: Hands on Practical Sessions (BL)

GENOMIC SELECTION WORKSHOP: Hands on Practical Sessions (BL) GENOMIC SELECTION WORKSHOP: Hands on Practical Sessions (BL) Paulino Pérez 1 José Crossa 2 1 ColPos-México 2 CIMMyT-México September, 2014. SLU,Sweden GENOMIC SELECTION WORKSHOP:Hands on Practical Sessions

More information

Stat 516, Homework 1

Stat 516, Homework 1 Stat 516, Homework 1 Due date: October 7 1. Consider an urn with n distinct balls numbered 1,..., n. We sample balls from the urn with replacement. Let N be the number of draws until we encounter a ball

More information

Bayesian Regression Linear and Logistic Regression

Bayesian Regression Linear and Logistic Regression When we want more than point estimates Bayesian Regression Linear and Logistic Regression Nicole Beckage Ordinary Least Squares Regression and Lasso Regression return only point estimates But what if we

More information

Bayesian shrinkage approach in variable selection for mixed

Bayesian shrinkage approach in variable selection for mixed Bayesian shrinkage approach in variable selection for mixed effects s GGI Statistics Conference, Florence, 2015 Bayesian Variable Selection June 22-26, 2015 Outline 1 Introduction 2 3 4 Outline Introduction

More information

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An

More information

Markov Chain Monte Carlo methods

Markov Chain Monte Carlo methods Markov Chain Monte Carlo methods By Oleg Makhnin 1 Introduction a b c M = d e f g h i 0 f(x)dx 1.1 Motivation 1.1.1 Just here Supresses numbering 1.1.2 After this 1.2 Literature 2 Method 2.1 New math As

More information

Principles of QTL Mapping. M.Imtiaz

Principles of QTL Mapping. M.Imtiaz Principles of QTL Mapping M.Imtiaz Introduction Definitions of terminology Reasons for QTL mapping Principles of QTL mapping Requirements For QTL Mapping Demonstration with experimental data Merit of QTL

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is

More information

Bayesian Nonparametric Regression for Diabetes Deaths

Bayesian Nonparametric Regression for Diabetes Deaths Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,

More information

A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data

A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data Faming Liang, Chuanhai Liu, and Naisyin Wang Texas A&M University Multiple Hypothesis Testing Introduction

More information

MULTIPLE-TRAIT MULTIPLE-INTERVAL MAPPING OF QUANTITATIVE-TRAIT LOCI ROBY JOEHANES

MULTIPLE-TRAIT MULTIPLE-INTERVAL MAPPING OF QUANTITATIVE-TRAIT LOCI ROBY JOEHANES MULTIPLE-TRAIT MULTIPLE-INTERVAL MAPPING OF QUANTITATIVE-TRAIT LOCI by ROBY JOEHANES B.S., Universitas Pelita Harapan, Indonesia, 1999 M.S., Kansas State University, 2002 A REPORT submitted in partial

More information

Selecting explanatory variables with the modified version of Bayesian Information Criterion

Selecting explanatory variables with the modified version of Bayesian Information Criterion Selecting explanatory variables with the modified version of Bayesian Information Criterion Institute of Mathematics and Computer Science, Wrocław University of Technology, Poland in cooperation with J.K.Ghosh,

More information

Lecture 13: Population Structure. October 8, 2012

Lecture 13: Population Structure. October 8, 2012 Lecture 13: Population Structure October 8, 2012 Last Time Effective population size calculations Historical importance of drift: shifting balance or noise? Population structure Today Course feedback The

More information

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017

Lecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017 Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping

More information

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M. STAT 550 Howework 6 Anton Amirov 1. This question relates to the same study you saw in Homework-4, by Dr. Arno Motulsky and coworkers, and published in Thompson et al. (1988; Am.J.Hum.Genet, 42, 113-124).

More information

TECHNICAL REPORT NO December 1, 2008

TECHNICAL REPORT NO December 1, 2008 DEPARTMENT OF STATISTICS University of Wisconsin 300 University Avenue Madison, WI 53706 TECHNICAL REPORT NO. 46 December, 2008 Revised on January 27, 2009 Causal Graphical Models in System Genetics: a

More information

The Bayesian Choice. Christian P. Robert. From Decision-Theoretic Foundations to Computational Implementation. Second Edition.

The Bayesian Choice. Christian P. Robert. From Decision-Theoretic Foundations to Computational Implementation. Second Edition. Christian P. Robert The Bayesian Choice From Decision-Theoretic Foundations to Computational Implementation Second Edition With 23 Illustrations ^Springer" Contents Preface to the Second Edition Preface

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm

More information

Evolution of phenotypic traits

Evolution of phenotypic traits Quantitative genetics Evolution of phenotypic traits Very few phenotypic traits are controlled by one locus, as in our previous discussion of genetics and evolution Quantitative genetics considers characters

More information

One-week Course on Genetic Analysis and Plant Breeding January 2013, CIMMYT, Mexico LOD Threshold and QTL Detection Power Simulation

One-week Course on Genetic Analysis and Plant Breeding January 2013, CIMMYT, Mexico LOD Threshold and QTL Detection Power Simulation One-week Course on Genetic Analysis and Plant Breeding 21-2 January 213, CIMMYT, Mexico LOD Threshold and QTL Detection Power Simulation Jiankang Wang, CIMMYT China and CAAS E-mail: jkwang@cgiar.org; wangjiankang@caas.cn

More information

Binary trait mapping in experimental crosses with selective genotyping

Binary trait mapping in experimental crosses with selective genotyping Genetics: Published Articles Ahead of Print, published on May 4, 2009 as 10.1534/genetics.108.098913 Binary trait mapping in experimental crosses with selective genotyping Ani Manichaikul,1 and Karl W.

More information

Forward Problems and their Inverse Solutions

Forward Problems and their Inverse Solutions Forward Problems and their Inverse Solutions Sarah Zedler 1,2 1 King Abdullah University of Science and Technology 2 University of Texas at Austin February, 2013 Outline 1 Forward Problem Example Weather

More information

I Have the Power in QTL linkage: single and multilocus analysis

I Have the Power in QTL linkage: single and multilocus analysis I Have the Power in QTL linkage: single and multilocus analysis Benjamin Neale 1, Sir Shaun Purcell 2 & Pak Sham 13 1 SGDP, IoP, London, UK 2 Harvard School of Public Health, Cambridge, MA, USA 3 Department

More information

Genome-wide Multiple Loci Mapping in Experimental Crosses by the Iterative Adaptive Penalized Regression

Genome-wide Multiple Loci Mapping in Experimental Crosses by the Iterative Adaptive Penalized Regression Genetics: Published Articles Ahead of Print, published on February 15, 2010 as 10.1534/genetics.110.114280 Genome-wide Multiple Loci Mapping in Experimental Crosses by the Iterative Adaptive Penalized

More information

Or How to select variables Using Bayesian LASSO

Or How to select variables Using Bayesian LASSO Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO On Bayesian Variable Selection

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES: .5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the

More information

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5

Association Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative

More information

Quantitative characters II: heritability

Quantitative characters II: heritability Quantitative characters II: heritability The variance of a trait (x) is the average squared deviation of x from its mean: V P = (1/n)Σ(x-m x ) 2 This total phenotypic variance can be partitioned into components:

More information

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 20: Epistasis and Alternative Tests in GWAS Jason Mezey jgm45@cornell.edu April 16, 2016 (Th) 8:40-9:55 None Announcements Summary

More information

The Quantitative TDT

The Quantitative TDT The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus

More information

Bayesian model selection in graphs by using BDgraph package

Bayesian model selection in graphs by using BDgraph package Bayesian model selection in graphs by using BDgraph package A. Mohammadi and E. Wit March 26, 2013 MOTIVATION Flow cytometry data with 11 proteins from Sachs et al. (2005) RESULT FOR CELL SIGNALING DATA

More information

A Bayesian Approach to Detect Quantitative Trait Loci Using Markov Chain Monte Carlo

A Bayesian Approach to Detect Quantitative Trait Loci Using Markov Chain Monte Carlo Copyright 0 1996 by the Genetics Society of America A Bayesian Approach to Detect Quantitative Trait Loci Using Markov Chain Monte Carlo Jaya M. Satagopan," Brian S. Yandell,t Michael A. Newtont and Thomas

More information

Linkage Mapping. Reading: Mather K (1951) The measurement of linkage in heredity. 2nd Ed. John Wiley and Sons, New York. Chapters 5 and 6.

Linkage Mapping. Reading: Mather K (1951) The measurement of linkage in heredity. 2nd Ed. John Wiley and Sons, New York. Chapters 5 and 6. Linkage Mapping Reading: Mather K (1951) The measurement of linkage in heredity. 2nd Ed. John Wiley and Sons, New York. Chapters 5 and 6. Genetic maps The relative positions of genes on a chromosome can

More information

Genotype Imputation. Biostatistics 666

Genotype Imputation. Biostatistics 666 Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives

More information

THE problem of identifying the genetic factors un- QTL models because of their ability to separate linked

THE problem of identifying the genetic factors un- QTL models because of their ability to separate linked Copyright 2001 by the Genetics Society of America A Statistical Framework for Quantitative Trait Mapping Śaunak Sen and Gary A. Churchill The Jackson Laboratory, Bar Harbor, Maine 04609 Manuscript received

More information

Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control

Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control Xiaoquan Wen Department of Biostatistics, University of Michigan A Model

More information