QTL model selection: key players
|
|
- David Richards
- 6 years ago
- Views:
Transcription
1 QTL Model Selection. Bayesian strategy. Markov chain sampling 3. sampling genetic architectures 4. criteria for model selection Model Selection Seattle SISG: Yandell 0 QTL model selection: key players observed measurements y phenotypic trait m markers & linkage map i individual index (,,n missing i data missing marker data QT genotypes alleles QQ, Q, or at locus unknown uantities λ QT locus (or loci μ phenotype model parameters γ QTL model/genetic architecture pr( m,λ,γ genotype model observed mx Yy missing grounded ddby lik linkage map, experimental tlcross recombination yields multinomial for given m pr(y,γ phenotype model distribution shape (assumed normal here unknown parameters μ (could be non-parametric Q unknown λλ θμ γ after Sen Churchill (00 Model Selection Seattle SISG: Yandell 0
2 QTL mapping (from ZB Zeng phenotype model pr(y,γ genotypes Q pr( m,λ,γ λ markers M Model Selection Seattle SISG: Yandell 0 3 classical likelihood approach genotype model pr( m,λ,γ missing genotypes depend on observed markers m across genome phenotype model pr(y,γ link phenotypes y to genotypes LOD( λ log 0{max pr( y m, μ, λ } + c ( 0 μ likelihood mixes over missing QTL genotypes: pr( y m, λ pr( y, μpr( m, λ Model Selection Seattle SISG: Yandell 0 4
3 EM approach Iterate E and M steps expectation (E: geno prob s pr( m,λ,γ maximization (M: pheno model parameters mean, effects, variance careful attention when many QTL present Multiple papers by Zhao-Bang Zeng and others Start with simple initial model Add QTL, epistatic effects seuentially Model Selection Seattle SISG: Yandell 0 5 classic model search initial model from single QTL analysis search for additional QTL search for epistasis between pairs of QTL Both in model? One in model? Neither? Refine model Update QTL positions Check if existing QTL can be dropped Analogous to stepwise regression Model Selection Seattle SISG: Yandell 0 6
4 comparing models (details later balance model fit against model complexity want to fit data well (maximum likelihood without getting too complicated a model smaller model bigger model fit model miss key features fits better estimate phenotype may be biased no bias predict new data may be biased no bias interpret model easier more complicated estimate effects low variance high variance SysGen: Overview Seattle SISG: Yandell 0 7. Bayesian strategy for QTL study augment data (y,m with missing genotypes study unknowns (λ,γ given augmented data (y,m, find better genetic architectures γ find most likely genomic regions QTL λ estimate phenotype parameters genotype means μ sample from posterior in some clever way multiple imputation (Sen Churchill 00 Markov chain Monte Carlo (MCMC (Satagopan et al. 996; Yi et al. 005, 007 likelihood* prior posterior constant phenotype likelihood*[prior for, λ, γ ] posterior for, λ, γ constant pr( y, γ *[pr( m, λ, γ pr( μ γ pr( λ m, γ pr( γ ] pr(, λ, γ y, m pr( y m Model Selection Seattle SISG: Yandell 0 8
5 Bayes posterior for normal data n lar rge n small prior r prior mean actual mean y phenotype values prior rge n small n lar prior mean actual me an y phenotype values small prior variance large prior variance Model Selection Seattle SISG: Yandell 0 9 Posterior on genotypic means? phenotype model pr(y,μ data means prior mean data mean n small prior posterior means n large Q QQ y phenotype values Model Selection Seattle SISG: Yandell 0 0
6 posterior centered on sample genotypic mean but shrunken slightly toward overall mean Bayes posterior QTL means phenotype mean: genotypic prior: posterior: / ( ( ( ( ( ( ( + n b y V y b y b y E V y E y V y E σ μ μ κσ μ μ σ μ QTL : Bayes Seattle SISG: Yandell 00 shrinkage: / sum } count{ } { + i i n n b n y y n i κ κ pr( m,λ recombination model pr( m,λ pr(geno map, locus pr(geno flanking markers, locus m m 3 m 4 m 5 m 6 m? markers Model Selection Seattle SISG: Yandell 0 λ distance along chromosome
7 Model Selection Seattle SISG: Yandell 0 3 what are likely QTL genotypes? how does phenotype y improve guess? D4Mit4 D4Mit4 bp what are probabilities for genotype between markers? recombinants AA:AB all : if ignore y and if we use y? AA AA AB AA AA AB AB AB Genotype Model Selection Seattle SISG: Yandell 0 4
8 posterior on QTL genotypes full conditional of given data, parameters proportional to prior pr( m, λ weight toward that agrees with flanking markers proportional to likelihood pr(y, μ weight toward with similar phenotype values posterior recombination model balances these two this is the E-step of EM computations pr( y, m, λ pr( y, μ * pr( m, λ pr( y m, λ Model Selection Seattle SISG: Yandell 0 5 Where are the loci λ on the genome? prior over genome for QTL positions flat prior no prior idea of loci or use prior studies to give more weight to some regions posterior depends on QTL genotypes pr(λ m, pr(λ pr( m,λ / constant constant determined by averaging over all possible genotypes over all possible loci λ on entire map no easy way to write down posterior Model Selection Seattle SISG: Yandell 0 6
9 what is the genetic architecture γ? which positions correspond to QTLs? priors on loci (previous slide which QTL have main effects? priors for presence/absence of main effects same prior for all QTL can put prior on each d.f. ( for BC, for F which pairs of QTL have epistatic interactions? prior for presence/absence of epistatic pairs depends on whether 0,, QTL have main effects epistatic effects less probable than main effects Model Selection Seattle SISG: Yandell 0 7 γ genetic architecture: loci: main QTL epistatic pairs effects: add, dom aa, ad, dd Model Selection Seattle SISG: Yandell 0 8
10 Bayesian priors & posteriors augmenting with missing genotypes prior is recombination model posterior is (formally E step of EM algorithm sampling phenotype model parameters μ prior is flat normal at grand mean (no information posterior shrinks genotypic means toward grand mean (details for unexplained variance omitted here sampling QTL loci λ prior is flat across genome (all loci eually likely sampling QTL genetic architecture model γ number of QTL prior is Poisson with mean from previous IM study genetic architecture of main effects and epistatic interactions priors on epistasis depend on presence/absence of main effects Model Selection Seattle SISG: Yandell 0 9. Markov chain sampling construct Markov chain around posterior want posterior as stable distribution of Markov chain in practice, the chain tends toward stable distribution initial values may have low posterior probability burn-in period to get chain mixing well sample QTL model components from full conditionals sample locus λ given,γ (using Metropolis-Hastings step sample genotypes given λ,y,γ (using Gibbs sampler sample effects μ given,y,γ (using Gibbs sampler sample QTL model γ given λ,y, (using Gibbs or M-H ( λ,, γ ~ pr( λ,, γ y, m ( λ,, γ ( λ,, γ L ( λ,, γ N Model Selection Seattle SISG: Yandell 0 0
11 MCMC sampling of unknowns (,µ,λ for given genetic architecture γ Gibbs sampler genotypes effects µ not loci λ ~ pr( y, m, λ i pr( y, μpr( μ μ ~ pr( y pr( m, λpr( λ m λ ~ pr( m i Metropolis-Hastings sampler extension of Gibbs sampler does not reuire normalization pr( m sum λ pr( m, λ pr(λ Model Selection Seattle SISG: Yandell 0 Gibbs sampler for two genotypic means want to study two correlated effects could sample directly from their bivariate distribution assume correlation ρ is known instead use Gibbs sampler: sample each effect from its full conditional given the other pick order of sampling at random repeat many times μ 0 ρ ~ N, μ 0 ρ μ ~ N μ ~ N ( ρ ρ ( ρμ, ρ Model Selection Seattle SISG: Yandell 0
12 Gibbs sampler samples: ρ 0.6 N 50 samples N 00 samples Gibbs: mean Markov chain index Gibbs: mean 3 n 3 n Gibbs: mean Gibbs: mean Markov chain index Gibbs: mean Gibbs: mea Gibbs: mea Gibbs: mean Gibbs: mean n n Gibbs: mean Markov chain index Gibbs: mean Markov chain index Gibbs: mean Model Selection Seattle SISG: Yandell 0 3 full conditional for locus cannot easily sample from locus full conditional pr(λ ymµ y,m,µ, pr(λ λ m m, pr( m, λ pr(λ / constant constant is very difficult to compute explicitly must average over all possible loci λ over genome must do this for every possible genotype Gibbs sampler will not work in general but can use method based on ratios of probabilities Metropolis-Hastings is extension of Gibbs sampler Model Selection Seattle SISG: Yandell 0 4
13 Metropolis-Hastings idea want to study distribution f(λ take Monte Carlo samples unless too complicated take samples using ratios of f Metropolis-Hastings samples: propose new value λ * near (? current value λ from some distribution g accept new value with prob a Gibbs sampler: a always f(λ g(λ λ λ * a * * f ( λ g( λ λ min, * f ( λ g( λ λ Model Selection Seattle SISG: Yandell mcmc seuence Metropolis-Hastings for locus λ 0.6 pr(θ Y θ θ λ λ added twist: occasionally propose from entire genome Model Selection Seattle SISG: Yandell 0 6
14 Metropolis-Hastings samples mcmc seuence N 00 samples N 000 samples narrow g wide g narrow g wide g mcmc seuence θ λ θλ θ λ θ λ. mcmc seuence mcmc seuence histogram pr(θ Y 0 4 histogram pr(θ Y histogram pr(θ Y histogram pr(θ Y θ θ θ θ Model Selection Seattle SISG: Yandell sampling genetic architectures search across genetic architectures γ of various sizes allow change in number of QTL allow change in types of epistatic interactions methods for search reversible jump MCMC Gibbs sampler with loci indicators complexity of epistasis Fisher-Cockerham effects model general multi-qtl interaction & limits of inference Model Selection Seattle SISG: Yandell 0 8
15 reversible jump MCMC consider known genotypes at known loci λ models with or QTL M-H step between -QTL and -QTL models model changes dimension (via careful bookkeeping consider mixture over QTL models H γ QTL : Y β 0 + β ( + e γ QTL : Y β 0 + β ( + β ( + e Model Selection Seattle SISG: Yandell 0 9 geometry of reversible jump Move Between Models Reversible Jump Seuence b m c 0.7 b m β b b β Model Selection Seattle SISG: Yandell 0 30
16 geometry allowing and λ to change a short seuence first 000 with m<3 b b b b Model Selection Seattle SISG: Yandell β β collinear QTL correlated effects 4-week 8-week 0.0 cor cor -0.7 additive additive additive additive effect effect linked QTL collinear genotypes correlated estimates of effects (negative if in coupling phase sum of linked effects usually fairly constant Model Selection Seattle SISG: Yandell 0 3
17 sampling across QTL models γ 0 λ λ m+ λ λ m L action steps: draw one of three choices update QTL model γ with probability -b(γ-d(γ update current model using full conditionals sample QTL loci, effects, and genotypes add a locus with probability b(γ propose a new locus along genome innovate new genotypes at locus and phenotype effect decide whether to accept the birth of new locus drop a locus with probability d(γ propose dropping one of existing loci decide whether to accept the death of locus Model Selection Seattle SISG: Yandell 0 33 Gibbs sampler with loci indicators consider only QTL at pseudomarkers every - cm modest approximation with little bias use loci indicators in each pseudomarker γ if QTL present γ 0 if no QTL present Gibbs sampler on loci indicators γ relatively easy to incorporate epistasis Yi, Yandell, Churchill, hill Allison, Eisen, Pomp (005 Genetics (see earlier work of Nengjun Yi and Ina Hoeschele μ μ + γ β ( + γ β (, γ k 0, Model Selection Seattle SISG: Yandell 0 34
18 Bayesian shrinkage estimation soft loci indicators strength of evidence for λ j depends on γ 0 γ (grey scale shrink most γs to zero Wang et al. (005 Genetics Shizhong Xu group at U CA Riverside μ β 0 + γ β ( + γ β (, 0 γ k Model Selection Seattle SISG: Yandell 0 35 other model selection approaches include all potential loci in model assume true model is sparse in some sense Sparse partial least suares Chun, Keles (009 Genetics; 00 JRSSB LASSO model selection Foster (006; Foster Verbyla Pitchford (007 JABES Xu (007 Biometrics; Yi Xu (007 Genetics Shi Wahba Wright Klein Klein (008 Stat & Infer Model Selection Seattle SISG: Yandell 0 36
19 4. criteria for model selection balance fit against complexity classical information criteria penalize likelihood L by model size γ IC log L(γ y + penalty(γ maximize over unknowns Bayes factors marginal posteriors pr(y γ average over unknowns Model Selection Seattle SISG: Yandell 0 37 classical information criteria start with likelihood L(γ y, m measures fit of architecture (γ to phenotype (y given marker data (m genetic architecture (γ depends on parameters have to estimate loci (µ and effects (λ complexity related to number of parameters γ size of genetic architecture BC: γ + n.tl + n.tl(n.tl F: γ + n.tl +4n.tl(n.tl Model Selection Seattle SISG: Yandell 0 38
20 classical information criteria construct information criteria balance fit to complexity Akaike AIC log(l + γ Bayes/Schwartz BIC log(l + γ log(n Broman BIC δ log(l + δ γ log(n general form: IC log(l + γ D(n compare models hypothesis testing: designed for one comparison log[lr(γ, γ ] L(y m, γ L(y m, γ model selection: penalize complexity IC(γ, γ log[lr(γ, γ ] + ( γ γ D(n Model Selection Seattle SISG: Yandell 0 39 information criteria vs. model size WinQTL.0 SCD data on F AAIC BIC( BIC( dbic(δ models,,3,4 QTL epistasis : AD information criteria model parameters p epistasis Model Selection Seattle SISG: Yandell 0 40 d d d d d A A A A A A A A d d d
21 Bayes factors ratio of model likelihoods ratio of posterior to prior odds for architectures averaged over unknowns pr( γ y, m / pr( γ y, m pr( y m, γ B pr( γ / pr( γ pr( y m, γ roughly euivalent to BIC BIC maximizes over unknowns BF averages over unknowns log( B log( LR ( γ γ log( n Model Selection Seattle SISG: Yandell 0 4 scan of marginal Bayes factor & effect Model Selection Seattle SISG: Yandell 0 4
22 issues in computing Bayes factors BF insensitive to shape of prior on γ geometric, Poisson, uniform precision improves when prior mimics posterior BF sensitivity to prior variance on effects θ prior variance should reflect data variability resolved by using hyper-priors automatic algorithm; no need for user tuning easy to compute Bayes factors from samples sample posterior using MCMC posterior pr(γ y, m is marginal histogram Model Selection Seattle SISG: Yandell 0 43 Bayes factors & genetic architecture γ γ number of QTL prior pr(γ chosen by user posterior pr(γ y,m sampled marginal histogram shape affected by prior pr(a pr( γ y, m / pr( γ BF γ, γ pr( γ y, m / pr( γ bability prior prob pattern of QTL across genome gene action and epistasis e e p p e p u exponential Poisson uniform u u p eu u pu u u e e p p e p e p e e p e u u pu e pu m number of QTL Model Selection Seattle SISG: Yandell 0 44
23 BF sensitivity to fixed prior for effects Bayes factors B45 B34 B3 B 4 3 ( σ / m, β ~ N 0, σ h σ j hyper-prior heritability h G G, h fixed Model Selection Seattle SISG: Yandell 0 45 total BF insensitivity to random effects prior hyper-prior density *Beta(a,b insensitivity to hyper-prior 3.0 density , ,9.5,9,0,3,.0 Bayes factors B34 B3 B hyper-parameter heritability h E(h ( σ / m, β ~ N 0, σ h σ j G G total, h ~ Beta( a, b Model Selection Seattle SISG: Yandell 0 46
QTL model selection: key players
Bayesian Interval Mapping. Bayesian strategy -9. Markov chain sampling 0-7. sampling genetic architectures 8-5 4. criteria for model selection 6-44 QTL : Bayes Seattle SISG: Yandell 008 QTL model selection:
More information1. what is the goal of QTL study?
NSF UAB Course 008 Bayesian Interval Mapping Brian S. Yandell, UW-Madison www.stat.wisc.edu/~yandell/statgen overview: multiple QTL approaches Bayesian QTL mapping & model selection data examples in detail
More informationModel Selection for Multiple QTL
Model Selection for Multiple TL 1. reality of multiple TL 3-8. selecting a class of TL models 9-15 3. comparing TL models 16-4 TL model selection criteria issues of detecting epistasis 4. simulations and
More informationMultiple QTL mapping
Multiple QTL mapping Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] 1 Why? Reduce residual variation = increased power
More informationSeattle Summer Institute : Systems Genetics for Experimental Crosses
Seattle Summer Institute 2012 15: Systems Genetics for Experimental Crosses Brian S. Yandell, UW-Madison Elias Chaibub Neto, Sage Bionetworks www.stat.wisc.edu/~yandell/statgen/sisg Real knowledge is to
More informationBayesian Model Selection for Multiple QTL. Brian S. Yandell
Bayesian Model Selection for Multiple QTL Brian S. Yandell University of Wisconsin-Madison www.stat.wisc.edu/~yandell/statgen Jackson Laboratory, October 005 October 005 Jax Workshop Brian S. Yandell 1
More informationcontact information & resources
Seattle Summer Institute 006 Advanced QTL Brian S. Yandell University of Wisconsin-Madison Bayesian QTL mapping & model selection data examples in detail multiple phenotypes & microarrays software demo
More informationQTL Model Search. Brian S. Yandell, UW-Madison January 2017
QTL Model Search Brian S. Yandell, UW-Madison January 2017 evolution of QTL models original ideas focused on rare & costly markers models & methods refined as technology advanced single marker regression
More informationMapping multiple QTL in experimental crosses
Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationMapping multiple QTL in experimental crosses
Human vs mouse Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman www.daviddeen.com
More informationwhat is Bayes theorem? posterior = likelihood * prior / C
who was Bayes? Reverend Thomas Bayes (70-76) part-time mathematician buried in Bunhill Cemetary, Moongate, London famous paper in 763 Phil Trans Roy Soc London was Bayes the first with this idea? (Laplace?)
More informationQTL Mapping I: Overview and using Inbred Lines
QTL Mapping I: Overview and using Inbred Lines Key idea: Looking for marker-trait associations in collections of relatives If (say) the mean trait value for marker genotype MM is statisically different
More informationOverview. Background
Overview Implementation of robust methods for locating quantitative trait loci in R Introduction to QTL mapping Andreas Baierl and Andreas Futschik Institute of Statistics and Decision Support Systems
More informationInferring Genetic Architecture of Complex Biological Processes
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison http://www.stat.wisc.edu/~yandell/statgen
More informationTutorial Session 2. MCMC for the analysis of genetic data on pedigrees:
MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation
More informationCausal Graphical Models in Systems Genetics
1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation
More informationR/qtl workshop. (part 2) Karl Broman. Biostatistics and Medical Informatics University of Wisconsin Madison. kbroman.org
R/qtl workshop (part 2) Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Example Sugiyama et al. Genomics 71:70-77, 2001 250 male
More informationGene mapping in model organisms
Gene mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Goal Identify genes that contribute to common human diseases. 2
More informationBayesian linear regression
Bayesian linear regression Linear regression is the basis of most statistical modeling. The model is Y i = X T i β + ε i, where Y i is the continuous response X i = (X i1,..., X ip ) T is the corresponding
More informationBAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI
BAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI By DÁMARIS SANTANA MORANT A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR
More informationContents. Part I: Fundamentals of Bayesian Inference 1
Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian
More informationDetection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred population
Genet. Sel. Evol. 36 (2004) 415 433 415 c INRA, EDP Sciences, 2004 DOI: 10.1051/gse:2004009 Original article Detection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters
More informationStatistical issues in QTL mapping in mice
Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate
More informationStat 516, Homework 1
Stat 516, Homework 1 Due date: October 7 1. Consider an urn with n distinct balls numbered 1,..., n. We sample balls from the urn with replacement. Let N be the number of draws until we encounter a ball
More informationUse of hidden Markov models for QTL mapping
Use of hidden Markov models for QTL mapping Karl W Broman Department of Biostatistics, Johns Hopkins University December 5, 2006 An important aspect of the QTL mapping problem is the treatment of missing
More informationBTRY 4830/6830: Quantitative Genomics and Genetics
BTRY 4830/6830: Quantitative Genomics and Genetics Lecture 23: Alternative tests in GWAS / (Brief) Introduction to Bayesian Inference Jason Mezey jgm45@cornell.edu Nov. 13, 2014 (Th) 8:40-9:55 Announcements
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Backcross P 1 P 2 P 1 F 1 BC 4
More informationIntroduction to QTL mapping in model organisms
Human vs mouse Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] www.daviddeen.com
More informationLecture 8. QTL Mapping 1: Overview and Using Inbred Lines
Lecture 8 QTL Mapping 1: Overview and Using Inbred Lines Bruce Walsh. jbwalsh@u.arizona.edu. University of Arizona. Notes from a short course taught Jan-Feb 2012 at University of Uppsala While the machinery
More informationDoing Bayesian Integrals
ASTR509-13 Doing Bayesian Integrals The Reverend Thomas Bayes (c.1702 1761) Philosopher, theologian, mathematician Presbyterian (non-conformist) minister Tunbridge Wells, UK Elected FRS, perhaps due to
More informationLecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis
Lecture 3 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,
More informationModule 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline
Module 4: Bayesian Methods Lecture 9 A: Default prior selection Peter Ho Departments of Statistics and Biostatistics University of Washington Outline Je reys prior Unit information priors Empirical Bayes
More informationBayesian Inference and MCMC
Bayesian Inference and MCMC Aryan Arbabi Partly based on MCMC slides from CSC412 Fall 2018 1 / 18 Bayesian Inference - Motivation Consider we have a data set D = {x 1,..., x n }. E.g each x i can be the
More informationBayesian construction of perceptrons to predict phenotypes from 584K SNP data.
Bayesian construction of perceptrons to predict phenotypes from 584K SNP data. Luc Janss, Bert Kappen Radboud University Nijmegen Medical Centre Donders Institute for Neuroscience Introduction Genetic
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationA Bayesian Approach to Phylogenetics
A Bayesian Approach to Phylogenetics Niklas Wahlberg Based largely on slides by Paul Lewis (www.eeb.uconn.edu) An Introduction to Bayesian Phylogenetics Bayesian inference in general Markov chain Monte
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is
More informationSupplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control
Supplementary Materials for Molecular QTL Discovery Incorporating Genomic Annotations using Bayesian False Discovery Rate Control Xiaoquan Wen Department of Biostatistics, University of Michigan A Model
More informationBayesian Inference of Interactions and Associations
Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,
More information(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis
Summarizing a posterior Given the data and prior the posterior is determined Summarizing the posterior gives parameter estimates, intervals, and hypothesis tests Most of these computations are integrals
More informationLecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011
Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector
More informationReminder of some Markov Chain properties:
Reminder of some Markov Chain properties: 1. a transition from one state to another occurs probabilistically 2. only state that matters is where you currently are (i.e. given present, future is independent
More informationMarkov Chain Monte Carlo methods
Markov Chain Monte Carlo methods By Oleg Makhnin 1 Introduction a b c M = d e f g h i 0 f(x)dx 1.1 Motivation 1.1.1 Just here Supresses numbering 1.1.2 After this 1.2 Literature 2 Method 2.1 New math As
More informationNormal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,
Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability
More informationEvolution of phenotypic traits
Quantitative genetics Evolution of phenotypic traits Very few phenotypic traits are controlled by one locus, as in our previous discussion of genetics and evolution Quantitative genetics considers characters
More informationSelecting explanatory variables with the modified version of Bayesian Information Criterion
Selecting explanatory variables with the modified version of Bayesian Information Criterion Institute of Mathematics and Computer Science, Wrocław University of Technology, Poland in cooperation with J.K.Ghosh,
More information. Also, in this case, p i = N1 ) T, (2) where. I γ C N(N 2 2 F + N1 2 Q)
Supplementary information S7 Testing for association at imputed SPs puted SPs Score tests A Score Test needs calculations of the observed data score and information matrix only under the null hypothesis,
More informationFast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets
Fast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets Yu-Ling Chang A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment
More informationMonte Carlo in Bayesian Statistics
Monte Carlo in Bayesian Statistics Matthew Thomas SAMBa - University of Bath m.l.thomas@bath.ac.uk December 4, 2014 Matthew Thomas (SAMBa) Monte Carlo in Bayesian Statistics December 4, 2014 1 / 16 Overview
More informationBayesian Phylogenetics
Bayesian Phylogenetics Paul O. Lewis Department of Ecology & Evolutionary Biology University of Connecticut Woods Hole Molecular Evolution Workshop, July 27, 2006 2006 Paul O. Lewis Bayesian Phylogenetics
More informationCSC 2541: Bayesian Methods for Machine Learning
CSC 2541: Bayesian Methods for Machine Learning Radford M. Neal, University of Toronto, 2011 Lecture 3 More Markov Chain Monte Carlo Methods The Metropolis algorithm isn t the only way to do MCMC. We ll
More informationBayesian QTL mapping using skewed Student-t distributions
Genet. Sel. Evol. 34 00) 1 1 1 INRA, EDP Sciences, 00 DOI: 10.1051/gse:001001 Original article Bayesian QTL mapping using skewed Student-t distributions Peter VON ROHR a,b, Ina HOESCHELE a, a Departments
More informationGENOMIC SELECTION WORKSHOP: Hands on Practical Sessions (BL)
GENOMIC SELECTION WORKSHOP: Hands on Practical Sessions (BL) Paulino Pérez 1 José Crossa 2 1 ColPos-México 2 CIMMyT-México September, 2014. SLU,Sweden GENOMIC SELECTION WORKSHOP:Hands on Practical Sessions
More informationBustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #
Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either
More informationMonte Carlo Methods. Leon Gu CSD, CMU
Monte Carlo Methods Leon Gu CSD, CMU Approximate Inference EM: y-observed variables; x-hidden variables; θ-parameters; E-step: q(x) = p(x y, θ t 1 ) M-step: θ t = arg max E q(x) [log p(y, x θ)] θ Monte
More informationLocating multiple interacting quantitative trait. loci using rank-based model selection
Genetics: Published Articles Ahead of Print, published on May 16, 2007 as 10.1534/genetics.106.068031 Locating multiple interacting quantitative trait loci using rank-based model selection, 1 Ma lgorzata
More informationLecture 11: Multiple trait models for QTL analysis
Lecture 11: Multiple trait models for QTL analysis Julius van der Werf Multiple trait mapping of QTL...99 Increased power of QTL detection...99 Testing for linked QTL vs pleiotropic QTL...100 Multiple
More informationMultiple interval mapping for ordinal traits
Genetics: Published Articles Ahead of Print, published on April 3, 2006 as 10.1534/genetics.105.054619 Multiple interval mapping for ordinal traits Jian Li,,1, Shengchu Wang and Zhao-Bang Zeng,, Bioinformatics
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationMOST complex traits are influenced by interacting
Copyright Ó 2009 by the Genetics Society of America DOI: 10.1534/genetics.108.099556 Hierarchical Generalized Linear Models for Multiple Quantitative Trait Locus Mapping Nengjun Yi*,1 and Samprit Banerjee
More informationAffected Sibling Pairs. Biostatistics 666
Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD
More informationMarkov Chain Monte Carlo
Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).
More information1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:
.5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationMarkov Chain Monte Carlo, Numerical Integration
Markov Chain Monte Carlo, Numerical Integration (See Statistics) Trevor Gallen Fall 2015 1 / 1 Agenda Numerical Integration: MCMC methods Estimating Markov Chains Estimating latent variables 2 / 1 Numerical
More informationForward Problems and their Inverse Solutions
Forward Problems and their Inverse Solutions Sarah Zedler 1,2 1 King Abdullah University of Science and Technology 2 University of Texas at Austin February, 2013 Outline 1 Forward Problem Example Weather
More informationHierarchical Generalized Linear Models for Multiple QTL Mapping
Genetics: Published Articles Ahead of Print, published on January 1, 009 as 10.1534/genetics.108.099556 Hierarchical Generalized Linear Models for Multiple QTL Mapping Nengun Yi 1,* and Samprit Baneree
More informationPackage LBLGXE. R topics documented: July 20, Type Package
Type Package Package LBLGXE July 20, 2015 Title Bayesian Lasso for detecting Rare (or Common) Haplotype Association and their interactions with Environmental Covariates Version 1.2 Date 2015-07-09 Author
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 20: Epistasis and Alternative Tests in GWAS Jason Mezey jgm45@cornell.edu April 16, 2016 (Th) 8:40-9:55 None Announcements Summary
More informationMANY complex human diseases and traits of bio- tive corrections for multiple testing. Non-Bayesian model
Copyright 2005 by the Genetics Society of America DOI: 10.1534/genetics.104.040386 Bayesian Model Selection for Genome-Wide Epistatic Quantitative Trait Loci Analysis Nengjun Yi,*,,1 Brian S. Yandell,
More informationCausal Model Selection Hypothesis Tests in Systems Genetics
1 Causal Model Selection Hypothesis Tests in Systems Genetics Elias Chaibub Neto and Brian S Yandell SISG 2012 July 13, 2012 2 Correlation and Causation The old view of cause and effect... could only fail;
More informationBayesian Partition Models for Identifying Expression Quantitative Trait Loci
Journal of the American Statistical Association ISSN: 0162-1459 (Print) 1537-274X (Online) Journal homepage: http://www.tandfonline.com/loi/uasa20 Bayesian Partition Models for Identifying Expression Quantitative
More informationLecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1
Lecture 5 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,
More informationBayesian model selection in graphs by using BDgraph package
Bayesian model selection in graphs by using BDgraph package A. Mohammadi and E. Wit March 26, 2013 MOTIVATION Flow cytometry data with 11 proteins from Sachs et al. (2005) RESULT FOR CELL SIGNALING DATA
More informationReview Article Statistical Methods for Mapping Multiple QTL
Hindawi Publishing Corporation International Journal of Plant Genomics Volume 2008, Article ID 286561, 8 pages doi:10.1155/2008/286561 Review Article Statistical Methods for Mapping Multiple QTL Wei Zou
More informationTHE data in the QTL mapping study are usually composed. A New Simple Method for Improving QTL Mapping Under Selective Genotyping INVESTIGATION
INVESTIGATION A New Simple Method for Improving QTL Mapping Under Selective Genotyping Hsin-I Lee,* Hsiang-An Ho,* and Chen-Hung Kao*,,1 *Institute of Statistical Science, Academia Sinica, Taipei 11529,
More informationPetr Volf. Model for Difference of Two Series of Poisson-like Count Data
Petr Volf Institute of Information Theory and Automation Academy of Sciences of the Czech Republic Pod vodárenskou věží 4, 182 8 Praha 8 e-mail: volf@utia.cas.cz Model for Difference of Two Series of Poisson-like
More informationBayesian reversible-jump for epistasis analysis in genomic studies
Balestre and de Souza BMC Genomics (2016) 17:1012 DOI 10.1186/s12864-016-3342-6 RESEARCH ARTICLE Bayesian reversible-jump for epistasis analysis in genomic studies Marcio Balestre 1* and Claudio Lopes
More informationIntroduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016
Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An
More informationMCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES
MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES Elizabeth A. Thompson Department of Statistics, University of Washington Box 354322, Seattle, WA 98195-4322, USA Email: thompson@stat.washington.edu This
More informationCausal Network Models for Correlated Quantitative Traits. outline
Causal Network Models for Correlated Quantitative Traits Brian S. Yandell UW Madison October 2012 www.stat.wisc.edu/~yandell/statgen Jax SysGen: Yandell 2012 1 outline Correlation and causation Correlatedtraitsinorganized
More informationPattern Recognition and Machine Learning
Christopher M. Bishop Pattern Recognition and Machine Learning ÖSpri inger Contents Preface Mathematical notation Contents vii xi xiii 1 Introduction 1 1.1 Example: Polynomial Curve Fitting 4 1.2 Probability
More informationA Review of Bayesian Variable Selection Methods: What, How and Which
Bayesian Analysis (2009) 4, Number 1, pp. 85 118 A Review of Bayesian Variable Selection Methods: What, How and Which R.B. O Hara and M. J. Sillanpää Abstract. The selection of variables in regression
More informationA new simple method for improving QTL mapping under selective genotyping
Genetics: Early Online, published on September 22, 2014 as 10.1534/genetics.114.168385 A new simple method for improving QTL mapping under selective genotyping Hsin-I Lee a, Hsiang-An Ho a and Chen-Hung
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationThe E-M Algorithm in Genetics. Biostatistics 666 Lecture 8
The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as
More informationAdvanced Machine Learning
Advanced Machine Learning Nonparametric Bayesian Models --Learning/Reasoning in Open Possible Worlds Eric Xing Lecture 7, August 4, 2009 Reading: Eric Xing Eric Xing @ CMU, 2006-2009 Clustering Eric Xing
More informationStat 535 C - Statistical Computing & Monte Carlo Methods. Lecture 18-16th March Arnaud Doucet
Stat 535 C - Statistical Computing & Monte Carlo Methods Lecture 18-16th March 2006 Arnaud Doucet Email: arnaud@cs.ubc.ca 1 1.1 Outline Trans-dimensional Markov chain Monte Carlo. Bayesian model for autoregressions.
More informationMethods for QTL analysis
Methods for QTL analysis Julius van der Werf METHODS FOR QTL ANALYSIS... 44 SINGLE VERSUS MULTIPLE MARKERS... 45 DETERMINING ASSOCIATIONS BETWEEN GENETIC MARKERS AND QTL WITH TWO MARKERS... 45 INTERVAL
More informationLecture 6: Markov Chain Monte Carlo
Lecture 6: Markov Chain Monte Carlo D. Jason Koskinen koskinen@nbi.ku.dk Photo by Howard Jackman University of Copenhagen Advanced Methods in Applied Statistics Feb - Apr 2016 Niels Bohr Institute 2 Outline
More informationVariable Selection in Structured High-dimensional Covariate Spaces
Variable Selection in Structured High-dimensional Covariate Spaces Fan Li 1 Nancy Zhang 2 1 Department of Health Care Policy Harvard University 2 Department of Statistics Stanford University May 14 2007
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm
More informationEstimating Evolutionary Trees. Phylogenetic Methods
Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent
More informationMCMC: Markov Chain Monte Carlo
I529: Machine Learning in Bioinformatics (Spring 2013) MCMC: Markov Chain Monte Carlo Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Contents Review of Markov
More informationExpression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia
Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.
More informationMH I. Metropolis-Hastings (MH) algorithm is the most popular method of getting dependent samples from a probability distribution
MH I Metropolis-Hastings (MH) algorithm is the most popular method of getting dependent samples from a probability distribution a lot of Bayesian mehods rely on the use of MH algorithm and it s famous
More information