Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees:
|
|
- Magnus Spencer
- 5 years ago
- Views:
Transcription
1 MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation Monte Carlo estimation of linkage lod scores 1
2 GENETIC MARKERS Human genome: bp of DNA. DNA variants that can be typed in individuals. Allele type of the DNA at position on chromosome Have been mapped: known locations on the genome. Locus position on a chromosome, or DNA at that position Idea: map genes for traits relative to these markers. Microsatellites; lots of alleles; 350 in a genome scan One every 10 7 bp SNPs: typically only two alleles; lots more exist; 1 per 1000 bp 2
3 THE STRUCTURE OF A GENETIC MODEL Population model: parameters q, provide probabilities for latent A allelic types of FGL at each j Inheritance model: parameters ρ, provide probabilities for latent S inheritance of FGL at j, jointly over j. Individual genotypes G is deterministic function of (S, A) penetrance model, parameters β relates G (and perhaps observable covariates) to observable data Y ξ = (q, ρ, β). 3
4 FROM RECOMBINATION TO LOCATION Recall model for Si, = (Si,1,..., S i,l ): Pr(Si,j S i,j+1 ) = ρj: assumed same i (convenience). Si,j assumed Markov in j: no genetic interference. Genetic distance d is expected number of crossover events on underlying chromosome: an additive measure. Crossovers arise as a Poisson process rate 1 (per Morgan). There is a recombination between two loci if there is an odd number W of crossovers between them: W (d) P(d). Hence the Haldane map function: ρ(d) = (1/2)(1 exp( 2d)). The key thing is the model: the map function just puts loci onto a linear location map. (See later: MCMC under interference.) 4
5 WHAT AND WHY THE LOCATION LOD SCORE γ Λ M = ((q i, λ i ); i = 1,..., L) M1 M2 M3 M4 M5 β YT Parameter ξ = (β, γ, Λ M ). Data Y = (Y M, Y T ) lod(γ) = log10 ( Pr(Y; Λ M, β, γ) Pr(Y; Λ M, β, γ = ) Trait locus location γ is parameter of interest: γ = is no linkage. Exact computation is infeasible ) 5
6 AN EXAMPLE PEDIGREE: APPROXIMATED SIMPED: disease status and marker availability Marker data are SIMULATED at 10 linked markers on Chr 1. Trait is close to M6 6
7 Chromosome Position (cm) lod score AN EXAMPLE MULTIPOINT LOD SCORE
8 MONTE CARLO LIKELIHOODS ON PEDIGREES Monte Carlo estimates expectations. L(ξ) = Pξ (Y) = P ξ (S, Y) = P ξ (Y S) P ξ (S) S S for parameters ξ and latent variables S. Simple (but not useful) example: L(ξ) = E ξ (P ξ (Y S)) More generally L(ξ) = ( ) ( Pξ (S, Y) P S P Pξ (S, Y) (S) = E P (S) P (S) provided P (S) > 0 if P ξ (S, Y) > 0. ) 8
9 SEQUENTIAL IMPUTATION OVER LOCI Choose the sampling distribution: loci Now: P (S,j ) = P ξ (S 0,j S (j 1), Y (j) ) = P ξ (S 0,j S,1,... S,j 1, Y,1,..., Y,j 1, Y,j ) = P ξ (S 0,j S,j 1, Y,j) data j i Y,j meioses P ξ 0 (S,j S (j 1), Y (j) ) = P ξ0 (S,j, Y,j S (j 1), Y (j 1) ) P ξ 0 (Y,j S (j 1), Y (j 1) ) = P ξ0 (S,j, Y,j S (j 1), Y (j 1) ) wj. where, by pedigree-peeling, we can compute wj = P ξ 0 (Y,j Y (j 1), S (j 1) ) = P ξ 0 (Y,j S,j 1 ). 9
10 MONTE CARLO LIKELIHOOD ESTIMATE Thus sequential imputation distribution is P (S ) = L j=1 P ξ 0 (S,j S (j 1), Y (j) ) = P ξ0 (S, Y) W L (S ) where W L (S ) = L j=1 wj. Now ( ) Pξ L(ξ0) = P ξ (Y) = E (S, Y) 0 P P (S) = E P (W L (S )) Given N realizations S (τ) the estimate of L(ξ0) is N 1 τ W L (S (τ) ). 10
11 THE IDEAL SAMPLING DISTRIBUTION We want P (S) close to proportional to P ξ (Y, S) 0 that is P (S) P ξ (S Y). 0 Of course we cannot achieve this, else Monte Carlo would be unnecessary. Suppose we use MCMC to sample S from P ξ 0 (S Y). P ξ (Y) = S = E ξ 0 P ξ (Y, S) = S ( P ξ (Y, S) = P ξ 0 (Y) E ξ0 P ξ 0 (S Y) Y ( Pξ (Y, S) P ξ (Y, S) P ξ (S Y)P (S Y) ξ0 0 ) P ξ 0 (Y, S) Y ) 11
12 LIKELIHOOD RATIO ESTIMATION Thus we have L(ξ) L(ξ0) = P ξ(y) P ξ (Y) = E ξ0 0 ( Pξ (Y, S) P ξ 0 (Y, S) Y S is the random variable, Y is fixed. S P ξ ( Y). 0 If S (τ), τ = 1,..., N, are realized from P ξ ( Y) then the likelihood 0 ratio can be estimated by 1 N N τ=1 P ξ(y, S (τ) ) P ξ 0 (Y, S(τ) ) ) 12
13 LINKAGE LOCATION LIKELIHOOD RATIO The form for linkage lod that follows directly from this is ( L(β, γ1, Λ M ) Pξ L(β, γ0, Λ M ) = E (Y 1 T, Y M, S T, S M ) ) ξ0 P ξ (Y 0 T, Y M, S T, S M ) Y T, Y M for two hypothesized trait locus positions γ1 and γ0. Now P ξ (Y, S) = P β (Y T S T )P ΛM (Y M, S M )Pγ(S T S M ) so ratio reduces to ( L(β, γ1, Λ M ) P L(β, γ0, Λ M ) = E γ1 (S ) T S M ) ξ0 Pγ0 (S T S M ) Y T, Y M 13
14 LOCAL ESTIMATE IS VERY SIMPLE: GLOBAL IS HARD i... l T r... Pγ1 (S T S M ) Pγ0 (S T S M ) = i ρ1l ( ρ 0l ( ρ 1r ρ0r ) Si,T S i,l ( 1 ρ 1l 1 ρ 0l ) Si,T Si,r ( 1 ρ1r 1 ρ0r ) 1 Si,T S i,l ) 1 Si,T Si,r The above works well only for γ1 γ0, and for γ0, γ1 with same l and r. When likelihoods are not smooth, combining LR estimates does not work well especially across markers. 14
15 AN MCMC IMPORTANCE SAMPLING ESTIMATE Lange and Sobel (1996) write the likelihood in the form L(β, γ, Λ M ) = P β,γ,λm (Y M, Y T ) P β,γ,λm (Y T Y M ) = S M P β,γ (Y T S M )P ΛM (S M Y M ) = E ΛM (P β,γ (Y T S M ) Y M ). Sample S M given Y M : compute P (Y T S M ) β, γ a form of Rao-Blackwellization integrate over S T. Also importance sampling: maybe P (S M Y M ) P (S M Y M, Y T ) For fuzzy traits it works quite well. 15
16 METROPOLIS HASTINGS FOR INTERFERENCE Suppose we have interference model P (I) (S) in place of Haldane model P (H) (S) we have used so far. Use block-gibbs update of meiosis i (Si, ) to propose S. Hastings ratio is for current S and proposed S is h(s ; S) = P (I) (S, Y) P (I) (S, Y) P (H) (Si, S k,, k i, Y) P (H) (S i, S k,, k i, Y) = P (I) (S, Y)P (H) (S, Y) P (I) (S, Y)P (H) (S, Y) = P (Y S )P (I) (S )P (Y S)P (H) (S) P (Y S)P (I) (S)P (Y S )P (H) (S ) 16
17 INTERFERENCE ctd. h(s ; S) = m k=1 P (I) (S k, ) P (H) (S k, ) P (I) (S k, ) P (H) (S k, ) = P (I) (S i, ) P (I) (Si, ) P (H) (Si, ) P (H) (S i, ). Pr(S = S ) = a = min(1, h). Pr(S = S) = 1 a. Question: better to sample under H and reweight, or use M-H to sample under model I? 17
MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES
MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES Elizabeth A. Thompson Department of Statistics, University of Washington Box 354322, Seattle, WA 98195-4322, USA Email: thompson@stat.washington.edu This
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities
More information1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics
1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
More informationStatistical issues in QTL mapping in mice
Statistical issues in QTL mapping in mice Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Outline Overview of QTL mapping The X chromosome Mapping
More informationQTL model selection: key players
Bayesian Interval Mapping. Bayesian strategy -9. Markov chain sampling 0-7. sampling genetic architectures 8-5 4. criteria for model selection 6-44 QTL : Bayes Seattle SISG: Yandell 008 QTL model selection:
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationGene mapping in model organisms
Gene mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman Goal Identify genes that contribute to common human diseases. 2
More informationExpression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia
Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.
More informationThe Lander-Green Algorithm. Biostatistics 666 Lecture 22
The Lander-Green Algorithm Biostatistics 666 Lecture Last Lecture Relationship Inferrence Likelihood of genotype data Adapt calculation to different relationships Siblings Half-Siblings Unrelated individuals
More informationFor 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.
STAT 550 Howework 6 Anton Amirov 1. This question relates to the same study you saw in Homework-4, by Dr. Arno Motulsky and coworkers, and published in Thompson et al. (1988; Am.J.Hum.Genet, 42, 113-124).
More informationStatistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014
Overview - 1 Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Elizabeth Thompson University of Washington Seattle, WA, USA MWF 8:30-9:20; THO 211 Web page: www.stat.washington.edu/ thompson/stat550/
More informationAffected Sibling Pairs. Biostatistics 666
Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD
More informationOn Computation of P-values in Parametric Linkage Analysis
On Computation of P-values in Parametric Linkage Analysis Azra Kurbašić Centre for Mathematical Sciences Mathematical Statistics Lund University p.1/22 Parametric (v. Nonparametric) Analysis The genetic
More informationThe genomes of recombinant inbred lines
The genomes of recombinant inbred lines Karl W Broman Department of Biostatistics Johns Hopkins University http://www.biostat.jhsph.edu/~kbroman C57BL/6 2 1 Recombinant inbred lines (by sibling mating)
More informationUse of hidden Markov models for QTL mapping
Use of hidden Markov models for QTL mapping Karl W Broman Department of Biostatistics, Johns Hopkins University December 5, 2006 An important aspect of the QTL mapping problem is the treatment of missing
More informationAdvanced Algorithms and Models for Computational Biology -- a machine learning approach
Advanced Algorithms and Models for Computational Biology -- a machine learning approach Population Genetics: meiosis and recombination Eric Xing Lecture 15, March 8, 2006 Reading: DTW book, Meiosis Meiosis
More informationMultiple QTL mapping
Multiple QTL mapping Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] 1 Why? Reduce residual variation = increased power
More informationThe universal validity of the possible triangle constraint for Affected-Sib-Pairs
The Canadian Journal of Statistics Vol. 31, No.?, 2003, Pages???-??? La revue canadienne de statistique The universal validity of the possible triangle constraint for Affected-Sib-Pairs Zeny Z. Feng, Jiahua
More informationIntroduc)on to Gene)cs How to Analyze Your Own Genome Fall 2013
Introduc)on to Gene)cs 02-223 How to Analyze Your Own Genome Fall 2013 Overview Primer on gene
More informationThe Admixture Model in Linkage Analysis
The Admixture Model in Linkage Analysis Jie Peng D. Siegmund Department of Statistics, Stanford University, Stanford, CA 94305 SUMMARY We study an appropriate version of the score statistic to test the
More informationQTL Mapping I: Overview and using Inbred Lines
QTL Mapping I: Overview and using Inbred Lines Key idea: Looking for marker-trait associations in collections of relatives If (say) the mean trait value for marker genotype MM is statisically different
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Backcross P 1 P 2 P 1 F 1 BC 4
More informationStatistics 246 Spring 2006
Statistics 246 Spring 2006 Meiosis and Recombination Week 3, Lecture 1 1 - the process which starts with a diploid cell having one set of maternal and one of paternal chromosomes, and ends up with four
More informationIntroduction to QTL mapping in model organisms
Human vs mouse Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University www.biostat.jhsph.edu/~kbroman [ Teaching Miscellaneous lectures] www.daviddeen.com
More informationGene mapping, linkage analysis and computational challenges. Konstantin Strauch
Gene mapping, linkage analysis an computational challenges Konstantin Strauch Institute for Meical Biometry, Informatics, an Epiemiology (IMBIE) University of Bonn E-mail: strauch@uni-bonn.e Genetics an
More informationVariance Component Models for Quantitative Traits. Biostatistics 666
Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond
More informationGenotype Imputation. Biostatistics 666
Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives
More informationMapping multiple QTL in experimental crosses
Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics and Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman [ Teaching Miscellaneous lectures]
More informationQTL model selection: key players
QTL Model Selection. Bayesian strategy. Markov chain sampling 3. sampling genetic architectures 4. criteria for model selection Model Selection Seattle SISG: Yandell 0 QTL model selection: key players
More informationSolutions to Problem Set 4
Question 1 Solutions to 7.014 Problem Set 4 Because you have not read much scientific literature, you decide to study the genetics of garden peas. You have two pure breeding pea strains. One that is tall
More informationNormal approximation to Binomial
Normal approximation to Binomial 24.10.2007 GE02: day 3 part 3 Yurii Aulchenko Erasmus MC Rotterdam Binomial distribution at different n and p 0.3 N=10 0.2 N=25 0.1 N=100 P=0.5 0.2 0.1 0.1 0 0 0 k k k
More informationLinkage and Linkage Disequilibrium
Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies
More informationComputation of Multilocus Prior Probability of Autozygosity for Complex Inbred Pedigrees
Genetic Epidemiology 14:1 15 (1997) Computation of Multilocus Prior Probability of Autozygosity for Complex Inbred Pedigrees Sun-Wei Guo* Department of Biostatistics, University of Michigan, Ann Arbor
More informationBAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI
BAYESIAN MAPPING OF MULTIPLE QUANTITATIVE TRAIT LOCI By DÁMARIS SANTANA MORANT A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR
More informationGenetic Association Studies in the Presence of Population Structure and Admixture
Genetic Association Studies in the Presence of Population Structure and Admixture Purushottam W. Laud and Nicholas M. Pajewski Division of Biostatistics Department of Population Health Medical College
More informationDepartment of Forensic Psychiatry, School of Medicine & Forensics, Xi'an Jiaotong University, Xi'an, China;
Title: Evaluation of genetic susceptibility of common variants in CACNA1D with schizophrenia in Han Chinese Author names and affiliations: Fanglin Guan a,e, Lu Li b, Chuchu Qiao b, Gang Chen b, Tinglin
More informationUNIT 8 BIOLOGY: Meiosis and Heredity Page 148
UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular
More informationBayesian construction of perceptrons to predict phenotypes from 584K SNP data.
Bayesian construction of perceptrons to predict phenotypes from 584K SNP data. Luc Janss, Bert Kappen Radboud University Nijmegen Medical Centre Donders Institute for Neuroscience Introduction Genetic
More information1. Understand the methods for analyzing population structure in genomes
MSCBIO 2070/02-710: Computational Genomics, Spring 2016 HW3: Population Genetics Due: 24:00 EST, April 4, 2016 by autolab Your goals in this assignment are to 1. Understand the methods for analyzing population
More informationRobert Collins CSE586, PSU Intro to Sampling Methods
Robert Collins Intro to Sampling Methods CSE586 Computer Vision II Penn State Univ Robert Collins A Brief Overview of Sampling Monte Carlo Integration Sampling and Expected Values Inverse Transform Sampling
More informationEvaluating the Performance of a Block Updating McMC Sampler in a Simple Genetic Application
Evaluating the Performance of a Block Updating McMC Sampler in a Simple Genetic Application N.A. SHEEHAN B. GULDBRANDTSEN 1 D.A. SORENSEN 1 1 Markov chain Monte Carlo (McMC) methods have provided an enormous
More informationOptimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives
Genetic Epidemiology 16:225 249 (1999) Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives Mary Sara McPeek* Department of Statistics, University of Chicago, Chicago, Illinois
More informationLinkage Mapping. Reading: Mather K (1951) The measurement of linkage in heredity. 2nd Ed. John Wiley and Sons, New York. Chapters 5 and 6.
Linkage Mapping Reading: Mather K (1951) The measurement of linkage in heredity. 2nd Ed. John Wiley and Sons, New York. Chapters 5 and 6. Genetic maps The relative positions of genes on a chromosome can
More informationThe problem Lineage model Examples. The lineage model
The lineage model A Bayesian approach to inferring community structure and evolutionary history from whole-genome metagenomic data Jack O Brien Bowdoin College with Daniel Falush and Xavier Didelot Cambridge,
More informationPopulation Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda
1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;
More informationFriday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo
Friday Harbor 2017 From Genetics to GWAS (Genome-wide Association Study) Sept 7 2017 David Fardo Purpose: prepare for tomorrow s tutorial Genetic Variants Quality Control Imputation Association Visualization
More informationMODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES
MODEL-FREE LINKAGE AND ASSOCIATION MAPPING OF COMPLEX TRAITS USING QUANTITATIVE ENDOPHENOTYPES Saurabh Ghosh Human Genetics Unit Indian Statistical Institute, Kolkata Most common diseases are caused by
More informationBayesian Inference of Interactions and Associations
Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,
More informationtheta H H H H H H H H H H H K K K K K K K K K K centimorgans
Linkage Phase Recall that the recombination fraction ρ for two loci denotes the probability of a recombination event between those two loci. For loci on different chromosomes, ρ = 1=2. For loci on the
More informationNatural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate
Natural Selection Population Dynamics Humans, Sickle-cell Disease, and Malaria How does a population of humans become resistant to malaria? Overproduction Environmental pressure/competition Pre-existing
More informationp(d g A,g B )p(g B ), g B
Supplementary Note Marginal effects for two-locus models Here we derive the marginal effect size of the three models given in Figure 1 of the main text. For each model we assume the two loci (A and B)
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationMapping multiple QTL in experimental crosses
Human vs mouse Mapping multiple QTL in experimental crosses Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman www.daviddeen.com
More informationThe E-M Algorithm in Genetics. Biostatistics 666 Lecture 8
The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as
More informationIntroduction to QTL mapping in model organisms
Introduction to QTL mapping in model organisms Karl W Broman Department of Biostatistics Johns Hopkins University kbroman@jhsph.edu www.biostat.jhsph.edu/ kbroman Outline Experiments and data Models ANOVA
More informationSNP Association Studies with Case-Parent Trios
SNP Association Studies with Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health September 3, 2009 Population-based Association Studies Balding (2006). Nature
More informationObjectives. Announcements. Comparison of mitosis and meiosis
Announcements Colloquium sessions for which you can get credit posted on web site: Feb 20, 27 Mar 6, 13, 20 Apr 17, 24 May 15. Review study CD that came with text for lab this week (especially mitosis
More information1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:
.5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the
More informationAsymptotic properties of the likelihood ratio test statistics with the possible triangle constraint in Affected-Sib-Pair analysis
The Canadian Journal of Statistics Vol.?, No.?, 2006, Pages???-??? La revue canadienne de statistique Asymptotic properties of the likelihood ratio test statistics with the possible triangle constraint
More informationRunaway. demogenetic model for sexual selection. Louise Chevalier. Jacques Labonne
Runaway demogenetic model for sexual selection Louise Chevalier Master 2 thesis UPMC, Specialization Oceanography and Marine Environments Jacques Labonne UMR Ecobiop INRA - National Institute for Agronomic
More informationLecture WS Evolutionary Genetics Part I 1
Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in
More informationPrinciples of Genetics
Principles of Genetics Snustad, D ISBN-13: 9780470903599 Table of Contents C H A P T E R 1 The Science of Genetics 1 An Invitation 2 Three Great Milestones in Genetics 2 DNA as the Genetic Material 6 Genetics
More informationR/qtl workshop. (part 2) Karl Broman. Biostatistics and Medical Informatics University of Wisconsin Madison. kbroman.org
R/qtl workshop (part 2) Karl Broman Biostatistics and Medical Informatics University of Wisconsin Madison kbroman.org github.com/kbroman @kwbroman Example Sugiyama et al. Genomics 71:70-77, 2001 250 male
More informationCh 11.4, 11.5, and 14.1 Review. Game
Ch 11.4, 11.5, and 14.1 Review Game What happens to the chromosome number during meiosis? A It doubles B It stays the same C It halves D It becomes diploid Ans: C Gametes are A Sex cells B Sperm and eggs
More informationModels for Meiosis. Chapter The meiosis process
Chapter 5 Models for Meiosis 5.1 The meiosis process In section 4.1, we introduced recombination as the process of crossing over between the two homologous parental chromosomes in the formation of an offspring
More informationLecture 7 (FW) February 11, 2009 Phenotype and Genotype Reading: pp
Lecture 7 (FW) February 11, 2009 Phenotype and Genotype Reading: pp. 51-62 Annoucement: A review session for the first mid term will be held on Tuesday, 2/24, from 5-6:30 PM in 159 Mulford Hall. The mid
More information(Genome-wide) association analysis
(Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by
More informationProportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power
Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion
More informationLesson 4: Understanding Genetics
Lesson 4: Understanding Genetics 1 Terms Alleles Chromosome Co dominance Crossover Deoxyribonucleic acid DNA Dominant Genetic code Genome Genotype Heredity Heritability Heritability estimate Heterozygous
More information2. Map genetic distance between markers
Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,
More informationGenotype Imputation. Class Discussion for January 19, 2016
Genotype Imputation Class Discussion for January 19, 2016 Intuition Patterns of genetic variation in one individual guide our interpretation of the genomes of other individuals Imputation uses previously
More informationMCMC 2: Lecture 2 Coding and output. Phil O Neill Theo Kypraios School of Mathematical Sciences University of Nottingham
MCMC 2: Lecture 2 Coding and output Phil O Neill Theo Kypraios School of Mathematical Sciences University of Nottingham Contents 1. General (Markov) epidemic model 2. Non-Markov epidemic model 3. Debugging
More informationLecture 11: Multiple trait models for QTL analysis
Lecture 11: Multiple trait models for QTL analysis Julius van der Werf Multiple trait mapping of QTL...99 Increased power of QTL detection...99 Testing for linked QTL vs pleiotropic QTL...100 Multiple
More informationFast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets
Fast Bayesian Methods for Genetic Mapping Applicable for High-Throughput Datasets Yu-Ling Chang A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment
More informationPrediction of the Confidence Interval of Quantitative Trait Loci Location
Behavior Genetics, Vol. 34, No. 4, July 2004 ( 2004) Prediction of the Confidence Interval of Quantitative Trait Loci Location Peter M. Visscher 1,3 and Mike E. Goddard 2 Received 4 Sept. 2003 Final 28
More informationSNP-SNP Interactions in Case-Parent Trios
Detection of SNP-SNP Interactions in Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health June 2, 2009 Karyotypes http://ghr.nlm.nih.gov/ Single Nucleotide Polymphisms
More informationAssociation Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5
Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative
More informationBinomial Mixture Model-based Association Tests under Genetic Heterogeneity
Binomial Mixture Model-based Association Tests under Genetic Heterogeneity Hui Zhou, Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 April 30,
More informationCausal Graphical Models in Systems Genetics
1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation
More informationChapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)
12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²
More informationMapping QTL to a phylogenetic tree
Mapping QTL to a phylogenetic tree Karl W Broman Department of Biostatistics & Medical Informatics University of Wisconsin Madison www.biostat.wisc.edu/~kbroman Human vs mouse www.daviddeen.com 3 Intercross
More informationLinear Regression (1/1/17)
STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression
More informationLecture 6. QTL Mapping
Lecture 6 QTL Mapping Bruce Walsh. Aug 2003. Nordic Summer Course MAPPING USING INBRED LINE CROSSES We start by considering crosses between inbred lines. The analysis of such crosses illustrates many of
More informationEstimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty
Estimation of Parameters in Random Effect Models with Incidence Matrix Uncertainty Xia Shen 1,2 and Lars Rönnegård 2,3 1 The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden; 2 School
More informationModelling Linkage Disequilibrium, And Identifying Recombination Hotspots Using SNP Data
Modelling Linkage Disequilibrium, And Identifying Recombination Hotspots Using SNP Data Na Li and Matthew Stephens July 25, 2003 Department of Biostatistics, University of Washington, Seattle, WA 98195
More informationMarkov Chain Monte Carlo
Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).
More informationMarkov Chain Monte Carlo, Numerical Integration
Markov Chain Monte Carlo, Numerical Integration (See Statistics) Trevor Gallen Fall 2015 1 / 1 Agenda Numerical Integration: MCMC methods Estimating Markov Chains Estimating latent variables 2 / 1 Numerical
More informationOutline. P o purple % x white & white % x purple& F 1 all purple all purple. F purple, 224 white 781 purple, 263 white
Outline - segregation of alleles in single trait crosses - independent assortment of alleles - using probability to predict outcomes - statistical analysis of hypotheses - conditional probability in multi-generation
More informationLecture 8. QTL Mapping 1: Overview and Using Inbred Lines
Lecture 8 QTL Mapping 1: Overview and Using Inbred Lines Bruce Walsh. jbwalsh@u.arizona.edu. University of Arizona. Notes from a short course taught Jan-Feb 2012 at University of Uppsala While the machinery
More informationCSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation
CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation Instructor: Arindam Banerjee November 26, 2007 Genetic Polymorphism Single nucleotide polymorphism (SNP) Genetic Polymorphism
More information56th Annual EAAP Meeting Uppsala, 2005
56th Annual EAAP Meeting Uppsala, 2005 G7.5 The Map Expansion Obtained with Recombinant Inbred Strains and Intermated Recombinant Inbred Populations for Finite Generation Designs F. Teuscher *, V. Guiard
More informationDetection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred population
Genet. Sel. Evol. 36 (2004) 415 433 415 c INRA, EDP Sciences, 2004 DOI: 10.1051/gse:2004009 Original article Detection of multiple QTL with epistatic effects under a mixed inheritance model in an outbred
More informationEstimating Evolutionary Trees. Phylogenetic Methods
Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent
More informationMarkov Chain. Edited by. Andrew Gelman. Xiao-Li Meng. CRC Press. Taylor & Francis Croup. Boca Raton London New York. an informa business
Chapman & Hall/CRC Handbooks of Modern Statistical Methods Handbook of Markov Chain Monte Carlo Edited by Steve Brooks Andrew Gelman Galin L. Jones Xiao-Li Meng CRC Press Taylor & Francis Croup Boca Raton
More informationExpectations, Markov chains, and the Metropolis algorithm
Expectations, Markov chains, and the Metropolis algorithm Peter Hoff Departments of Statistics and Biostatistics and the Center for Statistics and the Social Sciences University of Washington 7-27-05 1
More informationMissing Data. On Missing Data and Interactions in SNP Association Studies. Missing Data - Approaches. Missing Data - Approaches. Multiple Imputation
October, 6 @ U of British Columbia Missing Data On Missing Data Interactions in SNP Association Studies 5 SNPs Ingo Ruczinski Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
More informationMCMC: Markov Chain Monte Carlo
I529: Machine Learning in Bioinformatics (Spring 2013) MCMC: Markov Chain Monte Carlo Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Contents Review of Markov
More informationLECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50
LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if
More information