The statistical evaluation of DNA crime stains in R
|
|
- Wesley Wiggins
- 6 years ago
- Views:
Transcription
1 The statistical evaluation of DNA crime stains in R Miriam Marušiaková Department of Statistics, Charles University, Prague Institute of Computer Science, CBI, Academy of Sciences of CR UseR! 2008, Dortmund Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
2 Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
3 Introduction Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
4 Introduction Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
5 Introduction Single crime scene stain Blood stain at the crime scene Believed it was left by offender Suspect arrested for reasons unconnected with his DNA profile Crime sample, suspect sample Hypothesis H p (prosecution): The suspect left the crime stain. H d (defense): Some other person left the crime stain. Notation DNA typing results E = {G C, G S } non-dna evidence I Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
6 Introduction Evidence interpretation Prior odds Posterior odds Bayes theorem Pr(H p I) Pr(H d I) Pr(H p E, I) Pr(H d E, I) Pr(H p E, I) Pr(H d E, I) = Pr(E H p, I) Pr(H p I) Pr(E H d, I) Pr(H }{{} d I) LR Balding and Donelly (1995), Robertson and Vignaux (1995) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
7 Introduction Evidence interpretation LR = Pr(E H p, I) Pr(E H d, I) = Pr(G S, G C H p, I) Pr(G S, G C H d, I) = Pr(G C G S, H p, I) Pr(G C G S, H d, I) Pr(G S H p, I) Pr(G S H d, I) = Pr(G C G S, H p, I) Pr(G C G S, H d, I) 1 = 1 Pr(G C G S, H d, I) = 1 Pr(G C H d, I) if independence assumed Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
8 Introduction Errors and fallacies Statement The probability of observing this type if the blood came from someone other than suspect is 1 in 100. Pr(G C H d, I) = 1/100 Common error The probability that the blood came from someone else is 1 in 100. Pr(H d G C, I) = 1/100 There is a 99% probability that it came from the suspect. Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
9 Introduction Errors and fallacies (cont. d) Statement The evidence is 100 times more probable if the suspect left the crime stain than if some unknown person left it. 1 Pr(G C H d, I) = 100 Common error It is 100 times more probable that the suspect left the crime stain than some unknown person. Pr(H p G C, I) Pr(H d G C, I) = 100 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
10 Single Crime Scene Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
11 Single Crime Scene Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
12 Single Crime Scene Product rule Model of an ideal population Infinite size Random mating Model reliable for most real-world problems Hardy-Weinberg equilibrium Likelihood ratio LR = Pr(G = A i A i ) = p 2 i Pr(G = A i A j ) = 2p i p j, i j 1 Pr(G C G S, H d, I) = 1 Pr(G C H d, I) = 1 p 2 i ( 1 2p i p j ) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
13 Single Crime Scene Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
14 Single Crime Scene F - measure of uncertainty about allele proportions in the population of suspects Genetic interpretation of F (Wright, 1951) How to estimate F (Weir and Cockerham, 1984) Recommendations (National Research Council, 1996) F = 0.01 large subpopulations (USA) F = 0.03 small isolated subpopulations Match probabilities (Balding and Nichols, 1994) P(G C = A i A i G S = A i A i ) = [2F + (1 F)p i] [3F + (1 F)p i ] (1 + F)(1 + 2F) P(G C = A i A j G S = A i A j ) = 2 [F + (1 F)p i] [F + (1 F)p j ] (1 + F)(1 + 2F) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
15 Single Crime Scene Effects of F corrections Likelihood ratio - some numerical values Heterozygotes A i A j, p i = p j = p F = 0 F = F = 0.01 F = 0.03 p = p = p = Homozygotes A i A i, p i = p F = 0 F = F = 0.01 F = 0.03 p = p = p = Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
16 Single Crime Scene Relatedness Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
17 Single Crime Scene Relatedness Inbreeding Individuals with common ancestors - related Their children - inbred Alleles are ibd (identical by descent) - copies of the same allele Example Alleles h 1, h 2 transmitted from parent H to X, Y who transmit a, b to I h 1 X a H h 2 Y b I Pr(h 1 is ibd to h 2 ) = Pr(h 1 h 2 ) = 0.5 Pr(a b) = Pr(a h 1, b h 2 h 1 h 2 )Pr(h 1 h 2 ) = = Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
18 Single Crime Scene Relatedness Match probabilities for close relatives Balding and Nichols (1994) { ( k0 p 4 i + k 1 p 3 i + k 2 p 2 ) i /p 2 i Pr(G C G S, H d, I) = ) (4k 0 p 2 i p2 j + k 1 p i p j (p i + p j ) + 2k 2 p i p j /2p i p j Kinship coefficients Relationship k 0 k 1 k 2 Parent - child Siblings 1/4 1/2 1/4 Grandparent - grandchild 1/2 1/2 0 Uncle - nephew 1/2 1/2 0 Cousins 3/4 1/4 0 Unrelated k i - probability that two persons will share i alleles ibd i = 0, 1, 2 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
19 Single Crime Scene R package forensic Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
20 Single Crime Scene R package forensic R function Pmatch Usage Pmatch(prob, k = c(1, 0, 0), theta = 0) Example > p <- c(0.057, 0.160, 0.182, 0, 0.024, 0.122) > Pmatch(p) $prob locus 1 locus 2 locus 3 allele allele $match locus 1 locus 2 locus 3 [1,] $total_match [1] e-06 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
21 Mixed stain Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
22 Mixed stain Mixtures Prosecution and defense hypothesis H p : Contributors were the victim and the suspect. H d : Contributors were the victim and an unknown person. Likelihood ratio for the mixture LR = Pr(E C, G V, G S H p, I) Pr(E C, G V, G S H d, I) = Pr(E C G V, G S, H p, I) Pr(E C G V, G S, H d, I) Pr(G V, G S H p, I) Pr(G V, G S H d, I) = Pr(E C G V, G S, H p, I) Pr(E C G V, G S, H d, I) = 1 Pr(E C G V, G S, H d, I) 1 = (if independence assumed) Pr(E C G V, H d, I) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
23 Mixed stain Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
24 Mixed stain Match probability (independence assumption) Weir et al (1997) P x (U, C) = (T 0 ) 2x i U (T 1i ) 2x + T 0 = l C p l T 1i = l C\{i} p l, i U T 2ij = l C\{i,j} p l, i, j U, i < j i,j U:i<j (T 2ij ) 2x... U - set of alleles from the crime sample C not carried by known contributors x - number of unknown contributors R Pevid.ind(alleles, prob, x, u = NULL) LR.ind(alleles, prob, x1, x2, u1 = NULL, u2 = NULL) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
25 Mixed stain Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
26 Mixed stain Match probability for structured population Assumption All the involved people come from the same subpopulation with parameter F Fung and Hu (2000), Zoubková and Zvárová (2004) MP = r r r 1 r 1 =0 r 2 =0 r r 1 r c 2 r c 1 =0 ti +u i +v i 1 i=1 j=t i +v i (2n U )! c [(1 F)p i + jf] c i=1 u i! 2n T +2n U +2n V 1 j=2n T +2n V [(1 F) + jf] R Pevid.gen(alleles, prob, x, T = NULL, V = NULL, theta = 0) T, V - genotypes of known contributors, known non-contributors Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
27 Mixed stain People versus Simpson Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
28 Mixed stain People versus Simpson People versus Simpson (Los Angeles County Case) DNA evidence Material at the crime scene - alleles a, b, c (locus D2S44) Suspect s genotype ab Victim s genotype ac Hypotheses (Weir et al, 1997, Fung and Hu, 2000) H p : Contributors were the victim, suspect and m unknowns. H d : Contributors were n unknowns. R a = c( a, b, c ) p = c(0.0316, , ) suspect <- a/b, victim <- a/c Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
29 Mixed stain People versus Simpson Likelihood ratios for the Simpson case Defense n = 2 n = 3 Prosecution F = F = m = m = Prosecution proposition (F = 0) Pevid.ind(alleles = a, prob = p, x = m) Pevid.gen(alleles = a, prob = p, x = m, T = c(victim, suspect)) Defense proposition (F = 0) Pevid.ind(alleles = a, prob = p, x = n, u = c( a, b, c )) Pevid.gen(alleles = a, prob = p, x = n) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
30 Mixed stain More general situation Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
31 Mixed stain More general situation More general situation Contributors from different ethnic groups Fung and Hu (2001) Pevid.ind, LR.ind (independence within and between ethnic groups) Presence of related people Hu and Fung (2003) Example Suspect not typed, his relative is tested Two related people among unknown contributors Pevid.rel Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
32 Conclusion Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
33 Conclusion Conclusion Single crime stain Suspect and offender are unrelated are members of the same subpopulation are close relatives Pmatch Mixed crime stain Contributors unrelated members of the same subpopulation may be related from different ethnic groups Pevid.ind, LR.ind Pevid.gen Pevid.rel Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
34 Conclusion Thank you for your attention! Acknowledgement The work was supported by GAČR 201/05/H007 and the project 1M06014 of the Ministry of Education, Youth and Sports of the Czech Republic. Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
35 Important publications BALDING AND NICHOLS(1994) DNA profile match probability calculation Forensic Science International 64, p WEIR ET AL(1997) Interpreting DNA mixtures Journal of Forensic Sciences 42, p FUNG AND HU(2000) Interpreting forensic DNA mixtures: allowing for uncertainty in population substructure and dependence Journal of Royal Statistical Society A 163, p ZOUBKOVÁ, SUPERVISOR ZVÁROVÁ (2004) Master thesis (in Czech) Charles University, Prague Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
36 Software R DEVELOPMENT CORE TEAM (2006) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria ISBN , URL MARUŠIAKOVÁ, M (2007) forensic: Statistical methods in forensic genetics R package version 0.2 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36
THE RARITY OF DNA PROFILES 1. BY BRUCE S. WEIR University of Washington
The Annals of Applied Statistics 2007, Vol. 1, No. 2, 358 370 DOI: 10.1214/07-AOAS128 Institute of Mathematical Statistics, 2007 THE RARITY OF DNA PROFILES 1 BY BRUCE S. WEIR University of Washington It
More informationarxiv: v1 [stat.ap] 7 Dec 2007
The Annals of Applied Statistics 2007, Vol. 1, No. 2, 358 370 DOI: 10.1214/07-AOAS128 c Institute of Mathematical Statistics, 2007 THE RARITY OF DNA PROFILES 1 arxiv:0712.1099v1 [stat.ap] 7 Dec 2007 By
More informationStochastic Models for Low Level DNA Mixtures
Original Article en5 Stochastic Models for Low Level DNA Mixtures Dalibor Slovák 1,, Jana Zvárová 1, 1 EuroMISE Centre, Institute of Computer Science AS CR, Prague, Czech Republic Institute of Hygiene
More informationInterpreting DNA Mixtures in Structured Populations
James M. Curran, 1 Ph.D.; Christopher M. Triggs, 2 Ph.D.; John Buckleton, 3 Ph.D.; and B. S. Weir, 1 Ph.D. Interpreting DNA Mixtures in Structured Populations REFERENCE: Curran JM, Triggs CM, Buckleton
More informationBreeding Values and Inbreeding. Breeding Values and Inbreeding
Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A
More informationWorkshop: Kinship analysis First lecture: Basics: Genetics, weight of evidence. I.1
Workshop: Kinship analysis First lecture: Basics: Genetics, weight of evidence. I.1 Thore Egeland (1),(2), Klaas Slooten (3),(4) (1) Norwegian University of Life Sciences, (2) NIPH, (3) Netherlands Forensic
More informationExamensarbete. Rarities of genotype profiles in a normal Swedish population. Ronny Hedell
Examensarbete Rarities of genotype profiles in a normal Swedish population Ronny Hedell LiTH - MAT - EX - - 2010 / 25 - - SE Rarities of genotype profiles in a normal Swedish population Department of
More informationForensic Genetics. Summer Institute in Statistical Genetics July 26-28, 2017 University of Washington. John Buckleton:
Forensic Genetics Summer Institute in Statistical Genetics July 26-28, 2017 University of Washington John Buckleton: John.Buckleton@esr.cri.nz Bruce Weir: bsweir@uw.edu 1 Contents Topic Genetic Data, Evidence,
More informationStatistical Methods and Software for Forensic Genetics. Lecture I.1: Basics
Statistical Methods and Software for Forensic Genetics. Lecture I.1: Basics Thore Egeland (1),(2) (1) Norwegian University of Life Sciences, (2) Oslo University Hospital Workshop. Monterrey, Mexico, Nov
More informationLECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50
LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if
More informationRelatedness 1: IBD and coefficients of relatedness
Relatedness 1: IBD and coefficients of relatedness or What does it mean to be related? Magnus Dehli Vigeland NORBIS course, 8 th 12 th of January 2018, Oslo Plan Introduction What does it mean to be related?
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationThe Lander-Green Algorithm. Biostatistics 666 Lecture 22
The Lander-Green Algorithm Biostatistics 666 Lecture Last Lecture Relationship Inferrence Likelihood of genotype data Adapt calculation to different relationships Siblings Half-Siblings Unrelated individuals
More information1 Introduction. Probabilistic evaluation of low-quality DNA profiles. K. Ryan, D. Gareth Williams a, David J. Balding b,c
Probabilistic evaluation of low-quality DNA profiles K. Ryan, D. Gareth Williams a, David J. Balding b,c a. DGW Software Consultants LTD b. UCL Genetics Institute, University College, London WC1E 6BT c.
More informationPopulation Structure
Ch 4: Population Subdivision Population Structure v most natural populations exist across a landscape (or seascape) that is more or less divided into areas of suitable habitat v to the extent that populations
More informationModeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17
Modeling IBD for Pairs of Relatives Biostatistics 666 Lecture 7 Previously Linkage Analysis of Relative Pairs IBS Methods Compare observed and expected sharing IBD Methods Account for frequency of shared
More informationAbstract. BEECHAM Jr., GARY WAYNE. Statistical methods for the Analysis of Forensic DNA Mixtures. (Under the direction of Bruce S. Weir.
Abstract BEECHAM Jr., GARY WAYNE. Statistical methods for the Analysis of Forensic DNA Mixtures. (Under the direction of Bruce S. Weir.) As forensic DNA typing technologies become more sensitive, there
More informationPopulation genetics snippets for genepop
Population genetics snippets for genepop Peter Beerli August 0, 205 Contents 0.Basics 0.2Exact test 2 0.Fixation indices 4 0.4Isolation by Distance 5 0.5Further Reading 8 0.6References 8 0.7Disclaimer
More informationCovariance between relatives
UNIVERSIDADE DE SÃO PAULO ESCOLA SUPERIOR DE AGRICULTURA LUIZ DE QUEIROZ DEPARTAMENTO DE GENÉTICA LGN5825 Genética e Melhoramento de Espécies Alógamas Covariance between relatives Prof. Roberto Fritsche-Neto
More informationA consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation
Ann. Hum. Genet., Lond. (1975), 39, 141 Printed in Great Britain 141 A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation BY CHARLES F. SING AND EDWARD D.
More informationAdventures in Forensic DNA: Cold Hits, Familial Searches, and Mixtures
Adventures in Forensic DNA: Cold Hits, Familial Searches, and Mixtures Departments of Mathematics and Statistics Northwestern University JSM 2012, July 30, 2012 The approach of this talk Some simple conceptual
More informationReporting LRs - principles
Simone Gittelson Ian Evett Reporting LRs - principles Tacha Hicks-Champod Franco Taroni DNA: http://www.formation-continueunil-epfl.ch/en/essentials-dnainterpretation 1 Steven Myers Tim Kalafut Gittelson
More informationSTRmix - MCMC Study Page 1
SDPD Crime Laboratory Forensic Biology Unit Validation of the STRmix TM Software MCMC Markov Chain Monte Carlo Introduction The goal of DNA mixture interpretation should be to identify the genotypes of
More informationProblems for 3505 (2011)
Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.
More information2. Map genetic distance between markers
Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,
More informationThe problem of Y-STR rare haplotype
The problem of Y-STR rare haplotype February 17, 2014 () Short title February 17, 2014 1 / 14 The likelihood ratio When evidence is found on a crime scene, the expert is asked to provide a Likelihood ratio
More information19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:
9. Genetic Drift Genetic drift is the alteration of gene frequencies due to sampling variation from one generation to the next. It operates to some degree in all finite populations, but can be significant
More informationIdentification of Pedigree Relationship from Genome Sharing
INVESTIGATION Identification of Pedigree Relationship from Genome Sharing William G. Hill 1 and Ian M. S. White Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh,
More informationY-STR: Haplotype Frequency Estimation and Evidence Calculation
Calculating evidence Further work Questions and Evidence Mikkel, MSc Student Supervised by associate professor Poul Svante Eriksen Department of Mathematical Sciences Aalborg University, Denmark June 16
More informationAffected Sibling Pairs. Biostatistics 666
Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD
More informationNotes on Population Genetics
Notes on Population Genetics Graham Coop 1 1 Department of Evolution and Ecology & Center for Population Biology, University of California, Davis. To whom correspondence should be addressed: gmcoop@ucdavis.edu
More informationBayesian Updating: Odds Class 12, Jeremy Orloff and Jonathan Bloom
Learning Goals Bayesian Updating: Odds Class 2, 8.05 Jeremy Orloff and Jonathan Bloom. Be able to convert between odds and probability. 2. Be able to update prior odds to posterior odds using Bayes factors.
More informationLecture 13: Population Structure. October 8, 2012
Lecture 13: Population Structure October 8, 2012 Last Time Effective population size calculations Historical importance of drift: shifting balance or noise? Population structure Today Course feedback The
More information8. Genetic Diversity
8. Genetic Diversity Many ways to measure the diversity of a population: For any measure of diversity, we expect an estimate to be: when only one kind of object is present; low when >1 kind of objects
More informationMechanisms of Evolution
Mechanisms of Evolution 36-149 The Tree of Life Christopher R. Genovese Department of Statistics 132H Baker Hall x8-7836 http://www.stat.cmu.edu/ ~ genovese/. Plan 1. Two More Generations 2. The Hardy-Weinberg
More informationLecture 2: Introduction to Quantitative Genetics
Lecture 2: Introduction to Quantitative Genetics Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 2018 1 Basic model of Quantitative Genetics Phenotypic value --
More informationIntroductory seminar on mathematical population genetics
Exercises Sheets Introductory seminar on mathematical population genetics WS 20/202 Kristan Schneider, Ada Akerman Ex Assume a single locus with alleles A and A 2 Denote the frequencies of the three (unordered
More informationEXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?
Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population
More informationMajor Genes, Polygenes, and
Major Genes, Polygenes, and QTLs Major genes --- genes that have a significant effect on the phenotype Polygenes --- a general term of the genes of small effect that influence a trait QTL, quantitative
More informationForensic Genetics Module. Evaluating DNA evidence
Summer Institute in Statistical Genetics University of Washington, Seattle Forensic Genetics Module Evaluating DNA evidence David Balding Professor of Statistical Genetics University of Melbourne and UCL
More informationCharacterization of Error Tradeoffs in Human Identity Comparisons: Determining a Complexity Threshold for DNA Mixture Interpretation
Characterization of Error Tradeoffs in Human Identity Comparisons: Determining a Complexity Threshold for DNA Mixture Interpretation Boston University School of Medicine Program in Biomedical Forensic
More informationChem 4331 Name : Final Exam 2008
Chem 4331 Name : Final Exam 2008 Answer any seven of questions 1-8. Each question is worth 15 points for a total of 105 points. 1. A crime has been committed, and a blood sample has been found at the crime
More informationMatch probabilities in a finite, subdivided population
Match probabilities in a finite, subdivided population Anna-Sapfo Malaspinas a, Montgomery Slatkin a, Yun S. Song b, a Department of Integrative Biology, University of California, Berkeley, CA 94720, USA
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm
More informationarxiv: v3 [stat.ap] 4 Nov 2014
The multivarte Dirichlet-multinoml distribution and its application in forensic genetics to adjust for sub-population effects using the θ-correction T. TVEDEBRNK P. S. ERKSEN arxiv:1406.6508v3 [stat.p]
More informationStatistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014
Overview - 1 Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Elizabeth Thompson University of Washington Seattle, WA, USA MWF 8:30-9:20; THO 211 Web page: www.stat.washington.edu/ thompson/stat550/
More informationCase Studies in Ecology and Evolution
3 Non-random mating, Inbreeding and Population Structure. Jewelweed, Impatiens capensis, is a common woodland flower in the Eastern US. You may have seen the swollen seed pods that explosively pop when
More informationSTAT 536: Migration. Karin S. Dorman. October 3, Department of Statistics Iowa State University
STAT 536: Migration Karin S. Dorman Department of Statistics Iowa State University October 3, 2006 Migration Introduction Migration is the movement of individuals between populations. Until now we have
More information... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE
NORMAL DISTRIBUTIONS OF PHENOTYPES Mice Fruit Flies In:Introduction to Quantitative Genetics Falconer & Mackay 1996 CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE Mean and variance are two quantities
More informationVariance Component Models for Quantitative Traits. Biostatistics 666
Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond
More informationGBLUP and G matrices 1
GBLUP and G matrices 1 GBLUP from SNP-BLUP We have defined breeding values as sum of SNP effects:! = #$ To refer breeding values to an average value of 0, we adopt the centered coding for genotypes described
More informationBayesian analysis of the Hardy-Weinberg equilibrium model
Bayesian analysis of the Hardy-Weinberg equilibrium model Eduardo Gutiérrez Peña Department of Probability and Statistics IIMAS, UNAM 6 April, 2010 Outline Statistical Inference 1 Statistical Inference
More informationA gamma model for DNA mixture analyses
Bayesian Analysis (2007) 2, Number 2, pp. 333 348 A gamma model for DNA mixture analyses R. G. Cowell, S. L. Lauritzen and J. Mortera Abstract. We present a new methodology for analysing forensic identification
More information1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c.
1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c. Changes in allele frequencies in a population across generations
More informationDNA commission of the International Society of Forensic Genetics: Recommendations on the interpretation of mixtures
Forensic Science International 160 (2006) 90 101 www.elsevier.com/locate/forsciint DNA commission of the International Society of Forensic Genetics: Recommendations on the interpretation of mixtures P.
More information1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:
.5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the
More informationEvaluation of the STR typing kit PowerPlexk 16 with respect to technical performance and population genetics: a multicenter study
International Congress Series 1239 (2003) 789 794 Evaluation of the STR typing kit PowerPlexk 16 with respect to technical performance and population genetics: a multicenter study L. Henke a, *, A. Aaspollu
More informationAnalyzing the genetic structure of populations: a Bayesian approach
Analyzing the genetic structure of populations: a Bayesian approach Introduction Our review of Nei s G st and Weir and Cockerham s θ illustrated two important principles: 1. It s essential to distinguish
More informationInference on pedigree structure from genome screen data. Running title: Inference on pedigree structure. Mary Sara McPeek. The University of Chicago
1 Inference on pedigree structure from genome screen data Running title: Inference on pedigree structure Mary Sara McPeek The University of Chicago Address for correspondence: Department of Statistics,
More information2018 SISG Module 20: Bayesian Statistics for Genetics Lecture 2: Review of Probability and Bayes Theorem
2018 SISG Module 20: Bayesian Statistics for Genetics Lecture 2: Review of Probability and Bayes Theorem Jon Wakefield Departments of Statistics and Biostatistics University of Washington Outline Introduction
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 009 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results
More informationEXERCISES FOR CHAPTER 7. Exercise 7.1. Derive the two scales of relation for each of the two following recurrent series:
Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 7 Exercise 7.. Derive the two scales of relation for each of the two following recurrent series: u: 0, 8, 6, 48, 46,L 36 7
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 007 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results
More informationDNA Evidence in the Ferguson Grand Jury Proceedings
DNA Evidence in the Ferguson Grand Jury Proceedings Marcello Di Bello!! Lehman College CUNY!! PHI 169 - Spring 2015 01 How Trustworthy is DNA Evidence?! Is it More Trustworthy than Eyewitness Evidence?
More informationPopulation Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda
1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;
More informationA paradigm shift in DNA interpretation John Buckleton
A paradigm shift in DNA interpretation John Buckleton Specialist Science Solutions Manaaki Tangata Taiao Hoki protecting people and their environment through science I sincerely acknowledge conversations
More informationQuestion: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?
October 12, 2009 Bioe 109 Fall 2009 Lecture 8 Microevolution 1 - selection The Hardy-Weinberg-Castle Equilibrium - consider a single locus with two alleles A 1 and A 2. - three genotypes are thus possible:
More informationThe E-M Algorithm in Genetics. Biostatistics 666 Lecture 8
The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as
More informationOptimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives
Genetic Epidemiology 16:225 249 (1999) Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives Mary Sara McPeek* Department of Statistics, University of Chicago, Chicago, Illinois
More informationBayesian Regression (1/31/13)
STA613/CBB540: Statistical methods in computational biology Bayesian Regression (1/31/13) Lecturer: Barbara Engelhardt Scribe: Amanda Lea 1 Bayesian Paradigm Bayesian methods ask: given that I have observed
More informationA maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes
Genetics: Published Articles Ahead of Print, published on July 30, 2012 as 10.1534/genetics.112.139519 A maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate
More informationAppendix A Evolutionary and Genetics Principles
Appendix A Evolutionary and Genetics Principles A.1 Genetics As a start, we need to distinguish between learning, which we take to mean behavioral adaptation by an individual, and evolution, which refers
More informationMethods for Cryptic Structure. Methods for Cryptic Structure
Case-Control Association Testing Review Consider testing for association between a disease and a genetic marker Idea is to look for an association by comparing allele/genotype frequencies between the cases
More informationProcesses of Evolution
15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection
More informationMatch Likelihood Ratio for Uncertain Genotypes
Match Likelihood Ratio for Uncertain Genotypes Mark W. Perlin *,1, PhD, MD, PhD Joseph B. Kadane 2, PhD Robin W. Cotton 3, PhD 1 Cybergenetics, Pittsburgh, PA, USA 2 Statistics Department, Carnegie Mellon
More informationAlignment Algorithms. Alignment Algorithms
Midterm Results Big improvement over scores from the previous two years. Since this class grade is based on the previous years curve, that means this class will get higher grades than the previous years.
More informationABSTRACT. ZAYKIN, DMITRI V. Statistical Analysis of Genetic Associations (Advisor: Bruce S. Weir)
ABSTRACT ZAYKIN, DMITRI V. Statistical Analysis of Genetic Associations (Advisor: Bruce S. Weir) There is an increasing need for a statistical treatment of genetic data prompted by recent advances in molecular
More informationGoodness of Fit Goodness of fit - 2 classes
Goodness of Fit Goodness of fit - 2 classes A B 78 22 Do these data correspond reasonably to the proportions 3:1? We previously discussed options for testing p A = 0.75! Exact p-value Exact confidence
More informationThe Wright-Fisher Model and Genetic Drift
The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population
More informationOutline for today s lecture (Ch. 14, Part I)
Outline for today s lecture (Ch. 14, Part I) Ploidy vs. DNA content The basis of heredity ca. 1850s Mendel s Experiments and Theory Law of Segregation Law of Independent Assortment Introduction to Probability
More informationDemography April 10, 2015
Demography April 0, 205 Effective Population Size The Wright-Fisher model makes a number of strong assumptions which are clearly violated in many populations. For example, it is unlikely that any population
More informationLecture 6. QTL Mapping
Lecture 6 QTL Mapping Bruce Walsh. Aug 2003. Nordic Summer Course MAPPING USING INBRED LINE CROSSES We start by considering crosses between inbred lines. The analysis of such crosses illustrates many of
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities
More informationSTAT 536: Genetic Statistics
STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,
More informationEstimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty
Estimation of Parameters in Random Effect Models with Incidence Matrix Uncertainty Xia Shen 1,2 and Lars Rönnegård 2,3 1 The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden; 2 School
More informationGenotype Imputation. Biostatistics 666
Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives
More informationarxiv: v1 [q-bio.qm] 26 Feb 2016
Non-Identifiable Pedigrees and a Bayesian Solution B. Kirkpatrick arxiv:1602.08183v1 [q-bio.qm] 26 Feb 2016 University of British Columbia Abstract. Some methods aim to correct or test for relationships
More informationLecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency
Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 2018 1 Outline Genetics of complex
More informationSelection Page 1 sur 11. Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION
Selection Page 1 sur 11 Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION * I- Introduction II- Modeling and selective values III- Basic model IV- Equation of the recurrence of allele
More informationObservation: we continue to observe large amounts of genetic variation in natural populations
MUTATION AND GENETIC VARIATION Observation: we continue to observe large amounts of genetic variation in natural populations Problem: How does this variation arise and how is it maintained. Here, we look
More informationSOLUTIONS TO EXERCISES FOR CHAPTER 9
SOLUTIONS TO EXERCISES FOR CHPTER 9 gronomy 65 Statistical Genetics W. E. Nyquist March 00 Exercise 9.. a. lgebraic method for the grandparent-grandoffspring covariance (see parent-offspring covariance,
More informationHomework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:
Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships
More informationBig Idea #1: The process of evolution drives the diversity and unity of life
BIG IDEA! Big Idea #1: The process of evolution drives the diversity and unity of life Key Terms for this section: emigration phenotype adaptation evolution phylogenetic tree adaptive radiation fertility
More informationMathematical Population Genetics II
Mathematical Population Genetics II Lecture Notes Joachim Hermisson June 9, 2018 University of Vienna Mathematics Department Oskar-Morgenstern-Platz 1 1090 Vienna, Austria Copyright (c) 2013/14/15/18 Joachim
More informationApplications of Mathematics
Applications of Mathematics Tomáš Cipra Exponential smoothing for irregular data Applications of Mathematics Vol. 5 2006) No. 6 597--604 Persistent URL: http://dml.cz/dmlcz/34655 Terms of use: Institute
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 013 Population genetics Outline of lectures 3-6 1. We ant to kno hat theory says about the reproduction of genotypes in a population. This results
More informationModule 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline
Module 4: Bayesian Methods Lecture 9 A: Default prior selection Peter Ho Departments of Statistics and Biostatistics University of Washington Outline Je reys prior Unit information priors Empirical Bayes
More information. Also, in this case, p i = N1 ) T, (2) where. I γ C N(N 2 2 F + N1 2 Q)
Supplementary information S7 Testing for association at imputed SPs puted SPs Score tests A Score Test needs calculations of the observed data score and information matrix only under the null hypothesis,
More informationLecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015
Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.
More informationBootstrap Procedures for Testing Homogeneity Hypotheses
Journal of Statistical Theory and Applications Volume 11, Number 2, 2012, pp. 183-195 ISSN 1538-7887 Bootstrap Procedures for Testing Homogeneity Hypotheses Bimal Sinha 1, Arvind Shah 2, Dihua Xu 1, Jianxin
More information