The statistical evaluation of DNA crime stains in R

Size: px
Start display at page:

Download "The statistical evaluation of DNA crime stains in R"

Transcription

1 The statistical evaluation of DNA crime stains in R Miriam Marušiaková Department of Statistics, Charles University, Prague Institute of Computer Science, CBI, Academy of Sciences of CR UseR! 2008, Dortmund Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

2 Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

3 Introduction Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

4 Introduction Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

5 Introduction Single crime scene stain Blood stain at the crime scene Believed it was left by offender Suspect arrested for reasons unconnected with his DNA profile Crime sample, suspect sample Hypothesis H p (prosecution): The suspect left the crime stain. H d (defense): Some other person left the crime stain. Notation DNA typing results E = {G C, G S } non-dna evidence I Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

6 Introduction Evidence interpretation Prior odds Posterior odds Bayes theorem Pr(H p I) Pr(H d I) Pr(H p E, I) Pr(H d E, I) Pr(H p E, I) Pr(H d E, I) = Pr(E H p, I) Pr(H p I) Pr(E H d, I) Pr(H }{{} d I) LR Balding and Donelly (1995), Robertson and Vignaux (1995) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

7 Introduction Evidence interpretation LR = Pr(E H p, I) Pr(E H d, I) = Pr(G S, G C H p, I) Pr(G S, G C H d, I) = Pr(G C G S, H p, I) Pr(G C G S, H d, I) Pr(G S H p, I) Pr(G S H d, I) = Pr(G C G S, H p, I) Pr(G C G S, H d, I) 1 = 1 Pr(G C G S, H d, I) = 1 Pr(G C H d, I) if independence assumed Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

8 Introduction Errors and fallacies Statement The probability of observing this type if the blood came from someone other than suspect is 1 in 100. Pr(G C H d, I) = 1/100 Common error The probability that the blood came from someone else is 1 in 100. Pr(H d G C, I) = 1/100 There is a 99% probability that it came from the suspect. Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

9 Introduction Errors and fallacies (cont. d) Statement The evidence is 100 times more probable if the suspect left the crime stain than if some unknown person left it. 1 Pr(G C H d, I) = 100 Common error It is 100 times more probable that the suspect left the crime stain than some unknown person. Pr(H p G C, I) Pr(H d G C, I) = 100 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

10 Single Crime Scene Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

11 Single Crime Scene Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

12 Single Crime Scene Product rule Model of an ideal population Infinite size Random mating Model reliable for most real-world problems Hardy-Weinberg equilibrium Likelihood ratio LR = Pr(G = A i A i ) = p 2 i Pr(G = A i A j ) = 2p i p j, i j 1 Pr(G C G S, H d, I) = 1 Pr(G C H d, I) = 1 p 2 i ( 1 2p i p j ) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

13 Single Crime Scene Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

14 Single Crime Scene F - measure of uncertainty about allele proportions in the population of suspects Genetic interpretation of F (Wright, 1951) How to estimate F (Weir and Cockerham, 1984) Recommendations (National Research Council, 1996) F = 0.01 large subpopulations (USA) F = 0.03 small isolated subpopulations Match probabilities (Balding and Nichols, 1994) P(G C = A i A i G S = A i A i ) = [2F + (1 F)p i] [3F + (1 F)p i ] (1 + F)(1 + 2F) P(G C = A i A j G S = A i A j ) = 2 [F + (1 F)p i] [F + (1 F)p j ] (1 + F)(1 + 2F) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

15 Single Crime Scene Effects of F corrections Likelihood ratio - some numerical values Heterozygotes A i A j, p i = p j = p F = 0 F = F = 0.01 F = 0.03 p = p = p = Homozygotes A i A i, p i = p F = 0 F = F = 0.01 F = 0.03 p = p = p = Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

16 Single Crime Scene Relatedness Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

17 Single Crime Scene Relatedness Inbreeding Individuals with common ancestors - related Their children - inbred Alleles are ibd (identical by descent) - copies of the same allele Example Alleles h 1, h 2 transmitted from parent H to X, Y who transmit a, b to I h 1 X a H h 2 Y b I Pr(h 1 is ibd to h 2 ) = Pr(h 1 h 2 ) = 0.5 Pr(a b) = Pr(a h 1, b h 2 h 1 h 2 )Pr(h 1 h 2 ) = = Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

18 Single Crime Scene Relatedness Match probabilities for close relatives Balding and Nichols (1994) { ( k0 p 4 i + k 1 p 3 i + k 2 p 2 ) i /p 2 i Pr(G C G S, H d, I) = ) (4k 0 p 2 i p2 j + k 1 p i p j (p i + p j ) + 2k 2 p i p j /2p i p j Kinship coefficients Relationship k 0 k 1 k 2 Parent - child Siblings 1/4 1/2 1/4 Grandparent - grandchild 1/2 1/2 0 Uncle - nephew 1/2 1/2 0 Cousins 3/4 1/4 0 Unrelated k i - probability that two persons will share i alleles ibd i = 0, 1, 2 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

19 Single Crime Scene R package forensic Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

20 Single Crime Scene R package forensic R function Pmatch Usage Pmatch(prob, k = c(1, 0, 0), theta = 0) Example > p <- c(0.057, 0.160, 0.182, 0, 0.024, 0.122) > Pmatch(p) $prob locus 1 locus 2 locus 3 allele allele $match locus 1 locus 2 locus 3 [1,] $total_match [1] e-06 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

21 Mixed stain Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

22 Mixed stain Mixtures Prosecution and defense hypothesis H p : Contributors were the victim and the suspect. H d : Contributors were the victim and an unknown person. Likelihood ratio for the mixture LR = Pr(E C, G V, G S H p, I) Pr(E C, G V, G S H d, I) = Pr(E C G V, G S, H p, I) Pr(E C G V, G S, H d, I) Pr(G V, G S H p, I) Pr(G V, G S H d, I) = Pr(E C G V, G S, H p, I) Pr(E C G V, G S, H d, I) = 1 Pr(E C G V, G S, H d, I) 1 = (if independence assumed) Pr(E C G V, H d, I) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

23 Mixed stain Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

24 Mixed stain Match probability (independence assumption) Weir et al (1997) P x (U, C) = (T 0 ) 2x i U (T 1i ) 2x + T 0 = l C p l T 1i = l C\{i} p l, i U T 2ij = l C\{i,j} p l, i, j U, i < j i,j U:i<j (T 2ij ) 2x... U - set of alleles from the crime sample C not carried by known contributors x - number of unknown contributors R Pevid.ind(alleles, prob, x, u = NULL) LR.ind(alleles, prob, x1, x2, u1 = NULL, u2 = NULL) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

25 Mixed stain Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

26 Mixed stain Match probability for structured population Assumption All the involved people come from the same subpopulation with parameter F Fung and Hu (2000), Zoubková and Zvárová (2004) MP = r r r 1 r 1 =0 r 2 =0 r r 1 r c 2 r c 1 =0 ti +u i +v i 1 i=1 j=t i +v i (2n U )! c [(1 F)p i + jf] c i=1 u i! 2n T +2n U +2n V 1 j=2n T +2n V [(1 F) + jf] R Pevid.gen(alleles, prob, x, T = NULL, V = NULL, theta = 0) T, V - genotypes of known contributors, known non-contributors Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

27 Mixed stain People versus Simpson Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

28 Mixed stain People versus Simpson People versus Simpson (Los Angeles County Case) DNA evidence Material at the crime scene - alleles a, b, c (locus D2S44) Suspect s genotype ab Victim s genotype ac Hypotheses (Weir et al, 1997, Fung and Hu, 2000) H p : Contributors were the victim, suspect and m unknowns. H d : Contributors were n unknowns. R a = c( a, b, c ) p = c(0.0316, , ) suspect <- a/b, victim <- a/c Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

29 Mixed stain People versus Simpson Likelihood ratios for the Simpson case Defense n = 2 n = 3 Prosecution F = F = m = m = Prosecution proposition (F = 0) Pevid.ind(alleles = a, prob = p, x = m) Pevid.gen(alleles = a, prob = p, x = m, T = c(victim, suspect)) Defense proposition (F = 0) Pevid.ind(alleles = a, prob = p, x = n, u = c( a, b, c )) Pevid.gen(alleles = a, prob = p, x = n) Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

30 Mixed stain More general situation Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

31 Mixed stain More general situation More general situation Contributors from different ethnic groups Fung and Hu (2001) Pevid.ind, LR.ind (independence within and between ethnic groups) Presence of related people Hu and Fung (2003) Example Suspect not typed, his relative is tested Two related people among unknown contributors Pevid.rel Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

32 Conclusion Outline Introduction Single Crime Scene Relatedness R package forensic Mixed stain People versus Simpson More general situation Conclusion Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

33 Conclusion Conclusion Single crime stain Suspect and offender are unrelated are members of the same subpopulation are close relatives Pmatch Mixed crime stain Contributors unrelated members of the same subpopulation may be related from different ethnic groups Pevid.ind, LR.ind Pevid.gen Pevid.rel Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

34 Conclusion Thank you for your attention! Acknowledgement The work was supported by GAČR 201/05/H007 and the project 1M06014 of the Ministry of Education, Youth and Sports of the Czech Republic. Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

35 Important publications BALDING AND NICHOLS(1994) DNA profile match probability calculation Forensic Science International 64, p WEIR ET AL(1997) Interpreting DNA mixtures Journal of Forensic Sciences 42, p FUNG AND HU(2000) Interpreting forensic DNA mixtures: allowing for uncertainty in population substructure and dependence Journal of Royal Statistical Society A 163, p ZOUBKOVÁ, SUPERVISOR ZVÁROVÁ (2004) Master thesis (in Czech) Charles University, Prague Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

36 Software R DEVELOPMENT CORE TEAM (2006) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria ISBN , URL MARUŠIAKOVÁ, M (2007) forensic: Statistical methods in forensic genetics R package version 0.2 Miriam Marušiaková (Prague) The statistical evaluation of DNA crime stains UseR! 2008, August / 36

THE RARITY OF DNA PROFILES 1. BY BRUCE S. WEIR University of Washington

THE RARITY OF DNA PROFILES 1. BY BRUCE S. WEIR University of Washington The Annals of Applied Statistics 2007, Vol. 1, No. 2, 358 370 DOI: 10.1214/07-AOAS128 Institute of Mathematical Statistics, 2007 THE RARITY OF DNA PROFILES 1 BY BRUCE S. WEIR University of Washington It

More information

arxiv: v1 [stat.ap] 7 Dec 2007

arxiv: v1 [stat.ap] 7 Dec 2007 The Annals of Applied Statistics 2007, Vol. 1, No. 2, 358 370 DOI: 10.1214/07-AOAS128 c Institute of Mathematical Statistics, 2007 THE RARITY OF DNA PROFILES 1 arxiv:0712.1099v1 [stat.ap] 7 Dec 2007 By

More information

Stochastic Models for Low Level DNA Mixtures

Stochastic Models for Low Level DNA Mixtures Original Article en5 Stochastic Models for Low Level DNA Mixtures Dalibor Slovák 1,, Jana Zvárová 1, 1 EuroMISE Centre, Institute of Computer Science AS CR, Prague, Czech Republic Institute of Hygiene

More information

Interpreting DNA Mixtures in Structured Populations

Interpreting DNA Mixtures in Structured Populations James M. Curran, 1 Ph.D.; Christopher M. Triggs, 2 Ph.D.; John Buckleton, 3 Ph.D.; and B. S. Weir, 1 Ph.D. Interpreting DNA Mixtures in Structured Populations REFERENCE: Curran JM, Triggs CM, Buckleton

More information

Breeding Values and Inbreeding. Breeding Values and Inbreeding

Breeding Values and Inbreeding. Breeding Values and Inbreeding Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A

More information

Workshop: Kinship analysis First lecture: Basics: Genetics, weight of evidence. I.1

Workshop: Kinship analysis First lecture: Basics: Genetics, weight of evidence. I.1 Workshop: Kinship analysis First lecture: Basics: Genetics, weight of evidence. I.1 Thore Egeland (1),(2), Klaas Slooten (3),(4) (1) Norwegian University of Life Sciences, (2) NIPH, (3) Netherlands Forensic

More information

Examensarbete. Rarities of genotype profiles in a normal Swedish population. Ronny Hedell

Examensarbete. Rarities of genotype profiles in a normal Swedish population. Ronny Hedell Examensarbete Rarities of genotype profiles in a normal Swedish population Ronny Hedell LiTH - MAT - EX - - 2010 / 25 - - SE Rarities of genotype profiles in a normal Swedish population Department of

More information

Forensic Genetics. Summer Institute in Statistical Genetics July 26-28, 2017 University of Washington. John Buckleton:

Forensic Genetics. Summer Institute in Statistical Genetics July 26-28, 2017 University of Washington. John Buckleton: Forensic Genetics Summer Institute in Statistical Genetics July 26-28, 2017 University of Washington John Buckleton: John.Buckleton@esr.cri.nz Bruce Weir: bsweir@uw.edu 1 Contents Topic Genetic Data, Evidence,

More information

Statistical Methods and Software for Forensic Genetics. Lecture I.1: Basics

Statistical Methods and Software for Forensic Genetics. Lecture I.1: Basics Statistical Methods and Software for Forensic Genetics. Lecture I.1: Basics Thore Egeland (1),(2) (1) Norwegian University of Life Sciences, (2) Oslo University Hospital Workshop. Monterrey, Mexico, Nov

More information

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50 LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if

More information

Relatedness 1: IBD and coefficients of relatedness

Relatedness 1: IBD and coefficients of relatedness Relatedness 1: IBD and coefficients of relatedness or What does it mean to be related? Magnus Dehli Vigeland NORBIS course, 8 th 12 th of January 2018, Oslo Plan Introduction What does it mean to be related?

More information

Lecture 9. QTL Mapping 2: Outbred Populations

Lecture 9. QTL Mapping 2: Outbred Populations Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred

More information

The Lander-Green Algorithm. Biostatistics 666 Lecture 22

The Lander-Green Algorithm. Biostatistics 666 Lecture 22 The Lander-Green Algorithm Biostatistics 666 Lecture Last Lecture Relationship Inferrence Likelihood of genotype data Adapt calculation to different relationships Siblings Half-Siblings Unrelated individuals

More information

1 Introduction. Probabilistic evaluation of low-quality DNA profiles. K. Ryan, D. Gareth Williams a, David J. Balding b,c

1 Introduction. Probabilistic evaluation of low-quality DNA profiles. K. Ryan, D. Gareth Williams a, David J. Balding b,c Probabilistic evaluation of low-quality DNA profiles K. Ryan, D. Gareth Williams a, David J. Balding b,c a. DGW Software Consultants LTD b. UCL Genetics Institute, University College, London WC1E 6BT c.

More information

Population Structure

Population Structure Ch 4: Population Subdivision Population Structure v most natural populations exist across a landscape (or seascape) that is more or less divided into areas of suitable habitat v to the extent that populations

More information

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17 Modeling IBD for Pairs of Relatives Biostatistics 666 Lecture 7 Previously Linkage Analysis of Relative Pairs IBS Methods Compare observed and expected sharing IBD Methods Account for frequency of shared

More information

Abstract. BEECHAM Jr., GARY WAYNE. Statistical methods for the Analysis of Forensic DNA Mixtures. (Under the direction of Bruce S. Weir.

Abstract. BEECHAM Jr., GARY WAYNE. Statistical methods for the Analysis of Forensic DNA Mixtures. (Under the direction of Bruce S. Weir. Abstract BEECHAM Jr., GARY WAYNE. Statistical methods for the Analysis of Forensic DNA Mixtures. (Under the direction of Bruce S. Weir.) As forensic DNA typing technologies become more sensitive, there

More information

Population genetics snippets for genepop

Population genetics snippets for genepop Population genetics snippets for genepop Peter Beerli August 0, 205 Contents 0.Basics 0.2Exact test 2 0.Fixation indices 4 0.4Isolation by Distance 5 0.5Further Reading 8 0.6References 8 0.7Disclaimer

More information

Covariance between relatives

Covariance between relatives UNIVERSIDADE DE SÃO PAULO ESCOLA SUPERIOR DE AGRICULTURA LUIZ DE QUEIROZ DEPARTAMENTO DE GENÉTICA LGN5825 Genética e Melhoramento de Espécies Alógamas Covariance between relatives Prof. Roberto Fritsche-Neto

More information

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation Ann. Hum. Genet., Lond. (1975), 39, 141 Printed in Great Britain 141 A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation BY CHARLES F. SING AND EDWARD D.

More information

Adventures in Forensic DNA: Cold Hits, Familial Searches, and Mixtures

Adventures in Forensic DNA: Cold Hits, Familial Searches, and Mixtures Adventures in Forensic DNA: Cold Hits, Familial Searches, and Mixtures Departments of Mathematics and Statistics Northwestern University JSM 2012, July 30, 2012 The approach of this talk Some simple conceptual

More information

Reporting LRs - principles

Reporting LRs - principles Simone Gittelson Ian Evett Reporting LRs - principles Tacha Hicks-Champod Franco Taroni DNA: http://www.formation-continueunil-epfl.ch/en/essentials-dnainterpretation 1 Steven Myers Tim Kalafut Gittelson

More information

STRmix - MCMC Study Page 1

STRmix - MCMC Study Page 1 SDPD Crime Laboratory Forensic Biology Unit Validation of the STRmix TM Software MCMC Markov Chain Monte Carlo Introduction The goal of DNA mixture interpretation should be to identify the genotypes of

More information

Problems for 3505 (2011)

Problems for 3505 (2011) Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.

More information

2. Map genetic distance between markers

2. Map genetic distance between markers Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,

More information

The problem of Y-STR rare haplotype

The problem of Y-STR rare haplotype The problem of Y-STR rare haplotype February 17, 2014 () Short title February 17, 2014 1 / 14 The likelihood ratio When evidence is found on a crime scene, the expert is asked to provide a Likelihood ratio

More information

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift: 9. Genetic Drift Genetic drift is the alteration of gene frequencies due to sampling variation from one generation to the next. It operates to some degree in all finite populations, but can be significant

More information

Identification of Pedigree Relationship from Genome Sharing

Identification of Pedigree Relationship from Genome Sharing INVESTIGATION Identification of Pedigree Relationship from Genome Sharing William G. Hill 1 and Ian M. S. White Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh,

More information

Y-STR: Haplotype Frequency Estimation and Evidence Calculation

Y-STR: Haplotype Frequency Estimation and Evidence Calculation Calculating evidence Further work Questions and Evidence Mikkel, MSc Student Supervised by associate professor Poul Svante Eriksen Department of Mathematical Sciences Aalborg University, Denmark June 16

More information

Affected Sibling Pairs. Biostatistics 666

Affected Sibling Pairs. Biostatistics 666 Affected Sibling airs Biostatistics 666 Today Discussion of linkage analysis using affected sibling pairs Our exploration will include several components we have seen before: A simple disease model IBD

More information

Notes on Population Genetics

Notes on Population Genetics Notes on Population Genetics Graham Coop 1 1 Department of Evolution and Ecology & Center for Population Biology, University of California, Davis. To whom correspondence should be addressed: gmcoop@ucdavis.edu

More information

Bayesian Updating: Odds Class 12, Jeremy Orloff and Jonathan Bloom

Bayesian Updating: Odds Class 12, Jeremy Orloff and Jonathan Bloom Learning Goals Bayesian Updating: Odds Class 2, 8.05 Jeremy Orloff and Jonathan Bloom. Be able to convert between odds and probability. 2. Be able to update prior odds to posterior odds using Bayes factors.

More information

Lecture 13: Population Structure. October 8, 2012

Lecture 13: Population Structure. October 8, 2012 Lecture 13: Population Structure October 8, 2012 Last Time Effective population size calculations Historical importance of drift: shifting balance or noise? Population structure Today Course feedback The

More information

8. Genetic Diversity

8. Genetic Diversity 8. Genetic Diversity Many ways to measure the diversity of a population: For any measure of diversity, we expect an estimate to be: when only one kind of object is present; low when >1 kind of objects

More information

Mechanisms of Evolution

Mechanisms of Evolution Mechanisms of Evolution 36-149 The Tree of Life Christopher R. Genovese Department of Statistics 132H Baker Hall x8-7836 http://www.stat.cmu.edu/ ~ genovese/. Plan 1. Two More Generations 2. The Hardy-Weinberg

More information

Lecture 2: Introduction to Quantitative Genetics

Lecture 2: Introduction to Quantitative Genetics Lecture 2: Introduction to Quantitative Genetics Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 2018 1 Basic model of Quantitative Genetics Phenotypic value --

More information

Introductory seminar on mathematical population genetics

Introductory seminar on mathematical population genetics Exercises Sheets Introductory seminar on mathematical population genetics WS 20/202 Kristan Schneider, Ada Akerman Ex Assume a single locus with alleles A and A 2 Denote the frequencies of the three (unordered

More information

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important? Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population

More information

Major Genes, Polygenes, and

Major Genes, Polygenes, and Major Genes, Polygenes, and QTLs Major genes --- genes that have a significant effect on the phenotype Polygenes --- a general term of the genes of small effect that influence a trait QTL, quantitative

More information

Forensic Genetics Module. Evaluating DNA evidence

Forensic Genetics Module. Evaluating DNA evidence Summer Institute in Statistical Genetics University of Washington, Seattle Forensic Genetics Module Evaluating DNA evidence David Balding Professor of Statistical Genetics University of Melbourne and UCL

More information

Characterization of Error Tradeoffs in Human Identity Comparisons: Determining a Complexity Threshold for DNA Mixture Interpretation

Characterization of Error Tradeoffs in Human Identity Comparisons: Determining a Complexity Threshold for DNA Mixture Interpretation Characterization of Error Tradeoffs in Human Identity Comparisons: Determining a Complexity Threshold for DNA Mixture Interpretation Boston University School of Medicine Program in Biomedical Forensic

More information

Chem 4331 Name : Final Exam 2008

Chem 4331 Name : Final Exam 2008 Chem 4331 Name : Final Exam 2008 Answer any seven of questions 1-8. Each question is worth 15 points for a total of 105 points. 1. A crime has been committed, and a blood sample has been found at the crime

More information

Match probabilities in a finite, subdivided population

Match probabilities in a finite, subdivided population Match probabilities in a finite, subdivided population Anna-Sapfo Malaspinas a, Montgomery Slatkin a, Yun S. Song b, a Department of Integrative Biology, University of California, Berkeley, CA 94720, USA

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm

More information

arxiv: v3 [stat.ap] 4 Nov 2014

arxiv: v3 [stat.ap] 4 Nov 2014 The multivarte Dirichlet-multinoml distribution and its application in forensic genetics to adjust for sub-population effects using the θ-correction T. TVEDEBRNK P. S. ERKSEN arxiv:1406.6508v3 [stat.p]

More information

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Overview - 1 Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Elizabeth Thompson University of Washington Seattle, WA, USA MWF 8:30-9:20; THO 211 Web page: www.stat.washington.edu/ thompson/stat550/

More information

Case Studies in Ecology and Evolution

Case Studies in Ecology and Evolution 3 Non-random mating, Inbreeding and Population Structure. Jewelweed, Impatiens capensis, is a common woodland flower in the Eastern US. You may have seen the swollen seed pods that explosively pop when

More information

STAT 536: Migration. Karin S. Dorman. October 3, Department of Statistics Iowa State University

STAT 536: Migration. Karin S. Dorman. October 3, Department of Statistics Iowa State University STAT 536: Migration Karin S. Dorman Department of Statistics Iowa State University October 3, 2006 Migration Introduction Migration is the movement of individuals between populations. Until now we have

More information

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE NORMAL DISTRIBUTIONS OF PHENOTYPES Mice Fruit Flies In:Introduction to Quantitative Genetics Falconer & Mackay 1996 CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE Mean and variance are two quantities

More information

Variance Component Models for Quantitative Traits. Biostatistics 666

Variance Component Models for Quantitative Traits. Biostatistics 666 Variance Component Models for Quantitative Traits Biostatistics 666 Today Analysis of quantitative traits Modeling covariance for pairs of individuals estimating heritability Extending the model beyond

More information

GBLUP and G matrices 1

GBLUP and G matrices 1 GBLUP and G matrices 1 GBLUP from SNP-BLUP We have defined breeding values as sum of SNP effects:! = #$ To refer breeding values to an average value of 0, we adopt the centered coding for genotypes described

More information

Bayesian analysis of the Hardy-Weinberg equilibrium model

Bayesian analysis of the Hardy-Weinberg equilibrium model Bayesian analysis of the Hardy-Weinberg equilibrium model Eduardo Gutiérrez Peña Department of Probability and Statistics IIMAS, UNAM 6 April, 2010 Outline Statistical Inference 1 Statistical Inference

More information

A gamma model for DNA mixture analyses

A gamma model for DNA mixture analyses Bayesian Analysis (2007) 2, Number 2, pp. 333 348 A gamma model for DNA mixture analyses R. G. Cowell, S. L. Lauritzen and J. Mortera Abstract. We present a new methodology for analysing forensic identification

More information

1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c.

1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c. 1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c. Changes in allele frequencies in a population across generations

More information

DNA commission of the International Society of Forensic Genetics: Recommendations on the interpretation of mixtures

DNA commission of the International Society of Forensic Genetics: Recommendations on the interpretation of mixtures Forensic Science International 160 (2006) 90 101 www.elsevier.com/locate/forsciint DNA commission of the International Society of Forensic Genetics: Recommendations on the interpretation of mixtures P.

More information

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES: .5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the

More information

Evaluation of the STR typing kit PowerPlexk 16 with respect to technical performance and population genetics: a multicenter study

Evaluation of the STR typing kit PowerPlexk 16 with respect to technical performance and population genetics: a multicenter study International Congress Series 1239 (2003) 789 794 Evaluation of the STR typing kit PowerPlexk 16 with respect to technical performance and population genetics: a multicenter study L. Henke a, *, A. Aaspollu

More information

Analyzing the genetic structure of populations: a Bayesian approach

Analyzing the genetic structure of populations: a Bayesian approach Analyzing the genetic structure of populations: a Bayesian approach Introduction Our review of Nei s G st and Weir and Cockerham s θ illustrated two important principles: 1. It s essential to distinguish

More information

Inference on pedigree structure from genome screen data. Running title: Inference on pedigree structure. Mary Sara McPeek. The University of Chicago

Inference on pedigree structure from genome screen data. Running title: Inference on pedigree structure. Mary Sara McPeek. The University of Chicago 1 Inference on pedigree structure from genome screen data Running title: Inference on pedigree structure Mary Sara McPeek The University of Chicago Address for correspondence: Department of Statistics,

More information

2018 SISG Module 20: Bayesian Statistics for Genetics Lecture 2: Review of Probability and Bayes Theorem

2018 SISG Module 20: Bayesian Statistics for Genetics Lecture 2: Review of Probability and Bayes Theorem 2018 SISG Module 20: Bayesian Statistics for Genetics Lecture 2: Review of Probability and Bayes Theorem Jon Wakefield Departments of Statistics and Biostatistics University of Washington Outline Introduction

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 009 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

EXERCISES FOR CHAPTER 7. Exercise 7.1. Derive the two scales of relation for each of the two following recurrent series:

EXERCISES FOR CHAPTER 7. Exercise 7.1. Derive the two scales of relation for each of the two following recurrent series: Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 7 Exercise 7.. Derive the two scales of relation for each of the two following recurrent series: u: 0, 8, 6, 48, 46,L 36 7

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 007 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results

More information

DNA Evidence in the Ferguson Grand Jury Proceedings

DNA Evidence in the Ferguson Grand Jury Proceedings DNA Evidence in the Ferguson Grand Jury Proceedings Marcello Di Bello!! Lehman College CUNY!! PHI 169 - Spring 2015 01 How Trustworthy is DNA Evidence?! Is it More Trustworthy than Eyewitness Evidence?

More information

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda 1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;

More information

A paradigm shift in DNA interpretation John Buckleton

A paradigm shift in DNA interpretation John Buckleton A paradigm shift in DNA interpretation John Buckleton Specialist Science Solutions Manaaki Tangata Taiao Hoki protecting people and their environment through science I sincerely acknowledge conversations

More information

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation? October 12, 2009 Bioe 109 Fall 2009 Lecture 8 Microevolution 1 - selection The Hardy-Weinberg-Castle Equilibrium - consider a single locus with two alleles A 1 and A 2. - three genotypes are thus possible:

More information

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8 The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as

More information

Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives

Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives Genetic Epidemiology 16:225 249 (1999) Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives Mary Sara McPeek* Department of Statistics, University of Chicago, Chicago, Illinois

More information

Bayesian Regression (1/31/13)

Bayesian Regression (1/31/13) STA613/CBB540: Statistical methods in computational biology Bayesian Regression (1/31/13) Lecturer: Barbara Engelhardt Scribe: Amanda Lea 1 Bayesian Paradigm Bayesian methods ask: given that I have observed

More information

A maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes

A maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate genotypes Genetics: Published Articles Ahead of Print, published on July 30, 2012 as 10.1534/genetics.112.139519 A maximum likelihood method to correct for allelic dropout in microsatellite data with no replicate

More information

Appendix A Evolutionary and Genetics Principles

Appendix A Evolutionary and Genetics Principles Appendix A Evolutionary and Genetics Principles A.1 Genetics As a start, we need to distinguish between learning, which we take to mean behavioral adaptation by an individual, and evolution, which refers

More information

Methods for Cryptic Structure. Methods for Cryptic Structure

Methods for Cryptic Structure. Methods for Cryptic Structure Case-Control Association Testing Review Consider testing for association between a disease and a genetic marker Idea is to look for an association by comparing allele/genotype frequencies between the cases

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Match Likelihood Ratio for Uncertain Genotypes

Match Likelihood Ratio for Uncertain Genotypes Match Likelihood Ratio for Uncertain Genotypes Mark W. Perlin *,1, PhD, MD, PhD Joseph B. Kadane 2, PhD Robin W. Cotton 3, PhD 1 Cybergenetics, Pittsburgh, PA, USA 2 Statistics Department, Carnegie Mellon

More information

Alignment Algorithms. Alignment Algorithms

Alignment Algorithms. Alignment Algorithms Midterm Results Big improvement over scores from the previous two years. Since this class grade is based on the previous years curve, that means this class will get higher grades than the previous years.

More information

ABSTRACT. ZAYKIN, DMITRI V. Statistical Analysis of Genetic Associations (Advisor: Bruce S. Weir)

ABSTRACT. ZAYKIN, DMITRI V. Statistical Analysis of Genetic Associations (Advisor: Bruce S. Weir) ABSTRACT ZAYKIN, DMITRI V. Statistical Analysis of Genetic Associations (Advisor: Bruce S. Weir) There is an increasing need for a statistical treatment of genetic data prompted by recent advances in molecular

More information

Goodness of Fit Goodness of fit - 2 classes

Goodness of Fit Goodness of fit - 2 classes Goodness of Fit Goodness of fit - 2 classes A B 78 22 Do these data correspond reasonably to the proportions 3:1? We previously discussed options for testing p A = 0.75! Exact p-value Exact confidence

More information

The Wright-Fisher Model and Genetic Drift

The Wright-Fisher Model and Genetic Drift The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population

More information

Outline for today s lecture (Ch. 14, Part I)

Outline for today s lecture (Ch. 14, Part I) Outline for today s lecture (Ch. 14, Part I) Ploidy vs. DNA content The basis of heredity ca. 1850s Mendel s Experiments and Theory Law of Segregation Law of Independent Assortment Introduction to Probability

More information

Demography April 10, 2015

Demography April 10, 2015 Demography April 0, 205 Effective Population Size The Wright-Fisher model makes a number of strong assumptions which are clearly violated in many populations. For example, it is unlikely that any population

More information

Lecture 6. QTL Mapping

Lecture 6. QTL Mapping Lecture 6 QTL Mapping Bruce Walsh. Aug 2003. Nordic Summer Course MAPPING USING INBRED LINE CROSSES We start by considering crosses between inbred lines. The analysis of such crosses illustrates many of

More information

Calculation of IBD probabilities

Calculation of IBD probabilities Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,

More information

Estimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty

Estimation of Parameters in Random. Effect Models with Incidence Matrix. Uncertainty Estimation of Parameters in Random Effect Models with Incidence Matrix Uncertainty Xia Shen 1,2 and Lars Rönnegård 2,3 1 The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden; 2 School

More information

Genotype Imputation. Biostatistics 666

Genotype Imputation. Biostatistics 666 Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives

More information

arxiv: v1 [q-bio.qm] 26 Feb 2016

arxiv: v1 [q-bio.qm] 26 Feb 2016 Non-Identifiable Pedigrees and a Bayesian Solution B. Kirkpatrick arxiv:1602.08183v1 [q-bio.qm] 26 Feb 2016 University of British Columbia Abstract. Some methods aim to correct or test for relationships

More information

Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency

Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 2018 1 Outline Genetics of complex

More information

Selection Page 1 sur 11. Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION

Selection Page 1 sur 11. Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION Selection Page 1 sur 11 Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION * I- Introduction II- Modeling and selective values III- Basic model IV- Equation of the recurrence of allele

More information

Observation: we continue to observe large amounts of genetic variation in natural populations

Observation: we continue to observe large amounts of genetic variation in natural populations MUTATION AND GENETIC VARIATION Observation: we continue to observe large amounts of genetic variation in natural populations Problem: How does this variation arise and how is it maintained. Here, we look

More information

SOLUTIONS TO EXERCISES FOR CHAPTER 9

SOLUTIONS TO EXERCISES FOR CHAPTER 9 SOLUTIONS TO EXERCISES FOR CHPTER 9 gronomy 65 Statistical Genetics W. E. Nyquist March 00 Exercise 9.. a. lgebraic method for the grandparent-grandoffspring covariance (see parent-offspring covariance,

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

Big Idea #1: The process of evolution drives the diversity and unity of life

Big Idea #1: The process of evolution drives the diversity and unity of life BIG IDEA! Big Idea #1: The process of evolution drives the diversity and unity of life Key Terms for this section: emigration phenotype adaptation evolution phylogenetic tree adaptive radiation fertility

More information

Mathematical Population Genetics II

Mathematical Population Genetics II Mathematical Population Genetics II Lecture Notes Joachim Hermisson June 9, 2018 University of Vienna Mathematics Department Oskar-Morgenstern-Platz 1 1090 Vienna, Austria Copyright (c) 2013/14/15/18 Joachim

More information

Applications of Mathematics

Applications of Mathematics Applications of Mathematics Tomáš Cipra Exponential smoothing for irregular data Applications of Mathematics Vol. 5 2006) No. 6 597--604 Persistent URL: http://dml.cz/dmlcz/34655 Terms of use: Institute

More information

Outline of lectures 3-6

Outline of lectures 3-6 GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 013 Population genetics Outline of lectures 3-6 1. We ant to kno hat theory says about the reproduction of genotypes in a population. This results

More information

Module 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline

Module 4: Bayesian Methods Lecture 9 A: Default prior selection. Outline Module 4: Bayesian Methods Lecture 9 A: Default prior selection Peter Ho Departments of Statistics and Biostatistics University of Washington Outline Je reys prior Unit information priors Empirical Bayes

More information

. Also, in this case, p i = N1 ) T, (2) where. I γ C N(N 2 2 F + N1 2 Q)

. Also, in this case, p i = N1 ) T, (2) where. I γ C N(N 2 2 F + N1 2 Q) Supplementary information S7 Testing for association at imputed SPs puted SPs Score tests A Score Test needs calculations of the observed data score and information matrix only under the null hypothesis,

More information

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015

Lecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015 Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.

More information

Bootstrap Procedures for Testing Homogeneity Hypotheses

Bootstrap Procedures for Testing Homogeneity Hypotheses Journal of Statistical Theory and Applications Volume 11, Number 2, 2012, pp. 183-195 ISSN 1538-7887 Bootstrap Procedures for Testing Homogeneity Hypotheses Bimal Sinha 1, Arvind Shah 2, Dihua Xu 1, Jianxin

More information