The Lander-Green Algorithm. Biostatistics 666 Lecture 22

Similar documents
Calculation of IBD probabilities

Calculation of IBD probabilities

Affected Sibling Pairs. Biostatistics 666

Breeding Values and Inbreeding. Breeding Values and Inbreeding

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

Lander-Green Algorithm. Biostatistics 666

Genotype Imputation. Biostatistics 666

Variance Component Models for Quantitative Traits. Biostatistics 666

Solutions to Problem Set 4

The genomes of recombinant inbred lines

MCMC IN THE ANALYSIS OF GENETIC DATA ON PEDIGREES

Hidden Markov models in population genetics and evolutionary biology

Tutorial Session 2. MCMC for the analysis of genetic data on pedigrees:

Inference on pedigree structure from genome screen data. Running title: Inference on pedigree structure. Mary Sara McPeek. The University of Chicago

Use of hidden Markov models for QTL mapping

On Computation of P-values in Parametric Linkage Analysis

Computation of Multilocus Prior Probability of Autozygosity for Complex Inbred Pedigrees

Chapter 4 Lesson 1 Heredity Notes

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

Modeling IBD for Pairs of Relatives. Biostatistics 666 Lecture 17

Interactive Biology Multimedia Courseware Mendel's Principles of Heredity. Copyright 1998 CyberEd Inc.

LECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50

HEREDITY: Objective: I can describe what heredity is because I can identify traits and characteristics

Chapter 13 Meiosis and Sexual Reproduction

Outline for today s lecture (Ch. 14, Part I)

Dropping Your Genes. A Simulation of Meiosis and Fertilization and An Introduction to Probability

Family resemblance can be striking!

Optimal Allele-Sharing Statistics for Genetic Mapping Using Affected Relatives

Heredity and Genetics WKSH

2. Map genetic distance between markers

Outline. P o purple % x white & white % x purple& F 1 all purple all purple. F purple, 224 white 781 purple, 263 white

Lecture 11: Multiple trait models for QTL analysis

Designer Genes C Test

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.

Lecture 9. QTL Mapping 2: Outbred Populations

Parts 2. Modeling chromosome segregation

1 Errors in mitosis and meiosis can result in chromosomal abnormalities.

F1 Parent Cell R R. Name Period. Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes

genome a specific characteristic that varies from one individual to another gene the passing of traits from one generation to the next

EXERCISES FOR CHAPTER 7. Exercise 7.1. Derive the two scales of relation for each of the two following recurrent series:

Gene mapping, linkage analysis and computational challenges. Konstantin Strauch

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Statistics 246 Spring 2006

Yesterday s Picture UNIT 3D

BS 50 Genetics and Genomics Week of Oct 3 Additional Practice Problems for Section. A/a ; B/B ; d/d X A/a ; b/b ; D/d

Gene mapping in model organisms

Objectives. Announcements. Comparison of mitosis and meiosis

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.

Lecture WS Evolutionary Genetics Part I 1

Statistical issues in QTL mapping in mice

The history of Life on Earth reflects an unbroken chain of genetic continuity and transmission of genetic information:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Parts 2. Modeling chromosome segregation

Labs 7 and 8: Mitosis, Meiosis, Gametes and Genetics

NCEA Level 2 Biology (91157) 2017 page 1 of 5 Assessment Schedule 2017 Biology: Demonstrate understanding of genetic variation and change (91157)

Biology. Revisiting Booklet. 6. Inheritance, Variation and Evolution. Name:

Week 7.2 Ch 4 Microevolutionary Proceses

Unit 2 Lesson 4 - Heredity. 7 th Grade Cells and Heredity (Mod A) Unit 2 Lesson 4 - Heredity

Ch 11.4, 11.5, and 14.1 Review. Game

Big Idea 3: Living systems store, retrieve, transmit, and respond to information essential to life processes.

REVISION: GENETICS & EVOLUTION 20 MARCH 2013

The statistical evaluation of DNA crime stains in R

Which of these best predicts the outcome of the changes illustrated in the diagrams?

Outline of lectures 3-6

10. How many chromosomes are in human gametes (reproductive cells)? 23

The Quantitative TDT

Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012

Lecture 6. QTL Mapping

Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes

Learning gene regulatory networks Statistical methods for haplotype inference Part I

6.6 Meiosis and Genetic Variation. KEY CONCEPT Independent assortment and crossing over during meiosis result in genetic diversity.

Evolution of Populations. Chapter 17

Binary trait mapping in experimental crosses with selective genotyping

When one gene is wild type and the other mutant:

AP Biology Unit 6 Practice Test 1. A group of cells is assayed for DNA content immediately following mitosis and is found to have an average of 8

Big Idea #1: The process of evolution drives the diversity and unity of life

New imputation strategies optimized for crop plants: FILLIN (Fast, Inbred Line Library ImputatioN) FSFHap (Full Sib Family Haplotype)

A Robust Identity-by-Descent Procedure Using Affected Sib Pairs: Multipoint Mapping for Complex Diseases

arxiv: v1 [q-bio.qm] 26 Feb 2016

7.014 Problem Set 6 Solutions

Unit 7 Genetics. Meiosis

Ch 11.Introduction to Genetics.Biology.Landis

Meiosis and Mendel. Chapter 6

Part 2- Biology Paper 2 Inheritance and Variation Knowledge Questions

Meiosis. ~ fragmentation - pieces split off and each piece becomes a new organism - starfish

BENCHMARK 1 STUDY GUIDE SPRING 2017

SNP Association Studies with Case-Parent Trios

Advance Organizer. Topic: Mendelian Genetics and Meiosis

Lesson 4: Understanding Genetics

Mathematical models in population genetics II

The universal validity of the possible triangle constraint for Affected-Sib-Pairs

Genotype Imputation. Class Discussion for January 19, 2016

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014

Introduction to Genetics

Name Date Class. Meiosis I and Meiosis II

Outline of lectures 3-6

Mendelian Genetics. Introduction to the principles of Mendelian Genetics

Segregation versus mitotic recombination APPENDIX

CELL BIOLOGY - CLUTCH CH MEIOSIS AND SEXUAL REPRODUCTION.

Darwinian Selection. Chapter 7 Selection I 12/5/14. v evolution vs. natural selection? v evolution. v natural selection

Transcription:

The Lander-Green Algorithm Biostatistics 666 Lecture

Last Lecture Relationship Inferrence Likelihood of genotype data Adapt calculation to different relationships Siblings Half-Siblings Unrelated individuals Importance of modeling error

Today The Lander-Green Algorithm Multipoint analysis in general pedigrees The basis of modern pedigree analysis packages

Hidden Markov Model X X X 1 X M P X 1 I P X I P X I P X M I ( 1 ( ( ( M I 1 I I I M P I I P I I P(... ( 1 ( The final ingredient connects IBD states along the chromosome

Fundamental Calculations Enumerate possible IBD states Transition probability for neighboring IBD states Probability of genotype data given IBD state

Lander-Green Algorithm More general definition for I, the "IBD vector" Probability of genotypes given IBD vector Transition probabilities for the IBD vectors = = = 1 1 1 1 ( ( (... I m i i i I m i i i I G P I I P I P L m

Part I IBD Vectors Inheritance Vectors Descent Graphs Gene Flow Pattern

IBD Vector Specifications Specify IBD between all individuals Must be compact Must allow calculation of: Conditional probabilities for neighboring markers Probability of observed genotypes

IBD Vector Specify the outcome of each meiosis Which of the two parental alleles transmitted? Implies founder allele carried by each individual Implies whether a pair of chromosomes is identical-by-descent

For any pedigree, consider What are the meioses? What are the possible outcomes for the entire set of meioses?

Example

What we are doing... Listing meioses Alternating outcomes The outcomes of all meioses define our IBD vector Meiosis 1 Meiosis Meiosis V1 V V V4

So far A set of n binary digits specifies IBD in a pedigree with n non-founders There are n such sets Next, must calculate the probability of the observed genotypes for each one

Part II Probability of Observed Genotypes Founder Allele Graph Founder Allele Frequencies

Founder Allele Graphs / Sets Calculated for each marker individually List of founder alleles compatible with: Observed genotypes for all individuals A particular gene flow pattern Likelihood of each set is a product of allele frequencies

Observed Genotypes For each family For each marker Some pattern of observed genotypes???? 1 1 1 1

Gene flow pattern In turn, specify gene flow throughout the pedigree For each individual, we know precisely what founder allele they carry

Combine the two Conditional on gene flow Founder allele states are restricted In this case, there is only one founder allele set: {1, 1, 1,?} 1 1 1? Likelihood is a product of allele frequencies P(allele 1 P(any allele 1 1 1 1

Finding founder allele sets Group founder alleles transmitted to the same genotyped individuals If a founder allele passes through an homozygote or different heterozygotes Its state is either fixed or impossible Fixes state of other alleles in the group

No. of Possible States for Grouped Founder Alleles No compatible states One Possible State If at least founder allele passes through different homozygotes or incompatible heterozygotes Two Possible States For Each Allele Observed genotypes are all identical and heterozygous Every marker allele is possible Only for unconnected founder alleles

Example: Observed Genotypes 1 1 1 1 1 4

Example Descent Graph C D E F A B G H {1,} {1,} {1,} {1,} {1,} {,4}

Possible founder allele states Founder Alleles in Corresponding Group Allele States Probability (B (any allele 1 (A,C,E,,1 or (,1, P²P(+P(P² (D,F,G,H,,,4 PP(P(P(4

Lander-Green inheritance vector n elements Meiotic outcomes specified in index bit Stores probability of genotypes for each set of meiotic outcomes 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001 1010 1011 1100 1101 1110 1111 L L L L L L L L L L L L L L L L

So far Generalized the IBD vector Probability of observed genotypes Next step: Transition probabilities HMM to combine information along the genome

Part III Transition Probabilities Recombination Fraction Changes in IBD Along Chromosome

With one meiosis T =

With two meiosis T = T T T T

With two meiosis = T

With three meiosis T T = T T T

With three meiosis = T

In general Transition matrix is patterned Transition probability depends on: No. of meiosis were outcome changed No. of meiosis were outcome did not change Product of powers of and

Recursive Formulation T n+ 1 n = n n T T T n T

Lander-Green Markov Model Transition matrix T n T = 1 1 v l 1..l = v l-1 1..l-1 T n v l v l l..m = v l+1 l+1..m T n v l v l 1..m = (v 1..l-1 T n v l (v l+1..m T n

All The Ingredients To Single Marker Left Conditional Right Conditional Full Likelihood

Appropriate Problems Large number of markers Analysis of >5,000 markers possible Relatively small pedigrees 0-0 individuals x larger pedigrees for the X chromosome. Why?

So far Key components for Lander-Green Extending definition of IBD vector Probability of genotypes given IBD Transition probabilities Next: Practical applications!

Lander-Green Algorithm More general definition for I, the "IBD vector" Probability of genotypes given IBD vector Transition probabilities for the IBD vectors = = = 1 1 1 1 ( ( (... I m i i i I m i i i I G P I I P I P L m

Reading Historically, two key papers: Lander and Green 987 PNAS 84:6-7 Kruglyak, Daly, Reeve-Daly, Lander 996 Am J Hum Genet 58:147-6