Lab 12. Linkage Disequilibrium. November 28, 2012

Similar documents
1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:

Introduction to Linkage Disequilibrium

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Phasing via the Expectation Maximization (EM) Algorithm

Microevolution Changing Allele Frequencies

Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.

Problems for 3505 (2011)

Linkage and Linkage Disequilibrium

Recombina*on and Linkage Disequilibrium (LD)

EXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

2. Map genetic distance between markers

Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012

Friday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo

Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014

Introduction. Key Concepts I: Mitosis. AP Biology Laboratory 3 Mitosis & Meiosis

Processes of Evolution

Lecture 2. Basic Population and Quantitative Genetics

When one gene is wild type and the other mutant:

Meiosis vs Mitosis. How many times did it go through prophase-metaphase-anaphase-telophase?

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

A. Correct! Genetically a female is XX, and has 22 pairs of autosomes.

Population Genetics I. Bio

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

The Wright-Fisher Model and Genetic Drift

The phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.

8. Genetic Diversity

MEIOSIS, THE BASIS OF SEXUAL REPRODUCTION

Case Studies in Ecology and Evolution

STAT 536: Migration. Karin S. Dorman. October 3, Department of Statistics Iowa State University

1. Understand the methods for analyzing population structure in genomes

The Chromosomal Basis of Inheritance

STAT 536: Genetic Statistics

Outline of lectures 3-6

Outline of lectures 3-6

Genetics (patterns of inheritance)

Tutorial on Theoretical Population Genetics

1. T/F: Genetic variation leads to evolution. 2. What is genetic equilibrium? 3. What is speciation? How does it occur?

The E-M Algorithm in Genetics. Biostatistics 666 Lecture 8

Question: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?

Mechanisms of Evolution

Evolutionary Genetics Midterm 2008

Life Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM

Ch 11.Introduction to Genetics.Biology.Landis

Chapter 16. Table of Contents. Section 1 Genetic Equilibrium. Section 2 Disruption of Genetic Equilibrium. Section 3 Formation of Species

Lecture 13: Population Structure. October 8, 2012

What is a sex cell? How are sex cells made? How does meiosis help explain Mendel s results?

Effective population size and patterns of molecular evolution and variation

Chapter 6 Meiosis and Mendel

Reinforcement Unit 3 Resource Book. Meiosis and Mendel KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

The Quantitative TDT

Reproduction and Evolution Practice Exam

How robust are the predictions of the W-F Model?

Section 11 1 The Work of Gregor Mendel

Evolutionary quantitative genetics and one-locus population genetics

Population Genetics & Evolution

AUTHORIZATION TO LEND AND REPRODUCE THE THESIS. Date Jong Wha Joanne Joo, Author

Population genetics snippets for genepop

Introduction to Genetics

Lecture WS Evolutionary Genetics Part I 1

Mathematical models in population genetics II

The Genetics of Natural Selection

Darwin s Observations & Conclusions The Struggle for Existence

Background Selection in Partially Selfing Populations

I. Short Answer Questions DO ALL QUESTIONS

Introduction to Genetics

KEY: Chapter 9 Genetics of Animal Breeding.

The Mechanisms of Evolution

Name Class Date. KEY CONCEPT Gametes have half the number of chromosomes that body cells have.

Ladies and Gentlemen.. The King of Rock and Roll

Linear Regression (1/1/17)

Enduring Understanding: Change in the genetic makeup of a population over time is evolution Pearson Education, Inc.

Case-Control Association Testing. Case-Control Association Testing

Segregation versus mitotic recombination APPENDIX

List the five conditions that can disturb genetic equilibrium in a population.(10)

Evolution of Populations

Selection Page 1 sur 11. Atlas of Genetics and Cytogenetics in Oncology and Haematology SELECTION

Biology 20 Evolution

BIRS workshop Sept 6 11, 2009

Labs 7 and 8: Mitosis, Meiosis, Gametes and Genetics

Theory of Natural Selection

Lecture 9. Short-Term Selection Response: Breeder s equation. Bruce Walsh lecture notes Synbreed course version 3 July 2013

Modeling Genetic Variation in Gametes PSI AP Biology

(Genome-wide) association analysis

For 5% confidence χ 2 with 1 degree of freedom should exceed 3.841, so there is clear evidence for disequilibrium between S and M.

F SR = (H R H S)/H R. Frequency of A Frequency of a Population Population

Lecture 13: Variation Among Populations and Gene Flow. Oct 2, 2006

Expression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia

AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity,

opulation genetics undamentals for SNP datasets

BIO Lab 5: Paired Chromosomes

Introduction to Genetics. Why do biological relatives resemble one another?

p(d g A,g B )p(g B ), g B

Name Class Date. Pearson Education, Inc., publishing as Pearson Prentice Hall. 33

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution

I. GREGOR MENDEL - father of heredity

Computational Approaches to Statistical Genetics

Transcription:

Lab 12. Linkage Disequilibrium November 28, 2012

Goals 1. Es<ma<on of LD in terms of D, D and r 2. 2. Determine effect of random and non- random ma<ng on LD. 3. Es<mate LD from diploid genotype data using EM- algorithm.

LD es<ma<on in two- locus (A&B) and two- allele (1 & 2) model A1 p1 A1 p1 A2 p2 A2 p2 B1 q1 B2 q2 B1 q1 B2 q2 Gamete Observed gametic frequency Expected gametic frequency under linkage equilibrium Allele Allele frequency A1B1 x 11 p 1 q 1 A1 p1=x11+x12 A1B2 x 12 p 1 q 2 A2 p2= x21+x22 A2B1 x 21 p 2 q 1 B1 q1= x11+x21 A2B2 x 22 p 2 q 2 B2 q2= x12+x22 D = x p q = x p q = p q x = p q x = x x x x 11 1 1 22 2 2 1 2 12 1 2 12 11 22 12 21

Different measures of LD If D > 0, D max = min(p 1 q 2, p 2 q 1 ) If D < 0, D max = min(p 1 q 1, p 2 q 2 ). D ' = D D max r = D p p q q 1 2 1 2 2 χ = 2 Nr = p 1 ND p q 2 2 1 q 2

Allele history Time High dri/ or Selec2ve sweep

LD Broken by recombina2on A 1 B 1 A 2 B 2 A 1 B 2 A 2 B 1 A 1 B 1 A 1 B 2 A 1 B 2 A 1 B 1

LD Broken by recombina2on Closer proximity - > less recombina2on - > stronger LD

Decay of LD 0 0 ) 1 ( D e D c D ct t t = = 0 ln 1 D D c t t c effective = c 1! S 2! S " # $ % & ' Recombina<on rate for self- fer<lizing organisms:

Problem 1. In most conifers, gamete frequencies and the linkage phase of diploid genotypes can be determined directly because seeds contain relatively large amounts of haploid nutritional tissue (called endosperm or megagametophyte), which originates from the maternal gamete. As part of a study of the linkage relationship among allozyme loci in loblolly pine (Pinus taeda), Adams and Joly (1980) sampled 456 gametes at loci phosphoglucose isomerase 2 (PGI2, for simplicity, let this be locus A) and glutamate-oxaloacetate transaminase 1 (GOT1, let this be locus B) and observed the following numbers of gametes.(15 minutes) Gamete Count A1B1 138 A1B2 88 A2B1 78 A2B2 152 Total 456 a.)calculate D, D, and r 2, and test the statistical significance of the gametic disequilibrium between the two loci. b.)because the linkage phase of each mother tree was known, Adams and Joly were able to estimate that the recombination rate between the two loci is c = 0.044. i. What is the expected value of D in the next generation (i.e., in the offspring of the seeds that were included in the study)? ii. How many generations of random mating will it take for D to decay below 0.005? iii. What is the expected value of D in the next generation if: S = 0.1? S = 0.5? S = 0.9? c.) Repeat the calculations from b) assuming c = 0.5 (i.e., assuming that the two loci are physically unlinked). d.) Discuss the relative importance of rates of recombination and self-fertilization in determining the rate of LD decay

Problem 2. Compare rates of decay of r 2 with physical distance in sequences from the phytochrome B2 (PHYB2) gene in European aspen (Populus tremula) and the phytochrome C (PHYC) gene in Arabidopsis thaliana. a) Show scatter plots with trend lines illustrating the decay of r 2 with physical distance for each gene b) How do the patterns of LD differ between these two species, and why? c) GRADUATE STUDENTS: Provide facts and citations supporting your biological explanation

Haplotypes through EM When we genotype, we o/en don t know the actual haplotypes Unphased haplotypes Can use a maximum likelihood method to obtain haplotype frequencies Expecta2on Maximiza2on (EM)

Haplotypes through EM 1. Ini2alize Guess the gamete frequencies 2. Expecta2on Step Find expected frequencies of known phase genotypes given gamete frequencies 3. Maximiza2on Step Find expected frequencies of all unphased genotypes given gamete frequencies a. Use to make new gamete frequency es2mates L = P( Data haplotype frequencies) = 1! n2! n3! n4! n5! where n= # of unphased genotypes in the samples, n1, n2.n5, are the # of 2mes each unphased genotype was observed in the sample, and P1, P2,., P5 are the expected frequencies of the unphased genotypes in the sample. n n! P n 1 1 P n2 2 P n 3 3 P n 4 4 P n 5 5

Problem 3. File human_ld.arp contains data for humans from two populations (Han and Melanesian) genotyped for the same loci you have analyzed for departures from Hardy-Weinberg Equilibrium and population structure. The Han sample includes individuals from a broad geographic area in China, whereas the Melanesian sample only includes individuals from the Bougainville Island. Use Arlequin to test for significant linkage disequilibrium among the 10 loci in each of these populations. a.) How do you interpret the difference in the number of linked loci in the two populations? b.) GRAD STUDENTS: How many pairs of loci are expected to show significant LD at α=0.05 by chance? c.) GRAD STUDENTS: Provide facts and citations supporting your biological claim.

Han hyp://en.wikipedia.org/wiki/melanesia