Friday Harbor From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo
|
|
- Camron Bishop
- 5 years ago
- Views:
Transcription
1 Friday Harbor 2017 From Genetics to GWAS (Genome-wide Association Study) Sept David Fardo
2 Purpose: prepare for tomorrow s tutorial Genetic Variants Quality Control Imputation Association Visualization Prioritization
3 OUTLINE Goal: be able to answer the following questions What are some of the historical landmarks of GWAS? What is unique about GWAS data and data quality considerations? How do you test for genetic association?
4 TOWARDS GWAS Evidence for genetic role? Population differences Familial aggregation Linkage?
5 LINKAGE
6 One cent Use properties of recombination to localize Track transmissions through families Linkage Analysis: a two cent version Second cent Use principle of similarity Sib-pairs that are phenotypically similar should also be genotypically similar -Penrose, 1935 Identity by state / descent (IBS/IBD) Thomas Hunt Morgan Nobel "for his discoveries concerning the role played by the chromosome in heredity".
7 Recombination Two Loci: A and B Two Alleles at each Locus: {A 1, A 2 }, {B 1, B 2 } Four Possible Haplotypes: A 1 B 1 A 1 B 2 A 2 B 1 A 2 B 2 Ten Possible Diploid Genotypes (sometimes called diplotypes): A 1 B 1 A 1 B 1 A 1 B 1 A 1 B 2 A 1 B 1 A 2 B 1 A 1 B 1 A 2 B 2 A 1 B 2 A 1 B 2 A 1 B 2 A 2 B 1 A 1 B 2 A 2 B 2 A 2 B 1 A 2 B 1 A 2 B 1 A 2 B 2 A 2 B 2 A 2 B 2
8 Diploid Genotypes A 1 B 1 A 1 B 1 A 1 B 1 A 1 B 2 A 1 B 1 A 2 B 1 A 1 B 1 A 2 B 2 A 1 B 2 A 1 B 2 A 1 B 2 A 2 B 1 A 1 B 2 A 2 B 2 A 2 B 1 A 2 B 1 A 2 B 1 A 2 B 2 A 2 B 2 A 2 B 2 Recombination only detectable in the double heterozygotes
9 Double Heterozygotes A 1 B 1 A 2 B 2 gametes A 1 B 1 A 1 B 2 A 2 B 1 A 2 B 2 1- q 2 transmission probability q 2 q 2 1- q 2 q = recombination rate (ranges from 0 to 0.5) q = 0 : no recombination q = 0.5 : unlinked
10
11 Cardon & Bell, Nat Rev Gen (2001)
12 LINKAGE DISEQUILIBRIUM (LD)
13 A definition Linkage Disequilibrium allelic association between two genetic loci
14 What you need to know about LD It can be defined several ways mathematically, each definition with its own pros/cons (I will show a couple briefly) It degrades over generations Its properties are used for GWAS
15 Linkage Disequilibrium A 1 B 1 A 2 B 2 gametes A 1 B 1 A 1 B 2 A 2 B 1 A 2 B 2 transmission frequency x 1 x 2 x 3 x 4
16 Linkage Disequilibrium Gametes A 1 B 1 A 1 B 2 A 2 B 1 A 2 B 2 Frequency x 1 x 2 x 3 x 4 Allele A 1 A 2 B 1 B 2 Frequency p A1 =x 1 +x 2 p A2 =x 3 +x 4 p B1 =x 1 +x 3 p B2 =x 2 +x 4 D = Observed - Expected D = x 1 - p A1 p B1 D = x 1 - (x 1 + x 2 )(x 1 + x 3 ) D = x 1 x 4 - x 2 x 3
17 Linkage Disequilibrium After one generation of random mating: x 1 x 2 x 3 x 4 = = = = x x x 1 x qd + qd + qd -qd D t=1 = x 1 x 4 - x 2 x 3 D t=1 = (1-q)D After t generations: D t = (1-q) t D 0
18 What does this mean? D t = (1-q) t D 0 D 0 theta t D
19 Normalized LD Parameters D = D D max D max = min(p A1 p B2,p A2 p B1 ) if D is positive = min(p A1 p B1,p A2 p B2 ) if D is negative Now, LD ranges from -1 to +1
20 r 2 Most commonly used LD measure -- squared correlation coefficient -- r 2 = D 2 p A1 p A 2 p B1 p B 2
21 LD take home points It can be defined several ways mathematically, each definition with its own pros/cons It degrades over generations Its properties are used for GWAS
22 IMPUTATION
23 Historical context of some large-scale initiatives à towards imputation Human Genome Project 2003 (kind of) 2 males, 2 females HapMap 2005 / 2007 / 2009 initially 269; expanded to ~ Genomes Project 2010 / 2012 / 2015 guess? Haplotype Reference Consortium st release is ~65k haplotypes All of Us (PMI Initiative)?
24 Human Genome Project Goals: identify all the approximate 30,000 genes in human DNA, determine the sequences of the 3 billion chemical base pairs that make up human DNA, store this information in databases, improve tools for data analysis, transfer related technologies to the private sector, and address the ethical, legal, and social issues (ELSI) that may arise from the project. Milestones: 1990: Project initiated as joint effort of U.S. Department of Energy and the National Institutes of Health June 2000: Completion of a working draft of the entire human genome February 2001: Analyses of the working draft are published April 2003: HGP sequencing is completed and Project is declared finished two years ahead of schedule U.S. Department of Energy Genome Programs, Genomics and Its Impact on Science and Society, 2003
25
26 HapMap An NIH program to chart genetic variation within the human genome Begun in 2002, the project is a 3-year effort to construct a map of the patterns of SNPs (single nucleotide polymorphisms) that occur across populations in Africa, Asia, and the United States. Consortium of researchers from six countries Researchers hope that dramatically decreasing the number of individual SNPs to be scanned will provide a shortcut for identifying the DNA regions associated with common complex diseases Map may also be useful in understanding how genetic variation contributes to responses in environmental factors
27 What is the 1000 Genomes Project? International multi-center collaboration building on HapMap data to establish the most comprehensive catalogue of human genetic variation available Phase I: 1,092 complete genomes from 14 populations published in Nature, October 2012 Freely accessible public databases Final phase of project brings total genotyped to 2504 individuals from 26 populations worldwide
28
29
30 From Where?
31 Haplotype Reference Consortium
32 Take home points Many subjects From many populations Assayed for many variants Quality of reference haplotypes continues to improve Data are publicly available
33 QUALITY CONTROL
34 Quality Control As in ANY analysis, we want quality data Garbage in à garbage out So what here is unique? Mendelian inheritance Lab-based protocols Sample duplication for concordance Call rates Chromosomal anomalies Population genetics, e.g., Hardy-Weinberg Equilibrium testing Much research in this area, updated protocols
35 Hardy-Weinberg Equilibrium Allele and genotype frequencies remain constant over time, when Large population Random mating Sex-independent genotype frequencies No natural selection No migration No mutation No inbreeding Implications Can derive expected genotype frequencies from allele frequencies If these deviate from realized genotype frequencies, then one of the assumptions may not hold OR Genotyping error Association? OR
36
37
38 Modes of Inheritance
39 Coding Genotypes Assume a biallelic marker (SNP) A or G Each chromosome will have one of the two possible alleles FID IID A1 A A A A G G G A A
40 Mode of inheritance (MOI) A pattern of how a disease is transmitted in families Additive Carrier Carrier Phenotype value Dominant Recessive Non-carrier Carrier Carrier Carrier
41 Additive Mode b 1 Y b X
42 Dominant Mode Y X
43 Recessive Mode Y X
44 FID IID A1 A A A A G G G A A A: risk allele Additive Dominant Recessive FID IID G1 FID IID G1 FID IID G
45 Tests for Association
46 Tests for Association Discrete Traits Cochrane-Armitage Trend Test Alleles Test General RxC Contingency Table (Chi-square) Other Types Continuous Time-to-event Multivariate
47 Cochrane-Armitage Copies of Allele Case A 0 A 1 A 2 M 1 Control U 0 U 1 U 2 M 0 N 0 N 1 N 2 N c 1 2 = N[N(A 1 + 2A 2 ) - M 1 (N 1 + 2N 2 )] 2 M 1 (N - M 1 )[N(N 1 + 4N 2 ) - (N 1 + 2N 2 ) 2 ]
48 Alleles Test + - Case A 1 +2A 2 A 1 +2A 0 2M 1 Control U 1 +2U 2 U 1 +2U 0 2M 0 N 1 +2N 2 N 1 +2N 0 2N c 1 2 = 2N[2N(A 1 + 2A 2 ) - 2M 1 (N 1 + 2N 2 )] 2 2M 1 2(N - M 1 )[2N(N 1 + 2N 2 ) - (N 1 + 2N 2 ) 2 ] Note: Variance (denominator) assumes HWE!!!
49 General Chi-Square Copies of Allele Case A 0 A 1 A 2 M 1 Control U 0 U 1 U 2 M 0 N 0 N 1 N 2 N c 2 2 = [A 0 - E(A 0 )]2 E(A 0 ) + [A 1 - E(A 1 )]2 E(A 1 ) + [A 2 - E(A 2 )]2 E(A 2 ) + [U 0 - E(U 0 )]2 E(U 0 ) + [U 1 - E(U 1 )]2 E(U 1 ) + [U 2 - E(U 2 )]2 E(U 2 )
50 Logistic Regression æ p ln x ç è 1- p x ö = b 0 + b 1 X ø logit(p x ) = b 0 + b 1 X
51 Model Interpretation Additive model (X=0, 1 or 2) æ p ln x ö ç = b 0 + b 1 X è 1- p x ø ln(b 1 ) = one - unit increase Note: This is analogous to an odds ratio (OR) from a 2x3 table Genotype Model (indicator variables G i = 0 or 1) æ p ln x ö ç = b 0 + b 1 G 1 + b 2 G 2 è 1- p x ø Subjects with G 0 =1 are the reference group OR for subjects with G 2 compared to G 0 = e b 2
52 SNP SNP SNP SNP SNP E(y % ) = β * + β, X %. P-value or log p % 1 p % = β * + β, X %. P-value -log 10 (p-value) Top hitter Chromosomal position
53 Association take home points Many ways to seek out and test for a genetic association Mode of inheritance, while somewhat a misnomer in complex disease genetics, reflects our assumptions on how genotype influences phenotype We will focus largely on the flexible frameworks of linear and logistic regression
54 Population Stratification
55 Genetic Associations Truth Causal locus (direct) In LD with causal locus (indirect) Chance If you test 100 times, you ll see ~ 5 tests < 0.05 The association is due to chance - no causal underpinning Bias Association is not causal Yellow fingers associated with lung cancer e.g. Population stratification
56 Genetic Associations Truth Causal locus (direct) In LD with causal locus (indirect) Chance If you test 100 times, you ll see ~ 5 tests < 0.05 The association is due to chance - no causal underpinning Bias Association is not causal Yellow fingers associated with lung cancer e.g. Population stratification
57 Truth A genetic association test finds A Phenotype
58 Truth A genetic association test finds A Phenotype LD: highly correlated A Phenotype G?
59 Genetic Associations Truth Causal locus (direct) In LD with causal locus (indirect) Chance If you test 100 times, you ll see ~ 5 tests < 0.05 The association is due to chance - no causal underpinning Bias Association is not causal Yellow fingers associated with lung cancer e.g. Population stratification
60 Chance Each point represents the association for each locus. There are more than 6 million points. Genome wide significance level P = p = 0.05 = Manhattan plot: It is a scatter plot used to display the p-values in genome-wide association studies (GWAS)
61
62
63 Genetic Associations Truth Causal locus (direct) In LD with causal locus (indirect) Chance If you test 100 times, you ll see ~ 5 tests < 0.05 The association is due to chance - no causal underpinning Bias Association is not causal Yellow fingers associated with lung cancer e.g. Population stratification
64 Quantile-quantile (QQ) Plots Good way of seeing what s going on overall Any real hits? Any systematic problems? In GWAS, MOST SNPs will not be associated with whatever phenotype is examined, i.e., they are from the null distribution
65 Quantile-quantile (QQ) Plots
66 Quantile-quantile (QQ) Plots
67 Quantile-quantile (QQ) Plots
68 Bias A genetic association test finds A Phenotype
69 Bias A genetic association test finds A Spurious association Ancestry Phenotype True risk factor
70 Stratification Essentially a confounder! Yellow fingers associated with lung cancer How does it happen?
71 Famous Example Knowler et al (1988)
72 Cardon et al (2003)
73 Stratification Happens Historical strategies to deal with it Self-Reported Ancestry Match (design) or Adjust (analysis) Use other genetic markers (ancestry informative) Genomic Control STRUCTURE PCA/Eigenstrat Use a family-based design More later
74 Thank you! Questions?
75 Acknowledgements Harvard T. F. Chan School of Public Health Christoph Lange Pete Kraft University of Colorado Matt McQueen University of Kentucky Yuri Katsumata Jennifer Daddysman University of Florida Tyler Smith University of Washington Paul Crane Joey Mukherjee Vanderbilt University Tim Hohman Brigham Young University Keoni Kauwe National Institute of Aging R01 AG K25 AG R01 AG P30 AG R01 AG054060
1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics
1 Springer Nan M. Laird Christoph Lange The Fundamentals of Modern Statistical Genetics 1 Introduction to Statistical Genetics and Background in Molecular Genetics 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
More information(Genome-wide) association analysis
(Genome-wide) association analysis 1 Key concepts Mapping QTL by association relies on linkage disequilibrium in the population; LD can be caused by close linkage between a QTL and marker (= good) or by
More informationUNIT 8 BIOLOGY: Meiosis and Heredity Page 148
UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular
More informationChapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)
12/5/14 Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination) Linkage Disequilibrium Genealogical Interpretation of LD Association Mapping 1 Linkage and Recombination v linkage equilibrium ²
More informationLecture 2: Genetic Association Testing with Quantitative Traits. Summer Institute in Statistical Genetics 2017
Lecture 2: Genetic Association Testing with Quantitative Traits Instructors: Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 29 Introduction to Quantitative Trait Mapping
More informationProportional Variance Explained by QLT and Statistical Power. Proportional Variance Explained by QTL and Statistical Power
Proportional Variance Explained by QTL and Statistical Power Partitioning the Genetic Variance We previously focused on obtaining variance components of a quantitative trait to determine the proportion
More informationPopulation Genetics I. Bio
Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn
More informationBinomial Mixture Model-based Association Tests under Genetic Heterogeneity
Binomial Mixture Model-based Association Tests under Genetic Heterogeneity Hui Zhou, Wei Pan Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 April 30,
More informationLecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency
Lecture 1 Hardy-Weinberg equilibrium and key forces affecting gene frequency Bruce Walsh lecture notes Introduction to Quantitative Genetics SISG, Seattle 16 18 July 2018 1 Outline Genetics of complex
More informationPopulation Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda
1 Population Genetics with implications for Linkage Disequilibrium Chiara Sabatti, Human Genetics 6357a Gonda csabatti@mednet.ucla.edu 2 Hardy-Weinberg Hypotheses: infinite populations; no inbreeding;
More informationLecture WS Evolutionary Genetics Part I 1
Quantitative genetics Quantitative genetics is the study of the inheritance of quantitative/continuous phenotypic traits, like human height and body size, grain colour in winter wheat or beak depth in
More informationAssociation Testing with Quantitative Traits: Common and Rare Variants. Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5
Association Testing with Quantitative Traits: Common and Rare Variants Timothy Thornton and Katie Kerr Summer Institute in Statistical Genetics 2014 Module 10 Lecture 5 1 / 41 Introduction to Quantitative
More informationLinear Regression (1/1/17)
STA613/CBB540: Statistical methods in computational biology Linear Regression (1/1/17) Lecturer: Barbara Engelhardt Scribe: Ethan Hada 1. Linear regression 1.1. Linear regression basics. Linear regression
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 007 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results
More informationEXERCISES FOR CHAPTER 3. Exercise 3.2. Why is the random mating theorem so important?
Statistical Genetics Agronomy 65 W. E. Nyquist March 004 EXERCISES FOR CHAPTER 3 Exercise 3.. a. Define random mating. b. Discuss what random mating as defined in (a) above means in a single infinite population
More informationIntroduction to Linkage Disequilibrium
Introduction to September 10, 2014 Suppose we have two genes on a single chromosome gene A and gene B such that each gene has only two alleles Aalleles : A 1 and A 2 Balleles : B 1 and B 2 Suppose we have
More informationThe Quantitative TDT
The Quantitative TDT (Quantitative Transmission Disequilibrium Test) Warren J. Ewens NUS, Singapore 10 June, 2009 The initial aim of the (QUALITATIVE) TDT was to test for linkage between a marker locus
More informationLinkage and Linkage Disequilibrium
Linkage and Linkage Disequilibrium Summer Institute in Statistical Genetics 2014 Module 10 Topic 3 Linkage in a simple genetic cross Linkage In the early 1900 s Bateson and Punnet conducted genetic studies
More informationNature Genetics: doi: /ng Supplementary Figure 1. Number of cases and proxy cases required to detect association at designs.
Supplementary Figure 1 Number of cases and proxy cases required to detect association at designs. = 5 10 8 for case control and proxy case control The ratio of controls to cases (or proxy cases) is 1.
More informationF1 Parent Cell R R. Name Period. Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes
Name Period Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes 1. What is the chromosome theory of inheritance? 2. Explain the law of segregation. Use two different
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 009 Population genetics Outline of lectures 3-6 1. We want to know what theory says about the reproduction of genotypes in a population. This results
More information1.5.1 ESTIMATION OF HAPLOTYPE FREQUENCIES:
.5. ESTIMATION OF HAPLOTYPE FREQUENCIES: Chapter - 8 For SNPs, alleles A j,b j at locus j there are 4 haplotypes: A A, A B, B A and B B frequencies q,q,q 3,q 4. Assume HWE at haplotype level. Only the
More informationProbability of Detecting Disease-Associated SNPs in Case-Control Genome-Wide Association Studies
Probability of Detecting Disease-Associated SNPs in Case-Control Genome-Wide Association Studies Ruth Pfeiffer, Ph.D. Mitchell Gail Biostatistics Branch Division of Cancer Epidemiology&Genetics National
More informationProblems for 3505 (2011)
Problems for 505 (2011) 1. In the simplex of genotype distributions x + y + z = 1, for two alleles, the Hardy- Weinberg distributions x = p 2, y = 2pq, z = q 2 (p + q = 1) are characterized by y 2 = 4xz.
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 20: Epistasis and Alternative Tests in GWAS Jason Mezey jgm45@cornell.edu April 16, 2016 (Th) 8:40-9:55 None Announcements Summary
More informationLecture 9. QTL Mapping 2: Outbred Populations
Lecture 9 QTL Mapping 2: Outbred Populations Bruce Walsh. Aug 2004. Royal Veterinary and Agricultural University, Denmark The major difference between QTL analysis using inbred-line crosses vs. outbred
More informationUnit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February 5 th /6 th
Name: Date: Block: Chapter 6 Meiosis and Mendel Section 6.1 Chromosomes and Meiosis 1. How do gametes differ from somatic cells? Unit 6 Reading Guide: PART I Biology Part I Due: Monday/Tuesday, February
More informationQuestion: If mating occurs at random in the population, what will the frequencies of A 1 and A 2 be in the next generation?
October 12, 2009 Bioe 109 Fall 2009 Lecture 8 Microevolution 1 - selection The Hardy-Weinberg-Castle Equilibrium - consider a single locus with two alleles A 1 and A 2. - three genotypes are thus possible:
More informationComputational Systems Biology: Biology X
Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#5:(Mar-21-2010) Genome Wide Association Studies 1 Experiments on Garden Peas Statistical Significance 2 The law of causality...
More informationThe phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.
Series 1: Cross Diagrams There are two alleles for each trait in a diploid organism In C. elegans gene symbols are ALWAYS italicized. To represent two different genes on the same chromosome: When both
More information- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.
February 8, 2005 Bio 107/207 Winter 2005 Lecture 11 Mutation and transposable elements - the term mutation has an interesting history. - as far back as the 17th century, it was used to describe any drastic
More informationSNP Association Studies with Case-Parent Trios
SNP Association Studies with Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health September 3, 2009 Population-based Association Studies Balding (2006). Nature
More informationThe E-M Algorithm in Genetics. Biostatistics 666 Lecture 8
The E-M Algorithm in Genetics Biostatistics 666 Lecture 8 Maximum Likelihood Estimation of Allele Frequencies Find parameter estimates which make observed data most likely General approach, as long as
More informationBTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014
BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014 Homework 4 (version 3) - posted October 3 Assigned October 2; Due 11:59PM October 9 Problem 1 (Easy) a. For the genetic regression model: Y
More informationGenetics (patterns of inheritance)
MENDELIAN GENETICS branch of biology that studies how genetic characteristics are inherited MENDELIAN GENETICS Gregory Mendel, an Augustinian monk (1822-1884), was the first who systematically studied
More informationTutorial Session 2. MCMC for the analysis of genetic data on pedigrees:
MCMC for the analysis of genetic data on pedigrees: Tutorial Session 2 Elizabeth Thompson University of Washington Genetic mapping and linkage lod scores Monte Carlo likelihood and likelihood ratio estimation
More informationConstructing a Pedigree
Constructing a Pedigree Use the appropriate symbols: Unaffected Male Unaffected Female Affected Male Affected Female Male carrier of trait Mating of Offspring 2. Label each generation down the left hand
More informationTheoretical and computational aspects of association tests: application in case-control genome-wide association studies.
Theoretical and computational aspects of association tests: application in case-control genome-wide association studies Mathieu Emily November 18, 2014 Caen mathieu.emily@agrocampus-ouest.fr - Agrocampus
More information2. Map genetic distance between markers
Chapter 5. Linkage Analysis Linkage is an important tool for the mapping of genetic loci and a method for mapping disease loci. With the availability of numerous DNA markers throughout the human genome,
More informationLearning Your Identity and Disease from Research Papers: Information Leaks in Genome-Wide Association Study
Learning Your Identity and Disease from Research Papers: Information Leaks in Genome-Wide Association Study Rui Wang, Yong Li, XiaoFeng Wang, Haixu Tang and Xiaoyong Zhou Indiana University at Bloomington
More informationStatistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014
Overview - 1 Statistical Genetics I: STAT/BIOST 550 Spring Quarter, 2014 Elizabeth Thompson University of Washington Seattle, WA, USA MWF 8:30-9:20; THO 211 Web page: www.stat.washington.edu/ thompson/stat550/
More informationp(d g A,g B )p(g B ), g B
Supplementary Note Marginal effects for two-locus models Here we derive the marginal effect size of the three models given in Figure 1 of the main text. For each model we assume the two loci (A and B)
More informationNotes on Population Genetics
Notes on Population Genetics Graham Coop 1 1 Department of Evolution and Ecology & Center for Population Biology, University of California, Davis. To whom correspondence should be addressed: gmcoop@ucdavis.edu
More informationSNP-SNP Interactions in Case-Parent Trios
Detection of SNP-SNP Interactions in Case-Parent Trios Department of Biostatistics Johns Hopkins Bloomberg School of Public Health June 2, 2009 Karyotypes http://ghr.nlm.nih.gov/ Single Nucleotide Polymphisms
More informationEVOLUTION ALGEBRA Hartl-Clark and Ayala-Kiger
EVOLUTION ALGEBRA Hartl-Clark and Ayala-Kiger Freshman Seminar University of California, Irvine Bernard Russo University of California, Irvine Winter 2015 Bernard Russo (UCI) EVOLUTION ALGEBRA 1 / 10 Hartl
More informationProcesses of Evolution
15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection
More informationGENETICS - CLUTCH CH.1 INTRODUCTION TO GENETICS.
!! www.clutchprep.com CONCEPT: HISTORY OF GENETICS The earliest use of genetics was through of plants and animals (8000-1000 B.C.) Selective breeding (artificial selection) is the process of breeding organisms
More informationOn the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease
On the limiting distribution of the likelihood ratio test in nucleotide mapping of complex disease Yuehua Cui 1 and Dong-Yun Kim 2 1 Department of Statistics and Probability, Michigan State University,
More informationComputational Systems Biology: Biology X
Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#7:(Mar-23-2010) Genome Wide Association Studies 1 The law of causality... is a relic of a bygone age, surviving, like the monarchy,
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture16: Population structure and logistic regression I Jason Mezey jgm45@cornell.edu April 11, 2017 (T) 8:40-9:55 Announcements I April
More informationDropping Your Genes. A Simulation of Meiosis and Fertilization and An Introduction to Probability
Dropping Your Genes A Simulation of Meiosis and Fertilization and An Introduction to To fully understand Mendelian genetics (and, eventually, population genetics), you need to understand certain aspects
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans and Stacey Cherny University of Oxford Wellcome Trust Centre for Human Genetics This Session IBD vs IBS Why is IBD important? Calculating IBD probabilities
More informationExpression QTLs and Mapping of Complex Trait Loci. Paul Schliekelman Statistics Department University of Georgia
Expression QTLs and Mapping of Complex Trait Loci Paul Schliekelman Statistics Department University of Georgia Definitions: Genes, Loci and Alleles A gene codes for a protein. Proteins due everything.
More informationOutline for today s lecture (Ch. 14, Part I)
Outline for today s lecture (Ch. 14, Part I) Ploidy vs. DNA content The basis of heredity ca. 1850s Mendel s Experiments and Theory Law of Segregation Law of Independent Assortment Introduction to Probability
More informationSolutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin
Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele
More informationWhen one gene is wild type and the other mutant:
Series 2: Cross Diagrams Linkage Analysis There are two alleles for each trait in a diploid organism In C. elegans gene symbols are ALWAYS italicized. To represent two different genes on the same chromosome:
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 18: Introduction to covariates, the QQ plot, and population structure II + minimal GWAS steps Jason Mezey jgm45@cornell.edu April
More informationAEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity,
AEC 550 Conservation Genetics Lecture #2 Probability, Random mating, HW Expectations, & Genetic Diversity, Today: Review Probability in Populatin Genetics Review basic statistics Population Definition
More informationCase-Control Association Testing. Case-Control Association Testing
Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits. Technological advances have made it feasible to perform case-control association studies
More informationGenotype Imputation. Biostatistics 666
Genotype Imputation Biostatistics 666 Previously Hidden Markov Models for Relative Pairs Linkage analysis using affected sibling pairs Estimation of pairwise relationships Identity-by-Descent Relatives
More informationObjectives. Announcements. Comparison of mitosis and meiosis
Announcements Colloquium sessions for which you can get credit posted on web site: Feb 20, 27 Mar 6, 13, 20 Apr 17, 24 May 15. Review study CD that came with text for lab this week (especially mitosis
More informationBreeding Values and Inbreeding. Breeding Values and Inbreeding
Breeding Values and Inbreeding Genotypic Values For the bi-allelic single locus case, we previously defined the mean genotypic (or equivalently the mean phenotypic values) to be a if genotype is A 2 A
More informationMicroevolution Changing Allele Frequencies
Microevolution Changing Allele Frequencies Evolution Evolution is defined as a change in the inherited characteristics of biological populations over successive generations. Microevolution involves the
More informationNIH Public Access Author Manuscript Stat Sin. Author manuscript; available in PMC 2013 August 15.
NIH Public Access Author Manuscript Published in final edited form as: Stat Sin. 2012 ; 22: 1041 1074. ON MODEL SELECTION STRATEGIES TO IDENTIFY GENES UNDERLYING BINARY TRAITS USING GENOME-WIDE ASSOCIATION
More informationNatural Selection. Population Dynamics. The Origins of Genetic Variation. The Origins of Genetic Variation. Intergenerational Mutation Rate
Natural Selection Population Dynamics Humans, Sickle-cell Disease, and Malaria How does a population of humans become resistant to malaria? Overproduction Environmental pressure/competition Pre-existing
More informationLife Cycles, Meiosis and Genetic Variability24/02/2015 2:26 PM
Life Cycles, Meiosis and Genetic Variability iclicker: 1. A chromosome just before mitosis contains two double stranded DNA molecules. 2. This replicated chromosome contains DNA from only one of your parents
More informationTOPICS IN STATISTICAL METHODS FOR HUMAN GENE MAPPING
TOPICS IN STATISTICAL METHODS FOR HUMAN GENE MAPPING by Chia-Ling Kuo MS, Biostatstics, National Taiwan University, Taipei, Taiwan, 003 BBA, Statistics, National Chengchi University, Taipei, Taiwan, 001
More informationLesson 4: Understanding Genetics
Lesson 4: Understanding Genetics 1 Terms Alleles Chromosome Co dominance Crossover Deoxyribonucleic acid DNA Dominant Genetic code Genome Genotype Heredity Heritability Heritability estimate Heterozygous
More informationLab 12. Linkage Disequilibrium. November 28, 2012
Lab 12. Linkage Disequilibrium November 28, 2012 Goals 1. Es
More informationBiology. Revisiting Booklet. 6. Inheritance, Variation and Evolution. Name:
Biology 6. Inheritance, Variation and Evolution Revisiting Booklet Name: Reproduction Name the process by which body cells divide:... What kind of cells are produced this way? Name the process by which
More informationCalculation of IBD probabilities
Calculation of IBD probabilities David Evans University of Bristol This Session Identity by Descent (IBD) vs Identity by state (IBS) Why is IBD important? Calculating IBD probabilities Lander-Green Algorithm
More informationQ Expected Coverage Achievement Merit Excellence. Punnett square completed with correct gametes and F2.
NCEA Level 2 Biology (91157) 2018 page 1 of 6 Assessment Schedule 2018 Biology: Demonstrate understanding of genetic variation and change (91157) Evidence Q Expected Coverage Achievement Merit Excellence
More informationBiology 211 (1) Exam 4! Chapter 12!
Biology 211 (1) Exam 4 Chapter 12 1. Why does replication occurs in an uncondensed state? 1. 2. A is a single strand of DNA. When DNA is added to associated protein molecules, it is referred to as. 3.
More informationLecture 1: Case-Control Association Testing. Summer Institute in Statistical Genetics 2015
Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2015 1 / 1 Introduction Association mapping is now routinely being used to identify loci that are involved with complex traits.
More informationBayesian Inference of Interactions and Associations
Bayesian Inference of Interactions and Associations Jun Liu Department of Statistics Harvard University http://www.fas.harvard.edu/~junliu Based on collaborations with Yu Zhang, Jing Zhang, Yuan Yuan,
More informationNOTES CH 17 Evolution of. Populations
NOTES CH 17 Evolution of Vocabulary Fitness Genetic Drift Punctuated Equilibrium Gene flow Adaptive radiation Divergent evolution Convergent evolution Gradualism Populations 17.1 Genes & Variation Darwin
More informationPatterns of inheritance
Patterns of inheritance Learning goals By the end of this material you would have learnt about: How traits and characteristics are passed on from one generation to another The different patterns of inheritance
More informationGenetics and Natural Selection
Genetics and Natural Selection Darwin did not have an understanding of the mechanisms of inheritance and thus did not understand how natural selection would alter the patterns of inheritance in a population.
More informationEffective population size and patterns of molecular evolution and variation
FunDamental concepts in genetics Effective population size and patterns of molecular evolution and variation Brian Charlesworth Abstract The effective size of a population,, determines the rate of change
More informationIntroduction to population genetics & evolution
Introduction to population genetics & evolution Course Organization Exam dates: Feb 19 March 1st Has everybody registered? Did you get the email with the exam schedule Summer seminar: Hot topics in Bioinformatics
More informationHomework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:
Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships
More informationConcept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes
r Chapter 15: The Chromosomal Basis of Inheritance Name Period Chapter 15: The Chromosomal Basis of Inheritance Concept 15.1 Mendelian inheritance has its physical basis in the behavior of chromosomes
More informationA. Correct! Genetically a female is XX, and has 22 pairs of autosomes.
MCAT Biology - Problem Drill 08: Meiosis and Genetic Variability Question No. 1 of 10 1. A human female has pairs of autosomes and her sex chromosomes are. Question #01 (A) 22, XX. (B) 23, X. (C) 23, XX.
More informationThe phenotype of this worm is wild type. When both genes are mutant: The phenotype of this worm is double mutant Dpy and Unc phenotype.
Series 2: Cross Diagrams - Complementation There are two alleles for each trait in a diploid organism In C. elegans gene symbols are ALWAYS italicized. To represent two different genes on the same chromosome:
More informationOutline of lectures 3-6
GENOME 453 J. Felsenstein Evolutionary Genetics Autumn, 013 Population genetics Outline of lectures 3-6 1. We ant to kno hat theory says about the reproduction of genotypes in a population. This results
More informationIntroduction to Genetics
Introduction to Genetics The Work of Gregor Mendel B.1.21, B.1.22, B.1.29 Genetic Inheritance Heredity: the transmission of characteristics from parent to offspring The study of heredity in biology is
More informationGSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION
FIFTH EDITION IV I ^HHk ^ttm IZTI/^Q i I II MPHBBMWBBIHB '-llwmpbi^hbwm^^pfc ' GSBHSRSBRSRRk LlML I I \l 1MB ^HP'^^MMMP" jflp^^^^^^^^st I Iv^O FROM GENES TO GENOMES %^MiM^PM^^MWi99Mi$9i0^^ ^^^^^^^^^^^^^V^^^fii^^t^i^^^^^
More informationLECTURE # How does one test whether a population is in the HW equilibrium? (i) try the following example: Genotype Observed AA 50 Aa 0 aa 50
LECTURE #10 A. The Hardy-Weinberg Equilibrium 1. From the definitions of p and q, and of p 2, 2pq, and q 2, an equilibrium is indicated (p + q) 2 = p 2 + 2pq + q 2 : if p and q remain constant, and if
More informationHow robust are the predictions of the W-F Model?
How robust are the predictions of the W-F Model? As simplistic as the Wright-Fisher model may be, it accurately describes the behavior of many other models incorporating additional complexity. Many population
More informationThe Chromosomal Basis of Inheritance
The Chromosomal Basis of Inheritance Mitosis and meiosis were first described in the late 800s. The chromosome theory of inheritance states: Mendelian genes have specific loci (positions) on chromosomes.
More informationName: Period: EOC Review Part F Outline
Name: Period: EOC Review Part F Outline Mitosis and Meiosis SC.912.L.16.17 Compare and contrast mitosis and meiosis and relate to the processes of sexual and asexual reproduction and their consequences
More informationStatistical Analysis of Haplotypes, Untyped SNPs, and CNVs in Genome-Wide Association Studies
Statistical Analysis of Haplotypes, Untyped SNPs, and CNVs in Genome-Wide Association Studies by Yijuan Hu A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in
More informationSolutions to Problem Set 4
Question 1 Solutions to 7.014 Problem Set 4 Because you have not read much scientific literature, you decide to study the genetics of garden peas. You have two pure breeding pea strains. One that is tall
More informationX Chromosome Association Testing in Genome-Wide Association Studies
X Chromosome Association Testing in Genome-Wide Association Studies Honours Thesis November 6, 009 Peter Hickey Department of Mathematics and Statistics, The University of Melbourne Bioinformatics Division,
More informationOutline. P o purple % x white & white % x purple& F 1 all purple all purple. F purple, 224 white 781 purple, 263 white
Outline - segregation of alleles in single trait crosses - independent assortment of alleles - using probability to predict outcomes - statistical analysis of hypotheses - conditional probability in multi-generation
More informationLecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012
Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012 Last Time Sequence data and quantification of variation Infinite sites model Nucleotide diversity (π) Sequence-based
More informationEvolutionary Genetics Midterm 2008
Student # Signature The Rules: (1) Before you start, make sure you ve got all six pages of the exam, and write your name legibly on each page. P1: /10 P2: /10 P3: /12 P4: /18 P5: /23 P6: /12 TOT: /85 (2)
More informationUnit 3 - Molecular Biology & Genetics - Review Packet
Name Date Hour Unit 3 - Molecular Biology & Genetics - Review Packet True / False Questions - Indicate True or False for the following statements. 1. Eye color, hair color and the shape of your ears can
More informationHeterozygosity is variance. How Drift Affects Heterozygosity. Decay of heterozygosity in Buri s two experiments
eterozygosity is variance ow Drift Affects eterozygosity Alan R Rogers September 17, 2014 Assumptions Random mating Allele A has frequency p N diploid individuals Let X 0,1, or 2) be the number of copies
More informationCh 11.Introduction to Genetics.Biology.Landis
Nom Section 11 1 The Work of Gregor Mendel (pages 263 266) This section describes how Gregor Mendel studied the inheritance of traits in garden peas and what his conclusions were. Introduction (page 263)
More information