Introduction to Advanced Population Genetics

Similar documents
Genetic Drift in Human Evolution

Population Genetics I. Bio

Mathematical models in population genetics II

Processes of Evolution

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009

Estimating Evolutionary Trees. Phylogenetic Methods

Frequency Spectra and Inference in Population Genetics

Classical Selection, Balancing Selection, and Neutral Mutations

1. Understand the methods for analyzing population structure in genomes

Big Idea #1: The process of evolution drives the diversity and unity of life

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution

Coalescent based demographic inference. Daniel Wegmann University of Fribourg

Selection and Population Genetics

Learning ancestral genetic processes using nonparametric Bayesian models

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Enduring Understanding: Change in the genetic makeup of a population over time is evolution Pearson Education, Inc.

Notes for MCTP Week 2, 2014

Evolutionary Genetics: Part 0.2 Introduction to Population genetics

Introduction to population genetics & evolution

e.g. population: 500, two alleles: Red (R) and White (r). Total: 1000 genes for flower color in the population

The Wright-Fisher Model and Genetic Drift

Supporting Information Text S1

Supporting Information

Neutral Theory of Molecular Evolution

Chapter 6 Linkage Disequilibrium & Gene Mapping (Recombination)

Diffusion Models in Population Genetics

Mechanisms of Evolution Microevolution. Key Concepts. Population Genetics

NOTES CH 17 Evolution of. Populations

AP Biology Evolution Review Slides

THE EVOLUTION OF POPULATIONS THE EVOLUTION OF POPULATIONS

Population Structure

Endowed with an Extra Sense : Mathematics and Evolution

6 Introduction to Population Genetics

Unit 7: Evolution Guided Reading Questions (80 pts total)

Demography April 10, 2015

CSci 8980: Advanced Topics in Graphical Models Analysis of Genetic Variation

The Mechanisms of Evolution

Effective population size and patterns of molecular evolution and variation

6 Introduction to Population Genetics

Effect of Genetic Divergence in Identifying Ancestral Origin using HAPAA

Ecology and Evolutionary Biology 2245/2245W Exam 3 April 5, 2012

Notes on Population Genetics

p(d g A,g B )p(g B ), g B

How robust are the predictions of the W-F Model?

Population Genetics & Evolution

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

List the five conditions that can disturb genetic equilibrium in a population.(10)

Evolution Problem Drill 10: Human Evolution

1. What is the definition of Evolution? a. Descent with modification b. Changes in the heritable traits present in a population over time c.

Evolution AP Biology

Quantitative Trait Variation

Introduction to Natural Selection. Ryan Hernandez Tim O Connor

Genetic Variation in Finite Populations

Breeding Values and Inbreeding. Breeding Values and Inbreeding

19. Genetic Drift. The biological context. There are four basic consequences of genetic drift:

Chapter 16: Evolutionary Theory

- point mutations in most non-coding DNA sites likely are likely neutral in their phenotypic effects.

Review of molecular biology

MULTIPLE CHOICE QUESTIONS

Evolution of Populations. Chapter 17

Observation: we continue to observe large amounts of genetic variation in natural populations

Introduction to Evolution

Lecture 18 : Ewens sampling formula

Gene Pool Genetic Drift Geographic Isolation Fitness Hardy-Weinberg Equilibrium Natural Selection

1 Errors in mitosis and meiosis can result in chromosomal abnormalities.

Chapters Objectives

This course is about VARIATION: its causes, effects, and history.

Mechanisms of Evolution. Adaptations. Old Ideas about Evolution. Behavioral. Structural. Biochemical. Physiological

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

1.1: Natural selection is a major mechanism of evolution 1. NATURAL SELECTION

AUTHORIZATION TO LEND AND REPRODUCE THE THESIS. Date Jong Wha Joanne Joo, Author

Chapter 16. Table of Contents. Section 1 Genetic Equilibrium. Section 2 Disruption of Genetic Equilibrium. Section 3 Formation of Species

Robust demographic inference from genomic and SNP data

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

Darwinian Selection. Chapter 7 Selection I 12/5/14. v evolution vs. natural selection? v evolution. v natural selection

Evolu&on, Popula&on Gene&cs, and Natural Selec&on Computa.onal Genomics Seyoung Kim

Detecting selection from differentiation between populations: the FLK and hapflk approach.

Population Genetics. with implications for Linkage Disequilibrium. Chiara Sabatti, Human Genetics 6357a Gonda

Evidence of Evolution

D. Incorrect! That is what a phylogenetic tree intends to depict.

Chapter 02 Population Genetics

Modern Evolutionary Classification. Section 18-2 pgs

Processes of Evolution

Genetics and Natural Selection

How to Use This Presentation

Processes of Evolution

Linkage and Linkage Disequilibrium

Levels of genetic variation for a single gene, multiple genes or an entire genome

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.

Genes Within Populations

The genomes of recombinant inbred lines

AP Biology Review Chapters Review Questions Chapter 15: Darwin Chapter 16-17: Evolution

Computational Approaches to Statistical Genetics

Primate Diversity & Human Evolution (Outline)

Darwin s Observations & Conclusions The Struggle for Existence

EVOLUTION UNIT. 3. Unlike his predecessors, Darwin proposed a mechanism by which evolution could occur called.

CONGEN Population structure and evolutionary histories

Figure S1: The model underlying our inference of the age of ancient genomes

Transcription:

Introduction to Advanced Population Genetics

Learning Objectives Describe the basic model of human evolutionary history Describe the key evolutionary forces How demography can influence the site frequency spectrum Be able to interpret a site frequency spectrum Understand how the SFS is affected by evolutionary forces

Review: What are the assumptions of Hardy-Weinberg? 1)There must be no mutation 2)There must be no migration 3)Individuals must mate at random with respect to genotype 4)There must be no selection 5)The population must be infinitely large How do these affect allele frequencies?

Drift, mutation, migration, and Selection Coefficient 0.10000 selection Natural Selection 0.01000 0.00100 0 1000 2000 3000 4000 1/(2N e ) N e = 500 0.00010 0.00001 Genetic Drift

Genetic drift: Serial founder effect

Out of Africa Model We now have an excellent road map of how humans evolved in Africa and migrated to populate the rest of the earth. Africa Europe Americas Asia North Australia

Heterozygosity is correlated with distance from East Africa Ramachandran et al. 2005 PNAS

Mutation: How often do mutations arise? Table 1. Estimated per generation mutation rates in mice Homozygous Heterozygous Final generation Overall No. Rate ( 10 9 ) (95% CI) No. Rate ( 10 9 ) (95% CI) Rate ( 10 9 ) Rate (95% CI) SNV cona 63.3 a 3.4 (2.6 4.4) a 101.5 5.7 (4.4 7.3) 6.9 5.4 10 9 (4.6 6.5, 10 9 ) conb 117.5 a 5.1 (4.3 6.2) a 92.7 5.2 (4.0 6.7) 6.8 mutc 1304 84.3 (76.1 94.5) 1944 110.6 (94.0 132.5) 150 9.4 10 8 (9.0 9.8, 10 8 ) mutd 1472 86.9 (79.1 96.6) 1633 92.3 (78.6 110.5) 90 Indel cona 6.7 a 0.57 (0.22 1.20) a 4 0.35 (0.10 0.91) 3.1 10 10 (1.2 6.4, 10 10 ) conb 4 a 0.28 (0.08 0.71) a 3 0.26 (0.05 0.77) mutc 28 2.9 (1.9 4.1) 28 2.5 (1.7 3.6) 2.7 10 9 (2.2 3.2, 10 9 ) mutd 21 2.0 (1.2 3.0) 37 3.3 (2.3 4.5) Mutation rates per nucleotide per generation were estimated using the number of homozygous or heterozygous de novo mutations in cona, conb, mutc, and mutd. The estimates for SNVs were validated by counting newly arisen mutations in the final generation. The number of de novo mutations in cona and conb was partly adjusted for the frequency of true de novo variants; 95% confidence intervals (CI) were calculated by computer simulation or Poisson distribution error analysis of the number of mutations (details in Supplemental Methods). a Note that homozygous variant numbers in control lines were uncertain due to the low ability to discriminate between de novo and initial variants; these values were not used in the estimates for the overall rate. Uchimura et al. (2015) Genome Research

Mutation is a major source of phenotypic variation Uchimura et al. (2015) Genome Research Figure 2. Frequency of visible phenotypic anomalies in breeding lines. Frequency of visible anomalies in each successive generation. Circles indicate observed frequencies with 90% CI, determined by Fisher s exact test. Since fewer than 20 mice were screened in the early-generation (fewer than seven generations) populations of control mice, mean phenotypic frequencies are shown for generations 0 3 and 4 6. Solid lines show the fit with a binomial linear model.

Mutation: How often do mutations arise in Humans? study loci considered per-generation mean mutation rate (10-8 bp -1 generation -1 ) yearly mean mutation rate (10-9 bp -1 y -1 ) t gen = 30 y t gen = 25 y Kondrashov (2003) disease 1.85 (0.00 3.65) 0.62 (0.00 1.22) 0.74 (0.00 1.46) Lynch (2010) disease 1.28 (0.68 1.88) 0.42 (0.23 0.63) 0.51 (0.27 0.75) Roach et al. (2010) WG 1.10 (0.68 1.70) 0.37 (0.23 0.57) 0.44 (0.27 0.68) Awadalla et al. (2010) WG 1.36 (0.34 2.72) 0.45 (0.11 0.91) 0.54 (0.14 1.09) 1000 Genomes Project WG 1.17 (0.94 1.73) 0.39 (0.31 0.57) 0.47 (0.38 0.69) (2010), CEU 1000 Genomes Project WG 0.97 (0.72 1.44) 0.32 (0.24 0.48) 0.39 (0.29 0.58) (2010), YRI Sanders et al. (2012) exome 1.28 (1.05 1.50) 0.43 (0.35 0.50) 0.51 (0.42 0.60) O Roak et al. (2012) exome 1.57 (1.05 2.26) 0.52 (0.35 0.75) 0.63 (0.42 0.90) Kong et al. (2012) WG 1.20 0.40 0.48 Scally and Durbin (2012) Nature Rev. Genet.

What are the effects of paternal age on mutation rate? Kong et al. (2012) Nature

Evolutionary time effects on the mutation rate a b Base-substitution mutation rate per nucleotide site per generation (u) 10 8 10 9 10 10 10 11 23 21 35 20 24 22 Lynch et al. (2016) Nature Rev. Genet. 26 32 34 27 29 17 18 Base-substitution mutation rate per nucleotide site per generation (u) 10 0 10 1 10 2 10 3 10 4 10 4 10 5 10 6 10 7 10 8 10 9 enome size ( 4 2 15 6 3 13 1 12 5 7 10 8 9 11 10 8 10 9 10 10 10 11 8 11 10 9 16 2 3 4 5 26 6 7 12 19 17 18 13 25 32 20 21 24 30 33 22 34 29 15 Effective population size (N e ) Eubacteria Unicellular eukaryotes Multicellular eukaryotes

Fu et al. (2013) Nature When did most variation arise?

Most SNVs are very rare 57% singletons 72% singletons+doubletons+tripletons Tennessen et al. (2012) Science

How has our population size grown? Tennessen et al. (2012) Science

Most SNVs are population specific A. AA (n = 1,089) EA (n = 1,351) 177,419 91,703 234,359 Proportion Proportion of Population-specific Private SNVs SNVs 1.00 0.75 0.50 0.25 0.00 AA EA 0.5 0.5<x 1 >1 503,481 SNVs MAF(%) Tennessen et al. (2012) Science

Figure 4. Haplotype Sharing between Breeds from Different Phylogenetic Clades The circos plot is ordered and colored to match the tree in Figure 1. Ribbons connecting breeds indicate a median haplotype sharing between all dogs of each breed in excess of 95% of all haplotype sharing across clades. Definitions of the breed abbreviations can be found in Table S1.

Mathias et al. (2016) Nature Comm. Migration: Admixture is migration between diverged populations

100% - 80% - ]$ 60% - 40% - 20% - 0% - Estimates of global ancestry CAAPA Nigeria African American CAAPA Reference Colombian Barbados Brazil Dominican Republic Honduras Jamaica PuertoRico ReferencePopula0ons European (CEU) African (YRI) n ) e n n ) Native American Mathias et al. 2016 Nature Comm.

Local ancestry of a single individual C D Bryc et al. 2009 PNAS

Ancient admixture: Neanderthals are still among us Recent genetic data suggests that 1-4% of non-african genomes are derived from Neanderthals

Neanderthals are still among us

Adaptive (Darwinian) Selection I have called this principle, by which each slight variation, if useful, is preserved, by the term Natural Selection. Charles Darwin from "The Origin of Species, 1859

Antibiotic resistance is an example of adaptive evolution Initial Population After Antibiotic Final Population G Not Resistant T Resistant

Reading the genome for signatures of positive selection Neutral Advantageous Mutation Fixation This process imparts signatures on patterns of genetic variation that we can use to find adaptively evolving genes

Genes that influence physical traits have been targets of recent selection Eye Color Skin Pigmentation Hair Texture HERC2 SLC24A5, OCA2, TYRP1 EDAR

Mio et al. (2017) Molecular Biology and Evolution FIG. 3. Evidence of introgression in the EPAS1 locus. (A) The distribution of almost fixed ABBA/BA significantly excessive ABBA sites (D ¼ 0.955 6 0.02) and significant Fst between TM and YJD common haplotypes in the locus. The haplotypes are defined from 29 representative SNPs across Non-independence of evolutionary forces: Adaptive-migration (introgression) Miao et al.. doi:10.1093/molbev/msw274 B MBE D A C B D

Outcrossing Selfing Non-independence of evolutionary forces: Drift (Selfing) and Selection Hartfield et al. (2017) Trends in Genetics Later common ancestor with recombina on: more neutral grnralogy Earlier common ancestor without recombina on: star like genealogy Recombina on occurs at rate r eff (1 p t ) Mean trait value Outcrossing Genera ons of selec on Selfing Fixed popula on size Time Frequency of beneficial variant (gray shading)

These forces all affect the Site Frequency Spectrum (SFS)

Primer on coalescent

E(T i )= 2 i(i 1) Primer on coalescent To generate a3 genealogy4of i genes under Kingman s coalescent: 2 3 4 V ar(t i )= Fewer lineage means longer expected time to coalescence. 2 i(i 1) Draw an observation from an exponential distribution with mean µ =2/(i(i 1)). This will be the time of the first coalescent event (looking from the present backwards in time). Pick two lineages at random to coalescence. Decrease i by 1. If i =1, stop. Otherwise, repeat these steps [8, 9]. 3 4 2 ns longer expected time to coalescence

Site Frequency Spectrum (SFS)

Joint Site Frequency Spectrum (JSFS)

Genetic Drift can profoundly shape allele frequencies Gutenkunst et al. 2009 PLoS Genet.

Useful equations Time: t = T/(4*N ref *Gen) N ref = reference or ancestral population size Gen = number of years per generation T = chronological years θ = 4*N ref *μ*length; μ = mutation rate Length is the bp of the segment simulated (aka nsites for recombination) Growth: N(t) = N(0)e tα Recombination: ρ = 4N ref r r is the recombination rate between the ends of a unit length sequence Migration: M ij = 4N ref m ij m ij is the fraction of subpopulation i that is made up of migrants from subpopulation j in forward time.

Concluding Summary Four main evolutionary forces are: Mutation, migration, selection, and drift. These forces interact and rarely act independently. These forces change the site frequency spectrum in informative ways that we can use for both demographic analysis and simulation.