Coalescent based demographic inference. Daniel Wegmann University of Fribourg

Size: px
Start display at page:

Download "Coalescent based demographic inference. Daniel Wegmann University of Fribourg"

Transcription

1 Coalescent based demographic inference Daniel Wegmann University of Fribourg

2 Introduction The current genetic diversity is the outcome of past evolutionary processes. Hence, we can use genetic diversity to tell stories about the past.

3 Introduction The current genetic diversity is the outcome of past evolutionary processes. Hence, we can use genetic diversity to tell stories about the past. But this is a challenging task! The history of natural populations is usually complex. Several evolutionary processes can leave similar footprints (bottleneck vs. selection). Loci are not independent, but correlated realizations of the same process. Novembre & Ramachandran (2011)

4 Qualitative inference Traditionally, we have relied on qualitative inference Example: out of Africa expansion via sequential founder effects in humans. Heterozygosity decays with distance from East Africa Ramachandran et al. (2005)

5 Model-based inference Patterns of genetic diversity may serve as evidence for or against stories of the evolutionary past. Such stories are usually vague ( Serial founder effects ). While the evidence may be strong, the argument remains verbal and is potentially subjective. Model-based inference provides statistical support

6 Model-based inference Patterns of genetic diversity may serve as evidence for or against stories of the evolutionary past. Such stories are usually vague ( Serial founder effects ). While the evidence may be strong, the argument remains verbal and is potentially subjective. Model-based inference provides statistical support Essentially, all models are wrong, but some are useful. George E. Box Qualitative inference is key when constructing sensible models!

7 Rejection of a Null Model The same as hypothesis testing in frequentist statistics: A null model M is rejected using a summary statistics s if P ( s M ) s s obs By convention, α = 0.05 Often the Null model is an isolated Wright-Fisher population of constant size

8 Rejection of a Null Model: F-Statistics F ST may be used to reject a panmictic population in favor of a specific structure. F IS may be used to reject a panmictic population in favor of non-random mating (inbreeding or substructure) The significance of F-Statistics is usually assessed usign permutation or randomization approaches.

9 Rejection of a Null Model: F-Statistics Western Eastern Central Bonobo Wegmann & Excoffier, MBE, 2010

10 Rejection of a Null Model: Tajima s D Tajima s D compares two estimates of θ=4n for a Wright-Fisher population of constant size: one based on the number segregating sites S one based on the average number of pairwise differences These estimates may differ when assumptions of the Wright-Fisher population are violated. Witgh-Fisher population An expanding population, for instance, leads to a negative D Significance is usually assessed via simulations. expanding population

11 Felsenstein Equation The Felsenstein Equation The Likelihood Function The probability of the data D given the parameters of the model Θ: P(D Θ) Maximum Likelihood Inference The maximum likelihood estimates are the values of Θ for which the likelihood P(D Θ) is maximized. 1 / 14

12 Felsenstein Equation The Felsenstein Equation The Likelihood Function The probability of the data D given the parameters of the model Θ: P(D Θ) Maximum Likelihood Inference The maximum likelihood estimates are the values of Θ for which the likelihood P(D Θ) is maximized. Bayesian Statistics The goal is to infer the probability of the parameters Θ given the data D. According to probability theory, Here, P(Θ D) = P(D Θ)P(Θ) P(D) P(D Θ)P(Θ) = P(D Θ)P(Θ)d Θ Θ P(Θ) is the prior probability, the probability of the parameter before looking at the data (yes, this is subjective!). P(Θ D) is the posterior probability of the parameter after considering the data. 2 / 14

13 Felsenstein Equation Mutation Model Likelihood of sequence data given a Genealogy The link between sequencing data D and some demographic parameters Θ is the underlying, unknown genealogy. Given a genealogy G i and a mutation model µ, the likelihood of the data is straight forward to calculate. Ind 1 : aagacacaga gatagaccag Ind 1 Ind 2 Ind 3 Ind 2 : aagacgcaga gatagaccag Ind 3 : aagacacaga tatagacaag Assuming all mutations to occur with rate µ: P(D G i, µ) = P(# mutations on b length(b), µ) b {Branches} Ind 1 Ind 2 Ind 3 3 / 14

14 Felsenstein Equation The Felsenstein Equation The Felsenstein Equation Calculating P(D Θ) requires to integrate over all possible genealogies and weighting each by their probability. P(D Θ, µ) = P(D G, µ)p(g Θ)dG G 4 / 14

15 Felsenstein Equation The Felsenstein Equation The Felsenstein Equation Calculating P(D Θ) requires to integrate over all possible genealogies and weighting each by their probability. P(D Θ, µ) = P(D G, µ)p(g Θ)dG G The Felsenstein Equation in practice Unfortunately, this integral is impossible to solve analytically in all but some extremely simple models. In practice, we thus approximate this integral using a random sample of coalescent trees. P(D Θ, µ) 1 N N P(D G i, µ) where g i P(G Θ) i=1 5 / 14

16 Felsenstein Equation Primer in Coalescent Theory Coalescent theory A population genetic theory that considers the history of a sample backward in time. Coalescent event If two sampled lineages have the same parent in the previous generation. 6 / 14

17 Felsenstein Equation Primer in Coalescent Theory Coalescent theory A population genetic theory that considers the history of a sample backward in time. Coalescent event If two sampled lineages have the same parent in the previous generation. Probability to coalesce Under random mating in a constant population, two lineages coalesce in the previous generation with probability Pr(2 individuals coalesce) = 1 2N 7 / 14

18 Felsenstein Equation Primer in Coalescent Theory Coalescent theory A population genetic theory that considers the history of a sample backward in time. Coalescent event If two sampled lineages have the same parent in the previous generation. Probability to coalesce Under random mating in a constant population, two lineages coalesce in the previous generation with probability Pr(2 individuals coalesce) = 1 2N Expected time t 2 until two lineages coalesce (time to Most Recent Common Ancestor, MRCA): E[t 2] = 2N generations. 8 / 14

19 Felsenstein Equation Coalescence with multiple samples Probability of coalescent Intuitive explanation ( ) k 1 k(k 1) Pr(at least one coalescent event) = = 2 2N 4N Probability of coalescence among k lineages = probability of coalescence among two lineages 1 2N the number of possible pairs ( k 2). times 9 / 14

20 Felsenstein Equation Coalescence with multiple samples Probability of coalescent Intuitive explanation ( ) k 1 k(k 1) Pr(at least one coalescent event) = = 2 2N 4N Probability of coalescence among k lineages = probability of coalescence among two lineages 1 2N the number of possible pairs ( k 2). times Expected time t k until k lineages coalesce 10 / 14

21 Felsenstein Equation Coalescence with multiple samples Probability of coalescent Intuitive explanation ( ) k 1 k(k 1) Pr(at least one coalescent event) = = 2 2N 4N Probability of coalescence among k lineages = probability of coalescence among two lineages 1 2N the number of possible pairs ( k 2). times Expected time t k until k lineages coalesce The expected waiting time until an event occurs the first time is given by the inverse of the probability of the event! E[t k ] = 1 ( k 2) 1 2N = ( 2N 4N k = 2) k(k 1) 11 / 14

22 Felsenstein Equation Expected genealogy of n samples (lineages) Height versus length of a genealogy of n samples ( E[T n] = 4N 1 1 ) n n 1 1 E[L n] = 4N k k=1 E[ L n ] or E[ T n ] 28N 24N 20N 16N 12N 8N 4N 0N E[ L n ] E[ T n ] Sample size n Note: Adding additional samples does increase the expected tree height only marginally, but increases the tree length a lot. Actually, doubling of the sample size increases the tree length by about 1.5 N. 12 / 14

23 Deep resequencing data set Data set: 202 known or prospective drug target genes 14,002 individuals, of which 12,514 Europeans Median coverage of 27x and a call rate of 90.7% Extensive quality control John Novembre Matt Nelson Heterozygous concordance 99.1% in 130 sample duplicates 99.0% in comparison to 1000G Trios Singleton concordance 98.5% in 130 sample duplicates 98.3% of 245 validated via Sanger Wegmann & Nelson et al. 2012

24 Rare variants are only weakly affected by selection Expected number of Alleles with frequency x Advantageous alleles Neutral alleles Disadvantageous alleles Messer 2009

25 Phenotypic Effect of Rare Variants Rare variants have a strong, negative impact on the phenotype 85% of NS mutations are deleterious enough never to get fixed 75% never to never get common (MAF of 5%) Similar patterns found by PolyPhen Wegmann & Nelson et al. 2012

26 Joint inference of demography and mutation rates Mutation rate and population size N have similar effects on genetic diversity. large population small population low mutation rate large mutation rate Wakeley and Takahashi 2002

27 Joint inference of demography and mutation rates Mutation rate and population size N have similar effects on genetic diversity. large population small population low mutation rate large mutation rate If sample size > effective population size, the rate of recent coalescent events is independent of, which rensers estimation of and N individually possible. Wakeley and Takahashi 2002

28 Joint inference of demography and mutation rates Mutation rate and population size N have similar effects on genetic diversity. large population small population low mutation rate large mutation rate If sample size > effective population size, the rate of recent coalescent events is independent of, which rensers estimation of and N individually possible. Problem: Likelihood calculation is intractable! Wakeley and Takahashi 2002

29 Joint inference of demography and mutation rates Using Monte Carlo simulations to approximate P(SFS,N): Simulate genealogies with fixed parameter values Africa Asia Europe Exponential growth in Europe All other parameters fixed to Schaffner estimates Nielsen 2000; Coventry et al. 2010

30 Joint inference of demography and mutation rates Using Monte Carlo simulations to approximate P(SFS,N): Simulate genealogies with fixed parameter values Compute average likelihood of the SFS across genealogies Africa Asia Europe Exponential growth in Europe All other parameters fixed to Schaffner estimates Likelihood 1 Likelihood 2 Likelihood 3 Average Likelihood Nielsen 2000; Coventry et al. 2010

31 Mutation rate Joint inference of demography and mutation rates Rapid population growth in Europe Variable mutation rates across genes (p ) Median mutation rate of 1.2x10-8 Lower than divergence based estimates (2.5x10-8 ) But in good agreement with recent estimates from pedigrees Population size (millions)

32 Mode of Speciation in Rose Finches In the classic view, geographic isolation was considered essential for speciation. However, recent evidence suggests that local adaptation and speciation may occur in the presence of gene flow if ecological selection is strong. In Birds, the Z-chromosome is known to play a vital role is speciation Haldanes Rule: In hybrids, fintness is lower in the hemizygous sex (females) Male sexually selected traits and female preference was mapped to the Z- chromosome in several species. Prediction If selection against hybrids is a driving force in speciation, gene flow will be interrupted ealier on the Z-chromosome than on autosomes.

33 Mode of Speciation in Rose Finches Inferring isolation times for Z-linked and autosomal markers seperately. Shou-Hsien Li Carpodacus vinaceus (Himalaya) Carpodacus formosa (Taiwan)

34 Two major difficulties For realistic evolutionary models, analytical solutions of the likelihood function are usually very hard and often impossible to obtain. We will use two tricks: 1) Using summary statistics S instead of the full data D The hope is that P(D θ) is proportional to P(S θ) 2) Using simulations to approximate the likelihood function P(S θ) Apply in a Bayesian setting: P(θ D) P(D θ) P(θ) Posterior Approximate Bayesian Computation (ABC) Likelihood Prior

35 Tavaré et al. (1997); Weiss & von Haeseler (1998) Approximate Bayesian Computation ABC defining statistics S,, F ST, D,... Data Summary statistics

36 Tavaré et al. (1997); Weiss & von Haeseler (1998) Standard ABC Algorithm defining statistics generating simulations according to prior

37 Tavaré et al. (1997); Weiss & von Haeseler (1998) Approximate Bayesian Computation ABC defining statistics generating simulations according to prior accepting close simulations

38 Tavaré et al. (1997); Weiss & von Haeseler (1998) Approximate Bayesian Computation ABC defining statistics generating simulations according to prior accepting close simulations estimating posterior distribution

39 Tavaré et al. (1997); Weiss & von Haeseler (1998) Approximate Bayesian Computation ABC defining statistics generating simulations according to prior accepting close simulations estimating posterior distribution

40 Beaumont et al. (2002); Blum & François (2009) Approximate Bayesian Computation ABC defining statistics generating simulations according to prior Regression to project points to s obs Assumption: no change in prior weight accepting close simulations post sampling regression adjustment estimating posterior distribution

41 ABC-GLM defining statistics generating simulations according to prior It is easy to show that where is the truncated likelihood accepting close simulations fitting a simple likelihood model estimating posterior distribution and the truncated prior Leuenberger & Wegmann (2010) Chris Leuenberger

42 ABC-GLM defining statistics generating simulations according to prior accepting close simulations fitting a simple likelihood model Assume GLM (estimate via OLS) with From retained sample using Gaussian peaks estimating posterior distribution Leuenberger & Wegmann (2010) Note: other models could be used, GLM was chosen due to laziness...

43 Leuenberger & Wegmann (2010) ABC-GLM defining statistics generating simulations according to prior accepting close simulations fitting a simple likelihood model estimating posterior distribution

44 Mode of Speciation in Rose Finches

45 Mode of Speciation in Rose Finches

46 Mode of Speciation in Rose Finches Joint posterior asymmetry observed in simulated data sets 51.5%

47 Cross River Gorilla (Thalmann et al., 2011) Olaf Thalmann Thalmann et al. (2011)

48 Hybridizing ABC with Full Likelihood Example: Estimating continuous trait evolution on phylogenetic trees Backbone tree Clades with unknown phylogenetic relationships Graham Slater L ( D a, 2,,, ) Trait values mean and variance within clade Brownian model of trait evolution a = root state of trait 2 = rate of trait evolution Phylogenetic birth-death process = species birthrate = species death rate Slater et al. (2011)

49 Hybridizing ABC with Full Likelihood Example: Estimating continuous trait evolution on phylogenetic trees Backbone tree Clades with unknown phylogenetic relationships L ( D a, 2,,, ) 2 L ( D a,, T ) P ( T,, ) Trait values mean and variance within clade T Brownian model of trait evolution a = root state of trait 2 = rate of trait evolution G Phylogenetic birth-death process = species birthrate = species death rate Slater et al. (2011)

50 Hybridizing ABC with Full Likelihood Example: Estimating continuous trait evolution on phylogenetic trees Backbone tree Clades with unknown phylogenetic relationships ABC-MCMC Metropolis-Hastings L ( D a, 2,,, ) 2 L ( D a,, T ) P ( T,, ) Trait values mean and variance within clade T Brownian model of trait evolution a = root state of trait 2 = rate of trait evolution G Phylogenetic birth-death process = species birthrate = species death rate Slater et al. (2011)

51 Application to Body Size Evolution in Carnivora Several members of the semiaquatic Pinnipedia attain very large body sizes. Did body size evolve faster among Pinnipedia than all other Carnivora? Southern Elephant Seal up to 4,000 Kg Walrus up to 1,800 Kg Slater et al. (2011)

52 Several members of the semiaquatic Pinnipedia attain very large body sizes. Did body size evolve faster among Pinnipedia than all other Carnivora? Slater et al. (2011)

53 Several members of the semiaquatic Pinnipedia attain very large body sizes. Did body size evolve faster among Pinnipedia than all other Carnivora? Slater et al. (2011)

54 Conclusions While often preferred, model based inference in biology is challenging due to the stochasticity and complexity of realistic models. As a consequence, we often rely on approximate inference schemes... It may help to replace the full data with summary statistics. Approximate Bayesian Computation is an extremely flexible but crude approach.... or approximate models. Approximating models such that they fit standard inference schemes. On the bright side: Such techniques allow us to estimate what we are really interested in, rather than require us to shift to problems for which analytical solutions are available.

Estimating Evolutionary Trees. Phylogenetic Methods

Estimating Evolutionary Trees. Phylogenetic Methods Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent

More information

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009 Gene Genealogies Coalescence Theory Annabelle Haudry Glasgow, July 2009 What could tell a gene genealogy? How much diversity in the population? Has the demographic size of the population changed? How?

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

Demography April 10, 2015

Demography April 10, 2015 Demography April 0, 205 Effective Population Size The Wright-Fisher model makes a number of strong assumptions which are clearly violated in many populations. For example, it is unlikely that any population

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

Genetic Drift in Human Evolution

Genetic Drift in Human Evolution Genetic Drift in Human Evolution (Part 2 of 2) 1 Ecology and Evolutionary Biology Center for Computational Molecular Biology Brown University Outline Introduction to genetic drift Modeling genetic drift

More information

Introduction to Advanced Population Genetics

Introduction to Advanced Population Genetics Introduction to Advanced Population Genetics Learning Objectives Describe the basic model of human evolutionary history Describe the key evolutionary forces How demography can influence the site frequency

More information

Taming the Beast Workshop

Taming the Beast Workshop Workshop and Chi Zhang June 28, 2016 1 / 19 Species tree Species tree the phylogeny representing the relationships among a group of species Figure adapted from [Rogers and Gibbs, 2014] Gene tree the phylogeny

More information

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page. EVOLUTIONARY BIOLOGY EXAM #1 Fall 2017 There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page. Part I. True (T) or False (F) (2 points each). Circle

More information

Mathematical models in population genetics II

Mathematical models in population genetics II Mathematical models in population genetics II Anand Bhaskar Evolutionary Biology and Theory of Computing Bootcamp January 1, 014 Quick recap Large discrete-time randomly mating Wright-Fisher population

More information

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression) Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures

More information

Lecture 14 Chapter 11 Biology 5865 Conservation Biology. Problems of Small Populations Population Viability Analysis

Lecture 14 Chapter 11 Biology 5865 Conservation Biology. Problems of Small Populations Population Viability Analysis Lecture 14 Chapter 11 Biology 5865 Conservation Biology Problems of Small Populations Population Viability Analysis Minimum Viable Population (MVP) Schaffer (1981) MVP- A minimum viable population for

More information

Robust demographic inference from genomic and SNP data

Robust demographic inference from genomic and SNP data Robust demographic inference from genomic and SNP data Laurent Excoffier Isabelle Duperret, Emilia Huerta-Sanchez, Matthieu Foll, Vitor Sousa, Isabel Alves Computational and Molecular Population Genetics

More information

Challenges when applying stochastic models to reconstruct the demographic history of populations.

Challenges when applying stochastic models to reconstruct the demographic history of populations. Challenges when applying stochastic models to reconstruct the demographic history of populations. Willy Rodríguez Institut de Mathématiques de Toulouse October 11, 2017 Outline 1 Introduction 2 Inverse

More information

CONSERVATION AND THE GENETICS OF POPULATIONS

CONSERVATION AND THE GENETICS OF POPULATIONS CONSERVATION AND THE GENETICS OF POPULATIONS FredW.Allendorf University of Montana and Victoria University of Wellington and Gordon Luikart Universite Joseph Fourier, CNRS and University of Montana With

More information

How robust are the predictions of the W-F Model?

How robust are the predictions of the W-F Model? How robust are the predictions of the W-F Model? As simplistic as the Wright-Fisher model may be, it accurately describes the behavior of many other models incorporating additional complexity. Many population

More information

Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles

Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles Supplemental Information Likelihood-based inference in isolation-by-distance models using the spatial distribution of low-frequency alleles John Novembre and Montgomery Slatkin Supplementary Methods To

More information

7. Tests for selection

7. Tests for selection Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info

More information

The theory of evolution continues to be refined as scientists learn new information.

The theory of evolution continues to be refined as scientists learn new information. Section 3: The theory of evolution continues to be refined as scientists learn new information. K What I Know W What I Want to Find Out L What I Learned Essential Questions What are the conditions of the

More information

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #

Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information # Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either

More information

Approximate Bayesian Computation: a simulation based approach to inference

Approximate Bayesian Computation: a simulation based approach to inference Approximate Bayesian Computation: a simulation based approach to inference Richard Wilkinson Simon Tavaré 2 Department of Probability and Statistics University of Sheffield 2 Department of Applied Mathematics

More information

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Evolutionary Theory Mathematical and Conceptual Foundations Sean H. Rice Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Contents Preface ix Introduction 1 CHAPTER 1 Selection on One

More information

Frequency Spectra and Inference in Population Genetics

Frequency Spectra and Inference in Population Genetics Frequency Spectra and Inference in Population Genetics Although coalescent models have come to play a central role in population genetics, there are some situations where genealogies may not lead to efficient

More information

Neutral Theory of Molecular Evolution

Neutral Theory of Molecular Evolution Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation

More information

Microevolution (Ch 16) Test Bank

Microevolution (Ch 16) Test Bank Microevolution (Ch 16) Test Bank Multiple Choice Identify the letter of the choice that best completes the statement or answers the question. 1. Which of the following statements describes what all members

More information

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall

More information

Stochastic Demography, Coalescents, and Effective Population Size

Stochastic Demography, Coalescents, and Effective Population Size Demography Stochastic Demography, Coalescents, and Effective Population Size Steve Krone University of Idaho Department of Mathematics & IBEST Demographic effects bottlenecks, expansion, fluctuating population

More information

An introduction to Approximate Bayesian Computation methods

An introduction to Approximate Bayesian Computation methods An introduction to Approximate Bayesian Computation methods M.E. Castellanos maria.castellanos@urjc.es (from several works with S. Cabras, E. Ruli and O. Ratmann) Valencia, January 28, 2015 Valencia Bayesian

More information

6 Introduction to Population Genetics

6 Introduction to Population Genetics Grundlagen der Bioinformatik, SoSe 14, D. Huson, May 18, 2014 67 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,

More information

Name Period. 3. How many rounds of DNA replication and cell division occur during meiosis?

Name Period. 3. How many rounds of DNA replication and cell division occur during meiosis? Name Period GENERAL BIOLOGY Second Semester Study Guide Chapters 3, 4, 5, 6, 11, 14, 16, 17, 18 and 19. SEXUAL REPRODUCTION AND MEIOSIS 1. What is the purpose of meiosis? 2. Distinguish between diploid

More information

Selection and Population Genetics

Selection and Population Genetics Selection and Population Genetics Evolution by natural selection can occur when three conditions are satisfied: Variation within populations - individuals have different traits (phenotypes). height and

More information

Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates

Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates JOSEPH FELSENSTEIN Department of Genetics SK-50, University

More information

I. Short Answer Questions DO ALL QUESTIONS

I. Short Answer Questions DO ALL QUESTIONS EVOLUTION 313 FINAL EXAM Part 1 Saturday, 7 May 2005 page 1 I. Short Answer Questions DO ALL QUESTIONS SAQ #1. Please state and BRIEFLY explain the major objectives of this course in evolution. Recall

More information

The Combinatorial Interpretation of Formulas in Coalescent Theory

The Combinatorial Interpretation of Formulas in Coalescent Theory The Combinatorial Interpretation of Formulas in Coalescent Theory John L. Spouge National Center for Biotechnology Information NLM, NIH, DHHS spouge@ncbi.nlm.nih.gov Bldg. A, Rm. N 0 NCBI, NLM, NIH Bethesda

More information

There are 3 parts to this exam. Take your time and be sure to put your name on the top of each page.

There are 3 parts to this exam. Take your time and be sure to put your name on the top of each page. EVOLUTIONARY BIOLOGY BIOS 30305 EXAM #2 FALL 2011 There are 3 parts to this exam. Take your time and be sure to put your name on the top of each page. Part I. True (T) or False (F) (2 points each). 1)

More information

The Origin of Species

The Origin of Species The Origin of Species Introduction A species can be defined as a group of organisms whose members can breed and produce fertile offspring, but who do not produce fertile offspring with members of other

More information

6 Introduction to Population Genetics

6 Introduction to Population Genetics 70 Grundlagen der Bioinformatik, SoSe 11, D. Huson, May 19, 2011 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,

More information

From Individual-based Population Models to Lineage-based Models of Phylogenies

From Individual-based Population Models to Lineage-based Models of Phylogenies From Individual-based Population Models to Lineage-based Models of Phylogenies Amaury Lambert (joint works with G. Achaz, H.K. Alexander, R.S. Etienne, N. Lartillot, H. Morlon, T.L. Parsons, T. Stadler)

More information

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!

Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis

More information

Big Idea #1: The process of evolution drives the diversity and unity of life

Big Idea #1: The process of evolution drives the diversity and unity of life BIG IDEA! Big Idea #1: The process of evolution drives the diversity and unity of life Key Terms for this section: emigration phenotype adaptation evolution phylogenetic tree adaptive radiation fertility

More information

A Bayesian Approach to Phylogenetics

A Bayesian Approach to Phylogenetics A Bayesian Approach to Phylogenetics Niklas Wahlberg Based largely on slides by Paul Lewis (www.eeb.uconn.edu) An Introduction to Bayesian Phylogenetics Bayesian inference in general Markov chain Monte

More information

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Primate Diversity & Human Evolution (Outline)

Primate Diversity & Human Evolution (Outline) Primate Diversity & Human Evolution (Outline) 1. Source of evidence for evolutionary relatedness of organisms 2. Primates features and function 3. Classification of primates and representative species

More information

Ch. 16 Evolution of Populations

Ch. 16 Evolution of Populations Ch. 16 Evolution of Populations Gene pool the combined genetic information of all the members of a population. There are typically 2 or more alleles for a certain trait. (dominant or recessive) Allele

More information

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution 15.2 Intro In biology, evolution refers specifically to changes in the genetic makeup of populations over time.

More information

STABILIZING SELECTION ON HUMAN BIRTH WEIGHT

STABILIZING SELECTION ON HUMAN BIRTH WEIGHT STABILIZING SELECTION ON HUMAN BIRTH WEIGHT See Box 8.2 Mapping the Fitness Landscape in Z&E FROM: Cavalli-Sforza & Bodmer 1971 STABILIZING SELECTION ON THE GALL FLY, Eurosta solidaginis GALL DIAMETER

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

Discrete & continuous characters: The threshold model

Discrete & continuous characters: The threshold model Discrete & continuous characters: The threshold model Discrete & continuous characters: the threshold model So far we have discussed continuous & discrete character models separately for estimating ancestral

More information

Darwinian Selection. Chapter 7 Selection I 12/5/14. v evolution vs. natural selection? v evolution. v natural selection

Darwinian Selection. Chapter 7 Selection I 12/5/14. v evolution vs. natural selection? v evolution. v natural selection Chapter 7 Selection I Selection in Haploids Selection in Diploids Mutation-Selection Balance Darwinian Selection v evolution vs. natural selection? v evolution ² descent with modification ² change in allele

More information

Markov chain Monte-Carlo to estimate speciation and extinction rates: making use of the forest hidden behind the (phylogenetic) tree

Markov chain Monte-Carlo to estimate speciation and extinction rates: making use of the forest hidden behind the (phylogenetic) tree Markov chain Monte-Carlo to estimate speciation and extinction rates: making use of the forest hidden behind the (phylogenetic) tree Nicolas Salamin Department of Ecology and Evolution University of Lausanne

More information

Name Period. 2. Name the 3 parts of interphase AND briefly explain what happens in each:

Name Period. 2. Name the 3 parts of interphase AND briefly explain what happens in each: Name Period GENERAL BIOLOGY Second Semester Study Guide Chapters 3, 4, 5, 6, 11, 10, 13, 14, 15, 16, and 17. SEXUAL REPRODUCTION AND MEIOSIS 1. The cell cycle consists of a growth stage and a division

More information

Molecular Epidemiology Workshop: Bayesian Data Analysis

Molecular Epidemiology Workshop: Bayesian Data Analysis Molecular Epidemiology Workshop: Bayesian Data Analysis Jay Taylor and Ananias Escalante School of Mathematical and Statistical Sciences Center for Evolutionary Medicine and Informatics Arizona State University

More information

- point mutations in most non-coding DNA sites likely are likely neutral in their phenotypic effects.

- point mutations in most non-coding DNA sites likely are likely neutral in their phenotypic effects. January 29 th, 2010 Bioe 109 Winter 2010 Lecture 10 Microevolution 3 - random genetic drift - one of the most important shifts in evolutionary thinking over the past 30 years has been an appreciation of

More information

Gene Pool The combined genetic material for all the members of a population. (all the genes in a population)

Gene Pool The combined genetic material for all the members of a population. (all the genes in a population) POPULATION GENETICS NOTES Gene Pool The combined genetic material for all the members of a population. (all the genes in a population) Allele Frequency The number of times a specific allele occurs in a

More information

Quantitative Trait Variation

Quantitative Trait Variation Quantitative Trait Variation 1 Variation in phenotype In addition to understanding genetic variation within at-risk systems, phenotype variation is also important. reproductive fitness traits related to

More information

Reproduction and Evolution Practice Exam

Reproduction and Evolution Practice Exam Reproduction and Evolution Practice Exam Topics: Genetic concepts from the lecture notes including; o Mitosis and Meiosis, Homologous Chromosomes, Haploid vs Diploid cells Reproductive Strategies Heaviest

More information

Bayesian Phylogenetics:

Bayesian Phylogenetics: Bayesian Phylogenetics: an introduction Marc A. Suchard msuchard@ucla.edu UCLA Who is this man? How sure are you? The one true tree? Methods we ve learned so far try to find a single tree that best describes

More information

122 9 NEUTRALITY TESTS

122 9 NEUTRALITY TESTS 122 9 NEUTRALITY TESTS 9 Neutrality Tests Up to now, we calculated different things from various models and compared our findings with data. But to be able to state, with some quantifiable certainty, that

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA Human Population Genomics Outline 1 2 Damn the Human Genomes. Small initial populations; genes too distant; pestered with transposons;

More information

The Mechanisms of Evolution

The Mechanisms of Evolution The Mechanisms of Evolution Figure.1 Darwin and the Voyage of the Beagle (Part 1) 2/8/2006 Dr. Michod Intro Biology 182 (PP 3) 4 The Mechanisms of Evolution Charles Darwin s Theory of Evolution Genetic

More information

NOTES CH 17 Evolution of. Populations

NOTES CH 17 Evolution of. Populations NOTES CH 17 Evolution of Vocabulary Fitness Genetic Drift Punctuated Equilibrium Gene flow Adaptive radiation Divergent evolution Convergent evolution Gradualism Populations 17.1 Genes & Variation Darwin

More information

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS. !! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory

More information

Fundamentals and Recent Developments in Approximate Bayesian Computation

Fundamentals and Recent Developments in Approximate Bayesian Computation Syst. Biol. 66(1):e66 e82, 2017 The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. This is an Open Access article distributed under the terms of

More information

Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium

Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium Rapid speciation following recent host shift in the plant pathogenic fungus Rhynchosporium Tiziana Vonlanthen, Laurin Müller 27.10.15 1 Second paper: Origin and Domestication of the Fungal Wheat Pathogen

More information

Phenotypic Evolution. and phylogenetic comparative methods. G562 Geometric Morphometrics. Department of Geological Sciences Indiana University

Phenotypic Evolution. and phylogenetic comparative methods. G562 Geometric Morphometrics. Department of Geological Sciences Indiana University Phenotypic Evolution and phylogenetic comparative methods Phenotypic Evolution Change in the mean phenotype from generation to generation... Evolution = Mean(genetic variation * selection) + Mean(genetic

More information

Concepts and Methods in Molecular Divergence Time Estimation

Concepts and Methods in Molecular Divergence Time Estimation Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks

More information

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics: Homework Assignment, Evolutionary Systems Biology, Spring 2009. Homework Part I: Phylogenetics: Introduction. The objective of this assignment is to understand the basics of phylogenetic relationships

More information

ABCME: Summary statistics selection for ABC inference in R

ABCME: Summary statistics selection for ABC inference in R ABCME: Summary statistics selection for ABC inference in R Matt Nunes and David Balding Lancaster University UCL Genetics Institute Outline Motivation: why the ABCME package? Description of the package

More information

Demographic inference reveals African and European. admixture in the North American Drosophila. melanogaster population

Demographic inference reveals African and European. admixture in the North American Drosophila. melanogaster population Genetics: Advance Online Publication, published on November 12, 2012 as 10.1534/genetics.112.145912 1 2 3 4 5 6 7 Demographic inference reveals African and European admixture in the North American Drosophila

More information

Notes on Population Genetics

Notes on Population Genetics Notes on Population Genetics Graham Coop 1 1 Department of Evolution and Ecology & Center for Population Biology, University of California, Davis. To whom correspondence should be addressed: gmcoop@ucdavis.edu

More information

(Write your name on every page. One point will be deducted for every page without your name!)

(Write your name on every page. One point will be deducted for every page without your name!) POPULATION GENETICS AND MICROEVOLUTIONARY THEORY FINAL EXAMINATION (Write your name on every page. One point will be deducted for every page without your name!) 1. Briefly define (5 points each): a) Average

More information

Supporting Information

Supporting Information Supporting Information Hammer et al. 10.1073/pnas.1109300108 SI Materials and Methods Two-Population Model. Estimating demographic parameters. For each pair of sub-saharan African populations we consider

More information

Distinguishing between population bottleneck and population subdivision by a Bayesian model choice procedure

Distinguishing between population bottleneck and population subdivision by a Bayesian model choice procedure Molecular Ecology (2010) 19, 4648 4660 doi: 10.1111/j.1365-294X.2010.04783.x Distinguishing between population bottleneck and population subdivision by a Bayesian model choice procedure BENJAMIN M. PETER,*

More information

Notes for MCTP Week 2, 2014

Notes for MCTP Week 2, 2014 Notes for MCTP Week 2, 2014 Lecture 1: Biological background Evolutionary biology and population genetics are highly interdisciplinary areas of research, with many contributions being made from mathematics,

More information

UNIT V. Chapter 11 Evolution of Populations. Pre-AP Biology

UNIT V. Chapter 11 Evolution of Populations. Pre-AP Biology UNIT V Chapter 11 Evolution of Populations UNIT 4: EVOLUTION Chapter 11: The Evolution of Populations I. Genetic Variation Within Populations (11.1) A. Genetic variation in a population increases the chance

More information

The Wright-Fisher Model and Genetic Drift

The Wright-Fisher Model and Genetic Drift The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population

More information

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Lecture 5 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,

More information

Applications of Genetics to Conservation Biology

Applications of Genetics to Conservation Biology Applications of Genetics to Conservation Biology Molecular Taxonomy Populations, Gene Flow, Phylogeography Relatedness - Kinship, Paternity, Individual ID Conservation Biology Population biology Physiology

More information

Population Genetics & Evolution

Population Genetics & Evolution The Theory of Evolution Mechanisms of Evolution Notes Pt. 4 Population Genetics & Evolution IMPORTANT TO REMEMBER: Populations, not individuals, evolve. Population = a group of individuals of the same

More information

Evolution of Populations. Chapter 17

Evolution of Populations. Chapter 17 Evolution of Populations Chapter 17 17.1 Genes and Variation i. Introduction: Remember from previous units. Genes- Units of Heredity Variation- Genetic differences among individuals in a population. New

More information

A (short) introduction to phylogenetics

A (short) introduction to phylogenetics A (short) introduction to phylogenetics Thibaut Jombart, Marie-Pauline Beugin MRC Centre for Outbreak Analysis and Modelling Imperial College London Genetic data analysis with PR Statistics, Millport Field

More information

Approximate Bayesian Computation

Approximate Bayesian Computation Approximate Bayesian Computation Michael Gutmann https://sites.google.com/site/michaelgutmann University of Helsinki and Aalto University 1st December 2015 Content Two parts: 1. The basics of approximate

More information

that of Phylotree.org, mtdna tree Build 1756 (Supplementary TableS2). is resulted in 78 individuals allocated to the hg B4a1a1 and three individuals to hg Q. e control region (nps 57372 and nps 1602416526)

More information

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Statistical Methods in Particle Physics Lecture 1: Bayesian methods Statistical Methods in Particle Physics Lecture 1: Bayesian methods SUSSP65 St Andrews 16 29 August 2009 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

ESTIMATION of recombination fractions using ped- ber of pairwise differences (Hudson 1987; Wakeley

ESTIMATION of recombination fractions using ped- ber of pairwise differences (Hudson 1987; Wakeley Copyright 2001 by the Genetics Society of America Estimating Recombination Rates From Population Genetic Data Paul Fearnhead and Peter Donnelly Department of Statistics, University of Oxford, Oxford, OX1

More information

Lecture 11 Friday, October 21, 2011

Lecture 11 Friday, October 21, 2011 Lecture 11 Friday, October 21, 2011 Phylogenetic tree (phylogeny) Darwin and classification: In the Origin, Darwin said that descent from a common ancestral species could explain why the Linnaean system

More information

Genetics: Early Online, published on February 26, 2016 as /genetics Admixture, Population Structure and F-statistics

Genetics: Early Online, published on February 26, 2016 as /genetics Admixture, Population Structure and F-statistics Genetics: Early Online, published on February 26, 2016 as 10.1534/genetics.115.183913 GENETICS INVESTIGATION Admixture, Population Structure and F-statistics Benjamin M Peter 1 1 Department of Human Genetics,

More information

Processes of Evolution

Processes of Evolution Processes of Evolution Microevolution Processes of Microevolution How Species Arise Macroevolution Microevolution Population: localized group of individuals belonging to the same species with the potential

More information

A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences

A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences Indian Academy of Sciences A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences ANALABHA BASU and PARTHA P. MAJUMDER*

More information

The problem Lineage model Examples. The lineage model

The problem Lineage model Examples. The lineage model The lineage model A Bayesian approach to inferring community structure and evolutionary history from whole-genome metagenomic data Jack O Brien Bowdoin College with Daniel Falush and Xavier Didelot Cambridge,

More information

Mechanisms of Evolution. Adaptations. Old Ideas about Evolution. Behavioral. Structural. Biochemical. Physiological

Mechanisms of Evolution. Adaptations. Old Ideas about Evolution. Behavioral. Structural. Biochemical. Physiological Mechanisms of Evolution Honors Biology 2012 1 Adaptations Behavioral Structural Biochemical Physiological 2 Old Ideas about Evolution Aristotle (viewed species perfect and unchanging) Lamarck suggested

More information

I of a gene sampled from a randomly mating popdation,

I of a gene sampled from a randomly mating popdation, Copyright 0 1987 by the Genetics Society of America Average Number of Nucleotide Differences in a From a Single Subpopulation: A Test for Population Subdivision Curtis Strobeck Department of Zoology, University

More information

Modern Evolutionary Classification. Section 18-2 pgs

Modern Evolutionary Classification. Section 18-2 pgs Modern Evolutionary Classification Section 18-2 pgs 451-455 Modern Evolutionary Classification In a sense, organisms determine who belongs to their species by choosing with whom they will mate. Taxonomic

More information

Bayesian Inference and MCMC

Bayesian Inference and MCMC Bayesian Inference and MCMC Aryan Arbabi Partly based on MCMC slides from CSC412 Fall 2018 1 / 18 Bayesian Inference - Motivation Consider we have a data set D = {x 1,..., x n }. E.g each x i can be the

More information

Introduction into Bayesian statistics

Introduction into Bayesian statistics Introduction into Bayesian statistics Maxim Kochurov EF MSU November 15, 2016 Maxim Kochurov Introduction into Bayesian statistics EF MSU 1 / 7 Content 1 Framework Notations 2 Difference Bayesians vs Frequentists

More information

Surfing genes. On the fate of neutral mutations in a spreading population

Surfing genes. On the fate of neutral mutations in a spreading population Surfing genes On the fate of neutral mutations in a spreading population Oskar Hallatschek David Nelson Harvard University ohallats@physics.harvard.edu Genetic impact of range expansions Population expansions

More information

Population Structure

Population Structure Ch 4: Population Subdivision Population Structure v most natural populations exist across a landscape (or seascape) that is more or less divided into areas of suitable habitat v to the extent that populations

More information

Statistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution?

Statistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution? Statistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution? 30 July 2011. Joe Felsenstein Workshop on Molecular Evolution, MBL, Woods Hole Statistical nonmolecular

More information