The Combinatorial Interpretation of Formulas in Coalescent Theory

Size: px
Start display at page:

Download "The Combinatorial Interpretation of Formulas in Coalescent Theory"

Transcription

1 The Combinatorial Interpretation of Formulas in Coalescent Theory John L. Spouge National Center for Biotechnology Information NLM, NIH, DHHS Bldg. A, Rm. N 0 NCBI, NLM, NIH Bethesda MD 09

2 Motivation Evolution is fundamental to understanding biological networks. The infinite alleles model was arguably the most important model of evolution until about 990. yields many simple solutions, mostly proved by induction. still provides useful approximations (e.g., in LDHat, a program for estimating recombination rates along human chromosomes) If a difficult problem has a solution with a simple form, the solution had to be simple, and something is missing. In 990, a simple solution was discovered, but not pursued. The solution M Kimura & JF Crow (9) Genetics 9: P Joyce & S Tavare (990) Stoch Proc Appl : might indicate techniques useful in newer evolutionary realistic models. provides people of a combinatorial bent with an instant understanding of a significant part of almost thirty years of evolutionary literature.

3 Overview Population Genetics Models Forward Time Wright-Fisher Model Moran Model Yule Model The Death Process for the Moran Model Backward Time The waiting times are independent of the combinatorics. Invariance theorems: the ancestral tree of a population sample converges to a single limit for many genetics models (the coalescent process). The death process maps to random permutations, so counting permutations provides many results for the infinite alleles model. The mapping to random permutations also gives insight into deep results for the entire population.

4 Life Reproduction Mutation Selection C Neuhauser & SM Krone (99) Genetics :9 F Crick (9) Life Itself: Its Origin and Nature

5 Wright-Fisher Model Without Mutation The population is: composed of haploid individuals; of constant size. M (No sex!) Individuals reproduce simultaneously. Every offspring chooses a single parent uniformly at random. RA Fisher (90) The Genetical Theory of Natural Selection S Wright (9) Genetics :9

6 Wright-Fisher Model Without Mutation M t 0

7 Wright-Fisher Model Without Mutation Some lineages die out by chance. Some lineages expand by chance. Some lineages do nothing much at all. M t 0

8 Moran Model Without Mutation The population is: composed of haploid individuals; of constant size. At each step Two individuals are chosen uniformly at random with replacement. One reproduces; the other (possibly the same individual) dies.

9 Moran Model Without Mutation M 9 t 0

10 Moran Model With Mutation Infinite Alleles Model The population is: composed of haploid individuals; of constant size. At each step Two individuals are chosen uniformly at random with replacement. One reproduces; the other (possibly the same individual) dies. With probability u, the new individual is a mutant type never seen before. 0

11 Infinite Alleles & Infinite Sites Models Infinite alleles models gel Every mutation is a type never seen before, but nothing is said about the structure of the mutation. Infinite sites models DNA Every mutation is a type never seen before, occuring in a new place on the DNA.

12 Moran Model Without Mutation M t 0

13 Moran Model With Mutation u u M t 0

14 Moran Model With Mutation Looking Backward a Death Process Let us temporarily ignore changes that do not affect the ancestry of the present population. set of rooted trees ordered by height M nt 0

15 Moran Model With Mutation Looking Backward a Death Process M Ancestral Tree n 0

16 Moran Model With Mutation Looking Backward a Death Process M canonical representation of a permutation by its cycle structure n 0

17 Moran Model With Mutation Looking Backward a Death Process M n 0

18 Moran Model With Mutation Looking Backward a Death Process M Each permutation on M objects corresponds to exactly one realization of the death process. n 0

19 Moran Model With Mutation 9 Looking Backward a Death Process M Ancestral Permutation n 0

20 Moran Model With Mutation Looking Backward M 0 k n k n tj t u u u u j j ( j ) M M j= M M P t The variates Π and T are independent. ( Π, T) = ( π, t) { } t t j+ t k cycles k = t t t t t π

21 P Moran Model With Mutation ( Π, T) = ( π, t) { } Looking Backward k n k n tj t u u u u j j ( j ) M M j= M M The variates Π and T are independent. j+ P{ Π=π} π P k k k u u Mu k θ = = M M u k { Π= π } θ S n k cycles k =

22 Moran Model With Mutation Looking Backward from a Population M n 0

23 Moran Model With Mutation Mn Looking Backward from a Sample n Each permutation on n objects corresponds to exactly one realization of the death process. 0

24 Moran Model With Mutation Looking Backward from a Sample k n k n tj t u u u u j j ( j ) M M j= M M P ( Π, T) = ( π, t) { } M u = θ M + θ Fix P j+ { T= t} Mu θ = u By suitably scaling time, the waiting times between deaths are independent exponential variates. There are invariance theorems showing that when their times are suitably scaled, the Wright-Fisher Model, the Moran Model, and many other genetics models converge to limits, so-called coalescent processes. π JFC Kingman (9) Stoch Proc Appl :

25 Moran Model With Mutation n M The Death Process The distribution of the death process for the population at any time has the same form as for a sample. 0 WJ Ewens & GA Watterson, p. in NM Bingham & CM Goldie (00) Probability and Mathematical Genetics

26 Sample Path of the Death Process & the Cycle Structure of Permutations π = ( )( )( )( ) π The Fundamental Theorem P of Infinite Alleles Mutation Models k { Π= π } θ To calculate the probability of any particular ancestral tree, enumerate the corresponding ancestral permutations with θ marking the cycles and then normalize to a probability. In a coalescent limit (infinite population with suitable scaling of time), the waiting times between events are independent exponential variates. S n P Joyce & S Tavare (990) Stoch Proc Appl :

27 The Ancestral Permutation and the Sampling Permutation π = ( )( )( )( ) In practice, individuals are sampled from the population before they are placed into the ancestral tree. Without loss of generality, let the sampling be sequential, and without replacement. The sampling order of the n individuals requires a second, sampling permutation, to put the individuals into the unique order required for the correspondence between the ancestral graph and the ancestral permutation. P { Σ= σ S } = / n! σ = n The variates Π, Σ, and T are independent. π

28 A Light Warm-up

29 Moran Model Without Mutation (θ=0) Looking Backward a Death Process 9 n M n 0

30 Yule Process Without Mutation (θ=0) Looking Forward What is the distribution of the number of leaves on either side of the first split? 0 n M n 0

31 Moran Model Without Mutation (θ=0) Looking Backward a Death Process n M n The Each distribution cyclic permutation of the number of n of leaves objects on corresponds either side of to the exactly first one realization split is uniform. of the death process (or the Yule process). 0 J Hein, MH Schierup, and C Wiuf (00) Gene Genealogies, Variation, and Evolution p.

32 Combinatorial Notations

33 Combinatorial Notations n θ = θ( θ + )...( θ + n ) = θ θ k= 0 k = π Sn n n k k Rising factorial Stirling number of the first kind Stirling cycle number The number of permutations of n objects with k cycles The rising factorial is the ordinary generating function of permutations, with θ marking the number of cycles. ( )...( n ) n θ θ θ θ = + Falling factorial ξ #(elements in a set ξ) R Graham, D Knuth, and O Patashnik (99) Concrete Mathematics: A Foundation for Computer Science

34 Something a Little Heavier WJ Ewens (9) Theor Pop Biol : P Donnelly & S Tavare (9) Adv Appl Prob : J Hein et al. (00) Gene Genealogies, Variation and Evolution

35 Probability of k Types in the Sample With Mutation n θ P { k types} n k = θ k The number of permutations of n objects with k cycles, marked with θ k Stirling number of the first kind Stirling cycle number k cycles k = π P Donnelly & S Tavare (9) Adv Appl Prob :

36 Ewens Sampling Formula n θ P { a } a an... n = n i= partition by types n! ( a! ) i ai i k θ π k n = a i= i n n = ia i= i k cycles k = WJ Ewens (9) Theor Pop Biol :

37 Most Recent Common Ancestor (MRCA) Without Mutation What is the probably that in a sample of n individuals, a random subset of m coalesces to its MRCA before coalescing with any other individuals? In the (circular) sample permutation, the individuals are consecutive. In the (circular) ancestral permutation, relative to the m individuals in the sample permutation, both end numbers are smaller than the internal numbers. n n m m + m = n = J Hein et al. (00) Gene Genealogies, Variation and Evolution π

38 Something Heavy

39 Distribution of Ancestors Without Mutation π k= ancestors 9 0

40 Distribution of Ancestors 0 ( ) P = { } Without Mutation { } n! n! A ξ,..., ξ k #(realizations producing A k ) ( ) ( ) ξ!...!!!! ξk k k n k k= ancestors k 0 π JFC Kingman (9) Stoch Proc Appl :

41 Distribution of Ancestors With Mutation { = ({ } ( ))} ( ) ( ) n n! θ P A k ξ,..., ξ k, η,..., η j #(realizations marked by θ producing A k ) ξ!... ξ! k! η!... η! k ( ) ( j) j θ ( n k) ( η( ) η ) ( )... η j ( j) k= ancestors! k θ oldest to youngest 0 π P Donnelly & S Tavare (9) Adv Appl Prob :

42 Distribution of Sample Types P { A ( ) } 0 = ( ),..., ( ) plus k j younger types j n θ η η #(realizations marked by θ producing A 0 ) s j n n sj k θ n n s n s k j ( )...( ) j oldest to youngest π si = η η ( ) ( i) total length of first i cycles P Donnelly & S Tavare (9) Adv Appl Prob :

43 Distribution of All Sample Types { A ( )} 0 = ( ),..., ( k ) n θ P η η #(realizations producing A 0, marked by θ) n! n n s n s ( )...( ) k k θ ( η,..., η ) = (,,, ) ( ) ( k ) oldest to youngest π si = η η ( ) ( i) P Donnelly & S Tavare (9) Adv Appl Prob :

44 Something Really Heavy

45 Moran Model With Mutation Nested Samples Embed a death process for a known sample of size n in the death process for a super-sample of size M. Use the correspondence with random permutations in S M to derive results for the super-sample, conditioned on the sample. It will be expected that various exact results hold for the Moran model WJ Ewens & GA Watterson in NM Bingham & CM Goldie (00) Probability and Mathematical Genetics, p. Let M to derive results relating the sample to an infinite population. Because of invariance theorems, such results often pertain to several different genetics models.

46 Distribution of Sample Types Within an Infinite Population P P n! { A ( )} 0 = η( ),..., η ( ) = k n( n s )...( n s ) k oldest to youngest types within sample k θ n θ k n! θ n θ θ θ θ { A ( )} 0 = η( ),..., η ( ) = k ( + n)( + n s )...( + n s ) k oldest to youngest types within sample are in fact the k oldest types in the population k = P Donnelly & S Tavare (9) Adv Appl Prob : si = η η ( ) ( i) Species Deletion Property

47

48 Summary Population Genetics Models Forward Time Models without mutation and infinite-alleles infinite-sites models Wright-Fisher Model Moran Model Yule Model The Death Process for the Models Backward Time The waiting times are independent of the combinatorics. Invariance theorems: the ancestral tree of a population sample converges to a single limit for many genetics models (the coalescent process). The death process maps to random permutations, so counting permutations provides many results for genetic samples. The mapping to random permutations also gives insight into results for the entire population Species Deletion Property

49 Laszlo Szekely 9 Eva Czabarka

6 Introduction to Population Genetics

6 Introduction to Population Genetics Grundlagen der Bioinformatik, SoSe 14, D. Huson, May 18, 2014 67 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,

More information

6 Introduction to Population Genetics

6 Introduction to Population Genetics 70 Grundlagen der Bioinformatik, SoSe 11, D. Huson, May 19, 2011 6 Introduction to Population Genetics This chapter is based on: J. Hein, M.H. Schierup and C. Wuif, Gene genealogies, variation and evolution,

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA Human Population Genomics Outline 1 2 Damn the Human Genomes. Small initial populations; genes too distant; pestered with transposons;

More information

A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences

A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences Indian Academy of Sciences A comparison of two popular statistical methods for estimating the time to most recent common ancestor (TMRCA) from a sample of DNA sequences ANALABHA BASU and PARTHA P. MAJUMDER*

More information

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009

Gene Genealogies Coalescence Theory. Annabelle Haudry Glasgow, July 2009 Gene Genealogies Coalescence Theory Annabelle Haudry Glasgow, July 2009 What could tell a gene genealogy? How much diversity in the population? Has the demographic size of the population changed? How?

More information

How robust are the predictions of the W-F Model?

How robust are the predictions of the W-F Model? How robust are the predictions of the W-F Model? As simplistic as the Wright-Fisher model may be, it accurately describes the behavior of many other models incorporating additional complexity. Many population

More information

Rare Alleles and Selection

Rare Alleles and Selection Theoretical Population Biology 59, 8796 (001) doi:10.1006tpbi.001.153, available online at http:www.idealibrary.com on Rare Alleles and Selection Carsten Wiuf Department of Statistics, University of Oxford,

More information

Lecture 18 : Ewens sampling formula

Lecture 18 : Ewens sampling formula Lecture 8 : Ewens sampling formula MATH85K - Spring 00 Lecturer: Sebastien Roch References: [Dur08, Chapter.3]. Previous class In the previous lecture, we introduced Kingman s coalescent as a limit of

More information

ON COMPOUND POISSON POPULATION MODELS

ON COMPOUND POISSON POPULATION MODELS ON COMPOUND POISSON POPULATION MODELS Martin Möhle, University of Tübingen (joint work with Thierry Huillet, Université de Cergy-Pontoise) Workshop on Probability, Population Genetics and Evolution Centre

More information

Mathematical models in population genetics II

Mathematical models in population genetics II Mathematical models in population genetics II Anand Bhaskar Evolutionary Biology and Theory of Computing Bootcamp January 1, 014 Quick recap Large discrete-time randomly mating Wright-Fisher population

More information

Endowed with an Extra Sense : Mathematics and Evolution

Endowed with an Extra Sense : Mathematics and Evolution Endowed with an Extra Sense : Mathematics and Evolution Todd Parsons Laboratoire de Probabilités et Modèles Aléatoires - Université Pierre et Marie Curie Center for Interdisciplinary Research in Biology

More information

Stochastic Demography, Coalescents, and Effective Population Size

Stochastic Demography, Coalescents, and Effective Population Size Demography Stochastic Demography, Coalescents, and Effective Population Size Steve Krone University of Idaho Department of Mathematics & IBEST Demographic effects bottlenecks, expansion, fluctuating population

More information

Evolution in a spatial continuum

Evolution in a spatial continuum Evolution in a spatial continuum Drift, draft and structure Alison Etheridge University of Oxford Joint work with Nick Barton (Edinburgh) and Tom Kurtz (Wisconsin) New York, Sept. 2007 p.1 Kingman s Coalescent

More information

Lecture 19 : Chinese restaurant process

Lecture 19 : Chinese restaurant process Lecture 9 : Chinese restaurant process MATH285K - Spring 200 Lecturer: Sebastien Roch References: [Dur08, Chapter 3] Previous class Recall Ewens sampling formula (ESF) THM 9 (Ewens sampling formula) Letting

More information

Genetic Variation in Finite Populations

Genetic Variation in Finite Populations Genetic Variation in Finite Populations The amount of genetic variation found in a population is influenced by two opposing forces: mutation and genetic drift. 1 Mutation tends to increase variation. 2

More information

Selection and Population Genetics

Selection and Population Genetics Selection and Population Genetics Evolution by natural selection can occur when three conditions are satisfied: Variation within populations - individuals have different traits (phenotypes). height and

More information

The Moran Process as a Markov Chain on Leaf-labeled Trees

The Moran Process as a Markov Chain on Leaf-labeled Trees The Moran Process as a Markov Chain on Leaf-labeled Trees David J. Aldous University of California Department of Statistics 367 Evans Hall # 3860 Berkeley CA 94720-3860 aldous@stat.berkeley.edu http://www.stat.berkeley.edu/users/aldous

More information

The mathematical challenge. Evolution in a spatial continuum. The mathematical challenge. Other recruits... The mathematical challenge

The mathematical challenge. Evolution in a spatial continuum. The mathematical challenge. Other recruits... The mathematical challenge The mathematical challenge What is the relative importance of mutation, selection, random drift and population subdivision for standing genetic variation? Evolution in a spatial continuum Al lison Etheridge

More information

Dynamics of the evolving Bolthausen-Sznitman coalescent. by Jason Schweinsberg University of California at San Diego.

Dynamics of the evolving Bolthausen-Sznitman coalescent. by Jason Schweinsberg University of California at San Diego. Dynamics of the evolving Bolthausen-Sznitman coalescent by Jason Schweinsberg University of California at San Diego Outline of Talk 1. The Moran model and Kingman s coalescent 2. The evolving Kingman s

More information

Frequency Spectra and Inference in Population Genetics

Frequency Spectra and Inference in Population Genetics Frequency Spectra and Inference in Population Genetics Although coalescent models have come to play a central role in population genetics, there are some situations where genealogies may not lead to efficient

More information

The Wright-Fisher Model and Genetic Drift

The Wright-Fisher Model and Genetic Drift The Wright-Fisher Model and Genetic Drift January 22, 2015 1 1 Hardy-Weinberg Equilibrium Our goal is to understand the dynamics of allele and genotype frequencies in an infinite, randomlymating population

More information

Population Genetics I. Bio

Population Genetics I. Bio Population Genetics I. Bio5488-2018 Don Conrad dconrad@genetics.wustl.edu Why study population genetics? Functional Inference Demographic inference: History of mankind is written in our DNA. We can learn

More information

classes with respect to an ancestral population some time t

classes with respect to an ancestral population some time t A GENEALOGICAL DESCRIPTION OF THE INFINITELY-MANY NEUTRAL ALLELES MODEL P. J. Donnelly S. Tavari Department of Statistical Science Department of Mathematics University College London University of Utah

More information

BIRS workshop Sept 6 11, 2009

BIRS workshop Sept 6 11, 2009 Diploid biparental Moran model with large offspring numbers and recombination Bjarki Eldon New mathematical challenges from molecular biology and genetics BIRS workshop Sept 6, 2009 Mendel s Laws The First

More information

An introduction to mathematical modeling of the genealogical process of genes

An introduction to mathematical modeling of the genealogical process of genes An introduction to mathematical modeling of the genealogical process of genes Rikard Hellman Kandidatuppsats i matematisk statistik Bachelor Thesis in Mathematical Statistics Kandidatuppsats 2009:3 Matematisk

More information

Populations in statistical genetics

Populations in statistical genetics Populations in statistical genetics What are they, and how can we infer them from whole genome data? Daniel Lawson Heilbronn Institute, University of Bristol www.paintmychromosomes.com Work with: January

More information

Wald Lecture 2 My Work in Genetics with Jason Schweinsbreg

Wald Lecture 2 My Work in Genetics with Jason Schweinsbreg Wald Lecture 2 My Work in Genetics with Jason Schweinsbreg Rick Durrett Rick Durrett (Cornell) Genetics with Jason 1 / 42 The Problem Given a population of size N, how long does it take until τ k the first

More information

Demography April 10, 2015

Demography April 10, 2015 Demography April 0, 205 Effective Population Size The Wright-Fisher model makes a number of strong assumptions which are clearly violated in many populations. For example, it is unlikely that any population

More information

Statistical population genetics

Statistical population genetics Statistical population genetics Lecture 7: Infinite alleles model Xavier Didelot Dept of Statistics, Univ of Oxford didelot@stats.ox.ac.uk Slide 111 of 161 Infinite alleles model We now discuss the effect

More information

CLOSED-FORM ASYMPTOTIC SAMPLING DISTRIBUTIONS UNDER THE COALESCENT WITH RECOMBINATION FOR AN ARBITRARY NUMBER OF LOCI

CLOSED-FORM ASYMPTOTIC SAMPLING DISTRIBUTIONS UNDER THE COALESCENT WITH RECOMBINATION FOR AN ARBITRARY NUMBER OF LOCI Adv. Appl. Prob. 44, 391 407 (01) Printed in Northern Ireland Applied Probability Trust 01 CLOSED-FORM ASYMPTOTIC SAMPLING DISTRIBUTIONS UNDER THE COALESCENT WITH RECOMBINATION FOR AN ARBITRARY NUMBER

More information

Processes of Evolution

Processes of Evolution 15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection

More information

J. Stat. Mech. (2013) P01006

J. Stat. Mech. (2013) P01006 Journal of Statistical Mechanics: Theory and Experiment Genealogies in simple models of evolution Éric Brunet and Bernard Derrida Laboratoire de Physique Statistique, École ormale Supérieure, UPMC, Université

More information

Theoretical Population Biology

Theoretical Population Biology Theoretical Population Biology 74 008 104 114 Contents lists available at ScienceDirect Theoretical Population Biology journal homepage: www.elsevier.com/locate/tpb A coalescent process with simultaneous

More information

A phase-type distribution approach to coalescent theory

A phase-type distribution approach to coalescent theory A phase-type distribution approach to coalescent theory Tuure Antonangeli Masteruppsats i matematisk statistik Master Thesis in Mathematical Statistics Masteruppsats 015:3 Matematisk statistik Maj 015

More information

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin

Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin Solutions to Even-Numbered Exercises to accompany An Introduction to Population Genetics: Theory and Applications Rasmus Nielsen Montgomery Slatkin CHAPTER 1 1.2 The expected homozygosity, given allele

More information

Evolution & Natural Selection

Evolution & Natural Selection Evolution & Natural Selection Learning Objectives Know what biological evolution is and understand the driving force behind biological evolution. know the major mechanisms that change allele frequencies

More information

Haploid & diploid recombination and their evolutionary impact

Haploid & diploid recombination and their evolutionary impact Haploid & diploid recombination and their evolutionary impact W. Garrett Mitchener College of Charleston Mathematics Department MitchenerG@cofc.edu http://mitchenerg.people.cofc.edu Introduction The basis

More information

A CHARACTERIZATION OF ANCESTRAL LIMIT PROCESSES ARISING IN HAPLOID. Abstract. conditions other limit processes do appear, where multiple mergers of

A CHARACTERIZATION OF ANCESTRAL LIMIT PROCESSES ARISING IN HAPLOID. Abstract. conditions other limit processes do appear, where multiple mergers of A CHARACTERIATIO OF ACESTRAL LIMIT PROCESSES ARISIG I HAPLOID POPULATIO GEETICS MODELS M. Mohle, Johannes Gutenberg-University, Mainz and S. Sagitov 2, Chalmers University of Technology, Goteborg Abstract

More information

The effects of a weak selection pressure in a spatially structured population

The effects of a weak selection pressure in a spatially structured population The effects of a weak selection pressure in a spatially structured population A.M. Etheridge, A. Véber and F. Yu CNRS - École Polytechnique Motivations Aim : Model the evolution of the genetic composition

More information

Ancestral Inference in Molecular Biology. Simon Tavaré. Saint-Flour, 2001

Ancestral Inference in Molecular Biology. Simon Tavaré. Saint-Flour, 2001 Ancestral Inference in Molecular Biology This is page i Printer: Opaque this Simon Tavaré Departments of Biological Sciences, Mathematics and Preventive Medicine, University of Southern California Saint-Flour,

More information

The two-parameter generalization of Ewens random partition structure

The two-parameter generalization of Ewens random partition structure The two-parameter generalization of Ewens random partition structure Jim Pitman Technical Report No. 345 Department of Statistics U.C. Berkeley CA 94720 March 25, 1992 Reprinted with an appendix and updated

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

Australian bird data set comparison between Arlequin and other programs

Australian bird data set comparison between Arlequin and other programs Australian bird data set comparison between Arlequin and other programs Peter Beerli, Kevin Rowe March 7, 2006 1 Data set We used a data set of Australian birds in 5 populations. Kevin ran the program

More information

I of a gene sampled from a randomly mating popdation,

I of a gene sampled from a randomly mating popdation, Copyright 0 1987 by the Genetics Society of America Average Number of Nucleotide Differences in a From a Single Subpopulation: A Test for Population Subdivision Curtis Strobeck Department of Zoology, University

More information

Challenges when applying stochastic models to reconstruct the demographic history of populations.

Challenges when applying stochastic models to reconstruct the demographic history of populations. Challenges when applying stochastic models to reconstruct the demographic history of populations. Willy Rodríguez Institut de Mathématiques de Toulouse October 11, 2017 Outline 1 Introduction 2 Inverse

More information

Chapter 16: Evolutionary Theory

Chapter 16: Evolutionary Theory Chapter 16: Evolutionary Theory Section 1: Developing a Theory Evolution: Artificial Selection: Evolution: I. A Theory to Explain Change Over Time B. Charles Darwin C. Theory: D. Modern evolutionary theory

More information

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS. !! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory

More information

A representation for the semigroup of a two-level Fleming Viot process in terms of the Kingman nested coalescent

A representation for the semigroup of a two-level Fleming Viot process in terms of the Kingman nested coalescent A representation for the semigroup of a two-level Fleming Viot process in terms of the Kingman nested coalescent Airam Blancas June 11, 017 Abstract Simple nested coalescent were introduced in [1] to model

More information

ANCESTRAL PROCESSES WITH SELECTION: BRANCHING AND MORAN MODELS

ANCESTRAL PROCESSES WITH SELECTION: BRANCHING AND MORAN MODELS **************************************** BANACH CENTER PUBLICATIONS, VOLUME ** INSTITUTE OF MATHEMATICS Figure POLISH ACADEMY OF SCIENCES WARSZAWA 2* ANCESTRAL PROCESSES WITH SELECTION: BRANCHING AND MORAN

More information

Notes for MCTP Week 2, 2014

Notes for MCTP Week 2, 2014 Notes for MCTP Week 2, 2014 Lecture 1: Biological background Evolutionary biology and population genetics are highly interdisciplinary areas of research, with many contributions being made from mathematics,

More information

Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates

Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimates JOSEPH FELSENSTEIN Department of Genetics SK-50, University

More information

Coalescent based demographic inference. Daniel Wegmann University of Fribourg

Coalescent based demographic inference. Daniel Wegmann University of Fribourg Coalescent based demographic inference Daniel Wegmann University of Fribourg Introduction The current genetic diversity is the outcome of past evolutionary processes. Hence, we can use genetic diversity

More information

Crump Mode Jagers processes with neutral Poissonian mutations

Crump Mode Jagers processes with neutral Poissonian mutations Crump Mode Jagers processes with neutral Poissonian mutations Nicolas Champagnat 1 Amaury Lambert 2 1 INRIA Nancy, équipe TOSCA 2 UPMC Univ Paris 06, Laboratoire de Probabilités et Modèles Aléatoires Paris,

More information

URN MODELS: the Ewens Sampling Lemma

URN MODELS: the Ewens Sampling Lemma Department of Computer Science Brown University, Providence sorin@cs.brown.edu October 3, 2014 1 2 3 4 Mutation Mutation: typical values for parameters Equilibrium Probability of fixation 5 6 Ewens Sampling

More information

The nested Kingman coalescent: speed of coming down from infinity. by Jason Schweinsberg (University of California at San Diego)

The nested Kingman coalescent: speed of coming down from infinity. by Jason Schweinsberg (University of California at San Diego) The nested Kingman coalescent: speed of coming down from infinity by Jason Schweinsberg (University of California at San Diego) Joint work with: Airam Blancas Benítez (Goethe Universität Frankfurt) Tim

More information

Similarity Measures and Clustering In Genetics

Similarity Measures and Clustering In Genetics Similarity Measures and Clustering In Genetics Daniel Lawson Heilbronn Institute for Mathematical Research School of mathematics University of Bristol www.paintmychromosomes.com Talk outline Introduction

More information

Properties of normal phylogenetic networks

Properties of normal phylogenetic networks Properties of normal phylogenetic networks Stephen J. Willson Department of Mathematics Iowa State University Ames, IA 50011 USA swillson@iastate.edu August 13, 2009 Abstract. A phylogenetic network is

More information

Counting All Possible Ancestral Configurations of Sample Sequences in Population Genetics

Counting All Possible Ancestral Configurations of Sample Sequences in Population Genetics 1 Counting All Possible Ancestral Configurations of Sample Sequences in Population Genetics Yun S. Song, Rune Lyngsø, and Jotun Hein Abstract Given a set D of input sequences, a genealogy for D can be

More information

Computational statistics

Computational statistics Computational statistics Combinatorial optimization Thierry Denœux February 2017 Thierry Denœux Computational statistics February 2017 1 / 37 Combinatorial optimization Assume we seek the maximum of f

More information

The Genealogy of a Sequence Subject to Purifying Selection at Multiple Sites

The Genealogy of a Sequence Subject to Purifying Selection at Multiple Sites The Genealogy of a Sequence Subject to Purifying Selection at Multiple Sites Scott Williamson and Maria E. Orive Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence We investigate

More information

Closed-form sampling formulas for the coalescent with recombination

Closed-form sampling formulas for the coalescent with recombination 0 / 21 Closed-form sampling formulas for the coalescent with recombination Yun S. Song CS Division and Department of Statistics University of California, Berkeley September 7, 2009 Joint work with Paul

More information

π b = a π a P a,b = Q a,b δ + o(δ) = 1 + Q a,a δ + o(δ) = I 4 + Qδ + o(δ),

π b = a π a P a,b = Q a,b δ + o(δ) = 1 + Q a,a δ + o(δ) = I 4 + Qδ + o(δ), ABC estimation of the scaled effective population size. Geoff Nicholls, DTC 07/05/08 Refer to http://www.stats.ox.ac.uk/~nicholls/dtc/tt08/ for material. We will begin with a practical on ABC estimation

More information

Mathematical Population Genetics II

Mathematical Population Genetics II Mathematical Population Genetics II Lecture Notes Joachim Hermisson March 20, 2015 University of Vienna Mathematics Department Oskar-Morgenstern-Platz 1 1090 Vienna, Austria Copyright (c) 2013/14/15 Joachim

More information

From Individual-based Population Models to Lineage-based Models of Phylogenies

From Individual-based Population Models to Lineage-based Models of Phylogenies From Individual-based Population Models to Lineage-based Models of Phylogenies Amaury Lambert (joint works with G. Achaz, H.K. Alexander, R.S. Etienne, N. Lartillot, H. Morlon, T.L. Parsons, T. Stadler)

More information

RW in dynamic random environment generated by the reversal of discrete time contact process

RW in dynamic random environment generated by the reversal of discrete time contact process RW in dynamic random environment generated by the reversal of discrete time contact process Andrej Depperschmidt joint with Matthias Birkner, Jiří Černý and Nina Gantert Ulaanbaatar, 07/08/2015 Motivation

More information

Microevolution Changing Allele Frequencies

Microevolution Changing Allele Frequencies Microevolution Changing Allele Frequencies Evolution Evolution is defined as a change in the inherited characteristics of biological populations over successive generations. Microevolution involves the

More information

1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals.

1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals. 1. The diagram below shows two processes (A and B) involved in sexual reproduction in plants and animals. Which statement best explains how these processes often produce offspring that have traits not

More information

LIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS

LIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS LIFE SCIENCE CHAPTER 5 & 6 FLASHCARDS Why were ratios important in Mendel s work? A. They showed that heredity does not follow a set pattern. B. They showed that some traits are never passed on. C. They

More information

Computing likelihoods under Λ-coalescents

Computing likelihoods under Λ-coalescents Computing likelihoods under Λ-coalescents Matthias Birkner LMU Munich, Dept. Biology II Joint work with Jochen Blath and Matthias Steinrücken, TU Berlin New Mathematical Challenges from Molecular Biology,

More information

Mathematical Population Genetics II

Mathematical Population Genetics II Mathematical Population Genetics II Lecture Notes Joachim Hermisson June 9, 2018 University of Vienna Mathematics Department Oskar-Morgenstern-Platz 1 1090 Vienna, Austria Copyright (c) 2013/14/15/18 Joachim

More information

On The Mutation Parameter of Ewens Sampling. Formula

On The Mutation Parameter of Ewens Sampling. Formula On The Mutation Parameter of Ewens Sampling Formula ON THE MUTATION PARAMETER OF EWENS SAMPLING FORMULA BY BENEDICT MIN-OO, B.Sc. a thesis submitted to the department of mathematics & statistics and the

More information

Introduction to population genetics & evolution

Introduction to population genetics & evolution Introduction to population genetics & evolution Course Organization Exam dates: Feb 19 March 1st Has everybody registered? Did you get the email with the exam schedule Summer seminar: Hot topics in Bioinformatics

More information

Pathwise construction of tree-valued Fleming-Viot processes

Pathwise construction of tree-valued Fleming-Viot processes Pathwise construction of tree-valued Fleming-Viot processes Stephan Gufler November 9, 2018 arxiv:1404.3682v4 [math.pr] 27 Dec 2017 Abstract In a random complete and separable metric space that we call

More information

NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT

NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE. Manuscript received September 17, 1973 ABSTRACT NATURAL SELECTION FOR WITHIN-GENERATION VARIANCE IN OFFSPRING NUMBER JOHN H. GILLESPIE Department of Biology, University of Penmyluania, Philadelphia, Pennsyluania 19174 Manuscript received September 17,

More information

Stirling Numbers of the 1st Kind

Stirling Numbers of the 1st Kind Daniel Reiss, Colebrook Jackson, Brad Dallas Western Washington University November 28, 2012 Introduction The set of all permutations of a set N is denoted S(N), while the set of all permutations of {1,

More information

The Evolution of Gene Dominance through the. Baldwin Effect

The Evolution of Gene Dominance through the. Baldwin Effect The Evolution of Gene Dominance through the Baldwin Effect Larry Bull Computer Science Research Centre Department of Computer Science & Creative Technologies University of the West of England, Bristol

More information

WXML Final Report: Chinese Restaurant Process

WXML Final Report: Chinese Restaurant Process WXML Final Report: Chinese Restaurant Process Dr. Noah Forman, Gerandy Brito, Alex Forney, Yiruey Chou, Chengning Li Spring 2017 1 Introduction The Chinese Restaurant Process (CRP) generates random partitions

More information

It is indeed a pleasure for me to contribute to this dedicatory volume

It is indeed a pleasure for me to contribute to this dedicatory volume CHAPTER THREE The Genealogy of the Birth, Death, and Immigration Process Simon. Tavark 3.1. Introduction It is indeed a pleasure for me to contribute to this dedicatory volume for Professor Samuel Karlin.

More information

Incomplete Lineage Sorting: Consistent Phylogeny Estimation From Multiple Loci

Incomplete Lineage Sorting: Consistent Phylogeny Estimation From Multiple Loci University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1-2010 Incomplete Lineage Sorting: Consistent Phylogeny Estimation From Multiple Loci Elchanan Mossel University of

More information

Introduction to Digital Evolution Handout Answers

Introduction to Digital Evolution Handout Answers Introduction to Digital Evolution Handout Answers Note to teacher: The questions in this handout and the suggested answers (in red, below) are meant to guide discussion, not be an assessment. It is recommended

More information

The Λ-Fleming-Viot process and a connection with Wright-Fisher diffusion. Bob Griffiths University of Oxford

The Λ-Fleming-Viot process and a connection with Wright-Fisher diffusion. Bob Griffiths University of Oxford The Λ-Fleming-Viot process and a connection with Wright-Fisher diffusion Bob Griffiths University of Oxford A d-dimensional Λ-Fleming-Viot process {X(t)} t 0 representing frequencies of d types of individuals

More information

Lecture 1. References

Lecture 1. References There follows an annotated bibliography. It is extensive, but still far from complete. I have separated the primary sources which form a less intimidating list. If I have the energy I ll add a section

More information

Evolution. Species Changing over time

Evolution. Species Changing over time Evolution Species Changing over time Objectives I can differentiate between natural selection and artificial selection and I can give examples of each. I can explain several reasons for genetic variation

More information

The coalescent process

The coalescent process The coalescent process Introduction Random drift can be seen in several ways Forwards in time: variation in allele frequency Backwards in time: a process of inbreeding//coalescence Allele frequencies Random

More information

Modeling Evolution DAVID EPSTEIN CELEBRATION. John Milnor. Warwick University, July 14, Stony Brook University

Modeling Evolution DAVID EPSTEIN CELEBRATION. John Milnor. Warwick University, July 14, Stony Brook University Modeling Evolution John Milnor Stony Brook University DAVID EPSTEIN CELEBRATION Warwick University, July 14, 2007 A First Model for Evolution: The Phylogenetic Tree, U time V W X A B C D E F G 1500 1000

More information

Concepts and Methods in Molecular Divergence Time Estimation

Concepts and Methods in Molecular Divergence Time Estimation Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks

More information

REVIEW 6: EVOLUTION. 1. Define evolution: Was not the first to think of evolution, but he did figure out how it works (mostly).

REVIEW 6: EVOLUTION. 1. Define evolution: Was not the first to think of evolution, but he did figure out how it works (mostly). Name: REVIEW 6: EVOLUTION 1. Define evolution: 2. Modern Theory of Evolution: a. Charles Darwin: Was not the first to think of evolution, but he did figure out how it works (mostly). However, Darwin didn

More information

REVERSIBLE MARKOV STRUCTURES ON DIVISIBLE SET PAR- TITIONS

REVERSIBLE MARKOV STRUCTURES ON DIVISIBLE SET PAR- TITIONS Applied Probability Trust (29 September 2014) REVERSIBLE MARKOV STRUCTURES ON DIVISIBLE SET PAR- TITIONS HARRY CRANE, Rutgers University PETER MCCULLAGH, University of Chicago Abstract We study k-divisible

More information

Some mathematical models from population genetics

Some mathematical models from population genetics Some mathematical models from population genetics 5: Muller s ratchet and the rate of adaptation Alison Etheridge University of Oxford joint work with Peter Pfaffelhuber (Vienna), Anton Wakolbinger (Frankfurt)

More information

mrna Codon Table Mutant Dinosaur Name: Period:

mrna Codon Table Mutant Dinosaur Name: Period: Mutant Dinosaur Name: Period: Intro Your dinosaur is born with a new genetic mutation. Your job is to map out the genes that are influenced by the mutation and to discover how the new dinosaurs interact

More information

Regents Biology REVIEW 6: EVOLUTION. 1. Define evolution:

Regents Biology REVIEW 6: EVOLUTION. 1. Define evolution: Period Date REVIEW 6: EVOLUTION 1. Define evolution: 2. Modern Theory of Evolution: a. Charles Darwin: Was not the first to think of evolution, but he did figure out how it works (mostly). However, Darwin

More information

Tree sets. Reinhard Diestel

Tree sets. Reinhard Diestel 1 Tree sets Reinhard Diestel Abstract We study an abstract notion of tree structure which generalizes treedecompositions of graphs and matroids. Unlike tree-decompositions, which are too closely linked

More information

Coalescent Theory for Seed Bank Models

Coalescent Theory for Seed Bank Models Coalescent Theory for Seed Bank Models June 8, 001 Ingemar Kaj 1, Stephen M. Krone, Martin Lascoux 3 1 Department of Mathematics, Uppsala University, S-751 06 Uppsala, Sweden. e-mail: ikaj@math.uu.se Department

More information

Name Period. 3. How many rounds of DNA replication and cell division occur during meiosis?

Name Period. 3. How many rounds of DNA replication and cell division occur during meiosis? Name Period GENERAL BIOLOGY Second Semester Study Guide Chapters 3, 4, 5, 6, 11, 14, 16, 17, 18 and 19. SEXUAL REPRODUCTION AND MEIOSIS 1. What is the purpose of meiosis? 2. Distinguish between diploid

More information

Chapter 5 Evolution of Biodiversity. Sunday, October 1, 17

Chapter 5 Evolution of Biodiversity. Sunday, October 1, 17 Chapter 5 Evolution of Biodiversity CHAPTER INTRO: The Dung of the Devil Read and Answer Questions Provided Module 14 The Biodiversity of Earth After reading this module you should be able to understand

More information

Follow links Class Use and other Permissions. For more information, send to:

Follow links Class Use and other Permissions. For more information, send  to: COPYRIGHT NOTICE: Sarah P. Otto & Troy Day: A Biologist's Guide to Mathematical Modeling in Ecology and Evolution is published by Princeton University Press and copyrighted, 2007, by Princeton University

More information

Neutral behavior of shared polymorphism

Neutral behavior of shared polymorphism Proc. Natl. Acad. Sci. USA Vol. 94, pp. 7730 7734, July 1997 Colloquium Paper This paper was presented at a colloquium entitled Genetics and the Origin of Species, organized by Francisco J. Ayala (Co-chair)

More information

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall

More information

Chapter 13- Reproduction, Meiosis, and Life Cycles. Many plants and other organisms depend on sexual reproduction.

Chapter 13- Reproduction, Meiosis, and Life Cycles. Many plants and other organisms depend on sexual reproduction. Chapter 13- Reproduction, Meiosis, and Life Cycles Many plants and other organisms depend on sexual reproduction. Flowers are the sexual reproductive organ systems of angiosperms. Sexual reproduction gametes

More information

Evolutionary dynamics on graphs

Evolutionary dynamics on graphs Evolutionary dynamics on graphs Laura Hindersin May 4th 2015 Max-Planck-Institut für Evolutionsbiologie, Plön Evolutionary dynamics Main ingredients: Fitness: The ability to survive and reproduce. Selection

More information