Bayesian models for macroevolutionary studies
|
|
- Antonia Cobb
- 5 years ago
- Views:
Transcription
1 Bayesian models for macroevolutionary studies Nicolas Lartillot February 28, 2012 Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
2 The molecular clock 8%&9%:&'9%+.*;:-(+ "#$%&$'()*+'()+,'"*-(./+0123/+4-5"&'+/ ρ = µ f 0 substitution rate = mutation rate fraction of neutral mutations
3 Variation of the substitution rate among lineages AARDVARK GOLDENMOLE TENRECID ELEPHANTULUS MACROSCELIDES ELEPHANT HYRAX SIRENIAN ANTEATER SLOTH ARMADILLO CAT DOG PANGOLIN FLYINGFOX PHYLLOSTOMID HORSE RHINO TAPIR COW DELPHINOID WHALE HIPPO PIG LLAMA HEDGEHOG SHREW MOLE CAVIOMORPH MOUSE RAT SCIURID PIKA RABBIT FLYINGLEMUR HUMAN LEMUR TREESHREW 0.1 subs per site concatenation of 13 nuclear genes, 38 placentals
4 Variation of the substitution rate among lineages Possible causes generation-time effect time metabolic rate effects selection for longevity (reviewed in Lanfear et al, 2010) = AARDVARK GOLDENMOLE TENRECID ELEPHANTULUS MACROSCELIDES ELEPHANT HYRAX SIRENIAN ANTEATER SLOTH ARMADILLO CAT DOG PANGOLIN FLYINGFOX PHYLLOSTOMID HORSE RHINO TAPIR COW DELPHINOID WHALE HIPPO PIG LLAMA HEDGEHOG SHREW MOLE CAVIOMORPH MOUSE RAT SCIURID PIKA RABBIT FLYINGLEMUR HUMAN LEMUR TREESHREW 0.1 subs per site concatenation of 13 nuclear genes, 38 placentals
5 Introduction Tree shape and diversification patterns Nee et al, 1994 modeling speciation-extinction processes using time-calibrated phylogenies to fit and compare models Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
6 tion event, a new 10 l 2 is drawn from the dis- (ii) Cope s rule with tinction alone, and (v timal body size, has the best empirical support a probability calculated only from the species size. produced size distrib [model selection by likelihood ratio test theandcretaceous-tertiary We specified a basal extinction boundary rate b by(19, assuming 20, 25). close to the empirical Bayesian information criterion (24)]. ThisFurther, model that a limit the number in this of vicinity Recent terrestrial is supported mammal species by both dicts a particular exti includes both a strengthening of Cope s rule for small-bodied species (x ~ < 32 g) and a small but is close to a putative carrying capacity. We then let extinction risk per unit time increase logarithmically that could be tested data. with body size (24, 28). uniformly positive bias for larger species, resulting Diffusion This model and leaves To further discrim in an average body-size growth of 4.1 ± 1.0% only the rate r by which riskshort term increases with size planations for the sp selective advantages tested simpler diffus rameters estimated fr A B ing (i) unbiased diffu 10 (iii) Cope s rule alon that omits the incre 2 Long term risk species (x < ~ 32 g). We x A of extinction 1 predicted size distrib Lower 0.5 times dramatically so x D boundary bution (Fig. 3, B and effects Additionally, we found for large-bodied sp Ancestor descendant data extinction risk increa Estimated model (µ, σ) Body size (g) results support the i lower limit, the diffu Ancestor body (g) increasing risk of ext Clauset Fig. 1. et Smoothed al, 2008 species body-size distribution of Fig. 2. (A) A schematic illustrating a simple4002 cladogenetic Recent diffusion terrestrial model of mammals species body-size [dataevolution, from (21)], an increased bias tow where the size of a descendant species x D is related to its ancestor s size x A by a multiplicative factor l.(b) bodied species (x ~ < 3 Empirical data on 1106 changes in North American showing mammalian the three body size macroevolutionary [data from (20)], as a function processesthus, the shape of ancestor size, overlaid with the estimated that model shape of within-lineage the relative changes, abundances where the average of different logchange can be interpreted in log l varies piecewise as a function of body size (24). macroevolutionary pr n body size (17) could also produce a simape. Furthermore, different mechanisms ive body-size evolution on spatial and temscales (3), and the importance of interic competition to the macroevolutionary ics of species body size is not known. developed a generalized diffusion model ies body-size evolution, in which the size ution is the product of three macroevoluprocesses (Fig. 1). We combine these ses, each of which has been independently Extinction and body size d (1, 2, 17, 18), in a single quantitative ork, estimate its parameters from fossil n extinct terrestrial mammals from before te Quaternary (19, 20), and determine er this model, or simpler variants, can ree the sizes of the 4002 known extant and terrestrial mammal species from the late nary (Recent species) (21, 22). is model makes three assumptions: (i) s size varies over evolutionary time as a enetic Equilibrium multiplicative between diffusion process ; the size of a descendant species x D is the t of a stochastic growth factor l and its or s size x A, that is, x D = lx A. For each coupling between tree shape and body size evolution A Empirical data intra-lineage trend towards larger 1 body B size Diffusion model higher risk of extinction in larger animals ion Multiplicative change in Log body Density size, λ sizes. The left tail of the distribution is created by diffusion in the vicinity of a taxon-specific lower limit near 2 g, whereas the long right tail is produced by the interaction of diffusion over evolutionary time (including trends like Cope s rule) and the long-term risk of extinction 10 2 from increased body size. ion ion 10 1 C 10 2 Downloaded
7 Introduction Motivation Macro-evolutionary studies reconstructing divergence times testing diversification models testing models about life-history evolution understanding driving forces of molecular evolution correlation diversification / life-history / substitution patterns Potential methodological problems current approaches proceed sequentially problems of bias / error propagation lack of flexibility calls for more integrative approaches Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
8 Estimating divergence times: the relaxed clock model r t sequence alignment &'(#'('#('&#')'#***# &'&#'&'#('&#')'#***# &'(#'&'#()&#')'#***# &'(#'&'#()&#')'#***# %# $# "# Data and constraints multiple alignment D (here, nuclear coding genes in mammals) tree topology T, and fossil calibrations Φ
9 Estimating divergence times: the relaxed clock model r t sequence alignment &'(#'('#('&#')'#***# &'&#'&'#('&#')'#***# &'(#'&'#()&#')'#***# &'(#'&'#()&#')'#***# %# $# "# Hierarchial Bayesian model diversification process (e.g. birth-death, parameters λ, µ, ρ) substitution rate: Brownian log-normal process (variance σ 2 ) substitution process (4x4 substitution matrix Q) parameters: θ = (λ, µ, ρ, σ, t, r)
10 Estimating divergence times: the relaxed clock model r t sequence alignment &'(#'('#('&#')'#***# &'&#'&'#('&#')'#***# &'(#'&'#()&#')'#***# &'(#'&'#()&#')'#***# %# $# "# Diversification process diversification process (e.g. birth-death, parameters λ, µ, ρ) tree topology T, fossil constraints Φ t: vector of divergence times p(t λ, µ, ρ, Φ, T ) = p(t λ, µ, ρ) Kendall 1948, Nee et al 1994, Yang and Rannala, 2006
11 Molecular dating Brownian process lnr j ~ N(ln r jup,"#t) r j r j up l = r "t %# $# "# "t = t 2 # t 1 joint probability: p(d T,l,Q)p(r t,")p(t)p(")p(q) x t = ln r t dx t = σdb t x t N(x 0, σ 2 t) P taxa, j = 1..2P 2 branches p(r r 0, σ 2, t) = j p(r j r jup, t j, σ 2 ) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
12 Molecular dating Substitution process sequence alignment r j &'(#'('#('&#')'#***# r j up l = r "t &'&#'&'#('&#')'#***# &'(#'&'#()&#')'#***# &'(#'&'#()&#')'#***# %# $# "# "t = t 2 # t 1 probability of transition between nucleotides a and b over time t: r = 1 t t1 r t dt r j + r jup t 2 2 p(a b r, t, Q) = [exp(r t Q)] ab Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
13 Molecular dating Relaxed clock model lnr j ~ N(ln r jup,"#t) r j r j up l = r "t %# $# "# "t = t 2 # t 1 joint probability: joint probability: p(d T,l,Q)p(r t,")p(t)p(")p(q) p(t λ, µ, ρ) p(r t, σ 2 ) p(d r, t, Q) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
14 Molecular dating Bayes theorem: general formulation p(θ D) = p(d) = p(d θ)p(θ) p(d) p(d θ)p(θ)dθ θ model parameters D Data p(θ) prior distribution p(d θ) likelihood p(d) evidence, or marginal likelihood p(θ D) posterior distribution Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
15 Sampling from the posterior: the Metropolis algorithm Histogram of out2[[1]] 1 "out2" Density Metropolis algorithm draw U Uniform[0, 1] random walk set θk = θ k + δ(u 0.5), possibly reflecting back into [0, 1] { } accept with probability q = min iterate 1, p(θ k D) p(θ k D) (θ k ) k N is a Markov chain with stationary distirbution p(θ f )
16 Molecular dating Relaxed clock model lnr j ~ N(ln r jup,"#t) r j r j up l = r "t %# $# "# "t = t 2 # t 1 joint probability: posterior proportional to joint probability: p(d T,l,Q)p(r t,")p(t)p(")p(q) p(λ, µ, ρ) p(σ 2 ) p(t λ, µ, ρ) p(r t, σ 2 ) p(d r, t, Q) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
17 Molecular dating Relaxed clock model r j " r j # %# $# "# Metropolis Hastings on rates α = p(d r, t, Q) p(r t, σ 2 ) p(d r, t, Q) p(r t, σ 2 ) α > 1: accept move α < 1: accept with prob. α Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
18 Molecular dating Relaxed clock model %# $# "# t j " t # j Metropolis Hastings on divergence times α = p(d r, t, Q) p(r t, σ 2 ) p(t λ, µ, ρ) p(d r, t, Q) p(r t, σ 2 ) p(t λ, µ, ρ) α > 1: accept move α < 1: accept with prob. α Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
19 Molecular dating Relaxed clock model lnr j ~ N(ln r jup,"#t) r j r j up l = r "t %# $# "# "t = t 2 # t 1 joint probability: p(d T,l,Q)p(r t,")p(t)p(")p(q) Metropolis Hastings on σ 2 α = p(r t, σ 2 ) p(σ 2 ) p(r t, σ 2 ) p(σ 2 ) α > 1: accept move α < 1: accept with prob. α Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
20 Molecular dating Relaxed clock model lnr j ~ N(ln r jup,"#t) r j r j up l = r "t %# $# "# "t = t 2 # t 1 joint probability: Metropolis Hastings on λ, µ, ρ p(d T,l,Q)p(r t,")p(t)p(")p(q) α = p(t λ, µ, ρ ) p(λ, µ, ρ ) p(t λ, µ, ρ) p(λ, µ, ρ) α > 1: accept move α < 1: accept with prob. α Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
21 Posterior mean times and rates Divergence times and substitution rat rate time CAT DOG PANGOLIN FLYINGFOX PHYLLOSTOM HORSE RHINO TAPIR COW DELPHINOID WHALE HIPPO PIG LLAMA HEDGEHOG SHREW MOLE CAVIOMORPH MOUSE RAT SCIURID PIKA RABBIT FLYINGLEMU HUMAN STREPSIRRH TREESHREW GOLDENMOLE TENRECID LOEARELESH SHEARELESH AARDVARK ELEPHANT HYRAX SIRENIAN ANTEATER SLOTH ARMADILLO DIDELPHIS MONODELPHI PLATYPUS 100 KT 0 Myrs (Thorne et al 1998, Lepage et al 2007, Rannala and Yang 2007) Multidivtime, PhyloBayes, MCMCtree
22 Substitution-calibrated versus time-calibrated trees AARDVARK GOLDENMOLE TENRECID ELEPHANTULUS MACROSCELIDES ELEPHANT HYRAX SIRENIAN ANTEATER SLOTH ARMADILLO CAT DOG PANGOLIN FLYINGFOX PHYLLOSTOMID HORSE RHINO TAPIR COW DELPHINOID WHALE HIPPO PIG LLAMA HEDGEHOG SHREW MOLE CAVIOMORPH MOUSE RAT SCIURID PIKA RABBIT FLYINGLEMUR HUMAN LEMUR TREESHREW 0.1 subs per site
23 Correlating rates and life-history traits rate / mass regression log mass log subs. rate correcting for phylogenetic inertia (independent contrasts) sequential method: error propagation no feedback of rate variations on life-history evolution
24 Coupling substitution process with life-history evolution &'#"()# *+,-#./00# 01*02#3/4# r 3 r 2 r 1 sequence alignment $ " = 2 #1 ' & ) %#1 1 ( covariance matrix l 2 = r 2 "t "()# )2(5# 5)# 789#898#987#8:8#222# 787#878#987#8:8#222# 789#878#9:7#8:8#222# (6))# 789#878#9:7#8:8#222# %# $# "# kg (Lartillot and Poujol, 2011, Molecular Biology and Evolution) Data and constraints multiple alignment D (nuclear coding genes in mammals) matrix of quantitative characters (C) (life-history traits) tree topology (T ), and fossil calibrations (Φ)
25 Coupling substitution process with life-history evolution &'#"()# *+,-#./00# 01*02#3/4# r 3 r 2 r 1 sequence alignment $ " = 2 #1 ' & ) %#1 1 ( covariance matrix l 2 = r 2 "t "()# )2(5# 5)# 789#898#987#8:8#222# 787#878#987#8:8#222# 789#878#9:7#8:8#222# (6))# 789#878#9:7#8:8#222# %# $# "# kg (Lartillot and Poujol, 2011, Molecular Biology and Evolution) Hierarchial Bayesian model (parameter estimation by MCMC) diversification process t (birth-death, parameters λ, µ, ρ) Brownian multivariate process X (covariance matrix Σ) time-dependent codon model Q
26 Codon model Mutation matrix U (4 x 4) A C G T A γ κγ 1 C 1 γ κ G κ γ 1 T 1 κγ γ κ: transition-transversion ratio γ = GC /(1 GC ) (GC : equilibrium GC) Codon substitution matrix Q (61 x 61) Q c1 c 2 = U n1 n 2, Q c1 c 2 = U n1 n 2 ω, c 1, c 2 synonymous, c 1, c 2 non-synonymous. Muse and Gaut 1994
27 Phylogenetic covariance model Generalization time-dependent substitution parameters rate of synonymous substitution (r) non-synonymous / synonymous ratio (ω) equilibrium GC (γ) time-dependent quantitative traits sexual maturity (proxy of generation time) adult body mass maximum recorded lifespan (proxy of longevity) metabolic rate genome size karyotypic number (number of chromosomes 2n) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
28 Coupling substitution process with life-history evolution &'#"()# *+,-#./00# 01*02#3/4# r 3 r 2 r 1 sequence alignment $ " = 2 #1 ' & ) %#1 1 ( covariance matrix l 2 = r 2 "t "()# )2(5# 5)# 789#898#987#8:8#222# 787#878#987#8:8#222# 789#878#9:7#8:8#222# (6))# 789#878#9:7#8:8#222# %# $# "# kg (Lartillot and Poujol, 2011, Molecular Biology and Evolution) posterior proportional to joint probability: p(λ, µ, ρ) p(t λ, µ, ρ) p(σ) p(x t, Σ) p(d X, t)
29 Phylogenetic covariance model Conjugate sampling Data augmentation: substitution mappings resample mapping conditional on current model parameters resample parameters conditional on current mapping Lartillot, 2006, J. Comput Biol. Conjugacy on covariance matrix - Brownian multivariate process Σ W 1 (Σ 0, q) p(σ Σ 0, q) Σ 0 q 2 Σ q+m+1 2 e 1 2 tr(σ 0Σ 1), p(σ): Inverse-Wishart of parameters Σ 0 = κi, q d.f. p(x Σ, t): Multivariate normal p(σ X, t): Inverse-Wishart of parameters Σ + A, q + 2P 2 d.f. Lartillot and Poujol 2011, Mol. Biol. Evol. Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
30 Phylogenetic covariance model Conjugate sampling Conjugacy on covariance matrix - Brownian multivariate process p(σ): Inverse-Wishart of parameters Σ 0 = κi, q d.f. p(x Σ, t): Multivariate normal p(σ X, t): Inverse-Wishart of parameters Σ + A, q + 2P 2 d.f. A: empirical cov. matrix of branchwise contrasts: (X j X jup )/ t j. Integrating the joint probability over Σ: p(λ, µ, ρ) p(t λ, µ, ρ) ( ) p(x t, Σ)p(Σ)dΣ p(d X, t) MCMC marginalized over Σ for each point θ k of the MCMC: Σ k p(σ k X) Lartillot and Poujol 2011, Mol. Biol. Evol. Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
31 Results Rates, dates and traits 1. Nuclear data: correlates of synonymous rate ds ds dn/ds mat. long. mass met. dn/ds maturity longevity mass red: blue: light shade: positive negative not significant metabolic rate strong correlations between life-history traits ds correlates negatively with body mass, gen. time and longevity R 2 : life-history variations explain 35% of synonymous rate. dn/ds positively correlated with body size (nearly neutral effect?) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
32 Inferring divergence times and body size evolution Manis Ailuropoda Canis Felis Panthera Equus Ceratotherium Tapirus Tadarida Antrozous Myotis Artibeus Nycteris Megaderma Pteropus Rousettus Sus Bos Tragelaphus Hippopotamus Megaptera Tursiops Lama Vicugna Solenodon Erinaceus Sorex Galemys Talpa Tarsius Callithrix Macaca Pongo Gorilla Homo Pan Otolemur Lemur Microcebus Cynocephalus Galeopterus Ptilocercus Tupaia Castor Dipodomys Pedetes Mus Rattus Hystrix Erethizon Cavia Hydrochoerus Muscardinus Spermophilus Tamias Ochotona Oryctolagus Sylvilagus Orycteropus Amblysomus Echinops Elephantulus Macroscelides Loxodonta Procavia Trichechus Dasypus Chaetophractus Euphractus Choloepusdi Choloepusho Myrmecophaga Tamandua Myrs KT
33 The evolution of body size CAT 1 kg 10 kg DOG PANGOLIN FLYINGFOX PHYLLOSTOMID HORSE Pakicetids 100 kg RHINO TAPIR 1000 kg COW DELPHINOID WHALE HIPPO PIG LLAMA HEDGEHOG SHREW MOLE (Thewissen et al, 2001) CAVIOMORPH MOUSE RAT SCIURID PIKA RABBIT Hippo Whale ancestor FLYINGLEMUR HUMAN LEMUR TREESHREW GOLDENMOLE TENRECID ELEPHANTULUS MACROSCELIDES AARDVARK ELEPHANT HYRAX SIRENIAN ANTEATER SLOTH ARMADILLO DIDELPHIS post. density coupled < KT uncoupled MONODELPHIS PLATYPUS log 10 Mass (g) 100 KT 0 Myrs
34 Systematic trends substitution rate body mass Cope s or Stanley s rule intra-lineage drive towards larger body size more frequent extinction of large-bodied mammals needs to be explicitely modeled (directed Brownian motion) possible impact in estimated divergence times (Welch 2008) connections with mass-dependent extinction (FitzJohn, 2010).
35 Results GC landscapes and biased genes conversion 2. Equilibrium GC (GC ) in nuclear genomes ds dn/ds GC* mat. mass long. # chrom. ds dn/ds GC* maturity red: blue: light shade: positive negative not significant mass longevity # chromosomes negative correlation between GC and body size positive correlation between GC and number of chromosomes Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
36 Results GC landscapes and biased genes conversion Biased gene conversion (BGC) during meiosis GC is overtransmitted GC-biased mismatch repair A A T = T A T C G C G C G C G C G adapted from Duret and Galtier 2009 x GC = 1 + b 2 x AT = 1 b 2 meiotic distorsion bias b positive selection for GC b proportional to local recombination rate Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
37 Biased gene conversion explains variations of GC ds ds dn/ds gc mat. mass long. # chrom. dn/ds gc maturity red: blue: light shade: positive negative not significant mass longevity # chromosomes Negative correlation GC / body size larger animals = smaller population = less efficient selection also less efficient BGC (lower GC )
38 Biased gene conversion explains variations of GC F. Pardo-Manuel de Villena, C. Sapienza: Recombination and chromosome arms ds ds dn/ds gc mat. mass long. # chrom. dn/ds gc maturity mass longevity # chromosomes Fig. 1. Plot of mammalian recombination frequency as a function of haploid number of chromosome arms (FN/2). a) Recombination estimated as number of chiasma de Villena (NC). Each and circlesapienza, represents a species: , Dasyuroides byrnei; 2, Dasyurus viverrinus; 3, Sarcophilus harrissi; 4, Smithopsis crassicaudata; 5, Paremeles gunnii; 6, Isoodon macrourus; 7, Dasypus novemcinctus; 8, Oryctolagus cuniculus; 9, Cricetus cricetus; 10, Lagurus lagurus; 11, Meriones ungiculatus; 12, Apodemus sylvaticus; 13, Rattus norvegicus; 14, Mus musculus; 15, Cebuella pygmamaea; 16, Sanguinus oedipus; 17, Macaca fuscata; 18, Macaca mulatta; 19, Macaca nemestrina; 20, Pan troglodytes; 21, Homo sapiens; 22, Homo sapiens; 23, Mandrillux sphinx; 24, Cebus capucinus; 25, Lemur fulvus; 26, Lemur fulvus collaris; 27, Lemur fulvus albocollaris; 28, Akodon arviculoides; 29; Akodon arviculoides; 30, Akodon sp; 31, Akodon nigris; 32, Zygodontomys lasiurus; 33, Clyomys laticeps; 34; Nectomys squamipes; 35, Nectomys squamypes; 36, Oxymicterus sp; 37, Euryzygomalomys guiara; 38, Proechimys iheringi; 39, Mus musculus; 40, Mesocricetus auratus; 41, Positive correlation GC / chromosome number conversion bias proportional to recombination rate 1 recombination event per chromosome arm per meiosis recombination rate inversely proportional to chromosome size linkage maps are provisional, they are in good agreement with the regression line. Both the regression line for all species in Fig. 1a (NC 0.85(NF/2) + 6.2; R ; F < "17, in Fig. 1a) and the average NC per chromosome arm (1.06 and 1.16 in Dutrillaux (1986) and Burt and Bell (1987), respectively) suggest that there is stronger gene conversion bias in more fragmented karyotypes Cricetulus griseus data in species re data reported by number of haploid data in species rep the regression lin data is denoted as map. Each filled c 1, cattle; 2, dog; species not includ 9, cat; 10, horse; a each species is as which we estimate correcting for the used in their stud ingly, the centro combination an and human (Do Mathani and W establishes imp crossovers for a
39 Results GC landscapes and biased genes conversion Mutation selection model Kimura 1982 Substitution process (low mutation approx.) Substitution rate = mutation rate x fixation probability r = 2Nu p = u 2Np = up u: mutation rate p: fixation probability N: effective population size P = 2Np: scaled fixation probability (relative to neutral) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
40 Results GC landscapes and biased genes conversion Fixation probability of a mutation CARTWRIGHT, LARTILLOT, AND THORNE STUDYING ANCESTRAL LINEAGES 23 FIGURES (a) Ancestral Lineage (b) Ancestral Lineages Relating Three Species FIGURE 1. neutral case p 0 = 1 2N general case p = 2s 1 e 4Ns Scaled fixation probability P = 2Np = 4Ns 1 e 4Ns = S 1 e S S = 4Ns: scaled selection coefficient Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
41 Results GC landscapes and biased genes conversion Scaled fixation probability as a function of S = 4Ns P = 2Np = S 1 e S P S neutral case S = 0: P = 1 deleterious S < 0: P < 1 advantageous S > 0: P > 1 Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
42 Neutral fixation probability in the presence of BGC CARTWRIGHT, LARTILLOT, AND THORNE STUDYING ANCESTRAL LINEAGES 23 FIGURES (a) Ancestral Lineage (b) Ancestral Lineages Relating Three Species FIGURE 1. GC overtransmission A T A T C G C G A T C G C G C G x GC = 1 + b 2 x AT = 1 b 2 Scaled conversion coefficient B = 4Nb mutation from AT to GC P = B 1 e B > 1 mutation from GC to AT P = B 1 e B < 1
43 A mechanistic phylogenetic covariance model substitution rate = mutation rate x fixation probability µ AC µ AG µ AT µ CA µ CG µ CT µ GA µ GC µ GT µ TA µ TC µ TG + B = µ AC B 1 e B µ AG B 1 e B µ AT µ CA B 1 e B µ CG µ CT B 1 e B µ GA B 1 e B µ GC µ GT B 1 e B µ TA µ TC B 1 e B µ TG B 1 e B B = 4N e b : scaled conversion coefficient
44 A mechanistic phylogenetic covariance model substitution rate = mutation rate x fixation probability µ AC µ AG µ AT µ CA µ CG µ CT µ GA µ GC µ GT µ TA µ TC µ TG + B = µ AC B 1 e B µ AG B 1 e B µ AT µ CA B 1 e B µ CG µ CT B 1 e B µ GA B 1 e B µ GC µ GT B 1 e B µ TA µ TC B 1 e B µ TG B 1 e B B = 4N e b : scaled conversion coefficient Predicted scaling of B = 4N e b with mass (M) and # chrom (2n) N e M γ M, (γ M < 0) b = b 0 r r 2n therefore, B M γ M 2n. ln B = γ M ln M + ln 2n + N(0, σ 2 t)
45 9#:;<'*2# &'()#*+,,# -./0&# M (kg) 2n sequence alignment (syn. positions) " = [ 3# 3] covariance matrix B 1 B 2 B 3 "/1# 12/3# 31# %=# 3=# $/# #222# #222# #222# /411# /$# #222# %# $# "# overall modeling strategy only 4-fold degenerate third codon positions modeling joint variation in B, body mass (M) and karyotype (2n) modeling among gene variation (recombination landscapes)
46 Results GC landscapes and biased genes conversion Allometric scaling of B = 4N e b B M γ M 2n. ln B = γ M ln M + γ n ln 2n + N(0, σ 2 t) predicted: γ M < 0 and γ n = 1. Estimated scaling coefficients γ M γ n 73 taxa 17 genes (-0.19, -0.03) 1.28 ( 0.54, 2.03) 33 taxa 115 genes (-0.52, -0.01) 0.21 (-1.20, 1.56) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
47 A history of biased gene conversion in placentals reconstruction of B = 4N e b Dasypus Choloepus Procavia Loxodonta Echinops Oryctolagus Ochotona Rattus Mus Dipodomys Cavia Spermophilus Otolemur Microcebus Pan Homo Gorilla Pongo B < 1: effectivelymacaca neutral Callithrix Tarsius Tupaia Sorex Erinaceus B > 1: selective regime B > 10: likely deleterious Canis Felis Equus Pteropus Myotis Tursiops Bos Sus Vicugna Dasypus Choloepus Procavia Loxodonta Echinops Oryctolagus Ochotona Rattus Mus Dipodomys Cavia Spermophilus Otolemur Microcebus Pan Homo Gorilla Pongo Macaca Callithrix Tarsius Tupaia Sorex Erinaceus Felis Canis Equus Pteropus Myotis Tursiops Bos Sus Vicugna BGC very weak (B 0.2) in anthropoids BGC above the nearly neutral threshold (B > 1) in many taxa in some lineages (microbats, insectivores) > 5% exons under B > 10
48 GC and genome rearrangements GC and genome rearrangements Biased clustered substitutions in the human genome atterns of substitution bias are nearly identical in human and chimp. (A) Unexpected biased clustered substitutions (UBCS) (fa Dreszer et al, 2007 Figure 3. Exceptions to genome-wide patterns of bias. (A) There is little evidence of UBCS on either human or chimp sex chromosomes. There is also poor agreement between the human and chimp smoothed UBCS curves ( = 0.25 on chrx). The pseudo-autosomal regions (PAR) are shown in green. (B) Human chromosome 2 shows an atypical central peak, which is likely due to the fusion of two ancestral chromosomes (red, human chr2; blue, chimp chr2a and chr2b). The two smoothed curves are in remarkable agreement ( = 0.84). The UBCS signal for hypothetical telomeres (if no fusion had occurred) was predicted (green) by using 16 Mb of distal sequence (yellow). chromosome fusion dated at 0.74 Myr in the human lineage. Dreszer et al, 2007 beit reduced relative to the autosomes) both the human and significant (P 0). This level of correlation is high, especially
49 Perspectives Perspectives on biased gene conversion Perspectives joint reconstruction of GC and genome rearrangements teasing out population size, recombination landscapes and BGC modeling overdispersion due to recombination hotspots turnover understanding the (mal)adaptive value of BGC population genetics models (modifier theory) is there a selective regulation (buffering) of BGC intensity? Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
50 Perspectives Conclusions integrative approach for correlating substitution patterns and quantitative traits can yield mechanistic insights about causes of molecular evolution potential source of information for reconstructing evolution of life-history, population size, karyotype, and genetic systems Perspectives further into mechanistic modeling (dn/ds, BGC) including data about body size of fossil taxa modeling bursts (punctuated equilibria) and trends (Cope s rule) including other diversification models modeling trait-dependent speciation and extinction modeling correlation with discrete characters Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
51 Perspectives Acknowledgments Raphael Poujol Nicole Uwimana Frédéric Delsuc Nicolas Rodrigue Hervé Philippe many others... Software availability (coevol) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
52 Perspectives Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
53 Perspectives dn/ds, Kr/Kc and the nearly neutral theory 2. Mitochondrial data: correlates of dn/ds ds dn/ds mat. mass long. ds dn/ds maturity mass fit unfit coding sequence 1/N 2 1/N 1 longevity positive correlation between dn/ds and body size compatible with a nearly-neutral interpretation via negative correlation body size population size (N) (Ohta, 1972, Kimura, 1979, Popadin, 2007) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
54 Radical-conservative amino-acid replacement model (adapted from Livington and Barton, 1993) ω = K r /K c Q ab = R ab if a b conservative, Q ab = R ab ω if a b radical. R ab : a general time reversible 20x20 process. conservative = conserving volume and/or polarity (and/or charge)
55 Perspectives dn/ds, Kr/Kc and the nearly neutral theory Mitochondrial data K r /K c (volume + polarity) Kc Kr/Kc mat. mass long. Kc Kr/Kc maturity mass red: blue: light shade: positive negative not significant longevity positive correlation between K r /K c and body size similar to that observed for dn/ds (but higher R 2 ) charge: no significant effect polarity + volume : strongest correlation (highest R 2 ) Nicolas Lartillot (Universite de Montréal) Integrative models of macroevolution February 28, / 47
56 Reconstructed variations of K r /K c Acinonyx jubatus Felis catus Herpestes javanicus Arctocephalus forsteri Eumetopias jubatus Odobenus rosmarus rosmarus Halichoerus grypus Phoca vitulina Ursus americanus Ursus arctos Ursus maritimus Canis familiaris Ceratotherium simum Rhinoceros unicornis Tapirus terrestris Equus asinus Equus caballus Manis tetradactyla Balaena mysticetus Eubalaena australis Eubalaena japonica Balaenoptera acutorostrata Balaenoptera bonaerensis Balaenoptera borealis Balaenoptera brydei Balaenoptera musculus Balaenoptera physalus Megaptera novaeangliae Eschrichtius robustus Caperea marginata Berardius bairdii Hyperoodon ampullatus Platanista minor Inia geoffrensis Pontoporia blainvillei Lagenorhynchus albirostris Monodon monoceros Phocoena phocoena Kogia breviceps Physeter catodon Hippopotamus amphibius Bos grunniens Bos indicus Bos taurus Bubalus bubalis Capra hircus Ovis aries Cervus nippon centralis Cervus nippon yesoensis Muntiacus crinifrons Muntiacus muntjak Muntiacus reevesi Lama pacos Sus scrofa Artibeus jamaicensis Mystacina tuberculata Chalinolobus tuberculatus Pipistrellus abramus Pteropus dasymallus Pteropus scapulatus Rhinolophus monoceros Rhinolophus pumilus Crocidura russula Episoriculus fumidus Sorex unguiculatus Mogera wogura Talpa europaea Urotrichus talpoides Echinosorex gymnura Erinaceus europaeus Hemiechinus auritus Cavia porcellus Thryonomys swinderianus Jaculus jaculus Mus musculus Mus musculus molossinus Rattus norvegicus Volemys kikuchii Nannospalax ehrenbergi Myoxus glis Sciurus vulgaris Lepus europaeus Oryctolagus cuniculus Ochotona collaris Ochotona princeps Cebus albifrons Cercopithecus aethiops Macaca mulatta Macaca sylvanus Papio hamadryas Colobus guereza Trachypithecus obscurus Gorilla gorilla Homo sapiens Pan paniscus Pan troglodytes Pongo pygmaeus Pongo pygmaeus abelii Hylobates lar Cynocephalus variegatus Tarsius bancanus Lemur catta Nycticebus coucang Tupaia belangeri Bradypus tridactylus Choloepus didactylus Tamandua tetradactyla Dasypus novemcinctus Chrysochloris asiatica Orycteropus afer Elephantulus sp Macroscelides proboscideus Echinops telfairi Dugong dugon Elephas maximus Loxodonta africana Procavia capensis "#$%&'#"( 4)#%55',"-*./"( )*"#+',"-*./"( 0%#'1*)#"( 23/.1'*.10/"( 6',)$+"( 7"8'9'#10"( 4#%9"*)5( :)$"#*0#"( ;<#'*0)#%"(
Bayesian models for molecular evolution Rates, rates and traits
Bayesian models for molecular evolution Rates, rates and traits Nicolas Lartillot March 20, 2012 Nicolas Lartillot (Universite de Montréal) BCM2004: Molecular evolution March 20, 2012 1 / 38 1 Molecular
More informationComparing Genomes! Homologies and Families! Sequence Alignments!
Comparing Genomes! Homologies and Families! Sequence Alignments! Allows us to achieve a greater understanding of vertebrate evolution! Tells us what is common and what is unique between different species
More informationMolecules, divergence times and the evolution of life-histories
Molecules, divergence times and the evolution of life-histories Nicolas Lartillot, Raphael Poujol, Frederic Delsuc September 2010 Nicolas Lartillot (Universite de Montréal) Life history evolution September
More informationRELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG
RELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG Department of Biology (Galton Laboratory), University College London, 4 Stephenson
More informationUsing phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)
Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures
More informationMammalogy: the study of the evolution, ecology, physiology, and anatomy of members of the Class Mammalia (Chordata, Vertebrata).
Mammalogy: the study of the evolution, ecology, physiology, and anatomy of members of the Class Mammalia (Chordata, Vertebrata). Mammalogy has been of practical interest to humans since our ancestors evolved
More informationInferring Species Phylogenies From Multiple Genes: Concatenated Sequence Tree Versus Consensus Gene Tree
JOURNAL OF EXPERIMENTAL ZOOLOGY (MOL DEV EVOL) 34B:64 74 (25) Inferring Species Phylogenies From Multiple Genes: Concatenated Sequence Tree Versus Consensus Gene Tree SUDHINDRA R. GADAGKAR, 1 * MICHAEL
More informationEVOLUTIONARY DISTANCES
EVOLUTIONARY DISTANCES FROM STRINGS TO TREES Luca Bortolussi 1 1 Dipartimento di Matematica ed Informatica Università degli studi di Trieste luca@dmi.units.it Trieste, 14 th November 2007 OUTLINE 1 STRINGS:
More informationTaming the Beast Workshop
Workshop and Chi Zhang June 28, 2016 1 / 19 Species tree Species tree the phylogeny representing the relationships among a group of species Figure adapted from [Rogers and Gibbs, 2014] Gene tree the phylogeny
More informationMolecules consolidate the placental mammal tree.
Molecules consolidate the placental mammal tree. The morphological concensus mammal tree Two decades of molecular phylogeny Rooting the placental mammal tree Parallel adaptative radiations among placental
More informationConcepts and Methods in Molecular Divergence Time Estimation
Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks
More informationEstimation of species divergence dates with a sloppy molecular clock
Estimation of species divergence dates with a sloppy molecular clock Ziheng Yang Department of Biology University College London Date estimation with a clock is easy. t 2 = 13my t 3 t 1 t 4 t 5 Node Distance
More informationInferring Speciation Times under an Episodic Molecular Clock
Syst. Biol. 56(3):453 466, 2007 Copyright c Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150701420643 Inferring Speciation Times under an Episodic Molecular
More informationGenome Visualization in Space
Genome Visualization in Space Leandro S. Marcolino 1, Brá ulio R. G. M. Couto 2 and Marcos A. dos Santos 1 Departamento de Ciência da Computação, Universidade Federal de Minas Gerais / UFMG 1 ; Programa
More informationOrthologous loci for phylogenomics from raw NGS data
Orthologous loci for phylogenomics from raw NS data Rachel Schwartz The Biodesign Institute Arizona State University Rachel.Schwartz@asu.edu May 2, 205 Big data for phylogenetics Phylogenomics requires
More informationCONSTRUCTION OF PHYLOGENETIC TREE FROM MULTIPLE GENE TREES USING PRINCIPAL COMPONENT ANALYSIS
INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) Proceedings of the International Conference on Emerging Trends in Engineering and Management (ICETEM14) ISSN 0976
More informationPhylogenetics: Bayesian Phylogenetic Analysis. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Bayesian Phylogenetic Analysis COMP 571 - Spring 2015 Luay Nakhleh, Rice University Bayes Rule P(X = x Y = y) = P(X = x, Y = y) P(Y = y) = P(X = x)p(y = y X = x) P x P(X = x 0 )P(Y = y X
More informationPhylogenetic Tree Reconstruction
I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven
More informationSupplementary Materials for
advances.sciencemag.org/cgi/content/full/3/7/e1602878/dc1 Supplementary Materials for Inactivation of thermogenic UCP1 as a historical contingency in multiple placental mammal clades Michael J. Gaudry,
More informationAmira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut
Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological
More informationMolecular evidence for multiple origins of Insectivora and for a new order of endemic African insectivore mammals
Project Report Molecular evidence for multiple origins of Insectivora and for a new order of endemic African insectivore mammals By Shubhra Gupta CBS 598 Phylogenetic Biology and Analysis Life Science
More informationUnderstanding relationship between homologous sequences
Molecular Evolution Molecular Evolution How and when were genes and proteins created? How old is a gene? How can we calculate the age of a gene? How did the gene evolve to the present form? What selective
More informationDr. Amira A. AL-Hosary
Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological
More informationApplication of new distance matrix to phylogenetic tree construction
Application of new distance matrix to phylogenetic tree construction P.V.Lakshmi Computer Science & Engg Dept GITAM Institute of Technology GITAM University Andhra Pradesh India Allam Appa Rao Jawaharlal
More informationMarkov chain Monte-Carlo to estimate speciation and extinction rates: making use of the forest hidden behind the (phylogenetic) tree
Markov chain Monte-Carlo to estimate speciation and extinction rates: making use of the forest hidden behind the (phylogenetic) tree Nicolas Salamin Department of Ecology and Evolution University of Lausanne
More informationReconstructing Mammalian Phylogenies: A Detailed Comparison of the Cytochrome b and Cytochrome Oxidase Subunit I Mitochondrial Genes
Reconstructing Mammalian Phylogenies: A Detailed Comparison of the Cytochrome b and Cytochrome Oxidase Subunit I Mitochondrial Genes Shanan S. Tobe 1 *, Andrew C. Kitchener 2,3, Adrian M. T. Linacre 4
More informationBio 1B Lecture Outline (please print and bring along) Fall, 2007
Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution
More informationMolecular Evolution & Phylogenetics
Molecular Evolution & Phylogenetics Heuristics based on tree alterations, maximum likelihood, Bayesian methods, statistical confidence measures Jean-Baka Domelevo Entfellner Learning Objectives know basic
More informationWHAT fraction of new mutations in the genome are
Copyright Ó 2011 by the Genetics Society of America DOI: 10.1534/genetics.110.124073 Inference of Mutation Parameters and Selective Constraint in Mammalian Coding Sequences by Approximate Bayesian Computation
More informationPhylogenetics. BIOL 7711 Computational Bioscience
Consortium for Comparative Genomics! University of Colorado School of Medicine Phylogenetics BIOL 7711 Computational Bioscience Biochemistry and Molecular Genetics Computational Bioscience Program Consortium
More informationReanalysis of Murphy et al. s Data Gives Various Mammalian Phylogenies and Suggests Overcredibility of Bayesian Trees
J Mol Evol (2003) 57:S290 S296 DOI: 10.1007/s00239-003-0039-7 Reanalysis of Murphy et al. s Data Gives Various Mammalian Phylogenies and Suggests Overcredibility of Bayesian Trees Kazuharu Misawa, Masatoshi
More informationRNA-based Phylogenetic Methods: Application to Mammalian Mitochondrial RNA Sequences.
RNA-based Phylogenetic Methods: Application to Mammalian Mitochondrial RNA Sequences. Cendrine Hudelot 1, Vivek Gowri-Shankar 2, Howsun Jow 2, Magnus Rattray 2 and Paul G. Higgs 3 1 School of Biological
More informationLetter to the Editor. Temperature Hypotheses. David P. Mindell, Alec Knight,? Christine Baer,$ and Christopher J. Huddlestons
Letter to the Editor Slow Rates of Molecular Evolution Temperature Hypotheses in Birds and the Metabolic Rate and Body David P. Mindell, Alec Knight,? Christine Baer,$ and Christopher J. Huddlestons *Department
More informationAutomated Removal of Noisy Data in Phylogenomic Analyses
J Mol Evol (2010) 71:319 331 DOI 10.7/s00239-010-9398-z Automated Removal of Noisy Data in Phylogenomic Analyses Vadim V. Goremykin Svetlana V. Nikiforova Olaf R. P. Bininda-Emonds Received: 14 April 2010
More informationCongruent Mammalian Trees from Mitochondrial and Nuclear Genes Using Bayesian Methods
Congruent Mammalian Trees from Mitochondrial and Nuclear Genes Using Bayesian Methods Aurelio Reyes,* Carmela Gissi, Francois Catzeflis,à Eviatar Nevo, Graziano Pesole, and Cecilia Saccone *Sezione di
More informationInference of mutation parameters and selective constraint in mammalian. coding sequences by approximate Bayesian computation
Genetics: Published Articles Ahead of Print, published on February 14, 2011 as 10.1534/genetics.110.124073 Inference of mutation parameters and selective constraint in mammalian coding sequences by approximate
More information(2010) : 5 (11). ISSN
Tobe, Shanan and Kitchener, Andrew and Linacre, A.M.T. (2010) Reconstructing mammalian phylogenies : a detailed comparison of the cytochrome b and cytochrome oxidase subunit I mitochondrial genes. PLoS
More informationFrom Individual-based Population Models to Lineage-based Models of Phylogenies
From Individual-based Population Models to Lineage-based Models of Phylogenies Amaury Lambert (joint works with G. Achaz, H.K. Alexander, R.S. Etienne, N. Lartillot, H. Morlon, T.L. Parsons, T. Stadler)
More informationMolecular Clocks. The Holy Grail. Rate Constancy? Protein Variability. Evidence for Rate Constancy in Hemoglobin. Given
Molecular Clocks Rose Hoberman The Holy Grail Fossil evidence is sparse and imprecise (or nonexistent) Predict divergence times by comparing molecular data Given a phylogenetic tree branch lengths (rt)
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More informationAlgorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Distance Methods Character Methods
More informationDetermining the Null Model for Detecting Adaptive Convergence from Genomic Data: A Case Study using Echolocating Mammals
Determining the Null Model for Detecting Adaptive Convergence from Genomic Data: A Case Study using Echolocating Mammals Gregg W.C. Thomas 1 and Matthew W. Hahn*,1,2 1 School of Informatics and Computing,
More informationEvidence of Evolution by Natural Selection. Dodo bird
Evidence of Evolution by Natural Selection Dodo bird 2007-2008 Evidence supporting evolution Fossil record transition species Anatomical record homologous & vestigial structures embryology & development
More informationDiscrete & continuous characters: The threshold model
Discrete & continuous characters: The threshold model Discrete & continuous characters: the threshold model So far we have discussed continuous & discrete character models separately for estimating ancestral
More informationMolecular Evolution & the Origin of Variation
Molecular Evolution & the Origin of Variation What Is Molecular Evolution? Molecular evolution differs from phenotypic evolution in that mutations and genetic drift are much more important determinants
More informationMolecular Evolution & the Origin of Variation
Molecular Evolution & the Origin of Variation What Is Molecular Evolution? Molecular evolution differs from phenotypic evolution in that mutations and genetic drift are much more important determinants
More informationRapid birth-and-death evolution of the xenobiotic metabolizing NAT gene family in vertebrates with evidence of adaptive selection
Sabbagh et al. BMC Evolutionary Biology 2013, 13:62 RESEARCH ARTICLE Open Access Rapid birth-and-death evolution of the xenobiotic metabolizing NAT gene family in vertebrates with evidence of adaptive
More informationEstimating Evolutionary Trees. Phylogenetic Methods
Estimating Evolutionary Trees v if the data are consistent with infinite sites then all methods should yield the same tree v it gets more complicated when there is homoplasy, i.e., parallel or convergent
More informationI. Short Answer Questions DO ALL QUESTIONS
EVOLUTION 313 FINAL EXAM Part 1 Saturday, 7 May 2005 page 1 I. Short Answer Questions DO ALL QUESTIONS SAQ #1. Please state and BRIEFLY explain the major objectives of this course in evolution. Recall
More informationLecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30
Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 27. Phylogeny methods, part 7 (Bootstraps, etc.) p.1/30 A non-phylogeny
More informationPhylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.
Five Sami Khuri Department of Computer Science San José State University San José, California, USA sami.khuri@sjsu.edu v Distance Methods v Character Methods v Molecular Clock v UPGMA v Maximum Parsimony
More informationIntroduction to Molecular Evolution (BIOL 3046) Course website:
Introduction to Molecular Evolution (BIOL 3046) Course website: www.biol3046.info What is evolution? Evolution: from Latin evolvere, to unfold broad sense, to change e.g., stellar evolution, cultural evolution,
More informationAnatomy of a species tree
Anatomy of a species tree T 1 Size of current and ancestral Populations (N) N Confidence in branches of species tree t/2n = 1 coalescent unit T 2 Branch lengths and divergence times of species & populations
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 2009 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More information9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)
I9 Introduction to Bioinformatics, 0 Phylogenetic ree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & omputing, IUB Evolution theory Speciation Evolution of new organisms is driven by
More informationChapter 26 Phylogeny and the Tree of Life
Chapter 26 Phylogeny and the Tree of Life Biologists estimate that there are about 5 to 100 million species of organisms living on Earth today. Evidence from morphological, biochemical, and gene sequence
More informationEvidence for Evolution by Natural Selection. Raven Chapters 1 & 22
Evidence for Evolution by Natural Selection Raven Chapters 1 & 22 2006-2007 Science happens within a culture What was the doctrine of the time? TINTORETTO The Creation of the Animals 1550 Then along comes
More informationDating r8s, multidistribute
Phylomethods Fall 2006 Dating r8s, multidistribute Jun G. Inoue Software of Dating Molecular Clock Relaxed Molecular Clock r8s multidistribute r8s Michael J. Sanderson UC Davis Estimation of rates and
More informationLikelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution
Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Ziheng Yang Department of Biology, University College, London An excess of nonsynonymous substitutions
More informationBayesian Inference and MCMC
Bayesian Inference and MCMC Aryan Arbabi Partly based on MCMC slides from CSC412 Fall 2018 1 / 18 Bayesian Inference - Motivation Consider we have a data set D = {x 1,..., x n }. E.g each x i can be the
More informationPhylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26
Phylogeny Chapter 26 Taxonomy Taxonomy: ordered division of organisms into categories based on a set of characteristics used to assess similarities and differences Carolus Linnaeus developed binomial nomenclature,
More informationarxiv: v2 [q-bio.pe] 18 Oct 2013
arxiv:131.2968 The Fossilized Birth-Death Process: arxiv:131.2968v2 [q-bio.pe] 18 Oct 213 A Coherent Model of Fossil Calibration for Divergence Time Estimation Tracy A. Heath 1,2, John P. Huelsenbeck 1,3,
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 200 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More informationChromosomal rearrangements in mammalian genomes : characterising the breakpoints. Claire Lemaitre
PhD defense Chromosomal rearrangements in mammalian genomes : characterising the breakpoints Claire Lemaitre Laboratoire de Biométrie et Biologie Évolutive Université Claude Bernard Lyon 1 6 novembre 2008
More informationCHAPTERS 24-25: Evidence for Evolution and Phylogeny
CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology
More informationBiology Keystone (PA Core) Quiz Theory of Evolution - (BIO.B ) Theory Of Evolution, (BIO.B ) Scientific Terms
Biology Keystone (PA Core) Quiz Theory of Evolution - (BIO.B.3.2.1 ) Theory Of Evolution, (BIO.B.3.3.1 ) Scientific Terms Student Name: Teacher Name: Jared George Date: Score: 1) Evidence for evolution
More informationSeuqence Analysis '17--lecture 10. Trees types of trees Newick notation UPGMA Fitch Margoliash Distance vs Parsimony
Seuqence nalysis '17--lecture 10 Trees types of trees Newick notation UPGM Fitch Margoliash istance vs Parsimony Phyogenetic trees What is a phylogenetic tree? model of evolutionary relationships -- common
More informationEvaluating Purifying Selection in the Mitochondrial DNA of Various Mammalian Species
Evaluating Purifying Selection in the Mitochondrial DNA of Various Mammalian Species Pedro Soares 1 *., Diogo Abrantes 1., Teresa Rito 1, Noel Thomson 2, Predrag Radivojac 3, Biao Li 3, Vincent Macaulay
More informationOne of the striking features that has emerged from the study
Positive Darwinian selection drives the evolution of several female reproductive proteins in mammals Willie J. Swanson*, Ziheng Yang, Mariana F. Wolfner*, and Charles F. Aquadro* *Department of Molecular
More informationCHAPTER 26 PHYLOGENY AND THE TREE OF LIFE Connecting Classification to Phylogeny
CHAPTER 26 PHYLOGENY AND THE TREE OF LIFE Connecting Classification to Phylogeny To trace phylogeny or the evolutionary history of life, biologists use evidence from paleontology, molecular data, comparative
More informationSelection and Population Genetics
Selection and Population Genetics Evolution by natural selection can occur when three conditions are satisfied: Variation within populations - individuals have different traits (phenotypes). height and
More informationHow should we organize the diversity of animal life?
How should we organize the diversity of animal life? The difference between Taxonomy Linneaus, and Cladistics Darwin What are phylogenies? How do we read them? How do we estimate them? Classification (Taxonomy)
More informationPOPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics
POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the
More informationAn Introduction to Bayesian Inference of Phylogeny
n Introduction to Bayesian Inference of Phylogeny John P. Huelsenbeck, Bruce Rannala 2, and John P. Masly Department of Biology, University of Rochester, Rochester, NY 427, U.S.. 2 Department of Medical
More informationEstimating the Distribution of Selection Coefficients from Phylogenetic Data with Applications to Mitochondrial and Viral DNA
Estimating the Distribution of Selection Coefficients from Phylogenetic Data with Applications to Mitochondrial and Viral DNA Rasmus Nielsen* and Ziheng Yang *Department of Biometrics, Cornell University;
More informationStatistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution?
Statistical nonmolecular phylogenetics: can molecular phylogenies illuminate morphological evolution? 30 July 2011. Joe Felsenstein Workshop on Molecular Evolution, MBL, Woods Hole Statistical nonmolecular
More informationBayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies
Bayesian Inference using Markov Chain Monte Carlo in Phylogenetic Studies 1 What is phylogeny? Essay written for the course in Markov Chains 2004 Torbjörn Karfunkel Phylogeny is the evolutionary development
More informationGene expression differences in human and chimpanzee cerebral cortex
Evolution of the human genome by natural selection What you will learn in this lecture (1) What are the human genome and positive selection? (2) How do we analyze positive selection? (3) How is positive
More informationA (short) introduction to phylogenetics
A (short) introduction to phylogenetics Thibaut Jombart, Marie-Pauline Beugin MRC Centre for Outbreak Analysis and Modelling Imperial College London Genetic data analysis with PR Statistics, Millport Field
More informationIntegrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley
Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley B.D. Mishler Feb. 14, 2018. Phylogenetic trees VI: Dating in the 21st century: clocks, & calibrations;
More informationInferring evolutionary processes from phylogenies
Pergamon Zoologica Scripta, Vol. 26, No. 4, pp. 331-348, 1997 Elsevier Science Ltd 0 1998 The Norwegian Academy of Science and Letters All rights reserved. Printed in Great Britain PII: S0300-3256(98)00004-X
More informationExpanded View Figures
Marie-Thérèse El-aher et al non-polyq fragments and Huntington s disease The EMO Journal Expanded View Figures HTT-Q TEV rec + + + + αtub 15 5 37 sh-htt cells 167/586 167/586 TEV-Q -Q23 Particle volume
More informationReading for Lecture 13 Release v10
Reading for Lecture 13 Release v10 Christopher Lee November 15, 2011 Contents 1 Evolutionary Trees i 1.1 Evolution as a Markov Process...................................... ii 1.2 Rooted vs. Unrooted Trees........................................
More informationUncertainty in phylogenetic tree estimates
Uncertainty in phylogenetic tree estimates Amy Willis Department of Biostatistics, University of Washington and Rayna Bell Smithsonian Institution, National Museum of Natural History October 16, 2017 arxiv:1611.03456v2
More informationModeling continuous trait evolution in the absence of phylogenetic information
Modeling continuous trait evolution in the absence of phylogenetic information Mariia Koroliuk GU university, EM in complex systems under the supervision of Daniele Silvestro June 29, 2014 Contents 1 Introduction
More informationAn Evaluation of Different Partitioning Strategies for Bayesian Estimation of Species Divergence Times
Syst. Biol. 67(1):61 77, 2018 The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. This is an Open Access article distributed under the terms of
More informationSequence Divergence & The Molecular Clock. Sequence Divergence
Sequence Divergence & The Molecular Clock Sequence Divergence v simple genetic distance, d = the proportion of sites that differ between two aligned, homologous sequences v given a constant mutation/substitution
More informationPhylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center
Phylogenetic Analysis Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center Outline Basic Concepts Tree Construction Methods Distance-based methods
More informationEstimating the Rate of Evolution of the Rate of Molecular Evolution
Estimating the Rate of Evolution of the Rate of Molecular Evolution Jeffrey L. Thorne,* Hirohisa Kishino, and Ian S. Painter* *Program in Statistical Genetics, Statistics Department, North Carolina State
More informationThe Tree of Life. Phylogeny
The Tree of Life Phylogeny Phylogenetics Phylogenetic trees illustrate the evolutionary relationships among groups of organisms, or among a family of related nucleic acid or protein sequences Each branch
More informationPhylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?
Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species
More informationArrival and Diversification of Caviomorph Rodents and Platyrrhine Primates in South America
Syst. Biol. 55(2):228-244,2006 Copyright Society of Systematic Biologists ISSN: 1063-5157 print / 1076-836X online DOI: 10.1080/10635150500481390 Arrival and Diversification of Caviomorph Rodents and Platyrrhine
More informationAP Biology. Evolution is "so overwhelmingly established that it has become irrational to call it a theory." Evidence of Evolution by Natural Selection
Evidence of Evolution by Natural Selection Evolution is "so overwhelmingly established that it has become irrational to call it a theory." -- Ernst Mayr What Evolution Is 2001 Professor Emeritus, Evolutionary
More informationCh. 7 Evolution and the fossil record
Ch. 7 Evolution and the fossil record Evolution (popular definition) = descent with modification Evolution (technical definition) = change in gene frequencies or gene combinations in a series of populations,
More informationNature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1
Supplementary Figure 1 Zn 2+ -binding sites in USP18. (a) The two molecules of USP18 present in the asymmetric unit are shown. Chain A is shown in blue, chain B in green. Bound Zn 2+ ions are shown as
More informationLecture 11 Friday, October 21, 2011
Lecture 11 Friday, October 21, 2011 Phylogenetic tree (phylogeny) Darwin and classification: In the Origin, Darwin said that descent from a common ancestral species could explain why the Linnaean system
More informationpart 4: phenomenological load and biological inference. phenomenological load review types of models. Gαβ = 8π Tαβ. Newton.
2017-07-29 part 4: and biological inference review types of models phenomenological Newton F= Gm1m2 r2 mechanistic Einstein Gαβ = 8π Tαβ 1 molecular evolution is process and pattern process pattern MutSel
More informationMechanisms of Evolution Darwinian Evolution
Mechanisms of Evolution Darwinian Evolution Descent with modification by means of natural selection All life has descended from a common ancestor The mechanism of modification is natural selection Concept
More informationFitness landscapes and seascapes
Fitness landscapes and seascapes Michael Lässig Institute for Theoretical Physics University of Cologne Thanks Ville Mustonen: Cross-species analysis of bacterial promoters, Nonequilibrium evolution of
More informationHow should we go about modeling this? Model parameters? Time Substitution rate Can we observe time or subst. rate? What can we observe?
How should we go about modeling this? gorilla GAAGTCCTTGAGAAATAAACTGCACACACTGG orangutan GGACTCCTTGAGAAATAAACTGCACACACTGG Model parameters? Time Substitution rate Can we observe time or subst. rate? What
More information