Natural selection on the molecular level
|
|
- Heather York
- 5 years ago
- Views:
Transcription
1 Natural selection on the molecular level
2 Fundamentals of molecular evolution How DNA and protein sequences evolve?
3 Genetic variability in evolution } Mutations } forming novel alleles } Inversions } change gene order } may impede recombinatios and fix a haplotype } Duplications } may involve genes or gene fragments } or entire chromosomes and genomes } the main source of evolutionary innovations } Horizontal gene transfer } including symbiotic events 3
4 Substitutions } Why transitions are more frequent? } there are more possible transversions } observed ts/tv ratio from ~2 (ndna) to ~15 (human mtdna) } exception plant mtdna } the ts/tv ratio varies significantly between lineages 4
5 Transitions and transversions } Why are the transitions more frequent? } Selection hypothesis (transitions are more likely to be silent and neutral) } But: } transitions are more frequent in rrna genes, pseudogees and noncoding regions } transitions are more frequent in 4-fold degenerate positions (codons like CUN Leu) 5
6 Transitions and transversions } Why are the transitions more frequent? } Mechanistic explanation mutagenesis and repair mechanisms } Transitions are the result of: } tautomeric shift of bases } deamination (e.g. oxidative) } Transitions cause less distortion of the double helix structure } less likely to be detected and repaired by the MMR system 6
7 Modeling DNA evolution } Ancestral sequence usually not available } The number of mutations has to be inferred from differences between present sequences } Requires correcting for multiple hits, particularly for more distant sequences 7
8 Modeling sequence evolution } Markov models the state of generation n +1 depends only on the state of generation n and the rule set (character substitution probability matrix) } There are many models with varying complexity } Parameters may include: } multiple hits (Poisson correction) } different substitution probabilities for various mutations } different substitution probabilities for various ositions in sequence } nucleotide frequencies 8
9 DNA evolution -the simplest model (Jukes- Cantor) A C G T A 1-3α α α α C α 1-3α α α G α α 1-3α α T α α α 1-3α } } } } equal nucleotide frequencies (f=0.25) equal substitution probabilities corrected distance: D JC = 3 4 ln(1 4 3 D) Example: D OBS =0.43 => D JC =0.64 9
10 Other models } Kimura (K80, two-parameter) - different probabilities for transitions and transversions } Felsenstein (F81), Hasegawa-Kishino-Yano (HKY85) - different nucleotide frequencies (F81) + different probabilities for transitions and transversions (HKY85) } GTR (General Time Reversible, Tavare 86) 10
11 The GTR model } Different probabilities for each substitution (but symmetrical, e.g. A T = T A) - 6 parameters } Different nucleotide frequencies - 4 parameters 11
12 The gamma distribution } In simple models each position in sequence has the same substitution probability - unrealistic } Different classes of substitution probabilities are modeled using the gamma distribution 12
13 On the level of the genetic code } A mutation can: } change the codon to another codon encoding the same aa } synonymous } change the codon to a codon encoding a different aa } nonsynonymous } change the codon to a STOP codon } nonsens } cause a frameshift } change gene expression 13
14 Mutations and the natural selection } We do not observe mutations } we observe differences between populations (species) } or intra-population variation (polymorphism) } Alleles formed by mutation are subject to selection } Allele frequencies can be changed by genetic drift } We observe mutations that are fixed entirely or partially (polymorphisms) in the gene pool 14
15 The fundamental question of molecular evolution } What is the role of genetic drift and natural selection in shaping sequence variation? } intrapopulation (polymorphisms) } interspecific } The question concerns quantitative differences } There is no doubt that evolutionary adaptations are a result of natural selection } But, how many of the differences we observe are adaptive? } And which ones? 15
16 Selection or drift? } Selectionism } the majority of fixed mutations were positively selected } most polymorphisms are maintained by selection } balancing selection, overdominance, frequency-based selection } Neutralism (Kimura, 1968) } the majority of fixed mutations were fixed by genetic drift (by chance) } the majority of polymorphisms are the result of drift } positively selected mutations are rare and are not significant for the quantitative analysis of molecular variation 16
17 Mutations and natural selection } deleterious } s < 0 } eliminated by selection (purifying/negative) } neutral } s 0 (more precisely, s 1/4N) } can be fixed or lost by genetic drift } beneficial } s > 0 } fixed by positive selection (influenced by drift for small s) 17
18 Selectionism vs neutralism } Selectionism: } most mutations are deleterious } most fixed mutations are beneficial } neutral mutations are rare (not more frequent than beneficial) } Neutralism } most mutations are deleterious or neutral } most fixed mutations are neutral } beneficial mutations are rare (much less frequent than neutral), but still important for adaptation 18
19 Selectionism vs neutralism selectionism neutralism pan-neutralism Neutralism is not pan-neutralism, nobody denies the selective importance of mutations 19
20 Neutral theory - indications } The rate of substitution and the degree of polymorphism are too high to be explained by selection alone } Constant rate of molecular evolution (molecular clock) } Less important sequences (pseudogenes, less important fragments of proteins) change faster than key functional sequences 20
21 The constant rate of molecular evolution } Many sequences evolve at a constant rate } The rate varies for different sequences, but for the same sequence remains constant between lineages } Tzw. zegar molekularny Amino acid differences for vertebrate globins 21
22 Drift and the rate of evolution } Genetic drift is a random process, but its rate is equal over long time } Depends only on mutation rate (one fixed change per 1/µ generations) } For selection a constant rate only if the environment changes at a constant rate (unlikely) } The rate of adaptive changes is not constant 22
23 Molecular clock } A consequence of the neutral model } Sequences diverge at a constant rate for a given gene } different rates for different genes } Relative rate test K AC - K BC = 0 A B C } tests for constant rate between lineages, but not ove time 23
24 Molecular clock - the generation problem } In the neutral model the fixation rate is: } Should be constant per generation } Different organisms have different generation times } The rate of evolution should not be constant over time 1 2Nµ 2N = µ } But that s what was observed 24
25 The constant rate of molecular evolution } Tzw. zegar molekularny Amino acid differences for vertebrate globins 25
26 The generation problem } Generation time varies among different organisms ~0,03 generations/year ~3 generations/year } Why does the evolution rate remain constant? 26
27 The near-neutral model } The original neutral model concerns purey neutral changes (s = 0), which are rare } Mutations behave like neutral if: s 1 4N e } Mutations with a small selection coefficient s will behave like neutral mutations in small populations } in large populations they will be subject to selection 27
28 Near-neutral model } There is a negative correlation between generation time and population size 28
29 The near neutral model ~0,03 generations/year s 1 4N e ~3 generations/year Long generation time Less mutations per year Short generation time More mutations per year Small population (small N e ) Large population (large N e ) More mutations behave like neutral and are fixed by drift More mutations are subject to selection Generation time and population size cancel each other, resulting in a constant rate (Ohta & Kimura, 1971). 29
30 The molecular clock } Rate constant over time for proteins and nonsynonymous substitutions } In DNA, } for synonymous mutations } pseudogenes } some noncoding sequences } the rate depends on the generation time 30
31 Eveolutionary rate and function Less important sequences (pseudogenes, less important fragments of proteins) change faster than key functional sequences 31
32 Degeneracy (redundancy) in the code 32 4-fold degenerate 2-fold degenerate
33 Eveolutionary rate and function Less important sequences (pseudogenes, less important fragments of proteins) change faster than key functional sequences 33
34 Evolutionary rate unit: PAM/10 8 years Evolutionary unit time: how long (in millions of years, 10 6 ) does it take to fix 1 mutation/100 aa (1 PAM) 34
35 Evolutionary rate } Negative (purifying selection) is the main factor influencing the rate of evolution } in more important sequences a mutation is more likely to be deleterious (and eliminated by selection) } in more important sequences a mutation is more likely to be neutral (and may be fixed by drift) } mutations that don t influence function will be neutral } pseudogenes } noncoding regions? } synonymous substitutions? 35
36 Neutralism - the current status } A foundation that explains many observations } high DNA and protein polymorphism } } molecular clock } many deviations, there is no global clock, local clocks can be found slower evolution of more important sequences } It is a null hypothesis for testing for the positive selection at the molecular level 36
37 Neutralism - the current status } verified by sequencing, currently on the genomic scale } Kimura: 1968 before DNA sequencing was invented 37
38 Neutralism - the current status } Smith & Eyre-Walker % amino acid substitutions in Drosophila sp. fixed by selection } Andolfatto 2005 between D. melanogaster and D. simulans postive selection responsible for: } 20% DNA substitutions in introns and intragenic sequences } 60% DNA substitutions in UTR sequences 38
39 Neutralism - the current status } The main achievement of the neutral theory is the development of a mathematical frameworkto sudy the effects of selection and drift } Allowed to develop methods of testing for positive selection (using the neutral model as a null hypothesis) } A significant portion of the genome evolves according to the neutral model } a molecular clock can be found 39
40 40 Testing selection
41 Looking for traces of selection } Most sequences evolve in a constant, clocklike fashion } Deviation from the constant rate in a given lineage lineage-specific selection Constant rate Accelerated changes Slowed changes
42 Testing for selection } Assumption: synonymous mutations are neutral } For pairwise comparisons, calculate Ka (dn) number of nonsynonymous changes per number of nonsynonymous sites Ks (ds) number of synonymous changes per number of synonymous sites The rate Ka/Ks (ω) reflects selection ω=1 purely neutraln ω<1 negative (purifying) selection ω>1 positive selection
43 Testing for selection } The ω rate is rarely larger than 1 globally for the entire gene (exceptions, e.g. MHC genes) } Average ω between primates and rodents is 0.2, between human and chimp: 0.4 } Deviations from average ω for a given gene in a given lineage indicate selection } In a gene there can be sites with different ω, indicating selection acting on particular regions of the sequence
44 Testing for selection II } Comparing synonymous and nonsynonymous rates for intraspecific and interspecific comparisons
45 The McDonald-Kreitman test } Comparing synonymous and nonsynonymous rates for intraspecific and interspecific comparisons } In a neutral model both rates should be equal
46 Example: the ADH gene in 3 Drosophila species synonymous nonsynonymous rate intraspecific 42 2 ~0.05 interspecific 17 7 ~0.41 Conclusion: nonsynonymous changes favoured during speciation - not neutral
47 Comparing hypotheses } The likelihood criterion } PAML (Phylogenetic Analysis by Maximum Likelihood) author Ziheng Yang }
48 Likelihood } } } Likelihood = the probability of obtaining the observed data under a given model and its parameters: P(data model) Probability relates to future events in a known model - e.g. for a symmetrical coin toss, probability of heads = 1/2 (two heads 1/4 etc.) Likelihood explains observations (past) with an unknown model - e.g. observing 50% heads and 50% tails is most likely for a symmetrical coin toss
49 The likelihood ratio test } LRT (Likelihood Ratio Test) } Comparing nested models with different parameter count } For each model calculate likelihoods (L1, L2) LR = 2 [ln(l1) ln(l2)] } The significance of LR is determined by comparing to χ2 for df (degrees of freedom) equal to the difference in parameter count
50 Example #1 Model 0 ω the same for the whole tree: likelihood L0 Model 1 ω for lineage #1 different, than for the rest of the tree: likelihood L1 Is the rate of evolution different for lineage #1? LR = 2 [ln(l1) ln(l0)] Is LR larger than χ 2 for df=1?
51 Input files for PAML } Sequences: PHYLIP format (e.g. from Clustal) } Tree: Newick format, with optional branch labels } Control file: codeml.ctl
52 Parentheses (Newick) tree format (Macaca,Cerco,(Pongo,(Gorilla,(Homo,Chimp)))); (Macaca,Cerco,(Pongo,(Gorilla,(Homo#1,Chimp)))); #1 #1 #1 #1 #1 #1 (Macaca,Cerco,(Pongo,(Gorilla,(Homo,Chimp))$1)); #1 #1 #1 #2 #1 (Macaca,Cerco,(Pongo,(Gorilla,(Homo#2,Chimp))$1));
53 seqfile = treefile = outfile = noisy = 9 verbose = 2 runmode = 0 * 0,1,2,3,9: how much rubbish on the screen * 1: detailed output, 0: concise output * 0: user tree; 1: semi-automatic; 2: automatic * 3: StepwiseAddition; (4,5):PerturbationNNI seqtype = 1 * 1:codons; 2:AAs; 3:codons-->AAs CodonFreq = 2 * 0:1/61 each, 1:F1X4, 2:F3X4, 3:codon table clock = 0 * 0: no clock, unrooted tree, 1: clock, rooted tree model = 0 * models for codons: * 0:one, 1:b, 2:2 or more dn/ds ratios for branches NSsites = 0 icode = 0 * dn/ds among sites. 0:no variation, 1:neutral, 2:positive * 0:standard genetic code; 1:mammalian mt; 2-10:see below fix_kappa = 0 * 1: kappa fixed, 0: kappa to be estimated kappa = 2 * initial or fixed kappa fix_omega = 0 * 1: omega or omega_1 fixed, 0: estimate omega = 1 * initial or fixed omega, for codons or codon-transltd AAs fix_alpha = 1 * 0: estimate gamma shape parameter; 1: fix it at alpha alpha =.0 * initial or fixed alpha, 0:infinity (constant rate) Malpha = 0 * different alphas for genes ncatg = 4 * # of categories in the dg or AdG models of rates getse = 0 * 0: don't want them, 1: want S.E.s of estimates RateAncestor = 1 * (1/0): rates (alpha>0) or ancestral states (alpha=0) fix_blength = 1 * 0: ignore, -1: random, 1: initial, 2: fixed method = 0 * 0: simultaneous; 1: one branch at a time
54 Models } Branch models - is ω for a given branch (or a group of branches) different from the rest of the tree? } Site models - how many codon classes with respect to ω are there (is the selection similar for all the sites)? } Branch and site models - are there differently selected codon classes in a specific branch?
55 Brach models } Null hypothesis - ω is the same in all the branches } Test hypothesis - there are branches where ω is significantly different from the rest of the tree } The free model - a specific ω value for each branch } df = difference in parameter count (each specified branch adds 1)
56 Branch models seqfile = treefile = outfile = noisy = 9 verbose = 2 runmode = 0 * 0,1,2,3,9: how much rubbish on the screen * 1: detailed output, 0: concise output * 0: user tree; 1: semi-automatic; 2: automatic * 3: StepwiseAddition; (4,5):PerturbationNNI seqtype = 1 * 1:codons; 2:AAs; 3:codons-->AAs CodonFreq = 2 * 0:1/61 each, 1:F1X4, 2:F3X4, 3:codon table clock = 0 * 0: no clock, unrooted tree, 1: clock, rooted tree model = 0 * models for codons: * 0:one, 1:b, 2:2 or more dn/ds ratios for branches NSsites = 0 icode = 0 * dn/ds among sites. 0:no variation, 1:neutral, 2:positive * 0:standard genetic code; 1:mammalian mt; 2-10:see below fix_kappa = 0 * 1: kappa fixed, 0: kappa to be estimated kappa = 2 * initial or fixed kappa fix_omega = 0 * 1: omega or omega_1 fixed, 0: estimate omega = 1 * initial or fixed omega, for codons or codon-transltd AAs fix_alpha = 1 * 0: estimate gamma shape parameter; 1: fix it at alpha alpha =.0 * initial or fixed alpha, 0:infinity (constant rate) Malpha = 0 * different alphas for genes ncatg = 4 * # of categories in the dg or AdG models of rates getse = 0 * 0: don't want them, 1: want S.E.s of estimates RateAncestor = 1 * (1/0): rates (alpha>0) or ancestral states (alpha=0) fix_blength = 1 * 0: ignore, -1: random, 1: initial, 2: fixed method = 0 * 0: simultaneous; 1: one branch at a time
57 Site models } Assume constant ω in all branches } Null hypothesis (M1a): two classes of codons: } } neutral (ω=1) under purifying selection (ω<1) } Test hypothesis (M2a): three classes of codons: } } } neutral (ω=1) under purifying selection (ω<1) under positive selection (ω>1) } Also other, more complex models (see PAML documentation)
58 Site models seqfile = treefile = outfile = noisy = 9 verbose = 2 runmode = 0 * 0,1,2,3,9: how much rubbish on the screen * 1: detailed output, 0: concise output * 0: user tree; 1: semi-automatic; 2: automatic * 3: StepwiseAddition; (4,5):PerturbationNNI seqtype = 1 * 1:codons; 2:AAs; 3:codons-->AAs CodonFreq = 2 * 0:1/61 each, 1:F1X4, 2:F3X4, 3:codon table clock = 0 * 0: no clock, unrooted tree, 1: clock, rooted tree model = 0 * models for codons: * 0:one, 1:b, 2:2 or more dn/ds ratios for branches NSsites = 0 icode = 0 * dn/ds among sites. 0:no variation, 1:neutral, 2:positive * 0:standard genetic code; 1:mammalian mt; 2-10:see below fix_kappa = 0 * 1: kappa fixed, 0: kappa to be estimated kappa = 2 * initial or fixed kappa fix_omega = 0 * 1: omega or omega_1 fixed, 0: estimate omega = 1 * initial or fixed omega, for codons or codon-transltd AAs fix_alpha = 1 * 0: estimate gamma shape parameter; 1: fix it at alpha alpha =.0 * initial or fixed alpha, 0:infinity (constant rate) Malpha = 0 * different alphas for genes ncatg = 4 * # of categories in the dg or AdG models of rates getse = 0 * 0: don't want them, 1: want S.E.s of estimates RateAncestor = 1 * (1/0): rates (alpha>0) or ancestral states (alpha=0) fix_blength = 1 * 0: ignore, -1: random, 1: initial, 2: fixed method = 0 * 0: simultaneous; 1: one branch at a time
59 Branch and site models } In specified branches an additional class of codons with ω>1 } Three classes of codons: } } } neutral (ω=1) under purifying selection (ω<1) under positive selection (ω>1) } Null hypothesis: in the third class ω=1 (no positive selection)
60 Branch and site models seqfile = treefile = outfile = noisy = 9 verbose = 2 runmode = 0 * 0,1,2,3,9: how much rubbish on the screen * 1: detailed output, 0: concise output * 0: user tree; 1: semi-automatic; 2: automatic * 3: StepwiseAddition; (4,5):PerturbationNNI seqtype = 1 * 1:codons; 2:AAs; 3:codons-->AAs CodonFreq = 2 * 0:1/61 each, 1:F1X4, 2:F3X4, 3:codon table clock = 0 * 0: no clock, unrooted tree, 1: clock, rooted tree model = 2 * models for codons: * 0:one, 1:b, 2:2 or more dn/ds ratios for branches NSsites = 2 icode = 0 * dn/ds among sites. 0:no variation, 1:neutral, 2:positive * 0:standard genetic code; 1:mammalian mt; 2-10:see below fix_kappa = 0 * 1: kappa fixed, 0: kappa to be estimated kappa = 2 * initial or fixed kappa fix_omega = 0 * 1: omega or omega_1 fixed, 0: estimate omega = 1 * initial or fixed omega, for codons or codon-transltd AAs fix_alpha = 1 * 0: estimate gamma shape parameter; 1: fix it at alpha alpha =.0 * initial or fixed alpha, 0:infinity (constant rate) Malpha = 0 * different alphas for genes ncatg = 4 * # of categories in the dg or AdG models of rates getse = 0 * 0: don't want them, 1: want S.E.s of estimates RateAncestor = 1 * (1/0): rates (alpha>0) or ancestral states (alpha=0) fix_blength = 1 * 0: ignore, -1: random, 1: initial, 2: fixed method = 0 * 0: simultaneous; 1: one branch at a time
Understanding relationship between homologous sequences
Molecular Evolution Molecular Evolution How and when were genes and proteins created? How old is a gene? How can we calculate the age of a gene? How did the gene evolve to the present form? What selective
More information7. Tests for selection
Sequence analysis and genomics 7. Tests for selection Dr. Katja Nowick Group leader TFome and Transcriptome Evolution Bioinformatics group Paul-Flechsig-Institute for Brain Research www. nowicklab.info
More informationCodon-model based inference of selection pressure. (a very brief review prior to the PAML lab)
Codon-model based inference of selection pressure (a very brief review prior to the PAML lab) an index of selection pressure rate ratio mode example dn/ds < 1 purifying (negative) selection histones dn/ds
More informationProbabilistic modeling and molecular phylogeny
Probabilistic modeling and molecular phylogeny Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical University of Denmark (DTU) What is a model? Mathematical
More informationLecture Notes: BIOL2007 Molecular Evolution
Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits
More informationStatistics for Biology and Health. Series Editors K. Dietz, M. Gail, K. Krickeberg, A. Tsiatis, J. Samet
Statistics for Biology and Health Series Editors K. Dietz, M. Gail, K. Krickeberg, A. Tsiatis, J. Samet Rasmus Nielsen Editor Statistical Methods in Molecular Evolution 5 Maximum Likelihood Methods for
More informationSEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION
Annu. Rev. Genomics Hum. Genet. 2003. 4:213 35 doi: 10.1146/annurev.genom.4.020303.162528 Copyright c 2003 by Annual Reviews. All rights reserved First published online as a Review in Advance on June 4,
More informationHow Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building
How Molecules Evolve Guest Lecture: Principles and Methods of Systematic Biology 11 November 2013 Chris Simon Approaching phylogenetics from the point of view of the data Understanding how sequences evolve
More information"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky
MOLECULAR PHYLOGENY "Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky EVOLUTION - theory that groups of organisms change over time so that descendeants differ structurally
More informationDrosophila melanogaster and D. simulans, two fruit fly species that are nearly
Comparative Genomics: Human versus chimpanzee 1. Introduction The chimpanzee is the closest living relative to humans. The two species are nearly identical in DNA sequence (>98% identity), yet vastly different
More informationBio 1B Lecture Outline (please print and bring along) Fall, 2007
Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution
More informationFebuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution. Classical vs. balanced views of genome structure
Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution Classical vs. balanced views of genome structure - the proposal of the neutral theory by Kimura in 1968 led to the so-called neutralist-selectionist
More informationC3020 Molecular Evolution. Exercises #3: Phylogenetics
C3020 Molecular Evolution Exercises #3: Phylogenetics Consider the following sequences for five taxa 1-5 and the known outgroup O, which has the ancestral states (note that sequence 3 has changed from
More informationFitness landscapes and seascapes
Fitness landscapes and seascapes Michael Lässig Institute for Theoretical Physics University of Cologne Thanks Ville Mustonen: Cross-species analysis of bacterial promoters, Nonequilibrium evolution of
More informationLecture 7 Mutation and genetic variation
Lecture 7 Mutation and genetic variation Thymidine dimer Natural selection at a single locus 2. Purifying selection a form of selection acting to eliminate harmful (deleterious) alleles from natural populations.
More informationMassachusetts Institute of Technology Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution
Massachusetts Institute of Technology 6.877 Computational Evolutionary Biology, Fall, 2005 Notes for November 7: Molecular evolution 1. Rates of amino acid replacement The initial motivation for the neutral
More informationMaximum Likelihood Until recently the newest method. Popularized by Joseph Felsenstein, Seattle, Washington.
Maximum Likelihood This presentation is based almost entirely on Peter G. Fosters - "The Idiot s Guide to the Zen of Likelihood in a Nutshell in Seven Days for Dummies, Unleashed. http://www.bioinf.org/molsys/data/idiots.pdf
More informationMaximum Likelihood Tree Estimation. Carrie Tribble IB Feb 2018
Maximum Likelihood Tree Estimation Carrie Tribble IB 200 9 Feb 2018 Outline 1. Tree building process under maximum likelihood 2. Key differences between maximum likelihood and parsimony 3. Some fancy extras
More informationMolecular Evolution & the Origin of Variation
Molecular Evolution & the Origin of Variation What Is Molecular Evolution? Molecular evolution differs from phenotypic evolution in that mutations and genetic drift are much more important determinants
More informationMolecular Evolution & the Origin of Variation
Molecular Evolution & the Origin of Variation What Is Molecular Evolution? Molecular evolution differs from phenotypic evolution in that mutations and genetic drift are much more important determinants
More informationFUNDAMENTALS OF MOLECULAR EVOLUTION
FUNDAMENTALS OF MOLECULAR EVOLUTION Second Edition Dan Graur TELAVIV UNIVERSITY Wen-Hsiung Li UNIVERSITY OF CHICAGO SINAUER ASSOCIATES, INC., Publishers Sunderland, Massachusetts Contents Preface xiii
More informationLecture 27. Phylogeny methods, part 4 (Models of DNA and protein change) p.1/26
Lecture 27. Phylogeny methods, part 4 (Models of DNA and protein change) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 27. Phylogeny methods, part 4 (Models of DNA and
More informationSequence Divergence & The Molecular Clock. Sequence Divergence
Sequence Divergence & The Molecular Clock Sequence Divergence v simple genetic distance, d = the proportion of sites that differ between two aligned, homologous sequences v given a constant mutation/substitution
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: istance-based methods Ultrametric Additive: UPGMA Transformed istance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood
More informationAmira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut
Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological
More informationMaximum Likelihood in Phylogenetics
Maximum Likelihood in Phylogenetics June 1, 2009 Smithsonian Workshop on Molecular Evolution Paul O. Lewis Department of Ecology & Evolutionary Biology University of Connecticut, Storrs, CT Copyright 2009
More informationQ1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.
OEB 242 Exam Practice Problems Answer Key Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate. First, recall
More informationConstructing Evolutionary/Phylogenetic Trees
Constructing Evolutionary/Phylogenetic Trees 2 broad categories: Distance-based methods Ultrametric Additive: UPGMA Transformed Distance Neighbor-Joining Character-based Maximum Parsimony Maximum Likelihood
More informationEVOLUTIONARY DISTANCES
EVOLUTIONARY DISTANCES FROM STRINGS TO TREES Luca Bortolussi 1 1 Dipartimento di Matematica ed Informatica Università degli studi di Trieste luca@dmi.units.it Trieste, 14 th November 2007 OUTLINE 1 STRINGS:
More informationLecture 4. Models of DNA and protein change. Likelihood methods
Lecture 4. Models of DNA and protein change. Likelihood methods Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 4. Models of DNA and protein change. Likelihood methods p.1/36
More informationPhylogenetic Tree Reconstruction
I519 Introduction to Bioinformatics, 2011 Phylogenetic Tree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Evolution theory Speciation Evolution of new organisms is driven
More informationLikelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution
Likelihood Ratio Tests for Detecting Positive Selection and Application to Primate Lysozyme Evolution Ziheng Yang Department of Biology, University College, London An excess of nonsynonymous substitutions
More informationDr. Amira A. AL-Hosary
Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological
More informationEstimating Phylogenies (Evolutionary Trees) II. Biol4230 Thurs, March 2, 2017 Bill Pearson Jordan 6-057
Estimating Phylogenies (Evolutionary Trees) II Biol4230 Thurs, March 2, 2017 Bill Pearson wrp@virginia.edu 4-2818 Jordan 6-057 Tree estimation strategies: Parsimony?no model, simply count minimum number
More informationCHAPTERS 24-25: Evidence for Evolution and Phylogeny
CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology
More informationBustamante et al., Supplementary Nature Manuscript # 1 out of 9 Information #
Bustamante et al., Supplementary Nature Manuscript # 1 out of 9 Details of PRF Methodology In the Poisson Random Field PRF) model, it is assumed that non-synonymous mutations at a given gene are either
More informationPOPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics
POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics - in deriving a phylogeny our goal is simply to reconstruct the historical relationships between a group of taxa. - before we review the
More informationProcesses of Evolution
15 Processes of Evolution Forces of Evolution Concept 15.4 Selection Can Be Stabilizing, Directional, or Disruptive Natural selection can act on quantitative traits in three ways: Stabilizing selection
More informationLecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012
Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium November 12, 2012 Last Time Sequence data and quantification of variation Infinite sites model Nucleotide diversity (π) Sequence-based
More informationEvolutionary Analysis of Viral Genomes
University of Oxford, Department of Zoology Evolutionary Biology Group Department of Zoology University of Oxford South Parks Road Oxford OX1 3PS, U.K. Fax: +44 1865 271249 Evolutionary Analysis of Viral
More informationKaKs Calculator: Calculating Ka and Ks Through Model Selection and Model Averaging
Method KaKs Calculator: Calculating Ka and Ks Through Model Selection and Model Averaging Zhang Zhang 1,2,3#, Jun Li 2#, Xiao-Qian Zhao 2,3, Jun Wang 1,2,4, Gane Ka-Shu Wong 2,4,5, and Jun Yu 1,2,4 * 1
More informationNeutral Theory of Molecular Evolution
Neutral Theory of Molecular Evolution Kimura Nature (968) 7:64-66 King and Jukes Science (969) 64:788-798 (Non-Darwinian Evolution) Neutral Theory of Molecular Evolution Describes the source of variation
More informationOutline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16
Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection
More informationPOPULATION GENETICS Biology 107/207L Winter 2005 Lab 5. Testing for positive Darwinian selection
POPULATION GENETICS Biology 107/207L Winter 2005 Lab 5. Testing for positive Darwinian selection A growing number of statistical approaches have been developed to detect natural selection at the DNA sequence
More informationC.DARWIN ( )
C.DARWIN (1809-1882) LAMARCK Each evolutionary lineage has evolved, transforming itself, from a ancestor appeared by spontaneous generation DARWIN All organisms are historically interconnected. Their relationships
More information8/23/2014. Phylogeny and the Tree of Life
Phylogeny and the Tree of Life Chapter 26 Objectives Explain the following characteristics of the Linnaean system of classification: a. binomial nomenclature b. hierarchical classification List the major
More informationSequence Analysis 17: lecture 5. Substitution matrices Multiple sequence alignment
Sequence Analysis 17: lecture 5 Substitution matrices Multiple sequence alignment Substitution matrices Used to score aligned positions, usually of amino acids. Expressed as the log-likelihood ratio of
More informationHow should we go about modeling this? Model parameters? Time Substitution rate Can we observe time or subst. rate? What can we observe?
How should we go about modeling this? gorilla GAAGTCCTTGAGAAATAAACTGCACACACTGG orangutan GGACTCCTTGAGAAATAAACTGCACACACTGG Model parameters? Time Substitution rate Can we observe time or subst. rate? What
More informationPhylogenetics: Building Phylogenetic Trees
1 Phylogenetics: Building Phylogenetic Trees COMP 571 Luay Nakhleh, Rice University 2 Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary model should
More informationLecture 24. Phylogeny methods, part 4 (Models of DNA and protein change) p.1/22
Lecture 24. Phylogeny methods, part 4 (Models of DNA and protein change) Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 24. Phylogeny methods, part 4 (Models of DNA and
More informationGroup activities: Making animal model of human behaviors e.g. Wine preference model in mice
Lecture schedule 3/30 Natural selection of genes and behaviors 4/01 Mouse genetic approaches to behavior 4/06 Gene-knockout and Transgenic technology 4/08 Experimental methods for measuring behaviors 4/13
More informationThe Causes and Consequences of Variation in. Evolutionary Processes Acting on DNA Sequences
The Causes and Consequences of Variation in Evolutionary Processes Acting on DNA Sequences This dissertation is submitted for the degree of Doctor of Philosophy at the University of Cambridge Lee Nathan
More informationThe Phylogenetic Handbook
The Phylogenetic Handbook A Practical Approach to DNA and Protein Phylogeny Edited by Marco Salemi University of California, Irvine and Katholieke Universiteit Leuven, Belgium and Anne-Mieke Vandamme Rega
More informationMaximum Likelihood in Phylogenetics
Maximum Likelihood in Phylogenetics 26 January 2011 Workshop on Molecular Evolution Český Krumlov, Česká republika Paul O. Lewis Department of Ecology & Evolutionary Biology University of Connecticut,
More informationPhylogenetics: Building Phylogenetic Trees. COMP Fall 2010 Luay Nakhleh, Rice University
Phylogenetics: Building Phylogenetic Trees COMP 571 - Fall 2010 Luay Nakhleh, Rice University Four Questions Need to be Answered What data should we use? Which method should we use? Which evolutionary
More informationPhylogenetics: Distance Methods. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Distance Methods COMP 571 - Spring 2015 Luay Nakhleh, Rice University Outline Evolutionary models and distance corrections Distance-based methods Evolutionary Models and Distance Correction
More informationAccuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular Evolution
Accuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular Evolution Maria Anisimova, Joseph P. Bielawski, and Ziheng Yang Department of Biology, Galton Laboratory, University College
More information7.36/7.91 recitation CB Lecture #4
7.36/7.91 recitation 2-19-2014 CB Lecture #4 1 Announcements / Reminders Homework: - PS#1 due Feb. 20th at noon. - Late policy: ½ credit if received within 24 hrs of due date, otherwise no credit - Answer
More informationLecture 4: Evolutionary Models and Substitution Matrices (PAM and BLOSUM)
Bioinformatics II Probability and Statistics Universität Zürich and ETH Zürich Spring Semester 2009 Lecture 4: Evolutionary Models and Substitution Matrices (PAM and BLOSUM) Dr Fraser Daly adapted from
More informationEdward Susko Department of Mathematics and Statistics, Dalhousie University. Introduction. Installation
1 dist est: Estimation of Rates-Across-Sites Distributions in Phylogenetic Subsititution Models Version 1.0 Edward Susko Department of Mathematics and Statistics, Dalhousie University Introduction The
More informationLecture 4. Models of DNA and protein change. Likelihood methods
Lecture 4. Models of DNA and protein change. Likelihood methods Joe Felsenstein Department of Genome Sciences and Department of Biology Lecture 4. Models of DNA and protein change. Likelihood methods p.1/39
More informationLecture Notes: Markov chains
Computational Genomics and Molecular Biology, Fall 5 Lecture Notes: Markov chains Dannie Durand At the beginning of the semester, we introduced two simple scoring functions for pairwise alignments: a similarity
More informationMolecular Evolution, course # Final Exam, May 3, 2006
Molecular Evolution, course #27615 Final Exam, May 3, 2006 This exam includes a total of 12 problems on 7 pages (including this cover page). The maximum number of points obtainable is 150, and at least
More informationT R K V CCU CG A AAA GUC T R K V CCU CGG AAA GUC. T Q K V CCU C AG AAA GUC (Amino-acid
Lecture 11 Increasing Model Complexity I. Introduction. At this point, we ve increased the complexity of models of substitution considerably, but we re still left with the assumption that rates are uniform
More informationClassical Selection, Balancing Selection, and Neutral Mutations
Classical Selection, Balancing Selection, and Neutral Mutations Classical Selection Perspective of the Fate of Mutations All mutations are EITHER beneficial or deleterious o Beneficial mutations are selected
More informationRecombina*on and Linkage Disequilibrium (LD)
Recombina*on and Linkage Disequilibrium (LD) A B a b r = recombina*on frac*on probability of an odd Number of crossovers occur Between our markers 0
More informationProcesses of Evolution
15 Processes of Evolution Chapter 15 Processes of Evolution Key Concepts 15.1 Evolution Is Both Factual and the Basis of Broader Theory 15.2 Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom
More information- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.
February 8, 2005 Bio 107/207 Winter 2005 Lecture 11 Mutation and transposable elements - the term mutation has an interesting history. - as far back as the 17th century, it was used to describe any drastic
More informationMATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME
MATHEMATICAL MODELING AND THE HUMAN GENOME Hilary S. Booth Australian National University, Australia Keywords: Human genome, DNA, bioinformatics, sequence analysis, evolution. Contents 1. Introduction:
More informationGene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family
Review: Gene Families Gene Families part 2 03 327/727 Lecture 8 What is a Case study: ian globin genes Gene trees and how they differ from species trees Homology, orthology, and paralogy Last tuesday 1
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 2009 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More informationIntegrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley
Integrative Biology 200 "PRINCIPLES OF PHYLOGENETICS" Spring 2018 University of California, Berkeley B.D. Mishler Feb. 14, 2018. Phylogenetic trees VI: Dating in the 21st century: clocks, & calibrations;
More informationChapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships
Chapter 26: Phylogeny and the Tree of Life You Must Know The taxonomic categories and how they indicate relatedness. How systematics is used to develop phylogenetic trees. How to construct a phylogenetic
More informationEstimating the Distribution of Selection Coefficients from Phylogenetic Data with Applications to Mitochondrial and Viral DNA
Estimating the Distribution of Selection Coefficients from Phylogenetic Data with Applications to Mitochondrial and Viral DNA Rasmus Nielsen* and Ziheng Yang *Department of Biometrics, Cornell University;
More informationCladistics and Bioinformatics Questions 2013
AP Biology Name Cladistics and Bioinformatics Questions 2013 1. The following table shows the percentage similarity in sequences of nucleotides from a homologous gene derived from five different species
More informationPhylogeny Estimation and Hypothesis Testing using Maximum Likelihood
Phylogeny Estimation and Hypothesis Testing using Maximum Likelihood For: Prof. Partensky Group: Jimin zhu Rama Sharma Sravanthi Polsani Xin Gong Shlomit klopman April. 7. 2003 Table of Contents Introduction...3
More informationGENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.
!! www.clutchprep.com CONCEPT: OVERVIEW OF EVOLUTION Evolution is a process through which variation in individuals makes it more likely for them to survive and reproduce There are principles to the theory
More informationUoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)
- Phylogeny? - Systematics? The Phylogenetic Systematics (Phylogeny and Systematics) - Phylogenetic systematics? Connection between phylogeny and classification. - Phylogenetic systematics informs the
More informationMolecular phylogeny How to infer phylogenetic trees using molecular sequences
Molecular phylogeny How to infer phylogenetic trees using molecular sequences ore Samuelsson Nov 200 Applications of phylogenetic methods Reconstruction of evolutionary history / Resolving taxonomy issues
More informationPreliminaries. Download PAUP* from: Tuesday, July 19, 16
Preliminaries Download PAUP* from: http://people.sc.fsu.edu/~dswofford/paup_test 1 A model of the Boston T System 1 Idea from Paul Lewis A simpler model? 2 Why do models matter? Model-based methods including
More informationOutline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18
Genome Evolution Outline 1. What: Patterns of Genome Evolution Carol Eunmi Lee Evolution 410 University of Wisconsin 2. Why? Evolution of Genome Complexity and the interaction between Natural Selection
More informationPhylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?
Phylogeny and systematics Why are these disciplines important in evolutionary biology and how are they related to each other? Phylogeny and systematics Phylogeny: the evolutionary history of a species
More informationPhylogenetics. BIOL 7711 Computational Bioscience
Consortium for Comparative Genomics! University of Colorado School of Medicine Phylogenetics BIOL 7711 Computational Bioscience Biochemistry and Molecular Genetics Computational Bioscience Program Consortium
More informationLab 9: Maximum Likelihood and Modeltest
Integrative Biology 200A University of California, Berkeley "PRINCIPLES OF PHYLOGENETICS" Spring 2010 Updated by Nick Matzke Lab 9: Maximum Likelihood and Modeltest In this lab we re going to use PAUP*
More informationRELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG
RELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG Department of Biology (Galton Laboratory), University College London, 4 Stephenson
More informationAdditive distances. w(e), where P ij is the path in T from i to j. Then the matrix [D ij ] is said to be additive.
Additive distances Let T be a tree on leaf set S and let w : E R + be an edge-weighting of T, and assume T has no nodes of degree two. Let D ij = e P ij w(e), where P ij is the path in T from i to j. Then
More informationMolecular Clocks. The Holy Grail. Rate Constancy? Protein Variability. Evidence for Rate Constancy in Hemoglobin. Given
Molecular Clocks Rose Hoberman The Holy Grail Fossil evidence is sparse and imprecise (or nonexistent) Predict divergence times by comparing molecular data Given a phylogenetic tree branch lengths (rt)
More informationREVIEWS. The evolution of gene duplications: classifying and distinguishing between models
The evolution of gene duplications: classifying and distinguishing between models Hideki Innan* and Fyodor Kondrashov bstract Gene duplications and their subsequent divergence play an important part in
More information9/30/11. Evolution theory. Phylogenetic Tree Reconstruction. Phylogenetic trees (binary trees) Phylogeny (phylogenetic tree)
I9 Introduction to Bioinformatics, 0 Phylogenetic ree Reconstruction Yuzhen Ye (yye@indiana.edu) School of Informatics & omputing, IUB Evolution theory Speciation Evolution of new organisms is driven by
More informationConcepts and Methods in Molecular Divergence Time Estimation
Concepts and Methods in Molecular Divergence Time Estimation 26 November 2012 Prashant P. Sharma American Museum of Natural History Overview 1. Why do we date trees? 2. The molecular clock 3. Local clocks
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More informationInferring Molecular Phylogeny
Dr. Walter Salzburger he tree of life, ustav Klimt (1907) Inferring Molecular Phylogeny Inferring Molecular Phylogeny 55 Maximum Parsimony (MP): objections long branches I!! B D long branch attraction
More informationSome of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks!
Some of these slides have been borrowed from Dr. Paul Lewis, Dr. Joe Felsenstein. Thanks! Paul has many great tools for teaching phylogenetics at his web site: http://hydrodictyon.eeb.uconn.edu/people/plewis
More informationD. Incorrect! That is what a phylogenetic tree intends to depict.
Genetics - Problem Drill 24: Evolutionary Genetics No. 1 of 10 1. A phylogenetic tree gives all of the following information except for. (A) DNA sequence homology among species. (B) Protein sequence similarity
More informationThe abundance of deleterious polymorphisms in humans
Genetics: Published Articles Ahead of Print, published on February 23, 2012 as 10.1534/genetics.111.137893 Note February 3, 2011 The abundance of deleterious polymorphisms in humans Sankar Subramanian
More informationPhylogenetic inference
Phylogenetic inference Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, March 7 th 016 After this lecture, you can discuss (dis-) advantages of different information types
More informationQuantifying sequence similarity
Quantifying sequence similarity Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, February 16 th 2016 After this lecture, you can define homology, similarity, and identity
More informationMaximum Likelihood Estimation on Large Phylogenies and Analysis of Adaptive Evolution in Human Influenza Virus A
J Mol Evol (2000) 51:423 432 DOI: 10.1007/s002390010105 Springer-Verlag New York Inc. 2000 Maximum Likelihood Estimation on Large Phylogenies and Analysis of Adaptive Evolution in Human Influenza Virus
More informationEarly History up to Schedule. Proteins DNA & RNA Schwann and Schleiden Cell Theory Charles Darwin publishes Origin of Species
Schedule Bioinformatics and Computational Biology: History and Biological Background (JH) 0.0 he Parsimony criterion GKN.0 Stochastic Models of Sequence Evolution GKN 7.0 he Likelihood criterion GKN 0.0
More informationSubstitution = Mutation followed. by Fixation. Common Ancestor ACGATC 1:A G 2:C A GAGATC 3:G A 6:C T 5:T C 4:A C GAAATT 1:G A
GAGATC 3:G A 6:C T Common Ancestor ACGATC 1:A G 2:C A Substitution = Mutation followed 5:T C by Fixation GAAATT 4:A C 1:G A AAAATT GAAATT GAGCTC ACGACC Chimp Human Gorilla Gibbon AAAATT GAAATT GAGCTC ACGACC
More information