Protein Mistranslation is Unlikely to Ease a Population s Transit across a Fitness Valley. Matt Weisberg May, 2012

Similar documents
Repeated Occurrences of the Baldwin Effect Can Guide Evolution on Rugged Fitness Landscapes

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

COMP598: Advanced Computational Biology Methods and Research

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.

Microevolution 2 mutation & migration

DARWIN: WHICH MATHEMATICS?

Exercise 3 Exploring Fitness and Population Change under Selection

Objective 3.01 (DNA, RNA and Protein Synthesis)

The dynamics of complex adaptation

Population Genetics: a tutorial

Processes of Evolution

Evolutionary Theory. Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

The Evolution of Gene Dominance through the. Baldwin Effect

Microevolution Changing Allele Frequencies

The genome encodes biology as patterns or motifs. We search the genome for biologically important patterns.

UNIT 5. Protein Synthesis 11/22/16

Biological Evolution

arxiv: v1 [q-bio.qm] 7 Aug 2017

Big Idea 3: Living systems store, retrieve, transmit, and respond to information essential to life processes.

Genetical theory of natural selection

HEREDITY AND VARIATION

Lecture Notes: BIOL2007 Molecular Evolution

Genetics and Natural Selection

D. Incorrect! That is what a phylogenetic tree intends to depict.

Science Unit Learning Summary

The Derivative of a Function

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Chapter 16. Table of Contents. Section 1 Genetic Equilibrium. Section 2 Disruption of Genetic Equilibrium. Section 3 Formation of Species

Curriculum Links. AQA GCE Biology. AS level

Complexity in social dynamics : from the. micro to the macro. Lecture 4. Franco Bagnoli. Lecture 4. Namur 7-18/4/2008

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

Lab I: Three-Point Mapping in Drosophila melanogaster

THE EVOLUTION OF DUPLICATED GENES CONSIDERING PROTEIN STABILITY CONSTRAINTS

mrna Codon Table Mutant Dinosaur Name: Period:

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

How robust are the predictions of the W-F Model?

Reproduction- passing genetic information to the next generation

Tigard-Tualatin School District Science Grade Level Priority Standards

arxiv: v2 [q-bio.pe] 13 Jul 2011

BIOLOGY 432 Midterm I - 30 April PART I. Multiple choice questions (3 points each, 42 points total). Single best answer.

Why transcription factor binding sites are ten nucleotides long

Classical Selection, Balancing Selection, and Neutral Mutations

Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution. Classical vs. balanced views of genome structure

Introduction to Digital Evolution Handout Answers

BIOLOGY STANDARDS BASED RUBRIC

1.A- Natural Selection

4. Identify one bird that would most likely compete for food with the large tree finch. Support your answer. [1]

Why EvoSysBio? Combine the rigor from two powerful quantitative modeling traditions: Molecular Systems Biology. Evolutionary Biology

BIOL Evolution. Lecture 9

SNAP Centre Workshop. Solving Systems of Equations

Evidence of Evolution

Chapter 7. Evolutionary Game Theory


I N N O V A T I O N L E C T U R E S (I N N O l E C) Petr Kuzmič, Ph.D. BioKin, Ltd. WATERTOWN, MASSACHUSETTS, U.S.A.

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

I. Short Answer Questions DO ALL QUESTIONS

Genetic Algorithms. Donald Richards Penn State University

Evolution of Phenotype as selection of Dynamical Systems 1 Phenotypic Fluctuation (Plasticity) versus Evolution

Mutation, Selection, Gene Flow, Genetic Drift, and Nonrandom Mating Results in Evolution

OVERVIEW. L5. Quantitative population genetics

ADVANCED PLACEMENT BIOLOGY

Statistical mechanics of fitness landscapes

Chapter 22: Descent with Modification: A Darwinian View of Life

Chapter 17. From Gene to Protein. Biology Kevin Dees

Topic 9 Examples of Mass Functions and Densities

Relations in epidemiology-- the need for models

The concept of the adaptive landscape

The Theory of Evolution

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

A Simple Haploid-Diploid Evolutionary Algorithm

Endowed with an Extra Sense : Mathematics and Evolution

The Use of Tall Tower Field Data for Estimating Wind Turbine Power Performance

Reading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype

INFORMATION-THEORETIC BOUNDS OF EVOLUTIONARY PROCESSES MODELED AS A PROTEIN COMMUNICATION SYSTEM. Liuling Gong, Nidhal Bouaynaya and Dan Schonfeld

Unit 3 - Molecular Biology & Genetics - Review Packet

Solutions to Problem Set 4


7.36/7.91 recitation CB Lecture #4

Gene Expression as a Stochastic Process: From Gene Number Distributions to Protein Statistics and Back

(Write your name on every page. One point will be deducted for every page without your name!)

Functional divergence 1: FFTNS and Shifting balance theory

Full file at CHAPTER 2 Genetics

3. DISCRETE PROBABILITY DISTRIBUTIONS

Math 3 Variable Manipulation Part 3 Polynomials A

Partitioning Genetic Variance

The inevitability of unconditionally deleterious substitutions during adaptation arxiv: v2 [q-bio.pe] 17 Nov 2013

This is DUE: Come prepared to share your findings with your group.

Written Exam 15 December Course name: Introduction to Systems Biology Course no

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

Molecular evolution - Part 1. Pawan Dhar BII

chatper 17 Multiple Choice Identify the choice that best completes the statement or answers the question.

David B. Lukatsky and Ariel Afek Department of Chemistry, Ben-Gurion University of the Negev, Beer-Sheva Israel

Supplementary Figure 1. Phenotype of the HI strain.

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

Natural Selection results in increase in one (or more) genotypes relative to other genotypes.

The Evolution of Sex Chromosomes through the. Baldwin Effect

Exploring the Effect of Sex on Empirical Fitness Landscapes

Mathematical statistics

Transcription:

Protein Mistranslation is Unlikely to Ease a Population s Transit across a Fitness Valley Matt Weisberg May, 2012

Abstract Recent research has shown that protein synthesis errors are much higher than previously determined. Furthermore, mistranslation has been pinpointed as the most error-prone step in protein synthesis (Meyerovich et. al, 2010, Wilke and Drummond 2009). Prompted by this information, we explored the effect of mistranslation on evolution. Some mutations that have deleterious effects might have beneficial mutant neighbors in genotypic space. We wondered if higher levels of mistranslation would act as a smoothing effect on fitness landscapes and would decrease the expected time for a population to cross a fitness valley by mistranslating deleterious proteins into their beneficial protein neighbors. Ultimately it does not seem that mistranslation has a large effect on this process. Methods In order to view the effect of mistranslation in the easiest way possible, we adopted the theoretical fitness landscape from Weinreich and Chao 2005. This model depicts genotypic space as follows: an individual is at genotype ab with fitness 1. The two possible mutations located one nucleotide away are named ab and Ab and have fitness 1-s del. If these two mutations occurred in the same individual it would have genotype AB and would have fitness 1+s ben. Their model for traversing this type of fitness vally is given as Which is a combination of and N = Population Size µ = mutation rate u(x) = the probability of fixation given N and s ben and s del.

Our model is a slight variation on the Weinreich and Chao model, in which we introduce a modification of s del entitled s del *. S del * tempers s del by the amount of mistranslationbased change into a more fit neighboring genotype. s del * is a transformation of our attempt to model a dosage response curve. A dosage response represents the biological notion that the relationship between effect of a protein and number of proteins in the cell is not necessarily linear. In fact a few number of copies of a protein may provide the cell with all the functionality it needs from that protein. We first modeled the dosage response curve as d(x) = (1-e -αx ), where x is the proportion of proteins in the cell that display the mistranslated phenotype. and α is the difference from linearity of the dosage response curve, and d(x) is the fitness of the individual. This gave us the shape but not the size we wanted, so in order to scale all values to 1 we normalized all results by dividing by d(1), resulting in the final function d(x) = (1-e -αx )/(1-e -α) Here are some examples of the dosage response curve in action: \ X in this equation is itself an expression of the probability mistranslation producing the double mutant, so we set out to determine the function underlying this probability.

We decided the probability of getting a mistranslation error would likely be distributed with a Poisson distribution, which has distribution: P(X) = λe -λ X / (X! ). λ represents the frequency of the desired event, so we set it to νl, where ν = the mistranslation rate per amino acid and L = the length of the protein. νl is the frequency of a mistranslation event given the size of a protein. X above is the number of events for which are determining the probability of their occurrence. In our case we are looking for 1 mistranslation error, therefore P(1) = λe -λ / (1! ) = νle -ν L This is the probability of seeing exactly 1 mistranslation error over the length of a protein length L amino acids given the mistranslation per amino acid rate ν. Next, because each amino acid has 19 different amino acids to which it can change, there are 19*L total number of mistranslation products that are one amino acid away from a protein of length L. Since we are looking for one mistranslation product in specific, our final probability of receiving the double mutant, beneficial neighbor is: νle -ν L / 19*L. Again, the shape of the function was now accurate, but the size was not bounded by s del and s ben. To address this problem, we multiplied this expression by s del +s ben to scale the expression such that its maximum is at sben and then added 1-s del to ensure that at the lowest the function was at sdel. We then subtracted this total from 1 in order to derive a value that represented a mutation with a fitness different from the wild-type s fitness of 1. Lastly, the above calculation returns the fitness of the individual containing the mutants, and we want the difference between that individuals fitness from the wild type fitness. To achieve this we subtract the above calculation from the wild type fitness of 1. IN SUMMARY THE CALCULATION OF sdel* IS: s del * = 1 - ((1-e -αν Le^(- ν L)/(19L) ) / (1-e -α )*(s del +s ben )+(1-s del ))

Results For all of the results calculated here we used the values : mu = 1x10-9 sben =.1 sdel =.01 The results according to Weinreich and Chao 2005 are in black and the results from our trials are in blue. The appearance of only black circles indicates no difference between the datasets. Only the combination of nu =.01 and Alpha = 1000 gave us a value that was QUALITATIVELY different from the results in Weinreich and Chao 2005. We had previously reasoned that the power of mistranslation might be felt in particular if it managed to turn a deleterious mutation into a beneficial mutation, i.e. if sdel were to become negative and thus making the fitness of the individual greater than 1. In fact, sdel* for this combination is -.0093, confirming our reasoning. However an alpha

value of 1000 connotes an extremely strong dosage response curve, so we tested to see what combinations of alpha, nu, and L would result in a negative sdel*. The following combinations resulted in a negative sdel*: Alpha L Nu sdel* 100 10.1-0.0370 1000 10.01-0.0413 1000 10.1-0.3727 1000 100.1-0.0370 It is interesting to note that increasing Nu does not guarantee a higher sdel*. The reason we see it maximized in the eighth panel is related to the Poisson distribution driving the probability of getting a double mutant. P(1) is maximized when νl = 1. Above that the probability of getting one mistranslation product begins to decrease and the probability of getting two mistranslation products begins to increase, attenuating the gain in sdel*. Discussion These values indicate that mistranslation is likely to be important for a range of values that are biologically extreme and therefore most likely does not influence the dynamics of fitness valley traversal. We set out to determine the importance of this phenomenon, but from our standpoint it does not seem that it is likely to be a big factor. Additionally, we have since received information that makes us believe that our dosage response curve is stronger than biology indicates, and that we instead should use a model of X/(1+X) instead of (1-e -αx )/(1-e -α ) (Cherry 2009). This would further decrease the efficacy of mistranslation in aiding the crossing of fitness valleys. Using this dosage response curve, none of the above combinations generate sdel* < 0. It seems that mistranslation does not affect this process greatly because of the large pool of possible genotypic mistranslation neighbors from which we are hoping to pull one specific error. To address this, we discussed the possibility of calculating the probability of finding our beneficial double mutant as νle -ν L / 19 instead of νle -ν L / 19*L. This redefinition still does not broaden the biological relevance of mistranslation in fitness valley traversal. However, if sben is great enough (e.g. sben =1), this redefinition brings mistranslation s power into a possibly biologically relevant zone. Still it takes manipulation on our part and extreme cases before mistranslation takes on importance for crossing fitness valleys and is likely not a strong player in evolutionary dynamics.

Further Research Figure 2 in Weissman et al. 2010 showed that the time to fixation for mutations is affected by recombination. The algebra there suggests that given shallower fitness valleys, increased recombination leads to shorter valley crossing time. In this model, we have not included the effect of recombination combined with mistranslation and are further investigating the effect of these two forces in tandem. We question whether mistranslation might temper the effect of sdel enough to enter the process into the regime that is helped by recombination. This area is of great interest to us now. Works Cited Cherry, Joshua L. Should We Expect Substitution Rate to Depend on Population Size? Genetics 150, no. 2 (October 1, 1998): 911 919. Drummond, Allen D., and Claus O. Wilke. The Evolutionary Consequences of Erroneous Protein Synthesis. Nat Rev Genet 10, no. 10 (October 2009): 715 724. Meyerovich, Mor, Gideon Mamou, and Sigal Ben-Yehuda. Visualizing High Error Levels During Gene Expression in Living Bacterial Cells. Proceedings of the National Academy of Sciences (June 7, 2010). Weinreich, Daniel M., and Lin Chao. Rapid Evolutionary Escape by Large Populations from Local Fitness Peaks Is Likely in Nature. Evolution 59, no. 6 (June 1, 2005): 1175 1182. Weissman, Daniel B., Marcus W. Feldman, and Daniel S. Fisher. The Rate of Fitness-Valley Crossing in Sexual Populations. Genetics 186, no. 4 (December 1, 2010): 1389 1410.