Lecture Notes: BIOL2007 Molecular Evolution

Similar documents
Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Febuary 1 st, 2010 Bioe 109 Winter 2010 Lecture 11 Molecular evolution. Classical vs. balanced views of genome structure

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

The neutral theory of molecular evolution

Neutral Theory of Molecular Evolution

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/8/16

Processes of Evolution

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

Lecture 7 Mutation and genetic variation

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

1. In most cases, genes code for and it is that

SEQUENCE DIVERGENCE,FUNCTIONAL CONSTRAINT, AND SELECTION IN PROTEIN EVOLUTION

Classical Selection, Balancing Selection, and Neutral Mutations

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

Understanding relationship between homologous sequences

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

Genetical theory of natural selection

7. Tests for selection

Full file at CHAPTER 2 Genetics

FUNDAMENTALS OF MOLECULAR EVOLUTION

CHAPTER 23 THE EVOLUTIONS OF POPULATIONS. Section C: Genetic Variation, the Substrate for Natural Selection

Processes of Evolution

Natural selection on the molecular level

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Drosophila melanogaster and D. simulans, two fruit fly species that are nearly

Linear Regression (1/1/17)

(Write your name on every page. One point will be deducted for every page without your name!)

THE EVOLUTION OF DUPLICATED GENES CONSIDERING PROTEIN STABILITY CONSTRAINTS

BME 5742 Biosystems Modeling and Control

UNIT 5. Protein Synthesis 11/22/16

The Role of Causal Processes in the Neutral and Nearly Neutral Theories

Objective 3.01 (DNA, RNA and Protein Synthesis)

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology

D. Incorrect! That is what a phylogenetic tree intends to depict.

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

GENETICS - CLUTCH CH.22 EVOLUTIONARY GENETICS.

Related Courses He who asks is a fool for five minutes, but he who does not ask remains a fool forever.

Introduction to Molecular and Cell Biology

REVIEWS. The evolution of gene duplications: classifying and distinguishing between models

mrna Codon Table Mutant Dinosaur Name: Period:

Multiple Choice Review- Eukaryotic Gene Expression

PROTEIN SYNTHESIS INTRO


POPULATION GENETICS Biology 107/207L Winter 2005 Lab 5. Testing for positive Darwinian selection

Selection and Population Genetics

Translation Part 2 of Protein Synthesis

There are 3 parts to this exam. Use your time efficiently and be sure to put your name on the top of each page.

- point mutations in most non-coding DNA sites likely are likely neutral in their phenotypic effects.

Evolution & Natural Selection

Lecture 20 DNA Repair and Genetic Recombination (Chapter 16 and Chapter 15 Genes X)

BIOLOGY 432 Midterm I - 30 April PART I. Multiple choice questions (3 points each, 42 points total). Single best answer.

Pre-Lab: Aipotu: Evolution


Computational Biology: Basics & Interesting Problems

Topic 09 Evolution. I. Populations A. Evolution is change over time. (change in the frequency of heritable phenotypes & the alleles that govern them)

Natural Selection results in increase in one (or more) genotypes relative to other genotypes.

Genomes and Their Evolution

Name Block Date Final Exam Study Guide

Proteomics. 2 nd semester, Department of Biotechnology and Bioinformatics Laboratory of Nano-Biotechnology and Artificial Bioengineering

Lecture 5. How DNA governs protein synthesis. Primary goal: How does sequence of A,G,T, and C specify the sequence of amino acids in a protein?

Observation: we continue to observe large amounts of genetic variation in natural populations

Eukaryotic Gene Expression

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Fitness landscapes and seascapes

Videos. Bozeman, transcription and translation: Crashcourse: Transcription and Translation -

2. What was the Avery-MacLeod-McCarty experiment and why was it significant? 3. What was the Hershey-Chase experiment and why was it significant?

Chapter 13 Meiosis and Sexual Reproduction

Curriculum Map. Biology, Quarter 1 Big Ideas: From Molecules to Organisms: Structures and Processes (BIO1.LS1)

From gene to protein. Premedical biology

Segregation versus mitotic recombination APPENDIX

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

Population Genetics I. Bio

Interphase & Cell Division

O 3 O 4 O 5. q 3. q 4. Transition

Molecular evolution - Part 1. Pawan Dhar BII

Regulation of Gene Expression

Lecture 20 DNA Repair and Genetic Recombination (Chapter 16 and Chapter 15 Genes X)

Evolution PCB4674 Midterm exam2 Mar

Lecture 22: Signatures of Selection and Introduction to Linkage Disequilibrium. November 12, 2012

WHERE DOES THE VARIATION COME FROM IN THE FIRST PLACE?

Grade 11 Biology SBI3U 12

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis

Chapter 17. From Gene to Protein. Biology Kevin Dees

DNA Structure and Function

The Gene The gene; Genes Genes Allele;

Designer Genes C Test

This course is about VARIATION: its causes, effects, and history.

Natural Selection and Genetic Drift: Their Probabilistic Characterization Yosaku Nishiwaki 1

AP Biology TEST #5 Chapters REVIEW SHEET

Darwinian Selection. Chapter 7 Selection I 12/5/14. v evolution vs. natural selection? v evolution. v natural selection

Molecular Population Genetics

Eukaryotic vs. Prokaryotic genes

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

REVIEW. Selectionism and Neutralism in Molecular Evolution

Molecular Drive (Dover)

A complementation test would be done by crossing the haploid strains and scoring the phenotype in the diploids.

Q1) Explain how background selection and genetic hitchhiking could explain the positive correlation between genetic diversity and recombination rate.

Transcription:

Lecture Notes: BIOL2007 Molecular Evolution Kanchon Dasmahapatra (k.dasmahapatra@ucl.ac.uk) Introduction By now we all are familiar and understand, or think we understand, how evolution works on traits and characters survival of the fittest and stuff like that. Heritable changes in traits and characters are a result of underlying changes at the level of DNA. What we are probably less acquainted with is how DNA itself changes and evolves. In this lecture we will explore the theme of molecular evolution, taking a look at how and why nucleotide and amino acid sequences evolve. Background information But first, some background information. DNA has many different roles. Most of our DNA does not code for any protein and is therefore different in character from sections of DNA that do code for proteins. In eukaryotes, genes (or stretches of DNA that code for a particular protein) frequently consist of exons and introns. After the DNA is transcribed into messenger RNA (mrna), the intron regions are removed (spliced) and the exons are pasted back together. This modified mrna is then translated into the protein. Therefore, while exons represent protein coding regions, introns do not. Translation of the mrna into the protein is carried out according to the genetic code that describes how each of the 20 amino acids is specified by a consecutive sequence of three of the four bases comprising DNA. These triplets of bases coding for each amino acid are called codons. As there are 64 such codons and only 20 amino acids, the same amino acid is often encoded by more than one codon. Codons which produce the same amino acid are called synonymous codons. Point mutations (replacement of one base by another) that do not change the amino acid coded by codons are known as synonymous or silent mutations. This is often the case for point mutations occurring in the third position of codons. While point mutations that change the coded amino acid are called non-synonymous mutations. Modes of molecular evolution There are a number of different ways or modes by which molecular evolution takes place. Most of you will be familiar with these mutational modes, so we won t go into a great deal of detail with them. Here is a list of some of these modes: Single base pair changes, substitutions or point mutations Insertions or deletions, also known as indels Gene duplications - formation of multigene families and pseudogenes Slippage microsatellite length changes

Chromosomal mutations Transposition The Classical vs. the Balance Schools Before the 1960s (in the days before there was any data about protein or DNA variation) there were two schools of population geneticists: the classical and balance schools. The classical school believed that polymorphisms, the existence of more than one allele in a population of genes, were rare. They argued that natural selection was a mainly purifying force that removed any deleterious alleles that may arise or would drive any advantageous alleles to fixation. Therefore, they believed that individuals were homozygous for most loci. In contrast, the balance school believed that polymorphisms were common. Polymorphisms at the various loci were thought to be maintained by different forms of balancing selection that favoured heterozygotes over homozygotes. Both schools of thought agreed that natural selection was the force driving molecular evolution. In the mid-1960s the technique of protein electrophoresis was discovered allowing investigation into the levels of enzyme polymorphism. The results showed that large amounts of genetic variation was present in natural populations, appearing to vindicate the balance school s beliefs. The balance school held that these high levels of polymorphism were maintained by balancing selection. However, others argued that maintaining these high levels of polymorphism at thousands of loci by balancing selection would be very costly. Summed over multiple loci this high genetic load would be large enough to drive populations to extinction! The neutral theory of molecular evolution However, the high levels of polymorphism can explained without encountering excessive genetic load simply by dropping the assumption that natural selection is the driving force of molecular evolution and instead allowing the majority of mutations fixed to be neutral and therefore have no effect on fitness. Two papers, by Kimura in 1968 and by King and Jukes in 1969, first proposed this neutral theory of evolution. Since then it has become one of the most important and controversial theories in evolutionary biology. In his paper, Kimura made some simple calculations. If µ = mutation rate per gene per generation, and N = effective population size Number alleles in a diploid population = 2N Number of new mutations per generation = 2Nµ Most of the time a new neutral allele will be quickly lost from the population by genetic drift. But sometimes it will drift into the population and get fixed, that is, it will replace (or substitute) the original allele in the population.

The probability that the new allele will drift to fixation = 1/2N (this is equivalent to the probability of reaching into a bag containing 2N black marbles and pulling out the only red marble in the bag). Therefore, the rate of substitution of an allele by a new allele = 2Nµ 1/2N = µ Basically meaning that the rate of neutral molecular evolution is independent of population size and is simply equal to the neutral mutation rate. The average time for a neutral mutation to drift to fixation is 4N generations. Therefore, while the rate of origin and fixation of new mutations (µ) is independent of population size, the rate of progress of the mutation through the population is proportional to the population size. Therefore, under the neutral theory, polymorphisms in a large population are simply a result of lots of neutral mutations arising and passing through the population at a slow rate such that at any one time there are several different alleles at a particular locus drifting through the population. According to the neutralists, most mutations are either deleterious and are selectively removed, or are effectively neutral, in which case there is a small probability that they are fixed. Natural selection is incorporated, but as a purifying force, removing deleterious mutations and with only a small role in fixing new mutations. As we have seen above, the probability of fixation of a neutral allele by drift is 1/2N. If this probability is bigger than the selection pressure, the influence of drift is greater than that of selection and the mutation is effectively neutral. So, the neutral theory does not argue that most mutations are completely neutral, but that any selection pressures are outweighed by the effects of drift. In contrast, according to the selectionists, mutations are fixed because they confer a selective advantage and that neutral mutations are rare. Some predictions from the neutral theory 1. There is a constant rate, or molecular clock, of sequence evolution 2. There is an inverse relationship between the rate of substitution and the degree of functional constraint acting on a gene, such that functionally constrained genes or gene regions evolve at the lower rate and vice versa. The molecular clock A molecular clock is compatible with the neutral theory, as the rate of substitution of a neutral mutation is µ, and is not affected by population sizes or selective pressures. As long as µ is constant across species and most molecular evolution is neutral then the rate of evolution should be constant across lineages.

At first glance the evolution of sequences does indeed appear to be constant over time. However, on closer inspection, significant variation among lineages becomes evident. One way of testing the molecular clock is by using the relative rate test. A number of explanations have been put forward to explain deviations from molecular clock, such as differences between lineages in generation times, metabolic rate, DNA repair efficiency and even a bit of natural selection. Functional constraints and the rate of substitution According to the neutral theory most mutations are deleterious and the rest are neutral (advantageous mutations are very rare). However, genes will differ in the proportion of mutations that are deleterious. The higher the functional constraint on the gene, the greater is the strength of negative selection removing mutations. In a gene with high functional constraints the vast proportion of mutants will be deleterious and be removed by selection, leaving only a small fraction of neutral mutations which will result in a low rate of substitution. In a less constrained gene a larger fraction of the mutations will be neutral leading to a higher substitution rate. Examples: Variation in rates between and within genes Substitution rates in non-coding regions pseudogenes, introns. Synonymous vs. non-synonymous mutations rates Testing the neutrality of mutations using d N /d S : 1) Sequence copies of the gene of interest from a variety of species. 2) Construct a phylogeny of the species using the sequence or other data. 3) Identify synonymous and non-synonymous mutations. 4) Calculate the average synonymous rate of subsititution, d S, the average nonsynonymous rate of substitution, d S, and the ratio, ω = d N /d S. We assume that synonymous mutations are neutral. As we have seen, due to functional constraints, in most genes d N < d S, and ω < 1. If d N > d S, ω > 1, the coding changes are occurring more rapidly than silent changes. This is indicative of positive selection to change the amino acid sequence. Positive selection evidence against the neutral model? Examples mutation rates within the major histocompatibility complex and HIV envelop proteins. However, the procedure described above for detecting positive selection is insensitive. The procedure calculates a single value of ω for a gene, averaging over the whole gene. It is possible that only a few parts of a protein are under strong positive selection. If this is the case, averaging

over the whole gene will mean that the ω > 1 signal from the bits under positive selection will be swamped by the ω < 1 signal coming from the majority of the gene. Some improvements have been made to detecting positive selection, many of them coming from Zihang Yang here at UCL. e.g. lysine, see OHP. The nearly neutral model By the early 1970s it was becoming clear that the neutral theory was too simplistic. There was evidence for positive selection acting on mutations and the molecular clock did not tick at a perfectly constant rate. This gave rise to the nearly neutral model of molecular evolution. According to this theory, mutations in non-coding DNA and synonymous sites are still strictly neutral. However, non-synonymous mutations are no longer regarded as being neutral and are instead nearly neutral, being either slightly deleterious or slightly advantageous. Therefore, the nearly neutral model includes weak selection as well as genetic drift. So who is correct, the neutralists or the selectionists? It seems that both genetic drift and natural selection determine the evolution of mutations. Neutralists are probably correct in that most mutations are neutral, especially in non-coding DNA and synonymous sites. However, evidence of natural selection is sometimes evident at nonsynonymous sites when molecular evolution over short evolutionary time periods are examined. Reading Chapter 7 Models of molecular evolution Page, R. D. M. and E. C. Holmes. 1998. Molecular Evolution: a phylogenetic approach. Blackwell Publishing. Yang Z and J. P. Bielawski. 2000. Statistical methods for detecting molecular adaptation. Trends in Ecology and Evolution 15:496-503. Kimura, M. 1968. Evolutionary rate at the molecular level. Nature 217:624-626. King, L. C. and T. H. Jukes. 1969. Non-Darwinian evolution. Science 164:788-798. Li, W. 1993. So, what about the molecular clock hypothesis? Current Opinion in Genetics and Development 3:896-901.