METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Similar documents
PHYLOGENY AND SYSTEMATICS

8/23/2014. Phylogeny and the Tree of Life

Phylogenetic Trees. How do the changes in gene sequences allow us to reconstruct the evolutionary relationships between related species?

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

Cladistics and Bioinformatics Questions 2013

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Phylogeny and the Tree of Life

Genomes and Their Evolution

Chapter 26 Phylogeny and the Tree of Life

Lecture 11 Friday, October 21, 2011

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Organizing Life s Diversity

Evidence for Evolution

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

PHYLOGENY & THE TREE OF LIFE

Microbial Taxonomy. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

Name: Class: Date: ID: A

Microbial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Exploring Evolution & Bioinformatics

Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.

Microbial Taxonomy. Slowly evolving molecules (e.g., rrna) used for large-scale structure; "fast- clock" molecules for fine-structure.

UNIT 5. Protein Synthesis 11/22/16

Phylogeny and the Tree of Life

Macroevolution Part I: Phylogenies

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

AP Biology Notes Outline Enduring Understanding 1.B. Big Idea 1: The process of evolution drives the diversity and unity of life.

Big Idea 1: The process of evolution drives the diversity and unity of life. Sunday, August 28, 16

Computational Biology: Basics & Interesting Problems

Chapter 19 Organizing Information About Species: Taxonomy and Cladistics

Biodiversity. The Road to the Six Kingdoms of Life

Chapters 25 and 26. Searching for Homology. Phylogeny

Chapter 19: Taxonomy, Systematics, and Phylogeny

Classification and Phylogeny

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

Chapter 16: Reconstructing and Using Phylogenies

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Outline. Classification of Living Things

Classification and Phylogeny

Unit 5: Taxonomy. KEY CONCEPT Organisms can be classified based on physical similarities.

Understanding relationship between homologous sequences

Molecular evolution. Joe Felsenstein. GENOME 453, Autumn Molecular evolution p.1/49

Molecular Evolution & the Origin of Variation

Molecular Evolution & the Origin of Variation

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

Biodiversity. The Road to the Six Kingdoms of Life

BIOLOGY 432 Midterm I - 30 April PART I. Multiple choice questions (3 points each, 42 points total). Single best answer.

Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab

MiGA: The Microbial Genome Atlas

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis

Translation Part 2 of Protein Synthesis

Algorithms in Bioinformatics

BIOINFORMATICS LAB AP BIOLOGY

Phylogeny & Systematics

Multiple Sequence Alignment. Sequences

Phylogeny and the Tree of Life

9/19/2012. Chapter 17 Organizing Life s Diversity. Early Systems of Classification

Protein Architecture V: Evolution, Function & Classification. Lecture 9: Amino acid use units. Caveat: collagen is a. Margaret A. Daugherty.

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

Organizing Life on Earth

SECTION 17-1 REVIEW BIODIVERSITY. VOCABULARY REVIEW Distinguish between the terms in each of the following pairs of terms.

Phylogeny and the Tree of Life

Chapter 27: Evolutionary Genetics

18.4 Embryonic development involves cell division, cell differentiation, and morphogenesis

Lecture 14 - Cells. Astronomy Winter Lecture 14 Cells: The Building Blocks of Life

Modern Evolutionary Classification. Section 18-2 pgs

Emily Blanton Phylogeny Lab Report May 2009

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale

Biology 1B Evolution Lecture 2 (February 26, 2010) Natural Selection, Phylogenies

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

How should we organize the diversity of animal life?

Introduction to Molecular and Cell Biology

Primate Diversity & Human Evolution (Outline)

Phylogeny and Systematics

Bioinformatics Exercises

Chapter 17. From Gene to Protein. Biology Kevin Dees

GCD3033:Cell Biology. Transcription

Microbiology / Active Lecture Questions Chapter 10 Classification of Microorganisms 1 Chapter 10 Classification of Microorganisms

2012 Univ Aguilera Lecture. Introduction to Molecular and Cell Biology

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

AP Biology. Cladistics

1 ATGGGTCTC 2 ATGAGTCTC

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

Gene Families part 2. Review: Gene Families /727 Lecture 8. Protein family. (Multi)gene family

Taxonomy. Content. How to determine & classify a species. Phylogeny and evolution

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

BINF6201/8201. Molecular phylogenetic methods

Homeotic Genes and Body Patterns

Quiz answers. Allele. BIO 5099: Molecular Biology for Computer Scientists (et al) Lecture 17: The Quiz (and back to Eukaryotic DNA)

Chapter 19. History of Life on Earth

Dr. Amira A. AL-Hosary

Phylogenetic Trees. Phylogenetic Trees Five. Phylogeny: Inference Tool. Phylogeny Terminology. Picture of Last Quagga. Importance of Phylogeny 5.

The Theory of Evolution

Evolutionary Patterns, Rates, and Trends

What is the central dogma of biology?

Biology 211 (2) Week 1 KEY!

Transcription:

Chapter 12 (Strikberger) Molecular Phylogenies and Evolution METHODS FOR DETERMINING PHYLOGENY In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task. Modern molecular methods of classification can overcome many of the pitfalls associated with traditional methods. Molecular methods compare antibodies, DNA, RNA, or amino acid sequences to give insight to the relatedness of organisms. For example, evolutionary changes are indicated by substitutions in nucleotide and amino acid sequences. Possible even when no morphological, behavioral, or ecological links are present. Amino acid sequences Comparing amino acid sequences in a homologous protein can provide information about the relationships between different organisms. Hemoglobin was the first protein investigated. Structure includes a porphyrin (heme, that can reversibly bind to oxygen) attached to a globin polypeptide chain (>140 aa long). 1

Found in animals (hemoglobin-like molecules are also found in plants, fungi, and invertebrates). The conserved nature of this molecule implies an early place in evolution. In a normal, adult: Four polypeptide chains; two α and two β (α 2 β 2 ) In some adults: α 2 δ 2 Embryonic hemoglobin: α 2 γ 2 Myoglobin and ε chains of hemoglobin are present in some tissues. Differences in hemoglobin structures indicate two kinds of evolution. 1. Differing globin chains (α, β, γ, δ, ε) arose producing the variety carried by a particular organism. Why might differing structures of globin chains arise? Each were a variation of the same globin theme: 1. Chains are same length 2. Sequence similarity at many positions. 3. 3-D structure similar similar function. 4. β, γ, and δ genes are closely linked on chromosome 11. What does this close linkage tell you about the evolution of hemoglobin? 2

2. Once produced, each globin chain followed its own evolutionary path. This lead to changes in its amino acid sequence in different species. 3

GENE DUPLICATION AND DIVERGENCE The differing globins likely did not evolve independently, and then accidentally converge in sequence and function. Question: How did differing molecules of such similar structure and function arise? Linkage studies suggest that gene duplication of an original globin-type gene took place. Once copies of the gene were present, each could theoretically undergo independent evolution leading to today s α, β, γ, δ, and ε chains. Question: How could you tell which one came first? And which one is the most recent? The temporal order of hemoglobin chain evolution can be deduced by comparing amino acid sequences. The greater the sequence difference, the longer the time to their common ancestor (and subsequently, the greater evolutionary distance). We know: The myoglobin chain differs most from the others (different amino acids at > 100 sites). α differs from β at 77 sites. β differs from γ at 39 sites but differs from δ at only 10 sites. Implies that: 1. The myoglobin gene formed from an early duplication. 2. A later duplication separated α and β genes. 3. β and δ represent the latest duplication. 2 1 3 4

How do gene duplications arise? Unequal crossing over that results in increased chromosome material. Some duplicated genes have evolved completely different functions although they share common amino acid sequences. Amino acid sequence of α-lactalbumin protein is similar to that of lysozyme. Further evolutionary relationships include: products by ducts sugar is the substrate Once genes are duplicated, how long does it take for divergences to occur? Dependent on number of amino acid substitutions necessary to produce a differing function. However even one amino acid change can drive drastic changes. Example: A single amino acid substitution can convert lactate dehydrogenase (LDH) to malate dehydrogenase (MDH). LDH used in glycolysis (pyruvate lactate) MDH used in Krebs cycle (malate oxoloacetate) Change generated by a change from glutamine to arginine at the 102 nd polypeptide position. Is it ironic that this simple aa change drives a major functional alteration, but in two closely related cycles? 5

DETERMINING MOLECULAR PHYLOGENIES One can estimate the evolutionary similarity between two genes by determining the minimum number of mutations necessary to transform one amino acid in one sequence to another amino acid in the same position in the other sequence. Minimizing the number of mutations necessary to drive a given change is referred to as parsimony. Example: it is easier to explain that a phenylalanine codon (UUU) arose from a single nucleotide substitution in a serine codon (UCU UUU) than from a triple nucleotide substitution in a glutamic acid codon (GAA UUU). Once parsimoniously determined evolutionary distances between species are established, phylogenetic relationships can be resolved. Example: Assume the most parsimonious mutational distance between a protein in species A and B is 25, between A and C is 20, and between B and C is 30. Which two are most closely related? Assign legs x, y, and z to represent the numbers of mutations responsible for their divergence. 6

The phylogenetic relationship can be portrayed as follows: The length of the legs can be calculated: A B distance (25) is 5 mutations less than C B distance (30). Therefore, x is 5 mutations less than y. y + z = 30 -(x + z -= 25) y x = 5 Since y + x = 20 (A C distance) and y x = 5, we can determine y. y +x = 20 +(y - x = 5) 2y = 25 y = 12.5, and by substitution, x = 7.5 z = A B distance x = 25-7.5 = 17.5, yielding: Estimated branch position 7

Using mutational data, we can generate phylogenetic trees that display relationships among varying organisms. Calculated with complex mathematical algorithms. Many trees are possible, but only one will represent the true phylogeny. How do we know when a tree is the best one? Bootstrapping calculates the proportion of acceptable trees in which a node appears when data is repeatedly sampled and replaced. Example: Resample sequences that feature sequence differences one hundred times to produce one-hundred trees. Some differences in the trees are omitted and some appear more than once over the course of the resamplings. Each sampling generates a tree in which a particular node (position) may or may not occur. The bootstrap value is the frequency (% of time) in which the same branch appears. NUCLEIC ACID PHYLOGENIES BASED ON DNA-DNA HYBRIDIZATIONS Homology among genes from different organisms can be calculated by measuring the degree to which homologous nucleotide sequences in single strands pair up to form double strands. Referred to as DNA reassociation. DNA is isolated from two organisms (X and Y) and dissociated into single strands, then allowed to reassociate into X-Y hybrid double strands. 8

The reassociation process can be monitored by noting the A 260 on a spectrophotometer. As the DNA reassociates (becomes double stranded) the A 260 will decrease. The rate of reassociation is proportional to the homology of DNA strands in the mixture. Method can be used to compare simple or complex mixtures of DNA comprising billions of nucleotides. DNA reassociation has been used to deduce the phylogenetic relationship of primates. 9

Note: paleontological evidence suggests that the lineages of Old World monkeys and apes-humans diverged 33 million years ago. A 7.7 o C change in the thermal stability of DNA from humans and Old World Monkeys has also been observed. This implies that every 1 o C shift in DNA thermal stability represents a 4.3 million year interval in the evolution of primates. DNA hybridization techniques have their detractors. Allows the placement of the bottom x-axis in Fig. 12-11. DNA hybridization compresses all divergence information into a single distance measurement. NUCLEIC ACID PHYLOGENIES BASED ON RESTRICTION ENZYME SITES Restriction enzymes recognize short (4 8), specific nucleotide sequences and cleave the DNA at these sites. Example: EcoRI recognizes the sequence: 5 GAATTC 3 3 CTTAAG 5 and will cut (restrict) the DNA between the G and A 10

Since the DNA from different species exhibits differing sequences, the placement of restriction sites will be species- (sometimes strain) specific. Therefore, each species DNA will have fragments of characteristic length following enzyme restriction. There are many restriction enzymes available with which DNA can be restricted. Therefore complex mixtures of DNA fragment lengths can be generated, each representative of a differing species = Restriction fragment length polymorphisms (RFLP) Restriction maps can allow a comparison between species. Example: mitochondrial DNA from humans and apes was restricted with 19 different enzymes. The enzymes cleaved the DNA at approximately 50 sites. Comparison of site placement can yield evolutionary data. 11

In agreement with previously determined phylogenies, humans share many more restriction sites with chimpanzees and gorillas than with orangutans and gibbons. However, branching is unclear. NUCLEIC ACID PHYLOGENIES BASED ON NUCLEOTIDE SEQUENCE COMPARISONS AND HOMOLOGIES The most accurate method for determining phylogenetic relationships between different organisms is the direct comparison of DNA sequences of the same (or homologous) gene. Databases archive volumes of sequence data. Genbank (NCBI) Sequence information has suggested several evolutionary events: 1. Extensive horizontal gene transfer between genomes. 2. Considerable amount of gene duplication 25% of the Bacillus subtilis genome 3. Many Archaea protein sequences are more similar to Bacteria than to eukaryotes. 12

4. Protein used in replication, transcription, and translation show greater similarity in Archaea and eukaryotes. 5. 50% of genes have no known function. 6. Roughly 480 genes might be the minimum required for life. Based on sequencing of Mycoplasma genitalium genome. Primary targets for sequencing analyses to determine phylogeny are the ribosomal RNA genes. The gold standard today is the 16S rrna gene (prokaryotes) and 18S rrna gene (eukaryotes), however Strickberger focuses on the 5S rrna gene. Why the rrnas? 1. Universally distributed in all organisms. Component of the protein synthesis machinery = similar function in different organisms 2. Gene features regions that are highly conserved as well as regions that are variable. Constant secondary structure. Allows for alignment and comparison of DNA sequences. RATES OF MOLECULAR CHANGE: EVOLUTIONARY CLOCKS Inherent in all phylogenies is that evolutionary differences arose due to mutational differences. The greater the number of differences, the greater the evolutionary distance between organisms. In Figure 12-11, the evolutionist used a time scale (1 o C of DNA thermal stability change reflected 4.3 million years of evolutionary time). In doing this, they have assumed that mutations occurred at a fixed rate over time. 13

Implies that an evolutionary clock determines the rate at which many mutations occur. Evidence can be found in the amino acid sequence of hemoglobin. Compare the α-hemoglobin amino acid sequence of several different organisms to that of sharks. How many differences are detected? Carp 85 Salamander 84 Chicken 83 Mouse 79 Human 79 Results show that although considerable morphological changes have occurred in the organisms, a constant rate of mutation must have occurred. The number of sequence differences in β-hemoglobin chains correlates with the time to a common ancestor for many organism pairs: Organism pair aa changes Time to common per 100 codons ancestor (million years) Human/monkey 5 30 Human/cattle 18 90 Marsupial/placental mammal 27 130 Bird/ mammal 32 250 Shark/bony vertebrate 65 500 If evolutionary clocks exist, then two consequences can be expected: 1. The lines of descent leading from a common ancestor to all contemporary descendents should have similar rates of fixed mutations. 2. The proportional rate of fixation that occurs in one gene relative to the rates of fixation in other genes stays the same throughout any line of descent. These expectation were tested: The amino acid sequences of seven proteins in 17 vertebrate taxa were examined. An evolutionary clock was calibrated using known and accepted dates of divergence. The temporal length of each line of descent was calculated 14

The number of nucleotide substitutions that occurred over a given length of time was compared among the 17 taxa. The results suggested that the rate at which the individual proteins changed varied significantly among the differing lines of descent. Indicated that molecular changes were not uniform for a specific protein or taxa. However, when nucleotide substitutions are averaged over all seven proteins for each branching point in the phylogeny, the rate of molecular change over time is constant. This procedure calibrates an evolutionary clock for these particular proteins and allows us to link change to time. The need to average the nucleotide substitutions among the different genes in order to achieve a linear relationship between time and nucleotide substitutions tells us that no one evolutionary clock applies to every nucleotide sequence. Why? Perhaps selection intensity fixes some mutations more securely than others. 15