Molecular Evolution and DNA systematics

Similar documents
Comparing whole genomes

THEORY. Based on sequence Length According to the length of sequence being compared it is of following two types

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

Sequence Analysis 17: lecture 5. Substitution matrices Multiple sequence alignment

"Nothing in biology makes sense except in the light of evolution Theodosius Dobzhansky

Sara C. Madeira. Universidade da Beira Interior. (Thanks to Ana Teresa Freitas, IST for useful resources on this subject)

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

PHYLOGENY AND SYSTEMATICS

Phylogenetic analyses. Kirsi Kostamo

Bioinformatics Exercises

Evolutionary Analysis of Viral Genomes

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

Introduction to Bioinformatics Introduction to Bioinformatics

Bioinformatics tools for phylogeny and visualization. Yanbin Yin

Sequence analysis and comparison

Sequence Alignment: A General Overview. COMP Fall 2010 Luay Nakhleh, Rice University

5. MULTIPLE SEQUENCE ALIGNMENT BIOINFORMATICS COURSE MTAT

How to read and make phylogenetic trees Zuzana Starostová

Chapter 26 Phylogeny and the Tree of Life

Chapter 26. Phylogeny and the Tree of Life. Lecture Presentations by Nicole Tunbridge and Kathleen Fitzpatrick Pearson Education, Inc.

Practical Bioinformatics

BIOINFORMATICS LAB AP BIOLOGY

Phylogenetic Trees. How do the changes in gene sequences allow us to reconstruct the evolutionary relationships between related species?

3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT

Dr. Amira A. AL-Hosary

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Name. Ecology & Evolutionary Biology 2245/2245W Exam 2 1 March 2014

POPULATION GENETICS Winter 2005 Lecture 17 Molecular phylogenetics

Bioinformatics. Part 8. Sequence Analysis An introduction. Mahdi Vasighi

C.DARWIN ( )

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

CONCEPT OF SEQUENCE COMPARISON. Natapol Pornputtapong 18 January 2018

C3020 Molecular Evolution. Exercises #3: Phylogenetics

Systematics Lecture 3 Characters: Homology, Morphology

ISIS/Draw "Quick Start"

Phylogenetic analysis. Characters

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Cladistics and Bioinformatics Questions 2013

Using Bioinformatics to Study Evolutionary Relationships Instructions

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

How Molecules Evolve. Advantages of Molecular Data for Tree Building. Advantages of Molecular Data for Tree Building

mrna Codon Table Mutant Dinosaur Name: Period:

Draft document version 0.6; ClustalX version 2.1(PC), (Mac); NJplot version 2.3; 3/26/2012

Classification of Organisms

Phylogenetic inference

Algorithms in Bioinformatics

Bioinformatics 1 -- lecture 9. Phylogenetic trees Distance-based tree building Parsimony

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Introduction to Bioinformatics Online Course: IBT

Computational Biology: Basics & Interesting Problems

Skulls & Evolution. Procedure In this lab, groups at the same table will work together.

Phylogeny and Molecular Evolution. Introduction

Quantifying sequence similarity

PHYLOGENY WHAT IS EVOLUTION? 1/22/2018. Change must occur in a population via allele

Advanced topics in bioinformatics

Welcome to Evolution 101 Reading Guide

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

The practice of naming and classifying organisms is called taxonomy.

Seuqence Analysis '17--lecture 10. Trees types of trees Newick notation UPGMA Fitch Margoliash Distance vs Parsimony

Biochemistry 324 Bioinformatics. Pairwise sequence alignment

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Introduction to characters and parsimony analysis

BIOINFORMATICS: An Introduction

Study and Implementation of Various Techniques Involved in DNA and Protein Sequence Analysis

Phylogenetic Trees. What They Are Why We Do It & How To Do It. Presented by Amy Harris Dr Brad Morantz

Module: Sequence Alignment Theory and Applications Session: Introduction to Searching and Sequence Alignment

Chapter 16: Reconstructing and Using Phylogenies

Multiple Sequence Alignment, Gunnar Klau, December 9, 2005, 17:

Single alignment: Substitution Matrix. 16 march 2017

9/19/2012. Chapter 17 Organizing Life s Diversity. Early Systems of Classification

Historical Biogeography. Historical Biogeography. Systematics

BIO 285/CSCI 285/MATH 285 Bioinformatics Programming Lecture 8 Pairwise Sequence Alignment 2 And Python Function Instructor: Lei Qian Fisk University

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

Phylogenetic Tree Reconstruction

Lecture Notes: Markov chains

Theory of Evolution Charles Darwin

Constructing Evolutionary/Phylogenetic Trees

Name: Class: Date: ID: A

Biology 559R: Introduction to Phylogenetic Comparative Methods Topics for this week (Jan 27 & 29):

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Constructing Evolutionary/Phylogenetic Trees

Lecture 4: Evolutionary Models and Substitution Matrices (PAM and BLOSUM)

Chapter 19. Microbial Taxonomy

PHYLOGENY & THE TREE OF LIFE

ESS 345 Ichthyology. Systematic Ichthyology Part II Not in Book

Phylogeny: building the tree of life

Outline. Classification of Living Things

Phylogenetic methods in molecular systematics

Chapter 26: Phylogeny and the Tree of Life

Station 1 Fossil Record

Homework Assignment, Evolutionary Systems Biology, Spring Homework Part I: Phylogenetics:

MegAlign Pro Pairwise Alignment Tutorials

Evolutionary trees. Describe the relationship between objects, e.g. species or genes

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

A Phylogenetic Network Construction due to Constrained Recombination

COMPARING DNA SEQUENCES TO UNDERSTAND EVOLUTIONARY RELATIONSHIPS WITH BLAST

Bioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 4. Bioinformatics 1: Biology, Sequences, Phylogenetics

Sequence analysis and Genomics

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

8/23/2014. Phylogeny and the Tree of Life

Transcription:

Biology 4505 - Biogeography & Systematics Dr. Carr Molecular Evolution and DNA systematics Ultimately, the source of all organismal variation that we have examined in this course is the genome, written in the four letters of the DNA genetic code. In recent years, DNA has become a powerful systematic tool to investigate evolutionary relationships within and among species. In this lab, we will investigate three basic aspects of molecular systematics, (1) data collection, in this case the analysis of DNA sequence data from an automated DNA sequencer, (2) analysis of molecular evolution: patterns of mutational change between more and more distantly related creatures, and (3) DNA systematics, the use of DNA sequence variation to ascertain phylogenetic relationships among species. 1. Collecting the DNA data Chromatographic data from the Applied Biosystems automated DNA sequencer can be analyzed in PC-DOS format with the program CHROMAS (v1.4) by Conor McCarthy (Griffith Univ., Australia). [available at http://trishul.sci.gu.edu.au/~conor/chromas.html ] In the POPULATION BIOLOGY group, click on the CHROMAS 1.4 icon. Open any file with the extension *.ABI: OPEN / FILE / wtdeer.abi The available files includes whales, fish, sharks, deer, and frogs. Under OPTIONS, choose CONTINUOUS EDIT and EDITED SEQUENCE, and adjust the X-scale for clarity: X_ZOOM / 4 (these may already be selected for you). You will see a chromatogram with a series of coloured peaks marked N. To read the sequence, change N to the correct base call, using lower-case letters: green = a, blue = c, black = g, and red = t. The correct spacing has been done for you: you should not have to insert or delete bases. Note that sometimes two colored peaks will have similar intensity: leave these as N, or use the ambiguity code: c or t = Y (pyrimidine), and a or g = R (purine). Try to read at least 300 nucleotides. Within the first 20-30 nucleotides, you should see an ATG sequence (corresponding to an AUG start codon in the mrna): mark this with capital letters (this will make it easier to align). SAVE your file to a new *.ABI file: FILE / SAVE AS / mybeast.abi {for example} DO NOT save your base calls to the original *.ABI files EXPORT your sequence to a *.SEQ file: FILE / EXPORT / PLAIN TEXT / mybeast.seq {for example}

2. Analyzing your sequence data DNA sequences can be manipulated with the Eyeball Sequence Editor (ESEE 3.0S) by Eric Cabot. In the POPULATION BIOLOGY program group, click on the ESEE 3.0 icon. Restore a previously-saved set of 14 reference species: FILE / RESTORE SAVE FILE / 3900C.SAV Note the blinking cursor: this indicates the sequence and nucleotide being edited. Choose sequences by moving the cursor with the up/down/left/right arrows. Insert your sequence: FILE / INPUT ASCII / C:\CHROMAS\mybeast.SEQ Move your sequence above the most closely related sequence: LAYOUT / MOVE SEQ / number ALIGN your sequence with a closely related reference sequence. In your sequence, delete all bases before the ATG start codon; insert a --- triplet to align with non-xenopus sequences Any uncalled nucleotides will show up as a string of NNNNN s at the end of your sequence, move to the first N and delete the rest of the string with the ctrl-z key Place the cursor on the reference sequence: hit F6, or COMMAND / MATCH Check the alignment: vertical lines ( ) indicate matches, gaps ( ) indicate mismatches The % Similarity will be indicated in a box at the top of the screen EDIT your sequence. If your sequence suddenly goes out of alignment, you may have added or deleted nucleotides during editing. Errors occur towards the 3' end: a single broad peak is read as two bases, or two peaks run together are read as a single base. Edit by hand; if time permits, go back to CHROMAS and correct. TRANSLATE your edited sequence into amino acids. Change the genetic code to the mammalian mitochondrial code: FORMAT / GENETIC CODE / MAMMALIAN MITO Move the cursor to the first letter of your sequence: translate: hit F4 or COMMAND / TRANSLATE The amino acid sequence is given in the IUPAC single-letter code. Re-name this line: FORMAT/ NAME / MybeastAA Translate the reference sequence & re-name it; MOVE your translation next to it in the same way you did the nucleotide sequence COMPARE the % Similarity of the two amino acid sequences in the same way you did the nucleotide sequences.

3. Patterns of Molecular Evolution A. Nucleotide and amino acid similarity Delete all but the following sequences with Shift-F2 Xenopus, Gadus, Alces, Rangifer, Lynx, and Canis Compare nucleotide and amino acid differences for the following pairs of sequences: moose (Alces) vs. caribou (Rangifer) (different genera) lynx (Lynx) vs. coyote (Canis) (different suborders) frog (Xenopus) vs. codfish (Gadus) (different classes) Move each pair of sequences adjacent to each other and MATCH: A box will indicate the % (sequence) similarity: RECORD this number in the tables below. Translate each sequence as indicated above, re-name the amino acid translation, move adjacent, and MATCH: Again note the % (amino acid) similarity: RECORD this number in the tables below. B. Patterns of mutational substitution For each of the three pairs of species listed above: In each sequence, move the cursor back to the first A of the start codon: space the DNA and amino acid sequences in codon triplets hit F1 ; re-do the MATCH command. spacing can be removed by hitting F2 For each comparison, RECORD the Number of 1st, 2nd, and 3rd codon position changes in the tables on the data sheet..

4. Evolutionary analysis: Evolutionary relationships of the Giant Panda A long-standing controversy in mammalogy concerns the evolutionary relationships and systematic position of the Giant Panda (Ailuropoda). General morphology seems to link it to bears (Ursus) (Ursidae); some details of the skull and its peculiar diet (bamboo) and biogeographic distribution (China) seem to link it to lesser pandas (Ailurus) and raccoons (Procyon) (both Procyonidae). DNA sequence data can be used to resolve this question. A. Phenetic (UPGMA distance) analysis Restore the original 3900C.SAV file; Remove all sequences except Canis, Ailuropoda, Ursus, Procyon, and marten(martes) Translate all five sequences; re-label the translations. Determine the nucleotide and amino acid similarities for all 10 pairwise comparisons [Complete the Tables 1 & 2 on the next page: put similarities above the diagonal] Convert the similarities to differences (%D = 100% - %S) [Put difference values below the diagonal] CALCULATE a UPGMA dendrogram from the data in Table 1. B. Cladistic (Maximum Parsimony) analysis [See Notes on Parsimony Analysis on the webpage]. Remove the Canis sequence (leaving Ailuropoda, Ursus, Procyon, and Martes). Move Ailuropoda to the top line. Make a 'dot comparison' matrix: COMMAND / GLOBAL COMPARE / Yes the latter three sequence are identical to Ailuropoda, except where indicated. Inspect each nucleotide position to see if it is phylogenetically informative: An informative character is one for which two species are homologous (they share a character state, in this case a nucleotide), and the other two species share a different character state (they share a different nucleotide). Such a character suggests that the two species have a common ancestor, from whom they inherited the particular character state. In your data set, COUNT the number of positions where: panda = bear & raccoon = marten : positions of type 5 panda = raccoon & bear = marten : " " " 6 panda = marten & bear = raccoon : " " " 7 Ai Ur Pr Ma aat tcg ctt cta gga atc tgc cta atc ctg.....a..g..a.t....... t.......a.....a..c..c.....t......... t.a.....a..a..g..g..t... t.t..t t.. 1 2 3 4 5 6 7 COMPLETE Table 3: the cladistic "maximum parsimony" answer is the one with the highest score.

Biology 4505 - Biogeography & Systematics NAME Dr. Carr 1. Is the Giant Panda more similar to the Bear or to the Raccoon? Explain. 2. Is the Giant Panda more closely related to the Bear or to the Raccoon? Explain. 3. FOR THE FINAL EXAM: From a four-taxon DNA data set, you should be able to 1. From a matrix of differences, calculate a UPGMA tree. 2. Recognize all phylogenetically informative sites. 3. Identify:the most similar pair of species (by the phenetic [distance] criterion), and the most closely related pair of species (by the cladistic [parsimony] criterion).

Biology 4505 - Biogeography & Systematics Dr. Carr Molecular Systematics Data %nuc %aa #1st #2nd #3rd Rangifer vs. Alces %nuc %aa #1st #2nd #3rd Lynx vs. Canis %nuc %aa #1st #2nd #3rd Gadus vs. Xenopus Tables 1 & 2. Nucleotide & amino acid comparison among caniform carnivores Record % similarities above diagonal, % differences below diagonal Table 1: Nucleotides Table 2. Amino acids Ca Ai Ur Pr Ma Ca Ai Ur Pr Ma Canis - - Ailuropoda - - Ursus - - Procyon - - Martes - - Table 3. Parsimony sites favouring three hypotheses of Ailuropoda relationships: Ai Pr Ai Ur Ai Ur Ur Ma Pr Ma Ma Pr Ailuropoda X X X Ursus X Y Y Procyon Y X Y Martes Y Y X