Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Similar documents
COMPARING DNA SEQUENCES TO UNDERSTAND EVOLUTIONARY RELATIONSHIPS WITH BLAST

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

Cladistics and Bioinformatics Questions 2013

BIOINFORMATICS LAB AP BIOLOGY

Open a Word document to record answers to any italicized questions. You will the final document to me at

UoN, CAS, DBSC BIOL102 lecture notes by: Dr. Mustafa A. Mansi. The Phylogenetic Systematics (Phylogeny and Systematics)

Using Bioinformatics to Study Evolutionary Relationships Instructions

PHYLOGENY AND SYSTEMATICS

Bioinformatics Exercises

Organizing Life s Diversity

CREATING PHYLOGENETIC TREES FROM DNA SEQUENCES

Phylogeny 9/8/2014. Evolutionary Relationships. Data Supporting Phylogeny. Chapter 26

Bioinformatics: Investigating Molecular/Biochemical Evidence for Evolution

Name: Class: Date: ID: A

Emily Blanton Phylogeny Lab Report May 2009

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

Organizing Life on Earth

RELATIONSHIPS BETWEEN GENES/PROTEINS HOMOLOGUES

Thursday, January 14. Teaching Point: SWBAT. assess their knowledge to prepare for the Evolution Summative Assessment. (TOMORROW) Agenda:

Mechanisms of Evolution Darwinian Evolution

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

9/19/2012. Chapter 17 Organizing Life s Diversity. Early Systems of Classification

GENERAL BIOLOGY LABORATORY EXERCISE Amino Acid Sequence Analysis of Cytochrome C in Bacteria and Eukarya Using Bioinformatics

Phylogenetic Trees. How do the changes in gene sequences allow us to reconstruct the evolutionary relationships between related species?

Chapter 16: Reconstructing and Using Phylogenies

AP Biology Evolution Review Slides

Pogil Phylogenetic Trees Answer Key Ap Biology

#Evolution. Nothing in Biology makes sense except in the light of evolution.

CHAPTER 26 PHYLOGENY AND THE TREE OF LIFE Connecting Classification to Phylogeny

BIOINFORMATICS: An Introduction

PSI Biology Classification Classification

Hands-On Nine The PAX6 Gene and Protein

Concept Modern Taxonomy reflects evolutionary history.

b. In Table 1 (question #2 on the Answer Sheet describe the function of each set of bones and answer the question.)

Introduction to characters and parsimony analysis

Chapter 26 Phylogeny and the Tree of Life

Introduction to Bioinformatics Online Course: IBT

Phylogeny and the Tree of Life

Chapter 7: Covalent Structure of Proteins. Voet & Voet: Pages ,

Evidence of Common Ancestry Stations

Biology 211 (2) Week 1 KEY!

Station 1: Evidence from Current Examples

Chapter 19: Taxonomy, Systematics, and Phylogeny

Bioinformatics. Dept. of Computational Biology & Bioinformatics

R.S. Kittrell Biology Wk 10. Date Skill Plan

SPECIATION. REPRODUCTIVE BARRIERS PREZYGOTIC: Barriers that prevent fertilization. Habitat isolation Populations can t get together

Genomes and Their Evolution

8/23/2014. Phylogeny and the Tree of Life

The Classification of Plants and Other Organisms. Chapter 18

The Theory of Universal Common Ancestry suggests that organisms on Earth evolved from a single common ancestor.

Station A: #3. If two organisms belong to the same order, they must also belong to the same

Chapter What is a Plant? Biology. Slide 1 of 33. End Show. Copyright Pearson Prentice Hall

CLASSIFICATION NOTES

Biology. Slide 1 / 47. Slide 2 / 47. Slide 3 / 47. Classification

Outline. Classification of Living Things

Studying Life. Lesson Overview. Lesson Overview. 1.3 Studying Life

Evolution. Changes over Time

The practice of naming and classifying organisms is called taxonomy.

Journal of Proteomics & Bioinformatics - Open Access

Biology Unit 1 Warm Ups. Mrs. Hilliard

Topic 7: Evolution. 1. The graph below represents the populations of two different species in an ecosystem over a period of several years.

Evidence of Evolution by Natural Selection. Dodo bird

1. Wings 5.. Jumping legs 2. 6 Legs 6. Crushing mouthparts 3. Segmented Body 7. Legs 4. Double set of wings 8. Curly antennae

The Environment and Change Over Time

What is the purpose of the Classifying System? To allow the accurate identification of a particular organism

Processes of Evolution

Creating a Dichotomous Key

Phylogeny & Systematics: The Tree of Life

Draft document version 0.6; ClustalX version 2.1(PC), (Mac); NJplot version 2.3; 3/26/2012

Computational Structural Bioinformatics

Convergent evolution:

Homology and Information Gathering and Domain Annotation for Proteins

Warm Up. What are some examples of living things? Describe the characteristics of living things

Chapters AP Biology Objectives. Objectives: You should know...

PHYLOGENY & THE TREE OF LIFE

OMICS Journals are welcoming Submissions

ADVANCED PLACEMENT BIOLOGY

CLADOGRAMS & GENETIC PHYLOGENIES

Prereq: Concurrent 3 CH

Name: Date: Answer: Answer:

CHAPTER 10 Taxonomy and Phylogeny of Animals

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution.

Skulls & Evolution. Procedure In this lab, groups at the same table will work together.

Miller & Levine Biology 2014

EVIDENCE OF EVOLUTION

Characteristics of Life

A A A A B B1

Proteins: Historians of Life on Earth Garry A. Duncan, Eric Martz, and Sam Donovan

Phylogeny and the Tree of Life

Valley Central School District 944 State Route 17K Montgomery, NY Telephone Number: (845) ext Fax Number: (845)

Cladis'c method of phylogene'c reconstruc'on

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

How should we organize the diversity of animal life?

Biology Keystone (PA Core) Quiz Theory of Evolution - (BIO.B ) Theory Of Evolution, (BIO.B ) Scientific Terms

name: Worksheets for Ch 14, 15, 16 Evolution

Classification Chapter 18

Section 18-1 Finding Order in Diversity

AP Curriculum Framework with Learning Objectives

Campbell Biology AP Edition 11 th Edition, 2018

Phylogenies & Classifying species (AKA Cladistics & Taxonomy) What are phylogenies & cladograms? How do we read them? How do we estimate them?

Transcription:

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST Introduction Bioinformatics is a powerful tool which can be used to determine evolutionary relationships and better understand genetic diseases. You are going to use this tool to explore the conservation of a popular enzyme, cytochrome C, and how it is present in different eukaryotic organisms. Background The Human Genome Project (HGP) was completed by scientists in 2003 and was coordinated by the U.S. Department of Energy and National Institutes of Health. The goals of the project were to: Identify all of the approximately 20,000-25,000 genes in human DNA. Store the genetic sequences in databases. Improve tools for data analysis. Transfer related technologies to the private sector. Address ethical, legal, and social issues arising from the identification of genetic data. The project mapped not only the genome of humans but also of other species such as Drosophila melanogaster (fruit fly), mouse, and Escherichia coli. The locations and complete sequences of the genes in each of these species are available for anyone in the world to access on the Internet. This information is important because the ability to identify the precise location and sequence of human genes will allow greater understanding of genetic diseases. Also, learning about the sequence of genes in other species helps us to understand evolutionary relationships among organisms. Many of our genes are similar if not identical to those found in other species. For example, a gene in fruit flies is found to be responsible for a particular disease. Scientists might wonder is this gene found in humans and does it cause a similar disease. It would take years to read through the human genome to locate the same sequence of base pairs. Given time constraints, this is not practical so a technological method was developed. Bioinformatics is a study that combines statistics, mathematical modeling, and computer science to analyze biological data. Through bioinformatics, entire genomes may be quickly compared in order to detect and analyze their similarities and differences. BLAST (Basic Local Alignment Search Tool) is an extremely useful bioinformatics tool which allows users to input a gene sequence of interest and search entire genomic libraries for identical or similar sequences.

Classification of organisms based on evolutionary history is called phylogenetic systematics. Scientists study how different organisms are related to determine if they have common ancestry. Today most scientists practice cladistics. Cladistics is a taxonomic approach that classifies organisms according to the order in time at which branches arise along a phylogenetic tree without considering the degree of morphological divergence. A phylogenetic diagram based on cladistics is called a cladogram. It is a tree constructed from a series of two-way branch points. Each branch point represents the divergence of a common ancestor. The cladogram is treelike where the endpoint of each branch represents a specific species (see Figure 1 below). Wolf Leopard Domestic Cat Figure 1. Sample Cladogram The cladogram featured in Figure 2 includes additional details such as the evolution of particular physical structures called derived characters. Note that the placement of the derived characters corresponds to that character having evolved. Every species above the character label possesses that structure. For example, lizards, tigers, and gorillas all have dry skin, whereas salamanders, sharks, and lampreys do not have dry skin. Gorilla Tiger Lizard Salamander Shark No Tail Lamprey Hair Dry Skin Lungs Jaws Figure 2. Cladogram of six animal species Evolutionary changes stemming from random mutations events can alter a protein s primary structure. Some mutations do not allow the organism to survive. In order the change to propagate, the mutation must either allow the organism to have the same evolutionary ability as it had previously or increase its probability to survive and reproduce. Sometimes a mutation can improve the fitness of a host in its natural environment. A classic Darwinian example is sickle cell anemia. This is a result of a single mutation whose adaptive consequences turned out to be beneficial to combat malaria. Normal hemoglobin cells have a high potassium concentration whereas hemoglobin sickle cells do not contain as much potassium. In order for a malaria parasite to survive it needs cells with a high potassium concentration. Thus they do not survive in sickle cells.

Pre-Lab Questions 1. Cytochrome c is a highly conserved protein, found in plants, animals, and many unicellular organisms. Do a little research to determine why it is so highly conserved. In your explanation, be sure to include the function of cytochrome C. In order to receive any credit, you must also cite the source where you found this information. (Note: Use appropriate scientific resources, not websites like Wikipedia or ask.com) Source: 2. Draw a cladogram using the information provided in the table below. TAXA Pine Trees Mosses Flowering Plants Ferns Vascular Tissue 1 0 1 1 Characteristics Flowers 0 0 1 0 Seeds 1 0 1 0 Table 1. Character Table. A zero (0) indicates that a character is absent; a one (1) indicates that a character is present.

Procedure Part A. Gathering cytochrome C sequences from NCBI 1. Log in to your school gmail and create a Google Drive document in order to save the sequences you will be gathering. Save the document as yourlastname.cytochrome C sequences 2. Go to the National Center for Biotechnology Information Website: http://ncbi.nlm.nih.gov 3. In the search box type in cytochrome C and click the search button Make sure that the drop-down menu to the left of the search box is set to All databases. 4. Scroll down until you see the brown heading Proteins and click the blue link for Protein It says protein sequences to its right 5. You will then be directed to the protein database. In the upper left region of the page locate the drop-down menu and make sure it is selected on protein. 6. In the search box type cytochrome C horse. Click the search button. 7. Many choices will appear. Select the record with the label that includes RecName: Full=Cytochrome C before the organism s scientific name For the horse, the accession number is P00004.2 8. There is a plethora of information on this page. Locate the menu in the upper left portion of the page that says display settings. Click the drop-down menu and select FASTA and then click Apply. This will list the amino acid sequence in a simple format. 9. Copy and paste the amino acid sequence into your Google Drive document Be sure to label the species name and accession number used 10. Go back to the NCBI Website and arrow back until you arrive at the protein search page again. Backspace to remove the organism horse and then type the next organism from the list below Note: Cytochrome C should still precede the organism in the search box. 11. Continue copying and pasting the cytochrome C sequences until you have gathered all 13 of the following organisms Horse Chicken Tuna Human Chimpanzee Rabbit Rattlesnake Bullfrog Dog Bee Cow Ateles (monkey) Arabidopsis thaliana (plant) 12. Be sure to label the name and accession number for each organism in your Google Drive document Note: When choosing which record to copy and paste, select the first entry with the label that includes RecName: Full=Cytochrome C before the organism s scientific name 13. Print a copy of your document (and be sure to turn that page in with your completed lab) and also share your document with me (pamos@fortcherry.org) 14. Manually compare the amino acid sequences of the following 4 organisms to that of the human and count the number of differences between the human cytochrome C sequence and the four others and record in the table in the results section Horse Ateles/monkey Cow Chimpanzee

Part B. Comparative Genomics and Bioinformatics 1. Set up a free account at Biology Workbench, which serves to allocate hard disk space for the session you will run. Go to http://workbench.sdsc.edu and click register Complete the requested information 2. Once you have logged into the site, scroll down until you see a series of buttons near the middle of the page and click the one that says Protein Tools. 3. Select the Add new protein sequence option and click run. 4. Two boxes will appear type the name of the sequence in the smaller of the two and paste the sequence (from your Google Drive document created in Part A) into the larger box. For the label, include the name of the gene/protein AND the name of the organism For the sequence, copy only the amino acids, not the identifying information Example: o Label: cytochrome C horse o Sequence:MGDVEKGKKIFVQKCAQCHTVEKGGKHKTGPNLHGLFGR KTGQAPGFTYTDANKNKGITWKEETLMEYLENPKKYIPGTKMIFAGI KKKTEREDLIAYLKKATNE 5. Scroll down to the bottom of the page and click save. 6. The site will default back to the add sequence page but now you should see a check box for the sequence you saved 7. Repeat until all 13 organisms have been added 8. Click the boxes next to all 13 organisms so they are checked 9. Scroll down in the menu box until you see Clustal W-Multiple Sequence Alignment. Click on this option to highlight it and press run. 10. Find the line that says Guided tree display and pull down the menu next to it until it says Rooted and Unrooted Trees and then click submit. 11. Draw both the unrooted and rooted trees in the space provided in your results section 12. Answer the analysis questions based on your results 13. Complete the Extension activity using the gene assigned to you Results Organism Horse Ateles/monkey Cow Chimpanzee Number of Different Amino Acids compared to cytochrome C sequence of Humans

Unrooted Tree Rooted Tree

Analysis Compare the results from your table (where you manually counted the number of differences in the amino acid sequences) with the unrooted and rooted trees that were produced using the BLAST program. 1. Which of the 4 organisms has the most similarities in its cytochrome c amino acid sequence compared to humans? 2. Which organism has the next most similar cytochrome c amino acid sequence compared to humans? 3. How are these similarities reflected in the resulting phylogenetic trees? 4. Which 2 organisms have the most differences in their cytochrome c amino acid sequence compared to humans? 5. How is this reflected in the resulting phylogenetic trees?

Extension: Comparing Other Genes What other genes are conserved among different species and what does this suggest about their evolutionary relationships? You will investigate these questions by researching the function of the protein created from an assigned gene and comparing its amino acid sequence in 6 total organisms. Record your assigned gene in the space provided and research the function of the resulting protein, making sure to cite your source. In addition to humans, select 5 more of the 13 organisms used when analyzing the cytochrome C sequences and record their names below. Repeat the procedure used in Parts A and B for those 6 total organisms o Create a new Google Drive document titled yourlastname.yourassignedgene sequences Save, print, and share the sequences for humans and the 5 other organisms you selected as well as pasting them into the Biology Workbench site to produce phylogenetic trees o Draw the resulting unrooted and rooted phylogenetic trees on the next page Assigned Gene: Function (be sure to cite the source): Other organisms compared (in addition to humans): 1. 2. 3. 4. 5.

Unrooted Tree Rooted Tree