What can sequences tell us?

Similar documents
Introduction to Evolutionary Concepts

Genomes and Their Evolution

A. Incorrect! In the binomial naming convention the Kingdom is not part of the name.

METHODS FOR DETERMINING PHYLOGENY. In Chapter 11, we discovered that classifying organisms into groups was, and still is, a difficult task.

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

8/23/2014. Phylogeny and the Tree of Life

Microbial Taxonomy and the Evolution of Diversity

Unit 5: Taxonomy. KEY CONCEPT Organisms can be classified based on physical similarities.

Chapter 26 Phylogeny and the Tree of Life

Computational Biology: Basics & Interesting Problems

Chapter 18 Active Reading Guide Genomes and Their Evolution

Chapter 19. Microbial Taxonomy

2. What was the Avery-MacLeod-McCarty experiment and why was it significant? 3. What was the Hershey-Chase experiment and why was it significant?

Reconstruction of Human Genome Evolution Predicts an Extensively Changed Neurodevelopmental Gene

SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology

GSBHSRSBRSRRk IZTI/^Q. LlML. I Iv^O IV I I I FROM GENES TO GENOMES ^^^H*" ^^^^J*^ ill! BQPIP. illt. goidbkc. itip31. li4»twlil FIFTH EDITION

Pairwise & Multiple sequence alignments

Bioinformatics Exercises

CHAPTERS 24-25: Evidence for Evolution and Phylogeny

Science Unit Learning Summary

What examples can you think of?

UNIT 5. Protein Synthesis 11/22/16

Sec$on 9. Evolu$onary Rela$onships

Chapters 25 and 26. Searching for Homology. Phylogeny

Multiple Choice Review- Eukaryotic Gene Expression

Cladistics and Bioinformatics Questions 2013

Genomics and Bioinformatics

Macroevolution Part I: Phylogenies

Phylogeny and systematics. Why are these disciplines important in evolutionary biology and how are they related to each other?

Origins of Life. Fundamental Properties of Life. Conditions on Early Earth. Evolution of Cells. The Tree of Life

Microbial Diversity and Assessment (II) Spring, 2007 Guangyi Wang, Ph.D. POST103B

Molecular phylogeny - Using molecular sequences to infer evolutionary relationships. Tore Samuelsson Feb 2016

Sequence Alignment: A General Overview. COMP Fall 2010 Luay Nakhleh, Rice University

Modern cellular organisms. From

Evolution of Protein Structure in the Aminoacyl-tRNA Synthetases

Warm-Up- Review Natural Selection and Reproduction for quiz today!!!! Notes on Evidence of Evolution Work on Vocabulary and Lab

Molecular evolution - Part 1. Pawan Dhar BII

CISC 889 Bioinformatics (Spring 2004) Sequence pairwise alignment (I)

Microbial Taxonomy. Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.

Rapid Learning Center Chemistry :: Biology :: Physics :: Math

Microbiome: 16S rrna Sequencing 3/30/2018

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

Early History up to Schedule. Proteins DNA & RNA Schwann and Schleiden Cell Theory Charles Darwin publishes Origin of Species

This is a repository copy of Microbiology: Mind the gaps in cellular evolution.

Genomics and bioinformatics summary. Finding genes -- computer searches

Chapter 7: Covalent Structure of Proteins. Voet & Voet: Pages ,

Chapter 15 Active Reading Guide Regulation of Gene Expression

and just what is science? how about this biology stuff?

Biochemistry 324 Bioinformatics. Pairwise sequence alignment

Exploring Evolution & Bioinformatics

The Complete Set Of Genetic Instructions In An Organism's Chromosomes Is Called The

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Sequencing alignment Ameer Effat M. Elfarash

MACROEVOLUTION Student Packet SUMMARY EVOLUTION IS A CHANGE IN THE GENETIC MAKEUP OF A POPULATION OVER TIME Macroevolution refers to large-scale

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

full file at

Genetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

PHYLOGENY AND SYSTEMATICS

Genetic transcription and regulation

3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT

MiGA: The Microbial Genome Atlas

BINF6201/8201. Molecular phylogenetic methods

Interpreting the Molecular Tree of Life: What Happened in Early Evolution? Norm Pace MCD Biology University of Colorado-Boulder

From soup to cells the origin of life

BIOINFORMATICS LAB AP BIOLOGY

1. In most cases, genes code for and it is that

Horizontal transfer and pathogenicity

Microbes usually have few distinguishing properties that relate them, so a hierarchical taxonomy mainly has not been possible.

Microbial Taxonomy. Slowly evolving molecules (e.g., rrna) used for large-scale structure; "fast- clock" molecules for fine-structure.

Computational approaches for functional genomics

Bioinformatics Chapter 1. Introduction

Chapter 27: Evolutionary Genetics

Research Proposal. Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family.

Lecture Notes: BIOL2007 Molecular Evolution

Molecular evolution. Joe Felsenstein. GENOME 453, Autumn Molecular evolution p.1/49

Sequencing alignment Ameer Effat M. Elfarash

Taxonomy. Content. How to determine & classify a species. Phylogeny and evolution

9/19/2012. Chapter 17 Organizing Life s Diversity. Early Systems of Classification

Sugars, such as glucose or fructose are the basic building blocks of more complex carbohydrates. Which of the following

Chapter 26: Phylogeny and the Tree of Life Phylogenies Show Evolutionary Relationships

Horizontal Gene Transfer and the Emergence of Darwinian Evolution

Curriculum Links. AQA GCE Biology. AS level

Evolution Problem Drill 09: The Tree of Life

Review of molecular biology

HORIZONTAL TRANSFER IN EUKARYOTES KIMBERLEY MC GRAIL FERNÁNDEZ GENOMICS

Biology 211 (2) Week 1 KEY!

Reading for Lecture 13 Release v10

Page 1. Evolutionary Trees. Why build evolutionary tree? Outline

Biology 10 th Grade. Textbook: Biology, Miller and Levine, Pearson (2010) Prerequisite: None

Sequence analysis and comparison

Quantifying sequence similarity

O 3 O 4 O 5. q 3. q 4. Transition

Biology 112 Practice Midterm Questions

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

Biodiversity. The Road to the Six Kingdoms of Life

1

Biology. Slide 1 of 36. End Show. Copyright Pearson Prentice Hall

Outline. Genome Evolution. Genome. Genome Architecture. Constraints on Genome Evolution. New Evolutionary Synthesis 11/1/18

Transcription:

Bioinformatics

What can sequences tell us? AGACCTGAGATAACCGATAC By themselves? Not a heck of a lot...* *Indeed, one of the key results learned from the Human Genome Project is that disease is much more complicated than a simple appeal to genomebased therapeutics as was originally promised However, through comparison and analysis, combined with molecular and structural biology, they can reveal vast amounts of evolutionary information hidden away within them (Francis Crick, the less vocal eugenics advocate of the pair) annotated sequence of human X chromosome

How to compare sequences ith position of sequence 1,2 S tot (σ i,σ i)= N S i (σ i,σ i) i scoring function replacement frequency S ij = log p ij q i q j amino-acid frequency Empirically derived based on real sequences BLOSUM62 scoring matrix PBoC 21.2.2

Phylogenetic analysis hemoglobin sequences phylogenetic tree sequence similarity can be used to trace ancestral lineages PBoC 21.4.1

Tree of life -based on 16S rrna taxonomy -demonstrates most diversity is in the microbes -first proof of archaea as a separate evolutionary domain (only accepted a decade after first published!) -determined by Carl Woese, who was dubbed Microbiology's Scarred Revolutionary Woese, C. R.; G. E. Fox (1977-11-01). "Phylogenetic structure of the prokaryotic domain: The primary kingdoms".pnas 74 (11): 5088 5090.

Neutral mutations and evolution frequency of neutral amino acid (and codon redundant) mutations also agree with expectations presence or absence of retroviral sequences inserted into DNA match phylogenetic tree (similar for transposons) Chimp 2p all great apes have 24 pairs of chromosomes, while humans have 23 pairs genetic analysis shows human chromosome 2 resulted from ancestral fusion of two chromosomes Chimp 2q Human 2

From sequence to structure sequences of actin-like proteins in bacteria (MreB, ParM) and eukaryotes (actin) - almost ZERO similarity...yet the structures look nearly identical! sequence conservation structure conservation (BUT NOT THE CONVERSE) PBoC 21.2

How to compare structures Simple RMSD no longer works when sequence lengths differ QH scores alignment based on residue-residue distances AND gaps (no sequence information!) Sequence-based and structure-based phylogenetic trees are in agreement structure encodes evolutionary information as well!!! Patrick O'Donoghue, Zaida Luthey-Schulten, Evolutionary Profiles Derived from the QR Factorization of Multiple Structural Alignments Gives an Economy of Information, (2005) JMB, 346: 875-894,

Molecular paleontology ancient protein sequences can be reconstructed via phylogenetic analysis (NOT the same as Jurassic Park, but close!) absorbance spectra of dinosaur rhodopsin demonstrates what it could see PBoC 21.4.1 Eric Gaucher at GT structurally characterized 3-4 billion-year-old versions of a antibiotic-resistance protein (www.gauchergroup.biology.gatech.edu/) Beta-lactamase (modern, ancestor 1, ancestor 2) Valeria A. Risso et al. 2013. Hyperstability and Substrate Promiscuity in Laboratory Resurrections of Precambrian β- Lactamases. J. Am. Chem. Soc., 135 (8), pp. 2899 2902

Horizontal gene transfer genes are shared horizontally between species instead of solely vertically common (even dominant?) among bacteria; can lead to, e.g., extremely fast spread of antibiotic resistance genes even eukaryotes may acquire some genes horizontally, including entire cells (mitochondria and chloroplasts) complicates attempts to draw a universal tree of life with a unique common ancestor (but does not erase it completely!)

Human accelerated regions If chimps and humans share > 98% of our DNA, where are the important differences? In the so-called Human Accelerated Regions (HARs) ~200 identified so far, mostly in non-coding regions, NOT genes for proteins For example, HAR1, the most accelerated region, codes for a novel RNA gene expressed during neocortical development codes it s all about regulation! Pollard KS, Salama SR, Lambert N, Lambot MA, Coppens S, Pedersen JS, Katzman S, King B, Onodera C, Siepel A, Kern AD, Dehay C, Igel H, Ares M Jr, Vanderhaeghen P, Haussler D (2006). "An RNA gene expressed during cortical development evolved rapidly in humans". Nature 443: 167 172.

How to sequence DNA? shotgun sequencing multiple copies of genome are broken up into fragments of 2-10k bases PCR-like method can read 0.5-1k bases of each fragment (from both directions) random short sequences examined for overlaps and computationally reassembled into one long sequence Human Genome Project (public) used hierarchical shotgun, where libraries of 100-300k bases were first created and then shotgun-sequenced Celera Genomics project (private) used whole-genome shotgun sequencing

Next (and next)-gen sequencing methods Human Genome Project cost $2.7 billion and took 10 years goal for personalized medicine is (was) $1000 challenge now met, new goal is < $100 One promising technique: nanopore sequencing, e.g., using alpha-hemolysin (left) or MsbA (above) computational modeling/simulation necessary to interpret experiments (group of Alek Aksimentiev, UIUC)

Epigenetics - beyond sequence alone Modifications to DNA other than sequence changes can also influence expression in some cases are even heritable (some definitions include heritability as a requirement)

Epigenetics DNA methylation a key example of epigenetic control same genes, different tail kink hypermethylation involved in some cancers methylation can both increase and decrease stability of DNA strands depending on spacing, frequency Recognition of methylated DNA through methyl- CpG binding domain proteins Xueqing Zou, Wen Ma, Ilia Solov'yov, Christophe Chipot and Klaus Schulten Nucleic Acids Research, 40:2747-2758,2012