Genome Browsers And Genome Databases. Andy Conley Computational Genomics 2009

Similar documents
Synteny Portal Documentation

Ensembl focuses on metazoan (animal) genomes. The genomes currently available at the Ensembl site are:

Unlocated Arthropod genes and ways to find them

GEP Annotation Report

EBI web resources II: Ensembl and InterPro. Yanbin Yin Spring 2013

EBI web resources II: Ensembl and InterPro

Ensembl Genomes (non-chordates): Quick tour. This quick tour provides a brief introduction to Ensembl Genomes [2], the non-chordate genome browser.

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

Jay Moore,, Graham King, James Lynn. Data integration for Brassica comparative genomics

BIOINFORMATICS LAB AP BIOLOGY

Student Handout Fruit Fly Ethomics & Genomics

Open a Word document to record answers to any italicized questions. You will the final document to me at

Large-Scale Genomic Surveys

Lecture 14: Multiple Sequence Alignment (Gene Finding, Conserved Elements) Scribe: John Ekins

GENE ONTOLOGY (GO) Wilver Martínez Martínez Giovanny Silva Rincón

Lesson 3-1: Solving Linear Systems by Graphing

Annotation of Drosophila grimashawi Contig12

RGP finder: prediction of Genomic Islands

Example of Function Prediction

Introduction to Bioinformatics

Introduction to Digital Evolution Handout Answers

Graph Alignment and Biological Networks

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Araport, a community portal for Arabidopsis. Data integration, sharing and reuse. sergio contrino University of Cambridge

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

Frazer et al. ago (Aparicio et al. 2002), conserved long-range sequence organization has not been reported for more distantly related species. Figure

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Physics 2020 Lab 5 Intro to Circuits

Small RNA in rice genome

GCD3033:Cell Biology. Transcription

Supplemental Materials

In Genomes, Two Types of Genes

Biol478/ August

1 Abstract. 2 Introduction. 3 Requirements. 4 Procedure

Sun s differential rotation. Student s Guide Advanced Level CESAR s Science Case

Session 5: Phylogenomics

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

SoyBase, the USDA-ARS Soybean Genetics and Genomics Database

Homolog. Orthologue. Comparative Genomics. Paralog. What is Comparative Genomics. What is Comparative Genomics

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007

OrthoCluster: A New Tool for Mining Synteny Blocks and Applications in Comparative Genomics

wfleabase: the Daphnia genome database

A Browser for Pig Genome Data

COMPARING DNA SEQUENCES TO UNDERSTAND EVOLUTIONARY RELATIONSHIPS WITH BLAST

EGS: Lab Activities. Virtual Visitor Center at SLAC

Browsing Genes and Genomes with Ensembl

Comparative genomics: Overview & Tools + MUMmer algorithm

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

MegAlign Pro Pairwise Alignment Tutorials

USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES

Finding local extrema and intervals of increase/decrease

Overview. Everywhere. Over everything.

One platform for desktop, web and mobile

Phylogenetics a primer.

Browsing Genomic Information with Ensembl Plants

EBI web resources II: Ensembl and InterPro

A Database of human biological pathways

I. Objectives Describe vertical profiles of pressure in the atmosphere and ocean. Compare and contrast them.

Flexible genome retrieval for supporting in-silico studies of endobacteria-amfs

DATA ACQUISITION FROM BIO-DATABASES AND BLAST. Natapol Pornputtapong 18 January 2018

Exercise 3 Exploring Fitness and Population Change under Selection

Investigating Weather and Climate with Google Earth Teacher Guide

Inparanoid: a comprehensive database of eukaryotic orthologs

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Overview of I519 & Introduction to Bioinformatics. Yuzhen Ye School of Informatics and Computing, IUB

4. Why not make all enzymes all the time (even if not needed)? Enzyme synthesis uses a lot of energy.

Collisions in One Dimension

You are required to know all terms defined in lecture. EXPLORE THE COURSE WEB SITE 1/6/2010 MENDEL AND MODELS

Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)

Procedure to Create NCBI KOGS

Networks & pathways. Hedi Peterson MTAT Bioinformatics

Infrared Experiments of Thermal Energy and Heat Transfer

What Can Physics Say About Life Itself?

Newton s Cooling Model in Matlab and the Cooling Project!

Comparing whole genomes

Investigation 3: Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST

Whole Genome Alignments and Synteny Maps

Your web browser (Safari 7) is out of date. For more security, comfort and. the best experience on this site: Update your browser Ignore

This is Solving Linear Systems, chapter 3 from the book Advanced Algebra (index.html) (v. 1.0).

Statistical Inferences for Isoform Expression in RNA-Seq

Lecture Materials are available on the 321 web site

1 Least Squares Estimation - multiple regression.

Comparative Bioinformatics Midterm II Fall 2004

More Protein Synthesis and a Model for Protein Transcription Error Rates

BLAST Database Searching. BME 110: CompBio Tools Todd Lowe April 8, 2010

Last Time. x + 3y = 6 x + 2y = 1. x + 3y = 6 y = 1. 2x + 4y = 8 x 2y = 1. x + 3y = 6 2x y = 7. Lecture 2

Analysis of Genome Evolution and Function, University of Toronto, Toronto, ON M5R 3G4 Canada

5.1 Simplifying Rational Expressions

The Contribution of Bioinformatics to Evolutionary Thought

Introduction to Error Analysis

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of

Tandem repeat 16,225 20,284. 0kb 5kb 10kb 15kb 20kb 25kb 30kb 35kb

Primary KS1 1 VotesForSchools2018

Bacteria, Friends or Foes?

Physics Lab #5: Starry Night Observations of the Sun and Moon

Intro Gene regulation Synteny The End. Today. Gene regulation Synteny Good bye!

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

RNA- seq read mapping

Orthology Part I: concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona

Lab 2 Worksheet. Problems. Problem 1: Geometry and Linear Equations

Transcription:

Genome Browsers And Genome Databases Andy Conley Computational

What is a Genome Browser Genome browsers facilitate genomic analysis by presenting alignment, experimental and annotation data in the context of genomic DNA sequences. Melissa S Cline & James W Kent, 2009 2

Current Browsers There are a number of genome browser platforms are out there. UCSC, ensembl, NCBI, JGI, GBrowse Cline, M. S. and W. J. Kent (2009). "Understanding genome browsing." Nat Biotechnol 27(2): 153-5. ^^ Covers mainly just the UCSC genome browser. 3

UCSC Genome Browser Huge, very functional. Contains many, many genomes. Lots of additional tools, e.g. table browsing. UCSC Genome Browser 4

GBrowse Generic Genome Browser Part of GMOD Free, open source Flexible Widely used (60+ projects) 5

WormBase Database of various Caenorhabditis genomes and other nematodes (little worm things). Uses GBrowse as it s genome viewer. One of the more mature GMOD installations. WormBase 6

FlyBase Another (perhaps the most) mature GMOD installation. Home to the genomes of 12 Dropsophila genomes currently. FlyBase 7

NeisseriaBase Our version of GBrowse. Contains our four strains of N.meningitidis from last year. Also includes four previously sequenced strains and two N.gonorrhea strains. 8

What do they all have in common? All of these browsers look different, have different organisms, etc. They all do one important thing: they combine different sources of data. They also present the data in an easy to access format. 9

Functionality of a Browser What does the browser do in terms of our data? Allow navigation of the genome Show features Show annotations Show comparisons 10

Tracks What are tracks? Tracks are data! 11

More Tracks Genes: generally the most important track. You also have ESTs and mrnas. You have expression tracks. 12

WormBase, FlyBase and ToxoDB FlyBase WormBase ToxoDB 13

Looking Around in NeisseriaBase 14

Functional Annotation Looking at genes is great, but we want more information about them. When you click on a feature, gene, trna, etc., you want to learn more about it. This is the core of the browser functionality. 15

UCSC The UCSC genome browser integrates a lot of information for a given gene. GABRA3 on the UCSC Browser 16

FlyBase & WormBase FlyBase shows even more information than the UCSC browser. Dmel\cnn on FlyBase K04D7 on WormBase 17

NeisseriaBase We have integrated a decent amount of functional and comparative information into NeisseriaBase D-Amnio Acid Dehydrogenase 18

Comparative Genomics Other major source of data in a browser. The whole reason to sequence different closely related species is to compare them. 19

Conservation What parts of the genomes are conserved across species? What parts of the genomes are syntenic across species? What genes are present or absent across species? 20

Comparative Genomics in the UCSC Browser Conservation on a base-level Aligned regions. Conserved coding sequences UCSC Genome Browser 21

WormBase WormBase provides alignments of C.briggsae to the C.elegans genome, and vice-versa. Alignments of ESTs from other species to the genomes. dev.wormbase.org had more. 22

FlyBase Provides alignments of other organisms to the Drosophila species. Also has a list of orthologs for each gene. 23

Synteny The ordering of features in the genome. Genomes, bacterial ones in particular, are fluid. The ordering can change over time. Synteny comparison can tell you about the evolution of the organisms genome. 24

Gbrowse_syn A synteny viewer based off of GBrowse. Allows the display of syntenic regions and the features in them. WormBase s GBrowse_syn is down and FlyBase hasn t implemented it yet. The Pseudomonas genome database does have GBrowse_syn implemented. 25

WormBase 26

Is a Browser Useful? 27

Volumes of Data We deal with huge volumes of data. We want things to be high throughput. We want to analyze. A genome browser does none of this. 28

So why make one? Not everybody cares about high throughput. A lot of wet-lab biologists have never heard of PERL. They may have only a few genes of interest that they study experimentally. A genome browser provides a way for them to easily access information. Thalassiosira pseudonana 29

We Still Need Browsers It always helps to see your data. Browsers help you make sure your data are correct. Plus, they make pretty pictures. 30

In Summary The Purpose of a Browser is to: Visualize data Provide annotations for the data Show comparative genomics Be useable for non-programmers. 31