This document describes the process by which operons are predicted for genes within the BioHealthBase database.
|
|
- Samson Lawrence
- 5 years ago
- Views:
Transcription
1 1. Purpose This document describes the process by which operons are predicted for genes within the BioHealthBase database. 2. Methods Description An operon is a coexpressed set of genes, transcribed onto a single mrna. Operon finding software (OFS) can identify operons using only the genetic code. This is an important goal because gene coexpression controls intracellular enzyme concentrations, influences the operation of metabolic pathways, impacts drug discovery, and is essential in understanding microarray and other gene regulation data associated both with the target micro-organism and the host it infects. Operons are predicted for the BHB prokaryotes: Francisella and Mycobacterium. Operons in BHB are generated from three sources. Two in-house software methods are used predictively: the SRI Pathway Tools operon finding algorithm ( Ptools-OFS ) and the Washington University-St Louis operon finding software package ( WUStL-OFS ). These computational methods first gather information about genetic open reading frames (ORFs), determine the genes directionality (plus or minus strand), calculates the base pair distance between flanking open reading frames, detects the presence of those genes in the same metabolic pathway (PTools only), and determine the presence of common GeneOntology (GO) terms within the two protein Genbank records (WUStL only). The data is then compiled and used to make a prediction about which genes are present in operon units. A third source of data is the operon prediction set from TBDB.org. These operons are for Mycobacterium tuberculosis H37Rv only. TBDB shares this data, and to make it they use a modification of the methods described in Moreno-Hagelseib et al (2003) and in Salgado (2001). The TBDB operons are predicted without using pathway data, however the method is tuned to the most common intergenic distances (IGD) of Mycobacteria and E. coli (Salgado et al), the -1 or -4bp IGD (overlap) of ATGA or TAATG/TGATG (Figure 1 Right, Salgado Figure 2). Functional annotations of E. coli genes developed by Riley (1996) were also used to tune the method so the appropriate IGD was used as cutoff. For this prediction method, known operons from E.coli and B subtilis were used as truth standards to tune the algorithm. The odds of being in an operon (log-
2 odds value) for every intergenic distance was plotted, using the within-operon (WO) and transcription-unit boundary (TUB) distances of all gene pairs. Taking the log of the fraction of WO over TUB, according to equation 1: The log-likelihood (log-odds, LogL) of a pair of neighboring genes being in the same operon as a function of distance was calculated with the formula: where N op and N nop are pairs of genes in operons and at transcriptional boundaries, where TN op and TN nop are the total number of pairs of genes in operons and at the transcription unit boundaries, respectively. The most accurate intergenic distance value for predicting the occurrence in an operon was demonstrated to be a universal constant among tested prokaryotes (logl threshold=0.05) Figure 2 Right (Salgado 5b). At this peak accuracy, over 90% of the gene pairs are present in operons, where Accuracy is the sum of true positives and true negative, divided by the sum of all predictions true, false, positive, and negative (trues over totals). Thus given the intergenic distance in base pairs, the method reads the log-odds value at that intergenic distance, from the E. coli/b. subtilis training set standard curve, using the boundary cutoff of 0.05 for WO/TUB pairs. Operation of the prediction tools Actual operation of the two prediction tools is dramatically different (below). While the WUStL tool uses precompiled protein table data files (PTT), it also uses different algorithmic weighting parameters and a novel genbank file mining step, so that the operon prediction decisions made by WUStL are sometimes different than PTools-OFS. The WUStL-OFS tool is operated as a shell script in which informant genomes are concatenated, run through formatdb; and then BLASTed against the target genomes. The algorithm next calculates orthologs, base pair distances between ORFs, directionality of ORFs, and clusters the output into pairs and operon groups. The PTools-OFS tool is run graphically and provided with genome data, generates a pathway map for the organism, finds holes in the built pathway using BLAST, compares the pathway maps of target and informant organisms, and predicts operons.
3 3. Input Data Preparation PTools-OFS and WUStL-OFS both require input files to predict operons. These data files are collected for each organism under study, in the format provided at the NCBI FTP site 1. The OFS algorithms use Genbank (*.gbk), fasta amino acid (*.faa), fasta nucleotide (*.fna), and pathway tools (*.ptt) files. a) WUStL-OFS The WUStL-OFS tool is run using only the PTT and FAA files as input. These files need to be generated for the target and informant organisms, and the shell script takes care of all downstream processing in an automated fashion. b) PTools-OFS The preparation for making PTools-OFS input files is more complicated. After the pathologic tools application is started up in its GUI, a new organism database is created as specified. A bacterial taxon number is entered using the NC**** identifier, and the phyla control is set to bacteria. Genbank (gbk) and nucleotide files (fna) are then loaded into the PTools system, and the software application is set to run overnight, thus building a pathway map. After a trial parse, a pathway hole filler is run, name matching is verified, consistency is checked, and an automated pathway genome database (PGDB) is finally built. This PDGB and the four input files above form the collection of data files needed to obtain operon predictions from PTools. The intermediate file produced in this process is as follows: Directon: (MT MT0001 MT0002 MT0003 MT0004 MT0005 MT0006 MT0007 TRNA-ILE-1 TRNA-ALA-1) Known TUs: None Predicted TU 1: (MT4041.2) MT hypothetical protein Predicted TU 2: (MT0001) MT0001 chromosomal DnaA Predicted TU 3: (MT0002 MT0003 MT0004) MT0002 DNA polymerase III, beta subunit MT0003 recf protein MT0004 conserved hypothetical protein (etc) 4. Output Data Post-Processing and Display Output of the operon predictions is a single tab-delimited file. The WUSTL output is a single file specifying the likelihood that juxtaposed, codirected genes are present on the same operon: Operon findings at probability cutoff ,MAV_0003,MAV_0004, ,MAV_0005,MAV_0006, ,MAV_0017,MAV_0019, ftp://ftp.ncbi.nlm.nih.gov/genomes/bacteria/
4 4,MAV_0020,MAV_0021, ,MAV_0021,MAV_0022, ,MAV_0029,MAV_0030,0.902 There is one line for each pair of genes belonging to the same operon. The leftmost number is the number of the operon, where all ORFs led by the same first column number are [physically present in the same operon. The 6 data rows in this example above represent 5 operons. The PTools output operon prediction report contains no scores, and is postparsed using a perl script, opr_parse_codeonly.pl. It looks like this: 1,MSMEG_0001a,MSMEG_0002 1,MSMEG_0002,MSMEG_0003 1,MSMEG_0003,MSMEG_0004 2,MSMEG_0005a,MSMEG_0006 3,MSMEG_0008a,MSMEG_0009 3,MSMEG_0009,MSMEG_0010 4,MSMEG_0011a,MSMEG_0012 Post processing of the TBDB data is a matter of loading the data into a data warehouse testbed, retrieving the insert errors, and then manually curating the data at fault. Most of the data that is not inserted into the warehouse is the result of two locus_tags or names being used for separate ORFS, pseudogenes being called differently (or not at all) between algorithms used at Genbank and TBDB, or redundant BHB table locations where it is not clear how to place the insertable data since two ORFS exist with the same name. In all cases the errors are expunged by submitting queries to the GBrowse BHB interface, to locate the correct gene name, and eliminating it if no ORF is called as a gene. As example from the June 2008 data load: TBDB Operons:~ 2525 Inserted without error: 2508 (99.3%) Load Errors: 17 (0.7%) Corrected in name: 12 (99.8%) No ORF/ pseudogene call (5 ; unloaded) Total: % In the instances where TBDB gene names or locus_tags did not match BHB, the names were compared. Where untranscribed pseudogenes, typos or obsolete names occurred, or where there was a redundant name (e.g. two 'erb1' homologs which were edited in Genbank after creation of the operon prediction, the new name was substituted to be consistent with BHB, and the entire file was reloaded to the BHB testbed without error.
5 5. References Ermolaeva, M.D., White, O., and Salzberg, S. (2001) Prediction of operons in microbial genomes. Nucleic Acid Res. 29, Karp, P.D., Paley, S., and Romero, P. (2002) The Pathway Tools software. Bioinformatics 18 Suppl 1:S Moreno-Hagelseib, G., and Collado-Vides, J. (2003) A powerful nonhomology method for the prediction of operons in prokaryotes. Bioinformatics18, S Riley, M. & Labedan, B. (1996) in: Escherichia coli and Salmonella: Cellular and Molecular Biology, eds. Neidhardt, F. N., Curtiss, R. I., Lin, E. C. C., Ingraham, J. L., Low, K. B., Magasanik, B., Resnikoff, W., Riley, M., Schaechter, M. & Umbarger, E. (Am. Soc. Microbiol., Washington, DC), pp H. Salgado, S. Gama-Castro, A. Martinez-Antonio, E. Diaz-Peredo, F. Sanchez-Solano, M. Peralta-Gil, D. Garcia-Alonso, V. Jimenez- JacinSalgado, H., Moreno-Hagelseib, G., Smith, T., and Collado-Vides, J. (2000) Operons in Escherichia coli: Genomic analyses and predictions. Nucleic Acid Res. 29,72. Westover, B.P., Buhler, J.D., Sonnenburg, J.L., Gordon, J.I. (2005) Operon prediction without a training set. Bioinformatics 21(7): 880. NCBI ftp site: URL ftp://ftp.ncbi.nlm.nih.gov/genomes/bacteria/
The EcoCyc Database. January 25, de Nitrógeno, UNAM,Cuernavaca, A.P. 565-A, Morelos, 62100, Mexico;
The EcoCyc Database Peter D. Karp, Monica Riley, Milton Saier,IanT.Paulsen +, Julio Collado-Vides + Suzanne M. Paley, Alida Pellegrini-Toole,César Bonavides ++, and Socorro Gama-Castro ++ January 25, 2002
More informationComputational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem
University of Groningen Computational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's
More informationThe advent of the genomic era has opened up the doors to the
Operons in Escherichia coli: Genomic analyses and predictions Heladia Salgado*, Gabriel Moreno-Hagelsieb*, Temple F. Smith, and Julio Collado-Vides Centro de Investigacion sobre Fijacion de Nitrogeno,
More informationIdentification and annotation of promoter regions in microbial genome sequences on the basis of DNA stability
Annotation of promoter regions in microbial genomes 851 Identification and annotation of promoter regions in microbial genome sequences on the basis of DNA stability VETRISELVI RANGANNAN and MANJU BANSAL*
More informationMicrobial computational genomics of gene regulation*
Pure Appl. Chem., Vol. 74, No. 6, pp. 899 905, 2002. 2002 IUPAC Microbial computational genomics of gene regulation* Julio Collado-Vides, Gabriel Moreno-Hagelsieb, and Arturo Medrano-Soto Program of Computational
More informationA genomic-scale search for regulatory binding sites in the integration host factor regulon of Escherichia coli K12
The integration host factor regulon of E. coli K12 genome 783 A genomic-scale search for regulatory binding sites in the integration host factor regulon of Escherichia coli K12 M. Trindade dos Santos and
More informationSupplemental Materials
JOURNAL OF MICROBIOLOGY & BIOLOGY EDUCATION, May 2013, p. 107-109 DOI: http://dx.doi.org/10.1128/jmbe.v14i1.496 Supplemental Materials for Engaging Students in a Bioinformatics Activity to Introduce Gene
More informationGenomic Arrangement of Regulons in Bacterial Genomes
l Genomes Han Zhang 1,2., Yanbin Yin 1,3., Victor Olman 1, Ying Xu 1,3,4 * 1 Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology and Institute of Bioinformatics,
More informationComparative genomics: Overview & Tools + MUMmer algorithm
Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first
More informationCISC 636 Computational Biology & Bioinformatics (Fall 2016)
CISC 636 Computational Biology & Bioinformatics (Fall 2016) Predicting Protein-Protein Interactions CISC636, F16, Lec22, Liao 1 Background Proteins do not function as isolated entities. Protein-Protein
More informationINTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA
INTERACTIVE CLUSTERING FOR EXPLORATION OF GENOMIC DATA XIUFENG WAN xw6@cs.msstate.edu Department of Computer Science Box 9637 JOHN A. BOYLE jab@ra.msstate.edu Department of Biochemistry and Molecular Biology
More informationComputational methods for predicting protein-protein interactions
Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational
More informationSUPPLEMENTARY METHODS
SUPPLEMENTARY METHODS M1: ALGORITHM TO RECONSTRUCT TRANSCRIPTIONAL NETWORKS M-2 Figure 1: Procedure to reconstruct transcriptional regulatory networks M-2 M2: PROCEDURE TO IDENTIFY ORTHOLOGOUS PROTEINSM-3
More informationIntegration of Omics Data to Investigate Common Intervals
2011 International Conference on Bioscience, Biochemistry and Bioinformatics IPCBEE vol.5 (2011) (2011) IACSIT Press, Singapore Integration of Omics Data to Investigate Common Intervals Sébastien Angibaud,
More informationBioinformatics. Dept. of Computational Biology & Bioinformatics
Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS
More informationChapter 15 Active Reading Guide Regulation of Gene Expression
Name: AP Biology Mr. Croft Chapter 15 Active Reading Guide Regulation of Gene Expression The overview for Chapter 15 introduces the idea that while all cells of an organism have all genes in the genome,
More informationCOMPARATIVE PATHWAY ANNOTATION WITH PROTEIN-DNA INTERACTION AND OPERON INFORMATION VIA GRAPH TREE DECOMPOSITION
COMPARATIVE PATHWAY ANNOTATION WITH PROTEIN-DNA INTERACTION AND OPERON INFORMATION VIA GRAPH TREE DECOMPOSITION JIZHEN ZHAO, DONGSHENG CHE AND LIMING CAI Department of Computer Science, University of Georgia,
More informationName: SBI 4U. Gene Expression Quiz. Overall Expectation:
Gene Expression Quiz Overall Expectation: - Demonstrate an understanding of concepts related to molecular genetics, and how genetic modification is applied in industry and agriculture Specific Expectation(s):
More informationGenome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.
Genome Annotation Bioinformatics and Computational Biology Genome Annotation Frank Oliver Glöckner 1 Genome Analysis Roadmap Genome sequencing Assembly Gene prediction Protein targeting trna prediction
More informationMultiple Choice Review- Eukaryotic Gene Expression
Multiple Choice Review- Eukaryotic Gene Expression 1. Which of the following is the Central Dogma of cell biology? a. DNA Nucleic Acid Protein Amino Acid b. Prokaryote Bacteria - Eukaryote c. Atom Molecule
More informationRGP finder: prediction of Genomic Islands
Training courses on MicroScope platform RGP finder: prediction of Genomic Islands Dynamics of bacterial genomes Gene gain Horizontal gene transfer Gene loss Deletion of one or several genes Duplication
More informationConservation of Gene Co-Regulation between Two Prokaryotes: Bacillus subtilis and Escherichia coli
116 Genome Informatics 16(1): 116 124 (2005) Conservation of Gene Co-Regulation between Two Prokaryotes: Bacillus subtilis and Escherichia coli Shujiro Okuda 1 Shuichi Kawashima 2 okuda@kuicr.kyoto-u.ac.jp
More informationEvaluation of the relative contribution of each STRING feature in the overall accuracy operon classification
Evaluation of the relative contribution of each STRING feature in the overall accuracy operon classification B. Taboada *, E. Merino 2, C. Verde 3 blanca.taboada@ccadet.unam.mx Centro de Ciencias Aplicadas
More informationTransitioning BioCyc to a Subscription Model
Transitioning BioCyc to a Subscription Model Peter D. Karp SRI International ecocyc.org biocyc.org metacyc.org BioCyc.org Collection of 9,300 Pathway/Genome Databases Pathway/Genome Database (PGDB) combines
More informationUniversity of Groningen. The relative value of operon predictions Brouwer, Rutger W. W.; Kuipers, Oscar; van Hijum, Sacha A. F. T.
University of Groningen The relative value of operon predictions Brouwer, Rutger W. W.; Kuipers, Oscar; van Hijum, Sacha A. F. T. Published in: Briefings in Bioinformatics DOI: 10.1093/bib/bbn019 IMPORTANT
More information3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.
3.B.1 Gene Regulation Gene regulation results in differential gene expression, leading to cell specialization. We will focus on gene regulation in prokaryotes first. Gene regulation accounts for some of
More informationAN EVIDENCE ONTOLOGY FOR USE IN PATHWAY/GENOME DATABASES
AN EVIDENCE ONTOLOGY FOR USE IN PATHWAY/GENOME DATABASES Appears in Pacific Symposium on Biocomputing 2004 Pages 190-201, World Scientific Publishers, Singapore P.D. KARP, S. PALEY, C.J. KRIEGER SRI International,
More informationchapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis
chapter 5 the mammalian cell entry 1 (mce1) operon of Mycobacterium Ieprae and Mycobacterium tuberculosis chapter 5 Harald G. Wiker, Eric Spierings, Marc A. B. Kolkman, Tom H. M. Ottenhoff, and Morten
More informationIntroduction. Gene expression is the combined process of :
1 To know and explain: Regulation of Bacterial Gene Expression Constitutive ( house keeping) vs. Controllable genes OPERON structure and its role in gene regulation Regulation of Eukaryotic Gene Expression
More informationSequence Alignment Techniques and Their Uses
Sequence Alignment Techniques and Their Uses Sarah Fiorentino Since rapid sequencing technology and whole genomes sequencing, the amount of sequence information has grown exponentially. With all of this
More informationRegulation of Gene Expression
Chapter 18 Regulation of Gene Expression Edited by Shawn Lester PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley
More informationGrundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson
Grundlagen der Bioinformatik, SS 10, D. Huson, April 12, 2010 1 1 Introduction Grundlagen der Bioinformatik Summer semester 2010 Lecturer: Prof. Daniel Huson Office hours: Thursdays 17-18h (Sand 14, C310a)
More informationThiago M Venancio and L Aravind
Minireview Reconstructing prokaryotic transcriptional regulatory networks: lessons from actinobacteria Thiago M Venancio and L Aravind Address: National Center for Biotechnology Information, National Library
More informationTopic 4 - #14 The Lactose Operon
Topic 4 - #14 The Lactose Operon The Lactose Operon The lactose operon is an operon which is responsible for the transport and metabolism of the sugar lactose in E. coli. - Lactose is one of many organic
More informationIntroduction to Bioinformatics
CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics
More information(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.
1. A change that makes a polypeptide defective has been discovered in its amino acid sequence. The normal and defective amino acid sequences are shown below. Researchers are attempting to reproduce the
More informationIntroduction to Bioinformatics Integrated Science, 11/9/05
1 Introduction to Bioinformatics Integrated Science, 11/9/05 Morris Levy Biological Sciences Research: Evolutionary Ecology, Plant- Fungal Pathogen Interactions Coordinator: BIOL 495S/CS490B/STAT490B Introduction
More informationBacterial Genetics & Operons
Bacterial Genetics & Operons The Bacterial Genome Because bacteria have simple genomes, they are used most often in molecular genetics studies Most of what we know about bacterial genetics comes from the
More informationBiology 105/Summer Bacterial Genetics 8/12/ Bacterial Genomes p Gene Transfer Mechanisms in Bacteria p.
READING: 14.2 Bacterial Genomes p. 481 14.3 Gene Transfer Mechanisms in Bacteria p. 486 Suggested Problems: 1, 7, 13, 14, 15, 20, 22 BACTERIAL GENETICS AND GENOMICS We still consider the E. coli genome
More informationSUPPLEMENTARY INFORMATION
Supplementary information S3 (box) Methods Methods Genome weighting The currently available collection of archaeal and bacterial genomes has a highly biased distribution of isolates across taxa. For example,
More informationChapter 16 Lecture. Concepts Of Genetics. Tenth Edition. Regulation of Gene Expression in Prokaryotes
Chapter 16 Lecture Concepts Of Genetics Tenth Edition Regulation of Gene Expression in Prokaryotes Chapter Contents 16.1 Prokaryotes Regulate Gene Expression in Response to Environmental Conditions 16.2
More informationGenomes and Their Evolution
Chapter 21 Genomes and Their Evolution PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from
More information1. In most cases, genes code for and it is that
Name Chapter 10 Reading Guide From DNA to Protein: Gene Expression Concept 10.1 Genetics Shows That Genes Code for Proteins 1. In most cases, genes code for and it is that determine. 2. Describe what Garrod
More informationBioinformatics Exercises
Bioinformatics Exercises AP Biology Teachers Workshop Susan Cates, Ph.D. Evolution of Species Phylogenetic Trees show the relatedness of organisms Common Ancestor (Root of the tree) 1 Rooted vs. Unrooted
More information2 Genome evolution: gene fusion versus gene fission
2 Genome evolution: gene fusion versus gene fission Berend Snel, Peer Bork and Martijn A. Huynen Trends in Genetics 16 (2000) 9-11 13 Chapter 2 Introduction With the advent of complete genome sequencing,
More informationComputational Cell Biology Lecture 4
Computational Cell Biology Lecture 4 Case Study: Basic Modeling in Gene Expression Yang Cao Department of Computer Science DNA Structure and Base Pair Gene Expression Gene is just a small part of DNA.
More informationControlling Gene Expression
Controlling Gene Expression Control Mechanisms Gene regulation involves turning on or off specific genes as required by the cell Determine when to make more proteins and when to stop making more Housekeeping
More informationL3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus
L3.1: Circuits: Introduction to Transcription Networks Cellular Design Principles Prof. Jenna Rickus In this lecture Cognitive problem of the Cell Introduce transcription networks Key processing network
More informationGENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications
1 GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications 2 DNA Promoter Gene A Gene B Termination Signal Transcription
More informationFrom Gene to Protein
From Gene to Protein Gene Expression Process by which DNA directs the synthesis of a protein 2 stages transcription translation All organisms One gene one protein 1. Transcription of DNA Gene Composed
More informationEssentiality in B. subtilis
Essentiality in B. subtilis 100% 75% Essential genes Non-essential genes Lagging 50% 25% Leading 0% non-highly expressed highly expressed non-highly expressed highly expressed 1 http://www.pasteur.fr/recherche/unites/reg/
More informationCHAPTER : Prokaryotic Genetics
CHAPTER 13.3 13.5: Prokaryotic Genetics 1. Most bacteria are not pathogenic. Identify several important roles they play in the ecosystem and human culture. 2. How do variations arise in bacteria considering
More information08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega
BLAST Multiple Sequence Alignments: Clustal Omega What does basic BLAST do (e.g. what is input sequence and how does BLAST look for matches?) Susan Parrish McDaniel College Multiple Sequence Alignments
More informationComparing whole genomes
BioNumerics Tutorial: Comparing whole genomes 1 Aim The Chromosome Comparison window in BioNumerics has been designed for large-scale comparison of sequences of unlimited length. In this tutorial you will
More informationName Period The Control of Gene Expression in Prokaryotes Notes
Bacterial DNA contains genes that encode for many different proteins (enzymes) so that many processes have the ability to occur -not all processes are carried out at any one time -what allows expression
More informationSUPPLEMENTARY INFORMATION
Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)
More informationOrganization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p
Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p.110-114 Arrangement of information in DNA----- requirements for RNA Common arrangement of protein-coding genes in prokaryotes=
More informationGCD3033:Cell Biology. Transcription
Transcription Transcription: DNA to RNA A) production of complementary strand of DNA B) RNA types C) transcription start/stop signals D) Initiation of eukaryotic gene expression E) transcription factors
More informationComputational approaches for functional genomics
Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding
More informationProteomics Systems Biology
Dr. Sanjeeva Srivastava IIT Bombay Proteomics Systems Biology IIT Bombay 2 1 DNA Genomics RNA Transcriptomics Global Cellular Protein Proteomics Global Cellular Metabolite Metabolomics Global Cellular
More informationBioinformatics Study On Mirror DNA In Mycobacterium Tuberculosis Using Fast Parallel Complement Blast Strategy.
INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 4, ISSUE 2, DECEMBER 205 ISSN 2277-866 Bioinformatics Study On Mirror DNA In Mycobacterium Tuberculosis Using Fast Parallel Complement Blast
More informationPathway Bioinformatics: Inference, Visualization, and Analysis. Peter D. Karp, Ph.D.
Pathway Bioinformatics: Inference, Visualization, and Analysis Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International pkarp@ai.sri.com BioCyc.org EcoCyc.org, MetaCyc.org, HumanCyc.org 1 SRI
More informationAlgorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment
Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot
More informationGenome reduction in prokaryotic obligatory intracellular parasites of humans: a comparative analysis
International Journal of Systematic and Evolutionary Microbiology (2004), 54, 1937 1941 DOI 10.1099/ijs.0.63090-0 Genome reduction in prokaryotic obligatory intracellular parasites of humans: a comparative
More informationProkaryotic Gene Expression (Learning Objectives)
Prokaryotic Gene Expression (Learning Objectives) 1. Learn how bacteria respond to changes of metabolites in their environment: short-term and longer-term. 2. Compare and contrast transcriptional control
More informationMathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007
-2 Transcript Alignment Assembly and Automated Gene Structure Improvements Using PASA-2 Mathangi Thiagarajan mathangi@jcvi.org Rice Genome Annotation Workshop May 23rd, 2007 About PASA PASA is an open
More informationCSCE555 Bioinformatics. Protein Function Annotation
CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The
More informationGENE REGULATION AND PROBLEMS OF DEVELOPMENT
GENE REGULATION AND PROBLEMS OF DEVELOPMENT By Surinder Kaur DIET Ropar Surinder_1998@ yahoo.in Mob No 9988530775 GENE REGULATION Gene is a segment of DNA that codes for a unit of function (polypeptide,
More informationSynteny Portal Documentation
Synteny Portal Documentation Synteny Portal is a web application portal for visualizing, browsing, searching and building synteny blocks. Synteny Portal provides four main web applications: SynCircos,
More informationGenetic Variation: The genetic substrate for natural selection. Horizontal Gene Transfer. General Principles 10/2/17.
Genetic Variation: The genetic substrate for natural selection What about organisms that do not have sexual reproduction? Horizontal Gene Transfer Dr. Carol E. Lee, University of Wisconsin In prokaryotes:
More informationDynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information -
Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes - Supplementary Information - Martin Bartl a, Martin Kötzing a,b, Stefan Schuster c, Pu Li a, Christoph Kaleta b a
More informationthe noisy gene Biology of the Universidad Autónoma de Madrid Jan 2008 Juan F. Poyatos Spanish National Biotechnology Centre (CNB)
Biology of the the noisy gene Universidad Autónoma de Madrid Jan 2008 Juan F. Poyatos Spanish National Biotechnology Centre (CNB) day III: noisy bacteria - Regulation of noise (B. subtilis) - Intrinsic/Extrinsic
More informationExploring molecular networks using MONET ontology
J.P.M. da Silva et al. 182 Exploring molecular networks using MONET ontology João Paulo Müller da Silva, Ney Lemke, José Carlos Mombach, José Guilherme Camargo de Souza, Marialva Sinigaglia and Renata
More informationSequence analysis and comparison
The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species
More informationRegulation of Gene Expression
Chapter 18 Regulation of Gene Expression PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from
More informationTopology. 1 Introduction. 2 Chromosomes Topology & Counts. 3 Genome size. 4 Replichores and gene orientation. 5 Chirochores.
Topology 1 Introduction 2 3 Genome size 4 Replichores and gene orientation 5 Chirochores 6 G+C content 7 Codon usage 27 marc.bailly-bechet@univ-lyon1.fr The big picture Eukaryota Bacteria Many linear chromosomes
More informationMolecular Biology, Genetic Engineering & Biotechnology Operons ???
1 Description of Module Subject Name?? Paper Name Module Name/Title XV- 04: 2 OPERONS OBJECTIVES To understand how gene is expressed and regulated in prokaryotic cell To understand the regulation of Lactose
More informationRevisiting the Central Dogma The role of Small RNA in Bacteria
Graduate Student Seminar Revisiting the Central Dogma The role of Small RNA in Bacteria The Chinese University of Hong Kong Supervisor : Prof. Margaret Ip Faculty of Medicine Student : Helen Ma (PhD student)
More informationVital Statistics Derived from Complete Genome Sequencing (for E. coli MG1655)
We still consider the E. coli genome as a fairly typical bacterial genome, and given the extensive information available about this organism and it's lifestyle, the E. coli genome is a useful point of
More informationHonors Biology Reading Guide Chapter 11
Honors Biology Reading Guide Chapter 11 v Promoter a specific nucleotide sequence in DNA located near the start of a gene that is the binding site for RNA polymerase and the place where transcription begins
More informationNetworks & pathways. Hedi Peterson MTAT Bioinformatics
Networks & pathways Hedi Peterson (peterson@quretec.com) MTAT.03.239 Bioinformatics 03.11.2010 Networks are graphs Nodes Edges Edges Directed, undirected, weighted Nodes Genes Proteins Metabolites Enzymes
More informationBioinformatics Chapter 1. Introduction
Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!
More informationNewly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:
m Eukaryotic mrna processing Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: Cap structure a modified guanine base is added to the 5 end. Poly-A tail
More informationobjective functions...
objective functions... COFFEE (Notredame et al. 1998) measures column by column similarity between pairwise and multiple sequence alignments assumes that the pairwise alignments are optimal assumes a set
More informationPredicting Protein Functions and Domain Interactions from Protein Interactions
Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput
More informationREVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E
REVIEW SESSION Wednesday, September 15 5:30 PM SHANTZ 242 E Gene Regulation Gene Regulation Gene expression can be turned on, turned off, turned up or turned down! For example, as test time approaches,
More informationBiology 112 Practice Midterm Questions
Biology 112 Practice Midterm Questions 1. Identify which statement is true or false I. Bacterial cell walls prevent osmotic lysis II. All bacterial cell walls contain an LPS layer III. In a Gram stain,
More informationGene Regulation and Expression
THINK ABOUT IT Think of a library filled with how-to books. Would you ever need to use all of those books at the same time? Of course not. Now picture a tiny bacterium that contains more than 4000 genes.
More informationComplete all warm up questions Focus on operon functioning we will be creating operon models on Monday
Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday 1. What is the Central Dogma? 2. How does prokaryotic DNA compare to eukaryotic DNA? 3. How is DNA
More informationBio 119 Bacterial Genomics 6/26/10
BACTERIAL GENOMICS Reading in BOM-12: Sec. 11.1 Genetic Map of the E. coli Chromosome p. 279 Sec. 13.2 Prokaryotic Genomes: Sizes and ORF Contents p. 344 Sec. 13.3 Prokaryotic Genomes: Bioinformatic Analysis
More informationTranslation and Operons
Translation and Operons You Should Be Able To 1. Describe the three stages translation. including the movement of trna molecules through the ribosome. 2. Compare and contrast the roles of three different
More informationAS A SERVICE TO THE RESEARCH COMMUNITY, GENOME BIOLOGY PROVIDES A 'PREPRINT' DEPOSITORY
http://genomebiology.com/2002/3/12/preprint/0011.1 This information has not been peer-reviewed. Responsibility for the findings rests solely with the author(s). Deposited research article MRD: a microsatellite
More informationChapter 17. From Gene to Protein. Biology Kevin Dees
Chapter 17 From Gene to Protein DNA The information molecule Sequences of bases is a code DNA organized in to chromosomes Chromosomes are organized into genes What do the genes actually say??? Reflecting
More informationComparative Bioinformatics Midterm II Fall 2004
Comparative Bioinformatics Midterm II Fall 2004 Objective Answer, part I: For each of the following, select the single best answer or completion of the phrase. (3 points each) 1. Deinococcus radiodurans
More informationHomology Modeling. Roberto Lins EPFL - summer semester 2005
Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,
More informationDATA ACQUISITION FROM BIO-DATABASES AND BLAST. Natapol Pornputtapong 18 January 2018
DATA ACQUISITION FROM BIO-DATABASES AND BLAST Natapol Pornputtapong 18 January 2018 DATABASE Collections of data To share multi-user interface To prevent data loss To make sure to get the right things
More informationFUNCTION ANNOTATION PRELIMINARY RESULTS
FUNCTION ANNOTATION PRELIMINARY RESULTS FACTION I KAI YUAN KALYANI PATANKAR KIERA BERGER CAMILA MEDRANO HUBERT PAN JUNKE WANG YANXI CHEN AJAY RAMAKRISHNAN MRUNAL DEHANKAR OVERVIEW Introduction Previous
More informationSequences, Structures, and Gene Regulatory Networks
Sequences, Structures, and Gene Regulatory Networks Learning Outcomes After this class, you will Understand gene expression and protein structure in more detail Appreciate why biologists like to align
More informationReading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype
Lecture Series 7 From DNA to Protein: Genotype to Phenotype Reading Assignments Read Chapter 7 From DNA to Protein A. Genes and the Synthesis of Polypeptides Genes are made up of DNA and are expressed
More informationThe Gene The gene; Genes Genes Allele;
Gene, genetic code and regulation of the gene expression, Regulating the Metabolism, The Lac- Operon system,catabolic repression, The Trp Operon system: regulating the biosynthesis of the tryptophan. Mitesh
More information