Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.
|
|
- Lawrence Porter
- 5 years ago
- Views:
Transcription
1 Genome Annotation Bioinformatics and Computational Biology Genome Annotation Frank Oliver Glöckner 1 Genome Analysis Roadmap Genome sequencing Assembly Gene prediction Protein targeting trna prediction Annotation Genome Annotation Frank Oliver Glöckner 2 1
2 Genome annotation: From Sequence to Biology! Definition: Annotation of a DNA sequence is the assignment of biologically relevant features to certain regions of the sequence. Genome Annotation Frank Oliver Glöckner 3 Genome Annotation: Functional Assignment Translate the predicted coding region into the amino acid sequence Analyze the amino acid sequence 1. Step: Search in sequence databases for similar sequences! BLAST, FASTA Genome Annotation Frank Oliver Glöckner 4 2
3 Dogma of Similarity and Function Similarity Homology Function detect infer assign Challenge: Detection of Similarity! Genome Annotation Frank Oliver Glöckner 5 Similarity search - sequence databases Nucleotide databases GenBank, EMBL, DDBJ, [INSDC] Protein databases - curated and non-curated SWISSPROT, PIR, TrEMBL, [UniProt] NR (Non-Redundant) EST databases, e.g. NCBI Expressed sequence tags, for eukaryotic systems (valuable information to confirm the transcription and splicing of predicted ORFs) Genome Annotation Frank Oliver Glöckner 6 3
4 Non-redundant protein database GenBank/EMBL/DDBJ (translations) PIR SWISS-PROT nr : non-redundant protein database Complementary (redundant) to UniProt/UniRef Genome Annotation Frank Oliver Glöckner 7 Generation of functional evidences List of ORFs Patterns Profiles Fold Location BLAST PROSITE BLOCKS PFAM COG SCOP TMHMM SignalP Automatic annotation manually refined annotation Genome Annotation Frank Oliver Glöckner 8 4
5 Tools COG: Clusters of orthologous genes KEGG: Metabolic pathways GO: Gene Ontology SCOP: Characterize secondary structure elements and compare to database of protein folds Detect characteristics that indicate a cellular location of the protein SignalP for signal peptide prediction protein acts not in the cytosol can be located in the periplasm or outside the cell TMHMM for detection of transmembrane regions integral membrane protein or membrane associated protein Genome Annotation Frank Oliver Glöckner 9 Clusters of Orthologous Groups - COGs Definitions Orthologous Genes Direct common ancestor Found in different species assumed to have same functions Paralogous Genes Originates from a gene duplication event Found in the same species, can have different functions COGs Each COG consists of individual orthologous genes or orthologous groups of paralogs found in at least three Genomes Genome Annotation Frank Oliver Glöckner 10 5
6 COGs - Construction COGs Members of a COG have been evolved from a common ancestral gene by speciation [E.coli] Pairwise comparison of all proteins of all available genomes Determination of best hits BeTs for every protein Minimal COG: Triangle of symmetrical BeTs [Saccharomyces] [Synechocystis] Tatusov, 1997, Science, p Genome Annotation Frank Oliver Glöckner 11 COGs - Construction Example: Isoleucine trna-synthetase [Yeast cyto] [E.coli] [H. influenzae] YPL040c BeTs to all bacterial and the archaeal genes [Yeast mito] [M. genitalium] Symmetrical BeTs to all bacterial genes [M. jannaschii] [Synechocystis] [M. pneumoniae] YPL076c Symmetrical BeT to the M. jannaschii gene Tatusov, 1997, Science, p Genome Annotation Frank Oliver Glöckner 12 6
7 COGs - Status Version 1, Organisms Escherichia coli Haemophilus influenzae Saccharomyces cerevisiae Synechocystis sp. Mycoplasma genitalium Mycoplasma pneumoniae 17,967 Proteins 720 COGs 15 functional categories Genome Annotation Frank Oliver Glöckner 13 COGs Status Version 2, December Genomes Genome Annotation Frank Oliver Glöckner 14 7
8 COGs Status, unic. Genomes & 7 euk. Genomes Version 3, 2003 Tatusov, 2003, BMC Bioinformatics, 4(1):41 Genome Annotation Frank Oliver Glöckner 15 COGs Classification Version 2, 18 functional categories Genome Annotation Frank Oliver Glöckner 16 8
9 COG Classification Version 3, 25 functional categories Genome Annotation Frank Oliver Glöckner 17 COGs - Searching COGNITOR Assigns new proteins to existing COGs Classification of Proteins Prediction of function Provides a fast overview on the possible metabolism of an organism COG Website: Genome Annotation Frank Oliver Glöckner 18 9
10 COG is retired Genome Annotation Frank Oliver Glöckner 19 COGs - Results Genome Annotation Frank Oliver Glöckner 20 10
11 COG Results Genome Annotation Frank Oliver Glöckner 21 COG Result Genomic context Genome Annotation Frank Oliver Glöckner 22 11
12 COG: Functional cross genome comparisons Genome Annotation Frank Oliver Glöckner 23 Ontologies Why ontologies? Ontologies provide conceptualizations of domains of knowledge and facilitate both communication between researchers and the use of domain knowledge by computers for multiple purposes Genome Annotation Frank Oliver Glöckner 24 12
13 Genome Annotation Frank Oliver Glöckner 25 The Gene Ontology Consortium Started A joint project of three eukaryotic model databases FlyBase (Drosophila), Mouse Genome Informatics (MGI) and Saccharomyces Genome Database (SGD) The three major goals are: 1. Develop a set of controlled, structured vocabularies to describe key domains of molecular biology, including gene product attributes and biological sequences 2. To apply GO terms in annotation of sequences, genes or gene products 3. To provide a centralized public resource allowing universal access to ontologies, annotation data sets and software tools Genome Annotation Frank Oliver Glöckner 26 13
14 The GO consortium Genome Annotation Frank Oliver Glöckner 27 GO: Three Ontologies Biological process Refers to a biological objective to which the gene or gene product contributes e.g. Cell growth and maintenance (broad) or translation (more specific) Molecular function Defined as the biochemical activity of a gene product (including specific binding to ligands and structures) e.g. enzyme (broad) or adenylate cyclase (more specific) Cellular component Refers to the place in the cell where a gene product is active e.g. ribosome (multicomponent complex) or nuclear membrane (cellular structure) Genome Annotation Frank Oliver Glöckner 28 14
15 GO Structure Ontology Structure Directed acyclic graphs (DAGs) A child can have more than one parent Relationships is a = instance of the parent part of = component of the parent Every GO term has a: Unique identifier Annotation Oxford dictionary of Molecular Biology Source (literature or computational analysis) Evidence given in the source (e.g. experimental ) Genome Annotation Frank Oliver Glöckner 29 GO Network Genome Annotation Frank Oliver Glöckner 30 15
16 GO Biological Process One node can have more than one parent! The Gene Ontology Consortium 2000, Nature vol. 25, p. 25 Genome Annotation Frank Oliver Glöckner 31 GO Molecular Function The Gene Ontology Consortium 2000, Nature vol. 25, p. 25 Genome Annotation Frank Oliver Glöckner 32 16
17 GO Cellular Component The Gene Ontology Consortium 2000, Nature vol. 25, p. 25 Genome Annotation Frank Oliver Glöckner 33 EBI: The GOA-project See for recent accomplishments in classification efforts: mapping of external catalogues to GO EC numbers to GO (EC2go) SWISS-PROT keywords to GO (spkw2go) InterPro entries to GO (interpro2go) TIGR functional roles to GO (TIGR2go) GenProtEC functional categories to GO (genprotec2go) Genome Annotation Frank Oliver Glöckner 34 17
18 Mappings to GO Genome Annotation Frank Oliver Glöckner 35 Example BLASTP of ORF2114 Name? Functional classification? Genome Annotation Frank Oliver Glöckner 36 18
19 Swiss-Prot Genome Annotation Frank Oliver Glöckner 37 Example BLASTP of ORF2114 against COG COG Aconitase A C: Energy production and conversion TCA Cycle Genome Annotation Frank Oliver Glöckner 38 19
20 KEGG Genome Annotation Frank Oliver Glöckner 39 Send to GOhst Genome Annotation Frank Oliver Glöckner 40 20
21 Annotation of ORF2114 Aconitate hydratase or Citrate hydro-lyase, Aconitase Cis-aconitase Deinococcus radiodurans (strain R1) Gene: acnb or citb E.C COG: Aconitase A, C: Energy production and conversion, TCA Cycle KEGG Pathway Ko00020 Citrate cycle (TCA cycle) Ko00630 Glyoxylate and dicarboxylate metabolism Ko00720 Carbon fixation pathways in prokaryotes GO Recommended name: aconitate hydratase Function: aconitate hydratase GO: Process: tricarboxylic acid cycle GO: part of Genome Annotation Frank Oliver Glöckner 41 is a InterPro Genome Annotation Frank Oliver Glöckner 42 21
22 TIGRFAMs Genome Annotation Frank Oliver Glöckner 43 Rhodopirellula: Annotation Glöckner et al., PNAS, 2003 Genome Annotation Frank Oliver Glöckner 44 22
The Minimal-Gene-Set -Kapil PHY498BIO, HW 3
The Minimal-Gene-Set -Kapil Rajaraman(rajaramn@uiuc.edu) PHY498BIO, HW 3 The number of genes in organisms varies from around 480 (for parasitic bacterium Mycoplasma genitalium) to the order of 100,000
More informationComparative genomics: Overview & Tools + MUMmer algorithm
Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune 411 007. urmila@bioinfo.ernet.in Genome sequence: Fact file 1995: The first
More informationFunctional Annotation
Functional Annotation Outline Introduction Strategy Pipeline Databases Now, what s next? Functional Annotation Adding the layers of analysis and interpretation necessary to extract its biological significance
More informationEBI web resources II: Ensembl and InterPro
EBI web resources II: Ensembl and InterPro Yanbin Yin http://www.ebi.ac.uk/training/online/course/ 1 Homework 3 Go to http://www.ebi.ac.uk/interpro/training.htmland finish the second online training course
More informationEBI web resources II: Ensembl and InterPro. Yanbin Yin Spring 2013
EBI web resources II: Ensembl and InterPro Yanbin Yin Spring 2013 1 Outline Intro to genome annotation Protein family/domain databases InterPro, Pfam, Superfamily etc. Genome browser Ensembl Hands on Practice
More informationHomology. and. Information Gathering and Domain Annotation for Proteins
Homology and Information Gathering and Domain Annotation for Proteins Outline WHAT IS HOMOLOGY? HOW TO GATHER KNOWN PROTEIN INFORMATION? HOW TO ANNOTATE PROTEIN DOMAINS? EXAMPLES AND EXERCISES Homology
More informationWe have: We will: Assembled six genomes Made predictions of most likely gene locations. Add a layers of biological meaning to the sequences
Recap We have: Assembled six genomes Made predictions of most likely gene locations We will: Add a layers of biological meaning to the sequences Start with Biology This will motivate the choices we make
More informationHomology and Information Gathering and Domain Annotation for Proteins
Homology and Information Gathering and Domain Annotation for Proteins Outline Homology Information Gathering for Proteins Domain Annotation for Proteins Examples and exercises The concept of homology The
More informationBioinformatics. Dept. of Computational Biology & Bioinformatics
Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS
More informationBMD645. Integration of Omics
BMD645 Integration of Omics Shu-Jen Chen, Chang Gung University Dec. 11, 2009 1 Traditional Biology vs. Systems Biology Traditional biology : Single genes or proteins Systems biology: Simultaneously study
More information-max_target_seqs: maximum number of targets to report
Review of exercise 1 tblastn -num_threads 2 -db contig -query DH10B.fasta -out blastout.xls -evalue 1e-10 -outfmt "6 qseqid sseqid qstart qend sstart send length nident pident evalue" Other options: -max_target_seqs:
More informationIntroduction to Bioinformatics Integrated Science, 11/9/05
1 Introduction to Bioinformatics Integrated Science, 11/9/05 Morris Levy Biological Sciences Research: Evolutionary Ecology, Plant- Fungal Pathogen Interactions Coordinator: BIOL 495S/CS490B/STAT490B Introduction
More informationGene Ontology and overrepresentation analysis
Gene Ontology and overrepresentation analysis Kjell Petersen J Express Microarray analysis course Oslo December 2009 Presentation adapted from Endre Anderssen and Vidar Beisvåg NMC Trondheim Overview How
More informationCS612 - Algorithms in Bioinformatics
Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available
More informationCSCE555 Bioinformatics. Protein Function Annotation
CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The
More information2 Genome evolution: gene fusion versus gene fission
2 Genome evolution: gene fusion versus gene fission Berend Snel, Peer Bork and Martijn A. Huynen Trends in Genetics 16 (2000) 9-11 13 Chapter 2 Introduction With the advent of complete genome sequencing,
More informationGENE ONTOLOGY (GO) Wilver Martínez Martínez Giovanny Silva Rincón
GENE ONTOLOGY (GO) Wilver Martínez Martínez Giovanny Silva Rincón What is GO? The Gene Ontology (GO) project is a collaborative effort to address the need for consistent descriptions of gene products in
More informationIn-Silico Approach for Hypothetical Protein Function Prediction
In-Silico Approach for Hypothetical Protein Function Prediction Shabanam Khatoon Department of Computer Science, Faculty of Natural Sciences Jamia Millia Islamia, New Delhi Suraiya Jabin Department of
More informationExample of Function Prediction
Find similar genes Example of Function Prediction Suggesting functions of newly identified genes It was known that mutations of NF1 are associated with inherited disease neurofibromatosis 1; but little
More informationLecture 2. The Blast2GO annotation framework
Lecture 2 The Blast2GO annotation framework Annotation steps Modulation of annotation intensity Export/Import Functions Sequence Selection Additional Tools Functional assignment Annotation Transference
More information2 GENE FUNCTIONAL SIMILARITY. 2.1 Semantic values of GO terms
Bioinformatics Advance Access published March 7, 2007 The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
More informationGene function annotation
Gene function annotation Paul D. Thomas, Ph.D. University of Southern California What is function annotation? The formal answer to the question: what does this gene do? The association between: a description
More informationHeuristic Methods. Heuristic methods for alignment Sequence databases Multiple alignment Gene and protein prediction
Heuristic methods for alignment Sequence databases Multiple alignment Gene and protein prediction Armstrong, 2010 Heuristic Methods! FASTA! BLAST! Gapped BLAST! PSI-BLAST Armstrong, 2010 1 Assumptions
More informationSupplementary Information 16
Supplementary Information 16 Cellular Component % of Genes 50 45 40 35 30 25 20 15 10 5 0 human mouse extracellular other membranes plasma membrane cytosol cytoskeleton mitochondrion ER/Golgi translational
More informationIntegration of functional genomics data
Integration of functional genomics data Laboratoire Bordelais de Recherche en Informatique (UMR) Centre de Bioinformatique de Bordeaux (Plateforme) Rennes Oct. 2006 1 Observations and motivations Genomics
More informationGCD3033:Cell Biology. Transcription
Transcription Transcription: DNA to RNA A) production of complementary strand of DNA B) RNA types C) transcription start/stop signals D) Initiation of eukaryotic gene expression E) transcription factors
More informationBio2. Heuristics, Databases ; Multiple Sequence Alignment ; Gene Finding. Biological Databases (sequences) Armstrong, 2007 Bioinformatics 2
Bio2 Heuristics, Databases ; Multiple Sequence Alignment ; Gene Finding Biological Databases (sequences) 1 Biological Databases Introduction to Sequence Databases Overview of primary query tools and the
More informationMeiothermus ruber Genome Analysis Project
Augustana College Augustana Digital Commons Meiothermus ruber Genome Analysis Project Biology 2018 Predicted ortholog pairs between E. coli and M. ruber are b3456 and mrub_2379, b3457 and mrub_2378, b3456
More informationComputational approaches for functional genomics
Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding
More information# shared OGs (spa, spb) Size of the smallest genome. dist (spa, spb) = 1. Neighbor joining. OG1 OG2 OG3 OG4 sp sp sp
Bioinformatics and Evolutionary Genomics: Genome Evolution in terms of Gene Content 3/10/2014 1 Gene Content Evolution What about HGT / genome sizes? Genome trees based on gene content: shared genes Haemophilus
More informationProtein function prediction based on sequence analysis
Performing sequence searches Post-Blast analysis, Using profiles and pattern-matching Protein function prediction based on sequence analysis Slides from a lecture on MOL204 - Applied Bioinformatics 18-Oct-2005
More informationComputational methods for predicting protein-protein interactions
Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational
More informationV19 Metabolic Networks - Overview
V19 Metabolic Networks - Overview There exist different levels of computational methods for describing metabolic networks: - stoichiometry/kinetics of classical biochemical pathways (glycolysis, TCA cycle,...
More informationV14 extreme pathways
V14 extreme pathways A torch is directed at an open door and shines into a dark room... What area is lighted? Instead of marking all lighted points individually, it would be sufficient to characterize
More informationSequences, Structures, and Gene Regulatory Networks
Sequences, Structures, and Gene Regulatory Networks Learning Outcomes After this class, you will Understand gene expression and protein structure in more detail Appreciate why biologists like to align
More informationUpdate on human genome completion and annotations: Protein information resource
UPDATE ON GENOME COMPLETION AND ANNOTATIONS Update on human genome completion and annotations: Protein information resource Cathy Wu 1 and Daniel W. Nebert 2 * 1 Director of PIR, Department of Biochemistry
More informationProcedure to Create NCBI KOGS
Procedure to Create NCBI KOGS full details in: Tatusov et al (2003) BMC Bioinformatics 4:41. 1. Detect and mask typical repetitive domains Reason: masking prevents spurious lumping of non-orthologs based
More informationBLAST. Varieties of BLAST
BLAST Basic Local Alignment Search Tool (1990) Altschul, Gish, Miller, Myers, & Lipman Uses short-cuts or heuristics to improve search speed Like speed-reading, does not examine every nucleotide of database
More informationChapter 5. Proteomics and the analysis of protein sequence Ⅱ
Proteomics Chapter 5. Proteomics and the analysis of protein sequence Ⅱ 1 Pairwise similarity searching (1) Figure 5.5: manual alignment One of the amino acids in the top sequence has no equivalent and
More informationRelated Courses He who asks is a fool for five minutes, but he who does not ask remains a fool forever.
CSE 527 Computational Biology http://www.cs.washington.edu/527 Lecture 1: Overview & Bio Review Autumn 2004 Larry Ruzzo Related Courses He who asks is a fool for five minutes, but he who does not ask remains
More informationHands-On Nine The PAX6 Gene and Protein
Hands-On Nine The PAX6 Gene and Protein Main Purpose of Hands-On Activity: Using bioinformatics tools to examine the sequences, homology, and disease relevance of the Pax6: a master gene of eye formation.
More informationBioinformatics Chapter 1. Introduction
Bioinformatics Chapter 1. Introduction Outline! Biological Data in Digital Symbol Sequences! Genomes Diversity, Size, and Structure! Proteins and Proteomes! On the Information Content of Biological Sequences!
More informationMiGA: The Microbial Genome Atlas
December 12 th 2017 MiGA: The Microbial Genome Atlas Jim Cole Center for Microbial Ecology Dept. of Plant, Soil & Microbial Sciences Michigan State University East Lansing, Michigan U.S.A. Where I m From
More informationComparative RNA-seq analysis of transcriptome dynamics during petal development in Rosa chinensis
Title Comparative RNA-seq analysis of transcriptome dynamics during petal development in Rosa chinensis Author list Yu Han 1, Huihua Wan 1, Tangren Cheng 1, Jia Wang 1, Weiru Yang 1, Huitang Pan 1* & Qixiang
More informationA Protein Ontology from Large-scale Textmining?
A Protein Ontology from Large-scale Textmining? Protege-Workshop Manchester, 07-07-2003 Kai Kumpf, Juliane Fluck and Martin Hofmann Instructive mistakes: a narrative Aim: Protein ontology that supports
More informationRiboflavin Metabolism: A study to see if Mrub_1256 is Orthologous to E. coli b0415, and if Mrub_1254 is Orthologous to E.
Augustana College Augustana Digital Commons Meiothermus ruber Genome Analysis Project Biology Winter 2-2016 Riboflavin Metabolism: A study to see if Mrub_1256 is Orthologous to E. coli b0415, and if Mrub_1254
More informationBioinformatics in the post-sequence era
Bioinformatics in the post-sequence era review Minoru Kanehisa 1 & Peer Bork 2 doi:10.1038/ng1109 In the past decade, bioinformatics has become an integral part of research and development in the biomedical
More informationFitness constraints on horizontal gene transfer
Fitness constraints on horizontal gene transfer Dan I Andersson University of Uppsala, Department of Medical Biochemistry and Microbiology, Uppsala, Sweden GMM 3, 30 Aug--2 Sep, Oslo, Norway Acknowledgements:
More informationProteins: Structure & Function. Ulf Leser
Proteins: Structure & Function Ulf Leser This Lecture Proteins Structure Function Databases Predicting Protein Secondary Structure Many figures from Zvelebil, M. and Baum, J. O. (2008). "Understanding
More informationSupplemental Materials
JOURNAL OF MICROBIOLOGY & BIOLOGY EDUCATION, May 2013, p. 107-109 DOI: http://dx.doi.org/10.1128/jmbe.v14i1.496 Supplemental Materials for Engaging Students in a Bioinformatics Activity to Introduce Gene
More informationGene Ontology. Shifra Ben-Dor. Weizmann Institute of Science
Gene Ontology Shifra Ben-Dor Weizmann Institute of Science Outline of Session What is GO (Gene Ontology)? What tools do we use to work with it? Combination of GO with other analyses What is Ontology? 1700s
More informationSequence Alignment Techniques and Their Uses
Sequence Alignment Techniques and Their Uses Sarah Fiorentino Since rapid sequencing technology and whole genomes sequencing, the amount of sequence information has grown exponentially. With all of this
More informationMotifs, Profiles and Domains. Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC
Motifs, Profiles and Domains Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC Comparing Two Proteins Sequence Alignment Determining the pattern of evolution and identifying conserved
More informationRELATIONSHIPS BETWEEN GENES/PROTEINS HOMOLOGUES
Molecular Biology-2018 1 Definitions: RELATIONSHIPS BETWEEN GENES/PROTEINS HOMOLOGUES Heterologues: Genes or proteins that possess different sequences and activities. Homologues: Genes or proteins that
More informationComputational Structural Bioinformatics
Computational Structural Bioinformatics ECS129 Instructor: Patrice Koehl http://koehllab.genomecenter.ucdavis.edu/teaching/ecs129 koehl@cs.ucdavis.edu Learning curve Math / CS Biology/ Chemistry Pre-requisite
More informationLarge-Scale Genomic Surveys
Bioinformatics Subtopics Fold Recognition Secondary Structure Prediction Docking & Drug Design Protein Geometry Protein Flexibility Homology Modeling Sequence Alignment Structure Classification Gene Prediction
More informationProteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?
Proteomics What is it? Reveal protein interactions Protein profiling in a sample Yeast two hybrid screening High throughput 2D PAGE Automatic analysis of 2D Page Yeast two hybrid Use two mating strains
More informationSyllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)
Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Course Name: Structural Bioinformatics Course Description: Instructor: This course introduces fundamental concepts and methods for structural
More informationReading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype
Lecture Series 7 From DNA to Protein: Genotype to Phenotype Reading Assignments Read Chapter 7 From DNA to Protein A. Genes and the Synthesis of Polypeptides Genes are made up of DNA and are expressed
More informationBME 5742 Biosystems Modeling and Control
BME 5742 Biosystems Modeling and Control Lecture 24 Unregulated Gene Expression Model Dr. Zvi Roth (FAU) 1 The genetic material inside a cell, encoded in its DNA, governs the response of a cell to various
More informationG4120: Introduction to Computational Biology
ICB Fall 2009 G4120: Introduction to Computational Biology Oliver Jovanovic, Ph.D. Columbia University Department of Microbiology & Immunology Copyright 2008 Oliver Jovanovic, All Rights Reserved. Genome
More informationUpdate on genome completion and annotations: Protein Information Resource
UPDATE ON GENOME COMPLETION AND ANNOTATIONS Update on genome completion and annotations: Protein Information Resource Cathy Wu 1 and Daniel W. Nebert 2 * 1 Director of PIR, Department of Biochemistry and
More informationProtein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki.
Protein Bioinformatics Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet rickard.sandberg@ki.se sandberg.cmb.ki.se Outline Protein features motifs patterns profiles signals 2 Protein
More informationBioinformatics 2. Yeast two hybrid. Proteomics. Proteomics
GENOME Bioinformatics 2 Proteomics protein-gene PROTEOME protein-protein METABOLISM Slide from http://www.nd.edu/~networks/ Citrate Cycle Bio-chemical reactions What is it? Proteomics Reveal protein Protein
More informationGenomics and bioinformatics summary. Finding genes -- computer searches
Genomics and bioinformatics summary 1. Gene finding: computer searches, cdnas, ESTs, 2. Microarrays 3. Use BLAST to find homologous sequences 4. Multiple sequence alignments (MSAs) 5. Trees quantify sequence
More informationFrancisco M. Couto Mário J. Silva Pedro Coutinho
Francisco M. Couto Mário J. Silva Pedro Coutinho DI FCUL TR 03 29 Departamento de Informática Faculdade de Ciências da Universidade de Lisboa Campo Grande, 1749 016 Lisboa Portugal Technical reports are
More informationGenome Annotation Project Presentation
Halogeometricum borinquense Genome Annotation Project Presentation Loci Hbor_05620 & Hbor_05470 Presented by: Mohammad Reza Najaf Tomaraei Hbor_05620 Basic Information DNA Coordinates: 527,512 528,261
More informationBioinformatics Exercises
Bioinformatics Exercises AP Biology Teachers Workshop Susan Cates, Ph.D. Evolution of Species Phylogenetic Trees show the relatedness of organisms Common Ancestor (Root of the tree) 1 Rooted vs. Unrooted
More informationComparative Bioinformatics Midterm II Fall 2004
Comparative Bioinformatics Midterm II Fall 2004 Objective Answer, part I: For each of the following, select the single best answer or completion of the phrase. (3 points each) 1. Deinococcus radiodurans
More informationSUPPLEMENTARY INFORMATION
Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)
More informationRGP finder: prediction of Genomic Islands
Training courses on MicroScope platform RGP finder: prediction of Genomic Islands Dynamics of bacterial genomes Gene gain Horizontal gene transfer Gene loss Deletion of one or several genes Duplication
More informationBio 1B Lecture Outline (please print and bring along) Fall, 2007
Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution
More informationBayesian Hierarchical Classification. Seminar on Predicting Structured Data Jukka Kohonen
Bayesian Hierarchical Classification Seminar on Predicting Structured Data Jukka Kohonen 17.4.2008 Overview Intro: The task of hierarchical gene annotation Approach I: SVM/Bayes hybrid Barutcuoglu et al:
More informationWhat is the central dogma of biology?
Bellringer What is the central dogma of biology? A. RNA DNA Protein B. DNA Protein Gene C. DNA Gene RNA D. DNA RNA Protein Review of DNA processes Replication (7.1) Transcription(7.2) Translation(7.3)
More informationOrganization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p
Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p.110-114 Arrangement of information in DNA----- requirements for RNA Common arrangement of protein-coding genes in prokaryotes=
More informationHomology Modeling. Roberto Lins EPFL - summer semester 2005
Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,
More informationBiology 105/Summer Bacterial Genetics 8/12/ Bacterial Genomes p Gene Transfer Mechanisms in Bacteria p.
READING: 14.2 Bacterial Genomes p. 481 14.3 Gene Transfer Mechanisms in Bacteria p. 486 Suggested Problems: 1, 7, 13, 14, 15, 20, 22 BACTERIAL GENETICS AND GENOMICS We still consider the E. coli genome
More informationPhylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction. Lesser Tenrec (Echinops telfairi)
Phylogenetics - Orthology, phylogenetic experimental design and phylogeny reconstruction Lesser Tenrec (Echinops telfairi) Goals: 1. Use phylogenetic experimental design theory to select optimal taxa to
More informationobjective functions...
objective functions... COFFEE (Notredame et al. 1998) measures column by column similarity between pairwise and multiple sequence alignments assumes that the pairwise alignments are optimal assumes a set
More informationGiri Narasimhan. CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Evaluation. Course Homepage.
CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 389; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs06.html 1/12/06 CAP5510/CGS5166 1 Evaluation
More informationIntro Secondary structure Transmembrane proteins Function End. Last time. Domains Hidden Markov Models
Last time Domains Hidden Markov Models Today Secondary structure Transmembrane proteins Structure prediction NAD-specific glutamate dehydrogenase Hard Easy >P24295 DHE2_CLOSY MSKYVDRVIAEVEKKYADEPEFVQTVEEVL
More informationOrthologs Detection and Applications
Orthologs Detection and Applications Marcus Lechner Bioinformatics Leipzig 2009-10-23 Marcus Lechner (Bioinformatics Leipzig) Orthologs Detection and Applications 2009-10-23 1 / 25 Table of contents 1
More informationIntegrative Protein Function Transfer using Factor Graphs and Heterogeneous Data Sources
Integrative Protein Function Transfer using Factor Graphs and Heterogeneous Data Sources Antonina Mitrofanova New York University antonina@cs.nyu.edu Vladimir Pavlovic Rutgers University vladimir@cs.rutgers.edu
More informationComputational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem
University of Groningen Computational methods for the analysis of bacterial gene regulation Brouwer, Rutger Wubbe Willem IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's
More information10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison
10-810: Advanced Algorithms and Models for Computational Biology microrna and Whole Genome Comparison Central Dogma: 90s Transcription factors DNA transcription mrna translation Proteins Central Dogma:
More informationV14 Graph connectivity Metabolic networks
V14 Graph connectivity Metabolic networks In the first half of this lecture section, we use the theory of network flows to give constructive proofs of Menger s theorem. These proofs lead directly to algorithms
More informationToday. Last time. Secondary structure Transmembrane proteins. Domains Hidden Markov Models. Structure prediction. Secondary structure
Last time Today Domains Hidden Markov Models Structure prediction NAD-specific glutamate dehydrogenase Hard Easy >P24295 DHE2_CLOSY MSKYVDRVIAEVEKKYADEPEFVQTVEEVL SSLGPVVDAHPEYEEVALLERMVIPERVIE FRVPWEDDNGKVHVNTGYRVQFNGAIGPYK
More informationIntroduction to Evolutionary Concepts
Introduction to Evolutionary Concepts and VMD/MultiSeq - Part I Zaida (Zan) Luthey-Schulten Dept. Chemistry, Beckman Institute, Biophysics, Institute of Genomics Biology, & Physics NIH Workshop 2009 VMD/MultiSeq
More informationFrom gene to protein. Premedical biology
From gene to protein Premedical biology Central dogma of Biology, Molecular Biology, Genetics transcription replication reverse transcription translation DNA RNA Protein RNA chemically similar to DNA,
More informationPrediction of protein function from sequence analysis
Prediction of protein function from sequence analysis Rita Casadio BIOCOMPUTING GROUP University of Bologna, Italy The omic era Genome Sequencing Projects: Archaea: 74 species In Progress:52 Bacteria:
More informationNetAffx GPCR annotation database summary December 12, 2001
NetAffx GPCR annotation database summary December 12, 2001 Introduction Only approximately 51% of the human proteome can be annotated by the standard motif-based recognition systems [1]. These systems,
More informationNewly made RNA is called primary transcript and is modified in three ways before leaving the nucleus:
m Eukaryotic mrna processing Newly made RNA is called primary transcript and is modified in three ways before leaving the nucleus: Cap structure a modified guanine base is added to the 5 end. Poly-A tail
More information08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega
BLAST Multiple Sequence Alignments: Clustal Omega What does basic BLAST do (e.g. what is input sequence and how does BLAST look for matches?) Susan Parrish McDaniel College Multiple Sequence Alignments
More informationBiological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor
Biological Networks:,, and via Relative Description Length By: Tamir Tuller & Benny Chor Presented by: Noga Grebla Content of the presentation Presenting the goals of the research Reviewing basic terms
More informationCellular Function Prediction for Hypothetical Proteins Using High-Throughput Data
University of Tennessee, Knoxville Trace: Tennessee Research and Creative Exchange Masters Theses Graduate School 8-2003 Cellular Function Prediction for Hypothetical Proteins Using High-Throughput Data
More informationChapter 15 Active Reading Guide Regulation of Gene Expression
Name: AP Biology Mr. Croft Chapter 15 Active Reading Guide Regulation of Gene Expression The overview for Chapter 15 introduces the idea that while all cells of an organism have all genes in the genome,
More informationMetabolic modelling. Metabolic networks, reconstruction and analysis. Esa Pitkänen Computational Methods for Systems Biology 1 December 2009
Metabolic modelling Metabolic networks, reconstruction and analysis Esa Pitkänen Computational Methods for Systems Biology 1 December 2009 Department of Computer Science, University of Helsinki Metabolic
More informationImplications of Structural Genomics Target Selection Strategies: Pfam5000, Whole Genome, and Random Approaches
PROTEINS: Structure, Function, and Bioinformatics 58:166 179 (2005) Implications of Structural Genomics Target Selection Strategies: Pfam5000, Whole Genome, and Random Approaches John-Marc Chandonia 1
More informationOrthology Part I concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona
Orthology Part I concepts and implications Toni Gabaldón Centre for Genomic Regulation (CRG), Barcelona Toni Gabaldón Contact: tgabaldon@crg.es Group website: http://gabaldonlab.crg.es Science blog: http://treevolution.blogspot.com
More informationBio 119 Bacterial Genomics 6/26/10
BACTERIAL GENOMICS Reading in BOM-12: Sec. 11.1 Genetic Map of the E. coli Chromosome p. 279 Sec. 13.2 Prokaryotic Genomes: Sizes and ORF Contents p. 344 Sec. 13.3 Prokaryotic Genomes: Bioinformatic Analysis
More information