Хемоінформатика. Докінг. Дизайн ліків. Біоінформатика (3 курс) Лекція 4 (частина 1)

Similar documents
Networks & pathways. Hedi Peterson MTAT Bioinformatics

Applying the Semantic Web to Computational Chemistry

Chemoinformatics in Europe: Research and Teaching

Tautomerism in chemical information management systems

Chemical Data Retrieval and Management

Chemical Journal Publishing in an Online World. Jason Wilde, Publisher Physical Sciences Nature Publishing Group ACS Spring Meeting 2009

FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES

SABIO-RK Integration and Curation of Reaction Kinetics Data Ulrike Wittig

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

CDK-Taverna: an open workflow environment for cheminformatics

IUCLID Substance Data

Introducing a Bioinformatics Similarity Search Solution

Chemistry in Bioinformatics

International Journal of Scientific & Engineering Research, Volume 6, Issue 2, February ISSN

InChI keys as standard global identifiers in chemistry web services. Russ Hillard ACS, Salt Lake City March 2009

Supporting Information. Kekule.js: An Open Source JavaScript Chemoinformatics Toolkit

Introduction to Chemoinformatics and Drug Discovery

Reaxys Managing Complexity

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann

QSAR in Green Chemistry

Open PHACTS Explorer: Compound by Name

Other Related IUPAC RDA Updates

Chemical Ontologies. Chemical Ontologies. ChemAxon UGM May 23, 2012

MACiE: A Database of Enzyme Reaction Mechanisms

Pipeline Pilot Integration

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

Creating Phase and Interface Models

The IBM Patent Data Donation to NIH, and its Integration in the NCI/CADD Database and Web Services

InChI/InChIKey vs. NCI/CADD Structure Identifiers: A comparison

Catching the Drift Indexing Implicit Knowledge in Chemical Digital Libraries

Chemical File Format Conversion Tools : A n Overview

Organometallics & InChI. August 2017

Bioinformatics. Dept. of Computational Biology & Bioinformatics

86 Part 4 SUMMARY INTRODUCTION

CHEMOINFORMATICS: THEORY, PRACTICE, & PRODUCTS

cheminformatics toolkits: a personal perspective

BIOINFORMATICS LAB AP BIOLOGY

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Web Search of New Linearized Medical Drug Leads

A powerful site for all chemists CHOICE CRC Handbook of Chemistry and Physics

Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction

Chemical Knowledge for the Semantic Web

Canonical Line Notations

SBMLmerge, a System for Combining Biochemical Network Models

CDK & Mass Spectrometry

GeoPostcodes. Grecia

The BRENDA Enzyme Information System. Module B4. Ligand Search Substructure Search

GeoPostcodes. Litauen

CLRG Biocreative V

Chemical Markup, XML, and the World Wide Web. 6. CMLReact, an XML Vocabulary for Chemical Reactions

RInChI. International Chemical Identifier for Chemical Reactions (RInChI) Guenter Grethe, Jonathan Goodman, Chad Allen

GeoPostcodes. Denmark

Introduction to Chemoinformatics

Using Web Technologies for Integrative Drug Discovery

The NCI/CADD Group's InChI Usage and Analysis of Tautomerism for InChI V2

Biological Concepts and Information Technology (Systems Biology)

GeoPostcodes. Luxembourg

Text and multimedia languages and properties

The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemoand Bioinformatics

GeoPostcodes. Trinidad & Tobago

JOHN MAYFIELD EGON WILLIGHAGEN CHEMISTRY DEVELOPMENT KIT V2.0

Representation of molecular structures. Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal

ACD/Labs Software Impurity Resolution Management. Presented by Peter Russell

Dictionary of ligands

CSCE555 Bioinformatics. Protein Function Annotation

Data Mining in the Chemical Industry. Overview of presentation

CHEMISTRY COLLECTION Basic Chemistry Guide

PNmerger: a Cytoscape plugin to merge biological pathways and protein interaction networks

Marvin 5.4 A new generation of structure indexing at Elsevier. Dr. Michael Maier, Dr. Heike Nau, Elsevier

Pipeline Pilot Integration

Describing Geographical Objects

Navigating between patents, papers, abstracts and databases using public sources and tools

1. Introduction. * Equal Contributors

KATE2017 on NET beta version Operating manual

Application of Associative Matrices to Recognize DNA Sequences in Bioinformatics

MODELING HYDRATES AND THE GAS HYDRATE MARKUP

CS612 - Algorithms in Bioinformatics

Computing chemistry on the web. Igor V. Tetko

Dock Ligands from a 2D Molecule Sketch

GIS-based Smart Campus System using 3D Modeling

BRENDA Exercises Quick Search

The BRENDA Enzyme Information System. Computer-based access. Module B5

Structure and Reaction querying in Reaxys

Introduction Molecular Structure Script Console External resources Advanced topics. JMol tutorial. Giovanni Morelli.

BMD645. Integration of Omics

Transcription:

Хемоінформатика. Докінг. Дизайн ліків Біоінформатика (3 курс) Лекція 4 (частина 1)

Формати файлів в хемоінформатиці Chemical information is usually provided as files or streams and many formats have been created, with varying degrees of documentation. file extension (usually 3 letters). This is widely used, but fragile as common suffixes such as ".mol" and ".dat" are used by many systems, including non-chemical ones. self-describing files where the format information is included in the file. Examples are CIF and CML.

Перетворення між форматами файлів OpenBabel and JOELib are freely available open source tools specifically designed for converting between file formats. Their chemical expert systems support a large atom type conversion tables. A number of tools intended for viewing and editing molecular structures are able to read in files in a number of formats and write them out in other formats. The tools JChemPaint (based on the Chemistry Development Kit), XDrawChem (based on OpenBabel), Chime, Jmol, Mol2mol and Discovery Studio fit into this category.

Мови для машинного вводу хімічних формул

Chemical Machine Languages Interestingly, chemistry has defined three simple languages for encoding chemical information. InChI, SMILES, CML Can generate these by hand or automatically InChIs and SMILES can represent molecules as a single string/character array. Useful as keys for databases and for search queries in Google. You can convert between SMILES and InChIs OpenBabel, OELib, JOELib CML is an XML format, and more verbose, but benefits from XML community tools

A CML Example

SMILES: Simplified Molecular Input Line Entry Specification Language for describing the structure of chemical molecules using ASCII strings. http://www.daylight.com/dayhtml/doc/theory/theory.smiles.html

http://www.daylight.com/dayhtml/doc/theory/theory.smarts.html

SMIRKS http://www.daylight.com/dayhtml/doc/theory/theory.smirks.html

http://www.opensmiles.org/

InChI: International Chemical Identifier IUPAC and NIST Standard similar to SMILES Encodes structural information about compounds Based on open an standard and algorithms. http://wwmm.ch.cam.ac.uk/inchifaq/

InChI in Public Chemistry Databases US National Institute of Standards and Technology (NIST) - 150,000 structures NIH/NCBI/PubChem project - >3.2 million structures Thomson ISI - 2+ million structures US National Cancer Institute(NCI) Database - 23+ million structures US Environmental Protection Agency(EPA)-DSSToX Database - 1450 structures Kyoto Encyclopaedia of Genes and Genomes (KEGG) database - 9584 structures University of California at San Francisco ZINC - >3.3 million structures BRENDA enzyme information system (University of Cologne) - 36,000 structures Chemical Entities of Biological Interest (ChEBI) database of the European Bioinformatics Institute - 5000 structures University of California Carcinogenic Potency Project - 1447 structures Compendium of Pesticide Common Names - 1437 (2005-03-03) structures

Journals and Software Using InChI Journals Nature Chemical Biology. Beilstein Journal of Organic Chemistry Software ACD/Labs ACD/ChemSketch. ChemAxon Marvin. SciTegic Pipeline Pilot. CACTVS Chemoinformatics Toolkit by Xemistry, GmbH.

Chemistry Markup Language CML is an XML markup language for encoding chemical information. Developed by Peter Murray Rust, Henry Rzepa and others. Actually dates from the SGML days before XML More verbose than InChI and SMILES But inherits XML schema, namespaces, parsers, XPATH, language binding tools like XML Beans, etc. Not limited to structural information Has OpenBabel support. http://cml.sourceforge.net/, http://cml.sourceforge.net/wiki/index.php/main_page

Ресурси хемоінформатики

http://www.ebi.ac.uk/chebi/advancedsearchforward.do

http://www.ebi.ac.uk/chebi/advancedsearchforward.do

http://pubchem.ncbi.nlm.nih.gov/

http://www.ncbi.nlm.nih.gov/pccompound?tabcmd=limits

http://pubchem.ncbi.nlm.nih.gov/

http://pubchem.ncbi.nlm.nih.gov/assay/assay.cgi?p=heat

http://www.emolecules.com/

http://www.smpdb.ca/

http://ctdbase.org/

http://zinc.docking.org/