Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction

Size: px
Start display at page:

Download "Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction"

Transcription

1 Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction Stefan Kuhn 1, Björn Egert 2, Steffen Neumann 2, Christoph Steinbeck 1European Bioinformatics Institute (EBI), Chemoinformatics and Metabolism Team, Wellcome Trust Genome Campus, Cambridge, CB10 1SD, United Kingdom 2Research Group for Molecular Informatics, Cologne University Bioinformatics Center (CUBIC), Zuelpicher Str. 47, D Cologne, Germany, steinbeck@ebi.ac.uk, Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 1 07:05:37

2 CUBIC Experiment: Fast quenching of metabolism Cell lysis and extraction Derivation Detection via GC/MS Trehalose Lactat Glutamat Signalintensit ä t Ca compounds visible in GC 400 derivatives can be reproducibly quantified 240 compounds identified t [min] Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 2 07:05:37

3 CUBIC Gas chromatography (GC) Mass spectrometry (MS) Procedure: Extraction of bacterial cells with methanol Mass spectrometer Derivatisation Separation of compounds by gas chromatography Analysis by massspectrometry after electron impact ionization Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 3 07:05:37

4 De novo Elucidation of Biomarkers and Metabolites: Computer Assisted Structure Elucidation (CASE) Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 4 07:05:38

5 The Chemistry Development Kit (CDK) Java library for chemoinformatics, Open Source, LGPL (permits commercial use) >50 developers, core team people >50 academic and industrial projects world wide Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 5 07:05:38

6 CDK Functionality Input/Output I/O (CML, MDL Molfile, SDF, PDB) SMILES InChI Modelling 3D Model Builder Atom Typing Force Field Representation of Biomolecular Structures Chemical Graphs Isomorphism detection Maximum Common Substructure Searches SMARTS and Substructure searches Ring searches Aromaticity detection Visualization Structure Diagram Layout (SDG) 2D Rendering 3D Rendering Library Enumeration Deterministic Isomer generator Stochastic Structure Generators via Simulated Annealing Genetic Algorithms Properties Fingerprinting > 70 QSAR Descriptors QSAR model building Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 6 07:05:38

7 Characterizing Biomarkers and Metabolites NMRShiftDB ( Open Access Open Submission Open Source [1] Steinbeck, C.; Kuhn, S.; Krause, S., J. Chem. Inf. Comput. Sci. 2003, 43, [2] Steinbeck, C.; Kuhn, S. Phytochemistry 2004, 65, Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 7 07:05:38

8 2D NMR Data for CASE Steinbeck, C. Computer Assisted Structure Elucidation. In Handbook on Chemoinformatics.; Gasteiger, J. Ed.; Wiley VCH: Weinheim, 2003; Vol. 2; pp Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 8 07:05:39

9 CASE with Simulated Annealing Fitness Evaluation (Scoring) S total = S NMR HMBC + S NMR HHCOSY + S NMR Shift + S Symmetry + O H S MassSpec... + S Features H O Polycarpol (C 30 H 48 O 2 ). Steinbeck, C.; Journal of Chemical Information & Computer Sciences 2001, 41, Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 9 07:05:39

10 How far do we get with 1D NMR? Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 10 07:05:39

11 Deterministic Structure Generators work quite nicely for small molecules even with very simple fitness functions For around 10 heavy atoms, we've been able to find the correct solutions just based on 13 C shift prediction and comparison with measured spectrum. Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 11 07:05:40

12 1D Proton NMR Prediction Methods trained based on CDK descriptors (random order) J48 HOSE codes Support Vector Machines M5' PRISM naïve Bayes Linear Regression K Means Clustering Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 12 07:05:40

13 Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 13 07:05:40

14 Descriptors (416/100%) Physicochemical (242/57,93%) Spatial (105/25,24%) Topological (66/15,86%) Exp. Conditions (3/0.72%) RDF G H,G D [9] Van der Waals [11] RDF G S [9] Distance [11] RDF G H topol[9] Picontact [11] Temperature Frequency Solvent Valence Electrons[11] Heavy Atom Hydrogen BondsToAtom [11] Min Avg Electronegativity [9] Sigma Pi Period [11] Charge [9] 330 descriptors in total Hybrization [11] Sigma Pi Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 14 07:05:40

15 Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 15 07:05:41

16 Random Forest, real vs predicted, protons Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 16 07:05:41

17 Kuhn S., Egert B., Neumann S. and Steinbeck C. (2008) BMC Bioinformatics Sep 25;9(1):400. Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 17 07:05:41

18 Acknowledgement Prof. Peter Murray Rust Stefan Kuhn Steffen Neumann Bjlörn Egert Egon Willighagen (Unilever Center for Molecular Informatics, Cambridge, UK) Dr. William Hull, Dr. Willi von der Lieth (DKFZ, Heidelberg) Dr. Kämpchen (Universität Marburg) All Collaborators at Cologne University Bioinformatics Center (CUBIC), EBI and the CDK team Dr. Heinz Kolshorn (Universität Mainz) DFG, BMBF, DAAD Roche Diagnostics, Penzberg Orion Pharma, Finnland Christoph Steinbeck European Bioinformatics Institute (EBI) Slide 18 07:05:42

CDK & Mass Spectrometry

CDK & Mass Spectrometry CDK & Mass Spectrometry October 3, 2011 1/18 Stephan Beisken October 3, 2011 EBI is an outstation of the European Molecular Biology Laboratory. Chemistry Development Kit (CDK) An Open Source Java TM Library

More information

Chemical Data Retrieval and Management

Chemical Data Retrieval and Management Chemical Data Retrieval and Management ChEMBL, ChEBI, and the Chemistry Development Kit Stephan A. Beisken What is EMBL-EBI? Part of the European Molecular Biology Laboratory International, non-profit

More information

JOHN MAYFIELD EGON WILLIGHAGEN CHEMISTRY DEVELOPMENT KIT V2.0

JOHN MAYFIELD EGON WILLIGHAGEN CHEMISTRY DEVELOPMENT KIT V2.0 JOHN MAYFIELD EGON WILLIGHAGEN CHEMISTRY DEVELOPMENT KIT V2.0 CINF 64: ACS Fall 2017, Washington, D.C. http://cdk.github.io http://efficientbits.blogspot.com INTRODUCTION HISTORY Project started in 2000

More information

Solved and Unsolved Problems in Chemoinformatics

Solved and Unsolved Problems in Chemoinformatics Solved and Unsolved Problems in Chemoinformatics Johann Gasteiger Computer-Chemie-Centrum University of Erlangen-Nürnberg D-91052 Erlangen, Germany Johann.Gasteiger@fau.de Overview objectives of lecture

More information

CDK-Taverna: an open workflow environment for cheminformatics

CDK-Taverna: an open workflow environment for cheminformatics BMC Bioinformatics This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. CDK-Taverna: an open workflow

More information

Machine Learning Concepts in Chemoinformatics

Machine Learning Concepts in Chemoinformatics Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics

More information

An Integrated Approach to in-silico

An Integrated Approach to in-silico An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc. 14600 Catalina St., San Leandro, CA 94577 Goals

More information

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure

More information

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov CADD Group Chemical Biology Laboratory Frederick National Laboratory for Cancer Research National Cancer Institute, National Institutes

More information

Use of CTI Index for Perception of Duplicated Chemical Structures in Large Chemical Databases

Use of CTI Index for Perception of Duplicated Chemical Structures in Large Chemical Databases MATCH Communications in Mathematical and in Computer Chemistry MATCH Commun. Math. Comput. Chem. 71 (2014) 645-656 ISSN 0340-6253 Use of CTI Index for Perception of Duplicated Chemical Structures in Large

More information

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures

More information

The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemoand Bioinformatics

The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemoand Bioinformatics This is an open access article published under a Creative Commons Attribution (CC-BY) License, which permits unrestricted use, distribution and reproduction in any medium, provided the author and source

More information

Large scale classification of chemical reactions from patent data

Large scale classification of chemical reactions from patent data Large scale classification of chemical reactions from patent data Gregory Landrum NIBR Informatics, Basel Novartis Institutes for BioMedical Research 10th International Conference on Chemical Structures/

More information

Orbital Development Kit

Orbital Development Kit Orbital Development Kit Egon Willighagen Bioclipse & Proteochemometric Group (Prof. Wikberg) (Until 2010-09-30) Department of Pharmaceutical Biosciences Uppsala University 2010-08-25 Representation Types

More information

Bioinformatics Workshop - NM-AIST

Bioinformatics Workshop - NM-AIST Bioinformatics Workshop - NM-AIST Day 3 Introduction to Drug/Small Molecule Discovery Thomas Girke July 25, 2012 Bioinformatics Workshop - NM-AIST Slide 1/44 Introduction CMP Structure Formats Similarity

More information

Structural biology and drug design: An overview

Structural biology and drug design: An overview Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

Analyzing Small Molecule Data in R

Analyzing Small Molecule Data in R Analyzing Small Molecule Data in R Tyler Backman and Thomas Girke December 12, 2011 Analyzing Small Molecule Data in R Slide 1/49 Introduction CMP Structure Formats Similarity Searching Background Fragment

More information

Chemical Space: Modeling Exploration & Understanding

Chemical Space: Modeling Exploration & Understanding verview Chemical Space: Modeling Exploration & Understanding Rajarshi Guha School of Informatics Indiana University 16 th August, 2006 utline verview 1 verview 2 3 CDK R utline verview 1 verview 2 3 CDK

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics

More information

Introduction to Chemoinformatics

Introduction to Chemoinformatics Introduction to Chemoinformatics www.dq.fct.unl.pt/cadeiras/qc Prof. João Aires-de-Sousa Email: jas@fct.unl.pt Recommended reading Chemoinformatics - A Textbook, Johann Gasteiger and Thomas Engel, Wiley-VCH

More information

Structure-Activity Modeling - QSAR. Uwe Koch

Structure-Activity Modeling - QSAR. Uwe Koch Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity

More information

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Roberto Todeschini Milano Chemometrics and QSAR Research Group - Dept. of

More information

The Schrödinger KNIME extensions

The Schrödinger KNIME extensions The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich Topics What are the Schrödinger extensions? Workflow application

More information

AUTOMATIC GENERATION OF TAUTOMERS

AUTOMATIC GENERATION OF TAUTOMERS ПЛОВДИВСКИ УНИВЕРСИТЕТ ПАИСИЙ ХИЛЕНДАРСКИ БЪЛГАРИЯ НАУЧНИ ТРУДОВЕ, ТОМ 38, КН. 5, 2011 ХИМИЯ UNIVERSITY OF PLOVDIV PAISII HILENDARSKI BULGARIA SCIENTIFIC PAPERS, VOL. 38, BOOK 5, 2011 CHEMISTRY AUTOMATIC

More information

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann Information Extraction from Chemical Images Discovery Knowledge & Informatics April 24 th, 2006 Dr. Available Chemical Information Textbooks Reports Patents Databases Scientific journals and publications

More information

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015 Ignasi Belda, PhD CEO HPC Advisory Council Spain Conference 2015 Business lines Molecular Modeling Services We carry out computational chemistry projects using our selfdeveloped and third party technologies

More information

Introduction to Chemoinformatics

Introduction to Chemoinformatics Introduction to Chemoinformatics Dr. Igor V. Tetko Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) Institute of Bioinformatics & Systems Biology (HMGU) Kyiv, 10 August

More information

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National

More information

QSAR in Green Chemistry

QSAR in Green Chemistry QSAR in Green Chemistry Activity Relationship QSAR is the acronym for Quantitative Structure-Activity Relationship Chemistry is based on the premise that similar chemicals will behave similarly The behavior/activity

More information

Machine-learning scoring functions for docking

Machine-learning scoring functions for docking Machine-learning scoring functions for docking Dr Pedro J Ballester MRC Methodology Research Fellow EMBL-EBI, Cambridge, United Kingdom EBI is an Outstation of the European Molecular Biology Laboratory.

More information

Drug Informatics for Chemical Genomics...

Drug Informatics for Chemical Genomics... Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison

More information

JCICS Major Research Areas

JCICS Major Research Areas JCICS Major Research Areas Chemical Information Text Searching Structure and Substructure Searching Databases Patents George W.A. Milne C571 Lecture Fall 2002 1 JCICS Major Research Areas Chemical Computation

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

Machine learning for ligand-based virtual screening and chemogenomics!

Machine learning for ligand-based virtual screening and chemogenomics! Machine learning for ligand-based virtual screening and chemogenomics! Jean-Philippe Vert Institut Curie - INSERM U900 - Mines ParisTech In silico discovery of molecular probes and drug-like compounds:

More information

Chemical Databases: Encoding, Storage and Search of Chemical Structures

Chemical Databases: Encoding, Storage and Search of Chemical Structures Chemical Databases: Encoding, Storage and Search of Chemical Structures Dr. Timur I. Madzhidov Kazan Federal University, Department of Organic Chemistry * Ray, L.C. and R.A. Kirsch, Finding Chemical Records

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Natural Products. Innovation with Integrity. High Performance NMR Solutions for Analysis NMR

Natural Products. Innovation with Integrity. High Performance NMR Solutions for Analysis NMR Natural Products High Performance NMR Solutions for Analysis Innovation with Integrity NMR NMR Spectroscopy Continuous advancement in Bruker s NMR technology allows researchers to push the boundaries for

More information

CHEMISTRY Organic Chemistry Laboratory II Spring 2019 Lab #5: NMR Spectroscopy

CHEMISTRY Organic Chemistry Laboratory II Spring 2019 Lab #5: NMR Spectroscopy Team Members: Unknown # CHEMISTRY 244 - Organic Chemistry Laboratory II Spring 2019 Lab #5: NMR Spectroscopy Purpose: You will learn how to predict the NMR data for organic molecules, organize this data

More information

Condensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule

Condensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule Condensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule Frank Hoonakker 1,3, Nicolas Lachiche 2, Alexandre Varnek 3, and Alain Wagner 3,4 1 Chemoinformatics laboratory,

More information

Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, Dr.

Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, Dr. Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, 2006 Dr. Overview Brief introduction Chemical Structure Recognition (chemocr) Manual conversion

More information

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells

More information

Statistical concepts in QSAR.

Statistical concepts in QSAR. Statistical concepts in QSAR. Computational chemistry represents molecular structures as a numerical models and simulates their behavior with the equations of quantum and classical physics. Available programs

More information

SENECA: A Platform-Independent, Distributed, and Parallel System for Computer-Assisted Structure Elucidation in Organic Chemistry

SENECA: A Platform-Independent, Distributed, and Parallel System for Computer-Assisted Structure Elucidation in Organic Chemistry 1500 J. Chem. Inf. Comput. Sci. 2001, 41, 1500-1507 SENECA: A Platform-Independent, Distributed, and Parallel System for Computer-Assisted Structure Elucidation in Organic Chemistry Christoph Steinbeck

More information

ACD/Structure Elucidator

ACD/Structure Elucidator ACD/Structure Elucidator ver a Decade of Development in Computer Assisted Structure Elucidation Visionary Software Advancing Research A Case for CASE Computer-Assisted Structure Elucidation (CASE) is an

More information

PaDEL-Descriptor: An Open Source Software to Calculate Molecular Descriptors and Fingerprints

PaDEL-Descriptor: An Open Source Software to Calculate Molecular Descriptors and Fingerprints Software News and Update PaDEL-Descriptor: An Open Source Software to Calculate Molecular Descriptors and Fingerprints CHUN WEI YAP Department of Pharmacy, Pharmaceutical Data Exploration Laboratory, National

More information

Dictionary of ligands

Dictionary of ligands Dictionary of ligands Some of the web and other resources Small molecules DrugBank: http://www.drugbank.ca/ ZINC: http://zinc.docking.org/index.shtml PRODRUG: http://www.compbio.dundee.ac.uk/web_servers/prodrg_down.html

More information

New developments on the cheminformatics open workflow environment CDK-Taverna

New developments on the cheminformatics open workflow environment CDK-Taverna SOFTWARE Open Access New developments on the cheminformatics open workflow environment CDK-Taverna Andreas Truszkowski 1, Kalai Vanii Jayaseelan 2, Stefan Neumann 3, Egon L Willighagen 4, Achim Zielesny

More information

Similarity Search. Uwe Koch

Similarity Search. Uwe Koch Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance

More information

Development of a Structure Generator to Explore Target Areas on Chemical Space

Development of a Structure Generator to Explore Target Areas on Chemical Space Development of a Structure Generator to Explore Target Areas on Chemical Space Kimito Funatsu Department of Chemical System Engineering, This materials will be published on Molecular Informatics Drug Development

More information

Tautomerism in chemical information management systems

Tautomerism in chemical information management systems Tautomerism in chemical information management systems Dr. Wendy A. Warr http://www.warr.com Tautomerism in chemical information management systems Author: Wendy A. Warr DOI: 10.1007/s10822-010-9338-4

More information

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics Marvin Sketching, viewing and predicting properties with Marvin - features, tips and tricks Gyorgy Pirok Solutions for Cheminformatics The Marvin family The Marvin toolkit provides web-enabled components

More information

Practical QSAR and Library Design: Advanced tools for research teams

Practical QSAR and Library Design: Advanced tools for research teams DS QSAR and Library Design Webinar Practical QSAR and Library Design: Advanced tools for research teams Reservationless-Plus Dial-In Number (US): (866) 519-8942 Reservationless-Plus International Dial-In

More information

Workshop II: Optimization, Search and Graph-Theoretical Algorithms for Chemical Compound Space

Workshop II: Optimization, Search and Graph-Theoretical Algorithms for Chemical Compound Space Markus Meringer Generation of Molecular Graphs and Applications in Chemistry Navigating Chemical Compound Space for Materials and Bio Design Workshop II: Optimization, Search and Graph-Theoretical Algorithms

More information

Bridging the Dimensions:

Bridging the Dimensions: Bridging the Dimensions: Seamless Integration of 3D Structure-based Design and 2D Structure-activity Relationships to Guide Medicinal Chemistry ACS Spring National Meeting. COMP, March 13 th 2016 Marcus

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available

More information

Integrative Metabolic Fate Prediction and Retrosynthetic Analysis the Semantic Way

Integrative Metabolic Fate Prediction and Retrosynthetic Analysis the Semantic Way Integrative Metabolic Fate Prediction and Retrosynthetic Analysis the Semantic Way Leonid L. Chepelev 1 and Michel Dumontier 1,2,3 1 Department of Biology, Carleton University, 1125 Colonel by Drive, Ottawa,

More information

Molecular Modeling of Inorganic Compounds

Molecular Modeling of Inorganic Compounds Peter Comba, Trevor W. Hambley and Bodo Martin Molecular Modeling of Inorganic Compounds Third Completely Revised and Enlarged Edition WILEY- VCH WILEY-VCH Verlag GmbH & Co. KGaA v Contents Preface to

More information

Learning Organic Chemistry

Learning Organic Chemistry Objective 1 Represent organic molecules with chemical formulas, expanded formulas, Lewis structures, skeletal structures. Determine shape (VSEPR), bond polarity, and molecule polarity. Identify functional

More information

Representation of molecular structures. Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal

Representation of molecular structures. Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal Representation of molecular structures Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal A hierarchy of structure representations Name (S)-Tryptophan 2D Structure 3D Structure Molecular

More information

OMG: Open Molecule Generator

OMG: Open Molecule Generator Peironcely et al. Journal of Cheminformatics 2012, 4:21 SOFTWARE OMG: Open Molecule Generator Julio E Peironcely 1,2,3,4, Miguel Rojas-Chertó 2,3, Davide Fichera 4, Theo Reijmers 2,3, Leon Coulier 1,3,

More information

Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility

Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility Dr. Wendy A. Warr http://www.warr.com Warr, W. A. A Short Review of Chemical Reaction Database Systems,

More information

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Approach for andling of Protonation and Tautomer Ambiguities in Docking Experiments Ingo Dramburg BioSolve IT Gmb An der

More information

De Novo molecular design with Deep Reinforcement Learning

De Novo molecular design with Deep Reinforcement Learning De Novo molecular design with Deep Reinforcement Learning @olexandr Olexandr Isayev, Ph.D. University of North Carolina at Chapel Hill olexandr@unc.edu http://olexandrisayev.com About me Ph.D. in Chemistry

More information

Computational Approaches towards Life Detection by Mass Spectrometry

Computational Approaches towards Life Detection by Mass Spectrometry Markus Meringer Computational Approaches towards Life Detection by Mass Spectrometry International Workshop on Life Detection Technology: For Mars, Encheladus and Beyond Earth-Life Science Institute, Tokyo

More information

Interaction Potentials in Molecules and Non-Local Information in Chemical Space

Interaction Potentials in Molecules and Non-Local Information in Chemical Space Interaction Potentials in Molecules and Non-Local Information in Chemical Space Katja Hansen 1, Franziska Biegler 2, O. Anatole von Lilienfeld 3,4, Klaus-Robert Müller 2,5, and Alexandre Tkatchenko 1 1

More information

Fragment-based de novo Design

Fragment-based de novo Design ragment-based de novo Design Gisbert Schneider & Uli echner gisbert.schneider@modlab.de www.modlab.de Beilstein Endowed Chair for Cheminformatics Institute of rganic Chemistry & Chemical Biology Johann

More information

Supporting Information. Kekule.js: An Open Source JavaScript Chemoinformatics Toolkit

Supporting Information. Kekule.js: An Open Source JavaScript Chemoinformatics Toolkit Supporting Information Kekule.js: An Open Source JavaScript Chemoinformatics Toolkit Chen Jiang, *, Xi Jin, Ying Dong and Ming Chen Department of Organic Chemistry, China Pharmaceutical University, Nanjing

More information

CHEMICAL CLASSIFICATION:

CHEMICAL CLASSIFICATION: CHEMICAL CLASSIFICATION: CLASSYFIRE S APPLICATIONS IN ENVIRONMENTAL HEALTH AND SAFETY Yannick Djoumbou Feunang University of Alberta, Canada 08/24/2016 ACS Meeting, Philadelphia (PA) DATA AND MORE DATA

More information

5/31/17. MS processing. Overview. Overview. my spectrum + my biological question. Annotation & Databanks. An organized collection of information

5/31/17. MS processing. Overview. Overview. my spectrum + my biological question. Annotation & Databanks. An organized collection of information MS processing Annotation & banks Franck Giacomoni, Yann Guitton 01/06/2017 v 1.0.0 CC by-nc-nd : http://creativecommons.org/licenses/by-nc-nd/2.0/fr/ Overview An organized collection of information Dilbert

More information

Application Note LCMS-116 What are we eating? MetaboScape Software; Enabling the De-replication and Identification of Unknowns in Food Metabolomics

Application Note LCMS-116 What are we eating? MetaboScape Software; Enabling the De-replication and Identification of Unknowns in Food Metabolomics Application Note LCMS-116 What are we eating? MetaboScape Software; Enabling the De-replication and Identification of Unknowns in Food Metabolomics Introduction Determining the structure of secondary metabolites

More information

Docking. GBCB 5874: Problem Solving in GBCB

Docking. GBCB 5874: Problem Solving in GBCB Docking Benzamidine Docking to Trypsin Relationship to Drug Design Ligand-based design QSAR Pharmacophore modeling Can be done without 3-D structure of protein Receptor/Structure-based design Molecular

More information

Nuclear Magnetic Resonance H-NMR Part 1 Introduction to NMR, Instrumentation, Sample Prep, Chemical Shift. Dr. Sapna Gupta

Nuclear Magnetic Resonance H-NMR Part 1 Introduction to NMR, Instrumentation, Sample Prep, Chemical Shift. Dr. Sapna Gupta Nuclear Magnetic Resonance H-NMR Part 1 Introduction to NMR, Instrumentation, Sample Prep, Chemical Shift Dr. Sapna Gupta Introduction NMR is the most powerful tool available for organic structure determination.

More information

Хемоінформатика. Докінг. Дизайн ліків. Біоінформатика (3 курс) Лекція 4 (частина 1)

Хемоінформатика. Докінг. Дизайн ліків. Біоінформатика (3 курс) Лекція 4 (частина 1) Хемоінформатика. Докінг. Дизайн ліків Біоінформатика (3 курс) Лекція 4 (частина 1) Формати файлів в хемоінформатиці Chemical information is usually provided as files or streams and many formats have been

More information

Chemoinformatics An Introduction for Computer Scientists

Chemoinformatics An Introduction for Computer Scientists 8 Chemoinformatics An Introduction for Computer Scientists NATHAN BROWN Novartis Institutes for BioMedical Research Chemoinformatics is an interface science aimed primarily at discovering novel chemical

More information

Automated and accurate component detection using reference mass spectra

Automated and accurate component detection using reference mass spectra TECHNICAL NOTE 72703 Automated and accurate component detection using reference mass spectra Authors Barbara van Cann 1 and Amit Gujar 2 1 Thermo Fisher Scientific, Breda, NL 2 Thermo Fisher Scientific,

More information

Ligand Scout Tutorials

Ligand Scout Tutorials Ligand Scout Tutorials Step : Creating a pharmacophore from a protein-ligand complex. Type ke6 in the upper right area of the screen and press the button Download *+. The protein will be downloaded and

More information

CADASTER & MC ITN ECO

CADASTER & MC ITN ECO CADASTER & MC ITN ECO Igor V. Tetko Chemoinformatics group, IBIS, Helmholtz Zentrum München & eadmet GmbH (Germany) SPRING workshop, Beijing, July 2-3, 2011" Helmholtz Zentrum München Part of Helmholtz

More information

cheminformatics toolkits: a personal perspective

cheminformatics toolkits: a personal perspective cheminformatics toolkits: a personal perspective Roger Sayle Nextmove software ltd Cambridge uk 1 st RDKit User Group Meeting, London, UK, 4 th October 2012 overview Models of Chemistry Implicit and Explicit

More information

CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS

CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS 159 CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS 6.1 INTRODUCTION The purpose of this study is to gain on insight into structural features related the anticancer, antioxidant

More information

Scoring functions for of protein-ligand docking: New routes towards old goals

Scoring functions for of protein-ligand docking: New routes towards old goals 3nd Strasbourg Summer School on Chemoinformatics Strasbourg, June 25-29, 2012 Scoring functions for of protein-ligand docking: New routes towards old goals Christoph Sotriffer Institute of Pharmacy and

More information

The Chemistry Development Kit (CDK). 3. Atom typing, Rendering, Molecular Formula, and Substructure Searching

The Chemistry Development Kit (CDK). 3. Atom typing, Rendering, Molecular Formula, and Substructure Searching Willighagen et al. RESEARCH The Chemistry Development Kit (CDK). 3. Atom typing, Rendering, Molecular Formula, and Substructure Searching Egon L Willighagen 1*, John W May 2, Jonathan Alvarsson 3, Arvid

More information

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös KIME-based scoring functions in Muse 3.0 KIME User Group Meeting 2013 Fabian Bös Certara Mission: End-to-End Model-Based Drug Development Certara was formed by acquiring and integrating Tripos, Pharsight,

More information

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of

More information

Hydrogen Bonding & Molecular Design Peter

Hydrogen Bonding & Molecular Design Peter Hydrogen Bonding & Molecular Design Peter Kenny(pwk.pub.2008@gmail.com) Hydrogen Bonding in Drug Discovery & Development Interactions between drug and water molecules (Solubility, distribution, permeability,

More information

Characterization of petroleum samples via thermal analysis coupled to APCI FTMS

Characterization of petroleum samples via thermal analysis coupled to APCI FTMS Characterization of petroleum samples via thermal analysis coupled to APCI FTMS Abstract Thermal analysis, by means of a thermo balance (TG), was coupled to an atmospheric pressure chemical ionization

More information

Interpretation of Organic Spectra. Chem 4361/8361

Interpretation of Organic Spectra. Chem 4361/8361 Interpretation of Organic Spectra Chem 4361/8361 Characteristics of Common Spectrometric Methods H-1 C-13 MS IR/RAMAN UV-VIS ORD/CD X- RAY Radiation type RF RF Not relevant IR UV to visible UV to visible

More information

Review 1 of Handbook of Chemoinformatics Algorithms by Faulon, Bender, eds. CRC Press, pages, hardcover

Review 1 of Handbook of Chemoinformatics Algorithms by Faulon, Bender, eds. CRC Press, pages, hardcover Review 1 of Handbook of Chemoinformatics Algorithms by Faulon, Bender, eds. CRC Press, 2010 440 pages, hardcover Review by Aaron Sterling (sterling@iastate.edu) Department of Computer Science, Iowa State

More information

Encoding molecular structures as ranks of models: A new, secure way for sharing chemical data and development of ADME/T models. Igor V.

Encoding molecular structures as ranks of models: A new, secure way for sharing chemical data and development of ADME/T models. Igor V. Encoding molecular structures as ranks of models: A new, secure way for sharing chemical data and development of ADME/T models Igor V. Tetko IBPC, Ukrainian Academy of Sciences, Kyiv, Ukraine and Institute

More information

Exploring the black box: structural and functional interpretation of QSAR models.

Exploring the black box: structural and functional interpretation of QSAR models. EMBL-EBI Industry workshop: In Silico ADMET prediction 4-5 December 2014, Hinxton, UK Exploring the black box: structural and functional interpretation of QSAR models. (Automatic exploration of datasets

More information

Introduction to GC/MS

Introduction to GC/MS Why Mass Spectrometry? Introduction to GC/MS A powerful analytical technique used to: 1.Identify unknown compounds 2. Quantify known materials down to trace levels 3. Elucidate the structure of molecules

More information

Quantitative Structure-Activity Relationship (QSAR) computational-drug-design.html

Quantitative Structure-Activity Relationship (QSAR)  computational-drug-design.html Quantitative Structure-Activity Relationship (QSAR) http://www.biophys.mpg.de/en/theoretical-biophysics/ computational-drug-design.html 07.11.2017 Ahmad Reza Mehdipour 07.11.2017 Course Outline 1. 1.Ligand-

More information

Translating Methods from Pharma to Flavours & Fragrances

Translating Methods from Pharma to Flavours & Fragrances Translating Methods from Pharma to Flavours & Fragrances CINF 27: ACS National Meeting, New Orleans, LA - 18 th March 2018 Peter Hunt, Edmund Champness, Nicholas Foster, Tamsin Mansley & Matthew Segall

More information

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Anthony Arvanites Daylight User Group Meeting March 10, 2005 Outline 1. Company Introduction

More information

NUCLEAR MAGNETIC RESONANCE AND INTRODUCTION TO MASS SPECTROMETRY

NUCLEAR MAGNETIC RESONANCE AND INTRODUCTION TO MASS SPECTROMETRY NUCLEAR MAGNETIC RESONANCE AND INTRODUCTION TO MASS SPECTROMETRY A STUDENT SHOULD BE ABLE TO: 1. Identify and explain the processes involved in proton ( 1 H) and carbon-13 ( 13 C) nuclear magnetic resonance

More information

Plan. Day 2: Exercise on MHC molecules.

Plan. Day 2: Exercise on MHC molecules. Plan Day 1: What is Chemoinformatics and Drug Design? Methods and Algorithms used in Chemoinformatics including SVM. Cross validation and sequence encoding Example and exercise with herg potassium channel:

More information

Examples: 1. Draw the possible resonance structures for the following covalent compounds: a. O3

Examples: 1. Draw the possible resonance structures for the following covalent compounds: a. O3 AP Chemistry Ms. Ye Name Date Block 1. Draw a Lewis structure for each of these molecules. Identify the molecular shape, hybridization, and bond angles. Determine the total number of valence electrons

More information

Tips & Tricks GPC/SEC: Quantify and Get More Than Molar Mass Averages

Tips & Tricks GPC/SEC: Quantify and Get More Than Molar Mass Averages Tips & Tricks GPC/SEC: Quantify and Get More Than Molar Mass Averages Daniela Held, PSS Polymer Standards Service GmbH, Mainz, Germany Gel permeation chromatography/size-exclusion chromatography (GPC/SEC)

More information

AP Chemistry Chapter 7: Bonding

AP Chemistry Chapter 7: Bonding AP Chemistry Chapter 7: Bonding Types of Bonding I. holds everything together! I All bonding occurs because of! Electronegativity difference and bond character A. A difference in electronegativity between

More information