Navigating between patents, papers, abstracts and databases using public sources and tools

Size: px
Start display at page:

Download "Navigating between patents, papers, abstracts and databases using public sources and tools"

Transcription

1 Navigating between patents, papers, abstracts and databases using public sources and tools Christopher Southan 1 and Sean Ekins 2 TW2Informatics, Göteborg, Sweden, Collaborative Drug Discovery, North Carolina, USA ACS, April 2013 [1]

2 [2]

3 ACS Abstract Engaging with chemistry in the biosciences requires navigation between journals, patents, abstracts, databases, Google results and connecting across millions of structures specified only in text. The ability to do this in public sources has been revolutionised by several trends a) ChEMBL's capture of SAR from journals c) the deposition of three major automated patent extractions (SureChem, IBM and SCRIPDB) in PubChem for over 15 million structures, d) open tools such as chemicalize.org, OPSIN, and OSCAR that enable the conversion of IUPAC names or images to structures e) the indexing of chemical terms (e.g. InChIKeys) that turn Google searches into a merged global repository of 40 to 50 million structures. Details of these trends, including PubChem intersect statistics, will be presented, along with practical examples from selected tools. New structure sharing trends will also be considered such as patent crowdsourcing, dropbox, blogs, figshare and open lab notebooks. [3]

4 Getting chemistry out of text and linking to data: some is done but we have to dig for the rest [4]

5 Estimates for chemical text tombs Journal chemistry public extraction, ~10 to 20 million entombed? Majority of useful patent chemistry already publically extracted, but, ~5 to 10 million still to go? PubMed abstracts and MeSH chemistry ~ 0.5 million still entombed? Other unique, useful, text-only (i.e. no database cross-references) chemistry on the web ~ 0.1 to 0.5 million entombed? [5]

6 What s out there: publically disinterred structures InChIKey in Google ~ 50 million PubChem = 48 million PubChem ROF Mw (lead-like) = 31 million ChemSpider = 28 million PubChem all docs (papers & patents) = 16 million PubChem patents = 15 million SureChemOpen = 13 million PubChem journal sources (PubMed + ChEMBL) = 1 million ~90% of all structures in databases have their primary origin in text sources [6]

7 Medicinal chemistry patents (tombs with lids off) 18,777,229 patents, 2,208,422 WO s (i.e. ~ 9 per family) WO, C07 or A61= 469,856 WO, C07D or A61K = 235,854 WO, C07D = 72,737 (assignee vs. year plots below) [7]

8 PubMed at 22 mill: ~ 10% with chemistry (guarded tombs) Free full text = 575,513 (24%) [8]

9 Top-5 Med Chem journals (4% lids off tombs) Free full text = 2671 (4.3%) [9]

10 Growth: (escaping the tombs) Patent big bang (SureChem & SCRIPDB in 2012) Literature slow burn (ChEMBL 2009 jump) Paradox - patents:papers 15:1 (both sets of CIDs cumulative) [10]

11 Patents in PubChem: post-bang total vs. unique content PubChem at 47.3 million CIDs, 32% include patents, 20% patent-only [11]

12 Citations: connections between tombs but still need to disinter structures Papers Abstracts Patents PubMed "relatedness" heuristics [12]

13 Databases <> structures < > documents: links, but few reciprocal Papers Abstracts 0.8 mill (ChEMBL) 12K 0.2 mill (mainly MeSH) Patents 15 mill [13]

14 Post-document retrieval: basic questions 1. What is the name:iupac:image:other ratio in the document? 2. Which tools might be appropriate for first-pass extractions? 3. How many and what proportion of strucs can be extracted? 4. Which SAR /in vivo/clinical data is linked to strucs? 5. Which document sections include the key strucs? 6. Which database entries have links (back) to this document? 7. Which strucs have InChIKey matches in Google, & database entries? 8. Which strucs have synthesis data? 9. What other documents specify and/or cite this struc? 10. Which database records for this struc have links to other documents? 11. What realtionship connections can be made using similarity searches? 12. What intersects and differences are discernible within a document set? [14]

15 Triaging document or webpage chemistry Identify the structure specification types, e.g. Semantic names (all sources) Code names (press releases, papers and abstracts) IUPAC names (papers, patents and abstracts) Images (papers, patents, & Google images) SMILES (open lab books) InChi strings (open lab books) SDF files (open lab books, & github) Convert these to a structure (e.g. SDF, SMILES, InChI) then: Search InChIKey in Google Search major databases Search SureChemOpen Compare extracted sets for intersects and diffs Extend exact match connectivity with similarity searching [15]

16 Triage example: antimalarial starting point The MMV code name is linked to an image in press reports but is PubChem and PubMed -ve [16]

17 Images: convert and search Real chemists sketch them in a jiffy; the rest of us can use OSRA: Optical Structure Recognition Application (after editing, CS(=O)(=O)c3ccc(C2=CN=C(N)C(C1=CCC(C(F)(F)F)N=C1)C2)cc3) [17]

18 Making connections: image > strucure > database > documents CID > ChEMBL > PubMed SureChem or chemicalize.org > patent [18]

19 Patent SAR from WO : Collating activities via SureChemOpen CID > [19]

20 Patent SAR results: top-20 from 39 IC50s [20]

21 Results > figshare [21]

22 Structures > MyNCBI biougfudsdbhek5/. [22]

23 SAR Table: ios app from Molecular Materials Informatics SureChemOpen strucs -> manual data collation -> PubChem CIDs -> SDF -> Dropbox -> SAR Table -> edit in data, R-group decompose -> share [23]

24 InChIKey in Google: instant orthogonal joining [24]

25 Chemicalize.org: 413 strucs from WO CID > [25]

26 Using OPSIN and chemcalize.org to fix recalcitrant IUPACs from WO Can quasi-manually extract ~ 10 more split IUPAC examples [26]

27 Clustering document extraction sets: CheS-Mapper WO > chemicalize.org -> 413 cpds download -> CheS-Mapper -> cluster 8 -> export 53 cpds [27]

28 PubChem -> ChEMBL -> PMID -> assay -> strucs CHEMBL (structure) PMID (paper) CHEMBL (assay for 32 strucs from paper) The 32 CIDs all have patent matches [28]

29 Venny: intersects, diffs, de-dupes and merges 1) WO matches in PubCHem 2) CheS-Mapper cluster 8 from WO ) ChEMBL assayed cpds from PMID (handles any regular strings e.g. db IDs, SMILES, IChI or InChIKey) [29]

30 The open toolbox facilitates extraction and collation of 10 to 30 million structures entombed in text [30]

31 Conclusions The ability to extract chemical structures from text and web sources has been transformed by an expansion of the public toolbox The PubChem big-bang increases probability of extraction having database exact or similarity matches Paradoxically, the patent corpus is now completely open while access to journal text is still restricted However, ChEMBL has extracted ~ 0.8 mill. SAR-linked and target mapped structures from ~ 50K papers The submission of ~15 mill. patent structures to PubChem ensures at least representation from the majority of medicinal chemistry patents (many of which spawned the subsequent ChEMBL papers) Those who want to share their structures globally (e.g. OSDD) have an expanding set of options for surfacing their results. [31]

32 You can find CDD Booth 205 PAPER ID: PAPER TITLE: Dispensing processes profoundly impact biological assays and computational and statistical analyses April 8 th 8.35am Room 349 PAPER ID: PAPER TITLE: Enhancing High Throughput Screening For Mycobacterium tuberculosis Drug Discovery Using Bayesian Models April 9 th 1.30pm Room 353 PAPER ID: PAPER TITLE: Navigating between patents, papers, abstracts and databases using public sources and tools April 9 th 3.50pm Room 350 PAPER ID: PAPER TITLE: TB Mobile: Appifying Data on Anti-tuberculosis Molecule Targets April 10 th 8.30am Room 357 PAPER ID: PAPER TITLE: Challenges and recommendations for obtaining chemical structures of industry-provided repurposing candidates April 10 th 10.20am Room 350 PAPER ID: PAPER TITLE: Dual-event machine learning models to accelerate drug discovery April 10 th 3.05 pm Room 350 [32]

Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses

Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc,

More information

Expanding the scope of literature data with document to structure tools PatentInformatics applications at Aptuit

Expanding the scope of literature data with document to structure tools PatentInformatics applications at Aptuit Expanding the scope of literature data with document to structure tools PatentInformatics applications at Aptuit Alfonso Pozzan Computational and Analytical Chemistry Drug Design and Discovery Department

More information

A Journey from Data to Knowledge

A Journey from Data to Knowledge A Journey from Data to Knowledge Ian Bruno Cambridge Crystallographic Data Centre @ijbruno @ccdc_cambridge Experimental Data C 10 H 16 N +,Cl - Radspunk, CC-BY-SA CC-BY-SA Jeff Dahl, CC-BY-SA Experimentally

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

Chemical Ontologies. Chemical Ontologies. ChemAxon UGM May 23, 2012

Chemical Ontologies. Chemical Ontologies. ChemAxon UGM May 23, 2012 Chemical Ontologies ChemAxon UGM May 23, 2012 Chemical Ontologies OntoChem GmbH Heinrich-Damerow-Str. 4 06120 Halle (Saale) Germany Tel. +49 345 4780472 Fax: +49 345 4780471 mail: info(at)ontochem.com

More information

Open PHACTS Explorer: Compound by Name

Open PHACTS Explorer: Compound by Name Open PHACTS Explorer: Compound by Name This document is a tutorial for obtaining compound information in Open PHACTS Explorer (explorer.openphacts.org). Features: One-click access to integrated compound

More information

The Case for Use Cases

The Case for Use Cases The Case for Use Cases The integration of internal and external chemical information is a vital and complex activity for the pharmaceutical industry. David Walsh, Grail Entropix Ltd Costs of Integrating

More information

PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE

PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE 1 SERVING THE LIFE SCIENCES SPACE ADDRESSING KEY CHALLENGES ACROSS THE R&D VALUE CHAIN Characterize targets & analyze disease

More information

Tutorial. Getting started. Sample to Insight. March 31, 2016

Tutorial. Getting started. Sample to Insight. March 31, 2016 Getting started March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com Getting started

More information

A powerful site for all chemists CHOICE CRC Handbook of Chemistry and Physics

A powerful site for all chemists CHOICE CRC Handbook of Chemistry and Physics Chemical Databases Online A powerful site for all chemists CHOICE CRC Handbook of Chemistry and Physics Combined Chemical Dictionary Dictionary of Natural Products Dictionary of Organic Dictionary of Drugs

More information

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS Cheminformatics Role in Pharmaceutical Industry Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS Agenda The big picture for pharmaceutical industry Current technological/scientific issues Types

More information

Use of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs

Use of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs Use of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs James Mills; Karl Gibson, Gavin Whitlock, Paul Glossop, Jean-Robert Ioset, Leela

More information

Chemical Journal Publishing in an Online World. Jason Wilde, Publisher Physical Sciences Nature Publishing Group ACS Spring Meeting 2009

Chemical Journal Publishing in an Online World. Jason Wilde, Publisher Physical Sciences Nature Publishing Group ACS Spring Meeting 2009 Chemical Journal Publishing in an Online World Jason Wilde, Publisher Physical Sciences Nature Publishing Group ACS Spring Meeting 2009 1 Overview Chemists and chemistry NPG and chemistry Nature Chemistry

More information

Chemical Data Retrieval and Management

Chemical Data Retrieval and Management Chemical Data Retrieval and Management ChEMBL, ChEBI, and the Chemistry Development Kit Stephan A. Beisken What is EMBL-EBI? Part of the European Molecular Biology Laboratory International, non-profit

More information

Data Quality Issues That Can Impact Drug Discovery

Data Quality Issues That Can Impact Drug Discovery Data Quality Issues That Can Impact Drug Discovery Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry,

More information

Reaxys Pipeline Pilot Components Installation and User Guide

Reaxys Pipeline Pilot Components Installation and User Guide 1 1 Reaxys Pipeline Pilot components for Pipeline Pilot 9.5 Reaxys Pipeline Pilot Components Installation and User Guide Version 1.0 2 Introduction The Reaxys and Reaxys Medicinal Chemistry Application

More information

Reaxys The Highlights

Reaxys The Highlights Reaxys The Highlights What is Reaxys? A brand new workflow solution for research chemists and scientists from related disciplines An extensive repository of reaction and substance property data A resource

More information

Imago: open-source toolkit for 2D chemical structure image recognition

Imago: open-source toolkit for 2D chemical structure image recognition Imago: open-source toolkit for 2D chemical structure image recognition Viktor Smolov *, Fedor Zentsev and Mikhail Rybalkin GGA Software Services LLC Abstract Different chemical databases contain molecule

More information

Organometallics & InChI. August 2017

Organometallics & InChI. August 2017 Organometallics & InChI August 2017 The Cambridge Structural Database 900,000+ small-molecule crystal structures Over 60,000 datasets deposited annually Enriched and annotated by experts Structures available

More information

Using Web Technologies for Integrative Drug Discovery

Using Web Technologies for Integrative Drug Discovery Using Web Technologies for Integrative Drug Discovery Qian Zhu 1 Sashikiran Challa 1 Yuying Sun 3 Michael S. Lajiness 2 David J. Wild 1 Ying Ding 3 1 School of Informatics and Computing, Indiana University,

More information

DRUG DISCOVERY TODAY ELN ELN. Chemistry. Biology. Known ligands. DBs. Generate chemistry ideas. Check chemical feasibility In-house.

DRUG DISCOVERY TODAY ELN ELN. Chemistry. Biology. Known ligands. DBs. Generate chemistry ideas. Check chemical feasibility In-house. DRUG DISCOVERY TODAY Known ligands Chemistry ELN DBs Knowledge survey Therapeutic target Generate chemistry ideas Check chemical feasibility In-house Analyze SAR Synthesize or buy Report Test Journals

More information

Experience with data handling in large chemical databases NEAL LANGERMAN

Experience with data handling in large chemical databases NEAL LANGERMAN Experience with data handling in large chemical databases NEAL LANGERMAN Advanced Chemical Safety San Diego, California, USA Provide integrated chemical health and safety information with a strong emphasis

More information

TRAINING REAXYS MEDICINAL CHEMISTRY

TRAINING REAXYS MEDICINAL CHEMISTRY TRAINING REAXYS MEDICINAL CHEMISTRY 1 SITUATION: DRUG DISCOVERY Knowledge survey Therapeutic target Known ligands Generate chemistry ideas Chemistry Check chemical feasibility ELN DBs In-house Analyze

More information

Handling Human Interpreted Analytical Data. Workflows for Pharmaceutical R&D. Presented by Peter Russell

Handling Human Interpreted Analytical Data. Workflows for Pharmaceutical R&D. Presented by Peter Russell Handling Human Interpreted Analytical Data Workflows for Pharmaceutical R&D Presented by Peter Russell 2011 Survey 88% of R&D organizations lack adequate systems to automatically collect data for reporting,

More information

The shortest path to chemistry data and literature

The shortest path to chemistry data and literature R&D SOLUTIONS Reaxys Fact Sheet The shortest path to chemistry data and literature Designed to support the full range of chemistry research, including pharmaceutical development, environmental health &

More information

Reaxys Medicinal Chemistry Fact Sheet

Reaxys Medicinal Chemistry Fact Sheet R&D SOLUTIONS FOR PHARMA & LIFE SCIENCES Reaxys Medicinal Chemistry Fact Sheet Essential data for lead identification and optimization Reaxys Medicinal Chemistry empowers early discovery in drug development

More information

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann Information Extraction from Chemical Images Discovery Knowledge & Informatics April 24 th, 2006 Dr. Available Chemical Information Textbooks Reports Patents Databases Scientific journals and publications

More information

Let s develop together an exchange virtual compounds computa6onal pla7orm

Let s develop together an exchange virtual compounds computa6onal pla7orm Let s develop together an exchange virtual compounds computa6onal pla7orm Francesco Ortuso University Magna Græcia of Catanzaro Italy Some (trivial) data Industrial research (a quasi-failure story) the

More information

LIBRARY DESIGN FOR COLLABORATIVE DRUG DISCOVERY: EXPANDING DRUGGABLE CHEMOGENOMIC SPACE

LIBRARY DESIGN FOR COLLABORATIVE DRUG DISCOVERY: EXPANDING DRUGGABLE CHEMOGENOMIC SPACE 5 th /June/2018@British Embassy in Tokyo LIBRARY DESIGN FOR COLLABORATIVE DRUG DISCOVERY: EXPANDING DRUGGABLE CHEMOGENOMIC SPACE Kazuyoshi Ikeda, Ph.D. Keio University SELF-INTRODUCTION Keio University,

More information

Internet Resource Guide for Undergraduate Organic Chemists

Internet Resource Guide for Undergraduate Organic Chemists Internet Resource Guide for Undergraduate Organic Chemists Cameron Incognito February 8 th, 2013 ENGL 202C Table of Contents Preface Contents and Scope of this Guide.... 3 Audience and Purpose of this

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

Tautomerism in chemical information management systems

Tautomerism in chemical information management systems Tautomerism in chemical information management systems Dr. Wendy A. Warr http://www.warr.com Tautomerism in chemical information management systems Author: Wendy A. Warr DOI: 10.1007/s10822-010-9338-4

More information

Introducing a Bioinformatics Similarity Search Solution

Introducing a Bioinformatics Similarity Search Solution Introducing a Bioinformatics Similarity Search Solution 1 Page About the APU 3 The APU as a Driver of Similarity Search 3 Similarity Search in Bioinformatics 3 POC: GSI Joins Forces with the Weizmann Institute

More information

In Silico Investigation of Off-Target Effects

In Silico Investigation of Off-Target Effects PHARMA & LIFE SCIENCES WHITEPAPER In Silico Investigation of Off-Target Effects STREAMLINING IN SILICO PROFILING In silico techniques require exhaustive data and sophisticated, well-structured informatics

More information

Patent Searching using Bayesian Statistics

Patent Searching using Bayesian Statistics Patent Searching using Bayesian Statistics Willem van Hoorn, Exscientia Ltd Biovia European Forum, London, June 2017 Contents Who are we? Searching molecules in patents What can Pipeline Pilot do for you?

More information

Canonical Line Notations

Canonical Line Notations Canonical Line otations InChI vs SMILES Krisztina Boda verview Compound naming InChI SMILES Molecular equivalency Isomorphism Kekule Tautomers Finding duplicates What s Your ame? 1. Unique numbers CAS

More information

CHEMICAL CLASSIFICATION:

CHEMICAL CLASSIFICATION: CHEMICAL CLASSIFICATION: CLASSYFIRE S APPLICATIONS IN ENVIRONMENTAL HEALTH AND SAFETY Yannick Djoumbou Feunang University of Alberta, Canada 08/24/2016 ACS Meeting, Philadelphia (PA) DATA AND MORE DATA

More information

Catching the Drift Indexing Implicit Knowledge in Chemical Digital Libraries

Catching the Drift Indexing Implicit Knowledge in Chemical Digital Libraries Catching the Drift Indexing Implicit Knowledge in Chemical Digital Libraries Benjamin Köhncke 1, Sascha Tönnies 1, Wolf-Tilo Balke 2 1 L3S Research Center; Hannover, Germany 2 TU Braunschweig, Germany

More information

CLRG Biocreative V

CLRG Biocreative V CLRG ChemTMiner @ Biocreative V Sobha Lalitha Devi., Sindhuja Gopalan., Vijay Sundar Ram R., Malarkodi C.S., Lakshmi S., Pattabhi RK Rao Computational Linguistics Research Group, AU-KBC Research Centre

More information

User Manual for Range Tool for Article 12 (Birds Directive) & Article 17 (Habitats Directive)

User Manual for Range Tool for Article 12 (Birds Directive) & Article 17 (Habitats Directive) EUROPEAN TOPIC CENTRE ON BIOLOGICAL DIVERSITY User Manual for Range Tool for Article 12 (Birds Directive) & Article 17 (Habitats Directive) Prepared by Brian Mac Sharry (MNHN) Version 1.1 Date: 30 th of

More information

Searching Substances in Reaxys

Searching Substances in Reaxys Searching Substances in Reaxys Learning Objectives Understand that substances in Reaxys have different sources (e.g., Reaxys, PubChem) and can be found in Document, Reaction and Substance Records Recognize

More information

STRUCTURAL BIOINFORMATICS I. Fall 2015

STRUCTURAL BIOINFORMATICS I. Fall 2015 STRUCTURAL BIOINFORMATICS I Fall 2015 Info Course Number - Classification: Biology 5411 Class Schedule: Monday 5:30-7:50 PM, SERC Room 456 (4 th floor) Instructors: Vincenzo Carnevale - SERC, Room 704C;

More information

Building a mobile reaction lab notebook

Building a mobile reaction lab notebook Building a mobile reaction lab notebook Alex M. Clark, Ph.D. March 2014 2014 Molecular Materials Informatics, Inc.! !2 Electronic Lab Notebooks Many shapes & sizes: big, small, hosted, desktop, mobile

More information

Introduction to Google Drive Objectives:

Introduction to Google Drive Objectives: Introduction to Google Drive Objectives: Learn how to access your Google Drive account Learn to create new documents using Google Drive Upload files to store on Google Drive Share files and folders with

More information

My Career in the Pharmaceutical Industry

My Career in the Pharmaceutical Industry SCI Career Options Seminar My Career in the Pharmaceutical Industry Simon Yates, AstraZeneca University of Sheffield 27 th November 2013 1998-2002 - Degree MChem. Undergraduate at University of York Final

More information

SABIO-RK Integration and Curation of Reaction Kinetics Data Ulrike Wittig

SABIO-RK Integration and Curation of Reaction Kinetics Data  Ulrike Wittig SABIO-RK Integration and Curation of Reaction Kinetics Data http://sabio.villa-bosch.de/sabiork Ulrike Wittig Overview Introduction /Motivation Database content /User interface Data integration Curation

More information

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure

More information

Introduction. OntoChem

Introduction. OntoChem Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Bio-Medical Text Mining with Machine Learning

Bio-Medical Text Mining with Machine Learning Sumit Madan Department of Bioinformatics - Fraunhofer SCAI Textual Knowledge PubMed Journals, Books Patents EHRs What is Bio-Medical Text Mining? Phosphorylation of glycogen synthase kinase 3 beta at Threonine,

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Large scale classification of chemical reactions from patent data

Large scale classification of chemical reactions from patent data Large scale classification of chemical reactions from patent data Gregory Landrum NIBR Informatics, Basel Novartis Institutes for BioMedical Research 10th International Conference on Chemical Structures/

More information

ople are co-authors if there edge to each of them. derived from the five largest PH, HEP TH, HEP PH, place a time-stamp on each h paper, and for each perission to the arxiv. The the period from April 1992

More information

Dock Ligands from a 2D Molecule Sketch

Dock Ligands from a 2D Molecule Sketch Dock Ligands from a 2D Molecule Sketch March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com

More information

Web Search of New Linearized Medical Drug Leads

Web Search of New Linearized Medical Drug Leads Web Search of New Linearized Medical Drug Leads Preprint Software Engineering Department The Jerusalem College of Engineering POB 3566, Jerusalem, 91035, Israel iaakov@jce.ac.il Categories and subject

More information

Chemistry For Engineering Students Brown Holme

Chemistry For Engineering Students Brown Holme We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with chemistry for engineering

More information

CHEM 333 Spring 2016 Organic Chemistry I California State University Northridge

CHEM 333 Spring 2016 Organic Chemistry I California State University Northridge CHEM 333 Spring 2016 Organic Chemistry I California State University Northridge Lecture: Instructor: Thomas Minehan Office: Science 2314 Office hours: MW 12:00-1:00 pm E.mail: thomas.minehan@csun.edu Class

More information

Christopher A. Lipinski Adjunct Senior Research Fellow Pfizer Global R&D, Groton Labs

Christopher A. Lipinski Adjunct Senior Research Fellow Pfizer Global R&D, Groton Labs Issues in Compound Storage in DMSO Christopher A. Lipinski Adjunct Senior Research Fellow Pfizer Global R&D, Groton Labs christopher_a_lipinski@groton.pfizer.com 1 Sample storage in DMSO - overview Centralized

More information

ImprovIng ADmE ScrEEnIng productivity In Drug DIScovEry

ImprovIng ADmE ScrEEnIng productivity In Drug DIScovEry Improving ADME Screening Productivity in Drug Discovery INTRODUCTION Improving the quality of the decisions made at the drug discovery stage can profoundly impact the efficiency of the drug discovery and

More information

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK Molecular Modelling Computational Chemistry Demystified Peter Bladon Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK John E. Gorton Gorton Systems, Glasgow, UK Robert B. Hammond Institute

More information

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations Capturing Chemistry What you see is what you get In the world of mechanism and chemical transformations Dr. Stephan Schürer ead of Intl. Sci. Content Libraria, Inc. sschurer@libraria.com Distribution of

More information

ChemBioNet: Chemical Biology supported by Networks of Chemists and Biologists. Affinity Proteomics Meeting Alpbach. Michael Lisurek 15.3.

ChemBioNet: Chemical Biology supported by Networks of Chemists and Biologists. Affinity Proteomics Meeting Alpbach. Michael Lisurek 15.3. ChemBioet: Chemical Biology supported by etworks of Chemists and Biologists Affinity Proteomics Meeting Alpbach Michael Lisurek 15.3.2007 - Introduction - Equipment Screening Unit - Screening Compound

More information

CHE 200 INFORMATION RESOURCES LIBRARY PRESENTATION

CHE 200 INFORMATION RESOURCES LIBRARY PRESENTATION CHE 200 INFORMATION RESOURCES LIBRARY PRESENTATION November 5, 2014 INTRODUCTION & AGENDA DAVE ZWICKY CHEMICAL INFORMATION SPECIALIST Chemistry & Chemical Engineering Librarian Background BS (CHE), Wisconsin

More information

Medicinal Chemistry 1

Medicinal Chemistry 1 Medicinal Chemistry 1 1-Basic Information Title: Medicinal Chemistry 1 Code: PHC - 415 Level: Third year (1 st Semester) Department: Medicinal Chemistry Unit: Lecture: 2(2hrs) Tutorial/ Practical:1(3hrs)

More information

OECD QSAR Toolbox v.3.3. Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists

OECD QSAR Toolbox v.3.3. Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists OECD QSAR Toolbox v.3.3 Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists Background Objectives Specific Aims Trend analysis The exercise

More information

SciFinder Premier CAS solutions to explore all chemistry MethodsNow, PatentPak, ChemZent, SciFinder n

SciFinder Premier CAS solutions to explore all chemistry MethodsNow, PatentPak, ChemZent, SciFinder n ACS International Ltd vhyttinen@cas.org SciFinder Premier CAS solutions to explore all chemistry MethodsNow, PatentPak, ChemZent, SciFinder n Veli-Pekka Hyttinen May 23, 2018 CAS Hyttinen CAS supports

More information

Challenges in Big Data Chemistry Using Publicly Available Chemical Information

Challenges in Big Data Chemistry Using Publicly Available Chemical Information The 250 th ACS ational Meeting (Boston, MA) Challenges in Big Data Chemistry Using Publicly Available Chemical Information Sunghwan Kim, Gang Fu, Volker D. Hähnke, Lianyi Han, Bo Yu, Lewis Geer, Benjamin

More information

Welcome! Mass Spectrometry meets ChemInformatics WCMC Metabolomics Course 2013 Tobias Kind. Course 3: Mass spectral and molecular database search

Welcome! Mass Spectrometry meets ChemInformatics WCMC Metabolomics Course 2013 Tobias Kind. Course 3: Mass spectral and molecular database search Biology Informatics Chemistry Welcome! Mass Spectrometry meets ChemInformatics WCMC Metabolomics Course 2013 Tobias Kind Course 3: Mass spectral and molecular database search http://fiehnlab.ucdavis.edu/staff/kind

More information

EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS

EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS PETER GUND Pharmacopeia Inc., CN 5350 Princeton, NJ 08543, USA pgund@pharmacop.com Empirical and theoretical approaches to drug discovery have often

More information

NASA/IPAC EXTRAGALACTIC DATABASE

NASA/IPAC EXTRAGALACTIC DATABASE GRITS 2009 NASA/IPAC EXTRAGALACTIC DATABASE 20 years, 163 million objects, 3.3 billion rows, and beyond The Team Rick Ebert presenting What is? Mission A brief history 20 years, 3 dbms, multiple uif The

More information

Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, Dr.

Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, Dr. Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, 2006 Dr. Overview Brief introduction Chemical Structure Recognition (chemocr) Manual conversion

More information

Information Retrieval Using Boolean Model SEEM5680

Information Retrieval Using Boolean Model SEEM5680 Information Retrieval Using Boolean Model SEEM5680 1 Unstructured (text) vs. structured (database) data in 1996 2 2 Unstructured (text) vs. structured (database) data in 2009 3 3 The problem of IR Goal

More information

Introduction to Chemoinformatics

Introduction to Chemoinformatics Introduction to Chemoinformatics Dr. Igor V. Tetko Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) Institute of Bioinformatics & Systems Biology (HMGU) Kyiv, 10 August

More information

Chemistry in Bioinformatics

Chemistry in Bioinformatics F R O N T M A T T E R B O D Y Chemistry in Bioinformatics Peter Murray Rust, 1 John B. O. Mitchell, 1 and Henry S. Rzepa 2 1 Unilever Centre for Molecular Science Informatics, Department of Chemistry,

More information

Analytical data, the web, and standards for unified laboratory informatics databases

Analytical data, the web, and standards for unified laboratory informatics databases Analytical data, the web, and standards for unified laboratory informatics databases Presented By Patrick D. Wheeler & Graham A. McGibbon ACS San Diego 16 March, 2016 Sources Process, Analyze Interfaces,

More information

ICM-Chemist How-To Guide. Version 3.6-1g Last Updated 12/01/2009

ICM-Chemist How-To Guide. Version 3.6-1g Last Updated 12/01/2009 ICM-Chemist How-To Guide Version 3.6-1g Last Updated 12/01/2009 ICM-Chemist HOW TO IMPORT, SKETCH AND EDIT CHEMICALS How to access the ICM Molecular Editor. 1. Click here 2. Start sketching How to sketch

More information

Chemists are from Mars, Biologists from Venus. Originally published 7th November 2006

Chemists are from Mars, Biologists from Venus. Originally published 7th November 2006 Chemists are from Mars, Biologists from Venus Originally published 7th November 2006 Chemists are from Mars, Biologists from Venus Andrew Lemon and Ted Hawkins, The Edge Software Consultancy Ltd Abstract

More information

FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES

FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES DIFFERENT LEVELS OF KNOWLEDGE REPRESENTATION IN CHEMISTRY Michael Braden, PhD ACS / San Diego/ 2016 Overview ChemAxon Who are we? Examples/use cases: Create

More information

The IBM Patent Data Donation to NIH, and its Integration in the NCI/CADD Database and Web Services

The IBM Patent Data Donation to NIH, and its Integration in the NCI/CADD Database and Web Services The IBM Patent Data Donation to NIH, and its Integration in the NCI/CADD Database and Web Services Marc C. Nicklaus Computer-Aided Drug Design Group, Chemical Biology Laboratory, Frederick National Laboratory

More information

Ákos Tarcsay CHEMAXON SOLUTIONS

Ákos Tarcsay CHEMAXON SOLUTIONS Ákos Tarcsay CHEMAXON SOLUTIONS FINDING NOVEL COMPOUNDS WITH IMPROVED OVERALL PROPERTY PROFILES Two faces of one world Structure Footprint Linked Data Reactions Analytical Batch Phys-Chem Assay Project

More information

Standardized Representations of ELN Reactions for Categorization and Duplicate/Variation Identification

Standardized Representations of ELN Reactions for Categorization and Duplicate/Variation Identification Standardized Representations of ELN Reactions for Categorization and Duplicate/Variation Identification Roger Sayle and daniel lowe NextMove Software, Cambridge, UK Overview Electronic Lab Notebooks (ELNs)

More information

A Textbook Of Analytical Geometry Of Three Dimensions 2nd

A Textbook Of Analytical Geometry Of Three Dimensions 2nd A Textbook Of Analytical Geometry Of Three Dimensions 2nd We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer,

More information

How IJC is Adding Value to a Molecular Design Business

How IJC is Adding Value to a Molecular Design Business How IJC is Adding Value to a Molecular Design Business James Mills Sexis LLP ChemAxon TechTalk Stevenage, ov 2012 james.mills@sexis.co.uk Overview Introduction to Sexis Sexis IJC use cases Data visualisation

More information

Early Stages of Drug Discovery in the Pharmaceutical Industry

Early Stages of Drug Discovery in the Pharmaceutical Industry Early Stages of Drug Discovery in the Pharmaceutical Industry Daniel Seeliger / Jan Kriegl, Discovery Research, Boehringer Ingelheim September 29, 2016 Historical Drug Discovery From Accidential Discovery

More information

OECD QSAR Toolbox v.3.0

OECD QSAR Toolbox v.3.0 OECD QSAR Toolbox v.3.0 Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists Background Objectives Specific Aims Trend analysis The exercise

More information

Chemistry Lab Manual Timberlake Answer Key

Chemistry Lab Manual Timberlake Answer Key We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with chemistry lab manual

More information

Application integration: Providing coherent drug discovery solutions

Application integration: Providing coherent drug discovery solutions Application integration: Providing coherent drug discovery solutions Mitch Miller, Manish Sud, LION bioscience, American Chemical Society 22 August 2002 Overview 2 Introduction: exploring application integration

More information

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS 179 Molecular Informatics: Confronting Complexity, May 13 th - 16 th 2002, Bozen, Italy PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS CARLETON R. SAGE, KEVIN R. HOLME, NIANISH

More information

OECD QSAR Toolbox v.3.4. Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding

OECD QSAR Toolbox v.3.4. Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding OECD QSAR Toolbox v.3.4 Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding Outlook Background Objectives Specific Aims The exercise Workflow

More information

OECD QSAR Toolbox v.3.3. Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding

OECD QSAR Toolbox v.3.3. Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding OECD QSAR Toolbox v.3.3 Step-by-step example of how to build and evaluate a category based on mechanism of action with protein and DNA binding Outlook Background Objectives Specific Aims The exercise Workflow

More information

Introduction To Spectroscopy Pavia 3rd Edition

Introduction To Spectroscopy Pavia 3rd Edition INTRODUCTION TO SPECTROSCOPY PAVIA 3RD EDITION PDF - Are you looking for introduction to spectroscopy pavia 3rd edition Books? Now, you will be happy that at this time introduction to spectroscopy pavia

More information

Evaluating Physical, Chemical, and Biological Impacts from the Savannah Harbor Expansion Project Cooperative Agreement Number W912HZ

Evaluating Physical, Chemical, and Biological Impacts from the Savannah Harbor Expansion Project Cooperative Agreement Number W912HZ Evaluating Physical, Chemical, and Biological Impacts from the Savannah Harbor Expansion Project Cooperative Agreement Number W912HZ-13-2-0013 Annual Report FY 2018 Submitted by Sergio Bernardes and Marguerite

More information

Internet Resource Guide. For Chemical Engineering Students at The Pennsylvania State University

Internet Resource Guide. For Chemical Engineering Students at The Pennsylvania State University Internet Resource Guide For Chemical Engineering Students at The Pennsylvania State University Faith Tran 29 FEBRUARY 2016 Table of Contents 1 Front Matter 1.1 What is in the Guide... 2 1.2 Who this Guide

More information

Reaxys Managing Complexity

Reaxys Managing Complexity June 2009 Reaxys Managing Complexity Dr. Jürgen Swienty-Busch (j.swienty-busch@elsevier.com) What is Reaxys? Reaxys is Chemistry Covering more than 200 years of organic, organometallic and inorganic chemistry

More information

Organic Chem Lab Survival Manual: A Student's Guide To Techniques By James W. Zubrick READ ONLINE

Organic Chem Lab Survival Manual: A Student's Guide To Techniques By James W. Zubrick READ ONLINE Organic Chem Lab Survival Manual: A Student's Guide To Techniques By James W. Zubrick READ ONLINE AbeBooks.com: The Organic Chem Lab Survival Manual: A Student's Guide to Techniques (9781118875780) by

More information

Molecular Graphics. Molecular Graphics Expt. 1 1

Molecular Graphics. Molecular Graphics Expt. 1 1 Molecular Graphics Expt. 1 1 Molecular Graphics The study of organic chemistry has for more than a century and a half focussed on the relationship between the structure of an organic molecule (its three-dimensional

More information

ChemSpider Reactions: Delivering a free community resource of chemical syntheses

ChemSpider Reactions: Delivering a free community resource of chemical syntheses ChemSpider Reactions: Delivering a free community resource of chemical syntheses Valery Tkachenko, Colin Batchelor, Daniel Lowe, Ken Karapetyan, David Sharpe and Antony Williams ACS New Orleans April 2013

More information

Discovering The World Of Chemistry

Discovering The World Of Chemistry Discovering The World Of Chemistry Dr. Dimitrios Tzalis IMI Open Info Day: Horizon 2020 - Health, demographic change and wellbeing 18th September 2015 Brussel Taros Chemicals GmbH & Co. KG Taros: Stability,

More information

Chapter 25 Nuclear Chemistry Worksheet Answer Key

Chapter 25 Nuclear Chemistry Worksheet Answer Key We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with chapter 25 nuclear chemistry

More information

Microscale Acid-Base Titration

Microscale Acid-Base Titration Microscale Acid-Base Titration Experiment 36 A titration is a process used to determine the volume of a solution needed to react with a given amount of another substance. In this experiment, you will titrate

More information