DRUG DISCOVERY TODAY ELN ELN. Chemistry. Biology. Known ligands. DBs. Generate chemistry ideas. Check chemical feasibility In-house.

Similar documents
The shortest path to chemistry data and literature

TRAINING REAXYS MEDICINAL CHEMISTRY

Reaxys Medicinal Chemistry Fact Sheet

Reaxys The Highlights

Searching Substances in Reaxys

PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE

Structure and Reaction querying in Reaxys

DiscoveryGate. One place for the answers you need. Chulalongkorn University, Thailand

In Silico Investigation of Off-Target Effects

NOVEMBER 2014 REAXYS R101 BASIC REACTION QUERIES

Reaxys Managing Complexity

Erfassung und Speicherung von Forschungsdaten im Fachbereich Chemie

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

Reaxys Pipeline Pilot Components Installation and User Guide

Kalexsyn Overview Kalexsyn, Inc Campus Drive Kalamazoo, MI Phone: (269) Fax: (269)

Internet Resource Guide for Undergraduate Organic Chemists

Introduction. OntoChem

A powerful site for all chemists CHOICE CRC Handbook of Chemistry and Physics

Chemists are from Mars, Biologists from Venus. Originally published 7th November 2006

Overview. Database Overview Chart Databases. And now, a Few Words About Searching. How Database Content is Delivered

Reaxys Improved synthetic planning with Reaxys

DECEMBER 2014 REAXYS R201 ADVANCED STRUCTURE SEARCHING

MDL Databases. Solutions At Every Stage in the R&D Process. MDL Databases. Information Systems, Inc.

Finding the Needle - Reaxys Structure Searching

CSD. Unlock value from crystal structure information in the CSD

Geo-enabling a Transactional Real Estate Management System A case study from the Minnesota Dept. of Transportation

Patent Granted 2,899 Patent 14,484 Filed % 38%

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Database Speaks. Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan

Chemically Intelligent Experiment Data Management

MSc Drug Design. Module Structure: (15 credits each) Lectures and Tutorials Assessment: 50% coursework, 50% unseen examination.

Demand Forecasting. for. Microsoft Dynamics 365 for Operations. User Guide. Release 7.1. April 2018

FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES

Introducing a Bioinformatics Similarity Search Solution

Integrated Electricity Demand and Price Forecasting

Integrated Cheminformatics to Guide Drug Discovery

Expanding the scope of literature data with document to structure tools PatentInformatics applications at Aptuit

Open PHACTS Explorer: Compound by Name

The Case for Use Cases

The MDL Discovery Framework: Data and Application Integration in the Life Sciences

Geodatabase Best Practices. Dave Crawford Erik Hoel

Introducing the New SciFinder. Veli Pekka Hyttinen Regional Marketing Manager, Central and Eastern Europe Jasna April 1, 2014

INTRODUCTION TO ARCGIS Version 10.*

Compounding insights Thermo Scientific Compound Discoverer Software

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

DiscoveryGate SM Version 1.4 Participant s Guide

SciFinder Content Update

IUCLID Substance Data

Science of Synthesis Guided Examples

GIS-T 2010 Building a Successful Geospatial Data Sharing Framework: A Ohio DOT Success Story

SpringerMaterials The Landolt-Börnstein Database

WALKUP LC/MS FOR PHARMACEUTICAL R&D

Structure Searching in CrossFire Beilstein. DiscoveryGate SM Version 1.4 Participant s Guide

Pharmaceutical e-learning at the University of Innsbruck

FORENSIC TOXICOLOGY SCREENING APPLICATION SOLUTION

Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, Dr.

Course Plan for Pharmacy (Bachelor Program) No.: (2016/2017) Approved by Deans Council by decision (09/26/2016) dated (03/08/2016) )160) Credit Hours

Solution Story: Dr. Carsten Schauerte, Managing Director of solid-chem GmbH

Chemistry Informatics in Academic Laboratories: Lessons Learned

Telecommunication Services Engineering (TSE) Lab. Chapter IX Presence Applications and Services.

SpyMeSat Mobile App. Imaging Satellite Awareness & Access

ESTABLISHMENT OF KARNATAKA GEOPORTAL AND ITS ROLE IN PLANNING

Canadian Board of Examiners for Professional Surveyors Core Syllabus Item C 5: GEOSPATIAL INFORMATION SYSTEMS

Welcome to Chemistry! Mr R Frankis and Dr C Walker

Analytical data, the web, and standards for unified laboratory informatics databases

Spatial Asset Management

PubChem data extraction and integration using Instant JChem. Oleg Ursu Cristian Bologa Tudor I. Oprea Division of Biocomputing

Administering your Enterprise Geodatabase using Python. Jill Penney

ATLAS of Biochemistry

Ákos Tarcsay CHEMAXON SOLUTIONS

Searching CrossFire Databases-

LABOTRON EXTRACTION & SYNTHESIS 2450 MHz

Introduction to Chemoinformatics and Drug Discovery

Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility

Introduction to Spark

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

The conceptual view. by Gerrit Muller University of Southeast Norway-NISE

Marvin 5.4 A new generation of structure indexing at Elsevier. Dr. Michael Maier, Dr. Heike Nau, Elsevier

for Effective Land Administration

Oak Ridge Environmental Information System (OREIS): Formalizing the Shapes of Things on the Oak Ridge Reservation 15393

Everyday NMR. Innovation with Integrity. Why infer when you can be sure? NMR

Application integration: Providing coherent drug discovery solutions

Navigating between patents, papers, abstracts and databases using public sources and tools

Web GIS Administration: Tips and Tricks

Application Note 12: Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova

Searching CrossFire Beilstein Using DiscoveryGate. DiscoveryGate Version 2.2 Participant s Guide

Elsevier R&D Solutions. Tool Sheet. Exploring a chemical reaction

MeteoGroup FleetGuard. The world s most comprehensive SaaS fleet management system

Geog 469 GIS Workshop. Managing Enterprise GIS Geodatabases

Building a National Data Repository

CAPE FARM MAPPER - an integrated spatial portal

How to Create a Substance Answer Set

Basic Techniques in Structure and Substructure

Bentley Map Advancing GIS for the World s Infrastructure

Bentley Geospatial update

Chemical and Biological Sciences

Intelligent NMR productivity tools

Search for Substance Data

Since 1988 High Force Research has worked with organizations operating in practically all end use sectors requiring chemical synthesis input, and

A Novel Software Tool for Crystallization Process Development

Transcription:

DRUG DISCOVERY TODAY Known ligands Chemistry ELN DBs Knowledge survey Therapeutic target Generate chemistry ideas Check chemical feasibility In-house Analyze SAR Synthesize or buy Report Test Journals & Patents ELN Biology Check ADME/Tox Flatfiles Docs Journals 1

Dr. Sebastian Radestock Product Manager Reaxys Elsevier Information Systems GmbH Frankfurt am Main Germany MAKING HIDDEN DATA DISCOVERABLE: HOW TO BUILD EFFECTIVE DRUG DISCOVERY ENGINES? 2

DRUG DISCOVERY TOMORROW Biology integrator Knowledge survey Therapeutic target Known ligands Generate chemistry ideas Analyze SAR Chemistry In-house Federated search system Check chemical feasibility Synthesize or buy Chemistry integrator ELN DBs Report Test Journals & Patents Literature management tool Biology Integrator for drug safety Check ADME/Tox Integrator for biomedical data Docs ELN Flatfiles 3

DRUG DISCOVERY TOMORROW TWO APPROACHES TO SOLVING THE CHALLENGE OF DATA ACCESS FEDERATED MODEL WAREHOUSE APPROACH CHEMISTRY INTEGRATOR Storage and capture Analysis system Storage Capture ELN User ELN 4

EXAMPLE IMPLEMENTATION OF THE FEDERATED MODEL FEDERATED MODEL CHEMISTRY INTEGRATOR Expansion of the existing system by integrating Reaxys Storage and capture In-house In-house Analysis system Existing system for consolidating in-house structure and bioactivity data User 5

Scientific data model THE REAXYS DATA BASE CONTAINS ALL PUBLISHED AND HISTORICAL CHEMISTRY DATA Compounds, substance property data, preparations, reactions, and bibliographic information from 400 core chemistry journals from relevant chemistry patents Manual extraction of all the data Coverage of 500 property data fields, from basics like boiling point or melting point, via crystal data and magnetic properties, to spectra All together 750 million data points Historical chemistry data dating back to 1771 6

EXPANDED BIBLIOGRAPHIC CONTENT IN REAXYS SUPPORTING MULTI-DISCIPLINARY RESEARCH Bibliographic data from 16.000 periodicals covering chemistry and related sciences have been loaded into Reaxys This goes beyond journals and patents, it includes conference proceedings, business articles, reviews etc. old new AGRICULTURAL AND BIOLOGICAL SCIENCES BIOCHEMISTRY, GENETICS AND MOLECULAR BIOLOGY CHEMICAL ENGINEERING DENTISTRY EARTH AND PLANETARY SCIENCES ENERGY ENGINEERING ENVIRONMENTAL SCIENCE IMMUNOLOGY AND MICROBIOLOGY MATERIALS SCIENCE MEDICINE NEUROSCIENCE PHARMACOLOGY, TOXICOLOGY AND PHARMACEUTICS PHYSICS AND ASTRONOMY VETERINARY ETC. 7

REAXYS-TREE AND AUTOMATIC INDEXING THE NEXT STEP COMING 2014 Title/Abstract Step 1 Index for chemistry terms Step 2 Add chemistry relevant keywords and identified chemical entities The Biosynthesis of Aristeromycin. Conversion of Neplanocin A to Aristeromycin by a Novel Enzymatic Reduction Partially purified cell-free extracts of the aristeromycin producer Streptomyces citricolor have been shown to catalyze the NADPHdependent reduction of neplanocin A to aristeromycin. Stereochemical studies revealed that the reduction proceeds with anti-geometry and involves transfer of the 4 pro-r hydrogen atom of NADPH to the 6'β position of aristeromycin. Reaxys Keywords: Aristeromycin biosynthesis, enzymatic reduction, Neplanocin A Step 3 Translate chemical names into structures and make them searchable 8

FEDERATED SEARCH SYSTEM WITH REAXYS LOOK-UP ACCESS TO REAXYS IS VIA THE REAXYS APPLICATION PROGRAMMING INTERFACE (API) How should the API look like? 9

THE REAXYS APPLICATION PROGRAMMING INTERFACE (API) LESSONS LEARNED Customers want to have access to all data in Reaxys Substances and substance property data Reactions and reaction details Citations Customers want to have access to all functionality of Reaxys Exact structure and reaction searching, similarity and substructure searching Factual queries Further processing of hitsets The Reaxys API was designed to be based on exchanging XML code between the user and the Reaxys server via HTML POST requests Security and usage tracking is an issue Secure communication via HTTPS POST is supported The Reaxys API is stateful, and login is required 10

LIMITATIONS OF THE FEDERATED MODEL SOME DISADVANTAGES TO CONSIDER FEDERATED MODEL CHEMISTRY INTEGRATOR Performance and availability of the system is dependent on the source systems Storage and capture In-house In-house Analysis system No clean-up and normalization of the data ELN User Architecture allows easy expansion when new data source becomes available 11

EXAMPLE IMPLEMENTATION OF THE WAREHOUSE APPROACH Bioactivity data was normalized, and structures were deduplicated CHEMISTRY INTEGRATOR WAREHOUSE APPROACH Structures A customer set up a system that contains structure and bioactivity data, and IP information Analysis system Storage Capture All data was extracted, translated and loaded to fit into one unified data model (UDM) User In-house 12

PLATFORM FOR STRUCTURE AND BIOACTIVITY DATA STRUCTURE DATA FROM REAXYS COMES FROM THE REAXYS STRUCTURE FLAT FILE 13

LIMITATIONS OF THE OF THE WAREHOUSE APPROACH SOME DISADVANTAGES TO CONSIDER WAREHOUSE APPROACH CHEMISTRY INTEGRATOR Structures Long implementation time and associated high cost Analysis system Storage Capture Difficult to accommodate differences or changes in data types User In-house 14

EXAMPLE IMPLEMENTATION OF A FLEXIBLE APPROACH A CONTENT INTEGRATION SOLUTION THAT IS NOW AVAILABLE TO ALL REAXYS USERS FEDERATED MODEL WAREHOUSE APPROACH Real-time commercial availability and pricing information comes via an emolecules API Storage and capture CHEMISTRY INTEGRATOR Storage Capture Pricing Structures Structures Structures from PubChem and emolecules have been integrated into Reaxys User Using multiple storage systems eliminates the need for one UDM 15

Results are available from different data source tabs The substance crosslinking icon allows to switch to corresponding substances in other data sources 16

Results are available from different data source tabs Filter for substances that are/aren t contained in other data sources 17

Results are available from different data source tabs Filter for substances that are/aren t contained in other data sources 18

The commercial availability icon allows to check real-time pricing information from emolecules 19

PubChem property headers with direct links to PubChem 20

FLEXIBLE APPROACH FOR INTEGRATION LESSONS LEARNED Reaxys has proven to be extremely powerful as analysis and database system Separation of the data from different data sources into multiple storage systems is the way to go if a powerful crosslinking mechanism is in place Some pieces of information that are subject to frequent updates should be integrated using the federated model 21

EXAMPLE IMPLEMENTATION OF A FLEXIBLE APPROACH A CONTENT INTEGRATION SOLUTION THAT ELSEVIER BUILT FOR ROCHE FEDERATED MODEL WAREHOUSE APPROACH Reaxys with PubChem and emolecules integrated CHEMISTRY INTEGRATOR Storage and capture Storage Capture Pricing Structures Structures Integration of Roche proprietary data on chemistry experiments User ELN In-house Reactions 22

INTEGRATION OF ROCHE IN-HOUSE DATA THE SITUATION AT ROCHE YESTERDAY AND TODAY 23

https://reaxys.roche.com Up to four Roche reaction data sources are supported Roche-specific data is included in the Output (PDF, MS-Word etc.) Filters on Roche-specific data fields Data on references and/or experiments, including PDF links Normalized Roche-specific reaction data 24

https://reaxys.roche.com A Roche icon allows to switch to a Roche in-house repository The reaction crosslinking icon allows to switch to corresponding reactions in other data sources 25

https://reaxys.roche.com Start building a synthesis tree by clicking on the synthesize link 26

https://reaxys.roche.com Synthesis planner opens up The first step of the synthesis plan is selected from the Roche data source 27

https://reaxys.roche.com Add a second step One step has been added 28

https://reaxys.roche.com The second step of the synthesis plan is selected from Reaxys New reactions are loaded 29

https://reaxys.roche.com Another step has been added Roche Experimental details of the mixed synthesis plan are summarized in a table. 30

INTEGRATION OF ROCHE IN-HOUSE DATA CUSTOMER FEEDBACK Usability and acceptance tests by Roche showed: Increased productivity of researchers at Roche Increased discoverability of the Roche reaction content Reduced maintenance effort for Roche: Legacy systems were decommissioned Roche gets on-going maintenance and functionality improvements by Elsevier Not compromise in security Flexible approach: Additional data sources have been added 31

DRUG DISCOVERY TOMORROW Chemistry In-house DBs Knowledge survey Therapeutic target Known ligands Generate chemistry ideas Analyze SAR Federated search system Check chemical feasibility Synthesize or buy ELN Report Test Check ADME/Tox Biology Journals & Patents MedScan Docs ELN Flatfiles 32

THANK YOU QUESTIONS? Dr. Sebastian Radestock Product Manager Reaxys Elsevier Information Systems GmbH Frankfurt am Main, Germany s.radestock@elsevier.com 33