Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Similar documents
Introduction. OntoChem

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

TRAINING REAXYS MEDICINAL CHEMISTRY

Building innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery

Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, Dr.

DRUG DISCOVERY TODAY ELN ELN. Chemistry. Biology. Known ligands. DBs. Generate chemistry ideas. Check chemical feasibility In-house.

Introduction to Chemoinformatics and Drug Discovery

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

De Novo molecular design with Deep Reinforcement Learning

Kinome-wide Activity Models from Diverse High-Quality Datasets

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

Integrated Cheminformatics to Guide Drug Discovery

Handling Human Interpreted Analytical Data. Workflows for Pharmaceutical R&D. Presented by Peter Russell

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics

In silico pharmacology for drug discovery

Using AutoDock for Virtual Screening

Practical QSAR and Library Design: Advanced tools for research teams

A powerful site for all chemists CHOICE CRC Handbook of Chemistry and Physics

EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS

Introducing a Bioinformatics Similarity Search Solution

Biologically Relevant Molecular Comparisons. Mark Mackey

Design and Synthesis of the Comprehensive Fragment Library

ACD/Labs Software Impurity Resolution Management. Presented by Peter Russell

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

InChI/InChIKey vs. NCI/CADD Structure Identifiers: A comparison

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

JCICS Major Research Areas

FACULTY OF PHARMACY. M. Pharmacy I Semester (Suppl.) Examination, November 2015 (Common To All) Subject: Pharmaceutical Analytical Techniques

Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility

COMBINATORIAL CHEMISTRY IN A HISTORICAL PERSPECTIVE

The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

AMRI COMPOUND LIBRARY CONSORTIUM: A NOVEL WAY TO FILL YOUR DRUG PIPELINE

Kalexsyn Overview Kalexsyn, Inc Campus Drive Kalamazoo, MI Phone: (269) Fax: (269)

Data Quality Issues That Can Impact Drug Discovery

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Chemical Data Retrieval and Management

The shortest path to chemistry data and literature

Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds

Data Mining in the Chemical Industry. Overview of presentation

The MDL Discovery Framework: Data and Application Integration in the Life Sciences

Reaxys Improved synthetic planning with Reaxys

Searching Substances in Reaxys

Receptor Based Drug Design (1)

CHEMOINFORMATICS: THEORY, PRACTICE, & PRODUCTS

Finding the Needle - Reaxys Structure Searching

MDL Databases. Solutions At Every Stage in the R&D Process. MDL Databases. Information Systems, Inc.

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration

Chemical Space: Modeling Exploration & Understanding

SciFinder Premier CAS solutions to explore all chemistry MethodsNow, PatentPak, ChemZent, SciFinder n

Early Stages of Drug Discovery in the Pharmaceutical Industry

Web tools for Monomer selection, Library Design and Compound Acquisition. Andrew Leach GlaxoSmithKline Research and Development Stevenage

FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES

Fragment-based de novo Design

Analytical data, the web, and standards for unified laboratory informatics databases

Assessing Synthetic Accessibility of Chemical Compounds Using Machine Learning Methods

Decision Support Systems for the Practicing Medicinal Chemist

A spatial view on the culture heritage domain

Reaxys The Highlights

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann

Drug Informatics for Chemical Genomics...

Pharmaceutical e-learning at the University of Innsbruck

Ákos Tarcsay CHEMAXON SOLUTIONS

Chemically Intelligent Experiment Data Management

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

Standardized Representations of ELN Reactions for Categorization and Duplicate/Variation Identification

Medicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted?

Portal for ArcGIS: An Introduction. Catherine Hynes and Derek Law

Tautomerism in chemical information management systems

Product Guide. Thermo Scientific Cellomics HCS Solution

MSc Drug Design. Module Structure: (15 credits each) Lectures and Tutorials Assessment: 50% coursework, 50% unseen examination.

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

Chemists are from Mars, Biologists from Venus. Originally published 7th November 2006

LIBRARY DESIGN FOR COLLABORATIVE DRUG DISCOVERY: EXPANDING DRUGGABLE CHEMOGENOMIC SPACE

Application integration: Providing coherent drug discovery solutions

ImprovIng ADmE ScrEEnIng productivity In Drug DIScovEry

Focus Series Peptide Synthesizers

Course Syllabus. Department: Science & Technology. Date: April I. Course Prefix and Number: CHM 211. Course Name: Organic Chemistry I

Leveraging Web GIS: An Introduction to the ArcGIS portal

ArcGIS Enterprise: What s New. Philip Heede Shannon Kalisky Melanie Summers Shreyas Shinde

DivCalc: A Utility for Diversity Analysis and Compound Sampling

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

BIOVIA ENHANCED STEREOCHEMICAL REPRESENTATION WHITE PAPER

Metabolite Identification and Characterization by Mining Mass Spectrometry Data with SAS and Python

Comprehensive Chemoinformatics since Web-based, client/server, and toolkit approaches. Native Oracle (cartridge) and Microsoft technology.

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Introduction to Portal for ArcGIS. Hao LEE November 12, 2015

CHEM 240: Survey of Organic Chemistry at North Dakota State University Midterm Exam 02 - Tue, 23 Sep 2014!! Name:! KEY!

CDK & Mass Spectrometry

ArcGIS Platform For NSOs

Data Aggregation with InfraWorks and ArcGIS for Visualization, Analysis, and Planning

Using Web Technologies for Integrative Drug Discovery

InChI keys as standard global identifiers in chemistry web services. Russ Hillard ACS, Salt Lake City March 2009

DATA ANALYTICS IN NANOMATERIALS DISCOVERY

I N T R O D U C T I O N : G R O W I N G I T C O M P L E X I T Y

Organic Synthesis PDF

Introduction to Portal for ArcGIS

Transcription:

Capturing Chemistry What you see is what you get In the world of mechanism and chemical transformations Dr. Stephan Schürer ead of Intl. Sci. Content Libraria, Inc. sschurer@libraria.com

Distribution of Knowledge electronic documents 20% others 12% in the heads of the employees 42% paper documents 26% achr. Chem. 2002, 12, 1416 in many companies only 20 to 40 % of existing knowledge is utilized

Pharmaceutical Industry Challenges : Patent expiration of many drugs Increased R&D costs, but fewer drugs in pipeline Uncertainty of success - high attrition rate Retirement of researchers and job fluctuation ow to activate and utilize (and preserve) knowledge as a resource

Knowledge-Based Drug Discovery Chemistry Public and proprietary data archive SAR Libraries Enumeration Software LUCIA QSAR Models ADME Models Tox Models Selection / ptimization

"I think having breadth in chemistry is the way to go" (with respect to identifying multiple lead series) Chris Lipinski, DDT Jan 2003 interview Enumeration Engine Interprets reaction mechanisms Multi-component reactions Rearrangement reactions Intramolecular reactions cyclizations Complex multi-step transformations Reaction step subsequences Generic stereochemistry Multiple reaction sites Multiple matches in building blocks Product mixtures Formation of stereoisomers Salt forms of reactants / products

"I think having breadth in chemistry is the way to go" (with respect to identifying multiple lead series) Chris Lipinski, DDT Jan 2003 interview Enumeration Engine Easy to use All interactions via intuitive GUI Web-browser and drawing tool Interactive graphical debugging functionality Failed BB analysis Synthetic sequences of individual compounds Fast 10K library 2 minutes 100K library 7 minutes

"Where computational chemistry tools really shine is capturing the types of functionality you want to avoid and they are pretty good at filtering by property, but they do a poor job of taking into account, chemical feasibility. Chris Lipinski, DDT Jan 2003 interview Chemistry Archive 14.600 Reaction transforms with detailed experimental procedures Reactions are categorized by mechanism and product type igh throughput chemistry Kinase-related heterocyclic chemistry 2.3 Million Compounds 77 % unique compounds 60 % actual 23 % duplicated compounds (alternative reaction pathways) 35 % polymer-bound 40 % virtual

Demo Enumeration Sequence Piv Piv Piv 2 Piv + + R 1 C + R 3 Piv Piv Piv R 3 Piv R 1 Stereoselective multi-component coupling (stereochemical induction) Generic stereochemistry (chiral centers do not influence the reaction) 2 R 1

Demo Enumeration Sequence R 1 + [Ru] R 1 C 2 Me C 2 Me R 1 C 2 Me + R 1 R 1 [Ru] [Ru] C Ts C C 2 Me Ts R 1 [Ru] R 1 R 1 [Ru] Yne-ene cross metathesis Diels-Alder and double elimination Formation of isomeric product mixtures

Demo Enumeration Sequence C 2 Me R R R 1 R R 1 2 R R Rearrangement via nitrene (Stereochemical configuration is preserved)

Another Enumeration Sequence 2 2 C R 1 * Fmoc I * * 2 R 1 R 1' 2 C R 3 * Fmoc 2 C R 4 R 1'' * X R 3' * R 4 R 1'' * * R 3' R 4 R 1' * * R 3 R 4 R 1''' * R 3'' * Generic stereochemistry in reactions Side chain modifications in synthetic steps X R 4 oughten et al J. Comb. Chem. 1999, 1, 195

Side-Chain Modifications Boc R2 Boc R2 Me alkylation reduction cleavage R2 Me 2 t-bu t-bu

Knowledge-Based Drug Discovery Chemistry Public and proprietary data archive SAR Libraries Enumeration Software LUCIA QSAR Models ADME Models Tox Models Selection / ptimization

Another recent theme is how to efficiently capture chemical data from the literature, especially in terms of constructing quantative structure-activity relationship datasets. Chris Lipinski, DDT Jan 2003 interview SAR Archive ~6K assay protocols with detailed experimental assay procedures ~300 kinase-related targets, categorized by mechanism SAR organized by assay type, target, etc. SAR grouped by binding-mode assay conditions Technology can capture SAR on any target / gene family Kinase Gene Family ~50K data points ~16K unique compounds

Integrated Computational Models E-screen / QSAR models 12 quantitative e-screen models developed 2 binary models Data for 20 more models of kinase gene family targets ther calculated descriptors Molecular properties Lipinsky properties ADME models Tox

Demo Enumeration Results and Analysis Enumerated products can be committed to the data base or exported Compounds are analyzed by variety of integrated models Results are automatically sent to the user by email with a life link to the server

Enabling Technologies DayCart 4.81 w/ program objects SMILES, SMARTS, SMIRKS Reaction toolkit 4.81 racle 9i Weblogic 6.0 Misc. tools and utilities QuickProp fromschrödinger Linux RDF automapper (Infochem) CSFC (SMILES depiction) MDL Chime Pro plugin

Libraria s Discovery Platform Based upon accepted industry-standard database platform and software (racle/daylight) Fully web-based to make technology available to bench chemists Scalable, open and enterprise-wide architecture Integrated, searchable archive of chemistry and SAR data with increasingly valuable knowledge content Intuitive technology for leveraging the medicinal chemist s conceptual capabilities in drug discovery The knowledgebase-architecture is a growing repository of chemistry, biology, and derived computational models a learning machine that automatically develops and utilizes its predictive capability.

Acknowledgements Daylight Chemical Information Systems Thank you for listening! Infochem Gmb Wolf-Dietrich Ihlenfeldt Peter Ertl