Exploring the chemical space of screening results

Similar documents
Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds

Bridging the Dimensions:

Translating Methods from Pharma to Flavours & Fragrances

2009 Optibrium Ltd. Optibrium, STarDrop, Auto-Modeler and Glowing Molecule are trademarks of Optibrium Ltd.

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt

Relative Drug Likelihood: Going beyond Drug-Likeness

Integrated Cheminformatics to Guide Drug Discovery

Quantum Mechanical Models of P450 Metabolism to Guide Optimization of Metabolic Stability

Considering the Impact of Drug-like Properties on the Chance of Success

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration

Electrical and Computer Engineering Department University of Waterloo Canada

Introduction to Chemoinformatics

Rules for drug discovery: can simple property criteria help you to find a drug?

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

Introduction. OntoChem

N- and S-Oxidation Model of the Flavin-containing MonoOxygenases

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

Practical QSAR and Library Design: Advanced tools for research teams

Computational chemical biology to address non-traditional drug targets. John Karanicolas

Bioisosteres in Medicinal Chemistry

Machine learning for ligand-based virtual screening and chemogenomics!

Advances in Multi-parameter Optimisation Methods for de Novo Drug Design

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing

Molecular Modelling for Medicinal Chemistry (F13MMM) Room A36

One Size Doesn't Fit All: Tailoring Read-across Methodology for TSCA and Other Contexts

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

DivCalc: A Utility for Diversity Analysis and Compound Sampling

Early Stages of Drug Discovery in the Pharmaceutical Industry

Medicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted?

De Novo molecular design with Deep Reinforcement Learning

In Search of Desirable Compounds

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Interpreting Deep Classifiers

Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,..

MultiscaleMaterialsDesignUsingInformatics. S. R. Kalidindi, A. Agrawal, A. Choudhary, V. Sundararaghavan AFOSR-FA

Advanced Medicinal Chemistry SLIDES B

COMP 551 Applied Machine Learning Lecture 21: Bayesian optimisation

Using AutoDock for Virtual Screening

AMRI COMPOUND LIBRARY CONSORTIUM: A NOVEL WAY TO FILL YOUR DRUG PIPELINE

Data Mining in the Chemical Industry. Overview of presentation

Important Aspects of Fragment Screening Collection Design

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

Validation of Experimental Crystal Structures

Clustering Ambiguity: An Overview

Building innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Hydrogen Bonding & Molecular Design Peter

Stephen McDonald and Mark D. Wrona Waters Corporation, Milford, MA, USA INT RO DU C T ION. Batch-from-Structure Analysis WAT E R S SO LU T IONS

Tell students that Earth is the only planet in our solar system known to have life. Ask:

Using Topological Data Analysis to find discrimination between microbial states in human microbiome data

In Silico Investigation of Off-Target Effects

An Integrated Approach to in-silico

Machine Learning Concepts in Chemoinformatics

Linear and Non-Linear Dimensionality Reduction

Introduction to Chemoinformatics and Drug Discovery

Structure-Activity Modeling - QSAR. Uwe Koch

Structural biology and drug design: An overview

Unsupervised Learning Methods

Similarity methods for ligandbased virtual screening

Similarity Search. Uwe Koch

DATA ANALYTICS IN NANOMATERIALS DISCOVERY

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today

Neuronal Cell Health Assays. Real-time automated measurements of neuronal cell viability and neurite dynamics inside your incubator.

molecules ISSN

COMPARISON OF SIMILARITY METHOD TO IMPROVE RETRIEVAL PERFORMANCE FOR CHEMICAL DATA

The PhilOEsophy. There are only two fundamental molecular descriptors

BSc MATHEMATICAL SCIENCE

My Career in the Pharmaceutical Industry

Mixture of metrics optimization for machine learning problems

Reaxys Medicinal Chemistry Fact Sheet

Accelerating the Metabolite Identification Process Using High Resolution Q-TOF Data and Mass-MetaSite Software

How IJC is Adding Value to a Molecular Design Business

Monday May 12, :00 to 1:30 AM

Data Quality Issues That Can Impact Drug Discovery

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

Design Drugs Collaboratively Using Spotfire Visualization and Analysis

Universities of Leeds, Sheffield and York

Multi-Parameter Optimization: Identifying high quality compounds with a balance of properties

7.1 Basis for Boltzmann machine. 7. Boltzmann machines

Fast Hierarchical Clustering from the Baire Distance

Keywords: anti-coagulants, factor Xa, QSAR, Thrombosis. Introduction

JCICS Major Research Areas

Studying the effect of noise on Laplacian-modified Bayesian Analysis and Tanimoto Similarity

Patent Searching using Bayesian Statistics

Unsupervised Kernel Dimension Reduction Supplemental Material

Development of a Structure Generator to Explore Target Areas on Chemical Space

Structure-Based Drug Discovery An Overview

Institute for Functional Imaging of Materials (IFIM)

Case Study: Protein Folding using Homotopy Methods

Relationship between Least Squares Approximation and Maximum Likelihood Hypotheses

Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Evolutionary Algorithm for Drug Discovery Interim Design Report

New Synthetic Technologies in Medicinal Chemistry

Learning Tetris. 1 Tetris. February 3, 2009

Nonparametric Inference for Auto-Encoding Variational Bayes

Using Self-Organizing maps to accelerate similarity search

Quantitative structure activity relationship and drug design: A Review

Functional Group Fingerprints CNS Chemistry Wilmington, USA

Transcription:

Exploring the chemical space of screening results Edmund Champness, Matthew Segall, Chris Leeding, James Chisholm, Iskander Yusof, Nick Foster, Hector Martinez ACS Spring 2013, 7 th April 2013 Optibrium, StarDrop, Auto-Modeller and Glowing Molecule are trademarks of Optibrium Ltd.

Overview Introduction Part 1: Chemical Space Part 2: Balancing Properties in Drug Discovery Part 3: Exploring the Space Around Us Conclusions 2

Part 1: Chemical Space 3

Chemical Space: What is it? Chemical space is the space spanned by all possible (i.e. energetically stable) molecules and chemical compounds [1] estimated to exceed 10 60 an amount so vast when compared to the number of such molecules we have made, or indeed could ever hope to make, that it might as well be infinite how we should best direct our efforts towards regions of chemical space that are most likely to contain molecules with useful biological activity? [2] [1] Wikipedia [2] Kirkpatrick, P.; C. Ellis (2004). "Chemical space, Nature - Vol 432 No 7019 (Insight) pp823-865 4

Chemical Space: How do we think about it? Outer space Moon Landing 240,000 miles Solar system exploration 3.5 billion miles Galaxy mapping 30,000 light years Universe? Quite large! Chemical space Preclinical several compounds Lead optimisation Tens/hundreds of compounds Hit-finding Many thousands of compounds All theoretical molecules Too many! 5

Methodology: Chemical Space Chemical information Fingerprints Descriptors Similarity measure Tanimoto Euclidean Dimension reduction Similar compounds close together Expectation maximisation PCA [3] tsne (t-distributed Stochastic Neighbour Embedding) [4] [3] Sam Roweis (1998) Neural Information Processing Systems 10 (NIPS'97) pp.626-632 [4] van der Maaten, L., & Hinton, G. (2008). Visualizing High-Dimensional Data Using t-sne. Journal of Machine Learning Research, 9, 2579-2605. 6

Two Approaches to Visualisation PCA Well known Can be optimised for large sets Linear transformation results in information loss Biased towards keeping dissimilar compounds far apart rather than similar compounds close together tsne (t-distributed Stochastic Neighbour Embedding) Computationally expensive Optimised for visual representation at expense of some intracompound relationships 7

How They Compare...NK2 Compounds 8

How They Compare...Dopamine Actives 9

Clustering? Clustering - Tanimoto level of 0.7 using path based fingerprints [5] [5] Butina, D. (1999). Unsupervised Data Base Clustering Based on Daylight's fingerprint and Tanimoto Similarity: A fast and automated way to cluster small and large data set. J. Chem. Inf. Comput. Sci, 39, 747-750. 10

Comparing Chemical Families 11

Part 1: 5HT1A Hit Space pk i = 7 pk i = 9.5 Looking promising... 12

Part 2: Balancing Properties in Drug Discovery 13

Property 2 Property 2 The Objectives of Drug Discovery Multi-parameter optimisation Identify chemistries with an optimal balance of properties Hit X Solubility Potency X Drug Metabolic stability Safety Absorption Quickly identify situations when such a balance is not possible Fail fast, fail cheap Only when confident Solubility No good drug Property 1 Potency Property 1 Safety Metabolic stability Absorption 14

An Approach to MPO Probabilistic Scoring [6] [6] Segall et al. (2009) Chem. & Biodiv. 6 p. 2144 15

Score Probabilistic Scoring Property data Experimental or predicted Criteria for success Relative importance Uncertainties in data Data do not separate these as error bars overlap Experimental or statistical Score (Likelihood of Success) Confidence in score Error bars show confidence in overall score Bottom 50% may be rejected with confidence Best Worst

Part 1: 5HT1A Hit Space pk i = 7 pk i = 9.5 Looking promising... 17

Part 2: Potential Property Space Score = 0 Score = 0.4 Still promising? 18

Part 2: Property + Hit Space Score = 0 Score = 0.4 Still promising? 19

Part 3: Exploring the Surrounding Space Generating New Ideas... 20

Generating Compound Ideas Applying Med. Chem. Transformation Rules Compounds generated must make sense from a medicinal chemistry perspective Apply transformation rules, derived from medicinal chemistry experience, to initial compound [7][8] Library of >200 transformations Not only functional group replacement but also framework transformations Functional group addition: e.g. sulfonamide addition [7] Drug Guru: A computer software program for drug design using medicinal chemistry rules K.D. Stewart et. al. Bioorg. Med. Chem. 14 (2006) p. 7011 [8] Applying Medicinal Chemistry Transformations and Multiparameter Optimization to Guide the Search for High-Quality Leads and Candidates Matthew Segall, Ed Champness, Chris Leeding, Ryan Lilien, Ramgopal Mettu and Brian Stevens J. Chem. Inf. Model. (2011) 51(11) pp. 2967-2976 21

Generating Compound Ideas Applying Med. Chem. Transformation Rules Compounds generated must make sense from a medicinal chemistry perspective Apply transformation rules, derived from medicinal chemistry experience, to initial compound [7][8] Library of >200 transformations Not only functional group replacement but also framework transformations Linker modification: e.g. ester to amide [7] Drug Guru: A computer software program for drug design using medicinal chemistry rules K.D. Stewart et. al. Bioorg. Med. Chem. 14 (2006) p. 7011 [8] Applying Medicinal Chemistry Transformations and Multiparameter Optimization to Guide the Search for High-Quality Leads and Candidates Matthew Segall, Ed Champness, Chris Leeding, Ryan Lilien, Ramgopal Mettu and Brian Stevens J. Chem. Inf. Model. (2011) 51(11) pp. 2967-2976 22

Generating Compound Ideas Applying Med. Chem. Transformation Rules Compounds generated must make sense from a medicinal chemistry perspective Apply transformation rules, derived from medicinal chemistry experience, to initial compound [7][8] Library of >200 transformations Not only functional group replacement but also framework transformations Ring addition: e.g. benzene to indole [7] Drug Guru: A computer software program for drug design using medicinal chemistry rules K.D. Stewart et. al. Bioorg. Med. Chem. 14 (2006) p. 7011 [8] Applying Medicinal Chemistry Transformations and Multiparameter Optimization to Guide the Search for High-Quality Leads and Candidates Matthew Segall, Ed Champness, Chris Leeding, Ryan Lilien, Ramgopal Mettu and Brian Stevens J. Chem. Inf. Model. (2011) 51(11) pp. 2967-2976 23

Iterative Application Exponential Growth! 24

Part 1: 5HT1A Hit Space pk i = 7 pk i = 9.5 Looking promising... 25

Part 2: Potential Property Space Score = 0 Score = 0.4 Still promising? 26

Part 3: Exploring the Space Score = 0 Score = 0.4 Interesting? 27

Part 3: Exploring the Space Score = 0 Score = 0.4 Interesting... 28

Conclusions We can use chemical space visualisation to help identify potentially interesting areas of chemistry Using an appropriate approach to MPO we can make confident decisions despite the uncertain nature of drug discovery data We can explore a library to find chemistries with the best chance of having a good balance of properties while avoiding missed opportunities due to uncertainty By automatically generating new chemistry ideas we can determine the potential of different areas of the chemistry space around which to expand upon our hit data 29

Acknowledgements StarDrop group past and present, including: Matthew Segall Alan Beresford Chris Leeding Dawn Yates Iskander Yusof Dan Hawksley James Chisholm Joelle Gola Nick Foster Brett Saunders Hector Martinez Simon Lister Olga Obrezanova Mike Tarbit 30

...and if you would like to try it yourself... StarDrop Hands-on Workshop Tuesday 9 th, 12:00 2:30p.m. Hall B2-C, Exhibitor Workshop Room 1 Lunch provided... Register at Optibrium booth #708 31