Ultra High Throughput Screening using THINK on the Internet

Similar documents
PATTERN RECOGNITION AND DISTRIBUTED COMPUTING

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

CSD. Unlock value from crystal structure information in the CSD

Introduction to Chemoinformatics and Drug Discovery

Conformational Searching using MacroModel and ConfGen. John Shelley Schrödinger Fellow

Kd = koff/kon = [R][L]/[RL]

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

Integrated Cheminformatics to Guide Drug Discovery

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

Ligand Scout Tutorials

Introduction. OntoChem

Data Mining in the Chemical Industry. Overview of presentation

Structural biology and drug design: An overview

Using AutoDock for Virtual Screening

In silico pharmacology for drug discovery

Schrodinger ebootcamp #3, Summer EXPLORING METHODS FOR CONFORMER SEARCHING Jas Bhachoo, Senior Applications Scientist

The PhilOEsophy. There are only two fundamental molecular descriptors

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities

Generating Small Molecule Conformations from Structural Data

Computational chemical biology to address non-traditional drug targets. John Karanicolas

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

5.1. Hardwares, Softwares and Web server used in Molecular modeling

An Integrated Approach to in-silico

Garib N Murshudov MRC-LMB, Cambridge

Design and Synthesis of the Comprehensive Fragment Library

Practical QSAR and Library Design: Advanced tools for research teams

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

Cheminformatics platform for drug discovery application

Biologically Relevant Molecular Comparisons. Mark Mackey

Structural Bioinformatics (C3210) Molecular Docking

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

Virtual Screening: How Are We Doing?

The Schrödinger KNIME extensions

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a

Identifying Interaction Hot Spots with SuperStar

Docking. GBCB 5874: Problem Solving in GBCB

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

Unlocking the potential of your drug discovery programme

Drug Informatics for Chemical Genomics...

Introducing a Bioinformatics Similarity Search Solution

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

In silico generation of novel, drug-like chemical matter using the LSTM deep neural network

Analyzing Molecular Conformations Using the Cambridge Structural Database. Jason Cole Cambridge Crystallographic Data Centre

Receptor Based Drug Design (1)

Pipelining Ligands in PHENIX: elbow and REEL

Searching Substances in Reaxys

Autodock tutorial VINA with UCSF Chimera

Chemical library design

The Quantum Landscape

Docking with Water in the Binding Site using GOLD

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Dock Ligands from a 2D Molecule Sketch

Different conformations of the drugs within the virtual library of FDA approved drugs will be generated.

The Conformation Search Problem

OECD QSAR Toolbox v.4.1. Tutorial illustrating new options for grouping with metabolism

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design

User Guide for LeDock

1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!

The Long and Rocky Road from a PDB File to a Protein Ligand Docking Score. Protein Structures: The Starting Point for New Drugs 2

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

Improving Protein Function Prediction with Molecular Dynamics Simulations. Dariya Glazer Russ Altman

Pose Prediction with GOLD

Using AutoDock 4 with ADT: A Tutorial

Lecture for Molekylär bioinformatik X3 Feb Computational Chemistry in Drug Discovery. Mats Kihlén. Head of Research Informatics.

Introduction to Structure Preparation and Visualization

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Data Quality Issues That Can Impact Drug Discovery

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Bridging the Dimensions:

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

Performing a Pharmacophore Search using CSD-CrossMiner

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

The Cambridge Structural Database (CSD) a Vital Resource for Structural Chemistry and Biology Stephen Maginn, CCDC, Cambridge, UK

Flexibility and Constraints in GOLD

In Silico Investigation of Off-Target Effects

Part 6. 3D Pharmacophore Modeling

Large Scale Evaluation of Chemical Structure Recognition 4 th Text Mining Symposium in Life Sciences October 10, Dr.

Molecular modeling with InsightII

Web tools for Monomer selection, Library Design and Compound Acquisition. Andrew Leach GlaxoSmithKline Research and Development Stevenage

Protein Structure Prediction and Protein-Ligand Docking

The reuse of structural data for fragment binding site prediction

DOCKING TUTORIAL. A. The docking Workflow

Tautomerism in chemical information management systems

MD Simulation in Pose Refinement and Scoring Using AMBER Workflows

Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification

Building innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery

Tutorial. Getting started. Sample to Insight. March 31, 2016

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Principles of Drug Design

SCULPT 3.0. Using SCULPT to Gain Competitive Insights. Brings 3D Visualization to the Lab Bench SPECIAL REPORT. 4 Molecular Connection Fall 1999

Transcription:

Ultra High Throughput Screening using THINK on the Internet Keith Davies Central Chemistry Laboratory, Oxford University Cathy Davies Treweren Consultants, UK

Blue Sky Objectives Reduce Development Failures Real choice of leads candidates for follow-up Better decisions Reduce Drug Discovery Research Timescales Costs

CAN-DDO Project Cancer Research Project Organised by Oxford University United Devices (www.ud.com) Used THINK Software as a Screen-saver Funded by NFCR and Sponsored by Intel 12 Targets 3.5 Billion Molecules Largest Computational Chemistry Project

CAN-DDO Virtual Screening Suppliers Catalogues Property & Substructure Filtering Conformer Generation Combinatorial Libraries Dock & Score Saved Hits Protein Active Site Generation of Queries Pharmaco- Phores

Pharmacophores Centre Types H-bond Donor H-bond Acceptor Acid Base Positive Charge Negative Charge Aromatic Ring Lipophile 4 User Definable Options 3 or 4 Centre Types User Definable Bins Occurrence Frequency Output to File

Generating Pharmacophores A D H N O H A D N,O D O A D=Donor; A=Acceptor (Not to scale) H

Molecule Summary Reviewed 1.4 Million Catalogue Molecules 1 177 Million Library Molecules Drug-like 0.4 Million Catalogue Molecules 35 Million Library Molecules De Novo Derivatives 100 for Each Molecule Automatically Filtered Total of 3.5 Billion Molecules

De Novo Structure Generation Anneal Initial Structure Compute Diversity Determine Similarity to Actives & Inactives Apply Transforms & Select New Molecule Learn from Bio Data Check Suitability Calculate Properties & Apply Filters

Filtering (1) Properties Substructures Unstable Reactive Undesirable (eg toxic) Centres 2:9 Mass 150:800 XSA 20:240 Rot-Bonds <10 Conformers <1000000

Filtering (2) Properties Substructures Unstable Reactive Undesirable (eg toxic) NOC [ND]=O N[SSP2] OO NN N=N=N N=C=O N=C=N N[HAL] OC[HAL] C=COH HNO [NITR] O[SSP2] SS N#N N[CAK]O N=C=S O[HAL] S[HAL] NC[HAL]

Filtering (3) Properties Substructures Unstable Reactive Undesirable (eg toxic) CC(H)=C(H)C=O C =COH C=CNH O=C[HAL] N=C[HAL] S=C[HAL] [HAL]C[HAL] HOCOH O=COC=O COS(=O)(=O)C COS(=O)OC

Filtering (4) Properties Substructures Unstable Reactive Undesirable (eg toxic) [M] [Si]O C1XC1 SC#N NP C=P CN#C O=CC#N NC#N PP [Si]N SH C1SC1 C(=O)S PS P[HAL] PC#N OCC#N CC(H)=O

Conformer Generation Modes Systematic Random Sample Contact Check VdW CPK (0.6*VdW) Bond Rotations 3 Single 2 Conjugated 6 Alpha 0 Amide (ie Off) 0 Ring (ie Off)

Site Searching Generation of Complementary Centres Ligand Pharmacophores Matched to Site Ranking Hits Based on ChemScore Equation G = G 0 + G hbond *N hbond + G lipo *N lipo + G rot *N rot + E where G 0 G hbond G lipo G rot are constants (-5.48;-3.34; -0.117; 2.56) N hbond is the number of qualifying interactions (on geometric criteria) N lipo is the number of lipohilic contacts N rot is the number of frozen rotatable bonds in the ligand E is the VdW interaction energy and ligand torsional energy

THINK Architecture SMILES ISIS SD PDB 2D & 3D Builders 2D & 3D Visualisation Combinatorial Chemistry 2D & Similarity Searching Data Analysis De Novo Structure Generation Conformer Generation 3D Searching Pharmacophore Generation Active Site Searching Modules Core 2D 3D Pharmacophore

Data Bandwidth Issues Protein Query PDB Format Centres Site Residues 50-100Kbytes Molecule Data per Work Unit 100 Molecules per Job in SMILES 5-10 Kbytes Results SMILES on Internet ISIS SD on Intranet

Performance THINK 1.03 (used for CAN-DDO) 42,000,000,000 molecules 126,000 years 900 molecules per CPU day (excluding redundancy) THINK 1.2 (current release) Optimised with assistance from Intel Up to 40 times faster More centres useful for larger sites Refinement of docked geometry About 250,000 molecules per 2GHz CPU day

Summary Results Hit Summary Good Hits Crude Hits 12 11 10 Protein Targets 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 Number of Hits (Log10 scale)

Results to Date 7.2 Million Preliminary Crude Hits Post-Processing (using LigandFit) Eliminate more false positives Optimise X,Y,Z position, rotation and torsions Oxford University Spin-off - Inhibox

<0 <300 <600 <900 L8 L18 L23B L43 L2 L1 L36 C2 C3 C1 L7A L21 L39 L4 L37 L7 L32 L6 L20 L5 L41 L5A L4A 0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05 % Hit rate Hit Score Libraries Vascular Endothelial Growth Factor L8 L18 L23B L43 L2 L1 L36 C2 C3 C1 L7A L21 L39 L4 L37 L7 L32 L6 L20 L5 L41 L5A L4A

Member Statistics Motivating Volunteers Feedback & Recognition Counting Jobs, Molecules, Hits etc Teams Promoting Membership Sense of identity and dependence Points & Rewards Donations to Charities eg Make a Wish

Calculation Redundancy Validation Processors Tampering Non Returns Member Attrition Job Failures Job Scheduling Transaction Model Multiple Servers

Project Progress Meaningful Feedback Open Communication Project Comparisons

Find-a-Drug Multiple Servers Future Projects Cancer HIV Bioterrorism Proteome

Acknowledgements United Devices, TX NFCR Graham Richards, Oxford University