Practical QSAR and Library Design: Advanced tools for research teams

Similar documents
Integrated Cheminformatics to Guide Drug Discovery

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

The Conformation Search Problem

The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc

The Schrödinger KNIME extensions

Biologically Relevant Molecular Comparisons. Mark Mackey

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Everyday NMR. Innovation with Integrity. Why infer when you can be sure? NMR

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

Reaxys Medicinal Chemistry Fact Sheet

Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,..

CSD. Unlock value from crystal structure information in the CSD

The shortest path to chemistry data and literature

In silico pharmacology for drug discovery

Building innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery

Reaxys Improved synthetic planning with Reaxys

User Guide for LeDock

Structural biology and drug design: An overview

COMBINATORIAL CHEMISTRY IN A HISTORICAL PERSPECTIVE

The Case for Use Cases

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Cheminformatics analysis and learning in a data pipelining environment

WALKUP LC/MS FOR PHARMACEUTICAL R&D

Life Science Webinar Series

The Schrödinger KNIME extensions

Ligand Scout Tutorials

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

Docking. GBCB 5874: Problem Solving in GBCB

Design and Synthesis of the Comprehensive Fragment Library

TRAINING REAXYS MEDICINAL CHEMISTRY

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering

Cheminformatics platform for drug discovery application

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

Receptor Based Drug Design (1)

Bridging the Dimensions:

An Integrated Approach to in-silico

JCICS Major Research Areas

Pipeline Pilot Integration

Introduction. OntoChem

Unlocking the potential of your drug discovery programme

Using AutoDock for Virtual Screening

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics

Structure-Activity Modeling - QSAR. Uwe Koch

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

The Schrödinger KNIME extensions

Introducing a Bioinformatics Similarity Search Solution

October 6 University Faculty of pharmacy Computer Aided Drug Design Unit

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Schrodinger ebootcamp #3, Summer EXPLORING METHODS FOR CONFORMER SEARCHING Jas Bhachoo, Senior Applications Scientist

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

DivCalc: A Utility for Diversity Analysis and Compound Sampling

NMR Solutions for drug discovery

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration

Fragment-based drug discovery

A Rational Approach to Receptor-Flexible Docking: Method and Validation

Automated Compound Collection Enhancement: how Pipeline Pilot preserved our sanity. Darren Green GSK

Ultra High Throughput Screening using THINK on the Internet

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Molecular Modelling for Medicinal Chemistry (F13MMM) Room A36

A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery

Reaxys Pipeline Pilot Components Installation and User Guide

for XPS surface analysis

SCULPT 3.0. Using SCULPT to Gain Competitive Insights. Brings 3D Visualization to the Lab Bench SPECIAL REPORT. 4 Molecular Connection Fall 1999

Quantitative structure activity relationship and drug design: A Review

MultiscaleMaterialsDesignUsingInformatics. S. R. Kalidindi, A. Agrawal, A. Choudhary, V. Sundararaghavan AFOSR-FA

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Drug Informatics for Chemical Genomics...

has its own advantages and drawbacks, depending on the questions facing the drug discovery.

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

CSD Conformer Generator User Guide

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

Machine Learning Concepts in Chemoinformatics

In Silico Investigation of Off-Target Effects

5.1. Hardwares, Softwares and Web server used in Molecular modeling

Application Note 12: Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova

Merck Virtual Library (MVL): Deployment, Application, and Future Enhancement

Highly automated protein formulation development: a case study

Exploring the chemical space of screening results

Farewell, PipelinePilot Migrating the Exquiron cheminformatics platform to KNIME and the ChemAxon technology

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Implementation of novel tools to facilitate fragment-based drug discovery by NMR:

How IJC is Adding Value to a Molecular Design Business

Progress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery

An automated synthesis programme for drug discovery

EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS

The PhilOEsophy. There are only two fundamental molecular descriptors

Notes of Dr. Anil Mishra at 1

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

All Ions MS/MS: Targeted Screening and Quantitation Using Agilent TOF and Q-TOF LC/MS Systems

CHEMOINFORMATICS: THEORY, PRACTICE, & PRODUCTS

Statistical concepts in QSAR.

Leveraging Web GIS: An Introduction to the ArcGIS portal

ABI 3200 Q TRAP LC/MS/MS System

Chemical Data Retrieval and Management

Transcription:

DS QSAR and Library Design Webinar Practical QSAR and Library Design: Advanced tools for research teams Reservationless-Plus Dial-In Number (US): (866) 519-8942 Reservationless-Plus International Dial-In Number: (770) 790-2186 Conference Code: 8587995760

Discovery Studio DS QSAR and DS Library Design David M. Lorber, MBA, Ph.D. This presentation and/or any related documents contains statements regarding our plans or expectations for future features, enhancements or functionalities of current or future products (collectively "Enhancements"). Our plans or expectations are subject to change at any time at our discretion. Accordingly, Accelrys is making no representation, undertaking no commitment or legal obligation to create, develop or license any product or Enhancements. The presentation, documents or any related statements are not intended to, nor shall, create any legal obligation upon Accelrys, and shall not be relied upon in purchasing any product. Any such obligation shall only result from a written agreement executed by both parties. In addition, information disclosed in this presentation and related documents, whether oral or written, is confidential or proprietary information of Accelrys. It shall be used only for the purpose of furthering our business relationship, and shall not be disclosed to third parties. November 6, 2007

DS QSAR+ Property calculation protocols Calculate Properties Calculate all calculable properties in Pipeline Pilot Calculate semi-empirical QM descriptors (VAMP) Calculate density functional QM descriptors (DMOL3) Structure-activity model building protocols Create Bayesian Model Create Multiple Linear Regression Model Create Partial Least Squares Model Create Back Propagation Neural Network (BPNN) models Create Genetic Function Approximation (GFA) model Includes multiobjective Pareto optimiser

Calculate Molecular Properties All calculable properties in Pipeline Pilot Lipinski and Molecular Properties Topological Element and Molecular Property Counts - Estate Keys Surface Area and Volume, and Jurs - Shadow Indices Fingerprints Extended connectivity Functional class Path-based Atom Environment MDL Public Keys Semi-empirical QM descriptors (VAMP) energy, orbitals (HOMO/LUMO), multipoles Density functional QM descriptors (DMOL3) Statistics

Appropriate Descriptors are Critical About 20 descriptors in C2.QSAR+ Over 100 descriptors in C2.Descriptor+ Over 600 descriptors in DS

Genetic Function Approximation Evolutionary algorithm generates population of statistically valid SAR models, rather than single model Multiple models provide different insights into system Ensemble model also included Friedman's lack-of fit (LOF) error measure to control number of terms in model whilst minimising least squares error Include higher order polynomials and spline functions Creation of non-linear models Automatic outlier removal (spline) Statistic Monitor variable usage

Bayesian and Back-Propagation Neural Network Builds models with non-linear relationships More tolerant of noise and outliers BNN allows descriptors to be chosen as both response and predictor variables Replicator architecture allows non-linear modes of PCA and PLS Sensitivity analysis for determining the importance of each variable Pre- and post processing features for inputs and outputs; permits splits of fingerprint keys Categorical, numeric and 2D fingerprint data handled Any number of targets (multiple Y), and allows numerical Y variables

QSAR Modeling and Deployment Capabilities Integrated work environment Validated Statistical Methods GFA, BNN, multiple linear regression, Partial Least Squares (PLS), Bayesian modeling Compound Clustering and 3D graphing QSAR models added to descriptor list

Discovery Studio DS Library Design This presentation and/or any related documents contains statements regarding our plans or expectations for future features, enhancements or functionalities of current or future products (collectively "Enhancements"). Our plans or expectations are subject to change at any time at our discretion. Accordingly, Accelrys is making no representation, undertaking no commitment or legal obligation to create, develop or license any product or Enhancements. The presentation, documents or any related statements are not intended to, nor shall, create any legal obligation upon Accelrys, and shall not be relied upon in purchasing any product. Any such obligation shall only result from a written agreement executed by both parties. In addition, information disclosed in this presentation and related documents, whether oral or written, is confidential or proprietary information of Accelrys. It shall be used only for the purpose of furthering our business relationship, and shall not be disclosed to third parties.

The Compound Acquisition Challenge Novelty Identical (or similar) compounds should not exist in the corporate collection Lead-likeness or Drug-likeness Characterize the hit Identify the good physiochemical properties Absence of reactive functionalities Procurement Availability from a reliable supplier at an appropriate price and quantity Series Compounds can be acquired in series to establish SAR Medicinal Chemists Review Compounds required for inspection

Diverse Selection Compare Libraries Focused Selection Enumerate Virtual Library Cluster Restrained Selection Combinatorial Selection

An Example Compound Acquisition Protocol in Pipeline Pilot

Library Design & Analysis Need to compare and prioritize various chemical libraries Which of the possible chemical libraries best meets the requirements for the current project? When considering multiple libraries, which set will provide the most information about the system? Need to select library members to optimize various properties within a proposed chemical library. Of the thousands of compounds that could go into a proposed library, which will: be the most representative? have the best combination of properties?

Library Design in DS 2.0 1. Select a Library Enumeration protocol 2. Set the library parameters 3. Create the library

Library Design Book-Keeping Complete, accurate, and automatic logging of all Discovery Studio experiments in a standard easy to search and access format.

Tools for Library Optimization Common selection tools in DS Library Design: Compare libraries Compute diversity metrics on a single library Cluster molecules Find a diverse set of molecules Find similar molecules Perform a principle component analysis Perform R Group sub-setting Need to optimize multiple properties simultaneously

Optimizing Multiple Library Properties Weighted Sum Optimization Create a function as a weighted sum of properties: O library = w u * uniqueness / s u w sol. * Opt.Sol. / s sol. Drawbacks Provides only a single solution Would just a small change in optimal solubility, have a large effect on uniqueness? Weighting values are largely arbitrary Small changes in weights may dramatically change results There must be a better way

Vilfredo Pareto: Italian economist (1848-1923) Discovered 80/20 rule (80% of land in Italy owned by 20% of people) since generalized to many other contexts His idea of efficiency forms basis of Pareto optimization Pareto optimal solution: A set of solutions where any change in allocation of solutions will result in other solutions becoming worse

Pareto Optimization Puts all goals on an equal footing (no need to assign weights) Discard potential solutions that can t possibly be of interest (given the goals) Yields a family of solutions on the Pareto-optimal surface Fingerprint Uniqueness Ideal library 1,575 compounds - Create libraries of 40 compounds - What is the best library? Optimal solubility

Specific Library Design Functions Based on QSAR Pareto optimization and sorting creates libraries with optimal sampling of multiple properties DS Library design provides multiple methods for calculating diversity metrics to guide the prioritization of libraries Integrated clustering, diversity and similarity tools help select specific compounds from within the Discovery Studio work environment DS DS Library Design implements advanced descriptors and and models to to create optimal libraries

Additional New Functionality Enhanced 3D graphing Better visualization Interactive manipulation of 3D graphs Instant access to information behind graph Perl scripting capabilities Enhanced flexibility, supported world wide Support for clusters, multiple parallel processing (MPI) Faster performance, more options, innovative software design Integration with Pipeline Pilot Much greater flexibility, customizability than any other molecular modeling software product

All tools Integrated into Discovery Studio Single integrated work environment improves productivity Ligand-based and Structure-based design tools are together in a powerful environment The ability to use Pipeline Pilot protocols provides unprecedented capabilities to customize and automate workflows. Seamlessly integrate the steps of the drug development process. Our Our modeling modeling experts experts were were pleased pleased with with the the time time savings savingsrealized by by having having a single single environment environmentin in which which to to generate generate conformers, conformers, dock dock poses, poses, and and model model pharmacophores. pharmacophores. -- --Dr. Dr. Jeff Jeff Zablocki Zablocki (Head (Head of of Chemistry, Chemistry, CV CV Therapeutics) Therapeutics) Just a few of the new capabilities added to Discovery Studio 2.0 Flexible docking Loop modeling for creating better protein models; protien pk prediction De Novo Ligand Builder for creating new compounds from pharmacophores. Ligand Profiler for understanding the environment of a drug. CAESAR for improved modeling of bioactive geometries Pareto Optimization for creating ideal libraries

Integrated Work Environment Improves Productivity Legacy Tools for Drug Discovery Many disjointed programs Time consuming to change applications. New Accelrys Integrated interface High quality, new science!

Data Exchange Among Scientist is Critical Example: Deploy pharmacophore and ADMET screening tools so that chemists have the ability to easily add, predict, and visualize new compounds. Create Create a workflow around around your your tools tools Allow Allow the the Chemists to to use use it it

Conclusion DS QSAR includes an extensive descriptor set and powerful tools for building predictive models QM descriptors, GFA DS Library Design provides the ability to quickly and easily generate targeted chemical libraries R-group and reaction-based library enumeration Pareto optimization Discovery Studio integrates these applications into a complete drug discovery solution

Acknowledgements TienLuu Shikha Varma-O Brien Dana Honeycutt Jon Sutter and the R&D Team Thank You