Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Similar documents
Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom

Structural biology and drug design: An overview

Similarity Search. Uwe Koch

Coefficient Symbol Equation Limits

Description of Molecules with Molecular Interaction Fields (MIF)

In silico pharmacology for drug discovery

Nonlinear QSAR and 3D QSAR

Structure-Activity Modeling - QSAR. Uwe Koch

Computational chemical biology to address non-traditional drug targets. John Karanicolas

BIOINF Drug Design 2. Jens Krüger and Philipp Thiel Summer Lecture 5: 3D Structure Comparison Part 1: Rigid Superposition, Pharmacophores

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Identification of Active Ligands. Identification of Suitable Descriptors (molecular fingerprint)

Design and Synthesis of the Comprehensive Fragment Library

Introduction. OntoChem

Plan. Day 2: Exercise on MHC molecules.

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Machine Learning Concepts in Chemoinformatics

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part I: Motivation, Basics, Descriptors

Using AutoDock for Virtual Screening

5.1. Hardwares, Softwares and Web server used in Molecular modeling

Docking. GBCB 5874: Problem Solving in GBCB

A Review on Computational Methods in Developing Quantitative Structure-Activity Relationship (QSAR)

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015

The PhilOEsophy. There are only two fundamental molecular descriptors

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Universities of Leeds, Sheffield and York

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Receptor Based Drug Design (1)

Performing a Pharmacophore Search using CSD-CrossMiner

Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies

Chapter 8: Introduction to QSAR

Molecular Modeling Study of Some Anthelmintic 2-phenyl Benzimidazole-1- Acetamides as β-tubulin Inhibitor

Keywords: anti-coagulants, factor Xa, QSAR, Thrombosis. Introduction

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

Using Self-Organizing maps to accelerate similarity search

Biologically Relevant Molecular Comparisons. Mark Mackey

Relative Drug Likelihood: Going beyond Drug-Likeness

Machine learning for ligand-based virtual screening and chemogenomics!

Universities of Leeds, Sheffield and York

Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA

Data Quality Issues That Can Impact Drug Discovery

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Practical QSAR and Library Design: Advanced tools for research teams

Efficient overlay of molecular 3-D pharmacophores

Structural Bioinformatics (C3210) Molecular Docking

Universities of Leeds, Sheffield and York

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner

Hydrogen Bonding & Molecular Design Peter

Author Index Volume

QSAR METHODS DEVELOPMENT, VIRTUAL AND EXPERIMENTAL SCREENING FOR CANNABINOID LIGAND DISCOVERY

Bridging the Dimensions:

Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors. Robert D. Clark 2004 Tripos, Inc.

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Statistical concepts in QSAR.

Virtual screening in drug discovery

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

International Journal of Research and Development in Pharmacy and Life Sciences. Review Article

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface

T. J. Hou, Z. M. Li, Z. Li, J. Liu, and X. J. Xu*,

Tutorials on Library Design E. Lounkine and J. Bajorath (University of Bonn) C. Muller and A. Varnek (University of Strasbourg)

Supplementary information

Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses

Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design

Introduction to Chemoinformatics and Drug Discovery

Identifying Interaction Hot Spots with SuperStar

Molecular Descriptors Theory and tips for real-world applications

Ping-Chiang Lyu. Institute of Bioinformatics and Structural Biology, Department of Life Science, National Tsing Hua University.

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Hit Finding and Optimization Using BLAZE & FORGE

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

QSAR of Microtubule Stabilizing Dictyostatins

Research Article. Chemical compound classification based on improved Max-Min kernel

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

Cheminformatics analysis and learning in a data pipelining environment

Why do we need so many chemical similarity search methods?

Targeting protein-protein interactions: A hot topic in drug discovery

has its own advantages and drawbacks, depending on the questions facing the drug discovery.

EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS

Analyzing Building Blocks Diversity for DNA Encoded Library Design. Cresset User Group Meeting Nik Stiefl & Finton Sirockin, Novartis

3D QSAR analysis of quinolone based s- triazines as antimicrobial agent

3D QSAR and pharmacophore modeling studies on C-2 and C-4 substituted Quinazoline series as Anti-tubercular agents.

Different conformations of the drugs within the virtual library of FDA approved drugs will be generated.

An Integrated Approach to in-silico

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

* Author to whom correspondence should be addressed; Tel.: ; Fax:

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Patrick: An Introduction to Medicinal Chemistry 5e Chapter 01

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

Kernel-based Machine Learning for Virtual Screening

A reliable computational workflow for the selection of optimal screening libraries

Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening

Spacer conformation in biologically active molecules*

Transcription:

Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen

Overview GRIND GRid-INDependent Descriptors Idea Construction Results How good is QSAR really?

Problems in 3D QSAR Core problem in 3D QSAR studies is the correct alignment of the correct conformations Which conformers should be aligned? Same problems as for structure comparison The larger the number of degrees of freedom in the molecule, the more difficult it gets to align the molecules properly Bultnick et al., p. 589

Problems in 3D QSAR Core problem in 3D QSAR studies is the correct alignment of the correct conformations Meaningful alignments can only be constructed if the there is structural knowledge of the binding mode It is often not clear, which property is most relevant for the superposition (hydrophobic interactions? electrostatics?) Wrong alignments lead to wrong energies in the grids and thus to incorrect QSAR models Bultnick et al., p. 589

GRIND/ALMOND GRid-INDependent descriptors for 3D QSAR aim at alleviating the alignment problem Idea Compute virtual description of the receptor in the form of interaction points Encode the geometry of the interaction points in an translation- and rotation-invariant representation Use this representation as descriptors for PLS GRIND is implemented in the (commercial) software package ALMOND Pastor et al., J. Med. Chem., 2000, 43, 3233

VRS Virtual Receptor Sites GRIND uses the GRID potentials of six different probe groups to build a virtual representation of the receptor From this grid-based representation a sparse representation based on interaction points is computed Filter algorithms thin out this point set to about 100 points per probe group Pastor et al., J. Med. Chem., 2000, 43, 3233

MIFs Compute MIFs Molecular Interaction Fields, i.e. GRID potentials for various probe groups around the ligand In CoMFA: grid entries are directly used as matrix entries -> alignment issues Instead: reduce grid values to interaction points (similar to what LUDI does, see lecture Drug Design 1) Filtering reduces number of interaction points further MIF of an amide probe around caffeine Pastor et al., J. Med. Chem., 2000, 43, 3233

VRS Virtual Receptor Sites Areas in the MIFs with significant negative energies are interesting We search for a sparse representation of these areas Points in these regions can be considered places, where the receptor might have interaction sites They are thus called virtual receptor sites (VRSs) Bultnick et al., p. 593

VRS Virtual Receptor Sites In contrast to the position of a VRS, the distance between pairs of VRSs is rotation- and translationinvariant We use a representation based on distances alone Compute distribution of distances between certain VRSs In analogy to a radial distribution function Disadvantages Loss of information Not invertible Bultnick et al., p. 593

Recap: 3D Descriptors RDF The distribution of all intramolecular distances is captured by the radial distribution function (RDF) Because the distances are typically discrete, they are turned into a density function: B is a constant describing the width of the Gaussian contribution to the density function RDF is a one-dimensional description of the threedimensional geometry of a molecule

Recap: 3D Descriptors RDF GE, p.416

Recap: 3D Descriptors RDF Isomers and even conformers (which, by definition, yield identical 2D descriptors) can be distinguished using the RDF Disadvantage: RDF is a vector, not a single number Similar to RigFit, physicochemical properties can also be integrated into RDFs: where p i, p j are the respective values for the property of atoms i and j

GRIND - Correlations GRIND applies the RDF idea to interaction points Instead of correlating atom coordinates, we use the interaction points Correlations can be computed just like RDFs Where RDFs multiply properties, GRIND multiplies the respective interaction energies Pastor et al., J. Med. Chem., 2000, 43, 3233

Auto- and Cross-Correlations ALMOND typically uses six different correlograms that are concatenated This results in a vector of scalar values used to encode the molecule Similar interaction options in similar distances will result in similar correlograms Probe 1 Probe 2 Interaction between DRY DRY Hydrophobic O O Hydrogen bond (HB) donor N1 N1 Hydrogen bond acceptor DRY O Hydrophobic and HB donor DRY N1 Hydrophobic and HB acc. O N1 HB acceptor and donor Pastor et al., J. Med. Chem., 2000, 43, 3233

Auto- and Cross-Correlations Pastor et al., J. Med. Chem., 2000, 43, 3233

Results Set of 10 superimposed correlograms (N1-N1) of glucose analogs The compounds are inhibitors of glycogen phosphorylase Points are color-coded according to biological activity (active = red, intermediate = white, inactive = green) Interaction energies correlate with activity! Pastor et al., J. Med. Chem., 2000, 43, 3233

Results Several cpds with serotoninergic (5- HT2A) activity GRIND model based on PLS Model: 3 LV R 2 = 0.93 Q 2 = 0.81 Average error: 0.35 Pastor et al., J. Med. Chem., 2000, 43, 3233

Benchmarking Descriptors It is often very hard to decide, what type of descriptors should be chosen There are also very few studies systematically comparing different descriptor sets on large independent datasets One study was published in 2006 by Gedeck et al. Datasets 944 in-house datasets (Novartis) 570,000 data points in total 143,000 different compounds Gedeck et al, J. Chem. Inf. Model., 2006, 46:1924-36

Benchmarking Descriptors Descriptors GRIND AlogP atom counts Avalon fingerprints (hashed, graph-based fingerprints) Pipeline Pilot fragment counts (hashed structurebased fingerprints) Hologram QSAR (graph based fragment counts) MDL public keys (binary structural fingerprint, 166 bit) Similog descriptors (pharmacophoric triplet counts) Gedeck et al, J. Chem. Inf. Model., 2006, 46:1924-36

Comparing Descriptors Gedeck et al, J. Chem. Inf. Model., 2006, 46:1924-36

Results Gedeck et al, J. Chem. Inf. Model., 2006, 46:1924-36

Caveats So far no consistent single set of descriptors has been found Most of these studies yield contradictory results Descriptor choice also varies with the target/scaffold: descriptors good on one target might be poor on another Also, careful data preparation plays a key role: Gedeck et al. write: Interestingly, the computationally most involved model based on the Almond descriptors, which uses the three-dimensional structure of the compound for the prediction, performed poorest. This might be due to the fact that the description of the three-dimensional structure of compounds is difficult, in general [...], and that the protocol to generate the structures in the present study was inappropriate, in particular. Gedeck et al, J. Chem. Inf. Model., 2006, 46:1924-36

References Books Bultnick et al. (Hrsg.), Computational Medicinal Chemistry for Drug Discovery, Marcel Dekker Inc., New York, 2004 Papers Pastor M, Cruciani G, McLay I, Pickett S, Clementi S. GRid- INdependent descriptors (GRIND): a novel class of alignmentindependent three-dimensional molecular descriptors. J Med Chem. 2000, 43(17):3233-43. Fontaine F, Pastor M, Zamora I, Sanz F. Anchor-GRIND: filling the gap between standard 3D QSAR and the GRid-INdependent descriptors. J Med Chem. 2005,48(7):2687-94. Gedeck P, Rohde B, Bartels C. QSAR how good is it in practice? Comparison of descriptor sets on an unbiased cross section of corporate data sets. J Chem Inf Model. 2006, 46(5):1924-36.