Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

Similar documents
Using AutoDock for Virtual Screening

Introduction to Chemoinformatics and Drug Discovery

Easier and Better Exploitation of PhysChem Properties in Medicinal Chemistry

JCICS Major Research Areas

Introduction. OntoChem

Supporting Information (Part II) for ACS Combinatorial Science

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences

Structural biology and drug design: An overview

Evolutionary Algorithm for Drug Discovery Interim Design Report

Enamine Golden Fragment Library

Lead- and drug-like compounds: the rule-of-five revolution

Advanced Medicinal Chemistry SLIDES B

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part I: Motivation, Basics, Descriptors

Chemical library design

Drug Informatics for Chemical Genomics...

Rules for drug discovery: can simple property criteria help you to find a drug?

ASSESSING THE DRUG ABILITY OF CHALCONES USING IN- SILICO TOOLS

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

Data Mining in the Chemical Industry. Overview of presentation

QSAR in Green Chemistry

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Relative Drug Likelihood: Going beyond Drug-Likeness

Identifying Promising Compounds in Drug Discovery: Genetic Algorithms and Some New Statistical Techniques

Practical QSAR and Library Design: Advanced tools for research teams

Chemoinformatics and Drug Discovery

Ultra High Throughput Screening using THINK on the Internet

Plan. Day 2: Exercise on MHC molecules.

Unlocking the potential of your drug discovery programme

Bridging the Dimensions:

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

International Journal of Pharma and Bio Sciences CONCEPT OF DRUG LIKENESS IN PHARMACEUTICAL RESEARCH ABSTRACT

Targeting protein-protein interactions: A hot topic in drug discovery

The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc

Hydrogen Bonding & Molecular Design Peter

Introduction to Chemoinformatics

A Framework For Genetic-Based Fusion Of Similarity Measures In Chemical Compound Retrieval

Flow of information in a drug discovery pipeline. 7th lecture Modern Methods in Drug Discovery WS16/17 1

Analyzing Small Molecule Data in R

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

An Integrated Approach to in-silico

Alkane/water partition coefficients and hydrogen bonding. Peter Kenny

Multi-Parameter Optimization: Identifying high quality compounds with a balance of properties

ChemDiv Beyond the Flatland 3D-Fragment Library for Fragment-Based Drug Discovery (FBDD)

Exploring the chemical space of screening results

Machine learning for ligand-based virtual screening and chemogenomics!

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Bioinformatics Workshop - NM-AIST

FRAUNHOFER IME SCREENINGPORT

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Building predictive unbound brain-to-plasma concentration ratio (K p,uu,brain ) models

molecules ISSN

AMRI COMPOUND LIBRARY CONSORTIUM: A NOVEL WAY TO FILL YOUR DRUG PIPELINE

Advances in Multi-parameter Optimisation Methods for de Novo Drug Design

Considering the Impact of Drug-like Properties on the Chance of Success

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Receptor Based Drug Design (1)

Compound Acquisition Strategies

2009 Optibrium Ltd. Optibrium, STarDrop, Auto-Modeler and Glowing Molecule are trademarks of Optibrium Ltd.

Computational Models for the Prediction of Intestinal Membrane Permeability

DATA FUSION APPROACHES IN LIGAND-BASED VIRTUAL SCREENING: RECENT DEVELOPMENTS OVERVIEW

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

Design and Synthesis of the Comprehensive Fragment Library

Similarity Search. Uwe Koch

Web tools for Monomer selection, Library Design and Compound Acquisition. Andrew Leach GlaxoSmithKline Research and Development Stevenage

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann

Application integration: Providing coherent drug discovery solutions

Open PHACTS Explorer: Compound by Name

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

DivCalc: A Utility for Diversity Analysis and Compound Sampling

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom

Building innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery

Translating Methods from Pharma to Flavours & Fragrances

Quest for the Rings. In Silico Exploration of Ring Universe To Identify Novel Bioactive Heteroaromatic Scaffolds

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing

In Silico Design of New Drugs for Myeloid Leukemia Treatment

Evolution of toxic properties of Anti Alzheimer s Drugs through Lipinski s rule of five

QSAR of Microtubule Stabilizing Dictyostatins

Structure-Activity Modeling - QSAR. Uwe Koch

molecules ISSN

In silico pharmacology for drug discovery

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a

Reaxys The Highlights

A reliable computational workflow for the selection of optimal screening libraries

Bioisosteres in Medicinal Chemistry

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection

De Novo molecular design with Deep Reinforcement Learning

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt

In Silico Investigation of Off-Target Effects

Background: Imagine it is time for your lunch break, you take your sandwich outside and you sit down to enjoy your lunch with a beautiful view of

Lecture for Molekylär bioinformatik X3 Feb Computational Chemistry in Drug Discovery. Mats Kihlén. Head of Research Informatics.

Simplifying Drug Discovery with JMP

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS

Comparing Multi-Label Classification Methods for Provisional Biopharmaceutics Class Prediction

Data Warehousing & Data Mining

Transcription:

Computational Methods and Drug-Likeness Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

The Problem Drug development in pharmaceutical industry: >8-12 years time ~$800m costs >90% failure rate PK 39 % Animal Tox 11 % Other 5 % Adverse effects 10 % Lack of Efficacy 30 % Business 5 % Kennedy (1997) DDT 2:436-444 Provided by Dr. Andreas Reichel, Schering AG Page 2

Overview General Problem Drug-likeness, mathematical formulation and realization Molecular Properties Calculation and Approximation Countable Properties Complex Properties Practical Application Databases and Data Mining Methods Counting Methods Knowledge-based Methods Computational Methods: PCA, genetic algortihms, decision trees and other Summary, Conclusion and References Page 3

General Problem Summary Drug-likeness: Profile of common chemical, physical and/or physiological properties of successful drugs. Positive match of candidate molecule properties with such a profile points to drug-likeness. Problem: Mathematical formulation of property profiles. Specification of scoring function. Algorithmic implementation. Page 4

Realization of Drug-likeness Mathematics: N-dimensional space S spanned by molecule properties. f: S R (Score) Score may express general drug-likeness or may approximate a value for a property correlated with a drug effect. Example: Prediction of colon permeability from molecular weight and a structural property. Classification by scoring function. Page 5

Calculation and Approximation of Properties Looking for molecular properties which: 1. allow an assertion about physiological behavior of the molecule. 2. give hints towards the qualification of a molecule as Lead or drug. 3. can be calculated efficiently or readily approximated. Page 6

Elementary/Countable Properties Molecular weight (MW) Functional groups H-bonds donors and -acceptors Rotatable bonds (single covalent bonds) Page 7

Complex Properties (1) Molecular properties, which can not be deduced from a single structural feature, approximation with heuristic formulas computational methods bioinformatics Page 8

Complex Properties (2) LogP Lipophilicity of a molecule Different schemes for calculation: MlogP: Calculation with rule-based function of 13 elementary properties MlogP = 1.244 * Num_CX^0.6-1.017 * Num_NO^0.9 + 0.406 * N_O_Prox - 0.145 * Num_UB^0.8... ClogP: Calculation by structure via fragmentation of isolated C- atoms (combinatorial libraries) Page 9

Complex Properties (3) Polar surface area (PSA) Relative share of polar surface to the total molecule surface Page 10

Complex Properties (4) Molecular Fingerprints Bit-vector representation of molecule structure Daylight method: patterns for paths of different lengths within the molecule Example: OC=CN 0-bond paths: C O N 1-bond paths: OC C=C CN 2-bond paths: OC=C C=CN 3-bond paths: OC=CN Each pattern is used as a seed in a PRNG to obtain bits Summation of the bits for each pattern yields fingerprint Page 11

Complex Properties (5) Properties of combinatorial libraries: Diversity, Potency, Selectivity Diversity: Similarity measure among components of the library. Potency: Measure of efficacy of a compound. Selectivity: Measure of target specificity. Page 12

Practice: Databases World Drug Index (WDI) 60,000 drugs Comprehensive Medicinal Chemistry Database CMC 8,400 drugs with 3D models and properties Available Chemicals Database ACD 238,000 commercially available compounds Page 13

Practice: Structure Data Formats SMART / SMILE: Regular Expressions for Molecules http://www.daylight.com/ - WDI entry for VIAGRA Page 14

Practice: Data Mining (CMC Screenshots) Page 15

Counting Methods Pfizer Rule of Five USAN library: 2,245 drug-like compounds extracted from WDI database Criteria: Existing USAN or INN Name No polymeres or peptides clinical exposure Rules: A compound is not drug-like, if >= 5 H-bond donors >= 10 H-bond acceptors MW >= 500 Log P <= 5 Page 16

Knowledge-based Methods (1) Approximation of target properties with simple functions on known properties. QSPR (Quantitative Structure-Property Relationship) QSAR (Quantitative Structure-Activity Relationship) Examples: Caco2 permeability: Caco2 logp = 0.008 x MW 0.043 x PSA 5165 Human effective permeability: logp = -2.546 0.011 x PSA 0.278 x HBD Blood-Brain-Barrier permeability: logbb = -0.0148 x PSA + 0.152 x clogp + 0.139 Page 17

Knowledge-based Methods (2) Filters for functional Groups (SMART Pattern) Groups with toxic properties, undesired reactivity or chemical properties Example: Page 18

Knowledge-based Methods (3) Search for drug-like structure patterns (Frameworks) Data set: 5,120 CMC compounds Framework definition: Result: 32 frameworks describe 50% of the data. Page 19

Knowledge-based Methods (4) Frameworks Page 20

Computational: Principal Components Analysis 21_A 15_A 21_A 18_B 11_B 18_B 15_A 11_B Principal Components Analysis: Each compound is characterized by certain (selected) properties. A compound could be represented as a point in an n-dimensional space, where n is the total number of properties. PCA is used to project this high-dimensional space into three-dimensional space. The set is broken down to three principal components, where each principal component is a linear combination of defined values of the properties. The result are three components maintaining as much of the variability of the experiments as possible. Page 21

Computational: Genetic Algorithms (1) Principle: Given: Chromosomen-Space Fitness-function Algorithm: Create random initial population P Iteration until convergence: compute fitness values in P, f : discard chromosomes with unsufficient fitness apply evolutionary operations on P (mutations, crossing over) X X R f ( p), p P Page 22

Computational: Genetic Algorithms (2) Example: Optimisation of combinatorial libraries (Multi-property Optimization) A Chromosome = components of a combinatorial library f ( A) = w (1 D( A)) + w f + K Fitness function: d f1 1 D( A) = B sim( Ai, Aj) 2 ( f1( A) b X b) i j b = 1 2 f1 = N B Page 23

Computational: Decision Trees Training with 3,500 molecules from WDI and ACD each. 70-80% classification success Page 24

Computational: Other Approaches The generic nature of the problem setting allows for numerous different approaches: - Simulated Annealing - Neural Networks - Bayesian Networks Page 25

Conclusion (1) Advantges: faster and cheaper to the goal rough classification of compounds available very quickly fewer experiments early indication of possible success high troughput screening possible reduced failure rate in later stages of drug development Page 26

Conclusion (2) Disadvantages: learning from known compounds may prevent innovation drugs may not be useful to find new leads models are strongly simplified physiological interpretation is not always possible binary results of some computational methods are less useful for Lead-Design beyond first indication Page 27

References -Clark DE, Pickett SD (2000) Computational methods for the prediction of 'druglikeness' Drug Discov Today 5(2), 49 45, -Walters WP, Murcko MA (2002) Prediction of 'drug-likeness'. AdvDrug DelivRev 54(3), 255 257 -Gillet, V.J., Willett, P., Bradshaw, J. Green (1999) Selecting Combinatorial Libraries to Optimise Diversity and Physical Properties. Journal of Chemical Information and Computer Sciences. 39: 169-177 -Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 46(1-3), 3 2 Page 28