Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom

Size: px
Start display at page:

Download "Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom"

Transcription

1 verview D-QSAR Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary Peter Gedeck ovartis Institutes for BioMedical Research RC, orsham, UK V A R T I S V A R T I S Definition What are descriptors? Atom Group umber Vector Function Physicochemical property (log P) Derived properties (distribution of surface electrostatic potential) D-s are based on descriptors derived from a twodimensional graph representation of a molecule 1D - molecular formula D - molecular connectivity / topology D - molecular geometry / stereochemistry D/D/ - conformational ensembles C 1 1 MW =. Molecule Abstract properties (fingerprint = fragment count) V A R T I S V A R T I S verview Good descriptors should characterize molecular properties important for molecular interactions ydrophobic, electronic, steric / size / shape, hydrogen bonding A recently published encyclopaedia describes more then 000 molecular descriptors used in QSAR and molecular modelling. R. Todeschini, V. Consonni, andbook of Molecular, Wiley, 000 Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary We cannot cover all! So, here is a selection V A R T I S V A R T I S 1

2 Feature counts Feature counts ydrogen bond donor ydrogen bond acceptor umber of rings umber or rotatable bonds Features are usually defined using substructures or SMARTS 1 [,] [!#;!0] 1 SMILES and SMARTS tutorial can be found at Feature counts Application Ghose and Crippen developed an atom-based model for logp (alogp) Ghose AK, Crippen GM. Atomic physicochemical parameters for three-dimensional structure-directed quantitative structure-activity relationships. I. Partition coefficients as a measure of hydrophobicity. J Comput Chem (1) -. Wildman SA, Crippen GM. Prediction of physicochemical parameters by atomic contributions. J Chem Inf Comput Sci (1) -. The atoms of a molecule are classified into 1 different atom types aromatic carbon, primary, secondary aliphatic carbon, Linear model for logp logp = f i n i atomtype i Extension of the approach to molar refractivity V A R T I S V A R T I S Feature counts Application: Polar Surface Area (PSA) Feature counts Application: Polar Surface Area (PSA) Polar Surface Area - PSA is the sum of surface contributions of polar atoms (usually oxygens, nitrogens and attached hydrogens). This descriptor is easy to interpret and what is most important, it provides very good correlation with drug transport properties. Ertl P, Rohde B, Selzer P, J Med Chem (000) 1 V A R T I S V A R T I S Feature counts Application: Polar Surface Area (PSA) PSA vs. D-PSA for molecules. n =, r = 0. Generalisation of feature counts based on D fingerprints Fragment dictionary fingerprints Defined structural features (public) keys Pre-defined fragments may not be suitable for dataset ashed fingerprints Automatically generated fragments Convert fragment to unique number (0- ) fingerprint Fold large fingerprint into short representation (e.g. ): Daylight, QSAR, or use as is: SciTegic V A R T I S V A R T I S

3 Cl I QSAR Fragment based fingerprints occurrences Break structure into fragments and count occurrences occurrences 1 occurrence List of unique fragments Combine all counts for all possible fragments into a vector of numbers = hologram V A R T I S 1 Convert to unique number 10,0,1,1,0,0,1,0, Interpret numbers as bits or counts Reduce length of vector by folding = 0 reduced hologram V A R T I S QSAR Application Level of detail encoded Atoms: CCC Atoms/Bonds Atoms/Bonds Connections Atoms/Bonds Connections Chirality Minimum and maximum size of fragments (R/S) Will describe a large scale QSAR study comparing various methods later Length of reduced hologram V A R T I S V A R T I S Scientific Rationale What determines binding? Example of a descriptor developed for a very specific application irons L, olliday JD, Jelfs SP, Willett P, Gedeck P. Use f The R-Group Descriptor For Alignment-Free QSAR. QSAR Comb. Sci (00) Lead optimisation datasets Series of compounds with common core structure Systematic variation of substituents Modification often localised at a small part of the molecule Cl R1 Glutamate + Glutamine R Isoleucine V A R T I S Muszynski IC et al.. QSAR 1 (1) - R R R1 R R V A R T I S Protein-Ligand interactions hydrophobic hydrogen bond electrostatic Position of pharmacophore in space important

4 Scientific Rationale ow to capture binding information? Scientific Rationale Influence of core on binding is constant for lead series + Differences of substituent properties cause difference in binding S R Influence of substituents for same binding mode almost additive Descriptor needs to encode position of pharmacophoric features in space have properties that correlate with binding interactions hydrophobic: atomic polarisability hydrogen bond: hydrogen bond donor/acceptor counts, polar surface area electrostatic: atomic charge Distance of functional groups from core important for binding nly interested in substituents with single attachment point Substituents are fairly small R1 R S V A R T I S V A R T I S R-Group Assign properties to atoms of descriptor Determine distance of atoms to attachment point Example Phenyl Phenyl -Aminophenyl -Aminophenyl Combine properties and distance to form the descriptor Descriptor: (0.1, 0., 0., 0.1, -0., 0, 0, 0) 1 1 F -Fluorophenyl -ydroxycyclohexyl -Fluorophenyl -ydroxycyclohexyl Atomic Polarisability Atomic Charge V A R T I S V A R T I S Variations QSAR Atom-Based Based upon the sum of atomic properties: Atomic weights and partial charges Atomic contributions to LogP, MR and PSA -Bond Acceptor and Donor counts (BA and BD) Data Surface-Based Based upon maximum-positive and minimum-negative surface potentials: Molecular Electrostatic Potentials (MEP) Molecular Lipophilicity Potentials (MLP) Structure Model Field-Based Based upon the molecular interaction fields (MIF, GRID) Dry probe - hydrophobic interactions Carbonyl oxygen probe - BD interactions 1 Amide nitrogen probe - BA interactions represent properties of the structure Predictions V A R T I S V A R T I S

5 QSAR QSAR R-group QSAR Development Descriptor generation: R descriptors R R R 1 Compounds R-groups Atomic properties R descriptors Me R 1 descriptors Compound 1 Compound R 1 R R R 1 R R R 1 R R Function relating descriptors to biological activity: activity = f (Molecular descriptors) X x = explain which molecular features are responsible to activity help to design new compounds with enhanced features Compound Property 1 Property Property variables V A R T I S V A R T I S QSAR Data sets QSAR results Four data sets selected from the literature: Data set Benzodiazepines QSAR PLS R Q pred-r R R R 1 R R R 1 R R 1 R R R 1 benzodiazepines serotonin triazines tropanes R 1 R Cl Serotonin Triazines Tropanes T Tropanes A Tropanes DA QSAR QSAR QSAR CoMFA QSAR CoMFA QSAR CoMFA V A R T I S V A R T I S QSAR Serotonin data set Simulated Lead-ptimisation Exception Serotonin data set (q = 0.) not surprising Literature result using CoMFA: r=0.1, q=0. Substituents large and structurally very diverse Demonstrates limitation of R-Group descriptors Retrieve initial lead compounds Initialisation R1 R R1: (cores) S S Remaining compounds? Generate QSAR [false] [true] Select best predictions ptimisation V A R T I S V A R T I S

6 Simulated Lead-ptimisation Retrospective analysis using three in-house datasets with known timecourse programme programme programme Distribution of activities (pic0 values) Iterations of 0 compounds activity Simulated Lead-ptimisation Box-plots improve clarity of visualisation activity outliers upper adjacent value upper quartile median lower quartile lower adjacent value 1 1 iteration iteration 1 V A R T I S V A R T I S Simulated Lead-ptimisation chemist chemist chemist verview Two strategies Chronological starting point Diverse starting point QSAR supported lead optimisation identifies potent compounds more rapidly activity chronos chronos chronos Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary diverse diverse diverse 1 1 iteration 1 V A R T I S V A R T I S contain between 0 and 000 datapoints Approximately 0 datasets extracted from corporate database contain estimated data (e.g. > µm, full DS) contain only exact measurements (pruned DS) verlap 0 datasets Average 00 datapoints Average 1. log(mol/l) different descriptors studied D descriptors (GRID): single conformation used (Concord); default settings DRY,, 1 probe D descriptors : Counts of atom types FCFCx (x=,,; SciTegic): Counts of extended connectivity fragments using pharmacophore atom typing; three levels of complexity QSAR: Count of fragment occurrences; default settings, 01 length Similog: Descriptor based on counts of pharmacophore triplets Fingerprint : public key fingerprint. : ovartis developed fingerprint, optimised for searching/filtering in corporate database. PCA required for FCFCx, and Similog due to large number of descriptors V A R T I S V A R T I S

7 sorted by activity split into training and test set 0-0: Every other data point used for test set (interpolation) -: Top and bottom % of dataset used for testing (extrapolation) PLS model (implementation Sybyl) ptimal number of components determined using crossvalidation of training set Characterisation of model performance Predictive performance of model on test set Multivariate predictive r pred Correlation actual versus predicted r corr pred act ( yi yi ) i test act act ( yi y ) rpred = 1 i test pred pred act act ( yi y )( yi y ) i test pred pred act act ( yi y ) ( yi y ) rcorr = i test i test V A R T I S V A R T I S Validation through randomisation experiments datasets using the descriptors Random test/training set splits Median std dev of r pred values: y-scrambling r pred values dropped to - with median std dev of Dataset datasets 00 data points descriptors s 0 0 experiments r pred V A R T I S V A R T I S 1. Performance of individual descriptors. Dependence on dataset characteristics. Comparing descriptors. r pred or r corr? 1. Performance of individual descriptors 0-0 experiment, full dataset Dependence on cut-off one of the descriptors is best all the times QSAR and perform best; descriptor are biased towards features of the dataset FCFCx should be similar, but too many features introduce too much noise FCFC slightly better than FCFC and FCFC AlogP, FCFC, FCFC,, and Similog occupy middle ground performs worst Percentage of good models 0 Method 0 FCFC FCFC FCFC QSAR 0 Similog Total r pred cut-off for good models V A R T I S V A R T I S

8 1. Performance of individual descriptors umber of good models: r pred > 0. Similog All descriptors: QSAR 0-0 experiment FCFC % (11) pruned DS FCFC FCFC % (11) full DS - experiment % () pruned DS % () full DS Adding estimated data Similog (=inactives) improves QSAR models (0-0 experiment) FCFC FCFC FCFC Pruned dataset - Full dataset Pruned dataset 0-0 Full dataset 0-0. Descriptor dependence Dataset size for 0-0 experiment Red line is local LESS regression Similar results obtained for experiment Trend as expected Good models are easier to achieve for larger datasets Percentage V A R T I S V A R T I S. Descriptor dependence Spread of biological activities for 0-0 experiment Red line is local LESS regression Similar results obtained for experiment Trend as expected log(1 mol/l) minimum requirement for good models log( mol/l) better. Comparing descriptors Example: versus 0-0 experiment, full dataset, negative r pred values set to 0 ighly correlated, yet more complex descriptors consistently better V A R T I S V A R T I S. Comparing descriptors. Comparing descriptors Graphs compare r pred values calculated for different descriptors using densities 0-0 experiment, full dataset and high correlation, but shifted curve verview shows Similog and descriptors are different FCFC and FCFC behave very similar Visualisation too complex Visualisation of correlation matrices. r pred used to cluster descriptors and to calculate correlation matrix. are reordered in each graph using hierarchical clustering. Colours correspond to correlation coefficients: 0. (blue), 0. (green), 0. (yellow), 0. (orange). FCFC, Similog and descriptors are very different and highly correlated, but quality of models is very different FCFC Similog QSAR FCFC FCFC FCFC Similog QSAR FCFC FCFC V A R T I S V A R T I S

9 . r pred or r correl QSAR, full dataset Black line is identity Similar result obtained for other descriptors 0-0 experiment: nly little difference between the two statistical measures - experiment: Accurate prediction of extrapolated activity data difficult Summary Study compares different QSAR methods using 00 real-life datasets ine different types of descriptors used Some descriptors are better than others, but none is perfect Why only -% good models? Quality of biological data Small dataset QSAR unreliable (but not useless!) Maybe it looks worse than it is; % good models for cut-off r pred>0. s performance was disappointing, but it may be improved if better care is used to identify correct conformations For a new dataset, try QSAR (and ) first: fast and often a good performance V A R T I S V A R T I S Acknowledgements Christian Bartels, Bernd Rohde, GPS, IBR, Basel, Switzerland Large-scale QSAR study Peter Ertl, Paul Selzer, GPS, IBR, Basel, Switzerland PSA model Steven Jelfs, Prof. Peter Willett, Dr. John olliday, Linda irons, University of Sheffield, UK R-group descriptors V A R T I S

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen Overview GRIND GRid-INDependent Descriptors

More information

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures

More information

Similarity Search. Uwe Koch

Similarity Search. Uwe Koch Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance

More information

Structural biology and drug design: An overview

Structural biology and drug design: An overview Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved

More information

Statistical concepts in QSAR.

Statistical concepts in QSAR. Statistical concepts in QSAR. Computational chemistry represents molecular structures as a numerical models and simulates their behavior with the equations of quantum and classical physics. Available programs

More information

Coefficient Symbol Equation Limits

Coefficient Symbol Equation Limits 1 Coefficient Symbol Equation Limits Squared Correlation Coefficient R 2 or r 2 0 r 2 N 1 2 ( Yexp, i Ycalc, i ) 2 ( Yexp, i Y ) i= 1 2 Cross-Validated R 2 q 2 r 2 or Q 2 or q 2 N 2 ( Yexp, i Ypred, i

More information

Structure-Activity Modeling - QSAR. Uwe Koch

Structure-Activity Modeling - QSAR. Uwe Koch Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity

More information

Notes of Dr. Anil Mishra at 1

Notes of Dr. Anil Mishra at   1 Introduction Quantitative Structure-Activity Relationships QSPR Quantitative Structure-Property Relationships What is? is a mathematical relationship between a biological activity of a molecular system

More information

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

Identification of Active Ligands. Identification of Suitable Descriptors (molecular fingerprint)

Identification of Active Ligands. Identification of Suitable Descriptors (molecular fingerprint) Introduction to Ligand-Based Drug Design Chimica Farmaceutica Identification of Active Ligands Identification of Suitable Descriptors (molecular fingerprint) Establish Mathematical Expression Relating

More information

5.1. Hardwares, Softwares and Web server used in Molecular modeling

5.1. Hardwares, Softwares and Web server used in Molecular modeling 5. EXPERIMENTAL The tools, techniques and procedures/methods used for carrying out research work reported in this thesis have been described as follows: 5.1. Hardwares, Softwares and Web server used in

More information

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics

More information

Plan. Day 2: Exercise on MHC molecules.

Plan. Day 2: Exercise on MHC molecules. Plan Day 1: What is Chemoinformatics and Drug Design? Methods and Algorithms used in Chemoinformatics including SVM. Cross validation and sequence encoding Example and exercise with herg potassium channel:

More information

Nonlinear QSAR and 3D QSAR

Nonlinear QSAR and 3D QSAR onlinear QSAR and 3D QSAR Hugo Kubinyi Germany E-Mail kubinyi@t-online.de HomePage www.kubinyi.de onlinear Lipophilicity-Activity Relationships drug receptor Possible Reasons for onlinear Lipophilicity-Activity

More information

Relative Drug Likelihood: Going beyond Drug-Likeness

Relative Drug Likelihood: Going beyond Drug-Likeness Relative Drug Likelihood: Going beyond Drug-Likeness ACS Fall National Meeting, August 23rd 2012 Matthew Segall, Iskander Yusof Optibrium, StarDrop, Auto-Modeller and Glowing Molecule are trademarks of

More information

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt ACS Spring National Meeting. COMP, March 16 th 2016 Matthew Segall, Peter Hunt, Ed Champness matt.segall@optibrium.com Optibrium,

More information

Chapter 8: Introduction to QSAR

Chapter 8: Introduction to QSAR : Introduction to 8) Chapter 8: 181 8.1 Introduction to 181 8.2 Objectives of 181 8.3 Historical development of 182 8.4 Molecular descriptors used in 183 8.5 Methods of 185 8.5.1 2D methods 186 8.6 Introduction

More information

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Approach for andling of Protonation and Tautomer Ambiguities in Docking Experiments Ingo Dramburg BioSolve IT Gmb An der

More information

Machine Learning Concepts in Chemoinformatics

Machine Learning Concepts in Chemoinformatics Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics

More information

An Integrated Approach to in-silico

An Integrated Approach to in-silico An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc. 14600 Catalina St., San Leandro, CA 94577 Goals

More information

Universities of Leeds, Sheffield and York

Universities of Leeds, Sheffield and York promoting access to White Rose research papers Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ This is an author produced version of a paper published in Organic & Biomolecular

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

In silico pharmacology for drug discovery

In silico pharmacology for drug discovery In silico pharmacology for drug discovery In silico drug design In silico methods can contribute to drug targets identification through application of bionformatics tools. Currently, the application of

More information

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Table of Contents Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Features

More information

Structure Determination. How to determine what compound that you have? One way to determine compound is to get an elemental analysis

Structure Determination. How to determine what compound that you have? One way to determine compound is to get an elemental analysis Structure Determination How to determine what compound that you have? ne way to determine compound is to get an elemental analysis -basically burn the compound to determine %C, %H, %, etc. from these percentages

More information

Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors. Robert D. Clark 2004 Tripos, Inc.

Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors. Robert D. Clark 2004 Tripos, Inc. Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors Robert D. Clark Tripos, Inc. bclark@tripos.com 2004 Tripos, Inc. Outline Background o history o mechanics Finding appropriate

More information

Description of Molecules with Molecular Interaction Fields (MIF)

Description of Molecules with Molecular Interaction Fields (MIF) Description of Molecules with Molecular Interaction Fields (MIF) Introduction to Ligand-Based Drug Design Chimica Farmaceutica 2 Reduction of Dimensionality into Few New Highly Informative Entities -----

More information

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities Ingo Dramburg BioSolve IT Gmb An der Ziegelei 75 53757

More information

Quantum Mechanical Models of P450 Metabolism to Guide Optimization of Metabolic Stability

Quantum Mechanical Models of P450 Metabolism to Guide Optimization of Metabolic Stability Quantum Mechanical Models of P450 Metabolism to Guide Optimization of Metabolic Stability Optibrium Webinar 2015, June 17 2015 Jonathan Tyzack, Matthew Segall, Peter Hunt Optibrium, StarDrop, Auto-Modeller

More information

Universities of Leeds, Sheffield and York

Universities of Leeds, Sheffield and York promoting access to White Rose research papers Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ This is an author produced version of a paper published in Quantitative structure

More information

Open PHACTS Explorer: Compound by Name

Open PHACTS Explorer: Compound by Name Open PHACTS Explorer: Compound by Name This document is a tutorial for obtaining compound information in Open PHACTS Explorer (explorer.openphacts.org). Features: One-click access to integrated compound

More information

Hydrogen Bonding & Molecular Design Peter

Hydrogen Bonding & Molecular Design Peter Hydrogen Bonding & Molecular Design Peter Kenny(pwk.pub.2008@gmail.com) Hydrogen Bonding in Drug Discovery & Development Interactions between drug and water molecules (Solubility, distribution, permeability,

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells

More information

Translating Methods from Pharma to Flavours & Fragrances

Translating Methods from Pharma to Flavours & Fragrances Translating Methods from Pharma to Flavours & Fragrances CINF 27: ACS National Meeting, New Orleans, LA - 18 th March 2018 Peter Hunt, Edmund Champness, Nicholas Foster, Tamsin Mansley & Matthew Segall

More information

Performing a Pharmacophore Search using CSD-CrossMiner

Performing a Pharmacophore Search using CSD-CrossMiner Table of Contents Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Searching with a Pharmacophore... 4 Performing a Pharmacophore Search using CSD-CrossMiner Version 2.0

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Patrick: An Introduction to Medicinal Chemistry 5e Chapter 01

Patrick: An Introduction to Medicinal Chemistry 5e Chapter 01 Questions Patrick: An Introduction to Medicinal Chemistry 5e 01) Which of the following molecules is a phospholipid? a. i b. ii c. iii d. iv 02) Which of the following statements is false regarding the

More information

Introduction. OntoChem

Introduction. OntoChem Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads

More information

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites Introduction J. Andrew Surface Hampden-Sydney College / Virginia Commonwealth University In the past several decades

More information

Three-dimensional molecular descriptors and a novel QSAR method

Three-dimensional molecular descriptors and a novel QSAR method Journal of Molecular Graphics and Modelling 21 (2002) 161 170 Three-dimensional molecular descriptors and a novel QSAR method Scott A. Wildman 1, Gordon M. Crippen College of Pharmacy, University of Michigan,

More information

Bridging the Dimensions:

Bridging the Dimensions: Bridging the Dimensions: Seamless Integration of 3D Structure-based Design and 2D Structure-activity Relationships to Guide Medicinal Chemistry ACS Spring National Meeting. COMP, March 13 th 2016 Marcus

More information

* Author to whom correspondence should be addressed; Tel.: ; Fax:

* Author to whom correspondence should be addressed;   Tel.: ; Fax: Int. J. Mol. Sci. 2011, 12, 946-970; doi:10.3390/ijms12020946 OPEN ACCESS Article International Journal of Molecular Sciences ISSN 1422-0067 www.mdpi.com/journal/ijms Structural Determination of Three

More information

Functional Group Fingerprints CNS Chemistry Wilmington, USA

Functional Group Fingerprints CNS Chemistry Wilmington, USA Functional Group Fingerprints CS Chemistry Wilmington, USA James R. Arnold Charles L. Lerman William F. Michne James R. Damewood American Chemical Society ational Meeting August, 2004 Philadelphia, PA

More information

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning and Simulated Annealing Student: Ke Zhang MBMA Committee: Dr. Charles E. Smith (Chair) Dr. Jacqueline M. Hughes-Oliver

More information

Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR

Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR Bela Torok Department of Chemistry University of Massachusetts Boston Boston, MA 1 Introduction Structure-Activity Relationship (SAR) - similar

More information

Exploring the black box: structural and functional interpretation of QSAR models.

Exploring the black box: structural and functional interpretation of QSAR models. EMBL-EBI Industry workshop: In Silico ADMET prediction 4-5 December 2014, Hinxton, UK Exploring the black box: structural and functional interpretation of QSAR models. (Automatic exploration of datasets

More information

QSAR of Microtubule Stabilizing Dictyostatins

QSAR of Microtubule Stabilizing Dictyostatins QSAR of Microtubule Stabilizing Dictyostatins Kia Montgomery BBSI 2007- University of Pittsburgh Department of Chemistry, Grambling State University Billy Day, Ph.D. Department of Pharmaceutical Sciences,

More information

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017 Using Phase for Pharmacophore Modelling 5th European Life Science Bootcamp March, 2017 Phase: Our Pharmacohore generation tool Significant improvements to Phase methods in 2016 New highly interactive interface

More information

Research Article. Chemical compound classification based on improved Max-Min kernel

Research Article. Chemical compound classification based on improved Max-Min kernel Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(2):368-372 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Chemical compound classification based on improved

More information

CHEM 4170 Problem Set #1

CHEM 4170 Problem Set #1 CHEM 4170 Problem Set #1 0. Work problems 1-7 at the end of Chapter ne and problems 1, 3, 4, 5, 8, 10, 12, 17, 18, 19, 22, 24, and 25 at the end of Chapter Two and problem 1 at the end of Chapter Three

More information

Quiz QSAR QSAR. The Hammett Equation. Hammett s Standard Reference Reaction. Substituent Effects on Equilibria

Quiz QSAR QSAR. The Hammett Equation. Hammett s Standard Reference Reaction. Substituent Effects on Equilibria Quiz Select a method you are using for your project and write ~1/2 page discussing the method. Address: What does it do? How does it work? What assumptions are made? Are there particular situations in

More information

T. J. Hou, Z. M. Li, Z. Li, J. Liu, and X. J. Xu*,

T. J. Hou, Z. M. Li, Z. Li, J. Liu, and X. J. Xu*, 1002 J. Chem. Inf. Comput. Sci. 2000, 40, 1002-1009 Three-Dimensional Quantitative Structure-Activity Relationship Analysis of the New Potent Sulfonylureas Using Comparative Molecular Similarity Indices

More information

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences Fragment based lead discovery - introduction György M. Keserű H2020 FRAGET etwork Hungarian Academy of Sciences www.fragnet.eu Hit discovery from screening Druglike library Fragment library Large molecules

More information

Identifying Interaction Hot Spots with SuperStar

Identifying Interaction Hot Spots with SuperStar Identifying Interaction Hot Spots with SuperStar Version 1.0 November 2017 Table of Contents Identifying Interaction Hot Spots with SuperStar... 2 Case Study... 3 Introduction... 3 Generate SuperStar Maps

More information

Supplementary information

Supplementary information Electronic Supplementary Material (ESI) for MedChemComm. This journal is The Royal Society of Chemistry 2017 Supplementary information Identification of steroid-like natural products as potent antiplasmodial

More information

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression APPLICATION NOTE QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression GAINING EFFICIENCY IN QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIPS ErbB1 kinase is the cell-surface receptor

More information

COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS

COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS DRUG DEVELOPMENT Drug development is a challenging path Today, the causes of many diseases (rheumatoid arthritis, cancer, mental diseases, etc.)

More information

CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS

CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS 159 CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS 6.1 INTRODUCTION The purpose of this study is to gain on insight into structural features related the anticancer, antioxidant

More information

Molecular Descriptors Theory and tips for real-world applications

Molecular Descriptors Theory and tips for real-world applications Molecular Descriptors Theory and tips for real-world applications Francesca Grisoni University of Milano-Bicocca, Dept. of Earth and Environmental Sciences, Milan, Italy ETH Zurich, Dept. of Chemistry

More information

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No. 12 1801 Ligand-based QSAR Studies on the Indolinones Derivatives as Inhibitors of the Protein Tyrosine

More information

Drug Informatics for Chemical Genomics...

Drug Informatics for Chemical Genomics... Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison

More information

Data Quality Issues That Can Impact Drug Discovery

Data Quality Issues That Can Impact Drug Discovery Data Quality Issues That Can Impact Drug Discovery Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry,

More information

Bioinformatics Workshop - NM-AIST

Bioinformatics Workshop - NM-AIST Bioinformatics Workshop - NM-AIST Day 3 Introduction to Drug/Small Molecule Discovery Thomas Girke July 25, 2012 Bioinformatics Workshop - NM-AIST Slide 1/44 Introduction CMP Structure Formats Similarity

More information

C. Correct! The abbreviation Ar stands for an aromatic ring, sometimes called an aryl ring.

C. Correct! The abbreviation Ar stands for an aromatic ring, sometimes called an aryl ring. Organic Chemistry - Problem Drill 05: Drawing Organic Structures No. 1 of 10 1. What does the abbreviation Ar stand for? (A) Acetyl group (B) Benzyl group (C) Aromatic or Aryl group (D) Benzoyl group (E)

More information

This doctoral thesis is based on the following papers, which will be referred to in the text by their Roman numerals (I-V):

This doctoral thesis is based on the following papers, which will be referred to in the text by their Roman numerals (I-V): To my family List of Papers This doctoral thesis is based on the following papers, which will be referred to in the text by their Roman numerals (I-V): I Larsson, J., Gottfries, J., Bohlin, L., Backlund,

More information

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National

More information

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.

More information

OECD QSAR Toolbox v.4.1. Tutorial illustrating new options of the structure similarity

OECD QSAR Toolbox v.4.1. Tutorial illustrating new options of the structure similarity OECD QSAR Toolbox v.4.1 Tutorial illustrating new options of the structure similarity Outlook Background Aims PubChem features The exercise Workflow 2 Background This presentation is designed to familiarize

More information

Biologically Relevant Molecular Comparisons. Mark Mackey

Biologically Relevant Molecular Comparisons. Mark Mackey Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist

More information

1. (18) Multiple choice questions. Please place your answer on the line preceding each question.

1. (18) Multiple choice questions. Please place your answer on the line preceding each question. CEM 5720 ame KEY Exam 2 ctober 21, 2015 Read all questions carefully and attempt those questions you are sure of first. Remember to proof your work; art work carries as much importance as written responses.

More information

Practical QSAR and Library Design: Advanced tools for research teams

Practical QSAR and Library Design: Advanced tools for research teams DS QSAR and Library Design Webinar Practical QSAR and Library Design: Advanced tools for research teams Reservationless-Plus Dial-In Number (US): (866) 519-8942 Reservationless-Plus International Dial-In

More information

CHAPTER-2. Drug discovery is a comprehensive approach wherein several disciplines

CHAPTER-2. Drug discovery is a comprehensive approach wherein several disciplines 36 CAPTER-2 Molecular Modeling Analog Based Studies Drug discovery is a comprehensive approach wherein several disciplines are used to design or discover the drugs. The R&D expenditure incurred to bring

More information

Web tools for Monomer selection, Library Design and Compound Acquisition. Andrew Leach GlaxoSmithKline Research and Development Stevenage

Web tools for Monomer selection, Library Design and Compound Acquisition. Andrew Leach GlaxoSmithKline Research and Development Stevenage Web tools for Monomer selection, Library Design and Compound Acquisition Andrew Leach GlaxoSmithKline Research and Development Stevenage Historical perspective Bench scientists unused to dealing with and

More information

Docking. GBCB 5874: Problem Solving in GBCB

Docking. GBCB 5874: Problem Solving in GBCB Docking Benzamidine Docking to Trypsin Relationship to Drug Design Ligand-based design QSAR Pharmacophore modeling Can be done without 3-D structure of protein Receptor/Structure-based design Molecular

More information

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013 Chemical Space Space, Diversity, and Synthesis Jeremy Henle, 4/23/2013 Computational Modeling Chemical Space As a diversity construct Outline Quantifying Diversity Diversity Oriented Synthesis Wolf and

More information

Cheminformatics analysis and learning in a data pipelining environment

Cheminformatics analysis and learning in a data pipelining environment Molecular Diversity (2006) 10: 283 299 DOI: 10.1007/s11030-006-9041-5 c Springer 2006 Review Cheminformatics analysis and learning in a data pipelining environment Moises Hassan 1,, Robert D. Brown 1,

More information

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design Bela Torok Department of Chemistry University of Massachusetts Boston Boston, MA 1 Computer Aided Drug Design - Introduction Development

More information

3D QSAR analysis of quinolone based s- triazines as antimicrobial agent

3D QSAR analysis of quinolone based s- triazines as antimicrobial agent International Journal of PharmTech Research CODEN (USA): IJPRIF ISSN : 0974-4304 Vol.4, No.3, pp 1096-1100, July-Sept 2012 3D QSAR analysis of quinolone based s- triazines as antimicrobial agent Ramesh

More information

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships QSAR/QSPR modeling Alexandre Varnek Faculté de Chimie, ULP, Strasbourg, FRANCE QSAR/QSPR models Development Validation

More information

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration Chris Luscombe, Computational Chemistry GlaxoSmithKline Summary of Talk Traditional approaches SAR Free-Wilson

More information

molecules ISSN

molecules ISSN Molecules 2004, 9, 1004-1009 molecules ISSN 1420-3049 http://www.mdpi.org Performance of Kier-Hall E-state Descriptors in Quantitative Structure Activity Relationship (QSAR) Studies of Multifunctional

More information

Design and Synthesis of the Comprehensive Fragment Library

Design and Synthesis of the Comprehensive Fragment Library YOUR INNOVATIVE CHEMISTRY PARTNER IN DRUG DISCOVERY Design and Synthesis of the Comprehensive Fragment Library A 3D Enabled Library for Medicinal Chemistry Discovery Warren S Wade 1, Kuei-Lin Chang 1,

More information

BIOINF Drug Design 2. Jens Krüger and Philipp Thiel Summer Lecture 5: 3D Structure Comparison Part 1: Rigid Superposition, Pharmacophores

BIOINF Drug Design 2. Jens Krüger and Philipp Thiel Summer Lecture 5: 3D Structure Comparison Part 1: Rigid Superposition, Pharmacophores BIOINF 472 Drug Design 2 Jens Krüger and Philipp Thiel Summer 2014 Lecture 5: D Structure Comparison Part 1: Rigid Superposition, Pharmacophores Overview Comparison of D structures Rigid superposition

More information

Fast similarity searching making the virtual real. Stephen Pickett, GSK

Fast similarity searching making the virtual real. Stephen Pickett, GSK Fast similarity searching making the virtual real Stephen Pickett, GSK Introduction Introduction to similarity searching Use cases Why is speed so crucial? Why MadFast? Some performance stats Implementation

More information

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004

Computational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004 Computational Methods and Drug-Likeness Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004 The Problem Drug development in pharmaceutical industry: >8-12 years time ~$800m costs >90% failure

More information

A Review on Computational Methods in Developing Quantitative Structure-Activity Relationship (QSAR)

A Review on Computational Methods in Developing Quantitative Structure-Activity Relationship (QSAR) Navdeep Singh Sethi: A Review on Computational Methods in Developing Quantitative Structure-Activity 815 International Journal of Drug Design and Discovery Volume 3 Issue 3 July September 2012. 815-836

More information

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer LigandScout Automated Structure-Based Pharmacophore Model Generation Gerhard Wolber* and Thierry Langer * E-Mail: wolber@inteligand.com Pharmacophores from LigandScout Pharmacophores & the Protein Data

More information

Analyzing Small Molecule Data in R

Analyzing Small Molecule Data in R Analyzing Small Molecule Data in R Tyler Backman and Thomas Girke December 12, 2011 Analyzing Small Molecule Data in R Slide 1/49 Introduction CMP Structure Formats Similarity Searching Background Fragment

More information

Kinome-wide Activity Models from Diverse High-Quality Datasets

Kinome-wide Activity Models from Diverse High-Quality Datasets Kinome-wide Activity Models from Diverse High-Quality Datasets Stephan C. Schürer*,1 and Steven M. Muskal 2 1 Department of Molecular and Cellular Pharmacology, Miller School of Medicine and Center for

More information

Alkane/water partition coefficients and hydrogen bonding. Peter Kenny

Alkane/water partition coefficients and hydrogen bonding. Peter Kenny Alkane/water partition coefficients and hydrogen bonding Peter Kenny (pwk.pub.2008@gmail.com) Neglect of hydrogen bond strength: A recurring theme in medicinal chemistry Rule of 5 Rule of 3 Scoring functions

More information

Non-linear Prediction of Quantitative Structure Activity Relationships

Non-linear Prediction of Quantitative Structure Activity Relationships Non-linear Prediction of Quantitative Structure Activity Relationships Peter Tiňo, School of Computer Science, Birmingham University, Birmingham B15 2TT, UK Ian T. Nabney, Neural Computing Research Group,

More information

Table 8.2 Detailed Table of Characteristic Infrared Absorption Frequencies

Table 8.2 Detailed Table of Characteristic Infrared Absorption Frequencies Table 8.2 Detailed Table of Characteristic Infrared Absorption Frequencies The hydrogen stretch region (3600 2500 cm 1 ). Absorption in this region is associated with the stretching vibration of hydrogen

More information

Kernel-based Machine Learning for Virtual Screening

Kernel-based Machine Learning for Virtual Screening Kernel-based Machine Learning for Virtual Screening Dipl.-Inf. Matthias Rupp Beilstein Endowed Chair for Chemoinformatics Johann Wolfgang Goethe-University Frankfurt am Main, Germany 2008-04-11, Helmholtz

More information

Hologram and Receptor-Guided 3D QSAR Analysis of Anilinobipyridine JNK3 Inhibitors

Hologram and Receptor-Guided 3D QSAR Analysis of Anilinobipyridine JNK3 Inhibitors 3D QSAR Analysis of Anilinobipyridine JK3 Inhibitors Bull. Korean Chem. Soc. 2009, Vol. 30, o. 11 2739 Hologram and Receptor-Guided 3D QSAR Analysis of Anilinobipyridine JK3 Inhibitors Jae Yoon Chung,,

More information

Chemical library design

Chemical library design Chemical library design Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery Today,

More information

The reuse of structural data for fragment binding site prediction

The reuse of structural data for fragment binding site prediction The reuse of structural data for fragment binding site prediction Richard Hall 1 Motivation many examples of fragments binding in a phenyl shaped pocket or a kinase slot good shape complementarity between

More information

Xia Ning,*, Huzefa Rangwala, and George Karypis

Xia Ning,*, Huzefa Rangwala, and George Karypis J. Chem. Inf. Model. XXXX, xxx, 000 A Multi-Assay-Based Structure-Activity Relationship Models: Improving Structure-Activity Relationship Models by Incorporating Activity Information from Related Targets

More information

Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,..

Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,.. 3 Conformational Search Molecular Docking Simulate Annealing Ab Initio QM Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,.. Rino Ragno:

More information