Exploring the black box: structural and functional interpretation of QSAR models.

Size: px
Start display at page:

Download "Exploring the black box: structural and functional interpretation of QSAR models."

Transcription

1 EMBL-EBI Industry workshop: In Silico ADMET prediction 4-5 December 2014, Hinxton, UK Exploring the black box: structural and functional interpretation of QSAR models. (Automatic exploration of datasets using QSAR) Pavel Polishchuk A.V. Bogatsky Physico-Chemical Institute of NAS of Ukraine, Odessa, Ukraine

2 Outline Introduction and existed approaches Structural QSAR interpretation theory and practical examples Functional QSAR interpretation theory, practical examples and comparison with docking studies Automatic exploration of chemical dataset 2

3 QSAR interpretation: interpretability vs. complexity Model interpretability Popular misbelief MLR PLS DT knn RF SVM ANN models ensembles Model complexity 3

4 Importance of QSAR structural interpretation Extract SAR information in a chemically meaningful way detection of structural alerts, creation of structural filters or set of rules fragment-based drug design Model validation interpretation results should not contradict with experimental observations 4

5 QSAR interpretation approaches Model-specific approaches: Rule-based (Decision tree) Regression coefficients (MLR, PLS) Latent variables (PLS) Weights and biases (ANN) Model-independent approaches: Local gradients or partial derivatives I.I. Baskin et al., SAR QSAR Environ Sci, 2002, G. Marcou et al., Molecular informatics, 2012, C i f(x i ) f(x x i i Δx i ) 5

6 QSAR interpretation: common workflow Model Variables contributions Structureproperty relationship f(x) Var_1 Var_2 Mol_ Mol_ Mol_ Mol_

7 Matched molecular pairs & molecular transformations logs = logs = ΔlogS = 2.58 H OH ΔlogS = 1.59 logs = logs = Leech A.G. et al, J. Med. Chem. 2006, 49, Sheridan R.P. et al., J. Chem. Inf. Model. 2006, 46,

8 Exemplified dataset 8

9 Structural QSAR interpretation - = logs pred = logs pred = ΔlogS pred = = logs pred = logs pred = ΔlogS pred = Polishchuk P.G. et al. Molecular Informatics 2013,32,

10 Structural QSAR interpretation - = logs pred = logs pred = ΔlogS pred = 2.39 Polishchuk P.G. et al. Molecular Informatics 2013,32,

11 Limitations of existed descriptors (Dragon, etc) Dragon fails - = Dragon is OK - = Computational MMP H COOH Polishchuk P.G. et al. Molecular Informatics 2013,32,

12 Simplex representation of molecular structure (SiRMS) Simplex generation example Atom-property labeling Kuz min, V. E. et al, Journal of Molecular Modeling 2005, 11, Kuz min, V. E. et al, Journal of Computer-Aided Molecular Design 2008, 22,

13 Local and global interpretation Local interpretation analysis of single compounds Global interpretation reveal trends 13

14 Interpretation: fragmentation Case number Do specific interactions of a ligand with its target exist or important? Is an orientation of a ligand relatively its target known? Fragments selection and grouping 1 NO (e.g. passive diffusion through membranes, solubility, lipophilicity, etc) not relevant can be done by the researcher based on his own knowledge 2 YES (ligand-receptor interactions, host-guest complexes, etc) YES 3 NO consider fragments positions relatively to the target and observed or predicted interactions MMP can be applied, silently assumed that all compounds have the same interaction mode 14

15 Examples of structural interpretation 15

16 Solubility (1033 compounds) Endpoint Solubility, logs 5-fold external cross validation results SiRMS Dragon Model R 2 CV RMSE R 2 CV RMSE PLS RF SVM Polishchuk P.G. et al., Molecular Informatics, 2013,

17 Mutagenicity (Ames, 4361 compounds) 5-fold external cross validation results Descriptors Algorithm Balanced Accuracy SiRMS RF SVM Dragon RF SVM Polishchuk P.G. et al. Molecular Informatics, 2013,

18 Combined contribution (effect) of fragments RF+SiRMS Contribution = 0 (non-mutagen) Contribution = 0 (non-mutagen) Contribution = 1 (mutagen) 18

19 Questions How does the fragment influence the property? WHY?! 19

20 Functional interpretation of QSAR models Structural interpretation A pic 50 = f(a 1, A 2, A 3 ) = x - = B pic 50 = f(b 1, B 2, B 3 ) = y C Contribution(C) = x - y Functional interpretation A - = B C pic 50 = f(a 1, A 2, A 3 ) = x pic 50 = f(a 1, A 2, B 3 ) = y Contribution 3 (C) = x - y 1, 2, 3 groups of descriptors represented different physico-chemical factors (charge, H-bonding, etc) of compound A and B. 20

21 Antagonists of fibrinogen receptor (functional interpretation example) 21

22 Fragment examples Antagonists of fibrinogen receptor: dataset Arg-Gly-Asp Arg-mimetic Linker Asp-mimetic 338 compounds 22

23 Antagonists of fibrinogen receptor: models 5-fold external cross validation results Algorithm R 2 RMSE RF SVM (RBF kernel) SVM (linear) PLS

24 Structural interpretation (global) Linker Asp-mimetic Arg-mimetic 24

25 Functional interpretation (global) Arg-mimetic Linker Asp-mimetic 25

26 Functional interpretation of RF model (local) RF model Asp224 Phe160 Tyr190 Arg Ser225 Mg electrostatic H-bonding hydrophobicity polarizability 26

27 Functional interpretation of SVM model (local) SVM-RBF model Asp224 Phe160 Tyr190 Arg Ser225 Mg electrostatic H-bonding hydrophobicity polarizability 27

28 Automatic exploration of datasets of chemical compounds (dataset mining) 28

29 SiRMS-QSAR software QSAR model building 1 2 utilizes ncpu

30 SiRMS-QSAR software Calculation of fragments contributions not implemented yet 30

31 SiRMS-QSAR software Plot fragments contributions structural interpretation functional interpretation 31

32 SiRMS-QSAR software External visualization tool 32

33 Interpretation workflow scheme Create sdf file with property values Build models (regression or classification) Look at models stat (if all models are bad reconsider dataset) Calculate fragment contributions Plot contributions of desired models selected from statistically significant ones 33

34 ADME/Tox examples (SAR trends, global interpretation) Datasets taken from: 1) Cheng W. et al., J. Chem. Inf. Model., 2012, ) Kovdienko N.A. et al., Molecular informatics, 2010, ) Polishchuk P.G. et al., J. Chem. Inf. Model., 2009, ) in-house data 34

35 Permeability (structural interpretation) consensus of RF, GBM, SVM models 35

36 Permeability (functional interpretation) consensus of RF, GBM, SVM models 36

37 Toxicity (structural interpretation) consensus of RF, GBM, SVM models 37

38 Toxicity (functional interpretation) consensus of RF, GBM, SVM models 38

39 Summary SiRMS Descriptors Others (Dragon, CDK, etc) Models Fragments Interpretation Regression + + Classification + + Terminal (substituent) + + Scaffold/linker + - Structural + + Functional +? 39

40 Conclusions Almost any QSAR model can be interpreted using the proposed schemes. Results of structural and functional interpretation obtained from different models are well correlated between models and correspond to observed trends. Structural interpretation allows to reveal trends in SAR, rank fragments, find potential structural alerts, etc. Functional interpretation may provide a guess about factors which are dominated and influence on the investigated property. 40

41 Perspectives Smart automatic fragmentation approaches Detection of potential activity cliffs in local interpretation Testing on other types of descriptors Usage of datasets which include mixtures of compounds Application of this approach for wider range of structurally diverse datasets with different end-points and comparison to MMP 41

42 Useful web links A.V. Bogatsky Physico-Chemical Institute, Chemoinformatic group: SiRMS project on GitHub: SiRMS-QSAR (dataset analysis): External web-based visualization: 42

43 Acknowledgement A.V. Bogatsky Physico-Chemical Institute (Odessa, Ukraine) Strasbourg University (France) Prof. V. Kuz min Dr. T. Khristova Dr. L. Ognichenko A. Kosinskaya E. Mokshina M. Kulinskiy Prof. A. Varnek Dr. D. Horvath 43

Structural interpretation of QSAR models a universal approach

Structural interpretation of QSAR models a universal approach Methods and Applications of Computational Chemistry - 5 Kharkiv, Ukraine, 1 5 July 2013 Structural interpretation of QSAR models a universal approach Victor Kuz min, Pavel Polishchuk, Anatoly Artemenko,

More information

Interpretation of QSAR models

Interpretation of QSAR models BIGCHEM, online lecture, 7 Febuary 2018 Interpretation of QSAR models Pavel Polishchuk Institute of Molecular and Translational Medicine Faculty of Medicine and Dentistry Palacky University pavlo.polishchuk@upol.cz

More information

Structure-Activity Modeling - QSAR. Uwe Koch

Structure-Activity Modeling - QSAR. Uwe Koch Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity

More information

Condensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule

Condensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule Condensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule Frank Hoonakker 1,3, Nicolas Lachiche 2, Alexandre Varnek 3, and Alain Wagner 3,4 1 Chemoinformatics laboratory,

More information

UniStra activities within the BigChem project:

UniStra activities within the BigChem project: UniStra activities within the Bighem project: data visualization and modeling using GTM approach; chemical reactions mining with ondensed Graphs of Reactions Alexandre Varnek Laboratory of hemoinformatics,

More information

Machine learning for ligand-based virtual screening and chemogenomics!

Machine learning for ligand-based virtual screening and chemogenomics! Machine learning for ligand-based virtual screening and chemogenomics! Jean-Philippe Vert Institut Curie - INSERM U900 - Mines ParisTech In silico discovery of molecular probes and drug-like compounds:

More information

In Silico Prediction of ADMET properties with confidence: potential to speed-up drug discovery

In Silico Prediction of ADMET properties with confidence: potential to speed-up drug discovery In Silico Prediction of ADMET properties with confidence: potential to speed-up drug discovery Igor V. Tetko Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) Institute

More information

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National

More information

In silico pharmacology for drug discovery

In silico pharmacology for drug discovery In silico pharmacology for drug discovery In silico drug design In silico methods can contribute to drug targets identification through application of bionformatics tools. Currently, the application of

More information

An Integrated Approach to in-silico

An Integrated Approach to in-silico An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc. 14600 Catalina St., San Leandro, CA 94577 Goals

More information

Screening and prioritisation of substances of concern: A regulators perspective within the JANUS project

Screening and prioritisation of substances of concern: A regulators perspective within the JANUS project Für Mensch & Umwelt LIFE COMBASE workshop on Computational Tools for the Assessment and Substitution of Biocidal Active Substances of Ecotoxicological Concern Screening and prioritisation of substances

More information

Virtual screening in drug discovery

Virtual screening in drug discovery Virtual screening in drug discovery Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery

More information

Chemical Space: Modeling Exploration & Understanding

Chemical Space: Modeling Exploration & Understanding verview Chemical Space: Modeling Exploration & Understanding Rajarshi Guha School of Informatics Indiana University 16 th August, 2006 utline verview 1 verview 2 3 CDK R utline verview 1 verview 2 3 CDK

More information

Machine Learning Concepts in Chemoinformatics

Machine Learning Concepts in Chemoinformatics Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics

More information

OCHEM. Product features and highlights

OCHEM. Product features and highlights OCHEM Product features and highlights Content OCHEM at a glance (components and Data upload) How to run models for ADME prediction? How to build models (Regression, Classification) and get Applicability

More information

Plan. Day 2: Exercise on MHC molecules.

Plan. Day 2: Exercise on MHC molecules. Plan Day 1: What is Chemoinformatics and Drug Design? Methods and Algorithms used in Chemoinformatics including SVM. Cross validation and sequence encoding Example and exercise with herg potassium channel:

More information

ADMET property estimation, oral bioavailability predictions, SAR elucidation, & QSAR model building software www.simulations-plus.com +1-661-723-7723 What is? is an advanced computer program that enables

More information

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt ACS Spring National Meeting. COMP, March 16 th 2016 Matthew Segall, Peter Hunt, Ed Champness matt.segall@optibrium.com Optibrium,

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Introduction. OntoChem

Introduction. OntoChem Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads

More information

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships QSAR/QSPR modeling Alexandre Varnek Faculté de Chimie, ULP, Strasbourg, FRANCE QSAR/QSPR models Development Validation

More information

Chemical library design

Chemical library design Chemical library design Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery Today,

More information

E. Muratov 1, E. Varlamova 2, A. Artemenko 2, D. Fourches 1, V. Kuz'min 2, A. Tropsha 1

E. Muratov 1, E. Varlamova 2, A. Artemenko 2, D. Fourches 1, V. Kuz'min 2, A. Tropsha 1 E. Muratov 1, E. Varlamova 2, A. Artemenko 2, D. Fourches 1, V. Kuz'min 2, A. Tropsha 1 1 University of orth Carolina, Chapel Hill, C, UA; 2 A.V. Bogatsky Physical-Chemical Institute AU, dessa, Ukraine;

More information

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov CADD Group Chemical Biology Laboratory Frederick National Laboratory for Cancer Research National Cancer Institute, National Institutes

More information

Translating Methods from Pharma to Flavours & Fragrances

Translating Methods from Pharma to Flavours & Fragrances Translating Methods from Pharma to Flavours & Fragrances CINF 27: ACS National Meeting, New Orleans, LA - 18 th March 2018 Peter Hunt, Edmund Champness, Nicholas Foster, Tamsin Mansley & Matthew Segall

More information

OECD QSAR Toolbox v.4.1. Step-by-step example for building QSAR model

OECD QSAR Toolbox v.4.1. Step-by-step example for building QSAR model OECD QSAR Toolbox v.4.1 Step-by-step example for building QSAR model Background Objectives The exercise Workflow of the exercise Outlook 2 Background This is a step-by-step presentation designed to take

More information

Predicting Binding Affinity of CSAR Ligands Using Both Structure- Based and Ligand-Based Approaches

Predicting Binding Affinity of CSAR Ligands Using Both Structure- Based and Ligand-Based Approaches pubs.acs.org/jcim Predicting Binding Affinity of CSAR Ligands Using Both Structure- Based and Ligand-Based Approaches Denis Fourches, Eugene Muratov,, Feng Ding, Nikolay V. Dokholyan, and Alexander Tropsha*,

More information

DOCKING TUTORIAL. A. The docking Workflow

DOCKING TUTORIAL. A. The docking Workflow 2 nd Strasbourg Summer School on Chemoinformatics VVF Obernai, France, 20-24 June 2010 E. Kellenberger DOCKING TUTORIAL A. The docking Workflow 1. Ligand preparation It consists in the standardization

More information

Structural biology and drug design: An overview

Structural biology and drug design: An overview Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved

More information

Hierarchical QSAR technology based on the Simplex representation of molecular structure

Hierarchical QSAR technology based on the Simplex representation of molecular structure J Comput Aided Mol Des (2008) 22:403 421 DI 10.1007/s10822-008-9179-6 Hierarchical QSAR technology based on the Simplex representation of molecular structure V. E. Kuz min Æ A. G. Artemenko Æ E.. Muratov

More information

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression APPLICATION NOTE QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression GAINING EFFICIENCY IN QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIPS ErbB1 kinase is the cell-surface receptor

More information

Applications of multi-class machine

Applications of multi-class machine Applications of multi-class machine learning models to drug design Marvin Waldman, Michael Lawless, Pankaj R. Daga, Robert D. Clark Simulations Plus, Inc. Lancaster CA, USA Overview Applications of multi-class

More information

Materials Informatics: Statistical Modeling in Material Science

Materials Informatics: Statistical Modeling in Material Science Materials Informatics: Statistical Modeling in Material Science Hanoch Senderowitz Bar-Ilan University, Israel Strasbourg Summer School in Cheminformatics, June 2016, Strasbourg, France Presentation Goals

More information

Emerging patterns mining and automated detection of contrasting chemical features

Emerging patterns mining and automated detection of contrasting chemical features Emerging patterns mining and automated detection of contrasting chemical features Alban Lepailleur Centre d Etudes et de Recherche sur le Médicament de Normandie (CERMN) UNICAEN EA 4258 - FR CNRS 3038

More information

Docking. GBCB 5874: Problem Solving in GBCB

Docking. GBCB 5874: Problem Solving in GBCB Docking Benzamidine Docking to Trypsin Relationship to Drug Design Ligand-based design QSAR Pharmacophore modeling Can be done without 3-D structure of protein Receptor/Structure-based design Molecular

More information

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös KIME-based scoring functions in Muse 3.0 KIME User Group Meeting 2013 Fabian Bös Certara Mission: End-to-End Model-Based Drug Development Certara was formed by acquiring and integrating Tripos, Pharsight,

More information

Estimating Predictive Uncertainty for Ensemble Regression Models by Gamma Error Analysis

Estimating Predictive Uncertainty for Ensemble Regression Models by Gamma Error Analysis Estimating Predictive Uncertainty for Ensemble Regression Models by Gamma Error Analysis Bob Clark & Marvin Waldman Simulations Plus, Inc. Lancaster CA USA bob@simulations-plus.com EuroQSAR 2018 The Standard

More information

Ligand-receptor interactions

Ligand-receptor interactions University of Silesia, Katowice, Poland 11 22 March 2013 Ligand-receptor interactions Dr. Pavel Polishchuk A.V. Bogatsky Physico-Chemical Institute of National Academy of Sciences of Ukraine Odessa, Ukraine

More information

Advanced Medicinal Chemistry SLIDES B

Advanced Medicinal Chemistry SLIDES B Advanced Medicinal Chemistry Filippo Minutolo CFU 3 (21 hours) SLIDES B Drug likeness - ADME two contradictory physico-chemical parameters to balance: 1) aqueous solubility 2) lipid membrane permeability

More information

The PhilOEsophy. There are only two fundamental molecular descriptors

The PhilOEsophy. There are only two fundamental molecular descriptors The PhilOEsophy There are only two fundamental molecular descriptors Where can we use shape? Virtual screening More effective than 2D Lead-hopping Shape analogues are not graph analogues Molecular alignment

More information

Quantitative Structure-Activity Relationship (QSAR) computational-drug-design.html

Quantitative Structure-Activity Relationship (QSAR)  computational-drug-design.html Quantitative Structure-Activity Relationship (QSAR) http://www.biophys.mpg.de/en/theoretical-biophysics/ computational-drug-design.html 07.11.2017 Ahmad Reza Mehdipour 07.11.2017 Course Outline 1. 1.Ligand-

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Matrix Data: Prediction Instructor: Yizhou Sun yzsun@ccs.neu.edu September 14, 2014 Today s Schedule Course Project Introduction Linear Regression Model Decision Tree 2 Methods

More information

Modeling Mutagenicity Status of a Diverse Set of Chemical Compounds by Envelope Methods

Modeling Mutagenicity Status of a Diverse Set of Chemical Compounds by Envelope Methods Modeling Mutagenicity Status of a Diverse Set of Chemical Compounds by Envelope Methods Subho Majumdar School of Statistics, University of Minnesota Envelopes in Chemometrics August 4, 2014 1 / 23 Motivation

More information

(e.g.training and prediction set, algorithm, ecc...). 2.9.Availability of another QMRF for exactly the same model: No other information available

(e.g.training and prediction set, algorithm, ecc...). 2.9.Availability of another QMRF for exactly the same model: No other information available QMRF identifier (JRC Inventory):To be entered by JRC QMRF Title: Insubria QSAR PaDEL-Descriptor model for prediction of NitroPAH mutagenicity. Printing Date:Jan 20, 2014 1.QSAR identifier 1.1.QSAR identifier

More information

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom verview D-QSAR Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary Peter Gedeck ovartis Institutes

More information

QSAR in Green Chemistry

QSAR in Green Chemistry QSAR in Green Chemistry Activity Relationship QSAR is the acronym for Quantitative Structure-Activity Relationship Chemistry is based on the premise that similar chemicals will behave similarly The behavior/activity

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Biologically Relevant Molecular Comparisons. Mark Mackey

Biologically Relevant Molecular Comparisons. Mark Mackey Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist

More information

User Guide for LeDock

User Guide for LeDock User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved. Introduction LeDock is flexible small-molecule docking software,

More information

CheS-Mapper 2.0 for visual validation of (Q)SAR models

CheS-Mapper 2.0 for visual validation of (Q)SAR models Gütlein et al. Journal of Cheminformatics 2014, 6:41 SOFTWARE Open Access CheS-Mapper 2.0 for visual validation of (Q)SAR models Martin Gütlein 1, Andreas Karwath 2 and Stefan Kramer 2* Abstract Background:

More information

Pose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft

Pose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft Pose and affinity prediction by ICM in D3R GC3 Max Totrov Molsoft Pose prediction method: ICM-dock ICM-dock: - pre-sampling of ligand conformers - multiple trajectory Monte-Carlo with gradient minimization

More information

Qsar study of anthranilic acid sulfonamides as inhibitors of methionine aminopeptidase-2 using different chemometrics tools

Qsar study of anthranilic acid sulfonamides as inhibitors of methionine aminopeptidase-2 using different chemometrics tools Qsar study of anthranilic acid sulfonamides as inhibitors of methionine aminopeptidase-2 using different chemometrics tools RAZIEH SABET, MOHSEN SHAHLAEI, AFSHIN FASSIHI a Department of Medicinal Chemistry,

More information

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.

More information

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Anthony Arvanites Daylight User Group Meeting March 10, 2005 Outline 1. Company Introduction

More information

Feature combination networks for the interpretation of statistical machine learning models: application to Ames mutagenicity

Feature combination networks for the interpretation of statistical machine learning models: application to Ames mutagenicity Webb et al. Journal of Cheminformatics 2014, 6:8 RESEARCH ARTICLE Feature combination networks for the interpretation of statistical machine learning models: application to Ames mutagenicity Samuel J Webb

More information

Towards Physics-based Models for ADME/Tox. Tyler Day

Towards Physics-based Models for ADME/Tox. Tyler Day Towards Physics-based Models for ADME/Tox Tyler Day Overview Motivation Application: P450 Site of Metabolism Application: Membrane Permeability Future Directions and Applications Motivation Advantages

More information

A Review on Computational Methods in Developing Quantitative Structure-Activity Relationship (QSAR)

A Review on Computational Methods in Developing Quantitative Structure-Activity Relationship (QSAR) Navdeep Singh Sethi: A Review on Computational Methods in Developing Quantitative Structure-Activity 815 International Journal of Drug Design and Discovery Volume 3 Issue 3 July September 2012. 815-836

More information

RSC Publishing. Principles and Applications. In Silico Toxicology. Liverpool John Moores University, Liverpool, Edited by

RSC Publishing. Principles and Applications. In Silico Toxicology. Liverpool John Moores University, Liverpool, Edited by In Silico Toxicology Principles and Applications Edited by Mark T. D. Cronin and Judith C. Madden Liverpool John Moores University, Liverpool, UK RSC Publishing Contents Chapter 1 In Silico Toxicology

More information

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009 ICM-Chemist-Pro How-To Guide Version 3.6-1h Last Updated 12/29/2009 ICM-Chemist-Pro ICM 3D LIGAND EDITOR: SETUP 1. Read in a ligand molecule or PDB file. How to setup the ligand in the ICM 3D Ligand Editor.

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

OECD QSAR Toolbox v.3.3. Step-by-step example of how to build a userdefined

OECD QSAR Toolbox v.3.3. Step-by-step example of how to build a userdefined OECD QSAR Toolbox v.3.3 Step-by-step example of how to build a userdefined QSAR Background Objectives The exercise Workflow of the exercise Outlook 2 Background This is a step-by-step presentation designed

More information

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Approach for andling of Protonation and Tautomer Ambiguities in Docking Experiments Ingo Dramburg BioSolve IT Gmb An der

More information

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure

More information

Development of a Structure Generator to Explore Target Areas on Chemical Space

Development of a Structure Generator to Explore Target Areas on Chemical Space Development of a Structure Generator to Explore Target Areas on Chemical Space Kimito Funatsu Department of Chemical System Engineering, This materials will be published on Molecular Informatics Drug Development

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Matrix Data: Prediction Instructor: Yizhou Sun yzsun@ccs.neu.edu September 21, 2015 Announcements TA Monisha s office hour has changed to Thursdays 10-12pm, 462WVH (the same

More information

5.1. Hardwares, Softwares and Web server used in Molecular modeling

5.1. Hardwares, Softwares and Web server used in Molecular modeling 5. EXPERIMENTAL The tools, techniques and procedures/methods used for carrying out research work reported in this thesis have been described as follows: 5.1. Hardwares, Softwares and Web server used in

More information

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen Overview GRIND GRid-INDependent Descriptors

More information

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of

More information

Kristin P. Bennett. Rensselaer Polytechnic Institute

Kristin P. Bennett. Rensselaer Polytechnic Institute Application in Cheminformatics Kristin P. Bennett Mathematical Sciences Department Rensselaer Polytechnic Institute Regression Case Study Given for each Molecule i Descriptor vector x i Bioresponse Construct

More information

Bridging the Dimensions:

Bridging the Dimensions: Bridging the Dimensions: Seamless Integration of 3D Structure-based Design and 2D Structure-activity Relationships to Guide Medicinal Chemistry ACS Spring National Meeting. COMP, March 13 th 2016 Marcus

More information

molecules ISSN

molecules ISSN Molecules 2004, 9, 1004-1009 molecules ISSN 1420-3049 http://www.mdpi.org Performance of Kier-Hall E-state Descriptors in Quantitative Structure Activity Relationship (QSAR) Studies of Multifunctional

More information

1.QSAR identifier 1.1.QSAR identifier (title): QSAR model for Toxicokinetics, Transfer Index (TI) 1.2.Other related models:

1.QSAR identifier 1.1.QSAR identifier (title): QSAR model for Toxicokinetics, Transfer Index (TI) 1.2.Other related models: QMRF identifier (JRC Inventory): QMRF Title: QSAR model for Toxicokinetics, Transfer Index (TI) Printing Date:16.02.2011 1.QSAR identifier 1.1.QSAR identifier (title): QSAR model for Toxicokinetics, Transfer

More information

Molecular Descriptors Family on Structure Activity Relationships 5. Antimalarial Activity of 2,4-Diamino-6-Quinazoline Sulfonamide Derivates

Molecular Descriptors Family on Structure Activity Relationships 5. Antimalarial Activity of 2,4-Diamino-6-Quinazoline Sulfonamide Derivates Leonardo Journal of Sciences ISSN 1583-0233 Issue 8, January-June 2006 p. 77-88 Molecular Descriptors Family on Structure Activity Relationships 5. Antimalarial Activity of 2,4-Diamino-6-Quinazoline Sulfonamide

More information

Click Prediction and Preference Ranking of RSS Feeds

Click Prediction and Preference Ranking of RSS Feeds Click Prediction and Preference Ranking of RSS Feeds 1 Introduction December 11, 2009 Steven Wu RSS (Really Simple Syndication) is a family of data formats used to publish frequently updated works. RSS

More information

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities Ingo Dramburg BioSolve IT Gmb An der Ziegelei 75 53757

More information

OECD QSAR Toolbox v.4.1. Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals

OECD QSAR Toolbox v.4.1. Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals OECD QSAR Toolbox v.4.1 Step-by-step example for predicting skin sensitization accounting for abiotic activation of chemicals Background Outlook Objectives The exercise Workflow 2 Background This is a

More information

Quantitative structure activity relationship and drug design: A Review

Quantitative structure activity relationship and drug design: A Review International Journal of Research in Biosciences Vol. 5 Issue 4, pp. (1-5), October 2016 Available online at http://www.ijrbs.in ISSN 2319-2844 Research Paper Quantitative structure activity relationship

More information

Machine-Learning Methods in Property Predictions: Quo Vadis?

Machine-Learning Methods in Property Predictions: Quo Vadis? Machine-Learning Methods in Property Predictions: Quo Vadis? Igor I. Baskin Lomonosov Moscow State University RUSSIA 1 General Workflow for QSAR Modiling in Chemoinformatics T ra i n in g A Structure Descriptors

More information

Practical QSAR and Library Design: Advanced tools for research teams

Practical QSAR and Library Design: Advanced tools for research teams DS QSAR and Library Design Webinar Practical QSAR and Library Design: Advanced tools for research teams Reservationless-Plus Dial-In Number (US): (866) 519-8942 Reservationless-Plus International Dial-In

More information

Using AutoDock for Virtual Screening

Using AutoDock for Virtual Screening Using AutoDock for Virtual Screening CUHK Croucher ASI Workshop 2011 Stefano Forli, PhD Prof. Arthur J. Olson, Ph.D Molecular Graphics Lab Screening and Virtual Screening The ultimate tool for identifying

More information

Molecular Modeling Study of Some Anthelmintic 2-phenyl Benzimidazole-1- Acetamides as β-tubulin Inhibitor

Molecular Modeling Study of Some Anthelmintic 2-phenyl Benzimidazole-1- Acetamides as β-tubulin Inhibitor Sawant et al : Molecular Modeling Study of Some Anthelmintic 2-phenyl Benzimidazole-1-Acetamides as -tubulin Inhibitor 1269 International Journal of Drug Design and Discovery Volume 5 Issue 1 January March

More information

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a Retrieving hits through in silico screening and expert assessment M.. Drwal a,b and R. Griffith a a: School of Medical Sciences/Pharmacology, USW, Sydney, Australia b: Charité Berlin, Germany Abstract:

More information

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017 Using Phase for Pharmacophore Modelling 5th European Life Science Bootcamp March, 2017 Phase: Our Pharmacohore generation tool Significant improvements to Phase methods in 2016 New highly interactive interface

More information

OECD QSAR Toolbox v.4.1. Tutorial on how to predict Skin sensitization potential taking into account alert performance

OECD QSAR Toolbox v.4.1. Tutorial on how to predict Skin sensitization potential taking into account alert performance OECD QSAR Toolbox v.4.1 Tutorial on how to predict Skin sensitization potential taking into account alert performance Outlook Background Objectives Specific Aims Read across and analogue approach The exercise

More information

Identification of Active Ligands. Identification of Suitable Descriptors (molecular fingerprint)

Identification of Active Ligands. Identification of Suitable Descriptors (molecular fingerprint) Introduction to Ligand-Based Drug Design Chimica Farmaceutica Identification of Active Ligands Identification of Suitable Descriptors (molecular fingerprint) Establish Mathematical Expression Relating

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

Coefficient Symbol Equation Limits

Coefficient Symbol Equation Limits 1 Coefficient Symbol Equation Limits Squared Correlation Coefficient R 2 or r 2 0 r 2 N 1 2 ( Yexp, i Ycalc, i ) 2 ( Yexp, i Y ) i= 1 2 Cross-Validated R 2 q 2 r 2 or Q 2 or q 2 N 2 ( Yexp, i Ypred, i

More information

Gradient Boosting, Continued

Gradient Boosting, Continued Gradient Boosting, Continued David Rosenberg New York University December 26, 2016 David Rosenberg (New York University) DS-GA 1003 December 26, 2016 1 / 16 Review: Gradient Boosting Review: Gradient Boosting

More information

Statistical concepts in QSAR.

Statistical concepts in QSAR. Statistical concepts in QSAR. Computational chemistry represents molecular structures as a numerical models and simulates their behavior with the equations of quantum and classical physics. Available programs

More information

(Big) Data analysis using On-line Chemical database and Modelling platform. Dr. Igor V. Tetko

(Big) Data analysis using On-line Chemical database and Modelling platform. Dr. Igor V. Tetko (Big) Data analysis using On-line Chemical database and Modelling platform Dr. Igor V. Tetko Institute of Structural Biology, Helmholtz Zentrum München & BIGCHEM GmbH September 14, 2018, EPFL, Lausanne

More information

Use of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs

Use of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs Use of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs James Mills; Karl Gibson, Gavin Whitlock, Paul Glossop, Jean-Robert Ioset, Leela

More information

Generating Small Molecule Conformations from Structural Data

Generating Small Molecule Conformations from Structural Data Generating Small Molecule Conformations from Structural Data Jason Cole cole@ccdc.cam.ac.uk Cambridge Crystallographic Data Centre 1 The Cambridge Crystallographic Data Centre About us A not-for-profit,

More information

Estimation of Melting Points of Brominated and Chlorinated Organic Pollutants using QSAR Techniques. By: Marquita Watkins

Estimation of Melting Points of Brominated and Chlorinated Organic Pollutants using QSAR Techniques. By: Marquita Watkins Estimation of Melting Points of Brominated and Chlorinated Organic Pollutants using QSAR Techniques By: Marquita Watkins Persistent Organic Pollutants Do not undergo photolytic, biological, and chemical

More information

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS 179 Molecular Informatics: Confronting Complexity, May 13 th - 16 th 2002, Bozen, Italy PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS CARLETON R. SAGE, KEVIN R. HOLME, NIANISH

More information

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre Dr. Sander B. Nabuurs Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre The road to new drugs. How to find new hits? High Throughput

More information

Tutorials on Library Design E. Lounkine and J. Bajorath (University of Bonn) C. Muller and A. Varnek (University of Strasbourg)

Tutorials on Library Design E. Lounkine and J. Bajorath (University of Bonn) C. Muller and A. Varnek (University of Strasbourg) Tutorials on Library Design E. Lounkine and J. Bajorath (University of Bonn) C. Muller and A. Varnek (University of Strasbourg) The purpose of this tutorial is to generate a library of potential inhibitors

More information

DivCalc: A Utility for Diversity Analysis and Compound Sampling

DivCalc: A Utility for Diversity Analysis and Compound Sampling Molecules 2002, 7, 657-661 molecules ISSN 1420-3049 http://www.mdpi.org DivCalc: A Utility for Diversity Analysis and Compound Sampling Rajeev Gangal* SciNova Informatics, 161 Madhumanjiri Apartments,

More information

Xia Ning,*, Huzefa Rangwala, and George Karypis

Xia Ning,*, Huzefa Rangwala, and George Karypis J. Chem. Inf. Model. XXXX, xxx, 000 A Multi-Assay-Based Structure-Activity Relationship Models: Improving Structure-Activity Relationship Models by Incorporating Activity Information from Related Targets

More information

Quantitative Structure Activity Relationships: An overview

Quantitative Structure Activity Relationships: An overview Quantitative Structure Activity Relationships: An overview Prachi Pradeep Oak Ridge Institute for Science and Education Research Participant National Center for Computational Toxicology U.S. Environmental

More information

Building predictive unbound brain-to-plasma concentration ratio (K p,uu,brain ) models

Building predictive unbound brain-to-plasma concentration ratio (K p,uu,brain ) models Master s degree project in Bioinformatics Department of Biology, Lund University Building predictive unbound brain-to-plasma concentration ratio (K p,uu,brain ) models Srinidhi Varadharajan August 2013-May

More information