Validation and Predictivity of QSAR Models

Size: px
Start display at page:

Download "Validation and Predictivity of QSAR Models"

Transcription

1 Validation and Predictivity of QSAR Models Hugo Kubinyi, Germany home/kubinyi 15th EURO-QSAR Symposium Istanbul, Turkey, Sept Validation of QSAR Models Statistical Parameters r, s and F values, confidence intervals Topliss criteria number of tested and included variables Hansch-Unger criteria meaningful parameters, statistical significance, Ockham s razor, biophysical model Crossvalidation: Q 2 and s PRESS values (Bootstrapping) Lateral Validation Y scrambling External Predictions

2 A Common Situation A chemist synthesizes about 30 compounds. The biologist determines the activity values. Both ask the chemoinformatician to derive a QSAR model. The chemoinformatician loads 1500 variables (e.g. from the program DRAGON, Roberto Todeschini) and derives a QSAR model, containing only a few variables, which meets all statistical criteria. Chemist, biologist and chemoinformatician publish the results. Everybody is happy. The Selwood Data Set n = 31 compounds and k = 53 independent variables. Theoretically, there are: 53 one-variable models, 1,378 two-variable models, 23,426 three-variable models, 292,825 four-variable models,..., 22,957,480 six-variable models,..., in total 7,160,260,814,092,303 regression models, containing one to 29 variables, selected from 53 X-variables.

3 The 53 X Variables of the Selwood Data Set ATCH1 - ATCH10 = partial atomic charges DIPV_X, DIPV_Y and DIPV_Z = dipole vectors DIPMOM = dipole moment ESDL1 - ESDL10 = electrophilic superdelocalizability NSDL1 - NSDL10 = nucleophilic superdelocalizability VDWVOL = van der Waals volume SURF_A = surface area MOFI_X, MOFI_Y and MOFI_Z = moments of inertia PEAX_X, PEAX_Y and PEAX_Z = ellipsoid axes MOL_WT = molecular weight S8_1DX, S8_1DY and S8_1DZ = substituent dimensions S8_1CX, S8_1CY and S8_1CZ = substituent centers LOGP = partition coefficient M_PNT = melting point SUM_F and SUM_R = sums of the F and R constants Ockham s Razor - Keep Things Simple! only models with up to four variables are considered in the following simulations (317,682 different models = complete coverage) Pluralitas non est ponenda sine necessitate ( avoid complexity if not necessary)

4 Questions Can we derive good (statistically valid) models: yes Do our models have internal predictivity (Q 2 values): yes Are these models better than models from scrambled or random data (y, x, y and x): yes Are 53 X variables too many to select from: no, fine Can our models predict a test set (r 2 pred value): not at all Is there a relationship between internal and external predictivity: by no means Results of PLS and Regression Analyses a) PLS, all variables (5 components) r = 0.929; s = 0.335; F = Q 2 = 0.279; s PRESS = b) Regression (best 3-variable model) r = 0.849; s = 0.460; F = Q 2 = 0.647; s PRESS = c) PLS, reduced variable set (5 components) r = 0.909; s = 0.376; F = Q 2 = 0.671; s PRESS = 0.519

5 Y Scrambling - Random Permutations of Y Values will y vs. y correlations disturb the result? 99% r 2 < % r 2 < 0.12 X block remains unchanged scrambled y vector scrambling original y vector Scrambling and Random Y and X Values 25 F F = 23.3 (best 3-variable model) original model new models 1000 runs for each group, best models y-scrambling, y y x x y+x y+x trial scr rnd scr rnd scr rnd 90% 95% 99% below F value

6 F Models Selected from Random X Variables F = 23.3 (best 3-variable model) 1000 runs for each group, best models from variables 90% 95% 99% below F value The Real Situation A chemist prepares some 20 compounds. The biologist determines the activity values. They both ask the chemoinformatician to derive a QSAR model. The resulting model does not contain more than four variables, is selected from about fifty variables and is validated by all statistical criteria, including LOO cross-validation and y scrambling. How good is the internal predictivity, how good is the external predicitivity for a test set of 10 compounds?

7 100 % Training Sets, Internal Predictivity (LOO) 1000 runs for each group. New model selected for every run = 1.9 billion runs Q 2 > n = Q 2 > 0.5 Q 2 > 0 External vs. Internal Predictivity

8 External vs. Internal Predictivity The Kubinyi Paradox J. H. van Drie, Curr. Pharm. Des. 9, (2003); J. H. van Drie, in: Computational Medicinal Chemistry for Drug Discovery, P. Bultinck et al., Eds., Marcel Dekker, 2004, pp Data from H. Kubinyi et al., J. Med. Chem. 41, (1998). Test vs. Training Set Predictivity (A. Doweyko, ACS 2004) Test Set (r 2 pred) Training Set (q 2 )

9 100 % 80 Test Sets, External Predictivity n = 1 n = 2 n = 3 n = 5 n = 10 n = runs for each group r 2 pred > 0.6 r 2 pred > 0.5 r 2 pred > 0 External vs. Internal Predictivity, Selwood Data r 2 pred 56 models Test set (n = 10) predictions from best training set models (n = 21) red area: 56 models (out of 1000) The best fit models are not the best ones for prediction! Q 2

10 Answers to Our Questions 1) We can derive good (statistically valid) models 2) The models have good internal predictivity 3) These models are significantly better than models from scrambled or random data (y, x, y and x) 4) 53 X variables are not too many to select from 5) The models have no external predictivity at all! 6) There is no relationship between internal and external predictivity Reasons? Explanations? Help? S. H. Unger and C. Hansch J. Med. Chem. 16, (1973) One must rely heavily on statistics in formulating a quantitative model but, at each critical step in constructing the model, one must set aside statistics and ask questions.... without a qualitative perspective one is apt to generate statistical unicorns, beasts that exist on paper but not in reality.... it has recently become all too clear that one can correlate a set of dependent variables using random numbers as dependent variables. Such correlations meet the usual criteria of high significance...

11 Good and Bad Guys in Regression Analysis Y outlier in the test set: r 2, Q 2 good r 2 pred poor the bad guy outlier in the training set: r 2, Q 2 poor r 2 pred good X External vs. Internal Predictivity Corticosteroid-binding globulin affinities of steroids log 1/CBG = (±0.46) [4,5 >C=C<] (±0.36) (n = 31; r = 0.838; s = 0.600; F = 68.28; Q 2 = 0.667; s PRESS = 0.634) Training set # 1-21; test set # Q 2 = 0.726; r 2 pred = 0.477; s PRED = Training set # 1-12 and 23-31; test set # Q 2 = 0.454; r 2 pred = 0.909; s PRED = H. Kubinyi, in: Computer-Assisted Lead Finding and Optimization van de Waterbeemd, H., Testa, B., and Folkers, G., Eds.; VHChA and VCH, Basel, Weinheim, 1997; pp. 9-28

12 Summary, Conclusions and Recommendations Apply the Unger and Hansch recommendations: 1. Selection of meaningful variables 2. Elimination of interrelated variables 3. Justification of variable choices by statistics 4. Principle of parsimony (Ockham s Razor) 5. Number of variables to choose from 6. Number of variables in the model 7. Qualitative biophysical model Additional recommendations: 8. Beware of Q 2 (Alex Tropsha) 9. Search for outliers in the test set 10. Do not expect your model to be predictive Summary, Conclusions and Recommendations La Trahison des Images (The Perfidy of Images) R. Magritte "All Models Are Wrong But Some Are Useful." George E. P. Box, 1979

International Journal of Research and Development in Pharmacy and Life Sciences. Review Article

International Journal of Research and Development in Pharmacy and Life Sciences. Review Article International Journal of Research and Development in Pharmacy and Life Sciences Available online at http//www.ijrdpl.com October - November, 2012, Vol. 1, No.4, pp 167-175 ISSN: 2278-0238 Review Article

More information

Nonlinear QSAR and 3D QSAR

Nonlinear QSAR and 3D QSAR onlinear QSAR and 3D QSAR Hugo Kubinyi Germany E-Mail kubinyi@t-online.de HomePage www.kubinyi.de onlinear Lipophilicity-Activity Relationships drug receptor Possible Reasons for onlinear Lipophilicity-Activity

More information

Detecting bad regression models: multicriteria fitness functions in regression analysis

Detecting bad regression models: multicriteria fitness functions in regression analysis Analytica Chimica Acta 515 (4) 199 8 Detecting bad regression models: multicriteria fitness functions in regression analysis Roberto Todeschini, Viviana Consonni, Andrea Mauri, Manuela Pavan Milano Chemometrics

More information

Quantitative Structure-Activity Relationship (QSAR) computational-drug-design.html

Quantitative Structure-Activity Relationship (QSAR)  computational-drug-design.html Quantitative Structure-Activity Relationship (QSAR) http://www.biophys.mpg.de/en/theoretical-biophysics/ computational-drug-design.html 07.11.2017 Ahmad Reza Mehdipour 07.11.2017 Course Outline 1. 1.Ligand-

More information

QSAR Parameters. Hugo Kubinyi. What is QSAR? Germany. HomePage

QSAR Parameters. Hugo Kubinyi. What is QSAR? Germany.  HomePage QSA Parameters Hugo Kubinyi Germany E-Mail kubinyi@t-online.de HomePage www.kubinyi.de What is QSA? QSA (quantitative structure-activity relationships) includes all statistical methods, by which biological

More information

Journal of Chemical and Pharmaceutical Research

Journal of Chemical and Pharmaceutical Research Available on line www.jocpr.com Journal of Chemical and Pharmaceutical Research ISS o: 0975-7384 CDE(USA): JCPRC5 J. Chem. Pharm. Res., 2010, 2(5):682-689 QSAR analysis of indanone and aurone derivatives

More information

MRP. parallel artificial membrane permeation assay (PAMPA) Caco-,

MRP. parallel artificial membrane permeation assay (PAMPA) Caco-, parallel artificial membrane permeation assay (PAMPA) Caco-, Caco-, Prediction of Human Oral Absorption of Chemicals in the Environment Miki AKAMATSU, Masaaki FUJIKAWA, Kazuya NAKAO, Ryo SHIMIZU, +, P-

More information

QSAR STUDY OF ANTIMALARIA OF XANTHONE DERIVATIVES USING MULTIPLE LINEAR REGRESSION METHODS

QSAR STUDY OF ANTIMALARIA OF XANTHONE DERIVATIVES USING MULTIPLE LINEAR REGRESSION METHODS PROCEEDING OF 3 RD INTERNATIONAL CONFERENCE ON RESEARCH, IMPLEMENTATION AND EDUCATION OF MATHEMATICS AND SCIENCE YOGYAKARTA, 16 17 MAY 2016 QSAR STUDY OF ANTIMALARIA OF XANTHONE DERIVATIVES USING MULTIPLE

More information

Medicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted?

Medicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted? Medicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted? J. Guy Breitenbucher Streamlining Drug Discovery Conference San Francisco, CA ct. 25, 2018 Additivity as the

More information

Validation of QSAR Models - Strategies and Importance

Validation of QSAR Models - Strategies and Importance Ravichandran Veerasamy, et al : Validation of QSAR Models-Strategies and Importance 511 International Journal of Drug Design and Discovery Volume Issue 3 July September 011. 511-519 Validation of QSAR

More information

An A statement: The year 1967 is 49 years in the past. A B statement: The year 1967 is 49 years before the year 2016

An A statement: The year 1967 is 49 years in the past. A B statement: The year 1967 is 49 years before the year 2016 McTaggart Ted Sider Intro Metaphysics 1. The A series and the B series Positions in time, as time appears to us prima facie, are distinguished in two ways. Each position is Earlier than some and Later

More information

Priority Setting of Endocrine Disruptors Using QSARs

Priority Setting of Endocrine Disruptors Using QSARs Priority Setting of Endocrine Disruptors Using QSARs Weida Tong Manager of Computational Science Group, Logicon ROW Sciences, FDA s National Center for Toxicological Research (NCTR), U.S.A. Thanks for

More information

Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,..

Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,.. 3 Conformational Search Molecular Docking Simulate Annealing Ab Initio QM Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,.. Rino Ragno:

More information

International Journal of Pharma and Bio Sciences V1(1)2010. Abhilash M

International Journal of Pharma and Bio Sciences V1(1)2010. Abhilash M Abhilash M Department of Biotechnology,The Oxford college of Engineering,Bangalore,INDIA. Corresponding Author abhibiotek@gmail.com Abstract Quantitative structure activity relationship (QSAR) represents

More information

CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS

CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS 159 CHAPTER 6 QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) ANALYSIS 6.1 INTRODUCTION The purpose of this study is to gain on insight into structural features related the anticancer, antioxidant

More information

Prediction of Acute Toxicity of Emerging Contaminants on the Water Flea Daphnia magna by Ant Colony Optimization - Support Vector Machine QSTR models

Prediction of Acute Toxicity of Emerging Contaminants on the Water Flea Daphnia magna by Ant Colony Optimization - Support Vector Machine QSTR models Electronic Supplementary Material (ESI) for Environmental Science: Processes & Impacts. This journal is The Royal Society of Chemistry 017 Prediction of Acute Toxicity of Emerging Contaminants on the Water

More information

Identification of Active Ligands. Identification of Suitable Descriptors (molecular fingerprint)

Identification of Active Ligands. Identification of Suitable Descriptors (molecular fingerprint) Introduction to Ligand-Based Drug Design Chimica Farmaceutica Identification of Active Ligands Identification of Suitable Descriptors (molecular fingerprint) Establish Mathematical Expression Relating

More information

PHILOSOPHY OF MODELING

PHILOSOPHY OF MODELING PHILOSOPHY OF MODELING 1B-1 Overview Modeling Scientific method Karl Popper Scientific method Problems in modeling Stages in model development Model complexity Model complexity Conclusions "After all,

More information

COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS

COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS DRUG DEVELOPMENT Drug development is a challenging path Today, the causes of many diseases (rheumatoid arthritis, cancer, mental diseases, etc.)

More information

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom verview D-QSAR Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary Peter Gedeck ovartis Institutes

More information

Qsar study of anthranilic acid sulfonamides as inhibitors of methionine aminopeptidase-2 using different chemometrics tools

Qsar study of anthranilic acid sulfonamides as inhibitors of methionine aminopeptidase-2 using different chemometrics tools Qsar study of anthranilic acid sulfonamides as inhibitors of methionine aminopeptidase-2 using different chemometrics tools RAZIEH SABET, MOHSEN SHAHLAEI, AFSHIN FASSIHI a Department of Medicinal Chemistry,

More information

SCIENCES (Int. J. of Pharm. Life Sci.) Melting point Models of Alkanes by using Physico-chemical Properties

SCIENCES (Int. J. of Pharm. Life Sci.) Melting point Models of Alkanes by using Physico-chemical Properties INTERNATIONAL JOURNAL OF PHARMACY & LIFE SCIENCES (Int. J. of Pharm. Life Sci.) Melting point s of Alkanes by using Physico-chemical Properties Suman Rawat 1* and Om Prakash Sati 2 1, Department of Chemistry,

More information

QSAR Study of Quinazoline Derivatives as Inhibitor of Epidermal Growth Factor Receptor-Tyrosine Kinase (EGFR-TK)

QSAR Study of Quinazoline Derivatives as Inhibitor of Epidermal Growth Factor Receptor-Tyrosine Kinase (EGFR-TK) 3rd International Conference on Computation for cience and Technology (ICCT-3) QAR tudy of Quinazoline Derivatives as Inhibitor of Epidermal Growth Factor Receptor-Tyrosine Kinase (EGFR-TK) La de Aman1*,

More information

International Journal of PharmTech Research CODEN (USA): IJPRIF, ISSN: Vol.8, No.3, pp , 2015

International Journal of PharmTech Research CODEN (USA): IJPRIF, ISSN: Vol.8, No.3, pp , 2015 International Journal of PharmTech Research CDE (USA): IJPRIF, ISS: 0974-4304 Vol.8, o.3, pp 408-414, 2015 AlogP calculation of octanol/water partition coefficient of ferrocene derivatives Ridha Ahmedi,

More information

Three-dimensional molecular descriptors and a novel QSAR method

Three-dimensional molecular descriptors and a novel QSAR method Journal of Molecular Graphics and Modelling 21 (2002) 161 170 Three-dimensional molecular descriptors and a novel QSAR method Scott A. Wildman 1, Gordon M. Crippen College of Pharmacy, University of Michigan,

More information

In silico pharmacology for drug discovery

In silico pharmacology for drug discovery In silico pharmacology for drug discovery In silico drug design In silico methods can contribute to drug targets identification through application of bionformatics tools. Currently, the application of

More information

Description of Molecules with Molecular Interaction Fields (MIF)

Description of Molecules with Molecular Interaction Fields (MIF) Description of Molecules with Molecular Interaction Fields (MIF) Introduction to Ligand-Based Drug Design Chimica Farmaceutica 2 Reduction of Dimensionality into Few New Highly Informative Entities -----

More information

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering Fragment based drug discovery in teams of medicinal and computational chemists Carsten Detering BioSolveIT Quick Facts Founded in 2001 by the developers of FlexX ~20 people Core expertise: docking, screening,

More information

QSAR of Microtubule Stabilizing Dictyostatins

QSAR of Microtubule Stabilizing Dictyostatins QSAR of Microtubule Stabilizing Dictyostatins Kia Montgomery BBSI 2007- University of Pittsburgh Department of Chemistry, Grambling State University Billy Day, Ph.D. Department of Pharmaceutical Sciences,

More information

Quantitative structure activity relationship and drug design: A Review

Quantitative structure activity relationship and drug design: A Review International Journal of Research in Biosciences Vol. 5 Issue 4, pp. (1-5), October 2016 Available online at http://www.ijrbs.in ISSN 2319-2844 Research Paper Quantitative structure activity relationship

More information

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships

QSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships QSAR/QSPR modeling Alexandre Varnek Faculté de Chimie, ULP, Strasbourg, FRANCE QSAR/QSPR models Development Validation

More information

Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR

Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR Bela Torok Department of Chemistry University of Massachusetts Boston Boston, MA 1 Introduction Structure-Activity Relationship (SAR) - similar

More information

REVIEW ON: QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) MODELING

REVIEW ON: QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) MODELING REVIEW ON: QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIP (QSAR) MODELING Umma Muhammad 1, Adamu Uzairu and David Ebuka Arthur 1 Department of Pre-nd Sci& Tech, School of General Studies,Kano State Polytechnic.

More information

Sorana-Daniela BOLBOACĂ a, Lorentz JÄNTSCHI b. Abstract

Sorana-Daniela BOLBOACĂ a, Lorentz JÄNTSCHI b. Abstract Computer-Aided Chemical Engineering, ISSN 1570-7946, 24(2007) 2007 Elsevier B.V./Ltd. All rights reserved. 965 Modelling the Inhibition Activity on Carbonic Anhydrase I of Some Substituted Thiadiazole-

More information

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of

More information

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No. 12 1801 Ligand-based QSAR Studies on the Indolinones Derivatives as Inhibitors of the Protein Tyrosine

More information

Coefficient Symbol Equation Limits

Coefficient Symbol Equation Limits 1 Coefficient Symbol Equation Limits Squared Correlation Coefficient R 2 or r 2 0 r 2 N 1 2 ( Yexp, i Ycalc, i ) 2 ( Yexp, i Y ) i= 1 2 Cross-Validated R 2 q 2 r 2 or Q 2 or q 2 N 2 ( Yexp, i Ypred, i

More information

Virtual screening for drug discovery. Markus Lill Purdue University

Virtual screening for drug discovery. Markus Lill Purdue University Virtual screening for drug discovery Markus Lill Purdue University mlill@purdue.edu Lecture material http://people.pharmacy.purdue.edu/~mlill/teaching/eidelberg/ I.1 Drug discovery Cl N Disease I.1 Drug

More information

Available online Journal of Chemical and Pharmaceutical Research, 2014, 6(4): Research Article

Available online   Journal of Chemical and Pharmaceutical Research, 2014, 6(4): Research Article Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(4):780-785 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 A novel series of Jumonji domain-containing protein

More information

Translating Methods from Pharma to Flavours & Fragrances

Translating Methods from Pharma to Flavours & Fragrances Translating Methods from Pharma to Flavours & Fragrances CINF 27: ACS National Meeting, New Orleans, LA - 18 th March 2018 Peter Hunt, Edmund Champness, Nicholas Foster, Tamsin Mansley & Matthew Segall

More information

Exploring the black box: structural and functional interpretation of QSAR models.

Exploring the black box: structural and functional interpretation of QSAR models. EMBL-EBI Industry workshop: In Silico ADMET prediction 4-5 December 2014, Hinxton, UK Exploring the black box: structural and functional interpretation of QSAR models. (Automatic exploration of datasets

More information

Introduction to Chemoinformatics

Introduction to Chemoinformatics Introduction to Chemoinformatics Dr. Igor V. Tetko Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) Institute of Bioinformatics & Systems Biology (HMGU) Kyiv, 10 August

More information

MODELING ANGIOTENSIN II ANTAGONIST ACTIVITY OF 4H-1, 2, 4-TRIAZOLES

MODELING ANGIOTENSIN II ANTAGONIST ACTIVITY OF 4H-1, 2, 4-TRIAZOLES Singh et.al/journal of Engineering, Science and Management Education/Vol., 010/57-66 MODELING ANGIOTENSIN II ANTAGONIST ACTIVITY OF 4H-1,, 4-TRIAZOLES Jyoti Singh*, Santosh Kumar Shukla**, Basheerulla

More information

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration

The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration Chris Luscombe, Computational Chemistry GlaxoSmithKline Summary of Talk Traditional approaches SAR Free-Wilson

More information

QSAR MODELS FOR PREDICTING TOXICITIES OF MICROCYSTINS IN CYANOBACTERIA USING GETAWAY DESCRIPTORS

QSAR MODELS FOR PREDICTING TOXICITIES OF MICROCYSTINS IN CYANOBACTERIA USING GETAWAY DESCRIPTORS QSAR MODELS FOR PREDICTING TOXICITIES OF MICROCYSTINS IN CYANOBACTERIA USING GETAWAY DESCRIPTORS 2 Alex A. Tardaguila, 2 Jennifer C. Sy, and 1 Eric R. Punzalan 1 Chemistry Department, De La Salle University,

More information

QSAR STUDY OF ASIATIC ACID AND ITS DERIVATIVES AS POTENTIAL INDUCIBLE NITRIC OXIDE SYNTHASE (inos) INHIBITORS

QSAR STUDY OF ASIATIC ACID AND ITS DERIVATIVES AS POTENTIAL INDUCIBLE NITRIC OXIDE SYNTHASE (inos) INHIBITORS Int. J. Chem. Sci.: 1(), 2015, 1459-147 ISSN 0972-768X www.sadgurupublications.com QSAR STUDY OF ASIATIC ACID AND ITS DERIVATIVES AS POTENTIAL INDUCIBLE NITRIC OXIDE SYNTHASE (inos) INHIBITORS IDA MUSFIROH

More information

Structure-Activity Modeling - QSAR. Uwe Koch

Structure-Activity Modeling - QSAR. Uwe Koch Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity

More information

Structural interpretation of QSAR models a universal approach

Structural interpretation of QSAR models a universal approach Methods and Applications of Computational Chemistry - 5 Kharkiv, Ukraine, 1 5 July 2013 Structural interpretation of QSAR models a universal approach Victor Kuz min, Pavel Polishchuk, Anatoly Artemenko,

More information

Molecular Descriptors for Chemoinformatics

Molecular Descriptors for Chemoinformatics Roberto Todeschini and Viviana Consonni Molecular Descriptors for Chemoinformatics Volume I: Alphabetical Listing Second, Revised and Enlarged Edition Roberto Todeschini and Viviana Consonni Molecular

More information

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Roberto Todeschini Milano Chemometrics and QSAR Research Group - Dept. of

More information

ADRIANA.Code and SONNIA. Tutorial

ADRIANA.Code and SONNIA. Tutorial ADRIANA.Code and SONNIA Tutorial Modeling Corticosteroid Binding Globulin Receptor Activity Molecular Networks GmbH Computerchemie July 2008 http://www.molecular-networks.com Henkestr. 91 91052 Erlangen

More information

Spring Frost Prediction Models in Cranberry

Spring Frost Prediction Models in Cranberry University of Massachusetts Amherst ScholarWorks@UMass Amherst North American Cranberry Researcher and Extension Workers Conference NACREW 2017 Aug 28th, 9:30 AM - 9:45 AM Spring Frost Prediction Models

More information

Integrated Cheminformatics to Guide Drug Discovery

Integrated Cheminformatics to Guide Drug Discovery Integrated Cheminformatics to Guide Drug Discovery Matthew Segall, Ed Champness, Peter Hunt, Tamsin Mansley CINF Drug Discovery Cheminformatics Approaches August 23 rd 2017 Optibrium, StarDrop, Auto-Modeller,

More information

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen Overview GRIND GRid-INDependent Descriptors

More information

De Novo molecular design with Deep Reinforcement Learning

De Novo molecular design with Deep Reinforcement Learning De Novo molecular design with Deep Reinforcement Learning @olexandr Olexandr Isayev, Ph.D. University of North Carolina at Chapel Hill olexandr@unc.edu http://olexandrisayev.com About me Ph.D. in Chemistry

More information

Hydrogen Bonding & Molecular Design Peter

Hydrogen Bonding & Molecular Design Peter Hydrogen Bonding & Molecular Design Peter Kenny(pwk.pub.2008@gmail.com) Hydrogen Bonding in Drug Discovery & Development Interactions between drug and water molecules (Solubility, distribution, permeability,

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics

More information

DivCalc: A Utility for Diversity Analysis and Compound Sampling

DivCalc: A Utility for Diversity Analysis and Compound Sampling Molecules 2002, 7, 657-661 molecules ISSN 1420-3049 http://www.mdpi.org DivCalc: A Utility for Diversity Analysis and Compound Sampling Rajeev Gangal* SciNova Informatics, 161 Madhumanjiri Apartments,

More information

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt ACS Spring National Meeting. COMP, March 16 th 2016 Matthew Segall, Peter Hunt, Ed Champness matt.segall@optibrium.com Optibrium,

More information

Molecular Descriptors Family on Structure Activity Relationships 5. Antimalarial Activity of 2,4-Diamino-6-Quinazoline Sulfonamide Derivates

Molecular Descriptors Family on Structure Activity Relationships 5. Antimalarial Activity of 2,4-Diamino-6-Quinazoline Sulfonamide Derivates Leonardo Journal of Sciences ISSN 1583-0233 Issue 8, January-June 2006 p. 77-88 Molecular Descriptors Family on Structure Activity Relationships 5. Antimalarial Activity of 2,4-Diamino-6-Quinazoline Sulfonamide

More information

molecules ISSN

molecules ISSN Molecules 2004, 9, 1004-1009 molecules ISSN 1420-3049 http://www.mdpi.org Performance of Kier-Hall E-state Descriptors in Quantitative Structure Activity Relationship (QSAR) Studies of Multifunctional

More information

Physical Chemistry Final Take Home Fall 2003

Physical Chemistry Final Take Home Fall 2003 Physical Chemistry Final Take Home Fall 2003 Do one of the following questions. These projects are worth 30 points (i.e. equivalent to about two problems on the final). Each of the computational problems

More information

ADME evaluation in drug discovery

ADME evaluation in drug discovery J Mol Model (2002) 8:337 349 DOI 10.1007/s00894-002-0101-1 ORIGINAL PAPER Tingjun Hou Xiaojie Xu ADME evaluation in drug discovery 1. Applications of genetic algorithms to the prediction of blood brain

More information

Improved Descriptors for the Quantitative Structure Activity Relationship Modeling of Peptides and Proteins

Improved Descriptors for the Quantitative Structure Activity Relationship Modeling of Peptides and Proteins Cite This: pubs.acs.org/jcim Improved Descriptors for the Quantitative Structure Activity Relationship Modeling of Peptides and Proteins Mark H. Barley, Nicholas J. Turner, and Royston Goodacre* School

More information

Relative Drug Likelihood: Going beyond Drug-Likeness

Relative Drug Likelihood: Going beyond Drug-Likeness Relative Drug Likelihood: Going beyond Drug-Likeness ACS Fall National Meeting, August 23rd 2012 Matthew Segall, Iskander Yusof Optibrium, StarDrop, Auto-Modeller and Glowing Molecule are trademarks of

More information

Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds

Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds Applying Bioisosteric Transformations to Predict Novel, High Quality Compounds Dr James Chisholm,* Dr John Barnard, Dr Julian Hayward, Dr Matthew Segall*, Mr Edmund Champness*, Dr Chris Leeding,* Mr Hector

More information

Keywords: anti-coagulants, factor Xa, QSAR, Thrombosis. Introduction

Keywords: anti-coagulants, factor Xa, QSAR, Thrombosis. Introduction PostDoc Journal Vol. 2, No. 3, March 2014 Journal of Postdoctoral Research www.postdocjournal.com QSAR Study of Thiophene-Anthranilamides Based Factor Xa Direct Inhibitors Preetpal S. Sidhu Department

More information

Statistical concepts in QSAR.

Statistical concepts in QSAR. Statistical concepts in QSAR. Computational chemistry represents molecular structures as a numerical models and simulates their behavior with the equations of quantum and classical physics. Available programs

More information

QSAR Models for the Prediction of Plasma Protein Binding

QSAR Models for the Prediction of Plasma Protein Binding doi: 10.5681/bi.2013.011 http://bi.tbzmed.ac.ir/ QSAR Models for the Prediction of Plasma Protein Binding Taravat Ghafourian *, Zeshan Amin Medway School of Pharmacy, Universities of Kent and Greenwich,

More information

1.QSAR identifier 1.1.QSAR identifier (title): QSAR for Relative Binding Affinity to Estrogen Receptor 1.2.Other related models:

1.QSAR identifier 1.1.QSAR identifier (title): QSAR for Relative Binding Affinity to Estrogen Receptor 1.2.Other related models: QMRF identifier (ECB Inventory): QMRF Title: QSAR for Relative Binding Affinity to Estrogen Receptor Printing Date:Feb 16, 2010 1.QSAR identifier 1.1.QSAR identifier (title): QSAR for Relative Binding

More information

Encoding molecular structures as ranks of models: A new, secure way for sharing chemical data and development of ADME/T models. Igor V.

Encoding molecular structures as ranks of models: A new, secure way for sharing chemical data and development of ADME/T models. Igor V. Encoding molecular structures as ranks of models: A new, secure way for sharing chemical data and development of ADME/T models Igor V. Tetko IBPC, Ukrainian Academy of Sciences, Kyiv, Ukraine and Institute

More information

Internally defined distances in 3D-quantitative structure-activity relationships

Internally defined distances in 3D-quantitative structure-activity relationships Journal of Computer-Aided Molecular Design, 16: 79 93, 2002. KLUWER/ESCOM 2002 Kluwer Academic Publishers. Printed in the Netherlands. 79 Internally defined distances in 3D-quantitative structure-activity

More information

Column Liquid Chromatography Experiment Adapted for Use in Secondary Schools

Column Liquid Chromatography Experiment Adapted for Use in Secondary Schools Column Liquid Chromatography Experiment Adapted for Use in Secondary Schools Mark Langella WISTA The most modern and sophisticated methods of separating mixtures that the organic chemist has available

More information

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National

More information

Comparison of log P/D with bio-mimetic properties

Comparison of log P/D with bio-mimetic properties Comparison of log P/D with bio-mimetic properties Klara Valko Bio-Mimetic Chromatography Consultancy for Successful Drug Discovery Solvation process First we need to create a cavity Solute molecule Requires

More information

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes Introduction Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes The production of new drugs requires time for development and testing, and can result in large prohibitive costs

More information

Combinatorial Chemistry Technology

Combinatorial Chemistry Technology ugo Kubinyi, www.kubinyi.de ombinatorial hemistry Technology ugo Kubinyi Germany E-Mail kubinyi@t-online.de omepage www.kubinyi.de ugo Kubinyi, www.kubinyi.de ombinatorial Diversity in ature 20 natural

More information

(e.g.training and prediction set, algorithm, ecc...). 2.9.Availability of another QMRF for exactly the same model: No

(e.g.training and prediction set, algorithm, ecc...). 2.9.Availability of another QMRF for exactly the same model: No QMRF identifier (JRC Inventory):To be entered by JRC QMRF Title: Insubria QSAR PaDEL-Descriptor model for prediction of Endocrine Disruptors Chemicals (EDC) Estrogen Receptor (ER)-binding affinity. Printing

More information

3D QSAR studies of Substituted Benzamides as Nonacidic Antiinflammatory Agents by knn MFA Approach

3D QSAR studies of Substituted Benzamides as Nonacidic Antiinflammatory Agents by knn MFA Approach INTERNATIONAL JOURNAL OF ADVANCES IN PARMACY, BIOLOGY AND CEMISTRY Research Article 3D QSAR studies of Substituted Benzamides as Nonacidic Antiinflammatory Agents by knn MFA Approach Anupama A. Parate*

More information

Advanced Studies in Organic Chemistry

Advanced Studies in Organic Chemistry Advanced Studies in Organic Chemistry Organic Chemistry of Drug Compounds 783614S 20 After this course the student is familiar with basic principles and techniques of medicinal chemistry, tactics and tools

More information

Math 111, Introduction to the Calculus, Fall 2011 Midterm I Practice Exam 1 Solutions

Math 111, Introduction to the Calculus, Fall 2011 Midterm I Practice Exam 1 Solutions Math 111, Introduction to the Calculus, Fall 2011 Midterm I Practice Exam 1 Solutions For each question, there is a model solution (showing you the level of detail I expect on the exam) and then below

More information

Chemical Space: Modeling Exploration & Understanding

Chemical Space: Modeling Exploration & Understanding verview Chemical Space: Modeling Exploration & Understanding Rajarshi Guha School of Informatics Indiana University 16 th August, 2006 utline verview 1 verview 2 3 CDK R utline verview 1 verview 2 3 CDK

More information

Effect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds

Effect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds IOSR Journal of Applied Chemistry (IOSR-JAC) e-issn: 2278-5736. Volume 6, Issue 3 (Nov. Dec. 2013), PP 09-17 Effect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds Anita K* 1,Vijay

More information

Notes of Dr. Anil Mishra at 1

Notes of Dr. Anil Mishra at   1 Introduction Quantitative Structure-Activity Relationships QSPR Quantitative Structure-Property Relationships What is? is a mathematical relationship between a biological activity of a molecular system

More information

Homework 2 Solutions

Homework 2 Solutions Homework 2 Solutions by the respectable Asa Levi p134 # 1 (10 pts) (a) Too low! - in this plot the point of averages is around (60, 60), way lower than (100, 100). (b) Too small! - in this plot the SD

More information

Decision Trees (Cont.)

Decision Trees (Cont.) Decision Trees (Cont.) R&N Chapter 18.2,18.3 Side example with discrete (categorical) attributes: Predicting age (3 values: less than 30, 30-45, more than 45 yrs old) from census data. Attributes (split

More information

TRAINING REAXYS MEDICINAL CHEMISTRY

TRAINING REAXYS MEDICINAL CHEMISTRY TRAINING REAXYS MEDICINAL CHEMISTRY 1 SITUATION: DRUG DISCOVERY Knowledge survey Therapeutic target Known ligands Generate chemistry ideas Chemistry Check chemical feasibility ELN DBs In-house Analyze

More information

Exploring Structural Requirements For PPAR Modulation Of Benzoyl 2-Methyl Indoles Using Physico-Chemical Parameters

Exploring Structural Requirements For PPAR Modulation Of Benzoyl 2-Methyl Indoles Using Physico-Chemical Parameters International Journal of PharmTech Research CODEN (USA): IJPRIF ISSN : 0974-4304 Vol.4, No.4, pp 1834-1839, Oct-Dec 2012 Exploring Structural Requirements For PPAR Modulation Of Benzoyl 2-Methyl Indoles

More information

Easier and Better Exploitation of PhysChem Properties in Medicinal Chemistry

Easier and Better Exploitation of PhysChem Properties in Medicinal Chemistry www.acdlabs.com Easier and Better Exploitation of PhysChem Properties in Medicinal Chemistry Dr. Sanjivanjit K. Bhal Advanced Chemistry Development, Inc. (ACD/Labs) Evolution of MedChem

More information

Stats notes Chapter 5 of Data Mining From Witten and Frank

Stats notes Chapter 5 of Data Mining From Witten and Frank Stats notes Chapter 5 of Data Mining From Witten and Frank 5 Credibility: Evaluating what s been learned Issues: training, testing, tuning Predicting performance: confidence limits Holdout, cross-validation,

More information

2.4.QMRF update(s): 3.Defining the endpoint - OECD Principle 1

2.4.QMRF update(s): 3.Defining the endpoint - OECD Principle 1 QMRF identifier (JRC Inventory): QMRF Title: Nonlinear ANN QSAR Model for Repeated dose 90-day oral toxicity study in rodents Printing Date:19.05.2011 1.QSAR identifier 1.1.QSAR identifier (title): Nonlinear

More information

Thanks to Paul Lewis and Joe Felsenstein for the use of slides

Thanks to Paul Lewis and Joe Felsenstein for the use of slides Thanks to Paul Lewis and Joe Felsenstein for the use of slides Review Hennigian logic reconstructs the tree if we know polarity of characters and there is no homoplasy UPGMA infers a tree from a distance

More information

QMRF identifier (JRC Inventory): QMRF Title: QSAR model for blood-brain barrier (BBB) partition Printing Date:

QMRF identifier (JRC Inventory): QMRF Title: QSAR model for blood-brain barrier (BBB) partition Printing Date: QMRF identifier (JRC Inventory): QMRF Title: QSAR model for blood-brain barrier (BBB) partition Printing Date:21.01.2010 1.QSAR identifier 1.1.QSAR identifier (title): QSAR model for blood-brain barrier

More information

EXPERIMENTAL DESIGN. u Science answers questions with experiments.

EXPERIMENTAL DESIGN. u Science answers questions with experiments. EXPERIMENTAL DESIGN u Science answers questions with experiments. Define the Problem u Begin by asking a question about your topic u Represented by a problem statement u What is a good question for an

More information

Quantitative Structure Relaxant Activity Relationship of Benzopyran Derivatives: A GA-PLS-MLR Approach

Quantitative Structure Relaxant Activity Relationship of Benzopyran Derivatives: A GA-PLS-MLR Approach BI Biomedicine International 2011; 2:. ORIGINAL ARTICLE Quantitative Structure Relaxant Activity Relationship of Benzopyran Derivatives: A GA-PLS-MLR Approach Behzad Jafari, 1 Somaieh Soltani, 2 Hossein

More information

1.3.Software coding the model: QSARModel Molcode Ltd., Turu 2, Tartu, 51014, Estonia

1.3.Software coding the model: QSARModel Molcode Ltd., Turu 2, Tartu, 51014, Estonia QMRF identifier (ECB Inventory):Q8-10-14-176 QMRF Title: QSAR for acute oral toxicity (in vitro) Printing Date:Mar 29, 2011 1.QSAR identifier 1.1.QSAR identifier (title): QSAR for acute oral toxicity (in

More information

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites Introduction J. Andrew Surface Hampden-Sydney College / Virginia Commonwealth University In the past several decades

More information

STRUCTURE-AFFINITY MODELING OF AZO DYE ADSORPTION ON CELLULOSE FIBRE BY MLR

STRUCTURE-AFFINITY MODELING OF AZO DYE ADSORPTION ON CELLULOSE FIBRE BY MLR TUCTUE-AFFIITY MODELIG OF AZO DYE ADOPTIO O CELLULOE FIBE BY ML IMOA FUA-TIMOFEI a, LUDOVIC KUUCZI b, WALTE M.F. FABIA c, DAIELA IOECU b a Institute of Chemistry of the omanian Academy, 4 Mihai Viteazul

More information

The Development of QSAR Models To Predict and Interpret the Biological Activity of Artemisinin Analogues

The Development of QSAR Models To Predict and Interpret the Biological Activity of Artemisinin Analogues 151 Chapter 6 The Development of QSAR Models To Predict and Interpret the Biological Activity of Artemisinin Analogues 6.1 Introduction Qinghao (Artemisia annua) is an herb that has been used for over

More information

FUNDAMENTAL CONCEPTS IN MEASUREMENT & EXPERIMENTATION

FUNDAMENTAL CONCEPTS IN MEASUREMENT & EXPERIMENTATION FUNDAMENTAL CONCEPTS IN MEASUREMENT & EXPERIMENTATION What is Measurement? In general, it is the acquisition of information. Classification of information leads to two different types of measurements:

More information