Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors. Robert D. Clark 2004 Tripos, Inc.

Size: px
Start display at page:

Download "Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors. Robert D. Clark 2004 Tripos, Inc."

Transcription

1 Characterization of Pharmacophore Multiplet Fingerprints as Molecular Descriptors Robert D. Clark Tripos, Inc Tripos, Inc.

2 Outline Background o history o mechanics Finding appropriate binning ranges o biased conformer generation Similarity measures o stochastic similarity Hypothesis generation o asymmetric similarity Conclusions

3 History of Pharmacophore Multiplets A.C. Good and I.D. Kuntz; J. Comput.-Aided Mol. Design 1995, 9, X. Chen, A. Rusinko, and S.S. Young; J. Chem. Inf. Comput. Sci. 1998, 38, J.S. Mason, I. Morize, P.R. Menard, D.L. Cheney, C. Hulme & R.F. Labaudiniere; J. Med. Chem. 1999, 42, M.J. McGregor & S.M. Muskal; J. Chem. Inf. Comput. Sci. 1999, 39, H. Matter and T. Pötter; J. Chem. Inf. Comput. Sci. 1999, 39, J.S. Mason and B.R. Beno; J. Mol. Graphics Mod. 2000, 18, E. Abrahamian, P.C. Fox, L. Nærum, I.T. Christensen, H. Thøgersen & R.D. Clark; J. Chem. Inf. Comput. Sci. 2003, 43,

4 Novo Nordisk / Tripos Tuplets Collaboration 2 year collaboration to develop and extend existing SYBYL triplet (PDT) technology Incorporate pair, triplet and quartet ( Tuplet) technology Augmented Tuplets and support for privileged substructures Conformers generated on-the-fly or retrieved Bitmaps created, stored and manipulated in compressed format o four 1.8 x 10 9 bit bitmaps stored as ~80kb file o seconds/molecule

5 Type III antiarrhythmic: UK donor atom acceptor atoms positive nitrogen hydrophobic center hydrophobic center donor/acceptor atoms

6 Multiplet Fingerprints

7 Indexing Triplets 2 D 3 Vertex joining longest and shortest edges A 5 H Bin: 5, 3, 2 Triplet: H-A-D

8 Indexing Tetrahedra Problems: Need a unique mapping Must deal with chirality Literally dozens of possible permutations Mapping must be based on bins and features A 4 4 C 2 3 D 2 C Plane of symmetry implies no chirality D 2 C 2 3 C 4 4 A 4 C 3 2 D 2 Chiral tetrahedra D C 4 A 4 B B 4 A

9 Mapping Quartet Bits Mapping for 7 bins and 3 features (D, A, H) * DDDD DDDA DDDH HHHH Bitmap Size = 7 6 * 3 4 = 9,529,569 bits * specifies the + enantiomer; specifies the - enantiomer +

10 Distribution of Distances Between Features frequency frequency frequency beta blockers K + channel openers Type I antiarrythmics edge length (Å)

11 1 Conformer By Class Cumulative Distributions across Classes frequency frequency Conformer By Class Estrogen Antagonists Type III Antiarrythmics Benzamides Phenothiazines Beta Blockers Type I Antiarrythmics K Channel Openers Estrogen Antagonists Type III Antiarrythmics Benzamides Phenothiazines Beta Blockers Type I Antiarrythmics K Channel Openers edge length (Å)

12 100 Confort Conformer By Class Effect of Biased Conformer Generation frequency frequency Estrogen Antagonists Type III Antiarrythmics Benzamides Phenothiazines Beta Blockers Type I Antiarrythmics K Channel Openers Systematic Search Conformers By Class Estrogen Antagonists Type III Antiarrythmics Benzamides Phenothiazines Beta Blockers Type I Antiarrythmics K Channel Openers edge length (Å)

13 Hypothesis Fingerprint Creation Binary Compound Fingerprints DDD 000 DDD 001 DDA 200 DAA 210 DDH 210 DAH 331 DHH 333 HHH

14 Hypothesis Fingerprint Creation Binary Compound Fingerprints Vector Sum Fingerprint DDD 000 DDD 001 DDA 200 DAA 210 DDH 210 DAH 331 DHH 333 HHH

15 Hypothesis Fingerprint Creation Binary Compound Fingerprints Vector Sum Fingerprint Feature Weights Bin Weights Bit Score DDD 111 DDD 211 DDA 311 DAA 321 DDH 321 DAH 442 DHH 444 HHH

16 Weighting Bits for Hypothesis Generation nf nd S b = f b i= 1 fw i j= 1 dw j f 1 S b is the score for the bit f b is the frequency of the bit fw i is the weight of the feature type dw j is the weight of the distance bin f 2 d 1 d 3 d 2 f 3 Construct an hypothesis from the highest scoring bits.

17 Hypothesis Fingerprint Creation Binary Compound Fingerprints Vector Sum Fingerprint Feature Weights Bin Weights Bit Score DDD 111 DDD 211 DDA 311 DAA 321 DDH 321 DAH 442 DHH 444 HHH

18 S = tn N N t Sanity Checker

19 Similarity Measures Tanimoto coefficient t( A, B) = Cosine coefficient pdt( A) pdt( A) pdt( B) pdt( B) Cc( a, b) = pdt( a) pdt( b) pdt( a) pdt( b) Stochastic cosine coefficient s( A, B) = E * E[ pdt( A) pdt ( B) ] * * [ pdt( A) pdt ( A) ] E[ pdt( B) pdt ( B) ]

20 Effect of Conformer Count on Stochastic Cosine Similarity 0.6 similarity Estrogen_Antagonist Class Similarity Estrogen_Antagonist Non-Class Similarity K_openers Class Similarity K_openers Non-Class Similarity benzamides Class Similarity 0.1 benzamides Non-Class Similarity conformer count (max)

21 Effect of Conformer Count on Stochastic Cosine Discrimination discrimination ratio I_Antiarrythmics III_Antiarrythmics Phenothiazines beta Blocker Benzamides K_openers Estrogen_Antagonist conformer count (max)

22 Discrimination and Similarity Measure discrimination ratio discrimination ratio simple cosine Tanimoto I_Antiarrythmics III_Antiarrythmics Phenothiazines beta Blocker Benzamides K_openers Estrogen_Antagonist I_Antiarrythmics III_Antiarrythmics Phenothiazines beta Blocker Benzamides K_openers Estrogen_Antagonist conformer count (max)

23 Discrimiantion and Conformer Bias discrimination ratio discrimination ratio CONFORT systematic search I_Antiarrythmics III_Antiarrythmics Phenothiazines beta Blocker Benzamides K_openers Estrogen_Antagonist I_Antiarrythmics III_Antiarrythmics Phenothiazines beta Blocker Benzamides K_openers Estrogen_Antagonist conformer count (max)

24 Symmetric Similarity Measures Symmetric stochastic cosine s( A, B) = E [ ] * E pdt( A) pdt ( B) [ ( ) ( ) ] [ ( ) ( ) ] * * pdt A pdt A E pdt B pdt B Asymmetric stochastic cosine s*( h, t) = [ ( ) pdt( t) ] [ ( ) *( )] E pdt h E pdt h pdt h

25 Effect of Hypoothesis Size (Type III antiarrhythmics) average similarity average similarity symmetric cosine asymmetric stochastic cosine CONFORT within class 100 Conformers without class systematic search within class 1000 Conformers without class bits in hypothesis

26 Conclusions Compression is cool Natural binning does make sense o >15Å o at least for triplets Systematic bias increases discrimination o rule-based conformational bias can be useful o caveat: it may limit lead-hopping More is not necessarily better o true in terms of conformation count o true in terms of multiplet hypothesis size A little asymmetry can be a good thing Compression is still cool

27 Acknowledgements Novo Nordisk A/S (Denmark) Lars Nærum * Henning Thøgersen* Tripos, Inc. Edmond Abrahamian Peter Fox Trevor Heritage

28 May the multiplets be with you...

29

30 What a Protein Sees (electrostatic field at 0.5 Å resolution, 80 and 30% contours)

31 What the Chemist Sees H 3 C O S O Cl O N O F H 3 C N N O O H 3 C N H 3 C N H O CF 3 tetrahydrophthalimide (American Cyanamide) trifluorotoluidide pyrazole ether (Monsanto)

32 Pharmacophoric Features hydrogen bond acceptors H 3 C O S O Cl O N O F H 3 C N N O H 3 C N hydrophobic centers H 3 C O N H O hydrogen bond donor CF 3

33 Conformational Sampling* *diverse conformers obtained using CONFORT

34 Mapping Multiplets Mapping for 7 bins and 3 features (D, A, H)* bit... DDD DDA DDH HHH Bitmap Size = 7 3 * 3 3 = 9261 bits * Features are handled in the order supplied by the application.

35 Hypothesis Generation Multiple methods implemented for hypothesis generation o o o From a collection of known actives From a user defined UNITY query From a single molecule pharmacophore map a) Single or multiple generated conformers o From user specified residues in receptor cavity

36 Privileged Substructures: Augmented Triplets HY DS HY # name mnemonic xref weight min_dist max_dist DONOR_SITE DS AA =NULL.

37 Effect of Conformer Count on Cosine Coefficient Similarity 0.6 discrimination similarity ratio conformer count (max) Estrogen_Antagonist Class Similarity Estrogen_Antagonist Non-Class Similarity I_Antiarrythmics K_openers Class Similarity III_Antiarrythmics Phenothiazines K_openers Non-Class Similarity beta Blocker benzamides Class Similarity Benzamides K_openers benzamides Non-Class Similarity Estrogen_Antagonist

Similarity Search. Uwe Koch

Similarity Search. Uwe Koch Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance

More information

Structural biology and drug design: An overview

Structural biology and drug design: An overview Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved

More information

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom

Overview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom verview D-QSAR Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary Peter Gedeck ovartis Institutes

More information

Universities of Leeds, Sheffield and York

Universities of Leeds, Sheffield and York promoting access to White Rose research papers Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ This is an author produced version of a paper published in Organic & Biomolecular

More information

Pacific Symposium on Biocomputing 6: (2001)

Pacific Symposium on Biocomputing 6: (2001) Molecular Fingerprinting on the SIMD Parallel Processor Kestrel Eric Rice and Richard Hughey Department of Computer Engineering University of California, Santa Cruz, CA 95064 felrice,rphg@cse.ucsc.edu

More information

The PhilOEsophy. There are only two fundamental molecular descriptors

The PhilOEsophy. There are only two fundamental molecular descriptors The PhilOEsophy There are only two fundamental molecular descriptors Where can we use shape? Virtual screening More effective than 2D Lead-hopping Shape analogues are not graph analogues Molecular alignment

More information

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters

Drug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen Overview GRIND GRid-INDependent Descriptors

More information

Jonathan S. Mason,, Isabelle Morize, Paul R. Menard,*, Daniel L. Cheney, Christopher Hulme, and Richard F. Labaudiniere

Jonathan S. Mason,, Isabelle Morize, Paul R. Menard,*, Daniel L. Cheney, Christopher Hulme, and Richard F. Labaudiniere J. Med. Chem. 1999, 42, 3251-3264 3251 New 4-Point Pharmacophore Method for Molecular Similarity and Diversity Applications: Overview of the Method and Applications, Including a Novel Approach to the Design

More information

Introduction. OntoChem

Introduction. OntoChem Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads

More information

User Guide for LeDock

User Guide for LeDock User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved. Introduction LeDock is flexible small-molecule docking software,

More information

Studying the effect of noise on Laplacian-modified Bayesian Analysis and Tanimoto Similarity

Studying the effect of noise on Laplacian-modified Bayesian Analysis and Tanimoto Similarity Studying the effect of noise on Laplacian-modified Bayesian nalysis and Tanimoto Similarity David Rogers, Ph.D. SciTegic, Inc. (Division of ccelrys, Inc.) drogers@scitegic.com Description of: nalysis methods

More information

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design Bela Torok Department of Chemistry University of Massachusetts Boston Boston, MA 1 Computer Aided Drug Design - Introduction Development

More information

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013 Chemical Space Space, Diversity, and Synthesis Jeremy Henle, 4/23/2013 Computational Modeling Chemical Space As a diversity construct Outline Quantifying Diversity Diversity Oriented Synthesis Wolf and

More information

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites Introduction J. Andrew Surface Hampden-Sydney College / Virginia Commonwealth University In the past several decades

More information

Spatial chemical distance based on atomic property fields

Spatial chemical distance based on atomic property fields J Comput Aided Mol Des (2010) 24:173 182 DOI 10.1007/s10822-009-9316-x Spatial chemical distance based on atomic property fields A. V. Grigoryan I. Kufareva M. Totrov R. A. Abagyan Received: 20 September

More information

Molecular Similarity Searching Using Inference Network

Molecular Similarity Searching Using Inference Network Molecular Similarity Searching Using Inference Network Ammar Abdo, Naomie Salim* Faculty of Computer Science & Information Systems Universiti Teknologi Malaysia Molecular Similarity Searching Search for

More information

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös KIME-based scoring functions in Muse 3.0 KIME User Group Meeting 2013 Fabian Bös Certara Mission: End-to-End Model-Based Drug Development Certara was formed by acquiring and integrating Tripos, Pharsight,

More information

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre Dr. Sander B. Nabuurs Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre The road to new drugs. How to find new hits? High Throughput

More information

Chemical Space: Modeling Exploration & Understanding

Chemical Space: Modeling Exploration & Understanding verview Chemical Space: Modeling Exploration & Understanding Rajarshi Guha School of Informatics Indiana University 16 th August, 2006 utline verview 1 verview 2 3 CDK R utline verview 1 verview 2 3 CDK

More information

Similarity methods for ligandbased virtual screening

Similarity methods for ligandbased virtual screening Similarity methods for ligandbased virtual screening Peter Willett, University of Sheffield Computers in Scientific Discovery 5, 22 nd July 2010 Overview Molecular similarity and its use in virtual screening

More information

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells

More information

Generating Small Molecule Conformations from Structural Data

Generating Small Molecule Conformations from Structural Data Generating Small Molecule Conformations from Structural Data Jason Cole cole@ccdc.cam.ac.uk Cambridge Crystallographic Data Centre 1 The Cambridge Crystallographic Data Centre About us A not-for-profit,

More information

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures

More information

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering Fragment based drug discovery in teams of medicinal and computational chemists Carsten Detering BioSolveIT Quick Facts Founded in 2001 by the developers of FlexX ~20 people Core expertise: docking, screening,

More information

Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design

Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design J. Chem. Inf. Comput. Sci. 1999, 39, 569-574 569 Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design Malcolm J. McGregor and Steven M. Muskal* Affymax Research Institute, 3410

More information

5.1. Hardwares, Softwares and Web server used in Molecular modeling

5.1. Hardwares, Softwares and Web server used in Molecular modeling 5. EXPERIMENTAL The tools, techniques and procedures/methods used for carrying out research work reported in this thesis have been described as follows: 5.1. Hardwares, Softwares and Web server used in

More information

Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening

Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening J. Chem. Inf. Model. 2008, 48, 1773 1791 1773 Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening I-Jen Chen* and Nicolas

More information

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors

A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.

More information

Computational chemical biology to address non-traditional drug targets. John Karanicolas

Computational chemical biology to address non-traditional drug targets. John Karanicolas Computational chemical biology to address non-traditional drug targets John Karanicolas Our computational toolbox Structure-based approaches Ligand-based approaches Detailed MD simulations 2D fingerprints

More information

Structure-Activity Modeling - QSAR. Uwe Koch

Structure-Activity Modeling - QSAR. Uwe Koch Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity

More information

Machine Learning Concepts in Chemoinformatics

Machine Learning Concepts in Chemoinformatics Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics

More information

has its own advantages and drawbacks, depending on the questions facing the drug discovery.

has its own advantages and drawbacks, depending on the questions facing the drug discovery. 2013 First International Conference on Artificial Intelligence, Modelling & Simulation Comparison of Similarity Coefficients for Chemical Database Retrieval Mukhsin Syuib School of Information Technology

More information

Conformational Searching using MacroModel and ConfGen. John Shelley Schrödinger Fellow

Conformational Searching using MacroModel and ConfGen. John Shelley Schrödinger Fellow Conformational Searching using MacroModel and ConfGen John Shelley Schrödinger Fellow Overview Types of conformational searching applications MacroModel s conformation generation procedure General features

More information

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No

Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No Ligand-based QSAR Studies on the Indolinones Derivatives Bull. Korean Chem. Soc. 2004, Vol. 25, No. 12 1801 Ligand-based QSAR Studies on the Indolinones Derivatives as Inhibitors of the Protein Tyrosine

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing

Analysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning and Simulated Annealing Student: Ke Zhang MBMA Committee: Dr. Charles E. Smith (Chair) Dr. Jacqueline M. Hughes-Oliver

More information

Alkane/water partition coefficients and hydrogen bonding. Peter Kenny

Alkane/water partition coefficients and hydrogen bonding. Peter Kenny Alkane/water partition coefficients and hydrogen bonding Peter Kenny (pwk.pub.2008@gmail.com) Neglect of hydrogen bond strength: A recurring theme in medicinal chemistry Rule of 5 Rule of 3 Scoring functions

More information

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics

More information

1. Some examples of coping with Molecular informatics data legacy data (accuracy)

1. Some examples of coping with Molecular informatics data legacy data (accuracy) Molecular Informatics Tools for Data Analysis and Discovery 1. Some examples of coping with Molecular informatics data legacy data (accuracy) 2. Database searching using a similarity approach fingerprints

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Approach for andling of Protonation and Tautomer Ambiguities in Docking Experiments Ingo Dramburg BioSolve IT Gmb An der

More information

Extended examples and description of various jcompoundmapper fingerprints in string format

Extended examples and description of various jcompoundmapper fingerprints in string format Extended examples and description of various jcompoundmapper fingerprints in string format Figure 1: The geometry and topology of Oxaceprol. Pharmacophore types shown in the 3D structure are 1=[L], 3=[A

More information

Fast similarity searching making the virtual real. Stephen Pickett, GSK

Fast similarity searching making the virtual real. Stephen Pickett, GSK Fast similarity searching making the virtual real Stephen Pickett, GSK Introduction Introduction to similarity searching Use cases Why is speed so crucial? Why MadFast? Some performance stats Implementation

More information

Supplementary information

Supplementary information Electronic Supplementary Material (ESI) for MedChemComm. This journal is The Royal Society of Chemistry 2017 Supplementary information Identification of steroid-like natural products as potent antiplasmodial

More information

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK

Chemoinformatics and information management. Peter Willett, University of Sheffield, UK Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics

More information

Data Quality Issues That Can Impact Drug Discovery

Data Quality Issues That Can Impact Drug Discovery Data Quality Issues That Can Impact Drug Discovery Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry,

More information

The Schrödinger KNIME extensions

The Schrödinger KNIME extensions The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich Topics What are the Schrödinger extensions? Workflow application

More information

Chemical Similarity Searching

Chemical Similarity Searching J. Chem. Inf. Comput. Sci. 1998, 38, 983-996 983 Chemical Similarity Searching Peter Willett* Krebs Institute for Biomolecular Research and Department of Information Studies, University of Sheffield, Sheffield

More information

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA Pharmacophore Model Development for the Identification of Novel Acetylcholinesterase Inhibitors Edwin Kamau Dept Chem & Biochem Kennesa State Uni ersit Kennesa GA 30144 Dept. Chem. & Biochem. Kennesaw

More information

Biologically Relevant Molecular Comparisons. Mark Mackey

Biologically Relevant Molecular Comparisons. Mark Mackey Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist

More information

Hydrogen Bonding & Molecular Design Peter

Hydrogen Bonding & Molecular Design Peter Hydrogen Bonding & Molecular Design Peter Kenny(pwk.pub.2008@gmail.com) Hydrogen Bonding in Drug Discovery & Development Interactions between drug and water molecules (Solubility, distribution, permeability,

More information

tconcoord-gui: Visually Supported Conformational Sampling of Bioactive Molecules

tconcoord-gui: Visually Supported Conformational Sampling of Bioactive Molecules Software News and Updates tconcoord-gui: Visually Supported Conformational Sampling of Bioactive Molecules DANIEL SEELIGER, BERT L. DE GROOT Computational Biomolecular Dynamics Group, Max-Planck-Institute

More information

Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification

Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification Chris Radoux www.ccdc.cam.ac.uk radoux@ccdc.cam.ac.uk 1 Introduction Hotspots Strongly attractive to organic molecules Organic molecules

More information

Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies

Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Dissertation zur Erlangung des Doktorgrades (Dr. rer. nat.) der Mathematisch-aturwissenschaftlichen Fakultät der Rheinischen

More information

CHEM1102 Worksheet 4 Answers to Critical Thinking Questions Model 1: Infrared (IR) Spectroscopy

CHEM1102 Worksheet 4 Answers to Critical Thinking Questions Model 1: Infrared (IR) Spectroscopy CEM1102 Worksheet 4 Answers to Critical Thinking Questions Model 1: Infrared (IR) Spectroscopy 1. See below. Model 2: UV-Visible Spectroscopy 1. See below. 2. All of the above. 3. Restricted to the identification

More information

Spin-Spin Coupling. H b1 H 3 C C Br. Review: 1 H- 1 H Coupling

Spin-Spin Coupling. H b1 H 3 C C Br. Review: 1 H- 1 H Coupling Review: 1-1 Coupling b1 3 C C Br b2 multiplicity: n + 1 rule can determine peak intensities by considering nuclear spin probabilities on adjacent hydrogens or use Pascal's triangle Coupling Constants (J)

More information

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer LigandScout Automated Structure-Based Pharmacophore Model Generation Gerhard Wolber* and Thierry Langer * E-Mail: wolber@inteligand.com Pharmacophores from LigandScout Pharmacophores & the Protein Data

More information

Exploring symmetry related bias in conformational data from the Cambridge Structural Database: A rare phenomenon?

Exploring symmetry related bias in conformational data from the Cambridge Structural Database: A rare phenomenon? Exploring symmetry related bias in conformational data from the Cambridge Structural Database: A rare phenomenon? Aim To explore some well known cases where symmetry effects bias the distribution of conformational

More information

Clustering Ambiguity: An Overview

Clustering Ambiguity: An Overview Clustering Ambiguity: An Overview John D. MacCuish Norah E. MacCuish 3 rd Joint Sheffield Conference on Chemoinformatics April 23, 2004 Outline The Problem: Clustering Ambiguity and Chemoinformatics Preliminaries:

More information

Lossless Compression of Chemical Fingerprints Using Integer Entropy Codes Improves Storage and Retrieval

Lossless Compression of Chemical Fingerprints Using Integer Entropy Codes Improves Storage and Retrieval Lossless Compression of Chemical Fingerprints Using Integer Entropy Codes Improves Storage and Retrieval Pierre Baldi,*,, Ryan W. Benz, Daniel S. Hirschberg, and S. Joshua Swamidass Institute for Genomics

More information

DOCKING TUTORIAL. A. The docking Workflow

DOCKING TUTORIAL. A. The docking Workflow 2 nd Strasbourg Summer School on Chemoinformatics VVF Obernai, France, 20-24 June 2010 E. Kellenberger DOCKING TUTORIAL A. The docking Workflow 1. Ligand preparation It consists in the standardization

More information

Virtual screening in drug discovery

Virtual screening in drug discovery Virtual screening in drug discovery Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery

More information

Spacer conformation in biologically active molecules*

Spacer conformation in biologically active molecules* Pure Appl. Chem., Vol. 76, No. 5, pp. 959 964, 2004. 2004 IUPAC Spacer conformation in biologically active molecules* J. Karolak-Wojciechowska and A. Fruziński Institute of General and Ecological Chemistry,

More information

Universities of Leeds, Sheffield and York

Universities of Leeds, Sheffield and York promoting access to White Rose research papers Universities of Leeds, Sheffield and York http://eprints.whiterose.ac.uk/ This is an author produced version of a paper published in Quantitative structure

More information

Exploring the black box: structural and functional interpretation of QSAR models.

Exploring the black box: structural and functional interpretation of QSAR models. EMBL-EBI Industry workshop: In Silico ADMET prediction 4-5 December 2014, Hinxton, UK Exploring the black box: structural and functional interpretation of QSAR models. (Automatic exploration of datasets

More information

Using AutoDock for Virtual Screening

Using AutoDock for Virtual Screening Using AutoDock for Virtual Screening CUHK Croucher ASI Workshop 2011 Stefano Forli, PhD Prof. Arthur J. Olson, Ph.D Molecular Graphics Lab Screening and Virtual Screening The ultimate tool for identifying

More information

Cheminformatics analysis and learning in a data pipelining environment

Cheminformatics analysis and learning in a data pipelining environment Molecular Diversity (2006) 10: 283 299 DOI: 10.1007/s11030-006-9041-5 c Springer 2006 Review Cheminformatics analysis and learning in a data pipelining environment Moises Hassan 1,, Robert D. Brown 1,

More information

Chemical Databases: Encoding, Storage and Search of Chemical Structures

Chemical Databases: Encoding, Storage and Search of Chemical Structures Chemical Databases: Encoding, Storage and Search of Chemical Structures Dr. Timur I. Madzhidov Kazan Federal University, Department of Organic Chemistry * Ray, L.C. and R.A. Kirsch, Finding Chemical Records

More information

Developing CAS Products for Substructure Searching by Chemists. Linda Toler

Developing CAS Products for Substructure Searching by Chemists. Linda Toler Developing CAS Products for Substructure Searching by Chemists Linda Toler Developing CAS Products for Substructure Searching Evolution of the CAS Registry Development of substructure searching for CAS

More information

Bioinformatics Workshop - NM-AIST

Bioinformatics Workshop - NM-AIST Bioinformatics Workshop - NM-AIST Day 3 Introduction to Drug/Small Molecule Discovery Thomas Girke July 25, 2012 Bioinformatics Workshop - NM-AIST Slide 1/44 Introduction CMP Structure Formats Similarity

More information

NUCLEAR MAGNETIC RESONANCE AND INTRODUCTION TO MASS SPECTROMETRY

NUCLEAR MAGNETIC RESONANCE AND INTRODUCTION TO MASS SPECTROMETRY NUCLEAR MAGNETIC RESONANCE AND INTRODUCTION TO MASS SPECTROMETRY A STUDENT SHOULD BE ABLE TO: 1. Identify and explain the processes involved in proton ( 1 H) and carbon-13 ( 13 C) nuclear magnetic resonance

More information

Analyzing Molecular Conformations Using the Cambridge Structural Database. Jason Cole Cambridge Crystallographic Data Centre

Analyzing Molecular Conformations Using the Cambridge Structural Database. Jason Cole Cambridge Crystallographic Data Centre Analyzing Molecular Conformations Using the Cambridge Structural Database Jason Cole Cambridge Crystallographic Data Centre 1 The Cambridge Structural Database (CSD) 905,284* USOPEZ a natural product intermediate,

More information

This is a repository copy of Chemoinformatics techniques for data mining in files of two-dimensional and three-dimensional chemical molecules.

This is a repository copy of Chemoinformatics techniques for data mining in files of two-dimensional and three-dimensional chemical molecules. This is a repository copy of Chemoinformatics techniques for data mining in files of two-dimensional and three-dimensional chemical molecules. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/8425/

More information

Ultra High Throughput Screening using THINK on the Internet

Ultra High Throughput Screening using THINK on the Internet Ultra High Throughput Screening using THINK on the Internet Keith Davies Central Chemistry Laboratory, Oxford University Cathy Davies Treweren Consultants, UK Blue Sky Objectives Reduce Development Failures

More information

ChemAxon. Content. By György Pirok. D Standardization D Virtual Reactions. D Fragmentation. ChemAxon European UGM Visegrad 2008

ChemAxon. Content. By György Pirok. D Standardization D Virtual Reactions. D Fragmentation. ChemAxon European UGM Visegrad 2008 Transformers f off ChemAxon By György Pirok Content Standardization Virtual Reactions Metabolism M b li P Prediction di i Fragmentation 2 1 Standardization http://www.chemaxon.com/jchem/doc/user/standardizer.html

More information

Functional Group Fingerprints CNS Chemistry Wilmington, USA

Functional Group Fingerprints CNS Chemistry Wilmington, USA Functional Group Fingerprints CS Chemistry Wilmington, USA James R. Arnold Charles L. Lerman William F. Michne James R. Damewood American Chemical Society ational Meeting August, 2004 Philadelphia, PA

More information

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC The precise definition of a dihedral or torsion angle can be found in spatial geometry Angle between to planes Dihedral

More information

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions Van der Waals Interactions

More information

Using NMR and IR Spectroscopy to Determine Structures Dr. Carl Hoeger, UCSD

Using NMR and IR Spectroscopy to Determine Structures Dr. Carl Hoeger, UCSD Using NMR and IR Spectroscopy to Determine Structures Dr. Carl Hoeger, UCSD The following guidelines should be helpful in assigning a structure from NMR (both PMR and CMR) and IR data. At the end of this

More information

In Silico Design of New Drugs for Myeloid Leukemia Treatment

In Silico Design of New Drugs for Myeloid Leukemia Treatment In Silico Design of New Drugs for Myeloid Leukemia Treatment Washington Pereira and Ihosvany Camps Computational Modeling Laboratory LaModel Exact Science Institute ICEx Federal University of Alfenas -

More information

Drug Informatics for Chemical Genomics...

Drug Informatics for Chemical Genomics... Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison

More information

Canonical Line Notations

Canonical Line Notations Canonical Line otations InChI vs SMILES Krisztina Boda verview Compound naming InChI SMILES Molecular equivalency Isomorphism Kekule Tautomers Finding duplicates What s Your ame? 1. Unique numbers CAS

More information

Performing a Pharmacophore Search using CSD-CrossMiner

Performing a Pharmacophore Search using CSD-CrossMiner Table of Contents Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Searching with a Pharmacophore... 4 Performing a Pharmacophore Search using CSD-CrossMiner Version 2.0

More information

Chemical library design

Chemical library design Chemical library design Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery Today,

More information

Virtual affinity fingerprints in drug discovery: The Drug Profile Matching method

Virtual affinity fingerprints in drug discovery: The Drug Profile Matching method Ágnes Peragovics Virtual affinity fingerprints in drug discovery: The Drug Profile Matching method PhD Theses Supervisor: András Málnási-Csizmadia DSc. Associate Professor Structural Biochemistry Doctoral

More information

Ligand-receptor interactions

Ligand-receptor interactions University of Silesia, Katowice, Poland 11 22 March 2013 Ligand-receptor interactions Dr. Pavel Polishchuk A.V. Bogatsky Physico-Chemical Institute of National Academy of Sciences of Ukraine Odessa, Ukraine

More information

Different conformations of the drugs within the virtual library of FDA approved drugs will be generated.

Different conformations of the drugs within the virtual library of FDA approved drugs will be generated. Chapter 3 Molecular Modeling 3.1. Introduction In this study pharmacophore models will be created to screen a virtual library of FDA approved drugs for compounds that may inhibit MA-A and MA-B. The virtual

More information

A single crystal investigation of L-tryptophan with Z = 16

A single crystal investigation of L-tryptophan with Z = 16 1 A single crystal investigation of L-tryptophan with Z = 16 Carl Henrik Görbitz, Karl Wilhelm Törnroos and Graeme Day Supplementary material 1. Figure 1S (below). Overlay of the eight molecules A, B,

More information

Expanded Interaction Fingerprint Method for Analyzing Ligand Binding Modes in Docking and Structure-Based Drug Design

Expanded Interaction Fingerprint Method for Analyzing Ligand Binding Modes in Docking and Structure-Based Drug Design 1942 J. Chem. Inf. Comput. Sci. 2004, 44, 1942-1951 Expanded Interaction Fingerprint Method for Analyzing Ligand Binding Modes in Docking and Structure-Based Drug Design Matthew D. Kelly and Ricardo L.

More information

Tautomerism in chemical information management systems

Tautomerism in chemical information management systems Tautomerism in chemical information management systems Dr. Wendy A. Warr http://www.warr.com Tautomerism in chemical information management systems Author: Wendy A. Warr DOI: 10.1007/s10822-010-9338-4

More information

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences Fragment based lead discovery - introduction György M. Keserű H2020 FRAGET etwork Hungarian Academy of Sciences www.fragnet.eu Hit discovery from screening Druglike library Fragment library Large molecules

More information

Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses

Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc,

More information

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK Molecular Modelling Computational Chemistry Demystified Peter Bladon Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK John E. Gorton Gorton Systems, Glasgow, UK Robert B. Hammond Institute

More information

Chemistry 605 (Reich)

Chemistry 605 (Reich) Chemistry 60 (Reich) SECD UR EXAM Sat. April 9, 0 Question/oints R-A /0 R-B / R-C /0 R-D / R-E /0 R-F /0 Total /00 Average 6 i 9 Mode Median 6 AB 7 BC Distribution from grade list (average: 6.; count:

More information

Efficient overlay of molecular 3-D pharmacophores

Efficient overlay of molecular 3-D pharmacophores Efficient overlay of molecular 3D pharmacophores Gerhard Wolber*, Alois A. Dornhofer & Thierry Langer * EMail: wolber@inteligand.com Superposition of molecules 1 Alignment: Outline Scope, design goals

More information

Docking. GBCB 5874: Problem Solving in GBCB

Docking. GBCB 5874: Problem Solving in GBCB Docking Benzamidine Docking to Trypsin Relationship to Drug Design Ligand-based design QSAR Pharmacophore modeling Can be done without 3-D structure of protein Receptor/Structure-based design Molecular

More information

Analyzing Building Blocks Diversity for DNA Encoded Library Design. Cresset User Group Meeting Nik Stiefl & Finton Sirockin, Novartis

Analyzing Building Blocks Diversity for DNA Encoded Library Design. Cresset User Group Meeting Nik Stiefl & Finton Sirockin, Novartis Analyzing Building Blocks Diversity for DA Encoded Library Design Cresset User Group Meeting ik Stiefl & Finton Sirockin, ovartis 2016.06.16 Outline DA Encoded Libraries (DEL) Building blocks selection

More information

Chapter 6 Principles of Stereochemistry

Chapter 6 Principles of Stereochemistry 6.1 (a) This compound is chiral. Methane is achiral. Instructor Supplemental Solutions to Problems 2010 Roberts and Company Publishers Chapter 6 Principles of Stereochemistry Solutions to In-Text Problems

More information

Evaluation of Molecular Similarity and Molecular Diversity Methods Using Biological Activity Data

Evaluation of Molecular Similarity and Molecular Diversity Methods Using Biological Activity Data 2 Evaluation of Molecular Similarity and Molecular Diversity Methods Using Biological Activity Data Peter Willett Abstract This chapter reviews the techniques available for quantifying the effectiveness

More information

Analyzing Small Molecule Data in R

Analyzing Small Molecule Data in R Analyzing Small Molecule Data in R Tyler Backman and Thomas Girke December 12, 2011 Analyzing Small Molecule Data in R Slide 1/49 Introduction CMP Structure Formats Similarity Searching Background Fragment

More information