An Integrated Approach to in-silico
|
|
- Robert Stanley
- 5 years ago
- Views:
Transcription
1 An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc Catalina St., San Leandro, CA 94577
2 Goals of in-silico screening Lead discovery/design Lead optimization Maximize potency Minimize toxicity Optimize ADME characteristics Fail-early paradigm - try to predict clinical success/failure Maximize use of prior information
3 Challenges of in-silico screening Combinatorial explosion of chemical and biological data Must insure data currency, validity, cleanliness Must accommodate multiple heterogeneoous sources of data Wide expanse of chemical space dictates a wide variety of tools to analyze them Need an integrated chemical informatics framework to utilize the tools efficiently and conveniently
4 Chemical Informatics Framework Databases, data marts, and data warehouses of chemical structure information - and the software to build and manage them (including in-house, commercial, and internet data sources) Software tools to extract, normalize, enumerate, and compute properties from structures Analysis applications to visualize, model, and predict activity from structure (structural data mining)
5 Chemical Informatics: : Database Chemical Warehousing - e.g., Reagent Selector, Compound Warehouse Oracle 8i architecture Star Schema - One or more large fact table(s) linked to several smaller dimension tables Serviced by schedulers and database daemon processes Scalable from project data mart (100K structures) to corporate warehouse (10M structures) Multiple structure sources, including commercial databases (MDDR, ACD, Beilstein), in-house, and online data sources
6 Chemical Informatics: : Software Tools Cheshire - MDL chemical scripting language Object-oriented, Java-like syntax - associative arrays for structures, queries, data; collections Perceive and set atom, bond properties, rings, fragments, sgroup data Supports mapping and structure-walking applications Exposes MDL s SSKeys; allows user-defined similarity searches
7 Cheshire: An Example Purpose: normalize nitro group representation function no2s() { var q1=createmol( N+(=O)-O) ); var ic = 0; if ((c=map(q1)).count() > 0 ) { c.set(a_charge, 0); c.set(b_type,b_type_double); ic = ic + 1; } return (ic); }
8 Topological Distances
9 Gasteiger Charges
10 Common Substructures Algorithm O O C H C O C O O O C H C C O N O N O O Find the Stereocenters Collect the Stereocenters, atoms alpha to them, rings connected to them Compare the new structure with those already collected. Is it unique? (No: Add a hit to the list) Is it a substructure of an already existing structure (or vise versa)? (Yes: Add a hit to the substructure's list, remove the "superstructure"; No: Add it to the list)
11 Structural Diversity
12 SSKeys Structure-based keys 166 bits - subset of unique and identified features 960 bits - optimized for searching; contains many-to-one mappings 2275 bits - many-to-one mappings removed
13 Modal Fingerprints Capture information about key bits set by a collection of molecules threshold weighting Similarity searches N(q&t)/N(q) ( sub -similarity) N(q&t)/N(q t) (Tanimoto) see also: Shemetulskis, et al., J. Chem. Inf. Comput. Sci. 1996, 36,
14 Chemical Informatics: : Analysis Application Visualization Unsupervised data mining - clustering Supervised data mining - classification, decision trees, recursive partitioning QSAR modeling - regression, neural networks, genetic algorithms Pharmacophore searching and docking
15 Chemical Informatics: : Clustering Goal: rapid, simple UI, select cluster count Possibilities: hierarchical, Jarvis-Patrick, monothetic, kmeans, binning Choice: Sample-based kmeans Fast, scalable, controllable, use 166 keys Published algorithm: L. Kauffman & P. J. Rousseeuw, Finding Groups in Data, Wiley, 1991 (win-
16 Chemical Informatics: : Classification Goal: rapid, simple UI, interpretable results Possibilities: C4.5, FIRM, CART, various Bayesian methods, etc. Choices: C4.5/C4.5rules, FIRM Available, fast, simple I/O Published algorithms: C4.5: J. R. Quinlan, C4.5, Morgan-Kaufmann, FIRM: D. Hawkins, UMN Statistics Dept. (ftp://ftp.stat.umn.edu/pub/firm/ibmfirm.exe) Others: WEKA - I. H. Witten et al (pure JAVA) (
17 Example: 5-HT Antagonists 3 Bind to ligand-gated ion-channel receptor for serotonin. Therapeutically useful in treatment of emesis, anxiety, schizophrenia, drug dependence, and age-related memory loss. Often show mixed agonist/antagonist properties, depending on site of action. Example refs: Cappelli, et.al., J. Med. Chem., 1998, 41, Rival, et.al., J. Med. Chem., 1998, 41,
18 Approaches to Library Construction Reagent Based Product Based
19 Reagent Based Library Construction Build around a known scaffold N N N + Y. Rival, et al., J. Med. Chem. 1998, 41,
20 Arylpiperazine Interactions Limited Space Attractive Short- Range interactions Aromatic Interactions Limited Space N H O N O N + O Limited Space
21 One-Step Chemistry + N Cl N N N N N
22 Diversity Around The Ring ACD Available Chemicals Directory R2 R1 70 Imidoyl chlorides 26 Mw < 250 Small molecular volume substitution at R2 & R3 Electron withdrawing & donating substitution at R1 R3 N Cl
23 Substructure Search Selecting Reagents Filter on M. Wt. Cluster Filtered Reagents
24 Product Based Library Construction Use modal fingerprints to characterize active compounds Choose a backbone (quinoline) for the compounds Search ACDSC for quinoline-containing molecules; refine the search with the modal fingerprint
25 Clustering Extracted Structures Cluster Analysis on substructure keys Drilldown into cluster
26 Decision Tree Classification Method: Select HT3 antagonists from MDDR20001 Select an equal number of most similar structures lacking reported 5-HT3 activity Use 166 User keys as descriptors Run a variety of classification programs, using 10-fold cross validation Example results: Method % correct Number of Rules C C45Rules 68 2 Ripper 85 6 Naïve Bayes Regression 77 --
27 Classification Results Training Set Classification Results: 166 Keys 960 Keys Method correct # rules correct # rules C4.5 81% 80 84% 67 C4.5Rules FIRM structures (204 active, 198 inactive ) Default program parameters 10-fold cross-validation results
28 Library Evaluation ACDSC - 1.4M structure screening database Pick structures similar to 960-key modal f.p. and those similar to 2275-key modal f.p. Test structures against decision tree models Number of structures predicted to be active: Similarity basis Model 960 key m.f.p key m.f.p. C / /3067 FIRM 193/ /3067
29 Conclusions 166 and 960-key sets performed similarly; 960 keys were marginally better for training set predictions. Decision tree performance correlated with rule-size. FIRM gave the best performance/rule-size ratio. Very little overlap was observed between keys selected by the decision tree programs and those selected in the modal fingerprints - suggesting the approaches are complementary. The 960-key set can be expanded to 2275 unrolled keys; similarity to these keys and the 960-key set yields different selections of structures, which behave differently in decision tree predictions.
30 Future Work Run classifications using the 2275-key set. Investigate other classifiers (knn, Bayesian, CART, etc.) Investigate relationship between similarity to modal fingerprint and level of biological activity. Develop classification and fingerprint models for ADME properties. Utilize query features in modal fingerprints....
31 Acknowledgments Doug Henry, David Evans, and Maurizio Bronzetti Richard Briggs, Burt Leland, and the rest of the Cheshire Team Ali Ozkabak and the rest of the Reagent Selector team
Machine Learning Concepts in Chemoinformatics
Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics
More informationIntroduction. OntoChem
Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads
More informationData Mining in the Chemical Industry. Overview of presentation
Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry
More informationVirtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME
Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells
More informationStructure-Activity Modeling - QSAR. Uwe Koch
Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity
More informationA Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors
A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.
More informationMachine learning for ligand-based virtual screening and chemogenomics!
Machine learning for ligand-based virtual screening and chemogenomics! Jean-Philippe Vert Institut Curie - INSERM U900 - Mines ParisTech In silico discovery of molecular probes and drug-like compounds:
More informationComputational chemical biology to address non-traditional drug targets. John Karanicolas
Computational chemical biology to address non-traditional drug targets John Karanicolas Our computational toolbox Structure-based approaches Ligand-based approaches Detailed MD simulations 2D fingerprints
More informationChemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller
Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics
More informationNavigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland
Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures
More informationNext Generation Computational Chemistry Tools to Predict Toxicity of CWAs
Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National
More informationDrug Informatics for Chemical Genomics...
Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison
More informationChemoinformatics and information management. Peter Willett, University of Sheffield, UK
Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics
More informationRapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value
Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Anthony Arvanites Daylight User Group Meeting March 10, 2005 Outline 1. Company Introduction
More informationSCULPT 3.0. Using SCULPT to Gain Competitive Insights. Brings 3D Visualization to the Lab Bench SPECIAL REPORT. 4 Molecular Connection Fall 1999
SPECIAL REPORT SCULPT 3.0 Brings 3D Visualization to the Lab Bench ith the acquisition of Interactive Simulations, Inc., last April, MDL added to its arsenal of discovery informatics tools a powerful analysis
More informationDivCalc: A Utility for Diversity Analysis and Compound Sampling
Molecules 2002, 7, 657-661 molecules ISSN 1420-3049 http://www.mdpi.org DivCalc: A Utility for Diversity Analysis and Compound Sampling Rajeev Gangal* SciNova Informatics, 161 Madhumanjiri Apartments,
More informationSimilarity Search. Uwe Koch
Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance
More informationKNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös
KIME-based scoring functions in Muse 3.0 KIME User Group Meeting 2013 Fabian Bös Certara Mission: End-to-End Model-Based Drug Development Certara was formed by acquiring and integrating Tripos, Pharsight,
More informationhas its own advantages and drawbacks, depending on the questions facing the drug discovery.
2013 First International Conference on Artificial Intelligence, Modelling & Simulation Comparison of Similarity Coefficients for Chemical Database Retrieval Mukhsin Syuib School of Information Technology
More informationQSAR in Green Chemistry
QSAR in Green Chemistry Activity Relationship QSAR is the acronym for Quantitative Structure-Activity Relationship Chemistry is based on the premise that similar chemicals will behave similarly The behavior/activity
More informationContents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics
Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure
More informationJCICS Major Research Areas
JCICS Major Research Areas Chemical Information Text Searching Structure and Substructure Searching Databases Patents George W.A. Milne C571 Lecture Fall 2002 1 JCICS Major Research Areas Chemical Computation
More informationCondensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule
Condensed Graph of Reaction: considering a chemical reaction as one single pseudo molecule Frank Hoonakker 1,3, Nicolas Lachiche 2, Alexandre Varnek 3, and Alain Wagner 3,4 1 Chemoinformatics laboratory,
More informationThe Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc
The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company Sally Rose BioFocus plc Overview History of BioFocus and acquisition of CDD Biological
More informationExploring the black box: structural and functional interpretation of QSAR models.
EMBL-EBI Industry workshop: In Silico ADMET prediction 4-5 December 2014, Hinxton, UK Exploring the black box: structural and functional interpretation of QSAR models. (Automatic exploration of datasets
More informationIntroduction to Chemoinformatics and Drug Discovery
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.
More informationAnalysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing
Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning and Simulated Annealing Student: Ke Zhang MBMA Committee: Dr. Charles E. Smith (Chair) Dr. Jacqueline M. Hughes-Oliver
More informationChemical library design
Chemical library design Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery Today,
More informationDevelopment of a Structure Generator to Explore Target Areas on Chemical Space
Development of a Structure Generator to Explore Target Areas on Chemical Space Kimito Funatsu Department of Chemical System Engineering, This materials will be published on Molecular Informatics Drug Development
More information1. Some examples of coping with Molecular informatics data legacy data (accuracy)
Molecular Informatics Tools for Data Analysis and Discovery 1. Some examples of coping with Molecular informatics data legacy data (accuracy) 2. Database searching using a similarity approach fingerprints
More informationThe Schrödinger KNIME extensions
The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich Topics What are the Schrödinger extensions? Workflow application
More informationPractical QSAR and Library Design: Advanced tools for research teams
DS QSAR and Library Design Webinar Practical QSAR and Library Design: Advanced tools for research teams Reservationless-Plus Dial-In Number (US): (866) 519-8942 Reservationless-Plus International Dial-In
More informationIn Silico Investigation of Off-Target Effects
PHARMA & LIFE SCIENCES WHITEPAPER In Silico Investigation of Off-Target Effects STREAMLINING IN SILICO PROFILING In silico techniques require exhaustive data and sophisticated, well-structured informatics
More informationBuilding innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery
Building innovative drug discovery alliances Just in KIME: Successful Process Driven Drug Discovery Berlin KIME Spring Summit, Feb 2016 Research Informatics @ Evotec Evotec s worldwide operations 2 Pharmaceuticals
More informationChemical Space: Modeling Exploration & Understanding
verview Chemical Space: Modeling Exploration & Understanding Rajarshi Guha School of Informatics Indiana University 16 th August, 2006 utline verview 1 verview 2 3 CDK R utline verview 1 verview 2 3 CDK
More informationPlan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.
Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of
More informationFast similarity searching making the virtual real. Stephen Pickett, GSK
Fast similarity searching making the virtual real Stephen Pickett, GSK Introduction Introduction to similarity searching Use cases Why is speed so crucial? Why MadFast? Some performance stats Implementation
More informationDe Novo molecular design with Deep Reinforcement Learning
De Novo molecular design with Deep Reinforcement Learning @olexandr Olexandr Isayev, Ph.D. University of North Carolina at Chapel Hill olexandr@unc.edu http://olexandrisayev.com About me Ph.D. in Chemistry
More informationPriority Setting of Endocrine Disruptors Using QSARs
Priority Setting of Endocrine Disruptors Using QSARs Weida Tong Manager of Computational Science Group, Logicon ROW Sciences, FDA s National Center for Toxicological Research (NCTR), U.S.A. Thanks for
More informationMedicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted?
Medicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted? J. Guy Breitenbucher Streamlining Drug Discovery Conference San Francisco, CA ct. 25, 2018 Additivity as the
More informationStructural biology and drug design: An overview
Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved
More informationest Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today
est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today Sign up for FREE GPU Test Drive on remotely hosted clusters www.nvidia.com/gputestd rive Shape Searching
More informationDesign and Synthesis of the Comprehensive Fragment Library
YOUR INNOVATIVE CHEMISTRY PARTNER IN DRUG DISCOVERY Design and Synthesis of the Comprehensive Fragment Library A 3D Enabled Library for Medicinal Chemistry Discovery Warren S Wade 1, Kuei-Lin Chang 1,
More informationBiologically Relevant Molecular Comparisons. Mark Mackey
Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist
More informationEMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS
EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS PETER GUND Pharmacopeia Inc., CN 5350 Princeton, NJ 08543, USA pgund@pharmacop.com Empirical and theoretical approaches to drug discovery have often
More informationEasySDM: A Spatial Data Mining Platform
EasySDM: A Spatial Data Mining Platform (User Manual) Authors: Amine Abdaoui and Mohamed Ala Al Chikha, Students at the National Computing Engineering School. Algiers. June 2013. 1. Overview EasySDM is
More informationUltra High Throughput Screening using THINK on the Internet
Ultra High Throughput Screening using THINK on the Internet Keith Davies Central Chemistry Laboratory, Oxford University Cathy Davies Treweren Consultants, UK Blue Sky Objectives Reduce Development Failures
More informationComputational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004
Computational Methods and Drug-Likeness Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004 The Problem Drug development in pharmaceutical industry: >8-12 years time ~$800m costs >90% failure
More informationUsing Self-Organizing maps to accelerate similarity search
YOU LOGO Using Self-Organizing maps to accelerate similarity search Fanny Bonachera, Gilles Marcou, Natalia Kireeva, Alexandre Varnek, Dragos Horvath Laboratoire d Infochimie, UM 7177. 1, rue Blaise Pascal,
More informationAMRI COMPOUND LIBRARY CONSORTIUM: A NOVEL WAY TO FILL YOUR DRUG PIPELINE
AMRI COMPOUD LIBRARY COSORTIUM: A OVEL WAY TO FILL YOUR DRUG PIPELIE Muralikrishna Valluri, PhD & Douglas B. Kitchen, PhD Summary The creation of high-quality, innovative small molecule leads is a continual
More informationXia Ning,*, Huzefa Rangwala, and George Karypis
J. Chem. Inf. Model. XXXX, xxx, 000 A Multi-Assay-Based Structure-Activity Relationship Models: Improving Structure-Activity Relationship Models by Incorporating Activity Information from Related Targets
More informationIgnasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015
Ignasi Belda, PhD CEO HPC Advisory Council Spain Conference 2015 Business lines Molecular Modeling Services We carry out computational chemistry projects using our selfdeveloped and third party technologies
More informationHit Finding and Optimization Using BLAZE & FORGE
Hit Finding and Optimization Using BLAZE & FORGE Kevin Cusack,* Maria Argiriadi, Eric Breinlinger, Jeremy Edmunds, Michael Hoemann, Michael Friedman, Sami Osman, Raymond Huntley, Thomas Vargo AbbVie, Immunology
More informationComputational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007
Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based
More informationChemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility
Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility Dr. Wendy A. Warr http://www.warr.com Warr, W. A. A Short Review of Chemical Reaction Database Systems,
More informationDrug Design 2. Oliver Kohlbacher. Winter 2009/ QSAR Part 4: Selected Chapters
Drug Design 2 Oliver Kohlbacher Winter 2009/2010 11. QSAR Part 4: Selected Chapters Abt. Simulation biologischer Systeme WSI/ZBIT, Eberhard-Karls-Universität Tübingen Overview GRIND GRid-INDependent Descriptors
More informationStatistical concepts in QSAR.
Statistical concepts in QSAR. Computational chemistry represents molecular structures as a numerical models and simulates their behavior with the equations of quantum and classical physics. Available programs
More informationIn silico pharmacology for drug discovery
In silico pharmacology for drug discovery In silico drug design In silico methods can contribute to drug targets identification through application of bionformatics tools. Currently, the application of
More informationResearch Article. Chemical compound classification based on improved Max-Min kernel
Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(2):368-372 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Chemical compound classification based on improved
More informationThe Conformation Search Problem
Jon Sutter Senior Manager Life Sciences R&D jms@accelrys.com Jiabo Li Senior Scientist Life Sciences R&D jli@accelrys.com CAESAR: Conformer Algorithm based on Energy Screening and Recursive Buildup The
More information(S1) (S2) = =8.16
Formulae for Attributes As described in the manuscript, the first step of our method to create a predictive model of material properties is to compute attributes based on the composition of materials.
More informationKinome-wide Activity Models from Diverse High-Quality Datasets
Kinome-wide Activity Models from Diverse High-Quality Datasets Stephan C. Schürer*,1 and Steven M. Muskal 2 1 Department of Molecular and Cellular Pharmacology, Miller School of Medicine and Center for
More informationCOMPARISON OF SIMILARITY METHOD TO IMPROVE RETRIEVAL PERFORMANCE FOR CHEMICAL DATA
http://www.ftsm.ukm.my/apjitm Asia-Pacific Journal of Information Technology and Multimedia Jurnal Teknologi Maklumat dan Multimedia Asia-Pasifik Vol. 7 No. 1, June 2018: 91-98 e-issn: 2289-2192 COMPARISON
More informationThe use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration
The use of Design of Experiments to develop Efficient Arrays for SAR and Property Exploration Chris Luscombe, Computational Chemistry GlaxoSmithKline Summary of Talk Traditional approaches SAR Free-Wilson
More informationChemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013
Chemical Space Space, Diversity, and Synthesis Jeremy Henle, 4/23/2013 Computational Modeling Chemical Space As a diversity construct Outline Quantifying Diversity Diversity Oriented Synthesis Wolf and
More informationA Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery
AtomNet A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery Izhar Wallach, Michael Dzamba, Abraham Heifets Victor Storchan, Institute for Computational and
More informationFragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering
Fragment based drug discovery in teams of medicinal and computational chemists Carsten Detering BioSolveIT Quick Facts Founded in 2001 by the developers of FlexX ~20 people Core expertise: docking, screening,
More informationCS145: INTRODUCTION TO DATA MINING
CS145: INTRODUCTION TO DATA MINING 4: Vector Data: Decision Tree Instructor: Yizhou Sun yzsun@cs.ucla.edu October 10, 2017 Methods to Learn Vector Data Set Data Sequence Data Text Data Classification Clustering
More informationChemoinformatics and Drug Discovery
Molecules 2002, 7, 566-600 molecules ISSN 1420-3049 http://www.mdpi.org Review: Chemoinformatics and Drug Discovery Jun Xu* and Arnold Hagler Discovery Partners International, Inc., 9640 Towne Center Drive,
More informationClassification Using Decision Trees
Classification Using Decision Trees 1. Introduction Data mining term is mainly used for the specific set of six activities namely Classification, Estimation, Prediction, Affinity grouping or Association
More informationJonathan S. Mason,, Isabelle Morize, Paul R. Menard,*, Daniel L. Cheney, Christopher Hulme, and Richard F. Labaudiniere
J. Med. Chem. 1999, 42, 3251-3264 3251 New 4-Point Pharmacophore Method for Molecular Similarity and Diversity Applications: Overview of the Method and Applications, Including a Novel Approach to the Design
More informationMolecular Complexity Effects and Fingerprint-Based Similarity Search Strategies
Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Dissertation zur Erlangung des Doktorgrades (Dr. rer. nat.) der Mathematisch-aturwissenschaftlichen Fakultät der Rheinischen
More informationBuilding blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction
Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction Stefan Kuhn 1, Björn Egert 2, Steffen Neumann 2, Christoph Steinbeck 1European Bioinformatics Institute
More informationInformation Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann
Information Extraction from Chemical Images Discovery Knowledge & Informatics April 24 th, 2006 Dr. Available Chemical Information Textbooks Reports Patents Databases Scientific journals and publications
More informationFarewell, PipelinePilot Migrating the Exquiron cheminformatics platform to KNIME and the ChemAxon technology
Farewell, PipelinePilot Migrating the Exquiron cheminformatics platform to KNIME and the ChemAxon technology Serge P. Parel, PhD ChemAxon User Group Meeting, Budapest 21 st May, 2014 Outline Exquiron Who
More informationInduction of Decision Trees
Induction of Decision Trees Peter Waiganjo Wagacha This notes are for ICS320 Foundations of Learning and Adaptive Systems Institute of Computer Science University of Nairobi PO Box 30197, 00200 Nairobi.
More informationOverview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom
verview D-QSAR Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary Peter Gedeck ovartis Institutes
More informationQSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression
APPLICATION NOTE QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression GAINING EFFICIENCY IN QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIPS ErbB1 kinase is the cell-surface receptor
More informationSimilarity methods for ligandbased virtual screening
Similarity methods for ligandbased virtual screening Peter Willett, University of Sheffield Computers in Scientific Discovery 5, 22 nd July 2010 Overview Molecular similarity and its use in virtual screening
More informationCheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS
Cheminformatics Role in Pharmaceutical Industry Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS Agenda The big picture for pharmaceutical industry Current technological/scientific issues Types
More informationPROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS
179 Molecular Informatics: Confronting Complexity, May 13 th - 16 th 2002, Bozen, Italy PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS CARLETON R. SAGE, KEVIN R. HOLME, NIANISH
More informationProgress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery
21 th /June/2018@CUGM Progress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery Kaz Ikeda, Ph.D. Keio University Self Introduction Keio University, Tokyo, Japan (Established
More informationWill it rain tomorrow?
Will it rain tomorrow? Bilal Ahmed - 561539 Department of Computing and Information Systems, The University of Melbourne, Victoria, Australia bahmad@student.unimelb.edu.au Abstract With the availability
More informationTautomerism in chemical information management systems
Tautomerism in chemical information management systems Dr. Wendy A. Warr http://www.warr.com Tautomerism in chemical information management systems Author: Wendy A. Warr DOI: 10.1007/s10822-010-9338-4
More informationDocking. GBCB 5874: Problem Solving in GBCB
Docking Benzamidine Docking to Trypsin Relationship to Drug Design Ligand-based design QSAR Pharmacophore modeling Can be done without 3-D structure of protein Receptor/Structure-based design Molecular
More informationPharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design
J. Chem. Inf. Comput. Sci. 1999, 39, 569-574 569 Pharmacophore Fingerprinting. 1. Application to QSAR and Focused Library Design Malcolm J. McGregor and Steven M. Muskal* Affymax Research Institute, 3410
More informationOctober 6 University Faculty of pharmacy Computer Aided Drug Design Unit
October 6 University Faculty of pharmacy Computer Aided Drug Design Unit CADD@O6U.edu.eg CADD Computer-Aided Drug Design Unit The development of new drugs is no longer a process of trial and error or strokes
More informationIntroducing a Bioinformatics Similarity Search Solution
Introducing a Bioinformatics Similarity Search Solution 1 Page About the APU 3 The APU as a Driver of Similarity Search 3 Similarity Search in Bioinformatics 3 POC: GSI Joins Forces with the Weizmann Institute
More informationQSAR/QSPR modeling. Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships
Quantitative Structure-Activity Relationships Quantitative Structure-Property-Relationships QSAR/QSPR modeling Alexandre Varnek Faculté de Chimie, ULP, Strasbourg, FRANCE QSAR/QSPR models Development Validation
More informationRetrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a
Retrieving hits through in silico screening and expert assessment M.. Drwal a,b and R. Griffith a a: School of Medical Sciences/Pharmacology, USW, Sydney, Australia b: Charité Berlin, Germany Abstract:
More informationText Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University
Text Mining Dr. Yanjun Li Associate Professor Department of Computer and Information Sciences Fordham University Outline Introduction: Data Mining Part One: Text Mining Part Two: Preprocessing Text Data
More informationLigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer
LigandScout Automated Structure-Based Pharmacophore Model Generation Gerhard Wolber* and Thierry Langer * E-Mail: wolber@inteligand.com Pharmacophores from LigandScout Pharmacophores & the Protein Data
More informationGenerating Small Molecule Conformations from Structural Data
Generating Small Molecule Conformations from Structural Data Jason Cole cole@ccdc.cam.ac.uk Cambridge Crystallographic Data Centre 1 The Cambridge Crystallographic Data Centre About us A not-for-profit,
More informationAn Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection
American Journal of Applied Sciences 8 (4): 368-373, 2011 ISSN 1546-9239 2010 Science Publications An Enhancement of Bayesian Inference Network for Ligand-Based Virtual Screening using Features Selection
More informationRelative Drug Likelihood: Going beyond Drug-Likeness
Relative Drug Likelihood: Going beyond Drug-Likeness ACS Fall National Meeting, August 23rd 2012 Matthew Segall, Iskander Yusof Optibrium, StarDrop, Auto-Modeller and Glowing Molecule are trademarks of
More informationInteractive Feature Selection with
Chapter 6 Interactive Feature Selection with TotalBoost g ν We saw in the experimental section that the generalization performance of the corrective and totally corrective boosting algorithms is comparable.
More informationStudying the effect of noise on Laplacian-modified Bayesian Analysis and Tanimoto Similarity
Studying the effect of noise on Laplacian-modified Bayesian nalysis and Tanimoto Similarity David Rogers, Ph.D. SciTegic, Inc. (Division of ccelrys, Inc.) drogers@scitegic.com Description of: nalysis methods
More informationPlan. Day 2: Exercise on MHC molecules.
Plan Day 1: What is Chemoinformatics and Drug Design? Methods and Algorithms used in Chemoinformatics including SVM. Cross validation and sequence encoding Example and exercise with herg potassium channel:
More informationQSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov
QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov CADD Group Chemical Biology Laboratory Frederick National Laboratory for Cancer Research National Cancer Institute, National Institutes
More informationIntroduction to Machine Learning Midterm Exam
10-701 Introduction to Machine Learning Midterm Exam Instructors: Eric Xing, Ziv Bar-Joseph 17 November, 2015 There are 11 questions, for a total of 100 points. This exam is open book, open notes, but
More information