Data Mining in the Chemical Industry. Overview of presentation
|
|
- Erick Ray
- 5 years ago
- Views:
Transcription
1 Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. verview of presentation verview of the chemical industry Example of the pharmaceutical industry Chemistry-based data mining Parsing, representation, matching, generating descriptors, data mining approaches Examples of how data mining is used within the chemical industry Identifying diverse compounds Analysis of high throughput screening Safety prediction
2 verview of the chemical industry Chemical industry overview The chemical industry refers to an industry involved in the production of chemicals The industry includes: petrochemicals agrochemicals pharmaceuticals polymers paints oleochemicals 2
3 Pharmaceutical industry World s largest manufacturing industry Pharmaceutical sales: $357 billion (2) USA(4%); Europe(24%); Japan(3%) R&D Time and Costs 3.6 years to discover a new drug Rising costs of drug discovery $54 million $23 million 22 - $82 million nly /3 of known diseases can be treated effective Examples of drugs Zyflo by Abbott Anti-asthmatic agent Cozaar by Merck Treatment for hypertension and congestive heart failure Protonix by Wyeth Anti-ulcer agent Prozac by Lilly Antidepressive agent 3
4 Drug discovery process Discovery Development 6.5 years Approval.9 years ID Submission DA Submission Target Discovery (2 years) Phase I Lead identification (9 months) Phase II Lead optimization (8 months) Phase III Pre-clinical (2 years) Phase IV Target Hits Leads Candidates Target discovery A target is a protein that plays a role in the disease The goal of drug discovery is to identify a new chemical (drug) that modifies the behavior of the protein 4
5 Lead identification Pharmaceutical companies have historical collections of chemicals ( million approx.) These chemicals will be screened against the target assay (test that indicates whether a chemical modifies the behavior of the target) A chemical showing a positive response is considered a hit at this stage Lead series (sets of similar chemicals) will be uncovered through an analysis of the data (both chemical and biological screening data) High throughput screening The test is performed on small plates The results of the testing are automatically read The process of screening the chemicals is automated 5
6 Screening results By analyzing the screening data, lead series can be identified Class Biological screening data Chemical Lead optimization Lead series Design new compounds Screen Synthesize or acquire compounds 6
7 Lead optimization Synthesize close analogs to determine how substituents affect the biological response data Lead optimization ptimize over multiple rows of data: () primary screening data (2) selected screening data (3) ADME screening data 7
8 Pre-clinical Use of model system Absorption Distribution Metabolism Elimination Toxicity (safety) ID submission and patent application Prior to clinical trials submit ID to Regulatory Agencies (FDA) Allow company to conduct a clinical trial Patent Exclusivity period from approval date 8
9 Drug development Issues 6 3 CE A p p ro vals CE approvals CE approvals fit line R&D expenditure R& D E xpenditures (Billions $) 9
10 Compounds withdrawals Source: IBM Why drugs fail?
11 Issues Making sense of large volumes of data Genomics, HTS, Lead optimization, ADME(T), Clinical trials Integration of data across silos Accelerating the pace of drug discovery Reducing compound attrition Chemistry-based data mining
12 Data mining in chemistry - Chemoinformatics Journals Journal of Chemical Information and Computer Sciences Journal of Computer-Aided Molecular Design Journal of Molecular Graphics and Modelling Books An Introduction to Chemoinformatics (Hardcover) by Andrew R. Leach, Valerie J. Gillet Chemoinformatics: A Textbook (Hardcover) by Johann Gasteiger (Editor), Thomas Engel (Editor) Meetings International Conference on Chemoinformatics Discovery Knowledge & Informatics 27 The Fourth Joint Sheffield Conference on Chemoinformatics Virtual Discovery. Computer-Aided Drug Design and Screening Web sites Issues to consider when data mining chemical data Parsing Chemical data Related information Matching Exact Substructure Similarity Descriptors Data mining methods 2
13 How to convert a chemical into a form to be read by a computer H Cl Computer readable representations of chemicals Describe the connection table in a computer readable form: MLFILE and SD File Contain information on the atoms (including coordinates), bonds, connections and associated information SMILES, WL, CML,. 3
14 MolFile example Cl H Structure- Structure- ACD/Labs V C C C C C C C C Cl M ED Matching chemical structures Aromaticity Cl Cl Representation + A B A B Stereochemistry H Tautomerism H A B A B 4
15 eed to annotate atoms and bonds (and the whole compound) to effectively search C H C C C Cl Acyclic. C C C Cl Acyclic Hs= C Cyclic Aromatic Hs= All atoms and bonds can be annotated with additional information Graphs odes (atoms) and edges (bonds) have properties on-calculated E.g.; Charge, bond type, atoms type,. Calculated Cyclic, number of hydrogens,. 5
16 Perception of chemical features Rings and chains Aromaticity Stereocenters lefinic Double Bonds Hydrogens Hybridisation levels Canonicalization Symmetry. Hydrogen perception example ow compound is in a graph we can easily determine the number hydrogens using the atom type, attached bonds and charge Cl Atom# Type Hydrogens Cl 2 C 3 C 4 C 5 C 6 C 7 C 8 C 9 6
17 Exact matching Exact Search H H 2 Query Chemicals selected from a database Substructure queries Structure drawing packages Add additional restriction on the atom and bonds: Cyclic/acyclic umber of hydrogens Closed to substitution.. MLFILE 7
18 MLFILE representation of a query -ISIS D Aromatic bond Ch Bond in chain only 999 V C C C C C C C C C M ED Query Substructure search example Aromatic bond Ch Bond in chain only A B E C D 8
19 Substructure search example Query Molecular Descriptors umber of hydrogen bond acceptors umber of hydrogen bond donors umber of rings umber of rotatable bonds Molecular weight Hydrophobicity Molar refractivity Topological indices Kappa shape indices Electrotopological state indices Polar surface area 2D fingerprints Atom-pairs and topological torsions Pharmacophore keys 9
20 Single Acyclic on-terminal Rotatable Bonds Rotatable bond Calculating rotatable bonds Rotatable bonds = Rotatable bonds = Rotatable bonds = 6 Rotatable bonds = Rotatable bonds = 3 2
21 2D Fingerprints Makes use of a dictionary of fragments (pre-defined substructures) Ak Any H Describing chemicals using fragment descriptors Fragment dictionary H H... Chemicals H H H H H H H 2. 2
22 Data Mining Methods Clustering Decision trees Principal component analysis Support vector machines k Decision forests Bayesian networks Genetic algorithms Identifying diverse chemicals Selecting diverse chemicals from commercially available chemical collections is often used to supplement in-house screening sets 22
23 Approach to identifying diverse chemicals Select and calculate chemical descriptors Determine the similarity between chemicals Cluster chemicals based on these descriptors Select representative chemicals from each group generated Describing chemicals using fragment descriptors Fragment dictionary H H... Chemicals H H H H H H H 2. 23
24 Similarity Tanimoto is one example SAB = c / (a + b - c) a is the number of bits set to one in A b is the the number of bits set to one in B c is the number of bits that are in both A and B Calculating similarity Fragment dictionary H H... Chemicals H H H H H H H 2. S = c / (a + b - c) S = / ( ) S =.4 24
25 Using 7 chemicals to illustrate B D G F E C A This type of analysis is usually performed on tens of thousands of chemicals Fingerprint table Chemicals Fragment dictionary ids 25
26 Hierarchical agglomerative clustering based on the fingerprints C D A B E F G Uses of the Euclidean distance and clusters using the average linkage joining rule Generating clusters C D Cluster 3 A B Cluster E F Cluster 2 G Adjusts the distance cut-off to change number of clusters 26
27 Cluster results at.7 cut-off Cluster B A Cluster 2 G F E Cluster 3 D C Selecting a representative from each cluster B G D 27
28 Analyzing HTS data Use of decision trees Supervised learning approach Partitions the set based on the descriptors Uses the biological data to determine the groups Used to quickly identify biologically interesting groups of chemicals 28
29 Generating decision trees. Find the most significant feature to split Feature-A 2. Partition the set according to those compounds containing the feature and those without 3. Partition each child node until a threshold is reached Whole Set Feature-A has the highest score Feature-B has the highest score Chemicals without feature-a Chemicals with feature-a Feature-C has the highest score Compounds without feature-b Compounds with feature-b Compounds without feature-c Compounds with feature-c Using 7 chemicals to illustrate B D G F E C A This type of analysis is usually performed on hundreds of thousands of chemicals 29
30 Potency data Structure ID Data A 4.44 B 3.5 C 5.2 D 5.4 E 5.23 F 9.33 G 8.7 Fingerprints Chemical acridine sec-amine(h) nitro pyridine alcohol thiane(h), 4-oxo- imidazole A B C D E F G 3
31 Decision Tree RP Tree G 8.7 F
32 RP Tree 4.44 A B 3.5 Predicting chemical safety 32
33 Safety prediction example Prepare data Integrate, normalize data, generate descriptors Prune descriptors Remove constants, desciptors lacking in information Understand chemical space Subset data Understand mechanisms of action Build and optimize model(s) Assess models Evaluate quality Combine models Applicability domain Apply to untested chemicals, where chemical is within the domain of the model Building model (dev tox) Structural descriptors Response: positive/negative/equivocal/unknown Use a classification tree model 33
34 Table used to build prediction model Classification tree 34
35 Classification tree assessment The quality of the predictive model is ultimately dependent on the quality of the data. In this example, the model is not very good at predicting safe chemicals since the original data lacked negative data points. Summary The chemical industry generates information about chemicals and its relationship to drug potency, safety, agrochemicals, Data mining is used extensively to accelerate the development of new products Representing and describing chemicals is a large part of the challenge of data mining chemical information 35
Chemoinformatics and information management. Peter Willett, University of Sheffield, UK
Chemoinformatics and information management Peter Willett, University of Sheffield, UK verview What is chemoinformatics and why is it necessary Managing structural information Typical facilities in chemoinformatics
More informationVirtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME
Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells
More informationIntroduction to Chemoinformatics and Drug Discovery
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.
More informationMachine Learning Concepts in Chemoinformatics
Machine Learning Concepts in Chemoinformatics Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-Universität Bonn BigChem Winter School 2017 25. October Data Mining in Chemoinformatics
More informationJCICS Major Research Areas
JCICS Major Research Areas Chemical Information Text Searching Structure and Substructure Searching Databases Patents George W.A. Milne C571 Lecture Fall 2002 1 JCICS Major Research Areas Chemical Computation
More informationAn Integrated Approach to in-silico
An Integrated Approach to in-silico Screening Joseph L. Durant Jr., Douglas. R. Henry, Maurizio Bronzetti, and David. A. Evans MDL Information Systems, Inc. 14600 Catalina St., San Leandro, CA 94577 Goals
More informationSolved and Unsolved Problems in Chemoinformatics
Solved and Unsolved Problems in Chemoinformatics Johann Gasteiger Computer-Chemie-Centrum University of Erlangen-Nürnberg D-91052 Erlangen, Germany Johann.Gasteiger@fau.de Overview objectives of lecture
More informationComputational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007
Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based
More informationEarly Stages of Drug Discovery in the Pharmaceutical Industry
Early Stages of Drug Discovery in the Pharmaceutical Industry Daniel Seeliger / Jan Kriegl, Discovery Research, Boehringer Ingelheim September 29, 2016 Historical Drug Discovery From Accidential Discovery
More informationNavigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland
Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures
More informationComputational Methods and Drug-Likeness. Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004
Computational Methods and Drug-Likeness Benjamin Georgi und Philip Groth Pharmakokinetik WS 2003/2004 The Problem Drug development in pharmaceutical industry: >8-12 years time ~$800m costs >90% failure
More informationIntroduction. OntoChem
Introduction ntochem Providing drug discovery knowledge & small molecules... Supporting the task of medicinal chemistry Allows selecting best possible small molecule starting point From target to leads
More informationSimilarity methods for ligandbased virtual screening
Similarity methods for ligandbased virtual screening Peter Willett, University of Sheffield Computers in Scientific Discovery 5, 22 nd July 2010 Overview Molecular similarity and its use in virtual screening
More informationIntroduction to Chemoinformatics
Introduction to Chemoinformatics www.dq.fct.unl.pt/cadeiras/qc Prof. João Aires-de-Sousa Email: jas@fct.unl.pt Recommended reading Chemoinformatics - A Textbook, Johann Gasteiger and Thomas Engel, Wiley-VCH
More informationData Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug Discovery Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry,
More informationAMRI COMPOUND LIBRARY CONSORTIUM: A NOVEL WAY TO FILL YOUR DRUG PIPELINE
AMRI COMPOUD LIBRARY COSORTIUM: A OVEL WAY TO FILL YOUR DRUG PIPELIE Muralikrishna Valluri, PhD & Douglas B. Kitchen, PhD Summary The creation of high-quality, innovative small molecule leads is a continual
More informationEMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS
EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS PETER GUND Pharmacopeia Inc., CN 5350 Princeton, NJ 08543, USA pgund@pharmacop.com Empirical and theoretical approaches to drug discovery have often
More informationIn silico pharmacology for drug discovery
In silico pharmacology for drug discovery In silico drug design In silico methods can contribute to drug targets identification through application of bionformatics tools. Currently, the application of
More informationBiologically Relevant Molecular Comparisons. Mark Mackey
Biologically Relevant Molecular Comparisons Mark Mackey Agenda > Cresset Technology > Cresset Products > FieldStere > FieldScreen > FieldAlign > FieldTemplater > Cresset and Knime About Cresset > Specialist
More informationData Analysis in the Life Sciences - The Fog of Data -
ALTAA Chair for Bioinformatics & Information Mining Data Analysis in the Life Sciences - The Fog of Data - Michael R. Berthold ALTAA-Chair for Bioinformatics & Information Mining Konstanz University, Germany
More informationFRAUNHOFER IME SCREENINGPORT
FRAUNHOFER IME SCREENINGPORT Design of screening projects General remarks Introduction Screening is done to identify new chemical substances against molecular mechanisms of a disease It is a question of
More informationUsing AutoDock for Virtual Screening
Using AutoDock for Virtual Screening CUHK Croucher ASI Workshop 2011 Stefano Forli, PhD Prof. Arthur J. Olson, Ph.D Molecular Graphics Lab Screening and Virtual Screening The ultimate tool for identifying
More informationThe Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc
The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company Sally Rose BioFocus plc Overview History of BioFocus and acquisition of CDD Biological
More informationFROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES
FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES DIFFERENT LEVELS OF KNOWLEDGE REPRESENTATION IN CHEMISTRY Michael Braden, PhD ACS / San Diego/ 2016 Overview ChemAxon Who are we? Examples/use cases: Create
More informationAnalysis of a Large Structure/Biological Activity. Data Set Using Recursive Partitioning and. Simulated Annealing
Analysis of a Large Structure/Biological Activity Data Set Using Recursive Partitioning and Simulated Annealing Student: Ke Zhang MBMA Committee: Dr. Charles E. Smith (Chair) Dr. Jacqueline M. Hughes-Oliver
More informationChemoinformatics and Drug Discovery
Molecules 2002, 7, 566-600 molecules ISSN 1420-3049 http://www.mdpi.org Review: Chemoinformatics and Drug Discovery Jun Xu* and Arnold Hagler Discovery Partners International, Inc., 9640 Towne Center Drive,
More informationReceptor Based Drug Design (1)
Induced Fit Model For more than 100 years, the behaviour of enzymes had been explained by the "lock-and-key" mechanism developed by pioneering German chemist Emil Fischer. Fischer thought that the chemicals
More informationThe Case for Use Cases
The Case for Use Cases The integration of internal and external chemical information is a vital and complex activity for the pharmaceutical industry. David Walsh, Grail Entropix Ltd Costs of Integrating
More informationNext Generation Computational Chemistry Tools to Predict Toxicity of CWAs
Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs William (Bill) Welsh welshwj@umdnj.edu Prospective Funding by DTRA/JSTO-CBD CBIS Conference 1 A State-wide, Regional and National
More informationStructural biology and drug design: An overview
Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved
More informationStructure-Activity Modeling - QSAR. Uwe Koch
Structure-Activity Modeling - QSAR Uwe Koch QSAR Assumption: QSAR attempts to quantify the relationship between activity and molecular strcucture by correlating descriptors with properties Biological activity
More informationChemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller
Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics
More informationDrug Informatics for Chemical Genomics...
Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison
More informationBuilding innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery
Building innovative drug discovery alliances Just in KIME: Successful Process Driven Drug Discovery Berlin KIME Spring Summit, Feb 2016 Research Informatics @ Evotec Evotec s worldwide operations 2 Pharmaceuticals
More informationOverview. Descriptors. Definition. Descriptors. Overview 2D-QSAR. Number Vector Function. Physicochemical property (log P) Atom
verview D-QSAR Definition Examples Features counts Topological indices D fingerprints and fragment counts R-group descriptors ow good are D descriptors in practice? Summary Peter Gedeck ovartis Institutes
More informationSimilarity Search. Uwe Koch
Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance
More informationQSAR in Green Chemistry
QSAR in Green Chemistry Activity Relationship QSAR is the acronym for Quantitative Structure-Activity Relationship Chemistry is based on the premise that similar chemicals will behave similarly The behavior/activity
More informationIntroduction to Chemoinformatics
Introduction to Chemoinformatics Dr. Igor V. Tetko Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) Institute of Bioinformatics & Systems Biology (HMGU) Kyiv, 10 August
More informationCapturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations
Capturing Chemistry What you see is what you get In the world of mechanism and chemical transformations Dr. Stephan Schürer ead of Intl. Sci. Content Libraria, Inc. sschurer@libraria.com Distribution of
More informationBioisosteres in Medicinal Chemistry
Edited by Nathan Brown Bioisosteres in Medicinal Chemistry VCH Verlag GmbH & Co. KGaA Contents List of Contributors Preface XV A Personal Foreword XI XVII Part One Principles 1 Bioisosterism in Medicinal
More informationRetrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a
Retrieving hits through in silico screening and expert assessment M.. Drwal a,b and R. Griffith a a: School of Medical Sciences/Pharmacology, USW, Sydney, Australia b: Charité Berlin, Germany Abstract:
More informationIntroducing a Bioinformatics Similarity Search Solution
Introducing a Bioinformatics Similarity Search Solution 1 Page About the APU 3 The APU as a Driver of Similarity Search 3 Similarity Search in Bioinformatics 3 POC: GSI Joins Forces with the Weizmann Institute
More informationIn Silico Investigation of Off-Target Effects
PHARMA & LIFE SCIENCES WHITEPAPER In Silico Investigation of Off-Target Effects STREAMLINING IN SILICO PROFILING In silico techniques require exhaustive data and sophisticated, well-structured informatics
More informationRapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value
Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Anthony Arvanites Daylight User Group Meeting March 10, 2005 Outline 1. Company Introduction
More informationContents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics
Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure
More informationDispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses
Dispensing Processes Profoundly Impact Biological, Computational and Statistical Analyses Sean Ekins 1, Joe Olechno 2 Antony J. Williams 3 1 Collaborations in Chemistry, Fuquay Varina, NC. 2 Labcyte Inc,
More informationComprehensive Chemoinformatics since Web-based, client/server, and toolkit approaches. Native Oracle (cartridge) and Microsoft technology.
CambridgeSoft Solutions CambridgeSoft Research Informatics Louis Culot Executive Director, Research Informatics Division Informatics Overview ChemDraw since 1986. Comprehensive Chemoinformatics since 1998.
More informationDivCalc: A Utility for Diversity Analysis and Compound Sampling
Molecules 2002, 7, 657-661 molecules ISSN 1420-3049 http://www.mdpi.org DivCalc: A Utility for Diversity Analysis and Compound Sampling Rajeev Gangal* SciNova Informatics, 161 Madhumanjiri Apartments,
More informationUltra High Throughput Screening using THINK on the Internet
Ultra High Throughput Screening using THINK on the Internet Keith Davies Central Chemistry Laboratory, Oxford University Cathy Davies Treweren Consultants, UK Blue Sky Objectives Reduce Development Failures
More informationCheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS
Cheminformatics Role in Pharmaceutical Industry Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS Agenda The big picture for pharmaceutical industry Current technological/scientific issues Types
More informationCOMPARISON OF SIMILARITY METHOD TO IMPROVE RETRIEVAL PERFORMANCE FOR CHEMICAL DATA
http://www.ftsm.ukm.my/apjitm Asia-Pacific Journal of Information Technology and Multimedia Jurnal Teknologi Maklumat dan Multimedia Asia-Pasifik Vol. 7 No. 1, June 2018: 91-98 e-issn: 2289-2192 COMPARISON
More informationPooling Experiments for High Throughput Screening in Drug Discovery
Pooling Experiments for High Throughput Screening in Drug Discovery Jacqueline M. Hughes-Oliver hughesol@stat.ncsu.edu Department of Statistics North Carolina State University Spring Research Conference,
More informationChemical library design
Chemical library design Pavel Polishchuk Institute of Molecular and Translational Medicine Palacky University pavlo.polishchuk@upol.cz Drug development workflow Vistoli G., et al., Drug Discovery Today,
More informationAnalysis of Activity Landscapes, Activity Cliffs, and Selectivity Cliffs. Jürgen Bajorath Life Science Informatics University of Bonn
Analysis of Activity Landscapes, Activity Cliffs, and Selectivity Cliffs Jürgen Bajorath Life Science Informatics University of Bonn Concept of Activity Landscapes Activity landscapes : biological activity
More informationhas its own advantages and drawbacks, depending on the questions facing the drug discovery.
2013 First International Conference on Artificial Intelligence, Modelling & Simulation Comparison of Similarity Coefficients for Chemical Database Retrieval Mukhsin Syuib School of Information Technology
More informationQSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov
QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov CADD Group Chemical Biology Laboratory Frederick National Laboratory for Cancer Research National Cancer Institute, National Institutes
More informationInformation Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann
Information Extraction from Chemical Images Discovery Knowledge & Informatics April 24 th, 2006 Dr. Available Chemical Information Textbooks Reports Patents Databases Scientific journals and publications
More informationSearching Substances in Reaxys
Searching Substances in Reaxys Learning Objectives Understand that substances in Reaxys have different sources (e.g., Reaxys, PubChem) and can be found in Document, Reaction and Substance Records Recognize
More informationIgnasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015
Ignasi Belda, PhD CEO HPC Advisory Council Spain Conference 2015 Business lines Molecular Modeling Services We carry out computational chemistry projects using our selfdeveloped and third party technologies
More informationInteractive Feature Selection with
Chapter 6 Interactive Feature Selection with TotalBoost g ν We saw in the experimental section that the generalization performance of the corrective and totally corrective boosting algorithms is comparable.
More informationDevelopment of a Structure Generator to Explore Target Areas on Chemical Space
Development of a Structure Generator to Explore Target Areas on Chemical Space Kimito Funatsu Department of Chemical System Engineering, This materials will be published on Molecular Informatics Drug Development
More informationUsing Self-Organizing maps to accelerate similarity search
YOU LOGO Using Self-Organizing maps to accelerate similarity search Fanny Bonachera, Gilles Marcou, Natalia Kireeva, Alexandre Varnek, Dragos Horvath Laboratoire d Infochimie, UM 7177. 1, rue Blaise Pascal,
More informationLarge scale classification of chemical reactions from patent data
Large scale classification of chemical reactions from patent data Gregory Landrum NIBR Informatics, Basel Novartis Institutes for BioMedical Research 10th International Conference on Chemical Structures/
More informationDesign and Synthesis of the Comprehensive Fragment Library
YOUR INNOVATIVE CHEMISTRY PARTNER IN DRUG DISCOVERY Design and Synthesis of the Comprehensive Fragment Library A 3D Enabled Library for Medicinal Chemistry Discovery Warren S Wade 1, Kuei-Lin Chang 1,
More informationAnalyzing Small Molecule Data in R
Analyzing Small Molecule Data in R Tyler Backman and Thomas Girke December 12, 2011 Analyzing Small Molecule Data in R Slide 1/49 Introduction CMP Structure Formats Similarity Searching Background Fragment
More informationQSAR of Microtubule Stabilizing Dictyostatins
QSAR of Microtubule Stabilizing Dictyostatins Kia Montgomery BBSI 2007- University of Pittsburgh Department of Chemistry, Grambling State University Billy Day, Ph.D. Department of Pharmaceutical Sciences,
More informationA Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors
A Tiered Screen Protocol for the Discovery of Structurally Diverse HIV Integrase Inhibitors Rajarshi Guha, Debojyoti Dutta, Ting Chen and David J. Wild School of Informatics Indiana University and Dept.
More informationDISCOVERING new drugs is an expensive and challenging
1036 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 17, NO. 8, AUGUST 2005 Frequent Substructure-Based Approaches for Classifying Chemical Compounds Mukund Deshpande, Michihiro Kuramochi, Nikil
More informationMining Molecular Fragments: Finding Relevant Substructures of Molecules
Mining Molecular Fragments: Finding Relevant Substructures of Molecules Christian Borgelt, Michael R. Berthold Proc. IEEE International Conference on Data Mining, 2002. ICDM 2002. Lecturers: Carlo Cagli
More informationDr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre
Dr. Sander B. Nabuurs Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre The road to new drugs. How to find new hits? High Throughput
More informationPerforming a Pharmacophore Search using CSD-CrossMiner
Table of Contents Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Searching with a Pharmacophore... 4 Performing a Pharmacophore Search using CSD-CrossMiner Version 2.0
More informationBioinformatics Workshop - NM-AIST
Bioinformatics Workshop - NM-AIST Day 3 Introduction to Drug/Small Molecule Discovery Thomas Girke July 25, 2012 Bioinformatics Workshop - NM-AIST Slide 1/44 Introduction CMP Structure Formats Similarity
More informationFast similarity searching making the virtual real. Stephen Pickett, GSK
Fast similarity searching making the virtual real Stephen Pickett, GSK Introduction Introduction to similarity searching Use cases Why is speed so crucial? Why MadFast? Some performance stats Implementation
More informationControl Strategies for Small Molecule Components of Antibody-Drug Conjugates
Control Strategies for Small Molecule Components of Antibody-Drug Conjugates Nathan C. Ihle, PhD Executive Director, Process Chemistry Seattle Genetics, Inc WCBP 2012 Antibody-Drug Conjugates: Balancing
More informationStructure-Based Drug Discovery An Overview
Structure-Based Drug Discovery An Overview Edited by Roderick E. Hubbard University of York, Heslington, York, UK and Vernalis (R&D) Ltd, Abington, Cambridge, UK RSC Publishing Contents Chapter 1 3D Structure
More informationRepresentation of molecular structures. Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal
Representation of molecular structures Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal A hierarchy of structure representations Name (S)-Tryptophan 2D Structure 3D Structure Molecular
More informationQSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression
APPLICATION NOTE QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression GAINING EFFICIENCY IN QUANTITATIVE STRUCTURE ACTIVITY RELATIONSHIPS ErbB1 kinase is the cell-surface receptor
More informationChemical Databases: Encoding, Storage and Search of Chemical Structures
Chemical Databases: Encoding, Storage and Search of Chemical Structures Dr. Timur I. Madzhidov Kazan Federal University, Department of Organic Chemistry * Ray, L.C. and R.A. Kirsch, Finding Chemical Records
More informationBuilding blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction
Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction Stefan Kuhn 1, Björn Egert 2, Steffen Neumann 2, Christoph Steinbeck 1European Bioinformatics Institute
More informationThe Schrödinger KNIME extensions
The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich Topics What are the Schrödinger extensions? Workflow application
More informationThis is a repository copy of Chemoinformatics techniques for data mining in files of two-dimensional and three-dimensional chemical molecules.
This is a repository copy of Chemoinformatics techniques for data mining in files of two-dimensional and three-dimensional chemical molecules. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/8425/
More informationUse of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs
Use of data mining and chemoinformatics in the identification and optimization of high-throughput screening hits for NTDs James Mills; Karl Gibson, Gavin Whitlock, Paul Glossop, Jean-Robert Ioset, Leela
More informationChemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility
Chemical Reaction Databases Computer-Aided Synthesis Design Reaction Prediction Synthetic Feasibility Dr. Wendy A. Warr http://www.warr.com Warr, W. A. A Short Review of Chemical Reaction Database Systems,
More informationReaxys Medicinal Chemistry Fact Sheet
R&D SOLUTIONS FOR PHARMA & LIFE SCIENCES Reaxys Medicinal Chemistry Fact Sheet Essential data for lead identification and optimization Reaxys Medicinal Chemistry empowers early discovery in drug development
More informationCOMBINATORIAL CHEMISTRY IN A HISTORICAL PERSPECTIVE
NUE FEATURE T R A N S F O R M I N G C H A L L E N G E S I N T O M E D I C I N E Nuevolution Feature no. 1 October 2015 Technical Information COMBINATORIAL CHEMISTRY IN A HISTORICAL PERSPECTIVE A PROMISING
More informationChemoinformatics: the first half century
Chemoinformatics: the first half century Peter Willett Presented at the Third Strasbourg Summer School on Chemoinformatics, 25 th June 2012 verview Introduction to chemoinformatics What it is How it has
More informationMolecular Graphics. Molecular Graphics Expt. 1 1
Molecular Graphics Expt. 1 1 Molecular Graphics The study of organic chemistry has for more than a century and a half focussed on the relationship between the structure of an organic molecule (its three-dimensional
More informationPROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS
179 Molecular Informatics: Confronting Complexity, May 13 th - 16 th 2002, Bozen, Italy PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS CARLETON R. SAGE, KEVIN R. HOLME, NIANISH
More informationCanonical Line Notations
Canonical Line otations InChI vs SMILES Krisztina Boda verview Compound naming InChI SMILES Molecular equivalency Isomorphism Kekule Tautomers Finding duplicates What s Your ame? 1. Unique numbers CAS
More informationClustering Ambiguity: An Overview
Clustering Ambiguity: An Overview John D. MacCuish Norah E. MacCuish 3 rd Joint Sheffield Conference on Chemoinformatics April 23, 2004 Outline The Problem: Clustering Ambiguity and Chemoinformatics Preliminaries:
More informationStatistical concepts in QSAR.
Statistical concepts in QSAR. Computational chemistry represents molecular structures as a numerical models and simulates their behavior with the equations of quantum and classical physics. Available programs
More informationExploring the chemical space of screening results
Exploring the chemical space of screening results Edmund Champness, Matthew Segall, Chris Leeding, James Chisholm, Iskander Yusof, Nick Foster, Hector Martinez ACS Spring 2013, 7 th April 2013 Optibrium,
More informationMolecular Complexity Effects and Fingerprint-Based Similarity Search Strategies
Molecular Complexity Effects and Fingerprint-Based Similarity Search Strategies Dissertation zur Erlangung des Doktorgrades (Dr. rer. nat.) der Mathematisch-aturwissenschaftlichen Fakultät der Rheinischen
More informationAdvanced Medicinal Chemistry SLIDES B
Advanced Medicinal Chemistry Filippo Minutolo CFU 3 (21 hours) SLIDES B Drug likeness - ADME two contradictory physico-chemical parameters to balance: 1) aqueous solubility 2) lipid membrane permeability
More informationPatent Searching using Bayesian Statistics
Patent Searching using Bayesian Statistics Willem van Hoorn, Exscientia Ltd Biovia European Forum, London, June 2017 Contents Who are we? Searching molecules in patents What can Pipeline Pilot do for you?
More informationTranslating Methods from Pharma to Flavours & Fragrances
Translating Methods from Pharma to Flavours & Fragrances CINF 27: ACS National Meeting, New Orleans, LA - 18 th March 2018 Peter Hunt, Edmund Champness, Nicholas Foster, Tamsin Mansley & Matthew Segall
More informationRECENT TRENDS IN PHARMACEUTICAL CHEMISTRY FOR DRUG DISCOVERY
INTERNATIONAL JOURNAL OF RESEARCH IN PHARMACY AND CHEMISTRY Available online at www.ijrpc.com Review Article RECENT TRENDS IN PHARMACEUTICAL CHEMISTRY FOR DRUG DISCOVERY Sathyaraj A Department of Chemistry,
More informationPlan. Day 2: Exercise on MHC molecules.
Plan Day 1: What is Chemoinformatics and Drug Design? Methods and Algorithms used in Chemoinformatics including SVM. Cross validation and sequence encoding Example and exercise with herg potassium channel:
More informationDeveloping CAS Products for Substructure Searching by Chemists. Linda Toler
Developing CAS Products for Substructure Searching by Chemists Linda Toler Developing CAS Products for Substructure Searching Evolution of the CAS Registry Development of substructure searching for CAS
More informationProgress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery
21 th /June/2018@CUGM Progress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery Kaz Ikeda, Ph.D. Keio University Self Introduction Keio University, Tokyo, Japan (Established
More information