The NCI/CADD Group's InChI Usage and Analysis of Tautomerism for InChI V2

Size: px
Start display at page:

Download "The NCI/CADD Group's InChI Usage and Analysis of Tautomerism for InChI V2"

Transcription

1 The NCI/CADD Group's InChI Usage and Analysis of Tautomerism for InChI V2 Marc C. Nicklaus Computer-Aided Drug Design (CADD) Group Chemical Biology Laboratory Center for Cancer Research National Cancer Institute National Institutes of Health Frederick, Maryland 21702

2 Chemical Identifier Resolver (CIR) Resolves structure identifiers or representations, i.e. converts one structure identifier/representation into another Usable by humans; but optimized for communication between computers (returns MIME/text pages wherever possible) Launched in June 2009 Planned: major update of services and underlying database, the CADD Group s Chemical Structure DataBase (CSDB) 2

3 Chemical Identifier Resolver (CIR) Flowchart identifier representation http request detection of the identifier type identifier is a hashed structure representation (e.g. InChIKey), or chemical name etc. identifier is a full structure representation (e.g. SMILES, InChI) structure calculation of the requested structure representation e.g. InChI, GIF image http response e.g. CAS number, chemical name database lookup

4 Chemical Identifier Resolver (CIR) Examples: Naproxen sodium MIME type: text/plain Buckyball XMWRBQBLMFGWIX-UHFFFAYSA-N/image?height=300&width=300&bgcolor=black&bondcolor=white

5 Chemical Structure Database (CSDB) in CIR Currently available in CIR: 140 chemical structure databases 120 million structure records 84.6 million unique structures by FICuS 110 million unique Standard InChIKeys for lookup (includes ~25 million derived scaffold and ring structures) ChemNavigator/Sigma iresearch Library compilation of commercially available screening compounds from ~300 international chemistry suppliers PubChem database including pen NCI database, EPA DSSTox databases, NIAID HIV database, NIST Webbook, NLM ChemIDplus, ChemSpider, Commercial Sources / others Asinex, Comgenex, emolecules, ChemNav. iresearch Lib. ~56% PubChem ~38% ~6% others

6 NCI/CADD Structure Identifiers NCI/CADD Structure Identifiers Based on CACTVS hashcodes; 16-digit hex numbers (64 bit unsigned) Fragments Isotopes Charges Tautomers Stereochemistry sensitive sensitive sensitive sensitive sensitive - Na + D D D D D D - + NH 3 H = H C H NH 2 H C H H NH 2 F I C T S = u u u u u H NH 2 H C H NH 2 un-sensitive un-sensitive un-sensitive un-sensitive un-sensitive FICuS identifier: comes closest to how a chemist perceives a compound Sitzmann et al. SAR QSAR Environ. Res. 2008, 19, 1 9 Tautomerism in large small-molecule databases

7 tautomer N NH H NH 2 HNDVDQJCIGZPN-UHFFFAYSA-N 9850FD9F9E2B4E25-FICuS HN N NH 2 HNDVDQJCIGZPN-RXMQYKEDSA-N E92E4BA2869F3611-FICuS H stereoisomers HN H N NH 2 HNDVDQJCIGZPN-YFKPBYRVSA-N 8A7AD1EB498CC76A-FICuS salt Na + HN - N NH 2 UHPNKBYGGMJTIM-UHFFFAYSA-M E5F83F10C5DB080A-FICuS HNDVDQJCIGZPN-UHFFFAYSA-N Histidine Std. InChIKey HN H N NH FD9F9E2B4E25-FICuS FICuS HN Na errors N NH2 HN H N NH charged form HN - + N NH 3 HNDVDQJCIGZPN-UHFFFAYSA-N A3DAE DDE4-FICuS isotope NH N 15 NH 2 H UHPNKBYGGMJTIM-UHFFFAYSA-M E5F83F10C5DB080A-FICuS HNDVDQJCIGZPN-UHFFFAYSA-N 9850FD9F9E2B4E25-FICuS HNDVDQJCIGZPN-CDYZYAPPSA-N B2FDA68AEDA06DB9-FICuS Tautomerism in large small-molecule databases

8 Chemical Structure Database (current version) calculation of Standard InChIKey and NCI/CADD identifiers (from ~350 million original raw records including multiple database versions) Total unique records: 168,015,387 (includes ~40M derived scaffold and ring structures) original structure record structure normalization parent structure hashcode calculation E_HASHISY NCI/CADD Identifier InChIKeys Standard: 167,722,852 Set 1: 167,722,850 Set 2: 167,722,852 Set 3: 167,698,426 any unique: 167,723,824 FICTS FICuS union set: uuuuu Standard InChIKey 1.04 NCI/CADD Identifiers (unique counts) FICTS: 125,009,738 FICuS: 121,429,689 uuuuu: 108,993,792 (before normalization: ~127M) Standard Set 1 Set 2 Set 3

9 Tautomers Tautomers are isomers that can transform into each other through chemical equilibrium reactions - Prototropic tautomerism: intramolecular movement of a hydrogen atom enol form keto form Strongly environmentdependent (ph, solvent, T, time,... ) - Ring-chain tautomerism: movement of the proton accompanied by opening/closing of a ring cyclic form acyclic form 9

10 Importance of Getting Tautomers Right The existence of multiple tautomeric forms of the same molecule can create problems! Property calculation Variations across tautomers by several orders of magnitude. Eg. pka, logp Registration in databases May lead to duplicate registration, missed molecules in searches, or incorrect identification of two structures as the same compound Clustering diversity (Tanimoto) similarity between tautomers can be very low Ligand docking Hydrogen bonding interactions are different for different tautomers X-ray crystallographic Most important for InChI What is the tautomeric form present in ligandprotein complexes? May impact the success of drug discovery 10

11 NCI/CADD Chemical Structure Database Tautomer Analysis Asinex 100 ChemBridge 90 ComGenex ChemNavigator 80 Columbia University 70 Molecular Screening Center 60 number EPA DSSTox database 50 Specs releases Ambinter BIND BindingDB ChemNavigator KEGG NCI pen Database NIST WebBook NLM ChemIDplus NMRShiftDB Thomson Pharma Wombat Average tautomeric overlap per DB ~0.3% Tautomeric overlap across all DBs: ~10% Structures capable of tautomerism: ~68% tautomeric overlap within each individual database release NCI/DTP PASS Training Set SGC-x ChemDB ZINC percentage of actual duplicates FICTS - FICuS parent structure in each database release frequency ChEBI ChemSpider Sitzmann M, Ihlenfeldt WD, Nicklaus MC. J Comput Aided Mol Des Jun;24(6-7): Tautomerism in large small-molecule databases

12 Tautomerism in Chemoinformatics In reality, tautomerism is a quantum-mechanical effect (orbital changes) In principle calculable at the QM level But incorporating the conditions (solvent, ph,...) is not easy...and these calculations can take weeks for one molecule In chemoinformatics: rule-based (you have maybe 100 ms per structure!) These rules will be correct only in a statistical sense Whether QM or rule-based: one has to agree on what set and ranges of conditions to use to define tautomerism in a practical application, e.g. for identifiers used for compound registration in a database or repository 12

13 Tautomerism and Identifiers Different chemical identifiers are sensitive to tautomerism to different degrees: InChI/InChIKey [IUPAC International Chemical Identifier] one identifier with a layer structure in principle designed to be tautomer-invariant but not all types of common tautomerism used by default (e.g. not invariant by default to keto/enol tautomerism in v. 1.04) NCI/CADD identifiers 1 several identifiers with different sensitivities to chemical features; most important one: FICuS tautomer-invariant 1 Sitzmann et al. SAR QSAR Environ. Res. 2008, 19,

14 Example of Known Issues in InChI 1,4-oxime/nitroso tautomerism not currently handled by InChI. Adding it will break current InChI. H N H N - H N + InChI=1S/C5H5N2/c (5)8/h1-4,7H InChI=1S/C5H5N2/c (5)8/h1-4,8H Dmitrii Tchekhovskoi, IUPAC InChI Committee Meeting, March

15 Tautomerism Transform Rules CACTVS: rule set of 21 transforms encoded as (CACTVS-extended) SMIRKS Types of tautomerism covered: rule 1: 1.3 (thio)keto/(thio)enol * rule 2: 1.5 (thio)keto/(thio)enol rule 3: simple (aliphatic) imine rule 4: special imine rule 5: 1.3 aromatic heteroatom H shift rule 6: 1.3 heteroatom H shift rule 7: 1.5 (aromatic) heteroatom H shift (1) rule 8: 1.5 (aromatic) heteroatom H shift (2) rule 9: 1.7 (aromatic) heteroatom H shift rule 10: 1.9 (aromatic) heteroatom H shift rule 11: 1.11 (aromatic) heteroatom H shift rule 12: furanones rule 13: keten/ynol exchange rule 14: ionic nitro/aci-nitro rule 15: pentavalent nitro/aci-nitro rule 16: oxim/nitroso rule 17: oxim/nitroso via phenol rule 18: cyanic/iso-cyanic acids rule 19: formamidinesulfinic acids rule 20: isocyanides rule 21: phosphonic acids * rule 1 has been merged with rule 6 CACTVS by: Wolf-Dietrich Ihlenfeldt, Xemistry GmbH 15

16 Tautomer verlap in Commercial Catalogs Aldrich Market Select (AMS) database of commercially available samples: 5,755,574 molecules ( version) Examples (prices per 1 g): 31,156 conflicts 62,872 molecules n-tuples Conflicts 2 30,619 $313 $ Same original supplier! $188 $300 16

17 Quadruple Tautomeric Case 17

18 But wait, there is more: Ring-Chain Tautomerism exocyclic endocyclic XH: Nucleophilic center, YZ: Electrophilic center Prototropic tautomerism handled by most chemoinformatics tools:...but not ring-chain tautomerism: The need for computer programs that predict ringchain tautomerization, a capability absent from the current tautomer generation programs etc. Y.C. Martin, J Comput Aided Mol Des. 2009: 23: Let s not forget tautomers UR GAL 18

19 Example for Ring-Chain Tautomerism: Warfarin Anticoagulant drug used in the prevention of thrombosis Introduced in 1948 as a pesticide against rats and mice Approved in 1954 for use as a medication Most widely prescribed oral anticoagulant drug in the U.S. Inhibits vitamin K epoxide reductase (recycles oxidized vitamin K1 to its reduced form) 19

20 Tautomerism of Warfarin What to Expect Can exist in principle in as many as 40 topologically distinct tautomeric forms! H H H H H H H H H H H H H H Submitted to PubChem Mentioned in literature Confirmed experimentally Valente E.J. et al., J. Med. Chem. 1977, 20, Karlsson, B.C.G. et al., J. Phys. Chem. B 2007, 111, Porter, R.P., J. Comput. Aided Mol. Des. 2010, 24, Nicholls, I.A. et al., J. Mol. Recognit. 2010, 23,

21 Tautomerism of Warfarin FICuS Identifier H H H H H H D76B88C F1-FICuS D76B88C F1-FICuS D76B88C F1-FICuS 09BB2FAADA1508A7-FICuS H H H D76B88C F1-FICuS D76B88C F1-FICuS D76B88C F1-FICuS 8F5519DD1E62B6B2-FICuS H H H H H D76B88C F1-FICuS D76B88C F1-FICuS D76B88C F1-FICuS Not covered by current rule set! prototropic tautomerism ring-chain 21 tautomerism 21

22 Tautomerism of Warfarin InChIKey Identifier H H H H H QTXVAVXCBMYBJW-UHFFFAYSA-N VWSXIGYSLWNCBN-VAWYXSNFSA-N GRAAPKVUSREWIL-UHFFFAYSA-N H LSCYDZJASSKSMJ-UHFFFAYSA-N H H H FQEPJULUDFINX-UHFFFAYSA-N UCKRWKACBKRIKB-VAWYXSNFSA-N NNLYDNMZCAHUV-UHFFFAYSA-N PIBBXWKSPNJFI-UHFFFAYSA-N H H H H H PJVWKTKQMNHTI-UHFFFAYSA-N FVSFCRPKSVCTBA-VAWYXSNFSA-N BBSKMPTDUUMKL-UHFFFAYSA-N prototropic tautomerism All InChIKey identifiers are different!! Revamp handling of tautomerism in InChI V.2 ring-chain 22 tautomerism 22

23 Definition of Ring-Chain Tautomerism Rules Baldwin's Rules as starting point J.E. Baldwin, J Chem Soc Chem Comun. 1976: Ring sizes being formed: 3 7 Breaking bonds: exocyclic (exo) or endocyclic (endo) Geometry of carbon atoms: sp 3 (tet), sp 2 (trig) or sp (dig) Disfavoured / favoured ring closures Baldwin s Rules ur Rules 11 ring-chain rules each rule is encoded as a SMIRKS string Guasch, L. et al.. JCIM (under revision) 23

24 6-exo-Trig exo-Dig endo-Trig Examples: Ring-chain tautomerism rules 24

25 Tautomers per Structure in the AMS Database Prototropic Tautomerism 21 Rules Ring-chain Tautomerism 11 Rules Count % Count % no tautomers (single molecule) 1,393, ,297, one tautomer 1,235, , tautomers 833, , tautomers 483, , tautomers 223, , tautomers 889, , tautomers 584, , tautomers 72, , Ring-chain tautomersim is in the minority relative to prototropic tautomerism but it is not an exotic occurrence either tautomers 35, , tautomers 3, , tautomers

26 Procedure: Tautomerism Analysis by Experimental Chemoinformatics Select tautomer tuples from AMS by coverage of types of tautomer transforms chemical diversity solubility availability from same original supplier likelihood to be distinguishable by NMR price Purchase samples Analyze by NMR spectroscopy Goal: measure as function of temperature, solvent, ph, shelf time... Investigate prevalence of tautomeric overlap in a real commercial catalog Test which of the tautomer transform rules may be too aggressive 26

27 NMR Experiments 127 prototropic tautomeric pairs 5 prototropic tautomeric triples 34 ring-chain tautomeric pairs 338 molecules from AMS Solvent: DMS-d6 Room temperature 1 H and 13 C NMR Spectra Bruker AVANCE TM 500 Autosampler (24) 27

28 Keto/enol Tautomerism (conflict 26) 1 H NMR Spectrum 26_1 Same 1 H and 13 C NMR spectrum between samples. Assignment of chemical shifts indicates enol form is present in both samples InChI=1S/C14H12N4S/c1-9-10( )13(19)18(17-9) (11)20-14/h2-3,6-7,19H,4-5H2,1H3 InChIKey=KTGFJMXHLHDQA-UHFFFAYSA-N 13 C NMR Spectrum 26_2 InChI=1S/C14H12N4S/c1-9-10( )13(19)18(17-9) (11)20-14/h2-3,6-7,17H,4-5H2,1H3 InChIKey=HGTVTJWWZHYAQS-UHFFFAYSA-N 28

29 Heteroaromatic Tautomerism (conflict 36) 1 H NMR Spectrum Same 1 H and 13 C NMR spectrum between samples. Double number of peaks, both samples have the same mixture of tautomers. InChI=1S/C9H13ClN4/c1-9(2,3)14-7-6( ) (10)13-7/h5H,4H2,1-3H3,(H,1,13) InChIKey=XRAYTMYJALHNQU-UHFFFAYSA-N 13 C NMR Spectrum InChI=1S/C9H13ClN4/c1-9(2,3)14-7-6( )4-11-8(10)13-7/h5,12H,4H2,1-3H3 InChIKey=PCZNHIHWFBLCGG-UHFFFAYSA-N 29

30 Ring-Chain Tautomerism (conflict 652) 1 H NMR Spectrum Same 1 H and 13 C NMR spectrum between samples. Assignment of chemical shifts indicates open form is present in both samples InChI=1S/C14H14N22S/c (8)19-14(11)16-12 (15-13) /h3,5,7,12,16H,1-2,4,6H2,(H,15,17)/t12-/m0/s1 InChIKey=XMZKSUZSNNZEDZ-LBPRGKRZSA-N 13 C NMR Spectrum InChI=1S/C14H14N22S/c15-13(17) (10)19-14 (12) /h3-4,7-8H,1-2,5-6H2,(H2,15,17) InChIKey=VKEHQTAMAUSRIS-UHFFFAYSA-N 30

31 Preliminary Results More than 200 spectra have been analyzed so far. VERY PRELIMINARY conclusions: Around 80% of prototropic tautomeric cases and 50% of ring-chain tautomeric cases show the same 1 H and 13 C NMR spectra Usually the same visual appearance of the samples of a pair (texture, color etc.) corresponds to identical NMR results. We have assigned the chemical shifts of some spectra to determine which tautomer is present in the samples. Some tautomeric conflicts, e.g. involving triazole, imidazole or pyrazole moieties, are practically indistinguishable by standard NMR experiments: 12_1 12_2 InChI=1S/C6H4BrN3/c (3-4) /h1-3H,(H,8,9,10) InChIKey=BQCIJWPKDPZNHD-UHFFFAYSA-N InChI=1S/C6H4BrN3/c (3-4) /h1-3H,(H,8,9,10) InChIKey=BQCIJWPKDPZNHD-UHFFFAYSA-N We are starting to review the tautomeric rules based on the NMR results. 31

32 Tautomerism ngoing and Planned Activities Database of tautomerism data (structures; ratios, interconversion rates, relative energies...) from literature, both experimental and computational IUPAC Working Group: Redesign of Handling of Tautomerism for InChI V2 Agree on conditions for tautomerism in InChI Addition of ring-chain tautomerism rule set Recommendation for a tautomerism rule set for InChI V2 Definition of a canonical tautomer 32

33 Acknowledgements NCI/CADD Team Alexey Zakharov Laura Guasch Megan Peach Marc Nicklaus Markus Sitzmann ChemNavigator Scott Hutton PubChem Xemistry GmbH Wolf-Dietrich Ihlenfeldt All other database providers InChI Team

34

35 Unique Representation of Chemical Structures NCI/CADD Structure Identifiers based on hashcodes calculated by the chemoinformatics toolkit CACTVS HN N NH 2 H CACTVS hashcodes: 9850FD9F9E2B4E25 represent a chemical structure uniquely as 16-digit hexadecimal number (64-bit unsigned) high sensitivity to structural features of a compound change if connectivity changes

36 tautomer N NH H NH 2 HNDVDQJCIGZPN-UHFFFAYSA-N 9850FD9F9E2B4E25-FICuS HN N NH 2 HNDVDQJCIGZPN-RXMQYKEDSA-N E92E4BA2869F3611-FICuS H stereoisomers HN H N NH 2 HNDVDQJCIGZPN-YFKPBYRVSA-N 8A7AD1EB498CC76A-FICuS salt Na + HN - N NH 2 UHPNKBYGGMJTIM-UHFFFAYSA-M E5F83F10C5DB080A-FICuS HNDVDQJCIGZPN-UHFFFAYSA-N Std. InChIKey HN H N NH FD9F9E2B4E25-FICuS FICuS HN Na errors N NH2 HN H N NH charged form HN - + N NH 3 HNDVDQJCIGZPN-UHFFFAYSA-N A3DAE DDE4-FICuS isotope NH N 15 NH 2 H UHPNKBYGGMJTIM-UHFFFAYSA-M E5F83F10C5DB080A-FICuS HNDVDQJCIGZPN-UHFFFAYSA-N 9850FD9F9E2B4E25-FICuS HNDVDQJCIGZPN-CDYZYAPPSA-N B2FDA68AEDA06DB9-FICuS

37 Chemical Identifier Resolver (CIR) examples: MIME type: text/plain Buckyball XMWRBQBLMFGWIX-UHFFFAYSA-N/image?height=300&width=300&bgcolor=black&bondcolor=white

38 Partial InChIKey Lookup (in preparation) resolve Standard InChIKey into full structure representation: Ethanol CC CC CC[H2+] C(C()([2H])[2H])[2H] CC()([2H])[2H] C(C)([2H])([2H])[2H] CC[17H] C(C)[2H] [14CH3]C CC

39 Chemical Structure Database (current version) InChI/InChIKey (Version 1.04) calculated with four InChI flag sets: Standard Set, Set 1 & Set 2: addition of hydrogen atoms by CACTVS Set 3: addition of hydrogen atoms by the InChI library CACTVS Standard : Add H Standard InChIKey Set 1 : Add H DNTADDH W0 FIXEDH RECMET NEWPS SPXYZ SAsXYZ Fb Fnud KET 15T Set 2 : Add H DNTADDH W0 FIXEDH RECMET NEWPS SPXYZ SAsXYZ Fb Fnud KET 15T Set 3 : Add H DNTADDH W0 FIXEDH RECMET NEWPS SPXYZ SAsXYZ Fb Fnud KET 15T

40 Unique Representation of Chemical Structures NCI/CADD Structure Identifiers MDL Molfile MDL SDF SMILES ChemDraw cdx PDB original structure record structure normalization parent structure hashcode calculation E_HASHISY NCI/CADD Identifier MDL SDF SMILES database FICTS FICuS uuuuu we calculate a set of parent structures with different sensitivity to chemical features fine grained representation of chemical structures

41 NCI/CADD Chemical Structure Database Tautomer Analysis 30 occurrence of tautomerism-critical molecules within each individual database release (%) number database releases frequency average: ~9.5% of FICuS parent structures Sitzmann M, Ihlenfeldt WD, Nicklaus MC. J Comput Aided Mol Des Jun;24(6-7):

42 Ring-chain tautomerism in a real database Most commonly applicable ring-chain tautomerism rules in the AMS Database SMIRKS rule Count % 3-exo-Trig 65, exo-Trig 10, exo-Trig 7,506, exo-Trig 5,289, exo-Trig 4,185, exo-Dig 179, exo-Dig 472, exo-Dig 3,293, endo-Trig 169, endo-Trig 156, endo-Trig 65, Most common 42

43 Imine/amine tautomerism (conflict 5) 1 H NMR Spectrum 5_1 Same 1 H and 13 C NMR spectrum between samples. Assignation of chemical shifts indicates imine form is present in both samples 5_2 InChI=1S/C14H11FN6/c ( (15)5-3-8) (6-16)13(17)21(14)20-11/h2-5,17,19H,7H2,1H3 InChIKey=ZDHCKDWMHUPE-UHFFFAYSA-N 13 C NMR Spectrum InChI=1S/C14H11FN6/c ( (15)5-3-8) (6-16)13(17)21(14)20-11/h2-5H,7,17H2,1H3 InChIKey=ZUXILZLWKBIETJ-UHFFFAYSA-N 43

44 InChI/InChIKey Resolver

45 InChI/InChIKey Resolver loose coupling of InChI resolvers provided by different organizations central list of resolvers each resolver must provide a specific protocol.

46 InChI/InChIKey Resolver Evan Bolton (NCBI, NLM, NIH) Valery Tkachenko (RSC/ChemSpider) Marc Nicklaus (CADD Group, NCI, NIH) Steven Bachrach (Trinity University) Antony Williams (RSC/ChemSpider) Markus Sitzmann (CADD Group, NCI, NIH)

47

InChI/InChIKey vs. NCI/CADD Structure Identifiers: A comparison

InChI/InChIKey vs. NCI/CADD Structure Identifiers: A comparison InChI/InChIKey vs. CI/CADD Structure Identifiers: A comparison Markus Sitzmann Computer-Aided Drug Design Group (CI/CADD), Laboratory of Medicinal Chemistry, CI-Frederick, I, DS Comparison Standard InChI/InChIKeys

More information

The IBM Patent Data Donation to NIH, and its Integration in the NCI/CADD Database and Web Services

The IBM Patent Data Donation to NIH, and its Integration in the NCI/CADD Database and Web Services The IBM Patent Data Donation to NIH, and its Integration in the NCI/CADD Database and Web Services Marc C. Nicklaus Computer-Aided Drug Design Group, Chemical Biology Laboratory, Frederick National Laboratory

More information

Tautomerism in chemical information management systems

Tautomerism in chemical information management systems Tautomerism in chemical information management systems Dr. Wendy A. Warr http://www.warr.com Tautomerism in chemical information management systems Author: Wendy A. Warr DOI: 10.1007/s10822-010-9338-4

More information

Introduction to Chemoinformatics and Drug Discovery

Introduction to Chemoinformatics and Drug Discovery Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013 The Chemical Space There are atoms and space. Everything else is opinion. Democritus (ca.

More information

AUTOMATIC GENERATION OF TAUTOMERS

AUTOMATIC GENERATION OF TAUTOMERS ПЛОВДИВСКИ УНИВЕРСИТЕТ ПАИСИЙ ХИЛЕНДАРСКИ БЪЛГАРИЯ НАУЧНИ ТРУДОВЕ, ТОМ 38, КН. 5, 2011 ХИМИЯ UNIVERSITY OF PLOVDIV PAISII HILENDARSKI BULGARIA SCIENTIFIC PAPERS, VOL. 38, BOOK 5, 2011 CHEMISTRY AUTOMATIC

More information

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov

QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov QSAR Modeling of Human Liver Microsomal Stability Alexey Zakharov CADD Group Chemical Biology Laboratory Frederick National Laboratory for Cancer Research National Cancer Institute, National Institutes

More information

InChI keys as standard global identifiers in chemistry web services. Russ Hillard ACS, Salt Lake City March 2009

InChI keys as standard global identifiers in chemistry web services. Russ Hillard ACS, Salt Lake City March 2009 InChI keys as standard global identifiers in chemistry web services Russ Hillard ACS, Salt Lake City March 2009 Context of this talk We have created a web service That aggregates sources built independently

More information

Synthetically Accessible Virtual Inventory (SAVI)

Synthetically Accessible Virtual Inventory (SAVI) Synthetically Accessible Virtual Inventory (SAVI) Yuri Pevzner, Wolf-Dietrich Ihlenfeldt 1, and Marc C. Nicklaus Computer-Aided Drug Design (CADD) Group 1 Xemistry GmbH, Germany Chemical Biology Laboratory

More information

Canonical Line Notations

Canonical Line Notations Canonical Line otations InChI vs SMILES Krisztina Boda verview Compound naming InChI SMILES Molecular equivalency Isomorphism Kekule Tautomers Finding duplicates What s Your ame? 1. Unique numbers CAS

More information

Chemical Databases: Encoding, Storage and Search of Chemical Structures

Chemical Databases: Encoding, Storage and Search of Chemical Structures Chemical Databases: Encoding, Storage and Search of Chemical Structures Dr. Timur I. Madzhidov Kazan Federal University, Department of Organic Chemistry * Ray, L.C. and R.A. Kirsch, Finding Chemical Records

More information

Introduction to Chemoinformatics

Introduction to Chemoinformatics Introduction to Chemoinformatics www.dq.fct.unl.pt/cadeiras/qc Prof. João Aires-de-Sousa Email: jas@fct.unl.pt Recommended reading Chemoinformatics - A Textbook, Johann Gasteiger and Thomas Engel, Wiley-VCH

More information

Хемоінформатика. Докінг. Дизайн ліків. Біоінформатика (3 курс) Лекція 4 (частина 1)

Хемоінформатика. Докінг. Дизайн ліків. Біоінформатика (3 курс) Лекція 4 (частина 1) Хемоінформатика. Докінг. Дизайн ліків Біоінформатика (3 курс) Лекція 4 (частина 1) Формати файлів в хемоінформатиці Chemical information is usually provided as files or streams and many formats have been

More information

DECEMBER 2014 REAXYS R201 ADVANCED STRUCTURE SEARCHING

DECEMBER 2014 REAXYS R201 ADVANCED STRUCTURE SEARCHING DECEMBER 2014 REAXYS R201 ADVANCED STRUCTURE SEARCHING 1 NOTES ON REAXYS R201 THIS PRESENTATION COMMENTS AND SUMMARY Outlines how to: a. Perform Substructure and Similarity searches b. Use the functions

More information

Bioinformatics Workshop - NM-AIST

Bioinformatics Workshop - NM-AIST Bioinformatics Workshop - NM-AIST Day 3 Introduction to Drug/Small Molecule Discovery Thomas Girke July 25, 2012 Bioinformatics Workshop - NM-AIST Slide 1/44 Introduction CMP Structure Formats Similarity

More information

5. Composition and Connectivity Does the formula always represent the complete composition of the substance?

5. Composition and Connectivity Does the formula always represent the complete composition of the substance? InChI FAQ InChI FAQ... 1 1. FAQ Overview... 5 1.1. What is this FAQ?... 5 1.2. Who is responsible for InChI?... 5 1.3. Where can I find out more?... 6 1.4. Is there an InChI mailing list?... 6 1.5. Are

More information

A Journey from Data to Knowledge

A Journey from Data to Knowledge A Journey from Data to Knowledge Ian Bruno Cambridge Crystallographic Data Centre @ijbruno @ccdc_cambridge Experimental Data C 10 H 16 N +,Cl - Radspunk, CC-BY-SA CC-BY-SA Jeff Dahl, CC-BY-SA Experimentally

More information

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre Dr. Sander B. Nabuurs Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre The road to new drugs. How to find new hits? High Throughput

More information

Analyzing Small Molecule Data in R

Analyzing Small Molecule Data in R Analyzing Small Molecule Data in R Tyler Backman and Thomas Girke December 12, 2011 Analyzing Small Molecule Data in R Slide 1/49 Introduction CMP Structure Formats Similarity Searching Background Fragment

More information

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME Iván Solt Solutions for Cheminformatics Drug Discovery Strategies for known targets High-Throughput Screening (HTS) Cells

More information

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann

Information Extraction from Chemical Images. Discovery Knowledge & Informatics April 24 th, Dr. Marc Zimmermann Information Extraction from Chemical Images Discovery Knowledge & Informatics April 24 th, 2006 Dr. Available Chemical Information Textbooks Reports Patents Databases Scientific journals and publications

More information

On InChI and evaluating the quality of cross-reference links

On InChI and evaluating the quality of cross-reference links Galgonek and Vondrášek Journal of Cheminformatics 2014, 6:15 RESEARCH ARTICLE Open Access On InChI and evaluating the quality of cross-reference links Jakub Galgonek * and Jiří Vondrášek * Abstract Background:

More information

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland

Navigation in Chemical Space Towards Biological Activity. Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Navigation in Chemical Space Towards Biological Activity Peter Ertl Novartis Institutes for BioMedical Research Basel, Switzerland Data Explosion in Chemistry CAS 65 million molecules CCDC 600 000 structures

More information

STEREOCHEMISTRY AND STEREOELECTRONICS NOTES

STEREOCHEMISTRY AND STEREOELECTRONICS NOTES - 1 - STEREOCHEMISTRY AND STEREOELECTRONICS NOTES Stereochemistry in Organic Molecules Conventions used in drawing molecules Also, Fischer projections can sometimes be useful for acyclic molecules with

More information

Completions Multiple Enrollment in same semester. 2. Mode of Instruction (Hours per Unit are defaulted) Hegis Code(s) (Provided by the Dean)

Completions Multiple Enrollment in same semester. 2. Mode of Instruction (Hours per Unit are defaulted) Hegis Code(s) (Provided by the Dean) CALIFORNIA STATE UNIVERSITY CHANNEL ISLANDS COURSE MODIFICATION PROPOSAL Courses must be submitted by November 2, 2009, to make the next catalog (2010--2011) production DATE (CHANGE DATE EACH TIME REVISED):

More information

Organometallics & InChI. August 2017

Organometallics & InChI. August 2017 Organometallics & InChI August 2017 The Cambridge Structural Database 900,000+ small-molecule crystal structures Over 60,000 datasets deposited annually Enriched and annotated by experts Structures available

More information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007 Computational Chemistry in Drug Design Xavier Fradera Barcelona, 17/4/2007 verview Introduction and background Drug Design Cycle Computational methods Chemoinformatics Ligand Based Methods Structure Based

More information

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE Examples of Protein Modeling Protein Modeling Visualization Examination of an experimental structure to gain insight about a research question Dynamics To examine the dynamics of protein structures To

More information

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017 Using Phase for Pharmacophore Modelling 5th European Life Science Bootcamp March, 2017 Phase: Our Pharmacohore generation tool Significant improvements to Phase methods in 2016 New highly interactive interface

More information

Spring Term 2012 Dr. Williams (309 Zurn, ex 2386)

Spring Term 2012 Dr. Williams (309 Zurn, ex 2386) Chemistry 242 Organic Chemistry II Spring Term 2012 Dr. Williams (309 Zurn, ex 2386) Web Page: http://math.mercyhurst.edu/~jwilliams/ jwilliams@mercyhurst.edu (or just visit Department web site and look

More information

CDK & Mass Spectrometry

CDK & Mass Spectrometry CDK & Mass Spectrometry October 3, 2011 1/18 Stephan Beisken October 3, 2011 EBI is an outstation of the European Molecular Biology Laboratory. Chemistry Development Kit (CDK) An Open Source Java TM Library

More information

IUPAC International Chemical Identifier (InChI) Subcommittee

IUPAC International Chemical Identifier (InChI) Subcommittee IUPAC International Chemical Identifier (InChI) Subcommittee Minutes of the meeting on March 23rd 2009 at the Sheraton City Centre, Salt Lake City, UT, USA Present: Subcommittee members: Steve Heller (Chairman)

More information

DEPARTMENT: Chemistry

DEPARTMENT: Chemistry CODE: CHEM 203 TITLE: Organic Chemistry I INSTITUTE: STEM DEPARTMENT: Chemistry COURSE DESCRIPTION: Students will apply many concepts from general chemistry to a study of organic chemistry. They will be

More information

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations Capturing Chemistry What you see is what you get In the world of mechanism and chemical transformations Dr. Stephan Schürer ead of Intl. Sci. Content Libraria, Inc. sschurer@libraria.com Distribution of

More information

Reaction mechanisms offer us insights into how reactions work / how molecules react with one another.

Reaction mechanisms offer us insights into how reactions work / how molecules react with one another. Introduction 1) Lewis Structures 2) Representing Organic Structures 3) Geometry and Hybridization 4) Electronegativities and Dipoles 5) Resonance Structures (a) Drawing Them (b) Rules for Resonance 6)

More information

Structural biology and drug design: An overview

Structural biology and drug design: An overview Structural biology and drug design: An overview livier Taboureau Assitant professor Chemoinformatics group-cbs-dtu otab@cbs.dtu.dk Drug discovery Drug and drug design A drug is a key molecule involved

More information

Representation of molecular structures. Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal

Representation of molecular structures. Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal Representation of molecular structures Coutersy of Prof. João Aires-de-Sousa, University of Lisbon, Portugal A hierarchy of structure representations Name (S)-Tryptophan 2D Structure 3D Structure Molecular

More information

cheminformatics toolkits: a personal perspective

cheminformatics toolkits: a personal perspective cheminformatics toolkits: a personal perspective Roger Sayle Nextmove software ltd Cambridge uk 1 st RDKit User Group Meeting, London, UK, 4 th October 2012 overview Models of Chemistry Implicit and Explicit

More information

Enumeration of Ring Chain Tautomers Based on SMIRKS Rules

Enumeration of Ring Chain Tautomers Based on SMIRKS Rules This is an open access article published under an ACS AuthorChoice License, which permits copying and redistribution of the article or any adaptations for non-commercial purposes. pubs.acs.org/jcim Enumeration

More information

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller Chemogenomic: Approaches to Rational Drug Design Jonas Skjødt Møller Chemogenomic Chemistry Biology Chemical biology Medical chemistry Chemical genetics Chemoinformatics Bioinformatics Chemoproteomics

More information

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences

György M. Keserű H2020 FRAGNET Network Hungarian Academy of Sciences Fragment based lead discovery - introduction György M. Keserű H2020 FRAGET etwork Hungarian Academy of Sciences www.fragnet.eu Hit discovery from screening Druglike library Fragment library Large molecules

More information

UNIT 4 REVISION CHECKLIST CHEM 4 AS Chemistry

UNIT 4 REVISION CHECKLIST CHEM 4 AS Chemistry UNIT 4 REVISION CHECKLIST CHEM 4 AS Chemistry Topic 4.1 Kinetics a) Define the terms: rate of a reaction, rate constant, order of reaction and overall order of reaction b) Deduce the orders of reaction

More information

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK Molecular Modelling Computational Chemistry Demystified Peter Bladon Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK John E. Gorton Gorton Systems, Glasgow, UK Robert B. Hammond Institute

More information

Command-line tools of ChemAxon: tips and tricks

Command-line tools of ChemAxon: tips and tricks Command-line tools of ChemAxon: tips and tricks György Pirok Solutions for Cheminformatics Command-line interface A command-line interface (CLI) is a mechanism for interacting with a computer operating

More information

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value Anthony Arvanites Daylight User Group Meeting March 10, 2005 Outline 1. Company Introduction

More information

Topic 9. Aldehydes & Ketones

Topic 9. Aldehydes & Ketones Chemistry 2213a Fall 2012 Western University Topic 9. Aldehydes & Ketones A. Structure and Nomenclature The carbonyl group is present in aldehydes and ketones and is the most important group in bio-organic

More information

Database Speaks. Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan

Database Speaks. Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan Database Speaks Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan Email: liuu@chem.sinica.edu.tw 1 OUTLINES -- Personal experiences Publication types Secondary publication

More information

Navigating between patents, papers, abstracts and databases using public sources and tools

Navigating between patents, papers, abstracts and databases using public sources and tools Navigating between patents, papers, abstracts and databases using public sources and tools Christopher Southan 1 and Sean Ekins 2 TW2Informatics, Göteborg, Sweden, Collaborative Drug Discovery, North Carolina,

More information

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics

Marvin. Sketching, viewing and predicting properties with Marvin - features, tips and tricks. Gyorgy Pirok. Solutions for Cheminformatics Marvin Sketching, viewing and predicting properties with Marvin - features, tips and tricks Gyorgy Pirok Solutions for Cheminformatics The Marvin family The Marvin toolkit provides web-enabled components

More information

(07) 2 (c) 2 (c) (i) Calculate the ph of this buffer solution at 25 oc (3 marks) (Extra space)

(07) 2 (c) 2 (c) (i) Calculate the ph of this buffer solution at 25 oc (3 marks) (Extra space) 7 The value of Ka for methanoic acid is 1.78 10 4 mol dm 3 at 25 oc. A buffer solution is prepared containing 2.35 10 2 mol of methanoic acid and Question 1: N/A 1.84 10 2 mol of sodium methanoate in 1.00

More information

The IUPAC InChI Chemical Structure Standard

The IUPAC InChI Chemical Structure Standard www.inchi-trust.org www.inchi-trust.org The IUPAC InChI Chemical Structure Standard Stephen Heller The main web sites for the IUPAC InChI project are: http://www.iupac.org/inchi and http://www.inchi-trust.org

More information

Tautomerism in Alkyl and -OH Derivatives of Heterocycles containing Two Heteroatoms

Tautomerism in Alkyl and -OH Derivatives of Heterocycles containing Two Heteroatoms Tautomerism in Alkyl and - Derivatives of eterocycles containing Two eteroatoms Colette Jermann Master in chemistry - Year eterocyclic Chemistry Coursework 1 Summary Type of tautomerism S Isoxazol Isothiazole

More information

2-1 The Nature of Matter

2-1 The Nature of Matter Biology 1 of 40 2 of 40 The study of chemistry begins with the basic unit of matter, the atom. The Greek philosopher Democritus called the smallest fragment of matter the atom, from the Greek word atomos.

More information

Dock Ligands from a 2D Molecule Sketch

Dock Ligands from a 2D Molecule Sketch Dock Ligands from a 2D Molecule Sketch March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com

More information

Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction

Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction Stefan Kuhn 1, Björn Egert 2, Steffen Neumann 2, Christoph Steinbeck 1European Bioinformatics Institute

More information

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics... 1 1.1 Chemoinformatics... 2 1.1.1 Open-Source Tools... 2 1.1.2 Introduction to Programming Languages... 3 1.2 Chemical Structure

More information

Pipeline Pilot Integration

Pipeline Pilot Integration Scientific & technical Presentation Pipeline Pilot Integration Szilárd Dóránt July 2009 The Component Collection: Quick facts Provides access to ChemAxon tools from Pipeline Pilot Free of charge Open source

More information

Data Mining in the Chemical Industry. Overview of presentation

Data Mining in the Chemical Industry. Overview of presentation Data Mining in the Chemical Industry Glenn J. Myatt, Ph.D. Partner, Myatt & Johnson, Inc. glenn.myatt@gmail.com verview of presentation verview of the chemical industry Example of the pharmaceutical industry

More information

ICM-Chemist How-To Guide. Version 3.6-1g Last Updated 12/01/2009

ICM-Chemist How-To Guide. Version 3.6-1g Last Updated 12/01/2009 ICM-Chemist How-To Guide Version 3.6-1g Last Updated 12/01/2009 ICM-Chemist HOW TO IMPORT, SKETCH AND EDIT CHEMICALS How to access the ICM Molecular Editor. 1. Click here 2. Start sketching How to sketch

More information

Tentative Daily Schedule CHEM 125 Fall 2012

Tentative Daily Schedule CHEM 125 Fall 2012 Tentative Daily Schedule CHEM 125 Fall 2012 Atomic Structure and Periodic Trends Thursday, Aug 30 th, Day 2 Resources: o Text and Connect Passcode (available at bookstore): Chemical Structure and Properties,

More information

EASTERN ARIZONA COLLEGE General Organic Chemistry I

EASTERN ARIZONA COLLEGE General Organic Chemistry I EASTERN ARIZONA COLLEGE General Organic Chemistry I Course Design 2015-2016 Course Information Division Science Course Number CHM 235 (SUN# CHM 2235) Title General Organic Chemistry I Credits 4 Developed

More information

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Plan Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics. Exercise: Example and exercise with herg potassium channel: Use of

More information

The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc

The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company Sally Rose BioFocus plc Overview History of BioFocus and acquisition of CDD Biological

More information

Classifying Compounds in Public Databases

Classifying Compounds in Public Databases Classifying Compounds in Public Databases ACS San Diego, March 15, 2016 OntoChem IT Solutions GmbH Blücherstr. 24 06120 Halle (Saale) Germany Tel. +49 345 4780470 Fax: +49 345 4780471 mail: info(at)ontochem.com

More information

COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS

COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS COMPUTER AIDED DRUG DESIGN (CADD) AND DEVELOPMENT METHODS DRUG DEVELOPMENT Drug development is a challenging path Today, the causes of many diseases (rheumatoid arthritis, cancer, mental diseases, etc.)

More information

18.1 Intro to Aromatic Compounds

18.1 Intro to Aromatic Compounds 18.1 Intro to Aromatic Compounds AROMATIC compounds or ARENES include benzene and benzene derivatives. Aromatic compounds are quite common. Many aromatic compounds were originally isolated from fragrant

More information

Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR

Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR Medicinal Chemistry/ CHEM 458/658 Chapter 3- SAR and QSAR Bela Torok Department of Chemistry University of Massachusetts Boston Boston, MA 1 Introduction Structure-Activity Relationship (SAR) - similar

More information

The Schrödinger KNIME extensions

The Schrödinger KNIME extensions The Schrödinger KNIME extensions Computational Chemistry and Cheminformatics in a workflow environment Jean-Christophe Mozziconacci Volker Eyrich Topics What are the Schrödinger extensions? Workflow application

More information

The IUPAC Chemical Identifier

The IUPAC Chemical Identifier The IUPAC Chemical Identifier Steve Stein, Steve eller, Dmitrii Tchekhovskoi ational Institute of Standards and Technology Gaithersburg, MD, USA CAS/IUPAC Conference on Chemical Identifiers and XML for

More information

Paper No. 1: ORGANIC CHEMISTRY- I (Nature of Bonding and Stereochemistry) MODULE No. 5: Tautomerism

Paper No. 1: ORGANIC CHEMISTRY- I (Nature of Bonding and Stereochemistry) MODULE No. 5: Tautomerism Subject Chemistry Paper No and Title Paper 1: ORGANIC - I (Nature of Bonding Module No and Module 5: Tautomerism Title Module Tag CHE_P1_M5 Paper No. 1: ORGANIC - I (Nature of Bonding TABLE OF CONTENTS

More information

Methods for tautomer enumeration, -searching and -duplicate filtering

Methods for tautomer enumeration, -searching and -duplicate filtering Methods for tautomer enumeration, -searching and -duplicate filtering József Szegezdi, Zsolt Mohácsi, Tamás Csizmazia, Szilárd Dóránt, Ákos Papp, György Pirok, Szabolcs Csepregi, Ferenc Csizmadia Solutions

More information

O N N. electrons in ring

O N N. electrons in ring ame I. ( points) Page 1 A. The compound shown below is a commonly prescribed antifungal drug. It belongs to a category of compounds known as triazoles. Analyze the geometry and other properties for various

More information

Biology. Slide 1 of 40. End Show. Copyright Pearson Prentice Hall

Biology. Slide 1 of 40. End Show. Copyright Pearson Prentice Hall Biology 1 of 40 2-1 The Nature of Matter 2 of 40 2-1 The Nature of Matter Atoms Atoms The study of chemistry begins with the basic unit of matter, the atom. 3 of 40 2-1 The Nature of Matter Atoms Placed

More information

Identifying Interaction Hot Spots with SuperStar

Identifying Interaction Hot Spots with SuperStar Identifying Interaction Hot Spots with SuperStar Version 1.0 November 2017 Table of Contents Identifying Interaction Hot Spots with SuperStar... 2 Case Study... 3 Introduction... 3 Generate SuperStar Maps

More information

User Guide for LeDock

User Guide for LeDock User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved. Introduction LeDock is flexible small-molecule docking software,

More information

Fragment-based de novo Design

Fragment-based de novo Design ragment-based de novo Design Gisbert Schneider & Uli echner gisbert.schneider@modlab.de www.modlab.de Beilstein Endowed Chair for Cheminformatics Institute of rganic Chemistry & Chemical Biology Johann

More information

Open PHACTS Explorer: Compound by Name

Open PHACTS Explorer: Compound by Name Open PHACTS Explorer: Compound by Name This document is a tutorial for obtaining compound information in Open PHACTS Explorer (explorer.openphacts.org). Features: One-click access to integrated compound

More information

Drug Informatics for Chemical Genomics...

Drug Informatics for Chemical Genomics... Drug Informatics for Chemical Genomics... An Overview First Annual ChemGen IGERT Retreat Sept 2005 Drug Informatics for Chemical Genomics... p. Topics ChemGen Informatics The ChemMine Project Library Comparison

More information

pka Prediction for Organic Acids and Bases

pka Prediction for Organic Acids and Bases pka Prediction for Organic Acids and Bases pka Prediction for Organic Acids and Bases D. D. Perrin John Curtin School of Medical Research Australian National Universi~oy Canberra Boyd Dempsey and E. P.

More information

Homework - Review of Chem 2310

Homework - Review of Chem 2310 omework - Review of Chem 2310 Chapter 1 - Atoms and Molecules Name 1. What is organic chemistry? 2. Why is there an entire one year course devoted to the study of organic compounds? 3. Give 4 examples

More information

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer LigandScout Automated Structure-Based Pharmacophore Model Generation Gerhard Wolber* and Thierry Langer * E-Mail: wolber@inteligand.com Pharmacophores from LigandScout Pharmacophores & the Protein Data

More information

2.0 h; 240 points please print clearly Dr. Kathleen Nolta Signature. Problem Points Score GSI I 42 II 35 III 36 IV 46 V 42 VI 39 Total 240

2.0 h; 240 points please print clearly Dr. Kathleen Nolta Signature. Problem Points Score GSI I 42 II 35 III 36 IV 46 V 42 VI 39 Total 240 hemistry 10 irst ame inal Examination Last ame.0 h; 0 points please print clearly Dr. Kathleen olta ignature UM ID # Problem Points core GI I II 5 III IV V VI 9 Total 0 Precision in drawing counts. heck

More information

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design Bela Torok Department of Chemistry University of Massachusetts Boston Boston, MA 1 Computer Aided Drug Design - Introduction Development

More information

Similarity Search. Uwe Koch

Similarity Search. Uwe Koch Similarity Search Uwe Koch Similarity Search The similar property principle: strurally similar molecules tend to have similar properties. However, structure property discontinuities occur frequently. Relevance

More information

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments BioSolveIT Biology Problems Solved using Information Technology A Combinatorial Approach for andling of Protonation and Tautomer Ambiguities in Docking Experiments Ingo Dramburg BioSolve IT Gmb An der

More information

August 10, Prospective Chemistry 5511 Students. SUBJECT: Course Syllabus for Chemistry 5511 Fall 2011

August 10, Prospective Chemistry 5511 Students. SUBJECT: Course Syllabus for Chemistry 5511 Fall 2011 TO: FROM: Prospective Chemistry 5511 Students Peter Gaspar August 10, 2011 SUBJECT: Course Syllabus for Chemistry 5511 Fall 2011 Chemistry 5511 Mechanistic Organic Chemistry is the first semester of a

More information

Similarity methods for ligandbased virtual screening

Similarity methods for ligandbased virtual screening Similarity methods for ligandbased virtual screening Peter Willett, University of Sheffield Computers in Scientific Discovery 5, 22 nd July 2010 Overview Molecular similarity and its use in virtual screening

More information

*Assignments could be reversed. *

*Assignments could be reversed. * Name Key 5 W-Exam No. Page I. (6 points) Identify the indicated pairs of hydrogens in each of the following compounds as (i) homotopic, (ii) enantiotopic, or (iii) diastereotopic s. Write the answers as

More information

More information can be found in Chapter 12 in your textbook for CHEM 3750/ 3770 and on pages in your laboratory manual.

More information can be found in Chapter 12 in your textbook for CHEM 3750/ 3770 and on pages in your laboratory manual. CHEM 3780 rganic Chemistry II Infrared Spectroscopy and Mass Spectrometry Review More information can be found in Chapter 12 in your textbook for CHEM 3750/ 3770 and on pages 13-28 in your laboratory manual.

More information

Synthesis of 2,5,7-triamino[1,2,4]triazolo[1,5-a][1,3,5]triazines as potential antifolate agents

Synthesis of 2,5,7-triamino[1,2,4]triazolo[1,5-a][1,3,5]triazines as potential antifolate agents Anton V. Dolzhenko, Anna V. Dolzhenko and Wai-Keung Chui Synthesis of 2,5,7-triamino[1,2,4]triazolo[1,5-a][1,3,5]triazines as potential antifolate agents Department of Pharmacy, Faculty of Science, ational

More information

Amines Reading Study Problems Key Concepts and Skills Lecture Topics: Amines: structure and nomenclature

Amines Reading Study Problems Key Concepts and Skills Lecture Topics: Amines: structure and nomenclature Amines Reading: Wade chapter 19, sections 19-1-19-19 Study Problems: 19-37, 19-39, 19-40, 19-41, 19-44, 19-46, 19-47, 19-48, 19-51, 19-54 Key Concepts and Skills: Explain how the basicity of amines varies

More information

2-1 The Nature of Matter

2-1 The Nature of Matter 2-1 The Nature of Matter Small Atoms Placed side by side, 100 million atoms would make a row only about 1 centimeter long. contain subatomic particles Atoms What three subatomic particles make up atoms?

More information

Organic Chemistry 112 A B C - Syllabus Addendum for Prospective Teachers

Organic Chemistry 112 A B C - Syllabus Addendum for Prospective Teachers Chapter Organic Chemistry 112 A B C - Syllabus Addendum for Prospective Teachers Ch 1-Structure and bonding Ch 2-Polar covalent bonds: Acids and bases McMurry, J. (2004) Organic Chemistry 6 th Edition

More information

Reaxys Pipeline Pilot Components Installation and User Guide

Reaxys Pipeline Pilot Components Installation and User Guide 1 1 Reaxys Pipeline Pilot components for Pipeline Pilot 9.5 Reaxys Pipeline Pilot Components Installation and User Guide Version 1.0 2 Introduction The Reaxys and Reaxys Medicinal Chemistry Application

More information

Organic Chemistry. Alkynes

Organic Chemistry. Alkynes For updated version, please click on http://ocw.ump.edu.my Organic Chemistry Alkynes by Dr. Seema Zareen & Dr. Izan Izwan Misnon Faculty Industrial Science & Technology seema@ump.edu.my & iezwan@ump.edu.my

More information

DOCKING TUTORIAL. A. The docking Workflow

DOCKING TUTORIAL. A. The docking Workflow 2 nd Strasbourg Summer School on Chemoinformatics VVF Obernai, France, 20-24 June 2010 E. Kellenberger DOCKING TUTORIAL A. The docking Workflow 1. Ligand preparation It consists in the standardization

More information

Chemical Journal Publishing in an Online World. Jason Wilde, Publisher Physical Sciences Nature Publishing Group ACS Spring Meeting 2009

Chemical Journal Publishing in an Online World. Jason Wilde, Publisher Physical Sciences Nature Publishing Group ACS Spring Meeting 2009 Chemical Journal Publishing in an Online World Jason Wilde, Publisher Physical Sciences Nature Publishing Group ACS Spring Meeting 2009 1 Overview Chemists and chemistry NPG and chemistry Nature Chemistry

More information

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013

Chemical Space. Space, Diversity, and Synthesis. Jeremy Henle, 4/23/2013 Chemical Space Space, Diversity, and Synthesis Jeremy Henle, 4/23/2013 Computational Modeling Chemical Space As a diversity construct Outline Quantifying Diversity Diversity Oriented Synthesis Wolf and

More information

Basic principles of multidimensional NMR in solution

Basic principles of multidimensional NMR in solution Basic principles of multidimensional NMR in solution 19.03.2008 The program 2/93 General aspects Basic principles Parameters in NMR spectroscopy Multidimensional NMR-spectroscopy Protein structures NMR-spectra

More information

The IUPAC InChI Open Source Chemical Structure Standard. Stephen Heller InChI-Trust Project Director

The IUPAC InChI Open Source Chemical Structure Standard. Stephen Heller InChI-Trust Project Director www.inchi-trust.org www.inchi-trust.org The IUPAC InChI Open Source Chemical Structure Standard Stephen Heller InChI-Trust Project Director steve@inchi-trust.org The main web sites for the IUPAC InChI

More information

DEPARTMENT: Chemistry

DEPARTMENT: Chemistry CODE CHEM 204 TITLE: Organic Chemistry II INSTITUTE: STEM DEPARTMENT: Chemistry COURSE DESCRIPTION: A continuation of CHEM-203, students will extend their studies into topics including aromatic hydrocarbons,

More information