Database Speaks. Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan

Similar documents
SciFinder Content Update

Information Literacy for Undergraduate Chemistry Students

The shortest path to chemistry data and literature

CSD. Unlock value from crystal structure information in the CSD

Introducing the New SciFinder. Veli Pekka Hyttinen Regional Marketing Manager, Central and Eastern Europe Jasna April 1, 2014

PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE

The National Chemical Database Service. Serin Dabb Executive Editor, Data

A Journey from Data to Knowledge

OF ALL THE CHEMISTRY RELATED SOFTWARE

Reaxys Managing Complexity

Reaxys (Beilstein) Assignment. Preliminary Report Due: October 13th Final Report Due: November 17th

Reaxys Medicinal Chemistry Fact Sheet

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

new interface and features

How to Create a Substance Answer Set

Internet Resource Guide for Undergraduate Organic Chemists

SciFinder Premier CAS solutions to explore all chemistry MethodsNow, PatentPak, ChemZent, SciFinder n

Reaxys (Beilstein) Assignment. Preliminary Report Due: October 1st Final Report Due: November 4th

Marvin 5.4 A new generation of structure indexing at Elsevier. Dr. Michael Maier, Dr. Heike Nau, Elsevier

Dr. LeGrande M. Slaughter Chemistry Building Rm. 307E Office phone: ; Tues, Thurs 11:00 am-12:20 pm, CHEM 331D

October 6 University Faculty of pharmacy Computer Aided Drug Design Unit

Reaxys (Beilstein) Assignment. Preliminary Report Due: October 2nd Final Report Due: November 4th

Reaxys Improved synthetic planning with Reaxys

Chemical Information Retrieval CAS & SciFinder Searching for Substances

Where do all those small molecules come from? What the data reveal

Searching Substances in Reaxys

DECEMBER 2014 REAXYS R201 ADVANCED STRUCTURE SEARCHING

In Silico Investigation of Off-Target Effects

Basic Techniques in Structure and Substructure

SciFinder introduction and a view behind the scene how this tool is made

Reaxys The Highlights

Springer Materials ABC. The world s largest resource for physical and chemical data in materials science. springer.com. Consult an Expert!

CHEMICAL ABSTRACTS SERVICE: THE GOLD STANDARD FOR CHEMICAL SUBSTANCE INFORMATION

Biologically Relevant Molecular Comparisons. Mark Mackey

QSAR Modeling of ErbB1 Inhibitors Using Genetic Algorithm-Based Regression

DiscoveryGate SM Version 1.4 Participant s Guide

Extending the scope of journal articles: Certifying and publishing experimental data

DRUG DISCOVERY TODAY ELN ELN. Chemistry. Biology. Known ligands. DBs. Generate chemistry ideas. Check chemical feasibility In-house.

Structure and Reaction querying in Reaxys

Developing CAS Products for Substructure Searching by Chemists. Linda Toler

NOVEMBER 2014 REAXYS R101 BASIC REACTION QUERIES

Reaxys Pipeline Pilot Components Installation and User Guide

Tautomerism in chemical information management systems

SpringerMaterials The Landolt-Börnstein Database

catena-poly[[[bis(cyclohexyldiphenylphosphine-»p)silver(i)]-μ-cyano-» 2 N:C-silver(I)-μ-cyano-» 2 C:N] dichloromethane solvate]

Thieme Chemistry E-Books

Search for Substance Data

SciFinder Scholar Guide to Getting Started

CHEMISTRY (CHEM) CHEM 5800 Principles Of Materials Chemistry. Tutorial in selected topics in materials chemistry. S/U grading only.

SciFinder Scholar Guide to Getting Started

DiscoveryGate. One place for the answers you need. Chulalongkorn University, Thailand

1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!

A powerful site for all chemists CHOICE CRC Handbook of Chemistry and Physics

Department of Chemistry and Biochemistry Approved Learning Outcomes Approved May 2017 at Departmental Retreat

CHEMISTRY (CHE) CHE 104 General Descriptive Chemistry II 3

electronic reprint 5,12-Bis(4-tert-butylphenyl)-6,11-diphenylnaphthacene

The Cambridge Structural Database (CSD) a Vital Resource for Structural Chemistry and Biology Stephen Maginn, CCDC, Cambridge, UK

Crystallographic Databases: Using the Knowledge Available. Amy Sarjeant CCCW-16, Saint Mary s University, Halifax NS

CHEMISTRY (CHEM) CHEM 208. Introduction to Chemical Analysis II - SL

All measurement has a limit of precision and accuracy, and this must be taken into account when evaluating experimental results.

Receptor Based Drug Design (1)

Inorganic Chemistry. Taro Saito

Erfassung und Speicherung von Forschungsdaten im Fachbereich Chemie

Introduction to Spark

Solved and Unsolved Problems in Chemoinformatics

Scientific Integrity: A crystallographic perspective

Chemical Samples Recycling: The MDPI Samples Preservation and Exchange Project

ChemIndustry.com serves the investigative and sourcing needs of up to 350,000 site visitors each month.

Chemistry (CHEM) Chemistry (CHEM) 1

Stereoselective Synthesis of Ezetimibe via Cross-Metathesis of Homoallylalcohols and α- Methylidene-β-Lactams

Iowa State University Library Collection Development Policy. Chemistry

Quality Assurance Plan. March 30, Chemical Crystallography Laboratory


Chemistry Courses -1

electronic reprint (P)-Tetra-μ 3 -iodido-tetrakis[(cyclohexyldiphenylphosphine-»p)silver(i)] John F. Young and Glenn P. A. Yap

CHE 121 Fundamentals of General, Organic and Biological Chemistry I 2010 SYLLABUS AND COURSE OUTLINE

Evaluation of Chemistry Journals at IIT Kharagpur, India: Use and Citation Analysis

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Supplementary Material. An improved, gram-scale synthesis of protected 3-haloazetidines: Rapid diversified synthesis of azetidine-3-carboxylic acids

Supporting information. Double Reformatsky Reaction: Divergent Synthesis of δ-hydroxy-β-ketoesters

PASA - an Electronic-only Model for a Professional Society-owned Astronomy Journal

Programme Specification MSc in Cancer Chemistry

Chemistry Courses. Courses. Chemistry Courses 1

Chemical Databases: Encoding, Storage and Search of Chemical Structures

Application of Information Science and Technology in Chemical Research

HOW TO FIND CHEMICAL INFORMATION

User Guide for LeDock

Jimmy U. Franco, Marilyn M. Olmstead and Justin C. Hammons

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

JCICS Major Research Areas

Introduction to single crystal X-ray analysis VI. About CIFs Alerts and how to handle them

Information Retrieval: SciFinder

THIS IS NOT A PRESENTATION

CHEMISTRY 413 Spring 2014

Geospatial Services in Special Libraries: A Needs Assessment Perspective

Introduction to FBDD Fragment screening methods and library design

Contributions should be sent to the Editorial Board of the Russian language Journal (not Springer), at the following address:

Experiment 1 - Literature Searching

MSc Drug Design. Module Structure: (15 credits each) Lectures and Tutorials Assessment: 50% coursework, 50% unseen examination.

Cheminformatics platform for drug discovery application

Transcription:

Database Speaks Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan Email: liuu@chem.sinica.edu.tw 1

OUTLINES -- Personal experiences Publication types Secondary publication From primary to secondary Databases Types of chemical databases CAS registry Reaxys Inverted structures of database Location of exact reference 2

Publication Types Full-length papers Rapid communications Short communications Letters to the editor Case reports Technical or Laboratory notes Methods etc. Overlapping publications Duplicate submission Duplicate publication Acceptable secondary publication Manuscripts based on the same database 3

Acceptable Secondary Publication Secondary publication of material published in other journals or online may be justifiable and beneficial, especially when intended to disseminate important information to the widest possible audience (e.g., guidelines produced by government agencies and professional organizations in the same or a different language). Approval from the editors of both journals; secondary publication for a different group of readers; abbreviated version; secondary version citing the primary reference. 4

From Primary to Secondary Preparation of an article Clear communication and impact factor Structure of a scientific paper Peer review Copy right, etc. Footnotes etc. Society publisher vs professional publisher Abstracts vs Concentrates etc. 5

60+ Free Chemistry Databases http://depth-first.com/articles/2011/10/12/sixty-four-free-chemistry-databases/ The open Web offers a rich collection of diverse chemical data. It's of course likely that more services will be created and/or retired in the coming years. Although many of the old databases are no longer active, a number of them continue to run and even prosper. Examples follow 6

ZINC "... a free database of commercially-available compounds for virtual screening. ZINC contains over 13 million purchasable compounds in ready-to-dock, 3D formats." 7

emolecules discovers sources of chemical data by searching the Internet, and receives submissions from data providers such as chemical suppliers and academic research institutions. 8

Organic Syntheses "... Procedure is written in detail as compared to typical procedures in journals, and reaction and characterization data has been checked for reproducibility." 9

Chemical Databases A chemical database is a database specifically designed to store chemical information. This information is about chemical and crystal structures, spectra, reactions and syntheses, thermophysical data, etc. 10

Types of Chemical Databases Chemical structures are traditionally represented using bonds between atoms and drawn as 2D structural formulae. While these are ideal visual representations for the chemist, they are unsuitable for computational search and storage. Small molecules (also called ligands in drug design applications), are usually represented using lists of atoms and their connections. Large molecules such as proteins are however more compactly represented using the sequences of their amino acid building blocks.!!! Terabytes of physical memory!!! 11

Types of Chemical Databases Chemical Literature databases correlate structures or other chemical information to relevant references such as academic papers or patents. This type of database includes STN, Scifinder and Reaxys. Crystallographic databases: noted examples are Protein Data Bank and Cambridge Structural Database. NMR spectra databases often include FTIR and mass data. Reactions databases store products, intermediates and temporarily created molecules, and also reaction mechanisms. Thermophysical databases 12

CAS REGISTRY SM More than 95 million small molecules information, including the structures, names, predicted and experimental properties, tags, and spectra literature from 1957 up to 10 digits, divided by hyphens into 3 parts 15,000 new substances added /day Chemical Database numbers assigned in sequential order to unique, new substances identified by CAS scientists for inclusion 66,068,339 sequences 13

CAS Number CAS REGISTRY SM CAS Number, up to 10 digits long with format xxxxxxx-yy-z, is assigned to a compound as CAS registers a new compound. CAS Number [64-17-5] refers to ethanol, also to ethyl alcohol, ethyl hydrate, absolute alcohol, grain alcohol, hydroxyethane. D-glucose is also called dextrose and has CAS number [50-99-7]. L-glucose is the mirror image of D-glucose and has a CAS Number of [921-60-8]. Water has a CAS Number [7732-18-5]; the control 5 from (8 1 + 1 2 + 2 3 + 3 4 + 7 5 + 7 6) mod 10 = 105 mod 10 = 5. 14

Reaxys Databases Reaxys and Reaxys Medicinal Chemistry combine comprehensive databases of chemistry literature and data with a powerful interface. They return relevant extracted data and citations in optimal formats. Validate promising leads, investigate chemical synthesis possibilities, and understand relationships between chemicals and bioactivity data. Access over 16,000 periodicals with 500 million experimental facts Find structures, properties, synthesis possibilities, reactions and bioactivity data Get inorganic, organic and organometallic chemistry and bioactivity data through one interface Search from various intuitive options such as Ask Reaxys and Reaxys Tree 15

Inverted Structures of Database Sequential processing Sorting Nodes linkers Atoms bonds Parallel processing 16

Reaxys Reaction Search Search and Location of References 17

Reaxys Reaction Search Search and Location of References 18

Reaxys Reaction Search Search and Location of References 19

Search and Location of References Scifinder Reaction Search 20

Search and Location of References Scifinder Reaction Search 21

Search and Location of References Author search Journal search Limited by years 22

addenda and errata 23

Dr. Haohao Li, Professor of Business School; Xiaoqing He and Shuai Wang, Postgraduate students Business School 24

25

Search and location of references Journal impact factor is the ratio of number of citations in one year to papers published during the 2 preceding years For Acta Cryst. A, the 2009 calculation was as follows: Total number of citations in 2009 to items published in 2007 and 2008 (A) = 6091 Total number of papers published in 2007-2008 (B) = 122 Impact factor = A/B = 6091/122 = 49.9 At 49.9, the impact factor of Acta Cryst. A is the second highest for a scientific journal for 2009, with only CA: A Cancer Journal for Clinicians scoring higher. Acta Cryst. A covers diffraction physics and the theory of crystallographic structure determination by diffraction methods using X-rays, neutrons and electrons. 26

Search and location of references Thomson Reuters' Journal Citation Report for 2009: Acta Cryst. A: Foundations of Crystallography I.F. = 49.9 Primary cause is a single feature article by George Sheldrick, A short history of SHELX, published in a special issue to celebrate 60 years of journal and IUCr (Acta Cryst. (2008). A64, 112-122; doi:10.1107/s0108767307043930). The article has provided the crystallographic community with a citable reference when one or more of the programs, amongst the most widely used in structure determination, are employed in the course of a crystal structure study. 43,000 articles published in IUCr journals alone have referenced SHELX programs. The journal impact factor is normal at 2.0-2.5. 27

Thank you 28