Where do all those small molecules come from? What the data reveal

Similar documents
SciFinder Content Update

SciFinder introduction and a view behind the scene how this tool is made

Demonstration: Searching patents based on chemical structure using SciFinder

SciFinder Premier CAS solutions to explore all chemistry MethodsNow, PatentPak, ChemZent, SciFinder n

Introducing the New SciFinder. Veli Pekka Hyttinen Regional Marketing Manager, Central and Eastern Europe Jasna April 1, 2014

SciFinder Essential Content. Proven Results.

Database Speaks. Ling-Kang Liu ( 劉陵崗 ) Institute of Chemistry, Academia Sinica Nangang, Taipei 115, Taiwan

SciFinder The choice for chemistry research

Reaxys Improved synthetic planning with Reaxys

new interface and features

The shortest path to chemistry data and literature

Information Retrieval: SciFinder

CHEMICAL ABSTRACTS SERVICE: THE GOLD STANDARD FOR CHEMICAL SUBSTANCE INFORMATION

Reaxys (Beilstein) Assignment. Preliminary Report Due: October 2nd Final Report Due: November 4th

SciFinder Scholar Guide to Getting Started

SciFinder Scholar Guide to Getting Started

Reaxys (Beilstein) Assignment. Preliminary Report Due: October 1st Final Report Due: November 4th

Issue of the Name of the Sea of Japan. Overview of study of maps possessed by the Bibliotheque Nationale de France

Chemical Databases: Encoding, Storage and Search of Chemical Structures

Reaxys (Beilstein) Assignment. Preliminary Report Due: October 13th Final Report Due: November 17th

How to Create a Substance Answer Set

Thieme Chemistry E-Books

October 6 University Faculty of pharmacy Computer Aided Drug Design Unit

CHE 200 INFORMATION RESOURCES LIBRARY PRESENTATION

Finding Polymer Information Part 2: Advanced. Dr. Thomas Haubenreich

Chemical Abstracts Service. Tina Jayroe. University of Denver

FORENSIC TOXICOLOGY SCREENING APPLICATION SOLUTION

Developing CAS Products for Substructure Searching by Chemists. Linda Toler

Perseverance. Experimentation. Knowledge.

ACS SHANGHAI INTERNATIONAL CHEMICAL SCIENCES CHAPTER 2014 ANNUAL REPORT

EMPIRICAL VS. RATIONAL METHODS OF DISCOVERING NEW DRUGS

Information Literacy for Undergraduate Chemistry Students

HOW TO FIND CHEMICAL INFORMATION

Reconciling the Protection of GIs and Trademarks. The Point of View of GI Producers

2017 Scientific Ocean Drilling Bibliographic Database Report

The Case for Use Cases

WIPO s Chemical Search Function

Basic Techniques in Structure and Substructure

Introducing a Bioinformatics Similarity Search Solution

HOW TO FIND CHEMICAL INFORMATION

Continental Divide National Scenic Trail GIS Program

For Excellence in Organic Chemistry

Internet Resource Guide for Undergraduate Organic Chemists

InChI keys as standard global identifiers in chemistry web services. Russ Hillard ACS, Salt Lake City March 2009

Communicating Chemistry:

Organic Spectroscopy Workbook

DiscoveryGate. One place for the answers you need. Chulalongkorn University, Thailand

China Chemical Regulations Overview

Overview. Database Overview Chart Databases. And now, a Few Words About Searching. How Database Content is Delivered

ADMINISTRATION OF GEOGRAPHICAL NAMES IN URUGUAY

ASTM International Annual Business Meeting

Receptor Based Drug Design (1)

Economic and Social Council 2 July 2015

Writing Patent Specifications

Ontology Summit Framing the Conversation: Ontologies within Semantic Interoperability Ecosystems

(For Tyre) Additives & Chemicals Technical Group 2

The Fundamentals of Moisture Calibration

Advancing Green Chemistry Practices in Business

Information Retrieval: SciFinder

Application and Development of Geographical Indication Protection in China

PIOTR GOLKIEWICZ LIFE SCIENCES SOLUTIONS CONSULTANT CENTRAL-EASTERN EUROPE

Agenda Item B /20 Strategic Budget Development Phase I Preliminary Recommendations

Chemical Samples Recycling: The MDPI Samples Preservation and Exchange Project

Concept Formulation of Geospatial Infrastructure. Hidenori FUJIMURA*

ACS PUBLICATIONS. Over a Century of Essential Chemistry on Your Desktop

INDIAN RIVER STATE COLLEGE INTRODUCTORY CHEMISTRY FALL 2016 PRUITT CAMPUS

Developments in Science and Technology: Autonomous Systems and Artificial Intelligence in Chemistry

SciFinder (Web version)

Chemical Safety as a Core ACS Value: Report on the 2018 Safety Summit

INTERNATIONAL BULLETIN ON ATOMIC AND MOLECULAR FOR FUSION

FROM MOLECULAR FORMULAS TO MARKUSH STRUCTURES

Central Valley School District Social Studies Curriculum Map Grade 7. August - September

Implementation Status & Results Indonesia Improving Governance for Sustainable Indigenous Community Livelihoods in Forested Areas (P130632)

Topographic Mapping in Australia: The Future State

Reaxys Pipeline Pilot Components Installation and User Guide

Searching Substances in Reaxys

Application of Information Science and Technology in Chemical Research

You are Building Your Organization s Geographic Knowledge

Extraction of structural information from ChemDraw CDX files: easy, or an underestimated, difficult challenge?

Ministry of Health and Long-Term Care Geographic Information System (GIS) Strategy An Overview of the Strategy Implementation Plan November 2009

Slovenian Chemical Society. Venčeslav Kaučič

Chemical Information Retrieval CAS & SciFinder Searching for Substances

Structure-based approaches to the indexing and retrieval of patent chemistry. Tim Miller Head of Research May 2010

U.S. Army Topographic Engineering Center (TEC) Alexandria, VA. Army Geo-Referenced PDF (GeoPDF) Project

Identifying Disinfection Byproducts in Treated Water

Teaching GIS Technology at UW-Superior. Volume 9, Number 8: May 23, 2003

IN QUALITATIVE ANALYSIS,

CHAPTER 22 GEOGRAPHIC INFORMATION SYSTEMS

ATheme. Capital Connections. Claudia Crump, Workshop Consultant Indiana University, Southeast

RSC Analytical Division Strategy

The rest of topic 11 INTRODUCTION TO ORGANIC SPECTROSCOPY

Gravity Analysis of Regional Economic Interdependence: In case of Japan

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

1. Omit Human and Physical Geography electives (6 credits) 2. Add GEOG 677:Internet GIS (3 credits) 3. Add 3 credits to GEOG 797: Final Project

Government GIS and its Application for Decision Support

7 Product-related Environmental Activities

Kalexsyn Overview Kalexsyn, Inc Campus Drive Kalamazoo, MI Phone: (269) Fax: (269)

Increasing Speed of UHPLC-MS Analysis Using Single-stage Orbitrap Mass Spectrometer

DOWNLOAD OR READ : THE HANDBOOK OF ORGANIC COMPOUNDS PDF EBOOK EPUB MOBI

Applying the Semantic Web to Computational Chemistry

Transcription:

Where do all those small molecules come from? What the data reveal Paul Peters Director, EMEA Sales CAS SIIL, Netherlands ICIC, Vienna, Austria October 27 th, 2010

CAS is a division of the American Chemical Society ACS Vision Improving people s lives through the transforming power of chemistry 2

CAS supports the mission of the ACS ACS Mission To advance the broader chemistry enterprise and its practitioners for the benefit of Earth and its people. CAS Commitment To support the ACS mission by providing secure access to high-quality, comprehensive chemical information. 3

Three major services provide access to CAS database information October 28, 2010 4

CAS mission drives development of the CAS databases 5

Growth in published chemistry has stayed strong in the last decade Published articles and patents in chemical science by CAS 2003-2010 1400000 1200000 1000000 800000 600000 400000 200000 0 2003 2004 2005 2006 2007 2008 2009 2010 Projection 6

Worldwide patenting of new chemical research continues to accelerate CAS Patents Covered each Year 1,200,000 1,000,000 800,000 600,000 400,000 200,000 0 1972 1976 1980 1984 1988 1992 1996 2000 2004 2008 7

CAS analyzes global chemical information, including publications from Asia Each year, CAS covers 10,000 serial journal titles and 61 patent authorities worldwide 2,100 Asian serial journal titles All major Asian patent authorities, including offices in People s Republic of China South Korea Japan India October 28, 2010 Chinese, Japanese, and Korean language publications account for 33% of new CAplus database records 8

For complex chemistry, CAS chemists classify substance information and verify graphical processes and structures 1. Review reaction and structure 2. Create registration record 9

Accurate and timely, CAS substance analysis for CAS REGISTRY ensures scientists can find information 1. Scientist asks structure question 3. Finds the PCT patent application 2. Finds 433 compounds and 3 publications 10

CAS chemists interpret when compounds are described in terms other than singular structures or names 11

From published patent to completed indexing in SciFinder : 15 days A typical chemistry patent: PCT application A new antibacterial 250 pages 24 claims 12

From published patent to completed indexing in SciFinder: 15 days Dr. Mark Willi from CAS completes substance indexing 15 days after WIPO publishes the patent CAS chemist analysis of single PCT application 917 indexed compounds from Examples and Claims 576 new compounds added to CAS REGISTRY 613 single-step reactions 5,394 reactions with multiple steps 1,029 reaction participants MARPAT Markush structure with 2,119 substituent definitions 13

Challenges in substance and reaction indexing show the value of intellectual substance identification 10 R=Me 2 R=H 3 R=Me OMe Scheme 1. Reagents and conditions: (a) HCO 2 Et, NaH; (b) H 2 NCH 2 CN, NaOAc, 80%; (c) ethyl chloroformate, DBN, 0 C, then DBN, 0 C to rt; (d) Na 2 C0 3> MeOH, 71%; (e) HCO 2 H, reflux, 75%; (0 POC1 3, 100 C; (g) (MeO) 2 SO 2, K 2 CO 3, 75%; (h) hydrazine hydrate, EtOH, 80 C, 82%; (i) pyridine-4- carboxaldehyde, pyrrolidine (cat.), EtOH, 80 C, 80%. Since products of step b, d, e, and h were characterized by their yield, CAS policy demands these are to be registered and indexed for CASREACT 14

CAS specialists in many fields of chemistry interpret author terminology to register compounds October 28, 2010 Author identified this compound only as D4GlcUA-GlcNAc- (GlcUA-GlcNAc)5-PA 15

Patents regularly describe substances in ambiguous ways: In WO 2007089907 this desired product is fully characterized 16

(Relatively) new substance classes can be registered Metal-organic frameworks show great potential for capture of H 2 or CO 2 or in other gas separation processes 17

By the end of 2010, CAS REGISTRY will contain more than 56 million compounds Total Small Molecules Registered Compounds (millions) 60 50 40 30 20 10 0 CAS REGISTRY CAS Registry 65 69 73 77 81 85 89 93 97 01 05 09 18

CAS REGISTRY substances are enriched with spectra, numeric properties, tags, and published sources Spectra More than 42M calculated NMR spectra (H1, C13, hetero), with 26M added in 2009 More than 700,000 experimental spectra (MS, NMR, IR, Raman), with another 300,000 newly acquired MS, IR, and NMR to be added in 2010 Numeric More than 4.5M experimental property values (m.p., b.p., optical rotary power, etc.) 7.5M data tags linked to indexed documents 2.68B calculated bio- evaluation metrics (bio-concentration, Log P, Lipinski, etc.) 19

CAS continues to register new small molecules in significant numbers CAS REGISTRY Growth, 2003-2010 Millions 60 50 40 30 20 10 0 51.3 56.0 41.5 33.3 30.0 22.3 25.0 27.0 2003 2004 2005 2006 2007 2008 2009 2010 Projection 20

Patenting has increasingly "monetized" new chemical information October 28, 2010 Percentage of New Compounds added to CAS REGISTRY from Patents Percentage of total % *CA Database annual average is 23% patents 21

CAS has indexed more than 10.1M exemplified prophetic substances from patent documents since 1993 22

CHEMCATS continues to grow and remains a source of new small molecules Number of Catalog Products and Unique Substances in the CAS CHEMCATS Database 23

Chemical substances from web-based sources provide a moderate addition to the small molecules in REGISTRY 1.6M substances have been captured from Internet substance collections (They must be from a reputable source and contain characterizing data) The following chart illustrates some of the larger collections: 400,000 350,000 300,000 250,000 200,000 150,000 100,000 50,000 0 ZINC ChemSpider ChemDB Broad Inst Ambinter NIST Mass Spec NCI 3D 24

Conclusions: What the data reveal about small molecules Information professionals and scientists from around the world rely on CAS to provide the most authoritative, current, and complete collection of substances Substances are mainly from the journals and patents covered by CAS databases and indexed by CAS scientists Intellectual processing of substance identification provides superior content Unique substances are found in chemical catalogs and chemical libraries as long as there is proof they exist Internet sources provide some complementary pointers to disclosed substance information October 28, 2010 25