Extending the scope of journal articles: Certifying and publishing experimental data 237 th ACS-Meeting 2009/03/25 Irina Sens 1, Jan Brase 1, Susanne Haak 2, Guido F. Herrmann 2 1 German National Library of Science and Technology, 2 Thieme Chemistry
The Partners TIB and Thieme (1) > German National Library of Science and Technology (TIB) > Architecture, Chemistry, Computer Science, Mathematics, Physics, Engineering Technology > Financed by Federal Government and all Federal States > Member of Leibniz Association (Independent Research Organization) > Global Supplier for scientific and technical information > All information types text, numeric data, audio, video, etc > All these information are accessible for the customer > Target groups: Science and Industry
The Partners TIB and Thieme (2) > Thieme Chemistry > Part of the Thieme publishing group, based in Stuttgart (Germany) > Publishes highly evaluated information about synthetic and general chemistry for professional chemists and advanced students since 1909. > Products are: SYNLETT, SYNTHESIS and SYNFACTS, Science of Synthesis, RÖMPP, Pharmaceutical Substances as well as a selected range of monographs and textbooks. > The entire content is available in electronic format, used by chemists worldwide in industry and academia. > Close cooperations with scientific societies (International Union of Pure and Applied Chemistry, German Chemical Society)
Background - Data in Chemistry (1) Typical research data are created: > Using the vast array of analytical techniques (GC, HPLC etc.) > Employing spectroscopic methods (NMR, MS, UV/VIS, IR, X- Ray, etc.) > As a result of theoretical calculations (quantum mechanics, simulation of spectra, etc.) > Or by using the various high-throughput technologies in medicinal chemistry
Background - Data in chemistry (2) > Journal Angewandte Chemie : > Round about 1,700 articles in 2006, > On average 40 spectra per article > Last 25 years > round about 1,700,000 mio. spectra = 1,700,000 research data sets But:
Problem: The research trajectory are Data analysed displayed interpreted are lost! become Information is is traceable published becomes Knowledge is accessible Publication
What do we need Answer by NSF "The establishment of strong, reciprocal, international, interagency and public-private partnerships is essentatial to ensure all stakeholders are engaged in the stewardship of valuable data assets. Cyberinfrastructure vision for 21st century discovery. National Science Foundation. Cyberinfrastructure Council. March 2007
What do we need - A possible solution > Creation of new and strengthening of existing data centers. Responsible for: > Quality assurance > Storage of the content and accessibility > Creation of metadata > Long-term archiving > Global access to data sets and their metadata through existing catalogues/portals/platforms: TIB stores the metadata and keeps it searchable via its portal. > Use of persistent identifiers also for data (DOI = Digital Object Identifier) TIB registers research data worldwide from a scientific, technical or medical background.
Data and article > The DOI system offers an easy way to connect the article with the underlying data and vice versa: Example
Example Research data set Access to the article Access to the data
Data access
What are the values and key benefits? (1) > Citability of research data > High visibility of the data > Easy re-use and verification of the data sets > Increasing quality of published papers
What are the values and key benefits? (2) > Encouraging the Brussels declaration on STM publishing "Raw research data should be made freely available to all researchers. Publishers encourage the public posting of the raw data outputs of research. Sets or sub-sets of data that are submitted with a paper to a journal should wherever possible be made freely accessible to other scholars. > Avoiding duplications > Saving costs > Motivation for new research
The role of the TIB > Since 2005 TIB is established as a non-commercial DOI-registration agency for scientific research data as a member of the International DOI Foundation (IDF) "A group of Europe s leading research libraries and > Registered technical more information than 600,000 providers objects have established a partnership to improve access to research data on the internet. The goal of this cooperation is to establish a > Since March 2, this has been enlarged to a European not-for-profit agency that enables organisations to consortium register research datasets and assign persistent identifiers to them, so that research datasets can be handled as independent, citable, unique scientific objects."
Workflow with Thieme Author Thieme electronic journal platform In the article the research data are cited as referenceitems with its DOI name data are freely accessible in machinereadable format assigns DOI to the data Article Research Data Research Data Data Center
The research trajectory becomes are Data analysed displayed interpreted are lost! become Information is is traceable published becomes Knowledge is accessible Publication
An ideal cycle of research Accumulation Data analysed displayed interpreted Publication (DOI) Research Experiment Catalogue Data and Journal archive, Publishers Publication(DOI) Information Inspiration Publication(DOI) linking Peer-Review Journal article
Vision "Our strategic vision is a digital scientific data universe [ ] that digital data realize their full potential as catalysts for progress in our global information society. Harnessing the power of digital data for science and society. Report of the Interagency Working Group on Digital Data to the Committee on Science of the National Science and Technology Council (USA), January 2009
Thank you for your attention!