Chemistry Informatics in Academic Laboratories: Lessons Learned

Similar documents
Integrated Cheminformatics to Guide Drug Discovery

No. of Days. Building 3D cities Using Esri City Engine ,859. Creating & Analyzing Surfaces Using ArcGIS Spatial Analyst 1 7 3,139

Why GIS & Why Internet GIS?

Lesson 6: Accuracy Assessment

Reaxys Managing Complexity

DEKDIV: A Linked-Data-Driven Web Portal for Learning Analytics Data Enrichment, Interactive Visualization, and Knowledge Discovery

Supporting Information. Thermodynamics of Bisphosphonate Binding to Human Bone: A Two-Site Model

CARTOGRAPHY in a Web World

Building innovative drug discovery alliances. Just in KNIME: Successful Process Driven Drug Discovery

CHAPTER 22 GEOGRAPHIC INFORMATION SYSTEMS

CS4445 Data Mining and Knowledge Discovery in Databases. B Term 2014 Solutions Exam 2 - December 15, 2014

Enabling ENVI. ArcGIS for Server

Ákos Tarcsay CHEMAXON SOLUTIONS

No. of Days. ArcGIS 3: Performing Analysis ,431. Building 3D cities Using Esri City Engine ,859

No. of Days. ArcGIS Pro for GIS Professionals ,431. Building 3D cities Using Esri City Engine ,859

Product Guide. Thermo Scientific Cellomics HCS Solution

Large scale classification of chemical reactions from patent data

WEB-BASED SPATIAL DECISION SUPPORT: TECHNICAL FOUNDATIONS AND APPLICATIONS

Management of Geological Information for Mining Sector Development and Investment Attraction Examples from Uganda and Tanzania

Introduction to Portal for ArcGIS. Hao LEE November 12, 2015

Web GIS Administration: Tips and Tricks

JCICS Major Research Areas

ArcGIS Enterprise: Administration Workflows STUDENT EDITION

DRUG DISCOVERY TODAY ELN ELN. Chemistry. Biology. Known ligands. DBs. Generate chemistry ideas. Check chemical feasibility In-house.

Introduction to Portal for ArcGIS

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Geog 469 GIS Workshop. Managing Enterprise GIS Geodatabases

Esri and GIS Education

A Model of GIS Interoperability Based on JavaRMI

Geo-enabling a Transactional Real Estate Management System A case study from the Minnesota Dept. of Transportation

In Silico Investigation of Off-Target Effects

Performance Evaluation

Structure and Reaction querying in Reaxys

Greater Portland Pulse: an Evolution

Available online at I-SEEC Proceeding - Science and Engineering (2013)

Bentley Map Advancing GIS for the World s Infrastructure

A Service Architecture for Processing Big Earth Data in the Cloud with Geospatial Analytics and Machine Learning

Paper UC1351. Conference: User Conference Date: 08/10/2006 Time: 8:30am-9:45am Room: Room 23-B (SDCC)

Pharmaceutical e-learning at the University of Innsbruck

Land Board, NW Services and SDI Tambet Tiits, FRICS

Develop a Spatial Information Management System: A Case Study for Faculty of Agriculture, University of Ruhuna, Sri Lanka

Web-based Interactive Landform Simulation Model (WILSIM)

ARCHAVE : A Virtual Reality Interface for Archaeological 3D GIS. Master s Thesis Proposal by Daniel Acevedo Feliz

Inverse Functions. Say Thanks to the Authors Click (No sign in required)

Marvin 5.4 A new generation of structure indexing at Elsevier. Dr. Michael Maier, Dr. Heike Nau, Elsevier

MetConsole AWOS. (Automated Weather Observation System) Make the most of your energy SM

SocViz: Visualization of Facebook Data

GIS Geographical Information Systems. GIS Management

CSD. Unlock value from crystal structure information in the CSD

PaikkaOppi - a Virtual Learning Environment on Geographic Information for Upper Secondary School

Presentation of the Cooperation Project goals. Nicola Ferrè

Lesson 16: Technology Trends and Research

Introduction to Chemoinformatics

Programme Specification MSc in Cancer Chemistry

Evaluating Physical, Chemical, and Biological Impacts from the Savannah Harbor Expansion Project Cooperative Agreement Number W912HZ

for Effective Land Administration

Portal for ArcGIS: An Introduction

Meridian Environmental Technology, Inc.

University of Illinois at Urbana-Champaign. Midterm Examination

4D information management system for road maintenance using GIS

History of the Atom. Say Thanks to the Authors Click (No sign in required)

Spatial Data Availability Energizes Florida s Citizens

CWMS Modeling for Real-Time Water Management

The MDL Discovery Framework: Data and Application Integration in the Life Sciences

Errors, and What to Do. CS 188: Artificial Intelligence Fall What to Do About Errors. Later On. Some (Simplified) Biology

CONCEPTS OF GENETICS BY ROBERT BROOKER DOWNLOAD EBOOK : CONCEPTS OF GENETICS BY ROBERT BROOKER PDF

Ministry of Health and Long-Term Care Geographic Information System (GIS) Strategy An Overview of the Strategy Implementation Plan November 2009

CS 188: Artificial Intelligence Fall 2011

Kalexsyn Overview Kalexsyn, Inc Campus Drive Kalamazoo, MI Phone: (269) Fax: (269)

Qualitative Spatio-Temporal Reasoning & Spatial Database Design

GIS for Crime Analysis. Building Better Analysis Capabilities with the ArcGIS Platform

THIS IS NOT A PRESENTATION

Oregon Department of Transportation. Geographic Information Systems Strategic Plan

Design and implementation of a new meteorology geographic information system

Integration of ArcFM UT with SCADA, SAP, MAXIMO and Network Calculation

MULTIVARIABLE CALCULUS BRIGGS PDF

Free and Open Source Software for Cadastre and Land Registration : A Hidden Treasure? Gertrude Pieper Espada. Overview

The Geodetic Infrastructure Management Via Web-Based Mapping Technology in Morocco

CARTOGRAPHY in a Web World

SPACE TIME ANALYSIS IN AN ENTERPRISE GIS Ritesh Agrawal, University of Illinois Urbana-Champaign

EasySDM: A Spatial Data Mining Platform

Metropolitan Wi-Fi Research Network at the Los Angeles State Historic Park

Agenda. Status of GI activities. NGII Framework. SDI from the national policy perspective

IMS4 ARWIS. Airport Runway Weather Information System. Real-time data, forecasts and early warnings

Drug Informatics for Chemical Genomics...

Incorporating ArcGIS Pro in your Curriculum

Practical teaching of GIS at University of Liège

Test and Evaluation of an Electronic Database Selection Expert System

Performance Evaluation

Graduate Education in Institute of Chemistry, Chinese Academy of Sciences

CADASTER & MC ITN ECO

POSITION DESCRIPTION. Position Title: Geographic Information Systems (GIS) Coordinator Department: Engineering

Solving Absolute Value Equations and Inequalities

Discovering The World Of Chemistry

Reaxys Medicinal Chemistry Fact Sheet

An intelligent client application for on-line astronomical information

Real-time Geographic Information System (GIS) for Monitoring the Area of Potential Water Level Using Rule Based System

DP Project Development Pvt. Ltd.

Your Virtual Workforce. On Demand. Worldwide. COMPANY PRESENTATION. clickworker GmbH 2017

Rapid Application Development using InforSense Open Workflow and Daylight Technologies Deliver Discovery Value

Transcription:

Chemistry Informatics in Academic Laboratories: Lessons Learned Michael Hudock Center for Biophysics & Computational Biology University of Illinois at Urbana-Champaign

My Background Ph.D. candidate, Biophysics & Computational Biology, University of Illinois at Urbana- Champaign. Associate Research Scientist, Discovery Technologies group at Bristol-Myers Squibb prior to graduate school. Strong interest in the interface of computers and chemistry, graduate work in computational modeling with chemoinformatics.

Talk Outline Chemoinformatics System Our Basic Requirements Registration / Results / Reports / Research Build vs. Buy Infrastructure / Cost / Maintenance / Development Results & Lessons Learned Short-term impact / long-term impact Future Directions New, advanced SAR modules

Our Laboratory

Our Basic Requirements Registration ~50 assays Results Reports Research Y= c + a b + c d +

A Decision Point Commercial Solution "Out of the box" functionality Restrictive Infrastructure Requirements Expensive, Perhaps Recurring Costs Completely Customizable? Programming Expertise Testing & Deployment Data Backup Custom Solution Decision to develop a custom solution that would meet, at first, our most basic requirements, with capability to expand at a later date.

Client-Server Architecture Multiple client platforms supported All code resides on the server Data all stored in one location

Specific Implementation Modular architecture allows new components to be quickly and easily added.

Database Architecture

Compound Registration

Input Results

Structures & Data United Using ChemAxon Marvin Java Applet

Retrieve Data Easily

Real-Time Data Analysis New analysis tools can be added quickly in response to user requests

Finding Patterns in a Few Clicks === Stratified cross-validation === === Summary === Correctly Classified Instances 23 88.4615 % Incorrectly Classified Instances 3 11.5385 % Kappa statistic 0.7692 Mean absolute error 0.1839 Root mean squared error 0.3543 Relative absolute error 36.4346 % Root relative squared error 70.1844 % Total Number of Instances 26 Provide SAR tools to all users, help detect trends. === Detailed Accuracy By Class === TP Rate FP Rate Precision Recall F-Measure Class 0.923 0.154 0.857 0.923 0.889 cluster1 0.846 0.077 0.917 0.846 0.88 cluster2 === Confusion Matrix === a b <-- classified as 12 1 a = cluster1 2 11 b = cluster2

Additional Modules Easily Added Additional modules added over time as needed

Initial Impact Initial Development: 1 FTE, 1 month Updates & New Code: 1 FTE, 3 days/month Intuitive interface, short end user training Pre- Chemoinformatics Chemoinformatics What is the structure of compound 700? 20 sec. 20 min. Correlate assay A with assay B 5 sec. 30 min. for compounds 65% similar to cpd 700 10 sec. 45 min. or instead, with assays B N 15 sec. 5 hours Will addition of CH 2 to 352 decrease activity? 10 sec. 25 min. Is assay A activity related to TPSA? 5 sec. 20 min. An informatics solution, commercial or custom, can have large positive impact on productivity - even for relatively small amounts of data.

Longer-Term Impact >1,000 unique compounds, >11,000 fittings Used daily by group members (~30) Data easily shared with entire group Trends now routinely identified publications Mindset: paper to electronic

How can I do this? Identify and implement basic requirements first, don t go overboard Programming typically requires functional understanding of databases and programming language such as PHP*. CS students, temporary help or computer savvy graduate students might be able help Use third-party components when appropriate, e.g. for plotting, displaying structures System can evolve over time, with sophisticated capabilities added with additional experience *Good Books: PHP and MySQL Web Development, Welling & Thompson, 2007. Web Database Applications with PHP & MySQL, Williams & Lane, 2004.

Acknowledgements CINF Division for the invitation to present National Institutes of Health Professor Eric Oldfield and members of the Oldfield Research Group, Department of Chemistry, University of Illinois at Urbana-Champaign Professor Eric Oldfield Yongcheng Song Yonghui Zhang Fenglin Yin Kilannin Krysiak Sujoy Mukherjee Dushyant Mukkamala Rong Cao Kyle Bergan