Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

Similar documents
The Schrödinger KNIME extensions

Integrated Cheminformatics to Guide Drug Discovery

Ligand Scout Tutorials

Introducing a Bioinformatics Similarity Search Solution

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

Computational chemical biology to address non-traditional drug targets. John Karanicolas

The Schrödinger KNIME extensions

BMD645. Integration of Omics

Contents 1 Open-Source Tools, Techniques, and Data in Chemoinformatics

Networks & pathways. Hedi Peterson MTAT Bioinformatics

The Schrödinger KNIME extensions

Cheminformatics Role in Pharmaceutical Industry. Randal Chen Ph.D. Abbott Laboratories Aug. 23, 2004 ACS

Cheminformatics analysis and learning in a data pipelining environment

In silico pharmacology for drug discovery

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Chemical Data Retrieval and Management

Introduction to Bioinformatics

Next Generation Computational Chemistry Tools to Predict Toxicity of CWAs

User Guide for LeDock

Receptor Based Drug Design (1)

ATLAS of Biochemistry

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a

CSCE555 Bioinformatics. Protein Function Annotation

Drug Informatics for Chemical Genomics...

Machine learning for ligand-based virtual screening and chemogenomics!

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Using AutoDock for Virtual Screening

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

Docking. GBCB 5874: Problem Solving in GBCB

The PhilOEsophy. There are only two fundamental molecular descriptors

Biologically Relevant Molecular Comparisons. Mark Mackey

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

Practical QSAR and Library Design: Advanced tools for research teams

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Mechanistic insight into inhibition of two-component system signaling

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

MM-GBSA for Calculating Binding Affinity A rank-ordering study for the lead optimization of Fxa and COX-2 inhibitors

PubChem data extraction and integration using Instant JChem. Oleg Ursu Cristian Bologa Tudor I. Oprea Division of Biocomputing

proteins Comparison of structure-based and threading-based approaches to protein functional annotation Michal Brylinski, and Jeffrey Skolnick*

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

Structure to Function. Molecular Bioinformatics, X3, 2006

COMPARISON OF SIMILARITY METHOD TO IMPROVE RETRIEVAL PERFORMANCE FOR CHEMICAL DATA

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

Cheminformatics platform for drug discovery application

Mathangi Thiagarajan Rice Genome Annotation Workshop May 23rd, 2007

PROVIDING CHEMINFORMATICS SOLUTIONS TO SUPPORT DRUG DISCOVERY DECISIONS

MSc Drug Design. Module Structure: (15 credits each) Lectures and Tutorials Assessment: 50% coursework, 50% unseen examination.

Lecture Notes for Fall Network Modeling. Ernest Fraenkel

Structural Bioinformatics (C3210) Molecular Docking

Introduction. OntoChem

OpenDiscovery: Automated Docking of Ligands to Proteins and Molecular Simulation

PDB : 1ZTB Ligand : ZINC

Preparing a PDB File

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Application integration: Providing coherent drug discovery solutions

Progress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery

CLRG Biocreative V

Solved and Unsolved Problems in Chemoinformatics

Structure Investigation of Fam20C, a Golgi Casein Kinase

Molecular Dynamics Graphical Visualization 3-D QSAR Pharmacophore QSAR, COMBINE, Scoring Functions, Homology Modeling,..

Hit Finding and Optimization Using BLAZE & FORGE

Gene Ontology and overrepresentation analysis

De Novo molecular design with Deep Reinforcement Learning

Capturing Chemistry. What you see is what you get In the world of mechanism and chemical transformations

Statistical Clustering of Vesicle Patterns Practical Aspects of the Analysis of Large Datasets with R

JCICS Major Research Areas

Studying the effect of noise on Laplacian-modified Bayesian Analysis and Tanimoto Similarity

Supplementary Material

The Changing Requirements for Informatics Systems During the Growth of a Collaborative Drug Discovery Service Company. Sally Rose BioFocus plc

Docking with Water in the Binding Site using GOLD

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today

Detection of Protein Binding Sites II

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Ákos Tarcsay CHEMAXON SOLUTIONS

Integration of functional genomics data

DOCKING TUTORIAL. A. The docking Workflow

Virtual Screening: How Are We Doing?

Development of Pharmacophore Model for Indeno[1,2-b]indoles as Human Protein Kinase CK2 Inhibitors and Database Mining

BIOINFORMATICS LAB AP BIOLOGY

Handling Human Interpreted Analytical Data. Workflows for Pharmaceutical R&D. Presented by Peter Russell

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA

Using Web Technologies for Integrative Drug Discovery

Identifying Interaction Hot Spots with SuperStar

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Chemogenomic: Approaches to Rational Drug Design. Jonas Skjødt Møller

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

MassHunter Software Overview

Ultra High Throughput Screening using THINK on the Internet

Genome Annotation. Qi Sun Bioinformatics Facility Cornell University

ALL LECTURES IN SB Introduction

Computational Biology From The Perspective Of A Physical Scientist

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Ignasi Belda, PhD CEO. HPC Advisory Council Spain Conference 2015

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM

CS612 - Algorithms in Bioinformatics

Binding Response: A Descriptor for Selecting Ligand Binding Site on Protein Surfaces

Introduction to Chemoinformatics and Drug Discovery

DATA FUSION APPROACHES IN LIGAND-BASED VIRTUAL SCREENING: RECENT DEVELOPMENTS OVERVIEW

UNIVERSITY OF YORK. BA, BSc, and MSc Degree Examinations Department : BIOLOGY. Title of Exam: Molecular microbiology

Transcription:

Cross Discipline Analysis made possible with Data Pipelining J.R. Tozer SciTegic

System Genesis Pipelining tool created to automate data processing in cheminformatics Modular system built with generic data types to accommodate diversity and change in the industry Validated in cheminformatics Approach equally applicable to bioinformatics and beyond

Data Pipelining is Graphical linking of independent modular steps Readers Calculators Filters Viewers etc

Data Pipelining is Nearly 300 different components available Easily extensible through integration components SOAP ODBC Perl VBScript Run Command-line program

Chemo-Genomics Ways of joining cheminformatics and bioinformatics: 1. Experimental data E.g., NCI cancer cell screen: Gene expression and compound activity data for the same set of cells 2. Pathways E.g., Kegg pathway database relates genes and compounds 3. Protein-Ligand interactions E.g., Virtual docking technologies match genes with compounds that might bind

Clustering Genes by Compound Activity NCI 60 Cancer Cell Screen Expression levels for 10K genes Growth inhibiting activity of 30K compounds Cluster genes according to the correlation of their expression level with the inhibiting activity of compounds. i.e. If Genes A and B are always over-expressed in cells that are responsive to compounds X and Y, they would cluster together. Annotate cluster members with PubMed references

Clustering Genes by Compound Activity

Clustering Genes by Compound Activity

Clustering Genes by Compound Activity 10K genes x 125 compounds x 60 cells Protocol composed in an afternoon Executes in <5 minutes

Assemble a library of compounds known to interact with genes of interest Kegg database catalogs all the genes and small molecules involved in known pathways Provides a bridge: Gene <-> Pathways <-> Compounds Given a list of interesting gene targets, assemble all the compounds that interact in the pathways they are involved in. Or, given a list of interesting compounds, determine what genes are involved in the pathways that contain them.

Assemble a library of compounds known to interact with genes of interest

Assemble a library of compounds known to interact with genes of interest

Find targets that might respond to compounds of interest ask more of your data Docking programs can evaluate the fit of compounds into binding pockets of proteins Given a list of interesting compounds, find which might bind to protein targets with known structures

Automated Protein-Ligand Docking Retrieve, prepare, dock & score ligands Calculate second score Display docked conformers in protein Calculate consensus score for ligands and display statistics

Gold is integrated as a SOAP service Parameters exposed Server location Batch size Protein file Mode Scoring Active site location

Display resulting poses DS.Viewer integration is an out-of-the-box component Uses VB/COM automation method

Docking Results

High-Throughput Docking Results

Next Steps Data Pipelining provides an ideal framework for linking the data and tools from different scientific domains. By eliminating the data and tool integration barriers, the challenge is to think of good scientific questions Looking for collaborators Unique data collections Unique domain-bridging ideas

Thank You Acknowledgements: David Rogers (fingerprints and clustering) Scott Markel (Kegg integration) Rob Brown, Andrei Caracoti (Gold Integration) Questions? (See more at SciTegic s Booth #127-129)