Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

Similar documents
Docking. GBCB 5874: Problem Solving in GBCB

What is Protein-Ligand Docking?

Kd = koff/kon = [R][L]/[RL]

MOLECULAR RECOGNITION DOCKING ALGORITHMS. Natasja Brooijmans 1 and Irwin D. Kuntz 2

Protein-Ligand Docking Evaluations

Protein-Ligand Docking Methods

Structural Bioinformatics (C3210) Molecular Docking

Protein-Ligand Docking

Using AutoDock 4 with ADT: A Tutorial

Protein-Ligand Docking Methods

Protein Structure Prediction and Protein-Ligand Docking

Structural Bioinformatics (C3210) Molecular Mechanics

BCB410 Protein-Ligand Docking Exercise Set Shirin Shahsavand December 11, 2011

Cheminformatics platform for drug discovery application

High Throughput In-Silico Screening Against Flexible Protein Receptors

Molecular Mechanics, Dynamics & Docking

Using AutoDock With AutoDockTools: A Tutorial

User Guide for LeDock

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM

Ranking of HIV-protease inhibitors using AutoDock

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

The PhilOEsophy. There are only two fundamental molecular descriptors

Virtual Screening: How Are We Doing?

Assessing Scoring Functions for Protein-Ligand Interactions

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

Computational modeling of G-Protein Coupled Receptors (GPCRs) has recently become

Small Molecule & Protein Docking CHEM 430

Pedro Alexandrino Fernandes, Dep. Chemistry & Biochemistry, University of Porto, Portugal

From Small Molecules to Biological Molecules: Modelling Interactions. Dr. Antonio Chana Milano, 22 nd April 2008

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

New techniques of molecular modelling and structural chemistry for the development of bioactive compounds

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Introduction to AutoDock and AutoDock Tools

Using AutoDock 4 with ADT: A Tutorial

Schrodinger ebootcamp #3, Summer EXPLORING METHODS FOR CONFORMER SEARCHING Jas Bhachoo, Senior Applications Scientist

Supporting Online Material for

In silico pharmacology for drug discovery

Portal. User Guide Version 1.0. Contributors

Principles of Drug Design

Receptor Based Drug Design (1)

Using Molecular Dynamics and Enhanced Sampling to Predict Binding Poses Beyond The Rigid Docking Approximation

Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

Crystal Structure Prediction using CRYSTALG program

OpenDiscovery: Automated Docking of Ligands to Proteins and Molecular Simulation

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

A genetic algorithm for the ligand-protein docking problem

Challenges, Applications, and Recent Advances of Protein-Ligand Docking in Structure-Based Drug Design

Computational chemical biology to address non-traditional drug targets. John Karanicolas

Towards Physics-based Models for ADME/Tox. Tyler Day

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

MM-GBSA for Calculating Binding Affinity A rank-ordering study for the lead optimization of Fxa and COX-2 inhibitors

Virtual screening in drug discovery

Computational protein design

Computational Analyses of Protein-Ligand Interactions

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

Author Index Volume

Studying complex macromolecules

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

Computational Identification of Inhibitors of Protein-Protein Interactions

Molecular Dynamics. A very brief introduction

Presenter: She Zhang

Structural biology and drug design: An overview

ALL LECTURES IN SB Introduction

ONETEP PB/SA: Application to G-Quadruplex DNA Stability. Danny Cole

Molecular Simulation III

Structure Investigation of Fam20C, a Golgi Casein Kinase

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

Investigation of physiochemical interactions in

Chimica Farmaceutica

Generating Small Molecule Conformations from Structural Data

DOCKING TUTORIAL. A. The docking Workflow

Biologically Relevant Molecular Comparisons. Mark Mackey

Advanced in silico Drug Design KFC/ADD

Hit Finding and Optimization Using BLAZE & FORGE

Pose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft

FDS: Flexible Ligand and Receptor Docking with a Continuum Solvent Model and Soft-Core Energy Function

Introduction to" Protein Structure

Scoring functions. Talk Overview. Eran Eyal. Scoring functions what and why

BME Engineering Molecular Cell Biology. Structure and Dynamics of Cellular Molecules. Basics of Cell Biology Literature Reading

Automatic Epitope Recognition in Proteins Oriented to the System for Macromolecular Interaction Assessment MIAX

SCREENED CHARGE ELECTROSTATIC MODEL IN PROTEIN-PROTEIN DOCKING SIMULATIONS

NIH Public Access Author Manuscript J Comput Chem. Author manuscript; available in PMC 2011 February 18.

5.1. Hardwares, Softwares and Web server used in Molecular modeling

SAM Teacher s Guide Protein Partnering and Function

Advanced in silico drug design

Using AutoDock for Virtual Screening

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today

All-atom Molecular Mechanics. Trent E. Balius AMS 535 / CHE /27/2010

Docking with Water in the Binding Site using GOLD

Virtual Libraries and Virtual Screening in Drug Discovery Processes using KNIME

CE 530 Molecular Simulation

Transcription:

Softwares for Molecular Docking Lokesh P. Tripathi NCBS 17 December 2007

Molecular Docking Attempt to predict structures of an intermolecular complex between two or more molecules Receptor-ligand (or drug) Enzyme-substrate Protein-DNA (or RNA) Protein-protein

Brief History of Docking Crick (1953) suggested that complementarity in helical coils could be modelled as knobs fitting into holes DOCK (Kuntz, 1982) pioneered the field of molecular docking GRID (Goodford, 1985) too became a part of many subsequent softwares

General considerations Molecular representations Abstract or atoms Fixed or flexible Juxtaposition of molecules Interactive or automated Search algorithm to create conformations Evaluating complementarity (ranking) Scoring function Force field energy functions

Search Algorithms Potentially several ways of putting two molecules together; possibilities increase exponentially with size of molecules involved Attempt to locate the most stable state in the energy landscape Broadly two types: 1) full solution space search; 2) guided search through solution space

Search Algorithms Random Genetic algorithms Monte Carlo methods Tabu search Systematic Fragment-based methods Point complementary methods Distance geometry methods Database Simulation Molecular dynamics Energy minimisation Multiple methods Algorithms

Docking Softwares Virtual screening AutoDock DOCK FlexX/E SLIDE Surflex ICM GOLD De novo design LUDI GRID MCSS SMoG GrowMol SPROUT

Random methods Sample the conformation space by making single change to a ligand or a population of ligands Alteration performed at each step and accepted or rejected based on a predetermined probability function Include Monte Carlo (MC) methods; Genetic Algorithm (GA) methods; Tabu search methods

Monte Carlo methods Use a simple energy function Makes random moves and accepting or rejecting based on Boltzmann probability function More efficient in stepping over energy barriers, allowing more complete searches of conformation space PRODOCK, MC-DOCK, ICM, DockVision, QXP, GLIDE; too slow for extensive flexible docking

Energy global minimum conformers generated by Monte Carlo method

Genetic Algorithm methods Apply ideas of genetics and evolution in docking Start with an initial population of random ligand conformers wrt protein, each defined by a set of variables called genes Genetic operators (mutations, crossovers) applied to sample conformation space till optimal population is derived AUTODOCK, GOLD, DIVALI, DARWIN; too slow for extensive flexible docking

Autodock Suite of automated docking tools Designed to predict how small molecules (ligands drug candidates) bind to a receptor; AMBER force field Three constituent programs -Autotors- define torsions in the ligand -Autogrid- calculate grids -Autodock- docking tool -AutoDockTools (ADT)- GUI to facilitate above and other modules accompanying AutoDock

Autodock Lamarckian GA LGA encompasses a genotypic and phenotypic phase i.e. genetic operations and energy function to be optimised Energy minimisation performed after genotypic changes and these phenotypic changes mapped back onto genes (by changing ligand coordinates. Most efficient and reliable of random methods

Autodock Grid maps Pre-calculated Grid for each atom type (e.g. C, H, O, N) Consists of 3D lattice of regularly spaced points, surrounding and centered on region of interest in the macromolecule Typical spacing is 0.375 Å Probe atom placed at each grid point and energy calculated

GOLD Genetic Optimisation and Ligand Docking, uses multiple subpopulations of ligand Force-field based scoring function, includes three terms: H-bonding term, intermolecular dispersion potential, intramolecular potential 71% success in identifying experimental binding mode in 100 protein complexes

Tabu Search methods Impose restrictions preventing searches from repeating already explored conformations New conformation is compared to the previous ones based on RMSD values which determine acceptance PRO-LEADS

Systematic Search methods Attempt to explore all degrees of freedom in a molecule Can be divided into three types: conformational search methods, fragmentation methods, and database methods

Conformational Search methods Brute force or shotgun methods of docking All rotatable bonds in ligand rotated through 360 till in fixed increments till all possible combinations generated and evaluated Number of structures generated increases exponentially with number of rotatable bondscombinatorial explosion

Fragmentation Search methods Incrementally grow ligand into the active site, by docking several fragments into the active site followed by covalent-linking to recreate the initial ligand Rigid core-fragment of the ligand is docked first followed by addition of flexible regions DOCK, FlexX, LUDI, ADAM, Hammerhead

DOCK Methodology

FlexX Base fragment is picked up and docked using pose-clustering algorithm Clustering algorithm is implemented to merge similar ligand transformations into active site Flexible fragments are added incrementally using MIMUMBA and evaluated using overlap function, followed by energy calculations till the ligand is completely built Final evaluation through Böhm s scoring function that includes H-bonds, ionic, aromatic and lipophilic terms

Database methods Tackle combinatorial explosion by using libraries of pregenerated conformations to deal with ligand flexibility FLOG generates and docks conformational libraries called Flexibases using distance geometry EUDOC uses conformational searches of ligands to generate different structures, which are placed into receptor active-site followed by energy evaluation

Scoring Essential to rank the ligand conformations determined by the search algorithms Scoring function must be able to distinguish between true binding modes and others Speed and accuracy are most desirable Three major classes: force-field based; empirical; knowledge-based

GoldScore, G-SCORE, D-SCORE, AMBER, CHARRM, GROMOS Force-field based Scoring Quantify sum of two energies-interaction energy between receptor-ligand; internal energy of the ligand Consist of van der Waals (Lennard-Jones potential) + electrostatic energy terms (Coulombic function) Do not include solvation and entropic terms

Empirical Scoring Designed to reproduce experimental data; binding energy can be approximated by sum of individual uncorrelated terms Experimentally determined binding energies used to quantify individual terms Easy computation, but non-versatile due to dependence on experimental datasets ChemScore, Böhm s scoring function, F-Score, X-Score

Knowledge-based Scoring Statistically derived principles that aim to replicate experimentally determined structures Employ simple interactions to screen large databases Dependent on information available in preexisting datasets DrugScore, SMoG score, Potential of Mean force (PMF)

Consensus Scoring Combines information from different scoring schemes to compensate for individual limitations Correlation of individual scoring systems may be a problem X-SCORE combines functions from PMF, ChemScore, PMF with FlexX

Protein-protein Docking Prediction of protein complex structure given individual components structures Huge number of degrees of freedom; docking largely performed as rigid body docking Z-DOCK, a Fast Fourier Transform-based rigid body docking program, is one of the most accurate programs as rated in Critical Assessment of Predicted Interactions (CAPRI)

Docking- strengths and limitations Most available softwares are able to predict known protein-bound conformations with an accuracy of 1.5-2 Å; 70-80% success rate Scoring function- major limitation factor due to simplifications and assumptions Solvation effects, quality of crystallographic data

Comparing Docking softwares in difficult Several studies compare docking programs but conclusions of general applicability are not evident Minor differences in methodology can have significant impact on success rates of various docking programs Cole et al., 2005 PROTEINS 60, 325-332 provide a list of recommendations in assessing docking programs

Docking softwares representations in citations

Docking Softwares- Citations per year

Challenges Predicting structures of multi-domain, multisubunit protein complexes Prediction and specificity in protein-nucleic acid interactions Protein-docking with backbone flexibility