User Guide for LeDock

Similar documents
Kd = koff/kon = [R][L]/[RL]

DOCKING TUTORIAL. A. The docking Workflow

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

Portal. User Guide Version 1.0. Contributors

Docking. GBCB 5874: Problem Solving in GBCB

Using AutoDock for Virtual Screening

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Using AutoDock 4 with ADT: A Tutorial

Build_model v User Guide

The PhilOEsophy. There are only two fundamental molecular descriptors

Analyzing Molecular Conformations Using the Cambridge Structural Database. Jason Cole Cambridge Crystallographic Data Centre

Generating Small Molecule Conformations from Structural Data

Ligand Scout Tutorials

Flexibility and Constraints in GOLD

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner

GC and CELPP: Workflows and Insights

Docking with Water in the Binding Site using GOLD

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Structural Bioinformatics (C3210) Molecular Docking

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

Conformational Searching using MacroModel and ConfGen. John Shelley Schrödinger Fellow

Pose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft

What is Protein-Ligand Docking?

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities

5.1. Hardwares, Softwares and Web server used in Molecular modeling

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today

Introduction to Structure Preparation and Visualization

April, The energy functions include:

Fondamenti di Chimica Farmaceutica. Computer Chemistry in Drug Research: Introduction

Supporting Information

Fragment based drug discovery in teams of medicinal and computational chemists. Carsten Detering

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

High Throughput In-Silico Screening Against Flexible Protein Receptors

Identifying Interaction Hot Spots with SuperStar

The Conformation Search Problem

OpenDiscovery: Automated Docking of Ligands to Proteins and Molecular Simulation

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Towards Physics-based Models for ADME/Tox. Tyler Day

Structure based drug design and LIE models for GPCRs

Toward an Understanding of GPCR-ligand Interactions. Alexander Heifetz

Molecular modeling with InsightII

Practical QSAR and Library Design: Advanced tools for research teams

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

Tutorial. Getting started. Sample to Insight. March 31, 2016

Electronic Supplementary Information Effective lead optimization targeted for displacing bridging water molecule

FlexPepDock In a nutshell

Conformational Sampling of Druglike Molecules with MOE and Catalyst: Implications for Pharmacophore Modeling and Virtual Screening

Force Fields for Classical Molecular Dynamics simulations of Biomolecules. Emad Tajkhorshid

Introduction to FBDD Fragment screening methods and library design

Performing a Pharmacophore Search using CSD-CrossMiner

Binding Response: A Descriptor for Selecting Ligand Binding Site on Protein Surfaces

MM-GBSA for Calculating Binding Affinity A rank-ordering study for the lead optimization of Fxa and COX-2 inhibitors

Protein-Ligand Docking Evaluations

The Schrödinger KNIME extensions

CSD Conformer Generator User Guide

Medicinal Chemistry/ CHEM 458/658 Chapter 4- Computer-Aided Drug Design

Molecular Mechanics, Dynamics & Docking

Dock Ligands from a 2D Molecule Sketch

CSD Conformer Generator User Guide

CSD. Unlock value from crystal structure information in the CSD

Life Science Webinar Series

Hydrogen Bonding & Molecular Design Peter

Part 6. 3D Pharmacophore Modeling

Different conformations of the drugs within the virtual library of FDA approved drugs will be generated.

Preparing a PDB File

BCB410 Protein-Ligand Docking Exercise Set Shirin Shahsavand December 11, 2011

The reuse of structural data for fragment binding site prediction

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM

Pose Prediction with GOLD

CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields

Schrodinger ebootcamp #3, Summer EXPLORING METHODS FOR CONFORMER SEARCHING Jas Bhachoo, Senior Applications Scientist

Protein Structure Prediction and Protein-Ligand Docking

Enhancing Specificity in the Janus Kinases: A Study on the Thienopyridine. JAK2 Selective Mechanism Combined Molecular Dynamics Simulation

Virtual screening for drug discovery. Markus Lill Purdue University

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Structure-based maximal affinity model predicts small-molecule druggability

Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification

Supporting Online Material for

Introduction to Spark

KNIME-based scoring functions in Muse 3.0. KNIME User Group Meeting 2013 Fabian Bös

ATP GTP Problem 2 mm.py

The Long and Rocky Road from a PDB File to a Protein Ligand Docking Score. Protein Structures: The Starting Point for New Drugs 2

Francisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans

Medicinal Chemist s Relationship with Additivity: Are we Taking the Fundamentals for Granted?

A Brief Guide to All Atom/Coarse Grain Simulations 1.1

Catalytic Mechanism of the Glycyl Radical Enzyme 4-Hydroxyphenylacetate Decarboxylase from Continuum Electrostatic and QC/MM Calculations

1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!

fconv Tutorial Part 2

Ligand-receptor interactions

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface

Structural Bioinformatics (C3210) Molecular Mechanics

Protein-Ligand Docking Methods

CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields

Hit Finding and Optimization Using BLAZE & FORGE

Tutorial: Molecular docking

Transcription:

User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved.

Introduction LeDock is flexible small-molecule docking software, which performs an exhaustive search of position, orientation and conformation of a ligand in the active site of a protein. Citation Zhang, N.; Zhao, H. Enriching screening libraries with bioactive fragment space. Bioorg. Med. Chem. Lett. 2016, 26, 3594-7. Theory Scoring Function. From the perspective of molecular recognition, a ligand binds to its receptor via shape complementarity using hydrogen bonds as anchors. The binding affinity G bind is determined by (Eq. 1). q ΔG bind = α (E vdw i + E hb i )Θ(E co E vdw i E hb i ) + β(r) i q j + γe lig i lig i lig j pro r i j strain (1) The first term is the sum of vdw interaction E vdw (Eq. 2) and hydrogen bonding energy E hb (Eq. 3), where Θ is the Heaviside step function and E co is the cutoff energy to allow for soft docking. The second term is the electrostatic interaction energy with a distance function β(r) accounting for both electrostatic screening and desolvation effects, where q is the partial atomic charge and r is the distance between a pair of atoms. The last term is the ligand conformational strain upon binding, and is contributed by the intra-molecular clash and/or torsion strain. Coefficients α, β and γ were empirically determined by referring to the previous scoring function. 1 12 min r E vdw i = ε ij ij r 2 r 6 min ij ij r ij (2) j pro E hb i = w ij (r ij r co )Θ(r co r ij ) (3) j pro where ε and r min are LJ parameters. E hb runs over hydrogen bonding pairs where r co is the cutoff distance of forming a H-bond, and w is the H-bonding weight to account for a variety of H-bonding strength. Although H-bonding weights may be derived in accordance with H-bond basicity, 2, 3 they are taken from those previously used to calculate H-bonding penalty upon ligand binding. 1 2

Search Algorithm. The evolutionary algorithm is adopted in combination with simulated annealing search, which is used to generate the first generation of docking poses. At the beginning of each simulated annealing search, the input ligand conformation, including its position, orientation and torsions, is randomly changed, so that each search starts with a different ligand conformation. Implementation. LeDock was coded in C++ language, with a simplified input interface: definition of the binding site, number of runs, root mean square deviation (RMSD) cutoff for pose clustering, and a ligand list for sequential docking of a library. LeDock reads small molecules in a Sybyl Mol2 format, where both Sybyl atom types and bond order shall be correctly assigned. Performance Docking performance on the Astex diversity set and kinase-focused set f RMSD 2.0Å a (%) f RMSD 2.0Å b (%) Time c (minutes) Astex diversity set 4 94.1 ± 2.0 d 82.4 ± 0.0 d 1.2 kinase-focused set 1 97 91 1.4 a By the closest pose among 20 runs. b By the best-scored pose among 20 runs. c Average time per 20 runs. d Standard deviation based on three replicates each with 20 runs. 100 Cumulative percentage (%) 80 60 40 20 Astex diversity set Kinase-focused set 0 0.0 0.5 1.0 1.5 2.0 RMSD (Å) Cumulative distribution of RMSD values within the cutoff of 2 Å by the closest poses relative to the crystallographic binding modes for the Astex diversity set (black) and kinase-focused set (red). 3

Usage $LeDock Get Help information. $LeDock [input file] Dock small molecules into the protein binding site one by one. The output for each small molecule is written into a file named [Small-molecule-name].dok. $LeDock spli [name].dok Split the docking poses of a docking output into individual PDB coordinates. 4

Input // An input file contains the following information: Receptor pro.pdb RMSD 1.0 Binding pocket x min y min z min x max y max z max // Number of binding poses 20 Ligands list ligands.list Receptor, Binding pocket, Number of binding poses, and Ligands list are keywords, which shall not be changed.! Receptor Explicit hydrogen atoms should be included in the PDB file. The atom will be read in only when a line starts with ATOM, so if water molecules, metal ions, cofactors, substrates need to be kept during docking, please ensure the line starting with ATOM instead of HETATM. The residue name and atom name of a protein shall follow the nomenclature of CHARMM. The original heavy atom name and residue name of a protein in a PDB file directly downloaded from Protein Data Bank shall also be fine. However, please use residue name HOH for water molecules, instead of TIP3. Section 4 will introduce the program LePro, which can be used to automatically process a PDB file downloaded from Protein Data Bank by removing all water molecules and cofactors, and adding hydrogen atoms to the protein. LePro will also generate a docking input file automatically. Please take a look at the generated protein file and get a flavor of PDB format required for docking. 5

! RMSD When multiple docking runs are carried out, one might find some docking poses are very similar. In order to reduce redundancy, a clustering by RMSD can be performed. The recommended value is 1.0 Å.! Binding pocket The binding pocket is set as a rectangular box. The size of the binding pocket is recommended to include any protein atom within 4 to 6 Å of any heavy atom of a drug-like molecule.! Number of binding poses 20 by default.! Ligands list A file such as ligands.list contains all small molecules to be docked. Please prepare your small molecules in SYBYL Mol2 format. A Mol2 file of a small molecule can be directly downloaded from ZINC database (http://zinc.docking.org/). One can use the program LeFrag to split a batch of Mol2 ligands into separate Mol2 files: $lefrag spli batch.mol2 Then to generate a ligand list, one can simply use the following command: $ls *mol2 > ligands.list Using the program LePro to automatically process the protein and generate an input file for docking Please download your target protein from Protein Data Bank, and then use the following command: $LePro [PDB code].pdb Two files will be generated. One is pro.pdb which can be used for docking. All water molecules, cofactors, and ligands are removed. Since the hydrogen atom of OH group in residues such Tyr, Ser and Thr may have different orientations, please check if the orientation of Hydrogen is correct and if not make changes accordingly. 6

The other is dock.in. The size of the binding site is determined as to include any protein atom within 4 Å of any heavy atom of the first ligand. However, this is not necessarily the binding site one might expect. Please ensure a correct binding site. For more information about LePro, please refer to its manual. Graphic interface Tutorial Please refer to the tutorial, which includes test cases with detailed explanation and troubleshooting. References 1. Zhao, H.; Huang, D. Hydrogen bonding penalty upon ligand binding. PLoS One 2011, 6, e19923. 2. Laurence, C.; Brameld, K. A.; Graton, J.; Le Questel, J. Y.; Renault, E. The pk(bhx) database: toward a better understanding of hydrogen-bond basicity for medicinal chemists. J. Med. Chem. 2009, 52, 4073-86. 3. Green, A. J.; Popelier, P. L. Theoretical prediction of hydrogen-bond basicity pkbhx using quantum chemical topology descriptors. J. Chem. Inf. Model. 2014, 54, 553-61. 4. Hartshorn, M. J.; Verdonk, M. L.; Chessari, G.; Brewerton, S. C.; Mooij, W. T.; Mortenson, P. N.; Murray, C. W. Diverse, high-quality test set for the validation of protein-ligand docking performance. J Med Chem 2007, 50, 726-41. 7