MD Simulation in Pose Refinement and Scoring Using AMBER Workflows

Similar documents
GC and CELPP: Workflows and Insights

MM-GBSA for Calculating Binding Affinity A rank-ordering study for the lead optimization of Fxa and COX-2 inhibitors

The Schrödinger KNIME extensions

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Pose and affinity prediction by ICM in D3R GC3. Max Totrov Molsoft

Supplementary Information

The Schrödinger KNIME extensions

D3RGC2: free energy scoring by alchemical free energy implementation in SOMD

Enhancing Specificity in the Janus Kinases: A Study on the Thienopyridine. JAK2 Selective Mechanism Combined Molecular Dynamics Simulation

est Drive K20 GPUs! Experience The Acceleration Run Computational Chemistry Codes on Tesla K20 GPU today

Kd = koff/kon = [R][L]/[RL]

MM-PBSA Validation Study. Trent E. Balius Department of Applied Mathematics and Statistics AMS

Advanced in silico drug design

Docking. GBCB 5874: Problem Solving in GBCB

New approaches to scoring function design for protein-ligand binding affinities. Richard A. Friesner Columbia University

The PhilOEsophy. There are only two fundamental molecular descriptors

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

Docking with Water in the Binding Site using GOLD

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Towards Physics-based Models for ADME/Tox. Tyler Day

Towards fast and accurate binding affinity. prediction with pmemdgti: an efficient. implementation of GPU-accelerated. Thermodynamic Integration

Progress of Compound Library Design Using In-silico Approach for Collaborative Drug Discovery

Supplementary Methods

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Chemical Shift Restraints Tools and Methods. Andrea Cavalli

Type II Kinase Inhibitors Show an Unexpected Inhibition Mode against Parkinson s Disease-Linked LRRK2 Mutant G2019S.

Thermodynamic Integration with Enhanced Sampling (TIES)

Computational chemical biology to address non-traditional drug targets. John Karanicolas

User Guide for LeDock

Protein-Ligand Docking Evaluations

Machine-learning scoring functions for docking

DOCKING TUTORIAL. A. The docking Workflow

Implementation of novel tools to facilitate fragment-based drug discovery by NMR:

Alchemical free energy calculations in OpenMM

Conformational sampling of macrocycles in solution and in the solid state

Advanced Molecular Dynamics

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

High Throughput In-Silico Screening Against Flexible Protein Receptors

Structure Investigation of Fam20C, a Golgi Casein Kinase

Protein Structure Determination

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

5.1. Hardwares, Softwares and Web server used in Molecular modeling

Improving Protein Function Prediction with Molecular Dynamics Simulations. Dariya Glazer Russ Altman

Protein Structures. 11/19/2002 Lecture 24 1

Large scale free energy calculations for blind predictions of protein ligand binding: the D3R Grand Challenge 2015

Statistical Mechanics for Proteins

Merck Virtual Library (MVL): Deployment, Application, and Future Enhancement

Scoring functions for of protein-ligand docking: New routes towards old goals

STRUCTURAL BIOINFORMATICS II. Spring 2018

Supplementary Figures

Large Scale FEP on Congeneric Ligand Series Have Practical Free Energy Calculations arrived at Last?

ONETEP PB/SA: Application to G-Quadruplex DNA Stability. Danny Cole

Molecular simulation and structure prediction using CHARMM and the MMTSB Tool Set Free Energy Methods

Human and Server CAPRI Protein Docking Prediction Using LZerD with Combined Scoring Functions. Daisuke Kihara

Introduction to Structure Preparation and Visualization

Mechanistic insight into inhibition of two-component system signaling

BioSolveIT. A Combinatorial Approach for Handling of Protonation and Tautomer Ambiguities in Docking Experiments

Binding Response: A Descriptor for Selecting Ligand Binding Site on Protein Surfaces

A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

Francisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions

Virtual screening in drug discovery

Supporting Information

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

β1 Structure Prediction and Validation

Generating Small Molecule Conformations from Structural Data

BioSolveIT. A Combinatorial Docking Approach for Dealing with Protonation and Tautomer Ambiguities

Supporting Online Material for

Modeling Biological Systems Opportunities for Computer Scientists

Prediction and refinement of NMR structures from sparse experimental data

Towards Ligand Docking Including Explicit Interface Water Molecules

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

Cheminformatics platform for drug discovery application

Life Science Webinar Series

T H E J O U R N A L O F G E N E R A L P H Y S I O L O G Y. jgp

Computational Analysis of Small Proteins using the WebAmber Interface. By Eddie Gorr Gustavus Adolphus College

Detection of Protein Binding Sites II

A Brief Guide to All Atom/Coarse Grain Simulations 1.1

Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification

CHAPTER-5 MOLECULAR DOCKING STUDIES

Limitations of temperature replica exchange (T-REMD) for protein folding simulations

Bridging the Dimensions:

Visualization of Macromolecular Structures

Protein Structure Analysis

A Rational Approach to Receptor-Flexible Docking: Method and Validation

Retrieving hits through in silico screening and expert assessment M. N. Drwal a,b and R. Griffith a

Molecular Modelling for Medicinal Chemistry (F13MMM) Room A36

Fondamenti di Chimica Farmaceutica. Computer Chemistry in Drug Research: Introduction

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

Structural Perspectives on Drug Resistance

Supplementary information for cloud computing approaches for prediction of ligand binding poses and pathways

5323 Harry Hines Blvd., Dallas, TX Department of Chemistry and Biochemistry, University of California at San Diego,

Virtual Screening: How Are We Doing?

Predictive Molecular Simulation for Drug Discovery

How is molecular dynamics being used in life sciences? Davide Branduardi

The Long and Rocky Road from a PDB File to a Protein Ligand Docking Score. Protein Structures: The Starting Point for New Drugs 2

Transcription:

MD Simulation in Pose Refinement and Scoring Using AMBER Workflows Yuan Hu (On behalf of Merck D3R Team) D3R Grand Challenge 2 Webinar Department of Chemistry, Modeling & Informatics Merck Research Laboratories, Merck & Co., Inc. March 27, 2017

Outline Overview of the D3R Predictions from Merck Today just the Stage 1 results Merck Team & Result Ranking Overall Pose Prediction and Scoring Approach Structure Preparation & Docking AMBER Workflows for Pose Refinement & Ranking Case Study & Lessons Learned Ranking with Ensemble-based MD-MMGBSA Pose Exploration with MD 2

Acknowledgement Merck D3R Team Modeling Ying-Duo Gao Xavier Fradera Yuan Hu Andreas Verras Deping Wang Hongwu Wang James Fells Kira Armacost Alejandro Crespo Brad Sherborne Informatics Huijun Wang Zhengwei Peng Robert Sheridan Motivation To refine and share our best practices To be engaged in the scientific community To better understand the capability of sciences in industry & academia 3

Overall Approach for Stage 1 Step 1: preparation as a team 102 D3R dataset Literature search Data mining from PDB and CHEMBL Structure Preparation Schrodinger s ligprep/prepwizard (Protomer/Tautomer, fill missing loop) 4

Overall Approach for Stage 1 Step 1: preparation as a team 102 D3R dataset Literature search Data mining from PDB and CHEMBL Structure Preparation Schrodinger s ligprep/prepwizard (Protomer/Tautomer, fill missing loop) Step 2: docking experiments Glide Ensemble Docking Submission A Colony Entropy Docking Submission B Glide+Gold Docking Submission C Visual Inspection (A+C) Submission E 5

Overall Approach for Stage 1 Step 1: preparation as a team 102 D3R dataset Literature search Data mining from PDB and CHEMBL Structure Preparation Schrodinger s ligprep/prepwizard (Protomer/Tautomer, fill missing loop) Step 2: docking experiments Glide Ensemble Docking Submission A Colony Entropy Docking Submission B Glide+Gold Docking Submission C Visual Inspection (A+C) Submission E Step 3: MD-MMGBSA MD simulations using multiple replicates MD pose refinement Submission D Ensemble-based MMGBSA ranking 6

Overall Approach for Stage 1 Step 1: preparation as a team 102 D3R dataset Literature search Data mining from PDB and CHEMBL Structure Preparation Schrodinger s ligprep/prepwizard (Protomer/Tautomer, fill missing loop) Step 2: docking experiments Glide Ensemble Docking Submission A Colony Entropy Docking Submission B Glide+Gold Docking Submission C Visual Inspection (A+C) Submission E Step 3: MD-MMGBSA MD simulations using multiple replicates Can MD be used to refine docking pose? How does MD-MMGBSA perform in ranking? MD pose refinement Submission D Ensemble-based MMGBSA ranking 7

Overall conclusion: MD improves pose prediction and ranking Mean RMSD to Crystal (Å) 14 12 10 8 6 4 2 0 Mean RMSD of Top Scoring Poses A: Ensemble Docking C: Glide+Gold E: Visual Selection B: Colony Entropy Docking D: MD refinement 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 0.50 0.40 0.30 0.20 0.10 0.00-0.10-0.20-0.30-0.40 D: MD-MMGBSA Kendalls Tau (Stage 1-Ligand Scoring) A: Ensemble Docking 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 * n=36 C: Glide+Gold 8

Overall conclusion: MD improves pose prediction and ranking Mean RMSD to Crystal (Å) 14 12 10 8 6 4 2 0 Mean RMSD of Top Scoring Poses A: Ensemble Docking C: Glide+Gold E: Visual Selection B: Colony Entropy Docking D: MD refinement 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 0.50 0.40 0.30 0.20 0.10 0.00-0.10-0.20-0.30-0.40 D: MD-MMGBSA Kendalls Tau (Stage 1-Ligand Scoring) A: Ensemble Docking 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 * n=36 C: Glide+Gold 9

Glide Ensemble Docking & AMBER workflows Completely automated Ensemble Docking AMBER MD-MMGBSA workflows Ligand/Protein Structure Preparation Glide XP Ensemble Docking 36 ligands x40 templates =1,440 Pose selection Rank by score rmsd >= 1.0 A to previously selected pose to ensure diversity Top Scoring Pose *Workflow was validated before on 40 native ligands *Workflow was validated in large data sets. Y. Hu, B. Sherborne, Z. Guo, et al.. How to Obtain Reliable and Reproducible MMGBSA Results? manuscript in preparation. 10

Glide Ensemble Docking & AMBER workflows Completely automated Ensemble Docking AMBER MD-MMGBSA workflows Ligand/Protein Structure Preparation Glide XP Ensemble Docking 36 ligands x40 templates =1,440 Pose selection Rank by score rmsd >= 1.0 A to previously selected pose to ensure diversity CRAY: 7h GPU: 3h 10-15cmpds/day Top Scoring Pose *Workflow was validated before on 40 native ligands *Workflow was validated in large data sets. Y. Hu, B. Sherborne, Z. Guo. How to Obtain Reliable and Reproducible MMGBSA Results? manuscript in preparation. 11

Glide Ensemble Docking & AMBER workflows Completely automated Ensemble Docking AMBER MD-MMGBSA workflows Ligand/Protein Structure Preparation Pose Selection Glide XP Ensemble Docking 36 ligands x40 templates =1,440 Pose selection Rank by score rmsd >= 1.0 A to previously selected pose to ensure diversity CRAY: 7h GPU: 3h 10-15cmpds/day Top Scoring Pose *Workflow was validated before on 40 native ligands *Workflow was validated in large data sets. Y. Hu, B. Sherborne, Z. Guo. How to Obtain Reliable and Reproducible MMGBSA Results? manuscript in preparation. 12

Good template is important for docking Similarity matters Docking Pose RMSD to X-ray Crystal (Å) Similarity>0.5 More templates More active Size by activity Red: Activity > 1 μm Green: Activity <= 1 μm 2D Similarity of closest template ligand in ensemble of complexes (descriptor: ECFP6) 13

Good template is important for docking Similarity matters Docking Pose RMSD to X-ray Crystal (Å) Similarity<=0.5 More templates More active Similarity>0.5 More templates More active Size by activity Red: Activity > 1 μm Green: Activity <= 1 μm 2D Similarity of closest template ligand in ensemble of complexes (descriptor: ECFP6) 14

Good template is important for docking Similarity matters Docking Pose RMSD to X-ray Crystal (Å) Similarity<0.5 Less templates More inactive Similarity<=0.5 More templates More active Similarity>0.5 More templates More active Size by activity Red: Activity > 1 μm Green: Activity <= 1 μm 2D Similarity of closest template ligand in ensemble of complexes (descriptor: ECFP6) 15

Question 1 Can MD be used to refine docking pose? 16

Pose refinement with MD Pose from Docking Pose from MD Pose no help ΔRMSD > 0.5 Å ΔRMSD < - 0.5 Å -0.5 <ΔRMSD < 0.5 Å RMSD to X-ray Crystal (Å) Pose maintained 0 Pose improved Docking Pose RMSD to X-ray Crystal (Å) 17

Case Study-success: FXR_3 Ligand interactions optimized through MD Ligand moves to create stable H-bond Crystal Structure Docking Pose MD refinement RMSD 2.00 Å RMSD 0.92 Å 18

Case Study-success: FXR_3 Ligand interactions optimized through MD 19

Case Study-success: FXR_10 Protein optimized through MD MD reorients the Asn to make H-bond contact Crystal Structure Docking Pose MD refinement RMSD 2.26 Å RMSD 1.41 Å 20

Case Study-success: FXR_10 Protein optimized through MD MD reorients the Asn to make H-bond contact Crystal Structure Docking Pose MD refinement RMSD 2.26 Å RMSD 1.41 Å 21

Case Study-success: FXR_10 Protein optimized through MD 22

Case Study-failure: FXR_15 MD workflow can t rescue reversed pose The docked pose has a reverse direction comparing to x-ray Hard to refine the structure with MD Crystal Structure Docking Pose MD refinement RMSD 8.83 Å RMSD 9.37 Å 23

Case Study-failure: FXR_15 MD workflow can t rescue reversed pose 24

Case Study-failure: FXR_18 MD workflow can t address very large protein movements Protein conformation change Ligand RMSD to X-ray Docking: 9.36 Å MD: 10.32 Å Chain opened up in template Crystal Structure Docking Template Protein RMSD 2.59 Å MD refinement 25

Case Study-failure: FXR_18 MD workflow can t address very large protein movements 26

Lesson learned from previous D3R challenge: Explicit solvent free energy methods have not yet outperformed faster scoring methods in blinded protein-ligand affinity predictions. -D3R website (Posted October 11, 2016) https://drugdesigndata.org/about/what-we-have-learned Question 2 How does MD-MMGBSA perform in ligand scoring? 27

MD-MMGBSA improves on docking scores D: MD-MMGBSA Docking Methods ΔG pred (kcal/mol) A:Glide Ensemble Docking C:Xscore Docking IC50 (um) Gold ChemScore Prime MMGBSA Score binding pocket is deep inside challenge data set for implicit model (MMGBSA) Stage1: 36cmpds 28

Summary Lessons o o o o Ensemble docking performs well in pose prediction & ranking, if there are suitably close protein complexes available MD maintains or refines docking poses (protein and ligand) and extends the limit of prediction MD can t refine challenging poses Ensemble-based MD-MMGBSA ranking improves on docking scores Teamwork broke the challenge down to manageable portions Next o o AMBER MD-MMGBSA to AMBER TI CPU to GPU Towards accurate binding affinity prediction with pmemdgt: an efficient implementation of GPU-accelerated Thermodynamics Integration. T. Lee, Y. Hu, B. Sherborne, Z. Guo, D.M. York. submitted. 29

Thank You! 30

Pose exploration with MD Multiple studies show that the low energy snapshots correlate with best pose prediction 650ps 10 frames Avg *average energy of the 10 lowest energy snapshots for each trajectory of 22 MD simulations differed by the initial position. Onufriev, A.; Bashford, D.; Case, D. Proteins, 2004, 55, 383 394. Hou, et al. J Comput Chem. 2011 Apr 15;32(5):866-77 31