Alchemical free energy calculations in OpenMM

Similar documents
Free energy calculations with alchemlyb

Free energy simulations

YANK: Free energy calculations made useful.

Free energy calculations

Large Scale FEP on Congeneric Ligand Series Have Practical Free Energy Calculations arrived at Last?

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Multi-Ensemble Markov Models and TRAM. Fabian Paul 21-Feb-2018

ONETEP PB/SA: Application to G-Quadruplex DNA Stability. Danny Cole

Biomolecular modeling. Theoretical Chemistry, TU Braunschweig (Dated: December 10, 2010)

Lecture 19: Free Energies in Modern Computational Statistical Thermodynamics: FEP and Related Methods

Statistical Mechanics for Proteins

An introduction to Molecular Dynamics. EMBO, June 2016

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Supporting Information Quantitative characterization of the binding and unbinding of millimolar drug fragments with molecular dynamics simulations

Supplementary information for cloud computing approaches for prediction of ligand binding poses and pathways

Limitations of temperature replica exchange (T-REMD) for protein folding simulations

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

Using Bayesian Statistics to Predict Water Affinity and Behavior in Protein Binding Sites. J. Andrew Surface

Using the Fast Fourier Transform in Binding Free Energy Calculations

Characterizing Structural Transitions of Membrane Transport Proteins at Atomic Detail Mahmoud Moradi

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Structural Bioinformatics (C3210) Molecular Mechanics

MM-GBSA for Calculating Binding Affinity A rank-ordering study for the lead optimization of Fxa and COX-2 inhibitors

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

Lecture 27. Transition States and Enzyme Catalysis

MD Simulations of Proteins: Practical Hints and Pitfalls to Avoid

MD Simulation in Pose Refinement and Scoring Using AMBER Workflows

Basic Ingredients of Free Energy Calculations: A Review

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

10/26/2010. An Example of a Polar Reaction: Addition of H 2 O to Ethylene. to Ethylene

Exercise 2: Solvating the Structure Before you continue, follow these steps: Setting up Periodic Boundary Conditions

MM-PBSA Validation Study. Trent E. Balius Department of Applied Mathematics and Statistics AMS

Structure Investigation of Fam20C, a Golgi Casein Kinase

Free energy calculations

From Friday s material

Large scale free energy calculations for blind predictions of protein ligand binding: the D3R Grand Challenge 2015

Type II Kinase Inhibitors Show an Unexpected Inhibition Mode against Parkinson s Disease-Linked LRRK2 Mutant G2019S.

Problems from Previous Class

Towards fast and accurate binding affinity. prediction with pmemdgti: an efficient. implementation of GPU-accelerated. Thermodynamic Integration

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Improving Protein Function Prediction with Molecular Dynamics Simulations. Dariya Glazer Russ Altman

Estimating Atomic Contributions to Hydration and Binding Using Free Energy Perturbation

Conformational Searching using MacroModel and ConfGen. John Shelley Schrödinger Fellow

Molecular Modeling Lecture 11 side chain modeling rotamers rotamer explorer buried cavities.

The Molecular Dynamics Method

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Structure based drug design and LIE models for GPCRs

Thermodynamic Integration with Enhanced Sampling (TIES)

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

Applications of Molecular Dynamics

Specific Ion Solvtion in Ethylene Carbonate and Propylene Carbonate

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

4 Examples of enzymes

2013 W. H. Freeman and Company. 6 Enzymes

High Throughput In-Silico Screening Against Flexible Protein Receptors

How is molecular dynamics being used in life sciences? Davide Branduardi

MMGBSA: Thermodynamics of Biomolecular Systems

Structure-based maximal affinity model predicts small-molecule druggability

Hamiltonian Replica Exchange Molecular Dynamics Using Soft-Core Interactions to Enhance Conformational Sampling

All-atom Molecular Mechanics. Trent E. Balius AMS 535 / CHE /27/2010

Free energy calculations and the potential of mean force

Biomolecules are dynamic no single structure is a perfect model

Variational Implicit Solvation of Biomolecules: From Theory to Numerical Computations

Advanced Molecular Dynamics

Computational Study of Gleevec and G6G Reveals Molecular Determinants of Kinase Inhibitor Selectivity

BIOC : Homework 1 Due 10/10

k θ (θ θ 0 ) 2 angles r i j r i j

An Informal AMBER Small Molecule Force Field :

C a h p a t p e t r e r 6 E z n y z m y e m s

Objectives INTRODUCTION TO METABOLISM. Metabolism. Catabolic Pathways. Anabolic Pathways 3/6/2011. How to Read a Chemical Equation

schematic diagram; EGF binding, dimerization, phosphorylation, Grb2 binding, etc.

Computational Modeling of Protein Kinase A and Comparison with Nuclear Magnetic Resonance Data

Binding Free Energy Calculations for Lead Optimization: Assessment of Their Accuracy in an Industrial Drug Design Context

CE 530 Molecular Simulation

MARTINI simulation details

Computational Studies of the Photoreceptor Rhodopsin. Scott E. Feller Wabash College

Gromacs Workshop Spring CSC

Supplementary Information

Sampling the free energy surfaces of collective variables

Structural Bioinformatics (C3210) Molecular Docking

Improved Solvation Models using Boundary Integral Equations


Conformational sampling of macrocycles in solution and in the solid state

An Introduction to Metabolism

Other Cells. Hormones. Viruses. Toxins. Cell. Bacteria

Applications of the Spring. Force: molecules

2054, Chap. 8, page 1

Interpreting and evaluating biological NMR in the literature. Worksheet 1

Protein-Ligand Docking Methods

Supporting Information

Molecular Driving Forces

Principles of Physical Biochemistry

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy

New approaches to scoring function design for protein-ligand binding affinities. Richard A. Friesner Columbia University

Molecular Modeling Study of Some Anthelmintic 2-phenyl Benzimidazole-1- Acetamides as β-tubulin Inhibitor

Computing Relative Free Energies of Solvation Using Single Reference Thermodynamic Integration Augmented with Hamiltonian Replica Exchange

A Brief Guide to All Atom/Coarse Grain Simulations 1.1

Advanced in silico drug design

Transcription:

Alchemical free energy calculations in OpenMM Lee-Ping Wang Stanford Department of Chemistry OpenMM Workshop, Stanford University September 7, 2012 Special thanks to: John Chodera, Morgan Lawrenz

Outline Introduction to free energy calculations Applications: Solvation and binding free energies Methods: Reaction coordinate vs. alchemical methods Accuracy and benchmarking Part I: Running the alchemical simulations Alchemical intermediates Soft-core nonbonded force Restraints Part II: Analyzing the data Multistate Bennett acceptance ratio Part III: Hydration free energy exercise

Introduction to free energy calculations Free energy differences are central to many questions in biological and pharmaceutical chemistry. How does compound X partition between different environments? e.g. How well does Lipitor partition between octanol and water? How tightly does compound X bind protein Y? How selectively? e.g. How well does clomifene bind/discriminate ERα/ERβ? How do modifications in compound X modulate affinity and selectivity? e.g. Why does selectivity of imatinib differ from that of other kinase inhibitors? How do target mutations affect binding affinities? e.g. How do clinically isolated β-lactamase resistance mutations restore ability to hydrolyze β-lactam antibiotics?

Calculation of free energies Without specialized simulation methods, free energy calculations would be impractically slow. Observing binding events from a normal MD simulation: P + L PL unbound bound time For typical drug off-rates (10-4 s -1 ), trajectories would need to be impractically long (hours), requiring ~10 9 CPU-years to simulate.

Calculation of free energies Free energy differences are calculated efficiently by simulating a path between reactant and product. Potential of Mean Force: Track the free energy along a specified reaction coordinate Alchemical: Simulate along an unphysical pathway to get free energy differences between two physical endpoints

Accuracy of free energy calculations The accuracy of a free energy calculation depends on the complexity of the problem.... hydration free energies of small neutral molecules 1.23±0.01 kcal/mol [502] (Mobley et al., in preparation) 1.33±0.05 kcal/mol [17] (Nicholls and Mobley et al., J Med Chem) solvent only small, neutral molecules fixed protonation states small apolar ligands T4 lysozyme L99A 1.89±0.04 kcal/mol [13] (Mobley and Graves et al., JMB 2007) 0.6±0.2 kcal/mol [3] (Mobley and Graves et al., JMB 2007) small, rigid protein small, neutral ligands fixed protonation states multiple sidechain orientations multiple ligand binding modes polar ligands FKBP12 1.42 kcal/mol [9] 0.94 kcal/mol [7] (Shirts et al., in preparation) small, rigid protein fixed protonation states larger drug-like ligands rotatable bond in ligands JNK3 kinase 6.3 kcal/mol [44] (Haque, Chodera, Shirts, Mobley, Pande) large protein multiple conformations large drug-like ligands, rotatable bonds in ligands multiple protonation states phosphorylation and activation peptide substrate? MgCl2 salt effects? retrospective RMS error [sample size] prospective RMS error [sample size] easy hard

Outline Introduction to free energy calculations Applications: Solvation and binding free energies Methods: Reaction coordinate vs. alchemical methods Accuracy and benchmarking Part I: Running the alchemical simulations Alchemical intermediates Soft-core nonbonded force Restraints Part II: Analyzing the data Multistate Bennett acceptance ratio Part III: Hydration free energy exercise

Alchemical free energy calculations Alchemical simulations progress through a series of intermediates between initial and final states. 8 simulations 16 simulations 5 simulations 8 simulations 16 simulations 1. Atomic charges on ligand are switched off 2. VdW interaction between ligand and water is switched off Harmonic restraints are switched on 3. VdW interaction between ligand and protein is switched on 4. Atomic charges on ligand are switched on 5. Harmonic restraints are removed Graphics from David Mobley

Alchemical free energy calculations The accuracy of alchemical simulations relies on the phase space overlap of neighboring intermediates. p(x) x Intermediate states are introduced to ensure a contiguous chain of good overlap. D. Wu and D. A. Kofke. "Phase-space overlap measures. I. Fail-safe bias detection...", J. Chem. Phys. 123: 054103 (2005).

Soft-core Lennard Jones potentials The Lennard-Jones repulsion is switched off gently, greatly improving the overlap between intermediates. Example: Using thermodynamic integration, the free energy difference is the area under the curve. Lennard-Jones is off at λ=1. Standard (linear) decoupling Soft core decoupling diverges near λ=1! T. A. Steinbrecher, D. L. Mobley, and D. A. Case. JCP 127:214108 (2007) (More efficient forms known, but all based on same idea.)

Restraint potentials Restraint potentials eliminate convergence issues related to translational / rotational motion. Alchemical intermediates with weakened interactions may encounter convergence issues from the ligand diffusing out of the binding pocket or otherwise moving around. Harmonic restraint potentials keep the ligand in place; the free energy of restrained degrees of freedom are added back in analytically. Absolute B inding Free E nergies: A Quantitative Approach for Their Calculation Boresch, S.; Tettinger, F.; Leitgeb, M.; Karplus, M. J. Phys. Chem. B.; (Article); 2003; 107(35); 9535-9551. DOI: 10.1021/jp0217839

Demonstration of alchemical intermediates Using enhanced sampling methods, multiple binding events and reorientation of the ligand are observed. [closeup] p-xylene bound to T4 lysozyme L99A

Outline Introduction to free energy calculations Applications: Solvation and binding free energies Methods: Reaction coordinate vs. alchemical methods Accuracy and benchmarking Part I: Running the alchemical simulations Alchemical intermediates Soft-core nonbonded force Restraints Part II: Analyzing the data Multistate Bennett acceptance ratio Part III: Hydration free energy exercise

Estimating free energy differences from data Not all estimators are created equal (in terms of bias and statistical efficiency). Thermodynamic integration quadrature error (bias) difficult to quantify Free energy perturbation / exponential reweighting Zwanzig RW. JCP 22:1420, 1954. Shirts MR and Pande VS. JCP 122:144107, 2005. suffers from large bias and variance Bennett acceptance ratio Bennett CH. J Comput Phys 22:245, 1976. Shirts MR, Bair E, Hooker G, and Pande VS. PRL 91:140601, 2003. only applicable to two states - can t we use all the data?

Multistate Bennett acceptance ratio (MBAR) The MBAR method provides statistically optimal estimates of free energy differences. Statistically optimal analysis of samples from multiple equilibrium states Michael R. Shirts (Department of Chemistry, Columbia University), John D. Chodera (Department of Chemistry, Stanford University) http://arxiv.org/abs/0801.1426 http://simtk.org/home/pymbar

Outline Introduction to free energy calculations Applications: Solvation and binding free energies Methods: Reaction coordinate vs. alchemical methods Accuracy and benchmarking Part I: Running the alchemical simulations Alchemical intermediates Soft-core nonbonded force Restraints Part II: Analyzing the data Multistate Bennett acceptance ratio Part III: Hydration free energy exercise

Hydration free energy exercise Hydration free energies are relatively simple to compute compared to binding free energies. gas Benefits: Test the accuracy of half of binding reaction (withdrawal from water) A simple, tractable system for studying accuracy of calculations Note: Run simulate-water.py now, it will take a few minutes water D. L. Mobley, E. Dumont, J. D. Chodera and K. A. Dill. J. Phys. Chem. B. 111:2242-2254 (2007) (DOI)

Some hydration free energy results Hydration free energies are often used for force field benchmarking. Slide by: David Rinaldo, Applications Scientist at Schrodinger

Choice of intermediates annihilation (remove interactions within solute) decoupling (keep interactions within solute) electrostatics potentially large G short correlation times requires vacuum recharging calculation smaller G longer correlation times due to bare charges no vacuum recharging calculation required 3 PME evaluations per timestep, tricky to use Lennard-Jones requires simulation in vacuum no vacuum simulation required eliminates potentially unphysical conformations Note that the literature is full of extremely confusing terminology, like double decoupling, which is actually annihilation... annihilation vs decoupling discussed in: M. R. Shirts, D. L. Mobley, and J. D. Chodera. "Alchemical free energy calculations: Ready for prime time?",annual Reports in Computational Chemistry 3:41-59 (2007).

Example calculation Perform a simple solvation free energy simulation. Example calculation 1: Hydration free energy of a water molecule Run simulate-water.py. What the script does: 1. Set up the OpenMM simulation, minimize the energy 2. Create several alchemical intermediates 3. Run a simulation for each alchemical intermediate 4. Post-process the simulation trajectories, evaluating the energy of each alchemical state over each trajectory 5. Save the data to disk Run analyze.py --ncfile output.nc 1. Run MBAR analysis on output data. 2. Get hydration free energy! ########################################################################### ### At 298.00 K, the solvation free energy is: -5.594 +- 0.514 kcal/mol ### ###########################################################################

Example calculation How about a more interesting molecule? Example calculation 2: Hydration free energy of ethanol Copy simulate-water.py to ethanol.py 1. Change the number of alchemical atoms. 2. Load the ethanol force field and PDB files. 3. Use the Modeller class to create a solvated box (or use provided solvated box). Run analyze.py --ncfile output.nc 1. Run MBAR analysis on output data. 2. Get hydration free energy! Finished? Try these: 1. Experiment with the number of alchemical intermediates, solvent box size and the simulation length. How to optimize accuracy vs. efficiency? 2. Set up your own molecule of interest using AMBER tools.