Free energy calculations

Similar documents
Free energy simulations

Biomolecular modeling. Theoretical Chemistry, TU Braunschweig (Dated: December 10, 2010)

Free energy calculations

Free energy calculations using molecular dynamics simulations. Anna Johansson

Alchemical free energy calculations in OpenMM

Free energy calculations with alchemlyb

Free energy calculations and the potential of mean force

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions

Protein Structure Analysis

MCB100A/Chem130 MidTerm Exam 2 April 4, 2013

Introduction to molecular dynamics

Lecture 19: Free Energies in Modern Computational Statistical Thermodynamics: FEP and Related Methods

Free Energy Simulation Methods

Advanced sampling. fluids of strongly orientation-dependent interactions (e.g., dipoles, hydrogen bonds)

Force fields, thermo- and barostats. Berk Hess

The Molecular Dynamics Method

22 Path Optimisation Methods

MCB100A/Chem130 MidTerm Exam 2 April 4, 2013

Molecular Modeling -- Lecture 15 Surfaces and electrostatics

Free Energy Estimation in Simulations

Aspects of nonautonomous molecular dynamics

Energy Barriers and Rates - Transition State Theory for Physicists

Computing Relative Free Energies of Solvation Using Single Reference Thermodynamic Integration Augmented with Hamiltonian Replica Exchange

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

Protein Structure Analysis

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University

Basic Ingredients of Free Energy Calculations: A Review

Entropy and Free Energy in Biology

CHEM-UA 652: Thermodynamics and Kinetics

Nonequilibrium thermodynamics at the microscale

Computer simulation methods (1) Dr. Vania Calandrini

Multi-Ensemble Markov Models and TRAM. Fabian Paul 21-Feb-2018

MD Thermodynamics. Lecture 12 3/26/18. Harvard SEAS AP 275 Atomistic Modeling of Materials Boris Kozinsky

Basics of Statistical Mechanics

Advanced in silico drug design

Statistical Mechanics in a Nutshell

Hyeyoung Shin a, Tod A. Pascal ab, William A. Goddard III abc*, and Hyungjun Kim a* Korea

Variational Implicit Solvation of Biomolecules: From Theory to Numerical Computations

Replica exchange methodology. John Karanicolas June 2003

BIOC : Homework 1 Due 10/10

7/19/2011. Models of Solution. State of Equilibrium. State of Equilibrium Chemical Reaction

Statistical Mechanics for Proteins

7 Planar systems of linear ODE

3. Solutions W = N!/(N A!N B!) (3.1) Using Stirling s approximation ln(n!) = NlnN N: ΔS mix = k (N A lnn + N B lnn N A lnn A N B lnn B ) (3.

Supporting Information. for. Influence of Cononsolvency on the. Aggregation of Tertiary Butyl Alcohol in. Methanol-Water Mixtures

Assignment 2 Atomic-Level Molecular Modeling

Advanced Molecular Molecular Dynamics

Computer simulation methods (2) Dr. Vania Calandrini

Introduction to Fluctuation Theorems

MODEL 1: Molecule Shapes Simulation ( shapes)

Entropy and Free Energy in Biology

Beyond the Second Law of Thermodynamics

Chapter 4: Going from microcanonical to canonical ensemble, from energy to temperature.

NWChem: Molecular Dynamics and QM/MM

Limitations of temperature replica exchange (T-REMD) for protein folding simulations

Chapter 6 Chemical Reactivity and Mechanisms

Secondary structure stability, beta-sheet formation & stability

Measuring The Binding Energy Of Glucose To The Glucose/Galactose Binding Protein Computationally

When does TMAO fold a polymer chain and urea unfold it?

Isothermal experiments characterize time-dependent aggregation and unfolding

Supporting Information

Structural Bioinformatics (C3210) Molecular Mechanics

Chemical reaction networks and diffusion

Optimum protocol for fast-switching free-energy calculations

Energy functions and their relationship to molecular conformation. CS/CME/BioE/Biophys/BMI 279 Oct. 3 and 5, 2017 Ron Dror

CE 530 Molecular Simulation

Statistical thermodynamics for MD and MC simulations

Biomolecular modeling I

Hands-on : Model Potential Molecular Dynamics

Phase Equilibria and Molecular Solutions Jan G. Korvink and Evgenii Rudnyi IMTEK Albert Ludwig University Freiburg, Germany

Protein Folding experiments and theory

3.20 Exam 1 Fall 2003 SOLUTIONS

k θ (θ θ 0 ) 2 angles r i j r i j

Microscopic analysis of protein oxidative damage: effect of. carbonylation on structure, dynamics and aggregability of.

I690/B680 Structural Bioinformatics Spring Protein Structure Determination by NMR Spectroscopy

Molecular Dynamics Simulations. Dr. Noelia Faginas Lago Dipartimento di Chimica,Biologia e Biotecnologie Università di Perugia

Molecular Dynamics. A very brief introduction

Introduction to Computer Simulations of Soft Matter Methodologies and Applications Boulder July, 19-20, 2012

Molecular simulation and structure prediction using CHARMM and the MMTSB Tool Set Free Energy Methods

Why study protein dynamics?

Lots of thanks for many reasons!

Chemical Kinetics. Goal. Objectives

Computational Studies of the Photoreceptor Rhodopsin. Scott E. Feller Wabash College

Basics of Statistical Mechanics

Reaction Dynamics (2) Can we predict the rate of reactions?

So far in talking about thermodynamics, we ve mostly limited ourselves to

Orthogonal Space Sampling of Slow Environment Responses

Lecture 20. Chemical Potential

Biomolecules are dynamic no single structure is a perfect model

(a) Write down the total Hamiltonian of this system, including the spin degree of freedom of the electron, but neglecting spin-orbit interactions.

Lecture 2: Receptor-ligand binding and cooperativity

Disorder and Entropy. Disorder and Entropy

Computational Chemistry - MD Simulations

Exercise 2: Solvating the Structure Before you continue, follow these steps: Setting up Periodic Boundary Conditions

Winmostar tutorial Gromacs Interfacial Tension V X-Ability Co., Ltd. 2018/01/15

Free energy, electrostatics, and the hydrophobic effect

Thermodynamics and Kinetics

Gromacs Workshop Spring CSC

Introduction Statistical Thermodynamics. Monday, January 6, 14

Journal of Physics: Conference Series PAPER. To cite this article: Jiramate Kitjanon et al 2017 J. Phys.: Conf. Ser

Transcription:

Free energy calculations Berk Hess May 5, 2017

Why do free energy calculations? The free energy G gives the population of states: ( ) P 1 G = exp, G = G 2 G 1 P 2 k B T Since we mostly simulate in the NPT ensemble we will use the Gibbs free energy G (not the NVT Helmholtz free energy F ) A free energy difference can be split into two terms: G = H T S, H = U + P V G is less costly to calculate than U

Example: solvation free energy Measuring the probabilities by calculating distributions directly can be very inefficient: path between two states infrequently sampled barrier high narrow connection probability of the two states can differ very much (large free energy difference)

Example: solvation free energy focus on the transition process

The coupling parameter approach We add a coupling parameter λ to the Hamiltonian or potential: V = V (r, λ) V (r, 0) = V A (r), V (r, 1) = V B (r) The free energy difference between state A and B is then given by: 1 H G B (p, T ) G A (p, T ) = dλ 0 λ NPT ;λ

Free energy perturbation At a given value of λ: F (λ) = k B T log [ c ] e βv (r,λ) dr Usually impossible to calculate from simulations. But possible to calculate as a perturbation from an ensemble average: exp[ βv (r, λ + λ)]dr F (λ + λ) F (λ) = k B T log exp[ βv (r, λ)]dr = k B T log e β[v (r,λ+ λ) V (r,λ)] λ

Free energy integration Thermodynamic integration (TI) At a given value of λ: F (λ) = k B T log [ c ] e βv (r,λ) dr Usually impossible to calculate from simulations. Also possible to compute da/dλ from an ensemble average: df V dλ = λ exp[ βv (r, λ)]dr = exp[ βv (r, λ)]dr V λ λ Averages are taken over an equilibrium path using V (λ) Usually intermediate points are used. About 5-20 points required to integrate V / λ properly

Slow growth Integrate V while changing λ at every MD step λ λ Issue: no ensemble average is taken: hysteresis is likely Only use this for a simple, rough initial estimate, use free energy integration for quantitative results

Free energy of solvation solute solvent solvated solute G of solvation is often used to parametrize force fields Partitioning properties: similarly one can determine the free energy of transfer from a polar (water) to an apolar solvent (octane, cyclohexane) This is important for protein folding and peptide-membrane interactions

Partitioning free-energies water water protein membrane water

Absolute binding free energies I E G binding E I I E ΔG binding E I ΔG I-E ΔG I-solvent I ΔG transfer=0 I E E G binding = G I E G I solvent

Relative binding free energies I G3 I I G3 I E E E E G1 G2 G1 G2 G4 E E A I G4 I B G = G 1 G 2 = G 3 G 4

G between similar states When the states of interest are very similar, e.g. small changes in charge or LJ parameters, only the potential energy difference matters and one can do single step perturbation In Gromacs this can be done in two ways: or Run a simulation for state A Do mdrun -rerun traj.trr with B-state parameters Determine the potential energy difference Run a simulation for state A with also a B-state topology and get the potential energy difference from the free energy code

G between different states With frequent transitions between states and small G: simulate the system count the populations With infrequent transitions between states or large G: We can modify the Hamiltonian to gradually move the system from state A to state B The free energy difference is then given by the total work associated with changing the Hamiltonian

The coupling parameter approach The λ dependence of V (rmλ) can be chosen freely, as long as the end points match the states The simplest approach is linear interpolation: V (r, λ) = (1 λ)v A (r) + λv B (r) This works fine, except when potentials with singularities are affected (LJ, Coulomb)

The coupling parameter approach solute solvent solvated solute State A: solute fully coupled to the solvent State B: solute fully decoupled from the solvent An example approach: Run simulations at: λ = 0, 0.1,..., 0.9, 1.0

Soft-core interactions Instead of linear interpolation we use: V sc (r) = (1 λ)v A (r A ) + λv B (r B ) r A = ( ασ 6 A λp + r 6) 1 6, r B = ( ασ 6 B (1 λ)p + r 6) 1 6 V sc 6 4 2 LJ, α=0 LJ, α=1.5 LJ, α=2 3/r, α=0 3/r, α=1.5 3/r, α=2 0 2 0 1 2 3 r

Solvation free energy in practice Make a topology where: the B-state atom types of the solute have LJ parameters zero the B-state charges of the solute are zero 1) Simulate the system with solvent at several λ-values 0,...,1 2) Simulate the system in vacuum at several λ-values 0,...,1 The solvation free energy is given by: 1 V G solv = dλ 0 λ 2) 1 0 V dλ λ 1)

Solvation free energy in practice In the.mdp file set: 1) couple-moltype to the molecule you want to couple/decouple. 2) couple-lambda0 to none 3) couple-lambda1 to vdw-q The solvation free energy is given by: 1 V G solv = dλ 0 λ 2) 1 0 V dλ λ 1)

Turning off non-bonded interactions In many cases it is more efficient to also turn off all non-bonded intramolecular interactions (separate calculation of the intramolecular contribution required)

Solvation of ethanol in water 600 in H 2 O, p=1, α=0.6 in H 2 O, p=2, α=1.5 vacuum, p=1, α=0.6 <dh/dλ> (kj/mol) 400 200 0 0 0.2 0.4 0.6 0.8 1 λ

mdp file parameters free-energy = yes couple-moltype = ethanol ; these soft-core parameters make sure we never get overlapping ; particles as lambda goes to 0 sc-power = 1 sc-sigma = 0.3 sc-alpha = 1.0 ; we still want the molecule to interact with itself at lambda=0 couple-intramol = no couple-lambda1 = vdwq couple-lambda0 = none ; The list of lambda points: fep-lambdas = 0.0 0.2 0.4 0.6 0.8 0.9 1.0 ; init-lambda-state is the index of the lambda to simulate init-lambda-state = 0

Potential of mean force A potential of mean force (PMF) is the free energy along one or more degrees of freedom The name comes from a common way to derive a PMF: by integrating the mean force working on a certain degree of freedom One can also obtain a potential of mean force by Boltzmann inverting a pair correlation function g(r): PMF(r) = k B T log(g(r)) + C

Entropic effects r The phase-space volume available to the system is: 4πr 2 Thus the entropic distribution is: T S = k B T log(4π r 2 ) = 2k B T log(r) + C This result can also be obtained by integrating the centrifugal force F c (r) = 2k BT r

PMF with a coupling parameter Since constraints are part of the Hamiltonian, they can also be coupled with lambda d( λ) The mean force is given by the mean constraint force: H λ With this approach one can one make PMF for distances between atoms with a pacakge supporing bond constraints (not yet between centers of mass)

PMF between molecules Look at a distance between centers of mass of molecules With a constraint: PMF between whole molecules A harmonic spring potential can be used instead of a constraint: umbrella sampling (not discussed yet)

Special case: molecules entering membranes c dc dc

The pull code in GROMACS In GROMACS any kind of center of masses constraints or pulling using potentials can be easily set using a few options in the mdp parameter file

Reversibility?

Reversibility Apply restraints to keep the decoupled molecule in place. Artificial restraints on the protein.

Reversibility Apply restraints to keep the decoupled molecule in place. Bond Angle Torsion

Reversibility check The forward and backward path should give the same answer So one can check: G making molecule dissappear = - G make molecule appear or G pulling molecule out of a protein = - G pushing molecule into a protein

Free energy from non-equilibrium processes If you change the Hamiltonian too fast, the ensemble will lag behind the change in the Hamiltonian: work partly irreversible Total work W done on the system exceeds the reversible part F : W F In 1997 Jarzynski published the relation between the work and free energy difference for an irreversible process from λ=0 to 1: F 1 F 0 = k B T log e βw λ=0 averaged over ensemble of initial points for λ = 0 Remarkable as W is very much process dependent

All methods up till now Jarzynski s relation: F 1 F 0 = k B T log e βw λ=0 Free energy perturbation is Jarzynski in one step Free energy integration is the slow limit of Jarzynski s equation Slow growth is incorrect, as it should use Jarzynski s relation Jarzynski s relation often not of practical use, as exponential averaging is inconvenient; only works well for σ(w ) < 2 k B T

Free energy integration error For highly curved V / λ, as is often the case, the discretized integral will give a systematic error Solution: use Bennett acceptance ratio method 4 2 δv/δλ 0 Instead of V / λ, calculate: V (λ i+1 ) V (λ i ) λi and V (λ i+1 ) V (λ i ) λi+1 and from that determine F (λ i+1 ) F (λ i ) using a maximum likelyhood estimate 2 4 0 0.2 0.4 0.6 0.8 1 λ This is the method of choice

Bennett acceptance ratio In 1967 Bennett showed that for any constant C: e F C = f (β(u B U A C)) A f (β(u A U B C)) B for f (x)/f ( x) e x Optimal choices: f (x) 1 1 + e x and C F This has to be solved iteratively. A maximum likelyhood, lowest variance estimator (M. Shirts et al. PRL (2003) 91, 140601)

Other non-equilibrium methods Crooks method using many transitions between λ=0 and λ=1 from two (or more) equilibrium simulations at λ=0 and λ=1

Conclusions Free-energy methods to use: Use direct counting of states when feasible Use perturbation when statistically feasible (not often the case) Bennett acceptance ratio method is the usual method of choice Crooks can be useful in certain cases. Possibility of unphysical pathways and states give you a lot of freedom to determine free energies in efficient ways But, unphysical states might give distributions far off from the final, physical ones, which can make sampling difficult If applicable, always think about / check for reversibility

Reading Daan & Frenkel Part III.7

Exercise Run simulations of methane in water. Decouple methane stepwise. Calculate solvation free energies of methane in water using BAR. Compare output from BAR with Thermodynamic Integratation.