Energy Landscapes and Accelerated Molecular- Dynamical Techniques for the Study of Protein Folding

Similar documents
PHYSICAL REVIEW LETTERS

Limitations of temperature replica exchange (T-REMD) for protein folding simulations

Free-Energy Calculations in Protein Folding by Generalized-Ensemble Algorithms

Advanced sampling. fluids of strongly orientation-dependent interactions (e.g., dipoles, hydrogen bonds)

Multi-Ensemble Markov Models and TRAM. Fabian Paul 21-Feb-2018

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

Smart walking: A new method for Boltzmann sampling of protein conformations

Javier Junquera. Statistical mechanics

Generalized-Ensemble Algorithms for Molecular Simulations of Biopolymers

Convergence of replica exchange molecular dynamics

MD Thermodynamics. Lecture 12 3/26/18. Harvard SEAS AP 275 Atomistic Modeling of Materials Boris Kozinsky

What is Classical Molecular Dynamics?

Lecture 14: Advanced Conformational Sampling

Parallel Tempering Algorithm for. Conformational Studies of Biological Molecules

Analysis of MD Results Using Statistical Mechanics Methods. Molecular Modeling

Can a continuum solvent model reproduce the free energy landscape of a β-hairpin folding in water?

Statistical thermodynamics (mechanics)

Molecular Dynamics and Accelerated Molecular Dynamics

Choosing weights for simulated tempering

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday.

Replica Exchange with Solute Scaling: A More Efficient Version of Replica Exchange with Solute Tempering (REST2)

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key

Effective stochastic dynamics on a protein folding energy landscape

Molecular Structure Prediction by Global Optimization

Free energy simulations

Recovering Kinetics from a Simplified Protein Folding Model Using Replica Exchange Simulations: A Kinetic Network and Effective Stochastic Dynamics

Computer simulation methods (1) Dr. Vania Calandrini

Dominant Paths in Protein Folding

Calculation of point-to-point short time and rare. trajectories with boundary value formulation

Free Energy Landscape of Protein Folding in Water: Explicit vs. Implicit Solvent

Equilibrium Molecular Thermodynamics from Kirkwood Sampling

w REXAMD: A Hamiltonian Replica Exchange Approach to Improve Free Energy Calculations for Systems with Kinetically Trapped Conformations

Equilibrium Molecular Thermodynamics from Kirkwood Sampling

Evolutionary design of energy functions for protein structure prediction

Essential dynamics sampling of proteins. Tuorial 6 Neva Bešker

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

The Molecular Dynamics Method

Hydrophobic Aided Replica Exchange: an Efficient Algorithm for Protein Folding in Explicit Solvent

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics

Basics of Statistical Mechanics

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover

Efficient Parallelization of Molecular Dynamics Simulations on Hybrid CPU/GPU Supercoputers

Protein Structure Analysis

Accelerated Quantum Molecular Dynamics

Simulations with MM Force Fields. Monte Carlo (MC) and Molecular Dynamics (MD) Video II.vi

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 6 Aug 2003

Density Functional Theory: from theory to Applications

Supplementary Information Intrinsic Localized Modes in Proteins

9.3: Separable Equations

Milestoning: Extending time scales of molecular simulations

Optimization Methods via Simulation

Multicanonical parallel tempering

Lecture 8: Computer Simulations of Generalized Ensembles

Protein Folding. I. Characteristics of proteins. C α

Energetics and Thermodynamics

The computational Core of Molecular Modeling. (The What? Why? And How? )

Serial Replica Exchange

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Biological Thermodynamics

Application of Computational Modelling to Protein Folding and Aggregation Studies

Parallel Tempering Algorithm in Monte Carlo Simulation

Monte Carlo simulation of proteins through a random walk in energy space

4/18/2011. Titus Beu University Babes-Bolyai Department of Theoretical and Computational Physics Cluj-Napoca, Romania

Using Spectral Clustering to Sample Molecular States and Pathways

Application of the Markov State Model to Molecular Dynamics of Biological Molecules. Speaker: Xun Sang-Ni Supervisor: Prof. Wu Dr.

Statistical Mechanics in a Nutshell

arxiv: v1 [cond-mat.soft] 22 Oct 2007

Molecular Dynamics Simulations

Molecular Dynamics in Principal Component Space

Ab initio molecular dynamics. Simone Piccinin CNR-IOM DEMOCRITOS Trieste, Italy. Bangalore, 04 September 2014

Replica exchange methodology. John Karanicolas June 2003

Protein folding is a fundamental problem in modern structural

Temperature and Pressure Controls

Classical Monte-Carlo simulations

The Molecular Dynamics Method

Exploring the Changes in the Structure of α-helical Peptides Adsorbed onto Carbon and Boron Nitride based Nanomaterials

Lecture 11: Protein Folding & Stability

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Optimized statistical ensembles for slowly equilibrating classical and quantum systems

Estimating Accuracy in Classical Molecular Simulation

Introduction to Simulation - Lectures 17, 18. Molecular Dynamics. Nicolas Hadjiconstantinou

8.333: Statistical Mechanics I Problem Set # 3 Due: 10/18/13. Kinetic Theory

Methods of Computer Simulation. Molecular Dynamics and Monte Carlo

Monte Carlo (MC) Simulation Methods. Elisa Fadda

CHE210D Principles of Modern Molecular Simulation Methods

Organization of NAMD Tutorial Files

Sampling the conformation space of complex systems, such as

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University

Protein Folding In Vitro*

Rare Events. Transition state theory Bennett-Chandler Approach 16.2 Transition path sampling16.4

Monte Carlo. Lecture 15 4/9/18. Harvard SEAS AP 275 Atomistic Modeling of Materials Boris Kozinsky

Lecture 21 (11/3/17) Protein Stability, Folding, and Dynamics Hydrophobic effect drives protein folding

Exercise 2: Solvating the Structure Before you continue, follow these steps: Setting up Periodic Boundary Conditions

Protein Folding experiments and theory

Isothermal experiments characterize time-dependent aggregation and unfolding

Computer simulations of protein folding with a small number of distance restraints

Exchange frequency in replica exchange molecular dynamics

Appendix C extra - Concepts of statistical physics Cextra1-1

Transcription:

Energy Landscapes and Accelerated Molecular- Dynamical Techniques for the Study of Protein Folding John K. Prentice Boulder, CO BioMed Seminar University of New Mexico Physics and Astronomy Department November 3, 2004 1

What is a Protein? See http://www.accessexcellence.org/rc/vl/gg/prot_struct.html 2

Peptide Units Combine to Form Proteins Space Filling Model Stick Model Ribbon Model See http://www.dl.ac.uk/srs/px/special/special.html 3

The Native Structure of Proteins is Unique and Complex Bovine F1 ATP Synthase See http://www.dl.ac.uk/srs/px/special/special.html 4

So How Do Proteins Fold? Denatured state Folded (native) state? Thanks to John Chodera, Dill Biophysics Group, UCSF 5

The Protein Folding Problem The holy grail : Compute structure from sequence ab initio This involves exploring the conformational space of the protein using first-principle physics methods in order to identify the native structure. 6

The Energy Funnel José Onuchic, Workshop IIII Structural Proteomics, IPAM-UCLA, May 2004 7

A Realistic Energy Funnel Landscape Denatured state Folding Funnel Native state 8

How to Theoretically Explore Folding? The Molecular Dynamics Approach The AMBER forcefield: Thanks to John Chodera, Dill Biophysics Group, UCSF 9

Molecular Dynamics Numerically solve Newton s 2 nd law over small time increments to compute the position and momentum of the atoms in the protein over time. m i N dvi = dt = j 1 j i F i, j q = p = p H; H q 10

All-Atom MD Calculations of Folding These calculations include all atoms in the protein, implicit or explicit solvent, and account for all forces between the various atoms. Each MD timestep is no more than 2 fs (10-15 s). A 4 µs calculation would require 2 x 10 9 timesteps to simulate observed real life folding times. At 10 ms per timestep, such a simulation would require 231 single CPU days to compute a single trajectory on a modern high-end workstation. Brute force, single trajectory full-folding folding calculations are thus in general not practical at this time. 11

Statistical Molecular Dynamics Consider a protein consisting of N atoms whose instantaneous positions and momenta at time t are given by the 3N dimensional vectors r t and p ( t). The time average of a property A is defined as: 1 A dt A r t p t t τ τ = lim (), () Numerically, we calculate this as: τ 0 ( ) M 1 A A r t p t t ( ), ( ) m m M = m 1 12

Statistical Molecular Dynamics (2) Note that the ergodic hypothesis says that A t = where the ensemble average of property A is given by A A dp dr A r, p ρ r, p e = e ( ) ( ) and the probability density in the canonical (PVT) ensemble is given by E ( r, p) ρ ( r, p) = exp Q kt B 13

Folding Times are Often Dominated by the Time to Escape Kinetic Traps Low energy trajectories tend to get trapped in metastable local minima. Higher energy (temperature) trajectories avoid kinetic traps. By varying temperatures to kick a trajectory out of a kinetic trap, we can accelerate MD folding simulations. 14

The Replica Exchange Method Simulations are conducted in parallel at temperatures T, T,, T, T 1 2 Λ 1 Λ T 1 T 2 P = { ( ) } min 1,exp T Λ 1 T Λ Because each replica can undergo cycles of heating and cooling, conformation space is broadly sampled. Thanks to John Chodera, Dill Biophysics Group, UCSF 15

Details of the REM (1) ( ) ( () ) [ i Let (1) ] [ i ( Λ ) ] [] 1 [ Λ X = x ] 1,, xλ = x,, x λ 1 λ( Λ) stand for a state in this generalized ensemble. The superscript and subscript label the replica and temperature respectively. Each state X is specified by Λ sets of coordinates and momenta of N atoms in replica i(λ) at temperature T λ. We assume each replica is in the canonical (PVT) ensemble. i( λ) i( λ) i( λ) ( ), λ x r p λ 16

Details of the REM (2) The replicas are non-interacting. Therefore, the weight factor for the state X in this generalized ensemble is given by the product of the Boltzmann factors for each replica (each at a single temperature): where Λ WREM ( X) = exp β () H r, p λ i i= 1 ( [] i [] i ) Λ = exp βλ H r, λ= 1 1 β = kt B ( i( λ) i( λ) p ) 17

Details of the REM (3) Now consider exchanging a pair of replicas in the generalized ensemble. Specifically, exchange the temperatures between replicas i and j. Thus: ( ) ( [] i [ j] ) [ j] [ i] λ γ λ γ X =, x,, x, X =, x,, x, where ( ) [] [] [] x r p x r p [ ] [ ] [ ] x r p x r p ( ) [] [] [], λ, i i i i i i λ λ ( ) ( ) [ ] [ ] [ ], γ, j j j j j j γ γ γ λ 18

Details of the REM (4) The post-exchange momenta are defined by: p p [] i γ [] i [ j] [ j] This specific scaling of momenta correctly rescales the kinetic energy to be consistent with the equipartition of energy theorem K p ( ) where K is the kinetic energy. T T T λ T T λ γ p p N 2 pk 3 = = Nk T B 2m 2 k = 1 k T 19

Details of the REM (5) In order for this exchange process to converge toward an equilibrium distribution in the canonical ensemble (for each temperature), it is sufficient to impose the detailed balance condition on the transition probability w( X X ): W X w X X = W X w X X REM ( ) ( ) ( ) ( ) REM This can be satisfied by the usual Metropolis criterion: [] [ ] ( ) ( ) i j 1 for 0 w X X w xλ xγ = exp( ) otherwise where U is the potential energy and = βγ β λ ( ( []) ( [ ])) i j U r U r 20

Details of the REM (6) The canonical expectation value of a quantity A at temperature T λ λ = 1,, Λ can then be calculated by ( ) 1 M f m m= 1 ( ( λ, ) ( )) λ m A = A x t Tλ M Expectations values for intermediate temperatures can be obtained using the multiple-histogram reweighting technique (WHAM). 21

The Replica Exchange Method Simulations are conducted in parallel at temperatures T, T,, T, T 1 2 Λ 1 Λ T 1 T 2 P = { ( ) } min 1,exp T Λ 1 T Λ Because each replica can undergo cycles of heating and cooling, conformation space is broadly sampled. Thanks to John Chodera, Dill Biophysics Group, UCSF 22

Advantages of REM Exchanging temperatures kicks replicas out of kinetic traps and accelerates the simulated folding process. Each replica calculation can be done on a different processor on a parallel computer. There is little interprocessor communication required, so the problem is virtually embarrassingly parallel. Thus, on a parallel computer with sufficient processors, there is no wall clock penalty for the additional work required in REM while the simulation time to reach the native state is reduced because the sampling of phase space is improved. 23

Results for Met-enkephallin Time series of temperature exchange and total potential energy for one replica in an 8 replica REM calculation of Met-enkephalin (Tyr-Gly-Gly-Phe-Met). Yuji Sugita and Yuko Okamoto, Chemical Physics Letters 314 (1999) 141-151 24

Results for Met-enkephallin Yuji Sugita and Yuko Okamoto, Chemical Physics Letters 314 (1999) 141-151 25

MREM Method Multiplexed REM (MREM) is closely related to REM, but uses multiple replicas at the same temperature in addition to replicas at different temperatures. This improves sampling and further reduces folding times. Rhee and Pande performed MREM calculations for the 23 amino acid model protein BBA5. The calculations used 20 temperatures and 200 replicas per temperature. The MREM calculations reached the native state approximately 10 times faster than a constant temperature molecular dynamics calculation. Rhee and Pande, Biophysical Journal 84 (2003) 775-786 26

Results for BBA5 Using MREM Rhee and Pande, Biophysical Journal 84 (2003) 775-786 27

Results for BBA5 Using MREM Rhee and Pande, Biophysical Journal 84 (2003) 775-786 28

Results for BBA5 Using MREM Rhee and Pande, Biophysical Journal 84 (2003) 775-786 29

Another Way to Accelerate Folding t 1/ k Note that transition rates merely reflect the average time required for a transition. Some trajectories will transition faster than others and some slower. The parallel replica method exploits this as another method for accelerating simulations. 30

The Parallel Replica Method Pande, et. al., Biopolymers, vol 68, 2003. Consider N identical conformation replicas at the same temperature, but with randomized velocities. The probability that a particular simulation has crossed the barrier is: P t k kt () 1 = exp( ) For M simulations, the probability of the first stimulation crossing the barrier is t PM t = MP t 1 dτp = Mk exp Mkt () () ( ) 1 1 0 ( ) τ M 1 31

Folding@Home Parallel replica requires infrequent exchange of information between replicas. It is therefore nearly embarrassingly parallel. See http://folding.stanford.edu Folding@Home distributes the computation of each replica to individual PCs across the World Wide Web. To date, over 1 million computers have participated in folding@home. This has made possible the calculation of realistic complete folds for moderate sized proteins out to the equivalent of milliseconds. 32

Conclusions Folding rates are dominated by time spent in kinetic traps. In silico brute-force folding of proteins of biological interest via a single molecular dynamics trajectory is not in general feasible with current computers. The replica exchange and parallel replica techniques avoid this problem by artificially minimizing the time spent in kinetic traps (at the expense of destroying information about the kinetics). Speedups of 10x in calculation of native structures have been realized for MREM compared to constant temperature MD. Speedups of several orders of magnitude for calculation of native structures have been realized using parallel replica. 33

Acknowledgements I would like to acknowledge the invaluable assistance in preparing this talk of John Chodera and Ken Dill of the Dill Biophysics Group, UCSF. 34