Hierarchical Metastable States and Kinetic Transition Networks: Trajectory Mapping and Clustering

Similar documents
Application of the Markov State Model to Molecular Dynamics of Biological Molecules. Speaker: Xun Sang-Ni Supervisor: Prof. Wu Dr.

Supplementary Figures:

Limitations of temperature replica exchange (T-REMD) for protein folding simulations

Using Spectral Clustering to Sample Molecular States and Pathways

The Molecular Dynamics Method

Exploring the Free Energy Surface of Short Peptides by Using Metadynamics

Advanced sampling. fluids of strongly orientation-dependent interactions (e.g., dipoles, hydrogen bonds)

Free energy simulations

METAGUI A VMD EXTENSION TO ANALYZE AND VISUALIZE METADYNAMICS SIMULATIONS

Coarse-Grained Peptide Modeling Using a Systematic Multiscale Approach

Monte Carlo simulations of polyalanine using a reduced model and statistics-based interaction potentials

Coarse-grained Models for Oligomer-grafted Silica Nanoparticles

Free energy calculations and the potential of mean force

Equilibrium Molecular Thermodynamics from Kirkwood Sampling

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

Equilibrium Molecular Thermodynamics from Kirkwood Sampling

Milestoning: Extending time scales of molecular simulations

Slow Folding of Cross-Linked r-helical Peptides Due to Steric Hindrance

(# = %(& )(* +,(- Closed system, well-defined energy (or e.g. E± E/2): Microcanonical ensemble

Classical Monte-Carlo simulations

Computing Reaction Rates Using Macro-states

Conformational Free-Energy Differences by Confinement Simulations

Computing free energy: Replica exchange

The Effect of Motional Averaging on the Calculation of NMR-Derived Structural Properties

IV. Classical Statistical Mechanics

Contents. 1 Introduction and guide for this text 1. 2 Equilibrium and entropy 6. 3 Energy and how the microscopic world works 21

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Identification and Analysis of Transition and Metastable Markov States

Methods of Computer Simulation. Molecular Dynamics and Monte Carlo

Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

Statistical Thermodynamics and Monte-Carlo Evgenii B. Rudnyi and Jan G. Korvink IMTEK Albert Ludwig University Freiburg, Germany

Transition Paths of Met-Enkephalin from Markov State Modeling of a Molecular Dynamics Trajectory

Monte Carlo. Lecture 15 4/9/18. Harvard SEAS AP 275 Atomistic Modeling of Materials Boris Kozinsky

Unfolding CspB by means of biased molecular dynamics

Finding slow modes and accessing very long timescales in molecular dynamics. Frank Noé (FU Berlin)

Exploring Substitution

Multi-Ensemble Markov Models and TRAM. Fabian Paul 21-Feb-2018

Computational Studies of the Photoreceptor Rhodopsin. Scott E. Feller Wabash College

A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring

Columbia University Department of Physics QUALIFYING EXAMINATION

Collective behavior, from particles to fields

Coarse-graining with the relative entropy. M. Scott Shell * Department of Chemical Engineering, University of California Santa Barbara

Basics of Statistical Mechanics

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

1. Thermodynamics 1.1. A macroscopic view of matter

Statistical Mechanics of Granular Systems

Methods for searching and sampling configurational space


MONTE CARLO METHOD. Reference1: Smit Frenkel, Understanding molecular simulation, second edition, Academic press, 2002.

Importance Sampling in Monte Carlo Simulation of Rare Transition Events

CHEM-UA 652: Thermodynamics and Kinetics

Molecular simulation and structure prediction using CHARMM and the MMTSB Tool Set Free Energy Methods

Lecture 11: Potential Energy Functions

Supporting Information How does Darunavir prevent HIV-1 protease dimerization?

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key

EXTENDING THE BOUNDARIES OF STRUCTURAL MODELING EPIGENETIC EFFECTS ON NUCLEOSOME POSITIONING

ICCP Project 2 - Advanced Monte Carlo Methods Choose one of the three options below

Convergence of replica exchange molecular dynamics

Variational Tensor Approach for Approximating the Rare-Event Kinetics of Macromolecular Systems

Long-Range Correlated Dynamics in Intrinsically Disordered Proteins

Villa et al. (2005) Structural dynamics of the lac repressor-dna complex revealed by a multiscale simulation. PNAS 102:

Some Accelerated/Enhanced Sampling Techniques

SUPPLEMENTAL MATERIAL

Insights into the Biotransformation of 2,4,6- Trinitrotoluene by the Old Yellow Enzyme Family of Flavoproteins. A Computational Study

Cecilia Clementi s research group.

On the Asymptotic Convergence. of the Transient and Steady State Fluctuation Theorems. Gary Ayton and Denis J. Evans. Research School Of Chemistry

Coarse-Grained Models Using Local-Density Potentials Optimized. with the Relative Entropy: Application to Implicit Solvation

Basics of Statistical Mechanics

Optimization Methods via Simulation

Computing free energy: Thermodynamic perturbation and beyond

Thermodynamics at Small Scales. Signe Kjelstrup, Sondre Kvalvåg Schnell, Jean-Marc Simon, Thijs Vlugt, Dick Bedeaux

SANDRO BOTTARO, PAVEL BANÁŠ, JIŘÍ ŠPONER, AND GIOVANNI BUSSI

Statistical Mechanics and Thermodynamics of Small Systems

3.320: Lecture 19 (4/14/05) Free Energies and physical Coarse-graining. ,T) + < σ > dµ

Classical Mechanics and Statistical/Thermodynamics. August 2018

Equilibrium Distribution from Distributed Computing (Simulations of Protein Folding)

Virtual Molecular Dynamics. Indiana University Bloomington, IN Merck and Co., Inc., West Point, PA 19486

Numerical methods in molecular dynamics and multiscale problems

Modeling the Free Energy Landscape for Janus Particle Self-Assembly in the Gas Phase. Andy Long Kridsanaphong Limtragool

arxiv: v1 [physics.bio-ph] 26 Nov 2007

Exercise 2: Solvating the Structure Before you continue, follow these steps: Setting up Periodic Boundary Conditions

Statistical Mechanics for Proteins

Phase Transition in a Bond. Fluctuating Lattice Polymer

Folding of small proteins using a single continuous potential

Coupling Atomistic and Continuum Hydrodynamics

Why study protein dynamics?

9.1 System in contact with a heat reservoir

3.4 The Chain Rule. F (x) = f (g(x))g (x) Alternate way of thinking about it: If y = f(u) and u = g(x) where both are differentiable functions, then

Hierarchical Modeling of Polystyrene: From Atomistic to Coarse-Grained Simulations

Copyright 2001 University of Cambridge. Not to be quoted or copied without permission.

Molecular Dynamics Simulations

Protein Folding Prof. Eugene Shakhnovich

Anatoly B. Kolomeisky Department of Chemistry Center for Theoretical Biological Physics How to Understand Molecular Transport through Channels: The

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

Relative binding enthalpies from molecular dynamics simulations. using a direct method

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

7. Kinetics controlled by fluctuations: Kramers theory of activated processes

Protein Folding by Robotics

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

Transcription:

Hierarchical Metastable States and Kinetic Transition Networks: Trajectory Mapping and Clustering Xin Zhou xzhou@gucas.ac.cn Graduate University of Chinese Academy of Sciences, Beijing 2012.6.5 KITP, Santa Barbara 1

Multiscale modeling and simulation: Understanding systems from simulation data (Date Mining) coarse-graining enhance sampling and accelerating dynamics

Understand MD Simulation data High dimension Large number of configurations Complicate structure

Project to low dimension: states and transitions End-End distance, Radius of gyration, Root mean squared distance (RMSD) Principle Components (PC) Reaction coordinates are usually hard to know a priori (Biased) states and kinetics 4

Cluster Analysis 1. Get the distance matrix 2. clustering Hundred kinds of clustering algorithms Not very stable, have some adjusted parameters 5

Configuration Cluster Analysis (Markov Chain Model) Divide MD configurations into lots of microstates Estimate transition rate matrix among these microstates Clustering these microstates to a few macro-states Bottom-up 6 F. Noe, S. Fischer, Cur. Opin. Struc. Biol. (2008); D. Gfeller, P. DeLosRios, A. Caflisch, PNAS (2007); D. Prada-Gracia et al, PLoS Comp. Biol. (2009); F. Noe et al. PNAS (2009)

Trajectory Mapping Top-down Clustering MD trajectories to form metastable states Decrease size of data set in clustering Depress intrastate fluctuations very much but keep the interstate fluctuations Take into account the similarity on dynamics 7

Trajectory Mapping Mapping each MD trajectory to a high-dimensional vector to represent the configuration distribution in the trajectory q(t [0, ]) r v ( A 1 (q),..., A n (q) ) A (q) 1 dta (q(t)) dqa (q)p (q) 0 {A i (q)} is a set of functions of configuration (basis functions) 8

Trajectory Mapping Shape and size of metastable region may be complicate MD Trajectory reaches local equilibrium distribution inside metastable region the trajectory-mapped vectors are almost same (the center limit theorem)

A tau-length trajectory ˆ A locates inside a tau-scale meta-stable state and reach local equilibrium or transition among a few states and reach local equilibrium in each of these meta-stable states 0 1 ˆ A All trajectories inside two states are mapped in a line Trajectory-mapped vectors have much simpler geometry

Trajectory Mapping 74 atoms, charged terminals; Implicit solvent simulation: Generalized Born; 172 basis functions from torsion angles Alanine-dodeca-peptide Each 1ns-length MD trajectory is mapped to a 172d vector Clustering Four nanosecond-order metastable states are found Principle component analyse

Orthonormalized basis functions P ( x ) P ref ( x ) 1 ˆ A ( x ) P ref ( x ) A ˆ ( x ) A ˆ ( x )... 0 P ( x ) A ˆ ( x ) A ˆ ( x ) P ref ( x ) A trajectory is mapped to a n-dimensional vector q(t) P (q) r v ( ˆ A 1 (q),..., ˆ A n (q) ) Inner product between trajectories is related to their overlapping v P i o P v j dx P i ( x )P j ( x ) P ref ( x ) 1 ˆ A ( x ) P i ( x ) ˆ A ( x ) P j ( x ) 1 v i v j

f (t) P ( x ) 2 dx P ( x ) ( x x (t)) P ref ( x ) 1, x (t) S 0, otherwise P ref ( x ) c P ( x ) Calculating inner product of single configuration with meta-stable states f(t) is state-indicator curve of trajectory Transition kinetics among states

Transition kinetics L. Gong and XZ, JPCB (2010)

Five 4-us trajectories overlap partially but not completely (microsecond-order simulation is not sufficient to reach equilibrium) t2 t3 t1 Alanine-dodeca-peptide t 4 t5 v P i o P v j dx P i ( x )P j ( x ) P ref ( x ) 1 ˆ A ( x ) P i ( x ) ˆ A ( x ) P j ( x ) 1 v i v j

Folding network of Ala 12 Analysis Three levels: 200ns, 20ns, 2ns; 28 states found, accounting for more than 90% simulation data. Get meta-stable states from long to short time scales

Metastable state network of Ala 12 中间态 ( 非平衡 ) 早期部分折叠态 ( 非平衡 ) 折叠态 ( 平衡 )

Transition trajectory

Reaction Detail Derived from Folding Network (Main Chain Dihedral Angle Difference)

Summary Trajectory mapping and clustering identify metastable states in high-dimensional configuration space Metastable states are dependent on dynamics and time scale t eq / t life 1 Transition kinetics among metastable states may be achieved or be focused on (e.g. transition path sampling, string method, flux method, etc.) 22

x: Collective Variable {q 1,..., q M } p( x ) r v ( ˆ A 1 ( x),..., ˆ A n ( x) ) Difference between samples d ( v P i, v P j ) v i v j 2 dx P i ( x ) P j ( x ) 2 P ref ( x ) P ( x ) P ref ( x ) 1 ˆ A ( x ) P ref ( x ) A ˆ ( x ) A ˆ ( x ) ˆ A ( x ) 1 M A ˆ ( x ) A ˆ ( x )... 0 P ref ( x ) P ( x ) A ˆ ( x i ) Coarse-graining

Coarse-graining 1. Map CG dof : x=x(r) 2. Select effective potential formula U ( x; u ) u f ( x ) 3. Optimize parameters of U(x) by minimizing difference between U(x) and F(x) F ( x ) k B T ln dre V ( r ) ( x x (r))

Coarse-graining: Match probability density P d 2 cg ( x ) P aa ( x ) (F,U ) P ref ( x ) (g 1 ) a a Correlation among of thermodynamics variables should be removed 2 ref a A cg A aa A ( x ) A ( x ) g Pref ( x ) Covariance matrix P ref ( x ) P ( x ) P ( x) cg aa 2 d 2 (U, F ) ( a ˆ ) 2

Relationship of different CG Matching d 2 (U, F ) ( a ˆ ) 2 Corrected thermodynamics matching e U ( x ) ~ e F ( x ) U ( x) ~ F ( x) Free energy surface (or probability) matching Gradual of free energy surface matching U ( x) ~ V (r) x Force matching (with linear x a r CG transformation)

d 2 (U, F ) ( a ˆ ) 2 1,... m In principle, m must be infinite In practice, m << M, the size of sample (the improvement using more basis functions is not helpful due to larger statistical error) The upper limit of relative deviation of any <A(x)> in the coarse-graining is d(u,f) A ( x ) cg A ( x ) aa ref ( A )d (U, F )

d 2 (U, F ) ( a ˆ ) 2 one-site CG water model

Pressures in CG and AA are not the ensemble means of the same configuration function P cg ) P cg ( x ) cg P aa P ) aa (r) aa ) P cg ( x (r)) P ) aa (r) Matching probability density in extended space: d (P cg ( v x n ), P aa ( v x n )) P cg ( v s n ;V ) ~ P aa ( v s n ;V ) v s x v / L

Question: how to construct a more transferable coarsegraining model? Summary The mean of any CG-configuration function A(x) can be reconstructed in probability density matching CG Matching probability density in an individual canonical ensemble is not sufficient for reconstruction of pressure (and some another physical variables, such as E, chemical potential) The matching method is actually a reverse MC. Its transferability was not be guaranteed.

http://kitpc.itp.ac.cn/program.jsp?id=pa20130610 31 Trajectory space sampling Weighted ensemble sampling Coarse-graining methods Free energy calculations Molecular simulation of self-assembly Welcome

Thank you for your Attention! xzhou@gucas.ac.cn 32