DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Similar documents
PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

Potential Energy (hyper)surface

Supporting Online Material for

The Molecular Dynamics Method

Docking. GBCB 5874: Problem Solving in GBCB

Kd = koff/kon = [R][L]/[RL]

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Supplementary Methods

An introduction to Molecular Dynamics. EMBO, June 2016

Toward an Understanding of GPCR-ligand Interactions. Alexander Heifetz

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

Protein sidechain conformer prediction: a test of the energy function Robert J Petrella 1, Themis Lazaridis 1 and Martin Karplus 1,2

The Molecular Dynamics Method

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

Protein Folding by Robotics

Multi-Scale Hierarchical Structure Prediction of Helical Transmembrane Proteins

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

User Guide for LeDock

Structural Bioinformatics (C3210) Molecular Mechanics

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

FlexPepDock In a nutshell

Simulating Folding of Helical Proteins with Coarse Grained Models

Molecular Dynamics, Monte Carlo and Docking. Lecture 21. Introduction to Bioinformatics MNW2

Molecular modeling with InsightII

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

Fragment Hotspot Maps: A CSD-derived Method for Hotspot identification

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Can a continuum solvent model reproduce the free energy landscape of a β-hairpin folding in water?

Why study protein dynamics?

Tutorial: Structural Analysis of a Protein-Protein Complex

Molecular Mechanics, Dynamics & Docking

Supplementary Information

4 th Advanced in silico Drug Design KFC/ADD Molecular Modelling Intro. Karel Berka, Ph.D.

Other Cells. Hormones. Viruses. Toxins. Cell. Bacteria

Multiscale Materials Modeling

Assignment 2 Atomic-Level Molecular Modeling

Structural Insights from Molecular Dynamics. Simulations of Tryptophan 7-Halogenase and

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Unfolding CspB by means of biased molecular dynamics

β 2 -microglobulin fibrillation?

Enhancing Specificity in the Janus Kinases: A Study on the Thienopyridine. JAK2 Selective Mechanism Combined Molecular Dynamics Simulation

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Structural Bioinformatics (C3210) Molecular Docking

The Molecular Dynamics Simulation Process

Protein Folding experiments and theory

Introduction to Comparative Protein Modeling. Chapter 4 Part I

The Molecular Dynamics Method

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy

Protein Dynamics, Allostery and Function

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

The Computational Microscope

Computational Chemistry in Drug Design. Xavier Fradera Barcelona, 17/4/2007

Flexibility and Constraints in GOLD

Structure Investigation of Fam20C, a Golgi Casein Kinase

Life Science Webinar Series

Ligand Scout Tutorials

Advanced Molecular Dynamics

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

Molecular dynamics simulation of Aquaporin-1. 4 nm

Autodock tutorial VINA with UCSF Chimera

Protein Dynamics, Allostery and Function

Introduction to Structure Preparation and Visualization

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

MULTIDOCK V1.0. Richard M. Jackson Biomolecular Modelling Laboratory Imperial Cancer Research Fund 44 Lincoln s Inn Fields London WC2A 3PX

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Molecular Dynamics Simulation of HIV-1 Reverse. Transcriptase

DOCKING TUTORIAL. A. The docking Workflow

Lec.1 Chemistry Of Water

Portal. User Guide Version 1.0. Contributors

Using Phase for Pharmacophore Modelling. 5th European Life Science Bootcamp March, 2017

Multiscale simulations of complex fluid rheology

Solutions and Non-Covalent Binding Forces

Analysis of the simulation

Gromacs Workshop Spring CSC

Ab-initio protein structure prediction

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Exercise 2: Solvating the Structure Before you continue, follow these steps: Setting up Periodic Boundary Conditions

schematic diagram; EGF binding, dimerization, phosphorylation, Grb2 binding, etc.

Modeling Biological Systems Opportunities for Computer Scientists

FDS: Flexible Ligand and Receptor Docking with a Continuum Solvent Model and Soft-Core Energy Function

HIV-1 TAR RNA Spontaneously Undergoes Relevant Apo-to-Holo Conformational Transitions in Molecular Dynamics and Constrained Geometrical Simulations

The protein folding problem consists of two parts:

SCREENED CHARGE ELECTROSTATIC MODEL IN PROTEIN-PROTEIN DOCKING SIMULATIONS

SUPPLEMENTARY INFORMATION An Empirical IR Frequency Map for Ester C=O Stretching Vibrations

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Citation for published version (APA): Feenstra, K. A. (2002). Long term dynamics of proteins and peptides. Groningen: s.n.

Molecular Dynamics. Coherency of Molecular Motions. Local Minima & Global Minimum. Theoretical Basis of Molecular Dynamic Calculations

Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition

MD Simulation in Pose Refinement and Scoring Using AMBER Workflows

Coarse-Grained Peptide Modeling Using a Systematic Multiscale Approach

Homework Problem Set 4 Solutions

Transcription:

DISCRETE TUTORIAL Agustí Emperador Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING: STRUCTURAL REFINEMENT OF DOCKING CONFORMATIONS Emperador et al., J. Chem. Theory Comput. 9, 1 (013) Files at mmb.pcb.ub.es/~agusti/scalalife/cecam http://www.scalalife.eu/content/discrete-1

Inputs for DISCRETE: 1- Topology file: chemical structure of the molecules (bonds, angles, dihedrals) - Coordinates file These are generated by the DMDsetup program, whose inputs are: 1- PDB file of the protein structures - Library of aminoacids (dmd_reslibrary.dat) 3- Interaction potential parameters (dmd_potentials.dat) Two structures: receptor (same coordinates for all docking conformations) and ligand (choose one). Inputs of DMDsetup to prepare the simulation.

THE PROTEIN-PROTEIN DOCKING PROBLEM Where is the interface? To have a fast prediction method: consider proteins as rigid bodies, generate many docking conformations and score them. Sampling in a 6D space (receptor fixed, translation (3) + rotation (3) of the ligand). Each rigid docking pose minimized Rigid protein-protein docking poses ranked with scoring function: 1Z0K.pydock Sources of error: 1- Insufficient sampling of rigid docking conformations (rugged free energy landscape) - Inaccuracies in the scoring function (that predicts the free energy, not the potential energy) 3- Ignoring protein flexibility (proteins are not rigid and can undergo conformational changes upon binding)

FOCUSING ON PROTEIN FLEXIBILITY: STRUCTURAL REFINEMENT Receptor and ligand structures may change upon binding. This should affect the binding energy. Structural changes in the interface during the simulation are expected to approach the experimental interface and improve the binding energy of the near-native poses We run a discrete molecular dynamics (DMD) simulation for each rigid docking pose. As starting point for the DMD simulations, we choose the 100 top-ranked configurations (out of 10000 generated with FTDOCK) Ranking of docking poses with pydock, a highly optimized scoring function. (Cheng et al., Proteins 68 503 (007)) We use a multiscale representation of the protein: all the atoms in the interface, only CA elsewhere. Only the atoms at the interface contribute to the scoring. Less particles makes the simulation much faster.

MULTISCALE MODEL OF THE COMPLEX Only interactions in the interface region have to be considered, Because we compute the binding energy of receptor and ligand Potential energy function: 1-Van der Waals interaction -Solvation term (EEF1 potential, Lazaridis and Karplus, Proteins 35, 133 (1999)) 3-Electrostatic potential between atoms in acceptor and donor sidechains Interface: residues with some atom of the other protein within 8 A Layer surrounding the interface: residues with some atom of the other protein within 1 A. These atoms are frozen, but interact with the atoms at the interface. They reconstruct the environment of the interface in the full atomistic model. Made by DMDSetup

Standard molecular dynamics (MD) Integrate the equations of motion at each timestep ( t = fs) F = dv = ma dr a= dv dt v= dr dt 500.000.000 integrations needed to generate 1 µs of trajectory! V = Ebonded + Enon bonded Discrete molecular dynamics (DMD) Particles move with constant velocity ri (t + tc ) = ri (t ) + vi (t )tc until a collision occurs Transfer of linear momentum upon a collision mi vi = mi vi '+ p m j v j + p = m j v j ' The velocities of particles i, j change No need to integrate equations of motion

DISCRETE MOLECULAR DYNAMICS FUNDAMENTALS No forces should be calculated: very fast, because there is no need to integrate the equations of motion Coarse-grained representation of the interaction potentials: In the limit of many potential steps, same results a a standard MD simulation Same accuracy as standard MD when the interaction potentials are not well determined (coarse-grained models of proteins) Timestep: time between an event and the next one. Much smaller than the timestep in a conventional MD (collisions very frequent). For a higher number of particles, the collision frequency is higher and the timestep if shorter. It is very important to keep low the number of particles per processor The collision frequency increases with the number of potential steps. It is convenient to use a minimum number of steps Numerical effort spent in calculating the collision times between all the particles. Speed multiplied with efficient algorithms Lifetime of DISCRETE: 4 years. Developed exclusively at the IRB (Barcelona)

Discrete molecular dynamics Conservation of linear momentum Conservation of energy when entering a region with different potential energy (Emperador et al, Proteins 78 83 (010)) More steps = more events mi v i +m j v j =m i v i '+m j v j ' 1 1 1 1 mi v i + m j v j = mi v i ' + m j v j ' +ΔV Transferred linear momentum Δp= mi m j mi +m j { m i +m j ( v j v i ) m m ΔV ( v j v i ) i j If V>0, the particles can overcome the potential step as long as ΔV< } m1 m ( v v i ) ( m 1 +m ) j Otherwise, the particles keep inside the well and (not trajectories) Δp= m i m j m i +m j ( v i v j )

MOLECULAR STRUCTURE IN DMD We use an united atom representation (only heavy atoms) In DMD all the potentials depend on particle-particle distances (two-body potentials) It is not possible to define forces depending on angles of dihedrals. Pseudobonds have to be used to fix bond angles and dihedrals Hydrogen bonds (HB) are dipole-dipole interactions, therefore dependent on the angle. Pseudobonds are used to keep the correct angle. LYS-ALA

INTERACTION POTENTIALS Solvent considered implicitly via a solvation term Interactions considered (nonbonded potential energy terms) 1- Van der Waals - Electrostatic 3- Solvation. EEF1 potential energy function. Parametrized for the united atom representation Lazaridis and Karplus, Proteins 35, 133 (1999) Secondary structure is fixed (otherwise would be lost due to the simplicity of the two-step potentials; minimum number of steps to increase speed)

Nonbonded part: the Lazaridis-Karplus EEF1 effective energy function V r

DMD representation of the interaction potential terms We want to use the minimum number of steps to reduce the number of events (faster simulation) Hydrophobic interaction Hydrophilic interaction Electrostatic interaction (opposite charge) Electrostatic interaction (same charge)

ENERGY OF THE DOCKING CONFIGURATIONS We run molecular dynamics with the DMD method for each docking configuration. The binding energy is obtained from the DMD interaction potential V = VVdW +VSolv +VCoul The energy varies along the simulation, and reaches a stationary average value after ns of trajectory. We compute the average of each potential energy term over the last 0.5 ns of simulation, and use these averages to score each conformation. Computing time: 0.5 h for 1000 particles

The efficiency of a docking method is tested with a benchmark of known complexes. We have chosen the Weng s benchark 4.0 (Proteins 78, 3311 (010)) We have studied 61 complexes of this benchmark for which pydock gives a near native solution (interface RMSD < 4 A respect to experimental complex) between the first 100 top-ranked configurations. Ranking made with pydock Rigid docking DMD refinement Plots of score vs RMSD for each docking pose, obtained in rigid doking (left) with DMD (right) In the right panel, binding energies before and after DMD relaxation Ideal result: funnel-like distribution, near native conformations with the best score (lower energy)

PERFORMANCE OF THE DMD REFINEMENT The performance of the refinement method over the whole bencharkis evaluated with the success rate: For how many complexes of the benchmark is found a near native solution between the N top ranked docking poses? pydock (rigid structures) DMD potential (relaxed structures) DMD potential (rigid structures) The flexibility included via DMD has an effect equivalent to the optimization of the scoring function (improvement from green to red curve).

DEPENDENCE WITH THE DEFORMATION UPON BINDING DEFORMATION < 1 A DEFORMATION > 1 A Complexes with low deformation upon binding: rigid docking scored with pydock and DMD show the same performance because deformation does not affect seriously the interaction energy Complexes with high deformation upon binding: backbone movements are relevant and modify the binding energy between receptor and ligand, improving the success rates Flexible protein-protein docking with DMD. Agustí Emperador

RIGID DOCKING vs FLEXIBLE DOCKING At TOP10, the performance of the DMD method (right panel) is virtually the same for 'rigid' and 'flexible' complexes Unlike rigid docking based methods, the performance of flexible docking is not seriously affected by conformational changes upon binding. This is due to the amplitude of the structural changes during the DMD trajectory. Flexible protein-protein docking with DMD. Agustí Emperador