Protein Structure Analysis

Similar documents
Protein Structure Analysis

The Molecular Dynamics Method

Potential Energy (hyper)surface

Molecular dynamics simulation of Aquaporin-1. 4 nm

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Energetics and Thermodynamics

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

Modeling Biological Systems Opportunities for Computer Scientists

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

The Molecular Dynamics Method

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

Atomistic Modeling of Small-Angle Scattering Data Using SASSIE-web

Lecture 11: Protein Folding & Stability

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Molecular Dynamics Simulations. Dr. Noelia Faginas Lago Dipartimento di Chimica,Biologia e Biotecnologie Università di Perugia

Biomolecules are dynamic no single structure is a perfect model

Course Introduction. SASSIE CCP-SAS Workshop. January 23-25, ISIS Neutron and Muon Source Rutherford Appleton Laboratory UK

Proteins are not rigid structures: Protein dynamics, conformational variability, and thermodynamic stability

MD Thermodynamics. Lecture 12 3/26/18. Harvard SEAS AP 275 Atomistic Modeling of Materials Boris Kozinsky

Methods of Computer Simulation. Molecular Dynamics and Monte Carlo

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

CAP 5510 Lecture 3 Protein Structures

= (-22) = +2kJ /mol

Simulation of molecular systems by molecular dynamics

Introduction to Computational Structural Biology

An introduction to Molecular Dynamics. EMBO, June 2016

Protein Folding. I. Characteristics of proteins. C α

Protein Structures. 11/19/2002 Lecture 24 1

Protein folding. Today s Outline

Dominant Paths in Protein Folding

Paul Sigler et al, 1998.

Modeling Background; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 8

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Analysis of the simulation

Hyeyoung Shin a, Tod A. Pascal ab, William A. Goddard III abc*, and Hyungjun Kim a* Korea

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University

Principles and Applications of Molecular Dynamics Simulations with NAMD

Advanced in silico drug design

Multiscale Materials Modeling

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila

Don t forget to bring your MD tutorial. Potential Energy (hyper)surface

Exercise 2: Solvating the Structure Before you continue, follow these steps: Setting up Periodic Boundary Conditions


arxiv: v1 [cond-mat.soft] 22 Oct 2007

Molecular Mechanics. I. Quantum mechanical treatment of molecular systems

Introduction to" Protein Structure

Investigation of physiochemical interactions in

Free energy calculations

The protein folding problem consists of two parts:

Molecular Dynamics, Monte Carlo and Docking. Lecture 21. Introduction to Bioinformatics MNW2

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

SUPPLEMENTARY INFORMATION

Villa et al. (2005) Structural dynamics of the lac repressor-dna complex revealed by a multiscale simulation. PNAS 102:

Unfolding CspB by means of biased molecular dynamics

Computational Chemistry - MD Simulations

CHEM-UA 652: Thermodynamics and Kinetics

A Brief Guide to All Atom/Coarse Grain Simulations 1.1

Supporting Information. The hinge region strengthens the nonspecific interaction. between lac-repressor and DNA: a computer simulation.

BIOCHEMISTRY Unit 2 Part 4 ACTIVITY #6 (Chapter 5) PROTEINS

Molecular dynamics simulations of a single stranded (ss) DNA

Molecular dynamics simulation. CS/CME/BioE/Biophys/BMI 279 Oct. 5 and 10, 2017 Ron Dror

Introduction to Comparative Protein Modeling. Chapter 4 Part I

The Molecular Dynamics Method

What can be learnt from MD

What is Classical Molecular Dynamics?

Routine access to millisecond timescale events with accelerated molecular dynamics

Why study protein dynamics?

A Molecular Dynamics Simulation of a Homogeneous Organic-Inorganic Hybrid Silica Membrane

Introduction to molecular dynamics

Essential dynamics sampling of proteins. Tuorial 6 Neva Bešker

Analysis of MD trajectories in GROMACS David van der Spoel

Introduction to Classical Molecular Dynamics. Giovanni Chillemi HPC department, CINECA

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Protein Modeling Methods. Knowledge. Protein Modeling Methods. Fold Recognition. Knowledge-based methods. Introduction to Bioinformatics

Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation

Supplementary Figures:

Water Dynamics in Protein Hydration Shells: The Molecular Origins of the Dynamical Perturbation

2008 Biowerkzeug Ltd.

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Multimedia : Fibronectin and Titin unfolding simulation movies.

DYNAMICS OF PROTEINS: ELEMENTS AND FUNCTION

How is molecular dynamics being used in life sciences? Davide Branduardi

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

Organization of NAMD Tutorial Files

Molecular Dynamics Simulation of a Nanoconfined Water Film

Advanced Molecular Dynamics

Principles of Physical Biochemistry

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

Chemistry 431. Lecture 27 The Ensemble Partition Function Statistical Thermodynamics. NC State University

Energy Barriers and Rates - Transition State Theory for Physicists

Energy Landscapes and Accelerated Molecular- Dynamical Techniques for the Study of Protein Folding

Entropy and Free Energy in Biology

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Mechanical Proteins. Stretching imunoglobulin and fibronectin. domains of the muscle protein titin. Adhesion Proteins of the Immune System

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Gerd Krause, Structural bioinformatics and protein design Leibniz-Institute of molecular Pharmacology

Physiochemical Properties of Residues

Simulation environment for Life Sciences

Transcription:

BINF 731 Protein Modeling Methods Protein Structure Analysis Iosif Vaisman Ab initio methods: solution of a protein folding problem search in conformational space Energy-based methods: energy minimization molecular simulation Knowledge-based methods: homology modeling fold recogniion 2015 HP Lattice Models Free energy surface in protein simulation Folding pathways Chan & Dill, 1998 Energy Minimazation Energy Minimazation

Molecular dynamics Molecular dynamics Twenty energy minimized structures taken at 5-ps intervals along a 100-ps molecular dynamics trajectory. YSNSG cyclopeptide as observed along the 20 ns molecular dynamics trajectory (Thevenard et al., 2006) Bubb et al., 1999 Molecular Dynamics Potential Energy Function and Force Field Model system Initial conditions Boundary conditions Integration algorithm Constraints Ensemble Results Force Field Development and Parametrization Molecular Dynamics F i =m i a i a i = dv i / dt v i = dr i / dt - de / dr i = F i - de / dr i = m i d 2 r i / dt 2 L.-P. Wang et al., 2014

Periodic Boundary Conditions Periodic Boundary Conditions J. Jensen, animation Adopted from D.van der Spoel et al. (2005) MD cycle and integration algorithm Calculate potential energy Calculate force at each atom Calculate acceleration of each atom Calculate velocity of each atom Calculate new atomic coordinates Characteristic Time Scales for Protein Motions event spatial extent (nm) amplitude (nm) time (s) appropriate simulations bond-length vibration 0.2 0.5 0.001 0.01 10-14 10-13 QM methods elastic vibration of globular domain 1.0 2.0 0.005 0.05 10-12 10-11 conventional MD rotation of solvent-exposed side chains 0.5 1.0 0.5 1.0 10-11 10-10 conventional MD torsional libration of buried groups 0.5 1.0 0.05 10-11 10-9 conventional MD hinge bending (relative motion of globular domains) 1.0 2.0 0.1 0.5 10-11 10-7 Langevin dynamics, enhanced sampling MD methods? rotation of buried side chains 0.5 0.5 10-4 1 enhanced sampling MD methods? allosteric transitions 0.5 4.0 0.1 0.5 10-5 1 enhanced sampling MD methods? local denaturation 0.5 1.0 0.5 1.0 10-5 10 1 enhanced sampling MD methods? loop motions 1.0 5.0 1.0 5.0 10-9 10-5 Brownian dynamics? rigid-body (helix) motions 1.0 5.0 10-9 10-6 enhanced sampling MD methods? helix coil transitions >5.0 10-7 10 4 enhanced sampling MD methods? protein association 1.0 Brownian dynamics S. A. Adcock and J. A. McCammon, 2006 MD Ensemble Microcanonical ensemble (NVE) : The thermodynamic state characterized by a fixed number of atoms, N, a fixed volume, V, and a fixed energy, E. This corresponds to an isolated system. Canonical Ensemble (NVT): This is a collection of all systems whose thermodynamic state is characterized by a fixed number of atoms, N, a fixed volume, V, and a fixed temperature, T. Isobaric-Isothermal Ensemble (NPT): This ensemble is characterized by a fixed number of atoms, N, a fixed pressure, P, and a fixed temperature, T. Grand canonical Ensemble (mvt): The thermodynamic state for this ensemble is characterized by a fixed chemical potential, m, a fixed volume, V, and a fixed temperature, T. Temperature in molecular dynamics U = m v NkT kin 1 3 2 = i i 2 2 N - number of atoms k - Boltzmann constant T - absolute temperatur e

MD of proteins: Solvent model MD of proteins: radial distribution functions MD simulation of ubiquitin (400 ps) g(r) = N(r) N i (r) = N(r) (r x 4p r 2 dr) Adopted from V.Daggett (1999) MD of proteins: radial distribution functions MD of proteins: mobile regions O N C Water O-O rdf Adopted from V.Daggett (1999) Protein-water rdf Snapshots of V H domain simulation at 300 and 340 K Adopted from W.F.VanGunsteren (2001) MD of proteins: Conformational change MD of proteins: scale of simulation A set of structures on the reaction path showing the behavior of a single subunit (apical domain, green; intermediate domain, yellow; equatorial domain, red; and ATP, blue). Structures 1 3 correspond to the first stages associated with ATP binding; 4 6 correspond to the second stage involving GroES binding. The early downward motion of the intermediate domain (compare [1,t] with [2] and [6,r'']) is the trigger for the overall transition. 1997 2005 estrogen receptor DNA-binding domain 36,000 atoms 100 ps LacI-DNA complex 314,000 atoms 10 ns Adopted from J.C.Phillips et al. (2005)

2014 1999 MD of proteins: long runs MD of proteins: long runs 1 microsecond simulation of villin Adopted from I.D.Kuntz and P.Kollman (2001) Adopted from J.C.Phillips et al. (2005) MD of proteins: performance MD: Reversible folding of peptides Simulation setup Performance, ps/day System FF virth Water Coulomb LJ ia32 x86-64 ppc Vil G no TIP3P cutoff 0.8 cutoff 0.8 9744 9574 14,385 3 6 Vil G yes TIP3P cutoff 0.8 cutoff 0.8 16,900 16,895 23,681 Vil G yes TIP3P RF 1.0 cutoff 1.0 10,308 9719 12,934 System (PDB ID) Number of atoms *DHFR (5DFR) 23,558 17.4 Approximate performance (ms/machine-day) 6 7 asfp (1SFP) 48,423 11.7 FtsZ (1FSZ) 98,236 5.7 T7Lig (1A01) 116,650 5.5 8 10 bilap (1BPM) 132,362 4.8 Adopted from W.F.VanGunsteren (2001) MD: Reversible folding of small proteins MD: Reversible folding of small proteins Formation of topology, native contacts, and secondary structure during protein folding. K. Lindorff-Larsen et al., 2011 (A) The three panels show the accumulation of native secondary structure, nonlocal native contacts, and native topology during a single folding event for α3d (B) The 24 transitions of α3d in a scatter plot are represented, with each of the black points corresponding to the time series integral for a single folding event (C) Each point shows the average value over all folding and unfolding events observed for one protein. Each point is labeled with the PDB code of the relevant protein. Most proteins fall below the diagonal in these plots, showing that topology and secondary structure develop earlier than the full set of native contacts. K. Lindorff-Larsen et al., 2011