Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation

Similar documents
Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Can a continuum solvent model reproduce the free energy landscape of a β-hairpin folding in water?

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Aggregation of the Amyloid-β Protein: Monte Carlo Optimization Study

Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function. The importance of proteins

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

The protein folding problem consists of two parts:

From Amino Acids to Proteins - in 4 Easy Steps

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

BCMP 201 Protein biochemistry

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy

Free Energy Landscape of Protein Folding in Water: Explicit vs. Implicit Solvent

Scuola di Chimica Computazionale

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

Presenter: She Zhang

Limitations of temperature replica exchange (T-REMD) for protein folding simulations

Introduction The gramicidin A (ga) channel forms by head-to-head association of two monomers at their amino termini, one from each bilayer leaflet. Th

Free energy, electrostatics, and the hydrophobic effect

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Supporting Online Material for

Protein Folding experiments and theory

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Central Dogma. modifications genome transcriptome proteome

Potential Energy (hyper)surface

Molecular Mechanics. I. Quantum mechanical treatment of molecular systems

Introduction to" Protein Structure

Close agreement between the orientation dependence of hydrogen bonds observed in protein structures and quantum mechanical calculations

Peptide folding in non-aqueous environments investigated with molecular dynamics simulations Soto Becerra, Patricia

Computational Biology & Computational Medicine

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

BIBC 100. Structural Biochemistry

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Biochemistry 530: Introduction to Structural Biology. Autumn Quarter 2014 BIOC 530

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

The Molecular Dynamics Method

Lecture 11: Potential Energy Functions

Why study protein dynamics?

BIOC : Homework 1 Due 10/10

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

arxiv: v1 [cond-mat.soft] 22 Oct 2007

Water models in classical simulations

Ab-initio protein structure prediction

Physiochemical Properties of Residues

Lecture 11: Protein Folding & Stability

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Protein Structure Basics

Dana Alsulaibi. Jaleel G.Sweis. Mamoon Ahram

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Free Radical-Initiated Unfolding of Peptide Secondary Structure Elements

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Basics of protein structure

Multi-scale approaches in description and design of enzymes

Applications of Molecular Dynamics

All-atom Molecular Mechanics. Trent E. Balius AMS 535 / CHE /27/2010

Assignment 2 Atomic-Level Molecular Modeling

Biomolecules: lecture 10

BIOC 530 Fall, 2011 BIOC 530

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Hyeyoung Shin a, Tod A. Pascal ab, William A. Goddard III abc*, and Hyungjun Kim a* Korea

Lecture 26: Polymers: DNA Packing and Protein folding 26.1 Problem Set 4 due today. Reading for Lectures 22 24: PKT Chapter 8 [ ].

Modeling Biological Systems Opportunities for Computer Scientists

Structural Bioinformatics (C3210) Molecular Mechanics

Protein structure and folding

Essential Forces in Protein Folding

Nanobiotechnology. Place: IOP 1 st Meeting Room Time: 9:30-12:00. Reference: Review Papers. Grade: 40% midterm, 60% final report (oral + written)

Solutions and Non-Covalent Binding Forces

k θ (θ θ 0 ) 2 angles r i j r i j

Protein Structure Bioinformatics Introduction

Introduction to Computational Structural Biology

RESEARCH ARTICLES. Effective Energy Function for Proteins in Solution

Discrimination of Near-Native Protein Structures From Misfolded Models by Empirical Free Energy Functions

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

2008 Biowerkzeug Ltd.

PROTEIN STRUCTURE AMINO ACIDS H R. Zwitterion (dipolar ion) CO 2 H. PEPTIDES Formal reactions showing formation of peptide bond by dehydration:

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

Biochemistry - I SPRING Mondays and Wednesdays 9:30-10:45 AM (MR-1307) Lectures 3-4. Based on Profs. Kevin Gardner & Reza Khayat

Announcements. Primary (1 ) Structure. Lecture 7 & 8: PROTEIN ARCHITECTURE IV: Tertiary and Quaternary Structure

ALL LECTURES IN SB Introduction

Unfolding CspB by means of biased molecular dynamics

Protein Folding by Robotics

Modeling the Free Energy of Polypeptides in Different Environments

Section Week 3. Junaid Malek, M.D.

Conformational sampling of macrocycles in solution and in the solid state

CAP 5510 Lecture 3 Protein Structures

arxiv: v1 [physics.bio-ph] 26 Nov 2007

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

Biochemistry: Concepts and Connections

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds

Energy landscapes of model polyalanines

Charged amino acids (side-chains)

Flexibility of Protein Structure

Molecular Simulation II. Classical Mechanical Treatment

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan

Optimization and Frustration:

Molecular dynamics simulation of Aquaporin-1. 4 nm

Denaturation and renaturation of proteins

Biophysics II. Hydrophobic Bio-molecules. Key points to be covered. Molecular Interactions in Bio-molecular Structures - van der Waals Interaction

Transcription:

Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation Jakob P. Ulmschneider and William L. Jorgensen J.A.C.S. 2004, 126, 1849-1857 Presented by Laura L. Thomas and Andrew Kohlway BBSI Journal Club Meeting June 3, 2004

Introduction Protein Folding Protein structure is intimately related to function Protein misfolding is associated with diseases such as Alzheimer s and Cystic Fibrosis Computational studies can give additional insight into experimental studies Biological relevance of crystal structures Novel protein structures may serve roles in nanotech applications

Protein Structure Amino Acids Non-polar Aromatic Polar + Charged - Charged Interactions protein protein & protein - solvent Covalent Electrostatic Hydrophobic Van der Waals Structure Primary (Sequence) Secondary (Local 3D) Tertiary (Net 3D) Quaternary (Aggregates)

In This Study The folding of three polypeptides which adopt β- hairpin and α-helical secondary structures were simulated using Monte Carlo sampling An implicit solvent model (GBSA) was implemented, while the proteins were represented with an all atom model (OPLS-AA) A concerted rotation algorithm was applied in order to increase sampling efficiency

Methods Systems OPLS-AA Force Field GBSA Continuum Solvent Model Monte Carlo Sampling Concerted Rotation Algorithm Simulation Protocol

Peptide Systems Under Investigation U(1-17)T9D MQIFVKTLDGKTITLEV pdb 1e0q Derived from ubiquitin β-hairpin α 1 ELLKKLLEELKG pdb 1al1 De novo α-helical trpzip2 SWTWENGKWTWK pdb 1le1 De novo β-hairpin

OPLS-AA Force Field Optimized Potentials for Liquid Simulations (All Atom) Intramolecular terms Bond Stretching Angle Bending Torsional Angle Non-bonded terms Coulombic Lennard-Jones 1,4-interactions Etot = Ebond + Eangle + Edih + Enb ( ) 2 E = k r r E E bond b, i i 0, i i angle θ, i i 0, i i dih nb ( ) E = k θ θ 2 ( 1+ cosϕ ) ( 1 cos 2ϕ ) ( 1+ cos3ϕ ) V V V = + + i 2 2 2 1, i i 2, i i 3, i i = + 12 6 2 qqe i j σ ij σ ij 4ε ij i j> i rij r ij r ij

GBSA Continuum Solvation Implicit model where solvent is treated as a statistical continuum Solvation free energy consists of a solventsolvent cavity term, a solute-solvent van der Waals term, and a solute-solvent electrostatic polarization term G sol is linearly related to SASA for saturated hydrocarbons in water Gsol = Gcav + GvdW + Gpol Gcav + GvdW = σ ksak G pol (The Generalized Born Equation) is derived from Coulomb s Law in a dielectric and the Born Equation G pol 1 = 166 1 qq n n i j ε i= 1 j= 1 fgb f GB is a function of the internuclear separation and atomic radii

Monte Carlo Sampling Classical statistical mechanics method based on the Metropolis Algorithm Random configurations of a molecule are generated x = x + 2r 1 δ r new old n ( ) max The energy of the system is computed If E new < E old Accept new configuration If E new > E old Compute the Boltzmann Factor If BF > r n Accept new configuration If BF < r n Reject new configuration ( ( N )/ b ) e V r k T

Concerted Rotation with Bond Angles Benefits of MC over MD Energy derivatives are not needed Exploits implicit solvent by enabling large conformational changes to cross over energy barriers However MC requires nontrivial move sets Variation in backbone degrees of freedom leads to global conformational changes CRA Change consecutive backbone torsion angles while leaving others unchanged Smaller number of non-bonded terms to evaluate Includes flexible angles, an improvement over earlier models Includes a Gaussian bias in order to increase sampling efficiency

Simulation Set-up Protocol Modified MCPRO package Temperatures 30 50 ºC No cutoffs for the non-bonded interactions Electrostatic energy was recomputed for every MC configuration The non-polar contribution to the solvation free energy was considered to be proportional to the SASA, which was updated every 30 MC configurations Backbone moves were made every 4 MC configurations, with the remainder being simple side chain moves The simulated structures were compared to NMR structures

Results β-hairpin U(1-17)T9D Eight MC runs at 30 ºC Started from completely extended conformations All runs stable β-hairpin conformations within 20 80 million MC configurations Rapid relaxation into compact state with twist which lasts from a few to 70 million configurations Turn formed (2-10M) Side chain reorientation and formation of interstrand H-bonds

β-hairpin U(1-17)T9D Lowest energy system most closely resembles the NMR structure The other structures were sampling local minima Formation of turn critical All simulations yielded a turn in the center of the chain No helical structures formed Transient salt bridge between amino group of K11 and carboxyl group of D9 7 non-native structures all very similar with a L-D-G-K turn caused all sidechain interactions to be out of sync (G should be in 4 th position) Hydrophobic core vs. H-bond centric modes of folding

α-helical Polypeptide α 1 Four MC runs at 40 ºC (350 800 M MC configurations) Started from completely extended conformations Forms a random coil at low concentrations Analysis of helical content Helical = Φ and Ψ within a range of (30º, 25º) of ideal (-57º, -47º) H-bonds = D-H-A distance of less than 2.8 Å and a D-H- A angle between 120º-180º All simulations showed that transient helical structures formed numerous times and were stable for 10 30 MC configurations

α-helical Polypeptide α 1 Cluster analysis used to assess secondary structural motifs encountered in simulations Structures were sampled every millionth MC step Main chain least squares superposition with a 1.5 Å similarity cutoff 606 clusters identified 200 analyzed 43% of the structures were helical A majority of these have only one helical turn Two turn helices make up 15% Complete helices make up 1% Indicates that fully helical α 1 is not stable on its own at this temperature 21% of structures were β-like 36% were random coils This is all in agreement with the NMR studies

Tryptophan Zipper trpzip2 Eight MC runs at 50 ºC (up to 8M configurations) Started from completely extended conformations Two of the simulations reached the native state Immediate relaxation Frequent transitions between folded states (compact coil or β- hairpin) The system then unfolds again and repeats this process

Tryptophan Zipper trpzip2 Comparison to the larger β-hairpin U(1-17)T9D where all runs led to stable folded structures Trpzip2 runs were done at higher temperature and were 5-10 times longer This criteria was sufficient to unfold local structures Only the native state had a lifetime longer than the simulation length The 6 non-native trpzip2 runs were not trapped in local minima, but rather did not yet sample their native basin Random search for native state without necessary intermediates All H-bonds formed simultaneously-no zipping (similar for both)

Native States How to distinguish between native and non-native states? Intervals of.1 M configurations from all runs used to create plot Native structures group in the bottom left corner (low energy, low RMSD)

Native States U(1-17)T9D Peptide native structures seen below -820 kcal/mol α 1 Peptide native much harder to discern: structures 1) β-turns above 6 Å 2) Partial Helical Structures in Middle 3) Full Helical Structures below 2 Angstroms (High in Energy: Stability obtained from interaction with other monomer units) Tryptophan Zipper more like U(1-17)T9D with native structures seen below -630 kcal/ml

Conclusions Successful folding of peptides obtained using Monte Carlo sampling with concerted backbone rotations, OPLS-AA force field, GBSA solvation model The β-hairpins both assumed native structures The α-helical structure assumed random coil conformations in agreement with low concentration NMR studies Higher temperatures were not necessary in contrast to similar MD studies (0-50 C) CRA method allowed for relatively larger conformational changes while avoiding intramolecular steric clashes

References Ulmschneider, J. P.; Jorgensen. J. Am. Chem. Soc. 2004, 126, 1849-1857. Still, W.C.; Tempczyk, A.; Hawley, R. C.; Hendrickson, T. J. Am. Chem. Soc. 1990, 112, 6127-6129. Ulmschneider, J. P.; Jorgensen. W.L. J. Chem. Phys. 2003, 118, 4261-4271. Paul von Rague Schleyer. Encyclopedia of Computational Chemistry; John Wiley & Sons Ltd, 1998.

Ubiquitin Ubiquitin is an evolutionarily highly conserved 76 amino acid polypeptide that is abundant in all eukaryotic cells. Polyubiquitination of substrates targets them for degradation by the 26S proteasome, a multiprotein complex conserved from archaebacteria to humans.