The protein folding problem consists of two parts:

Similar documents
Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

Lecture 11: Protein Folding & Stability

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

From Amino Acids to Proteins - in 4 Easy Steps

= (-22) = +2kJ /mol

Protein Folding. I. Characteristics of proteins. C α

Section Week 3. Junaid Malek, M.D.

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

Dana Alsulaibi. Jaleel G.Sweis. Mamoon Ahram

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein structure and folding

Quiz 2 Morphology of Complex Materials

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy

Protein Folding experiments and theory

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday.

Free energy, electrostatics, and the hydrophobic effect

Energetics and Thermodynamics

Biomolecules: lecture 10

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Proteins are not rigid structures: Protein dynamics, conformational variability, and thermodynamic stability

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University

Biochemistry: Concepts and Connections

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Lecture 34 Protein Unfolding Thermodynamics

Flexibility of Protein Structure

Modeling Background; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 8

Protein Structure Basics

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Introduction to" Protein Structure

4 Proteins: Structure, Function, Folding W. H. Freeman and Company

Paul Sigler et al, 1998.

THE UNIVERSITY OF MANITOBA. PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS

Proteins polymer molecules, folded in complex structures. Konstantin Popov Department of Biochemistry and Biophysics

MCB100A/Chem130 MidTerm Exam 2 April 4, 2013

MBLG lecture 5. The EGG! Visualising Molecules. Dr. Dale Hancock Lab 715

ALL LECTURES IN SB Introduction

Protein Structure Bioinformatics Introduction

Protein structure (and biomolecular structure more generally) CS/CME/BioE/Biophys/BMI 279 Sept. 28 and Oct. 3, 2017 Ron Dror

schematic diagram; EGF binding, dimerization, phosphorylation, Grb2 binding, etc.

BCH 4053 Spring 2003 Chapter 6 Lecture Notes

Denaturation and renaturation of proteins

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

BCMP 201 Protein biochemistry

7.88J Protein Folding Problem Fall 2007

Protein folding. Today s Outline

High Specificity and Reversibility

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Lecture 21 (11/3/17) Protein Stability, Folding, and Dynamics Hydrophobic effect drives protein folding

Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation

BCHS 6229 Protein Structure and Function. Lecture 3 (October 18, 2011) Protein Folding: Forces, Mechanisms & Characterization

arxiv: v1 [cond-mat.soft] 22 Oct 2007

StructuralBiology. October 18, 2018

Charged amino acids (side-chains)

Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function. The importance of proteins

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key

Protein Structures. 11/19/2002 Lecture 24 1

Problem Set 1

arxiv:chem-ph/ v1 11 Nov 1994

BIBC 100. Structural Biochemistry

Protein structure forces, and folding

1. (5) Draw a diagram of an isomeric molecule to demonstrate a structural, geometric, and an enantiomer organization.

Enzyme Catalysis & Biotechnology

Announcements & Lecture Points

Introduction to Protein Folding

MCB100A/Chem130 MidTerm Exam 2 April 4, 2013

Student Questions and Answers October 8, 2002

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Basics of protein structure

Introduction to Computational Structural Biology

Details of Protein Structure

Supersecondary Structures (structural motifs)

Florida International University. Bernard S. Gerstman Department of Physics. University Park Miami, FL

Biophysical Model Building

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

Molecular Mechanics, Dynamics & Docking

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

Biotechnology of Proteins. The Source of Stability in Proteins (III) Fall 2015

Homework Problem Set 4 Solutions

Supplementary Information

Chapter 6- An Introduction to Metabolism*

MCB65. Physical Biochemistry: Understanding Macromolecular Machines. Rachelle Gaudet Martin Samuels MCB 65 1/25/16 1

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1

SCIENCE CHINA Life Sciences. Physicochemical bases for protein folding, dynamics, and protein-ligand binding

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

BME Engineering Molecular Cell Biology. Structure and Dynamics of Cellular Molecules. Basics of Cell Biology Literature Reading

Principles of Physical Biochemistry

Archives of Biochemistry and Biophysics

Protein Folding In Vitro*

1. What is an ångstrom unit, and why is it used to describe molecular structures?

Unfolding CspB by means of biased molecular dynamics

Lecture 26: Polymers: DNA Packing and Protein folding 26.1 Problem Set 4 due today. Reading for Lectures 22 24: PKT Chapter 8 [ ].

Conformational Geometry of Peptides and Proteins:

BIOC : Homework 1 Due 10/10

CAP 5510 Lecture 3 Protein Structures

Biochemistry - I SPRING Mondays and Wednesdays 9:30-10:45 AM (MR-1307) Lectures 3-4. Based on Profs. Kevin Gardner & Reza Khayat

(6 pts) List three general characteristics shared by stable secondary structures.

Transcription:

Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures. 2) Finding a way to get to that structure 1

Why start a lecture on protein folding with a picture of DNA. The reason is that protein Structures are very different from the structure of DNA and by understanding what is so different between the two will help us better understand the essence of the protein folding problem. 2

The structure of myoglobin was a complete shock for people who had hoped that the structure, once revealed, would immediately explain how this molecule worked. The structure was startling in two ways 1) The structure was not regular at all. Yes there were helices, but they were arranged in what appeared to be random orientations. 2) Eventhough the structure was completely irregular, the protein had a defined structure. This was even weirder. How can a structure be so random but at the same time be so well defined. Obviously understanding how proteins worked would be a lot harder than understanding How DNA worked. 3

Comparison to other self-assembling systems? In this respect proteins are different from other polymers like DNA. These polymers also adopt a defined structure by finding the global energy minimum but this global energy minimum is achieved by finding local minima for parts of the structure. In other words we perform multiple independent searches. Dividing up the search for the global minimum global energy minimum by finding the local conformational minimum for each part 4

The search for the conformational energy minimum is global not a combination of multiple independent local searches Traveling salesman problem a classic example of a global search problem The conformational search problem for proteins are different from other polymers like DNA. These polymers also adopt a defined structure by finding the global energy minimum but this global energy minimum is achieved by finding local minima for parts of the structure. In other words we perform multiple independent searches. Dividing up the search for the global minimum global energy minimum by finding the local 5

The conformational energy landscape describes the relative energy of all possible conformational states of a molecule Two-dimensional energy landscape The conformational energy landscape of a protein describes the relative energy of all possible conformational states. Above is a two dimensional energy landscape. The z (vertical) axis give the relative energy. The x and y axis represent two parameters that describe the structure. This could be torsion angles for example. So if we had a molecule that had only two torsion angles then the energy landscape may look like the one in the picture. The location of the sphere represents the current conformational state of the molecule. The energy landscape will have little divots, local minima and the molecule could become trapped in these minima. However, there is thermal energy that constantly jostles the molecule around, so the molecule changes conformations I.e. the sphere moves around and can explore the energy landscape. To describe the conformation of a protein we obviously need a lot more parameters than two. So the protein energy landscape has many different dimensions. The backbone alone contributes two dimensions (phi,psi) for every peptide unit. Add to that bond stretching, bending of angles etc. and the energy landscape of a 150 AA protein can easily have hundreds or thousands of dimensions. We obviously cannot draw such an energy landscape, but the concept of energy landscapes is very useful when thinking about protein dynamics and so researchers will often draw a one- or two-dimensional energy landscape to make a point. 6

Frustrated systems 7

The thermodynamic hypothesis Christian Anfinsen The native structure of a protein is determined solely by the sequence of amino acids in its poly-peptide chain and represents the state of the lowest conformational energy under native conditions. Nobel Prize in Chemistry 1972 There are two important points to this statement: 1) The sequence determines the structure. Yes, chaperons help, but they do not alter the structure. Instead of shaping the structure, these chaperons appear to help the protein s get to that structure. 2) The native structure is the state of the lowest conformational energy. Because that energy minimum is unique, the structure is unique. Most polymers, including random polypeptide chains do not follow the thermodynamic hypothesis. Instead the structure of those polymers is usually determined by kinetic factors. The polymer gets stuck in some local conformational state. 8

Solving the energetic problem Make more stabilizing interactions than you break 5-15 kcal Hydrophobic effect Conformational entropy Hydrogen bonds Electrostatic interactions + Van der Waals interactions very small Number The main driving force towards a folded structure is the hydrophobic effect. The main driving force opposing folding is the loss of conformational entropy. Lets estimate the size of some of these individual interactions. 9

With increasing size the ratio of surface to buried residues decreases 5 hydrophilic 0 hydrophobic 11 hydrophilic 4 hydrophobic 16 hydrophilic 14 hydrophobic Another physical restriction on domain structure is protein size. One of the dominant energetic trade offs between the folded and unfolded state is between conformational entropy in the unfolded form and hydrophobic stabilization in the folded form. Remember both of these effects are entropic in nature. The per-residue entropic cost of folding is independent of the proteins size. The total entropic cost of folding a protein is simply the sum of the entropic cost of restricting the conformational degree of freedom for each amino acid. By contrast the amount of entropy we gain through the hydrophobic effect depends on the proteins size. The larger a protein gets, the larger the fraction of its sidechains that it can bury in a hydrophobic core. This is simply due to the fact that the volume of the protein increases with the cube of the radius while the surface area increases with the square of the radius. Does the combination of a size-independent entropic cost with a size-dependent entropic gain for the folding reaction impose any restrictions on domain size? Yes, because if we make our domain smaller and smaller, we have fewer and fewer hydrophobic interactions to pay for the cost of constraining the conformational freedom of our sidechains. So what fraction of a protein s sidechains has to be in the hydrophobic core to give the folded state a net stability? 10

Entropic cost per amino acid 0.6kcal/mol*ln10=1.3 kcal/mol Average number of methyl groups per hydrophobic amino acid 5 Hydrophobic effect per buried amino acid = 5*0.8 kcal/mol=4 kcal/mol ---> ~ 1/3 of amino acids need to be buried V=4/3 π r 3 4/3 π x 3 = hydrophobic 4/3 π y 3 total x=2 ; y=4 n= 64 ratio = 8 x=3 ; y=5 n=125 ratio = 4.6 x=4 ; y=6 n=216 ratio = 3.4 Lets do a very simple minded calculation. Lets say that we restrict a sidechain s conformational freedom by a factor of 10 when we go from the unfolded form to the folded form. So our entropic price per residue is ~1.3 kcal/mol. The average hydrophobic sidechain has 5 carbons and the average energy of burying a methyl group is 0.8 kcals/mol to burying a side chain should give us ~4 kcal/mol. So in order to get net stability from folding we need to bury approximately 1 out of every three amino acid sidechains in a protein s hydrophobic core. So the volume of the hydrophobic core needs to be greater than 1/3 the volume of the total protein. It turns out that we get a ratio of ~3 for domains on the size of 125-200 amino acids, which is close to the lower size limit for domain size. There clearly are some domains that are smaller than this size limit, but there are very few domains consisting of less than ~80 AA. Another way of thinking about this is if we have a cylindrical protein shape and we take the middle figure of the previous slide as a cross section we need at the very least a double layer of hydrophobic sidechains in the interior of the protein. To get close to this 1/3 ratio. 11

Here is an examples of the helical bundle fold in which we have this double layer of hydrophobic residues in the interior of the protein. 12

Here is the example of the ribonuclease inhibitor which again shows this double layer of hydrophobic residues. 13

There are some protein domains that are much smaller than our size limit of ~80 amino acids, but these domains are not stabilized by hydrophobic interactions alone. Instead these domains require some other source of stabilizing energy such as the binding of a metal ion or cystein bridges. To stabilize the structure Protein engineers have actually been able to generate zinc finger proteins like the one shown on the left above, that were stabilized by hydrophobic interactions alone. However in such a domain each interaction is absolutely essential to keep the structure intact. Such a domain would have very little chance of being retained by an organism over the course of evolution. One deleterious mutation would lead to a complete loss of the domain s structure and function and it is very unlikely the gene would acquire a compensatory mutation before the gene is lost, or receives a second deleterious mutation. In other words, protein domains that are very small, probably do not show the degree of robustness that is needed to retain protein structure in the face of random mutations. 14

15

Using the hydrophobic effect to evade immune Detection: Neisseria gonorrhoeae pilin Parge et al. 1995 Nature 16

Using the hydrophobic effect to evade immune Detection: Neisseria gonorrhoeae pilin 17

18

Levinthal s paradox Proteins have an incredible number of possible conformational states, yet they are able to fold very quickly. Cyrus Levinthal 19

Levinthal s paradox 150 AA domain 10 different conformations per side chain -------> 10 150 possible conformations. Atomic vibrations occur on fsec. (10-15 sec) time scale Time to search all possible conformations >> age of the universe Number of particles in the universe ~10 85 20

Levinthal s paradox Random protein sequences typically do not fold, neither do most other polymers. There must be a code / mechanism That allows proteins to fold within a reasonable Period of time. Two models: Frame work Global collapse 21

Framework model Secondary structure elements form first Packing of secondary structure elements leads to molten globule Repacking of the core in the molten globule leads to native structure 22

Global collapse model - Protein collapses due to hydrophobic effect - Hydrophobic environment drives formation of secondary structure to form molten globule - Repacking of molten globule leads to native structure 23

24

25

Phi value analysis Alan Fersht Phi value =1 interaction already exists in transition state Phi value =0 interaction is not part of transition state 26

27

Current state of debate A detailed understanding of protein folding remains illusive because we still lack experimental information on many of the states along the folding trajectory The transition state of a two-state folder tends to be very compact. Proteins with similar folds tend to fold following a similar mechanism. 28