Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures

Size: px
Start display at page:

Download "Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures"

Transcription

1 Proc. Natl. Acad. Sci. USA Vol. 96, pp , September 1999 Biophysics Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures E. ALM AND D. BAKER* Department of Biochemistry, University of Washington, Seattle, WA Edited by Peter G. Wolynes, University of Illinois at Urbana Champaign, Urbana, IL, and approved July 12, 1999 (received for review May 6, 1999) ABSTRACT Guided by recent experimental results suggesting that protein-folding rates and mechanisms are determined largely by native-state topology, we develop a simple model for protein folding free-energy landscapes based on native-state structures. The configurations considered by the model contain one or two contiguous stretches of residues ordered as in the native structure with all other residues completely disordered; the free energy of each configuration is the difference between the entropic cost of ordering the residues, which depends on the total number of residues ordered and the length of the loop between the two ordered segments, and the favorable attractive interactions, which are taken to be proportional to the total surface area buried by the ordered residues in the native structure. Folding kinetics are modeled by allowing only one residue to become ordered/ disordered at a time, and a rigorous and exact method is used to identify free-energy maxima on the lowest free-energy paths connecting the fully disordered and fully ordered configurations. The distribution of structure in these free-energy maxima, which comprise the transition-state ensemble in the model, are reasonably consistent with experimental data on the folding transition state for five of seven proteins studied. Thus, the model appears to capture, at least in part, the basic physics underlying protein folding and the aspects of nativestate topology that determine protein-folding mechanisms. Recent experimental results suggest that protein-folding rates and mechanisms are largely determined by native-state topology (1). First, dramatic changes in sequence have been found to have little effect on protein-folding rates. Laboratory generated variants of the SH3 domain and IgG binding domain of protein L with large numbers of sequence changes were found to fold at rates similar to that of the naturally occurring proteins, suggesting that the sequences of small proteins are not extensively optimized for rapid folding (2, 3). Furthermore, naturally occurring proteins with similar folds but very different sequences generally have similar folding rates (4). Second, for two distinct protein folds, the SH3 domain and CspB, there is evidence that folding mechanisms and transition-state structures are conserved among homologs despite differences in amino acid sequence and stability (4 6). Third, the folding rates of small proteins were found to be strongly correlated with a property of the native-state topology: the average sequence separation between residues that make contacts in the three-dimensional structure (the contact order) (7). Recent theoretical work is also consistent with the idea that topology is an important determinant of protein-folding mechanisms (8). Taken together, these results suggest that nativestate topology is a key determinant of protein-folding mechanisms and of the distribution of structure in the transitionstate ensemble. The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C solely to indicate this fact. PNAS is available online at These findings imply that a model of protein folding need not take into account the complex details of the interactions between residues or consider nonnative interactions in order to reproduce the general features of the folding landscape. Rather, a simple treatment of the interactions in the native protein may suffice to account for most of the experimental data available on the folding of small protein domains. We describe a simple model based on surface area burial and chain conformational entropy that shows promise in reproducing some of the general features of the protein-folding landscapes of real proteins. Basic Assumptions. The goal of our approach is to explicitly map out the folding free-energy landscape using information from the native protein structure. From the free-energy landscape, all of the observable properties of the folding process can be inferred, including folding rates and transition-state structure. Two key approximations make mapping out the free-energy landscape possible. The first is that only native interactions contribute significantly to the folding process; configurations including nonnative interactions are ignored. The second is that in any given configuration, each residue is either fully ordered as in the native state or completely disordered. Because each residue can be in one of two states, ordered or disordered, the folding free energy landscape that follows from these assumptions consists of 2 n possible configurations, where n is the number of residues in the protein considered. Enumerating Configurations. Although the second approximation reduces the continuum of possible configurations to a finite number, for most proteins there are still too many to evaluate. To reduce further the number of configurations that must be considered, all ordered residues can be required to occur within a single contiguous stretch in the linear protein sequence [the single sequence approximation of helix coil theory (9, 10)]. A more general approach is to allow multiple ordered segments that are distant in linear sequence to interact. Increasing the number of allowed segments increases both the complexity of the energy landscape and the total number of configurations that must be considered. We have adopted a compromise between the generality of the multiple-segment model and the simplicity of smaller models, which we call the sequential binary collision model: only two contiguous segments of the chain are allowed to be ordered in any given configuration; however, if all of the intervening residues between two ordered segments become ordered, a new segment may be added, because the two initial segments can be described by a single larger segment. When one ordered segment has a length of zero, this model is identical to the contiguous sequence model. METHODS Free Energy Function. Once a suitable set of configurations has been chosen to serve as the basis set for a folding This paper was submitted directly (Track II) to the Proceedings office. *To whom reprint requests should be addressed. dabaker@ u.washington.edu

2 11306 Biophysics: Alm and Baker Proc. Natl. Acad. Sci. USA 96 (1999) landscape, the free energy of each configuration must be calculated. We use a simple free-energy function, F ASA kt n ln L/L 0, [1] where the first term represents attractive native interactions, the second term, the entropic cost of ordering the residues in the ordered segments, and the third term is the entropic cost of closing the loop between the two ordered segments. The free-energy function for the contiguous sequence model contains only the first two terms, because only one ordered segment is allowed. A related free-energy function was developed by Finkelstein and Badretdinov (11). The attractive native interactions [first term in Eq. 1] were taken to be proportional to the difference between the surface area buried by the ordered residues in the native state and the surface area buried by these residues in the denatured state. Total surface area burial (both hydrophobic and polar) was used; because only native interactions are considered, all burial is assumed to be favorable. The surface area buried by the ordered residues was computed from the positions of these residues in the native structures [1fmk (SH3), 1rnb (barnase), 3ci2 (CI-2), 2chf (Che Y,) residues 6-92 of chain 3 of 1lmb ( repressor), residues of 2ptl (protein L), and residues 10A through 81A from 1aye (procarboxypeptidase activation domain)] using the method of LeGrand et al. (12). The surface area buried in the unfolded state was taken to be the sum of the surface areas for each residue in its corresponding native tripeptide. in Eq. 1 was taken to be 16 cal/mol/å 2 (13). Specific hydrogen bonding patterns, van der Waals energy, and details of sidechain packing are not taken into account in this simple model. Such terms may not be relevant until late in the folding process when the protein structure is well determined, and thus may be more relevant to the analysis of protein unfolding and stability. The conformational entropy lost on ordering a segment of consecutive residues was taken to be proportional to the number (n) of residues ordered (second term in Eq. 1). The model was tested on three of the proteins studied (the SH3 domain, barnase, and CI-2) over a range of values of (corresponding to free energies of 1.6, 1.65, 1.7, 1.75, and 1.8 kcal/mol per residue at 298 K), in order to find the value that best characterized the experimentally observed features of the folding landscape. A fixed value equivalent to 1.75 kcal/mol per residue at 298 K was chosen on the basis of the position of the transition state ensemble on the reaction coordinate (compared to m f /m values) and on the performance of the model in reproducing experimentally observed -values for these three proteins. This is very close to the value of 1.8 kcal/mol per residue at 298 K estimated by Freire from calorimetric studies (14). The decrease in entropy on loop closure in configurations in which there is more than one contiguous segment of ordered residues was estimated from off-lattice studies of loop closure frequencies in polypeptide chains (E.A. and D.B., unpublished results). The entropy of loop closure derived from these studies, S(loop closure)/k, was found to be well represented by ln(l/l 0 ), where L is the length of the loop, 1.8 and L This expression is consistent with estimates from polymer theory and recent experimental results (11, 15 19). Identifying Folding Pathways and Transition-State Configurations. Folding kinetics are treated in the model by allowing only one residue to change state (from ordered to disordered or vice versa) at a time. Thus, a configuration with two ordered segments is kinetically connected only to configurations in which one of the two segments is unchanged and the other is one residue longer or shorter. With this constraint, folding pathways connecting the fully unfolded (all residues disordered) and folded (all residues ordered) configurations can be readily identified by using a standard recursive depth firstsearch algorithm. The kinetically relevant pathways are those that involve surmounting the lowest free-energy barriers; the lowest free-energy transition-state configurations are defined to be the highest free-energy configurations on the lowest free-energy folding pathways. A simple algorithm was used to identify these pathways and transition-state configurations: 0. Generate a list of all allowed configurations and their free energies. 1. Remove from the list the highest free-energy configuration 2. Determine whether there is still a path from the fully unfolded configuration to the fully folded configuration. If there is a path, go to step 1. If there is not a path, the configuration removed in step 1 is the lowest free-energy transition-state configuration. The 100 lowest free-energy transition-state configurations were identified by repeatedly carrying out steps 0 2 and each time removing from the list the lowest free-energy transitionstate configuration. The computational time needed to identify transition-state configurations was considerably reduced by replacing the linear search in step 1 with a bisection technique and by using a branch-and-bound strategy to determine if there is a path in step 2. Calculation of -Values. To compare with experimental data, -values were obtained by dividing the change in transition-state free energy accompanying a sidechain truncation by the change in native-state free energy. Because there were typically a collection of transition-state species of similar free energy, -values were computed for the 100 lowest free-energy transition-state configurations and were averaged. Graphics. All molecular structures shown were produced by using the RASMOL and MOLSCRIPT software packages (20, 21). S-PLUS (Mathsoft, Seattle, WA) was used to generate images of the folding landscape for the two models described. RESULTS AND DISCUSSION Contiguous Sequence Model. The major advantage of the contiguous sequence approximation is that the resulting folding landscapes can be represented directly on a twodimensional plot. Fig. 1, column I, shows the contiguous sequence folding landscape for five proteins studied [several of these plots were shown for illustration in a recent review (1)]. The order parameter, N f, used as the x axis in this and all subsequent plots, is the fraction of residues that are ordered; N f 0(Left) corresponds to the fully unfolded state and N f 1(Right) to the folded state (22). The y axis indicates the position of the ordered fragment, and the colors indicate free energy (the highest free-energy configurations are shown in yellow and the lowest, in blue). The folding landscapes for the SH3 domain, barnase, and Che Y each contain a distinct low free-energy pathway from the unfolded state to the native state. For the SH3 domain, this pathway is determined by a bottleneck on the folding landscape when about half of the residues in the protein are ordered: the 30-residue region centered at residue 35 is much lower in free energy than any other region of comparable size. This segment roughly corresponds to the region that has been shown experimentally to be important in stabilizing the folding transition state (5, 6). For Che Y, the stretch of low free-energy (blue) configurations that connects the unfolded and folded states along the N- terminal (Bottom) side of the landscape suggests that there is a low free-energy pathway that corresponds to ordering the protein from the N terminus to the C terminus. This is consistent with the experimental observation that the N terminus is ordered at the transition state (23). For barnase, the residues shown experimentally to be involved in the transition state are split between the C-terminal -sheet and an N- terminal helix (24). Although, the contiguous sequence model correctly identifies one of these regions (the C-terminal beta

3 Biophysics: Alm and Baker Proc. Natl. Acad. Sci. USA 96 (1999) FIG. 1. Column I (free-energy landscape in the contiguous sequence model). Free-energy landscapes in the contiguous sequence model for the SH3 domain (row 1), CI-2 (row 2), barnase (row 3), Che Y (row 4), and repressor (row 5). In the contiguous sequence model, each configuration consists of a single stretch of ordered residues. Thus, each of the configurations on the free-energy landscape can be uniquely identified by two parameters, namely the length of the stretch of ordered residues or, equivalently, the fraction of residues ordered, N f (x axis) and the location of the center of the ordered segment (y axis). Colors indicate free energy and are computed from Eq. 1 (without the loop entropy term); black indicates 0 kcal/mol, and white indicates 30 kcal/mol for SH3, 25 kcal/mol for CI-2 and repressor, 35 kcal/mol for Che Y, and 40 kcal/mol for barnase. The color scheme used to represent free energies between 0 kcal/mol and the upper bounds is black blue (0 25% of upper bound), blue magenta (25 50%), magenta red (50 75%), red yellow (75 88%) and yellow white (88 100%). Column II (hierarchy of structure formation). The frequencies with which individual residues were ordered in a Boltzmann weighted ensemble of the configurations available under the sequential binary collision model are shown for each value of N f (see text); the y axis is position along the sequence. Colors indicate the frequencies with which residues were ordered (black blue, ; blue magenta, ; magenta red, ; red yellow, ; and yellow white, ). Column III (structure in the transition-state ensemble). The number of times each residue was ordered in the top 100 transition states is shown as a function of N f ; the color scheme (with counts taking the place of frequencies) and axes are as in Column II. Column IV (free energy profile). The free energy, as a function of N f, was computed from the partition function for each value of N f as described in the text. The x axis is the reaction coordinate, N f, and the y axis is the free energy in kcal/mol. sheet is the lowest free-energy structure with 40 residues ordered), the constraints of the model do not allow both regions to be identified. Thus, a major limitation of this simple model is the severe constraint that all configurations have only one stretch of ordered residues. CI-2 is another example of the limits of this simple model. The model correctly identifies the alpha helix (25) (the segment about 1/3 the length of the protein centered at residue 15) as a low free-energy configuration, but the helix cannot nucleate folding because of the large free-energy barrier to ordering the loop adjacent to the helix. In a more realistic model, the beta sheet could condense on the helix without fully ordering each intervening residue. Free-Energy Profile of the Sequential Binary Collision Model. The sequential binary collision model overcomes some of the limitations of the contiguous sequence model by allowing two segments of the protein distant in the linear sequence to come together in three dimensions. The full free-energy landscape for the sequential binary collision model cannot be represented easily in a two-dimensional graph as is the case for the contiguous sequence model. A free-energy profile, however, can be generated for this model with the caveat that the profile may include contributions from states that are not kinetically accessible. The free energy as a function of N f, A N f ktln Q N f kt ln i e F i kt is shown in Fig. 1, column IV, where F i is the free energy of configuration i (Eq. 1), and the sum is over all configurations with a particular N f. For all of the proteins studied, the free energy of the folded state was found to be higher than that of the unfolded state by 5 15 kcal/mol. This lack of agreement with experimentally determined stabilities was not unexpected, because only contributions from surface area burial were included in our simple energy function (a more sophisticated energy function may not be appropriate because partially ordered conformations are probably not well ordered enough to receive the full stabilizing energies of most hydrogen bonds and van der Waals interactions). Of particular interest in Fig. 1, column IV, is the ruggedness of the free-energy profile for barnase and Che Y. Because of the limitations of our energy function in correctly estimating the stability of

4 11308 Biophysics: Alm and Baker Proc. Natl. Acad. Sci. USA 96 (1999) completely ordered proteins, the large dips in the profiles for these proteins may actually be lower than indicated if they involve the condensation of a fragment of the protein into a well structured subdomain; these local energy minima may correspond to the kinetic intermediates observed in the folding of these two proteins. Hierarchy of Structure Formation. To exhibit the hierarchy of structure formation in the sequential binary collision model, Fig. 1, column II, displays how often each residue would be ordered in the thermodynamic limit where each configuration is populated according to its Boltzmann weight, and kinetic barriers are ignored. All configurations were enumerated, and the Boltzmann weighted average frequency with which each residue was ordered f res, N f i Ord res, i e Fi/kT Q N f is shown as a function of N f, where Ord(res, i) 1 if residue res is ordered in the ith configuration and Ord(res, i) 0 otherwise, and the sum is over all configurations at the specified value of N f. For SH3, it is clear that the first regions of structure to emerge (at N f 0.2) are the distal loop (the blue/red region centered near residue 40) and the RT loop (the blue red region centered near residue 10); however, when about half of the residues in the protein are ordered (at N f 0.5), there is little structure in the RT loop, suggesting that the structure observed there early in the folding pathway does not serve to nucleate folding; in this view of the folding landscape, the early structure seen in the RT loop can be described as offpathway. In contrast, the regions structured at N f 0.5 remain structured throughout the rest of the folding pathway. Interestingly, the regions of the SH3 domain experimentally determined to be structured in the folding transition state are nearly identical to those structured in this representation of the folding landscape at N f 0.5. In the CI-2 free energy landscape, the alpha helix (centered at residue 15) exhibits some structure early but does not become completely ordered until residues in the C-terminal -strands (near residue 50) are also ordered at N f 0.6. The free-energy landscape for the repressor fragment does not suggest a well defined pathway in the thermodynamic limit; when about half the residues in the protein are ordered (N f 0.5), there is a roughly uniform distribution of structure throughout the protein that gradually becomes more ordered until the protein reaches its native configuration (at N f 1). By contrast, the free-energy landscape for Che Y suggests an unambiguous folding mechanism in which the N-terminal region of the protein forms structure first and the C-terminal region becomes structured last. In the barnase folding landscape, the beta sheet becomes ordered first, then the protein condenses to a low free-energy configuration (based on the one-dimensional free-energy profile in Fig. 1, column V) that includes some interactions between the first alpha-helix (centered at residue 10) and the C-terminal beta sheet. Interestingly, the distribution of structure in the configurations that correspond to the first local minimum on the free-energy profile is in qualitative agreement with the distribution of experimental -values for the barnase folding intermediate (26). Distribution of Structure in the Transition-State Ensemble. Although Fig. 1, column II, gives a rough idea of which regions of a protein might be ordered at a given point in the folding reaction, it is not completely rigorous because some low free-energy configurations may not be kinetically accessible. To overcome this limitation, individual transition-state configurations (defined as the highest free-energy configurations on low free-energy paths from the unfolded to folded states) were identified (see Methods). Fig. 1, column III, shows the occupancy by residue of the 100 lowest free-energy transitionstate configurations. These configurations are distributed over a range of values of the reaction coordinate (N f ). For the SH3 domain and Che Y, all of the top transition states have the same general distribution of structure. In contrast, barnase appears to have two structurally distinct transition-state ensembles: one that corresponds to forming the -sheet alone, and one that corresponds to bringing the N-terminal helix onto the sheet. The placement of these transition states on the reaction coordinate, N f, agrees roughly with the position of the corresponding local maxima on the free-energy profile (Fig. 1, column IV) for barnase. The CI-2 and -repressor transitionstate ensembles exhibit more structural diversity than those of SH3 and Che Y, but are not easily separable into distinct ensembles as is the case for barnase. An important issue in computational studies of folding is whether purely thermodynamic approaches, which search for the maximum in the free energy along some suitably chosen reaction coordinate, can accurately identify the folding barrier (27, 28). Although there is a general correspondence between the peak in the free-energy profile and the location of the transition-state configurations, there are some transition-state configurations with values of N f significantly different from that of the maximum in the free-energy profile, suggesting that in this system N f is a good but not perfect reaction coordinate. For minimally frustrated systems, such as small proteins, with funnel-like free-energy landscapes, any order parameter that reflects the amount of native structure (the number of native contacts, the number of residues ordered, etc.) is likely to be a reasonable reaction coordinate (29). The distributions of transition-state configurations (Fig. 2, column III) show that the breadth of the transition-state ensemble (30, 31) varies considerably among the five proteins shown. A complementary estimate of the diversity among the configurations sampled during folding is provided by Y(N f ), which measures the inverse number of thermally accessible states available to the system at a given value of N f (32) (Y(N f ) i (P i2 ), where the sum is over all configurations at a particular value of N f, and P i is the Boltzmann weighted probability of observing the system in configuration i). Interestingly, for all of the proteins except repressor, a sharp FIG. 2. Distribution of thermally accessible states. Y(N f ) i (P i 2 ), which represents the inverse number of thermally accessible states available to the system, was computed for all values of N f, and the result for SH3 and repressor is shown. The sharp peak near N f 0.8 for the SH3 domain (solid line) indicates a drastic reduction in the number of accessible states late in folding that is also seen for CI-2, barnase, and Che Y. By contrast, -repressor (dotted line) does not exhibit such a drastic reduction in accessible states, suggesting a more plastic folding mechanism.

5 Biophysics: Alm and Baker Proc. Natl. Acad. Sci. USA 96 (1999) FIG. 3. Comparison of computed and experimental values. (Left) Protein structures are shown colored by -values computed directly from the sequential binary collision model. Red indicates residues that are important in stabilizing the folding transition state (high -values), and blue indicates residues that do not stabilize the transition state. (Right) Structures colored by experimentally observed -values; residues with a phi-value of 0 are colored blue, residues with a -value of 1 are colored red; purple shades indicate residues with intermediate -values, and white regions indicate residues for which there are no experimental data. reduction in the number of thermally accessible states was observed just after the transition barrier [the peak in Y(N f ) for the SH3 domain is evident near N f 0.8 in Fig. 2 (solid line)]. For repressor (Fig. 2, dashed line), the peak in Y(N f ) was absent, and the average value of Y(N f ) was lower than for the other proteins. This result is intriguing given the relatively large variation in the position of the transition state in repressor mutants (33); the position of the transition state may be less rigidly constrained in repressor because the large number of thermally accessible states provides a number of alternative folding trajectories that can become populated in response to mutations [such behavior was observed in a recent diffusion-collision model of repressor folding (34)]. Comparison with Experiment. There are two sets of experimental observables that can be used to test the sequential binary collision model: protein folding rates and -values. Folding rates can be approximated directly from the height of the free-energy barriers involved in folding. For the small data set studied under the computationally expensive sequential binary collision model, there is little correlation between folding rates and barrier height. A larger set of protein kinetic data was compared with barrier height under the contiguous sequence model, and although there was a correlation between barrier height and folding rates, the relative contact order (a simple measure of the balance between local and nonlocal contacts) was a better predictor of folding rates (7). Given the very large numbers being subtracted in each model to give the net free energy for individual configurations, it is not surprising that there is some error in calculating barrier height precisely. -Values suggest which residues are important in stabilizing the folding transition state and provide a means to test whether the transition-state ensemble identified using the model involves the correct region of the protein. A -value of 0 indicates the residue probed does not contribute to the transition-state stability, whereas a -value of 1 indicates that the residue stabilizes the transition state to the same extent that it stabilizes the native state (35). Folding -values were computed directly from the top 100 transition-state structures (shown in Fig. 1, column III) and were averaged. Results did not change significantly when only 10 transition-state structures were used but did become less accurate when only the top transition state was used to compute -values. On a residueby-residue basis, computed -values correlated reasonably well with those determined experimentally for five of the seven proteins we studied [the SH3 domain (r.50), CI-2 (r.58), barnase (r.61), Che Y (r.87), and -repressor (r.71), where r is the correlation coefficient between the predicted and experimental -values; (Fig. 3)]. Of these, Che Y and repressor, which were not used to optimize the conformational entropy parameter (see Methods), demonstrated the best agreement between computed and experimental -values. The disagreement between computed and experimental -values for the SH3 domain is primarily because of the fact that the model predicts a mostly ordered transition-state configuration including the distal loop, diverging turn, and half of the RT loop, while experimental data indicate only the distal loop and diverging turn are ordered. This is precisely the region of the protein ordered at N f 0.5 in the model (Fig. 2, column II). The predictions for two of the seven proteins, the procarboxypeptidase activation domain and the IgG binding domain of protein L, did not agree with the experimentally determined transition-state structures. For protein L, the failure of the model probably results from the neglect of local sequencestructure propensities. Protein L possesses a high degree of structural symmetry: an N-terminal hairpin is followed by a single helix and a C-terminal hairpin. The two -hairpins bury very similar amounts of surface area within themselves and against the helix, and thus are computed to have similar free energies of ordering in the model, yet only the N-terminal

6 11310 Biophysics: Alm and Baker Proc. Natl. Acad. Sci. USA 96 (1999) half is experimentally observed to be ordered in the folding transition-state ensemble (36) (D. Kim and D.B., unpublished observations). A constant value of conformational entropy per residue ignores sequence biases for specific local structures that may be responsible for the preferential formation of the first hairpin (recent data suggest that the first hairpin is already populated to some extent in the denatured state; Q. Yi and D.B., unpublished observations). A more sophisticated approach might use database statistics to estimate the freeenergy cost associated with formation of specific local structures. CONCLUSIONS The remarkable result of this study is that a very simple model for the folding free-energy landscape combined with a crude energy function is able to reproduce the qualitative features of the transition-state ensemble for five of the seven proteins studied. The success of the simple model combined with recent experimental results supports the idea that protein-folding landscapes are shaped primarily by gross topological features rather than by the details of interresidue interactions. While completing this work, we became aware of very complementary studies being carried out by Munoz and Eaton at the National Institutes of Health and Galzitskaya and Finkelstein in Moscow (Russian Academy of Sciences). Munoz and Eaton (38) used the single-sequence approximation and a secondary structure-dependent free-energy cost for chain ordering and were successful in predicting protein-folding rates after setting the strength of interresidue interactions to reproduce native-state stabilities. Finkelstein and coworkers used an elegant dynamic programming method to identify transition states for multisegment models and succeeded in predicting -values distributions for several proteins (37). The common theme of all three approaches is a native-state centric description of the folding free-energy landscape (nonnative interactions are completely neglected); the success achieved by these and other simple models (8) suggests that the essential physics underlying the folding of small proteins may be simple indeed. We thank Alexei Finkelstein for communicating results before publication and Peter Wolynes and members of the Baker group for helpful comments on the manuscript. This work was supported by a grant from the National Institutes of Health (NIH), and young investigator awards to D.B. from the National Science Foundation and the Packard Foundation, and a Molecular Biophysics predoctoral fellowship from the NIH to E.A. 1. Alm, E. & Baker, D. (1999) Curr. Opin. Struct. Biol. 9, Kim, D. E. & Baker, D. (1998) Proc. Natl. Acad. Sci. USA 95, Riddle, D. S., Santiago, J. V., Bray-Hall, S. T., Doshi, N., Grantcharova, V. P., Yi, Q. & Baker, D. (1997) Nat. Struct. Biol. 4, Perl, D., Welker, C., Schindler, T., Schroder, K., Marahiel, M. A., Jaenicke, R. & Schmid, F. X. (1998) Nat. Struct. Biol. 5, Grantcharova, V. P., Riddle, D. S., Santiago, J. V. & Baker, D. (1998) Nat. Struct. Biol. 5, Martinez, J. C., Pisabarro, M. T. & Serrano, L. (1998) Nat. Struct. Biol. 5, Plaxco, K. W., Simons, K. T. & Baker, D. (1998) J. Mol. Biol. 277, Portman, J. J., Takada, S. & Wolynes, P. G. (1998) Phys. Rev. Lett. 81, Schellman, J. A. (1958) J. Phys. Chem. 62, Munoz, V., Thompson, P. A., Hofrichter, J. & Eaton, W. A. (1997) Nature (London) 390, Finkelstein, A. V. & Badretdinov, A. Y. (1997) Fold. Des. (2) Le Grand, S. M. & Merz, K. M., Jr. (1993) J. Comput. Chem. 14, Eisenberg, D. & McLachlan, A. D. (1986) Nature (London) 319, D Aquino, J. A., Gomez, J., Hilser, V. J., Lee, K. H., Amzel, L. M. & Freire, E. (1996) Proteins 25, Jacobson, H. & Stockmayer, W. H. (1950) J. Chem. Phys. 18, Chan, H. S. & Dill, K. A. (1990) J. Chem. Phys. 92, Nagi, A. D. & Regan, L. (1997) Fold. Des. 2, Ladurner, A. G. & Fersht, A. R. (1997) J. Mol. Biol. 273, Viguera, A. R. & Serrano, L. (1997) Nat. Struct. Biol. 4, Sayle, R. A. & Milner-White, E. J. (1995) Trends Biochem. Sci. 20, Kraulis, P. J. (1991) J. Appl. Crystallogr. 24, Wolynes, P. G. (1997) Proc. Natl. Acad. Sci. USA 94, López-Hernández, E. & Serrano, L. (1996) Fold. Des. 1, Serrano, L., Matouschek, A. & Fersht, A. R. (1992) J. Mol. Biol. 224, Itzhaki, L. S., Otzen, D. E. & Fersht, A. R. (1995) J. Mol. Biol. 254, Matouschek, A., Serrano, L. & Fersht, A. R. (1992) J. Mol. Biol. 224, Socci, N. D., Onuchic, J. N. & Wolynes, P. G. (1996) J. Chem. Phys. 104, Pande, V. S., Grosberg, A. Yu., Tanaka, T. & Rokhsar, D. S. (1998) Curr. Opin. Struct. Biol. 8, Bryngelson, J. D., Onuchic, J. N., Socci, N. D. & Wolynes, P. G. (1995) Proteins Struct. Funct. Genet. 21, Thurmalai, D. & Klimov, D. K. (1998) Fold. Des. 13, R112 R Shakhnovich, E. I. (1998) Fold. Des. 13, R108 R Onuchic, J. N., Wolynes, P. G., Luthey-Schulten, Z. Socci, N. D. (1995) Proc. Natl. Acad. Sci. USA 92, Burton, R. E., Huang, G. S., Daugherty, M. A., Calderone, T. L. & Oas, T. G. (1997) Nat. Struct. Biol. 4, Burton, R. E., Myers, J. K. & Oas, T. G. (1998) Biochemistry 37, Fersht, A. R. (1994) Curr. Opin. Struct. Biol. 5, Gu, H., Kim, D. & Baker, D. (1997) J. Mol. Biol. 274, Galzitskaya, O. V. & Finkelstein, A. V. (1999) Proc. Natl. Acad. Sci. USA 96, Munoz, V. & Eaton, W. A. (1999) Proc. Natl. Acad. Sci. USA 96,

To understand pathways of protein folding, experimentalists

To understand pathways of protein folding, experimentalists Transition-state structure as a unifying basis in protein-folding mechanisms: Contact order, chain topology, stability, and the extended nucleus mechanism Alan R. Fersht* Cambridge University Chemical

More information

A simple model for calculating the kinetics of protein folding from three-dimensional structures

A simple model for calculating the kinetics of protein folding from three-dimensional structures Proc. Natl. Acad. Sci. USA Vol. 96, pp. 11311 11316, September 1999 Biophysics, Chemistry A simple model for calculating the kinetics of protein folding from three-dimensional structures VICTOR MUÑOZ*

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001 Modeling two-state cooperativity in protein folding Ke Fan, Jun Wang, and Wei Wang arxiv:cond-mat/0103385v1 [cond-mat.soft] 19 Mar 2001 National Laboratory of Solid State Microstructure and Department

More information

The kinetics of protein folding is often remarkably simple. For

The kinetics of protein folding is often remarkably simple. For Fast protein folding kinetics Jack Schonbrun* and Ken A. Dill *Graduate Group in Biophysics and Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94118 Communicated by

More information

Master equation approach to finding the rate-limiting steps in biopolymer folding

Master equation approach to finding the rate-limiting steps in biopolymer folding JOURNAL OF CHEMICAL PHYSICS VOLUME 118, NUMBER 7 15 FEBRUARY 2003 Master equation approach to finding the rate-limiting steps in biopolymer folding Wenbing Zhang and Shi-Jie Chen a) Department of Physics

More information

CHRIS J. BOND*, KAM-BO WONG*, JANE CLARKE, ALAN R. FERSHT, AND VALERIE DAGGETT* METHODS

CHRIS J. BOND*, KAM-BO WONG*, JANE CLARKE, ALAN R. FERSHT, AND VALERIE DAGGETT* METHODS Proc. Natl. Acad. Sci. USA Vol. 94, pp. 13409 13413, December 1997 Biochemistry Characterization of residual structure in the thermally denatured state of barnase by simulation and experiment: Description

More information

Protein Folding Prof. Eugene Shakhnovich

Protein Folding Prof. Eugene Shakhnovich Protein Folding Eugene Shakhnovich Department of Chemistry and Chemical Biology Harvard University 1 Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot)

More information

Temperature dependence of reactions with multiple pathways

Temperature dependence of reactions with multiple pathways PCCP Temperature dependence of reactions with multiple pathways Muhammad H. Zaman, ac Tobin R. Sosnick bc and R. Stephen Berry* ad a Department of Chemistry, The University of Chicago, Chicago, IL 60637,

More information

Clustering of low-energy conformations near the native structures of small proteins

Clustering of low-energy conformations near the native structures of small proteins Proc. Natl. Acad. Sci. USA Vol. 95, pp. 11158 11162, September 1998 Biophysics Clustering of low-energy conformations near the native structures of small proteins DAVID SHORTLE*, KIM T. SIMONS, AND DAVID

More information

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed. Macromolecular Processes 20. Protein Folding Composed of 50 500 amino acids linked in 1D sequence by the polypeptide backbone The amino acid physical and chemical properties of the 20 amino acids dictate

More information

Stretching lattice models of protein folding

Stretching lattice models of protein folding Proc. Natl. Acad. Sci. USA Vol. 96, pp. 2031 2035, March 1999 Biophysics Stretching lattice models of protein folding NICHOLAS D. SOCCI,JOSÉ NELSON ONUCHIC**, AND PETER G. WOLYNES Bell Laboratories, Lucent

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

Folding of small proteins using a single continuous potential

Folding of small proteins using a single continuous potential JOURNAL OF CHEMICAL PHYSICS VOLUME 120, NUMBER 17 1 MAY 2004 Folding of small proteins using a single continuous potential Seung-Yeon Kim School of Computational Sciences, Korea Institute for Advanced

More information

Protein Folding In Vitro*

Protein Folding In Vitro* Protein Folding In Vitro* Biochemistry 412 February 29, 2008 [*Note: includes computational (in silico) studies] Fersht & Daggett (2002) Cell 108, 573. Some folding-related facts about proteins: Many small,

More information

Identifying the Protein Folding Nucleus Using Molecular Dynamics

Identifying the Protein Folding Nucleus Using Molecular Dynamics doi:10.1006/jmbi.1999.3534 available online at http://www.idealibrary.com on J. Mol. Biol. (2000) 296, 1183±1188 COMMUNICATION Identifying the Protein Folding Nucleus Using Molecular Dynamics Nikolay V.

More information

Scattered Hammond plots reveal second level of site-specific information in protein folding: ( )

Scattered Hammond plots reveal second level of site-specific information in protein folding: ( ) Scattered Hammond plots reveal second level of site-specific information in protein folding: ( ) Linda Hedberg and Mikael Oliveberg* Department of Biochemistry, Umeå University, S-901 87 Umeå, Sweden Edited

More information

Local Interactions Dominate Folding in a Simple Protein Model

Local Interactions Dominate Folding in a Simple Protein Model J. Mol. Biol. (1996) 259, 988 994 Local Interactions Dominate Folding in a Simple Protein Model Ron Unger 1,2 * and John Moult 2 1 Department of Life Sciences Bar-Ilan University Ramat-Gan, 52900, Israel

More information

Simulating Folding of Helical Proteins with Coarse Grained Models

Simulating Folding of Helical Proteins with Coarse Grained Models 366 Progress of Theoretical Physics Supplement No. 138, 2000 Simulating Folding of Helical Proteins with Coarse Grained Models Shoji Takada Department of Chemistry, Kobe University, Kobe 657-8501, Japan

More information

Protein Folding. I. Characteristics of proteins. C α

Protein Folding. I. Characteristics of proteins. C α I. Characteristics of proteins Protein Folding 1. Proteins are one of the most important molecules of life. They perform numerous functions, from storing oxygen in tissues or transporting it in a blood

More information

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding? The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation By Jun Shimada and Eugine Shaknovich Bill Hawse Dr. Bahar Elisa Sandvik and Mehrdad Safavian Outline Background on protein

More information

1 of 31. Nucleation and the transition state of the SH3 domain. Isaac A. Hubner, Katherine A. Edmonds, and Eugene I. Shakhnovich *

1 of 31. Nucleation and the transition state of the SH3 domain. Isaac A. Hubner, Katherine A. Edmonds, and Eugene I. Shakhnovich * 1 of 31 Nucleation and the transition state of the SH3 domain. Isaac A. Hubner, Katherine A. Edmonds, and Eugene I. Shakhnovich * Department of Chemistry and Chemical Biology Harvard University 12 Oxford

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 16 Nov 2002

arxiv:cond-mat/ v1 [cond-mat.soft] 16 Nov 2002 Dependence of folding rates on protein length Mai Suan Li 1, D. K. Klimov 2 and D. Thirumalai 2 1 Institute of Physics, Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland 2 Institute

More information

It is not yet possible to simulate the formation of proteins

It is not yet possible to simulate the formation of proteins Three-helix-bundle protein in a Ramachandran model Anders Irbäck*, Fredrik Sjunnesson, and Stefan Wallin Complex Systems Division, Department of Theoretical Physics, Lund University, Sölvegatan 14A, S-223

More information

It is now well established that proteins are minimally frustrated

It is now well established that proteins are minimally frustrated Domain swapping is a consequence of minimal frustration Sichun Yang*, Samuel S. Cho*, Yaakov Levy*, Margaret S. Cheung*, Herbert Levine*, Peter G. Wolynes*, and José N. Onuchic* *Center for Theoretical

More information

Elucidation of the RNA-folding mechanism at the level of both

Elucidation of the RNA-folding mechanism at the level of both RNA hairpin-folding kinetics Wenbing Zhang and Shi-Jie Chen* Department of Physics and Astronomy and Department of Biochemistry, University of Missouri, Columbia, MO 65211 Edited by Peter G. Wolynes, University

More information

Nucleation and the Transition State of the SH3 Domain

Nucleation and the Transition State of the SH3 Domain doi:10.1016/j.jmb.2005.03.050 J. Mol. Biol. (2005) 349, 424 434 Nucleation and the Transition State of the SH3 Domain Isaac A. Hubner 1, Katherine A. Edmonds 2 and Eugene I. Shakhnovich 1 * 1 Department

More information

PROTEIN FOLDING THEORY: From Lattice to All-Atom Models

PROTEIN FOLDING THEORY: From Lattice to All-Atom Models Annu. Rev. Biophys. Biomol. Struct. 2001. 30:361 96 Copyright c 2001 by Annual Reviews. All rights reserved PROTEIN FOLDING THEORY: From Lattice to All-Atom Models Leonid Mirny and Eugene Shakhnovich Department

More information

The protein folding problem consists of two parts:

The protein folding problem consists of two parts: Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures.

More information

Simulating disorder order transitions in molecular recognition of unstructured proteins: Where folding meets binding

Simulating disorder order transitions in molecular recognition of unstructured proteins: Where folding meets binding Simulating disorder order transitions in molecular recognition of unstructured proteins: Where folding meets binding Gennady M. Verkhivker*, Djamal Bouzida, Daniel K. Gehlhaar, Paul A. Rejto, Stephan T.

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2004 How do we go from an unfolded polypeptide chain to a compact folded protein? (Folding of thioredoxin, F. Richards) Structure - Function

More information

Solvation in protein folding analysis: Combination of theoretical and experimental approaches

Solvation in protein folding analysis: Combination of theoretical and experimental approaches Solvation in protein folding analysis: Combination of theoretical and experimental approaches A. M. Fernández-Escamilla*, M. S. Cheung, M. C. Vega, M. Wilmanns, J. N. Onuchic, and L. Serrano* *European

More information

Lecture 11: Protein Folding & Stability

Lecture 11: Protein Folding & Stability Structure - Function Protein Folding: What we know Lecture 11: Protein Folding & Stability 1). Amino acid sequence dictates structure. 2). The native structure represents the lowest energy state for a

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2003 Structure - Function Protein Folding: What we know 1). Amino acid sequence dictates structure. 2). The native structure represents

More information

Unfolding CspB by means of biased molecular dynamics

Unfolding CspB by means of biased molecular dynamics Chapter 4 Unfolding CspB by means of biased molecular dynamics 4.1 Introduction Understanding the mechanism of protein folding has been a major challenge for the last twenty years, as pointed out in the

More information

Protein folding. Today s Outline

Protein folding. Today s Outline Protein folding Today s Outline Review of previous sessions Thermodynamics of folding and unfolding Determinants of folding Techniques for measuring folding The folding process The folding problem: Prediction

More information

Predicting free energy landscapes for complexes of double-stranded chain molecules

Predicting free energy landscapes for complexes of double-stranded chain molecules JOURNAL OF CHEMICAL PHYSICS VOLUME 114, NUMBER 9 1 MARCH 2001 Predicting free energy landscapes for complexes of double-stranded chain molecules Wenbing Zhang and Shi-Jie Chen a) Department of Physics

More information

Exploring protein-folding ensembles: A variable-barrier model for the analysis of equilibrium unfolding experiments

Exploring protein-folding ensembles: A variable-barrier model for the analysis of equilibrium unfolding experiments Exploring protein-folding ensembles: A variable-barrier model for the analysis of equilibrium unfolding experiments Victor Muñoz* and Jose M. Sanchez-Ruiz *Department of Chemistry and Biochemistry and

More information

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday.

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday. Short Announcements 1 st Quiz today: 15 minutes Homework 3: Due next Wednesday. Next Lecture, on Visualizing Molecular Dynamics (VMD) by Klaus Schulten Today s Lecture: Protein Folding, Misfolding, Aggregation

More information

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model JOURNAL OF CHEMICAL PHYSICS VOLUME 111, NUMBER 8 22 AUGUST 1999 Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model Benjamin Vekhter and R. Stephen

More information

= (-22) = +2kJ /mol

= (-22) = +2kJ /mol Lecture 8: Thermodynamics & Protein Stability Assigned reading in Campbell: Chapter 4.4-4.6 Key Terms: DG = -RT lnk eq = DH - TDS Transition Curve, Melting Curve, Tm DH calculation DS calculation van der

More information

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate

More information

Determination of Barrier Heights and Prefactors from Protein Folding Rate Data

Determination of Barrier Heights and Prefactors from Protein Folding Rate Data 3762 Biophysical Journal Volume 88 June 2005 3762 3769 Determination of Barrier Heights and Prefactors from Protein Folding Rate Data S. S. Plotkin Department of Physics and Astronomy, University of British

More information

Lattice protein models

Lattice protein models Lattice protein models Marc R. Roussel epartment of Chemistry and Biochemistry University of Lethbridge March 5, 2009 1 Model and assumptions The ideas developed in the last few lectures can be applied

More information

Visualizing folding of proteins (1 3) and RNA (2) in terms of

Visualizing folding of proteins (1 3) and RNA (2) in terms of Can energy landscape roughness of proteins and RNA be measured by using mechanical unfolding experiments? Changbong Hyeon and D. Thirumalai Chemical Physics Program, Institute for Physical Science and

More information

arxiv:cond-mat/ v1 2 Feb 94

arxiv:cond-mat/ v1 2 Feb 94 cond-mat/9402010 Properties and Origins of Protein Secondary Structure Nicholas D. Socci (1), William S. Bialek (2), and José Nelson Onuchic (1) (1) Department of Physics, University of California at San

More information

The sequences of naturally occurring proteins are defined by

The sequences of naturally occurring proteins are defined by Protein topology and stability define the space of allowed sequences Patrice Koehl* and Michael Levitt Department of Structural Biology, Fairchild Building, D109, Stanford University, Stanford, CA 94305

More information

Supplementary Information

Supplementary Information 1 Supplementary Information Figure S1 The V=0.5 Harker section of an anomalous difference Patterson map calculated using diffraction data from the NNQQNY crystal at 1.3 Å resolution. The position of the

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 5 May 1998

arxiv:cond-mat/ v1 [cond-mat.soft] 5 May 1998 Linking Rates of Folding in Lattice Models of Proteins with Underlying Thermodynamic Characteristics arxiv:cond-mat/9805061v1 [cond-mat.soft] 5 May 1998 D.K.Klimov and D.Thirumalai Institute for Physical

More information

Effective stochastic dynamics on a protein folding energy landscape

Effective stochastic dynamics on a protein folding energy landscape THE JOURNAL OF CHEMICAL PHYSICS 125, 054910 2006 Effective stochastic dynamics on a protein folding energy landscape Sichun Yang, a José N. Onuchic, b and Herbert Levine c Center for Theoretical Biological

More information

A Path Planning-Based Study of Protein Folding with a Case Study of Hairpin Formation in Protein G and L

A Path Planning-Based Study of Protein Folding with a Case Study of Hairpin Formation in Protein G and L A Path Planning-Based Study of Protein Folding with a Case Study of Hairpin Formation in Protein G and L G. Song, S. Thomas, K.A. Dill, J.M. Scholtz, N.M. Amato Pacific Symposium on Biocomputing 8:240-251(2003)

More information

2.4 DNA structure. S(l) bl + c log l + d, with c 1.8k B. (2.72)

2.4 DNA structure. S(l) bl + c log l + d, with c 1.8k B. (2.72) 2.4 DNA structure DNA molecules come in a wide range of length scales, from roughly 50,000 monomers in a λ-phage, 6 0 9 for human, to 9 0 0 nucleotides in the lily. The latter would be around thirty meters

More information

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key Biology 5357 Chemistry & Physics of Biomolecules Examination #1 Proteins Module September 29, 2017 Answer Key Question 1 (A) (5 points) Structure (b) is more common, as it contains the shorter connection

More information

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions Van der Waals Interactions

More information

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Protein Dynamics The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Below is myoglobin hydrated with 350 water molecules. Only a small

More information

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov Biophysical Journal, Volume 98 Supporting Material Molecular dynamics simulations of anti-aggregation effect of ibuprofen Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov Supplemental

More information

Sparsely populated folding intermediates of the Fyn SH3 domain: Matching native-centric essential dynamics and experiment

Sparsely populated folding intermediates of the Fyn SH3 domain: Matching native-centric essential dynamics and experiment Sparsely populated folding intermediates of the Fyn SH3 domain: Matching native-centric essential dynamics and experiment Jason E. Ollerenshaw*, Hüseyin Kaya, Hue Sun Chan, and Lewis E. Kay* Departments

More information

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila 1 Guessing the upper bound free-energy difference between native-like structures Jorge A. Vila IMASL-CONICET, Universidad Nacional de San Luis, Ejército de Los Andes 950, 5700- San Luis, Argentina Use

More information

Simple Models of Protein Folding

Simple Models of Protein Folding Simple Models of Protein Folding Andrew Blanchard May 11, 2012 Abstract Lattice models with simplified interaction potentials have been used to analyze the ability of certain amino acid sequences to adopt

More information

PROTEIN EVOLUTION AND PROTEIN FOLDING: NON-FUNCTIONAL CONSERVED RESIDUES AND THEIR PROBABLE ROLE

PROTEIN EVOLUTION AND PROTEIN FOLDING: NON-FUNCTIONAL CONSERVED RESIDUES AND THEIR PROBABLE ROLE PROTEIN EVOLUTION AND PROTEIN FOLDING: NON-FUNCTIONAL CONSERVED RESIDUES AND THEIR PROBABLE ROLE O.B. PTITSYN National Cancer Institute, NIH, Laboratory of Experimental & Computational Biology, Molecular

More information

Evolution of functionality in lattice proteins

Evolution of functionality in lattice proteins Evolution of functionality in lattice proteins Paul D. Williams,* David D. Pollock, and Richard A. Goldstein* *Department of Chemistry, University of Michigan, Ann Arbor, MI, USA Department of Biological

More information

Protein Folding Pathways and Kinetics: Molecular Dynamics Simulations of -Strand Motifs

Protein Folding Pathways and Kinetics: Molecular Dynamics Simulations of -Strand Motifs Biophysical Journal Volume 83 August 2002 819 835 819 Protein Folding Pathways and Kinetics: Molecular Dynamics Simulations of -Strand Motifs Hyunbum Jang,* Carol K. Hall,* and Yaoqi Zhou *Department of

More information

Modeling protein folding: the beauty and power of simplicity Eugene I Shakhnovich

Modeling protein folding: the beauty and power of simplicity Eugene I Shakhnovich R50 Review Modeling protein folding: the beauty and power of simplicity Eugene I Shakhnovich It is argued that simplified models capture key features of protein stability and folding, whereas more detailed

More information

S(l) bl + c log l + d, with c 1.8k B. (2.71)

S(l) bl + c log l + d, with c 1.8k B. (2.71) 2.4 DNA structure DNA molecules come in a wide range of length scales, from roughly 50,000 monomers in a λ-phage, 6 0 9 for human, to 9 0 0 nucleotides in the lily. The latter would be around thirty meters

More information

Lecture 34 Protein Unfolding Thermodynamics

Lecture 34 Protein Unfolding Thermodynamics Physical Principles in Biology Biology 3550 Fall 2018 Lecture 34 Protein Unfolding Thermodynamics Wednesday, 21 November c David P. Goldenberg University of Utah goldenberg@biology.utah.edu Clicker Question

More information

Protein Structure Prediction

Protein Structure Prediction Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on

More information

arxiv:q-bio/ v1 [q-bio.bm] 30 May 2006

arxiv:q-bio/ v1 [q-bio.bm] 30 May 2006 Transition States in Protein Folding Kinetics: The Structural Interpretation of Φ-values arxiv:q-bio/0605048v1 [q-bio.bm] 30 May 2006 Abstract Thomas R. Weikl 1 and Ken A. Dill 2 1 Max Planck Institute

More information

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University Thermodynamics Entropy and its Applications Lecture 11 NC State University System and surroundings Up to this point we have considered the system, but we have not concerned ourselves with the relationship

More information

FOCUS: HYDROGEN EXCHANGE AND COVALENT MODIFICATION

FOCUS: HYDROGEN EXCHANGE AND COVALENT MODIFICATION FOCUS: HYDROGEN EXCHANGE AND COVALENT MODIFICATION Accuracy of SUPREX (Stability of Unpurified Proteins from Rates of H/D Exchange) and MALDI Mass Spectrometry-Derived Protein Unfolding Free Energies Determined

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*

More information

BCHS 6229 Protein Structure and Function. Lecture 3 (October 18, 2011) Protein Folding: Forces, Mechanisms & Characterization

BCHS 6229 Protein Structure and Function. Lecture 3 (October 18, 2011) Protein Folding: Forces, Mechanisms & Characterization BCHS 6229 Protein Structure and Function Lecture 3 (October 18, 2011) Protein Folding: Forces, Mechanisms & Characterization 1 The folding problem One of the greatest unsolved problems of Science The folding

More information

Hidden symmetries in primary sequences of small α proteins

Hidden symmetries in primary sequences of small α proteins Hidden symmetries in primary sequences of small α proteins Ruizhen Xu, Yanzhao Huang, Mingfen Li, Hanlin Chen, and Yi Xiao * Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University

More information

Intermediates and the folding of proteins L and G

Intermediates and the folding of proteins L and G Intermediates and the folding of proteins L and G SCOTT BROWN 1 AND TERESA HEAD-GORDON Department of Bioengineering, University of California (UC), Berkeley, Berkeley, California 94720-1762, USA (RECEIVED

More information

Targeted Molecular Dynamics Simulations of Protein Unfolding

Targeted Molecular Dynamics Simulations of Protein Unfolding J. Phys. Chem. B 2000, 104, 4511-4518 4511 Targeted Molecular Dynamics Simulations of Protein Unfolding Philippe Ferrara, Joannis Apostolakis, and Amedeo Caflisch* Department of Biochemistry, UniVersity

More information

A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics

A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics JOURNAL OF CHEMICAL PHYSICS VOLUME 115, NUMBER 3 15 JULY 2001 A new combination of replica exchange Monte Carlo and histogram analysis for protein folding and thermodynamics Dominik Gront Department of

More information

Relationship between the Native-State Hydrogen Exchange and Folding Pathways of a Four-Helix Bundle Protein

Relationship between the Native-State Hydrogen Exchange and Folding Pathways of a Four-Helix Bundle Protein 7998 Biochemistry 2002, 41, 7998-8003 Relationship between the Native-State Hydrogen Exchange and Folding Pathways of a Four-Helix Bundle Protein Ruiai Chu, Wuhong Pei, Jiro Takei, and Yawen Bai* Laboratory

More information

letters Transition states and the meaning of Φ-values in protein folding kinetics S. Banu Ozkan 1,2, Ivet Bahar 3 and Ken A.

letters Transition states and the meaning of Φ-values in protein folding kinetics S. Banu Ozkan 1,2, Ivet Bahar 3 and Ken A. formed protein DNA complex (3 µl) was mixed with 3 µl of reservoir solution containing 15% (w/v) PEG 4000, 100 mm 2-(N- Morpholino)ethanesulfonic acid (MES), ph 6.0, 100 mm NH 4 H 2 PO 4 and 15% (v/v)

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 23 Mar 2007

arxiv:cond-mat/ v1 [cond-mat.soft] 23 Mar 2007 The structure of the free energy surface of coarse-grained off-lattice protein models arxiv:cond-mat/0703606v1 [cond-mat.soft] 23 Mar 2007 Ethem Aktürk and Handan Arkin Hacettepe University, Department

More information

Packing of Secondary Structures

Packing of Secondary Structures 7.88 Lecture Notes - 4 7.24/7.88J/5.48J The Protein Folding and Human Disease Professor Gossard Retrieving, Viewing Protein Structures from the Protein Data Base Helix helix packing Packing of Secondary

More information

Random heteropolymer adsorption on disordered multifunctional surfaces: Effect of specific intersegment interactions

Random heteropolymer adsorption on disordered multifunctional surfaces: Effect of specific intersegment interactions JOURNAL OF CHEMICAL PHYSICS VOLUME 109, NUMBER 15 15 OCTOBER 1998 Random heteropolymer adsorption on disordered multifunctional surfaces: Effect of specific intersegment interactions Simcha Srebnik Department

More information

A molecular dynamics investigation of the kinetic bottlenecks of the hpin1 WW domain. II: simulations with the Go model

A molecular dynamics investigation of the kinetic bottlenecks of the hpin1 WW domain. II: simulations with the Go model Proceedings of the 2006 WSEAS International Conference on Mathematical Biology and Ecology, Miami, Florida, USA, January 18-20, 2006 (pp31-36) A molecular dynamics investigation of the kinetic bottlenecks

More information

PHYSICAL REVIEW LETTERS

PHYSICAL REVIEW LETTERS PHYSICAL REVIEW LETTERS VOLUME 86 28 MAY 21 NUMBER 22 Mathematical Analysis of Coupled Parallel Simulations Michael R. Shirts and Vijay S. Pande Department of Chemistry, Stanford University, Stanford,

More information

Supersecondary Structures (structural motifs)

Supersecondary Structures (structural motifs) Supersecondary Structures (structural motifs) Various Sources Slide 1 Supersecondary Structures (Motifs) Supersecondary Structures (Motifs): : Combinations of secondary structures in specific geometric

More information

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Taking the advice of Lord Kelvin, the Father of Thermodynamics, I describe the protein molecule and other

More information

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Nature of the transition state ensemble for protein folding

Nature of the transition state ensemble for protein folding Nature of the transition state ensemble for protein folding N. H. Putnam, V.S. Pande, D.S. Rokhsar he ability of a protein to fold rapidly to its unique native state from any of its possible unfolded conformations

More information

How protein thermodynamics and folding mechanisms are altered by the chaperonin cage: Molecular simulations

How protein thermodynamics and folding mechanisms are altered by the chaperonin cage: Molecular simulations How protein thermodynamics and folding mechanisms are altered by the chaperonin cage: Molecular simulations Fumiko Takagi*, Nobuyasu Koga, and Shoji Takada* *PRESTO, Japan Science and Technology Corporation,

More information

Paul Sigler et al, 1998.

Paul Sigler et al, 1998. Biological systems are necessarily metastable. They are created, modulated, and destroyed according to a temporal plan that meets the survival needs of the cell, organism, and species...clearly, no biological

More information

arxiv: v1 [cond-mat.soft] 22 Oct 2007

arxiv: v1 [cond-mat.soft] 22 Oct 2007 Conformational Transitions of Heteropolymers arxiv:0710.4095v1 [cond-mat.soft] 22 Oct 2007 Michael Bachmann and Wolfhard Janke Institut für Theoretische Physik, Universität Leipzig, Augustusplatz 10/11,

More information

Protein folding can be described by using a free energy

Protein folding can be described by using a free energy The folding energy landscape of apoflavodoxin is rugged: Hydrogen exchange reveals nonproductive misfolded intermediates Yves J. M. Bollen*, Monique B. Kamphuis, and Carlo P. M. van Mierlo *Department

More information

Monte Carlo simulations of polyalanine using a reduced model and statistics-based interaction potentials

Monte Carlo simulations of polyalanine using a reduced model and statistics-based interaction potentials THE JOURNAL OF CHEMICAL PHYSICS 122, 024904 2005 Monte Carlo simulations of polyalanine using a reduced model and statistics-based interaction potentials Alan E. van Giessen and John E. Straub Department

More information

The starting point for folding proteins on a computer is to

The starting point for folding proteins on a computer is to A statistical mechanical method to optimize energy functions for protein folding Ugo Bastolla*, Michele Vendruscolo, and Ernst-Walter Knapp* *Freie Universität Berlin, Department of Biology, Chemistry

More information

Biotechnology of Proteins. The Source of Stability in Proteins (III) Fall 2015

Biotechnology of Proteins. The Source of Stability in Proteins (III) Fall 2015 Biotechnology of Proteins The Source of Stability in Proteins (III) Fall 2015 Conformational Entropy of Unfolding It is The factor that makes the greatest contribution to stabilization of the unfolded

More information

The topomer search model: A simple, quantitative theory of two-state protein folding kinetics

The topomer search model: A simple, quantitative theory of two-state protein folding kinetics REVIEW The topomer search model: A simple, quantitative theory of two-state protein folding kinetics DMITRII E. MAKAROV 1 AND KEVIN W. PLAXCO 2 1 Department of Chemistry and Biochemistry and Institute

More information

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur Lecture - 06 Protein Structure IV We complete our discussion on Protein Structures today. And just to recap

More information

BME Engineering Molecular Cell Biology. Structure and Dynamics of Cellular Molecules. Basics of Cell Biology Literature Reading

BME Engineering Molecular Cell Biology. Structure and Dynamics of Cellular Molecules. Basics of Cell Biology Literature Reading BME 42-620 Engineering Molecular Cell Biology Lecture 05: Structure and Dynamics of Cellular Molecules Basics of Cell Biology Literature Reading BME42-620 Lecture 05, September 13, 2011 1 Outline Review:

More information

Coil to Globule Transition: This follows Giant Molecules by Alexander Yu. Grosberg and Alexei R. Khokhlov (1997).

Coil to Globule Transition: This follows Giant Molecules by Alexander Yu. Grosberg and Alexei R. Khokhlov (1997). Coil to Globule Transition: This follows Giant Molecules by Alexander Yu. Grosberg and Alexei R. Khokhlov (1997). The Flory Krigbaum expression for the free energy of a self-avoiding chain is given by,

More information

Convergence of replica exchange molecular dynamics

Convergence of replica exchange molecular dynamics THE JOURNAL OF CHEMICAL PHYSICS 123, 154105 2005 Convergence of replica exchange molecular dynamics Wei Zhang and Chun Wu Department of Chemistry and Biochemistry, University of Delaware, Newark, Delaware

More information

Secondary structure stability, beta-sheet formation & stability

Secondary structure stability, beta-sheet formation & stability Protein Physics 2016 Lecture 6, February 5 Secondary structure stability, beta-sheet formation & stability Magnus Andersson magnus.andersson@scilifelab.se Theoretical & Computational Biophysics Recap of

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information