The topomer search model: A simple, quantitative theory of two-state protein folding kinetics

Size: px
Start display at page:

Download "The topomer search model: A simple, quantitative theory of two-state protein folding kinetics"

Transcription

1 REVIEW The topomer search model: A simple, quantitative theory of two-state protein folding kinetics DMITRII E. MAKAROV 1 AND KEVIN W. PLAXCO 2 1 Department of Chemistry and Biochemistry and Institute for Theoretical Chemistry, University of Texas at Austin, Austin, Texas 78712, USA 2 Department of Chemistry and Biochemistry and Interdepartmental Program in Biomolecular Science and Engineering, University of California, Santa Barbara, California 93106, USA (RECEIVED June 20, 2002; FINAL REVISION September 24, 2002; ACCEPTED October 3, 2002) Abstract Most small, single-domain proteins fold with the uncomplicated, single-exponential kinetics expected for diffusion on a smooth energy landscape. Despite this energetic smoothness, the folding rates of these two-state proteins span a remarkable million-fold range. Here, we review the evidence in favor of a simple, mechanistic description, the topomer search model, which quantitatively accounts for the broad scope of observed two-state folding rates. The model, which stipulates that the search for those unfolded conformations with a grossly correct topology is the rate-limiting step in folding, fits observed rates with a correlation coefficient of 0.9 using just two free parameters. The fitted values of these parameters, the pre-exponential attempt frequency and a measure of the difficulty of ordering an unfolded chain, are consistent with previously reported experimental constraints. These results suggest that the topomer search process may dominate the relative barrier heights of two-state protein-folding reactions. Keywords: Contact order; diffusion-collision; nucleation The folding kinetics of most simple, single-domain proteins is well fitted as a single-exponential, two-state process (Jackson and Fersht 1991; Guijarro et al. 1998; Jackson 1998; Plaxco et al. 1999), even at the lowest temperatures accessible to experiment (Gillespie and Plaxco 2000). This observation confirms that the rapid, biologically relevant folding rates that distinguish naturally occurring proteins are associated with a smooth energy landscape lacking both significant discrete traps (well-populated intermediates and misfolded states) and fine-scale heterogeneous roughness (Bryngelson and Wolynes 1987; Bryngelson et al. 1995; Onuchic et al. 1997). In the absence of these complications, folding is free to progress unimpeded to the native state with the greatest possible speed (Dill and Chan 1997; Dobson et Reprint requests to: Kevin W. Plaxco, Department of Chemistry and Biochemistry and Interdepartmental Program in Biomolecular Science and Engineering, University of California, Santa Barbara, CA 93106, USA; kwp@chem.ucsb.edu; fax: (805) Article and publication are at /ps al. 1998; Dinner et al. 2000). But this observation begs the question, if the folding energy landscapes of two-state proteins are generally smooth, why do some simple proteins fold a million times more rapidly than others (van Nuland et al. 1998; Wittung-Stafshede et al. 1999)? Here, we describe a simple, near-first principles model that quantitatively accounts for this well-established experimental observation. Although evolution can presumably smooth the energy landscape arbitrarily, there is one aspect of protein chemistry that selective pressures cannot optimize, namely, a polypeptide is a covalent chain that cannot cross through itself. An unavoidable consequence of this connectivity is that the rate with which unfolded polypeptides diffuse between distinct topologies is limited, and thus, even imaginary proteins with perfectly smooth energy landscapes will exhibit varying folding rates due to topological frustration (Clementi et al. 2000) and the difficulty of diffusing into the correct, native topology. Consistent with this hypothesis, by the mid- to late nineties, numerous authors had suggested the search for the correct gross topology may be an important contributor to the folding barrier (Sosnick et al. 1994; Gross Protein Science (2003), 12: Published by Cold Spring Harbor Laboratory Press. Copyright 2003 The Protein Society 17

2 Makarov and Plaxco 1996; Sosnick et al. 1996; Guo et al. 1997; Kolinski et al. 1998; Sheinerman and Brooks 1998; Socci et al. 1998; Bergasa-Caceres et al. 1999; Debe et al. 1999; Shea et al. 1999). In 1998, we serendipitously discovered that a simple, empirical measure of topological complexity is highly correlated with the experimentally observed folding rates of two-state proteins (Plaxco et al. 1998, 2000). The measure of topology in question, termed relative contact order, is simply the average sequence separation between all pairs of residues in contact in the native structure relative to the total length of the protein. The surprising strength of this correlation [r 0.9; all correlation coefficients in this review are for linearized equations. This correlation coefficent, r, thus reflects the significance of the linear relationship between log (k f ) and contact order. The square of the correlation coefficient, r 2, is a measure of the fraction of all of the variance in the data set that is captured by the model.] demonstrates that, against the background of the smooth energy landscapes of two-state proteins, this perhaps naïve measure captures in excess of 3/4 of the variance in reported (log) folding rates. Whereas the contact order-rate relationship hints at the mechanistic underpinnings of the folding reaction, the measure has not lent itself to any simple, quantitative reconciliation with first principles models of the process. For example, because contact order is related to the sequence separation between contacting residues, it has been suggested that it relates to the entropic cost of the loop closures required to surmount the rate-limiting step in folding (Plaxco et al. 1998; Alm and Baker 1999a; Galzitskaya and Finkelstein 1999; Fersht 2000). Unfortunately, however, loop closure entropy is proportional to the logarithm of loop length rather than loop length per se (Jacobson and Stockmayer 1950) and the average log (separation) between contacting residues is more poorly correlated with rates than is contact order as originally defined (K.W. Plaxco, unpubl.). Similarly, relative contact order (the average contact separation in terms of fraction of total peptide length) predicts rates significantly more accurately than absolute measures of the average sequence separation of contacting residues (Grantcharova et al. 2001; Ivankov et al. 2002). This produces the counterintuitive result that, of two proteins with the same average contact separation, the longer protein folds faster. Observations such as these lead inevitably to the possibility that contact order predicts rates, not because it is directly related to the underlying mechanism of folding, but because it is a proxy for some other, physically more reasonable parameter. Consistent with this suggestion, a number of additional, empirical measures of topology correlate approximately equally well with folding rates. These include the number of sequence-distant contacts per residue (Gromiha and Selvaraj 2001), the fraction of contacts that are sequence distant (Mirny and Shakhnovich 2001), and the total contact distance (Zhou and Zhou 2002). Motivated by the quantitative dependence of kinetics on topology, several groups have attempted to define mechanistic models of folding that predict rates with accuracy equal to or surpassing that of these empirical relationships. One approach is based on calculating the loop-entropy cost of sequentially creating the stabilizing interactions that define the native state (Alm and Baker 1999b; Muñoz and Eaton 1999; Grantcharova et al. 2001; Ivankov and Finkelstein 2001). Whereas these models have achieved real success in predicting folding kinetics, their relative complexity and slightly poorer correlation with experiment again emphasizes the question of whether the entropic cost of specific loop closures really underlies the perhaps deceptively simple relationship between a protein s topology and the rate with which it folds. The topomer search model The topomer search model provides a simple, alternative explanation for the topology-rate relationship. This model postulates that relative barrier heights are dominated by the diffusive search for the set of unfolded conformations that share a common, global topology with the native state (i.e., are in the native topomer; Debe et al. 1999), and that once this is achieved, the rate-limiting step has been surmounted and specific native contacts rapidly zipper to form the fully folded protein (Fig. 1). The model implies that the various empirical, topological metrics correlate with rates because they correlate with the probability of the unfolded chain diffusing into this native topomer. Here, we review the simple, quantitative arguments in support of the topomer search model of two-state folding. Why might the topomer search process be the dominant contributor to the folding barrier? A potential answer to this question arises as a consequence of two experimental observations. The first is that the formation of helices, hairpins, loops, and other local structures is orders of magnitude more rapid than the rate-limiting step in folding (Hagen et al. 1996; Muñoz et al. 1997; Thompson et al. 1997; Bieri et al. 1999; Eaton et al. 2000; Lapidus et al. 2000). The second is that the folding free energy of such isolated structural elements and of almost all of the partially folded and misfolded states of single domain proteins, are near or above zero (for review, see Flanagan et al. 1992; Ladurner et al. 1997; Camarero et al. 2001). Because local zippering is rapid, all of the conformers in the denatured ensemble will rapidly sample sequence-local elements of native structure (Fig. 1; transition B to A). For the vast majority of unfolded conformations, however, this zippering will stall when it reaches a point at which the formation of additional native structure would require massive, potentially slow rearrangement of the polypeptide chain (Fig. 1, state A). Because this partially folded state is unstable, it ruptures at least as rapidly as the rate with which it formed (Fig. 1, transition A to 18 Protein Science, vol. 12

3 The topomer search model Figure 1. The essence of the topomer search model is that the rate with which an unfolded polymer diffuses between distinct topologies is much slower than the rate with which local structural elements zipper (and, critically, unzip). It is well established that the formation of helices, loops, and other sequence-local structural elements (B to A transition) is significantly faster than the rate-limiting step in folding. Because the free energy of these partially folded states (A) is almost invariably 0 for two-state proteins, their disruption (A to B transition) is more rapid still. Given these constraints, the rate-limiting step in folding might be the slow, large-scale diffusion to find the set of conformations (C) close enough to the native topology that they can zipper deeply into the stable, native well (E) without requiring slow, large-scale topological rearrangements. Central to this argument is the suggestion that, whereas the formation of specific, native-like interactions may be necessary in order to surmount the rate-limiting step (D), they are neither sufficient (A) nor the dominant determinant of relative barrier heights. Here, we review the quantitative, experimental evidence in support of this model of two-state folding. B), liberating the chain to diffuse into a new topology (Fig. 1, transition B to C). Only if the new topology is grossly similar to the native topology (i.e., is in the native topomer; Fig. 1, state C), can the zippering of local structure proceed deep into the native well and the chain become trapped (Fig. 1, transition C to E). The observation of strong correlation between topology and rates suggests that the slow, diffusive search for unfolded conformations that can zipper directly into the native well (i.e., that are in the native topomer) is the dominant contributor to the folding barrier. More precisely, we have suspected that contact order predicts rates, because it correlates with the probability of a random, diffusive search finding the native topomer (Gillespie and Plaxco 2000; Millet et al. 2002). The demonstration that contact order or any of the many related topological parameters correlates with the probability of finding the native topomer would provide critical support for this hypothesis. On first inspection, however, one might think that determining the probability of a chain diffusing into a given topomer is an overwhelmingly complex exercise in conditional probabilities (Fig. 2A); the probability of bringing a given residue pair into proximity may depend acutely on which other pairs are already ordered (Chan and Dill 1990). The critical question is whether a simple mathematical description exists that reasonably approximates this complex set of conditional probabilities and accurately predicts the probability of achieving the native topomer. To date, two groups have attempted the search for a simple description of the probability of finding the native topomer. The first, Debe and Goddard, used rather detailed simulations to estimate numerically the number of distinct topologies available to an unfolded polypeptide chain (Debe et al. 1999), and arrived at a first principles model that accurately predicts (r ), the folding rates of the non-helical two-state proteins (Debe and Goddard 1999). However, this model fails to predict the folding rates of predominantly helical, two-state proteins. More recently, we have described a rather simpler and still more general version of the topomer search model that accurately predicts 19

4 Makarov and Plaxco the folding rates of all classes of two-state proteins (Makarov et al. 2002). Our model stems from simulations of the properties of inert, Gaussian chains. These simulations demonstrate that, due to two simplifying effects, a straightforward approximation describes the probability of a random-coil polymer adopting a given gross topology. The first simplifying effect is that, because the probability of sequence-neighboring residues being in proximity is high (their locations are highly correlated) the probability of forming the native topomer is dominated by pairs of residues that are distant in the sequence. Thus, sequence-local interactions contribute little to the probability of being in the native topomer. The second is that the probability of ordering the chain is well described by a mean-field approximation. That is, once a sufficient number of sequence-distant pairs of residues are brought into proximity, the remaining ordering events become independent of the precise nature of the pre-existing order (i.e., become independent of one another), and the probability of each of these orderings becomes approximately constant. The nature of this approximation can be understood in qualitative terms by considering the ordering of a native pair in a chain with a significant amount of pre-existing, native-like order (Fig. 2B). In such a situation, it is plausible that the entropic cost of bringing such an additional native pair into proximity to form a bundle of residues in the native topomer could be described in terms of the bulk characteristics of this bundle instead of its precise structure (Flory 1956; Gutin and Shakhnovich 1994; Plotkin et al. 1996; Shoemaker and Wolynes 1999). The simplest approximation of the probability of forming the native topomer would then be to replace the unique probability of ordering each specific pair by the average probability of ordering all pairs. Numerical simulations of the exactly solvable Gaussian chain model provide quantitative support for this qualitative argument (Makarov and Metiu 2002; Makarov et al. 2002). If, as suggested by Gaussian chain simulations, the probability of bringing each additional sequence-distant pair into proximity in the unfolded state is constant, then the probability that the unfolded polypeptide is in a given topomer, P(Q D ), is given by Figure 2. (A) The probability of bringing a sequence-distant native pair into proximity depends on the set of other pairs that are already in proximity. (B) Gaussian chain simulations indicate, however, that once a few ( 3) sequence-distant pairs are in proximity, the probability of forming any additional pair is well approximated as a constant (probability CD probability EF). This, in turn, suggests that the probability of achieving a given topomer is approximately exponentially related to Q D, the number of sequence-distant pairs of residues that must be brought into proximity in order to define it. P Q D = K Q D (1) in which Q D is the number of sequence-distant pairs whose proximity defines the topomer, <K> is the average equilibrium constant for residue pairs being in proximity (and is less than unity) and is a proportionality constant. We note that this probability is proportional to <K> Q D rather than equal to it; this is because, as suggested by the Gaussian chain studies, the additional entropic cost associated with the formation of the first few ordered pairs results in a prefactor that is less than unity (Makarov et al. 2002). Because of this, Equation 1 is only a valid approximation when Q D is sufficiently large (in practice greater than 3). We also note that <K> may depend generally on the length of the chain; that is, whereas P(Q D ) has approximately an exponential dependence on the number of sequence-distant pairs that must be brought into proximity, this dependence may be different for different chain lengths. For the present, we will ignore the length dependence of <K>, and will return to these considerations later in the review. The topomer search model predicts that the rate-limiting step in two-state folding is the formation of a conformation in which every residue is roughly in proximity to the residues that it contacts in the native state. We thus have the prediction that, by analogy to transition state theory, folding rates (k f ) should scale approximately as k f Q D K Q D (2) in which Q D is the attempt frequency (proportional to Q D due to the Q D possible pairs of native residues that can be ordered), and <K> Q D exp( G /k B T) is the equilibrium constant for the formation of the native topomer. This relationship is reminiscent of the contact order-rate relationship (k f exponentially related to contact order). Nevertheless, the physical meaning of Q D the number of sequence-distant native pairings that define the native topomer differs fundamentally from that of the earlier, entirely empirical measure. Testing the topomer search model The prediction that folding rates relate to Q D provides a means of testing the topomer search model. To perform this 20 Protein Science, vol. 12

5 The topomer search model test, however, we must define Q D in terms of experimental observables. This is readily performed if we assume that any pair of sequence-distant residues (separated by more than l c residues) that are in contact in the native state (i.e., within a cutoff distance, r c ) must be in proximity to form the native topomer. The precise values of r c and l c, however, are not well constrained by the model. Typical choices are for r c to reflect pairs of C atoms the model is independent of specific chemical interactions and thus ignores side chains that approach to within 6 Å 8 Å in the native state and for l c in the range of 4 12 residues [i.e., times reported persistence lengths (Schwalbe et al. 1997; Penkett et al. 1998)]. Fortunately, the topomer search model is rather insensitive to the precise details of how these native pairs (and thus how Q D ) are defined; the range of Q D that correspond to this wide range of parameters are all strongly correlated with one another, and critically, with experimentally observed folding rates. For example, if l c 12 residues and r c 6 Å, we obtain the statistically significant (r 0.88), predictive correlation illustrated in Figure 3. Thus, this simple model captures in excess of three-fourths of the variance in our kinetic data set using only two fitted parameters (<K> and the product ). In addition to successfully predicting folding rates, the topomer search model also successfully predicts when it will fail to predict folding rates. For example, Equation 2 is only a valid approximation if more than approximately three sequence-distant native pairs have been brought into proximity (Fig. 2); if fewer than four native pairs are ordered, the mean-field approximation breaks down. If a protein adopts a very simple native topology (i.e., a sequence-local topology for which Q D < 4), the behavior of Gaussian chains suggests that it should fold more rapidly than would be expected based on the naive application of this approximation. Three such proteins the engrailed homeodomain, protein A, and the villin headpiece (Mayor et al. 2000; Myers and Oas 2001; D. Raleigh, pers. comm.) have been characterized, and consistent with this prediction, all fold at least an order of magnitude more rapidly than simple, topology-based calculations would suggest (for review, see Islam et al. 2002). This limitation, however, affects only two-state proteins for which Q D 3. Within the remaining set of two-state proteins, Equation 2 is successful in quantitatively predicting folding rates. The model parameters The fitted parameters in the topomer search model are physically reasonable. The relationship between Q D and folding rates stems from first principles arguments that allow us to assign meaning to the slope and intercept of the relationship and to test their validity experimentally. The value of these fitted parameters depends only weakly on how Q D is defined, and the range of these parameters suggested by the model are consistent with a number of experimental and simulations-based studies of folding and the denatured ensemble. Despite the potentially significant approximation that all two-state folding reactions exhibit the same irrespective of, for example, chain length (Portman et al. 2001; Kaya and Chan 2002), the pre-exponential produced by the model is physically reasonable. We base this assertion on the following first-principles argument. The attempt frequency, Q D, is the rate of moving residue pairs into or out of proximity (Makarov et al. 2002). Assuming this is a purely entropic event, is the rate with which sequence-distant pairs diffuse apart and is given by (Szabo et al. 1980) = 3D d 2 (3) Figure 3. The topomer search model is highly correlated with observed folding rates (k f ) and the number of sequence-distant native pairs (Q D ). The observed correlation [across a previously established database of simple, single domain proteins (Plaxco et al. 2000)] is excellent, r 0.88, and indicates that this simple model captures > 3 of the variance in (log) 4 observed two-state folding rates. Illustrated is the fit to a linear version of Equation 2, log (k f /Q D ) log ( ) +Q D logk with fit parameters 3800 s 1 and <K> Q D is determined as described in the text. in which D cm 2 /s is the loop-closure diffusion coefficient (Hagen et al. 1997) and d is the characteristic distance at which a residue pair is no longer in sufficient proximity to rapidly zipper. Whereas the precise value of d is unclear, it must lie between 6 Å and 24 Å (respectively, the typical distance between residues in physical contact and the typical dimensions of a single domain protein). Across this range of d, 10 8 s 1, and as the fitted value of is 3800 s 1 (Fig. 3), Because arises due to the extra entropy associated with the first few order- 21

6 Makarov and Plaxco ing events, this suggests an additional Rln 85 J/mole.K can be assigned to this step in the topomer search process. Consistent with the arguments presented above (that the mean-field approximation becomes valid after 3 sequencedistant pairs have been ordered), this is comparable with approximately three times the entropic cost of closing a typical residue loop (Poland and Scheraga 1965). It is more difficult to ascertain whether the value of <K> is reasonable. The value obtained from fitting experimental folding rates is , depending on the choice of l c and r c. These values imply that, once more than approximately three sequence distant native pairs have been brought into proximity (and Equation 2 becomes a valid approximation), any remaining native pairs have an 45% chance (corresponding to an equilibrium constant of ) of being in proximity in the unfolded molecule. Whereas this may suggest that the unfolded state is relatively well-ordered, an important consideration is that the model defines proximity as any orientation in which elements can collide to form native contacts more rapidly than the rate-limiting step in folding. As the rate-limiting step in folding is orders of magnitude slower than the rate of loop closure, proximity need not imply that two residues are particularly close in space. Indeed, this is precisely how the topomer search model solves Levinthal s paradox; whereas the number of conformations in the native topomer is small relative to the total number of conformations available in the unfolded ensemble, it is enormously larger than unity. Because of this, the entropic cost of finding the native topomer may be reasonable even in the absence of native-like interactions that may favor this set of conformations. That said, recent experimental (Hodsdon and Frieden 2001; Plaxco and Gross 2001; Shortle and Ackerman 2001; Baldwin 2002; Klein- Seetharaman et al. 2002) and simulations-based (Choy and Forman-Kay 2000; Zagrovic et al. 2002) reports of residual long-range order in the equilibrium denatured state are consistent with the seemingly high value of <K>. If, as suggested by these studies, the denatured state adopts a nativelike topology, then it is perhaps not surprising that any given sequence distant native pair has, on average, a 45% chance of being in proximity. With these considerations, we now have all of the elements required to draw a complete picture of the topomer search model of two-state protein folding. The fitted value of <K> argues that, once the first few sequence-distant native pairs are in proximity, about half of the remaining sequence-distant pairs are likely to be in the correct topomeric state. That is, they are in sufficient proximity that they rapidly sample (and because of the relative instability of partially folded states, unsample ) their native interactions. The pre-exponential suggests that these correctly oriented elements will be rapidly fluctuating out of and incorrectly oriented elements back into the correct topomeric state. The rate-limiting step in folding is then the set of random fluctuations that simultaneously brings every element in the chain into the native topomer. Once this is achieved, the rate-limiting step is surmounted and specific native contacts rapidly and productively zipper to form the fully folded protein. The relationship between rates and contact order The relationship between rates and contact order thus appears to arise indirectly. That is, the behavior of Gaussian chains suggests that Q D defines the probability of achieving the native topomer and thus defines folding rates and that contact order predicts rates not because it is related to the folding mechanism per se but because it is a proxy for Q D. Although a strong correlation between contact order and Q D for most proteins renders it difficult to prove this hypothesis directly, recent counterexamples provide significant evidence in support of it. For example, circular permutation of the S6 domain allows us to distinguish between the two parameters, whereas permutation significantly alters the protein s contact order, it does not significantly alter Q D (Miller et al. 2002). Consistent with the predictions of the topomer search model, it has been reported recently that these permutations do not significantly alter folding rates (Lindberg et al. 2001). Similarly, the covalent circularization of a protein should significantly alter its contact order (presumably, one counts the shortest covalent path between contacting residues), leading to orders of magnitude rate accelerations. The topomer search model, in contrast, predicts relatively small rate accelerations, circularization preorders only one sequence-distant native pair. This will reduce the entropic cost of the first few ordering events by, at most, about one-third, increasing and thus folding rates by no more than a factor of 10. Consistent with this prediction, the relevant, reported circularizations produce only three- to sevenfold rate enhancements (Otzen and Fersht 1998; Grantcharova and Baker 2001; Camarero et al. 2001). Native interactions and the topomer search The topomer search model ignores the contributions of native-like interactions to the rate-limiting step in folding, obviously a potentially significant omission. For example, the strong, perfectly exponential denaturant dependencies of folding rates demonstrate that the folding transition state contains interactions similar to those that stabilize the native state (Plaxco et al. 2000). This suggestion is further supported by reports that native-state stability is an important determinant of the relative folding rates of topologically similar proteins (Guijarro et al. 1998; Clarke et al. 1999). Moreover, exhaustive mutagenasis studies (termed -value analysis) have firmly established that many side chains are in near-native environments during the rate-limiting step in folding (for review, see Fersht 1997). It is thus abundantly 22 Protein Science, vol. 12

7 The topomer search model clear that, in addition to the topomer search process, the formation of specific, native interactions also contributes to the relative free energy of the folding transition state. However, despite its studied lack of specific, nucleating interactions, it ignores all chemistry the topomer search model captures three-fourths of the variance in the log of relative two-state folding rates. This suggests that, although specific, native-like interactions are an obligatory feature of the folding transition state (Fig. 1, C to D transition), these interactions are neither sufficient to ensure folding nor the dominant determinant of relative barrier heights. Of course, the topomer search model need not completely ignore the energetically favorable interactions that may exist in the folding transition state; they are spun into the factor <K> (Makarov et al. 2002). That is, any stabilizing interactions that bias the chain toward the native geometry will increase the average probability of a native-like orientation of structural elements. As noted above, however, the observed value of <K> may be reasonable even in the absence of significant stabilizing interactions simply because proximity only implies close enough to collide more rapidly than the rate-limiting step. As the rate-limiting step in folding is slow (relative to loop closure rates), close enough may, in reality, be rather distant, and thus, energetically favorable interactions are not necessarily required to generate <K> and the rapid folding rates this produces. Relationship to previous folding models The topomer search model unifies several previous models of protein-folding kinetics. For example, the topomer search model is grounded in the energy landscape picture of protein folding (Bryngelson and Wolynes 1987; Bryngelson et al. 1995; Dill and Chan 1997; Onuchic et al. 1997; Dobson et al. 1998; Dinner et al. 2000); it is precisely because the energy landscapes of two-state proteins are exceedingly smooth that the topomer search, rather than diffusion over a rough landscape or escape from discrete traps, defines the folding barrier (Sosnick et al. 1994; Debe et al. 1999; Gillespie and Plaxco 2000; Millet et al. 2002). Notably, the energy landscape of the topomer search process itself is smooth; recent studies of the rate with which sequencedistant residue pairs are brought into proximity in unfolded cytochrome c demonstrate that inter-residue interactions (i.e., energetic roughness) do not control large-scale conformational diffusion even under native conditions (Hagen et al. 2001). The topomer search model can also be considered a limiting (albeit simple, general, and easily quantified) case of the hierarchical folding models (Rose 1979; Baldwin and Rose 1999). The diffusion-collision model, for example, stipulates that protein folding occurs via the diffusive, hierarchical assembly of more-or-less preformed elements of secondary structure (Karplus and Weaver 1979; Zhou and Karplus 1999; Myers and Oas 2001). The topomer search model, in contrast, stipulates that, except for those few, rapidly folding proteins for which Q D < 4 (see Islam et al. 2002), the sampling of local structure is orders of magnitude more rapid than the sampling of topomers. For most twostate proteins, the barrier is thus largely defined by the latter, with the sampling of local structural elements playing a much lesser role in determining relative folding rates. How can the topomer search model be improved? The strong correlation between Equation 2 and observed two-state folding rates suggests that, despite its seemingly excessive simplicity, the topomer search model captures the dominant contributor to relative barrier heights. There is, nevertheless, clearly room to improve the model s accuracy and generalizability. Here, we discuss likely future efforts in these directions. Chain-length dependence Numerous theoretical studies suggest that both the pre-exponential (via the diffusion coefficient) and the activation barrier (due to the entropic cost of the search) of folding are strong functions of chain length, N (Thirumalai 1995; Gutin et al. 1996; Zhdanov 1998; Debe et al. 1999). Most models predict that folding rates scale exponentially with N with a large, negative exponent (i.e., longer chains fold more slowly). No statistically significant length dependence is evident, however, in the experimentally observed folding rates of simple, single-domain proteins (Plaxco et al. 1998, 2000), perhaps because the effects of differing topologies overwhelm the more subtle, length-rate relationship. The topomer search model provides a convenient opportunity to account for the effects of topological variations and thus investigate the length dependence of folding independently of topology. When this is performed, a statistically significant relationship between rates and N arises, but in the counter-intuitive direction; longer proteins tend to fold more rapidly than predicted. This leads to a small, but statistically significant improvement in the relationship between log (k f ) and Q D N versus Q D alone (Fig. 4) via the equation k f = Q D J Q DN (4) in which is a negative number in the range of 0.5 to 1.0 (r over this range), J is a constant of magnitude < 1, and is a constant analogous to. Because J and are interdependent variables, it is impossible to pinpoint the value of more precisely. It is clear, however, that is negative, leading to the counterintuitive result that, all other parameters being equal, longer proteins fold more rapidly. 23

8 Makarov and Plaxco Figure 4. The empirical addition of length dependence to the topomer search model produces a small but statistically significant improvement in its predictive value. The addition also produces the counterintuitive prediction that a longer protein generally folds more rapidly than a shorter one with an equivalent number of sequence-distant native pairs. Shown here is the fit to a linearized version of Equation 4, log (k f /Q D ) log ( ) + Q D N logj, with set to 1. The correlation is relatively insensitive to the precise value of, producing a correlation coefficient of for all over the range 0.5 to 1. It is not hard to rationalize this length dependence in the context of the topomer search model. It is consistent with the generalization of the model in which the mean equilibrium constant for the ordering of native pairs is dependent on chain length log K N N (5) with an exponent, that is negative. A possible origin of this relationship (and the counterintuitive length dependence it gives rise to) is crowding effects. That is, if a sequence-distant interaction occurs, on average, once every 5 residues along the chain steric and geometric constraints may render the native topomer more difficult to achieve than if, on average, sequence-distant interactions occur only every 10 residues. Critically, the Gaussian chain model is unlikely to capture crowding correctly, as it rather poorly mimics the stiffness of an unfolded polypeptide and entirely ignores excluded volume interactions. This suggests that simulations of more realistic chains are in order if we are to verify the validity of this currently empirical correction. The mean-field approximation A second concern is that the mean-field approximation is simply that, an approximation. It is certain that the inclusion of additional parameters (beyond simply counting the number of sequence-distant native pairs) will be required in order to define the probability of achieving a given topomer more accurately. The equilibrium constant for bringing sequence-distant native pairs into proximity, for example, is at least a weak function of the chain length separating the pair from itself and from other, preordered pairs. This effect may be illustrated by studies in which the extension of solventexposed loops slows folding rates; such extension does not significantly alter Q D, but does change the accuracy of the approximation that <K> is a constant. That said, the effect of extending a loop by less than l c residues is relatively subtle; extensions of residues reduce rates by less than a factor of 4 (Ladurner and Fersht 1997; Viguera and Serrano 1997; Grantcharova et al. 2000). Only the longest reported loop-extensions (e.g., a 59-residues loop inserted in an artificially engineered, monomeric arc repressor) produce significant changes in two-state folding rates (Robinson and Sauer 1998). Native interactions A potentially more serious omission is that the topomer search model ignores all of the detailed chemical interactions that define the native state. As noted above, it is abundantly clear that the topomer search is only part of the folding barrier and the inclusion of specific, stabilizing interactions is clearly critical if we are to develop a more predictive model of folding kinetics. Recent experimental results, however, suggest that the native-like interactions occurring in the folding transition state are rather plastic (i.e., can be altered significantly without significantly altering folding rates), and thus, their effect on folding kinetics may prove difficult to model accurately (Grantcharova and Baker 2001; Nauli et al. 2001). Nevertheless, progress has already been reported on this front for the folding of the topologically simple proteins (Q D < 4), for which nativelike interactions play the greatest role in defining relative rates (Myers and Oas 2001; Islam et al. 2002). Non-two-state folding Further generalization of the model to fit non-two-state proteins may also prove difficult. The topomer search model is rooted in the observation that the folding energy landscape of two-state proteins is extremely smooth and, in the absence of energetic roughness, the connectivity-induced difficulty of the topomer search dominates relative barrier heights. In contrast, non-two-state folding necessarily implies that well-populated intermediates dominate the folding landscape, leading to deviations from single-exponential kinetics. Under these circumstances, folding kinetics could be defined by the rate of escape from these intermediate states rather than by the rate of topomer sampling (Sosnick et al. 24 Protein Science, vol. 12

9 The topomer search model 1994; Debe et al. 1999; Millet et al. 2002). As the free energy of these states are defined by specific chemical interactions, predicting the kinetics with which they are escaped will probably not prove as simple as describing the kinetics of the topomer search. Conclusions The topomer search model stipulates that the random, diffusive process by which an unfolded polypeptide achieves its native topomer dominates the relative folding rates of two-state proteins. This native topomer is defined as the set of conformations in which every pair of residues in contact in the native state are in sufficient proximity that they can collide and form native interactions more rapidly than the relatively slow rate-limiting step in folding. Simulations of the diffusion of an inert, Gaussian chain indicate that the probability of such an occurrence relates simply to the number of sequence-distant residue pairs required to define the native topomer. Consistent with this result, the experimentally observed folding rates of two-state proteins correlate strongly with Q D, the number of sequence-distant residue pairs in contact in the native state. The predictive value of this result supports the argument that the topomer search process is the dominant contributor to the relative barrier heights of two-state protein folding reactions. Acknowledgments The quantitative topomer search model was originally developed in collaboration with our colleagues Horia Metiu and Craig Keller and was motivated in part by the pioneering work of Derek Debe and William Goddard. The authors would also like to acknowledge numerous informative discussions with David Baker, Buzz Baldwin, Hue Sun Chan, Ken Dill, Chris Dobson, Carl Frieden, Blake Gillespie, Michael Gross, Jim Hu, Bob Matthews, Vijay Pande, Rohit Pappu, George Rose, David Shortle, and Tobin Sosnick. References Alm, E. and Baker, D. 1999a. Matching theory and experiment in protein folding. Curr. Opin. Struct. Biol. 9: b. Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures. Proc. Natl. Acad. Sci. 96: Baldwin, R.E Protein folding Making a network of hydrophobic clusters. Science 295: Baldwin, R.L. and Rose, G.D Is protein folding hierarchic? II. Folding intermediates and transition states. Trends. Biochem. Sci. 24: Bergasa-Caceres, F., Ronneberg, T.A., and Rabitz, H.A Sequential collapse model for protein folding pathways. J. Phys. Chem. B 103: Bieri, O., Wirz, J., Hellrung, B., Schutkowski, M., Drewello, M., and Kiefhaber, T The speed limit for protein folding measured by triplet triplet energy transfer. Proc. Natl. Acad. Sci. 96: Bryngelson, J.D. and Wolynes, P.G Spin-glasses and the statisticalmechanics of protein folding. Proc. Natl. Acad. Sci. 84: Bryngelson, J.D., Onuchic, J.N., Socci, N.D., and Wolynes, P.G Funnels, pathways, and the energy landscape of protein-folding a synthesis. Prot. Sruct. Func. Gen. 21: Camarero, J.A., Fushman, D., Sato, S., Giriat, I., Cowburn, D., Raleigh, D.P., and Muir, T.W Rescuing a destabilized protein fold through backbone cyclization. J. Mol. Biol. 308: Chan, H.S. and Dill, K.A The effects of internal constraints on the configurations of chain molecules. J. Chem. Phys. 92: Choy, W.Y. and Forman-Kay, J Calculation of ensembles of structures representing the unfolded state of an SH3 domain. J. Mol. Biol. 308: Clarke, J., Cota, E., Fowler, S.B., and Hamill, S.J Folding studies of immunoglobulin-like -sandwich proteins suggest that they share a common folding pathway. Structure 7: Clementi, C., Nymeyer, H., and Onuchic, J.N Topological and energetic factors: What determines the structural details of the transition state ensemble and en-route intermediates for protein folding? An investigation for small globular proteins. J. Mol. Biol. 298: Debe, D.A. and Goddard, W.A First principles prediction of protein folding rates. J. Mol. Biol. 294: Debe, D.A., Carlson, M.J., and Goddard, W.A The topomer-sampling model of protein folding. Proc. Natl. Acad. Sci. 96: Dill, K.A. and Chan, H.S From Levinthal to pathways to funnels. Nat. Struc. Biol. 4: Dinner, A.R., Sali, A., Smith, L.J., Dobson, C.M., and Karplus, M Understanding protein folding via free-energy surfaces from theory and experiment. Trend. Bioch. Sci. 25: Dobson, C.M., Sali, A., and Karplus, M Protein folding: A perspective from theory and experiment. Ang. Chem. Int. Ed. 37: Eaton, W.A., Munoz, V., Hagen, S.J., Jas, G.S., Lapidus, L.J., Henry, E.R., and Hofrichter, J Fast kinetics and mechanisms in protein folding. Annu. Rev. Biomol. Struct. 29: Fersht, A.R Nucleation mechanisms in protein folding. Curr. Opin. Struct. Biol. 7: Transition-state structure as a unifying basis in protein-folding mechanisms: Contact order, chain topology, stability, and the extended nucleus mechanism. Proc. Natl. Acad. Sci. 97: Flanagan, J.M., Kataoka, M., Shortle, D., and Engelman, D.M Truncated staphylococcal nuclease is compact but disordered. Proc. Natl. Acad. Sci. 89: Flory, P.J Theory of elastic mechanisms in fibrous proteins. J. Am. Chem. Soc. 78: Galzitskaya, O.V. and Finkelstein, A.V A theoretical search for folding/ unfolding nuclei in three-dimensional protein structures. Proc. Natl. Acad. Sci. 96: Gillespie, B. and Plaxco, K.W Non-glassy kinetics in the folding of a simple, single domain protein. Proc. Natl. Acad. Sci. 97: Grantcharova, V.P. and Baker, D Circularization changes the folding transition state of the src SH3 domain. J. Mol. Biol. 306: Grantcharova, V.P., Riddle, D.S., and Baker, D Long-range order in the src SH3 folding transition state. Proc. Natl. Acad. Sci. 97: Grantcharova, V.P., Alm. E.J., Baker, D., and Horowitz, A.L Mechanisms of protein folding. Curr. Opin. Struct. Biol. 11: Gromiha, M.M. and Selvaraj, S Comparison between long-range interactions and contact order in determining the folding rate of two-state proteins: Application of long-range order to folding rate prediction. J. Mol. Biol. 310: Gross, M Linguistic analysis of protein folding. FEBS Lett. 390: Guijarro, J.I., Morton, C.J., Plaxco, K.W., Campbell, I.D., and Dobson, C.M Folding kinetics of the SH3 domain of PI3 by real-time NMR and optical techniques. J. Mol. Biol. 275: Guo, Z.Y., Brooks, C.L., and Boczko, E.M Exploring the folding free energy surface of a three-helix bundle protein. Proc. Natl. Acad. Sci. 94: Gutin, A.M. and Shakhnovich, E.I Statistical mechanics of polymers with distance constraints. J. Chem. Phys. 100: Gutin, A.M., Abkevich, V.I., and Shakhnovich, E.I Chain length scaling of protein folding time. Phys. Rev. Lett. 77: Hagen, S.J., Hofrichter, J., Szabo, A., and Eaton, W.A Diffusion-limited contact formation in unfolded cytochrome c: Estimating the maximum rate of protein folding. Proc. Natl. Acad. Sci. 93: Hagen, S.J., Hofrichter, J., and Eaton, W.A Rate of intrachain diffusion of unfolded cytochrome c. J. Phys. Chem. B 101: Hagen, S.J., Carswell, C.W., and Sjolander, E.M Rate of intrachain contact formation in an unfolded protein: Temperature and denaturant effects. J. Mol. Biol. 305: Hodsdon, M.E. and Frieden, C Intestinal fatty acid binding protein: The folding mechanism as determined by NMR studies. Biochemistry 40:

10 Makarov and Plaxco Islam, S.A., Karplus, M., and Weaver, D.L Application of the diffusioncollision model to the folding of three-helix bundle proteins. J. Mol. Biol. 318: Ivankov, D.N. and Finkelstein, A.V Theoretical study of a landscape of protein folding-unfolding pathways. Folding rates at midtransition. Biochemistry 40: Jackson, S.E How do small single domain proteins fold? Fold. Des. 3: R81 R91. Jackson, S.E. and Fersht, A.R The folding of chymotrypsin inhibitor Evidence for a two-state transition. Biochemistry 30: Jacobson, H. and Stockmayer, W.H Intramolecular reaction in polycondensations. I. The theory of linear systems. J. Chem. Phys. 18: Karplus, M. and Weaver, D.L Diffusion-collision model or protein folding. Biopolymers 18: Kaya, H. and Chan, H.S Towards a consistent modeling of protein thermodynamic and kinetic cooperativity: How applicable is the transition state picture to folding and unfolding? J. Mol. Biol. 315: Klein-Seetharaman, J., Oikawa, M., Grimshaw, S.B., Wirmer, J., Duchardt, E., Ueda, T., Imoto, T., Smith, L.J., Dobson, C.M., and Schwalbe, H Long-range interactions within a nonnative protein. Science 295: Kolinski, A., Galazka, W., and Skolnick, J Monte Carlo studies of the thermodynamics and kinetics of reduced protein models: Application to small helical,, and / proteins. J. Chem. Phys. 108: Ladurner, A.G. and Fersht, A.R Glutamine, alanine, or glycine repeats inserted into the loop of a protein have minimal effects on stability and folding rates. J. Mol. Biol. 273: Ladurner, A.G., Itzhaki, L.S., Gay, G.D., and Fersht, A.R Complementation of peptide fragments of the single domain protein chymotrypsin inhibitor 2. J. Mol. Biol. 273: Lapidus, L.J., Eaton, W.A., and Hofrichter, J Measuring the rate of intramolecular contact formation in polypeptides. Proc. Natl. Acad. Sci. 97: Lindberg, M.O., Tangrot, J., Otzen, D.E., Dolgikh, D.A., Finkelstein, A.V., and Oliveberg, M Folding of circular permutants with decreased contact order: General trend balanced by protein stability. J. Mol. Biol. 314: Makarov, D.E. and Metiu, H A model for the kinetics of protein folding: Kinetic Monte Carlo simulations and analytical results. J. Chem. Phys. 116: Makarov, D.E., Keller, C.A., Plaxco, K.W., and Metiu, H How the folding rate constant of simple-single domain proteins depends on number of native contacts. Proc. Natl. Acad. Sci. 99: Mayor, U., Johnson, C.M., Daggett, V., and Fersht, A.R Protein folding and unfolding in microseconds to nanoseconds by experiment and simulation. Proc. Natl. Acad. Sci. 97: Miller, E.J., Fischer, K.F., and Marqusee, S Experimental evaluation of topological parameters determining protein-folding rates. Proc. Natl. Acad. Sci. 99: Millet, I.S., Townsley, L., Chiti, F., Doniach, S., and Plaxco, K.W Equilibrium collapse and the kinetic foldability of proteins. Biochemistry 41: Mirny, L. and Shakhnovich, E Protein folding theory: From lattice to all-atom models. Annu. Rev. Biophys. Biomol. Struc. 30: Muñoz, V. and Eaton, W.A A simple model for calculating the kinetics of protein folding from three-dimensional structures. Proc. Natl. Acad. Sci. 96: Muñoz, V., Thompson, P.A., Hofrichter, J., and Eaton, W.A Folding dynamics and mechanism of -hairpin formation. Nature 390: Myers, J.K. and Oas, T.G Preorganized secondary structure as an important determinant of fast protein folding. Nat. Struct. Biol. 8: Nauli, S., Kuhlman, B., and Baker, D Computer-based redesign of a protein folding pathway. Nat. Struct. Biol. 8: Onuchic, J.N., Luthey-Schulten, Z., and Wolynes, P.G Theory of protein folding: The energy landscape perspective. Annu. Rev. Phys. Chem. 48: Otzen, D.E. and Fersht, A.R Folding of circular and permuted chymotrypsin inhibitor 2: Retention of the folding nucleus. Biochemistry 37: Penkett, C.J., Redfield, C., Jones, J.A., Dodd, I., Hubbard, J., Smith, R.A.G., Smith, L.J., and Dobson, C.M Structural and dynamical characterization of a biologically active unfolded fibronectin-binding protein from Staphylococus aureus. Biochemistry 37: Plaxco, K.W. and Gross, M Unfolded, yes, but random? Never! Nat. Struct. Biol. 8: Plaxco, K.W., Simons, K.T., and Baker, D Contact order, transition state placement and the refolding rates of single domain proteins. J. Mol. Biol. 277: Plaxco, K.W., Millett, I.S., Segel, D.J., Doniach, S., and Baker, D Polypeptide chain collapse can occur concomitantly with the rate limiting stepin protein folding. Nat. Struct. Biol. 6: Plaxco, K.W., Simons, K.T., Ruczinski, I., and Baker, D Topology, stability, sequence, and length: defining the determinants of two-state protein folding kinetics. Biochemistry 37: Plotkin, S.S., Wang, J. and Wolynes, P.G Correlated energy landscape model for finite, random heteropolymers. Phys. Rev. E 53: Poland, D.C. and Scheraga, H.A Statistical mechanics of noncovalent bonds in polyamino acids. 8. Covalent loops in proteins. Biopolymers 3: Portman, J.J., Takada, S., and Wolynes, P.G Microscopic theory of protein folding rates. II. Local reaction coordinates and chain dynamics. J. Chem. Phys. 114: Robinson, C.R. and Sauer, R.T Optimizing the stability of single-chain proteins by linker length and composition mutagenasis. Proc. Natl. Acad. Sci. 95: Rose, G.D Hierarchic organization of domains in globular-proteins. J. Mol. Biol. 134: Schwalbe, H., Fiebig, J.M., Buck, M., Jones, J.A., Grimshaw, S.B., Spencer, A., Glaser, S.J., Smith, L.J., and Dobson, C.M Structural and dynamical properties of a denatured protein. Heteronuclear 3D NMR experiments and theoretical simulations of lysozyme in 8 M urea. Biochemistry 36: Shea, J.E., Onuchic, J.N., and Brooks, C.L Exploring the origins of topological frustration: Design of a minimally frustrated model of fragment B of protein A. Proc. Natl. Acad. Sci. 96: Sheinerman, F.B. and Brooks, C.L Molecular picture of folding of a small / protein. Proc. Natl. Acad. Sci. 95: Shoemaker, B.A. and Wolynes, P.G Exploring structures in protein folding funnels with free energy functionals: The denatured ensemble. J. Mol. Biol. 287: Shortle, D. and Ackerman, M.S Persistence of native-like topology in a denatured protein in 8 M urea. Science 293: Socci, N.D., Onuchic, J.N., and Wolynes, P.G Protein folding mechanisms and the multidimensional folding funnel. Prot. Struc. Func. Gen. 32: Sosnick, T.R., Mayne, L., Hiller, R., and Englander, S.W The barriers in protein folding. Nat. Struct. Biol. 1: Sosnick, T.R., Mayne, L., and Englander, S.W Molecular collapse: the rate-limiting step in two-state cytochrome c folding. Proteins 24: Szabo, A., Schulten, K., and Schulten, Z st passage time approach to diffusion controlled reactions. J. Chem. Physics 72: Thirumalai, D From minimal models to real proteins: Time scales for protein folding. J. Physique I 5: Thompson, P.A., Eaton, W.A., and Hofrichter, J Laser temperature jump study of the helix reversible arrow coil kinetics of an alanine peptide interpreted with a kinetic zipper model. Biochemistry 36: Van Nuland, N.A.J., Chiti, F., Taddei, N., Raugei, G., Ramponi, G., and Dobson, C.M Slow folding of muscle acylphosphatase in the absence of intermediates. J. Mol. Biol. 283: Viguera, A.R. and Serrano, L Loop length, intramolecular diffusion and protein folding. Nat. Struct. Biol. 4: Wittung-Stafshede, P., Lee, J.C., Winkler, J.R., and Gray, H.B Cytochrome b562 folding triggered by electron transfer: Approaching the speed limit for formation of a four-helix-bundle protein. Proc. Natl. Acad. Sci. 96: Zagrovic, B., Snow, C., Khaliq, S., Shirts, M., and Pande, V Native-like mean structure in the unfolded ensemble of small proteins. J. Mol. Biol. (in press) Zhdanaov, V.P Folding time of ideal -sheets vs. chain length. Europhys. Lett. 42: Zhou, H.Y. and Zhou, Y.Q Folding rate prediction using total contact distance. Biophys. J. 82: Zhou, Y.Q. and Karplus, M Interpreting the folding kinetics of helical proteins. Nature 401: Protein Science, vol. 12

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001 Modeling two-state cooperativity in protein folding Ke Fan, Jun Wang, and Wei Wang arxiv:cond-mat/0103385v1 [cond-mat.soft] 19 Mar 2001 National Laboratory of Solid State Microstructure and Department

More information

To understand pathways of protein folding, experimentalists

To understand pathways of protein folding, experimentalists Transition-state structure as a unifying basis in protein-folding mechanisms: Contact order, chain topology, stability, and the extended nucleus mechanism Alan R. Fersht* Cambridge University Chemical

More information

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed. Macromolecular Processes 20. Protein Folding Composed of 50 500 amino acids linked in 1D sequence by the polypeptide backbone The amino acid physical and chemical properties of the 20 amino acids dictate

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 16 Nov 2002

arxiv:cond-mat/ v1 [cond-mat.soft] 16 Nov 2002 Dependence of folding rates on protein length Mai Suan Li 1, D. K. Klimov 2 and D. Thirumalai 2 1 Institute of Physics, Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland 2 Institute

More information

Identifying the Protein Folding Nucleus Using Molecular Dynamics

Identifying the Protein Folding Nucleus Using Molecular Dynamics doi:10.1006/jmbi.1999.3534 available online at http://www.idealibrary.com on J. Mol. Biol. (2000) 296, 1183±1188 COMMUNICATION Identifying the Protein Folding Nucleus Using Molecular Dynamics Nikolay V.

More information

Folding of small proteins using a single continuous potential

Folding of small proteins using a single continuous potential JOURNAL OF CHEMICAL PHYSICS VOLUME 120, NUMBER 17 1 MAY 2004 Folding of small proteins using a single continuous potential Seung-Yeon Kim School of Computational Sciences, Korea Institute for Advanced

More information

Protein Folding. I. Characteristics of proteins. C α

Protein Folding. I. Characteristics of proteins. C α I. Characteristics of proteins Protein Folding 1. Proteins are one of the most important molecules of life. They perform numerous functions, from storing oxygen in tissues or transporting it in a blood

More information

A critical assessment of the topomer search model of protein folding using a continuum explicit-chain model with extensive conformational sampling

A critical assessment of the topomer search model of protein folding using a continuum explicit-chain model with extensive conformational sampling A critical assessment of the topomer search model of protein folding using a continuum explicit-chain model with extensive conformational sampling STEFAN WALLIN AND HUE SUN CHAN Department of Biochemistry

More information

Protein Folding In Vitro*

Protein Folding In Vitro* Protein Folding In Vitro* Biochemistry 412 February 29, 2008 [*Note: includes computational (in silico) studies] Fersht & Daggett (2002) Cell 108, 573. Some folding-related facts about proteins: Many small,

More information

Master equation approach to finding the rate-limiting steps in biopolymer folding

Master equation approach to finding the rate-limiting steps in biopolymer folding JOURNAL OF CHEMICAL PHYSICS VOLUME 118, NUMBER 7 15 FEBRUARY 2003 Master equation approach to finding the rate-limiting steps in biopolymer folding Wenbing Zhang and Shi-Jie Chen a) Department of Physics

More information

The kinetics of protein folding is often remarkably simple. For

The kinetics of protein folding is often remarkably simple. For Fast protein folding kinetics Jack Schonbrun* and Ken A. Dill *Graduate Group in Biophysics and Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94118 Communicated by

More information

A simple model for calculating the kinetics of protein folding from three-dimensional structures

A simple model for calculating the kinetics of protein folding from three-dimensional structures Proc. Natl. Acad. Sci. USA Vol. 96, pp. 11311 11316, September 1999 Biophysics, Chemistry A simple model for calculating the kinetics of protein folding from three-dimensional structures VICTOR MUÑOZ*

More information

PROTEIN FOLDING THEORY: From Lattice to All-Atom Models

PROTEIN FOLDING THEORY: From Lattice to All-Atom Models Annu. Rev. Biophys. Biomol. Struct. 2001. 30:361 96 Copyright c 2001 by Annual Reviews. All rights reserved PROTEIN FOLDING THEORY: From Lattice to All-Atom Models Leonid Mirny and Eugene Shakhnovich Department

More information

Modeling protein folding: the beauty and power of simplicity Eugene I Shakhnovich

Modeling protein folding: the beauty and power of simplicity Eugene I Shakhnovich R50 Review Modeling protein folding: the beauty and power of simplicity Eugene I Shakhnovich It is argued that simplified models capture key features of protein stability and folding, whereas more detailed

More information

Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures

Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures Proc. Natl. Acad. Sci. USA Vol. 96, pp. 11305 11310, September 1999 Biophysics Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures E. ALM AND D. BAKER* Department

More information

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday.

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday. Short Announcements 1 st Quiz today: 15 minutes Homework 3: Due next Wednesday. Next Lecture, on Visualizing Molecular Dynamics (VMD) by Klaus Schulten Today s Lecture: Protein Folding, Misfolding, Aggregation

More information

Temperature dependence of reactions with multiple pathways

Temperature dependence of reactions with multiple pathways PCCP Temperature dependence of reactions with multiple pathways Muhammad H. Zaman, ac Tobin R. Sosnick bc and R. Stephen Berry* ad a Department of Chemistry, The University of Chicago, Chicago, IL 60637,

More information

Scattered Hammond plots reveal second level of site-specific information in protein folding: ( )

Scattered Hammond plots reveal second level of site-specific information in protein folding: ( ) Scattered Hammond plots reveal second level of site-specific information in protein folding: ( ) Linda Hedberg and Mikael Oliveberg* Department of Biochemistry, Umeå University, S-901 87 Umeå, Sweden Edited

More information

Effective stochastic dynamics on a protein folding energy landscape

Effective stochastic dynamics on a protein folding energy landscape THE JOURNAL OF CHEMICAL PHYSICS 125, 054910 2006 Effective stochastic dynamics on a protein folding energy landscape Sichun Yang, a José N. Onuchic, b and Herbert Levine c Center for Theoretical Biological

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 5 May 1998

arxiv:cond-mat/ v1 [cond-mat.soft] 5 May 1998 Linking Rates of Folding in Lattice Models of Proteins with Underlying Thermodynamic Characteristics arxiv:cond-mat/9805061v1 [cond-mat.soft] 5 May 1998 D.K.Klimov and D.Thirumalai Institute for Physical

More information

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding? The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation By Jun Shimada and Eugine Shaknovich Bill Hawse Dr. Bahar Elisa Sandvik and Mehrdad Safavian Outline Background on protein

More information

Effect of Sequences on the Shape of Protein Energy Landscapes Yue Li Department of Computer Science Florida State University Tallahassee, FL 32306

Effect of Sequences on the Shape of Protein Energy Landscapes Yue Li Department of Computer Science Florida State University Tallahassee, FL 32306 Effect of Sequences on the Shape of Protein Energy Landscapes Yue Li Department of Computer Science Florida State University Tallahassee, FL 32306 yli@cs.fsu.edu Gary Tyson Department of Computer Science

More information

Protein Folding Prof. Eugene Shakhnovich

Protein Folding Prof. Eugene Shakhnovich Protein Folding Eugene Shakhnovich Department of Chemistry and Chemical Biology Harvard University 1 Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot)

More information

Protein Folding Pathways and Kinetics: Molecular Dynamics Simulations of -Strand Motifs

Protein Folding Pathways and Kinetics: Molecular Dynamics Simulations of -Strand Motifs Biophysical Journal Volume 83 August 2002 819 835 819 Protein Folding Pathways and Kinetics: Molecular Dynamics Simulations of -Strand Motifs Hyunbum Jang,* Carol K. Hall,* and Yaoqi Zhou *Department of

More information

Stretching lattice models of protein folding

Stretching lattice models of protein folding Proc. Natl. Acad. Sci. USA Vol. 96, pp. 2031 2035, March 1999 Biophysics Stretching lattice models of protein folding NICHOLAS D. SOCCI,JOSÉ NELSON ONUCHIC**, AND PETER G. WOLYNES Bell Laboratories, Lucent

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

Local Interactions Dominate Folding in a Simple Protein Model

Local Interactions Dominate Folding in a Simple Protein Model J. Mol. Biol. (1996) 259, 988 994 Local Interactions Dominate Folding in a Simple Protein Model Ron Unger 1,2 * and John Moult 2 1 Department of Life Sciences Bar-Ilan University Ramat-Gan, 52900, Israel

More information

Two-State Folding over a Weak Free-Energy Barrier

Two-State Folding over a Weak Free-Energy Barrier 1 arxiv:q-bio/0312046v1 [q-bio.bm] 30 Dec 2003 Two-State Folding over a Weak Free-Energy Barrier LU TP 03-07 April 28, 2003 Giorgio Favrin, Anders Irbäck, Björn Samuelsson and Stefan Wallin Complex Systems

More information

Clustering of low-energy conformations near the native structures of small proteins

Clustering of low-energy conformations near the native structures of small proteins Proc. Natl. Acad. Sci. USA Vol. 95, pp. 11158 11162, September 1998 Biophysics Clustering of low-energy conformations near the native structures of small proteins DAVID SHORTLE*, KIM T. SIMONS, AND DAVID

More information

Pathways for protein folding: is a new view needed?

Pathways for protein folding: is a new view needed? Pathways for protein folding: is a new view needed? Vijay S Pande 1, Alexander Yu Grosberg 2, Toyoichi Tanaka 2, and Daniel S Rokhsar 1;3 Theoretical studies using simplified models for proteins have shed

More information

Folding pathway of a lattice model for protein folding

Folding pathway of a lattice model for protein folding Folding pathway of a lattice model for protein folding Vijay S. Pande 1 and Daniel S. Rokhsar 1;2 The folding of a protein-like heteropolymer is studied by direct simulation of a lattice model that folds

More information

It is not yet possible to simulate the formation of proteins

It is not yet possible to simulate the formation of proteins Three-helix-bundle protein in a Ramachandran model Anders Irbäck*, Fredrik Sjunnesson, and Stefan Wallin Complex Systems Division, Department of Theoretical Physics, Lund University, Sölvegatan 14A, S-223

More information

Energetics and Thermodynamics

Energetics and Thermodynamics DNA/Protein structure function analysis and prediction Protein Folding and energetics: Introduction to folding Folding and flexibility (Ch. 6) Energetics and Thermodynamics 1 Active protein conformation

More information

1 of 31. Nucleation and the transition state of the SH3 domain. Isaac A. Hubner, Katherine A. Edmonds, and Eugene I. Shakhnovich *

1 of 31. Nucleation and the transition state of the SH3 domain. Isaac A. Hubner, Katherine A. Edmonds, and Eugene I. Shakhnovich * 1 of 31 Nucleation and the transition state of the SH3 domain. Isaac A. Hubner, Katherine A. Edmonds, and Eugene I. Shakhnovich * Department of Chemistry and Chemical Biology Harvard University 12 Oxford

More information

Intermediates and the folding of proteins L and G

Intermediates and the folding of proteins L and G Intermediates and the folding of proteins L and G SCOTT BROWN 1 AND TERESA HEAD-GORDON Department of Bioengineering, University of California (UC), Berkeley, Berkeley, California 94720-1762, USA (RECEIVED

More information

Elucidation of the RNA-folding mechanism at the level of both

Elucidation of the RNA-folding mechanism at the level of both RNA hairpin-folding kinetics Wenbing Zhang and Shi-Jie Chen* Department of Physics and Astronomy and Department of Biochemistry, University of Missouri, Columbia, MO 65211 Edited by Peter G. Wolynes, University

More information

PROTEIN EVOLUTION AND PROTEIN FOLDING: NON-FUNCTIONAL CONSERVED RESIDUES AND THEIR PROBABLE ROLE

PROTEIN EVOLUTION AND PROTEIN FOLDING: NON-FUNCTIONAL CONSERVED RESIDUES AND THEIR PROBABLE ROLE PROTEIN EVOLUTION AND PROTEIN FOLDING: NON-FUNCTIONAL CONSERVED RESIDUES AND THEIR PROBABLE ROLE O.B. PTITSYN National Cancer Institute, NIH, Laboratory of Experimental & Computational Biology, Molecular

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2004 How do we go from an unfolded polypeptide chain to a compact folded protein? (Folding of thioredoxin, F. Richards) Structure - Function

More information

arxiv:chem-ph/ v1 11 Nov 1994

arxiv:chem-ph/ v1 11 Nov 1994 chem-ph/9411008 Funnels, Pathways and the Energy Landscape of Protein Folding: A Synthesis arxiv:chem-ph/9411008v1 11 Nov 1994 Joseph D. Bryngelson, Physical Sciences Laboratory, Division of Computer Research

More information

Lecture 11: Protein Folding & Stability

Lecture 11: Protein Folding & Stability Structure - Function Protein Folding: What we know Lecture 11: Protein Folding & Stability 1). Amino acid sequence dictates structure. 2). The native structure represents the lowest energy state for a

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2003 Structure - Function Protein Folding: What we know 1). Amino acid sequence dictates structure. 2). The native structure represents

More information

arxiv: v1 [cond-mat.soft] 22 Oct 2007

arxiv: v1 [cond-mat.soft] 22 Oct 2007 Conformational Transitions of Heteropolymers arxiv:0710.4095v1 [cond-mat.soft] 22 Oct 2007 Michael Bachmann and Wolfhard Janke Institut für Theoretische Physik, Universität Leipzig, Augustusplatz 10/11,

More information

The protein folding problem consists of two parts:

The protein folding problem consists of two parts: Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures.

More information

Determination of Barrier Heights and Prefactors from Protein Folding Rate Data

Determination of Barrier Heights and Prefactors from Protein Folding Rate Data 3762 Biophysical Journal Volume 88 June 2005 3762 3769 Determination of Barrier Heights and Prefactors from Protein Folding Rate Data S. S. Plotkin Department of Physics and Astronomy, University of British

More information

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model JOURNAL OF CHEMICAL PHYSICS VOLUME 111, NUMBER 8 22 AUGUST 1999 Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model Benjamin Vekhter and R. Stephen

More information

The role of secondary structure in protein structure selection

The role of secondary structure in protein structure selection Eur. Phys. J. E 32, 103 107 (2010) DOI 10.1140/epje/i2010-10591-5 Regular Article THE EUROPEAN PHYSICAL JOURNAL E The role of secondary structure in protein structure selection Yong-Yun Ji 1,a and You-Quan

More information

Nucleation and the Transition State of the SH3 Domain

Nucleation and the Transition State of the SH3 Domain doi:10.1016/j.jmb.2005.03.050 J. Mol. Biol. (2005) 349, 424 434 Nucleation and the Transition State of the SH3 Domain Isaac A. Hubner 1, Katherine A. Edmonds 2 and Eugene I. Shakhnovich 1 * 1 Department

More information

Toward an outline of the topography of a realistic proteinfolding funnel

Toward an outline of the topography of a realistic proteinfolding funnel Proc. Natl. Acad. Sci. USA Vol. 92, pp. 3626-3630, April 1995 Biophysics Toward an outline of the topography of a realistic proteinfolding funnel J. N. ONUCHIC*, P. G. WOLYNESt, Z. LUTHEY-SCHULTENt, AND

More information

Residual Charge Interactions in Unfolded Staphylococcal Nuclease Can Be Explained by the Gaussian-Chain Model

Residual Charge Interactions in Unfolded Staphylococcal Nuclease Can Be Explained by the Gaussian-Chain Model Biophysical Journal Volume 83 December 2002 2981 2986 2981 Residual Charge Interactions in Unfolded Staphylococcal Nuclease Can Be Explained by the Gaussian-Chain Model Huan-Xiang Zhou Department of Physics,

More information

arxiv:cond-mat/ v1 2 Feb 94

arxiv:cond-mat/ v1 2 Feb 94 cond-mat/9402010 Properties and Origins of Protein Secondary Structure Nicholas D. Socci (1), William S. Bialek (2), and José Nelson Onuchic (1) (1) Department of Physics, University of California at San

More information

Nature of the transition state ensemble for protein folding

Nature of the transition state ensemble for protein folding Nature of the transition state ensemble for protein folding N. H. Putnam, V.S. Pande, D.S. Rokhsar he ability of a protein to fold rapidly to its unique native state from any of its possible unfolded conformations

More information

Long Range Moves for High Density Polymer Simulations

Long Range Moves for High Density Polymer Simulations arxiv:cond-mat/9610116v1 [cond-mat.soft] 15 Oct 1996 Long Range Moves for High Density Polymer Simulations J.M.Deutsch University of California, Santa Cruz, U.S.A. Abstract Monte Carlo simulations of proteins

More information

The folding mechanism of larger model proteins: Role of native structure

The folding mechanism of larger model proteins: Role of native structure Proc. Natl. Acad. Sci. USA Vol. 93, pp. 8356-8361, August 1996 Biophysics The folding mechanism of larger model proteins: Role of native structure (protein folding/lattice model/monte Carlo/secondary structure/folding

More information

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila 1 Guessing the upper bound free-energy difference between native-like structures Jorge A. Vila IMASL-CONICET, Universidad Nacional de San Luis, Ejército de Los Andes 950, 5700- San Luis, Argentina Use

More information

Lecture 34 Protein Unfolding Thermodynamics

Lecture 34 Protein Unfolding Thermodynamics Physical Principles in Biology Biology 3550 Fall 2018 Lecture 34 Protein Unfolding Thermodynamics Wednesday, 21 November c David P. Goldenberg University of Utah goldenberg@biology.utah.edu Clicker Question

More information

arxiv:q-bio/ v1 [q-bio.bm] 30 May 2006

arxiv:q-bio/ v1 [q-bio.bm] 30 May 2006 Transition States in Protein Folding Kinetics: The Structural Interpretation of Φ-values arxiv:q-bio/0605048v1 [q-bio.bm] 30 May 2006 Abstract Thomas R. Weikl 1 and Ken A. Dill 2 1 Max Planck Institute

More information

Visualizing folding of proteins (1 3) and RNA (2) in terms of

Visualizing folding of proteins (1 3) and RNA (2) in terms of Can energy landscape roughness of proteins and RNA be measured by using mechanical unfolding experiments? Changbong Hyeon and D. Thirumalai Chemical Physics Program, Institute for Physical Science and

More information

Universality and diversity of folding mechanics for three-helix bundle proteins

Universality and diversity of folding mechanics for three-helix bundle proteins Classification: Biological Sciences: Biophysics Universality and diversity of folding mechanics for three-helix bundle proteins Jae Shick Yang*, Stefan Wallin*, and Eugene I. Shakhnovich Department of

More information

Relationship between the Native-State Hydrogen Exchange and Folding Pathways of a Four-Helix Bundle Protein

Relationship between the Native-State Hydrogen Exchange and Folding Pathways of a Four-Helix Bundle Protein 7998 Biochemistry 2002, 41, 7998-8003 Relationship between the Native-State Hydrogen Exchange and Folding Pathways of a Four-Helix Bundle Protein Ruiai Chu, Wuhong Pei, Jiro Takei, and Yawen Bai* Laboratory

More information

Simulating disorder order transitions in molecular recognition of unstructured proteins: Where folding meets binding

Simulating disorder order transitions in molecular recognition of unstructured proteins: Where folding meets binding Simulating disorder order transitions in molecular recognition of unstructured proteins: Where folding meets binding Gennady M. Verkhivker*, Djamal Bouzida, Daniel K. Gehlhaar, Paul A. Rejto, Stephan T.

More information

Does Native State Topology Determine the RNA Folding Mechanism?

Does Native State Topology Determine the RNA Folding Mechanism? doi:10.1016/j.jmb.2004.02.024 J. Mol. Biol. (2004) 337, 789 797 Does Native State Topology Determine the RNA Folding Mechanism? Eric J. Sorin 1, Bradley J. Nakatani 1, Young Min Rhee 1 Guha Jayachandran

More information

CHRIS J. BOND*, KAM-BO WONG*, JANE CLARKE, ALAN R. FERSHT, AND VALERIE DAGGETT* METHODS

CHRIS J. BOND*, KAM-BO WONG*, JANE CLARKE, ALAN R. FERSHT, AND VALERIE DAGGETT* METHODS Proc. Natl. Acad. Sci. USA Vol. 94, pp. 13409 13413, December 1997 Biochemistry Characterization of residual structure in the thermally denatured state of barnase by simulation and experiment: Description

More information

PHYSICAL REVIEW LETTERS

PHYSICAL REVIEW LETTERS PHYSICAL REVIEW LETTERS VOLUME 86 28 MAY 21 NUMBER 22 Mathematical Analysis of Coupled Parallel Simulations Michael R. Shirts and Vijay S. Pande Department of Chemistry, Stanford University, Stanford,

More information

Quiz 2 Morphology of Complex Materials

Quiz 2 Morphology of Complex Materials 071003 Quiz 2 Morphology of Complex Materials 1) Explain the following terms: (for states comment on biological activity and relative size of the structure) a) Native State b) Unfolded State c) Denatured

More information

Computer simulations of protein folding with a small number of distance restraints

Computer simulations of protein folding with a small number of distance restraints Vol. 49 No. 3/2002 683 692 QUARTERLY Computer simulations of protein folding with a small number of distance restraints Andrzej Sikorski 1, Andrzej Kolinski 1,2 and Jeffrey Skolnick 2 1 Department of Chemistry,

More information

Cecilia Clementi s research group.

Cecilia Clementi s research group. Cecilia Clementi s research group http://leonardo.rice.edu/~cecilia/research/ Proteins don t have a folding problem it s we humans that do! Cartoons by Larry Gonick In principle, the laws of physics completely

More information

It is now well established that proteins are minimally frustrated

It is now well established that proteins are minimally frustrated Domain swapping is a consequence of minimal frustration Sichun Yang*, Samuel S. Cho*, Yaakov Levy*, Margaret S. Cheung*, Herbert Levine*, Peter G. Wolynes*, and José N. Onuchic* *Center for Theoretical

More information

Research Paper 577. Correspondence: Nikolay V Dokholyan Key words: Go model, molecular dynamics, protein folding

Research Paper 577. Correspondence: Nikolay V Dokholyan   Key words: Go model, molecular dynamics, protein folding Research Paper 577 Discrete molecular dynamics studies of the folding of a protein-like model ikolay V Dokholyan, Sergey V Buldyrev, H Eugene Stanley and Eugene I Shakhnovich Background: Many attempts

More information

Computer simulation of polypeptides in a confinement

Computer simulation of polypeptides in a confinement J Mol Model (27) 13:327 333 DOI 1.17/s894-6-147-6 ORIGINAL PAPER Computer simulation of polypeptides in a confinement Andrzej Sikorski & Piotr Romiszowski Received: 3 November 25 / Accepted: 27 June 26

More information

arxiv:q-bio/ v1 [q-bio.bm] 11 Jan 2004

arxiv:q-bio/ v1 [q-bio.bm] 11 Jan 2004 Cooperativity and Contact Order in Protein Folding Marek Cieplak Institute of Physics, Polish Academy of Sciences, Al. Lotników 32/46, 02-668 Warsaw, Poland arxiv:q-bio/0401017v1 [q-bio.bm] 11 Jan 2004

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University

Thermodynamics. Entropy and its Applications. Lecture 11. NC State University Thermodynamics Entropy and its Applications Lecture 11 NC State University System and surroundings Up to this point we have considered the system, but we have not concerned ourselves with the relationship

More information

Reliable Protein Folding on Complex Energy Landscapes: The Free Energy Reaction Path

Reliable Protein Folding on Complex Energy Landscapes: The Free Energy Reaction Path 2692 Biophysical Journal Volume 95 September 2008 2692 2701 Reliable Protein Folding on Complex Energy Landscapes: The Free Energy Reaction Path Gregg Lois, Jerzy Blawzdziewicz, and Corey S. O Hern Department

More information

Protein folding is a fundamental problem in modern structural

Protein folding is a fundamental problem in modern structural Protein folding pathways from replica exchange simulations and a kinetic network model Michael Andrec, Anthony K. Felts, Emilio Gallicchio, and Ronald M. Levy* Department of Chemistry and Chemical Biology

More information

Chemistry 431. Lecture 27 The Ensemble Partition Function Statistical Thermodynamics. NC State University

Chemistry 431. Lecture 27 The Ensemble Partition Function Statistical Thermodynamics. NC State University Chemistry 431 Lecture 27 The Ensemble Partition Function Statistical Thermodynamics NC State University Representation of an Ensemble N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T N,V,T

More information

Quantitative Stability/Flexibility Relationships; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 12

Quantitative Stability/Flexibility Relationships; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 12 Quantitative Stability/Flexibility Relationships; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 12 The figure shows that the DCM when applied to the helix-coil transition, and solved

More information

Paul Sigler et al, 1998.

Paul Sigler et al, 1998. Biological systems are necessarily metastable. They are created, modulated, and destroyed according to a temporal plan that meets the survival needs of the cell, organism, and species...clearly, no biological

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 23 Mar 2007

arxiv:cond-mat/ v1 [cond-mat.soft] 23 Mar 2007 The structure of the free energy surface of coarse-grained off-lattice protein models arxiv:cond-mat/0703606v1 [cond-mat.soft] 23 Mar 2007 Ethem Aktürk and Handan Arkin Hacettepe University, Department

More information

Solvation in protein folding analysis: Combination of theoretical and experimental approaches

Solvation in protein folding analysis: Combination of theoretical and experimental approaches Solvation in protein folding analysis: Combination of theoretical and experimental approaches A. M. Fernández-Escamilla*, M. S. Cheung, M. C. Vega, M. Wilmanns, J. N. Onuchic, and L. Serrano* *European

More information

Simulating Folding of Helical Proteins with Coarse Grained Models

Simulating Folding of Helical Proteins with Coarse Grained Models 366 Progress of Theoretical Physics Supplement No. 138, 2000 Simulating Folding of Helical Proteins with Coarse Grained Models Shoji Takada Department of Chemistry, Kobe University, Kobe 657-8501, Japan

More information

Universal correlation between energy gap and foldability for the random energy model and lattice proteins

Universal correlation between energy gap and foldability for the random energy model and lattice proteins JOURNAL OF CHEMICAL PHYSICS VOLUME 111, NUMBER 14 8 OCTOBER 1999 Universal correlation between energy gap and foldability for the random energy model and lattice proteins Nicolas E. G. Buchler Biophysics

More information

Collection of Biostatistics Research Archive

Collection of Biostatistics Research Archive Collection of Biostatistics Research Archive COBRA Preprint Series Year 2009 Paper 53 A Novel Topology for Representing Protein Folds Mark R. Segal University of California, San Francisco, mark@biostat.ucsf.edu

More information

Research Paper 1. Correspondence: Devarajan Thirumalai

Research Paper 1. Correspondence: Devarajan Thirumalai Research Paper Protein folding kinetics: timescales, pathways and energy landscapes in terms of sequence-dependent properties Thomas Veitshans, Dmitri Klimov and Devarajan Thirumalai Background: Recent

More information

arxiv:cond-mat/ v1 7 Jul 2000

arxiv:cond-mat/ v1 7 Jul 2000 A protein model exhibiting three folding transitions Audun Bakk Department of Physics, Norwegian University of Science and Technology, NTNU, N-7491 Trondheim, Norway arxiv:cond-mat/0007130v1 7 Jul 2000

More information

Relationships Between Amino Acid Sequence and Backbone Torsion Angle Preferences

Relationships Between Amino Acid Sequence and Backbone Torsion Angle Preferences PROTEINS: Structure, Function, and Bioinformatics 55:992 998 (24) Relationships Between Amino Acid Sequence and Backbone Torsion Angle Preferences O. Keskin, D. Yuret, A. Gursoy, M. Turkay, and B. Erman*

More information

Protein folding. Today s Outline

Protein folding. Today s Outline Protein folding Today s Outline Review of previous sessions Thermodynamics of folding and unfolding Determinants of folding Techniques for measuring folding The folding process The folding problem: Prediction

More information

Protein Folding experiments and theory

Protein Folding experiments and theory Protein Folding experiments and theory 1, 2,and 3 Protein Structure Fig. 3-16 from Lehninger Biochemistry, 4 th ed. The 3D structure is not encoded at the single aa level Hydrogen Bonding Shared H atom

More information

The effort to understand how proteins fold has consumed the

The effort to understand how proteins fold has consumed the An amino acid code for protein folding Jon Rumbley, Linh Hoang, Leland Mayne, and S. Walter Englander* Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania

More information

Folding of a Small Helical Protein Using Hydrogen Bonds and Hydrophobicity Forces

Folding of a Small Helical Protein Using Hydrogen Bonds and Hydrophobicity Forces 1 arxiv:cond-mat/0111291v1 [cond-mat.soft] 15 Nov 2001 LU TP 01-24 November 5, 2001 Folding of a Small Helical Protein Using Hydrogen Bonds and Hydrophobicity Forces Giorgio Favrin, Anders Irbäck and Stefan

More information

FREQUENCY selected sequences Z. No.

FREQUENCY selected sequences Z. No. COMPUTER SIMULATIONS OF PREBIOTIC EVOLUTION V.I. ABKEVICH, A.M. GUTIN, and E.I. SHAKHNOVICH Harvard University, Department of Chemistry 12 Oxford Street, Cambridge MA 038 This paper is a review of our

More information

Folding Pathway of the B1 Domain of Protein G Explored by Multiscale Modeling

Folding Pathway of the B1 Domain of Protein G Explored by Multiscale Modeling 726 Biophysical Journal Volume 94 February 2008 726 736 Folding Pathway of the B1 Domain of Protein G Explored by Multiscale Modeling Sebastian Kmiecik and Andrzej Kolinski Faculty of Chemistry, University

More information

Submolecular cooperativity produces multi-state protein unfolding and refolding

Submolecular cooperativity produces multi-state protein unfolding and refolding Biophysical Chemistry 101 102 (2002) 57 65 Submolecular cooperativity produces multi-state protein unfolding and refolding S. Walter Englander*, Leland Mayne, Jon N. Rumbley Johnson Research Foundation,

More information

Archives of Biochemistry and Biophysics

Archives of Biochemistry and Biophysics Archives of Biochemistry and Biophysics 531 (2013) 24 33 Contents lists available at SciVerse ScienceDirect Archives of Biochemistry and Biophysics journal homepage: www.elsevier.com/locate/yabbi Review

More information

S(l) bl + c log l + d, with c 1.8k B. (2.71)

S(l) bl + c log l + d, with c 1.8k B. (2.71) 2.4 DNA structure DNA molecules come in a wide range of length scales, from roughly 50,000 monomers in a λ-phage, 6 0 9 for human, to 9 0 0 nucleotides in the lily. The latter would be around thirty meters

More information

Folding with Downhill Behavior and Low Cooperativity of Proteins

Folding with Downhill Behavior and Low Cooperativity of Proteins PROTEINS: Structure, Function, and Bioinformatics 63:165 173 (2006) Folding with Downhill Behavior and Low Cooperativity of Proteins Guanghong Zuo, 1 Jun Wang, 1 and Wei Wang 1,2, * 1 National Laboratory

More information

Protein folding can be described by using a free energy

Protein folding can be described by using a free energy The folding energy landscape of apoflavodoxin is rugged: Hydrogen exchange reveals nonproductive misfolded intermediates Yves J. M. Bollen*, Monique B. Kamphuis, and Carlo P. M. van Mierlo *Department

More information

Asymmetric folding pathways and transient misfolding in a coarse-grained model of proteins

Asymmetric folding pathways and transient misfolding in a coarse-grained model of proteins May 2011 EPL, 94 (2011) 48005 doi: 10.1209/0295-5075/94/48005 www.epljournal.org Asymmetric folding pathways and transient misfolding in a coarse-grained model of proteins K. Wolff 1(a), M. Vendruscolo

More information

A molecular dynamics investigation of the kinetic bottlenecks of the hpin1 WW domain. II: simulations with the Go model

A molecular dynamics investigation of the kinetic bottlenecks of the hpin1 WW domain. II: simulations with the Go model Proceedings of the 2006 WSEAS International Conference on Mathematical Biology and Ecology, Miami, Florida, USA, January 18-20, 2006 (pp31-36) A molecular dynamics investigation of the kinetic bottlenecks

More information

letters Transition states and the meaning of Φ-values in protein folding kinetics S. Banu Ozkan 1,2, Ivet Bahar 3 and Ken A.

letters Transition states and the meaning of Φ-values in protein folding kinetics S. Banu Ozkan 1,2, Ivet Bahar 3 and Ken A. formed protein DNA complex (3 µl) was mixed with 3 µl of reservoir solution containing 15% (w/v) PEG 4000, 100 mm 2-(N- Morpholino)ethanesulfonic acid (MES), ph 6.0, 100 mm NH 4 H 2 PO 4 and 15% (v/v)

More information

Contact pair dynamics during folding of two small proteins: Chicken villin head piece and the Alzheimer protein -amyloid

Contact pair dynamics during folding of two small proteins: Chicken villin head piece and the Alzheimer protein -amyloid Contact pair dynamics during folding of two small proteins: Chicken villin head piece and the Alzheimer protein -amyloid Arnab Mukherjee and Biman Bagchi a) Solid State and Structural Chemistry Unit, Indian

More information