Calculation of protein conformation as an assembly of stable overlapping segments: Application to bovine pancreatic trypsin inhibitor

Size: px
Start display at page:

Download "Calculation of protein conformation as an assembly of stable overlapping segments: Application to bovine pancreatic trypsin inhibitor"

Transcription

1 Proc. Natl. Acad. Sci. USA Vol. 88, pp , May 1991 Biophysics Calculation of protein conformation as an assembly of stable overlapping segments: Application to bovine pancreatic trypsin inhibitor (conformational energy calculations/short-range interactions/build-up procedure/"conformon") ISTVAN SIMON*, LESLIE GLASSERt, AND HAROLD A. SCHERAGAt Baker Laboratory of Chemistry, Cornell University, Ithaca, NY Contributed by Harold A. Scheraga, January 4, 1991 ABSTRACT Conformations of bovine pancreatic trypsin inhibitor were calculated by assuming that the final structure as well as properly chosen overlapping segments thereof are simultaneously in low-energy (not necessarily the lowestenergy) conformational states. Therefore, the whole chain can be built up from building blocks whose conformations are determined primarily by short-range interactions. Our earlier buildup procedure was modified by taking account of a statistical analysis ofknown amino acid sequences that indicates that there is nonrandom pairing of amino acid residues in short segments along the chain, and by carrying out energy minimization on only these segments and on the whole chain [without minimizing the energies of intermediate-size segments (0-30 residues long)]. Results of this statistical analysis were used to determine the variable sizes of the overlapping oligopeptide building blocks used in the calculations; these varied from tripeptides to octapeptides, depending on the amino acid sequence. Successive stages of approximations were used to combine the low-energy conformations of these building blocks in order to keep the number of variables in the computations to a manageable size. The calculations led to a limited number of conformations of the protein (only two different groups, with very similar structure within each group), most residues of which were in the same conformational state as in the native structure. To overcome the large entropy difference between the unique native state and the ensemble of conformations constituting the unfolded states, the native conformation of a protein must correspond to a deep minimum in its conformational energy hypersurface. The whole conformational space available to the protein cannot possibly be explored within any reasonable amount of time, so that the success of refolding experiments implies that the minimum in the Gibbs free energy function which corresponds to the native state must be significantly deeper than any other minima attainable during the course of refolding. When the whole of conformational space is considered, however, there are so many minima (1) that there is no hope of finding the one minimum corresponding to the native conformation by attempting to examine all of them (). Therefore, the native conformation must be identified in a computationally feasible way that does not require comparison of the energies ofall the minimum-energy conformations. In this paper, we present a procedure that uses information only about the amino acid sequence and locations of the disulfide bonds of the protein and that results in a very limited number of conformations, which are close enough to the native one for application offurther refinement techniques The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C solely to indicate this fact. (1). This procedure has been applied to bovine pancreatic trypsin inhibitor (), a 58-residue protein. Designation of the Native Conformation We assume that the native conformation is not only the one of highest stability but also the one in which all properly chosen segments of the polypeptide chain are simultaneously in low-energy (not necessarily the lowest-energy) conformations (3). This implies that short-range interactions play a dominant role in determining the conformations of these segments (4). The number of low-energy conformations of an oligopeptide is usually several orders of magnitude smaller than the number of combinations constructed by combining the lowenergy conformations of the individual residues of which the oligopeptide is constituted. Since the common part of two overlapping segments must be in the same conformation in both of the overlapping oligopeptides (5), the number of conformations of the whole polypeptide chain in which most of the overlapping segments are simultaneously in a lowenergy conformation is rather limited (3). Such whole polypeptide conformations are designated as "conformons" [formerly called X-conformations (3)], and it has been suggested that, for properly defined segment sizes and energy ranges, the native conformation is the only conformon of the whole polypeptide chain (3); for short-chain proteins, this unique conformon would be the native conformation, but for long chains it would be the native conformation only of an independently folding domain. Thus, we search the conformational space for the conformon of the whole chain (defined by short-range interactions) and assume that it corresponds to the native conformation. On the other hand, we must keep in mind that, whereas short-range interactions dominate in determining the minimum-energy conformation corresponding to the conformon of the whole chain, long-range and protein-solvent interactions affect the stabilization free energy and contribute in determining the exact conformation of the protein; these additional interactions are incorporated by minimizing the energy of the whole structure (with inclusion of solvent and disulfide bonds) in the final stage of our procedure. Low-energy conformations of oligopeptides can be obtained by using a buildup procedure (6). Vdsquez and Scheraga (7, 8) built up the whole conformation from low-energy fragments by using a limited number of distance Abbreviation:, bovine pancreatic trypsin inhibitor. *On leave from: Institute of Enzymology, B.R.C., Hungarian Academy of Sciences, H-1518 Budapest, P.O. Box 7, Hungary ton leave from: Department of Chemistry, University of the Witwatersrand, Wits 050, South Africa tto whom reprint requests should be addressed. 3661

2 366 Biophysics: Simon et al. constraints from simulated NMR spectra. In that approach, the low-energy conformations of all the constituent tetrapeptides were calculated independently of one another and then overlapped. Since the number of conformations of larger fragments built up from various combinations of these tetrapeptide conformations was exceedingly large, about 00 distance constraints (from simulated NMR spectra) were required to reduce the magnitude of the computational problem to select the final, native structure. In this paper, we modify the buildup procedure by introducing statistical information expressing the correlation between the types of residues that can exist at various positions along the chain; this information dictates the sizes of the fragments (which depend on the amino acid sequence) that should be used in the buildup procedure and obviates the previous need (7, 8) to introduce distance constraints. At the present stage of development of this modified procedure, however, the final results are not yet as good as those of refs. 7 and 8. Initial Screening of the Conformational Space We first considered the influence of neighboring residues on each other's conformations, without introducing correlations among the types of amino residues at this stage. For this purpose, we began with tripeptides as the initial building blocks. When the conformational energies of tripeptides (and, later, of larger oligopeptides) were calculated, their N and C termini were blocked with acetyl and methylamide groups, respectively. These blocking groups simulated the adjacent peptide groups within a protein. The total energy was calculated with the ECEPP/ potential (Empirical Conformational Energy Program for Peptides; refs. 9-11) together with the SUMSL minimizer (Secant Unconstrained Minimization Solver; ref. 1). All backbone and side-chain dihedral angles were varied in the minimization. At the oligopeptide stage, no solvent effects were included in the calculation because, at this stage, it was not known which residues were exposed to solvent. In the final stage, when the conformational energy of the whole molecule was calculated and minimized, the solvation free energy of its constituent groups, proportional to the water-accessible surface area, was included in the total energy (13) with an average penalty of 0.05 kcal/(mol A) for both polar and nonpolar groups (14). Unless noted otherwise, two conformations of a residue were considered to be the same if their backbone dihedral angles fell in the same region of the map, as defined by the conformational code of Zimmerman et al. (15) [which divides the map into 16 regions], and the first side-chain dihedral angles, X1, were in the same rotational isomeric state, g+, g-, or t. The conformational energies were minimized for sequential tripeptides, starting at the N terminus of, as follows. All of the low-energy conformations of the terminally blocked tripeptide RPD were calculated first, by combining all lowenergy conformations (16) of the single terminally blocked amino acid residues R, P, and D that fell within a 3.0-kcal/mol range of the global minimum; all single residues differing only in X' (j> 1) were replaced by the lowest-energy one in each set. All tripeptide conformations with minimized energies less than 8 kcal/mol above the global minimum were retained. These energy ranges were chosen in order to retain a manageable number of conformations. To minimize the low-energy conformations of the next overlapping tripeptide, PDF, not as an isolated one but as one whose conformation reflects the presence of the preceding residue R, the full list of low-energy conformations of RPD was reduced by selecting only the lowest-energy conformations (within the 8-kcal/mol range) of RPD for which the PD Proc. Natl. Acad. Sci. USA 88 (1991) dipeptide conformations differed from each other (i.e., in 4, q1, XI), irrespective of the conformation of residue R. These selected PD dipeptide conformations were combined with all of the low-energy conformations of the next residue, F. With this overlapping new tripeptide, the whole of the above procedure was repeated. The process was then repeated for each successive overlapping tripeptide of the whole polypeptide chain. An 8-kcal/mol cutoff was used for tripeptides that did not contain half-cystine, C, whereas a higher, 10-kcal/mol cutoff was used for those that did contain C, because of the extra constraining covalent bond that can compensate for higher conformational energies. It is important to note that, when the conformations of the second tripeptide, PDF, were calculated, the lowest-energy conformation was not necessarily the known globalminimum one of the isolated tripeptide PDF. This is because we used only those initial PD conformations which appeared among the low-energy conformations of the first tripeptide, RPD, and this ensemble of PD conformations reflects the influence of the residue R. Some of the conformations of isolated PDF might have lower energies than those computed here but, when the PD portion is combined with R, they would have higher energies than the 8-kcal/mol cutoff for RPD. The cutoff level was measured with respect to the minimum of this modified set of PDF conformations (influenced by residue R), rather than from the global minimum of isolated PDF, and a range of energies (8-10 kcal/mol), smaller than that used previously (7, 8), was enough to retain the native conformations of all tripeptides. By using this overlapping procedure, all successive tripeptides, up to the C-terminal one, feel the influence of all of the preceding residues. When all of the tripeptides of the polypeptide chain had been generated in this manner-i.e., when the C-terminal tripeptide, GGA, was reached-we had retained about 10 to 10,000 conformations with an average of about 1000 for each tripeptide. At this point, without carrying out any further energy calculations, we introduced the following screening process in the reverse direction to reduce the number of conformations even further. From the low-energy conformations of the C-terminal tripeptide, GGA, the different conformations of the dipeptide GG (reflecting the influence of the preceding residues and of the C-terminal residue A) were selected by the same criterion as that used for PD in the forward chain generation. Moving backward from the C terminus to the N terminus, we retained only those conformations of the preceding tripeptide, CGG, for which the conformation of the GG dipeptide portion appeared in the set of low-energy conformations of the GGA tripeptide, irrespective of the conformation of residue A. With the new, reduced list for tripeptide CGG, the backward selection (based on overlapping tripeptides) was continued until the N-terminal tripeptide, RPD, was reached. At the end of this backward selection, an average of about 300 conformations of each tripeptide remained, with the number varying from about 10 to 3000; the native conformation of each tripeptide [in terms of the code of Zimmerman et al. (15)] still appeared in each ensemble. Introduction of Statistical Information: Identification of Low-Energy Building Blocks In order to build up the conformon of the whole protein from low-energy conformations of smaller segments, we must first determine the sizes of the low-energy segments (to be built from the foregoing tripeptides) that will serve as the building blocks. For this purpose, we next introduced the use of statistical information about regularities in amino acid sequences.

3 Biophysics: Simon et A It has been shown that short-range regularities (nonrandomness) exist in the amino acid sequences of proteins, and that these regularities lead to certain structural features (17-19). Recently, the range of these regularities, measured by the nonrandomness of amino acid pairing (or the correlation between the types of residues) as a function of the separation between two residues in an amino acid sequence, has been determined (0). If it is assumed that the range of nonrandom pairing reflects the separation within which the residues influence each other's conformations significantly, the lengths of the overlapping segments (building blocks) that simultaneously must be in low-energy conformations are found to vary from tripeptides to octapeptides, depending on the amino acid sequences of the relevant segments (0). To obtain these building blocks, we combined the foregoing reduced set of tripeptide conformations into larger segments. The sizes chosen for these building blocks reflect the separation distances for nonrandom pairing of the residues, as found from a large protein sequence data base (0). For example, the N-terminal sequence of is RPDF- CLEPPY... Starting with R at position i, there are nonrandom correlations of R with P at position i + 1, with D at position i +, with F at position i + 3, and with C at position i + 4, but the correlation becomes random with residue L at position i + 5; therefore, the first building block is RPDFC. The remaining building blocks, selected by this criterion, are each shifted along the chain by at least one residue and are ilhimctretaa crhermntieqilv in Pip 1 To IIemrnntrate how the Proc. Natd. Acad. Sci. USA 88 (1991) 3663 tripeptide conformations of the previous reduced set for RPD, PDF, and DFC were combined (with no further screening at this point) to form the pentapeptide; the energies of all of these conformations of RPDFC were then minimized with respect to all backbone and side-chain dihedral angles [the starting values of Xi, forj > 1, were the lowest-energy ones for single residues (16) for the given backbone conformation]. It was observed that, after the earlier screening at the tripeptide level, it was possible to retain the native conformations of the building blocks with a lower energy cutoffthan 8-10 kcal/mol. Thus, a 5-kcal/mol cutoff was chosen in the energy minimization for each building block that did not include half-cystine, while a 7-kcal/mol cutoffwas chosen for those having half-cystine. Only structures having energies below the cutoffs were retained as building blocks for further buildup. When the C terminus was reached, with an accumulation of a few hundred to hundreds of thousands of oligopeptide conformations, the same backward selection (with no energy minimization) as was described for the tripeptides was applied; in this backward screening, the degree of overlap shown in Fig. 1 was used. This screening resulted in to 10,000 conformations per oligopeptide, with the retention of the native conformation of each building block. Buildup of Chain with Low-Energy Building Blocks: Use of Disulfide Loops second building block was obtained, we note that, for P taken To proceed further, it was necessary to invoke additional as residue i, there is a random correlation with the next three chemical information to limit the number of conformations residues, D, F, and C (0); likewise for D with respect to F retained during the construction of the whole molecule; this C, and L; however, the same logic, applied to F as residue i information was the locations of the disulfide-bonded loops. yields the hexapeptide FCLEPP. Therefore, these two oli- We began the build-up with the N terminus of the smallest gopeptides (RPDFC and FCLEPP), rather than the nonapep- disulfide-bonded loop, Cys30-Cys51 (actually from Gly8 to tide RPDFCLEPP, were considered as building blocks be- Arg53 to avoid splitting a building block, and then truncating cause R is correlated up to C but not beyond. back to Cys3-Cys51). Further reduction of the ensembles of The building blocks were assembled by again starting at t building blocks was achieved by considering the nondegen- N terminus ofthe protein. Forexample, forrpdfc, all ofthe erate minima-i.e., by selecting only one (lowest-energy) side-chain conformation for the given backbone conformal tion (1) [according to the code of Zimmerman et al. (15)] for each residue in the building blocks. 1+1 l In order to combine the ensembles of building blocks in all 1.4+' combinations [based on the conformational letter code (15)] according to the overlaps in Fig. 1 to form larger segments, 1-H-H the dihedral angles of the overlapping residues in the two Ill,, building blocks were averaged (since the conformations being averaged were similar, no change occurred in the conforma- Hl l H tional letter code). For values close to 1800, however, the Hll l llldeviations 4 from 1800 were averaged. This procedure of come+1-91 l bination (without energy minimization) resulted in 3000 dif- F-H-I ferent backbone conformations (including the native) for the segment between residues 30 and 51. HlH Of these 3000 conformations, only 47 had a separation FH-Ibetween the two sulfur atoms of <10 A. The choice of 10 A l+hl lllwas a compromise between 5 A, for which no conformations were found, and 13 A, for which too many remained, espelhl llcially when combined with later fragments. With the io-a cutoff, the native backbone conformation of the Cys30-Cys51 loop was lost, but several backbone conformations (very similar to the corresponding part of the native protein) were l-hl lllretained. Energy minimization with a disulfide loop-closing H potential [without constraints on the (q5, f) dihedral angles] H was not applied at this stage because it might have shifted the conformational codes (15), thereby destroying the building FIG. 1. The sizes of the segments that were used as stable blocks and preventing further implementation ofthe essential building blocks in the buildup procedure, and their distribution along feature of this modified buildup procedure (the simultaneous the polypeptide chain of. The residues of the C-terminal existence of all the building blocks of the whole chain in tripeptide, G, G and A, are in random pairing with the preceding low-energy conformations). In the final stage of the proceresidues (0); therefore, this figure shows segments only up to dure, however, after the whole structure was built, energy residue 55. minimization (with a disulfide loop-closing potential) was

4 3664 Biophysics: Simon et al. applied, and the conformational codes were allowed to change. Among these 47 conformations, there were only different conformations for the nonapeptide from residues 30 to 38, which is the C-terminal portion of a second loop, the one involving Cys'4 and Cys38. These two nonapeptide conformations were combined, successively, with the building blocks of Fig. 1, moving toward the N terminus of Cys'4 to form the Cys14-Cys38 segment. Of the resulting =500 conformations of the segment, only 49 had a separation distance between the sulfur atoms of <10 A. Combination of the 47 conformations of segment and the 49 conformations of segment resulted in 136 backbone conformations for the portion of the polypeptide. These conformations were extended in both directions with the building blocks of Fig. 1, resulting in >4000 conformations of the 5-55 portion of the chain; only 8 of these, which had a separation distance between the sulfur atoms of Cys5 and Cys55 within the 10-A limit, were retained. Consideration of the Side Chains of the Building Blocks In each of the 8 backbone structures of the 5-55 segment, the side chains were attached in all possible combinations (with respect to X1); however, we retained only those sidechain conformations (i.e., values of X1) that were found earlier for the residues in the respective building blocks [the values of Xi (j > 1) were taken from the lowest-energy conformations (for the given X1) of each single residue (16)]. This selection procedure resulted in >300 conformations, but most of them were very similar in the backbone codes of Zimmerman et al. (15) and in the rotational isomeric states of X1 of their side chains. These conformations essentially fell into only two different groups, reflecting the two different conformations for residues We selected 0 structures of segment 5-55 randomly (10 from each group), and combined them with the N-terminal building blocks of Fig. 1 and the original C-terminal tripeptides (the C-terminal residues, G, G, and A being noncorrelated with the preceding residues). The terminal oligopeptides, especially the C-terminal tripeptide GGA, exhibited great flexibility-i.e., hundreds of conformations could be fitted to the rest of the molecule. Therefore, we combined 10 randomly selected conformations of each of the N- and C-terminal oligopeptides with the 0 randomly selected conformations of segment 5-55 to produce 000 conformations of the whole molecule; 60 of these 000 conformations, chosen randomly from the two groups (more from the group with lower energy), were selected for further consideration. Energy Minimization of the Whole Molecule Once the energies of the building blocks of Fig. 1 had been minimized, no further energy minimization had been applied up to this point; the combination of the building blocks, thus far, involved only selective screening. Now, the energies [including a pseudopotential (9-11) to form the disulfide bonds] of the 60 selected conformations of the whole molecule were minimized without solvation; only 4 of these 60 conformations had energies <104 kcal/mol (this high cutoff reflected high-energy local minima). Only 8 of these 4 had different conformations of the whole molecule, the remaining differences being only in the conformations of the N- and C-terminal blocking groups. The energies of these 8 conformations were minimized again, but now including the solvation free energy (13, 14); this reminimization required about 30 times more computational time but resulted only in some reordering of the conformational energies with very little conformational change. Proc. Natl. Acad. Sci. USA 88 (1991) The whole procedure resulted in two different families of conformations. Within each of these two conformational families, the structures differed only in the conformations of the terminal parts or in the side-chain conformations, but their backbones were very similar. Results and Discussion The lowest-energy structures of each of the two different conformational families were compared with the x-ray structure of native (), using the coordinate set 5PTI from the Brookhaven Data Bank (3, 4). Both calculated conformations are much less compact than the x-ray structure. The two loops, from Cys'4 to Cys38 and from Cys30 to Cys51, appear to be similar to those in the native structure, although the Cys14-Cys38 loop is slightly less twisted and the Cys30- Cys5' loop is more planar in the calculated conformations. The relative placements of the segments have been less accurately established. For example, the overlapping part of the two loops, the nonapeptide 30-38, was one of the central elements of our calculation procedure. When the nonapeptide from one of the families of the whole molecule is optimally superimposed on the corresponding portion of the x-ray structure, the rms deviation of the backbone atoms is <1 A (the nonapeptide from the other family is not as good); however, when the whole molecule is superimposed on the x-ray structure, these central nonapeptides lie far from one another, with a rms distance of 1oA. This demonstrates that when two structures are not sufficiently similar, the rms deviation can be a very misleading measure of agreement because a single parameter cannot distinguish between, on the one hand, two completely different structures and, on the other, two structures consisting of very similar segments that are poorly placed with respect to each other. Therefore, at the present stage of the development of our procedure (where our results are mainly of conceptual interest), we use only the conformational codes of Zimmerman et al. (15), rather than the rms deviation or a more detailed differential geometric comparison (5), for comparing two conformations. This type of comparison not only illustrates the conformational differences but also indicates whether the differences are distributed evenly or are confined to a few positions along the 14 F AC la BE F F IEC E ICI F 'Ft ' F' F F E F*: B F C A C F C LAj IEj F F DJLI E C F 15 9 D F D C C E E E C A A A A A* G D F D C C E E E C C A A A A* E D FD C C E E E C A A A A A* G I EiiDii Ei C CIrA ia*e G G G A C D IF IB' E A C D F: IEi FiE IA*I L..J C'C'L A E A* F A C D [m] FCI A C A A A A A A A* C* D G LIJ B: E AA A A A A A WrB B F*? A LAJ A J [AJ J JA J LJ IAI C* E G FIG.. Backbone conformational codes (15) of the residues. First row, structure 1; second row, standard-geometry version (7, 8) of the x-ray structure; and third row, structure. The boxes with full lines enclose identical conformational codes, whereas the boxes with broken lines enclose immediately adjacent conformational codes (15). The question marks arise because dihedral angle 4 of residue 1 and dihedral angle 4i of residue 58 are not defined in the x-ray structure.

5 polypeptide chain. Such a comparison is given in Fig.. For both groups of calculated conformations, about two-thirds of the residues are in the same conformational code region as in the standard-geometry version (7, 8) of the x-ray structure. Moreover, in structure no., half of the residues with incorrect backbone conformations are located at the two ends of the chain, which are more flexible than the interior of the chain. While such good agreement of conformational status (15) is encouraging, it should be recalled that a perfectprediction model [albeit only a 5-state one (6), compared to the 16-state one of Zimmerman et al. (15)] led to a poor structure (6); however, further refinements by procedures outlined in ref. I can improve the structure. Conclusions Biophysics: Simon et A Polypeptides with special, naturally selected amino acid sequences adopt unique, native conformations that have lower free energy than does the unfolded state. Therefore, when we attempt to solve the protein-folding problem, we must consider only those polypeptides having these special, naturally selected sequences. It therefore makes sense to incorporate the special features of naturally selected sequences in calculations ofprotein conformation. This was the basis for modifying the buildup procedure by selection of the sizes of the overlapping building blocks based on nonrandom correlations of amino acid sequences, from which the whole molecule was built. The whole procedure is based on the premise that shortrange interactions determine which of the minimum-energy structures correspond to the conformon, which is assumed to be the native structure. It is believed that chain-folding initiation sites (CFISs) are formed in various parts of the unfolded polypeptide chain in response to short-range interactions (7-9), and that further folding around these CFISs is also dominated by short-range interactions. In fact, CFISs have been shown to exist in short peptide fragments (9). It is further assumed that long-range and protein-solvent interactions only enhance the stability of the folded structure. The use of energy minimization (with inclusion of solvent and disulfide bonds) in the final stage of the procedure conforms to the collapse phenomenon discussed by Chan and Dill (30). Polypeptide sequences that do not lead to a single structure whose free energy is significantly lower than that ofany other accessible conformation may be presumed to be unsuitable from a biological point of view, so that natural selection will not retain such amino acid sequences. The procedure presented in this paper has taken advantage of this presumed natural selection to obtain a unique or, perhaps, a limited number of conformations for a polypeptide with selected amino acid sequences in which the overlapping building blocks are simultaneously in low-energy conformations. We thank K. D. Gibson, A. Nayeem, G. Ndmethy, M. R. Pincus, S. Rackovsky, and M. VAsquez for helpful comments on the manuscript. This work was supported by research grants from the National Institute of General Medical Sciences of the National Institutes of Health (GM1431) and from the National Science Foundation (DMB ). Support was also received from the National Foundation for Cancer Research and from the U.S.-Hungary International Cooperative Science Program (NSF-INT88-75). Support from the Hungarian Academy of Sciences (OTKA 318) is also Proc. Natl. Acad. Sci. USA 88 (1991) 3665 acknowledged. The computations were carried out at the Cornell National Supercomputer Facility, a resource of the Cornell Center for Theory and Simulation in Science and Engineering, which receives major funding from the National Science Foundation and the IBM Corporation, with additional support from New York State and members of the Corporate Research Institute. L.G. acknowledges additional support for sabbatical leave from both the University of the Witwatersrand and the Foundation for Research Development. 1. Scheraga, H. A. (1989) Chem. Scr. 9A, Levinthal, C. (1968) J. Chim. Phys. Phys.-Chim. Biol. 65, Simon, I. (1985) J. Theor. Biol. 113, Scheraga, H. A. (1973) Pure Appl. Chem. 36, Scheraga, H. A. (1983) Biopolymers, Simon, I., Ndmethy, G. & Scheraga, H. A. (1978) Macromolecules 11, VAsquez, M. & Scheraga, H. A. (1988) J. Biomol. Struct. Dyn. 5, Vasquez, M. & Scheraga, H. A. (1988) J. Biomol. Struct. Dyn. 5, Momany, F. A., McGuire, R. F., Burgess, A. W. & Scheraga, H. A. (1975) J. Phys. Chem. 79, NMmethy, G., Pottle, M. S. & Scheraga, H. A. (1983) J. Phys. Chem. 87, Sippl, M. J., Ndmethy, G. & Scheraga, H. A. (1984) J. Phys. Chem. 88, Gay, D. M. (1983) ACM Trans. Math. Software 9, Vila, J., Williams, R. L., VAsquez, M. & Scheraga, H. A. (1990) Proteins Struct. Funct. Genet., in press. 14. Chothia, C. (1974) Nature (London) 48, Zimmerman, S. S., Pottle, M. S., Ndmethy, G. & Scheraga, H. A. (1977) Macromolecules 10, VAsquez, M., Ndmethy, G. & Scheraga, H. A. (1983) Macromolecules 16, Vonderviszt, F., Matrai, G. & Simon, I. (1986) Int. J. Pept. Protein Res. 7, Vonderviszt, F. & Simon, I. (1986) Biochem. Biophys. Res. Commun. 139, Tudos, E., Cserzo, M. & Simon, I. (1990) Int. J. Pept. Protein Res. 36, Cserzo, M. & Simon, I. (1989) Int. J. Pept. Protein Res. 34, Pincus, M. R., Klausner, R. D. & Scheraga, H. A. (198) Proc. Natd. Acad. Sci. USA 79, Wlodawer, A., Deisenhofer, J. & Huber, R. (1987) J. Mol. Biol. 193, Bernstein, F. C., Koetzle, T. F., Williams, G. J. B., Meyer, E. F., Jr., Brice, M. D., Rodgers, J. R., Kennard, O., Shimanouchi, T. & Tasumi, M. (1977) J. Mol. Biol. 11, Abola, E. E., Bernstein, F. C., Bryant, S. H., Koetzle, T. F. & Weng, J. (1987) in Crystallographic Databases: Information Content, Software Systems, Scientific Applications, eds. Allen, F. H., Bergerhoff, G. & Sievers, R. (Data Commission of the Int. Union of Crystallography, Bonn), pp Rackovsky, S. & Scheraga, H. A. (1980) Macromolecules 13, Burgess, A. W. & Scheraga, H. A. (1975) Proc. Natl. Acad. Sci. USA 7, Matheson, R. R. & Scheraga, H. A. (1978) Macromolecules 11, Scheraga, H. A. (1980) in Protein Folding, ed. Jaenicke, R. (Elsevier, Amsterdam), pp Montelione, G. T. & Scheraga, H. A. (1989) Acc. Chem. Res., Chan, H. S. & Dill, K. A. (1991) Annu. Rev. Biophys. Biophys. Chem. 0,

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle. Project I Chemistry 8021, Spring 2005/2/23 This document was turned in by a student as a homework paper. 1. Methods First, the cartesian coordinates of 5021 and 8021 molecules (Fig. 1) are generated, in

More information

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed. Macromolecular Processes 20. Protein Folding Composed of 50 500 amino acids linked in 1D sequence by the polypeptide backbone The amino acid physical and chemical properties of the 20 amino acids dictate

More information

New Optimization Method for Conformational Energy Calculations on Polypeptides: Conformational Space Annealing

New Optimization Method for Conformational Energy Calculations on Polypeptides: Conformational Space Annealing New Optimization Method for Conformational Energy Calculations on Polypeptides: Conformational Space Annealing JOOYOUNG LEE, 1,2 HAROLD A. SCHERAGA, 2 S. RACKOVSKY 1 1 Department of Biomathematical Sciences,

More information

JOOYOUNG LEE*, ADAM LIWO*, AND HAROLD A. SCHERAGA*

JOOYOUNG LEE*, ADAM LIWO*, AND HAROLD A. SCHERAGA* Proc. Natl. Acad. Sci. USA Vol. 96, pp. 2025 2030, March 1999 Biophysics Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: Application

More information

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2004 How do we go from an unfolded polypeptide chain to a compact folded protein? (Folding of thioredoxin, F. Richards) Structure - Function

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Folding of Polypeptide Chains in Proteins: A Proposed Mechanism for Folding

Folding of Polypeptide Chains in Proteins: A Proposed Mechanism for Folding Proc. Nat. Acad. Sci. USA Vol. 68, No. 9, pp. 2293-2297, September 1971 Folding of Polypeptide Chains in Proteins: A Proposed Mechanism for Folding PETER N. LEWS, FRANK A. MOMANY, AND HAROLD A. SCHERAGA*

More information

Lecture 34 Protein Unfolding Thermodynamics

Lecture 34 Protein Unfolding Thermodynamics Physical Principles in Biology Biology 3550 Fall 2018 Lecture 34 Protein Unfolding Thermodynamics Wednesday, 21 November c David P. Goldenberg University of Utah goldenberg@biology.utah.edu Clicker Question

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

The protein folding problem consists of two parts:

The protein folding problem consists of two parts: Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures.

More information

Lecture 11: Protein Folding & Stability

Lecture 11: Protein Folding & Stability Structure - Function Protein Folding: What we know Lecture 11: Protein Folding & Stability 1). Amino acid sequence dictates structure. 2). The native structure represents the lowest energy state for a

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2003 Structure - Function Protein Folding: What we know 1). Amino acid sequence dictates structure. 2). The native structure represents

More information

Section Week 3. Junaid Malek, M.D.

Section Week 3. Junaid Malek, M.D. Section Week 3 Junaid Malek, M.D. Biological Polymers DA 4 monomers (building blocks), limited structure (double-helix) RA 4 monomers, greater flexibility, multiple structures Proteins 20 Amino Acids,

More information

An approach to the multiple-minima problem

An approach to the multiple-minima problem Proc. Natl. Acad. Sci. USA Vol. 83, pp. 2782-2786, May 986 Chemistry An approach to the multiple-minima problem by relaxing dimensionality (distance geometry/cayley-menger determinants/energy ENRICO 0.

More information

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Tomoyuki Hiroyasu, Mitsunori Miki, Shinya Ogura, Keiko Aoi, Takeshi Yoshida, Yuko Okamoto Jack Dongarra

More information

ON THE DOMINANCE OF SHORT-RANGE INTERACTIONS IN POLYPEPTIDES AND PROTEINS

ON THE DOMINANCE OF SHORT-RANGE INTERACTIONS IN POLYPEPTIDES AND PROTEINS ON THE DOMINANCE OF SHORT-RANGE INTERACTIONS IN POLYPEPTIDES AND PROTEINS HAROLD A. SCHERAGA Department of Chemistry, Cornell University, Ithaca, New York 14850, USA ABSTRACT The development of the concept

More information

Protein Folding. I. Characteristics of proteins. C α

Protein Folding. I. Characteristics of proteins. C α I. Characteristics of proteins Protein Folding 1. Proteins are one of the most important molecules of life. They perform numerous functions, from storing oxygen in tissues or transporting it in a blood

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

Homework Problem Set 4 Solutions

Homework Problem Set 4 Solutions Chemistry 380.37 Dr. Jean M. Standard omework Problem Set 4 Solutions 1. A conformation search is carried out on a system and four low energy stable conformers are obtained. Using the MMFF force field,

More information

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted

More information

Proteins polymer molecules, folded in complex structures. Konstantin Popov Department of Biochemistry and Biophysics

Proteins polymer molecules, folded in complex structures. Konstantin Popov Department of Biochemistry and Biophysics Proteins polymer molecules, folded in complex structures Konstantin Popov Department of Biochemistry and Biophysics Outline General aspects of polymer theory Size and persistent length of ideal linear

More information

Algorithm for Rapid Reconstruction of Protein Backbone from Alpha Carbon Coordinates

Algorithm for Rapid Reconstruction of Protein Backbone from Alpha Carbon Coordinates Algorithm for Rapid Reconstruction of Protein Backbone from Alpha Carbon Coordinates MARIUSZ MILIK, 1 *, ANDRZEJ KOLINSKI, 1, 2 and JEFFREY SKOLNICK 1 1 The Scripps Research Institute, Department of Molecular

More information

Prof. Jason Kahn Your Signature: Exams written in pencil or erasable ink will not be re-graded under any circumstances.

Prof. Jason Kahn Your Signature: Exams written in pencil or erasable ink will not be re-graded under any circumstances. 1 Biochemistry 461 February 16, 1995 Exam #1 Prof. Jason Kahn Your Printed Name: Your SS#: Your Signature: You have 75 minutes for this exam. Exams written in pencil or erasable ink will not be re-graded

More information

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

UNIT TWELVE. a, I _,o ' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I -oh UNT TWELVE PROTENS : PEPTDE BONDNG AND POLYPEPTDES 12 CONCEPTS Many proteins are important in biological structure-for example, the keratin of hair, collagen of skin and leather, and fibroin of silk. Other

More information

Protein Structure Basics

Protein Structure Basics Protein Structure Basics Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera Importance of Proteins Muscle structure depends on protein-protein interactions Transport across membranes

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001 Modeling two-state cooperativity in protein folding Ke Fan, Jun Wang, and Wei Wang arxiv:cond-mat/0103385v1 [cond-mat.soft] 19 Mar 2001 National Laboratory of Solid State Microstructure and Department

More information

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov Biophysical Journal, Volume 98 Supporting Material Molecular dynamics simulations of anti-aggregation effect of ibuprofen Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov Supplemental

More information

Powerful Simulated-Annealing Algorithm Locates Global Minimum of Protein-Folding Potentials from Multiple Starting Conformations

Powerful Simulated-Annealing Algorithm Locates Global Minimum of Protein-Folding Potentials from Multiple Starting Conformations Powerful Simulated-Annealing Algorithm Locates Global Minimum of Protein-Folding Potentials from Multiple Starting Conformations Mark E. Snow University of Michigan, Scientific Computation Croup, 535 W.

More information

Oxidative folding is the composite process by which a protein

Oxidative folding is the composite process by which a protein Structural determinants of oxidative folding in proteins Ervin Welker*, Mahesh Narayan*, William J. Wedemeyer, and Harold A. Scheraga Baker Laboratory of Chemistry and Chemical Biology, Cornell University,

More information

Folding of small proteins using a single continuous potential

Folding of small proteins using a single continuous potential JOURNAL OF CHEMICAL PHYSICS VOLUME 120, NUMBER 17 1 MAY 2004 Folding of small proteins using a single continuous potential Seung-Yeon Kim School of Computational Sciences, Korea Institute for Advanced

More information

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model JOURNAL OF CHEMICAL PHYSICS VOLUME 111, NUMBER 8 22 AUGUST 1999 Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model Benjamin Vekhter and R. Stephen

More information

Thermodynamic parameters for the helix-coil transition of oligopeptides: Molecular dynamics simulation with the peptide growth method

Thermodynamic parameters for the helix-coil transition of oligopeptides: Molecular dynamics simulation with the peptide growth method Proc. Natl. Acad. Sci. USA Vol. 92, pp. 10924-10928, November 1995 Biophysics Thermodynamic parameters for the helix-coil transition of oligopeptides: Molecular dynamics simulation with the peptide growth

More information

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015, Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015, Course,Informa5on, BIOC%530% GraduateAlevel,discussion,of,the,structure,,func5on,,and,chemistry,of,proteins,and, nucleic,acids,,control,of,enzyma5c,reac5ons.,please,see,the,course,syllabus,and,

More information

Molecular Structure Prediction by Global Optimization

Molecular Structure Prediction by Global Optimization Molecular Structure Prediction by Global Optimization K.A. DILL Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, CA 94118 A.T. PHILLIPS Computer Science

More information

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years. Structure Determination and Sequence Analysis The vast majority of the experimentally determined three-dimensional protein structures have been solved by one of two methods: X-ray diffraction and Nuclear

More information

Conformational Analysis of Macromolecules in Ground and Excited States

Conformational Analysis of Macromolecules in Ground and Excited States BIOPOLYMERS VOL. 7, PP. 207-213 (1969) Quantum Mechanical Approach to the Conformational Analysis of Macromolecules in Ground and Excited States ROALD HOFFRIIANN and AKIRA IT\IAR/IURA,* Department of Chemistry,

More information

Enkephalin: Conformational analysis by means of empirical energy

Enkephalin: Conformational analysis by means of empirical energy Proc. Nati. Acad. Sci. USA Vol. 74, No. 2, pp. 414-418, February 1977 Chemistry Enkephalin: Conformational analysis by means of empirical energy calculations (peptide with opiate action/peptide conformation/conformational

More information

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor Note: Adequate space is given for each answer. Questions that require a brief explanation should

More information

arxiv:q-bio/ v1 [q-bio.bm] 11 Jan 2004

arxiv:q-bio/ v1 [q-bio.bm] 11 Jan 2004 Cooperativity and Contact Order in Protein Folding Marek Cieplak Institute of Physics, Polish Academy of Sciences, Al. Lotników 32/46, 02-668 Warsaw, Poland arxiv:q-bio/0401017v1 [q-bio.bm] 11 Jan 2004

More information

Protein Structure Determination from Pseudocontact Shifts Using ROSETTA

Protein Structure Determination from Pseudocontact Shifts Using ROSETTA Supporting Information Protein Structure Determination from Pseudocontact Shifts Using ROSETTA Christophe Schmitz, Robert Vernon, Gottfried Otting, David Baker and Thomas Huber Table S0. Biological Magnetic

More information

Energetics of the structure of the four-a-helix bundle in proteins

Energetics of the structure of the four-a-helix bundle in proteins Proc. Natl. Acad. Sci. USA Vol. 85, pp. 4295-4299, June 1988 Biophysics Energetics of the structure of the four-a-helix bundle in proteins (conformational energy computation/packing of helices/protein

More information

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination Lecture 9 M230 Feigon Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination Reading resources v Roberts NMR of Macromolecules, Chap 4 by Christina Redfield

More information

Protein secondary structure prediction with a neural network

Protein secondary structure prediction with a neural network Proc. Nati. Acad. Sci. USA Vol. 86, pp. 152-156, January 1989 Biophysics Protein secondary structure prediction with a neural network L. HOWARD HOLLEY AND MARTIN KARPLUS Department of Chemistry, Harvard

More information

A lattice model for protein structure prediction at low resolution

A lattice model for protein structure prediction at low resolution Proc~. Natl. Acad. Sci. USA Vol. 89, pp. 2536-2540, April 1992 Biophysics A lattice model for protein structure prediction at low resolution (protein folding/tertiary structure/conformational search) D.

More information

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila

Guessing the upper bound free-energy difference between native-like structures. Jorge A. Vila 1 Guessing the upper bound free-energy difference between native-like structures Jorge A. Vila IMASL-CONICET, Universidad Nacional de San Luis, Ejército de Los Andes 950, 5700- San Luis, Argentina Use

More information

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig part of Bioinformatik von RNA- und Proteinstrukturen Computational EvoDevo University Leipzig Leipzig, SS 2011 Protein Structure levels or organization Primary structure: sequence of amino acids (from

More information

Can a continuum solvent model reproduce the free energy landscape of a β-hairpin folding in water?

Can a continuum solvent model reproduce the free energy landscape of a β-hairpin folding in water? Can a continuum solvent model reproduce the free energy landscape of a β-hairpin folding in water? Ruhong Zhou 1 and Bruce J. Berne 2 1 IBM Thomas J. Watson Research Center; and 2 Department of Chemistry,

More information

Clustering of low-energy conformations near the native structures of small proteins

Clustering of low-energy conformations near the native structures of small proteins Proc. Natl. Acad. Sci. USA Vol. 95, pp. 11158 11162, September 1998 Biophysics Clustering of low-energy conformations near the native structures of small proteins DAVID SHORTLE*, KIM T. SIMONS, AND DAVID

More information

Simulations of the folding pathway of triose phosphate isomerase-type a/,8 barrel proteins

Simulations of the folding pathway of triose phosphate isomerase-type a/,8 barrel proteins Proc. Natl. Acad. Sci. USA Vol. 89, pp. 2629-2633, April 1992 Biochemistry Simulations of the folding pathway of triose phosphate isomerase-type a/,8 barrel proteins ADAM GODZIK, JEFFREY SKOLNICK*, AND

More information

From Amino Acids to Proteins - in 4 Easy Steps

From Amino Acids to Proteins - in 4 Easy Steps From Amino Acids to Proteins - in 4 Easy Steps Although protein structure appears to be overwhelmingly complex, you can provide your students with a basic understanding of how proteins fold by focusing

More information

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Template Free Protein Structure Modeling Jianlin Cheng, PhD Template Free Protein Structure Modeling Jianlin Cheng, PhD Associate Professor Computer Science Department Informatics Institute University of Missouri, Columbia 2013 Protein Energy Landscape & Free Sampling

More information

Design of a Protein Potential Energy Landscape by Parameter Optimization

Design of a Protein Potential Energy Landscape by Parameter Optimization J. Phys. Chem. B 2004, 108, 4525-4534 4525 Design of a Protein Potential Energy Landscape by Parameter Optimization Julian Lee,,, Seung-Yeon Kim, and Jooyoung Lee*, Department of Bioinformatics and Life

More information

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain. Biochemistry Quiz Review 1I A general note: Short answer questions are just that, short. Writing a paragraph filled with every term you can remember from class won t improve your answer just answer clearly,

More information

The sequences of naturally occurring proteins are defined by

The sequences of naturally occurring proteins are defined by Protein topology and stability define the space of allowed sequences Patrice Koehl* and Michael Levitt Department of Structural Biology, Fairchild Building, D109, Stanford University, Stanford, CA 94305

More information

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease University of Wollongong Research Online Faculty of Science - Papers (Archive) Faculty of Science, Medicine and Health 2009 NMR study of complexes between low molecular mass inhibitors and the West Nile

More information

Ab-initio protein structure prediction

Ab-initio protein structure prediction Ab-initio protein structure prediction Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center, Cornell University Ithaca, NY USA Methods for predicting protein structure 1. Homology

More information

Complex flexibility of the transforming growth factor

Complex flexibility of the transforming growth factor Proc. Natl. Acad. Sci. USA Vol. 92, pp. 5406-5410, June 1995 Biophysics Complex flexibility of the transforming growth factor f3 superfamily (conformational analysis/protomer assembly) GANESH VENKATARAMAN*,

More information

Protein Structures. 11/19/2002 Lecture 24 1

Protein Structures. 11/19/2002 Lecture 24 1 Protein Structures 11/19/2002 Lecture 24 1 All 3 figures are cartoons of an amino acid residue. 11/19/2002 Lecture 24 2 Peptide bonds in chains of residues 11/19/2002 Lecture 24 3 Angles φ and ψ in the

More information

Protein sequences as random fractals

Protein sequences as random fractals J. Biosci., Vol. 18, Number 2, June 1993, pp 213-220. Printed in India. Protein sequences as random fractals CHANCHAL K MITRA* and MEETA RANI School of Life Sciences, University of Hyderabad, Hyderabad

More information

Universal Similarity Measure for Comparing Protein Structures

Universal Similarity Measure for Comparing Protein Structures Marcos R. Betancourt Jeffrey Skolnick Laboratory of Computational Genomics, The Donald Danforth Plant Science Center, 893. Warson Rd., Creve Coeur, MO 63141 Universal Similarity Measure for Comparing Protein

More information

NMR parameters intensity chemical shift coupling constants 1D 1 H spectra of nucleic acids and proteins

NMR parameters intensity chemical shift coupling constants 1D 1 H spectra of nucleic acids and proteins Lecture #2 M230 NMR parameters intensity chemical shift coupling constants Juli Feigon 1D 1 H spectra of nucleic acids and proteins NMR Parameters A. Intensity (area) 1D NMR spectrum: integrated intensity

More information

Useful background reading

Useful background reading Overview of lecture * General comment on peptide bond * Discussion of backbone dihedral angles * Discussion of Ramachandran plots * Description of helix types. * Description of structures * NMR patterns

More information

NMR, X-ray Diffraction, Protein Structure, and RasMol

NMR, X-ray Diffraction, Protein Structure, and RasMol NMR, X-ray Diffraction, Protein Structure, and RasMol Introduction So far we have been mostly concerned with the proteins themselves. The techniques (NMR or X-ray diffraction) used to determine a structure

More information

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007 Molecular Modeling Prediction of Protein 3D Structure from Sequence Vimalkumar Velayudhan Jain Institute of Vocational and Advanced Studies May 21, 2007 Vimalkumar Velayudhan Molecular Modeling 1/23 Outline

More information

THE UNIVERSITY OF MANITOBA. PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS

THE UNIVERSITY OF MANITOBA. PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS EXAMINATION: Biochemistry of Proteins EXAMINER: J. O'Neil Section 1: You must answer all of

More information

Short peptides, too short to form any structure that is stabilized

Short peptides, too short to form any structure that is stabilized Origin of the neighboring residue effect on peptide backbone conformation Franc Avbelj and Robert L. Baldwin National Institute of Chemistry, Hajdrihova 19, Ljubljana Sl 1115, Slovenia; and Department

More information

Exam I Answer Key: Summer 2006, Semester C

Exam I Answer Key: Summer 2006, Semester C 1. Which of the following tripeptides would migrate most rapidly towards the negative electrode if electrophoresis is carried out at ph 3.0? a. gly-gly-gly b. glu-glu-asp c. lys-glu-lys d. val-asn-lys

More information

BIOINFORMATICS. Limited conformational space for early-stage protein folding simulation

BIOINFORMATICS. Limited conformational space for early-stage protein folding simulation BIOINFORMATICS Vol. 20 no. 2 2004, pages 199 205 DOI: 10.1093/bioinformatics/btg391 Limited conformational space for early-stage protein folding simulation M. Bryliński 1,3, W. Jurkowski 1,3, L. Konieczny

More information

Molecular Modeling lecture 2

Molecular Modeling lecture 2 Molecular Modeling 2018 -- lecture 2 Topics 1. Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction 4. Where do protein structures come from? X-ray crystallography

More information

The typical end scenario for those who try to predict protein

The typical end scenario for those who try to predict protein A method for evaluating the structural quality of protein models by using higher-order pairs scoring Gregory E. Sims and Sung-Hou Kim Berkeley Structural Genomics Center, Lawrence Berkeley National Laboratory,

More information

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Protein Dynamics The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Below is myoglobin hydrated with 350 water molecules. Only a small

More information

arxiv: v1 [physics.bio-ph] 26 Nov 2007

arxiv: v1 [physics.bio-ph] 26 Nov 2007 On the Helix-coil Transition in Alanine-based Polypeptides in Gas Phase Yanjie Wei arxiv:0711.4087v1 [physics.bio-ph] 26 Nov 2007 Department of Physics, Michigan Technological University, Houghton, MI

More information

arxiv:cond-mat/ v1 2 Feb 94

arxiv:cond-mat/ v1 2 Feb 94 cond-mat/9402010 Properties and Origins of Protein Secondary Structure Nicholas D. Socci (1), William S. Bialek (2), and José Nelson Onuchic (1) (1) Department of Physics, University of California at San

More information

Problem Set 1

Problem Set 1 2006 7.012 Problem Set 1 Due before 5 PM on FRIDAY, September 15, 2006. Turn answers in to the box outside of 68-120. PLEASE WRITE YOUR ANSWERS ON THIS PRINTOUT. 1. For each of the following parts, pick

More information

Helix Folding Simulations with Various Initial Conformations

Helix Folding Simulations with Various Initial Conformations 1 796 Biophysical Journal Volume 66 June 1994 1796-1803 Helix Folding Simulations with Various Initial Conformations Shen-Shu Sung Research Institute, The Cleveland Clinic Foundation, Cleveland Ohio 44195

More information

P. SEETHARAMULU* and Gordon M. CRIPPEN ~ College of Pharmacy, University of Michigan, Ann Arbor, MI 48109, USA

P. SEETHARAMULU* and Gordon M. CRIPPEN ~ College of Pharmacy, University of Michigan, Ann Arbor, MI 48109, USA Journal of Mathematical Chemistry 6(1991)91-110 91 A POTENTIAL FUNCTION FOR PROTEIN FOLDING P. SEETHARAMULU* and Gordon M. CRIPPEN ~ College of Pharmacy, University of Michigan, Ann Arbor, MI 48109, USA

More information

Francisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans

Francisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans From: ISMB-97 Proceedings. Copyright 1997, AAAI (www.aaai.org). All rights reserved. ANOLEA: A www Server to Assess Protein Structures Francisco Melo, Damien Devos, Eric Depiereux and Ernest Feytmans Facultés

More information

Effect of variations in peptide parameters and charges on the unperturbed dimensions of polypeptide chains

Effect of variations in peptide parameters and charges on the unperturbed dimensions of polypeptide chains Pramitaa, Vol. 4, No. 2, 1975, pp 95-103, ~ Printed in India. Effect of variations in peptide parameters and charges on the unperturbed dimensions of polypeptide chains 1. Introduction A R SRINIVASAN*

More information

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition Sequence identity Structural similarity Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Fold recognition Sommersemester 2009 Peter Güntert Structural similarity X Sequence identity Non-uniform

More information

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Template Free Protein Structure Modeling Jianlin Cheng, PhD Template Free Protein Structure Modeling Jianlin Cheng, PhD Professor Department of EECS Informatics Institute University of Missouri, Columbia 2018 Protein Energy Landscape & Free Sampling http://pubs.acs.org/subscribe/archive/mdd/v03/i09/html/willis.html

More information

Calculation of protein backbone geometry from a-carbon coordinates based on peptide-group dipole alignment

Calculation of protein backbone geometry from a-carbon coordinates based on peptide-group dipole alignment Protein science (1993), 2, 1697-1714. Cambridge University Press. Printed in the USA. Copyright 0 1993 The Protein Society Calculation of protein backbone geometry from a-carbon coordinates based on peptide-group

More information

Master equation approach to finding the rate-limiting steps in biopolymer folding

Master equation approach to finding the rate-limiting steps in biopolymer folding JOURNAL OF CHEMICAL PHYSICS VOLUME 118, NUMBER 7 15 FEBRUARY 2003 Master equation approach to finding the rate-limiting steps in biopolymer folding Wenbing Zhang and Shi-Jie Chen a) Department of Physics

More information

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Programme 8.00-8.20 Last week s quiz results + Summary 8.20-9.00 Fold recognition 9.00-9.15 Break 9.15-11.20 Exercise: Modelling remote homologues 11.20-11.40 Summary & discussion 11.40-12.00 Quiz 1 Feedback

More information

Peptides And Proteins

Peptides And Proteins Kevin Burgess, May 3, 2017 1 Peptides And Proteins from chapter(s) in the recommended text A. Introduction B. omenclature And Conventions by amide bonds. on the left, right. 2 -terminal C-terminal triglycine

More information

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Department of Chemical Engineering Program of Applied and

More information

Physical reasons for the unusual -helix stabilization afforded by charged or neutral polar residues in alanine-rich peptides

Physical reasons for the unusual -helix stabilization afforded by charged or neutral polar residues in alanine-rich peptides Physical reasons for the unusual -helix stabilization afforded by charged or neutral polar residues in alanine-rich peptides Jorge A. Vila*, Daniel R. Ripoll, and H. A. Scheraga *Universidad Nacional de

More information

BIOCHEMISTRY Course Outline (Fall, 2011)

BIOCHEMISTRY Course Outline (Fall, 2011) BIOCHEMISTRY 402 - Course Outline (Fall, 2011) Number OVERVIEW OF LECTURE TOPICS: of Lectures INSTRUCTOR 1. Structural Components of Proteins G. Brayer (a) Amino Acids and the Polypeptide Chain Backbone...2

More information

BMB/Bi/Ch 173 Winter 2018

BMB/Bi/Ch 173 Winter 2018 BMB/Bi/Ch 173 Winter 2018 Homework Set 8.1 (100 Points) Assigned 2-27-18, due 3-6-18 by 10:30 a.m. TA: Rachael Kuintzle. Office hours: SFL 220, Friday 3/2 4:00-5:00pm and SFL 229, Monday 3/5 4:00-5:30pm.

More information

Structural energetics of serine protease inhibition*

Structural energetics of serine protease inhibition* Pure Appl. Chem., Vol. 71, No. 7, pp. 1207 1213, 1999. Printed in Great Britain. 1999 IUPAC Structural energetics of serine protease inhibition* Kenneth P. Murphy, Brian M. Baker, Stephen P. Edgcomb and

More information

Monte Carlo simulations of polyalanine using a reduced model and statistics-based interaction potentials

Monte Carlo simulations of polyalanine using a reduced model and statistics-based interaction potentials THE JOURNAL OF CHEMICAL PHYSICS 122, 024904 2005 Monte Carlo simulations of polyalanine using a reduced model and statistics-based interaction potentials Alan E. van Giessen and John E. Straub Department

More information

AMINO ACIDS ARE JOINED TOGETHER BY BONDS. FILE

AMINO ACIDS ARE JOINED TOGETHER BY BONDS. FILE 04 February, 2018 AMINO ACIDS ARE JOINED TOGETHER BY BONDS. FILE Document Filetype: PDF 401.79 KB 0 AMINO ACIDS ARE JOINED TOGETHER BY BONDS. FILE Understanding the chemistry and properties of the peptide

More information

Protein structure forces, and folding

Protein structure forces, and folding Harvard-MIT Division of Health Sciences and Technology HST.508: Quantitative Genomics, Fall 2005 Instructors: Leonid Mirny, Robert Berwick, Alvin Kho, Isaac Kohane Protein structure forces, and folding

More information

NMR Characterization of Partially Folded and Unfolded Conformational Ensembles of Proteins

NMR Characterization of Partially Folded and Unfolded Conformational Ensembles of Proteins Elisar Barbar Department of Chemistry and Biochemistry, Ohio University, Athens, OH 45701 NMR Characterization of Partially Folded and Unfolded Conformational Ensembles of Proteins Abstract: Studies of

More information

The role of secondary structure in protein structure selection

The role of secondary structure in protein structure selection Eur. Phys. J. E 32, 103 107 (2010) DOI 10.1140/epje/i2010-10591-5 Regular Article THE EUROPEAN PHYSICAL JOURNAL E The role of secondary structure in protein structure selection Yong-Yun Ji 1,a and You-Quan

More information

Building 3D models of proteins

Building 3D models of proteins Building 3D models of proteins Why make a structural model for your protein? The structure can provide clues to the function through structural similarity with other proteins With a structure it is easier

More information

Computer Modeling of Protein Folding: Conformational and Energetic Analysis of Reduced and Detailed Protein Models

Computer Modeling of Protein Folding: Conformational and Energetic Analysis of Reduced and Detailed Protein Models Cust. Ref. No. PEW 62/94 [SGML] J. Mol. Biol. (1995) 247, 995 1012 : Conformational and Energetic Analysis of Reduced and Detailed Protein Models Alessandro Monge, Elizabeth J. P. Lathrop, John R. Gunn

More information

Ж У Р Н А Л С Т Р У К Т У Р Н О Й Х И М И И Том 50, 5 Сентябрь октябрь С

Ж У Р Н А Л С Т Р У К Т У Р Н О Й Х И М И И Том 50, 5 Сентябрь октябрь С Ж У Р Н А Л С Т Р У К Т У Р Н О Й Х И М И И 2009. Том 50, 5 Сентябрь октябрь С. 873 877 UDK 539.27 STRUCTURAL STUDIES OF L-SERYL-L-HISTIDINE DIPEPTIDE BY MEANS OF MOLECULAR MODELING, DFT AND 1 H NMR SPECTROSCOPY

More information

Protein Simulations in Confined Environments

Protein Simulations in Confined Environments Critical Review Lecture Protein Simulations in Confined Environments Murat Cetinkaya 1, Jorge Sofo 2, Melik C. Demirel 1 1. 2. College of Engineering, Pennsylvania State University, University Park, 16802,

More information