Folding of Polypeptide Chains in Proteins: A Proposed Mechanism for Folding

Similar documents
ON THE DOMINANCE OF SHORT-RANGE INTERACTIONS IN POLYPEPTIDES AND PROTEINS

Packing of Secondary Structures

Physiochemical Properties of Residues

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Section Week 3. Junaid Malek, M.D.

Peptides And Proteins

Central Dogma. modifications genome transcriptome proteome

Proteins: Characteristics and Properties of Amino Acids

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

Any protein that can be labelled by both procedures must be a transmembrane protein.

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

1. What is an ångstrom unit, and why is it used to describe molecular structures?

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Exam I Answer Key: Summer 2006, Semester C

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

THE UNIVERSITY OF MANITOBA. PAPER NO: _1_ LOCATION: 173 Robert Schultz Theatre PAGE NO: 1 of 5 DEPARTMENT & COURSE NO: CHEM / MBIO 2770 TIME: 1 HOUR

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

Secondary and sidechain structures

Model Mélange. Physical Models of Peptides and Proteins

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

Course Notes: Topics in Computational. Structural Biology.

Protein Structures: Experiments and Modeling. Patrice Koehl

Computer simulations of protein folding with a small number of distance restraints

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

Protein Fragment Search Program ver Overview: Contents:

Molecular Structure Prediction by Global Optimization

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

Chemistry Chapter 22

Solutions In each case, the chirality center has the R configuration

Supersecondary Structures (structural motifs)

Protein Structure Bioinformatics Introduction

Properties of amino acids in proteins

Computational Protein Design

April, The energy functions include:

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

PROTEIN EVOLUTION AND PROTEIN FOLDING: NON-FUNCTIONAL CONSERVED RESIDUES AND THEIR PROBABLE ROLE

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover

Translation. A ribosome, mrna, and trna.

26.7 Laboratory Synthesis of Peptides

Problem Set 1

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE

Viewing and Analyzing Proteins, Ligands and their Complexes 2

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

Chapter 4: Amino Acids

Protein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013

7.012 Problem Set 1 Solutions

Diastereomeric resolution directed towards chirality. determination focussing on gas-phase energetics of coordinated. sodium dissociation

Supporting Information

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

CHEM J-9 June 2014

What binds to Hb in addition to O 2?

Resonance assignments in proteins. Christina Redfield

Dominant Paths in Protein Folding

Computer design of idealized -motifs

Overview. The peptide bond. Page 1

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

Unraveling the degradation of artificial amide bonds in Nylon oligomer hydrolase: From induced-fit to acylation processes

Details of Protein Structure

Biological Macromolecules

Geometrical Concept-reduction in conformational space.and his Φ-ψ Map. G. N. Ramachandran

HSQC spectra for three proteins

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Rotamers in the CHARMM19 Force Field

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

Prof. Jason Kahn Your Signature: Exams written in pencil or erasable ink will not be re-graded under any circumstances.

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry

Tamer Barakat. Razi Kittaneh. Mohammed Bio. Diala Abu-Hassan

BCH 4053 Exam I Review Spring 2017

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

The Anatomy & Taxonomy of Protein Structure

Complex flexibility of the transforming growth factor

NMR parameters intensity chemical shift coupling constants 1D 1 H spectra of nucleic acids and proteins

Nuclear Magnetic Resonance Spectrum of Deamino-Lysine-Vasopressin in Aqueous Solution and Its Structural Implications

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

FW 1 CDR 1 FW 2 CDR 2

Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

STRUCTURES OF PROTEINS AND TO IDENTIFY

Background BIOCHEMISTRY LAB CHE-554. Experiment #1 Spectrophotometry

Catalytic power of enzymes

4 Proteins: Structure, Function, Folding W. H. Freeman and Company

Bioinformatics Practical for Biochemists

M.O. Dayhoff, R.M. Schwartz, and B. C, Orcutt

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Useful background reading

The Structure and Functions of Proteins

Transcription:

Proc. Nat. Acad. Sci. USA Vol. 68, No. 9, pp. 2293-2297, September 1971 Folding of Polypeptide Chains in Proteins: A Proposed Mechanism for Folding PETER N. LEWS, FRANK A. MOMANY, AND HAROLD A. SCHERAGA* Department of Chemistry, Cornell University, thaca, N.Y. 1485 Contributed by Harold A. Scheraga, July 14, 1971 ABSTRACT A mechanism is proposed for the folding of protein chains. On the basis of short-range interactions, certain aminoacid suences have a high propensity to be, say, a-helical. However, these short helical (or other ordered) regions can be stabilized only by long-range interactions arising from the proximity of two such ordered regions. These regions are brought near each other by the directing influence of certain other aminoacid suences that have a high probability of forming,8-bends or variants thereof, also on the basis of short-range interactions. An analysis is made of the tendency of various amino acids to occur in,-bends, and it is possible to predict the regions of a chain in which a (-bend will occur with a high degree of reliability. n this series of papers, we will present a specific mechanism for the folding of a polypeptide chain into the native structure of a globular protein. n this presentation, we will attempt to demonstrate that specific backbone conformations such as the right-handed a-helix (ar), the (3-structure, and the (-bend, found to varying extents in the native structures of most globular proteins, are not only essential for the structural integrity of the protein but also are remnants of structures that play a key role in the folding process. n this initial paper, we give a general description of the proposed mechanism, as well as some illustrative correlations between the aminoacid suence and native structure of a protein that provide support for this mechanism. n subsuent papers in this series, we will discuss the energetics of the folding process. PROPOSED MECHANSM The protein molecule, under sufficiently denaturing conditions (or even, perhaps, directly after synthesis), behaves essentially as a random coil. Since the number of states accessible to the polypeptide chain in the random-coil condition is immense, it is reasonable to assume that (a) the folding of the chain into its most stable (native) conformation is not the result of a random event, and (b) a specific pathway exists for the folding process. t was previously suggested (1) that one of the initial steps (which might be considered a nucleation step) during the folding process is the fortuitous meeting of two distant sections of the protein chain to form a stabilized pair of a- helices (or, for that matter, any other ordered structure), around which the rest of the polypeptide chain could fold. This idea developed from the demonstration (1) that, for most proteins, those portions of the chain that have a high helical probability in the denatured condition are found to be in the ar conformation in the native structure. Further, it was shown (2) that, for the cytochrome c proteins of 27 species, the regions of high helical probability were, for the most * To whom ruests for reprints should be addressed. 2293 part, conserved from species to species; this result is consistent not only with the proposed invariance of the native conformation of these proteins (3), but also with our proposal that these regions of high helical probability aid in directing the folding to the native structure. While the above conclusion about one of the initial steps of the folding process seems warranted, it does not seem reasonable for the distant a-helical (or other ordered) conformations to rely on a random encounter to achieve a mutual stabilization (by means of long-range interactions) of the specific structures that have a propensity to be a-helical (because of short-range interactions) (1). nstead, it appears much more likely that two such distant helix-tending regions of the polypeptide chain are directed toward each other. The assumption of such a directing influence naturally introduces the proposition that certain regions of the chain function as "directing" sections. The role of these "directing" sections would be to provide the proper mutual orientation of distant (or near) ordered segments of the polypeptide chain so that the latter could interact with each other and, at the same time, serve as a substrate for interaction with still other chain segments. As an example, a (-bend or (-turn (defined in the next section) would "direct" the formation of an antiparallel (-structure which, in turn, might provide a surface for interaction (e.g., by means of hydrophobic bonds) with, and stabilization of, an ac helix. Our proposed mechanism for protein folding can be described as follows. A certain "directing" section promotes (in the manner described above) the formation of some small Jr FG. 1. Type (-bend (), with any i-residue at positions (i + 1) and (i + 2), and Type (3-bend (), with only glycine (5) being possible at position (i + 2). Adapted from Fig. 7 of ref. 3.

2294 Chemistry: Lewis et al. Proc. Nat. Acad. Sci. USA 68 (1971) 'o C.3 Hen Egg-White Lysozyme U a@.2 c a, n._o 2 a.. 2 4 6 8 l 12 Residue Number FG. 2. Probability that a tetrapeptide bend begins at site j of the hen egg-white lysozyme chain. The horizontal line is an arbitrary cut-off probability. The horizontal bars indicate the positions of the observed bends (D. C. Phillips, personal communication), starting at j = i. initial structure (e.g., two interacting ar helical or other ordered structures). This ordered collection of aminoacid residues (backbone and side-chain groups) then serves as a substrate to direct other sections of the chain to either stabilize still other ordered structures (depending on the propensity of these additional segments to form such ordered structures) or simply to wrap around the original substratet. f this mechanism is valid, then many of the distinctive features of the native structure (i.e., ar helix or (3-structure) would have been intimately involved in the folding pathway. n the next section, the U-bend is shown to satisfy the conditions ruired for a "directing" section. t will be demonstrated that an analysis of the fruency of occurrence of particular aminoacid residues at the various loci of these bends in a sample of three proteins of known structure (namely, lysozyme, ribonuclease S, and a-chymotrypsin) provides sufficient information to enable us to predict, with a reliability of around 8%, the positions of bends that occur in the native conformations of other proteins. THE,8-BEND n the previous section, it was argued that certain segments of the polypeptide chain are responsible for bringing distant portions of the chain into close proximity during the folding process. The simplest examples of such segments are the so-called (3-bends, involving residues i to (i + 3), as shown in Fig. 1. Venkatachalam (5) has considered the steric constraints in these bends for the formation of a hydrogen bond between the CO group of residue i and the NH group of residue (i + 3); he showed that glycine must occur at position (i + 2) for the bend in Fig. 1 (which is designated as a type- bend) to involve such a hydrogen bond. However, for L-residues, there is no such restriction in the other bend shown in Fig. 1 (which is designated as a type- bend). Both types of t t should be pointed out here that the notion that some initial structure (substrate) participates in the folding of a protein is not new and has been proposed by others (see, for example, ref. 4), although the emphasis placed in this paper on a "directing" section, such as a 8-bend, has not to our knowledge been envisaged as an initial structure. bends fulfill the ruirements for being "directing" sections very nicely by (a) providing a possible 18 reversal of the chain direction, (b) having a high probability of occurrence, compared to a larger loop structure ruiring long-range interactions, because the formation of the bend [and the i to (i + 3) hydrogen bond] depends on only two pairs of dihedral angles (short-range interactions), and (c) involving only a small number (namely, four) of residues, thereby providing much conformational information in a small region. Actually, the native structures of many proteins contain an abundance of (3-bends, or (3-like bends [distorted (3-bends, which are similar to (-bends but lack the i to (i + 3) hydrogen bond]. The distorted (-bend might arise by relaxation to satisfy the newly created interactions, once it has fulfilled its directing function during folding. n light of our earlier discussion of the possible importance of these bends for protein folding, it is of interest to consider the distribution of aminoacid residues at positions i, (i + 1), (i + 2), and (i + 3) (see Fig. 1) in the actual bends found in the native conformations of some proteins. Toward this end, the coordinates of hen egg-white lysozymet, bovine ribonuclease S, (6) and the B and C chains of bovine a-chymotrypsin (7) were analyzed for bends. A bend was considered to exist if (a) the calculated Ca(i) to Ca(i + 3) distance was less than 7 i (.7 nm) and (b) the (i + 1) or (i + 2) residue was not in an ar helix. The bends determined by the above criteria are given in Table 11. D. C. Phillips, personal communication. Criteria (a) and (b) in some cases are not sufficient to define the exact location of a bend. Molecular models of bovine ribonuclease S and bovine cz-chymotrypsin, built in this laboratory, provided additional information concerning the existence and location of bends in these two proteins. The C-terminal (16-129) section of hen egg-white lysozyme is particularly difficult to analyze for bends by criteria (a) and (b), because of the presence of some helix in that part of the protein chain; therefore, for this region, no bends are given in Table 1, although it may be that some exist. t should be emphasized that the list of bends given in Table 1 may well be subject to revision and is presented here only to illustrate the possible role of "directing" sections. Further work is being performed, in this laboratory, to better characterize these bends.

Proc. Nat. Acad. Sci. USA 68 (1971) Folding of Polypeptide Chains in Proteins 2295 TABLE 1. a-bends and variants found in the native structures of hen egg-white a-chymotrypsin (B and C chains) lysozyme, bovine ribonuclease S, and bovine Hen egg-white lysozyme Bovine ribonuclease S Bovine a-chymotrypsin Number Suence Number Suence Number Suencc 2-23 Tyr-Arg-Gly-Tyr 16-19 Ser-Thr-Ser-Ala 23-26 Val-Pro-Gly-Ser 36-39 Ser-Asn-Phe-Asn 36-39 Thr-Lys-Asp-Arg 27-3 Trp-Pro-Trp-Gln 39-42 Asn-Thr-Gln-Ala 65-68 Cys-Lys-Asn-Gy 35-38 Asp-Lys-Thr-Gly 47-5* Thr-Asp-Gly-Ser 75-78 Ser-Tyr-Ser-Thr 48-51 Asn-Glu-Asn-Trp 54-57 Gly-le-Leu-Gln 87-9 Thr-Gly-Ser-Ser 56-59 Ala-His-Cys-Gly 6-63 Ser-Arg-Trp-Trp 91-94 Lys-Tyr-Pro-Asn 61-64 Thr-Thr-Ser-Asp 66-69 Asp-Gly-Arg-Thr 112-115 Gly-Asn-Pro-Tyr 72-75 Asp-Gln-Gly-Ser 69-72 Thr-Pro-Gly-Ser 91-94 Asn-Ser-Lys-Tyr 74-77 Asn-Leu-Cys-Asn 96-99 Ser-Leu-Thr-le 85-88 Ser-Ser-Asp-le 99-12 le-asn-asn-asp 1-13* Ser-Asp-Gly-Asp 18-111 Leu-Ser-Thr-Ala 13-16 Asp-Gly-Met-Asn 115-118 Ser-Gln-Thr-Val 125-128 Ser-Ala-Ser-Asp 131-134 Ala-Ala-Gly-Thr 152-155 Pro-Asp-Arg-Leu 172-175 Trp-Gly-Thr-Lys 177-18 Lys-Asp-Ala-Met 185-188 Ala-Ser-Gly-Val 191-194 Cys-Met-Gly-Asp 194-197 Asp-Ser-Gly-Gly 23-26 Lys-Asn-Gly-Ala 217-22 Ser-Ser-Thr-Cys 221-224 Ser-Thr-Ser-Thr * The Ca(i) to C"(i + 3) distances for these two bends exceeded 7 A by.1 A and.4 i for residues 47-5 and 1-13, respectively. Nevertheless, these bends were counted because, in both cases, each was the region of a significant chain reversal. See ref. 4 for stereo drawings of hen egg-white lysozyme. TABLE 2. Fruency of occurrence of amino acid residues in 3-bends and variants found in hen egg-white lysozyme, bovine ribonuclease S, and bovine a-chymotrypsin (B and C chains) Amino Total i- (i + 3) i (i + 3) total acid occurrence* i i + 1 i + 2 i + 3 (totalt) (total occurrence) Ala 45 3 2 1 4 1.22 Asp 22 5 4 2 5 16.73 Cys 25 2 2 1 5.2 Glu 12 1 1.8 Phe 12 1 1.8 Gly 36 2 4 11 4 21.58 His 7 1 1.14 le 18 1 1 2 4.22 Lys 3 3 3 1 1 8.27 Leu 27 1 2 1 1 5.19 Met 8 1 1 1 3.38 Asn 36 4 4 3 4 15.42 Pro 13 1 4 1 6.46 Gln 19 2 1 2 5.26 Arg 18 2 2 1 5.28 Ser 51 11 6 6 5 28.55 Thr 39 5 4 6 4 19.49 Val 36 1 2 3.8 Trp 14 2 2 2 6.43 Tyr 13 1 2 1 2 6.46 * The numbers in this column represent the total occurrence of each residue in the three-protein sample. t The i - (i + 3) totals for Asn, Thr, and le are each larger by 1 than their actual occurrence, because, for example, Thr simultaneously occupies positions i and (i + 3) in bends 69-72 and 66-69, respectively, in lysozyme (see Table 1). Similarly, the total for Asp is larger by 2 than its actual occurrence,

2296 Chemistry: Lewis et al. Proc. Nat. Acad. Sci. USA 68 (1971) * Go SO *4: *O 4 9 Go "'d *E 2 D..1 a 4) co -qcq 3:4 co to 3 q O t- Ob&; at) CSN -4,-t.. c - e a Cq Ce,W " a _4 to. p). to eoo t- e.q 4,4-. - " co c 4 '" _ - Ox t.. t- " 8 8 8 V T e es Ut es CD3 N _ U13 N co es C- N 4 No C o CD en "s v 4 e- t ooe ro F q. e -o~~ e O e- - o CD so -rc.. - e _ 3 _ 4 - N 44 N.4o P-t.g S 1t.. ;s e a e to 1V _ 4 C_ e q e4qcc CD Cq e- es - cq ee e N e e 4t 4a -t S lb e he N N] go C)o (ND e co X4 O - *- 6. Cs 6.. o._ o- 4 a; Ea * 6 9.g E1 8 *.5o be The distribution of amino acid residues located at positions i, (i + 1), (i + 2), and (i + 3) in the bends given in Table 1 is shown in Table 2, together with the overall number of each aminoacid residue in the three-protein sample. t is interesting to note that, in 11 of the 42 bends shown in Table 1, glycine is located at position (i + 2). Presumably, most of these 11 bends are of type. For the bends given in Table 1, no distinction is made between types and bends. The data of Table 2 enable us to evaluate an a priori probability that a certain type of residue is located at the jth site in a,8-bend (where j = i, i + 1, i + 2, or i + 3), irrespective of its neighbors, by simply dividing the number of times the residue in question occurred in the jth site by the overall fruency of occurrence of that residue in the total protein sample; e.g., the a priori probability for Ala in site (i + 3) is 4/45. From the foregoing, and assuming that the residues are independent of each other, it follows that the probability of occurrence of a U-bend is simply the product of the four individual a priori probabilities for each amino acid residue type in each site j. The validity of the assumption that the residues in the bends behave independently (i.e., that sidechain to backbone interactions dominate in bend stabilization) is based on the observation (see below) that tetrapeptide bends of highest probability (calculated in thismanner) correlate very well with the appearance of these bends in many native proteins. For illustrative purposes, the probability of occurrence of a U-bend starting at chain site j (computed by the procedure described above) is plotted in Figs. 2 and 3 for hen egg-white lysozyme and horse ferricytochrome c, respectively. There is a good correlation between the positions of those peaks lying above an arbitrary cut-off probability line (determined by observation from Fig. 2) at 1-4 and the observed positions of the,8-turns in Figs. 2 and 3. This same procedure and criterion were applied to several other proteins, and the results for the start (the ith position) of the tetrapeptide (3-bend are shownw in Table 3. f we allow an error of t1 residue in the location of a bend [there being approximately N/3 (i.e. 1-4, 4-7, 7-1,...) tetrapeptides that will include all possible bends], then the overall per cent of bends predicted correctly (not including the original set of three proteins, on which the a priori probabilities were based) is 8%. This high degree of predictability suggests that the role assigned here to residues in (-nbends may be correct. The data of Table 2 indicate that no particular aminoacid residue is associated exclusively with bends. Since most of the observed bends appear to be composed mainly of polar residues (see the high values for the a priori probability of occurrence in a #-bend of polar residues such as Asp, Asn, Ser, Thr, and Tyr in the last column of Table 2), it is not surprising that nearly all the bends are located at the surface of the native globular structures, presumably to solvate the polar side chains. f these bends are as important to the native conformation as suggested here, then mutations that lead to changes in the residues in the bend regions should provide further informa- s L. q qe o) )V 6~~~ C.)U ~~~~~A C ii axn too hxe go ' ~~.~~~~~ -~ c4 U) V. = v: cc o) 4 4S The observed positions of the bends listed in Table 3 for the proteins other than the original set of three were taken from the stereo drawings in refs. 4 and 8. Since the coordinates of these proteins are not now available to us, the positions listed for the bends are tentative, and may have to be revised when the coordinates become available.

Proc. Nat. Acad. Sci. USA 68 (1971) Folding of Polypeptide Chains in Proteins 2297 Residue Number FG. 3. Probability that a tetrapeptide bend begins at site j of the horse ferricytachrome c chain. The horizontal line is an arbitrary cut-off probability. The horizontal bars indicate the positions of the observed bends (3), starting at j = i. A bend (not shown here) occurs (3) between residues 42-46; this is not a #-bend (since it contains more than four residues) even though it results in a reversal of the chain. tion as to the "directing" role of the bends in folding. This question is currently under investigation. t should be pointed out that the a-bends discussed in this paper are thought to exist in specific suences of amino acids. However, other #-bends (not considered here as necessarily likely to occur in globular proteins) can exist in any suence of amino acids if the chain is short and constrained to form a ring, as in gramicidin S and oxytocin. CONCLUSONS t is proposed that certain sections of a protein chain must play a role in bringing distant parts of the chain together to enable long-range interactions to stabilize those structures (i.e., ar helix, #-structure, etc.) that have a propensity to form because of short-range interactions. The j-bend and its variants were shown to fulfill the ruirements of a "directing" section. Further, it was shown that, to a good approximation, the distributions of aminoacid types in these bends are independent of each other; hence, the locations of a high percentage of the bends in proteins not included in the initial set of three can be predicted. Since these bends in proteins are very localized, it would be interesting to determine (e.g., by NMR measurements) whether they occur in smaller structures, i.e., in isolated noncyclic tetra- or larger oligopeptides (with appropriate end groups). n subsuent papers, we will consider the energetics of the 3-bends and their variants. P. N. L. was a National Research Council of Canada Postgraduate Fellow, 1971-1972. F. A. M. was a Special Fellow of the National nstitute of General Medical Sciences, National nstitutes of Health, 1968-1969. This work was supported by research grants from the National Science Foundation (GB- 28469X and GB-17388), from the National nstitute of General Medical Sciences of the National nstitutes of Health, U.S. Public Health Service (GM-14312), from the Eli Lilly, Hoffmann- LaRoche, and Smith Kline and French Grants Committees, and from Walter and George Todd. 1. Lewis, P. N., N. G6, M. G6, D. Kotelchuck, and H. A. Scheraga, Proc. Nat. Acad. Sci., USA, 65, 81 (197). 2. Lewis, P. N., and H. A. Scheraga, Arch. Biochem. Biophys., 144, 576 (1971). 3. Dickerson, R. E., T. Takano, D. Eisenberg,. Kallai, L. Samson, A. Cooper, and E. Margoliash, J. Biol. Chem., 246, 1511 (1971). 4. Dickerson, R. E., and. Geis, The Structure and Action of Proteins (Harper and Row, New York, 1969), p. 73. 5. Venkatachalam, C. M., Biopolymers, 6, 1425 (1968). 6. Wyckoff, H. W., D. Tsernoglou, A. W. Hanson, J. R. Knox, B. Lee, and F. M. Richards, J. Biol. Chem., 245, 35 (197). 7. Birktoft, J. J., B. W. Matthews, and D. M. Blow, Biochem. Biophys. Res. Commun., 36, 131 (1969). 8. Arnone, A., C. J. Bier, F. A. Cotton, V. W. Day, E. E. Hazen, Jr., D. C. Richardson, J. S. Richardson, and, in part, A. Yonath, J. Biol. Chem., 246, 232 (1971).