Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Size: px
Start display at page:

Download "Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution"

Transcription

1 Supplemental Materials for Structural Diversity of Protein Segments Follows a Power-law Distribution Yoshito SAWADA and Shinya HONDA* National Institute of Advanced Industrial Science and Technology (AIST), Central 6, Tsukuba , Japan *Correspondence: s.honda@aist.go.jp. Contents SUPPORTING DESCRIPTIONS Summary of Classified Structural Motifs Methodological Advantages Reproducibility of Results in the Single-Pass Clustering Method Validity of Overlapping Segment Sampling Objective Function in Fitting Calculations Goodness of Fit Evaluated by Parametric Bootstrapping Analysis Assessment of the Structural Dissimilarity Threshold, D th REFERENCE TABLE S1 Sensitivity of the single-pass clustering method to the order of sampling TABLE S2 Amino acid compositions of the sets of 9-residue segments with and without overlap TABLE S3 Confidence intervals of fit coefficients and goodness-of-fit statistics determined by parametric bootstrapping analysis (1)

2 FIGURE S1 Structural and sequential summary of clusters FIGURE S2 Flow chart of the single-pass clustering method used in the present study FIGURE S3 Reproducibility of results in the single-pass clustering method FIGURE S4 Sensitivity of the parameters to the order of sampling FIGURE S5 Suitability of objective functions to minimize in fitting calculations FIGURE S6 Histograms showing the structural dissimilarity, D, of 9-residue segments FIGURE S7 Structural differences of 9-residue segments from the center of the cluster Summary of Classified Structural Motifs An in-depth analysis of each cluster is beyond the scope of this paper. Here we show several clusters that were obtained under a typical condition (L=9, D th =30º, Culled PDB), in order to illustrate that our clustering method succeeded in the extraction of distinct structural motifs, including known canonical ones. Structural and sequential summary of these clusters are listed in Fig. S1 in Supplemental Materials. Methodological Advantages To compare protein structures, algorithms based on the Cartesian coordinates of Cα atoms are the most common in use (1-6). While these algorithms are effective in comparing the topology of whole proteins, they have a disadvantage in calculation time because they require some steps for the transformation of the system of coordinates to obtain a best superimposition between two proteins. Hence, in order to classify an enormous number of short segments effectively, we developed new algorithm defining the structural dissimilarity based on dihedral angles (φ, ψ and ω), which contain enough information to reproduce the backbone structure of proteins. In contrast to the methods by de Brevern et al. (7), our definition contains all three dihedral angles (see Methods). Although one (2)

3 may think the analysis including ω-angle causes confusion, we confirmed that the local structures containing a cis peptide bond were completely discriminated from the clusters of the segments having an all trans configuration (data not shown). Thus, this algorithm does not need the preprocessing to eliminate the structures containing cis peptide bonds. The single-pass clustering method (8), used in the present study, does not require us to presume a parameter for the total number of clusters before clustering (Fig. S2 in Supplemental Materials). Time-consuming iterative calculations are also unnecessary in the single-pass method. Therefore, the method is applicable to the problem in which the number of clusters is unknown, and it can process large-scale calculations at higher speed than other non-hierarchical clustering methods such as k-means and self-organizing map (SOM). The single-pass method also has an intrinsic advantage in the classification of samples that have a quite unbalanced distribution (In fact, there is more than a thousand-fold difference in frequency in Fig. 1a), while the k-means and SOM are rather preferred to the strict analysis of uniformity distributed samples. Reproducibility of Results in the Single-Pass Clustering Method A disadvantage of the single-pass method is that clustering results might be altered by the order of sampling. Before proceeding with in-depth analyses, we therefore checked the reproducibility of the calculations. Fig. S3a in Supplemental Materials shows ten distribution curves of 9-residue segments which were independently obtained when the order of sampling was changed randomly. Their almost identical shapes indicate that the structural distribution of protein segments is so robust that the results in the single-pass clustering method are not influenced significantly by the order of sampling. We also evaluated the statistical deviations of the various parameters introduced in the present study by analysis of independent 100 (or 1000) sets of results for both 9-residue and 21- residue segments that were obtained by changing the order of sampling randomly (Fig. S4 and Table S1 in Supplemental Materials). The resultant standard deviations of all parameters are considerably small. This implies that the conclusion concerning the structural diversity of protein segments presented in the present study is never affected by the order of sampling. For instance, the standard deviations in log(n est )/L and S est /L are less than 1% of the averaged parameters. Consequently, if one (3)

4 draws the standard deviations in Fig. 3 and Fig. 4, the size of error bars will become smaller than the size of circles of the data points. Furthermore, we checked the adequateness of the single-pass method by comparing with other iterative clustering methods. As shown in Fig. S3b in Supplemental Materials, the difference between the result of the single-pass clustering and the result of an iterative calculation (100 times) based on k-means algorithm using the former result as an initial condition was not significant. Thus, we considered an iterative and time-consuming calculation is not necessary for carrying out our purpose. Validity of Overlapping Segment Sampling To clear out a concern that the analyses using a set of overlapping segments might give some serious biases in the statistical results, we checked amino acid compositions of the sets of segments with and without the presence of overlap. Table S2 in Supplemental Materials shows the compositions calculated from the nine sets of 9-residue segments, where each set has no overlap. Comparing to the composition of the set of overlapping segments, i.e. the set of all 9-residue segments, there is no significant deviation among the compositions. This indicates the validity of the statistical analyses using a set of overlapping segments performed in the present study. Objective Function in Fitting Calculations To begin with the fitting calculations of the modified Mandelbrot formula (Eq.2) to the empirical distributions, we tried several equations as an objective function to minimize, because an ordinary equation, i.e. the sum of squared errors, seems to be inappropriate to obtain a good fitting result in a double-logarithmic scale plot. In fact, large upper shifts were found in low-ranked clusters when Eq.s1 was used (Fig S5a in Supplemental Materials). In case of Eq.s2, high-ranked clusters were underestimated. Among several equations we found Eq.s3 and Eq.s4 are available. Then, we further examined the goodness in fitting calculations using Kolmogorov-Smirnov (KS) parameter. As judged by the KS values, the fitting calculation using Eq.s4 appears to show good performance in many cases (Fig. S5b in Supplemental Materials). Accordingly, we chose Eq.s4 as an objective function to minimize and used it in further analyses. The equation can be interpreted on the (4)

5 assumption that the expected error in f(r) should be proportional to the square root of f(r). g 1 Ncls β [ cls ] 1 2 β (s1) N ( a, b, ) = f a( r + b) cls r = 2 Ncls β [ cls { }] 1 g2 log N g g 3 4 ( a, b, ) = log( f ) a( r + b) ( a, b, ) ( a b, ) β (s2) cls r = 2 cls r = 2 β [ log( f ) log{ a( r + b) }] Ncls 1 cls β = (s3) N r β [ fcls a( r + b) ] a( r + b) Ncls, = β Ncls r = β (s4) 2 2 Goodness of Fit Evaluated by Parametric Bootstrapping Analysis To test the goodness of fit and provide the confidence intervals of fit coefficients in fitting calculations of the modified Mandelbrot formula (Eq.2) to the empirical distributions, a parametric bootstrapping analysis was performed. Assuming the expected error ε is proportional to the square root of f(r) as described above, ε = β [ fcls a( r + b) ] β a( r + b) the model to be evaluate is designated as 2 = f cls a a( r + b) ( r + b) β β f cls ( r > 1) = a β β ( r + b) + a( r + b) ε (s5) Here, the error is assumed to represent a normal distribution; ε ~ N(0, σ 2 ε ). The variance σ 2 ε was determined by a maximum likelihood method using the residuals of f cls (r) from the best fitted curve. We generated sets of artificial data that follow Eq.s5 by setting the fit coefficients obtained from the original data. By repeating the same fitting calculations against the artificial data, we obtained bootstrap fit coefficients (a*, b*, and β*) as well as two kinds of goodness-of-fit statistics (χ 2 * and G 2 *) χ β ( ) 2 ( X E) fcls( r a( r + b) = M E a( r + b) 2 ) = β 2 (s6) (5)

6 G 2 X X ln E 2M f ( r)ln a ( r) 2 cls = = cls β f ( r + b) Confidence intervals of bootstrap parameters were determined using a percentile method. The results are summarized in TABLE S3 in Supplemental Materials. In many cases, the fit coefficients and the goodness-of-fit statistics calculated from the original data reside within the range of 99% confidence intervals of bootstrap parameters. Especially, it is appreciable in relatively long segments compared to relatively short segments. This tendency is consistent with the results in Fig. 2 where the model appears to be well fitted to the empirical data of relatively long segments. Through the analysis, the null hypothesis that the empirical data can be represented by the modified Mandelbrot formula is not rejected especially as for relatively long segments. (s7) Assessment of the Structural Dissimilarity Threshold, D th In the one-path clustering method used in the present study, the distribution of the local structures in principle depends on one parameter, a structural dissimilarity threshold, D th, the value of which was assigned arbitrarily before clustering (see Methods and Fig. S2 in Supplemental Materials). The effect of the value of D th on the shape of the distribution functions has been already described in Result. Here, we show some characteristics of the structural dissimilarity D to help the readers to catch the technical meaning of this threshold. Fig. S6 in Supplemental Materials shows histograms of 9-regidue segments, in which the x-axis indicates the value of D of these segments against certain typical secondary structures. Since the unit of D corresponds to an angle, the minimum and the maximum values of D are 0 and 180º, respectively, in principle. However, the actual data show most segments distribute within the range from 0 to 110º. In these three histograms, the shapes in the right half area, corresponding to the distributions of relatively dissimilar segments, are similar to each other. In contrast, the distributions in a left half area are different in their shape, which may indicate the individuality of the clusters composed of relatively similar segments. Our condition in D th is 20, 30, or 40º, which seems reasonable to discriminate relatively similar segments from relatively dissimilar segments. Fig. S7 in Supplemental Materials illustrates the structural difference of 9-residue segments from the center of the cluster which the segments were classified to. The x- and y-axes indicate the differences in D and in backbone RMS deviation (bbrmsd), (6)

7 respectively. Although the data are widely dispersed, a rough correlation between D and bbrmsd is identified. In case of segments having similar structures, smaller distance in angles indicates smaller distance in coordinates. From this correlation, we can say that 10, 20, and 30º in D th correspond to approximately 0.5, 1.0, and 2.0 Å in bbrmsd. REFERENCE 1. Holm, L. and C. Sander Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233: Madej, T., J. F. Gibrat, and S. H. Bryant Threading a database of protein cores. Proteins 23: Shindyalov, I. N. and P. E. Bourne Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11: Rooman, M. J., J. Rodriguez, and S. J. Wodak Automatic definition of recurrent local structure motifs in proteins. J. Mol. Biol. 213: Fetrow, J. S., M. J. Palumbo, and G. Berg Patterns, structures, and amino acid frequencies in structural building blocks, a protein secondary structure classification scheme. Proteins 27: Hunter, C. G. and S. Subramaniam Protein fragment clustering and canonical local shapes. Proteins 50: de Brevern, A. G., C. Etchebest, and S. Hazout Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks. Proteins 41: Richards, J. A. and X. Jia Remote sensing digital image analysis. New York: Springer- Verlag. (7)

8 TABLE S1 Sensitivity of the single-pass clustering method to the order of sampling. Averaged values and standard deviations of various parameters obtained from 100 or 1000 times clustering calculations in different order of sampling are listed. L f cls (r=1) N N cls a b β N est log(n est )/ L S est /L remark PDB Select ±0.002 ±49 ±27 ±0.033 ±2.6 ±0.02 ± ±0.004 ±0.004 A ± ±70 ±32 ±0.002 ±1.3 ±0.01 ± ±0.003 ±0.004 A Culled PDB ±0.007 ±29 ±22 ±0.022 ±1.4 ±0.01 ± ±0.001 ±0.004 A ±0.006 ±29 ±22 ±0.026 ±1.6 ±0.01 ± ±0.002 ±0.004 B ± ±32 ±25 ±0.002 ±1.0 ±0.01 ± ±0.002 ±0.002 A A Clustering conditions: D th =30º, number of runs=100. B Clustering conditions: D th =30º, number of runs=1000. Refer to Methods for the meaning of symbols. (8)

9 TABLE S2 Amino acid compositions of the sets of 9-residue segments with and without overlap without overlap with set of segment overla p no. of segments ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL The value with an underline corresponds to either minimum or maximum number among the sets of segments having no overlap. These data were calculated from the Culled PDB. (9)

10 TABLE S3 Confidence intervals of fit coefficients and goodness-of-fit statistics determined by parametric bootstrapping analysis. L a a* b b* [ , ] [ 15.41, ] [ , ] [ 9.76, ] [ , ] 6.53 [ 5.90, 6.55 ] [ , ] 6.58 [ 6.16, 6.66 ] [ , ] 4.79 [ 4.51, 4.88 ] [ , ] 7.82 [ 7.47, 7.96 ] [ , ] [ 11.80, ] [ , ] [ 12.90, ] [ , ] [ 10.97, ] L β β* N est N est * [ 1.099, ] 5683 [ 4100, 4783 ] [ 1.029, ] [ 9764, ] [ 0.948, ] [ 21214, ] [ 0.905, ] [ 41528, ] [ 0.861, ] [ 69715, ] [ 0.871, ] [ , ] [ 0.886, ] [ , ] [ 0.879, ] [ , ] [ 0.854, ] [ , ] L S est S est * Z/α (Z/α) * [ 9.17, 9.28 ] [ 3.282, ] [ 10.14, ] [ 2.775, ] [ 11.29, ] [ 2.474, ] [ 12.42, ] [ 2.266, ] [ 13.37, ] [ 2.103, ] [ 14.39, ] [ 2.016, ] [ 15.40, ] [ 1.958, ] [ 16.26, ] [ 1.892, ] [ 20.49, ] [ 1.708, ] L χ 2 χ 2 * G 2 G 2 * [ 7782, 8850 ] -154 [ 718, 1279 ] [ 6379, 7138 ] 32 [ 476, 977 ] [ 3200, 3555 ] 148 [ 21, 382 ] [ 1524, 1683 ] 116 [ -96, 191 ] [ 961, 1061 ] -49 [ -95, 151 ] [ 636, 704 ] -92 [ -90, 112 ] [ 598, 666 ] -42 [ -75, 102 ] [ 331, 370 ] -62 [ -61, 71 ] [ 89, 105 ] 1 [ -25, 28 ] Values in bracket indicate the 99% confidence intervals of bootstrap parameters obtained from the parametric bootstrapping analysis (10000 times) with a percentile method. Clustering conditions: D th =30º, PDB Select. Refer to Methods for the meaning of symbols. (10)

11 FIGURE S1 Structural and sequential summary of clusters. Only representative clusters are shown. Rank:1 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 1 The number of assigned segments Normalized frequency RMS deviation (Å) 0.36 Kullback Leibler entropy (bit) 0.09 The number of hydorgen bonds 7.7 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (11)

12 FIGURE S1 (continued) Rank:2 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 2 The number of assigned segments 1577 Normalized frequency RMS deviation (Å) 0.53 Kullback Leibler entropy (bit) 0.23 The number of hydorgen bonds 6.6 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (12)

13 FIGURE S1 (continued) Rank:3 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 3 The number of assigned segments 1030 Normalized frequency RMS deviation (Å) 1.48 Kullback Leibler entropy (bit) 0.10 The number of hydorgen bonds 6.0 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (13)

14 FIGURE S1 (continued) Rank:4 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 4 The number of assigned segments 972 Normalized frequency RMS deviation (Å) 0.68 Kullback Leibler entropy (bit) 0.27 The number of hydorgen bonds 6.0 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (14)

15 FIGURE S1 (continued) Rank:5 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 5 The number of assigned segments 871 Normalized frequency RMS deviation (Å) 0.42 Kullback Leibler entropy (bit) 0.43 The number of hydorgen bonds 6.8 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (15)

16 FIGURE S1 (continued) Rank:6 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 6 The number of assigned segments 813 Normalized frequency RMS deviation (Å) 0.62 Kullback Leibler entropy (bit) 0.14 The number of hydorgen bonds 6.3 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (16)

17 FIGURE S1 (continued) Rank:7 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 7 The number of assigned segments 685 Normalized frequency RMS deviation (Å) 0.58 Kullback Leibler entropy (bit) 0.47 The number of hydorgen bonds 6.3 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (17)

18 FIGURE S1 (continued) Rank:9 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 9 The number of assigned segments 487 Normalized frequency RMS deviation (Å) 0.78 Kullback Leibler entropy (bit) 0.50 The number of hydorgen bonds 5.8 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (18)

19 FIGURE S1 (continued) Rank:21 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 21 The number of assigned segments 295 Normalized frequency RMS deviation (Å) 0.92 Kullback Leibler entropy (bit) 0.58 The number of hydorgen bonds 5.4 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (19)

20 FIGURE S1 (continued) Rank:23 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 23 The number of assigned segments 288 Normalized frequency RMS deviation (Å) 0.99 Kullback Leibler entropy (bit) 0.56 The number of hydorgen bonds 5.2 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (20)

21 FIGURE S1 (continued) Rank:79 Clustering conditions: L=9, Dth=30, culled PDB Cluster ranking 79 The number of assigned segments 122 Normalized frequency RMS deviation (Å) 0.89 Kullback Leibler entropy (bit) 0.69 The number of hydorgen bonds 5.2 The centroid of the cluster Pos. Phi Psi Omega The Position Specific Scoring Matrix < N terminal C terminal > ALA ARG ASN ASP CYS GLN GLU GLY HIS ILE LEU LYS MET PHE PRO SER THR TRP TYR VAL (21)

22 FIGURE S2 Flow chart of the single-pass clustering method used in the present study START Initializing: All segments are unassigned to any cluster. Unique SID is given to each segment. N <= 0 Does an unassigned segment exist? Yes An unassigned segment is chosen randomly. j<= SID of the chosen segment d(j) <= dihedral angle vector of the segment j No Ranking Clusters: Clusters are ranked in the decreasing order of MCID. EXIT N > 0? No Yes Finding the nearest cluster: Finding a cluster whose center is the closest to the segment j. k <= CID of the nearest cluster Dmin <= the dissimilarity between c(k) and d(j) Dmin < Dth? No Yes Updating cluster: Mk <= Mk + 1 c(k) <= {(Mk-1) c(k) + d(j)}/mk The segment j is assigned to the nearest cluster k. Creating new cluster: N <= N + 1 CID of new cluster is numbered with N. k <= CID of the new cluster Mk <= 1 c(k) <= d(j) The segment j is assigned to the new cluster k. SID CID N M CID j k c(cid) d(sid) D th D min : numerical identification of each segment : numerical identification of each cluster : total number of clusters : number of segments in a cluster : SID of the chosen segment : CID of the nearest or created cluster : averaged dihedral angle vector of segments in a cluster (the cluster centroid) : dihedral angle vector of a segment : threshold parameter for creating a new cluster : dissimilarity of a segment from the nearest cluster (22)

23 FIGURE S3 Reproducibility of results in the single-pass clustering method. a, Ten distribution curves which were independently obtained when the order of sampling was changed randomly. Clustering conditions: L=9, D th =30º, Culled PDB. b, Comparison of the single-pass clustering method with an iterative clustering method. The iterative calculation based on k-means algorithm was carried out 100 times using the result of the single-pass clustering method as an initial condition. Clustering conditions: L=9, D th =30º, Culled PDB. a fcls b r fcls r (23)

24 FIGURE S4 Sensitivity of the parameters to the order of sampling. Each histogram corresponds to the parameters which were determined from the 1000 sets of clustering results that were independently performed by changing the order of sampling randomly. Clustering conditions: L=9, D th =30º, Culled PDB. Refer to Methods for the meaning of symbols. f cls (r=1) N N cls a b β N est log(n est )/L S est /L (24)

25 FIGURE S5 Suitability of objective functions to minimize in fitting calculations. a, Best fitted curves obtained in the fitting calculations of the same model to the same data when the different objective functions were used. Refer to Supporting Descriptions in Supplemental Materials for the equations Eq.S1, S2, S3, and S4. Clustering conditions: L=9, D th =30º, Culled PDB. b, Kolmogorov- Smirnov (KS) parameters for evaluating the goodness of fit in order to chose an appropriate objective function. KS parameters were determined from two cumulative distribution functions which were respectively computed from the fitted curve and the empirical distribution of the clusters containing at least five segments. Clustering conditions: D th =30º. a fcls or fest r b Eq.s1 Eq.s2 Eq.s3 Eq.s KS PDB Select L Culled PDB (25)

26 FIGURE S6 Histograms showing the structural dissimilarity, D, of 9-residue segments. The segments were generated from the Culled PDB and classified into several classes depending on the value of D against α helix (red), β strand (green), or β-hairpin (blue). A class interval is 5. Number of segments Structural dissimilarity D (º) (26)

27 FIGURE S7 Structural differences of 9-residue segments from the center of the cluster. The x- and y-axes indicate the differences in D and in backbone RMS deviation, respectively. Clustering conditions: Culled PDB, L=9, D th =30º. bbrmsd(å) D (º) (27)

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Peptides And Proteins

Peptides And Proteins Kevin Burgess, May 3, 2017 1 Peptides And Proteins from chapter(s) in the recommended text A. Introduction B. omenclature And Conventions by amide bonds. on the left, right. 2 -terminal C-terminal triglycine

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University Clustering and Model Integration under the Wasserstein Metric Jia Li Department of Statistics Penn State University Clustering Data represented by vectors or pairwise distances. Methods Top- down approaches

More information

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary Electronic Supplementary Material (ESI) for Journal of Materials Chemistry B. This journal is The Royal Society of Chemistry 21 What makes a good graphene-binding peptide? Adsorption of amino acids and

More information

Packing of Secondary Structures

Packing of Secondary Structures 7.88 Lecture Notes - 4 7.24/7.88J/5.48J The Protein Folding and Human Disease Professor Gossard Retrieving, Viewing Protein Structures from the Protein Data Base Helix helix packing Packing of Secondary

More information

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics B Ramachandran Plot ~b b 135 b ~b ~l l Psi (degrees) 5-5 a A ~a L - -135 SER HIS (F) 59 (G) SER (B) ~b b LYS ASP ASP 315 13 13 (A) (F) (B) LYS ALA ALA 315 173 (E) 173 (E)(A) ~p p ~b - -135 - -5 5 135 (degrees)

More information

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1 Supporting information to: Time-resolved observation of protein allosteric communication Sebastian Buchenberg, Florian Sittel and Gerhard Stock Biomolecular Dynamics, Institute of Physics, Albert Ludwigs

More information

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination Lecture 9 M230 Feigon Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination Reading resources v Roberts NMR of Macromolecules, Chap 4 by Christina Redfield

More information

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted

More information

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27 Acta Cryst. (2014). D70, doi:10.1107/s1399004714021695 Supporting information Volume 70 (2014) Supporting information for article: Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase

More information

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two Supplementary Figure 1. Biopanningg and clone enrichment of Alphabody binders against human IL 23. Positive clones in i phage ELISA with optical density (OD) 3 times higher than background are shown for

More information

HSQC spectra for three proteins

HSQC spectra for three proteins HSQC spectra for three proteins SH3 domain from Abp1p Kinase domain from EphB2 apo Calmodulin What do the spectra tell you about the three proteins? HSQC spectra for three proteins Small protein Big protein

More information

Computational Protein Design

Computational Protein Design 11 Computational Protein Design This chapter introduces the automated protein design and experimental validation of a novel designed sequence, as described in Dahiyat and Mayo [1]. 11.1 Introduction Given

More information

Proteins: Characteristics and Properties of Amino Acids

Proteins: Characteristics and Properties of Amino Acids SBI4U:Biochemistry Macromolecules Eachaminoacidhasatleastoneamineandoneacidfunctionalgroupasthe nameimplies.thedifferentpropertiesresultfromvariationsinthestructuresof differentrgroups.thergroupisoftenreferredtoastheaminoacidsidechain.

More information

Properties of amino acids in proteins

Properties of amino acids in proteins Properties of amino acids in proteins one of the primary roles of DNA (but not the only one!) is to code for proteins A typical bacterium builds thousands types of proteins, all from ~20 amino acids repeated

More information

Protein Fragment Search Program ver Overview: Contents:

Protein Fragment Search Program ver Overview: Contents: Protein Fragment Search Program ver 1.1.1 Developed by: BioPhysics Laboratory, Faculty of Life and Environmental Science, Shimane University 1060 Nishikawatsu-cho, Matsue-shi, Shimane, 690-8504, Japan

More information

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4. Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4. Field Desorption 5. MS MS techniques Matrix assisted

More information

Structural Alignment of Proteins

Structural Alignment of Proteins Goal Align protein structures Structural Alignment of Proteins 1 2 3 4 5 6 7 8 9 10 11 12 13 14 PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO --- --- --- GLU ALA ILE

More information

Course Notes: Topics in Computational. Structural Biology.

Course Notes: Topics in Computational. Structural Biology. Course Notes: Topics in Computational Structural Biology. Bruce R. Donald June, 2010 Copyright c 2012 Contents 11 Computational Protein Design 1 11.1 Introduction.........................................

More information

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov 2018 Biochemistry 110 California Institute of Technology Lecture 2: Principles of Protein Structure Linus Pauling (1901-1994) began his studies at Caltech in 1922 and was directed by Arthur Amos oyes to

More information

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods Cell communication channel Bioinformatics Methods Iosif Vaisman Email: ivaisman@gmu.edu SEQUENCE STRUCTURE DNA Sequence Protein Sequence Protein Structure Protein structure ATGAAATTTGGAAACTTCCTTCTCACTTATCAGCCACCT...

More information

April, The energy functions include:

April, The energy functions include: REDUX A collection of Python scripts for torsion angle Monte Carlo protein molecular simulations and analysis The program is based on unified residue peptide model and is designed for more efficient exploration

More information

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 9 Protein tertiary structure Sources for this chapter, which are all recommended reading: D.W. Mount. Bioinformatics: Sequences and Genome

More information

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells? ame 7.01 Problem Set 1 Section Question 1 a) What are the four major types of biological molecules discussed in lecture? Give one important function of each type of biological molecule in the cell? b)

More information

Supporting Information

Supporting Information Supporting Information Micelle-Triggered b-hairpin to a-helix Transition in a 14-Residue Peptide from a Choline-Binding Repeat of the Pneumococcal Autolysin LytA HØctor Zamora-Carreras, [a] Beatriz Maestro,

More information

Computer simulations of protein folding with a small number of distance restraints

Computer simulations of protein folding with a small number of distance restraints Vol. 49 No. 3/2002 683 692 QUARTERLY Computer simulations of protein folding with a small number of distance restraints Andrzej Sikorski 1, Andrzej Kolinski 1,2 and Jeffrey Skolnick 2 1 Department of Chemistry,

More information

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A. hbcse 33 rd International Page 101 hemistry lympiad Preparatory 05/02/01 Problems d. In the hydrolysis of the glycosidic bond, the glycosidic bridge oxygen goes with 4 of the sugar B. n cleavage, 18 from

More information

Resonance assignments in proteins. Christina Redfield

Resonance assignments in proteins. Christina Redfield Resonance assignments in proteins Christina Redfield 1. Introduction The assignment of resonances in the complex NMR spectrum of a protein is the first step in any study of protein structure, function

More information

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Tomoyuki Hiroyasu, Mitsunori Miki, Shinya Ogura, Keiko Aoi, Takeshi Yoshida, Yuko Okamoto Jack Dongarra

More information

Protein Structures: Experiments and Modeling. Patrice Koehl

Protein Structures: Experiments and Modeling. Patrice Koehl Protein Structures: Experiments and Modeling Patrice Koehl Structural Bioinformatics: Proteins Proteins: Sources of Structure Information Proteins: Homology Modeling Proteins: Ab initio prediction Proteins:

More information

Central Dogma. modifications genome transcriptome proteome

Central Dogma. modifications genome transcriptome proteome entral Dogma DA ma protein post-translational modifications genome transcriptome proteome 83 ierarchy of Protein Structure 20 Amino Acids There are 20 n possible sequences for a protein of n residues!

More information

NMR Assignments using NMRView II: Sequential Assignments

NMR Assignments using NMRView II: Sequential Assignments NMR Assignments using NMRView II: Sequential Assignments DO THE FOLLOWING, IF YOU HAVE NOT ALREADY DONE SO: For Mac OS X, you should have a subdirectory nmrview. At UGA this is /Users/bcmb8190/nmrview.

More information

Unraveling the degradation of artificial amide bonds in Nylon oligomer hydrolase: From induced-fit to acylation processes

Unraveling the degradation of artificial amide bonds in Nylon oligomer hydrolase: From induced-fit to acylation processes Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics. This journal is the Owner Societies 2015 Supporting Information for Unraveling the degradation of artificial amide bonds

More information

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE C H E M I S T R Y 2 0 0 0 A T I A L Q U A L I F Y I G E X A M I A T I SLUTIS GUIDE Answers are a guide only and do not represent a preferred method of solving problems. Section A 1B, 2A, 3C, 4C, 5D, 6D,

More information

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description Version 3.30 Document Published by the wwpdb This format complies with the PDB Exchange Dictionary (PDBx) http://mmcif.pdb.org/dictionaries/mmcif_pdbx.dic/index/index.html.

More information

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate Stef Vanhaecht, Gregory Absillis, Tatjana N. Parac-Vogt* Department

More information

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Viewing and Analyzing Proteins, Ligands and their Complexes 2 2 Viewing and Analyzing Proteins, Ligands and their Complexes 2 Overview Viewing the accessible surface Analyzing the properties of proteins containing thousands of atoms is best accomplished by representing

More information

Model Mélange. Physical Models of Peptides and Proteins

Model Mélange. Physical Models of Peptides and Proteins Model Mélange Physical Models of Peptides and Proteins In the Model Mélange activity, you will visit four different stations each featuring a variety of different physical models of peptides or proteins.

More information

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle. Project I Chemistry 8021, Spring 2005/2/23 This document was turned in by a student as a homework paper. 1. Methods First, the cartesian coordinates of 5021 and 8021 molecules (Fig. 1) are generated, in

More information

Molecular Structure Prediction by Global Optimization

Molecular Structure Prediction by Global Optimization Molecular Structure Prediction by Global Optimization K.A. DILL Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, CA 94118 A.T. PHILLIPS Computer Science

More information

Section Week 3. Junaid Malek, M.D.

Section Week 3. Junaid Malek, M.D. Section Week 3 Junaid Malek, M.D. Biological Polymers DA 4 monomers (building blocks), limited structure (double-helix) RA 4 monomers, greater flexibility, multiple structures Proteins 20 Amino Acids,

More information

NMR parameters intensity chemical shift coupling constants 1D 1 H spectra of nucleic acids and proteins

NMR parameters intensity chemical shift coupling constants 1D 1 H spectra of nucleic acids and proteins Lecture #2 M230 NMR parameters intensity chemical shift coupling constants Juli Feigon 1D 1 H spectra of nucleic acids and proteins NMR Parameters A. Intensity (area) 1D NMR spectrum: integrated intensity

More information

Solutions In each case, the chirality center has the R configuration

Solutions In each case, the chirality center has the R configuration CAPTER 25 669 Solutions 25.1. In each case, the chirality center has the R configuration. C C 2 2 C 3 C(C 3 ) 2 D-Alanine D-Valine 25.2. 2 2 S 2 d) 2 25.3. Pro,, Trp, Tyr, and is, Trp, Tyr, and is Arg,

More information

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS Int. J. LifeSc. Bt & Pharm. Res. 2012 Kaladhar, 2012 Research Paper ISSN 2250-3137 www.ijlbpr.com Vol.1, Issue. 1, January 2012 2012 IJLBPR. All Rights Reserved PROTEIN SECONDARY STRUCTURE PREDICTION:

More information

BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Protein Structure Prediction I

BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Protein Structure Prediction I BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer 2013 9. Protein Structure Prediction I Structure Prediction Overview Overview of problem variants Secondary structure prediction

More information

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Mathematics and Biochemistry University of Wisconsin - Madison 0 There Are Many Kinds Of Proteins The word protein comes

More information

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine Supplementary figure 1. Comparison of unbound and as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine GM-CSF (slate) with bound GM-CSF in the GIF:GM-CSF complex (GIF: green,

More information

Peptide Syntheses. Illustrative Protection: BOC/ t Bu. A. Introduction. do not acid

Peptide Syntheses. Illustrative Protection: BOC/ t Bu. A. Introduction. do not acid Kevin Burgess, May 3, 2017 1 Peptide yntheses from chapter(s) in the recommended text A. Introduction do not acid -Met-e- -Met-Met- -e-e- -e-met- dipeptide dipeptide dipeptide dipeptide diketopiperazine

More information

CSE 549: Computational Biology. Substitution Matrices

CSE 549: Computational Biology. Substitution Matrices CSE 9: Computational Biology Substitution Matrices How should we score alignments So far, we ve looked at arbitrary schemes for scoring mutations. How can we assign scores in a more meaningful way? Are

More information

Diastereomeric resolution directed towards chirality. determination focussing on gas-phase energetics of coordinated. sodium dissociation

Diastereomeric resolution directed towards chirality. determination focussing on gas-phase energetics of coordinated. sodium dissociation Supplementary Information Diastereomeric resolution directed towards chirality determination focussing on gas-phase energetics of coordinated sodium dissociation Authors: samu Kanie*, Yuki Shioiri, Koji

More information

The translation machinery of the cell works with triples of types of RNA bases. Any triple of RNA bases is known as a codon. The set of codons is

The translation machinery of the cell works with triples of types of RNA bases. Any triple of RNA bases is known as a codon. The set of codons is Relations Supplement to Chapter 2 of Steinhart, E. (2009) More Precisely: The Math You Need to Do Philosophy. Broadview Press. Copyright (C) 2009 Eric Steinhart. Non-commercial educational use encouraged!

More information

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Major Types of Association of Proteins with Cell Membranes. From Alberts et al Major Types of Association of Proteins with Cell Membranes From Alberts et al Proteins Are Polymers of Amino Acids Peptide Bond Formation Amino Acid central carbon atom to which are attached amino group

More information

Protein Structure Prediction

Protein Structure Prediction Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on

More information

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

UNIT TWELVE. a, I _,o ' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I -oh UNT TWELVE PROTENS : PEPTDE BONDNG AND POLYPEPTDES 12 CONCEPTS Many proteins are important in biological structure-for example, the keratin of hair, collagen of skin and leather, and fibroin of silk. Other

More information

Supplementary Information Intrinsic Localized Modes in Proteins

Supplementary Information Intrinsic Localized Modes in Proteins Supplementary Information Intrinsic Localized Modes in Proteins Adrien Nicolaï 1,, Patrice Delarue and Patrick Senet, 1 Department of Physics, Applied Physics and Astronomy, Rensselaer Polytechnic Institute,

More information

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease University of Wollongong Research Online Faculty of Science - Papers (Archive) Faculty of Science, Medicine and Health 2009 NMR study of complexes between low molecular mass inhibitors and the West Nile

More information

Protein structure analysis. Risto Laakso 10th January 2005

Protein structure analysis. Risto Laakso 10th January 2005 Protein structure analysis Risto Laakso risto.laakso@hut.fi 10th January 2005 1 1 Summary Various methods of protein structure analysis were examined. Two proteins, 1HLB (Sea cucumber hemoglobin) and 1HLM

More information

Sequence Alignments. Dynamic programming approaches, scoring, and significance. Lucy Skrabanek ICB, WMC January 31, 2013

Sequence Alignments. Dynamic programming approaches, scoring, and significance. Lucy Skrabanek ICB, WMC January 31, 2013 Sequence Alignments Dynamic programming approaches, scoring, and significance Lucy Skrabanek ICB, WMC January 31, 213 Sequence alignment Compare two (or more) sequences to: Find regions of conservation

More information

Conformational Geometry of Peptides and Proteins:

Conformational Geometry of Peptides and Proteins: Conformational Geometry of Peptides and Proteins: Before discussing secondary structure, it is important to appreciate the conformational plasticity of proteins. Each residue in a polypeptide has three

More information

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Exam III. Please read through each question carefully, and make sure you provide all of the requested information. 09-107 onors Chemistry ame Exam III Please read through each question carefully, and make sure you provide all of the requested information. 1. A series of octahedral metal compounds are made from 1 mol

More information

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1 Supplementary Information Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1 Philip D. Mosier 1, Meng-Jung Chiang 2, Zhengshi Lin 2, Yamei Gao 2, Bashayer Althufairi

More information

Protein Structure Bioinformatics Introduction

Protein Structure Bioinformatics Introduction 1 Swiss Institute of Bioinformatics Protein Structure Bioinformatics Introduction Basel, 27. September 2004 Torsten Schwede Biozentrum - Universität Basel Swiss Institute of Bioinformatics Klingelbergstr

More information

Similarity or Identity? When are molecules similar?

Similarity or Identity? When are molecules similar? Similarity or Identity? When are molecules similar? Mapping Identity A -> A T -> T G -> G C -> C or Leu -> Leu Pro -> Pro Arg -> Arg Phe -> Phe etc If we map similarity using identity, how similar are

More information

Bioinformatics Practical for Biochemists

Bioinformatics Practical for Biochemists Bioinformatics Practical for Biochemists Andrei Lupas, Birte Höcker, Steffen Schmidt WS 2013/14 03. Sequence Features Targeting proteins signal peptide targets proteins to the secretory pathway N-terminal

More information

Oxygen Binding in Hemocyanin

Oxygen Binding in Hemocyanin Supporting Information for Quantum Mechanics/Molecular Mechanics Study of Oxygen Binding in Hemocyanin Toru Saito and Walter Thiel* Max-Planck-Institut für Kohlenforschung, Kaiser-Wilhelm-Platz 1, D-45470

More information

Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information

Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information Grzegorz Nawrocki and Marek Cieplak Institute of Physics, Polish Academy of Sciences,

More information

Automated Identification of Protein Structural Features

Automated Identification of Protein Structural Features Automated Identification of Protein Structural Features Chandrasekhar Mamidipally 1, Santosh B. Noronha 1, Sumantra Dutta Roy 2 1 Dept. of Chemical Engg., IIT Bombay, Powai, Mumbai - 400 076, INDIA. chandra

More information

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Programme 8.00-8.20 Last week s quiz results + Summary 8.20-9.00 Fold recognition 9.00-9.15 Break 9.15-11.20 Exercise: Modelling remote homologues 11.20-11.40 Summary & discussion 11.40-12.00 Quiz 1 Feedback

More information

Protein Secondary Structure Prediction using Feed-Forward Neural Network

Protein Secondary Structure Prediction using Feed-Forward Neural Network COPYRIGHT 2010 JCIT, ISSN 2078-5828 (PRINT), ISSN 2218-5224 (ONLINE), VOLUME 01, ISSUE 01, MANUSCRIPT CODE: 100713 Protein Secondary Structure Prediction using Feed-Forward Neural Network M. A. Mottalib,

More information

Protein Struktur (optional, flexible)

Protein Struktur (optional, flexible) Protein Struktur (optional, flexible) 22/10/2009 [ 1 ] Andrew Torda, Wintersemester 2009 / 2010, AST nur für Informatiker, Mathematiker,.. 26 kt, 3 ov 2009 Proteins - who cares? 22/10/2009 [ 2 ] Most important

More information

Exam I Answer Key: Summer 2006, Semester C

Exam I Answer Key: Summer 2006, Semester C 1. Which of the following tripeptides would migrate most rapidly towards the negative electrode if electrophoresis is carried out at ph 3.0? a. gly-gly-gly b. glu-glu-asp c. lys-glu-lys d. val-asn-lys

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D. CHMI 2227 EL Biochemistry I Test 1 26 January 2007 Prof : Eric R. Gauthier, Ph.D. Guidelines: 1) Duration: 55 min 2) 14 questions, on 7 pages. For 70 marks (5 marks per question). Worth 15 % of the final

More information

Electronic Supplementary Information

Electronic Supplementary Information Electronic Supplementary Information A Sensitive Phosphorescent Thiol Chemosensor Based on an Iridium(III) Complex with α,β-unsaturated Ketone Functionalized 2,2 -Bipyridyl Ligand Na Zhao, a Yu-Hui Wu,

More information

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013 Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA

More information

Any protein that can be labelled by both procedures must be a transmembrane protein.

Any protein that can be labelled by both procedures must be a transmembrane protein. 1. What kind of experimental evidence would indicate that a protein crosses from one side of the membrane to the other? Regions of polypeptide part exposed on the outside of the membrane can be probed

More information

Automated Identification of Protein Structural Features

Automated Identification of Protein Structural Features Automated Identification of Protein Structural Features Chandrasekhar Mamidipally 1, Santosh B. Noronha 1, and Sumantra Dutta Roy 2 1 Dept. of Chemical Engg., IIT Bombay, Powai, Mumbai - 400 076, India

More information

Overview. The peptide bond. Page 1

Overview. The peptide bond. Page 1 Overview Secondary structure: the conformation of the peptide backbone The peptide bond, steric implications Steric hindrance and sterically allowed conformations. Ramachandran diagrams Side chain conformations

More information

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of Questions with Answers- Amino Acids & Peptides A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of 5.0-6.5 (Questions 1-4) 1. Which

More information

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry SUPPORTING INFORMATION Desorption/Ionization Efficiency of Common Amino Acids in Surface-assisted Laser Desorption/ionization Mass Spectrometry (SALDI-MS) with Nanostructured Platinum Syuhei Nitta, Hideya

More information

Bacterial protease uses distinct thermodynamic signatures for substrate recognition

Bacterial protease uses distinct thermodynamic signatures for substrate recognition Bacterial protease uses distinct thermodynamic signatures for substrate recognition Gustavo Arruda Bezerra, Yuko Ohara-Nemoto, Irina Cornaciu, Sofiya Fedosyuk, Guillaume Hoffmann, Adam Round, José A. Márquez,

More information

Basic Principles of Protein Structures

Basic Principles of Protein Structures Basic Principles of Protein Structures Proteins Proteins: The Molecule of Life Proteins: Building Blocks Proteins: Secondary Structures Proteins: Tertiary and Quartenary Structure Proteins: Geometry Proteins

More information

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i Full wwpdb X-ray Structure Validation Report i Mar 14, 2018 02:00 pm GMT PDB ID : 3RRQ Title : Crystal structure of the extracellular domain of human PD-1 Authors : Lazar-Molnar, E.; Ramagopal, U.A.; Nathenson,

More information

Bayesian Probabilistic Approach for Predicting Backbone Structures in Terms of Protein Blocks

Bayesian Probabilistic Approach for Predicting Backbone Structures in Terms of Protein Blocks PROTEINS: Structure, Function, and Genetics 41:271 287 (2000) Bayesian Probabilistic Approach for Predicting Backbone Structures in Terms of Protein Blocks A. G. de Brevern, 1 * C. Etchebest, 1,2 and S.

More information

Protein Struktur. Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep. wake up at slide 39

Protein Struktur. Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep. wake up at slide 39 Protein Struktur Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep wake up at slide 39 Andrew Torda, Wintersemester 2016/ 2017 Andrew Torda 17.10.2016 [ 1 ] Proteins - who cares?

More information

Secondary and sidechain structures

Secondary and sidechain structures Lecture 2 Secondary and sidechain structures James Chou BCMP201 Spring 2008 Images from Petsko & Ringe, Protein Structure and Function. Branden & Tooze, Introduction to Protein Structure. Richardson, J.

More information

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions BIRKBECK COLLEGE (University of London) Advanced Certificate in Principles in Protein Structure MSc Structural Molecular Biology Date: Thursday, 1st September 2011 Time: 3 hours You will be given a start

More information

Lecture 15: Realities of Genome Assembly Protein Sequencing

Lecture 15: Realities of Genome Assembly Protein Sequencing Lecture 15: Realities of Genome Assembly Protein Sequencing Study Chapter 8.10-8.15 1 Euler s Theorems A graph is balanced if for every vertex the number of incoming edges equals to the number of outgoing

More information

Supporting information

Supporting information Electronic Supplementary Material (ESI) for New Journal of Chemistry. This journal is The Royal Society of Chemistry and the Centre National de la Recherche Scientifique 2015 Supporting information Influence

More information

Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction

Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction Name page 1 of 6 Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction Part I. The ion Ca 2+ can function as a 2 nd messenger. Pick a specific signal transduction pathway

More information

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets Supporting information Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets Wan-Na Chen, Christoph Nitsche, Kala Bharath Pilla, Bim Graham, Thomas

More information

BIRKBECK COLLEGE (University of London)

BIRKBECK COLLEGE (University of London) BIRKBECK COLLEGE (University of London) SCHOOL OF BIOLOGICAL SCIENCES M.Sc. EXAMINATION FOR INTERNAL STUDENTS ON: Postgraduate Certificate in Principles of Protein Structure MSc Structural Molecular Biology

More information

Heteropolymer. Mostly in regular secondary structure

Heteropolymer. Mostly in regular secondary structure Heteropolymer - + + - Mostly in regular secondary structure 1 2 3 4 C >N trace how you go around the helix C >N C2 >N6 C1 >N5 What s the pattern? Ci>Ni+? 5 6 move around not quite 120 "#$%&'!()*(+2!3/'!4#5'!1/,#64!#6!,6!

More information

Details of Protein Structure

Details of Protein Structure Details of Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Anne Mølgaard, Kemisk Institut, Københavns Universitet Learning Objectives

More information

Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes Ora Schueler-Furman 1,2, Ron Elber 2 and Hanah Margalit 1

Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes Ora Schueler-Furman 1,2, Ron Elber 2 and Hanah Margalit 1 Research Paper 549 Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes Ora Schueler-Furman 1,2, Ron Elber 2 and Hanah Margalit 1 Background: The binding of T-cell

More information

ENZYME MECHANISMS, PROTEASES, STRUCTURAL BIOLOGY

ENZYME MECHANISMS, PROTEASES, STRUCTURAL BIOLOGY Supplementary Information SUBJECT AREAS: ENZYME MECHANISMS, PROTEASES, STRUCTURAL BIOLOGY Correspondence and requests for materials should be addressed to N.T. (ntanaka@pharm.showa-u.ac.jp) or W.O. (owataru@vos.nagaokaut.ac.jp)

More information

CHEM J-9 June 2014

CHEM J-9 June 2014 CEM1611 2014-J-9 June 2014 Alanine (ala) and lysine (lys) are two amino acids with the structures given below as Fischer projections. The pk a values of the conjugate acid forms of the different functional

More information

Chapter 4: Amino Acids

Chapter 4: Amino Acids Chapter 4: Amino Acids All peptides and polypeptides are polymers of alpha-amino acids. lipid polysaccharide enzyme 1940s 1980s. Lipids membrane 1960s. Polysaccharide Are energy metabolites and many of

More information

Problem Set 1

Problem Set 1 2006 7.012 Problem Set 1 Due before 5 PM on FRIDAY, September 15, 2006. Turn answers in to the box outside of 68-120. PLEASE WRITE YOUR ANSWERS ON THIS PRINTOUT. 1. For each of the following parts, pick

More information