Truncating Spy s unstructured tails does not dramatically decrease chaperone activity in vivo or in vitro.

Size: px
Start display at page:

Download "Truncating Spy s unstructured tails does not dramatically decrease chaperone activity in vivo or in vitro."

Transcription

1 Supplementary Figure 1 Truncating Spy s unstructured tails does not dramatically decrease chaperone activity in vivo or in vitro. (a) Overexpression of Spy increases the steady state level of the unstable Im7 mutant (Im7 L53A I54A) detected via SDS-PAGE and Coomassie stain (above) or by anti-his tag antibody (below). (b) Truncating Spy s unstructured tails does not dramatically alter chaperone activity in vitro, as determined by the aldolase refolding assay. In vitro, Spy is able to provide minor aid to aldolase refolding capacity, compared to the major folding aid provided to Im7 in vivo. Error bars plotted are s.e.m. using 3-5 replicates from separate experiments.

2 Supplementary Figure 2 Detection of substrates in Spy crystals. Substrates are Im (a), casein (b), and Im7 L18A L19A L37A (c), respectively. Lanes W1-W5: washes of crystals in crystallization solution; lane D: dissolved crystals. The lack of protein in (b) for W2 relative to W3 for casein and in (c) for Im7 L18A L19A L37A for W2 relative to W1 was due to some crystal breakage in the latter wash drops. WT Im7 crystals were too fragile to survive the washing process.

3 Supplementary Figure 3 Im structural properties. (a) Comparison of disordered Im helical propensity in simulation and experiment. Per-residue helicity profiles obtained from coarse-grained simulations (FF1-FF3) and NMR chemical shifts (Loïc Salmon, Logan S. Ahlstrom, Scott Horowitz, Alex Dickson, Charles L. Brooks III, and James C. A. Bardwell, unpublished data: Exp, black curve). The unmodified Im force field (FF1, red curve) yields an ensemble that overestimates the helicity. Applying an adjustment to the pseudo-dihedral potential (FF2, cyan curve) improves the agreement of the helicity profile with experiment. Rescaling the non-bonded interactions in addition to this adjustment results in a helicity distribution that closely matches the experimental profile (FF3, orange curve). (b) Native structure of WT Im7. Residues 6-45 are highlighted in orange.

4 Supplementary Figure 4 Pseudocrystal environments for Spy-Im simulations. (a) and (b) The two pseudo-crystal environments for Spy Im simulations. Use of two environments allows all multi-spy interactions Im would encounter within the crystal. Both environments comprise four crystallographic Spy dimers (blue, red, yellow, and magenta) and a single peptide (orange).

5 Supplementary Figure 5 Simulated data validation test of READ selection protocol. (a) Flowchart of simulated data validation test. (b) Three target ensembles (cyan) overlaid with selected ensembles (magenta). RMSDs of target to selected ensembles: 5.7 Å (top left), 6.5 Å (middle right), and 3.7 Å (bottom left).

6 Supplementary Figure 6 Flowchart of pool corruption validation test for READ selection protocol.

7 Bootstrapping validation test of READ selection protocol. Supplementary Figure 7 37% of data replaced with repeat data in 200 separate selections was used to calculate mean and standard error contact maps. Note the change in intensity scale for the standard error map of the selections. The contact profile of the ensembles from the bootstrapped selections has low error and displays a different binding profile

8 than the initial MD pool.

9 Supplementary Figure 8 Additional binding and competition assays. (a) Binding of Spy to Im7 L18A L19A L37A H40W measured via ITC. This Im7 mutant was subsequently used for competition assays with Im and Spy (b) Casein competes with Im for Spy binding. Due to binding between Im and casein , as identified by analytical ultracentrifugation experiments of the two proteins (c), we were unable to accurately quantitate competition between Im and casein (b). The dissociation constant of casein to Spy was determined by fluorescence (d) with a 1:1 stoichiometry of casein to Spy (e). Also, see Fig. 5c for competition of Im with Im7 L18A L19A L37A H40W. Error bars depict s.d. of n=3 technical replicates.

10 Supplementary Table 1 Summary of sigma-scaled anomalous peak heights from methionine sulfur atoms and assigned iodine signals after refinement Spy:Im7 E12I (crystal 1) Spy:Im7 E12I (crystal 2) Spy:Im7 A13I Median Methionine S 3.9 Median Methionine S 3.35 Median Methionine S 4 Iodine(s) 4.8 Iodine(s) 3.8 Iodine(s) Spy:Im7 E14I Spy:Im7 Q17I Spy:Im7 L19I (crystal 1) Spy:Im7 L19I (crystal 2) Median Methionine S 3.5 Median Methionine S 3.15 Median Methionine S 3.5 Median Methionine S 3.1 Iodine(s) 3.3 Iodine(s) 3.5 Iodine(s) 3.3 Iodine(s) Spy:Im7 E21I (crystal 1) Spy:Im7 E21I (crystal 2) Spy:Im7 V27I Spy:Im7 A28I Median Methionine S 3.95 Median Methionine S 3.6 Median Methionine S 3.7 Median Methionine S 3.5 Iodine(s) 4.4 Iodine(s) 3.9 Iodine(s) 4.4 Iodine(s)

11 Supplementary Table 2 Additional crystallography statistics Spy:Im7 A28I Spy:Im7 A28I Spy:Im7 V27I Spy:Im7 V27I Data collection Wavelength (Å) Space group P4 122 P4 122 P4 122 P4 122 Cell dimensions a, b, c (Å) 43.08, 43,08, , 43.2, , 42.6, , 42.6, α,β,γ ( ) 90, 90, 90 90, 90, 90 90, 90, 90 90, 90, 90 Resolution (Å) ( ) ( ) ( ) ( ) R merge (%) 16.8(29.7) 11.8(23.1) 5.1(17.3) 5.0(14) I/σ(I) 10.0(8.7) 14.7(9.0) 15.9(6.8) 22.6(10.5) Completeness (%) 100(100) 100(100) 97.7(94.9) 95.2(87.2) Redundancy 11.7(11.8) 13.4(13.2) 4.9(4.7) 7.4(7.7) Spy:Im7 E12I (crystal 2) Spy:Im7 E12I (crystal 2) Data collection Wavelength (Å) Space group P4 122 P4 122 Cell dimensions a, b, c (Å) 42.8, , 42.9, α,β,γ ( ) 90, 90, 90 90, 90, 90 Resolution (Å) ( ) ( ) R merge (%) 6.1(17) 5.5(19.6) I/σ(I) 20.0(10.4) 17.8(6.9) Completeness (%) 99.8(99.6) 99.7(99.3) Redundancy 8.2(8.1) 6.0(5.5) Spy:Im7 A13I Spy:Im7 A13I Spy:Im7 E14I Spy:Im7 E14I Data collection Wavelength (Å) Space group P4 122 P4 122 P4 122 P4 122 Cell dimensions a, b, c (Å) 43.1, 43.1, , 43.1, , 42.8, , 42.9, α,β,γ ( ) 90, 90, 90 90, 90, 90 90, 90, 90 90, 90, 90 Resolution (Å) ( ) ( ) ( ) ( ) R merge (%) 5.3(11.5) 5.1(11.2) 6.4(26.8) 7.0(40.1) I/σ(I) 30.6(18.5) 30.6(18.7) 18.7(6.6) 16.1(4.6) Completeness (%) 99.9(100) 99.5(98.9) 95.2(89.6) 94.8(88.7) Redundancy 12.6(12.6) 11.9(11.8) 8.0(8.4) 7.7(8.0) Spy:Im7 E12I (crystal 1) Spy:Im7 E12I (crystal 1) Spy:Im7 Q17I Spy:Im7 Q17I Data collection Wavelength (Å) Space group P4 122 P4 122 P4 122 P4 122 Cell dimensions a, b, c (Å) 42.7, 42.7, , 42.7, , 42.7, , 42.7, α,β,γ ( ) 90, 90, 90 90, 90, 90 90, 90, 90 90, 90, 90 Resolution (Å) ( ) ( ) ( ) ( ) R merge (%) 5.1(11.3) 6.2(16.5) 6.7(22.3) 5.7(38.4) I/σ(I) 24.2(14.8) 18.8(10.2) 21.0(8.3) 14.4(3.6) Completeness (%) 90.0(92.7) 88.7(90.6) 94.7(87.7) 93.4(87.0) Redundancy 7.3(6.8) 7.7(7.1) 8.0(8.6) 5.3(5.4) Spy:Im7 L19I (crystal 1) Spy:Im7 L19I (crystal 1) Spy:Im7 L19I (crystal 2) Spy:Im7 L19I (crystal 2) Data collection Wavelength (Å) Space group P4 122 P4 122 P4 122 P4 122 Cell dimensions a, b, c (Å) 42.9, 42.9, , 42.9, , 42.9, , 42.9, α,β,γ ( ) 90, 90, 90 90, 90, 90 90, 90, 90 90, 90, 90 Resolution (Å) ( ) ( ) ( ) ( ) R merge (%) 7.0(16.9) 7.4(31.0) 7.7(26.1) 10.0(71.8) I/σ(I) 19.1(10.2) 17.1(6.2) 16.7(7.1) 12.3(2.7) Completeness (%) 100(100) 95.6(96.8) 100(100) 99.9(100) Redundancy 8.5(8.5) 9.2(8.8) 8.4(8.6) 8.0(8.2)

12 Spy:Im7 E21I (crystal 1) Spy:Im7 E21I (crystal 1) Spy:Im7 E21I (crystal 2) Spy:Im7 E21I (crystal 2) Data collection Wavelength (Å) Space group P4 122 P4 122 P4 122 P4 122 Cell dimensions a, b, c (Å) 42.7, 42.7, , 42.7, , 42.8, , 42.8, α,β,γ ( ) 90, 90, 90 90, 90, 90 90, 90, 90 90, 90, 90 Resolution (Å) ( ) ( ) ( ) ( ) R merge (%) 4.8(7.9) 4.7(15.2) 6.3(13.6) 6.6(15.9) I/σ(I) 28.9(19.6) 25.6(12.6) 20.3(11.0) 19.4(10.1) Completeness (%) 100(100) 99.9(100) 99.8(99.3) 99.9(100) Redundancy 8.6(8.7) 8.8(8.9) 8.1(7.9) 8.6(8.5)

13 Supplementary Table 3 Strains and plasmids Strain Genotype or relevant characteristics Source SQ765 SQ1805 SQ1809 MG1655 (F λ ilvg rfb-50 rph-1), ΔhsdR SQ765, pcdftrc-ssim7l53ai54a, ptrc-spy SQ765, pcdftrc-ssim7l53ai54a SQ2049 SQ765, pcdftrc-ssim7l53ai54a, ptrc-spy This study SQ2050 SQ765, pcdftrc-ssim7l53ai54a, ptrc-spy This study SQ2051 SQ765, pcdftrc-ssim7l53ai54a, ptrc-spy This study SQ2052 SQ765, pcdftrc-ssim7l53ai54a, ptrc-spy This study SQ2053 SQ765, pcdftrc-ssim7l53ai54a, ptrc-spy This study Plasmid Relevant characteristics Source ptrc-spy ptrc-spy ptrc-spy ptrc-spy ptrc-spy ptrc-spy Spy cloned into ptrc99a derived vector psstrx Spy fragment (33-130) with native signal sequence cloned into psstrx Spy fragment (30-130) with native signal sequence cloned into psstrx Spy fragment (27-130) with native signal sequence cloned into psstrx Spy fragment (24-130) with native signal sequence cloned into psstrx Spy fragment (21-130) with native signal sequence cloned into psstrx 19 This study This study This study This study This study

14 Supplemental Note Supplemental Results Ensemble size determination and cross-validation tests To determine the most appropriate Im ensemble size, we performed the selection with sizes ranging from one to twenty. Overall, increasing the ensemble size improved the fit. The improvement declined, however, at larger ensemble sizes, particularly for the iodine data. With six members in the ensemble, the vast majority of the iodine signals were accounted for. We assessed the validity of the ensemble and ensured that it has predictive power by using cross-validation tests. Cross validations are commonly used to test whether a model adequately reproduces the experimental data while avoiding over-interpretation 1,2. We randomly excluded 10% of the residual electron density and iodine signals from the selection process for use as a test dataset. We then averaged over 10 independent cross-validations to generate a 10-fold crossvalidation test, similar to cross-validation test strategies in NMR spectroscopy 3. This test demonstrated that selected ensembles of at least six members back-predicts the test dataset better than ensembles randomly chosen from the MD pool (Online Methods). Passing this crossvalidation test demonstrates that the obtained ensemble has predictive capability for data held back from the fit and provides a meaningful description of the substrate conformational ensemble. Importantly, the cross-validation also examined if the underlying data points are incoherent 4,5 by testing whether iodine signals could be predicted using data arising from other pi-phe substitutions. If the different pi-phe substitutions in Im significantly impacted the binding mode or conformations of Im7 6-45, then the cross-validation test would have failed. The success of our cross-validation test, however, suggests that our labeling strategy did not induce major distortion in the binding mode or conformations of Im Minor differences might be introduced by the pi-phe substitutions, but these changes remain below the resolution of the READ approach. To further investigate the capability of our ensembles to recapitulate the electron density of the Spy:Im complex, we subjected the ensembles to crystallographic refinement. For ensemble sizes up to six, addition of the ensemble to the structural model of Spy produced a small improvement in R Free, decreasing it from 24% to 23%. This result demonstrated that including the six-member Im ensemble enables a better fit to the experimental electron density. At ensemble sizes larger than six members, this decrease in R Free was no longer evident. Theχ 2 cost function used in the selection prioritizes fitting areas of high density over low 1 Clore, G.M. & Garrett, D.S. R-factor, free R, and complete cross-validation for dipolar coupling refinement of NMR structures. Journal of the American Chemical Society 121, (1999). 2 Kleywegt, G.J. & Brunger, A.T. Checking your imagination: applications of the free R value. Structure 4, (1996). 3 Brunger, A.T., Clore, G.M., Gronenborn, A.M., Saffrich, R. & Nilges, M. Assessing the quality of solution nuclear magnetic resonance structures by complete cross-validation. Science 261, (1993). 4 ibid. 5 Zhang, Q., Stelzer, A.C., Fisher, C.K. & Al-Hashimi, H.M. Visualizing spatially correlated dynamics that directs RNA conformational transitions. Nature 450, (2007).

15 density. Thus, we pruned our model to only contain those atoms that were either directly selected in the READ procedure or that could be justified based on agreement with the residual electron density. Our final ensemble contains one out of every 2.5 residues in Im This six-membered ensemble was the largest ensemble that could simultaneously decrease the R Free and pass the 10-fold cross-validation test. We therefore used this ensemble as a model of the conformations that the substrate Im adopts while bound to its chaperone (Fig. 3 and Supplementary Movie 1). Additional validations of the selection and ensemble accuracy To test whether our selection strategy was indeed successful in providing useful models to fit the READ data, we investigated its ability to reproduce important structural features of chaperonebound Im using simulated data. For this set of validations, we performed our selections with noise-corrupted simulated data back-calculated from three pre-chosen target ensembles (Online Methods and Supplementary Fig. 5a). These target ensembles varied in the level of substrate structure and location with respect to Spy. Our goal was to test whether running selections with these back-calculated target data would recapitulate the target ensembles. In each case, we found that the selected ensembles were similar to the target ensembles (Supplementary Fig. 5b). These results confirmed that our selection indeed reaches convergence and is able to accurately determine ensembles based on experimental data. Next, we tested if the final ensemble accurately reflects the READ data as opposed to being overly influenced by the initial conformational pool. To distinguish between these two possibilities, we performed a test in which we significantly altered the MD pool used for the selection to make it energetically unrealistic (Supplementary Fig. 6). To do this, we first extracted ~1000 conformers from the energetically realistic pool that best recapitulated the conformational space observed in our READ selections. This extraction was accomplished by performing READ selections with ensemble sizes ranging from 1 to 20 conformers, and pooling these selected conformers. We then created an alternate energetically unrealistic MD pool that encompassed ~9,000 conformers generated from an MD simulation in which all intramolecular interactions within the Im substrate were turned off. The conformational features of this alternate pool were notably different and in poorer agreement with solution data reporting on Im than those of the original MD pool. We combined the alternate pool with the structures from our READ selections and thereby generated a mixed ensemble of ~10,000 members for a new round of selection (Supplementary Fig. 6). We reasoned that if the experimental data indeed guided the selection, the resulting ensembles should be enriched in structures that had previously been selected from the energetically realistic pool. In contrast, if the experimental data were not driving the selection, then the new ensembles would be heavily polluted by the alternate pool. Performing the selection using this mixed pool resulted in ensembles that were highly enriched in structures from the original pool of structures derived through READ (Supplementary Fig. 6). On average, two-thirds of the newly selected structures came from the original energetically realistic, READ-selected conformers, despite these conformers only forming one tenth of the combined pool. This enrichment indicated that the selection is significantly guided by the READ data and does not simply reflect the mostly populated members of the pool. Finally, to investigate the precision of the observed Spy-Im interactions from the READ ensemble, we performed a bootstrapping error estimation. This classical resampling procedure (Supplemental Methods) was used to estimate uncertainties in the contact map of the resulting ensemble. The small standard error of contact maps from 200 bootstrapped selections

16 (Supplementary Fig. 7) suggested a high degree of precision in our selection. Moreover, our finding that selected ensembles were enriched in regions of low population in the initial MD pool demonstrated that the READ data is indeed guiding the selection. Combined, these validation tests confirmed that the selection procedure and selected six-member ensemble recapitulate the experimental data. Supplemental Methods Simulations of Spy-substrate interaction. Force field. We performed MD simulations of Spy-substrate binding in CHARMM 6 by building on the Karanicolas-Brooks Gō-like model 7. In this model, each residue was represented by a single bead located at the Cα position and with the mass of the underlying amino acid. An additive potential describes the bonded and non-bonded interactions between beads: V = V #$%& + V (%)*+ + V &,-+&.(* + V %$%/#$%& (1) Virtual bonds and angles were specified by harmonic potentials with reference values determined by the Cα coordinates in the experimental structure. The dihedral energy term reflects the backbone dihedral angle probability distributions in the Protein Data Bank (PDB) for the 400 possible amino acid pairs and is thus independent of the experimental dihedral angles. To account for α-helical bias in this model arising from the high incidence of helical residues in the PDB, we added a correction to the dihedral potential 8. For the set of intramolecular non-bonded interactions, residue pairs separated in sequence by three or more bonds and located in close proximity in the experimental structure ( native contacts ) interact through the following potential, which leads to increased cooperativity in protein folding compared to a standard Lennard-Jones 12-6 interaction 9 : V,0 = ε :; :> where r,0 is the distance between residues i and j, r,0 C,% is the distance between the two residues at which V ij is minimum, and ε ij is the strength of the interaction at r,0 C,%. For a given native contact, r,0 C,% was set to the Cα-Cα distance between the residue pair in the experimental model,. (2) 6 Brooks, B.R. et al. CHARMM: The Biomolecular Simulation Program. Journal of Computational Chemistry 30, (2009). 7 Karanicolas, J. & Brooks, C.L., III. The origins of asymmetry in the folding transition states of protein L and protein G. Protein Science 11, (2002). 8 De Sancho, D. & Best, R.B. Modulation of an IDP binding mechanism and rates by helix propensity and non-native interactions: association of HIF1alpha with CBP. Molecular Biosystems 8, (2012). 9 Karanicolas, J. & Brooks, C.L. Improved Go-like models demonstrate the robustness of protein folding mechanisms towards non-native interactions. Journal of Molecular Biology 334, (2003).

17 and ε ij was scaled in proportion to the Miyazawa-Jernigan statistical contact energies 109. A slight repulsive interaction proportional to the first term in equation (2) is applied to residue pairs not in contact. To describe interactions between Spy and substrate, we add a generic inter-protein potential on top of the Gō-like model. This intermolecular potential takes the form of a Lennard-Jones term: V,0 = ε, :; where r ij and ε ij are defined as in equation (2). r,0 C,% was obtained from the mean Cα-Cα distance for residue pairs that form intermolecular contacts in the PDB 11. System setup. An initial model of Im was obtained from residues 6-45 of the full-length Im7 crystal structure (PDB ID: 1CEI) 12. We renormalized the strength of intramolecular Im interactions such that the per-residue helical propensities during simulation matched those obtained from NMR chemical shifts (Supplementary Fig. 4). Initial coordinates for Spy were taken from the co-crystal structure in the current study, while rebuilding the disordered linker and terminal residues by choosing the lowest energy configurations generated by MODELLER To account for the crystal environment in generating Spy:Im peptide poses for the subsequent selection procedure, we performed simulations of two different pseudo crystals that together encompass the possible contacts of Im with Spy within the crystal (Supplementary Fig. 4 and Fig. 2). Dynamics. Six independent pseudo-crystal MD simulations (three in each crystal environment) were seeded from different unfolded Im conformations randomly oriented about the Spy molecules chosen from the Im alone simulations (see above). In each simulation, Langevin dynamics were propagated at 300 K with a friction coefficient of 0.1 ps 1 for 10 8 steps with a 15 fs time step and different random seeds for assigning velocities. To preserve the pseudo-crystal arrangement, we maintained the non-loop and non-terminal atoms of the Spy molecules about their initial coordinates using harmonic restraints with a 10 kcal mol 1 Å 2 force constant. These simulations used the same NMR-parameterized intramolecular Im interaction strengths as described in step II (above). In total, we obtained a diverse pool of 600,000 configurations for analysis. Selection Pool Preparation. For each configuration from the pseudo-crystal MD simulations, we determined the Spy-Im pair with the greatest number of intermolecular residue-residue contacts. We then analyzed these binary complexes for steric clashes within the full crystal 10 Miyazawa, S. & Jernigan, R.L. Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. Ibid.256, (1996). 11 Cheng, S., Zhang, Y. & Brooks, C.L., III. PCalign: a method to quantify physicochemical similarity of protein-protein interfaces. BMC Bioinformatics 16, 33 (2015). 12 Chak, K.F., Safo, M.K., Ku, W.Y., Hsieh, S.Y. & Yuan, H.S. The crystal structure of the immunity protein of colicin E7 suggests a possible colicin-interacting surface. Proceedings of the National Academy of Sciences 93, (1996). 13 Sali, A. & Blundell, T.L. Comparative protein modelling by satisfaction of spatial restraints. Journal of Molecular Biology 234, (1993).. (3)

18 environment. We selected a total of ~10,000 non-clashing binding poses at an even extraction rate using a Cα-Cα cutoff of 4.0 Å, 0.5 Å less than the shortest r,0 C,% in our course-grained force field. For each pose, we recombined the Spy loop and termini and Im Cα coordinates with the Spy co-crystal structure and refined XYZ coordinates, B-factors, and occupancies using Phenix, with harmonic restraints set to limit the motion of Im This refinement of each individual conformer was necessary to establish 2mF o -DF c map values for each conformer using refined B-factors that would be directly used in the selection. By performing this refinement for each conformer individually, the bulk-solvent scaling would be as accurate as possible at this stage. The sets of Spy:Im complexes and resulting electron density maps served as the starting point for selection. The final binding poses from the MD pool were checked against both the residual electron density and iodine anomalous signals to ensure that the pool sufficiently sampled regions proximal to the data. Small changes in the Spy model (e.g. changes that caused R Free to increase up to 26%) were found to not make significant changes in the residual electron density. Binning the residual Im7 electron density. As the selection algorithm (see Ensemble selection, below) requires many cycled repeating comparisons between the residual electron density and test ensembles, the following procedure was applied to provide fast model to map comparisons. The output 2mF o -DF c maps from refining the MD simulation were averaged to create a 2mF o - DF c map averaged over the entire simulation. Then, we extracted map values from the averaged 2mF o -DF c map at the position of each Cα of each frame of the ~10,0000 refined MD configurations. A 4 Å 3D-grid was then used to average those values within bins for the selection. All bins with a density lower than a defined bottom threshold ranging from e/å 3 to e/å 3 were discarded, with e/å 3 providing the best results as determined by holdout cross-validation tests (see below). All values above 0.2 e/å 3 were truncated to 0.2 e/å 3 to avoid any contaminating electron density arising from well-ordered solvent or Spy that would bias the selection. Finally, each of the 4 Å bins sampled by less than 100 Cα over the course of the MD trajectory were discarded due to insufficient sampling that could lead to incomplete electron density averaging. This procedure led to an experimental histogram of 90 points, a computationally feasible number for incorporation into the selection procedure described below. Then, we applied these 90 bins to each of the individual MD frames, leading to individual histograms for each of the separate test conformers. Ensemble selection. A genetic algorithm was used to select a sub-ensemble of N conformers from the MD pool of ~10,000 configurations 15 (Fig. 2). The algorithm is initialized by generating a set of X=100 ensembles of N conformers generated randomly. Then, 50,000 steps of evolution are iteratively performed in which 4X new ensembles are generated using internal and external mutations, reproduction, or by generating new random ensembles. Mutations consist of modifying a given ensemble by substituting one of its conformers by another conformer, either 14 Afonine, P.V. et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallographica Section D-Biological Crystallography D68, (2012). 15 Nodet, G. et al. Quantitative Description of Backbone Conformational Sampling of Unfolded Proteins at Amino Acid Resolution from NMR Residual Dipolar Couplings. Journal of the American Chemical Society 131, (2009).

19 from the conformations not yet selected within the ensembles (external mutations) or from the conformations already selected (internal mutations). Reproduction consists in paring existing ensembles and mixing the conformers of these existing ensembles. The obtained 5X ensembles are pooled in T=1 100 different tournaments. The number of tournaments starts high (100) to provide a low selection pressure and favor conformational diversity. Reducing the number of tournaments through the iterations progressively increases the selection pressure as the χ 2 decreases to focus the selection towards an optimal ensemble. The ensembles providing the best data reproduction in each tournament are kept to obtain a new set of X ensembles. The data ; reproduction, at each step, is assessed by using a fitness function χ G$G reporting on both the iodine anomalous signal and the binned residual density of Im and the flexible linker regions of Spy. No helicity constraints were used in the selection procedure. For the residual electron density selection, the reproduction of the experimental histogram (see above) was assessed using a classical χ 2 function: ; χ +*+H =, ρ, H(*H ρ, +JK δ +*+H where i runs over the different bins of the experimental histogram, ρ, +JK is the value of the experimental histogram in that bin, and ρ, H(*H is the averaged value of the individual histograms for the conformations in the test ensemble (see above) and δ +*+H, a constant weight fixed to 0.05 to estimate of the noise in the residual electron density map. The choice of χ 2 biases the selection towards fitting areas of high electron density preferentially over areas of low electron density. As such, analysis of the χ 2 selections showed that bins in the top 50% as ranked by electron density level were twice as likely to be fit by protein than bins in the lower 50% as ranked by electron density level. For comparison, a correlation coefficient selection was also performed, in which the χ 2 method was tested against the scale-independent correlation coefficient. Theχ 2 selection performed better in the hold-out cross-validation tests (see below) and was therefore used in further selections. Minimizing χ 2 has the advantage of preferentially fitting regions of high electron density while de-prioritizing areas of low electron density. For the iodine anomalous signal, a χ 2 function modified for better handling of outliers was used 16,17 : ; χ,$&$ = c ; 1 exp D, H(*H +JK D,, D i calc ; cδ,$&$ where i runs over the different iodine signal, is the closest distance to = 6.5 Å between the iodine signal and the corresponding Cα of any of the conformers in the ensemble, δ,$&$ is a weight estimated using the atomic fluctuation of the particular iodine as defined above, and c is fixed at its canonical value This target function was selected to handle the possibility of iodine substitutions altering the conformational sampling of Im Bouvignies, G., Meier, S., Grzesiek, S. & Blackledge, M. Ultrahigh-resolution backbone structure of perdeuterated protein GB1 using residual dipolar couplings from two alignment media. Angewandte Chemie-International Edition 45, (2006). 17 Press, W.H. Numerical recipes : the art of scientific computing, xxi, 1235 p. (Cambridge University Press, Cambridge, UK ; New York, 2007). ; D i exp

20 The total fitness function, χ ; G$G, used to select the best ensembles in each tournament is then computed as: ; ; ; χ G$G = χ +*+H + λχ,$&$ where λ= 2.9 is a scaling factor introduced to ensure that the selection receives equal guidance from residual electron density and anomalous signals. The value of λ was computed as the ratio of the density and the iodine data points. Validation tests. 10-Fold cross-validation tests. To test the significance of the selection, we removed 10% of both electron density and iodine anomalous signal positions from the selection. The selection was then performed using the 90% remaining, and the data reproduction was assessed by ; computing the final χ G$G at the end of the selection using only the 10% of data unused in the fit. The procedure was repeated using 10 different sets of randomly removed data, and averaged (Fig. 1c,d). Alternatively, each iodine probe was removed one after the other and the active selection was carried out using the remaining iodine probes and the electron density. Again, the data reproduction was assessed using only the iodine signals that were not included in the selection procedure. Data reproduction using this alternative iodine removal procedure was roughly equivalent to the former case (data not shown). Boostrapping. Boostrapping was performed by resampling the experimental data distribution and randomly replacing a fraction of the measured points with repeat data. In the procedure, 200 new datasets were generated in which the canonical value of 37% of the data points 18 (both iodine anomalous signals and residual electron density bins) were removed and replaced with randomly chosen repeat data. For each of these datasets, the selection procedure was as described above (Ensemble selection). The 200 final ensembles were used to estimate the precision of the Spy:Im contact map. Simulated data. Three separate six-member target ensembles were chosen from the MD pool to provide target ensembles with varying levels of structure and placement relative to Spy. Simulated iodine signals and electron density were back calculated from these structures and noise corrupted. For iodine back-prediction, the same number of anomalous signals was used as in experiment. Simulated iodine anomalous signals were inserted by choosing random points 6.5 Å away from the Cα of the pi-phe-substituted residue. The position of this simulated iodine signal was then noise corrupted using a Gaussian function, with a standard deviation of 0.5 Å. To generate a target electron density map, simulated 2mF o -DF c maps were back-calculated from the target ensembles using Phenix. Map values were extracted for each Cα of these ensembles. To noise-corrupt the map, we took the positional coordinates of each Cα in the MD pool and replaced the 2mF o -DF c map values for each Cα in the MD pool with noise. Gaussian electron density noise for the map values was set to an average amplitude of e/å 3 and standard deviation of e/å 3. The map values and Cα positions from the target data were then combined with the noise map values and positions. Binning the electron density then proceeded as described above (see Binning the residual Im7 electron density). This noise-corrupted data was then used to perform READ selections. The difference between the target and initial ensembles were analyzed by pairwise RMSD. In this analysis, the pairwise RMSDs between the 18 Press, W.H. Numerical recipes : the art of scientific computing, xxi, 1235 p. (Cambridge University Press, Cambridge, UK ; New York, 2007).

21 target and selected ensembles were rank-ordered by lowest to highest RMSD. Then, the pairs between the target and selected ensembles were chosen to obtain a 1:1 mapping of the members from the selected pool with their lowest-rmsd analogs in the target ensemble. Then, the average RMSD of these pairs was calculated. Pool corruption. READ selections of ensemble of sizes from 1 to 20 were pooled together and supplemented with structures in their direct vicinity to provide a ~1000 conformer pool, sampling the conformational space in agreement with experimental data. A second pool of ~9,000 members were generated by performing analogous pseudo-crystalline simulations with all intra-molecular contacts within Im turned off. Thus, the substrate can still interact with the Spy lattice, but will adopt conformations that are unstructured. These two pools were combined into a single pool that was used to carry out selections as described above (Ensemble Selection). The similarity between the ensemble obtained using this pool and the ensemble obtained from the initial MD pool was estimated by counting the number of members from the initial MD pool in the final ensembles. R Free. All Im atoms that were actively selected were included in the final six-member model. The remaining atoms from the selected conformers were included based on refinement in Phenix 13 in which each of the remaining atoms was added back into the structure one at a time. Contact map generation. For the selected ensemble, we computed intermolecular contact maps between Spy and Im We assigned a contact to a given residue pair if the Cα-Cα distance between the two residues was less than or equal to λ r,0 C,% from the intermolecular potential presented in equation (3), where λ = 1.2. For each intermolecular residue pair, we reported the contact probability averaged over all possible peptide and Spy loop conformations. To project the contact maps onto the structures of Spy and Im on the same scale, the contact frequency for each residue pair was averaged over all residues.

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2 Table 1. Crystallographic data collection, phasing and refinement statistics Native Hg soaked Mn soaked 1 Mn soaked 2 Data collection Space group P2 1 2 1 2 1 P2 1 2 1 2 1 P2 1 2 1 2 1 P2 1 2 1 2 1 Cell

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary materials Figure S1 Fusion protein of Sulfolobus solfataricus SRP54 and a signal peptide. a, Expression vector for the fusion protein. The signal peptide of yeast dipeptidyl aminopeptidase

More information

Introduction to" Protein Structure

Introduction to Protein Structure Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the

More information

Supplementary Figures. Measuring Similarity Between Dynamic Ensembles of Biomolecules

Supplementary Figures. Measuring Similarity Between Dynamic Ensembles of Biomolecules Supplementary Figures Measuring Similarity Between Dynamic Ensembles of Biomolecules Shan Yang, Loïc Salmon 2, and Hashim M. Al-Hashimi 3*. Department of Chemistry, University of Michigan, Ann Arbor, MI,

More information

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

CMPS 3110: Bioinformatics. Tertiary Structure Prediction CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature11524 Supplementary discussion Functional analysis of the sugar porter family (SP) signature motifs. As seen in Fig. 5c, single point mutation of the conserved

More information

Supplementary Figure 1. Biochemical and sequence alignment analyses the

Supplementary Figure 1. Biochemical and sequence alignment analyses the Supplementary Figure 1. Biochemical and sequence alignment analyses the interaction of OPTN and TBK1. (a) Analytical gel filtration chromatography analysis of the interaction between TBK1 CTD and OPTN(1-119).

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature11085 Supplementary Tables: Supplementary Table 1. Summary of crystallographic and structure refinement data Structure BRIL-NOP receptor Data collection Number of crystals 23 Space group

More information

IgE binds asymmetrically to its B cell receptor CD23

IgE binds asymmetrically to its B cell receptor CD23 Supplementary Information IgE binds asymmetrically to its B cell receptor CD23 Balvinder Dhaliwal 1*, Marie O. Y. Pang 2, Anthony H. Keeble 2,3, Louisa K. James 2,4, Hannah J. Gould 2, James M. McDonnell

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. Title Author(s) Citation Coarse-Grained Molecular Modeling of the Solution Structure Ensemble of Dengue Virus

More information

Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1.

Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1. Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1. PDZK1 constru cts Amino acids MW [kda] KD [μm] PEPT2-CT- FITC KD [μm] NHE3-CT- FITC KD [μm] PDZK1-CT-

More information

Acta Cryst. (2017). D73, doi: /s

Acta Cryst. (2017). D73, doi: /s Acta Cryst. (2017). D73, doi:10.1107/s2059798317010932 Supporting information Volume 73 (2017) Supporting information for article: Designing better diffracting crystals of biotin carboxyl carrier protein

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature11054 Supplementary Fig. 1 Sequence alignment of Na v Rh with NaChBac, Na v Ab, and eukaryotic Na v and Ca v homologs. Secondary structural elements of Na v Rh are indicated above the

More information

Table S1. Primers used for the constructions of recombinant GAL1 and λ5 mutants. GAL1-E74A ccgagcagcgggcggctgtctttcc ggaaagacagccgcccgctgctcgg

Table S1. Primers used for the constructions of recombinant GAL1 and λ5 mutants. GAL1-E74A ccgagcagcgggcggctgtctttcc ggaaagacagccgcccgctgctcgg SUPPLEMENTAL DATA Table S1. Primers used for the constructions of recombinant GAL1 and λ5 mutants Sense primer (5 to 3 ) Anti-sense primer (5 to 3 ) GAL1 mutants GAL1-E74A ccgagcagcgggcggctgtctttcc ggaaagacagccgcccgctgctcgg

More information

Protein Structure Determination from Pseudocontact Shifts Using ROSETTA

Protein Structure Determination from Pseudocontact Shifts Using ROSETTA Supporting Information Protein Structure Determination from Pseudocontact Shifts Using ROSETTA Christophe Schmitz, Robert Vernon, Gottfried Otting, David Baker and Thomas Huber Table S0. Biological Magnetic

More information

Introduction to Computational Structural Biology

Introduction to Computational Structural Biology Introduction to Computational Structural Biology Part I 1. Introduction The disciplinary character of Computational Structural Biology The mathematical background required and the topics covered Bibliography

More information

Nature Structural and Molecular Biology: doi: /nsmb.2938

Nature Structural and Molecular Biology: doi: /nsmb.2938 Supplementary Figure 1 Characterization of designed leucine-rich-repeat proteins. (a) Water-mediate hydrogen-bond network is frequently visible in the convex region of LRR crystal structures. Examples

More information

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Protein Dynamics The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Below is myoglobin hydrated with 350 water molecules. Only a small

More information

Nature Structural & Molecular Biology: doi: /nsmb.3194

Nature Structural & Molecular Biology: doi: /nsmb.3194 Supplementary Figure 1 Mass spectrometry and solution NMR data for -syn samples used in this study. (a) Matrix-assisted laser-desorption and ionization time-of-flight (MALDI-TOF) mass spectrum of uniformly-

More information

THE CRYSTAL STRUCTURE OF THE SGT1-SKP1 COMPLEX: THE LINK BETWEEN

THE CRYSTAL STRUCTURE OF THE SGT1-SKP1 COMPLEX: THE LINK BETWEEN THE CRYSTAL STRUCTURE OF THE SGT1-SKP1 COMPLEX: THE LINK BETWEEN HSP90 AND BOTH SCF E3 UBIQUITIN LIGASES AND KINETOCHORES Oliver Willhoft, Richard Kerr, Dipali Patel, Wenjuan Zhang, Caezar Al-Jassar, Tina

More information

Supporting Information

Supporting Information Supporting Information Micelle-Triggered b-hairpin to a-helix Transition in a 14-Residue Peptide from a Choline-Binding Repeat of the Pneumococcal Autolysin LytA HØctor Zamora-Carreras, [a] Beatriz Maestro,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary Table 1: Amplitudes of three current levels. Level 0 (pa) Level 1 (pa) Level 2 (pa) TrkA- TrkH WT 200 K 0.01 ± 0.01 9.5 ± 0.01 18.7 ± 0.03 200 Na * 0.001 ± 0.01 3.9 ± 0.01 12.5 ± 0.03 200

More information

Protein Structures. 11/19/2002 Lecture 24 1

Protein Structures. 11/19/2002 Lecture 24 1 Protein Structures 11/19/2002 Lecture 24 1 All 3 figures are cartoons of an amino acid residue. 11/19/2002 Lecture 24 2 Peptide bonds in chains of residues 11/19/2002 Lecture 24 3 Angles φ and ψ in the

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION Structure of human carbamoyl phosphate synthetase: deciphering the on/off switch of human ureagenesis Sergio de Cima, Luis M. Polo, Carmen Díez-Fernández, Ana I. Martínez, Javier

More information

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its

More information

Acta Crystallographica Section D

Acta Crystallographica Section D Supporting information Acta Crystallographica Section D Volume 70 (2014) Supporting information for article: Structural characterization of the virulence factor Nuclease A from Streptococcus agalactiae

More information

Structural basis of PROTAC cooperative recognition for selective protein degradation

Structural basis of PROTAC cooperative recognition for selective protein degradation SUPPLEMENTARY INFORMATION Structural basis of PROTAC cooperative recognition for selective protein degradation Morgan S. Gadd 1, Andrea Testa 1, Xavier Lucas 1, Kwok-Ho Chan, Wenzhang Chen, Douglas J.

More information

Unfolding CspB by means of biased molecular dynamics

Unfolding CspB by means of biased molecular dynamics Chapter 4 Unfolding CspB by means of biased molecular dynamics 4.1 Introduction Understanding the mechanism of protein folding has been a major challenge for the last twenty years, as pointed out in the

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*

More information

Structure and RNA-binding properties. of the Not1 Not2 Not5 module of the yeast Ccr4 Not complex

Structure and RNA-binding properties. of the Not1 Not2 Not5 module of the yeast Ccr4 Not complex Structure and RNA-binding properties of the Not1 Not2 Not5 module of the yeast Ccr4 Not complex Varun Bhaskar 1, Vladimir Roudko 2,3, Jerome Basquin 1, Kundan Sharma 4, Henning Urlaub 4, Bertrand Seraphin

More information

Chapter 6. The interaction of Src SH2 with the focal adhesion kinase catalytic domain studied by NMR

Chapter 6. The interaction of Src SH2 with the focal adhesion kinase catalytic domain studied by NMR The interaction of Src SH2 with the focal adhesion kinase catalytic domain studied by NMR 103 Abstract The interaction of the Src SH2 domain with the catalytic domain of FAK, including the Y397 SH2 domain

More information

Supplementary Information. The protease GtgE from Salmonella exclusively targets. inactive Rab GTPases

Supplementary Information. The protease GtgE from Salmonella exclusively targets. inactive Rab GTPases Supplementary Information The protease GtgE from Salmonella exclusively targets inactive Rab GTPases Table of Contents Supplementary Figures... 2 Supplementary Figure 1... 2 Supplementary Figure 2... 3

More information

Supplemental Information

Supplemental Information Supplemental Information Combinatorial Readout of Unmodified H3R2 and Acetylated H3K14 by the Tandem PHD Finger of MOZ Reveals a Regulatory Mechanism for HOXA9 Transcription Yu Qiu 1, Lei Liu 1, Chen Zhao

More information

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC The precise definition of a dihedral or torsion angle can be found in spatial geometry Angle between to planes Dihedral

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary Results DNA binding property of the SRA domain was examined by an electrophoresis mobility shift assay (EMSA) using synthesized 12-bp oligonucleotide duplexes containing unmodified, hemi-methylated,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature10955 Supplementary Figures Supplementary Figure 1. Electron-density maps and crystallographic dimer structures of the motor domain. (a f) Stereo views of the final electron-density maps

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

Supplementary Figures:

Supplementary Figures: Supplementary Figures: Supplementary Figure 1: The two strings converge to two qualitatively different pathways. A) Models of active (red) and inactive (blue) states used as end points for the string calculations

More information

Supplemental Information. Molecular Basis of Spectral Diversity. in Near-Infrared Phytochrome-Based. Fluorescent Proteins

Supplemental Information. Molecular Basis of Spectral Diversity. in Near-Infrared Phytochrome-Based. Fluorescent Proteins Chemistry & Biology, Volume 22 Supplemental Information Molecular Basis of Spectral Diversity in Near-Infrared Phytochrome-Based Fluorescent Proteins Daria M. Shcherbakova, Mikhail Baloban, Sergei Pletnev,

More information

Quantitative Stability/Flexibility Relationships; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 12

Quantitative Stability/Flexibility Relationships; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 12 Quantitative Stability/Flexibility Relationships; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 12 The figure shows that the DCM when applied to the helix-coil transition, and solved

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Identification of the ScDcp2 minimal region interacting with both ScDcp1 and the ScEdc3 LSm domain. Pull-down experiment of untagged ScEdc3 LSm with various ScDcp1-Dcp2-His 6 fragments.

More information

Computational engineering of cellulase Cel9A-68 functional motions through mutations in its linker region. WT 1TF4 (crystal) -90 ERRAT PROVE VERIFY3D

Computational engineering of cellulase Cel9A-68 functional motions through mutations in its linker region. WT 1TF4 (crystal) -90 ERRAT PROVE VERIFY3D Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics. This journal is the Owner Societies 218 Supplementary Material: Computational engineering of cellulase Cel9-68 functional

More information

type GroEL-GroES complex. Crystals were grown in buffer D (100 mm HEPES, ph 7.5,

type GroEL-GroES complex. Crystals were grown in buffer D (100 mm HEPES, ph 7.5, Supplementary Material Supplementary Materials and Methods Structure Determination of SR1-GroES-ADP AlF x SR1-GroES-ADP AlF x was purified as described in Materials and Methods for the wild type GroEL-GroES

More information

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Department of Chemical Engineering Program of Applied and

More information

Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros

Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros Ensemble refinement of protein crystal structures in PHENIX Tom Burnley Piet Gros Incomplete modelling of disorder contributes to R factor gap Only ~5% of residues in the PDB are modelled with more than

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION www.nature.com/nature 1 Figure S1 Sequence alignment. a Structure based alignment of the plgic of E. chrysanthemi (ELIC), the acetylcholine binding protein from the snail Lymnea stagnalis (AchBP, PDB code

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Fig. 1 Influences of crystal lattice contacts on Pol η structures. a. The dominant lattice contact between two hpol η molecules (silver and gold) in the type 1 crystals. b. A close-up view of the hydrophobic

More information

Dipolar Couplings in Partially Aligned Macromolecules - New Directions in. Structure Determination using Solution State NMR.

Dipolar Couplings in Partially Aligned Macromolecules - New Directions in. Structure Determination using Solution State NMR. Dipolar Couplings in Partially Aligned Macromolecules - New Directions in Structure Determination using Solution State NMR. Recently developed methods for the partial alignment of macromolecules in dilute

More information

Lecture 11: Potential Energy Functions

Lecture 11: Potential Energy Functions Lecture 11: Potential Energy Functions Dr. Ronald M. Levy ronlevy@temple.edu Originally contributed by Lauren Wickstrom (2011) Microscopic/Macroscopic Connection The connection between microscopic interactions

More information

Supplementary information for:

Supplementary information for: SUPPLEMETARY IFRMATI Supplementary information for: Structure of a β 1 -adrenergic G protein-coupled receptor Tony Warne, Maria J. Serrano-Vega, Jillian G. Baker#, Rouslan Moukhametzianov, Patricia C.

More information

Supplementary Figures

Supplementary Figures 1 Supplementary Figures Supplementary Figure 1 Type I FGFR1 inhibitors (a) Chemical structures of a pyrazolylaminopyrimidine inhibitor (henceforth referred to as PAPI; PDB-code of the FGFR1-PAPI complex:

More information

T H E J O U R N A L O F G E N E R A L P H Y S I O L O G Y. jgp

T H E J O U R N A L O F G E N E R A L P H Y S I O L O G Y. jgp S u p p l e m e n ta l m at e r i a l jgp Lee et al., http://www.jgp.org/cgi/content/full/jgp.201411219/dc1 T H E J O U R N A L O F G E N E R A L P H Y S I O L O G Y S u p p l e m e n ta l D I S C U S

More information

NB-DNJ/GCase-pH 7.4 NB-DNJ+/GCase-pH 7.4 NB-DNJ+/GCase-pH 4.5

NB-DNJ/GCase-pH 7.4 NB-DNJ+/GCase-pH 7.4 NB-DNJ+/GCase-pH 4.5 SUPPLEMENTARY TABLES Suppl. Table 1. Protonation states at ph 7.4 and 4.5. Protonation states of titratable residues in GCase at ph 7.4 and 4.5. Histidine: HID, H at δ-nitrogen; HIE, H at ε-nitrogen; HIP,

More information

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l Phase problem: Determining an initial phase angle α hkl for each recorded reflection 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l Methods: Heavy atom methods (isomorphous replacement Hg, Pt)

More information

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007

Softwares for Molecular Docking. Lokesh P. Tripathi NCBS 17 December 2007 Softwares for Molecular Docking Lokesh P. Tripathi NCBS 17 December 2007 Molecular Docking Attempt to predict structures of an intermolecular complex between two or more molecules Receptor-ligand (or drug)

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION In the format provided by the authors and unedited. SUPPLEMENTARY INFORMATION DOI: 10.1038/NCHEM.2720 1 2 3 Tuning underwater adhesion with cation-π interactions Matthew A. Gebbie, Wei Wei, Alex M. Schrader,

More information

Molecular Modeling lecture 2

Molecular Modeling lecture 2 Molecular Modeling 2018 -- lecture 2 Topics 1. Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction 4. Where do protein structures come from? X-ray crystallography

More information

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding? The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation By Jun Shimada and Eugine Shaknovich Bill Hawse Dr. Bahar Elisa Sandvik and Mehrdad Safavian Outline Background on protein

More information

Prediction and refinement of NMR structures from sparse experimental data

Prediction and refinement of NMR structures from sparse experimental data Prediction and refinement of NMR structures from sparse experimental data Jeff Skolnick Director Center for the Study of Systems Biology School of Biology Georgia Institute of Technology Overview of talk

More information

Supplementary Information. The Solution Structural Ensembles of RNA Kink-turn Motifs and Their Protein Complexes

Supplementary Information. The Solution Structural Ensembles of RNA Kink-turn Motifs and Their Protein Complexes Supplementary Information The Solution Structural Ensembles of RNA Kink-turn Motifs and Their Protein Complexes Xuesong Shi, a Lin Huang, b David M. J. Lilley, b Pehr B. Harbury a,c and Daniel Herschlag

More information

Structural characterization of NiV N 0 P in solution and in crystal.

Structural characterization of NiV N 0 P in solution and in crystal. Supplementary Figure 1 Structural characterization of NiV N 0 P in solution and in crystal. (a) SAXS analysis of the N 32-383 0 -P 50 complex. The Guinier plot for complex concentrations of 0.55, 1.1,

More information

Better Bond Angles in the Protein Data Bank

Better Bond Angles in the Protein Data Bank Better Bond Angles in the Protein Data Bank C.J. Robinson and D.B. Skillicorn School of Computing Queen s University {robinson,skill}@cs.queensu.ca Abstract The Protein Data Bank (PDB) contains, at least

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:1.138/nature1737 Supplementary Table 1 variant Description FSEC - 2B12 a FSEC - 6A1 a K d (leucine) c Leucine uptake e K (wild-type like) K (Y18F) K (TS) K (TSY) K288A mutant, lipid facing side chain

More information

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate

More information

The Effect of Motional Averaging on the Calculation of NMR-Derived Structural Properties

The Effect of Motional Averaging on the Calculation of NMR-Derived Structural Properties PROTEINS: Structure, Function, and Genetics 6:542 555 (1999) The Effect of Motional Averaging on the Calculation of NMR-Derived Structural Properties Xavier Daura, 1 Iris Antes, 1,2 Wilfred F. van Gunsteren,

More information

Stabilizing the CH2 domain of an Antibody by Engineering in an Enhanced Aromatic Sequon

Stabilizing the CH2 domain of an Antibody by Engineering in an Enhanced Aromatic Sequon Stabilizing the CH2 domain of an Antibody by Engineering in an Enhanced Aromatic Sequon Wentao Chen,, Leopold Kong, Stephen Connelly, Julia M. Dendle,, Yu Liu,, Ian A. Wilson,#, Evan T. Powers, *, Jeffery

More information

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years. Structure Determination and Sequence Analysis The vast majority of the experimentally determined three-dimensional protein structures have been solved by one of two methods: X-ray diffraction and Nuclear

More information

RNA Polymerase I Contains a TFIIF-Related DNA-Binding Subcomplex

RNA Polymerase I Contains a TFIIF-Related DNA-Binding Subcomplex Molecular Cell, Volume 39 Supplemental Information RNA Polymerase I Contains a TFIIFRelated DNABinding Subcomplex Sebastian R. Geiger, Kristina Lorenzen, Amelie Schreieck, Patrizia Hanecker, Dirk Kostrewa,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature17991 Supplementary Discussion Structural comparison with E. coli EmrE The DMT superfamily includes a wide variety of transporters with 4-10 TM segments 1. Since the subfamilies of the

More information

Introduction solution NMR

Introduction solution NMR 2 NMR journey Introduction solution NMR Alexandre Bonvin Bijvoet Center for Biomolecular Research with thanks to Dr. Klaartje Houben EMBO Global Exchange course, IHEP, Beijing April 28 - May 5, 20 3 Topics

More information

Supplemental Data. Structure of the Rb C-Terminal Domain. Bound to E2F1-DP1: A Mechanism. for Phosphorylation-Induced E2F Release

Supplemental Data. Structure of the Rb C-Terminal Domain. Bound to E2F1-DP1: A Mechanism. for Phosphorylation-Induced E2F Release Supplemental Data Structure of the Rb C-Terminal Domain Bound to E2F1-DP1: A Mechanism for Phosphorylation-Induced E2F Release Seth M. Rubin, Anne-Laure Gall, Ning Zheng, and Nikola P. Pavletich Section

More information

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key Biology 5357 Chemistry & Physics of Biomolecules Examination #1 Proteins Module September 29, 2017 Answer Key Question 1 (A) (5 points) Structure (b) is more common, as it contains the shorter connection

More information

Solving the three-dimensional solution structures of larger

Solving the three-dimensional solution structures of larger Accurate and rapid docking of protein protein complexes on the basis of intermolecular nuclear Overhauser enhancement data and dipolar couplings by rigid body minimization G. Marius Clore* Laboratory of

More information

Computational Modeling of Protein Kinase A and Comparison with Nuclear Magnetic Resonance Data

Computational Modeling of Protein Kinase A and Comparison with Nuclear Magnetic Resonance Data Computational Modeling of Protein Kinase A and Comparison with Nuclear Magnetic Resonance Data ABSTRACT Keyword Lei Shi 1 Advisor: Gianluigi Veglia 1,2 Department of Chemistry 1, & Biochemistry, Molecular

More information

Supplementary Materials for

Supplementary Materials for advances.sciencemag.org/cgi/content/full/4/1/eaau413/dc1 Supplementary Materials for Structure and dynamics conspire in the evolution of affinity between intrinsically disordered proteins Per Jemth*, Elin

More information

Electro-Mechanical Conductance Modulation of a Nanopore Using a Removable Gate

Electro-Mechanical Conductance Modulation of a Nanopore Using a Removable Gate Electro-Mechanical Conductance Modulation of a Nanopore Using a Removable Gate Shidi Zhao a, Laura Restrepo-Pérez b, Misha Soskine c, Giovanni Maglia c, Chirlmin Joo b, Cees Dekker b and Aleksei Aksimentiev

More information

SUPPLEMENTARY MATERIAL FOR

SUPPLEMENTARY MATERIAL FOR SUPPLEMENTARY MATERIAL FOR THE LIPID-BINDING DOMAIN OF WILD TYPE AND MUTANT ALPHA- SYNUCLEIN: COMPACTNESS AND INTERCONVERSION BETWEEN THE BROKEN- AND EXTENDED-HELIX FORMS. Elka R. Georgieva 1, Trudy F.

More information

Molecular Biology Course 2006 Protein Crystallography Part II

Molecular Biology Course 2006 Protein Crystallography Part II Molecular Biology Course 2006 Protein Crystallography Part II Tim Grüne University of Göttingen Dept. of Structural Chemistry December 2006 http://shelx.uni-ac.gwdg.de tg@shelx.uni-ac.gwdg.de Overview

More information

Measuring quaternary structure similarity using global versus local measures.

Measuring quaternary structure similarity using global versus local measures. Supplementary Figure 1 Measuring quaternary structure similarity using global versus local measures. (a) Structural similarity of two protein complexes can be inferred from a global superposition, which

More information

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015 Overview & Applications T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 4 June, 215 Simulations still take time Bakan et al. Bioinformatics 211. Coarse-grained Elastic

More information

Analysis of the simulation

Analysis of the simulation Analysis of the simulation Marcus Elstner and Tomáš Kubař January 7, 2014 Thermodynamic properties time averages of thermodynamic quantites correspond to ensemble averages (ergodic theorem) some quantities

More information

Modeling for 3D structure prediction

Modeling for 3D structure prediction Modeling for 3D structure prediction What is a predicted structure? A structure that is constructed using as the sole source of information data obtained from computer based data-mining. However, mixing

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary Table 1: Data collection, phasing and refinement statistics ChbC/Ta 6 Br 12 Native ChbC Data collection Space group P4 3 2 1 2 P4 3 2 1 2 Cell dimensions a, c (Å) 132.75, 453.57 132.81, 452.95

More information

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190 Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190 Programs for NMR Based Structure Determination CNS - Brunger, A. T.; Adams, P. D.; Clore, G. M.; DeLano, W. L.; Gros, P.; Grosse-Kunstleve,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Figure S1. Secondary structure of CAP (in the camp 2 -bound state) 10. α-helices are shown as cylinders and β- strands as arrows. Labeling of secondary structure is indicated. CDB, DBD and the hinge are

More information

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS TASKQUARTERLYvol.20,No4,2016,pp.353 360 PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS MARTIN ZACHARIAS Physics Department T38, Technical University of Munich James-Franck-Str.

More information

Experimental and Computational Mutagenesis to Investigate the. Positioning of a General Base within an Enzyme Active Site

Experimental and Computational Mutagenesis to Investigate the. Positioning of a General Base within an Enzyme Active Site Experimental and Computational Mutagenesis to Investigate the Positioning of a General Base within an Enzyme Active Site Jason P. Schwans, Philip Hanoian, Benjamin J. Lengerich, Fanny Sunden, Ana Gonzalez

More information

Homologous proteins have similar structures and structural superposition means to rotate and translate the structures so that corresponding atoms are

Homologous proteins have similar structures and structural superposition means to rotate and translate the structures so that corresponding atoms are 1 Homologous proteins have similar structures and structural superposition means to rotate and translate the structures so that corresponding atoms are as close to each other as possible. Structural similarity

More information

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Taking the advice of Lord Kelvin, the Father of Thermodynamics, I describe the protein molecule and other

More information

Orientational degeneracy in the presence of one alignment tensor.

Orientational degeneracy in the presence of one alignment tensor. Orientational degeneracy in the presence of one alignment tensor. Rotation about the x, y and z axes can be performed in the aligned mode of the program to examine the four degenerate orientations of two

More information

Isothermal experiments characterize time-dependent aggregation and unfolding

Isothermal experiments characterize time-dependent aggregation and unfolding 1 Energy Isothermal experiments characterize time-dependent aggregation and unfolding Technical ote Introduction Kinetic measurements have, for decades, given protein scientists insight into the mechanisms

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Focus on PNA Flexibility and RNA Binding using Molecular Dynamics and Metadynamics

Focus on PNA Flexibility and RNA Binding using Molecular Dynamics and Metadynamics SUPPLEMENTARY INFORMATION Focus on PNA Flexibility and RNA Binding using Molecular Dynamics and Metadynamics Massimiliano Donato Verona 1, Vincenzo Verdolino 2,3,*, Ferruccio Palazzesi 2,3, and Roberto

More information

Full-length GlpG sequence was generated by PCR from E. coli genomic DNA. (with two sequence variations, D51E/L52V, from the gene bank entry aac28166),

Full-length GlpG sequence was generated by PCR from E. coli genomic DNA. (with two sequence variations, D51E/L52V, from the gene bank entry aac28166), Supplementary Methods Protein expression and purification Full-length GlpG sequence was generated by PCR from E. coli genomic DNA (with two sequence variations, D51E/L52V, from the gene bank entry aac28166),

More information

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP) Joana Pereira Lamzin Group EMBL Hamburg, Germany Small molecules How to identify and build them (with ARP/wARP) The task at hand To find ligand density and build it! Fitting a ligand We have: electron

More information

Supplementary figure 1 Application of tmfret in LeuT. (a) To assess the feasibility of using tmfret for distance-dependent measurements in LeuT, a

Supplementary figure 1 Application of tmfret in LeuT. (a) To assess the feasibility of using tmfret for distance-dependent measurements in LeuT, a Supplementary figure 1 Application of tmfret in LeuT. (a) To assess the feasibility of using tmfret for distance-dependent measurements in LeuT, a series of tmfret-pairs comprised of single cysteine mutants

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Resonance assignment and NMR spectra for hairpin and duplex A 6 constructs. (a) 2D HSQC spectra of hairpin construct (hp-a 6 -RNA) with labeled assignments. (b) 2D HSQC or SOFAST-HMQC

More information