Supplementary Information Intrinsic Localized Modes in Proteins

Similar documents
Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Proteins: Characteristics and Properties of Amino Acids

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Supporting Information

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Oxygen Binding in Hemocyanin

Unraveling the degradation of artificial amide bonds in Nylon oligomer hydrolase: From induced-fit to acylation processes

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

Biophysical Journal, Volume 96. Supporting Material

April, The energy functions include:

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Electronic Supplementary Information

Supplementary Information

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

Packing of Secondary Structures

NMR Assignments using NMRView II: Sequential Assignments

Physiochemical Properties of Residues

Protein Fragment Search Program ver Overview: Contents:

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Geometrical Concept-reduction in conformational space.and his Φ-ψ Map. G. N. Ramachandran

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Supporting Information

Computer simulations of protein folding with a small number of distance restraints

----- Ver October 24, 2014 Bug about reading MOPAC2012 Ver.14 calculations of 1 atom and 2 atoms molecule was fixed.

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Peptides And Proteins

FW 1 CDR 1 FW 2 CDR 2

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

Chapter 4: Amino Acids

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

The translation machinery of the cell works with triples of types of RNA bases. Any triple of RNA bases is known as a codon. The set of codons is

C CH 3 N C COOH. Write the structural formulas of all of the dipeptides that they could form with each other.

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

Supplementary Figure S1. Urea-mediated buffering mechanism of H. pylori. Gastric urea is funneled to a cytoplasmic urease that is presumably attached

Exam I Answer Key: Summer 2006, Semester C

Supplement information

Peptide Syntheses. Illustrative Protection: BOC/ t Bu. A. Introduction. do not acid

Amino acids: Optimization in underivatized formula and fragment patterns in LC/ESI-MS analysis. Yoshinori Takano (JAMSTEC)

Full wwpdb X-ray Structure Validation Report i

Dominant Paths in Protein Folding

β1 Structure Prediction and Validation

Supporting information

HSQC spectra for three proteins

X-ray crystallography NMR Cryoelectron microscopy

Bacterial protease uses distinct thermodynamic signatures for substrate recognition

Full wwpdb X-ray Structure Validation Report i

Model Mélange. Physical Models of Peptides and Proteins

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Diastereomeric resolution directed towards chirality. determination focussing on gas-phase energetics of coordinated. sodium dissociation

Supporting Online Material for

Amino Acids and Peptides

7.012 Problem Set 1 Solutions

Supplementary Materials for

Properties of amino acids in proteins

Deciphering the Rules for Amino Acid Co- Assembly Based on Interlayer Distances

Rho1 binding site PtdIns(4,5)P2 binding site Both sites

Molecular Structure Prediction by Global Optimization

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

Section Week 3. Junaid Malek, M.D.

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

L L. Figure by MIT OCW.

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

Computational Protein Design

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Journal of Pharmacology and Experimental Therapy-JPET#172536

Basic Principles of Protein Structures

Structure Investigation of Fam20C, a Golgi Casein Kinase

Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

Protein Structure and Visualisation. Introduction to PDB and PyMOL

Electronic Supplementary Information

Lecture 15: Realities of Genome Assembly Protein Sequencing

Any protein that can be labelled by both procedures must be a transmembrane protein.

Protein Structure Bioinformatics Introduction

Protein Structures: Experiments and Modeling. Patrice Koehl

Translation. A ribosome, mrna, and trna.

A Plausible Model Correlates Prebiotic Peptide Synthesis with. Primordial Genetic Code

Background BIOCHEMISTRY LAB CHE-554. Experiment #1 Spectrophotometry

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

Course Notes: Topics in Computational. Structural Biology.

Structural insights into energy regulation of light-harvesting complex from spinach CP29

Chemistry Chapter 22

Frequency Response of a Protein to Local Conformational Perturbations

Solutions In each case, the chirality center has the R configuration

Sequence Based Bioinformatics

Supplemental Information for: Characterizing the Membrane-Bound State of Cytochrome P450 3A4: Structure, Depth of Insertion and Orientation

Full wwpdb X-ray Structure Validation Report i

Transcription:

Supplementary Information Intrinsic Localized Modes in Proteins Adrien Nicolaï 1,, Patrice Delarue and Patrick Senet, 1 Department of Physics, Applied Physics and Astronomy, Rensselaer Polytechnic Institute, Troy, 118 New York, United States. Laboratoire Interdisciplinaire Carnot de Bourgogne, UMR 633 CNRS-Univ. Bourgogne Franche-Comté, 9 Av. A. Savary, BP 47 87, F-178 Dijon Cedex, France. Corresponding author: psenet@u-bourgogne.fr. List of Supplementary Figures: - Figure 1 - Figure - Figure 3 - Figure 4 - Figure 5 - Figure 6 - Figure 7 List of Supplementary Tables: - Table 1 - Table - Table 3 - Table 4 - Table 5 List of Supplementary Movies: - Movie 1 1

i i a b u u 4. 3.. 1. 4. 3.. 1. ILM # 3 4 5 6 7 8 9 1 11 1 13 14 15 16 17 18 vector index i ILM #18 3 4 5 6 7 8 9 1 11 1 13 14 15 16 17 18 vector index i Supplementary Figure 1: Amplitudes u i of ILMs i = and 18 as a function of vector index.

3 4 5 6 7 8 9 1 11 1 13 14 15 16 17 θ 18 18 1 o 6 18-18 18 γ 18 o 8 kt 6 4 Supplementary Figure : Effective free-energy map V (γ, θ) n for n = to 18 computed from MD run 1 of Trp-cage at T = 3 K. Black lines represent free-energy isolines V V min = 3k B T. 3

Supplementary Figure 3: Effective free-energy map V (γ, θ)n for n = to 18 computed from MD run 1 of Trp-cage at T = 38 K. Black lines represent free-energy isolines V Vmin = 3kB T computed from the effective free-energy maps V (γ, θ)n at T = 3 K (see Supplementary Figure ). Gray diamond and triangle represent (γ, θ) values of typical native and non-native conformations of Trp-cage, respectively, as shown in Figure 4 (c). 4

3 4 5 6 7 8 9 1 11 1 13 14 15 16 17 18 θ 18 1 6 18-18 18 γ 18 Supplementary Figure 4: (γ, θ) n for n = to 18 computed from NMR-derived structures provided in the PDB file 1LY. Black lines represent free-energy isolines V V min = 3k B T computed from the effective free-energy maps V (γ, θ) n at T = 3 K (see Supplementary Figure ). 5

a Trp-cage run 38 K 1 native NC [%] 75. non-native 5 5. 4. 1 3 5 simulation time [ns] b t = 54337 ps t = 45116 ps RMSD = 1.11 nm NC = 5.9% RMSD = 1.11 nm NC = 17.6% RMSD [nm] 4 Δui 1.4 1. 1..8.6.4. t = 31 ps 1 3 4 5 simulation time [ns] t = 9776 ps RMSD = 7 nm NC = 8.4% RMSD = 6 nm NC = 94.1% Supplementary Figure 5: (a) Native character (N C) (in %) as a function of simulation time for MD run of the Trp-cage protein at T = 38 K. Color line using the BGR palette (from blue, which corresponds to N C = 1%, green, which corresponds to N C = 5%, to red, which corresponds to N C = %) represents N C computed every 1 ps and the black line represents the mobile average of the N C computed on a time window of 1 ns. Gray impulses shown on the time axis of the lower graph represent the occurrence of an ILM (S <.5) along the MD trajectory. (b) Root mean-square deviation (RM SD in nm) as a function of simulation time for MD run of Trp-cage at T = 38 K. All-atom structures of Trp-cage for different snapshots are shown in a cartoon plus ball and stick representation. N C for each snapshot is also indicated. 6

a Trp-cage run 3 38 K 1 native NC [%] 75. non-native 5 5. 1 b 3 simulation time [ns] 4 HP-36 run 1 38 K 1 native 75. NC [%] 4. 5 Δui non-native 5 5. 4. 1 c 3 simulation time [ns] 4 Δui 5 HP-36 run 38 K 1 native NC [%] 75. 5 5. non-native 4. 5 1 d 3 simulation time [ns] 4 VA3 run 1 38 K 1 native 75. NC [%] Δui 5 5. non-native 4. 5 1 3 4 Δui simulation time [ns] Supplementary Figure 6: Native character (N C) (in %) as a function of simulation time for MD run 3 of Trp-cage (panel a), MD runs 1 and of the HP-36 (panels b and c) and MD run 1 of VA3 (panel d) at T = 38 K. Color line using the BGR palette (from blue, which corresponds to N C = 1%, green, which corresponds to N C = 5%, to red, which corresponds to N C = %) represents N C computed every 1 ps and the black line represents the mobile average of the N C computed on a time window of 1 ns. Gray impulses shown on the time axis of the lower graph represent the occurrence of an ILM (S <.5) along the MD trajectory. 7

a b ρ(δu, Δu Smin ) i i i Δu 1..9.8.7.6 ρmin.3.4.5.6.5. 1.5 1..5 Smin ρmin Smin Entropy S(t) 3 4 5 6 7 8 9 1 11 1 13 14 15 16 17 18 vector index i Supplementary Figure 7: Correlation analysis between ILMs. (a) Correlation ρ between u i profiles along the sequence with S.6 and u i Smin profile along the sequence characterized by S minimum (S =.4). (b) u i profiles along the sequence characterized by S min (blue line) and ρ min (red line), as shown in panel a..5..15.1 5 8

Supplementary Table 1: Number N ILM of ILMs of soliton type detected in MD simulations of the HP-36 protein at T = 38 K and their localization along the amino acid sequence (residues involved are listed between parenthesis using the three letters code). T = 38 K T = 3 K run 1 run run 1 N ILM 88 65 35 N ILM /N t (in %) 176 13 7 i = 4 (MET-LEU-SER-ASP) 1 1 3 i = 51 (VAL-PHE-GLY-MET) 3 3 i = 73 (GLU-LYS-GLY-LEU) 81 59 9 i = 74 (LYS-GLY-LEU-PHE) 3 1 9

Supplementary Table : Number N ILM of LMs of soliton type detected in MD simulations of VA3 at T = 38 K and their localization along the amino acid sequence (residues involved are listed between parenthesis using the three letters code). T = 38 K T = 3 K run 1 run 1 N ILM 34 33 N ILM /N t (in %) 68 66 i = 19 (LEU-THR-GLY-ALA) 1 - i = 36 (ILE-SER-GLY-SER) 3 33 i = 37 (SER-GLY-SER-THR) 1-1

Supplementary Table 3: Number N ILM of ILMs of the soliton type detected in the MD simulations of Trp-cage and their localization along the amino acid sequence (residues involved are listed between parenthesis using the three letters code). For each mutant, each residue replaced by ALA is marked by a symbol. All MD runs were of a 5 ns duration except run, at T = 3 K ( ns, see the Methods section). T = 38 K T = 3 K run 1 run run 3 mutant + mutant* run 1 run N ILM 51 3 199 31 58 71 17 N ILM /N t (in %) 5 6 398 64 116 14 85 i = (ASN-LEU-TYR-ILE) 4 3 3-6 - - i = 7 (TRP-LEU-LYS-ASP) - 1 - - - - - i = 8 (LEU-LYS-ASP-GLY) - - - 1 - - - i = 11 (GLY*-GLY*-PRO-SER) - 1-4 1-1 i = 1 (GLY*-PRO-SER-SER) - 1 - - 1 - - i = 13 (PRO-SER-SER-GLY*) - - - 1 1 - - i = 15 (SER-GLY +, *-ARG-PRO) 1 6 3-3 - i = 16 (GLY*-ARG-PRO-PRO) - - 1 1 - - i = 17 (ARG-PRO-PRO-PRO) 1-1 6 4-11

Supplementary Table 4: Number N ILM of ILMs of soliton type detected in MD simulations of the Trpcage protein using CHARMM7 and AMBER99SB*-ILDN-q force-fields and their localization along the amino acid sequence (residues involved are listed between parenthesis using the three letters code). T = 38 K T = 3 K run CHARMM run AMBER run CHARMM run AMBER N ILM 54 96 18 74 N ILM /N t (in %) 58 59 48 148 i = (ASN-LEU-TYR-ILE) 3 1 1 - i = 7 (TRP-LEU-LYS-ASP) - 1 - - i = 9 (LYS-ASP-GLY-GLY) 18 66 4 - i = 1 (ASP-GLY-GLY-PRO) 57 41 - i = 11 (GLY-GLY-PRO-SER) 9 1 5 - i = 1 (GLY-PRO-SER-SER) - - - - i = 13 (PRO-SER-SER-GLY) 7 - - i = 14 (SER-SER-GLY-ARG) 15 136 - - i = 15 (SER-GLY-ARG-PRO) 3 3 1 8 i = 16 (GLY-ARG-PRO-PRO) 8-4 - i = 17 (ARG-PRO-PRO-PRO) 17 5 13 7 i = 18 (PRO-PRO-PRO-SER) 117 4 98 59 Solitons i = 9,1 and 14 89 43 6-1

Supplementary Table 5: Comparative test of the number of ILMs as a function of the S cutoff value for the run 1 of Trp-cage at 38 K. S cutoff N ILM i = 9 i = 1 i = 14 i = 18 N n N nn P nn /P n.45 41 15 16 3 6-34 -.5 51 95 93 33 4 1 7.8.55 995 355 376 15 94 1 871 9.3.6 3177 997 1138 345 314 77 736 4.6 13

Supplementary Movie 1: Typical time evolution of an ILM of soliton type in MD trajectories of Trpcage of 1 ns duration for which the coordinates were recorded every fs. The time t = corresponds to the detection of soliton 3 in the MD trajectory shown in Figure 5 (static dashed line in the video). The variation of u i as function of time between t = 1 ps and t =.5 ps is represented every fs by crosses along the amino acid sequence. The crosses are joined by full lines to emphasize the soliton profile. The time window is identical to the one shown in Figure 6 b and Figure 6 c. 14