Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

Similar documents
What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Geometrical Concept-reduction in conformational space.and his Φ-ψ Map. G. N. Ramachandran

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supplementary Information

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Packing of Secondary Structures

Supporting information for: Mechanism of lignin inhibition of enzymatic. biomass deconstruction

Computer simulations of protein folding with a small number of distance restraints

Coupling Protein Dynamics with Enzyme Catalysis in Human Carbonic Anhydrase II

Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information

Computational engineering of cellulase Cel9A-68 functional motions through mutations in its linker region. WT 1TF4 (crystal) -90 ERRAT PROVE VERIFY3D

StructuralBiology. October 18, 2018

SUPPLEMENTARY INFORMATION

Journal of Pharmacology and Experimental Therapy-JPET#172536

Modeling Biological Systems Opportunities for Computer Scientists

Mechanical Proteins. Stretching imunoglobulin and fibronectin. domains of the muscle protein titin. Adhesion Proteins of the Immune System

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

T H E J O U R N A L O F G E N E R A L P H Y S I O L O G Y. jgp

Central Dogma. modifications genome transcriptome proteome

Catalytic Mechanism of the Glycyl Radical Enzyme 4-Hydroxyphenylacetate Decarboxylase from Continuum Electrostatic and QC/MM Calculations

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Bacterial protease uses distinct thermodynamic signatures for substrate recognition

Supersecondary Structures (structural motifs)

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

6 Similarities between principal components of protein dynamics and random diffusion

Supplementary Figures:

----- Ver October 24, 2014 Bug about reading MOPAC2012 Ver.14 calculations of 1 atom and 2 atoms molecule was fixed.

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

Unfolding CspB by means of biased molecular dynamics

Relative binding enthalpies from molecular dynamics simulations. using a direct method

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

Citation for published version (APA): Feenstra, K. A. (2002). Long term dynamics of proteins and peptides. Groningen: s.n.

Esser et al. Crystal Structures of R. sphaeroides bc 1

Dominant Paths in Protein Folding

Section Week 3. Junaid Malek, M.D.

Hyeyoung Shin a, Tod A. Pascal ab, William A. Goddard III abc*, and Hyungjun Kim a* Korea

Supplementary Materials for

Analysis of the simulation

Properties of amino acids in proteins

Supporting Information

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Peptides And Proteins

Protein Structures: Experiments and Modeling. Patrice Koehl

Electronic Supplementary Information

What binds to Hb in addition to O 2?

Supplementary Information Intrinsic Localized Modes in Proteins

Oxygen Binding in Hemocyanin

A prevalent intraresidue hydrogen bond stabilizes proteins

Insights into the Biotransformation of 2,4,6- Trinitrotoluene by the Old Yellow Enzyme Family of Flavoproteins. A Computational Study

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Essential dynamics sampling of proteins. Tuorial 6 Neva Bešker

Biomolecules: lecture 10

Physiochemical Properties of Residues

Structural Perspectives on Drug Resistance

Supplementary Figure S1. Urea-mediated buffering mechanism of H. pylori. Gastric urea is funneled to a cytoplasmic urease that is presumably attached

Structural basis for catalytically restrictive dynamics of a high-energy enzyme state

proteins Domain motions of hyaluronan lyase underlying processive hyaluronan translocation

Multi-scale approaches in description and design of enzymes

Unraveling the degradation of artificial amide bonds in Nylon oligomer hydrolase: From induced-fit to acylation processes

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

It s the amino acids!

1. What is an ångstrom unit, and why is it used to describe molecular structures?

Model Mélange. Physical Models of Peptides and Proteins

Problems from Previous Class

SUPPLEMENTARY FIGURES. Figure S1

Comparison of Full-atomic and Coarse-grained Models to Examine the Molecular Fluctuations of c-amp Dependent Protein Kinase

SUPPLEMENTARY INFORMATION

Supplementary Information

Enhancing Specificity in the Janus Kinases: A Study on the Thienopyridine. JAK2 Selective Mechanism Combined Molecular Dynamics Simulation

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Structural Insights from Molecular Dynamics. Simulations of Tryptophan 7-Halogenase and

SUPPLEMENTARY INFORMATION

Polymorphism of the Epidermal Growth Factor Receptor Extracellular Ligand Binding Domain: The Dimer Interface Depends on Domain Stabilization

Molecular Dynamics Simulations: an Useful Tool to Investigate the Metal Release Mechanism from Serum Transferrin Protein

Protein Structure Bioinformatics Introduction

SUPPLEMENTARY INFORMATION

Molecular Dynamics Simulation of HIV-1 Reverse. Transcriptase

Supporting Information

Supporting Information

Supporting Information. A Comprehensive Computational Study of the Interaction between Human Serum Albumin and Fullerenes

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate

Biological Macromolecules

Final Chem 4511/6501 Spring 2011 May 5, 2011 b Name

Voltage Dependence of Conformational Dynamics and Subconducting

Routine access to millisecond timescale events with accelerated molecular dynamics

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Low Frequency Motion in Proteins

GROMACS Implementation of Metadynamics in Essential Coordinates

Molecular Dynamics Studies of Human β-glucuronidase

NB-DNJ/GCase-pH 7.4 NB-DNJ+/GCase-pH 7.4 NB-DNJ+/GCase-pH 4.5

Molecular Dynamics in Principal Component Space

Supplement information

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

Biotechnology of Proteins. The Source of Stability in Proteins (III) Fall 2015

Secondary and sidechain structures

Transcription:

Electronic Supplementary Material (ESI) for Physical Chemistry Chemical Physics. This journal is the Owner Societies 215 Structural and mechanistic insight into the substrate binding from the conformational dynamics in apo and substrate-bound DapE enzyme Debodyuti Dutta, and Sabyashachi Mishra Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur, India E-mail: mishra@chem.iitkgp.ernet.in To whom correspondence should be addressed 1

Overall stabilization of the apo and enzyme-substrate complex 3 2.8 Zn1-E163 2.8 Zn1-H67 Distance (Å) 2.6 2.4 2.2 2 1.8 Zn2-E135 2.4 2 Zn2-H349 1.6 1.6 3 3 Zn1-D1 Distance (Å) 2.5 2 Zn2-D1 1.5 2 4 6 8 1 Time (ns) 2.5 2 Zn1-SDAP Zn2-SDAP 1.5 2 4 6 8 1 Time (ns) Figure S1: Distance of the active site residues from Zn atoms during the trajectory. Horizontal lines indicate the value of the same in the crystal structure of DapE. DapE DapE-SDAP # Residues 26 24 Helix # Residues 22 2 18 88 Sheet # H-bonds 84 8 76 Backbone H-bond 2 4 6 8 1 Time (ns) Figure S2: The stability of the enzyme native structure in the DapE and DapE-SDAP simulations shown by the number of residues that form helices (upper panel) and extended β-strands (middle panel) and number of backbone hydrogen bonds (lower panel) in both the systems. The horizontal lines indicate the corresponding values in the crystal structure. 2

Theoretical Background of Principal Component Analysis The principal component analysis, also known as quasiharmonic analysis or essential dynamics method, is a well established technique to express the conformational dynamics in high dimensional complex systems in terms of a few principal modes or principal components of motion. 1,2 The first step of calculation of the principal components involves the calculation of the covariance matrix, C, of the positional deviations and the diagonalization of this matrix. The 3N dimensional covariance matrix C is calculated based on an ensemble of protein structures, and the elements of C are defined as 3 C i j = (x i x i )(x j x j ), (S1) where x 1,...,x 3N are the Cartesian coordinates of an N-particle system and the brackets denote ensemble average. The diagonalization of the symmetric 3N 3N matrix C is equivalent to solving the eigenvalue problem 4,5 R T CR = λ, (S2) where λ 1 λ 2... λ 3N are the eigenvalues, and R T is the transpose of R. The columns of R are the eigenvectors or the principal modes. The trajectory can be projected onto the eigenvectors to obtain the principal components 6 q i (i),i = 1,2,...,3N, q = R T (x(t) x ). (S3) The eigenvalues λ i represent the mean-square fluctuation along the direction of the ith principal mode. The largest eigenvalue captures the largest fraction of the root-mean-square fluctuation (RMSF) of the overall dynamics followed by the smaller eigenvalues. The first few principal components typically describe the collective global motions of the system, while the principal components associated with smaller eigenvalues represent local motions largely dominated by random fluctuations. 3

To distinguish the actual protein motions from random fluctuations described by the dynamics of the first few principal components, the cosine contents (c i ) of these principal components were evaluated, 7 c i = 2 T ( T cos ( ) 2 iπt qi (t)dt) T T q i 2 (t)dt. (S4) Based on the fact that the principal components of random diffusion are cosines with the number of periods equal to half the principal component index, the cosine content can take values between zero (no cosine) and 1 (a perfect cosine), representing the limits of relevant protein dynamics and random fluctuations, respectively. 8 The eigenvectors corresponding to the larger eigenvalues, which can be used to describe the essential motions, tend to converge to a stable set in the nanosecond timescale, suggesting that MD simulations of nanosecond timescale can provide a reasonable definition of the essential subspace, valid well beyond the nanosecond range. 9 11 The convergence of the principal component analysis is further verified by determining the overlap of the sampling between the first and second half of the MD simulation of apo enzyme and enzyme-substrate complex. Since the elements of the covariance matrix are proportional to the square of the displacement, the extent of sampling can be accessed from the square root of the covariance matrix, which can be calculated from the eigenvalues λ i and the eigenvectors (columns of the matrix R). 6 The overlap s between two parts of the trajectory is defined as, 6 s(a,b) = 1 d(a,b) tra + trb, (S5) where tr is the trace of a matrix and the difference d between covariance matrices A and B, associated with the two parts of the trajectory, are given by, 6 d(a, B) = = tr ( A 1/2 B 1/2) ( N i (λ A i + λ B i ) 2 N i N i (S6) λ A i λ B j (RA i R B j ) 2) 1/2, (S7) 4

with λ A i and λ B i as the ith eigenvalue of covariance matrices A and B, corresponding to the eigenvector columns R A i and R B j, respectively. Convergence Tests of the Principal Component Analysis Due to the large size of the biomolecular system together with the large scale conformational dynamics observed during 1 ns of MD simulation, verification of the convergence of the principal component analysis is very important. With increasing length of MD simulation, the conformational dynamics of the system is expected to converge. For the present study, we have tested this by evaluating principal components for increasing length of simulations. Table S1 shows that the eigenvalues of the first three principal components converge as the simulation length approaches 1 ns. For further verification of the convergence of the results, the cosine contents (Equation S4) of the principal components, which help distinguish the essential protein motion from the random fluctuations, are evaluated for increasing length of simulation (Figure S4). While the principal component 2 requires only a few nanoseconds of simulations to acquire a cosine content close to zero, the principal component 1 requires much longer time to do so due to its higher rate of fluctuation. For biological systems as large as the present one, the limit of convergence of the cosine contents are around.5, 8 which is achieved for the principal component 1 by the end of 1 ns of MD simulations in both apo enzyme as well as enzyme-substrate complex. Table S1: The relative contribution of the first three principal components to the overall protein dynamics for increasing length of MD trajectory of DapE and DapE-SDAP complex. Time DaPE DaPE-SDAP PC 1 PC 2 PC 3 PC 1 PC 2 PC 3-2 ns.41.2.12.62.13.8-4 ns.53.16.11.57.12.1-6 ns.49.2.13.46.19.1-8 ns.43.27.12.52.18.9-1 ns.4.29.11.54.18.8 5

6 Eigenvalues (Å 2 ) 5 4 3 2 1 DapE DapE-SDAP Cumulative eigenvalues 1.8.6.4 DapE 2 4 6 8 1 Eigenvector index DapE-SDAP 2 4 6 8 1 Eigenvector index Figure S3: The eigenvalues of the covariance matrices obtained from the positional deviations of the enzyme backbone atoms in apo DapE and DapE-SDAP complex (upper panels) and the normalized cumulative eigenvalues of the two systems (lower panels). Only the first 1 eigenvalues are shown for clarity. To evaluate the convergence of the phase-space sampling by the MD simulations, the overlap factors (Equation S5) are calculated from the two halves of the trajectory with gradually increasing the length of the MD trajectory. We see that when the first 2 ns of the trajectory is divided into two, the covariance matrices show least overlap (Figure S4c,d). The overlap gradually increases and is the highest (around.75) when the entire 1 ns MD trajectory is divided into two halves, indicating the degree of convergence of the phase space sampling during MD simulations. The distribution of the displacement of the principal components gives a measure of randomness of the underlying dynamics. 12,13 Figure S5 shows the normalized distribution of the principal component 1 and principal component 2 and compares it with a relatively small amplitude component (principal component 5, chosen randomly). It is clear that the principal 6

Cosine content 1.8.6.4.2 DapE PC 1 PC 2 1.8.6.4.2 DapE-SDAP Overlap coefficient 2ns 4ns 6ns 8ns 1ns 1.8.6.4.2 2ns 4ns 6ns 8ns1ns 2ns 4ns 6ns 8ns 1ns 1.8.6.4.2 2ns 4ns 6ns 8ns1ns Figure S4: Cosine content analysis (Equation S4) of the principal components 1 and 2 in DapE and DapE-SDAP complex for increasing length of the MD trajectory (upper panels). The overlap coefficient (Equation S5) between two equal halves of the MD trajectory for increasing length of MD trajectory of DapE and DapE-SDAP complex (lower panels). component 1 and principal component 2 for both apo enzyme and the enzyme-substrate complex show non-gaussian distribution, emphasizing the fact that the dynamics of the systems along these eigenmodes sample a large number of configurations, which is a characteristic of global motion (Figure S5). The higher eigenmodes corresponding to smaller eigenvalues, on the other hand, exhibit Gaussian distributions signifying independent local fluctuations as seen for principal component 5 (Figure S5). In addition to the dominating first two principal components, the higher principal components have also been analyzed. We found the principal components 3 and 4 in both apo enzyme and the enzyme-substrate complex show essential motion exhibiting relative motion of the catalytic domains and dimerization domains. However, owing to their small amplitudes obtained from the diagonalization of the covariance matrix, their contributions to the overall enzyme 7

Normalized probability distribution DapE DapE-SDAP PC 1 PC 1-6-4-2 2 4 6-6-4-2 2 4 6 PC 2 PC 2-4 -2 2 4-4 -2 2 4 PC 5 PC 5-4 -2 2 4 Displacement of modes (Å) -4-2 2 4 Displacement of modes (Å) Figure S5: Normalized probability distribution of the principal components. While the distribution profiles for principal components 1 and 2 are non-gaussian, the principal component 5 shows a Gaussian distribution signifying the random fluctuations. conformational dynamics are expected to be marginal. 8

Characterization of the principal components Characterization of the principal component 1 The time evolution of the distance between the centers of mass of the catalytic domain and dimerization domain in both chains of the apo enzyme and enzyme-substrate complex obtained from 1 ns of MD simulations are shown in the upper panels in Figure S6, while the middle panels show the distance between centers of mass of the catalytic domains and dimerization domains of the trajectory filtered along mode 1. The two data sets are strongly correlated, in particular, for the DapE-SDAP system with correlation coefficient of.73 for chain A and.66 for chain B, as compared to.56 and.43 for the chains A and B in the apo enzyme, respectively. The high values of correlations further underlines the predominant importance of mode 1 in the conformational dynamics of the enzyme-substrate complex. The striking similarities between the time evolution of the distance between the centers of mass of the catalytic domain and dimerization domain (middle panel in Figure S6) with the time series of the displacement of the principal component 1 (lower panel in Figure S6) reflects the fact that the choice of the internal coordinates (i.e., the distance between centers of mass of the catalytic domain and dimerization domain) indeed provides an accurate description of the dynamics along the principal component 1. The time evolution of the distance between centers of mass of the catalytic domain and dimerization domain along mode 1, shows a decreasing distance between the catalytic and dimerization domains in the DapE-SDAP complex system, while the same is not true for the apo enzyme (middle panel in Figure S6). This differential correlation can be clearly seen from the cross-correlation matrix of C α atomic fluctuations along principal component 1 (Figure S7). Highly positive regions (blue) are indicative of strong correlation in the movement of specific residues, whereas negative regions (red) are associated with strong anticorrelated motion of the residues. Along the diagonal, both the systems show highly correlated motions because of the correlation of a residue with itself. In Figure S7 only the extreme values of correlation or anticorrelation ( correlation >.9) are shown for clarity. In the enzyme-substrate complex system, 9

PC 1 (Å) CoM Dist. (Å) CoM Dist. (Å) 42 4 38 36 34 41 4 39 38 37 36 35 6 4 2-2 -4-6 Chain A, DapE Chain B, DapE 42 2 4 6 8 1 Time (ns) 4 38 36 34 Chain A, DapE-SDAP Chain B, DapE-SDAP 41 Mode 1 4 Mode 1 39 38 37 36 35 6 4 2-2 -4-6 2 4 6 8 1 Time (ns) Figure S6: Distance between the centers of mass (CoM) of the catalytic domains and the dimerization domains in chain A and chain B of DapE and DapE-SDAP complex along the actual MD trajectory (upper panels), the trajectory filtered along principal component 1 (middle panels), and the displacement of the principal component 1 (lower panels). the motion of the two catalytic domains exhibit more anticorrelation compared to that in the case of apo enzyme. This indicates the fact that in the DapE-SDAP systems, the two catalytic domains exhibit motions in opposite directions. Together with the observations from Figure 3 in the manuscript, the essential dynamics along the principal component 1 in the enzyme-substrate complex can be inferred to involve a folding of the catalytic domains onto the dimerization domains, which is further supported from the radius of gyration of the entire protein along the principal component 1 in of both the systems (Figure 5a of the manuscript). The decreasing radius of gyration in the DapE-SDAP complex along principal component 1 indicates the increase 1

of the compactness of the enzyme-substrate complex, compared to the apo enzyme. Figure S7: The C α -C α cross-correlation map of the trajectory filtered along principal component 1 for DapE (lower diagonal) and DapE-SDAP complex (upper diagonal). Characterization of the principal component 2 The time evolution of the dihedral angle between the four centers of mass consisting of the catalytic domain of chain A, the dimerization domain of chain A, the dimerization domain of chain B and the catalytic domain of chain B of the apo enzyme and enzyme-substrate complex obtained from 1 ns of MD simulations are shown in the upper panels in Figure S8, while the middle panels show the same dihedral angle along principal component 2. The fluctuations in the dihedral angle are higher in case of the apo enzyme as compared to the enzyme-substrate complex, where principal component 1 is highly dominating in accounting for the enzyme conformational dynamics. Moreover, the correlation coefficients for the change in dihedral for the original trajectory and the trajectory filtered along mode 2 are.72 and.14 for DapE and DapE-SDAP, respectively. The displace- 11

ment along the principal component 2 in both apo enzyme and enzyme-substrate complex show excellent agreement with the time evolution of the dihedral angle along the principal component 2, suggesting the accurate description of the principal component 2 by the chosen internal coordinate. Dihedral angle (deg.) PC 2 14 13 12 11 1 14 13 12 11 4 2-2 -4 DapE Mode 2 2 4 6 8 1 Time (ns) 14 13 12 11 1 114 113 112 111 11 4 2-2 -4 DapE-SDAP Mode 2 2 4 6 8 1 Time (ns) Figure S8: Dihedral angle between the centers of mass of catalytic domain (chain A), dimerization domain (chain A), dimerization domain (chain B) and catalytic domain (chain B) along the actual MD trajectory (upper panels), the trajectory filtered along the principal component 2 (middle panels), and the displacement of the principal component 2 (lower panels) for DapE and DapE-SDAP complex. In addition to the dominating first two principal components, the higher principal components have also been analyzed. We found the principal components 3 and 4 in both apo enzyme and the enzyme-substrate complex show essential motion exhibiting relative motion of the catalytic domains and dimerization domains. However, owing to their small amplitudes obtained 12

from the diagonalization of the covariance matrix, their contributions to the overall enzyme conformational dynamics are expected to be marginal. The distribution of the eigenvalues and the description of the first two principal components are found to be similar when the covariance matrix is constructed from the backbone atoms of the protein (data not shown). This underlines the fact that the covariance analysis generally gives an idea about the global motion of the system, where the backbone atoms play primary role. Although, the side-chains of some of the residues are important in substrate binding, many side chains undergo Gaussian type fluctuations with very small amplitude and therefore contribute marginally to the large time scale overall protein dynamics. projection on eigenvector 2 (Å) 4 2-2 DapE DapE-SDAP -4-6 -4-2 2 projection on eigenvector 1 (Å) 4 6 Figure S9: Distribution of the planes spanned by the first two modes for DapE and DapE-SDAP systems. The combined displacements along the first two principal components provide information about the various conformational states the systems visit. The corresponding two-dimensional scatter plot distribution along the first two principal components (Figure S9) represents the tra13

jectories projected on two planes, defined by two eigenvectors for apo and complex systems. While it may be difficult to isolate the representative structures, it can be readily seen that during the 1 ns simulation, both the DapE and DapE-SDAP systems visit a wide range of conformations. Along eigenvector 1, DapE-SDAP spans a wider range of conformations compared to DapE, whereas along eigenvector 2, the conformational dynamics in DapE is wider than the DapE-SDAP complex. 14

Normalized occupancy of waters within 8 Å of Zn atoms DapE DapE-SDAP MD (NTP) 3 4 5 6 7 8 Equilibration (NVT) 3 4 5 6 7 8 # of waters within 8 Å of Zn Figure S1: The number of water molecules within 8 Å of Zn atoms in apo enzyme and enzymesubstrate complex during the MD trajectory (upper panel) and equilibration (lower panel). The space occupied by the substrate is not considered in this figure. In this figure, 8 Å radius around the metal ions in both DapE and DapE-SDAP systems is considered to calculate the number of water molecules near the active site. In the complex, the substrate SDAP falls within this radius. However, the volume occupied by the substrate can account for at the most a total of six water molecules (even if we assume perfect packing of water molecules). Additionally, since SDAP is present within the protein matrix, it is unlikely that the region vacated by the substrate will be entirely occupied by water molecules. Hence the difference in the number of water molecules must be the direct consequence of the types of motion of the catalytic domains that are described in the manuscript. This is further verified, when one compares the results during the equilibration (lower panel), where the number of water molecules with 8 Å of the metal center remains nearly same for the apo and complex system. 15

Table S2: Residue-wise decomposition of the binding free energy (kcal/mol) when the ligand is SDAP, or Zn1, or Zn2. During SDAP binding, most of the active site residues (connected to metal centers) exhibit positive G, while all the hydrogen-bonding partners of SDAP and the few dimeric-domain residues that have been mentioned in the manuscript, exhibit negative G. The positive G of the active site residues results from the electrostatic repulsion of the negatively charged ends of the active site residues (which are connected to Zn) and that of SDAP, which arises due to the fact that in MM-PBSA an infinite cut-off of electrostatics has been used. The active site residues, on the other hand, strongly stabilize the binding of metal centers (columns 3 and 4) which in turn stabilize the binding of substrate SDAP (column 2). Residue ligand SDAP ligand Zn1 ligand Zn2 HSD 67.82-38.3-4.93 ASP 1 4.48-94.26-89.43 GLU 135 3.15-29.57-13.19 GLU 163 2.45-115.73-21.29 HSD 349 -.3-3.51-33.41 GLU 134 -.86-13.12-5.99 LYS 175 -.74.59.4 ARG 178 -.89.63.48 GLY 322 -.9.6.2 GLY 323 -.31.54.2 GLY 324-1.26.83.38 THR 325-3.3 1..35 ASP 327 -.45-1.38 -.93 ARG 329-1.32 2.7 1.23 ASN 345 -.16.34.15 LYS 35 -.42 -.18 -.3 ARG 258 -.52.28.31 THR 261 -.6 -. -. SER 29 -.76 -.1 -.1 SER 181 -.7.1.1 PRO 293 -.8.4.2 TYR 259.8 -.2 -.2 Zn1-4.47 33.8 86.83 Zn2-5.15 83.43 36.63 SDAP -58.988-18.66-11.82 G total -66-282 -238 16

Table S3: Hydrogen bonds summary between SDAP and DapE in chain A and chain B during 1 ns MD. The numbers in the parentheses represent the hydrogen bonds summary along the principal component 1. H-bond summary of LL-SDAP Residues %Occupancy lifetime (ps) #events Glu134 38.7 (33.6) 15.28 (14.86) 2532 (224) Lys175 11.2 (8.9) 3.95 (3.9) 2837 (2282) Arg178 73.6 (61.3) 91.4 (9.1) 85 (68) Thr325 78.6 (71.9) 23.46 (22.91) 335 (3139) Asp327 74.6 (48.3) 18.36(17.2) 43 (2837) Arg329 19.8 (15.8) 22.76 (21.94) 87 (72) Asn345 42.5 (38.) 16.25 (14.1) 4 (365) Lys35 5.8 () 6.1 95 17

per-residue dimerization energy (kcal/mol) 2-2 -4 Chain A Chain B DapE DapE-SDAP Cat A Dim B Cat A Cat B Dim B Cat B 1 2 3 4 5 6 7 # Residues Figure S11: Residue-wise decomposition of the total dimerization free energy (the free energy of formation of the dimer DapE) obtained from MM-PBSA analysis of apo-dape MD trajectory (black) and DapE-SDAP MD trajectory (red). The two chains (monomers) are separated by the vertical blue line and the domains within a monomer are shown as horizontal arrows. 18

DapE DapE-SDAP Population 5 1 15 2 25 3 # of H-bonds between monomers A and B Figure S12: Distribution of the hydrogen bonds between the two chains of DapE during apo- DapE and DapE-SDAP MD simulations, shown in black and red, respectively. 19

Figure S13: The contact between the two monomers of the dimeric DapE is primarily between the α5 helices and β8 strands of both the monomers. The catalytic domains are shown in gray, the dimerization domains are shown in dark gray, and the α5 helices and beta8 strands are shown in blue and red for the chain A and chain B, respectively. 2

Dimerization free energy (kcal/mol) -4-5 -6-7 -4-5 -6-7 -4-5 -6-7 PC1 PC2 DapE DapE-SDAP 2 4 6 8 1 Figure S14: The free energy of formation of the dimer DapE (the dimerization free energy) during the 1 ns MD simulation (upper panel). The dimerization free energy from the trajectories filtered along the principal components 1 and 2, middle and lower panels, respectively. The average dimerization energy for the MD trajectory, PC 1, and PC 2, for the apo-dape are estimated as -63, -61, -61 kcal/mol, respectively and the same for the SDAP-DapE complex are obtained as -6, -57, -57 kcal/mol, respectively. 21

References (1) Kitao, A.; Hirata, F.; Go, N. Chem. Phys. 1991, 158, 447 472. (2) Garcia, A. Phys. Rev. Lett. 1992, 68, 2696 2699. (3) Levy, R. M.; Srinivasan, A.; Olson, W.; McCammon, J. Biopolymers 1984, 23, 199 1122. (4) Hayward, S.; Go, N. Annu. Rev. Phys. Chem. 1995, 46, 223 25. (5) Tournier, A. L.; Smith, J. C. Phys. Rev. Lett. 23, 91, 28161 28164. (6) Hess, B. Phys. Rev. E 22, 65, 31911 31911. (7) Hess, B. Phys. Rev. E 2, 62, 4838 8448. (8) Maisuradze, G. G.; Liwo, A.; Scherga, H. A. J. Mol. Biol. 29, 385(1), 312 329. (9) Van Aalten, D. M. F.; Groot, B. L. D.; Findlay, J. B. C.; Berendsen, H. J. C.; Amadei, A. J. Comput. Chem. 1997, 18, 169 181. (1) Arcangeli, C.; Bizzarri, A. R.; Cannistraro, S. Biophys. Chem. 21, 9, 45 56. (11) Kubitzki, M. B.; de Groot., B. L. Biophys. J. 27, 92, 4262 427. (12) Amadei, A.; Linssen, A. B. M.; Berendsen, H. J. C. Proteins: Structure, Function, and Bioinformatics 1993, 17, 412. (13) Laberge, M.; Yonetani, T. Biophys. J. 28, 94, 2737 2751. 22