Supplementary information for cloud computing approaches for prediction of ligand binding poses and pathways

Similar documents
SUPPLEMENTARY INFORMATION

Drug binding and subtype selectivity in G-protein-coupled receptors

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

T H E J O U R N A L O F G E N E R A L P H Y S I O L O G Y. jgp

SUPPLEMENTARY INFORMATION

Application of the Markov State Model to Molecular Dynamics of Biological Molecules. Speaker: Xun Sang-Ni Supervisor: Prof. Wu Dr.

L718Q mutant EGFR escapes covalent inhibition by stabilizing. a non-reactive conformation of the lung cancer drug. osimertinib

Experimental and Computational Mutagenesis to Investigate the. Positioning of a General Base within an Enzyme Active Site

Supporting Information for. Models for the Metal Transfer Complex of the N-terminal Region of CusB and. CusF

Comparing crystal structure of M.HhaI with and without DNA1, 2 (PDBID:1hmy and PDBID:2hmy),

SUPPLEMENTARY FIGURES

Improving Protein Function Prediction with Molecular Dynamics Simulations. Dariya Glazer Russ Altman

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets

NB-DNJ/GCase-pH 7.4 NB-DNJ+/GCase-pH 7.4 NB-DNJ+/GCase-pH 4.5

Supplementary Information

Computational engineering of cellulase Cel9A-68 functional motions through mutations in its linker region. WT 1TF4 (crystal) -90 ERRAT PROVE VERIFY3D

Structure Investigation of Fam20C, a Golgi Casein Kinase

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Supplementary Figure 1. Biochemical and sequence alignment analyses the

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

Alchemical free energy calculations in OpenMM

Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1.

Bioengineering & Bioinformatics Summer Institute, Dept. Computational Biology, University of Pittsburgh, PGH, PA

Supplementary Figure S1. Urea-mediated buffering mechanism of H. pylori. Gastric urea is funneled to a cytoplasmic urease that is presumably attached

Molecular dynamics simulations of anti-aggregation effect of ibuprofen. Wenling E. Chang, Takako Takeda, E. Prabhu Raman, and Dmitri Klimov

Supporting Information How does Darunavir prevent HIV-1 protease dimerization?

SUPPLEMENTARY INFORMATION

Supporting Information. Influence of Vapor Deposition on Structural. and Charge Transport Properties of. Ethylbenzene Films

Chapter 4. Glutamic Acid in Solution - Correlations

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Proteins are not rigid structures: Protein dynamics, conformational variability, and thermodynamic stability

CHEMpossible. 261 Exam 1 Review

Electronic Supplementary Information Effective lead optimization targeted for displacing bridging water molecule

FW 1 CDR 1 FW 2 CDR 2

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Lecture 21 (11/3/17) Protein Stability, Folding, and Dynamics Hydrophobic effect drives protein folding

SUPPLEMENTARY INFORMATION

Structural insights into Aspergillus fumigatus lectin specificity - AFL binding sites are functionally non-equivalent

Enhancing Specificity in the Janus Kinases: A Study on the Thienopyridine. JAK2 Selective Mechanism Combined Molecular Dynamics Simulation

Supplementary Figure 1 Crystal contacts in COP apo structure (PDB code 3S0R)

Supplementary Information

Macromolecular X-ray Crystallography

Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

CHAPTER 2: RESONANCE THEORY

MM-PBSA Validation Study. Trent E. Balius Department of Applied Mathematics and Statistics AMS

Problem Set 1

Laying down deep roots: Molecular models of plant hormone signaling towards a detailed understanding of plant biology

1. What is an ångstrom unit, and why is it used to describe molecular structures?

Timescales of Protein Dynamics

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supplementary Figures

Calculation of the distribution of eigenvalues and eigenvectors in Markovian state models for molecular dynamics

SUPPLEMENTARY INFORMATION

Protein Dynamics, Allostery and Function

Insights on Antibiotic Translocation : Molecular Dynamics Simulations

Relative binding enthalpies from molecular dynamics simulations. using a direct method

ψ j φ j+1 Motif position

Timescales of Protein Dynamics

* Author to whom correspondence should be addressed; Tel.: ; Fax:

Catalytic Mechanism of the Glycyl Radical Enzyme 4-Hydroxyphenylacetate Decarboxylase from Continuum Electrostatic and QC/MM Calculations

Structural Insights from Molecular Dynamics. Simulations of Tryptophan 7-Halogenase and

Electron Density at various resolutions, and fitting a model as accurately as possible.

Nature Structural and Molecular Biology: doi: /nsmb Supplementary Figure 1. Experimental approach for enhancement of unbiased Fo Fc maps.

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

CAP 5510 Lecture 3 Protein Structures

SUPPLEMENTARY ONLINE DATA

Problem Set 5 Question 1

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Full wwpdb X-ray Structure Validation Report i

Protein Dynamics, Allostery and Function

Computational Solvent Mapping Reveals the Importance of Local Conformational Changes for Broad Substrate Specificity in Mammalian Cytochromes P450

Molecular Modeling lecture 2

Destruction of Amyloid Fibrils by Graphene through Penetration and Extraction of Peptides

Frequency Response of a Protein to Local Conformational Perturbations

Supplementary Figures:

Supporting Information

Supplementary Figure 1 Change of the Tunnelling Transmission Coefficient from the Bulk to the Surface as a result of dopant ionization Colour-map of

Identifying Interaction Hot Spots with SuperStar

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

Supporting information for: Redox Potential Tuning Through Differential. Quinone Binding in the Photosynthetic Reaction

Current address: Department of Chemistry, Hong Kong Baptist University, Kowloon Tong, Hong Kong,

Supplementary Materials for

NGF - twenty years a-growing

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor

Docking. GBCB 5874: Problem Solving in GBCB

schematic diagram; EGF binding, dimerization, phosphorylation, Grb2 binding, etc.

Modeling Biological Systems Opportunities for Computer Scientists

SUPPLEMENTARY INFORMATION

Insights into the Biotransformation of 2,4,6- Trinitrotoluene by the Old Yellow Enzyme Family of Flavoproteins. A Computational Study

Effects of Chemical Exchange on NMR Spectra

From Amino Acids to Proteins - in 4 Easy Steps

Detailed description of overall and active site architecture of PPDC- 3dThDP, PPDC-2HE3dThDP, PPDC-3dThDP-PPA and PPDC- 3dThDP-POVA

SUPPLEMENTARY INFORMATION

Flexibility and Constraints in GOLD

Transcription:

Supplementary information for cloud computing approaches for prediction of ligand binding poses and pathways Morgan Lawrenz 1, Diwakar Shukla 1,2 & Vijay S. Pande 1,2 1 Department of Chemistry, Stanford University, Stanford, CA 94305 2 SIMBIOS NIH Center for Biomedical Computation, Stanford University, Stanford, CA 94305 Supplementary Table 1. Adaptive sampling scheme for FKBP12 ligand binding simulations. Round 1 (µs) Round 2 Round 3 Total (µs) Bind events Unbind events L2 3.7 107.2 69.8 180.7 883 a 222 b L3 3.3 126.9 70.1 200.3 891 273 L6 3.0 115.4 71.1 189.5 560 185 L9 3.1 128.2 73.2 204.5 494 141 AND 73.4 76.4-149.6 411 120 a Bind criteria: In trajectory, W59 sidechain - ligand nitrogen atom (oxygen for Androstan) distance transitions to > 1.5 Åwhen cumulative average is < 6 Å. b Unbind criteria is reverse of bind. 1

Supplementary Table 2. Final ranking of high probability MSM states for FKBP12 ligands. Four states with highest equilibrium probabilities from the final MSM are listed with the state rmsd to the reference bound state. Only L9 has a deposited crystal structure; other ligand poses are derived from the FK506 structure 1YAT as described in the main text. The Androstan reference structure is the final predicted pose. pose AND L2 L3 L6 L9 π a rmsd b π rmsd π rmsd π rmsd π rmsd 0 0.2 0.0 0.12 1.3 0.23 1.2 0.04 2.6 0.11 0.7 1 0.15 6.0 0.06 6.2 0.05 5.3 0.03 7.7 0.10 6.6 2 0.06 1.3 0.04 5.9 0.03 2.0 0.03 6.8 0.09 4.8 3 0.04 6.1 0.03 5.6 0.03 4.9 0.02 4.2 0.05 5.3 a Normalized equilibrium population from the MSM. b Root mean square deviation (Å) from reference (experimental or predicted) pose. 2

Supplementary Table 3. Comparison of computed and experimental FKBP ligand association rates. MSM-computed association rates for the ligands in this study (Holt, et al.) are listed, but as no experimental association rates are available for these particular ligands, we compare to related FK506-derived ligands (Tassa, et al.) with a similar binding affinity range and association rates from surface plasmon resonance (SPR). The strongest binders from both ligand series have rates that compare well at 20 10 6 M 1 s 1. Tassa, et al. Ligands (exp) Holt, et al. Ligands (calc) K on 0.019-230 19-227 (10 6 M 1 s 1 ) 3

Supplementary Table 4. Top flux pathway descriptions for FKBP12 ligands. For each pathway, a flux for each ligand is shown as a percentage out of the total flux represented by the analyzed top flux pathways. A pathway was analyzed if it met the criterion of having > 50% flux of the maximum flux pathway. The encounter complex is defined as the first observed on-pathway metastable state with formed ligand-protein interactions and is described with a summary of the interactions formed: near bound, within 5 Å of the predicted bound state; flipped intermediate, within the active site but flipped, as the example described in Supplementary Fig. 5; 80 s loop interaction, ligand binding either on the C-terminal or N-terminal ends of the loop formed by residues 84-91; nonspecific site, ligand interactions with sites A, B, or C as labeled in Supplementary Fig. 4. encounter complex AND L2 L3 L6 L9 near-bound 5 13 32 0 0 flipped intermediate 0 53 15 5 36 80 s Loop interaction 25 24 0 71 36 nonspecific site 69 10 53 24 28 4

Supplementary Table 5. List of lag times used for all MSMs in convergence plots. MSMs at listed aggregate simulation times were built using the lag time based on the implied timescale convergence. Models that did not demonstrate converged time scales were discarded and are not listed. L2 L3 L6 L9 AND time a lag b time lag time lag time lag time lag 0.6 0.6 0.5 0.4 1.0 0.4 0.5 0.4 5.6 0.4 6.0 6.0 3.3 2.0 6.6 1.0 0.7 0.4 21.5 0.4 11.3 6.0 22.9 1.0 16. 1.0 1.4 0.6 41.7 0.4 16.6 6.0 41.6 1.0 31. 3.0 6.1 3.0 51.5 0.4 20.7 1.0 58.3 2.0 47.2 1.0 23.2 0.8 69.3 0.4 38.3 1.0 73.4 3.0 60.1 2.0 33.4 0.8 77.1 0.4 54.2 1.0 93.3 4.0 71.1 2.0 42.9 0.8 90.1 2.0 68.4 2.0 116.6 6.0 84.6 12.0 52.0 0.8 105.0 2.0 80.7 2.0 127.0 6.0 84.6 12.0 60.5 2.0 113.0 2.0 92.0 4.0 136.5 2.0 96.3 5.0 76.2 1.5 130.9 2.0 102.4 2.0 153.2 6.0 107.2 10.0 103.4 6.0 142.3 2.0 116.4 4.0 166.9 10.0 113.9 10.0 120.4 6.0 149.6 2.0 128.7 10.0 178.0 12.0 125.0 10.0 130.7 3.0 139.5 6.0 184.4 10.0 151.4 12.0 140.2 3.0 151.4 6.0 196.0 8.0 163.5 10.0 156.9 2.0 163.0 6.0 200.3 8.0 174.5 8.0 176.7 5.0 173.1 6.0 200.3 8.0 189.5 10.0 204.5 10.0 180.7 10.0 a Aggregate simulation time in µs. b Lag time used for model in ns. 5

Supplementary Figure 1. Convergence of lowest free energy ligand MSM state. The lowest free energy state was selected from MSMs built at 10 µs intervals as aggregate sampling time is increased. We plot the state RMSD to the reference poses derived from crystallography (described in the main text), or in the case of Androstan the final predicted pose, as the rolling mean with standard deviations over 2 data points ( 20 µs). Data shown for ligands L2 (a), L3 (b), L6 (c), L9 (d), Androstan (e). Adaptive sampling times as described in Supplementary Table 1 are marked with red dashed lines. Nonspecific sites A, B, and C are labeled for L2 as non-converged binding sites and described in Supplementary Fig. 4. 6

Supplementary Figure 2. Predicted FKBP ligand binding poses from MSM free energies. (a) Overlap of the MSM-predicted binding poses of FK506-derived ligands (shown in stick representation) with the electron density derived for FK506 (PDBID 1YAT). For clarity, the densities shown are restricted to FK506 atoms which correspond to the derived ligand. (b) Free energy isosurfaces at 1.0 kcal/mol of the 3-D MSM-weighted free energy map are shown for all ligands. 7

Supplementary Figure 3. Low free energy surfaces can predict druggable regions of the binding site. (a) The MSM free energy isosurface at 1.5 kcal/mol is shown for L2 (in cyan stick representation), which extends down the N terminal region of the 80s loop (green square) (b) L6 (shown in cyan stick representation), as well as L9, and FK506 (not shown) make contacts in this same region, which contribute to an approximately two orders of magnitude increase in binding affinity. 8

Supplementary Figure 4. Predicted nonspecific binding sites. The MSM-weighted free energy map at the isosurface of 3.0 kcal/mol reveals high free energy binding sites on the protein that are common between all ligands. The primarily hydrophobic interactions formed with the ligand in the three sites, labeled A, B, and C, are also depicted 9

Supplementary Figure 5. Hydrogen bond shuttling mechanism in common FK506 ligand intermediate. The equilibrium population for these states are L2: 0.8%, L3: 0.8%, LG6: 0.5%, LG9: 0.9%. (a) The structure of the intermediate is shown for L6 and L2, with a comparison to the predicted crystal binding mode for L2. In the intermediate, the ligand isoleucine-mimetic or ring sidechain (represented by distances to the central atom c3) forms hydrophobic interactions with residues F99 and W59, but switches to interactions with F36 and I90. This switching is shown in the distance distributions in (b). Shown in (c) and (d), the ligand carbonyl groups, labeled o1, o2, and o3, initiate hydrogen bonds in the intermediate with both the Y82 hydroxyl and I56 backbone (red distributions). Conversion to the bound state stabilizes the o2 interaction with Y82 and allows o3 to find its hydrogen bond with I56. 10

Supplementary Figure 6. Implied timescales for the final MSM for all ligands. Converging timescales, or the log of the eigenvalues µ computed from the transition matrix P ij for models constructed at the final aggregate sampling time. Various lag times τ are shown with separate colors for each timescale for L2 (a), L3 (b), L6 (c), L9 (d), Androstan (e). The lag time of 10 ns was chosen for final analysis, shown for all plots as a dashed black line in the initial plateau region. The appropriate lag time for an MSM in practice requires a balance of demonstrated Markovian behavior (see Methods in the main text) with maintenance of good statistics that contribute to the matrix Cij. Here the smallest lag time was chosen in regions where convergence behavior is first demonstrated. 11