LineShapeKin NMR Line Shape Analysis Software for Studies of Protein-Ligand Interaction Kinetics

Similar documents
Supplementary Information. Overlap between folding and functional energy landscapes for. adenylate kinase conformational change

Effects of Chemical Exchange on NMR Spectra

Effects of Chemical Exchange on NMR Spectra

K ex. Conformational equilibrium. equilibrium K B

TITAN: Two-dimensional lineshape analysis

I690/B680 Structural Bioinformatics Spring Protein Structure Determination by NMR Spectroscopy

T 1, T 2, NOE (reminder)

Protein dynamics from NMR Relaxation data

3 Chemical exchange and the McConnell Equations

Spin Relaxation and NOEs BCMB/CHEM 8190

NMR in Medicine and Biology

Chemical Exchange and Ligand Binding

Inverse Detection in Multinuclear NMR

Protein Structure Determination using NMR Spectroscopy. Cesar Trinidad

NMR Resonance Assignment Assisted by Mass Spectrometry

Spin Dynamics Basics of Nuclear Magnetic Resonance. Malcolm H. Levitt

Non Uniform Sampling in Routine 2D Correlation Experiments

Introduction to Relaxation Theory James Keeler

A Hands on Introduction to NMR Lecture #1 Nuclear Spin and Magnetic Resonance

Supporting information for. Towards automatic protein backbone assignment using proton-detected 4D solid-state NMR data

Band-Selective Homonuclear 2D Correlation Experiments

PROTEIN NMR SPECTROSCOPY

Nuclear Magnetic Resonance Imaging

Pulsar. Delivering NMR to your benchtop

Biochemistry 530 NMR Theory and Practice

Lecture #6 Chemical Exchange

Algebraic solution for time course of enzyme assays. Enzyme kinetics in the real world. Progress curvature at low initial [substrate]

Chem 460 Laboratory Fall 2008 Experiment 3: Investigating Fumarase: ph Profile, Stereospecificity and Thermodynamics of Reaction

StepNOESY Overcoming Spectral Overlap in NOESY1D

protein interaction analysis bulletin 6300

Longitudinal-relaxation enhanced fast-pulsing techniques: New tools for biomolecular NMR spectroscopy

NMR in Structural Biology

I N N O V A T I O N L E C T U R E S (I N N O l E C)

Chemical Kinetics I: The Dry Lab. Up until this point in our study of physical chemistry we have been interested in

Physikalische Chemie IV (Magnetische Resonanz) HS Solution Set 2. Hand out: Hand in:

Supplementary Information

Implementation of the King-Altman method in DynaFit

High-Resolutio n NMR Techniques i n Organic Chemistry TIMOTHY D W CLARIDGE

Biochemistry 530 NMR Theory and Practice. Gabriele Varani Department of Biochemistry and Department of Chemistry University of Washington

10.4 Continuous Wave NMR Instrumentation

Structural basis for catalytically restrictive dynamics of a high-energy enzyme state

Everyday NMR. Innovation with Integrity. Why infer when you can be sure? NMR

Nuclear Magnetic Resonance (NMR)

Biophysical Chemistry: NMR Spectroscopy

APPENDIX 1 Version of 8/24/05 11:01 AM Simulation of H-NMR without a structure*:

Supporting Protocol This protocol describes the construction and the force-field parameters of the non-standard residue for the Ag + -site using CNS

Solid-state NMR and proteins : basic concepts (a pictorial introduction) Barth van Rossum,

ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE. Chester F. Carlson Center for Imaging Science

Operation of the Bruker 400 JB Stothers NMR Facility Department of Chemistry Western University

Introduction solution NMR

COSY type experiments exploring through-bond homonuclear correlations

Biophysical Chemistry: NMR Spectroscopy

NMR Spectroscopy of Polymers

We have seen that the total magnetic moment or magnetization, M, of a sample of nuclear spins is the sum of the nuclear moments and is given by:

General NMR basics. Solid State NMR workshop 2011: An introduction to Solid State NMR spectroscopy. # nuclei

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS. HINDERED ROTATION IN N-METHYLFORMAMIDE. A PEPTIDE-BOND MODEL SYSTEM.*t

Filtered/edited NOESY spectra

Natural abundance solid-state 95 Mo MAS NMR of MoS 2 reveals precise 95 Mo anisotropic parameters from its central and satellite transitions

Mnova Software Tools for Fragment-Based Drug Discovery

Biochemistry 530 NMR Theory and Practice

Name: BCMB/CHEM 8190, BIOMOLECULAR NMR FINAL EXAM-5/5/10

Spectroscopy of Polymers

Statistics and nonlinear fits

NMR Data workup using NUTS

NMR-spectroscopy of proteins in solution. Peter Schmieder

You are advised to spend an equal amount of time on each question.

Supporting Information

NMR BMB 173 Lecture 16, February

NMR Predictor. Introduction

Tools for Structure Elucidation

Protein NMR. Bin Huang

Chemical Exchange. Spin-interactions External interactions Magnetic field Bo, RF field B1

Chem 325 NMR Intro. The Electromagnetic Spectrum. Physical properties, chemical properties, formulas Shedding real light on molecular structure:

webzyme A Web-Based Enzyme Simulation System for Education and Research

Accelerating Biomolecular Nuclear Magnetic Resonance Assignment with A*

La RMN quantitative appliquée aux petites molécules

André Schleife Department of Materials Science and Engineering

14. Coherence Flow Networks

CHEMISTRY. CHEM 0100 PREPARATION FOR GENERAL CHEMISTRY 3 cr. CHEM 0110 GENERAL CHEMISTRY 1 4 cr. CHEM 0120 GENERAL CHEMISTRY 2 4 cr.

Nuclear Physics. Chapter 43. PowerPoint Lectures for University Physics, Thirteenth Edition Hugh D. Young and Roger A. Freedman

Using MnovaScreen to Process, Analyze and Report Ligand-Protein Binding Spectra for Fragment-based Lead Design

NMR Spectroscopy: A Quantum Phenomena

Nuclear magnetic resonance spectroscopy II. 13 C NMR. Reading: Pavia Chapter , 6.7, 6.11, 6.13

NMR: PRACTICAL ASPECTS

Technical University of Denmark

A Brief Introduction to Medical Imaging. Outline

Substrate-dependent switching of the allosteric binding mechanism of a dimeric enzyme

Chemistry Department

Unilateral NMR of Activated Carbon

NMR Dynamics and Relaxation

Skin Damage Visualizer TiVi60 User Manual

Protein NMR. Part III. (let s start by reviewing some of the things we have learned already)

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets

BMB/Bi/Ch 173 Winter 2018

Two Dimensional (2D) NMR Spectroscopy

For most practical applications, computing the

EXPERIMENT 8. NMR STUDY OF A KETO-ENOL EQUILIBRIUM CONSTANT

(PP) rrap3ht/c 61 PCBM (Fig. 2e) MEHPPV/C 61 PCBM (Fig. 2f) Supplementary Table (1) device/figure a HF (mt) J (mt) (CTE) 4 2 >1 0.

Basic principles of multidimensional NMR in solution

Spectral Broadening Mechanisms

Transcription:

LineShapeKin NMR Line Shape Analysis Software for Studies of Protein-Ligand Interaction Kinetics http://lineshapekin.net Spectral intensity Evgenii L. Kovrigin Department of Biochemistry, Medical College of Wisconsin 871 Watertown Plank Rd., Milwaukee, WI 53226 email: ekovrig@mcw.edu Frequency

Abstract 1 Interactions of proteins with their binding partners are characterized by the specific rates of association and dissociation (k on and k off ). Binding kinetics is traditionally studied using rapid mixing techniques, which require highly specialized equipment not generally available. Nuclear Magnetic Resonance spectroscopy is uniquely suited for extraction of the kinetic information under equilibrium physiological conditions using standard NMR probes and hardware. Recording a series of HSQC spectra while titrating protein with the ligand is routinely done to follow binding interactions with the site-specific resolution. Equilibrium binding constant, K d, is readily computed from the chemical shift changes of affected nuclei. Analysis of NMR line shapes, in turn, produces the kinetic constants, k on and k off. However, despite frequent use of HSQC titrations in protein-ligand interaction studies, interaction kinetic constants are rarely computed. Combination of equilibrium and kinetic information extracted from HSQC spectra is essential for understanding of the mechanisms of protein-ligand interactions. A user-friendly software package LineShapeKin has been developed to assist analysis of the peak line shapes in HSQC spectra and extract constants of association/dissociation kinetics. LineShapeKin software package integrates functionality of SPARKY NMR assignment program, Python programming language and MATLAB technical computing environment. Major features of the LineShapeKin package include: 1. Sparky extension that allows interactive extraction of the line shape data from a series of 2D spectra. 2. Matlab notebooks that perform data fitting and graphic presentation of the results. LineShapeKin allows fitting of the titration data on both per-residue basis (independent binding events) as well as globally (all resonances responding to the single binding event). Any subset of resonances may be fit as a group representing a binding sub-site. Akaike s Information Criterion is used to assess the likelihood of the alternative hypotheses. LineShapeKin is freely available to academic users from http://lineshapekin.net

LineShapeKin goals and capabilities 2 Goals Make NMR line shape analysis easily applicable to routine HSQC titration data: 1. Utilize popular spectral analysis software to do extraction of line shapes from 2D HSQC spectra (Sparky and NMRView). 2. Utilize high-level programming environment (MATLAB) to keep the code easy to understand and modify by the end user. Current capabilities LineShapeKin performs: 1. Interactive extraction of peak slices from 2D HSQC spectra. 2. Fitting of a titration series to obtain site-specific off-rate constants, k off and equilibrium constants, K d. 3. Fitting of the end-state peak position and ligand concentration (optional). 4. Statistical testing of models with different variable parameters. 5. Global fitting of a group of residues affected by the same binding event. 6. Testing of a likelihood of the individual vs. global binding models using Akaike s Information Criterion. Current site (computer) requirements 1. Linux/OS X/UNIX/Windows operating system. 2. Python, http://www.python.org 3. Sparky - NMR Assignment and Integration Software, http://www.cgl.ucsf.edu/home/sparky/ 4. MATLAB, Mathworks, http://www.mathworks.com/

Workflow overview 3 HSQC data collection Protein samples with increasing ligand concentration (C protein =const). Data processing (FFT, etc.) 2D planes C ligand /C protein 1.1 2.2...... N 2. SPARKY (LineShapeKin Python extension) Slice 2D peak for chosen residue to produce a series of 1D spectra corresponding to consecutive titration points. Export 1D line shapes to plain-text format files. Re-do slicing. MATLAB (LineShapeKin M-notebooks) Display a series of 1D spectra to access quality of the extracted line shapes and obtain initial estimates. Is the line shape looking good? NO YES

MATLAB (using LineShapeKin M-notebooks) 4.Re-do slicing. Adjust initial line width. or Fit the line shape data to McConnel s equations. Is the fit for this residue acceptable? NO YES Continue to the next peak. All done. Fit the line shapes of a group of residues to a global model using a single k off value as the fitting parameter. Analyze distribution of the off-rate constants among the residues of the protein. Are rate constants similar enough to propose that a group of residues responds to single binding event? Are the fits for individual residues acceptable? YES NO YES NO Discard global hypothesis. Test which of the two models better represents the data: (1) the model of independent binding events at individual sites versus (2) the model of a single binding event reported by multiple sites. Map the off-rate kinetic constants on the protein structure and correlate with known information on binding kinetics and thermodynamics. Publish your Nature paper.

Considerations on the dissociation constant, K d 5 LineShapeKin allows for fitting of K d values. However, this is the least accurate parameter obtained from NMR titrations. If possible, one should determine K d by an independent technique. For a simple binding equilibrium P + L PL, our ability to accurately measure dissociation constant depends upon both protein concentration in NMR tube and the value of dissociation constant. Conventional criterion, c-value is calculated as: c = C protein *n/k d, where n is reaction stoichiometry. Accurate determination of K d is possible when 1< c < 1. NMR titration experiments usually employ high protein concentrations. Therefore optimality of titration conditions depend on K d as (assume n=1) : K d c (1mM protein sample) c (.1 mm) 1 mm.1.1 1 mm 1.1 1 µm 1 1 1 µm 1 1 - measurable K d range shaded 1 µm 1 1 1 nm 1, 1 Essentially, low c-values require very high C ligand /C protein ratios to achieve complete conversion to PL form. On the other end of scale, high c-values require very small additions of ligand near equivalence point and very precise chemical shift measurements. D121A RNase A + 3 -CMP (red) K d = 75 +/- 1 µm WT D121A RNase A + 3 -CMP (black) K d = 2 +/- 3 µm D121A RNase A + ptppap K d = 1.4 +/- 1 µm The simulated derivative of a titration curve for different c-values illustrates these important issues: d[p]/dc L C ligand /C protein

Extraction of the line shapes from spectral data 6 1. Process HSQC datasets with exponential window function (to preserve natural lorentzian lineshape, EM in nmrpipe). Zero-fill sufficiently to have ca 15-2 data points per peak envelope. 2. Invoke custom Sparky extension for extraction of 1D slices (ee) and assign consecutive indices to titration points: 3. Slices are defined by dragging a rectangle around the peak. Then one of the dimensions of the rectangle is integrated (summed) resulting in a 1D slice of the peak. The program goes through all loaded spectra and performs the same integration. SPARKY MATLAB Integration of w 2 dimension Top portion of the MATLAB figure corresponds to the raw spectral data (blue circles). The program normalizes peak areas to unity (lower portion, green lines) and estimates starting parameters. Black and red traces show fitting of the first and the last titration points with the estimates for A and B state peak positions and their line widths. 2.5 x S123N H 1 : Raw data (blue) 16 xw 2 1.5 1.5.8.6.4.2 679 68 681 682 683 684 685 686 687 Frequency, Hz Normalized by integration; Parameter estimates: A (black), B (red) 679 68 681 682 683 684 685 686 687 Estimates: w = 6846.619 Hz R = 3.665 s 1 w =6811.52 Hz R = 28.853 s 1 A 2,A B 2,B Integration of w 1 dimension 2.5 x S123N H 2 : Raw data (blue) 16 xw 2 1.5 1.5 484 486 488 49 492 494 496 498 5 Frequency, Hz.4 Normalized by integration; Parameter estimates: A (black), B (red).3.2.1 484 486 488 49 492 494 496 498 5 Estimates: w A = 489.894 Hz R 2,A = 67.92 s 1 w B =4954.372 Hz R 2,B = 83.38 s 1

NMR Line Shape Fitting 7 Line shapes extracted via Sparky LineShapeKin extension are fit using LineShapeKin M-notebooks. Step 1. Individual fitting of different dimensions of the same peak. Here we set K d =1.4 µm and obtain: S123(ω 1 ) K off = 134 +- 1 s -1, and S123(ω 2 ) K off = 18 +- 2 s -1. slice along ω 1 2.5 x S123N H 1 : Raw data (blue) 16 xw 2 1.5 1.8.7.6 S123N H xw 1 single : exp. data (blue), fit results (red) Norm: 3.45e 3 1.3361e+2 +/ 8.9e+ 2.22614e+ +/ 2.7e 2.5.5 679 68 681 682 683 684 685 686 687 Frequency, Hz.4.8 Normalized by integration; Parameter estimates: A (black), B (red).3.6.2.4.2.1 679 68 681 682 683 684 685 686 687 Estimates: w A = 6846.619 Hz R 2,A = 3.665 s 1 w B =6811.52 Hz R 2,B = 28.853 s 1 679 68 681 682 683 684 685 686 687 2.5 x S123N H 2 : Raw data (blue) 16 xw 2 1.5 1.15 S123N H xw 2 single : exp. data (blue), fit results (red) Norm: 1.42e 4 1.77298e+2 +/ 1.6e+1 1.176e+ +/ 1.4e 2.5.1 slice along ω 2.4 Normalized by integration; Parameter estimates: A (black), B (red) 484 486 488 49 492 494 496 498 5 Frequency, Hz.3.5.2.1 484 486 488 49 492 494 496 498 5 Estimates: w A = 489.894 Hz R 2,A = 67.92 s 1 w B =4954.372 Hz R 2,B = 83.38 s 1 484 486 488 49 492 494 496 498 5 Step 2. It is likely that the amide nitrogen and proton of S123 experience the same chemical exchange process caused by ligand binding. We may fit them together and obtain the off-rate constant with better accuracy: K off = 138 +- 7 s -1 S123N H xw 1 global : exp. data (blue), fit results (red) S123N H xw 2 global : exp. data (blue), fit results (red).35.3.25 Norm: 8.5e 4 1.37688e+2 +/ 6.9e+ 9.99938e 1 +/ 8.8e 3.15 Norm: 8.5e 4 1.37688e+2 +/ 6.9e+ 9.99938e 1 +/ 8.8e 3.1.2.15.5.1.5 679 68 681 682 683 684 685 686 687 484 486 488 49 492 494 496 498 5

Global fitting mode 8 High resolution of HSQC spectra allows for calculation of off-rate constants for every individual site in a protein. Various molecular scenarios are possible ranging from many individual binding interactions to multiple nuclei reporting on the same binding events. LineShapeKin allows analysis of independent binding events as well as global binding processes. Step 1. We select a number of residues, which we propose report on the same binding event. First we fit them individually to establish variation in the individually fit constants. Individual fitting of these residues results in: 87 ± 8 s-1 78 ± 13 s-1 16 H12N Hxw2 indiv : exp. data (blue), fit results (red) 3 x 1 14 12 Y92N Hxw1 indiv : exp. data (blue), fit results (red).3 Norm: 1.99e 4 8.7279e+1 +/ 7.9e+.25 1.183e+ +/ 1.5e 2 1 134 ± 1 s-1 Norm: 6.71e 4 7.85256e+1 +/ 1.3e+1 1.18e+ +/ 1.3e 2.2 S123N H 1 indiv : exp. data (blue), fit results (red) xw.35.3.25 135 ± 23 s-1 T7N H 1 indiv : exp. data (blue), fit results (red) xw.35 Norm: 7.49e 4 1.34887e+2 +/.3 9.4e+ 1.8e+ +/.25 1.3e 2 Norm: 2.35e 3 1.3532e+2 +/ 2.2e+1 1.58e+ +/ 2.e 2.2.2 8.15.15 6.15 4.1.1.1.5 2.5.5 2 455 46 465 47 475 48 8 81 82 83 84 85 86 87 68 681 682 683 684 685 686.5 688 659 687 66 661 662 663 664 665 666 667 668 669 Step 2. Now we will treat them as a group that responds to the same binding event and fit them globally to a single Koff. Result is 127 ± 8 s-1: 16 3 x 1 H12N Hxw2 global : exp. data (blue), fit results (red) Norm: 4.4e 3 1.25956e+2 +/ 7.2e+ 9.98613e 1 +/ 7.6e 3 14 12 1 Y92N Hxw1 global : exp. data (blue), fit results (red).3 Norm: 4.4e 3 1.25956e+2 +/ 7.2e+ 9.98613e 1 +/ 7.6e 3.25.2 S123N Hxw1 global : exp. data (blue), fit results (red).35 Norm: 4.4e 3 1.25956e+2 +/ 7.2e+ 9.98613e 1 +/ 7.6e 3.3.25 T7N Hxw1 global : exp. data (blue), fit results (red).35 Norm: 4.4e 3 1.25956e+2 +/ 7.2e+ 9.98613e 1 +/ 7.6e 3.3.25.2.2 8.15.15 6.15 4.1.1.1.5 2.5.5 2 455 46 465 47 475 48 8 81 82 83 84 85 86 87 68 681 682 683 684 685 686 687 688.5 659 66 661 662 663 664 665 666 667 668 Key issue: we need to judge whether the global binding model or a model involving individual different binding interactions is more correctly describing this data? 669

Hypothesis testing using Akaike Information Criterion LineShapeKin uses Akaike s Information Criterion (AIC) to determine which model is more likely to be correct and how much more likely. Unlike the F test, Akaike's method does not require models be 'nested', they may be totally unrelated. Similarly to Fisher hypothesis testing, here we assess whether using a model with a larger number of fitting parameters brings us more than proportional reduction in deviation of fitted curve from experimental values. Akaike's Information Criterion is calculated as: 9 AIC =Nln SS N + 2K (1) where N is a number of data points, SS is sum of the squares of the residuals from fitting and K is a number of fitting parameters plus one. N must be significantly greater than K. With smaller data sizes, when N is not much greater than K (but, at least, 2 greater), a secondorder (corrected) Akaike's information criterion is more accurate: AIC c = AIC+ 2K(K +1) N K 1 (2) AICSc-s themselves are not informative, it is the absolute difference between the two AICc that defines how much more likely one model is correct relatively to another. We use evidence ratio calculated as Evidence ratio = probability that model 1 is correct probability that model 2 is correct = 1 e.5( AICc), (3) where AICc= AIC 2 AIC 1 Applying AICc to the previously shown data yields: 1. Individual fitting of the proton and nitrogen dimensions of S123 versus simultaneous fitting to a single off-rate constant: 1st Model (global fitting), AICc=-194 2nd Model (individual fitting), AICc=-187 Evidence ratio = 26 Hypothesis that proton and nitrogen of S123 report on the same binding event is 26x more likely correct than an alternative one that states they report on separate exchange processes. 2. Four residues in RNase responding to a binding of the specific inhibitor ptppap: 1st Model (global fitting), AICc=-361 2nd Model (individual fitting), AICc=-344 Evidence ratio = 41 In accord with near-perfect fitting of data with the global model, these residues report on the same binding process (global hypothesis is 41x more likely to be correct).

Summary 1 1. LineShapeKin is an easy-to-use NMR line shape analysis tool for studies of binding equilibria in proteins. The custom extension for SPARKY spectral analysis software allows interactive extraction of line shapes from HSQC titration series. MATLAB-based notebooks perform fitting, display and statistical analysis of the data on reaction kinetics. The package is fully modifiable and easily extendible to the more complicated binding models. 2. LineShapeKin is interfaced with the popular spectral analysis software. Current version of the package utilizes SPARKY for spectral data handling and display. Future versions of LineShapeKin will interface to NMRView with the goal to make kinetic information via NMR line shape analysis easily accessible for biomolecular NMR community. 3. Spectral overlap remains a significant hurdle in NMR data analysis. When analyzing titration series this problem is made worse by simultaneous shifting of a number of neighboring peaks. LineShapeKin allows to define individual extraction regions for HSQC spectra corresponding to different titration points thus increasing number of resonances amenable to line shape analysis. 4. One of the most important issues in science is availability of free academic software. Currently, LineShapeKin relies on efficient MATLAB computing environment to develop a full featured code while keeping its scientific content not obscured by details of programming implementation. However, using MATLAB to run LineShapeKin requires licensing from MathWorks, Inc. It is possible to develop a compiled version of LineShapeKin to relieve end user from necessity of obtaining and licensing MATLAB. Acknowledgements Evgenii Kovrigin acknowledges Dr. Roger Cole, Dr. James Kempf and Dr. Patrick Loria for contributions to the code at an early stage of the package development, Dr. Linda Nicholson and Soumya De for testing the package and helpful feedback.

Brief theoretical background on the NMR line shape analysis Consider the elementary reaction: P + L The equation of motion for the transverse magnetization in the rotating frame will be: dm + (t) = (i" # R + K)M + (t), dt where M is transverse magnetization vector for the protein spins in free and bound forms, " is a diagonal matrix of their Larmor frequencies in the rotating frame, R - diagonal matrix of their transverse relaxation rates, and K is the rate matrix for the reaction. The rate matrix K is # K = "k 1 k 2 & % (, $ k 1 "k 2 ' where k 2 = k off, while k 1 is a pseudo-first order rate constant k on! incorporating equilibrium concentration of a ligand: k 1 = k on [ L] Intensity of the frequency domain spectrum is proportional to a total magnetization and may be obtained for the steady-state reaction conditions from: & & p I(") #Re [ 1,1 ](i($ %") % R + K) %1 P ) ( ( +, ' ' p PL * where p P and p PL are equilibrium mole fractions of free and complexed protein. One can readily compute their values for every ligand concentration from knowledge of the equilibrium binding constant. For more details, refer to [1-3]. 1. Binsch, G., Top. Stereochem., 1968. 3: p. 97-191. 2. McConnell, H., J. Chem. Phys., 1958. 28: p. 43-431. 3. Cavanaugh J, F.W., Palmer III AG, Skelton NJ, Protein NMR Spectroscopy: Principles and Practice. 26: Academic Press. 587. k off PL