GUARDD: user-friendly MATLAB software for rigorous analysis of CPMG RD NMR data

Kleckner, Ian R.; Foster, Mark P.

doi:10.1007/s10858-011-9589-y

GUARDD: user-friendly MATLAB software for rigorous analysis of CPMG RD NMR data

Article
Published: 11 December 2011

Volume 52, pages 11–22, (2012)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Journal of Biomolecular NMR Aims and scope Submit manuscript

GUARDD: user-friendly MATLAB software for rigorous analysis of CPMG RD NMR data

Download PDF

Ian R. Kleckner¹ &
Mark P. Foster²

1284 Accesses
37 Citations
6 Altmetric
Explore all metrics

Abstract

Molecular dynamics are essential for life, and nuclear magnetic resonance (NMR) spectroscopy has been used extensively to characterize these phenomena since the 1950s. For the past 15 years, the Carr-Purcell Meiboom-Gill relaxation dispersion (CPMG RD) NMR experiment has afforded advanced NMR labs access to kinetic, thermodynamic, and structural details of protein and RNA dynamics in the crucial μs-ms time window. However, analysis of RD data is challenging because datasets are often large and require many non-linear fitting parameters, thereby confounding assessment of accuracy. Moreover, novice CPMG experimentalists face an additional barrier because current software options lack an intuitive user interface and extensive documentation. Hence, we present the open-source software package GUARDD (Graphical User-friendly Analysis of Relaxation Dispersion Data), which is designed to organize, automate, and enhance the analytical procedures which operate on CPMG RD data (http://code.google.com/p/guardd/). This MATLAB-based program includes a graphical user interface, permits global fitting to multi-field, multi-temperature, multi-coherence data, and implements χ ²-mapping procedures, via grid-search and Monte Carlo methods, to enhance and assess fitting accuracy. The presentation features allow users to seamlessly traverse the large amount of results, and the RD Simulator feature can help design future experiments as well as serve as a teaching tool for those unfamiliar with RD phenomena. Based on these innovative features, we expect that GUARDD will fill a well-defined gap in service of the RD NMR community.

A suite of ¹⁹F based relaxation dispersion experiments to assess biomolecular motions

Article Open access 30 September 2020

NMR Relaxation Dispersion Methods for the Structural and Dynamic Analysis of Quickly Interconverting, Low-Populated Conformational Substates

Protein Dynamics Revealed by CPMG Dispersion

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Macromolecular dynamics are essential for life, and nuclear magnetic resonance (NMR) spectroscopy has been used extensively to characterize these phenomena since the 1950s (Boehr et al. 2006; Kleckner and Foster 2011; McConnell 1958). For the past 15 years, the Carr-Purcell Meiboom-Gill relaxation dispersion (CPMG RD) NMR experiment has afforded advanced NMR labs access to kinetic, thermodynamic, and structural details of molecular dynamics in the crucial μs-ms time window for proteins (Kempf and Loria 2003; Kleckner and Foster 2011; Loria et al. 2008; Palmer et al. 2005), and more recently for RNA (Johnson and Hoogstraten 2008; Kloiber et al. 2011; Ren and Ghose 2011). CPMG RD enables researchers to extract information on certain dynamic processes by measuring the effective transverse relaxation rate of the NMR signal, $ R_{2}^{Eff} $, in response to the application of refocusing pulses applied at frequency ν _CPMG. Typically, the dynamic process is interpreted using the simplest description of exchange, the two-state A ↔ B model, which describes the RD curve using 4–5 parameters (Fig. 1).

Analysis of CPMG RD data is challenged by (1) the non-linear nature of its mathematical formulation, (2) the multitude of parameters required for fitting compared to its relatively featureless appearance, (3) the reality that an RD curve is often under-sampled with noisy data (to spare expensive NMR time), and (4) the fact that fitting accuracy is not validated by mere visual inspection of the fit to the data, nor by merely minimizing the fitted residuals, because some curves are fit equally well with divergent combinations of parameters (Hansen et al. 2008b; Ishima and Torchia 2005, 2006; Kovrigin et al. 2006; Myint and Ishima 2009; Neudecker et al. 2006). To address these limitations, experimentalists often acquire large datasets encompassing multiple magnetic field strengths (Kovrigin et al. 2006) and/or temperatures (Boehr et al. 2010; Mulder et al. 2001; Vallurupalli and Kay 2006) and/or pressures (Korzhnev et al. 2006) and/or quantum coherences (QCs) (Korzhnev et al. 2005), and produce custom software to implement a global analysis assuming two-state exchange (Beach et al. 2005; Boehr et al. 2010; Grey et al. 2003; Hansen et al. 2008a, b; Henzler-Wildman et al. 2007; Jee et al. 2008; Kovrigin, June 30, 2011; Mulder et al. 2002; Namanja et al. 2010) or, less often, three-state exchange (Korzhnev et al. 2007, 2004b; Neudecker et al. 2006). Although these software tools are generally available from their respective authors, their complexity and specificity may overwhelm a novice spectroscopist, and more intuitive software has only begun to emerge (Bieri and Gooley 2011).

Hence, we present the software package GUARDD (Graphical User-friendly Analysis of Relaxation Dispersion Data), which is designed to facilitate scientific investigations by organizing, automating, and enhancing the analytical processes related to CPMG RD data, especially for novice spectroscopists. In addition to its user-friendly interface, GUARDD contains powerful features that engage the user in the fitting process and thus clarify assessment of fitting accuracy. Advanced labs can benefit from using GUARDD to easily manage data and display results, to plan experiments using the comprehensive RD Simulator, and to extend customized functionality, permitted via an open-source code license. Importantly, we have reduced the activation barrier for its use by establishing tutorials, extensive documentation, and an online community of support (http://code.google.com/p/guardd/).

GUARDD software overview

GUARDD is designed for ease of use by implementing a graphical interface for key functions like managing data, performing and assessing fitting tasks, organizing and displaying results, and simulating RD data for experimental design and/or instruction. In the interest of space, only certain features are discussed, whereas comprehensive documentation and a user-friendly tutorial can be obtained via the GUARDD website (http://code.google.com/p/guardd/).

Data: hierarchical data objects are easily managed via graphical interface

Because analyzing RD data in aggregate is necessary for an accurate description of molecular dynamics, GUARDD implements a hierarchical data management system which accommodates a variety of hypothetical two-state exchange models (Fig. 2). The lowest tier is an RD Curve, which describes a series of relaxation rates and uncertainties, each paired with a value of ν _CPMG, for a unique NMR signal/dynamic probe (i.e., $ R_{2}^{Eff} (v_{CPMG} ) $ and $ \sigma \left( {R_{2}^{Eff} (v_{CPMG} )} \right) $, where σ designates uncertainty, for one or more values of ν _CPMG). Each Curve is typically loaded with Curves from other signals as a Dataset, which specifies the experimental conditions: magnetic field strength B ₀, temperature T, total time of the CPMG period T _CPMG, and the nucleus ^AX and QC probed during that time (SQ = Single Quantum, MQ = Multiple Quantum). For convenience, Curves are loaded using a plain-text file that can easily be prepared using a spreadsheet program. For analysis, each Curve is designated its own relaxation rate, $ R_{2}^{0} $. At the middle tier, one or more Curves are aggregated by signal assignment into a Curveset, which designates the chemical shift differences, |Δω _H| and |Δω _X|, for that NMR probe. For each Curve, the QC determines the effect of the Curveset’s chemical shift differences (Korzhnev et al. 2005), such that MQ Curves are sensitive to |Δω _H| + |Δω _X| whereas SQ Curves are only sensitive to |Δω _X| and therefore fix Δω _H to zero. At the highest tier, one or more Curvesets are aggregated into a Group, which designates signals that share the kinetic parameters P _A and k _ex at each temperature. Because each Curveset (and therefore Curve) within a Group is constrained to exhibit the same exchange kinetics, the highest tier is useful for segregating independent modes of exchange in the molecule (e.g., domains). Importantly, although GUARDD can auto-generate Curvesets and Groups, the user has complete control over how the data are organized, and is encouraged to create many combinations of Curvesets and Groups to test the validity of different descriptions of molecular dynamics. These essential tasks are easily accomplished using the GUARDD Data Manager (Fig. 3).

Analysis: sharing fit parameters via two-state exchange at multiple temperatures

In order to fit a wide range of RD data, GUARDD uses the MQ Carver-Richards-Jones all-timescales dispersion equation (Korzhnev et al. 2004a). To accommodate multi-coherence data, MQ Curves allow both |Δω _H| and |Δω _X| to be optimized during fitting, whereas |Δω _H| is fixed to zero for SQ Curves. To accommodate multi-field data, the Curve-specific rad/s value of Δω is obtained from its parent Curveset using $ \Updelta \omega_{Curve}^{{({\text{rad/s}})}} = 2\pi \gamma_{X} B_{0} \Updelta \omega_{Curveset}^{{({\text{ppm}})}} $, where γ _X is the gyromagnetic ratio of nucleus X. To accommodate multi-temperature data, there are two fitting options. (A) Designate a Group-specific k _ex and P _A for each of the N _T temperatures, and optimize these 2N _T parameters during fitting. After fitting, additional details of the dynamic process can be obtained from activation energies and exchange enthalpies (Boehr et al. 2010; Vallurupalli and Kay 2006) resulting from Arrhenius and van’t Hoff analyses, respectively (Kleckner and Foster 2011; Winzor and Jackson 2006). Briefly, Arrhenius analysis requires both P _A and k _ex to quantify the temperature-dependence of exchange rate and van’t Hoff analysis requires only P _A to quantify temperature-dependence of exchange populations. (B) The second fitting option uses the Arrhenius and van’t Hoff relations as part of the fitting process by optimizing four Group-specific parameters: k _ex and P _A at one temperature as well as activation energy E _AB = E _B − E _A and exchange enthalpy ΔΗ = H _B − H _A, which subsequently determine k _ex and P _A at any temperature. These options are useful because when data are available at more than two temperatures, option (B) requires fewer parameters than option (A) (4 vs. 2N _T), and therefore may be more appropriate for interpreting the RD dataset.

Analysis: grid search and Monte Carlo χ² maps enhance and assess fitting accuracy

The goal of the fitting procedure is to obtain a set of parameters that accurately describes the trends in the data, which is typically pursued by minimizing the difference between the observed data and that calculated from the fit (Bevington and Robinson 2003; Motulsky and Christopoulos 2003). To accomplish this task, GUARDD invokes the MATLAB fmincon function with the interior-point algorithm (Byrd et al. 2000; though other options are available) to iteratively alter the fitting parameters $ {\vec{p}} $ to minimize the target function $ {\chi^{2} = \sum\limits_{All \, Curves} {\left( {\sum\limits_{{All \, \nu_{CPMG} }} {\left( {\frac{{R_{2,Eff}^{Obs} (\nu_{CPMG} ) - R_{2,Eff}^{Calc} (\nu_{CPMG} ,\vec{p})}}{{\sigma \left( {R_{2,Eff}^{Obs} (\nu_{CPMG} )} \right)}}} \right)^{2} } } \right)} } $, where $ {R_{2,Eff}^{Obs} (\nu_{CPMG} )} $ designates an observed data point with error $ {\sigma \left( {R_{2,Eff}^{Obs} (\nu_{CPMG} )} \right)} $ on a single Curve acquired with conditions B ₀, T, QC, ^AX, and T _CPMG; $ {R_{2,Eff}^{Calc} (\nu_{CPMG} ,\vec{p})} $ designates the calculated point using the Curve conditions and the independent fitting parameters $ {\vec{p}} $ for the Group: P _A and k _ex for each temperature, |Δω _H| and |Δω _X| for each Curveset, and $ {R_{2}^{0} } $ for each Curve. Unfortunately, the nonlinear nature of RD phenomena makes the relationship between χ ² and $ {\vec{p}} $ difficult to predict, and therefore optimization algorithms often “fail” by finding a local minimum of χ ², which is sensitive to initial fitting conditions, instead of the intended global minimum (Fig. 4a). Hence, to assess the sensitivity of the final fit to initial conditions, GUARDD implements a grid search over a six-dimensional (6D) parameter space encompassing the most sensitive parameters: |Δω _H|, |Δω _X|, P _A, k _ex, E _AB, and ΔΗ. Each point on this 6D grid is used to set the initial conditions for subsequent optimization. Using GUARDD’s Fit RD window (Fig. 5), the user can specify the limits and number of steps for each dimension of the grid, the selection of which depends Group size, knowledge of the fitting solution, and use of the Arrhenius constraint which enables the E _AB and ΔH dimensions (see section "Analysis: sharing fit parameters", option (B)). The completed grid of N _grid points contains N _grid sets of optimized parameter values, with each set yielding one value of χ ². Because the grid for each Group is typically large (10–500 points in each of 5–50 dimensions), GUARDD’s Chi2 Map window is designed to make the otherwise-cumbersome navigation relatively easy (Fig. 4, Bottom).

When seeking the best fit for the Group, one must often select one χ ² minimum out of several candidates (Fig. 4c). This issue is exacerbated by noisy RD data and/or small R _ex resulting from exchange near a detection limit of the CPMG RD method. To this end, GUARDD is designed to accommodate three general strategies at the discretion of the user. (1) Select one of the fits, but mark the ill-defined parameters as “Not OK,” thus preventing their mis-interpretation in subsequent analyses. For example, in the fast-exchange regime (k _ex ≫ Δν), k _ex is often valid despite that P _A and Δω are not independently defined, but are instead convoluted into the quantity Φ _ex = P _A P _BΔω ² (Ishima and Torchia 1999; Luz and Meiboom 1963). (2) Alter the Group and re-fit. The user may remove noisy Curves and/or add additional Curvesets to help constrain the values of k _ex and P _A. (3) Acquire more data and re-fit the new Group. The RD Simulator can help determine optimal conditions of temperature, magnetic field strength, and/or QC for efficient use of spectrometer time (see section “Optimization and education”). Importantly, NMR probes that sample more than two states may not fit well to the two-state exchange model used by GUARDD, and hence relatively poor fits (large χ ²) may reflect this limitation.

As mentioned previously, RD data are often noisy, and this can further widen the variety of fitting solutions surrounding the global minimum of χ ². Hence, GUARDD implements a well-established Monte Carlo (MC) procedure to bootstrap from the residuals (Bevington and Robinson 2003; Motulsky and Christopoulos 2003). For a given Group, the MC algorithm generates many synthetic Groups, each randomly differing in $ {R_{2}^{Eff} } $ values from the original Group, yet still within their noise, and fits them individually to assess the sensitivity of the final fit to noise in the data. Much like the results of the grid search, the MC results are often large (100 s of fits with 5–50 parameters each), but can be easily navigated using the Chi2 Map window.

The computational time required for these fitting tasks depends on the number of Curves in the Group, where 1, 3, or 10 Curves requires about 5, 15, or 60 s per step using a desktop PC with a 2.6 GHz CPU and 3 Gb of RAM running MATLAB R2011a. Because these tasks are repeated hundreds of times for tens of Groups, GUARDD includes a Batch Task function which allows lengthy computations to be queued for consecutive processing, while incremental progress is automatically saved to the hard drive.

Optimization and education: RD Simulator explores the nature of RD phenomena

GUARDD includes an interactive RD Simulator that models multi-temperature, multi-coherence, multi-field RD data (Fig. 6). The related Kinetic Simulator models the temperature-dependence of population P _A and rates k _ex, k _A, and k _B for two-state exchange, provided enthalpy change ΔH and activation energy E _AB. These features are designed to serve three important functions.

First, they are helpful for selecting appropriate conditions for future experiments, thus optimizing the use of expensive NMR time. For example, if RD data are already available at two temperatures, the simulator can determine what third temperature yields the largest dispersions, which often yield the most accurate and informative fits used to describe the molecular dynamics.

Second, provided conditions for a future experiment, the simulator can identify the quantity and quality of new data required for an informative fit. This is important because if noisy Curves are added to a Group, the fit quality can be compromised. This assessment is performed by creating Groups of original data plus simulated Curves, each with different ν _CPMG points and signal to noise ratios (e.g., 10 points at 10% noise, 15 points at 10% noise, 15 points at 5% noise). By comparing fits of these new Groups, the investigator can ensure that a sufficient number spectra and scans are acquired, thus avoiding the “noisy new data” pitfall.

Finally, these features are excellent teaching tools for learning basic principles of RD. Simply by editing fields in a table, the user can explore how the Curve reports on Δω, P _A, k _ex, and how it is influenced by B ₀. Similarly, two-state exchange kinetics can be monitored in response to temperature, ΔH, and E _AB. These tools can be used to create custom figures as teaching aids, or can engage a student directly with a “hands-on” approach.

Presentation: massive datasets are easy to visualize for analysis and dissemination

Because analysis of RD data typically yields many results, organization is challenging using a spreadsheet, for example. Hence, GUARDD is designed with versatile presentation features that allow the user to easily explore their results. In the Display RD window (Fig. 7, Top), the user can view the Curves from a Group with any of its candidate fits and residuals. Next, a table of results can be dynamically generated based on the user’s selection criteria from an arbitrary set of RD parameters and Curves (e.g., display only P _A and k _ex at one temperature, instead of all RD parameters for all Curves for all Groups) (Fig. 7, Bottom). This customized table can be exported to plain-text for publication or external analysis. Finally, the Group Display window allows the user to seamlessly prepare scatter plots and histograms of arbitrary RD parameters (Fig. 8). This is helpful for identifying outlying results, comparing candidate fits, and producing publication-quality figures. Moreover, any figures prepared by GUARDD can easily be altered using MATLAB’s figure editor (e.g., adding and moving text, specifying symbols, colors, and font sizes).

Limitations and comparison to alternative software

Although GUARDD inherits desirable features from the MATLAB framework, this dependency imposes certain limitations. First, because MATLAB is an interpreted language, and because its graphical interface uses Java, GUARDD is slower than if it were coded using C or Python, for example. Although this lag is not prohibitive, it can be experienced while drawing the display, while reading or writing large session files, or while fitting data. Interested users may partially alleviate this drawback by using the MATLAB compiler, which generates a single executable that is not subject to runtime interpretation, and/or by programming consecutive fitting tasks to run in parallel. Second, malfunctions in MATLAB may hinder functionality of GUARDD (however, enhancements to MATLAB may imbue enhancements to GUARDD). Third, although MATLAB is a convenient cross-platform solution for dissemination of software, the user must have access to MATLAB and the MATLAB Optimization Toolbox (i.e., GUARDD is not a standalone program).

There are also limitations to the current features of GUARDD. First, the selection of exchange model is restricted to two-state using the all-timescales MQ Carver-Richards-Jones formulation (Korzhnev et al. 2004a). Although these equations are very general, certain cases may benefit from simplifications assuming skewed populations (P _B < P _A; Ishima and Torchia 1999) or assuming fast-exchange (k _ex > Δν; Kempf and Loria 2003; Kleckner and Foster 2011; Luz and Meiboom 1963). Also, more advanced labs may feel limited by the lack of analytical options like three-state exchange models (Neudecker et al. 2006), ZQ or DQ coherences (Korzhnev et al. 2005), pressure-dependence (Korzhnev et al. 2006), Anti-TROSY/TROSY analysis (Hansen et al. 2008a), or temperature-dependence using transition state theory (Boehr et al. 2010; Korzhnev et al. 2004b).

Regarding fitting accuracy, we expect GUARDD will match softwares that implement a grid search (Grey et al. 2003; Ishima and Torchia 2005; Kovrigin et al. 2006; Namanja et al. 2010), but will surpass ones that do not (Bieri and Gooley 2011), due to the ruggedness of RD χ ² maps (Ishima and Torchia 2005; Kovrigin et al. 2006) and the effectiveness of the grid search procedure (Bevington and Robinson 2003; Motulsky and Christopoulos 2003). Regarding error estimation, the MC bootstrap procedure used by GUARDD is very effective, but not always the most computationally efficient option. For example, depending on the size of the dataset, the jackknife approach (Choy et al. 2005; Mosteller and Tukey 1968, 1977) or the covariance matrix method (Korzhnev et al. 2004b) may be superior, but GUARDD does not currently implement those options.

As detailed in prior sections, GUARDD is distinguished from other RD software for its user-friendly graphical interface, versatile data management and presentation features, and comprehensive simulator. These qualities are important because they fill a well-defined void in the domain of RD analysis software. Furthermore, because our goal is to enhance the analysis of CPMG RD data, limitations to GUARDD can be addressed in future updates.

Discussion and conclusions

Because CPMG RD is widely used, and because current analytical software options are limited, we expect GUARDD will be of broad interest to the NMR community. Especially for novice spectroscopists, the user-friendly interface, tutorials, organizational features, and RD Simulator will enhance and accelerate both knowledge of RD phenomena and analysis of their data. For experienced spectroscopists, the innovative display features will allow them to navigate results with ease, and the RD Simulator will help them plan future experiments with quantitative precision. Although GUARDD can accomplish important analytical procedures from start to finish, we encourage users to provide feedback, to request additional functionality, and to edit and share the source code, as we continue to serve the needs of RD experimentalists.

Acknowledgments

This work is funded by National Institutes of Health grant R01GM077234. The authors are thankful to investigators who shared their RD data, including Z. Wu, P. Vallurupali, F. Mulder, and D. Korzhnev. The authors are thankful for developmental feedback from all Foster Lab members, as well as E. Kovrigin, and D. Kern. Special thanks to A. Simmons for careful review of this manuscript.

Materials and methods

Alternative software for fitting CPMG RD NMR data

Although many labs will supply CPMG RD analysis software upon request, some is already publicly accessible: CPMG Fit (Palmer Lab), http://biochemistry.hs.columbia.edu/labs/palmer/software/cpmgfit.html; CATIA from F. Hansen (Kay Lab), http://pound.med.utoronto.ca/~flemming/catia/; NESSY from M. Bieri (Gooley Lab), http://home.gna.org/nessy/.

Extracting RD data from signal intensity

GUARDD can receive RD Curve data as NMR signal intensities, which is used to calculate relaxation rates $ {R_{2}^{Eff} = - \ln (I(\nu_{CPMG} )/I_{0} )/T_{CPMG} } $, where I(ν _CPMG) is the signal intensity in the 2D spectrum acquired with refocusing frequency ν _CPMG, I ₀ is the reference signal intensity obtained in the spectrum with no refocusing block, and T _CPMG is the duration of the refocusing block. Errors in intensities $ {\sigma (R_{2}^{Eff} )} $are estimated via standard deviation from repeat measures of I(ν _CPMG).

Modeling MQ relaxation dispersion

The fitting equations for MQ dispersions, which simplify to SQ dispersions provided Δω _H = 0, are as follows using δ = 1/(4ν _CPMG) and n = T _CPMG ν _CPMG (Korzhnev et al. 2004a):

$$ {R_{2}^{Eff} \left( {\frac{1}{2\delta }} \right) = \Re (\lambda_{1} ) - \frac{\ln (Q)}{4n\delta }} $$

(1a)

$$ {\lambda_{1} = R_{2}^{0} + \frac{1}{2}\left( {k_{ex} - \left( {\frac{1}{2\delta }} \right)\cosh^{ - 1} (D_{ + } \cosh (\eta_{ + } ) - D_{ - } \cos (\eta_{ - } ))} \right)} $$

(1b)

$$ {D_{ \pm } = \frac{1}{2}\left( {\frac{{\Uppsi + 2\Updelta \omega_{X}^{2} }}{{\left( {\Uppsi^{2} + \zeta^{2} } \right)^{1/2} }} \pm 1} \right)} $$

(1c)

$$ {\eta_{ \pm } = \sqrt 2 \delta ((\Uppsi^{2} + \zeta^{2} )^{1/2} \pm \Uppsi )^{1/2} } $$

(1d)

$$ {\Uppsi = (i\Updelta \omega_{H} + (P_{A} - P_{B} )k_{ex} )^{2} - \Updelta \omega_{C}^{2} + 4P_{A} P_{B} k_{ex}^{2} } $$

(1e)

$$ {\zeta = - 2\Updelta \omega_{C} (i\Updelta \omega_{H} + (P_{A}- P_{B} )k_{ex} )} $$

(1f)

$$ {Q = \Re \left( {1 - m_{D}^{2} + m_{D} m_{Z} - m_{Z}^{2} + \left( {\frac{{m_{D} + m_{Z} }}{2}} \right)\left( {\frac{{P_{B} }}{{P_{A} }}} \right)^{1/2} } \right)} $$

(1g)

$$ {m_{D} = \frac{{ik_{ex} (P_{A} P_{B} )^{1/2} }}{{d_{ + } z_{ +} }}\left( {z_{ + } + 2\Updelta \omega_{X} \left( {\frac{{\sin(z_{ + } \delta )}}{{\sin ((d_{ + } + z_{ + } )\delta )}}}\right)} \right)} $$

(1h)

$$ {m_{Z} = \frac{{ik_{ex} (P_{A} P_{B} )^{1/2} }}{{d_{ - } z_{ -} }}\left( {d_{ - } - 2\Updelta \omega_{X} \left( {\frac{{\sin(d_{ - } \delta )}}{{\sin ((d_{ - } + z_{ - } )\delta )}}}\right)} \right)} $$

(1i)

$$ {d_{ \pm } = (\Updelta \omega_{H} + \Updelta \omega_{X} ) \pm ik_{ex} } $$

(1j)

$$ {z_{ \pm } = (\Updelta \omega_{H} - \Updelta \omega_{X} ) \pm ik_{ex} } $$

(1k)

Exchange broadening

The exchange broadening R _ex quantifies the height of the dispersion curve and is estimated from the fit as $ {R_{ex} \approx R_{2,Eff}^{Fit} (\nu_{CPMG} \approx 0{\text{Hz}}) - R_{2,Eff}^{Fit} (\nu_{CPMG} \approx 10^{4} {\text{Hz}})} $.

Scaling factor for chemical exchange timescale

The scaling factor α quantifies the timescale of chemical exchange, and is calculated α = d(ln(R _ex))/d(ln(Δω)) (Millet et al. 2000).

Temperature-dependence of exchange kinetics

Provided reasonable fitted exchange rates at multiple temperatures, Arrhenius and van’t Hoff analyses are performed (Winzor and Jackson 2006). Arrhenius analysis requires both P _A and k _ex to quantify temperature dependence of exchange rate via $ {\ln (k) = \ln (P) + \left( {\frac{ - E}{R}} \right)\left( \frac{1}{T} \right)} $ where k = k _A = (1 − P _A)k _ex (or k _B = P _A k _ex) = kinetic rate of exchange from A → B (or B → A), P = P _AB (or P = P _BA) pre-exponential rate, the exchange rate from A → B (or B → A) at infinite temperature, E = E _AB (or E = E _BA) = activation energy (≈enthalpy) required to exchange from A → B (or B → A), R = gas constant, and T = absolute temperature. van’t Hoff analysis only requires P _A to quantify temperature dependence of exchange populations via $ {\ln (K) = \ln \left( {\frac{{1 - P_{A} }}{{P_{A} }}} \right) = \frac{\Updelta S}{R} + \left( {\frac{ - \Updelta H}{R}} \right)\left( \frac{1}{T} \right)} $where K = (1 − P _A)/P _A = k _A/k _B = equilibrium constant for exchange, ΔS = system entropy change from A → B, ΔH = system enthalpy change from A → B. Errors in Arrhenius and van’t Hoff analyses are estimated from errors from MATLAB’s fit routine (provided data at more than two temperatures), or from propagation of relative error from the fitting variables (when limited to data at only two temperatures).

Kinetic simulator

The two-state kinetic simulator itemizes all kinetic parameters of interest: ΔH, ΔS, E _AB, P _AB, E _BA, P _BA, k _ex(T), P _A(T), k _A(T), and k _B(T) where T is an arbitrary temperature, provided ΔH, E _AB, $ {k_{ex}^{0} = k_{ex} (T_{0} )} $, $ {P_{A}^{0} = P_{A} (T_{0} )} $, and T ₀. By extension of the Arrhenius and van’t Hoff relations above, this is accomplished as follows:

Using ΔH and P _A(T ₀), the van’t Hoff relation yields ΔS

$$ {\Updelta S = R\ln \left( {\frac{{1 - P_{A}^{0} }}{{P_{A}^{0} }}} \right) + \frac{\Updelta H}{{T_{0} }}} $$

which, with ΔH, determines P _A at any temperature via van’t Hoff

$$ {P_{A} (T) = \left( {1 + \exp \left( {\frac{\Updelta S}{R} - \frac{\Updelta H}{RT}} \right)} \right)^{ - 1} } .$$

Next, using P _A and k _ex at T ₀ determines k _A and k _B at T ₀

$$ {k_{A}^{0} = k_{A} (T_{0} ) = (1 - P_{A}^{0} )k_{ex}^{0} } $$

$$ {k_{B}^{0} = k_{B} (T_{0} ) = P_{A}^{0} k_{ex}^{0} } $$

and using E _AB and k _A at T ₀, the Arrhenius relation yields P _AB

$$ {P_{AB} = k_{A}^{0} \exp \left( {\frac{{E_{AB} }}{{RT_{0} }}} \right)} $$

which, with E _AB, yields k _A at any temperature via Arrhenius

$$ {k_{A} (T) = P_{AB} \exp \left( {\frac{{ - E_{AB} }}{RT}} \right)} .$$

Next, knowledge of P _A and k _A at any temperature yields k _ex at any temperature

$$ {k_{ex} (T) = \frac{{k_{A} \left( T \right)}}{{1 - P_{A} \left( T \right)}}}$$

and therefore k _B at any temperature$ {k_{B} (T) = k_{ex} (T) - k_{A} (T)} $. Knowledge of k _B at any temperature yields E _BA via the Arrhenius relation and selection of any two temperatures T ₁ and T ₂ (e.g., 280 and 320 K)

$$ {E_{BA} = \frac{{R\ln (k_{B} (T_{1} )/k_{B} (T_{2} ))}}{{1/T_{2} - 1/T_{1} }}} .$$

Finally, using k _B(T ₀) and E _BA, the Arrhenius relation yields P _BA

$$ {P_{BA} = k_{B}^{0} \exp \left( {\frac{{E_{BA} }}{{RT_{0} }}} \right)} .$$

Monte Carlo bootstrapping from residuals (Motulsky and Christopoulos 2003)

Residuals are calculated for each observed ν _CPMG value in a given dispersion Curve as $ {\varepsilon (\nu_{CPMG} ) = R_{2,Eff}^{Obs} (\nu_{CPMG} ) - R_{2,Eff}^{Calc} (\nu_{CPMG} )} $. The set of residuals is used to create a normal distribution $ {\mathbf{N}}(mean(\vec{\varepsilon }),\text{var} (\vec{\varepsilon })) $ for the Curve. Next, a synthetic dispersion Curve is created using the best fit at each observed ν _CPMG value plus a random sample from this normal distribution: $ R_{2,Eff}^{Synth} (\nu_{CPMG} ) = R_{2,Eff}^{Calc} (\nu_{CPMG} ) + {\mathbf{N}}(mean(\vec{\varepsilon }),\text{var} (\vec{\varepsilon })) $. This is repeated for each Curve in the Group such that a synthetic Group is produced. The synthetic Group is fit using initial conditions from the best fit of the actual data. This is repeated 100 times to create 100 synthetic Groups and 100 sets of optimized fit parameters. The error in a given parameter is estimated as the standard deviation of the optimized fit parameter from its 100 element distribution. Errors in subsequent quantities (e.g., k _A, k _B, ln(k _A), etc.) are estimated using propagation of error assuming all parameters are uncorrelated (zero covariance).

Abbreviations

NMR:: Nuclear magnetic resonance
RD:: Relaxation dispersion
QC:: Quantum coherence
SQ:: Single quantum
MQ:: Multiple quantum
ZQ:: Zero quantum
DQ:: Double quantum
s:: Second
ms:: Millisecond (10⁻³ s)
μs:: Microsecond (10⁻⁶ s)
R _ex :: NMR exchange broadening
$ R_{2}^{Eff} $ :: Effective transverse relaxation rate, as measured by RD NMR
Δω :: Chemical shift difference upon exchange from A state to B state (rad/s)
Δν :: Chemical shift difference upon exchange from A state to B state (Hz)
P _A :: Population of A state
k _ex :: Total exchange rate between A and B states
k _A :: Exchange rate from A to B
k _B :: Exchange rate from B to A
ΔH :: Enthalpy change from A to B
E _AB :: Activation energy upon exchange from A to B
6D:: Six-dimensional
MC:: Monte Carlo

References

Beach H, Cole R, Gill ML, Loria JP (2005) Conservation of mus-ms enzyme motions in the apo- and substrate-mimicked state. J Am Chem Soc 127:9167–9176. http://view.ncbi.nlm.nih.gov/pubmed/15969595
Google Scholar
Bevington P, Robinson D (2003) Data reduction and error analysis for the physical sciences. McGraw-Hill, Boston, MA
Google Scholar
Bieri M, Gooley P (2011) Automated NMR relaxation dispersion data analysis using NESSY. BMC Bioinform 12:421
Google Scholar
Boehr DD, Dyson HJ, Wright PE (2006) An NMR perspective on enzyme dynamics. Chem Rev 106:3055–3079. http://view.ncbi.nlm.nih.gov/pubmed/16895318
Google Scholar
Boehr DD, McElheny D, Dyson HJ, Wright PE (2010) Millisecond timescale fluctuations in dihydrofolate reductase are exquisitely sensitive to the bound ligands. Proc Natl Acad Sci USA 107:1373–1378. http://view.ncbi.nlm.nih.gov/pubmed/20080605
Google Scholar
Byrd RH, Gilbert JC, Nocedal J (2000) A trust region method based on interior point techniques for nonlinear programming. Math Program 89:149–185
Article MathSciNet MATH Google Scholar
Choy W, Zhou Z, Bai Y, Kay LE (2005) An 15N NMR spin relaxation dispersion study of the folding of a pair of engineered mutants of apocytochrome b562. J Am Chem Soc 127:5066–5072. http://view.ncbi.nlm.nih.gov/pubmed/15810841
Google Scholar
Grey MJ, Wang C, Palmer AG 3rd (2003) Disulfide bond isomerization in basic pancreatic trypsin inhibitor: multisite chemical exchange quantified by CPMG relaxation dispersion and chemical shift modeling. J Am Chem Soc 125:14324–14335. http://view.ncbi.nlm.nih.gov/pubmed/14624581
Google Scholar
Hansen DF, Vallurupalli P, Kay LE (2008a) An improved 15N relaxation dispersion experiment for the measurement of millisecond time-scale dynamics in proteins. J Phys Chem B 112:5898–5904. http://view.ncbi.nlm.nih.gov/pubmed/18001083
Google Scholar
Hansen DF, Vallurupalli P, Lundström P, Neudecker P, Kay LE (2008b) Probing chemical shifts of invisible states of proteins with relaxation dispersion NMR spectroscopy: how well can we do? J Am Chem Soc 130:2667–2675. http://view.ncbi.nlm.nih.gov/pubmed/18237174
Google Scholar
Henzler-Wildman KA, Thai V, Lei M, Ott M, Wolf-Watz M, Fenn T, Pozharski E, Wilson MA, Petsko GA, Karplus M, Hübner CG, Kern D (2007) Intrinsic motions along an enzymatic reaction trajectory. Nature 450:838–844. http://view.ncbi.nlm.nih.gov/pubmed/18026086
Google Scholar
Ishima R, Torchia DA (1999) Estimating the time scale of chemical exchange of proteins from measurements of transverse relaxation rates in solution. J Biomol NMR 14:369–372. http://view.ncbi.nlm.nih.gov/pubmed/10526408
Google Scholar
Ishima R, Torchia DA (2005) Error estimation and global fitting in transverse-relaxation dispersion experiments to determine chemical-exchange parameters. J Biomol NMR 32:41–54. http://view.ncbi.nlm.nih.gov/pubmed/16041482
Ishima R, Torchia DA (2006) Accuracy of optimized chemical-exchange parameters derived by fitting CPMG R2 dispersion profiles when R2(0a) not = R2(0b). J Biomol NMR 34:209–219. http://view.ncbi.nlm.nih.gov/pubmed/16645811
Jee J, Ishima R, Gronenborn AM (2008) Characterization of specific protein association by 15N CPMG relaxation dispersion NMR: the GB1A34F monomer-dimer equilibrium. J Phys Chem B 112:6008–6012. http://pubs.acs.org/doi/abs/10.1021/jp076094h
Google Scholar
Johnson JE Jr, Hoogstraten CG (2008) Extensive backbone dynamics in the GCAA RNA tetraloop analyzed using 13C NMR spin relaxation and specific isotope labeling. J Am Chem Soc 130:16757–16769. http://view.ncbi.nlm.nih.gov/pubmed/19049467
Google Scholar
Kempf JG, Loria JP (2003) Protein dynamics from solution NMR: theory and applications. Cell Biochem Biophys 37:187–211. http://view.ncbi.nlm.nih.gov/pubmed/12625627
Google Scholar
Kleckner IR, Foster MP (2011) An introduction to NMR-based approaches for measuring protein dynamics. Biochim Biophys Acta 1814:942–968. http://view.ncbi.nlm.nih.gov/pubmed/21059410
Kloiber K, Spitzer R, Tollinger M, Konrat R, Kreutz C (2011) Probing RNA dynamics via longitudinal exchange and CPMG relaxation dispersion NMR spectroscopy using a sensitive 13C-methyl label. Nucleic Acids Res 39:4340–4351. http://view.ncbi.nlm.nih.gov/pubmed/21252295
Korzhnev DM, Kloiber K, Kay LE (2004a) Multiple-quantum relaxation dispersion NMR spectroscopy probing millisecond time-scale dynamics in proteins: theory and application. J Am Chem Soc 126:7320–7329. http://view.ncbi.nlm.nih.gov/pubmed/15186169
Google Scholar
Korzhnev DM, Salvatella X, Vendruscolo M, Di Nardo AA, Davidson AR, Dobson CM, Kay LE (2004b) Low-populated folding intermediates of Fyn SH3 characterized by relaxation dispersion NMR. Nature 430:586–590. http://view.ncbi.nlm.nih.gov/pubmed/15282609
Google Scholar
Korzhnev DM, Neudecker P, Mittermaier A, Orekhov VY, Kay LE (2005) Multiple-site exchange in proteins studied with a suite of six NMR relaxation dispersion experiments: an application to the folding of a Fyn SH3 domain mutant. J Am Chem Soc 127:15602–15611. http://view.ncbi.nlm.nih.gov/pubmed/16262426
Google Scholar
Korzhnev DM, Bezsonova I, Evanics F, Taulier N, Zhou Z, Bai Y, Chalikian TV, Prosser RS, Kay LE (2006) Probing the transition state ensemble of a protein folding reaction by pressure-dependent NMR relaxation dispersion. J Am Chem Soc 128:5262–5269. http://view.ncbi.nlm.nih.gov/pubmed/16608362
Google Scholar
Korzhnev DM, Religa TL, Lundström P, Fersht AR, Kay LE (2007) The folding pathway of an FF domain: characterization of an on-pathway intermediate state under folding conditions by (15)N, (13)C(alpha) and (13)C-methyl relaxation dispersion and (1)H/(2)H-exchange NMR spectroscopy. J Mol Biol 372:497–512. http://view.ncbi.nlm.nih.gov/pubmed/17689561
Google Scholar
Kovrigin E (2011) BiophysicsLab. http://biophysicslab.net. Accessed 30 June 2011
Kovrigin EL, Kempf JG, Grey MJ, Loria JP (2006) Faithful estimation of dynamics parameters from CPMG relaxation dispersion measurements. J Magn Reson (San Diego, Calif.: 1997) 180:93–104. http://view.ncbi.nlm.nih.gov/pubmed/16458551
Loria JP, Berlow RB, Watt ED (2008) Characterization of enzyme motions by solution NMR relaxation dispersion. Acc Chem Res 41:214–221. http://view.ncbi.nlm.nih.gov/pubmed/18281945
Google Scholar
Luz Z, Meiboom S (1963) Nuclear magnetic resonance study of protolysis of trimethylammonium ion in aqueous solution—order of reaction with respect to solvent. J Chem Phys 39:366–370
Article ADS Google Scholar
McConnell HM (1958) Reaction rates by nuclear magnetic resonance. J Chem Phys 28:430–431
Article ADS Google Scholar
Millet O, Loria JP, Kroenke CD, Pons M, Palmer AG (2000) The static magnetic field dependence of chemical exchange linebroadening defines the NMR chemical shift time scale. J Am Chem Soc 122:2867–2877
Article Google Scholar
Mosteller F, Tukey J (1968) Data analysis, including statistics. In: Lindzey G, Aronson E (eds) Handbook of social psychology, vol 2, 2nd edn. Addison-Wesley, Reading, pp 80–203
Google Scholar
Mosteller F, Tukey JW (1977) Data analysis and regression: a second course in statistics. Addison Wesley, Reading
Motulsky HJ, Christopoulos A (2003) Fitting models to biological data using linear and nonlinear regression. A practical guide to curve fitting. Graphpad Software Inc, San Diego
Google Scholar
Mulder FA, Mittermaier A, Hon B, Dahlquist FW, Kay LE (2001) Studying excited states of proteins by NMR spectroscopy. Nat Struct Biol 8:932–935. http://view.ncbi.nlm.nih.gov/pubmed/11685237
Google Scholar
Mulder FAA, Hon B, Mittermaier A, Dahlquist FW, Kay LE (2002) Slow internal dynamics in proteins: application of NMR relaxation dispersion spectroscopy to methyl groups in a cavity mutant of T4 lysozyme. J Am Chem Soc 124:1443–1451. http://view.ncbi.nlm.nih.gov/pubmed/11841314
Google Scholar
Myint W, Ishima R (2009) Chemical exchange effects during refocusing pulses in constant-time CPMG relaxation dispersion experiments. J Biomol NMR 45:207–216. http://view.ncbi.nlm.nih.gov/pubmed/19618276
Google Scholar
Namanja AT, Wang XJ, Xu B, Mercedes-Camacho AY, Wilson BD, Wilson KA, Etzkorn FA, Peng JW (2010) Toward flexibility-activity relationships by NMR spectroscopy: dynamics of Pin1 ligands. J Am Chem Soc 132:5607–5609. http://view.ncbi.nlm.nih.gov/pubmed/20356313
Google Scholar
Neudecker P, Korzhnev DM, Kay LE (2006) Assessment of the effects of increased relaxation dispersion data on the extraction of 3-site exchange parameters characterizing the unfolding of an SH3 domain. J Biomol NMR 34:129–135. http://view.ncbi.nlm.nih.gov/pubmed/16604422
Palmer AG 3rd, Grey MJ, Wang C (2005) Solution NMR spin relaxation methods for characterizing chemical exchange in high-molecular-weight systems. Methods Enzymol 394:430–465. http://view.ncbi.nlm.nih.gov/pubmed/15808232
Google Scholar
Ren Z, Ghose R (2011) Slow conformational dynamics in the cystoviral RNA-directed RNA polymerase P2: influence of substrate nucleotides and template RNA. Biochemistry 50:1875–1884. http://view.ncbi.nlm.nih.gov/pubmed/21244027
Google Scholar
Vallurupalli P, Kay LE (2006) Complementarity of ensemble and single-molecule measures of protein motion: a relaxation dispersion NMR study of an enzyme complex. Proc Natl Acad Sci USA 103:11910–11915. http://view.ncbi.nlm.nih.gov/pubmed/16880391
Winzor DJ, Jackson CM (2006) Interpretation of the temperature dependence of equilibrium and rate constants. J Mol Recognit JMR 19:389–407. http://view.ncbi.nlm.nih.gov/pubmed/16897812
Google Scholar

Download references

Author information

Authors and Affiliations

Biophysics Program, The Ohio State University, 484 West 12th Ave Room 776, Columbus, OH, 43210, USA
Ian R. Kleckner
Biochemistry Department, The Ohio State University, 484 West 12th Ave Room 776, Columbus, OH, 43210, USA
Mark P. Foster

Authors

Ian R. Kleckner
View author publications
You can also search for this author in PubMed Google Scholar
Mark P. Foster
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ian R. Kleckner or Mark P. Foster.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kleckner, I.R., Foster, M.P. GUARDD: user-friendly MATLAB software for rigorous analysis of CPMG RD NMR data. J Biomol NMR 52, 11–22 (2012). https://doi.org/10.1007/s10858-011-9589-y

Download citation

Received: 29 June 2011
Accepted: 25 September 2011
Published: 11 December 2011
Issue Date: January 2012
DOI: https://doi.org/10.1007/s10858-011-9589-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

GUARDD: user-friendly MATLAB software for rigorous analysis of CPMG RD NMR data

Abstract

Similar content being viewed by others

A suite of ¹⁹F based relaxation dispersion experiments to assess biomolecular motions

NMR Relaxation Dispersion Methods for the Structural and Dynamic Analysis of Quickly Interconverting, Low-Populated Conformational Substates

Protein Dynamics Revealed by CPMG Dispersion

Introduction

GUARDD software overview

Data: hierarchical data objects are easily managed via graphical interface

Analysis: sharing fit parameters via two-state exchange at multiple temperatures

Analysis: grid search and Monte Carlo χ² maps enhance and assess fitting accuracy

Optimization and education: RD Simulator explores the nature of RD phenomena

Presentation: massive datasets are easy to visualize for analysis and dissemination

Limitations and comparison to alternative software

Discussion and conclusions

Acknowledgments

Materials and methods

Alternative software for fitting CPMG RD NMR data

Extracting RD data from signal intensity

Modeling MQ relaxation dispersion

Exchange broadening

Scaling factor for chemical exchange timescale

Temperature-dependence of exchange kinetics

Kinetic simulator

Monte Carlo bootstrapping from residuals (Motulsky and Christopoulos 2003)

Abbreviations

References

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Keywords

Navigation

GUARDD: user-friendly MATLAB software for rigorous analysis of CPMG RD NMR data

Abstract

Similar content being viewed by others

A suite of 19F based relaxation dispersion experiments to assess biomolecular motions

NMR Relaxation Dispersion Methods for the Structural and Dynamic Analysis of Quickly Interconverting, Low-Populated Conformational Substates

Protein Dynamics Revealed by CPMG Dispersion

Introduction

GUARDD software overview

Data: hierarchical data objects are easily managed via graphical interface

Analysis: sharing fit parameters via two-state exchange at multiple temperatures

Analysis: grid search and Monte Carlo χ2 maps enhance and assess fitting accuracy

Optimization and education: RD Simulator explores the nature of RD phenomena

Presentation: massive datasets are easy to visualize for analysis and dissemination

Limitations and comparison to alternative software

Discussion and conclusions

Acknowledgments

Materials and methods

Alternative software for fitting CPMG RD NMR data

Extracting RD data from signal intensity

Modeling MQ relaxation dispersion

Exchange broadening

Scaling factor for chemical exchange timescale

Temperature-dependence of exchange kinetics

Kinetic simulator

Monte Carlo bootstrapping from residuals (Motulsky and Christopoulos 2003)

Abbreviations

References

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

A suite of ¹⁹F based relaxation dispersion experiments to assess biomolecular motions

Analysis: grid search and Monte Carlo χ² maps enhance and assess fitting accuracy