A Computational Model for Protein Ionization by Electrospray Based on Gas-Phase Basicity

Marchese, Roberto; Grandori, Rita; Carloni, Paolo; Raugei, Simone

doi:10.1007/s13361-012-0449-0

A Computational Model for Protein Ionization by Electrospray Based on Gas-Phase Basicity

Research Article
Published: 20 September 2012

Volume 23, pages 1903–1910, (2012)
Cite this article

Download PDF

Journal of The American Society for Mass Spectrometry

A Computational Model for Protein Ionization by Electrospray Based on Gas-Phase Basicity

Download PDF

Roberto Marchese¹,
Rita Grandori²,
Paolo Carloni³ &
…
Simone Raugei^3,4

1787 Accesses
40 Citations
Explore all metrics

Abstract

Identifying the key factor(s) governing the overall protein charge is crucial for the interpretation of electrospray-ionization mass spectrometry data. Current hypotheses invoke different principles for folded and unfolded proteins. Here, first we investigate the gas-phase structure and energetics of several proteins of variable size and different folds. The conformer and protomer space of these proteins ions is explored exhaustively by hybrid Monte-Carlo/molecular dynamics calculations, allowing for zwitterionic states. From these calculations, the apparent gas-phase basicity of desolvated protein ions turns out to be the unifying trait dictating protein ionization by electrospray. Next, we develop a simple, general, adjustable-parameter-free model for the potential energy function of proteins. The model is capable to predict with remarkable accuracy the experimental charge of folded proteins and its well-known correlation with the square root of protein mass.

Charging of Proteins in Native Mass Spectrometry

Article 12 October 2016

Probabilistic calculations of biomolecule charge states that generate mass spectra of multiply charged ions

Article 01 September 2015

Decomposition of multidimensional charge state distributions of ions produced by electrospray ionization of bioorganic compounds. Part 1: Basic formalism and implementation of the method

Article 12 December 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The interpretation of the data from electrospray-ionization mass spectrometry (ESI-MS) greatly benefits from uncovering the role of factors controlling the degree of protein ionization, which lead to the observed charge-state distributions (CSDs) [1, 2]. The current view of protein electrospray invokes different mechanisms to explain the ionization behavior of unfolded proteins, and folded globular proteins under non-denaturing conditions (the former may be either proteins under denaturing conditions or intrinsically disordered proteins). The degree of ionization of unfolded proteins is considered to be controlled by the apparent gas-phase basicity of protein ions (GB_app) relative to that of the solvent [3–5]. GB_app measures the propensity of the ionizable groups of desolvated protein ions to acquire a proton. It approaches the GB of the solvent for unfolded proteins in their highest observable charge state [4].

For folded proteins, the interpretation is less straightforward. The extent of protein ionization has been interpreted in terms of GB also in this case [6–9]. In particular, the GB_app of folded cytochrome c has been calculated from the crystallographic structure, accounting for Coulomb repulsions [9]. The derived value for the 11+ ion was little below the value for water, suggesting that the GB model could be extended to folded proteins. However, the most accredited hypothesis considers the charge of the precursor ESI droplet as the key factor determining the extent of protein ionization [10, 11]. In turn, the droplet charge is assumed to be close to the Rayleigh limit. In support to this hypothesis, plots of average protein charge as a function of protein mass can be fitted well by the Rayleigh equation using the surface tension of water and a droplet radius equal to that of the globular protein structure [10]. This hypothesis would imply that the charge of the protein depends on solvent surface tension following the Rayleigh equation [12]. Such a dependence could not be reproduced by experiments on either folded or unfolded proteins [1, 2, 12–15], although solvent surface tension certainly plays an important role in the ESI process [1, 2], and has also been suggested to explain some effects of supercharging agents [11, 16].

Here, we use computational methods to investigate the relevance of GB_app for folded proteins under electrospray conditions, with no assumption on a role of the charge of the precursor ESI droplet. Our test systems are nine structurally diverse and well characterized proteins, spanning a range of molecular weight from 4.0 to 29.2 kDa and a wide range isoelectric point (i.e., from basic to acidic). We first calculate GB_app by introducing a Monte-Carlo/molecular-dynamics (MC/MD) scheme, which takes into account the ionization of basic (Arg, Lys, His, and N-terminus) and acidic (Asp, Glu, and C-terminus) groups. This procedure explicitly considers the influence of protein structure on the intrinsic gas phase basicity of each ionizable group, allowing for a combined exploration of the conformer and protomer space. This method leads to the identification of a set of lowest-energy protomers for each value of protein net charge. In most cases, self-solvation networks lead to maintenance of zwitterionic states. Next, we propose a simple mathematical model based on the results of these atomistic investigations. This model has no adjustable parameters and it reproduces the well-known correlations between the protein net charge, q, and both mass [10] and solvent-exposed surface [17]. Our model hints to GB_app as a key factor for the ionization of folded proteins, suggesting that protein ionization depends on intrinsic properties of the protein structure and on the GB of the solvent [4].

2 Computational Details

2.1 Proteins

Nine globular proteins of diverse size and fold were considered (Table 1): ragweed pollen allergen from Ambrosia trifida (40 residues, pdb entry 1BBG), bovine pancreatic trypsin inhibitor (56 residues, pdb entry 1UUA), C-terminal domain of the ribosomal protein L7/L12 from E. coli (68 residues, pdb entry 1CTF), α-amylase inhibitor tendamistat (74 residues, pdb entry 1OK0), human ubiquitin (76 residues, pdb entry: 1V80), Ribonuclease SA (96 residues, pdb entry 1C54), E60Q mutant of human FK506 binding protein-12 (107 residues, pdb entry 2PPP), hen-egg white lysozyme (129 residues, pdb entry 1LZT), human carbonic anhydrase II (260 residues, pdb entry 2CBA). The 1LZT, 1BBG, 1UUA, and 1OK0 proteins feature 4, 4, 3, and 2 disulphide bridges, respectively. Breaking of these bonds was not considered.

Table 1 Proteins Studied in this Work: Ragweed Pollen Allergen (1BBG), Bovine Pancreatic Trypsin Inhibitor (1UUA), C-Terminal Domain of the Ribosomal Protein L7/L12 from E. coli (1CTF), α-Amylase Inhibitor Tendamistat (1OK0), Human Ubiquitin (1V80), Ribonuclease SA (1C54) E60Q Mutant of Human FK506 Binding Protein-12 (2PPP), Hen-Egg White Lysozyme (1LZT), and Human Carbonic Anhydrase II (2CBA). For Each Protein, the Table Lists the Number of Residues, the Mass (kDa), the Fold, the Number of Basic (Arg, Lys, His, and N-terminus) and Acidic Residues (Asp, Glu, and C-terminus) Considered in the Present Work and the Main Charge State Observed Experimentally for Ions Originated from Water along with the Charge Predicted in the Present Study

Full size table

Available evidence suggests that, under mild ESI conditions, protons are mostly exchanged among few sites (i.e., mainly Arg, Lys, His, Glu, and Asp side chain and the N- and C-termini [18]. Thus, in order to keep the sampling problem tractable, only protonation and deprotonation of these residues was considered.

2.2 Protomer Space Exploration

Predicting the distribution of protonated sites within a protein is not trivial, since the number of possible protomers can be prohibitively large to be explored exhaustively by any theoretical approach, even for a small protein. To cope with this problem, several Monte-Carlo (MC) protocols have been proposed [19] (and references therein). These schemes suppose that the protein structure does not change with the protonation state and they usually assume that the (average) protein structure in aqueous solutions or the crystallographic structure is a good approximation of the gas-phase structure. This may be generally true for the protein backbone. However, it might not necessarily hold for side chains. To tackle this issue, the present study employs a combined MC/MD sampling scheme using the OPLS/AA force field with GB corrections. Indeed, standard force fields for biomolecular simulations are unable to describe bond breaking and forming. This poses a serious limitation to the exploration of the protomer space using molecular-mechanics schemes. Here we propose to augment the standard force field energies, E _FF, with additional energy terms associated to the GB of ionizable residues, introducing the following corrected potential function:

$$ {{E}_{\text{corr}}} = - \sum\limits_i^{\prime } \mathop{{\left( {\text{GB}} \right)}}\nolimits_i {{\delta }_i} + {{E}_{\text{FF}}} $$

where the summation runs over all of the ionizable residues and δ _i is 1 if the i-th residue is ionized and 0 otherwise.

We chose the OPLS/AA force field [20], because it offers the most complete set of base/conjugate acid pairs. The adopted correction was validated against density functional theory (DFT) calculations on the small ragweed pollen allergen protein. DFT calculations were performed using the Becke exchange [21] and Lee-Yang-Parr [22] correlation functionals (BLYP) within a hybrid Gaussian. A plane wave approach was adopted [23], along with norm-conserving pseudopotentials [24], to describe the core electrons. The TZV2P Gaussian basis set was used for valence electrons of all atoms, while the auxiliary electron density was expanded in plane waves up to a cutoff of 280 Ry. Interaction between periodic images in the reciprocal space was removed according to the decoupling scheme presented in Ref. [25]. Dispersion energy was included according to Ref. [26]. We will refer to the dispersion-corrected DFT energy as DFT + D. As previously described [27], the adopted DFT scheme has been validated against more accurate quantum-chemical calculations (DFT/B3LYP and MP2). The wave function has been optimized according to Ref. [23]. The calculations were carried out with the CP2K code [23].

The comparison between DFT + D and corrected force field calculations was performed over 35 selected protomers (over a total of about 460 possible protomers) of the ragweed pollen allergen protein at q = 1+. For each protomer, conformational sampling was carried out according to the previously described protocol [27]. Several sets of GB values taken from the literature [9, 28, 29] were tested. The best agreement between DFT + D and corrected force field was obtained for the GBs calculated previously [27] for amino acids in an extended conformation, where the ionized groups do not make any short-range interaction such as hydrogen bonds or salt bridges. Indeed, this type of interaction is reasonably well described by the force field and there is no need to include it in the calculation of the amino-acid reference GB.

DFT energies do not correlate with non-corrected force-field energies (Figure SI-1A). The addition of the correction introduces a marked linear correlation between the two different energy evaluations (R ² = 0.93, Figure SI-1B). The data dispersion indicates that the corrected force field allows one to discriminate between high and low energy protomers but not to identify small energy differences and, thus, to identify the single lowest-energy protomer. The standard error of the estimate using the linear correlation of Figure SI-1B is σ = 35 kJ/mol. If we assume a normal distrubution of the DFT + D energies around the estimate obtained from the corrected force field, there is a confidence of 68.3 % and 99.7 % that the DFT + D energy is within σ and 3σ (about 100 kJ/mol) from the estimate, respectively. Indeed, all of the conformers located within 10 kJ/mol from the DFT + D minimum fall within 100 kJ/mol above the OPLS/AA. Similar discrepancies are found using the Amber [30] and GROMOS [31] force fields (data not shown). Hence, we carry out a statistical analysis of the protomer properties within a given energy cut off, which yields a high confidence to include the minimum-energy protomer. We will refer to these protomers as the most probable protomers. The discussion presented in this work is based on a 100 kJ/mol cut off. Only few structures are within this cut off (about 10 out of thousands). Different choices of the cut-off energy (from σ to 3σ) give comparable results (data not shown).

Using the OPLS/AA force field with GB corrections, protonation sites were randomly permuted and the total energy was calculated. At each MC step, a proton exchange is accepted or rejected according to the Metropolis criterion. The structure of each considered protomer was relaxed with the following simulated annealing-like procedure. First, a 400-ps, high-temperature (400 K) MD simulation was performed. This temperature was selected after several careful tests and it allows for an exhaustive sampling of side-chain conformations without disrupting, in the relatively short simulation time, the protein secondary structure. The resulting trajectory was split in 60 equally spaced time windows. In each of these windows, the geometry of the lowest-energy conformation was optimized. The optimized structure was then employed in the MC procedure. For each value of net charge, about 10³ lowest-energy configurations were sampled. This procedure converges in a relatively small number of MC steps. Indeed, MC searches starting from different protomers substantially yielded the same final charge configurations, differing at the most in the position of one or two protons.

The initial structure for the MC procedure was extracted from a 2-ns MD simulation at ambient conditions of the protein in aqueous solution (with counter-ions added). These preparatory simulations were long enough to stabilize the protein dynamics, as deduced from the root mean square displacement (RMSD) of the backbone heavy-atoms. In all cases, the structure closest to the average one was taken. The initial charge configuration for the MC process, instead, was randomly generated allowing for positively charged basic residues and negatively charged acidic residues.

The time evolution of any lowest-energy protomer obtained from the MC/MD protocol was followed at 300 K for 20 ns (Table 2) and, in the case of the q = 8+ ubiquitin ion, for 1 μs (Figure SI-9). All the calculations were carried out using the GROMACS [32] MD package. A time step of 1.5 fs for the integration of equations of motion was used in all of the simulations.

2.3 Calculation of the Apparent Gas-Phase Basicity

The GB_app is an extension of the concept of GB [4], and quantifies the ability of a protein, in a given conformation and charge state, to increase its charge state through the addition of a proton. The GB_app of a protein corresponds to the GB of the residue (embedded in the protein environment) with the highest gas-phase basicity. The GB_app,i of the i-th residue in a protein with total charge q is defined as [4]

$$ \mathop{\text{GB}}\nolimits_{{{\text{app}},i}} = \mathop{\text{GB}}\nolimits_i - \left( {E_{\text{FF}}^{{\left( {i,q} \right)}} - E_{\text{FF}}^{{\left( {i,q - 1} \right)}}} \right), $$

where GB_i is the GB of the i-th amino acid in the gas phase and $ E_{\text{FF}}^{{\left( {i,q} \right)}} $ (or $ E_{\text{FF}}^{{\left( {i,q - 1} \right)}} $) is the energy of the protein with that residue protonated (or non protonated). In contrast to the original formulation, developed for a coarse-grained representation of an unfolded protein [4], we include in the GB_app calculation all of the classical energy terms considered by a force field. No vibrational correction was taken into account. The justification for this choice has been discussed previously in the literature [27, 33].

We stress that, in the present study, we do not report the GB_app of the lowest-energy protomer of a protein in a given charge state, which might be ill-defined as discussed in the previous section. Rather, we extrapolate trends of the average GB_app for a large set of proteins and charge states, and try to relate these trends to the experimentally observed CSDs and the GB of the solvent from which the ions have been originated.

3 Results and Discussion

3.1 Protein Structure in Vacuo

We have analyzed the conformational and protomer space of nine proteins featuring different size, fold, and pI (Table 1). Significant conformational rearrangements take place upon desolvation. However, the most probable protomers identified by the MC/MD procedure conserve their secondary and tertiary structure upon passing from the aqueous solution to the gas phase at room temperature (Table 2 and Figure SI-2). The gyration radius (R _g), calculated over the nanosecond time-scale, decreases in a similar way for all of the charge states considered here. This contraction involves the solvent-exposed side chains, which fold onto the protein surface and, to a lesser extent, also the backbone. These rearrangements lead to the formation of new intramolecular hydrogen bonds (HB_pp). The total surface area (A _tot) also decreases, whereas the hydrophobic portion of the total surface area (A _phob) increases, as already reported [34–36]. The proteins turn out to be more rigid in the gas phase, as indicated by the RMSD of backbone atoms around their average positions. The most dramatic change is observed for the small pollen allergen, which has relatively flexible parts, pointing to the solvent in the native structure. Moreover, the protein structural rearrangements on passing from solution to gas phase lead to the formation of salt-bridges, as also reported in Ref. [34]. Further structural features (Table 2) are in close agreement to those found in previous simulation studies [34–36] and, hence, their detailed description is omitted.

3.2 Gas-phase Basicity and Protein Ionization

The number of ionized residues, n _IR, and the number of hydrogen bonds formed by ionized residues, n _iHB, per unit of protein surface is constant among our protein data set (Figure SI-3), with about 0.16 ionized residues and 0.43 hydrogen bonds per nm², for the most probable protomers of the proteins considered here, in their predicted most populated charge state: n _IR = 0.148S + 1.459 (R ² = 0.976) and n _iHB = 0.434S − 1.438 (R ² = 0.989), with S in nm². Both positive and negative ionized residues tend to form the maximum number of hydrogen bonds, compatible with the geometry of the gas-phase structures (Table 3). In general, protonated amino groups donate three hydrogen bonds (one for each N–H bond) and carboxylates receive four hydrogen bonds. The average number of hydrogen bonds per residue type is roughly constant across the proteins investigated. The differences are within the standard deviation (Table 3).

Table 3 Average Number of Hydrogen Bonds Formed by each Type of Ionized Residue in the Most Probable Protomers (Protomers Under 100 kJ/mol from the Energy Minimum) for the Proteins of Table 1. The Total Number of Ionized Hydrogen Bonds is Reported in the Last Column. The Standard Deviation from the Average is Given in Brackets

Full size table

Zwitterionic states are mostly retained, especially for low charge states, as can be seen by plotting the number of ionized residues as a function of the protein total charge (Figure 1 and Figure SI-6). Indeed, intramolecular hydrogen bonds, salt-bridges, π-charge interactions, and long-range electrostatic interactions can compensate for the thermodynamic penalty of charge separation in vacuo, providing internal solvation [27]. This finding is consistent with recent experimental evidence [6, 37, 38]. It also supports the hypothesis that a higher propensity for zwitterionic states of folded versus unfolded proteins can lower the net charge of the former, contributing to conformational effects in ESI-MS [39]. In contrast, there is no clear dependence of the number of salt bridges on the protein size (Figure SI-4). This result suggests that the formation of salt bridges depends on peculiar features of the protein structure.

The GB_app values decreases linearly as the protein net charge increases (Table SI-1, R ² ≥ 0.99 for all of the nine proteins) [4, 9, 40]. As expected, the slope of the line depends on the specific protein. Most notably, the intersection of the GB_app fitted line with the line of solvent GB corresponds with remarkable agreement to the experimental main (most abundant) charge state under mild ESI conditions [7, 10] (Figure 2 and Figure SI-7). Instead, GB_app turns out to be systematically underestimated when the calculation is performed constraining the non-hydrogen atoms in the position of the NMR or X-ray structures (Figure 2). By reproducing the experimental charge for proteins of different size, our calculations are also consistent with the well-known charge-to-mass empirical relation $ q \propto \sqrt {M} $ [10, 41].

It has been previously shown that such an empiric charge-to-mass relation reflects a linear log-log charge-to-surface relation, which has been proven to hold both for folded [41, 42] and unfolded [43] proteins. The relation holds when comparing the surface of hydrated proteins or protein complexes with different surface-to-mass relation [41]. Our results also support a linear log–log correlation between charge and desolvated protein surface. Indeed, a linear correlation seems to be respected when the area calculated from the gas-phase structures is employed (Figure SI-5). However, more datapoints will be needed to explore larger molecular weights.

Table 2 Average Structural Properties at 300 K in Water and in Vacuo for the Proteins of Table 1. From Left to Right: Radius of Gyration in Water (R _{g, wat} in Å); Radius of Gyration of the Backbone in Water (R _{g, bb wat} in Å) Radius of Gyration in Gas-Phase (R _{g, gp} in Å); Radius of Gyration of the Backbone in Gas-Phase (R _{g, bb gp} in Å); Protein–Protein Hydrogen Bonds in Water (HB_wat); Protein–Protein Hydrogen Bonds in Gas-Phase (HB_gp); Total Surface Area in Water (S _{tot, wat} in Å²); Total Surface Area in Gas-Phase (S _{tot, gp} in Å²; Hydrophobic Surface in Water (S _{pho, wat} in Å²); Hydrophobic Surface in Gas-Phase (S _{pho, gp} in Å²); Root Mean Square Displacement of the Backbone in Water (RMSD_wat in Å); Root Mean Square Displacement of the Backbone in Gas-Phase (RMSD_gp in Å). The Proteins in Vacuo are in the Charge State Predicted in the Present Study, whereas the Proteins in Water have all of the Acidic and Basic Residues Ionized but Histidines, which are Assumed Neutral. Standard Deviations are Reported in Parenthesis

Full size table

3.3 A Simple Model for Proteins in the Gas Phase

We now extend the correlation between charge and mass to any folded protein, by introducing a simple and general model energy function, without adjustable parameters. This model is based on a plausible assumption: the experimentally observed charge-to-mass relation can be interpreted as a linear combination of energetic contributions due to electrostatic repulsion and internal solvation. The latter is considered to be proportional to the protein surface and it is based on the above observation that the number of ionized residues and the number of hydrogen bonds they present per surface unit is constant.

We start by modeling a protein in the gas phase as a sphere of radius R and density ρ, with a net charge q uniformly distributed on its surface. The electrostatic energy of the protein can be expressed as:

$$ U(q) = \frac{1}{2}\frac{1}{{4{{\epsilon }_0}\pi }}\frac{{{{q}^2}}}{R} - 4\pi \xi {{R}^2}. $$

(1)

The quadratic dependence of the electrostatic energy on the total protein charge (first term) accounts for the linear change in GB_app described above (see also Figure SI-8). The second term takes into account the stabilization by intermolecular interactions. It is proportional to the surface area 4πR ² via a parameter to be determined by fitting our computational results (ξ ≈ 0.994 N/m with ρ from Ref. [52]). The protein is unstable when U(q) ≥ 0. Hence, the maximum charge attainable is

$$ q = 4\pi \sqrt {{2{{\epsilon }_0}\xi {{R}^3}}} $$

(2)

or

$$ q = 2\sqrt {{\frac{{6\pi {{\epsilon }_0}\xi }}{\rho }}} \sqrt {M}, $$

(3)

being M = 4/3πρR ³.

A numerical model based on a charged ellipsoid shows that, also in such a case, $ q \propto \sqrt {M} $ (see Supplementary Information). Thus, the $ q \propto \sqrt {M} $ relation is rather independent on the specific shape being used. More generally, it is easy to see from a simple dimensional analysis that any stabilizing contribution (−ξS) that depends on the surface area (S) yields a $ \sqrt {M} $ dependence of the maximum charge possible. Indeed, the electrostatic potential is dimensionally proportional to charge² × length^-1, whereas the stabilizing contribution is proportional to length². Thus, the square of maximum charge is proportional to a volume, and consequently the charge must be proportional to the square root of the mass.

Substituting our fitted ξ value in Equation (3) yields, with remarkable accuracy, not only the main charge state displayed by the proteins considered in this study, but also the experimental q versus M curve for a large variety of other proteins (Figure 3). Notice that we compare our predicted maximum charges q with the reported experimental main charge. This approximation is justified by the fact that experimental main and maximum charges are very similar for folded proteins [10] and they are related to each other.

Our simple model Equation (3) yields formally a result similar to that obtained with the Rayleigh-charge hypothesis. Both predict a $ \sqrt {M} $ dependence of the protein charge. However, in contrast to the Rayleigh-charge hypothesis, the present model does not involve adjustable parameters and does not introduce any dependence on the solvent surface tension [12]. The only parameter entering in Equation (3) is the stabilization term due to internal solvation and it is obtained from the quantities calculated from the set of proteins considered here. This model eliminates the explicit dependence on surface tension that derives from the application of the Rayleigh equation to the prediction of protein ionization by electrospray. At the same time, the simplifying assumptions of the model do not allow to capture the indirect role that the solvent surface tension might play on protein ionization via its effect on the electrospray mechanism. Surface tension is the limiting factor for droplet charging during electrospray, as indicated by the Rayleigh equation, and conditions might be found in which it becomes the limiting factor also for protein ionization [1, 2].

It should also be underscored that the results of this study do not help discriminating between the ion-evaporation model and the charged residue model, concerning the mechanism of production of gas-phase ions during ESI [1, 2]. The present model is not meant to test those hypotheses and is compatible with both mechanisms.

4 Conclusions

The present computational study establishes the relevance of GB_app for folded proteins under electrospray conditions. The calculations do not assume a role for the charge of the precursor ESI droplet. Our proposed model reproduces the well-known relationship between observed charge and protein mass based only on intrinsic protein features and solvent nature. Hence, the liquid medium of the precursor droplet provides or accepts protons according to its GB relative to the GB_app of the protein. Along with previous studies [2, 4], these results support a model in which the same principle (i.e., the GB_app of the protein relative to the GB of the solvent) is applied to folded and unfolded proteins, in order to explain the experimentally observed charge values.

References

Verkerk, U.H., Kebarle, P.: Ion–ion and ion–molecule reactions at the surface of proteins produced by nanospray. Information on the number of acidic residues and control of the number of ionized acidic and basic residues. J. Am. Soc. Mass Spectrom. 16, 1325–1341 (2005)
Article CAS Google Scholar
Kebarle, P., Verkerk, U.H.: Electrospray: From ions in solution to ions in the gas phase, what we know now. Mass Spectrom. Rev. 28, 898–917 (2009)
Article CAS Google Scholar
Konermann, L.: A minimalist model for exploring conformational effects on the electrospray charge state distribution of proteins. J. Phys. Chem. B 111, 6534–6543 (2007)
Article CAS Google Scholar
Schnier, P.D., Gross, D.S., Williams, E.R.: On the maximum charge-state and proton-transfer reactivity of peptide and protein ions formed by electrospray-ionization. J. Am. Soc. Mass Spectrom. 6, 1086–1097 (1995)
Article CAS Google Scholar
Gronert, S.: Coulomb repulsion in multiply charged ions: a computational study of the effective dielectric constants of organic spacer groups. Int. J. Mass Spectrom. 187, 351–357 (1999)
Article Google Scholar
Peschke, M., Verkerk, U.H., Kebarle, P.: Prediction of the charge states of folded proteins in electrospray ionization. Eur. J. Mass Spectrom. 10, 993–1002 (2004)
Article CAS Google Scholar
Isabel Catalina, M., van den Heuvel, R.H.H., van Duijn, E., Heck, A.J.R.: Decharging of globular proteins and protein complexes in electrospray. Chem.-Eur. J. 11, 960–968 (2005)
Article Google Scholar
Touboul, D., Jecklin, M.C., Zenobi, R.: Investigation of deprotonation reactions on globular and denatured proteins at atmospheric pressure by ESSI-MS. J. Am. Soc. Mass Spectrom. 19, 455–466 (2008)
Article CAS Google Scholar
Schnier, P.D., Gross, D.S., Williams, E.R.: Electrostatic forces and dielectric polarizability of multiply protonated gas-phase cytochrome c ions probed by ion/molecule chemistry. J. Am. Chem. Soc. 117, 6747–6757 (1995)
Article CAS Google Scholar
Fernandez de la Mora, J.: Electrospray ionization of large multiply charged species proceeds via dole’s charged residue mechanism. Anal. Chim. Acta 406, 93–104 (2000)
Article CAS Google Scholar
Iavarone, A.T., Williams, E.R.: Mechanism of charging and supercharging molecules in electrospray ionization. J. Am. Chem. Soc. 125, 2319–2327 (2003)
Article CAS Google Scholar
Samalikova, M., Grandori, R.: Testing the role of solvent surface tension in protein ionization by electrospray. J. Mass Spectrom. 40, 503–510 (2005)
Article CAS Google Scholar
Samalikova, M., Grandori, R.: Protein charge-state distributions in electrospray-ionization mass spectrometry do not appear to be limited by the surface tension of the solvent. J. Am. Chem. Soc. 125, 13352–13353 (2003)
Article CAS Google Scholar
Lomeli, S.H., Yin, S., Ogorzalek Loo, R.R., Loo, J.A.: Increasing charge while preserving noncovalent protein complexes for esi-ms. J. Am. Soc. Mass Spectrom. 20, 593–596 (2009)
Article CAS Google Scholar
Lomeli, S.H., Peng, I.X., Yin, S., Ogorzalek Loo, R.R., Loo, J.A.: New reagents for increasing ESI multiple charging of proteins and protein complexes. J. Am. Soc. Mass Spectrom. 21, 127–131 (2010)
Article CAS Google Scholar
Sterling, H.J., Cassou, C.A., Trnka, M.J., Burlingame, A.L., Krantz, B.A., Williams, E.R.: The role of conformational flexibility on protein supercharging in native electrospray ionization. Phys. Chem., Chem. Phys. 13, 18288–18296 (2011)
Article CAS Google Scholar
Prakash, H., Mazumdar, S.: Direct correlation of the crystal structure of proteins with the maximum positive and negative charge states of gaseous protein ions produced by electrospray ionization. J. Am. Soc. Mass Spectrom. 16, 1409–1421 (2005)
Article CAS Google Scholar
Jarrold, M.F.: Peptides and proteins in the vapor phase. Annu. Rev. Phys. Chem. 51, 179–207 (2000)
Article CAS Google Scholar
Patriksson, A., Adams, C.M., Kjeldsen, F., Zubarev, R.A., van der Spoel, D.: A direct comparison of protein structure in the gas and solution phase: The trp-cage. J. Phys. Chem. B 111, 13147–13150 (2007)
Article CAS Google Scholar
Jorgensen, W.L., Tirado-Rives, J.: Potential energy functions for atomic-level simulations of water and organic and biomolecular systems. Proc. Natl. Acad. Sci. U.S.A. 102, 6665–6670 (2005)
Article CAS Google Scholar
Becke, A.D.: Density-functional exchange-energy approximation with correct asymptotic-behavior. Phys. Rev. A 38, 3098–3100 (1988)
Article CAS Google Scholar
Lee, C.T., Yang, W.T., Parr, R.G.: Development of the Colle-Salvetti correlation-energy formula into a functional of the electron-density. Phys. Rev. B. 37, 785–789 (1988)
Article CAS Google Scholar
VandeVondele, J., Krack, M., Mohamed, F., Parrinello, M., Chassaing, T., Hutter, J.: Quickstep: Fast and accurate density functional calculations using a mixed Gaussian and plane waves approach. Comp. Phys. Comun. 167, 103–128 (2005)
Article CAS Google Scholar
Hartwigsen, C., Goedecker, S., Hutter, J.: Relativistic separable dual-space Gaussian pseudopotentials from h to rn. Phys. Rev. B 58, 3641–3662 (1998)
Article CAS Google Scholar
Martyna, G.J., Tuckerman, M.E.: A reciprocal space based method for treating long range interactions in ab initio and force-field-based calculations in clusters. J. Chem. Phys. 110, 2810–2821 (1999)
Article CAS Google Scholar
Grimme, S.: Semiempirical gga-type density functional constructed with a long-range dispersion correction. J. Comput. Chem. 27, 1787–1799 (2006)
Article CAS Google Scholar
Marchese, R., Grandori, R., Carloni, P., Raugei, S.: On the zwitterionic nature of gas-phase peptides and protein ions. PLOS Comput. Biol. 6, 1–11 (2010)
Article Google Scholar
Li, Z., Matus, M.H., Velazquez, H.A.: D. A Dixon, C. J. Cassady. Gas-phase acidities of aspartic acid, glutamic acid, and their amino acid amides. Int. J. Mass Spectrom. 265, 213–223 (2007)
Article CAS Google Scholar
Harrison, A.G.: The gas-phase basicities and proton affinities of amino acids and peptides. Mass Spectrom. Rev. 16, 201–217 (1997)
Article CAS Google Scholar
Case, D.A., Cheatham, T., Darden, T., Gohlke, H., Luo, L., Merz, K.M., Onufriev, A., Simmerling, C., Wang, B., Woods, R.: J. Comp. Chem. 26, 1668–1688 (2005)
Article CAS Google Scholar
Oostenbrink, C., Villa, A., Mark, A.E., van Gunsteren, W.F.: A biomolecular force field based on the free enthalpy of hydration and solvation: The GROMOS force-field parameter sets 53a5 and 53a6. J. Comput. Chem. 25, 1656–1676 (2004)
Article CAS Google Scholar
Lindahl, E., Hess, B., van der Spoel, D.: Gromacs 3.0: a package for molecular simulation and trajectory analysis. J. Mol. Model. 7, 306–317 (2001)
CAS Google Scholar
Berka, K., Laskowski, R., Riley, K.E., Hobza, P., Vondrekásek, J.: Representative amino acid side chain interactions in proteins. a comparison of highly accurate correlated ab initio quantum chemical and empirical potential procedures. J. Chem. Theory Comput. 5, 982–992 (2009)
Article CAS Google Scholar
Patriksson, A., Marklund, E., van der Spoel, D.: Protein structures under electrospray conditions. Biochemistry 46, 933–945 (2007)
Article CAS Google Scholar
Meyer, T., de la Cruz, X., Orozco, M.: An atomistic view to the gas phase proteome. Structure 17, 88–95 (2009)
Article CAS Google Scholar
D’Abramo, M., Meyer, T., Bernado, P., Pons, C., Fernandez-Recio, J., Orozco, M.: On the use of low-resolution data to improve structure prediction of proteins and protein complexes. J. Chem. Theory Comp. 5, 3129–3137 (2009)
Article Google Scholar
Heck, A.J.R., van den Heuvel, R.H.H.: Investigation of intact protein complexes by mass spectrometry. Mass Spectrom. Rev. 23, 368–389 (2004)
Article CAS Google Scholar
Krusemark, C.J., Frey, B.L., Belshaw, P.J., Smith, L.M.: Modifying the charge state distribution of proteins in electrospray ionization mass spectrometry by chemical derivatization. J. Am. Soc. Mass Spectrom. 20, 1617–1625 (2009)
Article CAS Google Scholar
Grandori, R.: Origin of the conformation dependence of protein charge-state distributions in electrospray ionization mass spectrometry. J. Mass Spectrom. 38, 11–15 (2003)
Article CAS Google Scholar
Gross, D.S., Schnier, P.D., Rodriguez-Cruz, S.E., Fagerquist, C.K., Williams, E.R.: Conformations and folding of lysozyme ions in vacuo. Proc. Natl. Acad. Sci. USA 93, 3143–3148 (1996)
Article CAS Google Scholar
Kaltashov, I.A., Mohimen, A.: Estimates of protein surface areas in solution by electrospray ionization mass spectrometry. Anal. Chem. 77, 5370–5379 (2005)
Article CAS Google Scholar
Halgand, F., Leprévote, O.: Mean charge state and charge state distribution of proteins as structural probes. an electrospray ionization mass spectrometry study of lysozyme and ribonuclease A. Eur. J. Mass Spectrom. 7, 433–439 (2001)
Article CAS Google Scholar
Testa, L., Brocca, S., Grandori, R.: Charge-surface correlation in electrospray ionization of folded and unfolded proteins. Anal. Chem. 83, 6459–6463 (2011)
Article CAS Google Scholar
Fenaille, F., Nony, E., Chabre, H., Lautrette, A., Couret, M.-N., Batard, T., Moingeon, P., Ezan, E.: Mass spectrometric investigation of molecular variability of grass pollen group 1 allergens. J. Proteome Res. 8, 4014–4027 (2009)
Article CAS Google Scholar
Konermann, L., Douglas, D.J.: Equilibrium unfolding of proteins monitored by electrospray ionization mass spectrometry: Distinguishing two-state from multi-state transitions. Rapid Commun. Mass Spectrom. 12, 435–442 (1998)
Article CAS Google Scholar
Benjamin, D.R., Robinson, V.C., Hendrick, J.P., Hartl, F.U., Dobson, C.M.: Mass spectrometry of ribosomes and ribosomal subunits. Proc. Natl. Acc. Sci. U.S.A. 95, 7391–7395 (1998)
Article CAS Google Scholar
Douglas, D.J., Collings, B.A., Numao, S., Nesatyy, V.J.: Detection of noncovalent complex between a-amylase and its microbial inhibitor tendamistat by electrospray ionization mass spectrometry. Rapid Commun. Mass Spectrom. 15, 89–96 (2001)
Article CAS Google Scholar
Hoerner, J.K., Xiao, H., Kaltashov, I.A.: Structural and dynamic characteristics of a partially folded state of ubiquitin revealed by hydrogen exchange mass spectrometry. Biochemistry 44, 11286–11294 (2005)
Article CAS Google Scholar
Smith, R.D.: Evolution of ESI–mass spectrometry and Fourier transform ion cyclotron resonance for proteomics and other biological applications Int. J. Mass Spectrom. 200, 509–544 (2000)
Article CAS Google Scholar
Samalikova, M., Grandori, R.: Role of opposite charges in protein electrospray ionization mass spectrometry. J. Mass Spectrom. 38, 941–947 (2003)
Article CAS Google Scholar
Wang, H., Zhang, X., Xiao, J., Yang, S., Nie, A., Zhao, L., Li, S.: ESI-MS study on noncovalent bond complex of rhfkbp12 and new neurogrowth promoter. Sci. China Ser. C 46, 286–292 (2003)
CAS Google Scholar
Fischer, H., Polikarpov, I., Craievich, A.F.: Average protein density is a molecular-weight-dependent function. Protein Sci. 13, 2825–2828 (2004)
Article CAS Google Scholar

Download references

Acknowledgment

The authors gratefully acknowledge CNR/INFM and Juelich Super Computing for a large allocation of computer time, respectively, at the CINECA and JUGENE supercomputing centers.

Author information

Authors and Affiliations

SISSA and INFM-DEMOCRITOS center, Trieste, Italy
Roberto Marchese
Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
Rita Grandori
Computational Biophysics, German Research School for Simulation Sciences, D-52425 Jülich, Germany and Institute for Advanced Simulation IAS-5, Computational Biomedicine, Forschungszentrum Jülich, D-52425, Jülich, Germany
Paolo Carloni & Simone Raugei
Fundamental and Computational Sciences Directorate, Pacific Northwest National Laboratory, 902 Battelle Boulevard, Richland, WA, 99352, USA
Simone Raugei

Authors

Roberto Marchese
View author publications
You can also search for this author in PubMed Google Scholar
Rita Grandori
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Carloni
View author publications
You can also search for this author in PubMed Google Scholar
Simone Raugei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Rita Grandori or Simone Raugei.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

Supplementary information. (PDF 1.34 mb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Marchese, R., Grandori, R., Carloni, P. et al. A Computational Model for Protein Ionization by Electrospray Based on Gas-Phase Basicity. J. Am. Soc. Mass Spectrom. 23, 1903–1910 (2012). https://doi.org/10.1007/s13361-012-0449-0

Download citation

Received: 29 September 2011
Revised: 13 July 2012
Accepted: 14 July 2012
Published: 20 September 2012
Issue Date: November 2012
DOI: https://doi.org/10.1007/s13361-012-0449-0

Key words

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Computational Model for Protein Ionization by Electrospray Based on Gas-Phase Basicity

Abstract

Similar content being viewed by others

Charging of Proteins in Native Mass Spectrometry

Probabilistic calculations of biomolecule charge states that generate mass spectra of multiply charged ions

Decomposition of multidimensional charge state distributions of ions produced by electrospray ionization of bioorganic compounds. Part 1: Basic formalism and implementation of the method

1 Introduction