Introduction

Evaluating physicochemical properties of new chemical entities (NCEs) at an early stage of drug development is mandatory to reduce attrition rates due to poor biopharmaceutical properties. Among these properties, ionisation constants (pK a) are needed to predict ADME (absorption, distribution, metabolism and excretion) behaviour, in particular to understand pH-related permeation mechanisms and solubility characteristics. Traditionally, potentiometric and spectrophotometric methods have been used to determine pK a values [1]. In the last 15 years, capillary zone electrophoresis (CZE) with UV detection confirmed its potential for measuring pK a values in a simple automated way [26]. Moreover, this technique consumes minor amounts of sample, is compatible with samples of low purity and is cost effective.

Within recent years, in addition to UV detection, conductivity and particularly capacitively coupled contactless conductivity detection (C4D) have become frequently used with CE [79]. The miniaturized dimensions of this detector and its simplicity have induced great interest, and devices are now commercially available permitting one to achieve good sensitivity and robustness. CE-C4D is suitable for measuring small inorganic and organic ions [10], and is particularly well adapted for chromophore- and fluorophore-free compounds. Generally, indirect UV detection strategies are used for non-UV absorbing analytes, but they present a lack of sensitivity. Furthermore, the strong UV-absorbing agent added to the background electrolytes (BGEs) must possess an electrophoretic mobility similar to the analytes to avoid important electrodispersion phenomena. In contrast, C4D is a reliable detection whatever the type of BGEs or analytes.

Thereby, it emerges as a near-universal detector, particularly well adapted to CE for determining physicochemical parameters, since such methods involve the use of specific conditions in regards to ionic strength and pH. Implementation of C4D over other detection techniques is not skill-demanding in the case of pure solute analysis, and commercially available instruments are easily hyphenated to CE. Features include on-capillary detection, great flexibility in capillary handling, capillary material independence and contactless detection. The last of these features allows the application of high separation voltage without interferences with the detection signal. Moreover, the detector can be freely positioned along the capillary and may therefore be placed where the apparent selectivity is optimum. Other advantages include providing a convenient second point of detection in addition to photometric detection, positioning several C4D cells along the capillary simultaneously and implementing in microchip and array technologies [11].

CZE is well suited for the separation of amino acids and peptides [12]. The pK a values of the 20 most common amino acids which do not possess an ionisable function on their lateral chain were determined with a conventional CZE method but required long analysis times especially at low pH (more than 30 min per analysis) [13]. More recently, Gaš and co-workers determined pK a values of the carboxylic groups of these amino acids with conductivity detection using two conductivity cells which enabled one to reduce the effective length at low pH and thus the analysis time (less than 15 min per analysis) [14, 15].

In order to reduce the analysis time for pK a determination, different generic strategies have been reported, such as pressure-assisted capillary electrophoresis [1619], short-end injection [20, 21], multiplexed capillary electrophoresis [22] or dynamic coated capillaries [23]. It can be noted that short-end injection and especially pressure-assisted strategies reduce apparent peak efficiency, which can decrease precision of migration time determination. All these approaches were generally used separately, whereas some of them may be combined for further time saving.

The aim of the present work consisted in developing a rapid and near-universal CZE method using short-end injection combined with a dynamic coating procedure for determining pK a values of ionisable functions of 20 natural amino acids. C4D was used in combination with UV to detect poorly UV absorbing compounds in the same run without instrument modification.

Theory

The ith dissociation of a fully protonated polyprotic solute H n Xz is described as follow:

(1)

where n is the total number of ionisable groups and z the charge of the fully protonated species. For measurements performed in non-ideal solutions, the thermodynamic dissociation constant of the ith dissociation step is defined as:

$$K_{{{\text{a}}i}} {\left( {{\text{thermodynamic}}} \right)}{\text{ = }}\frac{{{\left[ {{\text{H}}_{{n - i}} {\text{X}}^{{z - i}} } \right]}}}{{{\left[ {{\text{H}}_{{n - i{\text{ + 1}}}} {\text{X}}^{{z - i + {\text{1}}}} } \right]}}}{\text{ }} \cdot {\text{ }}\frac{{\gamma ^{{z - i}} }}{{\gamma ^{{z - i + {\text{1}}}} }}{\text{ }} \cdot {\text{ }}a_{{{\text{H}}^{{\text{ + }}} }} $$
(2)

where \(a_{{\text{H}}^{\text{ + }} }\) is the proton activity, terms in brackets represent molar concentrations of the electrical species of the solute and γ terms are the activity coefficients of charged species other than proton. The measurements giving apparent dissociation constants directly related to experimental conditions, the determination of thermodynamic constants is only indirectly possible using the estimation of γ terms by Eq. (3). Indeed, activity coefficients can be estimated by the Davies equation [24] extended from the Debye–Hückel model to take into account the effect of ionic strength (I) upon dissociation equilibria. According to IUPAC guidelines, the semi-empirical Davies equation (Eq. (3)) can be used for univalent and multivalent ions up to I = 0.2 M [25]:

$${\text{log }}\gamma = - A{\text{ }} \cdot {\text{ }}z^{\text{2}} \left( {\frac{{\sqrt {{\text{ }}I{\text{ }}} }}{{{\text{1 + }}\sqrt {{\text{ }}I{\text{ }}} }} - C{\text{ }} \cdot {\text{ }}I} \right)$$
(3)

where A = 0.509 and C = 0.3 in aqueous solutions at 25 °C. The Davies equation can also be used to compare measurements performed under different experimental conditions, for example at different ionic strengths. For simplification reasons, the notation K a used in this paper will refer to apparent dissociation constants measured under different conditions.

The effective mobility (μ eff) of a polyprotic compound H n Xz, coexisting in different ionised states at a given pH, depends on the molar fraction (χ j ) and on the effective mobility of each individual species :

$$\mu _{{\text{eff}}} = \sum\limits_{i = 0}^n {\chi _{{\text{H}}_{n - i} {\text{X}}^{z - i} } \cdot \mu _{{\text{H}}_{n - i} {\text{X}}^{z - i} } }$$
(4)

Therefore,

$$\mu _{{\text{eff}}} = \sum\limits_{i = 0}^n {\frac{{\left[ {{\text{H}}_{n - i} {\text{X}}^{z - i} } \right]}}{{\sum\limits_{i = 0}^n {\left[ {{\text{H}}_{n - i} {\text{X}}^{z - i} } \right]} }}} \cdot {\text{ }}\mu _{{\text{H}}_{n - i} {\text{X}}^{z - i} } $$
(5)

Using Eq. (2), the previous expression can be rewritten as:

$$\mu _{{\text{eff}}} = \sum\limits_{i = 0}^n {\frac{{\left[ {\prod\limits_{j = 1}^i {K_{{\text{a}}j} } } \right] \cdot a_{{\text{H}}^{\text{ + }} }^{n - i} }}{{\sum\limits_{i = 0}^n {\left[ {\prod\limits_{j = 1}^i {K_{{\text{a}}j} } } \right] \cdot a_{{\text{H}}^{\text{ + }} }^{n - i} } }}} \cdot {\text{ }}\mu _{{\text{H}}_{n - i} {\text{X}}^{z - i} }$$
(6)

Considering \(K_{{\text{a}}j} = 10^{ - {\text{p}}K_{{\text{a}}j} } \) and \(a_{{\text{H}}^{\text{ + }} }^n = 10^{ - n{\text{pH}}} \) Eq. (6) can be rearranged as a function of pH and pK aj :

$$\mu _{{\text{eff}}} = \sum\limits_{i = 0}^n {\frac{{\left[ {\prod\limits_{j = 1}^i {10^{ - {\text{p}}K_{{\text{a}}j} } } } \right] \cdot 10^{\left( {i - n} \right){\text{pH}}} }}{{\sum\limits_{i = 0}^n {\left[ {\prod\limits_{j = 1}^i {10^{ - {\text{p}}K_{{\text{a}}j} } } } \right] \cdot 10^{\left( {i - n} \right){\text{pH}}} } }}} \cdot {\text{ }}\mu _{{\text{H}}_{n - i} {\text{X}}^{z - i} }$$
(7)

This equation allows the determination of pK a values from a plot of μ eff as a function of pH. It can be applied to different ionisable compounds as shown in the Appendix.

Practically, μ eff of an analyte can be measured as:

$$\mu _{{\text{eff}}} = \frac{{L_{{\text{eff}}} \cdot L_{{\text{tot}}} }}{U} \cdot \left( {\frac{1}{{t_{\text{m}} }} - \frac{{\text{1}}}{{t_{{\text{EOF}}} }}} \right)$$
(8)

where t m and t EOF are the migration times (s) of the analyte and the neutral marker, respectively, U is the applied voltage (V), L tot the total capillary length (cm) and L eff the effective capillary length (cm).

Materials and methods

Chemicals

L-Alanine, L-asparagine, L-aspartic acid, L-cysteine, L-glutamic acid, L-glutamine, L-methionine, L-proline, L-serine, L-threonine, L-tryptophan, L-valine, L-arginine, L-histidine, L-isoleucine, L-leucine, L-lysine, L-phenylalanine and L-tyrosine were purchased from Fluka (Buchs, Switzerland). L-Glycine was obtained from Merck (Dietikon, Switzerland).

Phosphoric acid (H3PO4), formic acid (HCOOH), acetic acid (CH3COOH), MES (2-(N-morpholino)ethanesulfonic acid), MOPS (3-morpholinopropanesulfonic acid), TRICINE (N-(tris(hydroxymethyl)-methyl)glycine) and CHES (2-(cyclohexylamino)ethanesulfonic acid) were purchased from Fluka (Buchs, Switzerland). HPLC grade methanol was supplied by Romil (Kölliken, Switzerland), analytical reagent grade acetone by Acros Organics (Basel, Switzerland), hydrochloric acid by Riedel-de-Haën (Buchs, Switzerland) and 1 M Titrinorm sodium hydroxide by VWR (Dietikon, Switzerland). The Ceofix® pH 2.5 solutions were obtained from Analis (Namur, Belgium). All solutions were prepared using ultra-pure water supplied by a Milli-Q Waters Purification System from Millipore (Bedford, MA, USA).

Buffers and samples preparation

Twenty seven buffers covering a pH range from 1.5 to 12.0 were prepared. They were set at a constant ionic strength of 50 mM, and are listed exhaustively in Table 1. The pH values were measured with a Mettler-Toledo SevenMulti pH meter (Schwerzenbach, Switzerland), calibrated daily with four aqueous solutions at pH 2.00, 4.00, 7.00 and 10.00 from Riedel-de-Haën (Buchs, Switzerland). Stock solutions of each amino acid were prepared at a concentration of 1 mg mL−1. Samples were set at a concentration of 0.1 mg mL−1 in BGE, water and acetone (EOF neutral marker) 85:10:5 (V:V:V).

Table 1 Preparation of buffers at 50 mM ionic strength and applied voltages

Instrumentation

CE experiments were performed with an HP3DCE system (Agilent, Waldbronn, Germany) equipped with an on-capillary diode array detector (8.5 cm from the anode), an on-capillary capacitively coupled contactless conductivity detector (13.65 cm from the anode), an autosampler and a power supply able to deliver up to 30 kV. UV detection was carried out at 195 nm for amino acids and 260 nm for acetone. C4D was performed with a TraceDec detector (Innovative Sensor Technologies GmbH, Strasshof, Austria). The conductivity sensor consisted of two electrodes separated by a detection gap of 1 mm, housed inside an aluminium case and positioned along the capillary by sliding it into the desired position. CE ChemStation (Agilent) was used for CE and UV control, data acquisition and data handling. C4D was set at a frequency of 300 kHz and an amplitude of 100 V for the AC signal. C4D Tracemon (Istech), version 0.07a, was used for conductivity detector control. Analyses were performed in uncoated fused silica (FS) capillaries (BGB Analytik AG, Böckten, Switzerland) with 50-μm inner diameter (ID) and 32.5-cm total length. Samples were kept at room temperature in the autosampler and introduced by short-end injection, i.e. the polarity was reversed and samples were injected at the detector side. An injection equivalent to 0.4 % of the capillary’s total length was performed by applying a pressure of 48 mbar s (12 mbar for 4 s). During analysis, a voltage of 3.5 to 8 kV (depending on the running buffer) was applied to avoid the detrimental effect of Joule heating. The capillary was thermostated at 25 °C by a high velocity air stream. Before its first use, the capillary was sequentially rinsed (1 bar) with methanol, 1 M HCl, water, 1 M NaOH, 0.1 M NaOH and water (5 min each). Between different pH buffers, several washing steps (1 bar) were employed, namely water (1 min), Ceofix® initiator (0.5 min), Ceofix® accelerator (1 min), BGE (5 min) and pre-electrophoresis (3.5–8 kV, 5 min). Between analyses, the capillary was rinsed (1 bar) with BGE for 1 min. When not in use, the capillary was rinsed with water and dry stored.

Procedure

Each compound was injected once at each pH. The effective mobility was calculated from analyte and neutral marker migration times (Eq. (8)). Calculated effective mobilities were reported as a function of pH, giving rise to sigmoidal curves. With a GraphPad Prism 4.02 software (GraphPad Software, San Diego, CA, USA), non-linear regressions were performed to determine apparent pK a values at I = 50 mM, using the appropriate equation described in the Appendix.

Results and discussion

Previously, pK a values of more than ten drugs were determined by CZE-UV with a dynamic coating of the capillary to ensure high EOF, whatever the buffer pH [23]. In the present work, this procedure was combined with short-end injection to further increase the throughput and C4D was used as an additional detector for poorly UV absorbing compounds.

Analytical development

Twenty-two buffers reported in Table 1 were selected for pK a determination, which covered a pH range from 1.5 to 12.0 with an increment of 0.5 pH units between each. A constant ionic strength was used to avoid μ eff variations [2628] and keep activity coefficients constant (Eq. (3)). The ionic strength was set at 50 mM as a compromise between sufficient buffer capacity and low Joule heating.

The nature of the buffers was carefully selected to insure the best buffer capacity as well as minimal interactions with analytes. As shown in Fig. 1, μ eff values of arginine and lysine were lower with phosphate rather than MES or MOPS buffers from pH 6.0 to 8.0. At these pH values, these amino acids were mainly present in their monoanionic dicationic form and may thus be associated by electrostatic interactions with monohydrogenophosphate ions. As expected from theory, this phenomenon increased with pH values, since the concentration of monohydrogenophosphate ions also increased. These interactions were more important with amino acids possessing a positively charged lateral chain, such as arginine and lysine. As a consequence, MES and MOPS buffers were preferred to phosphate buffers in this pH range. Moreover, those buffers are more appropriate to CZE-C4D, since amphoteric electrolytes with a low equivalent conductivity at considerably high concentrations can limit peak dispersion effects.

Fig. 1
figure 1

Comparison of the relationship between μ eff and pH for arginine (a) and lysine (b) by CZE-UV: phosphate buffers (*) vs. optimized buffers (•)

The applied voltage was adjusted to the buffer conductivity to prevent excessive Joule heating. For this purpose, the voltage was adapted to remain in the linearity domain of Ohm’s law. As a result, voltages between 3.5 and 8.0 kV were applied, depending on the buffer (Table 1). The lowest voltages were applied with extremely acidic or basic buffers, because of their high conductivities.

Samples were diluted in 85% running BGE to avoid seriously distorted triangular peak shapes which induced inaccurate determination of migration times. Such distorted peaks are due to electrodispersion phenomena which are quite pronounced for amino acids possessing very high diffusion coefficients. Low differences in conductivity and pH between the running buffer and the sample zone limited electrodispersion [29, 30]. Therefore, peak shapes were symmetrical, allowing the precise determination of migration times. However, this procedure was at the expense of C4D sensitivity, since maximal sensitivity should be obtained when the highest possible conductivity difference between the analyte zone and BGE is achieved. Furthermore, matching sample and BGE global conductivity led to poor sample stacking, which was also detrimental to sensitivity [9, 30].

Curves of μ eff as a function of pH were determined with UV and C4D detections. Both detection methods complemented each other since some amino acids were not determined by UV because of their lack of chromophore, but were easily detected by C4D. For instance, glycine, lysine and glutamic acid were hardly achieved by UV, while C4D provided large peaks as shown in Fig. 2. Likewise, other amino acids were better detected by C4D than UV: alanine, aspartic acid, isoleucine, leucine, proline, serine, threonine, valine. In contrast, some amino acids were better detected by UV, but scarcely by C4D due to similar conductivity with the BGE. Although some amino acids were poorly detected by one of the two used detectors, it is noteworthy that in all cases enough points were achieved to determine pK a values by UV as well as C4D. Therefore, the combination of both detectors proved to be very useful, and allowed the analysis of the 20 amino acids in the 22 used buffers.

Fig. 2
figure 2

a CE-C4D analysis of glycine at pH 2.5, 5.5 and 10.0. b CE-C4D analysis of glycine at pH 2.5, 5.5, 10.0 and 11.0. c CE-C4D analysis of glutamic acid at pH 2.5, 5.5 and 10.0. BGE and separation voltage: as described in Table 1, capillary dimensions 50-μm ID x 32.5 cm, effective length 13.65 cm, injection 48 mbar s, temperature 25 °C

Since the main objective of this study was to reduce the analysis time, a dynamic coating procedure was combined with a short-end injection. The dynamic coating procedure generated a large EOF (ca. 5 × 10−4 cm2 V−1 s−1) whatever the buffer with pH values as low as 1.5. Moreover, it prevented intrusive interactions between compounds and capillary walls [12]. Concerning the short-end injection, this approach reduced the analysis time drastically without enhancing Joule heating by shortening the capillary total length. As a result, the total analysis time (i.e. in the 22 buffers) took less than 2 h per compound, including the washing steps, injection, vial permutation and separation times.

pK a determination

Plots of μ eff as a function of pH are shown in Fig. 3 for selected proteogenic amino acids. At any pH, μ eff of a given compound depends on the molar fraction of each ionised form (Eq.(4)). For example, asparagine (Fig. 3a) is a typical amino acid possessing two pK a values, showing a first inflection point between pH 1.5 and 3.5 (pK a = 2.21) and a second inflection point between pH 7.5 and 10.5 (pK a = 8.79). Twelve other amino acids also possess two pK a values, and present the same relationship between μ eff and pH. The curves of the seven other amino acids exhibited three inflection points corresponding to three dissociation constants as is the case for the monoacidic dibasic amino acid histidine (Fig. 3b) and the diacidic monobasic amino acid glutamic acid (Fig. 3c). In a few cases, dissociation steps partially overlapped due to close pK a values.

Fig. 3
figure 3

Relationship between μ eff and pH for a monoacidic monobasic amino acid asparagine, b monoacidic dibasic amino acid histidine, c diacidic monobasic amino acid glutamic acid and d tyrosine

The pK a values of the 20 investigated amino acids were determined using the appropriate fitting model (Appendix). pK a values of carboxylic and amino groups as well as pK a values of lateral chains were obtained for the first time by CE. The ionisation constant was affected by the ionic strength as demonstrated by Eq. (3). It was then necessary to report the ionic strength as well as temperature, since ionisation constants are also temperature-dependent. Since most of the results reported in the literature are determined at a physiological ionic strength of 150 mM [31], the obtained apparent pK a values at 50 mM ionic strength were converted to 150 mM ionic strength using Eq. (3) and data are listed in Table 2.

Table 2 Apparent pK a values at 150 mM ionic strength and 25 °C

With this procedure, 95% confidence intervals were typically lower than 0.20 units. They were higher at the limits of the investigated pH range (1.5 to 12.0). For instance, the pK a of arginine lateral chain (>12) was outside the pH range used in this study and therefore was not determined. The 95% confidence intervals were also high for overlapping pK a values as for example pK a2 and pK a3 of tyrosine (Fig. 3d), lysine, aspartic acid and cysteine. pK a values obtained by UV detection and C4D were similar; by taking into account the 95% confidence interval, they were considered identical.

As a comparison, pK a values of the 20 amino acids were plotted versus literature pK a values obtained at 25 °C (Fig. 4) [32]. The CZE-measured pK a values are correlated with those reported in the literature by using Passing–Bablok regression [33, 34]. Considering the 95% confidence interval, the slope of the linear regression was equal to 1 and the y-intercept 0. The CZE-measured pK a values could thus be considered similar to the literature values.

Fig. 4
figure 4

Plot of experimental pK a values obtained by CE-UV with 95% confidence interval (Table 2) vs. pK a values from literature [31]. Slope (CI) 0.988 (0.976 to 1.003), intercept (CI) 0.069 (−0.004 to 0.133)

Conclusion

A rapid and universal CZE-UV-C4D method was developed, using 22 optimal background electrolytes, to determine pK a values of all ionisable functions of 20 amino acids. Two generic strategies, namely short-end injection and dynamic coated capillaries, allowed the successful determination of all pK a values in less than 2 h per compound. The combined use of both detectors extended the method to poorly UV absorbing amino acids without supplementary analyses. This powerful combination allows the application of this method to the determination of pK a values for almost any kind of solute.

Importantly, this methodology can be transferred to array systems in order to perform several analyses simultaneously as C4D might be compatible with such instruments. To further increase throughput, a mass spectrometer could also be hyphenated to CE which would allow a greater pooling of compounds per analysis owing to the additional selectivity offered by MS [35].