Introduction

The human chorionic gonadotropin (hCG) is a hormone essential for the preservation of the pregnancy and the development of the placenta and fetus [1]. It is a heterodimeric glycoprotein, consisting of an α (hCGα) and a β (hCGβ) subunit. The α-subunit has an identical amino acid sequence to the pituitary gonadotropin hormones such as the follicle-stimulating hormone, the thyroid-stimulating hormone, and the luteinizing hormone, containing two N-glycosylation sites [2, 3], while the β-subunit represents the specific part of hCG and has 2 N- and 4 O-glycosylation sites [4]. Nowadays, it is well-known that changes in the glycosylation pattern of a protein affect its biological activity and half-life. Therefore, the detection and identification of the numerous hCG glycoforms are essential to be able to use one or several of them as biomarkers [5].

With 8 potential glycosylation sites, hCG is a highly heterogeneous protein and the characterization of all its glycoforms constitutes an important analytical challenge. The analysis of the protein at the intact level by liquid chromatography (LC) or capillary electrophoresis (CE) hyphenated with mass spectrometry (MS) is a fast and easy approach that allows the determination of the glycosylation profile [6]. Recently, we optimized for the first time a CE method hyphenated with MS for the separation of hCG glycoforms [7]. The MS analyzer was a triple quadrupole that was useful for comparing glycosylation profiles but prevented the glycoform identification, which requires a high-resolution (HR) MS. Then, we developed for the first time a LC separation in reversed-phase (RP) mode coupled with quadrupole/time-of-flight (qTOF) MS for the analysis of hCG at the intact level [8]. This method led to the separation of some α-subunit glycoforms, but did not allow the detection of the hCGβ glycoforms, perhaps due to a higher heterogeneity of hCGβ and/or a less efficient ionization with its potential 2 N- and 4 O-glycans. With regard to selectivity in RPLC, a correlation between the retention time of the hCGα glycoforms and the number of their sialic acids present at the terminal position on the N-glycan antennas was observed. Surprisingly, the retention time of hCGα glycoforms increases with the number of their sialic acids whereas this elution order would be more expected with hydrophilic interaction liquid chromatography (HILIC), which is complementary and orthogonal to RPLC [9, 10]. Indeed, the retention mechanism in HILIC is partially based on partitioning of the compounds between a water-enriched layer formed at the surface of the polar stationary phase and the hydro-organic mobile phase when containing a high content of organic solvent. Additional interactions can then occur between the compounds and the stationary phase including hydrogen bonding, ionic, and dipole-dipole interactions [11, 12]. The highly organic mobile phase is particularly well adapted to MS detection, leading to an improvement in sensitivity for a large variety of compounds [13]. Therefore, the HILIC mode is nowadays largely used for the analysis of small polar compounds, glycopeptides, or N- and O-glycans released from glycoproteins [14,15,16] and could help to separate the glycoforms of a protein according to their glycan moieties.

For the analysis of glycoproteins at intact level in HILIC, only few studies reported its use [17,18,19,20,21,22,23,24]. Its potential was demonstrated for the analysis of intact neo-glycoproteins obtained after a glycosylation procedure from ribonuclease A, and two antigenic proteins, namely TB10.4 and Ag85B [17,18,19]. These studies allowed the monitoring of the glycosylation process and glycoprotein stability. More recently, Domínguez-Vega et al. [20] developed, for the first time, a HILIC-HRMS method for assessing the glycosylation pattern of one highly glycosylated protein, erythropoietin having 3 N- and 1 O-glycosylation sites [20]. Although a partial separation was obtained, the qTOF analyzer allowed the identification of 51 glycoforms, in addition to glycoforms with other post-translational modifications such as succinimide, oxidation, and N-terminal methionine-loss products [20].

With regard to hCG, we optimized a HILIC-UV method to separate its glycoforms [25]. Two hCG-based drugs were successfully analyzed using an amide-based column and a mobile phase containing ACN and water with 0.1% of trifluoroacetic acid (TFA). The resolution obtained made it possible to distinguish the two drugs thanks to their different glycoforms in number and nature. However, the hyphenation with HRMS is mandatory to significantly improve the level of information about the nature of these hCG glycoforms. Therefore, the potential of HILIC-(qTOF) MS for the hCG characterization is here evaluated and compared with the results obtained with RPLC-(qTOF) MS in terms of selectivity, sensitivity, orthogonality, and complementary.

Materials and methods

Reagents and analytes

HPLC-grade acetonitrile (ACN) and formic acid (FA) were supplied by Carlo Erba (Val de Reuil, France). TFA was purchased from Sigma-Aldrich (Saint Quentin Fallavier, France). The ultra-pure water was obtained with a Direct-Q3 UV system (Millipore, Molsheim, France). Ovitrelle® (Organon, Oss, The Netherlands) was presented as a solution containing 500 mg L−1 of hCG (recombinant mammalian cell, r-hCG). Pregnyl® (Serono Europe Ltd., London, UK) is a lyophilized powder containing 5000 IU of hCG isolated from the urine of pregnant women (u-hCG) (1 IU is equivalent to a concentration of 0.092 μg L−1 [26, 27]).

The Ovitrelle® solution was reconstituted in 0.5 mL of water to obtain a stock solution containing 500 μg mL−1 of r-hCG. For Pregnyl®, the lyophilized content of each ampoule was dissolved in 1 mL of water to obtain a concentration of 460 μg mL−1. All the solutions were aliquoted and stored at − 20 °C. Before injection into LC-MS, the hCG aliquots were washed and pre-concentrated to 1 mg mL−1 by centrifugation with ultra-centrifugal units (Merck, Darmstadt, Germany) having a molecular weight cutoff value of 10 kDa.

Instrumentations and columns

HILIC-(qTOF) MS

The HILIC-(qTOF) MS experiments were performed on a biocompatible 1260 Infinity system composed of a quaternary pump, an auto-sampler, a column oven, and a diode array UV detector (Agilent Technologies, Les Ulis, France) coupled to a micrOTOF-Q II Mass Spectrometer (Bruker Daltonics, Wissembourg, France) with an electrospray ionization (ESI) source. An Accucore Amide column (150 × 2.1 mm, 2.6 μm, 150 Å) purchased from Thermo Fisher (Le Pecq, France) was used. The mobile phase was composed of acetonitrile and water both containing 0.1% TFA. The mobile phase gradient was from 85 to 40% ACN in 30 min, a plateau for 5 min, and a return to the initial composition within 1 min and an equilibrium step for 15 min. The column temperature was maintained at 60 °C. The injection volume was fixed at 2 μL, the hCG sample was at 1 mg mL−1 in water, and the flow rate at 0.4 mL min−1. The MS experiments were operated in the positive mode and the mass detection range was set to m/z 2200–2500. The parameters of the ESI source were as follows: drying gas (N2) flow rate, 9.0 L min−1; drying gas temperature, 200 °C; nebulizing gas pressure, 30 psi; capillary voltage, 4500 V. The system was controlled by the micrOTOF Control software (Version 3.4, Bruker Daltonics), and the HyStar (Version 3.2, Bruker Daltonics) software was used to interface the HPLC and MS systems. To ensure optimal conditions for the detection, the calibration of the qTOF was performed once a week using cesium clusters (2 g L−1 in water-isopropanol (1:1, v/v)) allowing a mass accuracy of about 10 ppm.

RPLC-(qTOF) MS

The RPLC-(qTOF) MS method was previously optimized [8]. It was performed on a 1100 LC system (Agilent Technologies) hyphenated to the same (micrOTOF-Q II) MS detector. The RPLC separation was achieved using an Aeris WIDEPORE XB-C18 column (150 × 2.1 mm, 200 Å) packed with 3.6-μm core-shell silica-based particles (Phenomenex, Le Pecq, France). The mobile phase was composed of 0.1% FA in water and ACN. The mobile phase gradient was an increase from 4.5 to 31.5% of ACN in 50 min, a plateau for 5 min, and a return to the initial composition within 1 min and an equilibrium step for 10 min. The column temperature was maintained at 65 °C, the injection volume was 5 μL, the hCG sample was at 0.1 mg mL−1 in water, and the flow rate was set at 0.4 mL min−1. The MS detection was carried out with the same parameters as those used for the HILIC experiments, except for the mass detection range set to m/z 1000–2500. Table S1 (see Electronic Supplementary Material, ESM) lists MS parameters recommended by the manufacturer (Bruker Daltonics) for the analysis of macromolecules (with m/z > 1000) that were used.

Results and discussions

HILIC-UV vs HILIC-HRMS

The separation of hCG glycoforms in HILIC-UV was previously optimized, the reliability of the method in terms of retention time and peak area was demonstrated, and its application to two hCG-based drugs showed its potential for fingerprinting approach (see ESM Fig. S1) [25]. The content and nature of the acidic additive (FA and TFA) and the addition of a volatile salt (ammonium formate) in the mobile phase on the retention and the resolution were studied. It was observed that TFA, an ion-pairing agent, was mandatory to obtain a high number of peaks and led to a nice separation. It is worthwhile to notice that only reagents compatible with MS were investigated since, for identification purposes, the hyphenation with HRMS detection is required and this is the objective of this work. The best HILIC separation was obtained with an amide-based stationary phase and a mobile phase composed of water/ACN containing 0.1% of TFA. However, no peaks were observed in MS using these conditions and a m/z range of 1000–2500, as we did it in RPLC-MS [8]. This may be due to the presence of TFA in the mobile phase, which induced a dramatic decrease in the MS signal, but our previous study showed that TFA was essential to achieve a good HILIC separation of hCG glycoforms. In addition, TFA has a stabilizing effect on protein species in highly organic solutions and is considered a good solubilizing agent at low concentrations (e.g., ∼ 0.1%) [28]. Therefore, a compromise between LC resolution/protein solubilization and MS sensitivity was done.

To overcome this detection problem, two modifications were considered. First, the injected amount of hCG was increased by increasing the injection volume, because the hCG concentration was already high (1 mg mL−1). In our previous study with UV detection [25], the hCG-based drug samples were dissolved in water and an injection volume of 1 μL was used. Even if a water-based sample is not favorable in HILIC, as a partial precipitation of hCG was observed with an ACN content higher than 30%, a fully aqueous sample was maintained and the injection volume was increased to 2 μL. Secondly, the detection mass range was reduced in order to increase the MS signal intensity, even if only a single charge state of the glycoforms was observed instead of three generally. Therefore, different small m/z ranges of 300 instead of 1500 were tested and it was observed that the m/z range between 2200 and 2500 gave the best signal intensities.

With these conditions, the resulting base peak chromatogram (BPC) is presented in Fig. 1a analyzing the recombinant hCG-based drug sample (r-hCG). The presence of 6 peaks between 19.8 and 21.8 min is observed. It is much less than what was obtained in UV with the same conditions of separation, where at least 10 peaks along 3 elution zones (19.8–21.8, 21.8–23.4, and 23.4–25.2 min) were observed [25]. In HILIC-MS, the first zone corresponds to the same first elution zone observed in HILIC-UV, but no peaks were detected in the last two elution zones. Two explanations can be proposed. First, the detection mass range used (m/z 2200–2500) may not be suitable for the detection of glycoforms present in the second and third elution zones. Therefore, different mass ranges were then tested (m/z 100–800, m/z 800–1500 and, m/z 1500–2200), but no peaks were detected. Secondly, these glycoforms are difficult to ionize, which may be due to a high content of glycans and/or the presence of TFA in the mobile phase. Regarding the detected peaks in MS, the average spectrum between 19.8 and 21.8 min on the BPC (see arrow in Fig. 1a) is presented in Fig. 1b. As expected, the mass range used (m/z 2200–2500) enabled to detect m/z values corresponding to hCG glycoforms with a charge state (z) equal to 6 for all peaks. However, the observation of a single charge state does not allow the application of a conventional deconvolution method for the determination of glycoform mass values requiring 3 charge states.

Fig. 1
figure 1

a BPC and b average MS spectrum between 19.8 and 21.8 min (see arrow) obtained by HILIC-HRMS for r-hCG. Mobile phase 0.1% TFA in H2O/ACN. Gradient 85–40% of ACN with a slope of 1.5% min−1. Mass range 2200–2500 m/z, full-scan positive mode. Temperature 60 °C. r-hCG samples 1.0 mg mL−1 prepared in water. Injected volume 2 μL. Flow 0.4 mL min1

In conclusion, the applicability of the HILIC-MS method for the analysis of intact hCG is not trivial. Indeed, the presence of TFA, essential to separate the hCG glycoforms as previously demonstrated [25], leads to a very low ionization of glycoforms in MS and thus to a dramatic loss of information on glycoforms present in the last two elution zones which are detected in UV. Nevertheless, the HILIC-HRMS method gave some m/z values corresponding to several hCG glycoforms in the first elution zone. In order to evaluate the potential of HILIC for hCG characterization, the number of detected glycoforms and the sensitivity of the analysis were then compared with the values obtained by RPLC-HRMS [8].

Comparison of RPLC and HILIC

The RPLC-HRMS analysis of r-hCG led to about 12 chromatographic peaks (see ESM Fig. S2A) [8]. At least three charge states were observed (z = 9, z = 8, and z = 7) with m/z values between 1300 and 2200 on the average MS spectrum (ESM Fig. S2B) obtained between 28.5 and 32.5 min on the BPC (see arrow, ESM Fig. S2A). After deconvolution, these peaks were assigned to glycoforms of the α-subunit and no glycoforms of the β-subunit were detected [8].

The sensitivity between HILIC- and RPLC-HRMS was first compared, and it is worthwhile to notice that the qTOF system, r-hCG provider, and LC columns dimensions were the same. In addition, based on the hypothesis that 1 nmol corresponds to 35 μg of hCG (MW ≈ 35 kDa), a quantity of 56.8 pmol was injected in HILIC-HRMS (2 μL at 1 mg mL−1), instead of 14.2 pmol in RPLC-HRMS method (5 μL at 100 μg mL−1). With these conditions, the signal intensity observed on the BPC for the HILIC separation (Fig. 1a, approx. 3.0E103) is 10 times less intense than for the RPLC mode (ESM Fig. S2A, approx. 3.0E104). Therefore, the RPLC-HRMS method appears to be 40 times more sensitive than the HILIC-HRMS method, despite a larger mass range of detection (m/z 1000–2500 in RPLC-HRMS instead of m/z 2200–2500 in HILIC-HRMS), which clearly demonstrates a dramatic decrease in the sensitivity of the HILIC-HRMS method compared with RPLC-HRMS. As mentioned above, this high sensitivity difference is mainly due to the presence of TFA in HILIC that was required to obtain a high-resolution LC separation.

The complementarity of RPLC and HILIC modes was next studied. All the peaks observed on the spectrum obtained in HILIC-HRMS between m/z 2200 and 2500 (Fig. 1b) were selected and their extracted ion chromatograms (XICs) were plotted (Fig. 2a). The absolute mass values, calculated according to a charge state of z = 6, are reported. The XIC chromatograms were also plotted for all the peaks observed on the spectrum obtained in RPLC between m/z 1700 and 1900 (z = 8) (ESM Fig. S2B). They are presented in Fig. 2b. First, it is worthwhile to notice the presence of several not resolved peaks associated with a given mass in RPLC (e.g., 14,316.0 Da; Fig. 2b), while there is only one peak for a given mass in HILIC (Fig. 2a). Although it is difficult to conclude at this stage, two hypotheses will be discussed later.

Fig. 2
figure 2

XICs of hCGα glycoforms with N-glycan structures obtained with the analysis of r-hCG by a HILIC-HRMS or b RPLC-HRMS. Conditions: see the “Materials and methods” section

When comparing Fig. 2a and b, it is interesting to note that 5 glycoforms with masses of 13,440.8, 13,733.7, 14,024.8, 14,316.0, and 14,972.0 Da were detected with both methods. These hCGα glycoforms were previously identified during the RPLC-HRMS study [8], and their glycan structures are represented in Fig. 2a and b. These hCGα glycoforms contain sialylated N-glycans with two antennas, except for one form (14,972.0 Da) having an N-glycan with three antennas. Table 1 shows all the hCGα glycoforms detected during the analysis of r-hCG by HILIC- and RPLC-HRMS. In HILIC, even if one glycoform (13,566.4 Da) was not observed, 4 were detected in addition to RPLC (13,716.6, 14,006.4, 14,389.2, and 14,682.6 Da) and 2 out of 4 were identified (an x means that no peak was detected corresponding to its isoform and unidentified that a peak was present on the spectrum but the N-glycan structures were not identified). They contain 2 N-glycans with 2 and 3 antennas. A mass difference corresponding to an N-acetylneuraminic acid was observed between some of them (291 Da between 13,716.6 and 14,006.4 and between 14,389.2 and 14,682.6). It can be noticed that a higher number of glycoforms composed of N-glycans with 3 antennas are detected in HILIC- than in RPLC-HRMS (3 in HILIC vs 1 in RPLC). These results demonstrate the complementarity of both separation modes.

Table 1 List of absolute mass values and N-glycan structures of the α-subunit glycoforms of r-hCG analyzed by HILIC- or RPLC-HRMS. x, no peaks corresponding to its isoform. Unidentified: a peak is present on the spectrum but the N-glycan structures were not identified. Green circle: mannose; yellow circle: galactose; pink diamond: N-acetylneuraminic acid; blue square: N-acetylglucosamine

To confirm this complementarity, a second hCG-based drug prepared from urines of pregnant women, u-hCG, was then analyzed by HILIC-HRMS, as it was already carried out by RPLC-HRMS [8]. The resulting BPCs showed in both cases at least 10 peaks with retention times between 19.1 and 20.8 min in HILIC (Fig. 3a) and between 28.5 and 32.5 min in RPLC (Fig. 3b). As for r-hCG, a decrease in sensitivity was observed in HILIC-HRMS compared with RPLC-HRMS. For all the peaks detected on the mass spectrum, the XICs were plotted for each absolute mass value and the glycan structures are presented in Fig. 3c and d for HILIC and RPLC, respectively. Table 2 presents the glycoforms detected with both methods. Seven glycoforms of hCGα were detected in both modes. They are composed of sialylated N-glycans with 1 or/and 2 antennas except for 1 glycoform (13,010.4 Da) which was not identified. Furthermore, 3 additional hCGα glycoforms were detected in HILIC: 12,960.0, 13,659.0, and 13,666.8 Da. Regarding the glycoform with a mass value of 13,659.0 Da, the N-glycan structures can be proposed thanks to a mass difference of 291 Da with another already identified glycoform (13,949.9 Da). Unfortunately, the two other hCGα glycoforms were not identified. In RPLC, 2 glycoforms were detected in addition to HILIC: one oxidized form (13,310.4 Da) and one with 2 sialylated N-glycans with 2 antennas (Table 2). These results confirm again the complementary of both modes for the identification of hCGα glycoforms.

Fig. 3
figure 3

BPCs of u-hCG analyzed by a HILIC-HRMS or b RPLC-HRMS and XICs of the hCGα glycoforms with their N-glycan structures of u-hCG analyzed by c HILIC-HRMS or d RPLC-HRMS. Conditions: see the “Materials and methods” section

Table 2 List of absolute mass values and N-glycan structures of the α-subunit glycoforms of u-hCG analyzed by HILIC- or RPLC-HRMS. Ox, oxidized isoforms; x, no peaks corresponding to its isoform. Unidentified: a peak is present on the spectrum but the N-glycans structures were not identified. Green circle: mannose; yellow circle: galactose; pink diamond: N-acetylneuraminic acid; blue square: N-acetylglucosamine

In Tables 1 and 2, it appears that a higher number of hCGα glycoforms were detected in HILIC compared with RPLC for both drugs: 10 and 10 vs 6 and 9 for r- and u-hCG, respectively. As it was already concluded after analyzing both drugs in RPLC [8], despite the additional glycoforms detected in HILIC, none is common to both drugs. As an example, no hCGα glycoforms with 3 antennas were observed for u-hCG but some with a very large number of hybrid glycans. This confirms the strong difference in the glycosylation patterns of the 2 drugs previously mentioned [8].

Finally, the orthogonality of the HILIC and RPLC separation modes was investigated. Figure 4a presents the retention time in HILIC as a function of the retention time in RPLC for the most intense peak of the 4 identified hCGα glycoforms of r-hCG having N-glycans each with 2 antennas. It appears that the number of terminal sialic acids present on this type of glycoform contributes significantly to an increase in its retention time in both modes, which could indicate a poor orthogonality of these methods for the r-hCG characterization. A similar representation was done with the glycoforms identified during the analysis of u-hCG with both methods (Fig. 4b). According to Tables 1 and 2, the structures of the corresponding N-glycans are different for u-hCG since at least one out of the four antennas is based on mannose. It appears here that the elution order in HILIC does not seem to be related to the number of sialic acids of the glycoforms. In this case, some glycoforms vary only in the number of mannose sugars they contain, such as 13,293.7, 13,455.7, and 13,618.4 Da having 6, 7, and 8 mannose sugars, respectively. They co-elute in RPLC but not in HILIC. Looking at glycoforms with similar N-glycan structures but where a mannose has been replaced by an antenna (one N-acetylglucosamine plus one galactose plus one N-acetylneuraminic acid) on one of the two N-glycans, such as the glycoforms of 13,455.7 and 13,949.9 Da, respectively, they co-elute in HILIC but not in RPLC. This clearly demonstrates that HILIC and RPLC are orthogonal separation modes for the separation of u-hCG glycoforms, which have a more heterogeneous composition than r-hCG glycoforms.

Fig. 4
figure 4

Retention times in HILIC as a function of retention times in RPLC for the common hCGα glycoforms of r-hCG (a, c) and u-hCG (b, d). Only the retention time of the most intense peak for each mass value was selected (a, b). The retention times of all the peaks of a given mass value were selected (c, d). The absolute mass of each isoform and its number of sialic acid (SA) are given. In d, a red triangle corresponds to the most intense peak for a given mass value

As seen previously, it is also important to notice the presence of several peaks associated with a given mass in RPLC for both drugs (see Fig. 2b and 3d), while there is only one peak for a given mass in HILIC (Figs. 2a and 3c). This difference between both modes clearly appears also in Fig. 4c and d, where all peaks for one mass were plotted for r- and u-hCG, respectively. Two explanations can be proposed. First, the different peaks could correspond to different isomers varying by the localization of the sialic acids on the ends of glycans. However, this hypothesis cannot be retained because several peaks were also observed for non-sialylated glycoforms, having a mass value of 13,440.8 Da, as shown in Fig. 2b.

Secondly, the main differences between RPLC and HILIC are the presence of 0.1% TFA in the HILIC mobile phase instead of formic acid in RPLC and the nature of the stationary phase. Indeed, the injection solvent (H2O), the flow rate (0.4 mL min−1), and the H2O/ACN-based mobile phase are the same. Even the column temperature was almost identical (65 °C instead of 60 °C). It was already observed that TFA can induce the dissociation of the two subunits of hCG, since they are not covalently linked [4]. Therefore, it could be assumed that the TFA present in the HILIC mobile phase leads to the dissociation of the two subunits during their analysis, giving different retention times for the α- and β-subunits. This is in agreement with experimental results. Indeed, as observed above, the elution zone observed in HILIC-HRMS corresponds to glycoforms of the α-subunit and the two other elution zones observed in our previous experiments in HILIC-UV that are not detected in HILIC-HRMS could correspond to glycoforms of the β-subunit [8]. Unfortunately, it was impossible to confirm this hypothesis with UV detection and the too low MS signal intensity, due to the presence of TFA and the difficulty to ionize highly glycosylated glycoforms of the β-subunit that have potentially 2 N- and 4 O-glycans and should be therefore also more heterogeneous. In RPLC-HRMS, the absence of TFA in the mobile phase and the observation of different chromatographic peaks for one single mass of hCGα could be explained by the fact that the dimeric form of hCG is preserved during the separation. One single hCGα glycoform, which can be more easily ionized with only two N-glycosylation sites, could be associated with different hCGβ glycoforms, which are not well ionized and consequently not detected in MS [8]. However, it is still not possible to conclude at this stage of the hCG study.

Conclusion and perspectives

For the first time, a HILIC-HRMS method for the characterization of the highly glycosylated hCG protein at the intact level was developed. Despite the presence of TFA in the mobile phase, which affects the performance of the MS detection while favoring the separation resolution, the HILIC-HRMS method allowed the identification of different hCGα glycoforms in 2 hCG-based drugs. HILIC allowed the detection of a slightly higher number of hCGα isoforms than RPLC. However, both are complementary because some isoforms were only detected in RPLC or HILIC and, even more so, they are orthogonal. Several perspectives can be considered, such as improving sensitivity in HILIC-HRMS or developing two-dimensional approaches coupling HILIC in the first dimension and RPLC in the second.