Introduction

Glycosylation is one of the most common post-translational modifications that occur in more than 50% eukaryotic proteins and modulate in various cellular processes [1]. Above all, it has been found that abnormal glycosylation is closely associated with numerous diseases containing cancer [2], and it has been reported that glycopeptides and glycoproteins may act as biomarkers of particular types of diseases [3, 4]. Consequently, for profound investigation of the connection among diseases and glycosylation, it is critical to know the protein glycosylation procedure, data of glycosylation sites, and important sequence of peptide in medicinal finding [5]. Although high throughput, resolution, and speed of mass spectrometry facilitate the glycosylation, the glycopeptide poor ionization and very low abundance make it a challenging task [6]. So, developing fast and high efficiency enrichment strategies to enrich selectively the N-linked glycopeptides from bio-samples prior to MS would make contributions to glycoproteomics research [7]. Different enrichment strategies which have been developed to enrich the N-linked glycopeptides include hydrazide chemistry [8], lectin affinity chromatography [9], BA (boronic acid chemistry), and hydrophilic interaction chromatography (HILIC) [10, 11]. Among all, this hydrophilic interaction chromatography (HILIC) is widely accepted due to mild enrichment conditions, unbiasness toward glycans composition, high enrichment efficacy, and compatibility to MS. Many substrates containing the hydrophilic functional groups (-COOH, -NH2, -SO3H, -OH, etc.) on their surface for N-linked glycopeptides enrichment attained great attention [12].

New nanocomposite-based HILIC sorbent for the selective enrichment of glycopeptides is in demand. Different enrichment efficacy was observed in various substrates with hydrophilic groups like mesoporous silica [13], polymeric monolithic tip [14], fibrous cellulose [15], magnetic nanoparticles [16], sepharose [17], and MOFs (metal-organic frameworks) [18, 19]. Among those, metal-organic frameworks stand out due to their ultrahigh surface specific area [20], tailorable chemistry, and high acid resistant stability and tunable nanostructure cavities [21]. Therefore, they have been applied as stationary phase in separation science. Moreover, metal-organic frameworks have been widely applied in heterogeneous catalysis [22], surface enhanced Raman spectroscopy (SERS) analysis [23], hydrogen storage [24], and so on.

So, development of high surface area and porosity polymer-MOF composite is achieved in the present study. In this context, we developed zeolitic imidazolate framework (ZIF-8) on the surface of poly(styrene-co-divinylbenzene-co-methacrylic acid) polymer. Terpolymer@ZIF-8 is further functionalized with boronic acid using 3-isocyanatopropyltriethoxysilane as precursor via N-(3 aminopropyl)imidazole, merging the qualities of the polymer with the high surface to volume area of ZIF-8 and extending the selective nature by using BA. The hybrid terpolymer@ZIF-8@BA was applied to tryptic digest of IgG as mono-glycosylated and HRP as multi-glycosylated peptide sample to determine separation capacity, reproducibility, reusability, and selectivity toward glycoproteins.

Experimental

Chemicals and reagents

The chemicals and reagents to carryout research are given in supporting information.

Fabrication of terpolymer@ZIF-8@BA

The fabrication route adopted for polymer/MOF composite involves (a) synthesis of terpolymer, (b) terpolymer/MOF composite, and (c) functionalization with boronic acid.

Synthesis of terpolymer

Detailed procedure for the terpolymer synthesis is provided in supporting information.

Terpolymer/MOF composite

Terpolymer was first incubated with 1.0 mol/L N-(3-aminopropyl)-imidazole methanol solution at 60 °C for 3 h, followed by rinsing the terpolymer with methanol until the pH value is neutral. In the next step, material was incubated with Zn (NO3)2∙6H2O at 60 °C for 3 h. After that, material was treated with 2-methylimidazole methanol solution using the same reaction condition for 1 h. Step 2 and step 3 were repeated for 1 h in 5 cycles for layer by layer synthesis. Methanol was used to rinse the column between each step. The terpolymer@MOF composite was collected.

Terpolymer@ZIF-8 was functionalized with aminophenylboronic acid to perform selective enrichment of N-linked glycopeptides. At first, 160 mg of 3-aminophenylboronic acid monohydrate was dissolved in THF. In the above solution, 3-isocyanatopropyltriethoxysilane (240 mL) was added under constant stirring for 24 h at room temperature. The obtained phenylboronic acid functionalized triethoxysilane reagent (APBA) was acidified using acetate buffer (0.1 mol L−1, pH 5.2, 40 mL) and 40 mg of terpolymer@ZIF-8 nanoparticles were dispersed homogeneously. The reaction was allowed to proceed at room temperature for 12 h. The final product (terpolymer@ZIF-8@BA) was collected and dried at 50 °C overnight.

Digestion of standard proteins

The protocol for tryptic digestion of IgG, HRP, and BSA is given in the supporting information.

Enrichment of N-linked glycopeptides

Enrichment of N-linked glycopeptides from tryptic digest of HRP and IgG is carried out using SPE batch extraction method. Terpolymer@ZIF-8@BA (3 mg) was equilibrated with 100 mL of loading buffer (50 mM NH4HCO3). The conditioned affinity material was incubated with 100 μL of digested protein sample at 37 °C for 45 min. The affinity material was collected and washed with the loading buffer to remove non-specifically bound peptides. The captured N-linked glycopeptides were eluted by 10 μL of acidic buffer (50%ACN/49%H2O/1%TFA) at 37 °C for 30 min. The eluted peptides were kept for MS analysis. In HILIC, normally during the enrichment process, loading and washing buffer composition is 50 mM ABC (ammonium bicarbonate) which is mild as compared to hydrazine and boronic acid affinities, but the elution buffer contains 1% TFA along with the 50% ACN that is also compatible with MS.

Regeneration of MOF material

To determine the regeneration ability of terpolymer@ZIF-8@BA, the composite was washed with 250 μL of 30% ACN in 0.1%TFA three times followed by de-ionized water. This removes the memory affect from previous enrichment. Prior to using regenerated material for second cycle of enrichment, it was equilibrated with loading buffer. This regenerates the surface chemistry of material. This cycle of regeneration was repeated up to 4 times.

Human serum digestion

In solution digestion of serum sample, 1.0 μL of serum was diluted with 17 μL of 25 mM ABC (ammonium carbonate) (pH = 7.8–8.0). At centrifugation at 12000g for 2 min, supernatant of diluted serum was collected. Then reduction was done by 10 m M dithiothreitol at 37 °C for 30 min, followed by alkylation using 20 mM of iodoacetamide at 37 °C for 60 min. Finally, for digestion, trypsin was added 1:30 (trypsin:protein) for 16 h at 37 °C. Trypsin was deactivated by adding 1 μL of 1% TFA and stored kept at − 20 °C.

MALDI-TOF/TOF MS analysis

One microliter of eluted fraction containing enriched N-linked glycopeptides was deposited onto 600 μm Anchor Chip Target with 1 μL of matrix solution. The matrix solution was prepared by dissolving 20 mg mL−1 of DHB in TA-30 solution (30% ACN in 0.1% TFA). The eluted peptides were directly analyzed MS as no desalting is required for this strategy. For external calibration, peptide Calibrant II Mono was also spotted. MALDI-TOF/TOF analysis was carried out on Ultraflex Extreme mass spectrometer (Bruker Daltons, Bremen, Germany). All the spectra were acquired in positive reflection mode, 1000 laser shots per spectrum. Resolution was kept 15,000–20,000. All the spectra with mass range 1000–5500 were processed using Flex Analysis Version 3.4 supplied by Bruker Daltonics.

Results and discussion

Fabrication of terpolymer@ZIF-8@BA

ZIF-8 is a member of a subfamily of MOFs and shows good mechanical stability, high porosity, and outer-surface properties [25]. These special properties make it a valuable candidate in separation and enrichment. Terpolymer offers support material to generate desired MOF on surface. The polymeric nature of terpolymer holds strong mechanical properties which endure the resilence during chemical modification. The inert nature reduces the non-specificity which is indirectly significant for selective enrichment. It further reduces agglomeration which is a common limitation in carbon-based or magnetic support materials. These properties of terploymer contribute to its selection for fabrication of terpolymer-ZIF8 composite. In this procedure, ZIF-8 is accumulated on the surface of terpolymer making its surface more available and uniting the benefits of polymer and ZIF-8. The 3-aminophenylboronic acid and triethoxysilane reagent were combined to achieve the reagent that generated a recognition site on the surface of composite. The boronic acid ligand is used to selectively recognize N-linked glycoproteins and N-linked glycopeptides. The preparation of terpolymer@ZIF-8@BA is demonstrated in Fig. 1.

Fig. 1
figure 1

Schematic representation of fabrication procedure of terpolymer@ZIF-8 composits@BA

Batch extraction method is adopted for enrichment as the incubation step provides enough time for selective binding of glycosylated peptides with boronic acid. Among reported affinities, boronic acid is chosen due to the reversible nature of binding [26]. The regeneration of material is easy and to consider the cost-effectiveness of research method, boronic acid is the best choice.

Characterization of terpolymer@ZIF-8@BA

Scanning electron microscopy (SEM), transmission electron microscopy (TEM), energy-dispersive X-ray (EDX), and Fourier transform infrared spectra (FTIR) are measurements of terpolymer@ZIF-8@BA measured and are shown in Fig. 2. SEM images (Fig. 2a) of the particles indicate that functionalized MOF particles are crystalline in nature with regular hexagonal morphology. TEM indicates that the single phase of ZIF-8 is obtained high crystallinity. Calculated particle size of the particles is around 55 nm. Usually the ZIF particles are properly dispersed and every crystal is separated from the other, but in this case, particles are closely attached to each other due to the presence of polymeric network to which ZIF crystals are adhered. The agglomerate sizes are approximately 1–2 times the size of particle. The loss and dispersible nature of agglomerates is also observed by TEM (Fig. 2b). Energy-dispersive X-ray spectroscopy analysis reveals that all the representative elements of terpolymer@ZIF-8@BA are present with carbon being the highest percentage (38%). This may be due to the presence of polymeric network attached to ZIF-8. Nitrogen, oxygen, and zinc are also present in the EDS spectrum (Fig. 2c). FTIR analysis shows representative peaks of OH at 3500 cm−1, CH stretching at 3050 cm−1, carbonyl groups at 1710 cm−1, and B-O at 1230 and 950 cm−1. These bands indicate the successful formation of terpolymer@ZIF-8@BA (Fig. 2d).

Fig. 2
figure 2

Characterization of terpolymer@ZIF-8@BA by a scanning electron microscopy, b transmission electron microscopy, c energy-dispersive X-ray spectroscopy, and d Fourier transform infrared spectroscopy

Porosity and surface area play significant role in enrichment as steric hindrance is reduced among bulky molecules which help to avoid saturation of affinity sites. Thus, material selective nature at low concentration is not compromised due to non-specific background. Layer by layer synthesis of MOF indicates its improved surface area during the fabrication. This is confirmed by using nitrogen adsorption porosimetry to calculate the surface area and pore size of terpolymer@ZIF-8@BA synthesized after first cycle and fifth cycle. Fig. S2A (ESM) shows the adsorption-desorption isotherm of terpolymer@ZIF-8@BA prepared by single cycle. Calculated surface of the material is 184.76 (m2 g−1), and BJH adsorption-desorption pore volume is 2.71 nm. After running 5 cycles, surface area of terpolymer@ZIF-8@BA is increased to 412 m2 g−1 with pore volume 0.075328 cm3 g−1. This sufficient increase in surface area will improve the enrichment ability of material. XRD analysis of terpolymer@ZIF-8@BA shows the peaks at 4.5°, 11°, 13°, 16.5°, 20°, 24°, 28°, 30°, 38.7°, 42°, 44.3°, 46°, and 55.4° which corresponds to carbon and zinc present in the structure Fig. S2B (Electronic supplementary material, ESM).

Enrichment of immunoglobulin G-mono-glycosylation

Human IgG analysis is important as the glycosylation is influenced by genetic makeup and health state. Variation in IgG glycosylation leads to development and progression of many diseases [27]. IgG is mono-glycosylated with conserved N-glycosylation site located at Asn180. The heterogeneity of IgG corresponds to complex type di-antennary N-linked glycans carrying 0–2 galactoses with a major fucose core and a minor having GlcNAc and 1–2 sialic acids. Therefore, the enrichment of N-linked glycopeptides generated as a result of IgG digestion will be helpful to study the variation in glycans attached to the peptide. A simple MS spectrum recorded for digested IgG shows high intensity non-specific background (Fig. 3a). The digest is applied to terpolymer@ZIF-8@BA and after elution, non-specific background is almost negligible with high intensity peaks of characteristics IgG N-linked glycopeptides (Fig. 3b). Using the peptide mass fingerprinting and BioTools, the mass peaks are identified with the attached glycans. Using terpolymer@ZIF-8@BA, 20 N-linked glycopeptides are detected with similar peptide sequence of EEQFN#STFR. The mass shift is due to the attached glycans. This indicates that the surface modification of terpolymer@ZIF-8 via boronic acid facilitate the selective enrichment of N-linked glycopeptides. The improvement in performance of designed composite relies on the enhanced hydrophilicity. The detailed information of the detected N-linked glycopeptides from IgG digest by MALDI-TOF MS analysis is listed in Table S1 (ESM).

Fig. 3
figure 3

MALDI-TOF mass spectra for the N-linked glycopeptides from IgG tryptic digest a before enrichment and b after enrichment by terpolymer@ZIF-8@BA. N-linked glycopeptides are marked in red stars

Enrichment of HRP multi-glycosylation

Horseradish peroxidase (HRP) has broad applications in biomedical research as diagnostic enzyme. It is used as a model glycoprotein with xylosylated core and fucosylated N-glycans. The attached glycans respond to environmental factors and thus act as antigenic plant allergens. HRP contains eight glycosylation sites at Asn 43, 87, 188, 216, 228, 244, 285, and 298. Therefore, tryptic digest of HRP mimic a complex sample suitable to evaluate the enrichment efficiency of fabricated material. The digest is subjected to MS analysis and it is clear that no significant detection is observed above 3000 Da (Fig. 4a). Only four N-linked glycopeptides are detected which is insufficient to relate HRP glycosylation with respective change. Terpolymer@ZIF-8@BA is used to reduce the non-specific ground and after enrichment 17 N-linked glycopeptides can be identified above S/N ratio (Fig. 4b). The information regarding the peptide m/z, its sequence, and associated glycan is given in Table S2. It is important for the designed affinity to enrich the N-linked glycopeptides with specific glycosylation site reported for the protein under study. This is helpful to determine any modification in normal and affected protein structure and thus can be traced to malfunctioning of said protein. Sequence coverage analysis is performed using the SwissProt database and identified N-linked glycopeptides are sequenced to check the percentage coverage. It is interesting that the data obtained provides 31.42% sequence coverage with complete glycosylation site detection. All the eight glycosylation sites are recovered as shown in the sequence coverage table (Table S5). Thus, terpolymer@ZIF-8@BA is the potential affinity material that can be used to assist the HRP-based proteomic analysis.

Fig. 4
figure 4

MALDI-TOF mass spectra for the N-linked glycopeptides from HRP tryptic digest a before enrichment and b after enrichment by terpolymer@ZIF-8@BA. N-linked glycopeptides are marked in red stars

Determination of selectivity

Spiked sample of target species in the non-specific background is used to measure the selectivity of designed material. Generally, biological fluid comprises of target molecule in low concentration with complex background. The selective nature of affinity material can be compromised in the presence of species which do have the tendency to bind non-specifically. Therefore, selectivity of terpolymer@ZIF-8@BA for N-linked glycopeptides is measured using tryptic digest of HRP and BSA digest in mass ratio of 1:1, 1:50, 1:100, 1:150, and 1:200. The enrichment protocol is adopted and MALDI-MS analysis is performed for each sample. In 1:1, all characteristic N-linked glycopeptides are observed showing negligible interference from BSA-derived non-glycopeptides (Fig. S3a, ESM). As the ratio of BSA is increased to 1:50 and 1:100, loss of one N-linked glycopeptide is observed at m/z 2361 (Fig. S3b) and 2613 (Fig. S4c). The possible loss is due to low abundance of these N-linked glycopeptides which is further lowered in the presence of BSA peptides. Increasing BSA concentration to 1:150, still HRP N-linked glycopeptides are detected above S/N ratio with loss of N-linked glycopeptides above 4500 Da (Fig. S4d). The possible reasons can be the bulky nature of N-linked glycopeptides with enhanced steric hindrance due by BSA peptides. At 1:200, the number of N-linked glycopeptides is similar to 1:150 but non-glycosylated peptides become visible below 2300 Da (Fig. S5, ESM). Thus, the selectivity experiment reflects that terpolymer@ZIF-8@BA possesses good selective recognition ability toward N-linked glycopeptides in the presence of complex surroundings. This supports the fact that MOF-based composite is more appropriate media to cultivate boronic acid affinity due to improved hydrophilic interactions.

Sensitivity experiment

The common hurdle in targeted proteome analysis is the sensitivity which limits the application of affinity material or developed method for particular biological molecule. High selectivity of terpolymer@ZIF-8@BA will be more favorable if improved sensitivity is achieved at femto molar concentrations of sample. To perform the task, four solutions of different concentrations as 1 pmol μL−1, 100 fmol μL−1, 10 fmol μL−1, and 1 fmol μL−1 are prepared using HRP digest in de-ionized water. After enrichment with terpolymer@ZIF-8@BA, HRP-derived N-linked glycopeptides are detected in MS spectra obtained. At 1 picomolar solution, there is no loss of N-linked glycopeptides Fig. S6a (Electronic supplementary material, ESM), whereas the signal intensity decreased to half when the concentration is lowered to 100 fmol μL−1 (Fig. S6b). At lower concentration down to 10 fmol μL−1, loss of two N-linked glycopeptides is observed (Fig. S7c). It must be noted that mass signals of these N-linked glycopeptides were already of low intensity and when dilution is increased, these relatively less abundant N-linked glycopeptides are not observed above S/N ratio. At 1 fmol μL−1, abundant N-linked glycopeptide peaks are detected easily (Fig. S7d). In short, the sensitivity experiment depicts that terpolymer@ZIF-8@BA has detection limit down to 1 fmol μL−1 of solution. Better sensitivity can be attributed to the hydrophilic porous structure and high surface area of the terpolymer@ZIF-8@BA as compared to reported MOF-based materials in glycoproteomics (Table 1).

Table 1 Literature survey table for functionalized materials used for enrichment of low-abundance glycopeptides/proteins

Reproducibility

Batch to batch reproducibility is necessary element to determine the feasibility of designed material. Three batches of terpolymer@ZIF-8@BA are used to calculate reproducibility. The MS analysis is performed after enrichment and m/z values of seven N-linked glycopeptides derived from HRP are selected. The data is subjected to statistical analysis and relative standard deviation (RSD) is calculated (Table 2). Similar peak patterns at same intensity highlight the reproducible results obtained from MS analysis (Fig. S8a-c).

Table 2 Statistically represented batch to batch reproducibility data after enrichment with terpolymer@ZIF-8@BA. The mass to charge ratio is used to calculate the relative standard deviation

Small RSD values show data clustering around mean and thus confirm data precision as shown in Table 2.

Reusability

Cost-effective nature increases the worth of given affinity material as quality analysis requires routine enrichment experiments of target molecules. The reusability of terplymer@ZIF-8@BA is tested by applying HRP digest. After first enrichment, composite is washed with TA-30 to remove any bound peptide and nullify the memory affect. It is followed by washing with de-ionized water to neutralize the applied acidic environment. The regenerated material is equilibrated with loading buffer and subjected to second enrichment cycle. The MS analysis is performed for eluted fraction at the end of each cycle. MS spectrum of forth enrichment cycle shows that terpolymer@ZIF-8@BA enrich N-linked glycopeptides (Fig. S9) without any loss. Thus, the regeneration ability of terpolymer@ZIF-8@BA strengthen its potential application in proteomics.

N-linked glycopeptides enrichment from serum

For the glycoproteome analysis of tryptic digest of real biological sample like serum, high abundance of non-glycosylated peptide complicate the very low-abundance glycosylated peptide during the enrichment process. Based on the high sensitivity and selectivity of terpolymer@ZIF-8@BA HILIC sorbent, we employed this novel HILIC sorbent to capture the glycopeptides from tiny volume (1 μL) of digested serum. Enriched glycopeptides were deglycosylated by PNGase-F before going to nano-LC-MS analysis. A total number of 318 glycopeptides from serum glycoproteins as serotransferrin, apolipoprotein A-I, apolipoprotein B-100, protein AMBP, complement C3, haptoglobin, lactotransferrin, and trypsin-1 are identified using bottom-up approach (Table S3). Role of these enriches glycoproteins in various cancer and their glycosylation has been reported as potential highlight of cancer related to these glycoproteins (Table S4). In hepatocellular carcinomas, serotransferrin glycosylation pattern indicates the increase in highly branched fucosylated glycans [34]. Similarly, Apo-B 100 and compliment C13 has been reported for breast cancer. Elevated levels of Apo-B 100 are a risk factor for intraocular metastasis in breast cancer patients [35]. Lactotransferin, iron binding human milk glycoprotein, is reported to exhibit key role in the protection of breast-fed infants against gastrointestinal tract infections. The importance of these glycoproteins and their potential role in cancer emphasize the need to selective enrichment platform. Therefore, the terplymer@ZIF-8@BA demonstrated remarkable enrichment capability for glycopeptides present in a complex biological matrix. The fabricated terpolymer ZIF8 holds no significant disadvantage that can affect its performance as selective affinity materials toward glycopeptides.

Conclusion

Terpolymer@ZIF-8 provides hydrophilicity which is suitable for desired surface modification. The present study provides synthetic route for hydrophilic terpolymer@ZIF-8 MOF-based composite with flexible surface chemistry. High BET surface area further supports the idea of modification of MOF composite. To exemplify its hydrophilicity, the MOF composite is functionalized with boronic acid applied to enrich N-linked glycopeptides from different standard protein samples. Mono-glycosylated IgG and multi-glycosylated HRP are successfully enriched under optimized enrichment conditions. Terpolymer@ZIF-8@BA exhibits enhanced enrichment performance for glycoproteins. Its batch to batch reproducibility and regenerative nature further complies its worth for desired target less abundant molecules particularly in proteomics.