Introduction

The genus Gardenia (Rubiaceae) comprises approximately 250 species in the world[1], most of which are used as ornamental plants, but Gardenia jasminoides Ellis, is not just for viewing, but also used as a kind of herbal medicine and edible food in China, Japan, and South Korea. Its dried and ripe fruit is firstly recorded in Shen Nong's Herbal Classic (A.D. 25-220) and has been used to clear away heat and toxic materials for thousands of years in Traditional Chinese Medicine (TCM) [2]. The fruit is traditionally called “Zhizi” or “Shanzhizi” in China and is recorded in a different version of China Pharmacopoeia, as well as Japanese Pharmacopoeia and Korean Pharmacopoeia [2,3,4]. Zhizi (the fruits of G. jasminoides Ellis) and Shuizhizi (the fruits of G. jasminoides var. radicans) are the two most cultivated species. However, by the theory of TCM, Zhizi is used for herb medicine, Shuizhizi is never for medicinal use and can only be used for dyeing. Until now, Shuizhizi is regarded as the counterfeit of Zhizhi [5,6,7]. Traditionally, Zhizi and Shuizhizi have distinct differences in appearance, and the former has a small and round appearance and is yellowish red but the latter is longer and redder [7, 8]. The China Pharmacopoeia (ChP) stipulates that the length of Zhizi is 1.5–3.5 cm [9], and the length of Shuizhizi is usually between 3.0 and 7.0 cm [10]. Due to the increasing demand for Zhizi, cultivars of G. jasminoides were selected by growers with large fruits and high productivity for higher economic benefits. In commercial Zhizi products, there are two mainstream cultivars in the markets (Fig. 1). One is small, round, and thin-skinned, more in line with the traditional description (ZZC1). The other is larger, redder as well as high-yield than the former (ZZC2) and it is intermediate in size and color between Zhizi and Shuizhizi [11, 12].

Fig. 1
figure 1

A A variety of G. jasminoides, Shuizhizi (the fruit of G. jasminoides var.radians). B ZZC2, C ZZC1 Two kinds of cultivars of G. jasminoides in China.

Previous phytochemical investigations of the fruit of G. jasminoides showed the presence of iridoid glycosides [13, 14], pigments [15], organic acids [16], and flavonoids [17], among which iridoids and pigments are considered as the main active and characteristic components. Geniposide is the representative iridoid component [18] and the percentage of geniposide in the fruits of G. jasminoides should be more than 1.8% by ChP. The current mandatory standard cannot distinguish Zhizi from Shuizhizi, while the literature reports that the content of geniposide in Zhizi was even higher than that in Shuizhizi [19, 20]. A multitude of classical analytical methods has been developed in the past to discriminate Zhizi and Shuizhizi [19,20,21,22,23]. The main analytical method was the combination of HPLC/UPLC fingerprint, multi-component determination, and multivariate statistical analysis. 22 common peaks of Zhizi and Shuizhizi were calibrated at most, crocin I, genipin-1-β-d-gentiobioside, and other unknown compositions were identified as differential marker compositions. Such approaches, however, have failed to address the problem that the content of chemical components may be affected by post-harvest processing [24, 25]. Compounds calibrated in fingerprints were limited and the lack of reference cannot be identified. Comparative analysis of holistic phytochemicals profiling in traditional herbal medicine has a high potential for the determination of quality control of active ingredients as well as their different medicinal values. Thus, it is necessary here to comprehensively obtain the phytochemicals difference information, to distinguish and identify these three fruits, namely two kinds of cultivars of Zhizi widely distributed in China and Shuizhizi (the fruits of G. jasminoides var. radicans).

To fill this gap, samples of the fruits of G. jasminoides were collected from different districts in China with the same post-harvest processing at first. Secondly, the integrated strategy for comprehensive profiling of phytochemicals of these samples on basis of UHPLC/ESI-QTOFMS was developed. In addition, a comprehensive collection of chromatographic and mass spectral data for 80 metabolites was assembled in the course of this study. Finally, the strategy was feasibly applied to compare profiles of phytochemicals of two cultivars and one variety of G. jasminoides using non-targeted and targeted data analysis approaches.

Materials and methods

Samples used in this study

21 batches of different landraces of G. jasminoides were collected in different provinces of China from October to December 2020. 18 batches of the samples were identified as Gardenia jasminoides Ellis and three of them were identified as G. jasminoides var. radicans by Prof. Shilin Hu from Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, where the voucher specimen was deposited. The detailed samples information are shown in Table 1.

Table 1 Sample list of G. jasminoides collected in different provinces of China

Chemicals and reagents

MS grade acetonitrile and methanol were obtained from Fisher Scientific (Fair Lawn, NJ, USA); Formic acid (HPLC grade) and water were purchased from Tedia (Fairfield, OH, USA) and Watsons group (Guangzhou, China), respectively.

20 standards were purchased from different manufacturers. Crocin-I and crocin-II were purchased from Shanghai Yuanye Bio-Technology Co., Ltd. (Shanghai, China); crocin III, geniposide gentiobioside, gardenoside, deacetyl asperulosidic acid methyl ester, geniposidic acid, and scandoside methyl ester were purchased from Sichuan Weiqi Biotechnology Co., Ltd. (Chengdu, China); shanzhiside, gardoside, deacetyl asperulosidic acid, neochlorogenic acid, cryptochlorogenic acid, isochlorogenic acid A, isochlorogenic acid B, and cryptochlorogenic acid C were purchased from Chengdu Chroma-Biotech Co., Ltd. (Chengdu, China); rutin was purchased from Chengdu Must Biotechnology Co., Ltd. (Chengdu, China); crocin VI was purchased from Chengdu prifa Technology Development Co., Ltd. (Chengdu, China); geniposide and chlorogenic acid were purchased from Chengdu Herbpurty Co., Ltd. (Chengdu, China). The relative contents in the percentage of all the reference substances were over 98% calculated with area normalization method by HPLC.

Method for metabolite analyses of G. jasminoides

Preparation of sample solutions

The 21 batches of samples were pulverized into powder and sieved through a 50-mesh sieve before extraction. Then 200 mg of powder of each sample were weighed accurately and extracted with 20 mL of 70% methanol (v/v) for 30 min in an ultrasonic water bath (40 kHz, 250 W). After extraction, the solution was filtered through a 0.22 µm membrane filter and stored at 4 °C before use.

The quality control (QC) samples were obtained by mixing 21 batches of sample solutions and injected once before (after blank), in the middle, and at last to monitor the relative stability of the instrument.

Metabolite's profiling of G. jasminoides using UPLC-QTOF-MS/MS

Metabolite's profile was performed on an ACQUITY UPLC I-Class/Xevo G2-S QTOF system (Waters, Milford, MA, USA). A BEH C18 column (2.1 × 100 mm, 1.7 μm) equipped with an online filter at 25 °C was used to separate the compounds and eluted by a binary mobile phase of 0.1% formic acid in H2O v/v (A) and acetonitrile (B) at a flow rate of 0.4 mL/min with a gradient elution: 0–4 min, 5%-12% (B); 4–10 min, 12%-21% (B); 10–14 min, 25%–33% (B); 17–19 min, 33% (B); 19–22 min, 33%–55% (B); 22–26 min, 90% (B). A 2–μL aliquot was injected for analysis.

Mass spectral data acquisition was ESI source in negative ion mode and the parameters were set as follows: capillary voltage, 3.0 kV; sampling cone voltage, 20 V; source offset voltage, 80 V; source temperature, 100 °C; desolvation temperature, 400 °C; cone gas flow, 50 L·h−1; and desolvation gas flow, 700 L·h−1. The mass analyzer scanned over a mass range of m/z 100–1600 under low energy of 6 V, and a high energy ramp of 20–100 V was set to acquire the MS2 data. Data calibration was performed using an external reference (LockSprayTM) by constant infusion of 1.0 μg/mL of leucine enkephalin at a flow rate of 10 μL/min and by reference to the ion m/z 554.2615. Data acquisition was controlled by MassLynx V4.1 software (Waters Corporation, Milford, USA).

Data pretreatment and multivariate statistical analysis

The MS raw data of samples and QC samples were collected by the UPLC-QTOF-MS/MS and imported into the Progenesis QI software (Waters, Milford, USA). Adduct ions with the different responses were selected by the patterns of [M–H], [M–H + HCOOH], [M–H2O-H], [2 M–H], and [2 M–H + HCOOH]. After deconvolution and normalization, the Progenesis QI software can produce a data matrix composed of retention time, normalized peak area, and other information. The data including peak number, sample name, and normalized peak areas were exported to the SIMCA-P 14.0 software (Umetrics, Umea, Sweden) for multivariate analysis. Principle component analysis (PCA) and partial least-squares-discriminant analysis (PLS-DA) were carried out to obtain the tendency of different samples to group, after mean centering and unit variance scaling. According to the variable influence on the projection (VIP > 1) and the significance threshold of the two-tailed t test (P < 0.05), the difference of different metabolites was determined.

Results and discussion

Optimization of chromatographic and mass spectrometric conditions

To acquire phytochemicals information completely, chromatographic separation was optimized including chromatographic column, mobile phase ratio, and column temperature. By adjusting the proportion of the mobile phase, comparatively, the BEH C18 column showed a more desirable resolution than CSH C18 and HSS T3 column. The column temperature at 25 °C was more suitable than 30 °C, 35 °C and the flow rate at 0.4 ml/min were showed a better separation effect of iridoids and pigments.

Both positive and negative ionization modes were investigated. However, in the negative mode, mass spectrum peaks showed good response and the quasi-molecular and production ions were stable and reproducible. Therefore, the negative ionization mode was chosen for subsequent experiments.

Analysis of metabolomic profiling

After being processed by Progenesis QI software, as a result, a total of 8187 features were characterized from samples in negative ionization mode. The data matrix of the metabolite features was simply processed and exported into the SIMCA-P 14.0 software for chemometric analysis (principal component analysis, PCA and partial least square discriminant analysis, PLS-DA). As can be seen from Fig. 2A, different colors represented different geographical origins of 21 batches of the sample (taking origin as Class ID). The distribution of QC samples was concentrated, indicating the relative stability of the instrument during the test sample. The PCA analysis results suggested the distribution dispersion of samples in different producing areas was large. That is, the correlation between the sample and its origin was not significant. Then, the data of different varieties and cultivars of G. jasminoides were also analyzed by PCA and PLS-DA. As can be seen in Fig. 2B, three defined groups have a certain tendency of separation indicating there was some correlation between the chemical profile and its varieties or cultivars. Samples of two cultivars of Zhizi were divided into two groups segregated from each other, while there was a certain intersection between ZZC1 and Shuizhizi. However, when it comes to PLS-DA analysis, it can be seen in Fig. 2C that the three groups of G. jasminoides can be significantly separated.

Fig. 2
figure 2

Non-targeted analysis of metabolite profiles (8187 molecular features detected in negtive ion mode). A PCA score scatter plot of data from different geographical origins of G. jasminoides (PC1:18.2% PC2:15.5%). B PCA score scatter plot of data from different varieties or cultivars of G. jasminoides (PC:18.2% PC2:15.5%). C PLS-DA score scatter plot for 21 batches of G. jasminoides (R2X = 0.534 R2Y = 0.988 Q2(cum) = 0.79). B, C group 1 representing ZZC1, group 2 representing ZZC2, group 3 representing Shuizhizi

Metabolites profiling by UPLC-QTOF-MS/MS

The base peak chromatogram of three groups of G. jasminoides was shown in Fig. 3. Data acquisition was controlled by Mass Lynx V4.1 software (Waters Corporation, Milford, USA). Compared with the accurate high-resolution mass measurements of adductions and characteristic fragmentation ions of the authentic substances or the literature data, the molecular structures were identified or tentatively deduced referring to MS splitting decomposition law. A total of 80 compounds were identified or tentatively characterized. Among them, 20 compounds were identified unambiguously using reference standards and the other 60 compounds were tentatively deduced by analyzing their MS/MS spectra.

Fig. 3
figure 3

The base peak chromatogram (BPC) in negative ion mode of three groups G. jasminoides. A ZZC1, B ZZC2, C Shuizhizi

Iridoids

Iridoids were regarded as the main active constituent in G. jasminoides, with the skeleton of a kind of monoterpenoids based on a cyclopentane-[C]-pyran. A total of 28 iridoids were identified or tentatively deduced in this study (Fig. 4).

Fig. 4
figure 4

The structure of iridoids has been identified or tentatively deduced

I11 was a representative iridoid in G. jasminoides, displayed an [M–H + HCOOH] and [M–H] ion at m/z 433.1348 (C18H25O12) and m/z 387.1298 (C17H24O10). The base peak at m/z 225.0762 and 207.0659 was generated by the cleavage of glycosidic bond and further losing H2O (18 Da). Then, the fragment ions of m/z 147.0440, m/z 123.0445, and m/z 101.0238 were obtained by the losses of carboxymethyl moiety, Retro-Diels Alder (RDA) reaction, and the cleavage of C4-5 and C1-O2 bond, respectively, coming from the A1, A2, and A3 segments, as can be seen in Fig. 5A. By comparing with the reference standard, I11 was identified as geniposide. Besides, in contrast to the reference standard, I5 and I8 were identified as deacetyl asperulosidic acid methyl ester and scandoside methyl ester. The differences between the structures and geniposide were that the C7 connected with α-OH, β-OH, respectively. Thus, I5 and I8 exhibited an [M–H + HCOOH] and [M–H] ion at m/z 449.1298 and m/z 403.1240 in the full mass spectrum. Similarly, as I11, the fragment ions at m/z 241.0712 and m/z 223.0602 were also due to the cleavage of glycosidic bond and further losing H2O (18 Da). The fragment ions at m/z 165.0555, m/z 139.0413, and m/z 101.0239 were generated from the A1, A2, and A3 segments. Different from I11, fragment ion displayed at m/z 193.0492 of I5 and I8 was triggered by the further loss of hydroxymethy after losing glucosyl residues and H2O.

Fig. 5
figure 5

A The MS2 spectra and plausible fragmentation pathway of geniposide (I11). B The MS2 spectra and plausible fragmentation pathways of gardoside (I2). C The MS2 spectra and plausible fragmentation pathway of chlorogenic acid (Q2). D The MS2 spectra and plausible fragmentation pathways of rutin (F2)

I4 was confirmed by matching with the reference substance and identified as geniposide acid. The difference between I4 and I11 (geniposide) was the carboxyl which was connected in the C4 position. Therefore, I4 showed an [M–H] ion at m/z 373.1135. In the MS2 spectra, fragment ion m/z 211.0604 was owing to the cleavage of the glucosidic bond. And after breaking the glycoside bond, it did not continue to break C1 hydroxyl groups as I11, but further lost the hydroxyl group and formed an ion at m/z 149.0599. Surely, it can also generate a fragment ion at m/z 123.0442 by the A3 segment. By comparison with the reference substance, I10 was identified as genipin-1-β-gentiobioside whose structure is one more glucosyl residues than geniposide. Thus, there were ions at m/z 595.1882 and m/z 549.1824 attributed to [M–H + HCOOH] and [M–H], respectively. The fragment ion at m/z 225.0767 was caused by the losses of two molecules of glucose (324 Da). Other characteristic fragment ions m/z 123.0446 and m/z 101.0238 were got in the same way as I11.

I25, I26, I28 showed the [M–H] ions at m/z 593.1884 (C28H33O14), m/z 533.1660 (C26H29O12), and m/z 593.1859 (C28H33O14) in the full mass spectra, respectively. I25 and I26 produced the diagnosed fragment ions at m/z 225.0756, m/z 123.0443, and m/z 101.0237 which indicated the existence of the genipin moiety. In the MS2 spectra, it has shown the fragment ions at m/z 223.0603 and m/z 205.0500 which were corresponded to characteristic ions of sinapoyl (224 Da) formed by [sinapoyl-H] and [sinapoyl-H2O-H]. The fragment ions at m/z 163.0390 and m/z 145.0283 were coumaroyl (164 Da) group and presented the patterns of [coumaroyl-H] and [coumaroyl-H2O-H]. Additionally, the fragment ions at m/z 367.1034 and m/z 307.0828 were caused by the cleavage of glycosidic bonds and generated the fragment ion formed by glycoside connected with sinapoyl or coumaroy. Thus, I25 and I26 were tentatively assigned as 6′-O-trans-sinapoylgeniposide [26] and 6′-O-trans-p-coumaroylgeniposide [27]. The characteristic fragment ions of I28 and I25 were almost the same but I28 showed another fragment ion at m/z 413.1238 which was generated by breaking the glycoside bond first. According to the literature reported, I28 was tentatively deduced to be 10-O-trans-sinapoylgeniposide [28].

The diagnostic ions of genipin at m/z 225.0756, m/z 123.0445, and m/z 101.0241 were also found in the MS/MS data of I17, I21, I22, I23, I24, and I27. Besides, the characteristic fragment ions of caffeoyl (180 Da), coumaroyl (164 Da), feruloyl (194 Da), sinapoyl (224 Da), cinnamoyl (148 Da) groups (expressed in X) generated in the patterns of [X–H] and [X-H2O-H] were found in the MS/MS analysis, respectively. Moreover, I17, I21, I22, I23, I24, and I27 have all produced a diagnostic ion [X + gentiobiosyl-H2O-H] which was caused by the cleavage of the glycosidic bonds. Bearing in mind of all the iridoids isolated from G. jasminoides, I17, I23, I24, and I27 were tentatively deduced as 6ʺ-O-trans-caffeoylgenipin gentiobioside [29], 6ʺ-O-trans-sinapoylgenipin gentiobioside [30], 6ʺ-O-trans-feruloylgenipin gentiobioside [21], and 6ʺ-O-trans-cinnamoylgenipin gentiobioside [30]. Because of the same neutral loss to result in the same fragment ions, I21 and I22 were tentatively deduced as 6ʺ-O-trans-p-coumaroylgenpin gentiobioside or 4ʺ-O-trans-p-coumaroyl gentiobiosylgenipin [27, 30].

I2 was identified as gardoside by comparing with the reference standard, which was illustrated briefly in Fig. 5B. Different from the geninpside, the C8 position of I2 was linked with a double bond. The fragment ion at m/z 211.0613, m/z 193.0496, m/z 123.0439, m/z 167.0702, and m/z 149.0596 were found on the MS2 spectra of gardoside and they were the characteristic fragment ion of gardoside derivatives. The MS2 spectra of compounds I13, I15, I15, and I20 contain all the above-mentioned fragment ions, so they share the same skeleton of gardoside. Besides, the characteristic fragment ions of m/z 179.0350, m/z 161.0244, m/z 163.0401, m/z 145.0290, m/z 193.0506, m/z 175.0406, m/z 223.0612, and m/z 205.0506 produced in the patterns of [X–H] and [X-H2O-H] of caffeoyl (180 Da), coumaroyl (164 Da), feruloyl (194 Da), sinapoyl (224 Da) groups appeared in the spectra of I13, I15, I18, and I20, respectively. Furthermore, a specific fragment ion at m/z 307.0809 occurred in I13, indicating the presence of a glucose-bearing coumaroyl. Thus, I13 was tentatively assigned as 2′-O-trans-p-coumaroyl gardoside or 2′-O-cis-p-coumaroyl gardoside [31], and I15, I18, and I20 were deduced as 6′-O-trans-sinapoyl gardoside [27], 2′-O-trans-feruloyl gardoside, and 2′-O-trans-caffeoyl gardoside [31], respectively. Finally, the MS fragmentation of other iridoid compounds can be seen in Table 2.

Table 2 MS data for characterization of compounds by UPLC-Q-TOF/MS in the G. jasminoides

Pigments

G. jasminoides are an excellent source of natural pigments. These pigments are another important active ingredient both in saffron (Crocus sativus L.) and G. jasminoides and the structures of the pigments are in all-trans- and 13-cis-crocetin esterified with one or two glucose, gentibiose, or neapolitanose sugar moieties et al. Owing to the existence of cis and trans isomers, there were semblable MS data. By combining the literature data and the splitting rules of four reference standards, the structures of 19 pigments in G. jasminoides were identified or tentatively deduced (Fig. 6).

Fig. 6
figure 6

The structure of crocins has been identified or tentatively deduced

C1 was identified as crocetin-digentiobioside ester (crocin-1) by matching with the reference substance. Its parental molecular ions [M–H-HCOOH] and [M–H] were displayed at m/z 1021.3787 (C45H65O26) and m/z 975.3720 (C44H63O24). Due to the structure being composed of crocin and two molecules of gentiobioside, in the MS/MS spectrum, the characteristic fragment ions at m/z 651.2665 and m/z 327.1604 were produced by the losses of gentiobiose via cleavage by each side of the glycosidic bond. Then, ions at m/z 283.1704 and m/z 239.1802 were also obtained by the losses of CO2 (44 Da) on each side. Since another compound C10 which displayed semblable MS data with C1, it was tentatively assigned as 13-Z-Crocetin-digentiobioside esters [32]. Apart from the same molecular ions at m/z 1021.3793 and m/z 975.3726 and the characteristic fragment ions at m/z 651.2653, m/z 327.1597, and m/z 283.1694, compounds C4 and C6 showed another diagnostic ion at m/z 813.3183, which suggested the loss of a glucose moiety (162 Da) from m/z 975.3716 [M–H]. Therefore, in combination with the chromatographic behavior [32], C4 and C6 have tentatively been deduced as trans-crocetin (β-d-neapolitanosyl)-(β-d-glucosyl) ester and cis-crocetin (β-d-neapolitanosyl)-(β-d-glucosyl) ester.

C2 generated [M–H-HCOOH] and [M–H] ions at m/z 859.3254 (C39H55O21) and m/z 813.3189 (C38H39O20) in the full mass spectrum. The fragment ions of C2 at m/z 651.2665, m/z 489.2111, and m/z 327.1594 were generated by the cleavage of the glycosidic bond on both sides resulting in the loss of the glucose (162 Da) and gentibiose (324 Da), respectively. The diagnosed ions at m/z 283.1700 and m/z 239.1799 were similar to the C1 which were the characteristic fragment ions of crocin derivatives. Consequently, C2 was identified as crocin-2 (Crocetin gentiobioside monoglucoside ester), which was confirmed by comparing it with the reference substance. C17 and C18 exhibited the same MS data with C2, according to the literature data and the chromatographic behaviors [32] they were annotated to be 13-cis-crocetin-8-O-β-d-gentiobiosyl-8′-O-β-d-glucopyranoside (C17) and 13-cis-crocetin-8-O-β-d-glucopyranosyl-8′-O-β-d-gentiobioside (C18). C9, C14, C15, and C16 were showed the same parental molecular ions [M–H] at m/z 651.2669 under negative ionization mode. C14 to be crocin-3 (Crocetin-monogentiobiosdie ester) was confirmed by matching retention time and MS2 spectral data with an authentic standard of crocin-3 (Crocetin-monogentiobioside ester). Its characteristic fragment ions at m/z 327.1589, m/z 283.1693, and m/z 239.1794 were generated by the loss of gentibiose (324 Da) and further losses of CO2 (44 Da) on each side. With the same MS2 spectral data of C14, C15 and C16 were the isomers of C14 and they were tentatively assigned to be 13-cis-crocetin-8-O-β-d-gentiobioside and 13′-cis-crocetin-8-O-β-d-gentiobioside based on the chromatographic behavior [15]. Besides, the MS2 spectral data of C9 were similar to that of C15 and C16, except for a fragment ion at m/z 489.2111[M–H-162], as suggested that there was the cleavage of a glycosidic bond to lose glucose (162 Da) in the structure of C9. Thus, C9 [33] were deduced as crocetin-diglucoside ester.

C7, C8, and C13 exhibited the same molecular ion at m/z 987.3515 [M–H]. According to the literature, three crocin compounds isolated from G. jasminoides named neocrocin B–D all showed the molecular ion at m/z 987.3498. The structures of these compounds were characterized by the gentibiose and caffeoyl + C7H11O5 (Fig. 6B) substituent on each side of the parent nucleus crocetin as shown in Fig. 6. Therefore, the fragment ions at m/z 825.3206 and m/z 651.2662 were caused by the loss of glucose or the loss of B moiety from the molecular ion. The fragment ion at m/z 501.2128 was caused by the loss of gentiobiose and caffeoyl at the same time. Other fragment ions at m/z 327.1595, m/z 283.1701, and m/z 239.1805 were the diagnostic ions of crocetin. Among these three compounds, the MS2 spectral data of C8 and C13 were the same. Therefore, C8 and C13 were deduced as cis- and trans-isomers. Simultaneously, considering the appearance time at the C18 column in the literature [15], C7, C8, and C13 were tentatively deduced as neocrocin C, neocrocin B, and neocrocin D, respectively.

C5 exhibited an [M–H] ion at m/z 1181.4347 in the full mass spectrum. In the MS/MS analysis, there were characteristic fragment ions at m/z 223.0603 and m/z 205.0499 which suggested the substituent group of sinapoyl. Besides, the fragment ions at m/z 857.3232 and m/z 529.1556 were shown by the loss of gentiobiose and the gentiobiose with sinapoyl [gentiobiosyl + sinapoyl-H]. Fragment ions at m/z 651.2656 and m/z 327.1593 were the common fragment ions of crocin compounds. Referring to the crocin compounds isolated from G. jasminoides previously, C5 was deduced as Neocrocin G. The fragment ion at m/z 651.2656 was supposed to be caused by the substituent group of D in Fig. 6. C12 produced an [M–H] ion at m/z 857.3237 and generated the same characteristic fragment ions m/z 651.2654, m/z 327.1595, m/z 529.1556, and m/z 223.0603 with C5 in the MS/MS spectrum. Going through the pigments isolated from G. jasminoides previously [15], C12 was annotated as Neocrocin F.

C19 found an [M–H + HCOOH] and [M–H] ions at m/z 833.3961 and m/z 787.3958 in the full mass spectrum. The fragment ions at m/z 463.2833 and m/z 419.2945 were suggested by the loss of one gentiobiose and one CO2 (44 Da) successively. In the MS/MS analysis, there have not characteristic ions similar to other pigments compounds. So, referring to the literature reported previously [34], C19 was tentatively assigned as gardecin.

Organic acids

The organic acids mostly characterized in G. jasminoides were quinic acids. Quinic acids are commonly characteristic with 1-hydroxy-hexahydro-gallic acid as the basic parent nucleus, and its C3, C4, C5 positions are often connected with caffeoyl, sinapoyl, and so on. A total of 13 organic acids were identified or tentatively characterized from G. jasminoides, 6 of them were identified by comparing with the reference standards.

Compounds Q1 and Q2 were the isomeric neochlorogenic acid and chlorogenic acid, whereas Q3 was assigned as cryptochlorogenic acid. All of these compounds showed the expected spectral data and matched those obtained for reference standards. The structures of these compounds belong to the same type of mono-caffeoylquinic (MCQA) with the only difference of the connected position of caffeoyl connected with quinic acid. Q1, Q2, and Q3 showed the same parental molecular ion [M–H] at m/z 353.0870 (C16H17O9). The characteristic fragment ions at m/z 191.0554 and m/z 173.0446 were generated by the losses of the caffeoyl residue and H2O continually. Simultaneously, fragment ions at the m/z 179.0342 and m/z 135.0439 were caused by the caffeoyl residue ionized by the patterns of [caffeic acid-H] and [caffeic acid-H-CO2], represented in Fig. 5C below.

Compound Q4, Q5, and Q6 showed the same [M–H] ion at m/z 515.1186 (C25H23O12). By comparing with the reference standards, these compounds were identified as isochlorogenic acid B (3,4-dicaffeoylquinic acid), isochlorogenic acid A (3,5-dicaffeoyl-quinic acid), and isochlorogenic acid C (4,5-dicaffeoylquinic acid). Due to the two molecules of caffeoyl attached to two positions of quinic acid, Q4, Q5, and Q6 belonged to dicaffeoylquinic acids (DCQA). Thus, there was a diagnostic fragment ion at m/z 353 which was produced by the loss of a caffeoyl residue, and other characteristic ions at m/z 191.0554, m/z 179.0343, m/z 173.0446, and m/z 135.0448 were produced in the same way as that in Q1.

Compound Q8 and Q9 were tentatively deduced as isomers which gave the same [M–H]- ion at m/z 559.1450 (C27H27O13). In their MS2 spectra, there were characteristic fragment ions for caffeoyl residue and sinapoyl residue at m/z 179.0343, m/z 161.0232, m/z 223.0612, and m/z 205.0506. Besides, there was a fragment ion at m/z 173.0447 so that Q8 and Q9 can be assigned to quinic acid derivatives. Referring to the reported quinic acid derivatives [28, 35] isolated from G. jasminoides, Q8 and Q9 were tentatively deduced as 4-O-sinapoyl-5-O-caffeoylquinic acid or 3-O-caffeoyl-4-sinapoylquinic acid. Consequently, the fragment ion at m/z 397.1126 was caused by the loss of caffeoyl residue from the molecule. Besides, there was a strong diagnostic fragment ion at m/z 335.0776 only can found in Q8 which suggested the base peak was produced by losing sinapoyl residue directly from the molecule. Finally, Q8 and Q9 were deduced as 4-O-sinapoyl-5-O-caffeoylquinic acid and 3-O-caffeoyl-4-sinapoylquinic acid, respectively.

Q7 exhibited an [M–H] ion at m/z 659.1613 (C31H31O16) in the full mass spectrum. In the MS/MS spectrum, there were characteristic fragment ions at m/z 191.0560, m/z 179.0345, m/z 173.0453, and m/z 135.0442 which indicated the structure of quinic acid derivatives. The fragment ions at m/z 497.1302 and m/z 335.0800 were created by the losses of two molecules of caffeoyl residue successively from the molecule. According to the reported quinic acid derivatives which isolated from G. jasminoides, Q7 was tentatively assigned as 3,4-dicaffeoyl-5-(3-hydroxy-3-methylglutaroyl) quinic acid [36] or 3,5-di-O-caffeoyl-4-O-(3-hydroxy-3-methyl) glutaroylquinic acid [37].

Flavonoids

Flavonoids were mostly reported from the flowers and leaves of G. jasminoides. In this paper, combined with literature data and the mass spectrometric data of reference standards, a total of five flavonoids were identified or tentatively deduced. Among them, one structure was confirmed by the comparison to the MS data of the reference substance.

F2 was identified as rutin by comparison with the retention time and MS data of reference standard. In negative ion mode, there was a parental molecular ion at m/z 609.1463 ([M–H]). A series of fragment ions appeared in the MS/MS spectrum and were illustrated briefly in Fig. 5D. F1 and F2 were isomers displaying the same parental ion. Based on the same characteristic fragment ions with F2, combining with the literature, compound F1 was deduced as neoisorutin [38], which has not been reported in G. jasminoides.

F4 and F5 both showed an [M–H] ion at m/z 593.1508 in the full mass spectrum. There were diagnostic fragment ions at m/z 285.0392 (m/z 284.0316) and m/z 255.0397 in their MS2 spectra which suggested the compounds were kaempferol derivatives. Additionally, referring to the molecular weight, these compounds were supposed to be flavonoid O-diglycosides. However, F4 displayed a fragment ion at m/z 284.0316 with a higher abundance than m/z 285.0392, and F5 exhibited a strong signal at m/z 285.0387. According to the literature previously [39], for flavonoid O-diglycosides, the relative intensity and specific fragmentation patterns were determined by the sequence of saccharidic parts and the type of sugar residue. In these compounds, compared with the 1 → 6 linkage of saccharidic parts, 1 → 2 linkages of saccharidic parts are beneficial to the formation of radical aglycone ion [Y0-H]· (aglycone expressed in Y0). Thus, F4 and F5 were tentatively deduced as nicotiflorin [40] and kaempferol 3-O-[2-O-(β-d-glucosyl)-α-L-rhamnoside, respectively, by comparing the reported flavonoids isolated from G. jasminoides.

F3 displayed an [M–H] ion at m/z 463.0887. The signals at m/z 300.0276, m/z 271.0247, m/z 255.0305, and m/z 151.0024 in the MS2 spectra were attributable to the quercetin aglycone. The characteristic fragment ion at m/z 300.0276 was suggested to the patterns of [M-Glc-H]−·. With the literature reported previously [39], for flavonoid mono-O-glycosides, the glycosylation position of flavanol 3-O-glycosides was more likely to generate the radical aglycone ion (300.0276 [M-Glc-H]−·) than flavanol 7-O-glycosides. Therefore, F3 was tentatively annotated as isoquercetin (quercetin3-O-β-D-glucopyranoside) [26].

Others

In addition to the above-mentioned compounds, there reported monoterpenes, triterpenes, glycosides, and lignans in G. jasminoides. By similar approaches, miscellaneous compounds were tentatively deduced by the fragment ions shown in Table 2.

Analysis of metabolomic profiling based on the identified or tentatively deduced compounds

To further understand the material differences between the three groups of G. jasminoides, 80 compounds were identified or tentatively to avoid the repeatable identities from the software mechanically and can ensure the structure of the biomarkers. The data of corresponding compounds were imported into the SIMCA14.0 from the Progenesis QI software for partial least square discriminant analysis (PLS-DA). As shown in Fig. 7A, different color spots were represented different varieties and cultivars of G. jasminoides. When the PLS-DA analysis was carried out, it can be seen that the three groups have a certain degree of dispersion and can be significantly separated, which was consistent with the results of variable analysis of 8227 metabolite features characterized by QI software. Therefore, it can be explained that the metabolites in grouping G. jasminoides were different, and the differences showed a tight relation with the botanical classification and morphological characters.

Fig. 7
figure 7

Targeted analysis of metabolite profiles (metabolite profiles were obtained from 80 compounds in negative ion modes). A PLS-DA score scatter plot for G. jasminoides samples (R2X = 0.461 R2Y = 0.773 Q2(cum) = 0.626). B VIP plot. C The relative content of 19 different components. Group 1 represents ZZC1, group 2 represents ZZC2, group 3 represents Shuizhizi. I1: Deacetyl asperulosidic acid, I2: Gardoside, I3: Shanzhiside, I9: Mussaenosidic acid, I11: 7-Deoxygardoside, I15: 6′-O-trans-sinapoyl gardoside, I16: 6′-O-trans-p-coumaroylgeniposidic acid, I20: 2-O-trans-3-caffeoylgardoside, I25: 6′-O-trans-sinapoylgeniposide;C10: Crocetin-diglucoside ester, Q2: Chlorogenic acid, 1: Jasminoside F, 6: Jasminoside R, 7: 6′-O-trans-sinapoyljasminoside B/L, 8: Jasminoside T, 9: 3-Methyl-l-erythritol-4-O-(6-O-trans-sinapoyl)-β-d-glucopyranoside, 10: 2-Methyl-l-erythritol-1-O-(6-O-trans-sinapoyl)-β-d-glucopyranoside, 11: 2,6-Di-O-E-sinapoyl-glucopyranose, 13: Lyoniresinol-9-O-β-d-glucopyranoside

For the purpose of finding the most contributing variables of the three groups G. jasminoides, PLS-DA, and the VIP plot were served. Based on the VIP value graph (Fig. 7B) obtained by PLS-DA analysis, those variables with VIP > 1.0 are considered as relevant to group discrimination and found 36 metabolites in this range. Additionally, aiming at avoiding false-positive results, parameters were set with max fold change > 1.5 and Anova P < 0.05.

As a result, 19 potential biomarkers were discovered suitable for simultaneous differentiation of the 3 groups of G. jasminoides, including 9 iridoids compounds, 1 pigment, 1 organic acid, 4 monoterpenoids, 3 glycosides and,1 lignan component. The contents of these 19 metabolites were expressed by the normalized peak area and can be seen in Fig. 7C, displaying the specific differences between 3 groups.

Conclusions

No matter as an edible food providing numerous nutrients or as a medical plant exerting special function, it is inseparable from the plant's metabolites. Moreover, as an herbal medicine, phytochemical composition was also found to be significantly associated with the post-harvest processing. According to the Chinese Pharmacopoeia, the post-harvest processing of Gardenia Frutucs is dry after boiling, dry after steaming and direct dry in the sun. However, the time of specific methods are not clear. Literature have shown that the active constituents in Gardenia Frutucs can be better retained after boiling or steaming and then drying due to the mechanism of killing enzyme and protecting glycoside [41,42,43,44]. It has been reported the content of geniposide in Gardenia Frutucs was 2.38% and 5.82%, respectively, after direct drying in the sun and dry after boiling for 15 min [42]. As the time of different post-harvest processing changes, the composition was generally increased and then decreased. Researchers reported changes in the content of geniposide and crocin-I by drying after boiled within 15 min. The content of geniposide was increased from 2.69% at 1 min to 3.86% at 13 min and finally decreased to 3.80% at 15 min. The content of crocin I was increased from 0.52% at 1 min to 0.87% at 8 min and then decreased to 0.70% at 13 min [43]. Studies also found that the low temperature drying has little effect on the content of active the constituents [41]. Thus, to avoid the influence of post-harvest processing on chemical composition, it is necessary to use the same post-harvest processing to compare their profiling of phytochemicals in the fruits of G. jasminoides.

Based on reversed-phase UHPLC/ESI-QTOFMS, an integrated analytical strategy for comprehensive profiling of phytochemicals from fruits of G. jasminoides was proposed. An accurate data pretreatment was provided by Progenesis QI, so that the metabolites features was thoroughly obtained conveniently. In addition, based on literature searches and analyses of reference compounds, the main secondary metabolites such as iridoids, pigments, organic acids and flavonoids were analyzed as primary target of secondary metabolites. A total of 80 metabolites were identified or tentatively deduced in this study, including 28 iridoids, 19 pigments, 11 organic acids, 5 flavonoids and 17 other compounds. The 3 kinds of fruits all contained these 80 compounds, but their contents and proportional relation were different in different populations. PCA and PLS-DA analysis of the holistic metabolites features and the 80 components showed that there was no close relationship between these components and the producing place, but they grouped according to their cultivar or variety. Further analysis results in 19 components were discovered suitable for simultaneous differentiation of the 3 populations of fruits of G. jasminoides.

Combined with non-targeted and targeted data analysis strategies, the proposed analytical strategy was effectively applied to discern differences in the profiles of secondary metabolites in the fruits of two cultivars and one variety of G. jasminoides. Unlike the common analytical methods which focus only on limited components, this attempt offers a thoroughly comprehensive view on the complex metabolic profile of G. jasminoides fruits and describes a straightforward protocol to catalogue their metabolic diversity contingent on genetic and environmental factors.