Introduction

Traditional Chinese medicine (TCM) is widely used in China and other Asian countries, for the prevention, diagnosis, treatment of disease or to regulate human function, and has gradually become an effective supplement therapy in the West. TCM is a unique medical system with the significant characteristic of pursuing an overall therapeutic effect with a multi-target treatment. It consists of multiple major or minor constituents that can regulate the balance and homeostasis of the body in a holistic fashion. Quantitative analysis with characteristic constituents is recognized as the prevalent method for quality control of TCM. Generally, single or myriad available constituents contained in the TCM were selected as quality control markers (QC-markers) for quantitative determination, which were responsible for overall pharmacological efficacy. However, these markers may not represent the characteristic or bioactivity of the specific TCM. Therefore, an effective screening strategy for QC-markers that can truly reflect the intrinsic properties of TCM has run into the bottleneck of TCM’s modernization, and has become an urgent issue. Since TCM is based on “holism” philosophy, it is philosophically conceivable that metabolomics techniques can provide important information relevant to TCM [1]. Metabolomics has employed global profiling methods for the comprehensive analysis of altered metabolites, providing insights into the global state of the entire metabolic process, which coincides well with the integrity and systemic features of TCMs.

In the present research, a pilot study was performed using four closely related Citrus TCMs (e.g., Zhishi, Zhiqiao, Qingpi, and Chenpi) for a performance evaluation as well as a rational screening strategy for QC-markers of other TCMs. Citrus plants, which are herbs of homology of medicine and food, are of great interest because they contain large amounts of dietary phenoloids possessing important therapeutic properties for human health, and Citrus fruits and their juices are consumed in large quantities around the world [2, 3]. The contents of bioactive phenoloid constituents and profiles in Citrus juices vary greatly with the type of fruit. The four Citrus TCMs mentioned above are used as drugs for regulating Qi flow and for strengthening the spleen and stomach since ancient times, while their actual clinical applications in China are varied but their QC-markers are rough and similar. Thus, it is important to screen for the representative QC-markers of Citrus TCMs both characteristically and bioactively in vitro and in vivo to interpret the material basis for their pharmacological effects. Here, a reliable screening strategy for QC-markers was proposed with the following steps: (a) find chemotaxonomic markers characteristic of the specific TCM using a global plant metabolomics method (completed in our previous research); (b) targeted prototype drug metabolomics applied to the improved quantitative plasma pharmacochemistry analysis; (c) untargeted metabolomics applied to unknown metabolites’ semi-quantification and identification (b and c are the pseudotargeted metabolomics methodology); (d) integrating the pseudotargeted metabolomics data and text mining to elucidate the rational metabolic pathway and uncover effective QC-markers.

Pseudotargeted metabolomics is a novel strategy integrating the advantages of both targeted and untargeted methods. For targeted metabolite quantification, multiple reaction monitoring (MRM) performed on a triple quadrupole tandem mass spectrometry (QqQ-MS) is recognized as the gold standard, attributed to its wide linear dynamic range, high sensitivity, and high repeatability, and has been widely applied in targeted metabolomics [4, 5]. In the realm of plasma pharmacochemistry, only drugs being absorbed into the blood are considered and will probably become bioactive constituents. We put forward an improved quantitative plasma pharmacochemistry methodology using a QqQ instrument to determine and compare targeted prototype constituents, which have been validated as characteristic to specific TCMs. However, targeted metabolomics alone are insufficient to discover the in vivo process and biotransformation of complex TCMs, because natural products are often pro-drugs which must undergo metabolic conversion before being active. It is worth noting that the comprehensive identification of unknown metabolites in complex biological matrices is the vital obstacle. Untargeted metabolomics aims to simultaneously measure as many metabolites as possible in a biological specimen. Often, the chemical identities of the MS-resolved peaks are not known a priori. The ultra-high-performance liquid chromatography Q Exactive hybrid quadrupole-orbitrap high-resolution accurate MS (UHPLC-QE Orbitrap HRMS) instrument has remarkable specificity and selectivity in terms of accurate mass identification and has a profound impact on the identification of characteristic metabolites from complex biological matrices. UHPLC-QE Orbitrap HRMS is of particular importance for in vivo metabolomics studies regarding the identification of untargeted metabolites in complex matrices, due to its capacity in distinguishing chemical-specific metabolites by determining the composition of precursor ions and fragment ions elements, which is capable of analyzing a broad range of secondary metabolites [6]. This method constituted a great development for the application of Q Exactive MS in the comprehensive identification and semi-quantification of untargeted metabolites in vivo. In a word, the combined pseudotargeted metabolomics including targeted and untargeted analysis provided a global insight into the complex process of TCMs in the blood.

Therefore, in order to describe the process of these four Citrus TCMs in vivo and to clarify the material basis of the therapeutic effects of different Citrus TCMs, the medicative ingredients used as the potential QC-markers in Citrus TCMs are screened, and the potential metabolic pathways are elucidated and profiled in rat plasma after oral administration. As a result, 32 absorbed prototype constituents were quantitatively determined in rat plasma after oral administration with standard vehicles of four labeled Citrus TCMs. The plasma migrating performance and potential correlations were analyzed between in vitro and in vivo concentrations of prototype constituents. This comparison indicated that the absorption of prototypic constituents exhibited obvious structure dependence. Sixty-six drug metabolites were detected and identified through deconvolution and normalization of complex spectra collected from the QE Orbitrap HRMS instrument. Based on the integrated pseudotargeted metabolomics data, a large number of metabolites from general metabolic pathways depended largely on the interaction with drug metabolizing enzymes, the constituents’ inherent property, I or II reactions, and the biotransformation of gut microbiota. These results contribute to an effective screening strategy for QC-markers and a better understanding of the in vivo exposure of complex TCMs to support further drug development and clinical application.

Materials and methods

Chemicals and reagents

HPLC-grade methanol and acetonitrile were obtained from Fisher Scientific (Fair Lawn, NJ, USA). The deionized water was redistilled. Formic acid (HPLC grade, Lot. 095224) was obtained from MREDA Technology Inc. (USA). The reference standards of bergapten and narirutin were obtained from Chengdu Must Bio-Technology Co., Ltd. Hesperidin, naringin, kaempferitrin, diosmin, diosmetin-7-O-glucoside, and sinensetin were obtained from Shanghai Source Leaf Biological Technology Co., Ltd. Eriocitrin, neohesperidin, naringenin, hesperetin, luteolin, rutin, isopimpinellin, and tangeretin were obtained from Tianjin Mark Biological Technology Co., Ltd. 5-Demethylnobiletin, nobiletin, and auraptene were obtained from Nanjing JingZhu Biological Technology Co., Ltd. Rhoifolin, apigenin, apigenin-7-O-glucoside, scoparone, and diosmetin were purchased from the National Institute for Control of Biological and Pharmaceutical Products of China. Poncirin, limonin, nomilin, eriodictyol, xanthotoxol, acacetin, isosakuranetin, and imperatorin were purchased from Beijing fufan Biological Technology Co., Ltd. The purity of the standards was relatively high (i.e., higher than 98%). Deionized water was prepared from distilled water using a Milli-Q water purification system (Millipore, Bedford, MA, USA). All other reagents and chemicals used were of analytical grade.

Preparation of Citrus TCM extract

Each Citrus TCM was soaked with 1 L ethanol for 30 min after being smashed into powder. Then, the mixture was boiled for 1 h, and this procedure was repeated three times. The combined aqueous extracts were concentrated in a vacuum rotary evaporator to 1 g/mL (crude herbal dose). The aqueous extracts (1 mL) were diluted in 99 mL of 50% methanol and filtered through a 0.22-mm microporous membrane before use.

Animal and drug administration

Male Sprague-Dawley rats (250 ± 20 g) were purchased from Beijing Vital River Laboratory Animal Technology Co., Ltd. Animals are housed in a separate cage in a room with automatic control of lighting (12 h light/dark cycles, lights on from 8:00 to 20:00), under controlled temperature (22–25 °C) and relative humidity into the food and water. Experiments, which were conducted in a specific pathogen-free (SPF) grade laboratory, were carried out in accordance with the Guide for the Care and Use of Laboratory Animals as adopted and promulgated by the National Health Ministry of China. Protocols for animal experiments had been approved by the Animal Center of the Institute of Basic Theory, China Academy of Chinese Medical Sciences.

Forty male Sprague-Dawley (SD) rats were randomly separated into four groups and administered a dosage of extracts via gastric gavage once per day, the detailed information about oral daily dose of Citrus TCMs (Zhishi, Zhiqiao, Qingpi, and Chenpi) is 3.6 g crude drug (20 times of Chinese Pharmacopoeia dose) per rat.

Plasma sample collection and pretreatment

Approximately 10 mL blood samples were collected from the abdominal aorta in heparinized tubes at 60 min after dosing. The plasma was separated from blood samples by centrifuging at 10,000 rpm for 15 min and was stored at −80 °C for later analysis. Each 1 mL aliquot of plasma sample was mixed with 3 mL of acetonitrile and vortexes for protein precipitation. After centrifugation at 10,000 rpm for 10 min, the supernatant was transferred and concentrated to dryness at 35 °C. The dried residue was then re-dissolved in 1 mL of methanol by ultrasound. After centrifugation, again at 10,000 rpm for 10 min, the supernatant was transferred to an autosampler vial. Injection volumes were 5 and 1 μL for RRLC-QqQ-MS and UHPLC-QE Orbitrap HRMS analysis, respectively.

RRLC-QqQ-MS instrumentation and conditions

The RRLC-QqQ-MS system consisted of an RRLC system (Agilent Technologies, Palo Alto, CA, USA) including a G1311A binary pump, a G1311A vacuum degasser and G1311A autosampler, and triple quadrupole mass spectrometer equipped with an electrospray source (Series 6410, Agilent Technologies).

The analytes were separated on a Poroshell 120 SB-C18 column (100 mm × 4.6 mm; 2.7 μm) and the column temperature was maintained at 25 °C. From 0 to 7 min, the A (water containing 0.1% (v/v) formic acid)/B (acetonitrile containing 0.1% (v/v) formic acid) was maintained at 25:75. From 7 to 17 min, the A was increased to 100%. From 17 to 20 min, the A was maintained at 100%. The total run time was 20 min. The mobile phase was returned to the initial conditions and reequilibration was conducted for 10 min. The flow rate was 0.5 mL/min. The sample injection volume was 5 μL.

The positive ion modes were performed with MRM for quantitative analysis by ESI. The MRM quantitative ions were then selected from the MS/MS data (see Electronic Supplementary Material (ESM) Table S1). The MS ionization source conditions were as follows: capillary voltage of 4.0 kV, drying gas temperature of 300 °C, drying gas flow rate of 11 L/min, corona current of 10 nA, and nebulizer pressure of 45 psi. The sheath gas temperature was 250 °C and sheath gas flow rate was 7 L/min.

UHPLC-QE Orbitrap HRMS instrumentation and conditions

The metabolites analysis was performed using an UltiMate 3000 Hyperbaric LC system coupled to a Q Exactive MS. Chromatographic separation was performed using an Acquity UPLC CSH C18 column (1.7 μm 100 × 2.1 mm). Mobile phase A (water contained 2 mmol/L ammonium formate, v/v, 0.1% formic acid) and mobile phase B (methanol) were utilized at a flow rate of 0.4 mL/min. From 0 to 6 min, the A was increased to 100% from 25%. From 6 to 20 min, the A was maintained at 100%. Mobile phase gradient was as follows: the temperatures of the column oven and autosampler were set at 40 and 4 °C, respectively. The injection volume was 1 μL.

The ESI source was operated in the positive mode with the following parameters: capillary temperature, 350 °C; source voltage and spray voltage, 3.7 kV; sheath gas (nitrogen) flow, 28 arb; and aux gas flow, 8 arb. Data were acquired using full MS scan (resolution, 70,000; AGC target, 1 × 106; maximum IT, 120 ms; scan range, m/z 100–1500) and collision-induced dissociation (CID)-based data dependent on MS/MS (DDMS2) (resolution, 17,500; AGC target, 1 × 105; maximum IT, 120 ms; loop count, 5; TopN = 5; isolation window, m/z 1.0; scan range, m/z 200–2000; NCE/stepped NCE: 30, 40, 50; underfill ratio, 1.0%; intensity threshold, 1.0 × 105; apex trigger, 2–6 s; dynamic exclusion, 10 s).

Preparation of calibration standards and quality control samples

The stock solutions of bergapten, narirutin, hesperidin, naringin, kaempferitrin, diosmin, diosmetin-7-O-glucoside, sinensetin, eriocitrin, neohesperidin, naringenin, hesperetin, luteolin, rutin, isopimpinellin, tangeretin, 5-demethylnobiletin, nobiletin, auraptene, rhoifolin, apigenin, apigenin-7-O-glucoside, scoparone, diosmetin, poncirin, limonin, nomilin, eriodictyol, xanthotoxol, acacetin, isosakuranetin, and imperatorin were separately dissolved in methanol to prepare a solution. A series of working standard solutions were prepared by diluting the stock solution with 50% acetonitrile to obtain the following concentrations: 1, 2, 5, 10, 16, 80, 160, 320 ng/mL for acacetin; 2, 10, 20, 40, 60, 100, 400, 800 ng/mL for apigenin; 2, 6, 20, 50, 100, 500 1000, 4000 ng/mL for apigenin-7-O-glucoside; 1, 4, 10, 16, 40, 400, 1000, 2000 ng/mL for auraptene; 4, 8, 12, 24, 50, 100, 400, 800 ng/mL for bergapten; 5, 10, 30, 200, 2000, 20,000, 100,000, 500,000 ng/mL for 5-demethylnobiletin; 10, 20, 50, 100, 200, 1000, 10,000, 30,000 ng/mL for diosmetin; 20, 40, 100, 500, 1000, 5000, 10,000, 20,000 ng/mL for diosmetin-7-O-glucoside; 30, 90, 150, 300, 500, 30,000, 100,000, 300,000 ng/mL for diosmin; 20, 40, 60, 80, 400, 2000, 20,000, 300,000 ng/mL for eriocitrin; 10, 20, 50, 100, 200, 400, 800, 1000 ng/mL for eriodictyol; 200, 400, 600, 800, 1000, 10,000, 50,000, 100,000 ng/mL for hesperetin; 200, 400, 800, 2000, 10,000, 200,000, 4,000,000 ng/mL for hesperidin; 2, 4, 8, 10, 20, 40, 200, 400 ng/mL for imperatorin; 10, 20, 30, 60, 100, 400, 1000 ng/mL for isopimpinellin; 1, 2, 4, 8, 16, 160, 300, 600 ng/mL for isosakuranetin; 2, 4, 8, 16, 32, 500, 1000, 2000 ng/mL for kaempferitrin; 20, 40, 80, 100, 800, 4000, 10,000, 30,000 ng/mL for limonin; 20, 80, 200, 400, 800, 10,000, 80,000, 400,000 ng/mL for luteolin; 20, 40, 100, 200, 500, 5000, 20,000, 120,000 ng/mL for naringenin; 200, 400, 1000, 3000, 7000, 700,000, 7,000,000, 10,000,000 ng/mL for naringin; 50, 100, 500, 1000, 5000, 10,000, 100,000, 350,000 ng/mL for narirutin; 500, 1000, 5000, 10,000, 80,000, 800,000, 8,000,000, 16,000,000 ng/mL for neohesperidin; 20, 80, 200, 1000, 5000, 20,000, 200,000 2,500,000 ng/mL for nobiletin; 20, 40, 80, 100, 150, 1500, 5000, 15,000 ng/mL for nomilin; 6, 30, 60, 100, 800, 1600, 16,000, 160,000 ng/mL for poncirin; 10, 30, 60, 120, 2000, 20,000, 100,000, 500,000 ng/mL for rhoifolin; 6, 20, 40, 50, 500, 5000, 50,000, 500,000 ng/mL for rutin; 9, 20 40, 80, 100, 500, 1000, 6000 ng/mL for scoparone; 10, 20, 60, 100, 200, 1000, 50,000, 5,000,000 ng/mL for sinensetin; 15, 30, 60, 100, 1000, 10,000, 100,000, 1,000,000 ng/mL for tangeretin; 15, 30, 50, 60, 80, 160, 1600, 8000, 16,000 ng/mL for xanthotoxol. Quality control samples are at 1, 5, 10 ng/mL for acacetin; 2, 20, 40 ng/mL for apigenin; 2, 6, 20 ng/mL for apigenin-7-O-glucoside; 1, 10, 40 ng/mL for auraptene; 8, 12, 24 ng/mL for bergapten; 5, 10, 30 ng/mL for 5-demethylnobiletin; 20, 50, 200 ng/mL for diosmetin; 20, 100, 1000 ng/mL for diosmetin-7-O-glucoside; 30, 150, 500 ng/mL for diosmin; 20, 40, 80 ng/mL for eriocitrin; 10, 20, 50 ng/mL for eriodictyol; 200, 400, 800 ng/mL for hesperetin; 200, 800, 2000 ng/mL for hesperidin; 2, 4, 8 ng/mL for imperatorin; 10, 20, 30 ng/mL for isopimpinellin; 2, 8, 16 ng/mL for isosakuranetin; 4, 8, 16 ng/mL for kaempferitrin; 20, 100, 800 ng/mL for limonin; 20, 200, 800 ng/mL for luteolin; 20, 100, 500 ng/mL for naringenin; 200, 1000, 7000 ng/mL for naringin; 50, 500, 1000 ng/mL for narirutin; 500, 5000, 10,000 ng/mL for neohesperidin; 20, 200, 1000 ng/mL for nobiletin; 20, 80, 150 ng/mL for nomilin; 6, 30,100 ng/mL for poncirin; 10, 60, 120 ng/mL for rhoifolin; 6, 20, 50 ng/mL for rutin; 9, 40, 80 ng/mL for scoparone; 10, 60, 200 ng/mL for sinensetin; 15, 60, 100 ng/mL for tangeretin; and 15, 50, 80 ng/mL for xanthotoxol. All solutions were stored at −20 °C before analysis.

Method validation

Selectivity

The selectivity of the method was evaluated by comparing the chromatograms of a blank plasma sample, a blank plasma sample spiked with the working solutions, the water extract of Citrus TCM, and a rat plasma sample after an oral administration of the water extract of Citrus TCM.

Linearity and lower limit of quantification

The calibration curves of the tested compounds were determined by least-squares linear regression of the peak area versus the concentrations. Lower limit of quantification (LLOQ) is defined as the lowest level of the analytes that can be reliably quantified based on the signal-to-noise ratio of 10:1 and reproduced with a precision of less than 20% and an accuracy of 80–120%.

Accuracy and precision

By analyzing six sets of quality control samples at three levels of concentration, the accuracy and accuracy (inter-day and intra-day) of the day were measured on 6 days with six concentrations on the same day or for three consecutive days. The accuracy was determined as RE (%) within the nominal value of 85–115%, the precision as RSD (%) within ±15%, in addition to LLOQ, its accuracy should be within 80–120%, and cannot exceed 20% of precision.

Recovery and matrix effect

Extraction recovery, evaluated in three replicates of QC sample, was determined by comparing the peak areas of extracted plasma (pre-spiked) standard QC samples to those of post-spiked standards at equivalent concentrations. The matrix effect was determined by comparing the peak response of the analytes in plasma samples with those of the pure standards prepared in the mobile phase. Six replicates of each concentration of quality control samples were prepared.

Stability

The stability of the samples was evaluated by measuring the quality control samples’ analysis data with freshly prepared mass control samples at room temperature, frozen and freeze-thaw storage conditions. The short-term (25 °C for 4 h) and long-term (80 °C for 30 days) stabilities of the analytes were evaluated using the quality control samples at three concentrations. To assess the stability of the sample in the autosampler, the quality control sample was placed in the autosampler at room temperature for 24 h and then analyzed. The stability experiments were measured by analyzing replicates (n = 5) during the sample storing and processing procedures.

Statistical analysis

Quantitative data consisting of values taken from the RRLC-QqQ-MS analyses of the extract samples and plasma samples were used for the multivariate analysis. Multivariate statistical analyses, including analysis of variance through the formula:

\( \mathrm{F}={S}_{\mathrm{high}}^2/{S}_{\mathrm{low}}^2 \) and S 2 =  ∑ (X −  ‾ X)2/(n − 1) (calculate the F value and then compare with the F value table)

and Student’s t test through the formula:

$$ t=\frac{\overline{X_1}-\overline{X_2}}{\sqrt{\frac{\left({n}_1-1\right){S}_1^2+\left({n}_2-1\right){S}_2^2}{n_1+{n}_2-2}\left(\frac{1}{n_1}+\frac{1}{n_2}\right)}} $$

were performed using SPSS 13.0.

Results

Method development.

Optimization of RRLC and QqQ-MS conditions

The plasma samples were precipitated with acetonitrile before RRLC-QqQ-MS analysis. The Poroshell 120 SB-C18 column has been proven to be suitable for simultaneous quantification of these analytes. The concentration of aqueous and organic 0.1% (v/v) formic acid solution was proved to give a better response than other aqueous phases and organic phases (e.g., for 0.2% formic acid). Gradient elution changed linearly from 25:75 (v/v) water containing 0.1% (v/v) formic acid-acetonitrile containing 0.1% (v/v) formic acid to 100:0 at a flow rate of 0.5 mL/min. The results showed that the best peak shape and resolution was obtained from a mixture of acetonitrile and water each containing 0.1% formic acid solution. The flow rate of 0.5 mL/min and column temperature of 25 °C were demonstrated to be the most suitable. The established chromatographic conditions are listed in “RRLC-QqQ-MS instrumentation and conditions” section.

MRM patterns were superior to other models in targeted analysis, and MRM patterns were used to quantify the analytes in rat plasma. The capillary temperature, the evaporator temperature, the flow rate of the analyte, and the fragmentation energy were optimized to obtain the protonated molecules and achieve the maximum response of the compound fragment ion peaks, and finding that the cationic mode provides better sensitivity for the target compound. Since the generated product ions are the most abundant, the use of MRM is used to simultaneously quantify 32 prototype analytes and to select quantitative ions.

Optimization of UHPLC and QE Orbitrap HRMS conditions

Different mobile phase compositions were screened to obtain LC chromatograms with better peak shape and separation. The water contained 2 mmol/L ammonium formate and 0.1% formic acid (v/v) in aqueous solution was ultimately selected as the mobile phase to obtain sufficiently good performance for these metabolites with good peak symmetry. To acquire better sensitivity for the base ions of most compounds in the QE Orbitrap HRMS spectra, the ionization parameters including capillary temperature, source voltage, spray voltage, sheath gas flow, and aux gas flow were optimized. The optimum conditions for QE Orbitrap HRMS were decided as follows: capillary temperature, 350 °C; source voltage and spray voltage, 3.7 kV; sheath gas (nitrogen) flow, 28 arb; and aux gas flow, 8 arb.

Targeted prototype drug metabolomics using RRLC-QqQ-MS analysis

Method validation

Selectivity

Figure S1 in ESM shows the representative MRM chromatograms of the blank rat plasma (A), blank rat plasma spiked with 32 analytes (B), and an in vivo plasma sample obtained 1 h after oral administration (C). No endogenous or extraneous peaks interfering with the analytes at the retention time were observed, indicating the specificity of the method was acceptable.

Linearity and lower limit of quantification

Representative calibration curves were as follows: acacetin: y = 0.0476x − 0.958 (r = 0.999), apigenin: y = 0.161x − 1.24 (r = 0.998), apigenin-7-O-glucoside: y = 0.155x − 8.51 (r = 0.992), auraptene: y = 0.0588x + 0.285 (r = 0.998), bergapten: y = 0.28x − 10.2 (r = 0.991), 5-demethylnobiletin: y = 0.0609x − 5.36 (r = 0.990), diosmetin: y = 0.132x − 3.63 (r = 0.995), diosmetin-7-O-glucoside: y = 0.908x − 9.68 (r = 0.999), diosmin: y = 1.04x + 0.528 (r = 0.998), eriocitrin: y = 0.813x − 3.64 (r = 0.996), eriodictyol: y = 0.565x + 2.42 (r = 0.999), hesperetin: y = 0.871x − 40.4 (r = 0.999), hesperidin: y = 0.565x − 2.61 (r = 0.998), imperatorin: y = 0.104x − 0.711 (r = 0.991), isopimpinellin: y = 0.0741x − 14.4 (r = 0.999), isosakuranetin: y = 0.214x − 9.05 (r = 0.998), kaempferitrin: y = 0.281x − 15.8 (r = 0.998), limonin: y = 1.56x + 1.05 (r = 0.999), luteolin: y = 0.703x − 12 (r = 0.996), naringenin: y = 0.374x − 24.5 (r = 0.990), naringin: y = 1.92x − 15.7 (r = 0.995), narirutin: y = 0.584x − 31.2 (r = 0.999), neohesperidin: y = 3.3x − 19.9 (r = 0.999), nobiletin: y = 0.0187x − 20.6 (r = 0.993), nomilin: y = 0.601x + 0.54 (r = 0.992), poncirin: y = 0.106x + 1.37 (r = 0.998), rhoifolin: y = 0.153x − 24.9 (r = 0.998), rutin: y = 0.432x + 2.38 (r = 0.999), scoparone: y = 0.177x − 20.3 (r = 0.996), sinensetin: y = 0.0396x − 2.63 (r = 0.991), tangeretin: y = 0.023x − 13.2 (r = 0.997), xanthotoxol: y = 0.163x − 0.365 (r = 0.996). The lowest concentrations with RSD <20% were taken as LLOQs and were found to be 0.8 ng/mL for acacetin, 1.3 ng/mL for apignin, 1.6 ng/mL for apigeinin-7-O-glucoside, 0.7 ng/mL for auraptene, 2.2 ng/mL for bergapten, 2.5 ng/mL for 5-demethlnobiletin, 5.3 ng/mL for diosmetin, 6.2 ng/mL for diosmetin-7-O-glucoside, 4.7 ng/mL for diosmin, 3.8 ng/mL for eriocitrin, 4.5 ng/mL for eriodictyol, 6.4 ng/mL for hesperetin, 10.6 ng/mL for hesperidin, 1.3 ng/mL for imperatorin, 3.3 ng/mL for isopimpinellin, 1.2 ng/mL for isosakuranetin, 2.1 ng/mL for kaempferitrin, 8.8 ng/mL for limonin, 7.6 ng/mL for luteolin, 6.8 ng/mL for naringenin, 12.6 ng/mL for naringin, 11.1 ng/mL for narirutin, 63.1 ng/mL for neohesperidin, 7.5 ng/mL for nobiletin, 7.1 ng/mL for nomilin, 2.7 ng/mL for poncirin, 3.4 ng/mL for rhoifolin, 3.8 ng/mL for rutin, 5.2 ng/mL for scoparone, 4.9 ng/mL for sinensetin, 7.9 ng/mL for tangeretin, and 4.8 ng/mL for xanthotoxol, respectively, which were sufficient for these studies.

Accuracy and precision

Accuracy and intra- and inter-day precision of LLOQ and three quality control samples for the RRLC-QqQ-MS analysis are presented in Table 1. The accuracy data in the present study ranged from 85.7 to 114.5% (RE), and the intra- and inter-day precision were 5.1 to 14.6% and 3.0 to 14.8% (RSD). This indicated that the precision and accuracy of this method were acceptable.

Table 1 The accuracy (intra- and inter-day), precision (intra- and inter-day), recovery, matrix effects, and stability for 32 compounds in rat plasma (n = 5)

Extraction recovery and matrix effect

The mean recovery and matrix effect were analyzed by analyzing the three concentrations of the quality control sample five times for RRLC-QqQ-MS analysis. As detailed in Table 1, the mean recoveries of the analytes were between 86.9 and 110.9% (RSD <15%) and the corresponding matrix effect ranges from 87.1 to 111.5% (RSD <15%). Thus, it is shown that methanol is a viable and suitable medium for the extraction of analytes and that there is no measurable matrix effect on the ionization of the analyte.

Sample stability

The stability of the quality control samples under different conditions at three concentrations was evaluated using the newly prepared mass control samples’ peak area. The results are presented in Table 1. The results indicated that these analytes were all stable in rat plasma with accuracy in the range from 95.1 to 102.0% after 24 h in the room temperature (20 °C), storage at −80 °C for 1 month, and three freeze-thaw cycles.

Quantification of plasma pharmacochemistry targeted to 32 prototypical constituents

It is well-known that the effects of TCMs are produced by their chemical constituents, and it is also accepted that only compounds absorbed into the blood have the opportunity to show bioactivity according to the opinion of plasma pharmacochemistry. In the present study, a developed quantitative plasma pharmacochemistry method was established to assist in the selection of reasonable QC-markers based on the absorption of prototypical compounds. Plant metabolomics is the profiling of the complete set of small-molecule metabolites, such as metabolic intermediates, hormones, and other signaling molecules, as well as secondary metabolites [7]. Plant metabolomics is an efficient analytical method that can aid in the understanding of the mechanisms/modes of the biosynthesis of secondary metabolites and in the identification of potential chemotaxonomic markers of different TCMs. In our recent study, a total of 73 active secondary metabolites were identified as potential classification markers among four Citrus TCMs using a plant metabolomics method. Among them, 32 chemotaxonomic markers were quantitatively determined and validated to be characteristic of the specific TCMs.

In this study, the targeted 32 prototypical compounds, including 11 glycosides, 9 flavonoids, 6 coumarins, 4 flavanones, and 2 limonoids, which are characteristically [8] and bioactively [2, 9,10,11] present in the four Citrus TCMs, were determined in these 4 standard vehicles and 28 biological samples after oral administration (eight samples each). Representative MRM chromatograms are shown in Fig. 1. The calibration curves were used for the quantitative determination of the 32 compounds. The contents of the 32 prototypical compounds in four Citrus standard vehicles and biological samples were then measured and are shown in ESM Table S2.

Fig. 1
figure 1

MRM figure for 32 compounds in four Citrus TCMs

Comparison of the relative prototypical absorption among different vehicle matrices

Based on the results depicted in Fig. 2, the changes in absorption of the prototypical constituents correspond with the method of quantitative plasma pharmacochemistry, which are presented in the box-plot with different colors. The potential correlations were analyzed between the in vitro standard vehicle matrix and in vivo blood samples of prototype constituents. The statistical analyses that are used to calculate correlations between treatment groups and Citrus standard vehicles, such as Student’s t test and Fisher ratios (common approaches for identifying putative variant prototype constituents), are reliant upon contents of prototypical compounds. Analysis by Student’s t test was employed to determine significant differences between each of the four Citrus standard vehicles and biological samples. Subsequently, the between-group vs. within-group variance of each compound was assessed in the biological samples where the compounds with higher Fisher ratios were regarded as potential candidate markers.

Fig. 2
figure 2

Combined box-and-whisker and dot plot of quantitative content of 32 prototypical compounds in Zhiqiao, Zhishi, Qingpi, and Chengpi groups (A1, B1, C1, and D1 for the Citrus TCMs extracts; A2, B2, C2, and D2 for plasma sample). The order of the prototypical compounds is arranged in descending order according to the content of the compound in the Citrus TCMs extracts, and the thumbnail (A1, B1, C1, and D1) is the same order as the A2, B2, C2, and D2. The potential candidate markers of relative higher absorption were marked by red circle, and the markers of relative lower absorption were marked by green circle. The statistical significance of differences between the four groups was marked. *p < 0.05, **p < 0.01 for Student’s t test compared to other groups; #Sig < 0.05 for Fisher ratios, compared with between-group and within-group

The result showed that 14 varied prototypical constituents were completely different from the normal level after oral administration with Zhiqiao extracts. Some of these showed relatively higher absorption, including apigenin, eriodictyol, nomilin, diosmetin-7-O-glucoside, hesperetin, naringenin, diosmin, and diosmetin. Certainly, some exhibited relatively lower absorption, including rhoifolin, apigenin-7-O-glucoside, nobiletin, tangeretin, 5-demethylnobiletin, and sinensetin. The treatment of the Zhishi extracts group showed relatively higher absorption in eriodictyol, xanthotoxol, diosmetin-7-O-glucoside, hesperetin, naringenin, and luteolin and relatively lower absorption in nobiletin, tangeretin, 5-demethylnobiletin, and sinensetin. The obvious changes in the group with Qingpi treatment included significantly higher absorption of scoparone, nomlin, limonin, diosmetin-7-O-glucoside, hesperetin, and naringenin, accompanied by significantly lower absorption of naringin nobiletin, tangeretin, 5-demethylnobiletin, and sinensetin. Meanwhile, significant differences were observed with the group given oral medication Chenpi extracts, and the absorption of eriodictyol, nomilin, diosimin, limion, and hesperetin was relatively higher and the absorption of nobiletin, tangeretin, 5-demethylnobiletin, and sinensetin was relatively lower.

Metabolite identification and biotransformation

Metabolites of prototypical compounds in four Citrus TCMs were identified using Progenesis QI software which contains multiple MS identification tools, including MS fragmentation scan, spectra matching, fragment ion annotation, and elemental composition prediction. To identify every screened metabolite rapidly, effectively, and accurately, great efforts were made for metabolite identification, which included isotopic characteristic fragment ion recognition (CFIR), constant neutral loss filtering (CNLF), pattern recognition (IPR), and mass defect filtering (MDF). Most of the prototypical compounds in the four Citrus TCMs and their metabolites contain benzene rings; thus, the CNLF method can indirectly predict biotransformation that occurs on the ring. It was found that phase I metabolites had characteristic neutral loss on the benzene ring, especially for the loss of H2O/H2O2 (for hydroxylated metabolites, e.g., M9-M11, M14-M16, M20, M21, M24-M26, M37, M48-M50, and M65) and CH3OH/CH2O (for the methoxylated metabolites, e.g., M29, M33, M34, and M38). The screened phase II metabolites included conjugates of glucuronide (M22, M39, M44, M51, and M52) and sulfate (M4, M12, M23, M30, M40, M43, and M53), predominantly presenting their specific fragments related to neutral loss of Glu moiety (176.0321 Da, <4 ppm, M22, M39, M44, M51, and M52), SO3 moiety (79.9563 Da, <4 ppm, M4, M12, M23, M30, M40, M43, and M53), and sulfate and glucuronide moiety (255.9889 Da, <4 ppm, M19 and M54), respectively. These results were consistent with previous studies [12, 13]. In the MS ionization process, because of the high stability of benzene ring, CFIR can be used to predict the corresponding metabolites, which include preliminary identification of readily identifiable metabolites to enrich the annotation of the corresponding fragment ions.

Hydroxylated and dehydroxylated metabolites

Metabolites M1 (t R 8.873 min), M3 (t R 8.898 min), M6 (t R 8.651 min), M31 (t R 8.667 min), M36 (t R 8.752 min), M45 (t R 8.805 min), and M56 (t R 8.674 min) had the formula C15H12O7, C16H12O7, C15H10O6, C15H10O6, C11H6O5, C21H22O9, and C20H20O9 for the [M + H]+ ions shown in Table 2, which were approximately 16 Da (+ O) or 32 Da (+ 2O) higher than eriodictyol, diosmetin, apigenin, xanthotoxol, nobiletin, and 5-demethylnobiletin, suggesting that the metabolite may be formed by the hydroxylate of prototypical compounds. For M2, M7, M8, M17, M18, M27, M28, M32, M35, M41, M42, M59, M61, M63, and M64, as shown in Table 2, had ions m/z 369.1805, 285.1595, 257.0834, 269.1756, 253.1807, 261.1132, 261.1131, 256.1035, 256.1206, 255.0848, 187.1449, 191.0409, 175.0485, 378.9265, 563.1384, 617.4423, and 601.3859 at MS2 which were approximately 16 Da (− O), 32 Da (− 2O) or 48 Da (− 3O) lower than corresponding ions of eriodictyol, diosmetin, hesperetin, luteolin, apigenin, xanthotoxol, scoparone, sinensetin, kaempferitrin, and rutin. Therefore, they were identified tentatively as hydroxylates and dehydroxylates of prototypical compounds and isomers.

Table 2 Mass spectral data and biotransformation of prototypical compounds screening and identification

Methoxylated and demethoxylated metabolites

As shown in Table 2, M29, M33, M34, and M38 exhibited the [M + H]+ ions at m/z 317.0453, 300.8961, 300.8854, and 233.0639 with formula C16H12O7, C16H12O6, and C12H8O5 and were eluted at 9.863, 8.537, 8.626, and 9.161 min, respectively, which was approximately 30 Da more than luteolin, apigenin, and xanthotoxol; they were speculated as products of methoxylated metabolites (+ OCH2) compared with these prototypical compounds. Metabolites M06, M55, M58, and M60, with [M + H]+ ions at m/z 271.1385, 359.1934, 364.9185, and 334.838, were eluted at 8.725, 8.993, 10.798, and 8.782 min, respectively. They were 30 Da or 60 Da less than diosmetin, 5-demethylnobiletin, and sinensetin with formula C15H10O5, C19H18O7, and C19H18O6, suggesting the loss of one or two methoxyl moiety.

Methylated and demethylated metabolites

Metabolites M13, M46, M47, and M57 were detected at 8.908, 8.76, 8.647, and 8.845 min. They generated the protonated molecule ion at m/z 289.1879 (C15H12O6), 396.7909 (C19H18O8), 368.8728 (C17H14O8), and 380.7709 (C19H18O7), which were 14 Da (− CH2), 28 Da (− C2H4), or 56 Da (− C4H8) less than that of hesperetin, nobiletin, and sinensetin, suggesting that they were demethylated products of these prototypical compounds. Metabolites M09-M11, M14-M16, M21, M22, M24-M26, M37, M48-M50, and M65 were speculated to be C17H14O6, C18H16O6, C18H18O6, C17H16O5, C16H12O6, C17H14O6, C12H8O4, C22H24O8, C23H26O8, and C29H34O16 according to their HRMS spectral data. They were 14 Da (+ CH2) or 28 Da (+ C2H4) higher than that of diosmetin, hesperetin, naringenin, luteolin, xanthotoxol, nobiletin, and rutin and were deduced as methoxylated metabolites.

Conjugated metabolites

With their conjugation on the hydroxy group of the benzene ring, we identified sulfate conjugates that were approximately 80 or 96 Da higher than prototypical compounds, such as M04 (eriodictyol + SO3), M12 (diosmetin + SO3), M23 (naringenin + O + SO3), M30 (luteolin + SO3), M40 (xanthotoxol + O + SO3), M43 (scoparone + SO3) and M53 (nobiletin + O + SO3); glucuronide conjugates, such as M22 (naringenin + Glu), M39 (xanthotoxol + O + Glu), M44 (scoparone + O + Glu), M51, and M52 (nobiletin + Glu), were detected as 176 or 192 Da more than that of prototypical compounds; and the sulfate and glucuronide conjugates, such as M19 (hesperetin + SO3 + Glu) and M54 (nobiletin + SO3 + Glu), were approximately 256 Da more than that of prototypical compounds.

Discussion

TCM is a complex system that exerts its therapeutic effect through the synergistic activity of multiple characteristic constituents. For the constituents absorbed into the blood, which have the opportunity to show bioactivities, a novel strategy based on pseudotargeted metabolomics for screening of QC-markers was developed. The pseudotargeted method which integrates the advantages of both targeted and untargeted methods is demonstrated to be a comprehensive strategy with high-quality and information-abundant data. Researchers, however, often ignore the importance of the prototype constituents with lower absorption, which might act on a converted type. Therefore, it is vital to pay much attention to the design of the pseudotargeted metabolomics strategy developed for screening QC-markers, as any prototype or converted metabolite in the blood will greatly reflect the overall efficiency of TCMs. Due to the structure diversities of the characteristic constituents in TCMs, the bioactivity and absorption of the prototypic constituents exhibit obvious structure-dependence. These may be attributed to drug interaction with intestinal microbiota, CYP-mediated drug metabolism, and their intrinsic chemical properties. To our knowledge, the host biotransformation systems include phase I and phase II reactions. The phase I metabolism, including methoxylation, methylation, hydroxylation, and hydrolysis, is mainly mediated by CYP enzymes in the liver, gut, and other tissues. The phase II metabolism is the conjugation reaction, including glucuronidation and sulfonation.14 Accordingly, a total of 66 metabolites were convincingly identified, including 52 phase I metabolites and 14 phase II metabolites in rat plasma. The global absorption and in vivo biotransformation process after orally administrated with standard extracts of Citrus TCMs was depicted in a systematic manner, as shown in Fig. 3.

Fig. 3
figure 3

Prototypical compounds and metabolites inside the boxes represent the metabolism of in rats’ stomach, liver, and gut. The compounds connected by the red arrow may cause the relative higher absorption, the green arrow may cause the relative lower absorption

The gastrointestinal microbiota can be considered as a site with huge biotransformational capacity of drugs, including hydrolysis and glycosylation. Hydrolysis should directly lead to a decrease in the absorption of glycosides [14]. Because of the hydrolysis of the gastrointestinal microbiota, glycosides as rutin and kaempferitrin exhibited lower absorption, while contributing to the content of their aglycone with the quercetin and kaempferol detected in the blood in “Metabolite identification and biotransformation” section. In addition, polyphenols, such as luteolin, scoparone, and eriodictyol which show relatively high absorption in this study, may affect the binding of microorganisms and other microorganisms by capturing metal ions (such as iron and cobalt) of metalloenzymes in intestinal microbial cells, resulting in the property of chelating with metal ions to form insoluble complexes to inhibit glycosylation in the intestines [15]. For glycosylation in the intestines, the toxicity or activity of the drug is reduced, and the polarity is increased to accelerate the excretion directly, with the flavonoid aglycones, such as acacetin and isosakuranetin, showing relatively weaker absorption in the results of our study. The prototype compounds, including eriodictyol, diosmetin, scoparone, xanthotoxol, hesperetin, naringenin, luteolin, and nobiletin, are conjugated with endogenous molecules (glucuronosyl and sulfo) by host enzyme transfer systems to increase their excretion. In our study, when administered orally, flavonoid glycosides demonstrate lower absorption and total plasma levels compared to aglycones. Thus, the interaction between the gut microbiota and orally administrated drugs may modify the structure and function of chemicals and be important in drug investigation.

CYP in the liver and other tissues are a major source of variability in drug pharmacokinetics and response. Functional CYPs belonging to the CYP1, 2, and 3 families are responsible for the biotransformation of most foreign substances, including a large scale of structure diversities in clinical use. The CYP enzymes have indispensable functions in drug metabolism; also, flavonoids are known to have effects on CYP enzymatic activity. Expression of CYP1A2 in the CYP1 family was relatively high in the liver, the main drug metabolic organ, which plays a significant role in the metabolism of several clinically important drugs by catalyzing biotransformations of deethylation and demethylation [16]. Flavanones, including hesperetin and naringenin, have enhanced absorption, which might be attributed to the inhibition of CYP1A2 [17]. CYP2C9 is the highest expressed member being expressed in the CYP2 family, while CYP2C8 and CYP2C19 are expressed at ~2-fold and 10-fold lower levels, catalyze biotransformations, including deethylation, oxidation, and hydroxylation [16]. Diosmetin inhibits CYP2C8 and CYP2C9 at concentrations similar to those observed after in vivo administration (in the low micromolar range) and may be of potential clinical relevance, since it may cause the higher absorption in this study with drugs metabolized by this CYP [18, 19]. CYP3A4 is in the majority of individuals, is abundantly expressed in the liver, and plays a major role in the metabolism of ~30% of clinically used drugs. The active site of CYP3A4 is large and flexible and can accommodate and metabolize many preferentially lipophilic compounds with the typical pathway of hydroxylation, demethylation, and demethoxylation [16]. In this instance, the polyhydroxylated flavones luteolin and naringenin may inhibit the metabolism of the drug which is catalyzed by CYP3A4, causing the stronger relative absorption [20, 21]. However, polymethoxyflavones, including nobiletin, tangeretin, 5-demethylnobiletin, and sinensetin, exhibited relatively lower absorption in our study, which is consistent with a previous report that this kind of constituent showed low oral bioavailability. The induction of CYP1A1, CYP1A2, and CYP2B enzymatic activity by the polymethoxyflavones might enhance the metabolism of parent constituents. In addition, hydroxylated polymethoxyflavones have been shown to have a broad spectrum of biological activities, such as anti-inflammatory, anti-cancer, and anti-atherogenic properties, indicated polymethoxyflavones may act on a metabolic type in vivo [22, 23]. This finding indicated that any ignored blood metabolites converted by in vivo biotransformation will greatly influence the overall effect of the specific TCM, which will be based on incomplete and skewed information.

In the present study, targeted metabolomics employing RRLC-QqQ-MS operated in the MRM mode is applied for determination and quantification of 32 prototype metabolites absorbed in the blood. Untargeted metabolomics employing UHPLC-QE Orbitrap HRMS can invariably determine the downstream metabolites’ effects on lower blood concentrations of the parent drug in the complex TCMs. Integrating these two methods, pseudotargeted metabolomics methodology was proposed to provide a global insight into the complex process of TCMs in the blood and to characterize the rational screening of potential QC-markers of TCMs.