Introduction

Tripterygium wilfordii Hook.f. is a herb used in Traditional Chinese Medicine which has a diverse range of bioactivities and thus, has been used in the treatment of an array of ailments including inflammation, systemic lupus erythematosus and rheumatoid arthritis for hundreds of years [1,2,3]. About 415 secondary metabolites were isolated from T. wilfordii, mainly terpenoids and alkaloids [4]. Triptolide, a main diterpenoid epoxide, has been reported to have immunosuppressive and anticancer activities [5,6,7]. Sesquiterpene pyridine alkaloids (such as wilforgine, wilforine, wilfortrine and wilfordine) have immunosuppressive, antitumor-promoting and cytotoxic, antiviral and anti-inflammatory properties [8,9,10,11,12]. Traditionally, the secondary metabolites such as terpenoids and alkaloids are obtained by extraction from the root of T. wilfordii. However, T. wilfordii is not widely cultivated and the content of secondary metabolites in natural plants is far below demand. Therefore, it is necessary to understand the secondary metabolites biosynthetic pathway for producing the compound through synthetic biology strategies.

The selection of target genes for the genetic manipulation of plants towards the production of bioactive metabolites often relies on quantitative transcription reverse polymerase chain reaction (qRT-PCR) analysis [13]. The qRT-PCR data of target genes is normalized against that of reference genes that, in turn, must be characterized by high levels of stability of gene expression, across different developmental stage and different experimental settings. This data normalization procedure mitigates many sources of experimental errors associated with the qRT-PCR method [14, 15]. Ideal reference genes should be stable and reliable throughout the diverse experimental conditions. However, no single reference genes have been shown to be universally constant during development and diverse experimental conditions. Furthermore, many studies have shown that traditionally used popular housekeeping genes used to normalize gene expression vary considerably under different conditions [16,17,18,19,20,21].

To elucidate biosynthesis pathway of bioactive metabolites, accurate quantification of key genes expression is needed which will help us to understand gene function in secondary metabolite biosynthesis. qRT-PCR has become the preferred method due to its high sensitivity, specificity, and accuracy in detecting the target gene expression. In previous studies, many reference genes were used to study the secondary metabolites biosynthesis for T. wilfordii, including elongation factor 1-α (EF-1α) [22,23,24,25], ACTIN7 [26], β-Actin [27,28,29,30,31], polypyrimidine tract-binding protein (PTB) [32].

Until now, no systematic validation of reference genes has been performed in T. wilfordii. Therefore, we evaluated the expression stability of ten genes (PP2A, TBP, TUB, 26S, SAND, GAPDH, TIP41, eIF-4α, EF-1α, and ACTINT7) among the tissues of T. wilfordii and under a range of abiotic stresses, such as drought, salinity, and methyl jasmonate (MeJA) and salicylic acid (SA) in T. wilfordii hairy roots by qRT-PCR. To validate the reliability of the reference genes, the expression levels of the TwHMGR and TwDXR, which encode for enzymes that participate in the isoprenoid biosynthesis pathways in T. wilfordii, were used to assess the performance of the newly selected reference genes to confirm their suitability for qRT-PCR analysis in T. wilfordii.

Materials and methods

Plant materials and experimental treatments

Tripterygium wilfordii plants were grown in the greenhouse of Northwest A&F University which transferred from a natural growing in Fujian Province of China. 0.8–1 g of roots, stems and leaves were collected randomly each from six cottage plantlets with the same growth and development status, each tissue part was repeated three times. The abiotic stress treatments were performed on hairy roots (HRs) of T. wilfordii. The culture of HRs was carried out as described previously [33]. Stock solutions of MeJA and SA were prepared and added to the final media (0.1 mM) [34]. The samples were collected from plants exposed to different stress treatments after 0, 3, 6, 9, 12, and 24 h for follow-up analysis. Salinity stress was induced by the addition of NaCl to the media to the final concentration of 150 mM. Drought stress was induced by the addition of PEG 8000 (20% w/v) to the media [35]. The different plant tissues were collected, immediately frozen in liquid nitrogen, and kept at − 80 °C for further analysis. All the treatments were gathered in three biological replications.

RNA extraction for cDNA synthesis

RNA isolation was performed according to Miao et al. with minor modifications [36]. The extraction buffer contained the following: 2 M NaCl, 200 mM Tris (pH 8.0), 25 mM EDTA, 2% CTAB (w/v), 2% PVPP (w/v) as well as 5% β-mercaptoethanol (v/v). RNA concentration and purity were measured using an M200 Pro-multimode reader (Tecan, Switzerland). RNA integrity was further assessed by electrophoresis using 1% agarose gels. For each sample, about 300 ng of total RNA was treated with gDNA Eraser (Takara, Japan) and used to make cDNA with PrimeScript™ RT (Takara, Japan).

qRT-PCR conditions

The qRT-PCR analysis was performed in an iCycler iQ™ Real-Time PCR Detection System (BioRad, USA). Each reaction contained nuclease-free water (9.5 μL, Takara, Japan), SYBR Premix Ex Taq™ II (12.5 μL), forward primer (1 μL, 10 μM) and reverse primer (1 μL, 10 μM), and cDNA (1 μL, unquantified). The cycling condition was: 95 °C for 3 min, followed by 40 cycles of 95 °C for 10 s, 52 °C for 30 s and 72 °C for 30 s followed by 95 °Cfor 10 s. Controls for reverse transcription were included in the PCR reactions to reveal any DNA contamination. Each PCR analysis was performed with three biological replicates and three technical replicates and the mean values were used for the analysis. Amplification efficiency (E) and correlation coefficient (R2) were of primers were calculated by standard curve method, and all the samples were diluted five times. The E value is calculated by formula (% Efficiency = (10[− 1/slope] − 1) × 100%) [37].

Selection of candidate reference genes and primer design

Ten reference genes, frequently used in Arabidopsis studies, were investigated in this study: protein phosphatase 2A (PP2A), TATA-box binding protein (TBP), TUB beta-7 (TUB), ribosomal RNA 26S (26S), SAND family protein (SAND), Glyceraldehyde-3-phosphate Dehydrogenase (GAPDH), Tap42-interacting protein (TIP41), eukaryotic translation initiation factor 4a − 1 (eIF-), elongation factor − 1α (EF-) and Actin 7 (ACTINT7) study (Table 1) [38, 39]. The sequences of these Arabidopsis genes were blasted against the root transcriptome (the accession number SRX472292) of T. wilfordii. qRT-PCR primers were designed based on the identified orthologous sequences for the ten putative reference genes using Primer Premier 5 software (amplicon length: 97–148 bp, and melting temperature (Tm): 51–59 °C).

Table 1 Candidate reference genes and target genes description and primer sequences

Analysis of gene expression stability

Ten selected reference genes were compared regarding the stability of their gene expression with four algorithms: GeNorm, NormFinder, BestKeeper, and the ΔCt. Finally, RefFinder (http://www.fulxie.0fees.us), a comprehensive evaluation platform which integrates the four algorithms mentioned above, was used to produce the final ranks regarding their overall stability.

GeNorm method

The GeNorm algorithm assumes that two suitable reference genes must have a constant gene expression ratio across samples, and therefore, is a measure of variation between a single candidate reference gene and each of the target gene analyzed [40]. Suitable qRT-PCR reference genes must have expression stability values of M < 1.5. This algorithm is one of the most commonly used to assess the performance of putative reference genes.

NormFinder method

The NormFinder method measures stability value (SV) of gene expression for reference genes by measuring the variance in gene expression within and between groups [41]. According to the GeNorm method, genes with the lowest SV are less variable in terms of gene expression and thus, better reference genes.

Delta-Ct (ΔCt) method

The Delta-Ct (ΔCt) method [42] is somewhat similar to the GeNorm method because it compares gene expression (ΔCt values) within a group to that of other groups. The reference gene that produces the lowest standard deviation (SD) across all samples is considered the most stable.

BestKeeper method

The BestKeeper method estimates Cq values for all candidate reference genes by measuring the coefficient of variation (CV) and the standard deviation (SD) for each reference gene. The most stable reference gene will exhibit the lowest Cq values [43].

RefFinder method

RefFinder analyzes the rankings provided by the GeNorm, NormFinder, BestKeeper and ΔCt algorithms and calculates a geometric mean for each gene. The best reference genes are the ones with the lowest geometric mean values [44].

Validation of reference gene analysis

TwHMGR (KU246037.1/GAVZ01053357.1) and TwDXR (KJ174341.1/GAVZ01038138.1) encode for enzymes that play a role in the biosynthesis of isoprenoids in T. wilfordii [26, 33, 45]. Variation in the transcript levels of these genes were used to validate the performance of the previously identified best reference genes. To evaluate the validity of the selection of reference genes, the expression levels of the TwHMGR and TwDXR genes were analyzed in different tissues and MeJA-induced abiotic stress treatments. For each experimental condition, the expression of TwHMGR and TwDXR were normalized using the two best reference genes (ACTIN7 and TBP), and a relatively unstable gene (TUB) recommended by RefFinder. RT-qPCR was performed as described above. The average Ct value was calculated from three biological and technical replicates and was used for relative expression analyses. The relative expression of the TwHMGR and TwDXR gene was subsequently calculated according the 2−∆∆CT method [46]. The expression levels of target genes were assessed using one-way ANOVA using SPSS v.17.0 with a signifcance level set at P < 0.05.

Results

Primer specificity and PCR efficiency

The nucleotide sequences of the ten candidate genes were obtained from the Arabidopsis genome database and blasted against a T. wilfordii transcriptome database (accession number SRX472292, http://www.ncbi.nlm.nih.gov/sra/SRX472292) in order to identify their corresponding homologs to be assessed as putative reference genes for T. wilfordii. (Table 1). PCR amplification produced a single band for each primer pair (each gene) as assessed by 2% agarose gel electrophoresis (Fig. 1a). The qRT-PCR melting-curve also showed a single peak corresponding to the expected size of the amplicon (Fig. 1b). These results strongly suggest that each primer pair was able to produce specific PCR amplified products of the target genes. Table 1 summarizes the primer sequences, amplicon length, amplicon temperature, correlation coefficient, and PCR amplification efficiency values. qRT-PCR amplification efficiency varied from 91.8% for EF- to 107.6% for 26S, whereas the correlation coefficients varied from 0.990 for TBP to 0.998 for SAND.

Fig. 1
figure 1

Amplification products of the 10 genes. a Lanes 1–10 are PP2A, TBP, TUB, 26S, SAND, GADPH, TIP41, eIF-4α, EF-1α and ACTIN7, respectively.. M: Marker DL2000. b Melt curves of the ten reference genes

Determination of quantification cycle (Cq) values

Quantification cycle values (Cq values) were obtained for the ten candidate reference genes, by performing qRT-PCR amplification reactions across the different T. wilfordii Hook.f. tissues and different abiotic stresses in root samples. The mean Cq values ranged from 16.25 to 32.98 across all samples for all the ten candidate reference genes. The 26S showed the lowest Cq (mean Cq of 18.08), suggesting that this gene had the highest expression level of all compared putative reference genes. EF- and ACTIN7 had slightly higher Cq values of 23.75 and 24.21 respectively, suggesting that these genes were also abundant transcripts. E1F4α, TUB, TBP, SAND, and PP2A had average Cq values between 24.25 and 28.35 and thus, were only moderately expressed. TIP41 and GADPH had the highest Cq (mean Cq of 29.71 and 30.92) suggesting that these genes transcripts were the lowest in abundance. The mean and median Cq values were similar for each gene, especially in the case of GAPDH and TBP which suggests that the Cq values were evenly distributed. The Cq values of EF- and TIP41 were distributed more centrally suggesting less variation across samples (Fig. 2).

Fig. 2
figure 2

Box and whisker plot displaying the Cp values for the candidate reference genes in all samples. The lower and upper edges of the box represent the 25th and the 75th percentiles, respectively; the line inside the box indicates the median value; the whiskers represent the maximum and minimum values

Determination of the optimal number of reference genes

The GeNorm algorithm was used to estimate the number of reference genes necessary for accurate data normalization using pairwise variation analysis by measuring the ratio (Vn/Vn + 1) between two sequential normalization factors (NFn and NFn + 1). Figure 3 shows pairwise variation (V) values for the reference genes in which the value is lower than 0.15 as to be optimized. The results show that a combination of two candidate reference genes efficiently normalizes the qRT-PCR data across T. wilfordii tissues and different abiotic treatments. According to the GeNorm algorithm, the best combinations were ACTINT7 + TBP across all samples, TIP41 + ACTINT7 across all tissues and across MeJA abiotic stress samples, 26S +ACTINT7 across SA abiotic stress samples, TBP + eIF-4α across drought stress samples and TBP + EF- across salinity stress samples.

Fig. 3
figure 3

Determination of the optimal number of reference genes for normalization by pairwise variation (V) using GeNorm

Gene expression stability analysis of candidate reference genes

The gene expression stability, in different tissue samples and in different stress treatments, was determined for all putative reference genes and analyzed using GeNorm, NormFinder, ΔCt, and BestKeeper. Finally, RefFinder summarized the results of the previously listed algorithms.

GeNorm analysis

Figure 4 summarizes the stability scores for the ten putative genes across all samples as estimated by GeNorm. ACTINT7 and TBP had the highest level of stability, whereas eIF- was the least stable gene across samples. Across tissues, TIP41 and ACTINT7 had the highest stability levels, while EF- showed the least stable expression across tissues. The most stable genes across samples submitted to the MeJA abiotic stress were TIP41 and ACTINT7, whereas TUB was the least stable. Under SA abiotic stress treatment, ACTINT7 and 26S showed the highest stability, whereas eIF- showed the least stable expression. TBP and eIF- had the highest stability in the drought stress group, whereas TBP and EF- showed low stability under salinity stress. TUB showed the lowest level of gene stability under both drought and salinity abiotic stresses.

Fig. 4
figure 4

Expression stability of ten candidate genes as calculated by GeNorm

NormFinder analysis

The NormFinder analysis differs from that of GeNorm as it uses ANOVA analysis to estimate intra- and inter-group variation in gene expression stability for each reference genes. According to this algorithm, ACTIN7 and TBP showed the highest level of stability in gene expression across all tissues and different abiotic stress treatments. TIP41 and ACTINT7 showed the highest stability across tissue samples. EF- and TIP41 were the most stably expressed genes in samples submitted to MeJA-induced abiotic stress whereas in samples submitted to SA-induced abiotic stress ACTIN7 and TBP had the highest stability. Finally, TBP and eIF- showed high levels of stability under drought stress, whereas TBP and EF- were the best reference genes under salinity stress (Table 2).

Table 2 Ranking of ten candidate reference genes under different experimental conditions in order of their expression stability calculated by NormFinder

Delta-Ct (ΔCt) analysis

In the ΔCt analysis, the reference gene that produces the lowest gene pair standard deviation (SD) across all samples is most stable and therefore the most suitable reference gene. The ten candidate reference genes were ranked from the highest to the lowest levels of stability of gene expression according to the ΔCt algorithm (Table 3). Overall, ACTINT7 had the highest level of stability in gene expression across all tissues, and within the SA-induced abiotic stress group. In the MeJA-induced abiotic stress group as well as across different tissues TIP41 was the most stably expressed, whereas TBP was the best reference gene under the drought and salinity stresses.

Table 3 Ranking of ten candidate reference genes under different experimental conditions in order of their expression stability calculated by ∆Ct

BestKeeper analysis

BestKeeper estimates expression stability by measuring the standard deviation (SD) across samples. A gene with SD > 1 is considered unacceptable for a reference gene. The SD values of all genes as estimated by BestKeeper logarithm are shown in Table 4. Across all samples, TIP41 (0.51) had the lowest SD value, followed by SAND (0.55), while EF- and eIF- had SD values were higher than 1. Across tissue samples, TIP41 was considered the most suitable reference genes (0.37), followed by ACTINT7 (0.60) and TUB (0.67). Among the MeJA treated samples, all reference genes had SD values lower than 1.0, the lowest ones obtained for the TIP41, ACTINT7 and TBP (0.31, 0.33 and 0.4, respectively). Similarly, under the SA stress treatment, all reference genes had low SD values (SD < 0.52), with the TBP (0.21), PP2A (0.3) and GADPH (0.32) showing the highest levels of stability. ACTIN7 showed the lowest SD value of 0.09 in the drought treatment group. Under salinity stress, GADPH (0.53) and TIP41 (0.67) were the two most stable genes.

Table 4 Ranking of ten candidate reference genes under different experimental conditions in order of their expression stability calculated by BestKeeper

RefFinder analysis

The results from the four previously described algorithms were integrated by the RefFinder algorithm and the results (Table 5). The stability rank of the ten putative reference genes was listed from the highest to the lowest as assessed by RefFinder (Fig. 5). The expression of TBP was the most stable under the drought and salinity stress. The most stable gene under MeJA-induced abiotic stress was TIP41. The most stable gene under the SA-induced stress and across different tissue samples was ACTIN7. Overall, the level of stability of the expression pattern of ACTIN7 revealed that this gene is the most reliable regarding the normalization of qRT-PCR data in T. wilfordii, across all samples.

Table 5 Expression stability ranking of the ten candidate reference genes as calculated by RefFinder
Fig. 5
figure 5

Expression stability of ten candidate reference genes as calculated by RefFinder

Expression analysis of TwHMGR and TwDXR genes for reference gene validation

TwHMGR and TwDXR encode for enzymes directly involved in the biosynthesis of isoprenoids in T. wilfordii [26, 47]. The relative expression of TwHMGR and TwDXR were used to validate the suitability of the reference genes across different tissues as well as across root samples in the MeJA-induced abiotic stress group (Fig. 6). We used the RefFinder stability rankings to select the most stable genes (ACTIN7 and TBP) and the least stable gene (TUB) for the normalization of qRT-PCR data obtained for TwHMGR and TwDXR.

Fig. 6
figure 6

Relative quantification of TwHMGR and TwDXR expression using validated reference genes including the most or the least stable reference genes for normalization under different tissues and MeJA treatment experimental sets. aTwHMGR expression of hairy roots under MeJA treatment after 0 h, 3 h, 6 h, 9 h, 12 h and 24 h; bTwHMGR expression of different tissues; cTwDXR expression of hairy roots under MeJA treatment after 0 h, 3 h, 6 h, 9 h, 12 h and 24 h; dTwDXR expression of different tissues. Bars represent the mean ± SE of three biological replicates

For MeJA treatment, TwHMGR expression was induced significantly at 3 h and kept to increase until 6 h and then decreased at 24 h, which was normalized by the two most stable reference genes (ACTIN7 and TBP). Furthermore, the expression of TwHMGR which was normalized by the two most stable reference gene (ACTIN7 and TBP) was markedly different from the expression level which was calculated using the least suitable reference gene (TUB) (P < 0.05) (Fig. 6a). Across different tissues, transcript level of TwHMGR was the lowest in leaf while it was the highest in root that was normalized by the two most stable reference gene (ACTIN7 and TBP). However, the expression level of TwHMGR which was normalized by the two most stable reference gene (ACTIN7 and TBP) was significantly different from the expression level which was calculated using the least suitable reference gene (TUB) among different tissues (P < 0.05) (Fig. 6b).

The expression level of TwDXR began to increase at 3 h, increased until 6 h and then dropped at 24 h that was normalized by the two most stable reference genes (ACTIN7 and TBP). However, the expression level of TwDXR normalized by the two most stable reference genes (ACTIN7 and TBP) was significantly different from the expression level calculated using the least suitable reference gene (TUB) (P < 0.05) (Fig. 6c). Across different tissues, significant difference of TwDXR transcript levels was observed. The expression level of TwDXR normalized by the two most stable reference genes (ACTIN7 and TBP) was significantly different from the expression level calculated using the least suitable reference gene (TUB) (P < 0.05) (Fig. 6d).

Discussion

The qRT-PCR analysis is a commonly used method to quantify gene differential expression levels in gene function studies [48]. However, the accuracy of this method depends on the availability of suitable reference genes, which is used to normalize the expression of the target genes. We made an orthologue search analysis using the sequences of ten reference genes of the Arabidopsis genome against the T. wilfordii root transcriptome database (accession number SRX472292, http://www.ncbi.nlm.nih.gov/sra/SRX472292) (Table 1) to identify putative reference genes in T. wilfordii.

The images of the agarose gels of each putative reference gene PCR products as well as the dissociation curves of the qRT-PCR amplification reactions showed that the primers designed for this analysis had good specificity. At the same time, the high value of the R2 of the standard curve equation as well as the high amplification efficiency values obtained during the qRT-PCR analyses confirmed the high degree of specificity of the primers designed to amplify T. wilfordii reference genes.

qRT-PCR data was obtained across different tissues of T. wilfordii as well as across samples exposed to different abiotic stress treatments. The stability of gene expression of ten putative reference genes was assessed with five statistical algorithms (GeNorm, NormFinder, BestKeeper, ΔCt, and RefFinder). In general, GeNorm, NormFinder, ΔCt, and RefFinder were able to produce consistent stability rank for the putative reference gene (Table 5). The BestKeeper algorithm is based on a different principle as compared to the other algorithms, which might explain why the BestKeeper results differed the most from those of the other algorithms [49]. Finally, we selected the most suitable reference gene(s) as determined by GeNorm, NormFinder, ΔCt, and the RefFinder algorithms.

For validation purposes, we chose TwHMGR and TwDXR as target genes, which encode for enzymes directly involved in the biosynthesis of isoprenoids in plants. The HMGR reductase catalyzes the irreversible conversion of 3-hydroxy-3-methylglutaryl coenzyme A (HMG-CoA) into mevalonic acid. This catalysis is a key regulatory step in isoprenoids biosynthesis leading to the biosynthesis of IPP and DMAPP [50, 51]. The (DXR) reductoisomerase catalyzes the conversion of DXP; a key protein activated upstream of the MEP pathway. Moreover, the addition of MeJA and SA to the media mimics the endogenous signals that occur during the plant response to biotic and abiotic stresses [52]. Moreover, MeJA and SA have also been used as elicitors to enhance the accumulation of secondary metabolites in adventitious roots and hairy roots of T. wilfordii Hook. f. [33, 36, 53]. Hence, we chose MeJA to induce an abiotic stress response in the T. wilfordii hairy roots in order to evaluate the stability of gene expression. Ideally, the stability of gene expression of a reference gene is not affected by tissue types and experimental conditions. As shown in Fig. 6a, c, TwHMGR and TwDXR are gradually upregulated by MeJA over the time course of the experiment until the 6 h. This pattern of gene expression was also observed when data was normalized against ACTIN7 and TBP, which had been identified as the most stable reference genes in previous studies [54,55,56]. Figure 6b and 6d show that the expression trends of TwHMGR and TwDXR in different tissues were identical when ACTIN7 and TBP were used for qRT-PCR data normalization. By contrast, large discrepancies in expression patterns appeared when the least stable reference gene (TUB) was used for data normalization.

This study is the first, to our knowledge, to evaluate and validate reference genes for the normalization of qRT-PCR data from different plant tissues and different abiotic stress conditions in T. wilfordii. The optimization of the qRT-PCR analysis depends on the identification and validation of reference genes in different tissues, different development stages, and under different experimental conditions. Our study successfully identified the most suitable reference genes for future studies based on qRT-PCR analysis of T. wilfordii.