Introduction

The identification of body fluids in the forensic context can be important, which helps establish a link between crime scenes and criminal acts. Traditionally, enzymatic, immunological, or chemical detection tests are routinely performed to identify the biological origin of body fluids. However, most conventional tests have disadvantages, such as a lack of specificity and an inability to discriminate mixed body fluids [1]. Therefore, various kinds of biomarkers were introduced to enhance the identification of body fluids, such as tissue-specific messenger RNAs (mRNAs) [2, 3] and microRNAs [4, 5], methylation markers of genomic DNA [6, 7], and microbial markers [8, 9]. As a promising method, mRNA profiling benefits from the simultaneous DNA/RNA coextraction and the detection of a set of biomarkers in one polymerase chain reaction (PCR) [10, 11]. In recent years, tissue-specific mRNA markers have been intensively investigated and multiplex assays have been developed [12, 13], which advances the identification of body fluids.

Casework samples are usually of limited quantity or have been subjected to degradation, which represents a significant challenge in forensic body fluid identification. To this end, sensitive and stable biomarkers have been investigated, such as microRNAs. MicroRNAs are thought to be less prone to degradation because of their small size and the stabilizing effect of Argonaute proteins [5, 14, 15]. In mRNA profiling, technical improvements and the utilization of stable regions of transcripts as targets can facilitate the detection of biomarkers, which helps improve the body fluid identification of trace and degraded samples [16,17,18]. Technical improvements, such as the increased recovery of transcripts and the purification of PCR products, indeed do not increase intrinsic transcripts of biomarkers [16, 17]. Although the use of degraded RNA transcript stable regions (StaRs) as targets was recently shown to enhance the body fluid identification of degraded samples, a small number of biomarkers used in forensic body fluid identification were identified to harbor StaRs [18]. Therefore, the improvement in the sensitivity and stability of biomarkers still requires further investigation.

Recently, circular RNAs (circRNAs), a class of noncoding RNA molecules, were discovered to be abundantly expressed in human cells [19,20,21,22]. These circular transcripts share exons with their linear counterparts and are generally formed by alternative backsplicing of pre-mRNA, in which a downstream 3′ end of an exon is covalently linked with an upstream 5′ end of an exon [23,24,25]. The closed circular structure of circRNAs confers them remarkable stability and resistance to the treatment of RNase R, an exonuclease that degrades linear RNA molecules [19, 21]. The expression of circRNAs is shown to be tissue and cell-type specific although the regulation of circRNA expression remains to be further clarified [19, 26]. The compelling features of circRNAs including high abundance, remarkable stability, and tissue-specific expression enable them to be promising targets in the detection of biomarkers.

Since pre-mRNAs from a single gene can be alternatively spliced into linear and circular transcripts, we postulated that the inclusion of circRNAs in mRNA profiling could enhance the detection sensitivity and stability of biomarkers in forensic body fluid identification. In this study, we investigated the circular transcripts of peripheral and menstrual blood-specific biomarkers and evaluated the sensitivity and stability of biomarkers with the inclusion of circRNAs in mRNA profiling.

Materials and methods

Preparation of samples

All fresh body fluid samples were collected from healthy volunteers. Peripheral blood samples were collected by venipuncture into tubes containing an anticoagulant. Each bloodstain was prepared by pipetting 100 μl of peripheral blood on a piece of sterile gauze. For environmental challenge, bloodstains were placed under a clump of bushes which provided a humid condition and the protection from loss, for 1, 4, 8, 13, and 18 days, respectively, and three samples were collected every period. The average ambient temperature was approximately 10 with 17 °C for the highest and 2 °C for the lowest. Freshly ejaculated semen in sterile tubes was used to prepare dried stains on sterile filter papers at room temperature. Aged peripheral bloodstains from different donors on sterile filter papers were stored at room temperature in a dark, dust-free, and non-humid place for approximately 6 to 7 years. Menstrual blood samples on sanitary napkins were collected from the first to the third day of the menstrual cycle and dried at room temperature followed by preparation as 1 × 1 cm2 pieces. For artificial thermal treatment to eliminate RNA, menstrual bloodstains in sterile tubes were placed at 37 °C for 1, 4, 8, 13, and 18 days, respectively, and three samples were collected every period. After collection, entire stains were cut into pieces followed by RNA extraction or storage at − 80 °C before use. In this study, none of the samples was treated with RNA stabilization reagents. All procedures were approved by the ethics committee of Shanghai Medical College, Fudan University, and all donors volunteered for this study based on informed consent.

RNA preparation

Total RNAs were isolated in a final volume of 40 μl of elution buffer using an RNeasy® Mini Kit (Qiagen, Germany) according to the manufacturer’s instructions and were quantified using a NanoDrop ND-2000 spectrophotometer (Thermo Scientific, USA). The removal of potential genomic DNA was performed with RNase-Free DNase Set (Qiagen, Germany) according to the manufacturer’s instructions. The removal of linear transcripts was carried out with the treatment of RNase R (Epicentre, USA) at 37 °C for 4 h with a dose of 3 U per 1 μg of RNA [21]. In the sensitivity test, the indicated dilution series of input total RNA were prepared followed by reverse transcription. The mixture of total RNA from peripheral or menstrual blood with that from semen stains was prepared at a concentration ratio of 1:100 followed by reverse transcription.

Reverse transcription

Random hexamers were used to synthesize cDNA in a final volume of 20 μL using a Transcriptor First Stand cDNA Synthesis Kit (Roche, USA) according to the manufacturer’s instructions and each reaction contained 1 ng, the indicated amounts, or the indicated volumes of total RNA. Reverse transcription (RT) minus controls without reverse transcriptase were used to rule out potential contamination of genomic DNA.

Primer design

The mRNA markers for peripheral and menstrual blood were acquired from previous reports [27, 28]. The circRNAs of δ-aminolevulinate synthase 2 (ALAS2) and hemoglobin alpha (HBA) were obtained by the analysis of previous sequencing data with TopHat-Fusion and CIRCexplorer [23, 26]. Primer sets were designed using Primer Premier v5.0 (Premier Biosoft, USA) [29]. Outward-facing primer sets, of which the binding sites of forward primers in the coding sequence are located downstream from the binding sites of reverse primers (Fig. S1A), were developed for detecting circular transcripts of ALAS2, HBA, matrix metalloproteinase 7 (MMP7), and matrix metalloproteinase 11 (MMP11). Conventional primer sets, of which the binding sites of forward primers in the coding sequence are located upstream from the binding sites of reverse primers, were developed based on exons shared by linear and circular transcripts (LC-primers, Fig. S1B) for simultaneously detecting linear and circular transcripts of ALAS2 or MMP7. Reported primer sets for the detection of ALAS2 and MMP7 were acquired from previous reports [28, 30]. Reported primer sets appeared to mainly amplify linear transcripts after the investigation of circRNAs of ALAS2 and MMP7 and thus were designed as L-primers in this study. For obtaining products of full-length coding sequences of ALAS2 and MMP7, primer sets, designed as FL-primers, were developed. FAM-labeled or unlabeled primer sets were synthesized in Sangon Biotech., Shanghai. The primer sequences and concentrations used in this study as well as product sizes are listed in Table S1.

PCR amplification

In singleplex PCR, 25 μl of reaction mix contained 1 μl of cDNA or equal amount of sterile water as the non-template control, 2 μl of dNTPs (Takara, Japan), 2.5 μl of 10× PCR buffer, and 1 U AmpliTaq Gold DNA polymerase (Thermofisher, USA). After an initial denaturation at 94 °C for 2 min in a Mastercycler® nexus GSX1 (Eppendorf, USA), amplification was carried out at the following conditions: denaturation at 94 °C for 30 s, annealing at the temperature indicated in Table S1, extension at 72 °C for 1 min, and a final extension at 72 °C for 10 min. The outward-facing primer sets and 34 cycles of PCR amplification were used for the detection of circular transcripts. FL-primers and 31 cycles of PCR amplification were used for obtaining products of full-length coding sequences. LC-primers or L-primers as well as 28 cycles of PCR amplification were used for characterizing the mRNA profiling or the evaluation of primer sets by capillary electrophoresis (CE). PCR products of circRNAs were separated by agarose gel electrophoresis (AGE) followed by purification and the junctions of head-to-tail products were determined by Sanger sequencing. PCR products of full-length coding sequences were separated by AGE followed by purification and quantification.

Capillary electrophoresis and profile analysis

PCR products obtained using LC-primers and L-primers were detected by CE using POP-7 on an ABI PRISM 3130xL Genetic Analyzer (Thermofisher, USA). Briefly, 1 μl of PCR products, 9 μL of HiDi formamide (Thermofisher, USA), and 1 μL of CC5 Internal Lane Standard 500 (Promega, USA) were mixed. Then, samples were denatured at 95 °C for 5 min followed by snap cooling on ice for 5 min. The electrophoresis conditions included a 10-s injection time, 2-kV injection voltage, 15-kV run voltage, and 30-min run time at 60 °C. Profiles were analyzed using GeneMapper ID software v3.2 and signals equal to or above 100 RFU were interpreted as positive results. Negative controls did not show amplification signals. For samples with negative results in the sensitivity and stability analysis, PCR amplification using LC-primers and L-primers were replicated followed by CE. Biomarkers were considered to have positive expression if signals were equal to or above 100 RFU.

Evaluation of primer sets

The copy numbers of full-length ALAS2 and MMP7 per microliter of elution buffer were calculated based on the concentration quantified using a NanoDrop ND-2000 spectrophotometer and the molecular weights (325 Da as the average molecular weight of dNMP), followed by the preparation of the dilution series of full-length ALAS2 and MMP7 DNA. The efficiency of LC-primers and L-primers in PCR amplification was compared by PCR-CE assay and quantitative PCR (qPCR). qPCR was performed using QuantiNova™ SYBR® Green PCR Kit (Qiagen, Germany) in a final reaction volume of 10 μl. The reaction mix with LC-primers or L-primers was run on an ABI Prism 7500 fluorescence quantitative PCR instrument (Thermofisher, USA) according to the conditions consisting of an initial denaturation at 95 °C for 2 min, followed by 40 cycles of 5 s at 95 °C, and 34 s at 60 °C. The Ct values were calculated using SDS software with an automatic baseline and a threshold of 0.2. Then, standard curves were constructed.

Results

Identification of circRNAs of peripheral and menstrual blood markers

Analysis of previous sequencing data showed that ALAS2 and HBA, two peripheral blood-specific biomarkers, had circular transcripts (Fig. 1a and Fig. S2A). Electrophoresis of PCR products from amplification using outward-facing primer sets of ALAS2 and HBA showed clear bands in all four samples (Fig. S2B and C), which validates the expression of circular transcripts of ALAS2 and HBA. The covalent junction in circular transcripts of ALAS2 and HBA was further revealed by sequencing (Fig. 1b, Fig. S2D and E).

Fig. 1
figure 1

Identification of ALAS2 and MMP7 circRNAs. a Three putative circRNAs of ALAS2 revealed by bioinformatics analysis are indicated by the triangles. The arrows indicate the binding sites of primers and the asterisk indicates the transcript identified in (b). b The junction sequence of the head-to-tail ALAS2 circRNA detected in Fig. S2B. The arrow indicates the junction site. c Two identified circRNAs of MMP7 by sequencing PCR products in this study. The arrows indicate the binding sites of primers. The asterisk indicates the transcript identified in (d) and the cross indicates the circRNA with a low expression level. d The junction sequence of the head-to-tail MMP7 circRNA detected in Fig. S2F. The arrow indicates the junction site

The expression of circRNAs from two menstrual blood-specific biomarkers, including MMP7 and MMP11, was subsequently investigated using outward-facing primer sets. As shown in Fig. S2F, PCR products from circular transcripts of MMP7 could be clearly observed in all four samples. The covalent junction in the circular transcript of MMP7 was further revealed by sequencing (Fig. 1c, d). Additionally, other outward-facing primer sets based on the combination of exons were used to further determine potential circular transcripts of MMP7. Results from endpoint PCR and sequencing revealed another circular transcript of MMP7 that might have a low expression level (Fig. 1c and data not shown). In contrast, we detected no products from the backsplicing of MMP11 pre-mRNA using a set of outward-facing primer sets based on the combination of exons (data not shown).

Evaluation of primer sets in the detection of biomarkers

ALAS2 and MMP7 were used for the next investigation in this study and HBA was excluded due to its high level of expression. LC-primers were used for simultaneous detection of linear and circular transcripts of ALAS2 and MMP7 (Fig. 2a, b). The investigation of circRNAs of ALAS2 and MMP7 revealed that L-primers from previous reports mainly amplify linear transcripts (Fig. 2a, b). The PCR amplification using LC-primers and L-primers was carried out on the dilution series of full-length double-stranded DNA of ALAS2 and MMP7. Although the amplification using L-primers of ALAS2 seemed to be lower than that with LC-primers of ALAS2, the amplification using L-primers of ALAS2 on one copy could obtain significantly detectable signals in CE (Fig. 2c). In fact, results from the dissociation curve of L-primers of ALAS2 might imply the generation of primer dimers in qPCR (Fig. S3A and B). Compared with amplification using L-primers of MMP7, the lower amplification signals using LC-primers of MMP7 might be due to longer PCR products in endpoint PCR (Fig. 2d). In qPCR, the amplification efficiencies of L-primers and LC-primers of MMP7 were comparable (Fig. S3B).

Fig. 2
figure 2

Evaluation of primer sets in the detection of biomarkers. a The horizontal lines cover the regions L-primers (lower) and LC-primers (upper) of ALAS2 amplify. b The horizontal lines cover the regions L-primers (lower) and LC-primers (upper) of MMP7 amplify. c Detection of ALAS2 using L-primers and LC-primers of ALAS2 by PCR-CE on the indicated copy number of double-stranded DNA. Data represent mean ± SD from four replicate samples. d Detection of MMP7 using L-primers and LC-primers of MMP7 by PCR-CE on the indicated copy number of double-stranded DNA. Data represent mean ± SD from four replicate samples

The detection sensitivity of biomarkers with the inclusion of circRNAs

Sensitivity testing of ALAS2 and MMP7 was assessed in a quantitative approach (input total RNA). A dilution series of manually extracted total RNA from different individuals (0.2–0.003 ng) was reversely transcribed. PCR amplification was performed using L-primers and LC-primers, respectively, and results are summarized in Table 1. The detection limits of ALAS2 and MMP7 determined using L-primers were up to 0.012 and 0.025 ng of input total RNA, respectively (Table 1). In contrast, ALAS2 and MMP7 could be detectable with as little as 0.003 ng of input total RNA using LC-primers in all tested samples (Table 1). Additionally, in contrast to L-primers, the amplification using LC-primers on mixtures of total RNA from peripheral or menstrual blood with that from semen at a concentration ratio of 1:100 produced considerably higher peaks (Fig. 3a, b; Fig. S4A and B). This is probably correlated with a higher number of target molecules by the inclusion of circRNAs.

Table 1 Detection sensitivity of ALAS2 and MMP7 in the serial dilution analysis using L-primers and LC-primers. The reverse transcription was performed using the indicated amounts of total RNA and 1 μl of RT products was used for 28 cycles of PCR amplification followed by CE. The number of samples in which the biomarker was detected out of the total number of samples tested is indicated
Fig. 3
figure 3

The detection sensitivity of ALAS2 and MMP7 with the inclusion of circRNAs. (A, B) L-primers and LC-primers of ALAS2 (a) and MMP7 (b) were used, respectively, for PCR amplification on the mixtures of total RNA from peripheral (a) or menstrual (b) blood with that from semen at a ratio of 1:100. The electrophoretogram represents one out of triplicate samples. c One representative electrophoretogram from Table 2 and the arrow indicates the low amplification signal of MMP7 using L-primers

Previous studies have shown that the mRNA expression level of MMP7 fluctuates over the days of menstruation in individuals [31]. Results from amplification using L-primers showed that the mRNA expression of MMP7 was negative in some samples (Table 2 and Fig. 3c). Although the simultaneous detection of linear and circular transcripts showed a low expression of MMP7 in linear mRNA-negative samples, the detection of MMP7 could be considered as positive results in mRNA profiling (Table 2 and Fig. 3c), which might result from the increase of targets by the inclusion of circular transcripts.

Table 2 Detection of MMP7 expression in menstrual bloodstains using L-primers and LC-primers

The stability of biomarkers with the inclusion of circRNAs

To determine the stability of biomarkers with the inclusion of circRNAs in the detection of artificially degraded samples, total RNA from fresh stains were treated with RNase R followed by reverse transcription. PCR amplification using RT products of intact total RNA as templates showed significantly detectable amplification signals of ALAS2 and MMP7 (Fig. 4a, b). However, treatment with RNase R resulted in undetectable amplicons of ALAS2 and MMP7 when amplification was performed using L-primers (Fig. 4c, d). In contrast, amplification using LC-primers demonstrated detectable amplicons of ALAS2 and MMP7 in RNase R-treated samples (Fig. 4c, d).

Fig. 4
figure 4

The stability of ALAS2 and MMP7 with the inclusion of circRNAs. a, b The detection of ALAS2 in peripheral bloodstains (a) and MMP7 in menstrual bloodstains (b) using L-primers or LC-primers. c, d The detection of ALAS2 in RNase R-treated total RNA from peripheral bloodstains (c) and MMP7 in RNase R-treated total RNA from menstrual bloodstains using L-primers or LC-primers (d). Reverse transcription was performed using 2 ng of total RNA treated with RNase R and 1 μl of RT products was used for 28 cycles of PCR amplification. e, g The detection of ALAS2 in 13-day-old (e) and 18-day-old (f) peripheral bloodstains using L-primers or LC-primers. Reverse transcription was performed using 4 μl of total RNA and 1 μl of RT products was used for 28 cycles of PCR amplification. f, h The detection of MMP7 in 13-day-old (f) and 18-day-old (h) menstrual bloodstains using L-primers or LC-primers. Reverse transcription was performed using 4 μl of total RNA and 1 μl of RT products was used for 28 cycles of PCR amplification. Each electrophoretogram represents one out of three samples. The font of scales was enlarged due to the low resolution of original figures

Peripheral bloodstains were exposed to environmental conditions and menstrual bloodstains were subjected to artificial thermal treatment for different periods. In 1- to 8-day-old samples, amplification using L-primers and LC-primers could obtain detectable amplicons of ALAS2 and MMP7 (data not shown). In 13- and 18-day-old samples, weak or no amplification signals of ALAS2 (Fig. 4e, g; Fig. S5A and B) and MMP7 (Fig. 4f, h; Fig. S5C and D) were observed when performing amplification using L-primers. In contrast, there were detectable amplicons of ALAS2 (Fig. 4e, g; Fig. S5A and B) and MMP7 (Fig. 4f, h; Fig. S5C and D) in 13- and 18-day-old samples when performing amplification using LC-primers.

Aged dried bloodstains stored at room temperature were used to compare the detection of ALAS2 using L-primers and LC-primers and results are summarized in Table 3. There were no detectable amplification signals in mRNA profiling when performing amplification using L-primers. However, amplicons could be observed by CE when performing amplification using LC-primers. These results suggest that the inclusion of circRNAs could facilitate the detection of ALAS2 in aged samples.

Table 3 Detection of ALAS2 in aged casework bloodstains using L-primers and LC-primers. The reverse transcription was performed using 4 μL of total RNA and 1 μl of RT products was used for 28 cycles of PCR amplification followed by CE

Discussion

The limited quantity and degradation of casework samples are challenging the current methods for forensically relevant body fluid identification. The development of sensitive and stable biomarkers as well as technical improvements in the detection of biomarkers can facilitate the identification of body fluids. As circRNAs and mRNAs are products of pre-mRNA splicing, the backsplicing of pre-mRNAs indeed decreases the expression level of mRNAs in cells. Although not all biomarkers include the expression of circRNAs, the discard of stable transcripts in the detection of biomarkers with the expression of circRNAs might underestimate the sensitivity and stability of biomarkers. Published primer sets do not appear to amplify the major circRNAs of ALAS2 and MMP7 identified in this study [27, 28, 30, 32,33,34,35]. Therefore, the selection of regions as targets requires more attention for improving the detection of biomarkers. In this study, we performed the mRNA profiling with the inclusion of circRNAs in the detection of biomarkers. Results from our investigation on the detection sensitivity and stability of biomarkers using L-primers and LC-primers show that the utility of regions shared by linear and circular transcripts as targets could improve the detection of biomarkers in mRNA profiling.

Pre-mRNA splicing is essential for eukaryotic gene expression, which can be regulated to produce multiple mRNA molecules from a single gene. A number of alternative splicing events are shown to be tissue-specific, which plays an important role in cellular differentiation and organismal development. Similarly, alternative backsplicing of pre-mRNA was recently revealed to generate multiple circRNAs from a single gene within cells [19, 20]. Since the mechanisms of backsplicing remain elusive, it is difficult to predict the combination of exons circRNAs employ, which is crucial to the purposeful design of primer sets in the detection of circRNAs. Currently, the expression profiles of circRNAs in forensically relevant body fluids are not available, but biochemical approaches can be applied to determine circRNAs from genes of interest [29]. In this study, circRNAs of ALAS2 and MMP7 were revealed using outward-facing primer sets for amplification and the head-to-tail junctions of circRNAs were further determined by sequencing. Interestingly, some circRNAs consisted of incomplete exons (Fig. 1d and Fig. S2D). It is possible that the rule of backsplicing is different from that of alterative splicing.

The expression of genes is tightly regulated in cells. As a consequence, most of mRNA molecules have a tissue-specific expression pattern. Previous studies show that the expression of circRNAs is abundant and cell-type specific in thousands of human genes. In some cases, the expression of circRNAs even exceeds the abundance of traditional linear transcripts [20]. However, the expression levels of different circRNAs from a single gene might be regulated in cells since results from endpoint PCR demonstrated one major circular transcript of ALAS2 and MMP7, respectively (Fig. S2B and C). In fact, different expression levels of circRNAs from a single gene were observed in previous reports [19, 23]. Furthermore, the backsplicing of circRNAs is shown to compete with pre-mRNA splicing [36]. Therefore, simultaneous detection of linear and circular transcripts indeed fully utilizes transcripts of biomarkers in mRNA profiling. Apart from an increased level of targets by the inclusion of circRNAs in mRNA profiling, the detection sensitivity of biomarkers is susceptible to the amplification efficiency of primer sets. As the amplification efficiency of primer sets has a substantial impact on the evaluation of the Ct value in qPCR, it is difficult to conduct a comparison in the detection of biomarkers by qPCR using L-primers and LC-primers. Furthermore, the △Ct value is theoretically small even though circRNAs account for approximately 40% of all transcripts from a single gene, which might result in an inaccurate evaluation. In this study, since the PCR amplification using L-primers and LC-primers on one copy of double-stranded DNA obtained significantly detectable signals in CE, the improvement in the detection sensitivity of biomarkers might be mainly attributed to the increased level of targets. Therefore, the inclusion of circRNAs in mRNA profiling could help in the detection of biomarkers. However, it should be noted that some RNA markers in potential non-target tissues might have quite a weak expression and the inclusion of circRNAs in the detection of these markers might result in a positive interpretation in non-target tissues, which will lower the specificity of biomarkers. Therefore, if circRNAs are included in forensic body fluid identification, the specificity of biomarkers with the expression of circRNAs, such as ALAS2 and MMP7, still requires to be further verified.

Forensic casework samples are often subjected to environmental insults, such as UV, humidity, heat, and enzymes. Apart from the expression level of biomarkers, the stability of transcripts is crucial to the identification of degraded or aged biological stains in mRNA profiling. The structure of RNA molecules and RNA-binding partners might have important roles in the maintenance of their stability in cells [37]. CircRNAs have been shown to have a longer decay half-life than mRNAs in cells, which might be due to the lack of free ends. In this study, our results showed that the inclusion of circRNAs in mRNA profiling could improve the detection of ALAS2 and MMP7 in samples with artificial treatment and environmental insults. Although we did not quantify the remnants of mRNAs and circRNAs in the degraded and aged samples, PCR-CE technology can reveal a difference in the copy numbers of targets since the singleplex PCR amplification using L-primers and LC-primers is similarly efficient. The unsuccessful detection of ALAS2 and MMP7 using L-primers might result from targets with a copy number that is less than the detection limit of mRNA profiling. The improvement in the stability of biomarkers might result from the increased level of targets and the inclusion of stable transcripts.

In conclusion, the inclusion of circular transcripts in mRNA profiling might be an efficient strategy for improving the detection sensitivity and stability of biomarkers in body fluid identification. Subsequent studies should determine the expression profile of circRNAs in all of forensically relevant body fluids and characterize a set of biomarkers with the expression of circRNAs, which might help with the development of more robust multiplex assays for body fluid identification.