Introduction

As the third most common cancer in men and the second one in women, colorectal cancer (CRC) accounts for approximately 8 % of all the cancer-related deaths [1]. The overall incidence of CRC is 5 % in the general population, and the 5-year survival rate ranges from 40 to 60 % [2]. In spite of advances in the CRC treatment, early diagnosis is still really crucial to improve the survival rate of cancer patients. Early detection of CRC depends on traditional screening technology, but with several defects. For instance, although colonoscopy is regarded as gold-standard test for CRC, it has a bad experience [3]. The diagnostic method of fecal occult blood test (FOBT) is accepted widely, but has a low sensitivity and specificity. For the method of fecal immunochemical tests (FIT), it is proven to be more sensitive in CRC, but the sensitivity is not high enough to detect CRC [4]. The fecal DNA and messenger RNA (mRNA) testing are non-invasive techniques, but they need more optimization to be applied in the clinical examination [5]. Therefore, it is necessary to explore new efficient and non-invasive methods for CRC detection.

MicroRNAs (miRNAs) are small non-coding RNAs (18–25 nucleotides long), and regulate the gene expressions by targeting the mRNA post-transcriptionally and binding mRNA in the 3’ untranslated regions (UTR) [6]. Further studies demonstrate that miRNAs play crucial roles in cell proliferation, differentiation, and apoptosis [7]. Increasing evidences show the different expression profile of miRNAs in blood was observed in the patients who have prostate cancer, lymphoma, ovarian cancer, lung cancer, gastric cancer, or CRC. In addition, miRNAs in feces have also been tested as an indicator to detect the CRC non-invasively, and a number of miRNAs have been reported to be closely associated with CRC, such as miR-106a, miR-143, miR-145, and miR-9 [3]. Therefore, the miRNAs in blood or feces have great potential to be biomarker for CRC diagnosis.

There are several researches that evaluate the feasibility of miRNA as biomarkers for CRC detection. An overexpression of miR-221 is found in cancer patients’ blood, which implies that miR-221 may play an important role in tumorigenesis. Therefore, it is possible to use miR-221 as a biomarker to diagnose CRC. An increasing number of miRNAs, like miR-21 miR-129-3p, miR-767-3p miR-877, miR-60, miR-760, and miR-92a, etc., are reported to have similar effect like miR-221. However, most of the researches were performed on a relatively small sample. Thus, we conduct a comprehensive meta-analysis of miRNAs for CRC diagnosis based on the former publications.

Materials and methods

Literature search strategy

We performed a meta-analysis according to diagnostic meta-analysis guidelines. Eligible studies were retrieved from several online public databases, including PubMed, Embase, Google Scholar, the Cochrane Library, and the Chinese National Knowledge Infrastructure (CNKI), up to July 13, 2014. The search terms include: (“microRNAs” or “miRNA” or “miR”) and (“colorectal neoplasms” or “colorectal cancer” or “colorectal malignancy”) and (“diagnosis” or “sensitivity” or “specificity” or “ROC curve”). In addition, the reference lists of relevant reviews were manually searched to obtain additional articles.

Publications inclusion and exclusion criteria

Studies qualified to be included have to fulfill the following inclusion criteria: (1) studies related to diagnostic value of miRNAs for CRC; (2) studies using the gold standard to confirm the diagnosis of CRC patients; and (3) studies providing sufficient data to construct two-by-two tables, including true positive (TP), false positive (FP), true negative (TN), and false negative (FN). Exclusion criteria were (1) studies not related to the diagnostic values of miRNAs for CRC; (2) studies with duplicate data from other studies; and (3) studies in the form of letters, editorials, case reports, or reviews.

Data extraction

The following data were extracted from all eligible studies using a standardized form: (1) basic characteristics of publications, name of the first author, year of publication, country of publication, ethnicity, sample sizes, age, gender ratio, and specimen types included; (2) diagnostic outcomes, including sensitivity, specificity, TP, FP, FN, and TN, respectively.

Quality assessment

Each article was evaluated systematically by reviewers according to updated Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) criteria [8]. This score system is comprised of four basic ingredients: patient selection, the index test, the reference standard, and flow and timing. There are seven questions required to be answered with “yes”, “no”, or “unclear”. The answer “yes” gets one score, while the answer “no” or “unclear” answers are given zero score. All questions have equal weight that the maximum score is 7.

Statistic analysis

All analyses were performed using the STATA 12.0 software. We calculated the pooled sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), and diagnostic odds ratio (DOR) of all included publications with bivariate meta-analysis model. Since the area under the SROC curve (AUC) was a pooled evaluation index of the diagnosis, it was calculated by applying the sensitivity and specificity of each publication to plot the summary receiver operator characteristic (SROC) curve [9]. Heterogeneity among included studies was evaluated by the Q test and I 2 statistic [10]. If the P value was smaller than 0.05 for the Q test or the I 2 value ≥ 50 %, it means that there is significant heterogeneity, and then the random-effects model should be applied. In addition, we also conducted subgroup and meta-regression analyses to explore potential sources of between-study heterogeneity. The Fagan’s nomogram was used to estimate the post-diagnostic effect after pooled analysis [11]. In addition, we performed Deek’s funnel plot asymmetry test to evaluate the publication bias, P < 0.10 imply significant publication bias [12].

Results

Literature search

The results of our literature search were listed in Fig. 1. The initial database search includes 397 potential relevant articles, of which 46 duplicates were excluded. Then, 351 study articles are left for further examination. After reviewing abstract and keywords, 138 letters, reviews, and meta-analysis were further excluded, the study subjects of 5 articles were not human, and 136 of them were not related to our research. Then, 72 articles remain for the full-text review. After the full-texts review, another 43 studies were excluded, including 14 studies irrelevant to diagnosis, 19 studies without sufficient data, 5 studies about metastatic disease, and other 5 studies with duplicate data. Finally, 29 eligible articles were included in our meta-analysis, including 19 articles about blood-based miRNAs assays [1331], and 10 articles about feces-based miRNAs assays [3, 5, 3239].

Fig. 1
figure 1

Diagram of publications selection process

Basic characteristics of included studies

The characteristics of the included studies are shown in Table 1. Finally, 80 trials from 29 articles were included in the present meta-analysis, including 55 single-miRNA assays, and 25 multiple-miRNA assays. The participants of the selected articles are Asian and Caucasian. The number of total CRC patients was 2626, and the number of all healthy controls was 2017. The specimens used in these 29 articles including serum, plasma, blood, and feces, which can be categorized into two general class of specimens, “blood” and “feces”. The expression levels of miRNA are measured by the real-time reverse transcription polymerase chain reaction (RT-PCR) in most of the selected articles. The qualities of these articles were assessed with QUADAS-2 system, and the assessment results, which were shown in Fig. 2, turned out to be moderate to high.

Table 1 Main characteristic of the included articles in the meta-analysis
Fig. 2
figure 2

Quality assessments of included publications

Diagnostic accuracy of miRNA for CRC

There is significant heterogeneity among the sensitivity and specificity results in the 80 studies of the 29 articles. For the sensitivity, the I 2 was 81.96 % (95 % CI, 78.14–85.78), and Q value was 376.54 (P < 0.01); for the specificity, the I 2 was 83.21 (95 % CI, 79.72–86.69), and Q value was 404.91 (P < 0.01). So, the random-effects model was selected in the next steps. The pooled estimates results are displayed in Table 2. The assessment results for overall studies are presented as follows: sensitivity, 0.74 (95 % CI, 0.70–0.77); specificity, 0.81 (95 % CI, 0.77–0.84); PLR, 3.9 (95 % CI, 3.2–4.7); NLR, 0.32 (95 % CI, 0.28–0.37); DOR, 12 (95 % CI, 9–16); and AUC, 0.84 (95 % CI, 0.81–0.87), suggesting a moderate diagnostic accuracy. The likelihood ratio was moderate as well as post-test probabilities, shown in the Fagan’s plot (Fig. 3). Likelihood ratios (LRs) provide a pooled effect by combining the sensitivity and specificity. In detail, for any people with a pre-test probability of 25 % to have CRC, if a positive test was given by miRNA assay with PLR value at 3, the post-test probability to have NDs would rise to 53 %; by contrast, for the same people, a negative result of miRNA assay would lower the post-test probability to 10 %. Hence, miRNA assay may play an important role as initial screening method for CRC.

Table 2 Summary estimates of diagnostic criteria for miRNAs profiling in CRC detection
Fig. 3
figure 3

SROC curve with pooled assessments of sensitivity, specificity, and AUC (a SROC curve of Caucasian-based ethnicity; b SROC curve of Asian-based ethnicity; c SROC curve of single-miRNA profiling; d SROC curve of multiple-miRNA profiling)

Subgroup analysis

The pooled results for diagnostic accuracy in different subgroups were displayed in Table 2. There are three subgroups, and the basic criteria for clarification were ethnicity, miRNA profiling (single- or multiple-miRNA), and sample types, respectively. For the ethnicity subgroups, the sensitivity, specificity, PLR, NLR, and DOR of Caucasian-based miRNA assays were 0.66 (95 % CI, 0.59–0.73), 0.74 (95 % CI, 0.69–0.79), 2.6 (95 % CI, 2.0–3.3), 0.45 (95 % CI, 0.35–0.59), and 6 (95 % CI, 3–9), respectively, with AUC of 0.77 (95 % CI, 0.73–0.80) (Fig. 4a). The diagnostic sensitivity, specificity, PLR, NLR, and DOR of Asian-based miRNA assays were 0.77 (95 % CI, 0.73–0.81), 0.84 (95 % CI, 0.79–0.88), 4.8 (95 % CI, 3.7–6.1), 0.28 (95 % CI, 0.24–0.33), and 17 (95 % CI, 13–24), respectively, with AUC of 0.87 (95 % CI, 0.84–0.89) (Fig. 4b). It seemed to be that the diagnostic accuracy of Asian-based miRNA assays might be higher than Caucasian-based miRNA assays.

Fig. 4
figure 4

Forest plots of multivariable meta-regression analyses for sensitivity and specificity

For the diagnostic accuracy of single-miRNA assays, the sensitivity was 0.69 (95 % CI, 0.64–0.74), the specificity was 0.80 (95 % CI, 0.75–0.84), the PLR was 3.5 (95 % CI, 2.7–4.4), the NLR was 0.38 (95 % CI, 0.33–0.45), the DOR was 9 (95 % CI, 6–13), and the AUC was 0.77 (95 % CI, 0.73–0.80) (Fig. 4c). For multiple-miRNA assays, the sensitivity was 0.82 (95 % CI, 0.79–0.85), the specificity was 0.83 (95 % CI, 0.78–0.86), the PLR was 4.7 (95 % CI, 3.7–6.0), the NLR was 0.21 (95 % CI, 0.18–0.26), the DOR was 22 (95 % CI, 15–31), and the AUC was 0.89 (95 % CI, 0.86–0.92) (Fig. 4d). The data shows multiple-miRNA assays were more accurate than single-miRNA assays for CRC diagnosis.

Similarly, for the subgroups of specimens, the blood-based miRNA assays exhibit a better performance to feces-based miRNA assays, with the AUCs of 0.84 versus 0.82. However, in terms of DOR, the feces-based miRNA assays of 15 (95 % CI, 8–31) had a better performance in CRC diagnosis than blood-based assays of 11 (95 % CI, 8–16). In addition, blood-based assays had a higher diagnostic sensitivity of 0.76 (95 % CI, 0.71–0.80) than feces-based assays of 0.68 (95 % CI, 0.60–0.75); on the opposite, feces-based assays had a higher diagnostic specificity of 0.88 (95 % CI, 0.78–0.93) than blood-based assays of 0.78 (95 % CI, 0.75–0.81). In conclusion, blood-based assays, with a lower specificity, were more sensitive than feces-based assays in CRC diagnosis.

Meta-regression and robustness analyses

Meta-regression analysis was performed to further explore potential sources of the heterogeneity and to testify the results of subgroup analysis (Fig. 4). For the sensitivity, Asian (P < 0.01), multiple-miRNA assay (P < 0.001), and blood-based specimen (P < 0.001) were found as the main factors that contributed to the heterogeneity. For diagnostic specificity, Asian (P < 0.001), multiple-miRNA assay (P < 0.001), and blood-based specimen (P < 0.001) were also essential elements to affect the heterogeneity. The conclusion can be drawn that ethnicity, miRNA profiling, and types of specimen were significant factors affecting the heterogeneity of both the diagnostic sensitivity and specificity. Goodness of fit and bivariate normality analyses indicated that the random-effects bivariate model was robust for the pooled analyses. Eleven deviated studies were found in influence analysis, and eight outliers were discovered in outlier detection. Some duplicate outliers were found in the two different methods. In summary, there were 11 deviated studies that may affect the robustness of the meta-analysis. After excluding the deviated studies or outliers, the pooled diagnostic sensitivity, specificity, PLR, NLR, DOR, and AUC were 0.73 (95 % CI, 0.70–0.76), 0.78 (95 % CI, 0.75–0.81), 3.4 (95 % CI, 2.9–3.9),0.34 (95 % CI, 0.30–0.39), 10 (95 % CI, 8–13), and 0.82 (95 % CI, 0.79–0.86), respectively, without significant differences relative to overall analysis with outliers, indicating that our meta-analysis was robust.

Discussion

CRC is a kind of malignant disease, with increasing incidence worldwide. The low accuracy of early diagnosis and high prevalence of late diagnosis lead to high mortality rate. So, it is crucial to find an effective early-stage diagnostic method for CRC patients to prevent the cancer deterioration, thus the patients will have a greater chance to survive [40]. Since miRNAs are widely reported to be dysregulated in cancer patients, it is proposed to be a potential biomarker for CRC detection without any invasive procedures involved [41]. MiR-223 and miR-451, measured in the feces samples, appear to have an aberrant expression profile between the CRC patients and healthy people [39]. The significant difference of expression level was also discovered in miR-21, let-7g, miR-31, miR-92a, miR-181b, and miR-203. Thus, miRNAs could be an ideal biomarker for CRC detection with less invasion and more accuracy, as well as lower costs, compared with the other diagnostic methods [29].

However, most of these studies just focused on the diagnostic value of only one single miRNA or a combination of miRNAs. Also, the different specimens such as blood, serum, plasma, or feces are analyzed among the selected specimens. Therefore, this meta-analysis was necessary to summarize and evaluate the accuracy of miRNA as biomarker based the included studies. The pooled sensitivity was 0.74 (95 % CI, 0.70–0.77), specificity was 0.81 (95 % CI, 0.77–0.84), and AUC was 0.84 (95 % CI, 0.81–0.87). Therefore, miRNAs are proven to be highly sensitive and specific diagnostic biomarkers. What is more, the DOR was 12 (95 % CI, 9–16), indicating that miRNA test-positive patients have 12-times higher chance of CRC than those healthy persons.

Our meta-analysis also has several interesting results. Firstly, it is more accurate to use miRNAs as CRC diagnostic biomarkers in the Asian group than in the Caucasian group. What is more, multiple-miRNA assay as biomarker is more accurate than single-miRNA assay. Therefore, multiple-miRNA assays tend to be given a more reliable diagnostic result, thus it is important to test multiple useful miRNAs to increase the credibility of results in clinical examination. According to the results in the individual study, panels of miR-203, −146b-5p, −215, −192, and -30e-3p (100 % sensitivity and 93 % specificity) in the Sheinerman el al. study and miR-21,-31, −92a, −181b, −203, and let-7 g (96.4 % sensitivity and 88.1 % specificity) in the Wang et al. study are the most predictive miRNA among all. Thirdly, CRC diagnosis with blood-based specimen is more sensitive but less specific than feces-based specimen.

To some extent, heterogeneity may undermine the reliability of this meta-analysis. Thus, in the beginning, we have tested the heterogeneity by a forest analysis (Fig. 3). According to the I 2 and Q value, a random-effects model was selected. Then, we performed subgroup analysis to detect the main factors that contributed to the heterogeneity. As a result, ethnicity, miRNA profiling (single or multiple miRNA), and specimen types are verified to be the main elements leading to heterogeneity. Also, eleven deviated studies were found that may contribute to heterogeneity. Then, we adopted these eleven studies and made a SROC curve again. However, the result without deviated studies was not significantly different from that with these deviated studies. It means that deviated studies had few impacts on our overall CRC meta-analysis.

Although there was existing meta-analysis that investigate the diagnostic value of miRNAs as biomarker to detect CRC, this present meta-analysis is the most comprehensive one. Only 13 studies were included in the previous meta-analysis by Zhou et al. (2013), with the pooled sensitivity of 0.81 (95 % CI, 0.79–0.84), specificity of 0.78 (95 % CI, 0.75–0.82), and AUC of 0.89 [42]. And their subgroup analysis was conducted based on the specimen only. However, a relatively more comprehensive analysis with 16 publications was performed in our meta-analysis. Apart from the specimen, the subgroup analysis of our study was also conducted based on the ethnicity and miRNA profiling. A conflict of result was found between the meta-analysis of Zhou et al. (2013) and ours. Zhou et al. (2013) had a conclusion that blood-based miRNA tests were more accurate than feces-based tests. However, our result was implied that blood-based CRC diagnosis had a higher sensitivity but lower specificity than feces-based diagnosis. In addition, in our present meta-regression, three main factors were found to contribute to the heterogeneity, including ethnicity, miRNA profiling, and specimen.

Despite our great efforts to perform an accurate analysis, there were also several limitations that should be noticed: (1) some useful publications may be missed in the process of retrieving and screening; (2) there was statistical heterogeneity in our analysis attributed to ethnicity, miRNA profiling, and specimen types; (3) no articles were found to report data relevant to African populations; and (4) the cut-off values were different or deficient in retrieved publications so that we did not extract the cut-off values.

In conclusion, we comprehensively assessed the value of miRNAs as biomarkers to detect CRC, and supported miRNAs as biomarkers in our meta-analysis. Moreover, the miRNA diagnostic analysis of CRC was more accurate in Asian populations than Caucasian populations. In addition, multiple-miRNA assay shows a higher accuracy than single-miRNA array.