Introduction

Ovarian cancer (OC) is the seventh most common cancer worldwide in the female with the highest mortality rate among gynecologic malignant tumors affecting the female reproductive system [1, 2]. According to available statistics, more than 180 thousand women die of ovarian cancer every year worldwide [3]. Histogenetically, OC is classified into three major subtypes, including epithelial, stromal, or germ cell tumors [4]. Approximately, 90% of ovarian cancers have been found to be epithelial ovarian cancer (EOC) subtypes [5]. Surgical resection and chemotherapy are the standard treatment options [6]. The International Federation of Gynecology and Obstetrics (FIGO) or Union for International Cancer Control (UICC) TNM classifications are referred to as the staging standard of ovarian cancer [7, 8]. Nevertheless, due to the vague and nonspecific symptoms and alike gastrointestinal, genitourinary, and gynecological findings, OCs are difficult to diagnose at early stage, and are often metastatic at the time of presentation, and are involved in a high likelihood of recurrence and poor prognosis [4, 9]. With regard to this, accurate preoperative evaluation, namely the differentiation of benign or malignant diseases, or the detection of nodal, peritoneal or distant diseases, is indispensable to achieve an optimized treatment schedule [10].

A biopsy of tumor tissue from surgery or imaged-guided needle aspiration, to date, is the reference standard to confirm the disease [11]. Nevertheless, this procedure is invasive and carries potential risks of unwanted and unpredictable complications [12, 13]. Transvaginal ultrasonography (TVUS) or abdominal contrast-enhanced computed tomography (CT) is the first imaging modality for the detection of OC [9, 14, 15]. TVUS is currently used for screening of OC and may be associated with discomfort and risk to the vagina [16]. In clinical setting, CT is the most commonly employed imaging method before the staging laparotomy [9]. However, the diagnostic value of CT is limited to depict tumor implants that are 1 cm or smaller the sensitivities of 25–50% in peritoneal metastases [17, 18]. Other modalities are being increasingly used in the management of ovarian cancer including magnetic resonance imaging (MRI), positron emission tomography/computed tomography (PET/CT), or PET alone [9]. MRI provides superior soft-tissue contrast resolution, and it can identify indeterminate lesions seen on CT without exposure to radiation [15, 19]. PET, using 2-Deoxy-2-[18F]fluoro-d-glucose (18F-FDG) as radiotracer, if especially combined with CT, is beneficial for the staging gynecological malignant tumors, particularly lymph node metastasis or recurrent tumors, as both functional and anatomical information are available [20,21,22,23]. Furthermore, 18F-FDG PET/CT is known to have a high sensitivity for detection of OC relapse with a reported pooled sensitivity of 89% when multidisciplinary standard of reference including histology, clinical and imaging follow-up have been utilized [24]. Moreover, reports demonstrated that the combination of 18F-FDG PET and MRI also provided both high anatomical and functional resolution, and it had shown acceptably superior diagnostic performance than 18F-FDG PET/CT in gynecologic malignancies [25, 26].

For decades, a great number of studies on the diagnostic performance of noninvasive imaging modalities (CT, MRI, PET, PET/CT, and PET/MRI) aforementioned have been performed. The corresponding results varied on account of the study design, sample size, baseline characteristics, type of disease, and etc. The aim of this meta-analysis was to provide a broad overview of the diagnostic value of noninvasive imaging methods in the detection of ovarian cancer, and to come up with more evidenced-based findings for decision and strategy making on a clinical basis.

Materials and methods

This study was performed and reported in accordance with the Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies (The PRISMA-DTA Statement) [27].

Database search and study selection

Three electronic databases including PubMed, Embase, and Ovid were systematically searched from the date of inception to 31st, March, 2022. Only records in English language were considered for potential inclusion. Moreover, the bibliographies of reviews and studies included were manually screened in order to retrieve additional studies that met the inclusion criterion of the current study. The following medical subject headings and search terms were used for the database search: “magnetic resonance imaging”, “positron emission tomography”, “computed tomography”, “MRI”, “CT”, “PET”, “PET/CT”, “PET/MRI” and “Ovarian cancer”. The records searched from the aforementioned databases were altogether exported into Endnote software (Version 9.3.3; Thomson Corporation, Stanford, USA) for duplicates removing and further screening. The processes with regard to the literature search and study screening were performed by two independent investigators. Any disagreement was assessed by a third reviewer until the final consensus was achieved.

Studies were included if they met all the following criteria: (1) studies used MRI, CT, PET, PET/CT, or PET/MRI as detection modalities for the presence of ovarian cancer, regardless of tumor types; (2) reference standard was explicitly documented in the study; (3) absolute numbers of patients with true positive (TP), false positive (FP), true negative (TN), false negative (FN) outcomes, or they can be derived based on the presented data. Animal studies, case reports, reviews, comments, conference abstracts or records without extractable data were excluded in this study.

Data extraction and quality assessment

The following information in each study enrolled were identified and extracted in a predesigned table: name of the first author, year of publication, country or district where the study was conducted, absolute number of participants, age, gender, prevalence of OC, patient selection (suspected or confirmed OC), type of OC (primary, relapsed, or metastatic), reference standard, modality used, variables including TP, FP, FN, TN, and analytical level (patient-based or lesion based). Two reviewers independently performed the data extraction, and discrepancies were addressed through discussion with a third reviewer.

The revised tool for the quality assessment of diagnostic accuracy studies 2 (QUADAS-2) was utilized to evaluate the methodological quality of each included study [28]. This 14-item tool is comprised of 2 components, namely risk of bias and applicability concerns. As for the assessment of risk of bias, 4 aspects including patient selection, index test, reference standard, and flow and timing were considered. Patient selection, index test, and reference standard were assessed, respectively, in the evaluation of applicability concerns. Each item was rated as ‘yes’, ‘no’, or ‘unclear’. Two independent authors scored the included studies using the Review Manager software (Version 5.3; Cochrane Collaboration, Oxford, UK). Disagreements were resolved by discussion with a third reviewer.

Statistical analysis

Based on the data extracted, pooled estimates for diagnostic indicators, such as sensitivity, specificity, positive likelihood ratio (+ LR), negative likelihood ratio (- LR), diagnostic odds ratio (DOR), and area under the curve (AUC) of summary receiver operating characteristic (SROC) with corresponding 95% confidence intervals (CIs) were synthesized using the random effects models. The I2 static tests were used to test heterogeneity between the studies, the value of I2 > 50% and p value < 0.05 were considered to be statistically significant heterogeneity [29, 30]. Furthermore, subgroup meta-analysis and meta-regression were performed to investigate the underlying source of heterogeneity [31,32,33]. We employed Deeks’ asymmetry test and funnel plot to detect potential publication bias in included studies [34]. The Stata software (Version 16.0; StataCorp, College Station, TX, USA) and R (Version 4.1.2; Comprehensive R Archive Network) were used for data.

Results

Baseline information on included studies

A total of 2406 citations were yielded through original literature search. Altogether 368 duplicates were removed using both Endnote software and manual identification. After title and abstract screening of the remaining 2038 records, 1943 articles were excluded. Ninety-five citations were reviewed in full text. Finally, 61 records (83 studies) including 4284 patients were deemed eligible for this study. The flow of literature search is displayed in Fig. 1. Enrolled citations included 18 prospective and 39 retrospective studies, 4 studies did not clarify their study design. The numbers of studies with regard to imaging modalities including CT, MRI, PET, PET/CT, and PET/MRI were 22, 11, 5, 43, and 2, respectively. The radiotracer used in PET imaging was 18F-FDG. Detailed information on the characteristics in provided in Supplementary Table 1. Most studies were rated as low risk of bias and low applicability concerns. High risk of bias or applicability concerns was not notified in included studies (Fig. 2 and Supplementary Fig. 1).

Fig. 1
figure 1

Flow of literature search

Fig. 2
figure 2

Methodological quality graph

Diagnostic performance of CT

Pooled estimates of sensitivity, specificity, + LR, -LR, DOR, and AUC of SROC with respective 95% CIs of CT on patient level were 0.83 (0.73, 0.90), 0.69 (0.54, 0.81), 2.7 (1.8, 4.0), 0.25 (0.17, 0.37), 11 (6, 19), and 0.84 (0.80, 0.87) with substantial heterogeneity among studies (I2 = 91.87% for sensitivity, I2 = 86.66% for specificity) (Table 1, Supplementary Fig. 2). On lesion level, the pooled sensitivity, specificity, + LR, -LR, DOR, and AUC of SROC with corresponding 95% CIs of CT were 0.69 (0.51, 0.83), 0.88 (0.73, 0.95), 5.8 (2.7, 12.2), 0.35 (0.22, 0.55), 17 (8, 36), and 0.86 (0.82, 0.89), respectively. Heterogeneity was detected (I2 = 93.73% for sensitivity, I2 = 96.84% for specificity) (Table 1, Supplementary Fig. 3). Results of meta-regression and subgroup analysis manifested insignificant affection of baseline characteristics on the pooled results (Table 2).

Table 1 Diagnostic performance of CT, MRI, and PET (PET/CT)
Table 2 Subgroup analysis of diagnostic performance of CT

Diagnostic performance of MRI

The overall sensitivity, specificity, + LR, -LR, DOR, and AUC of SROC with respective 95% CIs of MRI were 0.95 (0.91, 0.97), 0.81 (0.76, 0.85), 4.9 (3.8, 6.3), 0.07 (0.04, 0.12), 72 (36, 147), and 0.90 (0.87, 0.92) on patient level (I2 = 62.82% for sensitivity, I2 = 45.13% for specificity) (Table 1, Supplementary Fig. 4). The pooled estimates on lesion-based MRI were not performed due to limited number of studies. Five included studies used DWI examination, 3 used conventional MRI, and 2 studies utilized either conventional or contrast-enhanced MRI. Results of meta-regression showed significant impact of characteristics including study design, type of examination, Tesla level, age, and number of patients on the overall outcomes (Table 3).

Table 3 Subgroup analysis of diagnostic performance of MRI

Diagnostic performance of PET, PET/CT, PET/MRI

Results of studies investigating the diagnostic performance of PET/MRI were not pooled due to the limited number of studies included. The reported sensitivities and specificities were 91–97% and 86–87% in these studies. The overall sensitivity, specificity, SROC value of PET on patient level were 0.81 (0.71, 0.88), 0.81 (0.58, 0.93), and 0.82 (0.78, 0.85) (Table 4, Supplementary Fig. 5). Pooled estimates of sensitivity, specificity, and SROC value of PET/CT on patient level were 0.92 (0.88, 0.94), 0.88 (0.83, 0.92), and 0.96 (0.94, 0.97) (Table 4, Supplementary Fig. 6). The lesion-based overall sensitivity, specificity, and SROC value of PET/CT were 0.82 (0.71, 0.89), 0.94 (0.88, 0.97), and 0.95 (0.92, 0.96) (Table 4, Supplementary Fig. 7). Meta regression resulted in significant affection of study design, year of publication, and type of analysis on the pooled estimates of parameters.

Table 4 Subgroup analysis of diagnostic performance of PET (PET/CT)

Publication bias

Deeks’ funnel plot asymmetry tests yielded p values of 0.83, 0.77, 0.31, and 0.53 for CT, MRI, PET, and PET/CT studies (Fig. 3).

Fig. 3
figure 3

Deeks’ funnel plot asymmetry tests of CT, MRI, PET, and PET/CT studies. A Deeks’ funnel plot asymmetry tests of CT studies. B Deeks’ funnel plot asymmetry tests of MRI studies. C Deeks’ funnel plot asymmetry tests of PET studies. D Deeks’ funnel plot asymmetry tests of PET/CT studies

Discussion

Ovarian cancer remains one of the leading causes of mortality of gynecologic malignancies [35]. Metastases are often presented at the time of OC diagnosis, and high rate of relapse and poor prognosis are involved regardless of optimized management [36]. Timely and accurate detection of OC plays a significant role in treatment improvement and prognosis appraisal [21]. Imaging is critical for ovarian cancer management.

In the current study, we evaluated the most commonly utilized noninvasive imaging modalities in the detection of OC via the conduction of a meta-analysis. The diagnostic performance of CT, MRI, PET, along with hybrid imaging modalities, including PET/CT and PET/MRI were investigated. Results revealed that MRI manifested the highest overall sensitivity (0.95) and PET/CT showed the highest pooled specificity (0.88) on patient level. In general, PET/CT demonstrated the most superior diagnostic performance with an SROC value of 0.96 on a patient basis. Of note, only 2 PET/MRI studies were included in this study, the corresponding results were not meta-analyzed, sensitivities and specificities in these studies ranged from 91 to 97% and 86% to 87%, respectively. The combination of PET with CT or MRI can provide hybrid anatomical and functional imaging information so as to improve the detection accuracy [35, 37, 38]. Furthermore, results of meta-regression showed insignificant affection of baseline characteristics on the heterogeneity among CT studies. For MRI studies, study design, type of examination, Tesla level, age, and number of patients were detected to be potential sources of heterogeneity by meta-regression. Study design, year of publication, and the detection of metastatic OC were deemed as causes of heterogeneity among PET (PET/CT) studies. Due to limited number of covariates extracted from each enrolled study and insufficient numbers of studies in certain subgroups, meta-regression was performed based on available parameters, other possible sources of heterogeneity among studies should be further explored in the future.

Subgroup analyses were carried out to investigate the diagnostic value of these imaging modalities in specific subgroups. CT revealed the highest sensitivity in the detection of primary OC and showed superior specificity in metastases. High-field MRI produces better image quality compared with the lower field strength 1.5 T magnetic resonance MRI machines because it has a better signal-to-noise ratio (SNR) [39]. However, MRI performed on 1.5 T showed better diagnostic performance than on 3.0 T in this study. The probable reasons may be the differences on characteristics of study population, prevalence of OC, and the subjectivity of imaging reading between included studies. More well-designed relevant studies are warranted to further explore the impact of field strength on the diagnostic ability of OC. DWI examination demonstrated superior diagnostic ability than conventional or contrast-enhanced MRI. This combination of high contrast, millimeter level spatial resolution, robust performance, and short imaging time enables DWI to depict small tumor deposits in large volumes in a clinically timely manner [40]. PET/CT showed superior specificity but inferior sensitivity in the detection of metastasis compared to qualitative diagnosis. The possible reason may be the limited number of enrolled studies with regard to the detection of metastasis, more related studies are needed to verify the diagnostic performance of PET/CT in detecting metastasis. In addition, two reports enrolled in this study revealed that 18F-FDG PET/MRI yielded better diagnostic accuracy for the detection of metastatic ovarian cancer but did not offer significant additional information for the diagnosis primary OC [10, 41].

The current meta-analysis focused on the assessment of diagnostic performance of noninvasive imaging modalities on OC detection despite the cancer stages. It also provided considerable updates to previous meta-analyses on individual OC stage or imaging technique [11, 24, 42,43,44]. First, electronic database was systematically searched using relevant keywords and comprehensive search strategies; second, the processes of the literature screening, quality assessment, and data extraction were performed by two independent reviewers to minimize objective bias. Finally, statistical approaches were employed to detect potential heterogeneity and publication bias in included records. Publication bias of included studies was not indicated by the Deeks’ funnel plot asymmetry test. Nevertheless, as similar to any meta-analysis, heterogeneity is inevitable and the source of this heterogeneity is not sorted out thoroughly. The results should be interpreted with caution.

Conclusion

This meta-analysis revealed that noninvasive imaging modalities including CT, MRI, PET (PET/CT, PET/MRI) yielded favorable diagnostic performance in the detection of OC. A combination of different tools (PET/MRI) was deemed to be more accurate for identifying metastatic OC. Professional and economic issues should be considered by practitioners in the real-world clinical setting.