Introduction

Ovarian cancer is the second most common gynecologic malignancy and the most common cause of gynecologic cancer death in resource-rich countries with a lifetime risk of 1 in 75 for people with ovaries [1, 2]. Epithelial ovarian cancers (consisting of serous, endometrioid, clear cell, and mucinous subtypes) account for more than 90% of ovarian malignancies, followed by other cell types, including germ cell and sex cord-stromal tumors [2]. Only 15% of ovarian cancers are detected early (stage 1) and the overall 5-year survival for these patients is less than 50% as recently as 2015 [2].

Accurate staging of ovarian cancer remains critically important for treatment planning with stage 3 disease often requiring a combination of neoadjuvant therapy and/or surgery [3]. Current North American guidelines recommend contrast-enhanced CT as the primary work-up for staging and contrast-enhanced CT, MRI, or PET/CT as the post-primary treatment surveillance for stage 2–4 disease. However, supplementary PET/CT may improve accuracy compared to CT alone for detecting advanced ovarian cancer [4]. Further, CT alone may not be reliable for determining stage 3b disease [5]. To our knowledge, the diagnostic accuracy evidence of CT alone versus PET/CT for characterizing Stage 3 disease has not yet been systematically evaluated. The purpose of this systematic review is to compare the diagnostic accuracy of contrast-enhanced CT versus PET/CT for detecting stage 3 ovarian cancer.

Methods

This systematic review and meta-analysis were performed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analysis—Diagnostic Test Accuracy (PRISMA-DTA) guidelines [6, 7]. The study proposal was submitted to the PROSPERO database prior to initiation of the review (CRD42022371668). Institutional ethics approval was not required as it included only previously published information.

Literature search

A complete database search of MEDLINE, EMBASE, Scopus, and the Cochrane Library (including the Cochrane Central Register of Controlled Trials, the Cochrane Central Register of Protocols, and the Cochrane Database of Systematic Reviews) from inception through to October 30, 2022 was performed to identify studies evaluating the diagnostic accuracy of CT and/or PET/CT to detect stage 3 ovarian carcinoma as compared to a surgical reference standard ± clinical follow-up of at least 12 months. Different title/abstract/keywords and medical subject heading terms were searched with variations of (“computed tomography” OR “positron emission tomography”) AND “ovary” AND “cancer” were searched on an individualized basis in each database. The search strategies applied for each database are shown in Supplementary Information. No language or publication date restrictions were applied. The search was performed according to best practices for electronic search strategies [8]. Articles from each database were subsequently combined and duplicate articles were removed. Title and abstracts were screened for relevance of the study objective. Subsequently, full-text review was performed for potentially relevant studies performed independently by two reviewers with 3 and 6 years of imaging experience (BB and SS). Additionally, gray literature consisting of conference presentations from the Radiological Society of North America (RSNA) between 2020 and 2022, the European Congress of Radiology (ECR) between 2022 and 2023, and the International Society for Gynecologic Cancer (ICGS) from 2022 was performed by one author with 8 years of imaging experience (MPW). If a conference abstract provided sufficient information to meet inclusion criteria, these abstracts were included in the review and authors were contacted for additional information when necessary. Finally, reference lists for key studies were checked and forward searching was performed in Google Scholar.

Selection criteria

Original articles evaluating the diagnostic accuracy of CT and/or PET/CT for detecting stage 3 ovarian malignancy compared to a surgical reference standard ± clinical follow-up of at least 12 months were evaluated in full-text review. For inclusion, studies further required a minimum of 5 patients and sufficient information to reconstruct a 2 × 2 contingency table on a per-patient basis. If studies met all criteria but the latter, the authors were contacted via email for additional information and would be included if sufficient information was supplied. Studies evaluating the accuracy of CT and/or PET/CT by dividing individual patients into multiple regions of the abdomen (a “per-region basis”) were included in the systematic review but not included in the meta-analysis if sufficient data to construct a 2 × 2 contingency table on a per-patient basis could not be obtained. Studies were excluded when (1) CT or PET/CT were not the index test(s); (2) ovarian malignancy was not the target pathology; or (3) surgery was not the principal reference standard. Malignancy was defined by a pathological gold standard within each individual study. Non-original articles including review articles, guidelines, consensus statements, letters, and editorials were excluded.

Data extraction

The relevant data from included studies were extracted independently by two reviewers (BB AND SS). Study, patient, and imaging characteristics were recorded. Study demographics included first author, year of publication, country in which the study was performed, use of a prospective or retrospective study design, data acquisition from a single hospital or multi-hospital (multi-center) setting, reference standard used, number of readers for imaging interpretation, if consensus reading was used, and type of specialist of the reader. Acquired patient information included the total number of patients, mean age of included patients, modality used in that study, and tumor subtypes included in the study. Included imaging characteristics for CT: brand, number of detector rows, slice thickness, presence of contrast, and size threshold used for positivity. Included imaging characteristics for PET/CT: brand, PET-positive criteria, number of detector rows, and slice thickness.

True-positive (TP), false-negative (FN), false-positive (FP), and false-negative (FN) results for the presence of malignancy by size threshold were recorded. If data for different size thresholds was provided within an individual study, these were each recorded separately. If a study did not report these results directly but reported the sensitivity, specificity, sample size, and prevalence, then contingency table results were calculated. For studies with multiple index test readers, a single result was recorded as an average of all reviewers [9]. Following the completion of data acquisition independently by two authors, the contingency tables were reviewed in entirety by three authors (BB, SS, MPW) with re-review of articles where data were inconsistent and consensus agreement was used for the final data file. The third reviewer has 8 years of imaging experience (MPW). Data were collected using Microsoft Excel v15.14 (Microsoft, Redmond, Washington, USA).

Risk of bias assessment

The methodological quality of each study was evaluated using the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool [10]. Studies with a high-risk evaluation for any single signaling question in a particular category were considered high risk for that respective domain.

Data analysis

Meta-analysis was performed on studies that reported the results per patient, not per radiographic region. A bivariate meta-analysis model that addresses the correlation between sensitivity and specificity was used to evaluate the accuracy of CT and PET/CT, each compared to a reference standard. A generalized linear mixed model for the binomial family with a logit link was used as implemented in package [metadata]. Analysis was performed using STATA software (StataCorp, 2019, Stata Statistical Software: Release 17, College Station, Tx: StataCorp LP). Studies evaluating the accuracy of CT and/or PET/CT on a per-region basis were summarized using descriptive statistics (median and range). Variability was assumed based on current best-practice recommendations by the PRISMA-DTA group for bivariate models and a funnel plot and associated tests were not performed to explore small study effects [6]. A subgroup analysis of both CT and PET/CT groups was planned to evaluate for causes of presumed variability which may include one or more features of study, imaging, and/or patient-specific characteristics.

Results

Literature search

The literature search PRISMA flow diagram is shown in Fig. 1. A total of 3701 titles and abstracts were assessed, with 43 articles reviewed in full text. Twenty-eight studies were excluded flowing full-text review for reasons outlined in Fig. 1. A total of 15 studies with 918 patients were included in the systematic review [4, 11,12,13,14,15,16,17,18,19,20,21,22,23,24] with six studies evaluating the accuracy of CT on a per-patient basis [4, 11, 13, 16, 17, 24] and six studies evaluating the accuracy of CT on a per-region basis throughout the abdomen [12, 15, 18, 19, 21, 23]. Eleven studies evaluated PET/CT [4, 11, 13,14,15, 17, 18, 20,21,22, 24] with seven studies evaluating the accuracy of PET/CT on a per-patient basis [4, 11, 13, 17, 20, 22, 24] and four studies evaluating the accuracy of PET/CT on a per-region basis throughout the abdomen [14, 15, 18, 21]. For studies evaluating contrast-enhanced CT and PET/CT within the same study, readers were blinded to the separate modality interpretation during the evaluation of their given modality.

Fig. 1
figure 1

PRISMA flow diagram showing screening and selection of studies included in systematic review

Study, patient, and imaging characteristics

Study features and patient characteristics are shown in Table 1. Most studies used a prospective study design (9/15) and included single-center data (12/15). The studies were conducted in countries in North America, Europe, and Asia. All but one reporting studies used consensus review for image interpretation with most readers being either a radiologist and/or a nuclear medicine physician. Most studies included between 40 and 120 patients with two studies including less than 20 patients. The mean age of included patients for reporting studies ranged from 51 to 65 years. Most studies included only subtypes of epithelial ovarian cancers.

Table 1 Patient and study characteristics

Tables 2 and 3 summarize the CT characteristics and PET/CT characteristics by study, respectively. Most reporting studies used a lymph node size threshold of 1 cm for positive results in contrast-enhanced CT reporting. PET/CT-positive criteria was variably described with positive results predominantly involving abnormal FDG uptake in lymph nodes, although only two studies described their diagnostic SUV criteria [11, 14].

Table 2 CT imaging characteristics by study
Table 3 PET/CT imaging characteristics by study

Diagnostic accuracy of contrast-enhanced CT and PET/CT

Individual study performance reported for patient-based and region-based CT and PET/CT is shown in Table 4. Forest plots for sensitivity and specificity for patient-based CT and PET/CT analysis are shown in Fig. 2. Receiver operating characteristic (ROC) curves for CT and PET/CT are shown in Fig. 3. The sensitivity and specificity for contrast-enhanced CT on a per-patient basis is 82% (67–91%, 95% CI) and 72% (59–82%), respectively. The median and range sensitivity and specificity for contrast-enhanced CT on a per-region basis is 72% (35–96%) and 93% (69–98%), respectively.

Table 4 Contingency tables by study reported for CT and PET/CT by patients and regions
Fig. 2
figure 2

Pooled and weighted sensitivity and specificity of studies evaluating contrast-enhanced CT versus PET/CT for detecting stage 3 ovarian cancer

Fig. 3
figure 3

ROC curve of contrast-enhanced CT versus PET/CT for detecting stage 3 ovarian cancer

The sensitivity and specificity for PET/CT on a per-patient basis is 87% (75–94%) and 90% (82–95%), respectively. The median and range sensitivity and specificity for PET/CT on a per-region basis are 73% (62–95%) and 83% (49–97%), respectively. On a per-patient basis, there is no significant difference in sensitivity between contrast-enhanced CT versus PET/CT (p = 0.29), but PET/CT is significantly more specific (p < 0.01). There were an insufficient number of studies reporting both contrast-enhanced CT and PET/CT on a per-patient basis to perform a subgroup analysis in either group.

Risk of bias assessment

Risk of bias by study domain is shown in Supplementary Information. Risk of bias was low for almost all studies with only one study demonstrating a high risk of bias for patient selection on account of non-consecutive patient selection (excluding patients who received therapy at outside institutions) [18] and one study demonstrating a high risk of bias for index test due to limited clarity for diagnostic PET-positive criteria [13]. One study had an unclear risk for patient selection [15] and two studies had an unclear risk for reference standard [12, 23]. Applicability concerns were low for all included studies.

Discussion

This meta-analysis shows no statistical difference in sensitivity between contrast-enhanced CT alone and PET/CT for detecting stage 3 disease of ovarian cancer but does show statistical improvement in specificity and a trend toward improved overall accuracy when using PET/CT on a per-patient basis. The study supports existing North American guidelines which indicate that contrast-enhanced CT alone is a suitable method for initially screening patients with ovarian cancer. However, supplemental PET/CT in these patients may play an important role for improving detection of abdominal metastasis given the improved specificity and trend toward improved overall accuracy when compared to CT alone.

The standard supplementary use of PET/CT in the initial staging of patients with ovarian cancer may be limited by modality access relative to CT alone. In one study comparing contrast-enhanced CT, MRI, and FDG PET/CT for detecting peritoneal carcinomatosis in patients with ovarian cancer, PET/CT showed the highest specificity, while MRI showed the highest sensitivity, although no statistical differences in overall accuracy were identified between the modalities and multidetector CT was recommended as the initial staging modality as it was the “fastest, most economical, and most widely available modality” [21]. Even in this study, however, the additional potential value of PET/CT was emphasized. In fact, four of five studies comparing contrast-enhanced CT alone to PET/CT on a per-patient basis recommend the use of PET/CT, particularly for the utility of PET/CT in detecting lymph node metastases [4, 11, 13, 17]. A further two of the three studies comparing contrast-enhanced CT alone to PET/CT on a per-region basis also came to the same conclusion [15, 18]. The supplementary use of PET/CT may be particularly important for patients undergoing alternative or systemic therapies prior to or even in lieu of a diagnostic laparoscopy as the existing surgical gold standard has limitations in performance despite having near perfect specificity in the regions that are visualized and evaluated [25].

While serving as the first known systematic review evaluating the accuracy of contrast-enhanced CT alone versus PET/CT for ovarian cancer staging, this systematic review and meta-analysis is subject to several limitations. One principal limitation may be the use of consensus reading in nearly all reporting studies for both CT and PET/CT evaluation. This may not reflect a “real-world” scenario where the initial study is more often read by a single reader at most institutions in North America, particularly for CT, and the ability to detect small lesions throughout the abdomen may be lessened than when read by consensus. Further complicating this, is that a surgical gold standard is inherently imperfect, particularly with respect to sensitivity, given the potential for non-visualization of small lesions and particular regions (such as the retroperitoneum) with diagnostic laparoscopy [25]. In our opinion, this would likely have a more detrimental effect on the sensitivity of contrast-enhanced CT compared to PET/CT, supported by a recent prospective cohort study showing that PET/MRI has an improved sensitivity compared to CT for detecting peritoneal carcinomatosis of general primary abdominopelvic malignancies, despite the current meta-analysis not identifying statistically improved sensitivity with PET/CT [26]. Conversely, PET/CT has been known to have false positives in primary mucinous carcinoma subtypes on account of increased mucin content and decreased tumor cellularity, although the potential effect of cellularity on disseminated abdominal metastases for this subtype is not clearly defined [27]. Several other sources of variability both within and between modality assessments may also have an unrecognized effect on the results of our study including individual study, patient-, and/or imaging-specific parameters, although there were an insufficient number of studies evaluating each modality on a per-patient basis to undertake a planned subgroup analysis. By way of example, the nuanced differences in the acquisition of the CT portion of a PET/CT (such as slice thickness and contrast enhancement) may have an important but undefined difference in diagnostic accuracy within the PET/CT modality.

Given the limitations in the existing literature and recognized potential benefit for supplementary imaging beyond CT alone for staging patients with ovarian cancer, this study outlines several avenues for future research. First, further comparative analysis of contrast-enhanced CT versus PET/CT reporting accuracy on a per-patient basis would help improve the precision of this comparative analysis. Future systematic reviews should also include MRI or even PET/MRI studies in their comparative analysis. For patients evaluated with FDG PET/CT, a subgroup analysis of threshold values for positivity may help clarify the optimal SUV values for diagnosis given limited clarity and some variability for positive thresholds in the current existing publications. Finally, comparing new and emerging modalities for staging in these patients to CT and/or FDG PET/CT may provide alternative approaches to imaging these patients. As an example, in one recent publication, Gallium-68-labeled fibroblast activation protein inhibitor ([68Ga]-FAPI-04 PET/CT) was found to have a higher sensitivity than FDG PET/CT for detecting lymph node and peritoneal metastasis of epithelial ovarian cancers [28]. Future national and international guidelines should be informed by this and future analyses as the literature further evolves both within the existing modalities and expands on emerging modalities.

In summary, this meta-analysis shows no statistical difference in sensitivity between contrast-enhanced CT and PET/CT for detecting stage 3 ovarian cancer, supporting current guidelines indicating contrast-enhanced CT as an initial staging test for these patients. However, PET/CT is significantly more specific, potentially more accurate, and may serve an alternative and/or supplementary role to CT alone in these patients prior to and/or in lieu of diagnostic laparoscopy. Future research comparing SUV thresholds for FDG PET/CT, including MRI and/or PET/MRI, and evaluating the utility of emerging PET radiotracers may help advance the overall accuracy of PET imaging for the diagnosis of stage 3 ovarian cancer. Future revisions to existing guidelines should be informed by this study and emerging literature to further refine the pretherapeutic diagnostic pathway on an individualized basis.