Introduction

Cervical cancer is the fourth most common cancer in women worldwide, with estimated new cases and deaths of 13,240 and 4,170 respectively in the United States in 2018 [1]. The standard treatment for locally advanced cervical cancer [International Federation of Gynaecology and Obstetrics (FIGO) stages IB2 to IVA] is concurrent chemoradiotherapy (CCRT) [2]. The 5-year survival rate is about 58-80% for stage IB-II cancer, but is only 15-35% for the stage III-IV cervical cancer [1]. Biomarkers identifying patients at higher risk of recurrence is urgently needed, to provide a better personalisation of cancer care in treatment decisions or follow-up strategies. Still, they may not be satisfactory for identifying before treatment. Clinicopathological factors for cervical cancer including FIGO stage have shown predictive values in prognosis [3]. The most important causative factor for cervical cancer, human papillomavirus (HPV), also provides valuable prognostic information [4], with the absence of HPV infections or presence of alpha-7 species resulting in a poorer prognosis, compared with alpha-9 species or multiple HPV infections [4].

Magnetic resonance (MR) imaging is useful in the staging of cervical carcinomas of FIGO stage IB1 or greater, supported by its accuracy for evaluating tumour size, detecting parametrial invasion, and high negative predictive values for excluding bladder and rectal invasion [3]. Diffusion-weighted (DW) MR imaging provides a quantitative readout—apparent diffusion coefficient (ADC) values, representing tumour cellularity [3]. Preliminary reports have shown the prognostic values of ADC derived from a single section [5,6,7,8,9,10,11,12], few of them applying the three-dimensional whole-tumour ADC values [13]. With the development of high-throughput extraction of quantitative features that result in the conversion of images into mineable data, [14], DW MR imaging demonstrated potentials in integrating ADC values with the established predictive markers such as HPV genotyping [15] and clinical FIGO stage [3], to improve the predictive accuracy.

The purpose of this study was to develop and evaluate the values of a prognostic model based on an integration of pretreatment MR imaging, whole-tumour ADC values and HPV genotyping, in the prediction of survival and recurrence for women with stage IB-IV cervical cancer following CCRT.

Materials and methods

Patients

This study complied with the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) Statement. We retrospectively analysed three consecutive prospectively collected cohorts from February 2007 to December 2014 (IRB 95-1243B, IRB 97-2366B and IRB 102-0620A3), enrolling all the available and evaluable cases during the study period. There was no sample size estimation at the initial stage. Our institutional review board approved this study and informed consents were obtained. The study setting was conducted in a tertiary referral centre with a dedicated interdisciplinary gynaecological oncology team. The inclusion criteria for this study were: biopsy proved newly diagnosed cervical cancer clinical stage FIGO IB-IVA and willing to undergo CCRT treatment. The exclusion criteria were: patients who are judged to be non-compliant to treatment or not accessible for follow-up, patients having contraindications to MR scanning, or patients unable to cooperate for MR study due to mental status. The patient cohort is detailed in Fig. 1. We recorded the relevant clinical information, including age, histology, grade, HPV status and serum levels of squamous cell carcinoma antigen (SCC-Ag) and carcinoembryonic antigen (CEA). Serum SCC-Ag level is elevated in 28-88% of patients with squamous cell cervical cancer, and has shown prognostic significance for patients undergoing surgical treatment or radiation therapy [16]. Data collection was planned before the MR imaging acquisition. A total of 134 evaluable subjects entered final analysis in this study; age ranged from 24 to 92 years (median, 54 years).

Fig. 1
figure 1

Flow chart of the study cohort

MR methods

MR studies were conducted with a 3-T MR imaging unit (Tim Trio; Siemens Healthineers, Erlangen, Germany) before treatment. Patients were placed in the supine position with bladder emptied before the examination. The lower nine elements of the integrated spine coil and the lower six elements of the body-phased array coil were used to cover the entire pelvis. MR images of the pelvis were acquired in the sagittal and axial oblique planes perpendicular to the cervical canal (DW: single-shot echo-planar technique with fat suppression; TR/TE, 3,300/79; average, 4; b value = 0 and 1,000 s/mm2, section thickness, 4 mm; gap, 1 mm; matrix, 128 × 128; FOV, 20 cm; T2-weighted: 5,630/87; average, 3; matrix, 256 × 320; FOV, 20 cm). This study was performed during free-breathing at a minimal level. No premedication was administered [17].

Imaging analysis

ADC maps were generated using a monoexponential decay model with b value of 0 and 1,000 s/mm2 (VB17a, Trio TIM; Siemens Healthineers). Anonymous data were exported offline to a personal computer. By using an in-house developed software based on Matlab (MathWorks, Natick, MA, USA), the first reader (Y.T.H, a gynaecological radiologist with 8 years of experience) drew regions of interest (ROIs) around the tumour on each slice on the ADC maps with reference to the high b value DW and T2-weighted images to delineate the whole tumour volume to minimise the slice selection bias. The second reader independently verified the ROIs (G.L., a gynaecological radiologist with 10 years of experience). Both readers were blinded to clinical outcome. Care was taken to avoid non-pathological structures in the ROIs, such as normal cervical stroma, vascular structures, fluid or Nabothian cysts in the cervix. The mean, minimum, 10th, 25th, 50th, 75th and 90th percentiles and maximal pixel ADC values (ADCmean, ADCmin, ADC10, ADC25, ADC50, ADC75, ADC90 and ADCmax, respectively) were generated, and the skewness and kurtosis of the histogram were recorded (Fig. 2). The tumour volume was calculated by multiplying the voxel size of DW image by the number of voxels in the ROI, where the voxel size = in-plane pixel size (1.5625 mm2) × (4 mm slice thickness + 1 mm gap) = 7.8125 mm3.

Fig. 2
figure 2

Representative volumetric apparent diffusion coefficient (ADC) measurement. a Coloured scale bar, × 10-3 mm2/s. b Histogram analysis. A 47-year-old woman with moderately differentiated squamous cell carcinoma of uterine cervix, clinical stage T2b N0 M0, HPV genotype 18, maximal tumour diameter 7.3 cm, ADC10 0.68 × 10-3 mm2/s, ADC50 0.82 × 10-3 mm2/s (green), ADC90 1.00 × 10-3 mm2/s (red). This patient had persistent disease following concurrent chemoradiation therapy. Overall survival time, 8.1 months

Histopathology

Before treatment, histopathological types and tumour grades were evaluated in the consensus of a general pathologist and a specialised gynaecological pathologist (R.C.W.). If the primary site of a tumour was uncertain, an additional immunohistochemical study was performed to exclude the possibility of endometrial cancer and confirm the diagnosis of cervical cancer.

HPV

Poor prognostic HPV genotypes were defined as alpha-7 species (HPV18, 39, 45) and absence of HPV infection in this study, as opposed to favourable prognostic alpha-9 species (HPV16, 31, 33, 52, 58) and multiple HPV genotypes [4, 18]. The procedure for the HPV genotyping has been reported previously [19]. Briefly, DNA was extracted from paraffin-embedded specimens and amplified by polymerase chain reaction (PCR) for the L1 open reading frame with biotinylated GP6+ and SPF1 consensus primers, then hybridised with the Easychip® HPV Blot membranes (King Car, I-Lan, Taiwan), to determine the final HPV genotypes. Type-specific PCR reactions were performed to validate multiple types and HPV-negativity on HPV Blot. In cases of discordance between results, a repeat of HPV Blot, SPF1/GP6+ PCR, and direct sequencing was performed to resolve the discrepancy.

Treatment and outcome

All patients underwent standard external beam radiation therapy (daily fraction of 1.8–2 Gy in five fractions weekly; total dose, 40–45 Gy) and cisplatin-based CCRT, six cycles of weekly intravenous cisplatin infusions (40-50 mg/m2 body surface) during the radiation therapy course [4]. Post-therapy surveillance was the same as that in our previous report [4]. The primary outcome was overall survival (OS), defined as the time from the first fraction of radiation therapy to the date of death. Disease-free survival (DFS) was determined as the time from the first fraction of radiation therapy to the development of any recurrence (local, regional or distant). Evidence of recurrence was based on the pathology documentations. When the histological/cytological tissues were unable to be obtained due to anatomical limitations or patient conditions, we used clinical proof of recurrence confirmed by at least two imaging modalities—18F-fluorodeoxyglucose (18F-FDG) positron emission tomography/computed tomography (PET/CT), MR or CT. Cases lost to follow-up, non-cancer-related death and those alive at the end of the follow-up period were considered censored observations. Patients with persistent diseases were regarded as having relapsed on the first day when completing radiation therapy.

Statistical analysis

The data were analysed using the Statistical Package for Social Sciences, version 11 (SPSS; IBM, Armonk, New York, USA), MedCalc for Windows, Version 9.2.0.0 (MedCalc Software, Ostend, Belgium), or R Package for Statistical Computing (www.r-project.org). Survival curves were generated using the Kaplan-Meier method, and comparisons were made using the log-rank test. Pearson's chi-squared test was used to evaluate the association between the variables. Continuous variables were analysed with Student’s t-test (two-group comparisons). The dataset was randomly split into training (n = 88) and testing (n = 46) datasets. Based on the training set, univariate Cox proportional hazard regressions were used to assess associations of ADC parameters, tumour volume and relevant clinical variables (i.e. age, histological type, differentiation, TNM stage, HPV risk, serum levels of SCC-Ag and CEA) with OS and DFS. Variables significant in the univariate analysis or by prognostic interest were subsequently entered into a multivariate analysis using the Cox proportional hazards ratio model. Their hazard ratios (HRs) and 95% confidence intervals (CIs) were calculated using the Wald test. Continuous variables were determined by the recursive partitioning method for optimal cut-point analysis. By using the selected independent prognostic variables based on the training dataset, outcome-predicting models were formulated. The testing data, uniformly resampled from the testing dataset under the bootstrapping procedure to duplicate the testing sample size, was applied for independent validation of the models. We used likelihood ratio tests to compare the goodness-of-fit between models. Missing data was excluded from the analysis case-wise. All tests were two-sided, and p < 0.05 was considered statistically significant.

Results

Patient characteristics and clinical outcome

Table 1 lists the clinical and demographic features of the study population. The median follow-up was 69 months (range, 9–126 months) for surviving patients. Of the 134 women analysed, 87 (65%) were free of disease at last follow-up. Twenty-two patients had persistent cervical cancer and were never free of disease. Of the 25 patients with recurrence, 12 had distant failure alone (one bone, two lungs, lymph nodes at the supraclavicular fossa, mediastinum, inguina, three para-aortic lymph nodes, one having both para-aortic and cardiophrenic lymph nodes, and two having multiple sites including lung and multiple lymph nodes at the supraclavicular fossa and mediastinum), 1 had regional lymph node recurrence alone and 5 were local failure alone (four cervix and one vaginal cuff). The remaining seven patients had more than one site of failure: one local-regional-distant (vaginal stump, pelvic lymph node and lung), two local-distant (one cervix and lung, another supra-vesicle mass and para-aortic lymph node), four regional-distant (one pelvic and para-aortic lymph nodes, one pelvis, para-aortic and mediastinal lymph nodes, two pelvis, para-aortic and supraclavicular fossa lymph node, respectively). Recurrences were confirmed by at least two imaging modalities in all cases, and in 16 patients also proven by biopsy. The time to recurrence ranged from 7 to 61 months after primary treatment. The overall 2-year OS rate was 81.2% (95% CI, 73.5–86.9%), and the 2-year DFS rate was 83.5% (95% CI, 76.0–88.8%). Forty-one patients died of persistent or recurrent disease.

Table 1 Patient’s characteristics (n = 134)

Whole-tumour ADC10 is an independent prognostic factor for survival

We developed the prognostic factors based on the training dataset. Our two readers agreed in 747 of the total 825 image slices, with an agreement rate of 0.905. All the ADC parameters (ADCmean, ADCmin, ADC10, ADC25, ADC50, ADC75, ADC90 and ADCmax) correlated one another significantly (p < 0.0001). Univariate Cox regression analyses for OS and DFS are shown in Tables 2 and 3. Based on stepwise selection procedure, it showed that adenocarcinoma or adenosquamous carcinoma cell type, ADC10, ADCmax, T stage 3-4, distant metastasis (M1) and high-risk HPV status were independent predictors for both poor OS and DFS (Table 2 and 3). In subgroup analysis, adenocarcinoma or adenosquamous carcinoma cell type, ADC10, T stage 3-4 and high-risk HPV status were also independent predictors for both poor OS and DFS in the patients without distant metastasis (M0). ADC10 values showed no significant correlations with the tumour volume, serum CEA or SCC-Ag levels (Supplementary Table 1), and showed no significant differences between tumour cell type (adenocarcinoma or adenosquamous carcinoma vs squamous cell carcinoma), T (T1-2 vs T3-4), N and M stages, as well as HPV risk (Supplementary Table 2).

Table 2 Univariate and multivariate Cox regression analysis of factors associated with overall survival on the training dataset
Table 3 Univariate and multivariate Cox regression analysis of factors associated with disease-free survival on the training dataset

Outcome-predicting models

Outcome-predicting models were constructed using the five covariates, including non-squamous cell type, ADC10 <0.77 × 10-3 mm2/s, T3-4, M1 stage and high-risk HPV status. The prognostic score was low, intermediate and high in presence of 0-1, 2-3 and 4-5 risk covariates respectively. The probability of dying was lowest in the low-risk group (HR 0.01), followed by the intermediate-risk group (HR 0.02) when compared with the high-risk group (HR 1, p = 0.001; Table 4). Two-year OS rates of the low-, intermediate- and high-risk groups were 89.6%, 74.4% and 0%, respectively. The probability of cancer recurrence was lowest in the low-risk group (HR 0.03), followed by the intermediate-risk group (HR 0.12) when compared with the high-risk group (HR 1, p < 0.0001; Table 4). We also found significant differences between the low and intermediate prognostic groups in predicting the OS (p = 0.002) and DFS (p = 0.002). For both OS and DFS, the prognostic model improved the prediction compared with outcome classification using FIGO stage and was consistent and stable on the testing dataset (Figs. 3 and 4). The prognostic model has significant improvement compared with the FIGO staging system, based on the likelihood ratio tests to compare the goodness-of-fit (p < 0.0001). Adding the ADC10 to the model significantly improve the predictive value in OS (p = 0.035) and DFS (p = 0.038) in the testing dataset.

Table 4 Hazard ratios of overall survival and disease-free survival according to prognostic score
Fig. 3
figure 3

Kaplan-Meier curves for overall survival according to outcome prediction model (upper) and FIGO stage (lower) in cervical cancer patients undergoing CCRT, for training, testing and all datasets. The prognostic model has significant improvement in the training, test and overall data, compared with FIGO stage, based on the likelihood ratio tests to compare the goodness-of-fit (p < 0.0001 for all)

Fig. 4
figure 4

Kaplan-Meier curves for disease-free survival according to outcome prediction model (upper) and FIGO stage (lower) in cervical cancer patients undergoing CCRT, for training, testing and all datasets. The prognostic model has significant improvement in the training, test and overall data, compared with FIGO stage, based on the likelihood ratio tests to compare the goodness-of-fit (p < 0.0001 for all)

Discussion

In the present study, we integrated clinical and imaging data—non-squamous cell type, lower tumour ADC10 value, T3-4, M1 stage and high-risk HPV status—to generate a prognostic model which outperformed the current FIGO stage to predict OS and DFS for patients with cervical cancer following CCRT. We constructed independent validation bootstrap samples to estimate prediction error through randomly selecting datasets (training and testing), and extending the size of the testing set by bootstrap replication, in order to address the sample size limitation issue of independent validation [20]. This approach would lend stronger confidence in the data and a higher chance to have actionable models. Our principal findings based on ADC histogram analysis on the entire tumour volume showed that ADC10 is an independent prognostic indicator for OS and DFS. As lower ADC values imply an increase in cellularity [3], the lower ADC10 in this study might represent a more aggressive subpopulation in the cervical tumours. The low ADCmin of the primary tumour before CCRT has been reported as an independent prognostic factor for both OS and DFS based on single-slice ADC analysis [21]. Pretreatment ADCmean for patients with complete remission were significantly lower than those of partial remission following CCRT [11]. Furthermore, a lower ADCmean, along with FIGO stage, LN metastases and metabolic tumour volume (MTV) on 18F-FDG PET/CT were adverse predictors of both OS and DFS, by univariate Cox analysis [9]. In the only study analysing volumetric whole-tumour DW MR imaging for outcome prediction, Gladwish et al [13] showed a lower volumetric ADC90 or ADC95 might predict DFS in patients with locally advanced cervical cancer treated with CCRT, for both squamous cell carcinoma and adenocarcinoma. Our results indicate that the prognostic model could help oncologists accentuate the follow-up for patients with a high risk of local recurrence to assess for recurrence [22], or potentially tailor the prospective trial designs, as a reduced-dose intensity-modulated radiation therapy with concurrent chemotherapy might be an alternative in favourable-risk patients with HPV-associated oropharyngeal cancer [23].

ADC parameters were significantly lower than those of normal cervical tissues [24,25,26,27,28], which make it easy to contour the volumetric whole-tumour ADC histogram. Compared with adenocarcinoma, the tumour ADCmean [24,25,26,27,28,29] or ADC50 [26] is significantly lower for squamous cell carcinoma, possibly attributed to the glandular content of adenocarcinoma [30]. However, we found ADC10 values are independent of tumour volume, serum CEA or SCC-Ag levels, and show no significant differences in tumour cell types. In line with our study, there is no significant difference between the ADC histogram of cervical cancer when divided by FIGO stage [5], lymphovascular space invasion [25, 29], depth of cervical infiltration metastases [29] or lymph node metastases [25, 29], although a lower tumour ADCmean had been reported to be an independent predictor of pathological parametrial invasion [31]. Hence the ADC10 might represent tumour microenvironmental changes in cellularity, necrosis or interstitial fluid pressure [13]. Tumour subvolumes with greater cell density by DW MR imaging also have increased metabolic activity on FDG-PET [32]. An inverse correlation was reported between ADCmin and SUVmax [6, 33], with their combination being an independent prognostic factor for both DFS and OS [6]. The changes of ADCmean and SUVmax before and after 4 weeks of CCRT treatment were significantly correlated [34]. A recent small scaled study argued that the pre-treatment volume-based quantitative parameters of FDG PET may have better potential than ADC histogram for predicting treatment response and event-free survival in patients with locally advanced cervical cancer [35]. To this end, we postulated that the ADC10 might represent the more aggressive subpopulation in the target tumour, suggesting that the combination PET-MR might play a complementary role for the clinical assessment of tumour heterogeneity.

There were some limitations to our study. The primary technical limitation of this study is that, to reduce the adverse effects of medication and patient discomfort, we did not administrate anti-peristaltic agents in the prospective studies. Nevertheless, no remarkable bowel motion artefacts affected the quality of DW MR images in our study cohort. Second, although we excluded tumour volume less than 1 cm3 to prevent partial volume effects, we may still have included extreme ADCs resulting from DW imaging and ADC map misregistration artefacts. We can presume that the ADC10 or ADC90 obtained from histograms were less affected by these artefacts that ADCmin or ADCmax, and therefore, may be more reliable for histographical analysis. We gave equal weight to each predictor that had a different HR to facilitate clinical use. Despite a relatively short median follow-up of 2 years, the majority of treatment failures in cervical cancer occur within the first 2 years. Although our data are based on a single-institutional research, the ADC parameters have been proven to be robust across 1.5-T and 3-T scanners from different vendors using clinically available protocols, and might be suitable for use in multi-centre clinical studies and trials [36]. Nevertheless, our data are based on well-standardised prospective studies with internal validation and an independent testing dataset.

In conclusion, the prognostic model based on integrated clinical and imaging data—non-squamous cell type, lower tumour ADC10 value, T3-4, M1 stage and high-risk HPV status—may be a useful clinical biomarker to predict survival and recurrence in patients with stage IB-IV cervical cancer treated with CCRT. The prognostic models could be useful for stratifying study patients in future clinical trials.