Introduction

Nerve-sparing strategies during radical prostatectomy (RP) for localized prostate cancer (PCa) are associated with preservation of erectile function and urinary continence. Neurovascular bundles (NVB) are localized all-around of the prostate and seminal vesicles especially for cavernous nerves [1]. Nerve-sparing strategy has to be planned depending on the knowledge of the presence and localization of extra-prostatic extension (EPE) and seminal vesicle invasion (SVI) to reduce probability of positive surgical margins associated with a twofold increased hazard of biochemical relapse [2].

Prediction of EPE by digital rectal exam (DRE) or prostate biopsy is known to have low accuracy [3]. Nomograms like “Partin tables” using commonly available preoperative data—serum PSA level, clinical stage and biopsy Gleason score—to predict pathological stage at RP can be used [4].

Moreover, debate persists regarding whether MRI should be used routinely for preoperative evaluation of PCa. The millimetre accuracy provided by pathologist is impossible to obtain by MRI, and its performance to investigate the EPE remains notably lower below 0.75 mm or extension at the apex [5]. Recently, Rud et al. [6] have found in a randomized controlled trial that MRI alone prior to RP does not reduce risk for positive surgical margin [8]. Radiologists reached a sensitivity of 33–84% and specificity of 71–92% for T2/T3 differentiation depending on their experience: specialist uroradiologist in academic centre reflects the best diagnostic performance of MRI [7, 8]. However, in general radiologist centres, where the majority of prostate MRIs are conducted, this performance may be lower.

Our objective was to evaluate the place of the “everyday” (or current) prostate MRI in relation to other data in the preoperative staging of EPE and/or SVI and to establish a predictive scoring system combining SVI and EPE.

Methods

Patients

From January 2000 to December 2013, 1743 consecutive patients had RP in an academic centre for localized PCa. All of these have had a pretreatment MRI. For each patient, preoperative data were noted: clinical stage (T1–T3, N0 and M0), biopsy before surgery for evaluation of Gleason score, serum PSA level, MRI as preoperative staging tool of prostatic cancer.

Pathologic specimens

RP specimens were step sectioned and prospectively evaluated by experienced uropathologists of our centre according to Stanford protocol. The T category as defined using the American Joint Committee on Cancer categorization and the Gleason score original or on International Society of Urological Pathology (ISUP) 2005 Gleason score, and ISUP 2010 for specimens after these years was noted.

MRI

Prostate MRI was performed without any restriction of technique (it depended on the MRI technology available at the time of examination). Patient with MRI which is technically insufficient was not included. All MRI imaging were prospectively evaluated preoperatively for the presence of EPE by four urologists experienced in prostate cancer evaluation and radical prostatectomy. Presence of EPE on MRI was concluded based on personal training and knowledge of the radiologist’s report. Results were categorized using a scale for suspicion of EPE at MRI: iT2 if there was no EPE, iT3A if EPE was suspected or iT3B in case of seminal vesicle invasion (SVI).

Statistical analysis

The population of 1743 patients was randomly divided in two groups: (1) the training group of 1054 patients and (2) the validation group of 689 patients.

Characteristics of patients included in the two groups of analysis are presented as numbers and percentages, mean ± 1 standard deviation or median [interquartile range], depending of the distribution of the variable. Serum PSA was transformed in ordinal categorical variable with the following cut-offs: ≤5, ]5–10], ]10–15], >15 ng/ml. Gleason score was considered ≤6 or ≥7. Prostatic weight was divided as ≤50 or >50 g.

Factors associated with EPE were tested in the training group by means of Chi-squared test or Fisher’s exact test, where appropriate. All variables significant in univariate analysis at a p ≤ 0.05 level were tested in a stepwise logistic regression model that estimated the β coefficients, the odds ratios (ORs) with their 95% confidence intervals of the variables independently associated with EPE at a p ≤ 0.05 value. The goodness-of-fit of the model was assessed by the Hosmer–Lemeshow test.

A score of risk of EPE was calculated for each patient as the sum β i x i , where β i designed the β coefficient of each class of a variable, and corresponded to the variables with the value 0 for the category of reference and 1 otherwise. Performance of the score was assessed by the area under curve (AUC). Several cut-offs derived from the ROC curve were tested according to a better value of negative (NPV) or positive predictive value (PPV).

A simplified, user-friendly version of the model has been built. It was derived from the original model by linear transformation of the β coefficient. This simplified model was compared to the original one in terms of performance. The score was then applied to the validation population.

Results

Patients’ characteristics are shown in Table 1. Clinical, radiological and biological aspects were not different between the two groups. Overall 549 patients (31.5%) had EPE in their RP specimen (95% CI 29.3–33.7). SVI was reported in 182 patients (10.5%; 95% CI 9.0–11.8).

Table 1 Patients’ characteristics N = 1743; random sample: test population n = 1054; validation population n = 689

Population test analysis

Univariate analysis

In the population test, 329 patients had EPE in their RP specimen, accounting for an overall prevalence of EPE of 31.2%. SVI was reported in 119 patients accounting for an overall 11.3% prevalence of SVI. As shown in Table S1, in univariate analysis PSA, clinical stage (DRE), biopsy Gleason score, prostate weight and stage at MRI were significantly associated with EPE at RP (p < 0.05).

The crude results of MRI in that population were a sensibility of 34.8%, a specificity of 85.7%, a negative predictive value (NPV) of 63.6% and a positive predictive value (PPV) of 64.7%. For clinical exam, the DRE has a sensibility of 4.0%, a specificity of 99.5%, a NPV of 57.9% and a PPV of 85.7%.

Multivariate analysis and EPE score system development

All significant variables in univariate analysis were tested in a logistic regression model. As shown in Table 2Aa, multivariate analysis identified PSA > 5 ng/ml, clinical stage DRE T2 or T3, biopsy Gleason score ≥7, prostate weigh >50 g and stage at MRI iT3A or iT3B as factors independently associated with EPE at RP (p < 0.05).

Table 2 (A) Multivariate analysis of preoperative parameters predicting extra-prostatic extension (EPE) and/or seminal vesicle invasion (SVI): (a) with MRI (b) without MRI; (B) extra-prostatic extension (EPE) and/or seminal vesicle invasion (SVI) preoperative system score with or without MRI

Fitness of the model was assessed by the Hosmer–Lemeshow goodness-of-fit statistic showing that there was good concordance between observed and predicted EPE with a p = 0.77.

On the other hand, we have built a model without MRI (Table 2Ab). Addition of MRI with clinical parameters improved this fitness from 0.49 to 0.77.

As shown in Table 2B, the EPE scoring system was then produced by multiplying the coefficients β by 3 and rounding them to the nearest integer value. The score ranges from −1.5 to 13 points and −1.5 to 11.5 for the model without MRI.

The ROC curves are shown in Fig. 1. The AUC is 0.74 (IC 95% [0.71–0.77]; SE 0.0156), for the score with MRI. For the model without MRI, the AUC is 0.72 (IC 95% [0.69–0.75]; SE 0.0160).

Fig. 1
figure 1

Validation data set ROC curve for extra-prostatic extension (EPE) with preoperative data using the EPE evaluation scoring system. Comparison of ROC curves and AUC with or without MRI

There is a statistical difference between models with or without MRI (χ 2; p = 0.0039).

Its prognostic value was tested using several threshold values (2, 2.5 and 3). Although several cut-off values were significantly associated with EPE, the value of 3 was selected as optimal. The cut-off value of 3 separated the population into two subgroups: less probability of EPE (≤3) and high probability of EPE (>3).

Sensitivity, specificity, PPV and NPV of the EPE scoring system with MRI and a threshold of three were 67.6, 69.5, 62.7 and 73.9%, respectively.

Validation population

In the validation population of 689 patients, 220 patients (31.9%) had EPE in their RP specimen. SVI was reported in 63 patients (9.1%). Informational indices sensitivity, specificity, PPV and NPV at the threshold of three were, respectively, 65.4, 71.1, 61.4 and 74.5%.

Discussion

It has been well established that the prevalence of EPE is influenced by several established parameters such as PSA, clinical stage and biopsy Gleason score [912]. Because of the high incidence of PCa and the high cost of MRI, debate persists regarding whether MRI should be used routinely for preoperative evaluation of PCa. European Association of Urology (EAU) and French Association of Urology (AFU) have established recommendations for the evaluation of tumour extension before RP [13, 14]. For the EAU, the extent of PCa is evaluated by DRE and PSA and may be supplemented with bone scanning and computed tomography (CT) or multiparametric MRI (mp-MRI). For the AFU, the prostate MRI is optional and easily proposed to evaluate the indication of nerve preservation during RP.

Various nomograms have been developed to predict pathological characteristics [15, 16] (Table S2). Without MRI, the AUC of preoperative clinical parameter to predict EPE varied from 0.702 to 0.84 [4, 1719].

Some authors have evaluated the MRI to predict risk and location of EPE. MRI with other associated parameters has little result.

  • For Billing et al. [20] (106 patients), the accuracy of diagnosing EPE was 72.2%, with an overall sensitivity and specificity of 30.0 and 93.3%, respectively. The negative predictive value was 72.7%.

  • For Wang et al. [21] (612 patients), in the combined endorectal MR imaging–MR spectroscopic imaging group, the areas under the ROC curves were 0.81 for the staging nomograms and 0.90 for the staging nomograms plus MR findings.

  • Pak et al. [22] (944 patients) have developed a scoring system for the prediction of posterolateral EPE with an AUC of 0.810 for discrimination ability of the scoring system. The Hosmer–Lemeshow goodness-of-fit test is low (p = 0.396).

  • Feng et al. [23] with a limited population of 112 patients have shown that mp-MRI improved accuracy of existing clinical nomograms with an AUC of 0.93 with Partin table plus MRI and 0.94 with MSK nomogram plus MRI.

Our study has developed a scoring system including PSA, DRE, biopsy Gleason score, prostate weight and MRI for the prediction of EPE having an AUC of 0.74 for discrimination ability of the scoring system for EPE. With a Hosmer–Lemeshow goodness-of-fit test of 0.77, the preoperative scoring system has a good accuracy and the best one of the literature. A model with MRI had a better AUC and accuracy than a model without MRI justifying the contribution of MRI in the preoperative plan.

Clinical stage T2 was shown to be associated with presence of EPE (p = 0.027 in the multivariate analysis). This is unexpected and underlined the fact that DRE with low sensibility but high positive predictive value needs to be taken in consideration before RP. Prostate weight >50 g is a protective factor (OR 0.64 [0.48–0.86]). This is described in the literature [2426].

Larger glands may produce more PSA due to the presence of benign prostatic hyperplasia, causing a lead time bias or diagnosis of prostate cancer at an earlier point in the progression of disease. But, this has to be confirmed in a prospective study.

Compared to our cohort, some studies have better AUC even without MRI but our score has the best accuracy with the largest sample and an internal validation.

If we resume our study and others publications (Table S3), predictive models are more performing with MRI (mean of performance index values = 0.793) versus without MRI (mean of performance index values = 0.734). We have to take in consideration that MRI not only predicts EPE but also provides an anatomical view of the localization of the PCa [27].

Limitations of our study are that we did not standardize the patients who underwent MRI that introduce bias in evaluation. Additionally, because of “real life” condition, we were unable to obtain the training and experience profiles of the radiologists who conducted the MRIs. The MRI evaluation was heterogeneous by nonspecialized radiologists providing of multiple radiology centres. The performance of specialized radiologist MRI reading for EPE prediction is an incremental benefit [28]. MRI radiologist report homogenization was approached by evaluation of four urologists experienced in prostate cancer evaluation and radical prostatectomy. It is possible that our findings underestimate the true diagnostic accuracy of MRI and its added value.

Other limitation of our study is that the inclusion of preoperative parameters of our cohort lasted 13 years; this long period includes the development of MRI techniques. A recent meta-analysis by De Rooij et al. [29] analysed the accuracy of MRI with different technical features to detect EPE. Their conclusion is that functional imaging in addition to T2-weighted imaging improved sensitivity for EPE, while the sensitivity was not improved by endorectal coil use. There is a lack of sector-based comparison between MRI and histopathology. Neither the side nor the level of EPE was consistently reported in the prospectively recorded data.

This study has several strengths. Patient populations influence staging accuracy, as a high rate of organ-confined disease may obscure a poor detection of locally advanced disease, and small study populations may result in higher staging performance [30]. Our study population was large (n = 1743) and consisted of 41.9% with stage pT3 disease.

Our study “everyday” prostate MRI has a good calibration and is internally validated. So, this user-friendly score could be used in clinical practice.

Conclusions

Analysis of MRI associated with conventional preoperative data allows good staging prior to radical prostatectomy. A scoring system including PSA, DRE, biopsy Gleason score, prostate weight and MRI allows to exclude an EPE and/or VSI in 74% of localized PCa accurately.