Introduction

Digital rectal examination (DRE) and transrectal ultrasound (TRUS) are traditionally used for clinical staging of prostate cancer (PCa), but both are lacking in sensitivity and specificity, and TRUS often underestimates the size and stage of the tumour [1]. Thus, prediction of extracapsular tumour extension (ECE) by DRE and TRUS has low accuracy [2, 3].

Radical prostatectomy (RP) provides great disease control for patients with localised PCa (cT1-T2), while RP for locally advanced disease (cT3) remains controversial [1, 4]. Recovery of erectile function and continence after RP is related to surgical technique and preservation of the neurovascular bundles (NVB). Accurate preoperative knowledge of tumour stage and possible ECE are crucial in achieving the best surgical, oncological, and functional result with total tumour resection, while trying to preserve both potency and continence.

Recent findings support the rapidly growing use of mp-MRI as the most sensitive and specific imaging tool for PCa staging [5]. However, the diagnostic accuracy of mp-MRI staging and prediction of ECE differs among studies [610], which has led to a debate regarding mp-MRI’s readiness for routine use [11]. Recently published clinical guidelines [12] from prostate MRI experts have therefore included a structured uniform reporting and scoring system (PIRADS) [11] to standardise prostatic MRI readings. Previous studies have validated the PIRADS classification for PCa detection and localisation using both targeted biopsies [1315] and RP specimens [16] as standard reference. In addition, the guidelines also recommend using a risk score of possible ECE. Based on these recommendations, we carried out this prospective study of a patient cohort with clinically localised PCa based on DRE and TRUS findings who were scheduled for RP. The aim was to investigate the diagnostic accuracy of preoperative mp-MRI with ECE risk scoring in the assessment of tumour stage (T-stage) and prediction of ECE at final pathology.

Materials & Methods

This prospective single-institution study was approved by the Local Committee for Health Research Ethics (No. H-1-2011-066) and the Danish Data Protection Agency, and was registered at ClinicalTrials.gov (No.NCT01640262).

Patients

All patients provided written informed consent, and were prospectively enrolled between September 2011 and September 2013. Inclusion criteria required that all patients were diagnosed with clinically localised PCa determined by DRE and TRUS and scheduled for RP, based on clinical assessment of age, comorbidity, PSA, and biopsy Gleason score. No patients had prior treatment for PCa. The exclusion criteria were patients with contraindication to mp-MRI (pacemaker, magnetic implants, severe claustrophobia, previous moderate or severe reaction to gadolinium-based contrast-media, or impaired renal function with GFR <30 ml/min). No patients were excluded from RP due to preoperative mp-MRI findings.

Multiparametric MRI

Prior to RP, all patients underwent mp-MRI using a 3.0 T MRI scanner (Achieva (n = 71 patients) and Ingenia (n = 16 patients), Philips Healthcare, Best, the Netherlands) with a pelvic phased-array coil (Philips Healthcare, Best, the Netherlands) positioned over the pelvis. We did not use an endorectal coil (ERC), due to lack of availability. If tolerated, 1 mg intramuscular Glucagon (Glucagen®, Novo Nordisk, Bagsvaerd, Denmark) injection combined with 1 mg hyoscine butylbromid (Buscopan®, Boehringer Ingelheim, Ingelheim am Rhein, Germany) intravenous injection was administered to the patient to reduce peristaltic motion. Triplanar T2W images from below the prostatic apex to above the seminal vesicles were obtained. In addition, axial diffusion-weighted images (DWI) including four b-values (b0, b100, b800, and b1400), along with reconstruction of the corresponding apparent diffusion coefficient (ADC) map (b-values 100 and 800), together with dynamic contrast-enhanced (DCE) images before, during, and after intravenous administration of 15 ml gadoterate meglumine (Dotarem 279.3 mg/ml, Guerbet, Roissy CDG, France), were performed. The contrast agent was administered using a power injector (MedRad, Warrendale, PA, USA), followed by a 20 ml saline flush injection at a flow rate of 2.5 ml/s. For imaging parameters, see Table 1.

Table 1 Sequence parameters for 3.0 Tesla Achieva (n = 71)/Ingenia (n = 16) multiparametric MRI with PPA-coil

Image analysis

All tumour-suspicious lesions were evaluated according to the PIRADS classification from ESUR [12] giving a sum of scores (range 3–15). Lesions with PIRADS summation score ≥10, equivalent to a moderate or high-risk group [15], or PIRADS overall score ≥4 [17] were considered to be possible malignant lesions and were used to predict a cT-stage based on mp-MRI (cTMRI ) according to the TNM classification [18]. Pre-contrast T1W images were used to identify post-biopsy haemorrhage as an area with high signal intensity to rule out false-positive findings on T2W. The lesion most suspicious of ECE was evaluated using the ESUR MR prostate guidelines scoring of extraprostatic disease [12], focusing on the ECE criteria (Table 2). The ECE tumour characteristics were first assessed according to the following findings: a) no sign of ECE, b) capsular abutment, c) capsular irregularity, retraction, or thickening, d) neurovascular bundle thickening, e) capsular signal loss or bulging, and f) direct sign of tumour tissue in the extraprostatic tissue. The findings were subsequently associated with the ECE risk score ranging from 0 to 5 (Fig. 1). Suspicion of possible SVI was based on the following findings: a) low T2W signal in the lumen, b) filling in of angle, and/or c) enhancement/impeded diffusion of the seminal vesicles. The ECE tumour characteristics only evaluate T2W findings, and assigning a preoperative cTMRI stage requires a definitive decision of possible ECE and/or SVI. Therefore, the suspicion of ECE was dichotomised into either organ-confined (OC) disease or ECE based on the ECE tumour characteristics and personal opinion, while incorporating functional imaging (DWI and DCE) findings. Equivocal cases with strong suspicion of ECE on mp-MRI were read as positive, and equivocal cases with low suspicion of ECE were read as negative. All mp-MRI images from each patient were analysed by a dedicated MRI prostate physician (reader A) with two years of experience in prostate mp-MRI interpretation. All imaging modalities (T2W, DWI, and DCE imaging) were interpreted simultaneously to predict cTMRI and dichotomising ECE suspicion, and only T2W imaging were used to assess ECE tumour characteristics. Qualitative visual assessment was used for DCE MRI analysis, as described in the guidelines [12]. In addition, all images were reassessed independently from reader A by a second reader B with extensive experience in general abdominal MRI, but only limited experience in prostate mp-MRI interpretation. Both readers had access to preoperative PSA and knew that the patients had biopsy-proven clinically localised PCa, but were blinded to any histopathological findings.

Table 2 ESUR MR prostate guidelines risk scoring of extracapsular extension [11]. The probability of ECE is scored on a five-point scale, providing an ordinal risk score scale, with higher scores corresponding to higher risk of ECE
Fig. 1
figure 1

Prostate cancer (white arrows) tumour characteristics corresponding to different ECE risk score groups: a) ECE risk score 1 – tumour with capsular abutment b) ECE risk score 3 – tumour with capsular thickening c) ECE risk score 4 – tumour with capsular bulging, and d) ECE risk score 5 – tumour with direct sign of ECE

Histopathological evaluation

All patients underwent RP. The surgeon was blinded to any mp-MRI findings. The specimens were coated with ink and fixed in formalin. The bases, including the bases of the seminal vesicles and the apex, were cut in sagittal sections, whereas the remaining prostate was cut into 4–5 mm sections in a plane perpendicular to the rectal surface corresponding to the plane used for axial mp-MRI. The remainder of the seminal vesicles were cut longitudinally. The slices were then further cut into microscopic sections and stained with haematoxylin and eosin. All cancerous areas, including the presence and location of any extraprostatic extension (EPE), defined as either ECE and/or SVI, were outlined based on the histopathological results by an experienced pathologist. ECE was defined as tumour cell growth into the extraprostatic tissue, and SVI was defined as tumour infiltration of the seminal vesicles. The pathological T-stage (pT) was defined using the TNM classification [18].

Statistical analysis

The patients’ clinical characteristics were calculated and compared in two groups (organ-confined and extraprostatic disease) based on the histopathological results. Continuous variables including age, PSA, and percentage positive biopsy cores were compared using the Wilcoxon rank-sum test or a Student’s t test. A Fisher’s exact test or a chi-square test were used to compare the T-stage determined by DRE (cTDRE) and TRUS (cTTRUS), the cTMRI stage, the biopsy GS, and the D’Amico risk group.

The cTMRI was compared to pT for accuracy using weighted kappa statistics and a spearman rank order calculation. A receiver operating characteristic curve (ROC-curve) with an area under the curve (AUC) value was generated to analyse the predictive accuracy of the ECE risk scoring scale in detecting ECE at pathology. An optimal risk score cutoff level, representing the best trade-off between sensitivity and specificity, was determined for the more experienced reader A. In addition, the overall diagnostic accuracy, sensitivity, specificity, positive predicted value (PPV), and negative predictive value (NPV) of mp-MRI in predicting ECE by combining ECE tumour characteristics with personal opinion and in determining SVI were calculated for both readers. Inter-reader reliability was calculated using kappa statistics [19]. A p value below 0.05 was considered significant. Statistical analysis was performed using SPSS 20.0 software (IBM Corporation, U.S.).

Results

Ninety-three patients were prospectively enrolled. However, six patients were excluded due to mp-MRI technical problems or claustrophobia. The final study population of 87 patients with a median age of 65 (range 47–74) and a median PSA of 11 (range 4.6–45) underwent mp-MRI before RP. Clinical and demographic data are presented in Table 3. There was no significant difference in PSA, cTDRE, cTTRUS, or D’Amico risk group between patients with OC or EPE disease. There was a significant difference in age, percentage of positive biopsy cores, biopsy GS, and cTMRI between the groups.

Table 3 Comparison of the demographic data of the study cohort stratified by histopathological results into organ-confined (OC) and extraprostatic tumour extension (EPE)

Mp-MRI identified a tumour in all 87 patients. The correlation between cTMRI and pT showed spearman rho correlations of 0.658 (p < 0.001) and 0.306 (p = 0.004), with a weighted kappa of 0.585 [CI 0.44; 0.73] and 0.22 [CI 0.09; 0.35] for reader A and reader B, respectively. The inter-reader agreement for the readers in determining cTMRI was kappa = 0.44 [CI 0.2; 0.66]. The prevalence of EPE after RP was 32/87 (37 %), including 31/87 (36 %) with ECE and 5/87 (6 %) with SVI. One patient had SVI without concomitant ECE at pathology. Each ECE tumour characteristic (Fig. 2) was stratified into ECE risk score groups (Fig. 3). Mp-MRI ECE risk scoring for the more experienced reader A showed an AUC of 0.86 on the ROC curve (Fig. 4), with sensitivity of 81 % [CI 63; 93], specificity of 78 % [CI 66; 88], and diagnostic accuracy of 79 % at the optimal risk score cutoff level ≥4 (Table 4). Using this cutoff level, 6/31 patients with ECE were missed and 12/56 patients had a false-positive mp-MRI (Fig. 3; Table 4). When mp-MRI findings were dichotomised into either OC or ECE influenced by personal opinion, the false-positive rate dropped to 7/56 patients, at the expense of increased false-negative readings, where 8/31 patients with ECE were missed, producing diagnostic accuracy of 83 %, with sensitivity, specificity, PPV, and NVP of 74 % [CI 55; 88], 88 % [CI 76; 95], 77 % [CI 56; 90], and 86 % [CI 74; 94] for prediction of ECE, respectively. For the less experienced reader B, ECE risk scoring showed an AUC of 0.65 (Fig. 4) and sensitivity, specificity, and diagnostic accuracy of 51 % [CI 33; 70], 69 % [CI 52; 78], and 61 % at the risk score cutoff level ≥4 determined by reader A. The sensitivity, specificity, and diagnostic accuracy changed to 61 % [CI 0.42; 0.78], 77 % [CI 0.64; 0.87], and 71 % when ECE risk scoring was influenced by personal opinion. The inter-reader agreement kappa value between reader A and B was 0.40 [CI 0.19; 0.60] in distinguishing OC from ECE disease and 0.45 [CI 0.21; 0.68] in agreement of ECE risk scores. Reader A detected 4/5 (80 %) patients with SVI, and one patient with pT3a had evidence of SVI (T3b) on mp-MRI due to low T2W signal intensity caused by intraluminal SV infiltration with amyloid.

Fig. 2
figure 2

Tumour characteristics correlated to the histopathology in organ-confined (OC) or extracapsular tumour extension (ECE) clusters for readers A and B

Fig. 3
figure 3

ECE risk groups correlated to the histopathology in organ-confined (OC) or extracapsular tumour extension (ECE) clusters for readers A and B

Fig. 4
figure 4

Receiver operating characteristic curves (ROC) for extracapsular tumour extension (ECE) risk scoring for readers A and B

Table 4 Diagnostic performance of extracapsular tumour extension (ECE) risk scoring, depending on cutoff level and inclusion of personal opinion

Discussion

The prevalence of EPE at histopathology was 37 % in patients with clinically localised PCa, confirming the fact that DRE and TRUS often underestimate the tumour extension and stage.

The prognosis of PCa is highly related to the tumour stage. We found a significant correlation between the cTMRI stage and pT for the more experienced reader A using mp-MRI for clinical staging instead of DRE and TRUS. Complete agreement between cTMRI and pT is difficult to obtain, as mp-MRI does not identify all of the small dispersed insignificant tumour foci that frequently are present within the prostate and are incorporated into the pT assessment. This can easily cause a discrepancy between a T2a/T2b stage identified on mp-MRI and a T2c stage reported at pathology for OC tumours. However, the exact stage differentiation among patients with OC tumours is of less importance, as potentially curative surgery often can be offered regardless of T2-stage. In contrast, the treatment selection of PCa strongly relies on the distinction between OC (T2) and ECE (T3) disease. ROC curve analysis of ECE risk scoring showed a high AUC (0.86), indicating high clinical value of the scoring system when interpreted by a dedicated reader, who reached sensitivity and specificity of 81 % and 78 %, respectively, at best cutoff level ≥4. A cutoff at this level includes both direct sign of tumour growth through the capsule (risk score 5 - ECE highly likely to be present) and also indirect signs such as tumour bulging, loss of capsular signal, and neurovascular bundle thickening (risk score 4 - ECE likely to be present). If the equivocal cases with cutoff level ≥ risk score 3 (capsular retraction, irregularity, or thickening) are included, the sensitivity increases to 94 %, with a decrease in specificity to 68 %, but more interestingly, the NPV increases to 95 %, indicating high clinical value to almost definitively rule out ECE at this level. However, the inter-reader agreement kappa value (0.45) signifies only moderate consistency between the readers and indicates that there are differences in the image interpretation of the individual ECE tumour characteristics.

The purpose of the ESUR PIRADS classification is to introduce a structured uniform scoring system with less subjectivity in order to standardise prostatic mp-MRI readings and facilitate consistency between readers. This approach is well suited for PCa lesion detection and localisation, as each MRI modality in each suspicious lesion is scored independently, providing a summation of all individual scores. In addition, an overall final score (range 1-5) according to the probability of clinically significant PCa being present can be assigned. Similarly, the guidelines also recommend that the probability of extraprostatic disease should be scored on a five-point risk scale, and provide individual tumour characteristics with associated risk scores for ECE, SVI, distal sphincter, and bladder neck involvement. We only evaluated the ECE criteria in this study, as we considered the a priori probability of patients having SVI, distal sphincter, or bladder neck involvement in our population with clinically localised PCa to be too low to validate a five-point risk score. The five-point risk scale is considered a continuum of risk, with higher scores corresponding to higher probability of ECE. However, ECE risk scoring does not include a risk score = 2 or findings on functional imaging. The assessment of ECE tumour characteristics is based only on T2W imaging. Previous studies have shown that functional imaging may improve detection of ECE [2022], especially for less experienced readers [23]. Therefore, the interpretation and overall impression of possible ECE using the risk score scale may be influenced by personal opinion when incorporating functional imaging findings. This applies to our study, as the diagnostic accuracy (71 %–83 %) increased for both readers when incorporating personal opinion and functional imaging into the evaluation of ECE, and changed sensitivity and specificity to 61–74 % and 77–88 %, respectively. This overall diagnostic performance is in accordance with previous findings. A meta-analysis by Engelbrecht et al. [24] reported a combined sensitivity and specificity of 71 % in distinguishing between T2 and T3 disease on 1.5 T MRI systems. However, more recent studies at 3 T ERC MRI report higher rates, with sensitivity and specificity of 80–88 % and 95–100 %, respectively [25, 26].

The mp-MRI interpretation can be affected by the way the physician analyses the images when incorporating personal opinion, especially when deciding on possible ECE in equivocal cases. Until recently, RP was restricted to patients with localised PCa, while patients with high suspicion of ECE and/or SVI were referred for radiation therapy. This might influence the physician to value high specificity with a low number of false-positive readings, so no patients with equivocal mp-MRI findings would be ruled out of possible curative surgery. There has been an increasing interest in performing RP in selected patients with locally advanced disease, as some studies have shown promising results [2730]. If this tendency continues, the value of high-sensitivity readings increases, such that the surgeon is directed to the site of possible ECE to avoid positive surgical margins. Therefore, the ECE risk score cutoff level in Table 4 might be altered to either value high sensitive or high specific readings, depending on the clinical situation. Mp-MRI findings can then be combined with clinical findings and nomograms, and increase the overall pre-therapeutic diagnostic staging accuracy [31, 32].

We evaluated the reader performance of two readers with different experience in mp-MRI prostate interpretation. Overall, the more experienced reader A had significantly higher performance than reader B in the assessment of ECE, using both ECE risk scoring and personal opinion, and was more accurate in predicting the pathological stage. Furthermore, our results indicate that the overall interpretation of possible ECE in our hands should not rely only on ECE risk scoring in its current form, but whenever possible, should also incorporate functional imaging, especially for less experienced readers. It is evident that mp-MRI interpretation has a considerable learning curve, and is a specialised task that requires substantial dedication and experience to achieve acceptable diagnostic results [7, 33]. A dedicated reader education program on prostate mp-MRI interpretation is associated with a statistically significant increase in diagnostic accuracy [34].

This study was designed so as not to exclude any patients from RP based on any preoperative mp-MRI findings, in order to have the prostatectomy specimen from all patients as a standard reference for correlation. Further studies are now needed to address the clinical therapeutic consequences of performing staging of mp-MRI.

This study has some limitations. We included only patients with clinically localised disease at the time of surgery. This might have caused a selection bias, as all patients with clinically locally advanced disease had already been excluded from surgery – and therefore this study – which could cause an overestimation of mp-MRI specificity and an underestimation of sensitivity. Moreover, the readers knew that the patients had clinically localised disease and were not blinded to PSA during the mp-MRI readings; however, we find this more reflective of everyday clinical practice. We used only a pelvic-phased-array coil for staging purposes, and despite promising results at 3.0 T MRI, the use of an ERC might have improved image quality and staging accuracy [26, 35], and is recommended by the ESUR MR prostate guidelines [12] as part of optimal requirements.

Conclusions

Multiparametric MRI with ECE risk scoring by a dedicated reader is an accurate diagnostic technique for determining prostate cancer tumour stage and ECE at final pathology. However, further studies must investigate whether functional imaging should be included in the ECE risk scoring scale, and if so, how to weigh the individual findings in order to increase the diagnostic accuracy of the scoring system.