Introduction

According to the current guidelines of the European Association of Urology 2013, MRI generally is the method of choice for local staging of prostate cancer (Pca) [1].

Currently, one of the most important aims is to increase time- and thus cost-effectiveness of the examination in order to provide multiparametric (mp)-MRI on a larger scale, potentially allowing for a broad implementation as a screening tool [2].

To date, a lot of experience has been needed to accurately detect and characterise lesions owing to complex mp-MRI protocols. To use mp-MRI broadly, standardisation of diagnostic quality is crucial [3]. Currently, there are increasing efforts to standardise reports on mp-MRI. In this context, the PI-RADS criteria were established in 2012 [4]. Since then, the literature has confirmed the T2-weighted sequence (T2w) to be the most valuable technique for characterising transitional zone lesions, whereas diffusion-weighted imaging (DWI), on the other hand, appears to be most suitable for lesion characterisation in the peripheral zone. As a result of these observations, the PI-RADS criteria were revised in 2015 and PI-RADSv2 was established [5, 6]. In consequence, dynamic contrast-enhanced imaging (DCE) was downgraded to only differentiate between PI-RADS 3 and 4 in the peripheral zone. In both scores targeted biopsy is usually recommended, so that the additional value of the sequence with associated contrast agent administration for detection purposes is questionable, especially considering current contrast agent safety concerns [7,8,9].

Low b-value imaging lacks accuracy due to ‘T2-shine-through’ effects, whereas ultra-high b-values (>1,200 s/mm2) almost exclusively display tissue properties (e.g. high cell density) and have been confirmed to be beneficial for lesion detection [10, 11]. In the past, it was not feasible to measure ultra-high b-values with an acceptable image quality. Long echo times (ETs) were required that caused a significant loss of signal-to-noise ratio (SNR). To reduce these effects, generally various b-values with an acceptable SNR were obtained and virtual high b-values mathematically extrapolated [12]. New MRI systems with the option of two-channel excitation allow for the use of zoomed single-shot echo-planar imaging (EPI) executed with a reduced field of view (FOV) in the phase-encoding direction. This technical innovation has led to significant improvements in image quality and robustness of DWI in various organs [13, 14], including the prostate due to substantial reduction of off-resonance and image distortion artefacts [15, 16]. Moreover, ultra-high b-values (e.g. parallel transmit zoomed b = 2000 s/mm2 sequence (b2000)) may be measured with good image quality and sufficient SNR within acceptable scan times.

In addition to standardisation of lesion characterisation in mp-MRI, the precise description of lesion locations has become particularly important, because MR-images are now increasingly used for MR-guided biopsies and ablative procedures such as MR-guided focused ultrasound (HIFU) [17, 18]. A 15-segment model has been proposed to improve accuracy of lesion documentation [19], which has been implemented in the prostate-imaging reporting and data system version 2 (PI-RADS v2) recommendations. This model was also used in this study to guide recommended biopsies of detected lesions.

The purpose of this study was to evaluate the diagnostic performance of a comprehensive, non-contrast mp-MRI approach consisting of T2w and b2000 for lesion detection among readers with different degrees of experience in mp-MRI of the prostate with a histopathological gold standard.

Methods

Study population

Between March 2014 and March 2016, 93 consecutive patients with a mean age of 67 ± 8.53 years (range 45–88 years) and a mean prostate-specific antigen (PSA) of 14.2 ± 18.37 ng/ml (range 0–108 ng/ml) who underwent mp-MRI at our institution were enrolled in this study. All examinations were clinically indicated. Patients were referred due to elevated PSA levels and/or suspected Pca. Data collection was carried out after Institutional Review Board approval was obtained.

Definition of clinically significant cancer

Significant cancer was defined as having a Gleason score (GS) >3 + 4 (7a) according to previous publications [20, 21].

MR imaging

A state-of-the-art mp-MRI was performed on a whole-body 3 T scanner with two-channel fully dynamic parallel transmit capability (Magnetom Skyra with TimTX TrueShape, Siemens, Erlangen, Germany). The acquisition protocol consisted of high-resolution T1- and T2-weighted sequences, DCE and a zoomed DWI according to a study by Attenberger et al. [15].

b2000 was measured as a separate sequence with a total scan time of 2 min and 45 s. A two-dimensional spatially-selective RF pulse using an echo-planar transmit trajectory identical to that described by Attenberger et al. was used, and the FOV was reduced to one-third. Table 1 details the imaging parameters.

Table 1 The most relevant imaging parameters of b2000

Image analysis

Evaluation of the anonymised images was carried out using OsiriX DICOM viewer (OsiriX 3.9.4; The OsiriX Foundation, Geneva, Switzerland). All data were interpreted by three radiologists (reader A: 7 years’ experience in abdominal MRI; reader B: 4 years’ experience in abdominal MRI; reader C: <1 year’s experience in abdominal MRI) in a blinded, randomised fashion.

Lesion detection and diagnostic confidence

First, all three readers independently evaluated a black-and-white inverted b2000 in all datasets regarding presence of suspicious lesions. Second, T2w and b2000 images were fused to accurately describe lesion location according to an established 15-segment model. Morphological criteria such as ‘erased charcoal sign’ in the transitional zone were not used to characterise lesions. Only lesions that displayed a higher signal (approximately >20%) than healthy background on b2000 were recorded for statistical evaluation. The threshold of 20% was estimated empirically in clinical routine. Lesions that showed a lower contrast were not clearly distinguishable from the background due to noise. Moreover, the overall diagnostic confidence of all three readers was recorded (1 = poor; 2 = moderate; 3 = excellent) and compared.

Signal intensity (SI) lesion/background

SI was measured in lesions and healthy background in all histopathologically proven (true positive) Pca cases by drawing regions of interest (ROIs) of the same size into lesions and morphologically healthy prostate tissue without increased signal on b2000. The size of ROIs varied depending on tumour size. Evaluation was performed solely by Reader A.

Signal-to-noise ratio (SNR)

Quantification of SNR was performed using multiple pseudo replicas by adding synthetic noise with the same properties as in the original data set. For one patient, the raw data of b2000, as well as a noise scan, were exported from the scanner for SNR quantification. SNR maps of the source images acquired with a b-value of 2000 s/mm2 were generated in Matlab (The MathWorks Inc., Natick, MA, USA).

Real-time MR/ultrasound (MR/US) fusion and systematic biopsy

All men underwent real-time transrectal MR/US fusion biopsy of mp-MRI suspicious lesions first and then a systematic 12-core biopsy. The biopsy was performed on the ultrasound platform HiVisionPreirus (Hitachi Medical Systems, Tokyo, Japan) under general anaesthesia. The system uses a sensor-based registration, tracking the movement of the transrectal ultrasound (TRUS) probe through a low magnetic field (0.1 Tesla) and a sensor applied to the probe. After the identification of anatomical landmarks, the identical MR and the real-time TRUS image planes are fused [22]. The suspected lesions on mp-MRI were marked in the T2-weighted axial sequence interdisciplinary with a urologist and a radiologist prior to the procedure. The number of MR/US fusion biopsies was adjusted to the size of the target lesion [1,2,3]. The systematic 12-core TRUS-guided biopsy was performed without guidance of the mp-MRI. Histopathology analyses were performed by experienced uro-pathologists according to the 2005 International Society of Urological Pathology (ISUP) Consensus [23]. The GS of MR/US fusion biopsy served as a gold standard for the comparison with SI ratios.

Statistical analysis

Statistical evaluation was carried out on a patient-by-patient basis using SAS (SAS Version 9.3). Means and standard deviations of lesion and background SI are reported. Cohen’s Kappa was utilised so assess inter-reader agreement of reader pairs on lesion detection and diagnostic confidence. Sensitivity (SS), specificity (SP), positive predictive value (PPV) and negative predictive value (NPV) of all readers were recorded for all cancers (independent of GS). Moreover, detection rates for significant cancers (GS >7a) are reported.

Results

Distribution of GS

In 62 patients Pca was confirmed histopathologically. Among those, 32 (52%) had a GS of 6 (3 + 3); 25 (40%) had a GS of 7 (13 (21%) (3 + 4), 12 (19%) (4 + 3)), two (3%) had a GS of 8 (one (2%) 3 + 5; one (2%) 4 + 4)) and three (5%) a GS of 9 (5 + 4).

Lesion detection

In 11 cases, Pca was missed by one or more readers (Reader A: 6; Reader B: 8; Reader C:9). Among 17 significant cancers (GS >7a) only one Pca (GS 4 + 3 (7b)) was not detected by Reader C. False-positive ratings occurred in nine (Reader A), five (Reader B) and eight (Reader C) cases, respectively (Table 2a and b) (Figs. 1, 2 and 3).

Table 2 Lesion detection. Overall SS, SP, PPV and NPV of the T2/b2000 approach (a) was good. All three readers achieved excellent detection rates (b). One significant Pca (Gleason score (GS) 7b (4 + 3)) was missed by Reader C only
Fig. 1
figure 1

Images of a 72-year-old patient with a suspicious lesion in the peripheral zone of the left apex (10p). Diagnosis of prostate cancer (Gleason score 6 (3 + 3)) was confirmed by MR-guided biopsy. Notice the impressive background suppression of back-and-white inverted b2000 (lower right) and the perfect delineation of the lesion after image fusion with a morphological, high-resolution T2w sequence

Fig. 2
figure 2

Images of a 75-year-old patient with an initial prostate-specific antigen (PSA) of 19.6 ng/ml. Only the index lesion in segment 2p was confirmed histopathologically (GS 7b (4 + 3)) and rated positive on b2000 evaluation. Retrospectively, b2000 (right) depicted all three lesions. Lesions were confirmed by an additional GA68-PSMA PET/CT. Notice the good correspondence between Ga68-PSMA PET/CT (left) and black-and-white inverted b2000 (right) image

Fig. 3
figure 3

Images of a 79-year-old patient with an initial prostate-specific antigen (PSA) of 14.6 ng/ml (May 2013). PSA levels continuously increased over time (15.7 ng/ml in June 2014 and 16.7 ng/ml in August 2014). The lesion in segment 4a/p was rated as PI-RADS 5 on MR images acquired in November 2014 due to restricted diffusion with focally reduced ADC values and increased signal on b2000, low T2 signal and pathological DCE with increased blood flow (PF) and wash-out curve. Negative MR/US fusion biopsy results were obtained in January 2015. The referring physician was informed immediately after these results came to our attention. A PSA value of 21.1 ng/ml was confirmed in March 2016 and re-biopsy was scheduled

In five cases, no readers detected histopathologically proven Pca. Among those, four had a GS of 6 (3 + 3) and one a GS of 7 (3 + 4). One GS 6 (3 + 3) was missed by Readers A and C and another GS 6 (3 + 3) by Reader B and C. Two GS 7 (3 + 4) were not detected by Reader B only. Reader C did not detect two additional GS 7 cases (3 + 4 and 4 + 3).

Inter-reader agreement and diagnostic confidence

Inter-reader agreement for detection of lesions was good (Table 3). Moreover, we observed a generally high diagnostic confidence in all three readers with a moderate-to-good agreement of all readers (Table 4).

Table 3 Inter-reader agreement, which was good. For comparison of reader pairs Cohen’s Kappa was used
Table 4 Diagnostic confidence of all readers, which was generally high with a score of >2 in the majority of cases. For comparison of reader pairs Cohen’s K was used

Signal intensity (SI) lesion/background

SI of lesion and background was measured in 56 histopathologically proven cases by Reader A only. SI was higher in lesions (14.2 ± 4.0) than in healthy background (8.9 ± 2.4), although a clear overlap of values was observed (Fig. 4).

Fig. 4
figure 4

Lesion signal intensity (SI) (left) was higher than background SIright). Severe overlap of values with SI (background) was observed

Signal-to-noise ratio (SNR)

We observed a relatively poor SNR distribution on the generated SNR map. Nevertheless, prostate and lesions were well delineated in most cases, which made lesion depiction easy independent of the degree of experience (Fig. 5a and b).

Fig. 5
figure 5

(a and b) Signal-to-noise ratio (SNR) maps show an expected severe noise with low SNR. However, the prostate is nicely delineated and small, diffusion-restricted lesions can be excellently distinguished from surrounding healthy tissue. Nevertheless, due to prominent noise, b2000 should be measured as a separate sequence and not be used to calculate apparent diffusion coefficient (ADC)

Discussion

Multiple studies have confirmed the value of DWI of the prostate to detect Pca-typical cell density increases and to quantify the extent of restricted diffusion by the apparent diffusion coefficient (ADC) [24,25,26,27,28]. Many of these studies demonstrated a better performance compared with T2w images for lesion detection, both through the use of DWI as a single measurement or in combination with standard morphological sequences. Turkbey has determined that the combination of T2w and DWI achieved a SS in the range of 45–89% and a SP in the range of 61–97%, compared to 74–85% and 57–95% for DWI as a single sequence or 25–87% and 57–92% for T2w [28]. A recent Lancet study by Ahmed to compare the prevalence of clinically significant cancer (GS >3 + 4) by means of mp-MRI and TRUS-biopsy has determined a SS/SP/PPV/NPV of 93/41/51/89% of mp-MRI compared to 48/96/9074% of TRUS-biopsy in a large patient population with 230 significant cancers [20].

The results of this study reveal a very similar performance of a shortened protocol consisting of T2 and b2000, independent of the experience level of the readers (SS 85–90%/SP 71–84%/PPV 86–92%/NPV 72–79%). Detection rates for significant cancers (GS >3 + 4) were excellent (Reader A/B:100%; Reader C:94%).

In addition to high detection rates for prostate cancer, especially in the peripheral zone, DWI is particularly useful for evaluation of tumour aggressiveness (GS) and thus to identify clinically significant tumours for which therapy is necessary. The histopathological evaluation of prostate cancer aggressiveness is one of the most significant prognostic aspects used in predicting patient outcomes and disease-free survival. Underestimation of the final pathologically proven GS by means of TRUS-guided biopsy is a well-known problem. A 2001 study confirmed that GS was underestimated in 46% and overestimated in 18% of cases [26]. Well-differentiated tumours maintain their tubular architecture whereas more cellular components dominate aggressive cancers. High cellular density leads to a restriction of the random motion of water molecules, an attribute that can be quantified with ADC values [27]. A number of recently published articles have demonstrated an inverse correlation between the ADC and the final GS after prostatectomy [29,30,31]. Therefore, additional measurement of low b-values for the calculation of the ADC value still seems advisable. This should be done in a separate sequence, as prominent noise in b2000 (Fig. 3a and b) can lead to calculation of falsely low ADC values. An accurate estimation of ADC values is particularly crucial in active surveillance regimes for early detection of malignant transformation of formerly low-grade tumours.

If one would like to offer mp-MRI of the prostate on a larger scale as a screening tool, the examination would have to be short and thus cost efficient; on the other hand, the mp-MRI protocol should be safe and the accuracy of the reports generally high among readers with different degrees of experience. Hoeks concluded in a review published in 2009 that mp-MRI was not suitable as a primary screening test due to its high costs and limited availability [32]. A protocol without application of contrast media would be significantly cheaper. A comprehensive protocol consisting of T2w, DWI (e.g. 0, 50, 400, 800) for ADC-calculation and a separate b2000 could reduce the scan time to less than 15 min. Computation of b2000 from low b-value datasets may yield similar performance to that of acquired DWI for lesion detection as well as differentiation between intermediate–high- and low-risk prostate cancers, and may facilitate further decreasing examination costs [33, 34].

In addition, there is currently an extensive discussion on cerebral contrast agent deposits, especially in the dentate nucleus and the globus pallidus, of uncertain pathological significance [8]. The macrocyclic contrast agents that we use as standard in our institute seem to be less affected than linear contrast agents [9]. Nevertheless, taking into account the current debate, administration of contrast agent should presumably only be performed when truly necessary.

One of the specific aims of PI-RADSv2 was to “educate radiologists on prostate MRI reporting and reduce variability in imaging interpretations.” In our study, we were able to achieve an excellent agreement and high confidence of readers with different levels of experience. Even a beginner missed only one clinically significant Pca (GS 7a (4 + 3)). The overall performance of the beginner was similar to that of the two experienced readers. Moreover, b2000 provides transparent results for both patients and referring physicians, owing to simplified image interpretation compared to more advanced mp-MRI protocols.

Our study has some limitations. First, for the technique, a 3 Tesla scanner with the possibility of parallel radiofrequency excitation is necessary. Devices of this type are, so far, mainly available in large academic centres. Second, we would like to mention limitations of the histopathological gold standard using MR/US-guided and standard 12-core biopsy after fusion of T2w and ultrasound images with possible false-negative histopathological results.

In summary, b2000 is a promising technique for standardising the diagnostic quality of mp-MRI of readers with different levels of experience. This technique, in combination with a T2w and a standard DWI for ADC-calculation, could be sufficient as a screening tool to reliably exclude clinically significant Pca in a cost-efficient, safe, native protocol independent of the degree of experience of readers. Computation of b2000 or even higher b-values may help to further decrease examination costs. Supplementary advanced mp-MRI protocols, including DCE, could then be only performed in patients with detected lesions to further evaluate the disease burden (particularly for evaluation of extracapsular extension [35, 36]).