Introduction

Multiparametric MRI enriches prostate cancer diagnostics, providing visibility of suspicious tumor lesions and paving the way for TB and focal treatment. MRI/ultrasound fusion-guided prostate biopsy leads to increasing detection rates of clinically significant prostate cancer when compared to systematic biopsy (SB) [1, 2]. Application of the Prostate Imaging Reporting and Data System (PI-RADS) in 2012 and its updated version in 2015 allowed standardized mpMRI interpretation among different radiologists [3, 4]. Thus, specificity and sensitivity of mpMRI are reported to be up to 88 and 89%, respectively [5, 6]. Fusion biopsy is recommended by current guidelines for repeat biopsies in primary diagnostics and active surveillance protocols [7, 8]. Usage in biopsy-naïve patients is still under debate, but reveals promising results in recent randomized trials [7,8,9]. The increasing demand for fusion biopsies and the feasibility to perform them either with cognitive or software-based image fusion warrants MRI/ultrasound-guided biopsies be distributed evenly across healthcare providers rather than limited to a few experts in high-volume centers.

However, prostate fusion biopsy is more complex than regular SB, involving multiple steps that affect accuracy, such as image registration, precise ultrasound navigation, and biopsy sampling. Large series comparing TB and SB demonstrated that only the combination of both techniques yields the best diagnostic accuracy [2, 10]. There is a substantial risk of missing significant cancers by TB, demonstrated by the negative predictive values ranging from 0.65 to 0.94 [11].

When evaluating optimization of fusion biopsy, most studies focus on adequate acquisition of mpMRI, image interpretation, mpMRI/ultrasound fusion techniques, and the accuracy of biopsy sampling [1, 12,13,14,15]. In this study, we focus on a key step that is crucial for precise sampling of the region of interest (ROI) and is of utmost importance for the workflow in clinical and outpatient settings: reporting on MRI results varies greatly between radiologists as findings are provided either in a prose text, illustrated approximately on a sector map or delineated in the MRI sequence itself. It is common that urologists have to localize the target lesion in mpMRI only based on the description in written form without any illustration. This requires the urologist to be highly experienced in mpMRI interpretation. Therefore, we compared mpMRI target definitions by radiologists and urologists prior to MRI/TRUS fusion biopsy and analyzed whether differences affect tumor detection.

Materials and methods

Patients

Reasons for MRI/TRUS fusion biopsy comprised elevated PSA levels, abnormal DRE findings or confirmed cancer, either undergoing active surveillance or before focal HIFU treatment. Inclusion criteria were at least one suspicious lesion on mpMRI and the presence of a mpMRI report in a written form. We excluded 7 patients without a suspicious lesion on control mpMRI after focal HIFU therapy.

Multiparametric MRI and MRI/TRUS fusion biopsy

The whole-body 3.0 Tesla scanners Magnetom Skyra and Trio (Siemens Healthineers, Erlangen, Germany) were used for mpMRI in clinical routine. The acquisition protocol included high resolution T2-weighted sequences in transverse, coronal, and sagittal planes, diffusion-weighted imaging (b values of 50,400,800 s/mm2, additional b value of 2000 s/mm2 for Skyra) and dynamic contrast-enhanced (DCE) T1-weighted perfusion sequences.

All initial mpMRI analyses were routinely performed by uroradiologists and supervised by senior radiologists of at least 4 years of expertise in prostate MRI reading (n = 21 radiologists, this group was defined as “radiologist A”). Regions of interest (ROI) were classified according to PI-RADS classification v1 or v2 or as “suspicious”, depending on the time of acquisition. The reports were submitted to urologists in written form without marking the ROIs in MRI images. A standard written report included information on the ROI localization in T2w transversal sequence according to the PI-RADS sector map or by an anatomical description, results of diffusion-weighted imaging and DCE, and lymph node or bone metastasis.

The biopsy platform Artemis™ (Eigen, California, USA) was used for MRI/TRUS fusion-guided prostate biopsies. After import of the mpMRI raw data, surgeons used the proprietary ProFuse™ software (version 3.0.7.1) to mark the prostate contours in various slices. An automatic interpolation completed the contouring before a manual redefining was possible. Subsequently, all regions of interest were marked by an exact contouring in the T2w sequence (Fig. 1). Afterwards, TB followed by 12-core SB was performed as previously described [16, 17].

Fig. 1
figure 1

Contouring the ROI in the ProFuse Software (Eigen, USA) prior to image fusion. a Delineation of the prostate and ROI (red) in transverse T2w sequences. b 3D model of the prostate to visualize correct delineation and ROI definition

Image processing

One expert uroradiologist with more than 4 years training in prostate mpMRI reading (J.B., defined as “radiologist B”) resurveyed all 202 mpMRI images and contoured the lesion in T2w sequences based on the previous MRI report and all available sequences of the respective study using the Osirix software (Pixmeo SARL, Swiss). Radiologist B was blinded to the biopsy results. A lesion was always marked in all the transverse T2w slices where it was identifiable.

We compared the area of the ROIs marked by the radiologist and urologist by a software algorithm that recognized each contour. ROI volumes were calculated by addition of the voxel volume of each slice. Subsequently, we compared the ROI contouring and calculated the matching ROI volume of both observers (defined as common ROI volume = cROI volume) (Fig. 2).

Fig. 2
figure 2

T2w sequences with ROI marked by the radiologist (a) and urologist (b). Automated extraction of ROI contours (c) was followed by merging of the MRI sequences (d). The turquoise overlapping part of both ROIs represents the common fraction

In addition, we evaluated whether contouring a ROI in the wrong MRI slice (i.e., shifted in the cranio-caudal axis) impaired the cROI volume. Therefore, we merged all transverse MRI slices irrespective of slice order where radiologist B and urologist marked the ROI. We thus created a projection of the marked ROIs which ignored any cranio-caudal error in ROI contouring. The resulting cROI volumes were calculated (Fig. 3).

Fig. 3
figure 3

Schematic description of the merging process for patients without any cROI volume: When ROIs were defined in different T2w slices, all MRI images were merged independently of the slice. Light oval: prostate, red oval: ROI defined by radiologist, green oval: ROI defined by urologist

Statistical analysis

Cohen’s Kappa statistic was used for calculation of interobserver variability between radiologist A, radiologist B, and urologists. Wilcoxon rank sum test was used for statistical comparison of the median ROI volumes. We calculated intraclass correlation coefficient (ICC) to evaluate variance of the volume measurements. Potential predictors for a cancer-positive TB were evaluated by bivariate analysis. For approximately normally distributed data, two sample t tests have been used to compare the mean values of two groups. For skewed variables, Mann–Whitney U tests were performed instead. For the comparison of qualitative parameters, Chi square test or Fisher’s exact test have been applied. Significant variables were included for a multivariate logistic regression model to determine their independent prognostic value. Analyses were performed using SAS® 9.3 software and statistical significance was set at p < 0.05.

Results

Patient characteristics

In total, 202 patients received an mpMRI of the prostate and a consecutive MRI/TRUS fusion biopsy between December 2014 and December 2016. Of those, 46.5% received re-biopsy after a prior negative biopsy, 24.3% had a prior confirmed cancer, and 29.2% were biopsy-naïve. Patient characteristics are provided in Online Resource 1. Five patients with a suspicious control mpMRI 12 month after HIFU therapy were also included. For the comparison of ROI volumes, the 165 patients with an equal number of ROIs per evaluation were included. In 157 of the 165 patients, the ROIs or part of them were marked in the same prostate lobe and zone by both observers. In 7 patients, the prostate zone and in one patient the lobe were different. Additionally, for bi- and multivariate analysis, another 11 patients were excluded because their suspicious lesion was not classified according to PI-RADS (6 patients with proven cancer in TB and 5 patients without).

Multiparametric MRI results and definition of ROIs

The mpMRI analysis by radiologist A revealed 239 lesions in 202 patients, with a median of one lesion per patient (range 1–3). In more than a third of all patients (34.2%), the index lesion was classified as PI-RADS 3 (Table 1). Index lesions had a median diameter of 12 mm (5–45 mm). We compared if mpMRI interpretation differed between all observers. Urologists contoured the highest number of ROIs compared to radiologist A and radiologist B (261 vs. 239 vs. 223). This resulted in a substantial interobserver variability (κ = 0.41–0.65). In contrast, the localization of ROIs according to the PI-RADS sector maps differed less between the three observers (κ = 0.76–0.84) (Table 2) [4].

Table 1 Results of mpMRI and MRI/TRUS fusion biopsy (n = 202 patients)
Table 2 Agreement of ROI count and localization between three observers using Cohen’s kappa coefficient (κ)

Urologists marked significantly larger median ROI volumes than radiologists in the 165 patients with an equal number of ROIs per evaluation (p = 0.045). The median cROI volume of urologist and radiologist B was 0.24 cm3 (0–11.44 cm3), which represents a median fraction of 43.86% of the radiologists’ ROI volume (Fig. 4). Intraclass correlation coefficient was 0.84 for variances of volume measurements.

Fig. 4
figure 4

Median ROI volumes by radiologist and urologist were significantly different. A median fraction of 43.86% of the radiologists’ ROI volume matched with the urologists’ volume (cROI volume)

Biopsy results

In the 165 patients, the overall cancer detection of combined TB (a median of 2 TB per ROI; range 1–8) and 12-core SB was 68.5%. TB was positive in 52.0 and 51.9% in a per patient- and a per lesion-based analysis, respectively (Table 1).

In the subgroup analysis of 154 patients with either a positive or negative TB and classification by PI-RADS (Table 3), the cROI volume was higher in the group of positive TB. Bivariate analysis revealed ROI volume of the radiologist, cROI volume, prostate volume and PI-RADS score as significant predictive variables for a cancer-positive biopsy. In a multivariate logistic regression model, a smaller prostate volume and higher PI-RADS score remained independent factors (Table 4).

Table 3 Patient characteristics separated by target biopsy result
Table 4 Bivariate and multivariate analysis for cancer-positive targeted biopsy

SB revealed cancer in 63.9%. In 67.8% of the 90 cancer-positive ROIs, at least one SB sample was positive in the corresponding prostate sextant. The systematically obtained cancer-positive biopsy core within this sextant was a significant predictor of a positive TB in bi- and multivariate analysis (p ≤ 0.001) (Online Resource 2).

Relevance of MRI image slice

Target definition differed entirely between the radiologist and urologist in 38 patients (23%) and no cROI volume was measurable. As selecting the wrong T2w slice for target localization could be a potential bias in these patients, we merged all slices where radiologist and urologist marked the ROI. This resulted in an increased cROI volume fraction of 67.1% and reduced the number of patients without any cROI fraction to 10 patients (6.1%). TB was positive in 8 of 38 patients (21.1%), whereas SB was positive in 19/38 patients (50%). Of those, four cancers were missed by TB but found by at least one SB core in the corresponding prostate sextant (Online Resource 3).

Discussion

The fundamental prerequisite prior to MRI/TRUS fusion and targeted biopsy is the exact localization of the MRI visible lesion. ESUR guidelines recommend “preferably using a structured reporting scheme”, that includes PI-RADS score, localization and extraprostatic extent, as well as incidental findings [3, 4]. However, there is uncertainty on how to transmit MRI reports to urologists and whether radiologists or urologists should mark the target in MRI sequences [18]. Therefore, this study compared ROI definitions by radiologists and urologists and determined the effect of different observers on prostate cancer detection.

We observed that interobserver variability between the first reading radiologists, the resurveying radiologist and the urologists was higher for the total number of marked ROIs than for localization according to the sector map. Urologists marked more ROIs compared to the MRI report and to radiologist B, which is likely due to their interpretation of additional areas that appear suspicious but do not fulfill all PI-RADS criteria. In addition, correspondence with various radiologists immediately before biopsy to confirm correct ROI localization could have led to different reading results. It was shown that MRI interpretation and biopsy sampling underlie a learning curve [19, 20]. This is a further explanation for both variances between urologists and radiologists, and between radiologist A and B.

Target definition not only differed in terms of number and localization, but also in volume. Urologists tend to mark larger volumes. It is conceivable that this is justified by a safety strategy to avoid undersampling of the suspicious region. Underestimation of tumor size in mpMRI images was previously demonstrated, particularly in the context of focal therapies, leading to incomplete tumor ablation [21]. But, marking an oversized ROI around a small tumor might result in missing the tumor by inaccurate biopsy sampling. Since ROI volume data was not normally distributed, we used bar charts for comparison instead of a Bland–Altman plot. ICC (0.84) revealed that measurement variance certainly was based on observer disagreement.

When comparing cROI volume between the different observers with regard to biopsy result, the subgroup of patients with a TB positive for cancer had a higher conformity between radiologist and urologist. Vice versa, discrepancies in target definitions appear to impact cancer detection. Bivariate analysis confirmed the cROI volume as a statistically significant predictive variable for a positive biopsy result (p < 0.001). However, in our cohort, only 43.9% of the urologists’ ROI volumes matched with those of the radiologists. The most likely reason is a misjudgment of morphological variances or distinct signal intensities appearing like suspicious ROIs but lacking malignancy criteria. Hence, it is likely that in inconclusive MRI results, neither a written nor oral description of the ROI would adequately enable an unversed urologist to define the “true” ROI. We further hypothesized that MRI reports frequently lack information on MRI series and slices for ROI localization. Subsequently, if the urologist marks the ROI in an incorrect slice, this could affect hitting the target. We merged MRI images independently of the T2w slice and found that the cROI volume could be considerably increased. Thus, ROIs had been marked similarly in transverse planes but were located differently along the cranio-caudal axis. In the patients without any cROI volume, this could have resulted in cancers detected by SB in the same prostate sextant that were missed by TB in four cases.

We performed a multivariate logistic regression analysis in which, in contrast to bivariate analysis, large prostate volumes, and a lower PI-RADS score were the only independent predictors of positive biopsy results. To further investigate why cROI volumes remained statistically insignificant, we evaluated whether tumor extents were higher than expected in mpMRI and thereby more likely to be detected, and assumed a positive SB in the same prostate region as a surrogate marker. In more than two-thirds of all ROIs where TB was positive, at least one SB core in the same region was positive. We concluded that, in these patients, precise ROI definition and TB played a minor part, since large tumor volumes allowed detection by SB. A different explanation is that large prostate volumes and lower PI-RADS scores complicate target localization and selection for both radiologists and urologists. Previous studies demonstrated increasing detection rates in lower prostate volumes and for lesions with higher PI-RADS scores [10, 22].

These results are limited due to the lack of histopathological alignment as a reference for appropriate ROI definition. In prostatectomy specimens, the exact tumor localization could be compared to the assumed ones in mpMRI. However, we performed an extensive image analysis and correlation to clinical results, proving the need for optimization of MRI reporting in the future. In addition, results from both the MRI reports and contouring of the targets, originate from different observers. Their variety of expertise may lead to a bias, but this study intended to represent the clinical routine where typically MRI reporting and biopsy sampling are not restricted to the hands of single experts.

Conclusion

ROI definition by urologists based on a prose MRI report is common prior to MRI/ultrasound fusion biopsy. A comparison of ROI definition in the original MRI report by a reviewing uroradiologist and by an urologist revealed remarkable discrepancy in MRI contouring. The impact of an accurate target definition on cancer detection via biopsy was demonstrated by a higher conformity between the observers in targets positive for cancer. Thus, MRI reporting in written form might lack detailed information for urologists without expert knowledge. Accuracy of targeted biopsies would profit from ROI contouring in MRI sequences by the radiologist. If not feasible, improved MRI reporting should include specific information on ROI localization in sequence and slice accompanied by an illustration. However, large prostate volumes and lower PI-RADS scores have to be considered as most limiting variables affecting biopsy results.