Introduction

Focal treatments of prostate cancer (PCa) are expected to become a standard option for selected patients [1], trying to limit the side effects of radical treatment such as surgery or external beam radiation therapy (ERBT). Among them, high-intensity focused ultrasound (HIFU) ablation has been the most evaluated since the early 1990s [2]. HIFU is a therapeutic option for patients of advanced age, at low-to-intermediate risk, with a low rate of serious side effects [3]. However, one of the principal concerns in focal therapy is the lack of a validated, noninvasive test for monitoring oncological outcome. Although prostate-specific antigen (PSA) is an established monitoring tool after radical treatment [4], its value after focal treatment is uncertain because of the contribution of the residual prostate. Because biopsy is an invasive procedure with significant morbidity [5], exposed to inaccurate sampling and undergrading [6], the MRI-transrectal ultrasound (TRUS) fusion system with target biopsies is now considered best for the follow-up of focal therapy [7]. MpMRI has the capability to monitor the whole prostate (treated and untreated) and to provide information on changes in characteristics that might suggest residual or recurrent disease. Few studies have evaluated mpMRI for HIFU monitoring [8,9,10,11]. DCE has shown encouraging results for the detection of local recurrence after HIFU [8]. DWI has not been evaluated in this situation to the best of our knowledge. Based on continuous improvement of DWI [12], and recent concerns about repeated injection of gadolinium chelates [13,14,15], our aim was to evaluate the added value of DCE to the combination of T2w and DWI for the detection of PCa recurrences after HIFU, using prostate biopsy as a reference standard.

Patients and methods

Institutional Review Board

The creation of the database comprising mpMRI examinations and corresponding pathological data was declared to the appropriate administrative authority. At our institution, all patients undergoing prostate mpMRI with subsequent biopsy give written consent for the use of their MR and pathological data for research purposes. Every patient signed an informed consent form agreeing to HIFU therapy and follow-up.

Patients

Between 2012 and 2017, all patients who had MRI and consecutive prostate biopsies for long-term post-HIFU surveillance were selected from our prospective database. A search was performed to identify patients who fulfilled the inclusion criteria: (1) histologically proven PCa treated with HIFU (focal or partial HIFU), (2) available post-treatment mpMRI of the prostate, including T2w, DWI (b-value up to 1400 or 2000 s/mm2) and DCE sequences, (3) available post-treatment TRUS-guided biopsy following the mpMRI within 3 months. HIFU treatment was offered to patients with clinically localised PCa who either were assessed to be unsuitable for surgery (e.g. because of advanced age or comorbidity) or declined radical treatment after informed consent. The patients could be classified as low or intermediate risk according to D’Amico’s 2003 [16] risk group categories: T1c, T2a, PSA less than 10 ng/ml, and with a Gleason grade of 6 (3+3) or 7 (3+4, or 4 +3). For the patients with Gleason 6 PCa, treatment was decided in accordance with the patient’s willingness to be treated after receiving information on the procedure and outcomes [17, 18]. These patients had undergone no prior treatment for PCa.

HIFU procedure

All patients underwent HIFU focal or hemi-ablation under transrectal ultrasound guidance using the Ablatherm ® Integrated Imaging™ (Vaulx-en-Velin, France) device. If necessary, a trans-urethral resection was performed before ablation.

MRI procedure

All images were acquired with a 1.5- (n = 27) or 3-T (n = 18) MR imaging system (Aera 1.5T MRI and Skyra 3T MRI, Siemens Healthcare, Erlangen, Germany) using a pelvis phased-array coil with 18 channels. All examinations included non-enhanced thin turbo spin-echo T2-weighted (T2w) anatomical images acquired in two planes (axial and sagittal) on the 1.5-T MRI and a three-dimensional single sequence on the 3-T MRI. Axial DWIs were performed using b-values of 50, 400 and 1400 s/mm2 on the 1.5-T MRI and 50, 400 and 2000 s/mm2 on the 3-T MRI with an ADC map. DCE was obtained using a fat-saturated T1-weighted fast-field gradient echo sequence. After acquisition of the T1 relaxation data, consecutive dynamic sequences were acquired after an intravenous bolus injection of 20 ml of gadoterate meglumine (Dotarem, Guerbet, Roissy Charles de Gaulle, France), with a temporal resolution of 13-16 s. T2w, DWI and DCE were available for all patients. The description of mpMRI technical parameters is summarised in Table 1.

Table 1. ESUR recommendation-compliant description of multiparametric MRI protocol

Image analysis

MRIs were independently read by two radiologists: a senior dedicated radiologist with > 10 years of experience in prostate MRI (R.R.P.) [19, 20] and a junior radiologist with 6 months of experience in prostate imaging (R.L.). Both readers analysed two different imaging data sets with a 4-week interval. The first set (a) was image analysis based on the combination of T2w, DWI and DCE. Four weeks later, set (b) was analysed based on T2w and DWI. Readers were blinded to clinical, laboratory and histological findings. The prostate was divided into six sextants: the right and left base, mid-gland and apex. For each sextant, in each data set, readers evaluated the peripheral zone and the transition zone of the prostate.

The largest lesion was defined as the “index target lesion” [21] and was assigned a Likert score [22] ranging from 1 to 5 regarding the likelihood of the presence of PCa (1, definitely absent; 2, probably absent; 3, indeterminate; 4, probably present; 5, definitely present). A recurrent tumour was defined as a focal nodular area with (1) low signal intensity on T2w images, (2) restricted diffusion on DWI and ADC maps and high signal intensity at a b value ≥ 1400 s/mm2 on DWI and (3) rapid asymmetric enhancement on DCE images. Perfusion data were visually assessed with perfusion maps.

Morphological data were collected from a third consensual reading: prostatic volume calculated using the ellipsoid formula [23], size of the index lesion, involvement of treated edges (inferior/anterior/lateral), existence of a suspicious contralateral lesion, extra-prostatic extension, pelvic lymph nodes and bone lesions.

Standard of Reference

All patients underwent TRUS-guided biopsy under local anaesthesia. The biopsy procedure followed mpMRI, which was initially read by the senior radiologist (first set T2w, DWI and DCE) in clinical routine practice. As the biopsy was not performed by a single operator, different types of approaches were used: (A) standard biopsies (STB) with two systematic samples by sextant, one medial and one lateral; (B) addition of STB and targeted biopsies (TB) using the MRI TRUS fusion system (Koelis®, Grenoble, France); (C) TB without STB. The UroStation™ implements elastic registration to fuse the MRI and 3D TRUS images and allow for guiding and recording core localisations on the 3DTRUS and MRI images [24].

All biopsies were examined by a senior pathologist (E.C.). Biopsies were not assigned a Gleason score because of HIFU tissue modification [25]. The cores were individually inked with different colours to mark the sites from which they were collected. The cassettes were soaked in vials of Bouin solution for 1 s to fix the colours and then preserved in pots with 10% formalin. For each biopsy region, whether systematic or targeted, localisation per lobe and per sextant, involvement of the treated or the untreated lobe, total number/length of cores, the number/length of malignant cores and the maximal length of the malignant core were recorded. A cancer was considered as a clinically significant cancer (CSC) when the core length was > 3 mm [26,27,28].

Statistical analysis

Qualitative variables were described by numbers, percentage and quantitative variables by their medians and interquartile range (IQR). Performances of each method of MRI were assessed by providing Se and Sp for each Likert score threshold with their 95% bootstrap confidence interval (CI). ROCs were plotted and AUROCs were calculated with their 95% bootstrap confidence interval. The gold standard was the biopsy result. AUROCs were compared with the paired bootstrap test. Se and Sp were compared with the exact McNemar test. Percentages of discordance (with bootstrap CI) between two readers were evaluated. A p < 0.05 was considered significant. Statistical analysis was performed with R version 3.4.0.

Results

Population and flow chart

Forty-five males who fulfilled the inclusion criteria were included in the present study (Fig. 1). The median (interquartile range [IQR]) patient age was 72 (68-77) years; the PSA level at inclusion was 4.4 (3.1-8) ng/ml. The median (IQR) time from HIFU therapy to mpMRI was 599 (438-911) days; the interval between mpMRI and biopsy was 47 (29-84) days.

Fig. 1.
figure 1

Flow chart. Exclusion criteria were: post-HIFU procedure early evaluation because of local inflammatory changes (n = 11), no biopsy performed (n = 24) or with a delay > 2 months following the MRI (n = 18) The final study population includes 45 patients.

Description of mpMRI target

The median (IQR) prostate volume was 32 (14-35) ml. Targets were located in the peripheral zone (69%), mid-gland (51%) and apex (42%). For 32 patients (71%) targets were located on the treated edge, mostly in the anterior (51%) and inferior part (38%). A contralateral target for the treated area was identified for seven patients (16%). All descriptive mpMRI data are presented in Table 2.

Table 2. MpMRI descriptive data

Biopsy findings

Recurrent PCa was identified at biopsy in n = 37 (82%) of 45 patients, n = 24 (53%) with CSC. The number and localisation of positive biopsies for the final population are summarised in Table 3. Biopsy findings (total and positive respective number/length of cores) are presented for each biopsy procedure group respectively in Table 4. When no lesion was described on MRI, a median of 12 cores was obtained (two systematic samples by sextant, one medial and one lateral) (group A). When a focal lesion was described on MRI, a median of 4.5 targeted cores was obtained (group C) and a median of 14 cores in the group in which both random and targeted biopsies were performed (group B).

Table 3. Description of biopsy results
Table 4. Respective STB and TB biopsy findings

The median number of positive cores was 2.5 for a median total of 12 cores (A), 3 for a median total of 14 cores (B) and 1 for a median total of 4.5 cores (C). The median maximal length of positive core was 4 mm for a median total of 111 mm sampled core (A), 20 mm for a median total of 127 mm sampled core (B) and 10 mm for a median total of 45 mm sampled core (C). Among the 33 patients who had STB (A) and STB + TB (B), n = 21 patients had positive biopsies in the treated lobe (n = 6 patients with non CSC), 3 on the contralateral (n = 2 patients with non CSC) and n = 3 had bilateral positive biopsies.

Per-patient analysis

Using a Likert score ≥ of 3/5 for the diagnosis of prostate cancer, sensitivity was not significantly different when evaluating T2w + DWI and T2w + DWI + DCE for diagnosis and was respectively 0.97 [0.91; 1] − 0.95 [0.86; 1] [95 % CI bootstrap] for the senior reader and 0.95 [0.86; 1] − 0.97 [0.92; 1] for the junior reader. For the senior reader the specificity was not significantly different between T2w + DWI and T2w + DWI + DCE and was respectively 0.38 [0; 0.75] − 0.38 [0; 0.75]. For the junior reader the specificity was higher for T2w + DWI: 0.38 [0; 0.73] vs. 0.12 [0; 0.41] for T2w + DW I+ DCE.

For both the senior and junior readers, the AUCs of T2 + DW analysis and T2w + DWI + DCE analysis were not significantly different (p values 0.66 for the senior and 0.72 for the junior reader) (Fig. 2).

Fig. 2.
figure 2

ROC curves: per-patient analysis (a) and per lobe analysis (b) AUCs were not significantly different between the T2w + DWI and T2w + DWI + DCE combination at (a) the patient level (p values 0.66 for the senior and 0.72 for the junior reader) and (b) the lobe level (p values 0.18 for the senior and 0.78 for the junior reader). Either the junior or senior reader did not improve his diagnostic performance by the addition of DCE

Fig. 3.
figure 3

Case of a 78-year-old patient. Suspicious target on left mid-gland peripheral zone appearing hyperintense on high b-value Dw imaging (a), with low ADC (b), T2w hypointense (c) and ill-defined early enhancement on DCE imaging (d). Among the 12 STB and 2 TB biopsies performed, the presence of PCa was found on two STB cores of the left mid-gland and apical sectors (respectively 4- and-13 mm malignant length) and one left mid-gland TB core (7-mm malignant length)

Per-lobe analysis

Using a Likert score ≥ 3/5 for the diagnosis of PCa sensitivity was not significantly different for both readers. For T2w + DWI analysis and T2w + DWI + DCE analysis, the sensitivity was respectively 0.97 [0.91; 1] − 0.94 [0.86; 1] [95 CI % bootstrap] for the senior reader and 0.94 [0.84; 1] − 0.97 [0.9; 1] for the junior reader. Specificity for a Likert score ≥ 3 was not different for the senior reader (0.27 [0; 0.57] − 0.27 [0; 0.57]) but was higher for T2w +DWI for the junior reader (0.27 [0; 0.6] − 0.09 [0; 0.29]). Using a Likert score ≥ 4 sensitivity was higher with T2w + DWI + DCE than T2w + DWI for both readers (0. 71 [0.56; 0.86] vs. 0.62 [0.44; 0.79]) for the senior reader, [95 CI % bootstrap] and 0.82 [0.68; 0.94] vs. 0.65 [0.49; 0.79] for the junior reader. Specificity for a Likert score ≥ 4 was higher with T2w + DWI + DCE for the senior reader (0.45 [0.15; 0.78] vs. 0.27 [0; 0.57]) but not for the junior reader (0.36 [0.08; 0.67] vs. 0.55 [0.25; 0.83] − 0.36 [0.08; 0.67]). Results for each Likert group (Likert ≥ 2; Likert ≥ 3; Likert score ≥ 4; Likert = 5) are detailed in Table 5.

Table 5. Sensitivity and specificity for Likert ≥ 3 scored mpMRI target [CI 95% bootstrap] for per-patient analysis

For both the senior and junior reader, AUCs of T2 + DW analysis and T2 + DW + DCE analysis were not significantly different (p values 0.18 for the senior and 0.78 for the junior reader) (Fig. 2).

Inter-observer discordance percentage

Inter-observer agreement was not improved by addition of DCE; it was moderate for T2w + DWI analysis (k = 0.54) as well as T2w + DWI + DCE (k = 0.49). For T2 + DW analysis, the discordance percentage between the two readers was 6.7% [0%; 15,6%], with three patients classified in same Likert category (Likert 1 and 2 versus Likert 3, 4 and 5). For T2 + DW + DCE analysis, the discordance percentage between the two readers was 11.1% [2.2%; 22.2%], with five patients classified in same Likert category (Likert 1 and 2 versus Likert 3, 4 and 5) (see details in the supplementary materials).

Discussion

Since the first clinical application of the technique as a single treatment module for locally confined prostate cancer in 1996 [29], high-intensity focused ultrasonic ablation has been widely used in Europe, starting with whole gland ablation and then including hemi- or partial ablation. Detection of local tumour progression after HIFU is important because it can affect decisions about second-line treatment. The significance of PSA is decreasing [3], although the percentage decrease of PSA from before and after focal therapy may have a role in predicting successful ablation of the index lesion [4]. The role of mpMRI in focal therapy for prostate cancer has increased recently but few studies had evaluated MRI techniques for the prediction of local tumour progression in this context [8,9,10,11]. To the best of our knowledge, no prior studies have compared the added value of DCE to the combination of T2w + DWI in patients with suspicious local recurrence after focal treatment with HIFU.

In our study we made three important methodological choices. First we evaluated a homogeneous population: patients treated only with focal or hemi-HIFU but not global HIFU, with low-grade disease, advanced age and no prior PCa treatment. All of the patients were referred for long-term monitoring after focal or hemi-HIFU. We excluded immediate post-HIFU evaluation of necrosis and early evaluation usually performed at 3 to 6 month after the treatment. The population was therefore a selected population of patients with available prostatic biopsies following their mpMRI. Our findings can only be considered for those in long-term follow-up after HIFU.

Second, we used a standardised imaging (all patients had an mpMRI with T2w, DWI and DCE imaging) and compared one imaging modality with and without additional information (A vs. A+B). We chose an independent reading design [30, 31]. The reading was split into two different sets with a wash-out delay of 4 weeks at least. Indeed, the interpretation of imaging involves a global cognitive approach and each sequence influences the interpretation of the others. Independent readings are more likely to represent the real situation of having only T2w and Dw for the final mpMRI conclusion. The added value of each modality was tested independently for two different readers, a dedicated senior and junior uro-radiologist, to possibly expand findings to experienced and less-experienced radiologists.

Third, we used a Likert scale with different thresholds instead of the PIRADS score, which was designed for naïve prostate glands [22]. The Likert score has been proven to be a significant predictor of malignancy in untreated and treated patients [20, 30, 32].

Like previously published series evaluating radio-recurrent prostate cancer, we found that AUC values obtained with T2w + DWI and T2w + DWI + DCE imaging were not significantly different [30, 33]. However, AUC values only reflect the global performance of the test and sensitivity and specificity trade-offs must also be compared between the two techniques. For diagnosis of PCa recurrence after HIFU, we found that mpMRI showed higher sensitivity for both readers, with a poor specificity. Our concern was the potential impact of omitting DCE imaging. Sensitivity and specificity were not significantly modified by addition of DCE imaging. Using a Likert threshold ≥ 3/5, the performance of the junior reader was similar to that of the senior reader and DCE did not appear to be more beneficial to him [34]. We found a slight improvement of sensitivity with DCE imaging for both readers with a Likert threshold ≥ 4/5. Since post-HIFU mpMRI monitors conservative treatment with the possibility of other additional treatment options (surgery, ERBT, repeated HIFU), reliability in the detection of recurrence with a high sensitivity is needed. Kim et al. [9] found that, for prediction of the local tumour progression of prostate cancer after HIFU, DCE-MRI was more sensitive but less specific than the combination of T2w and DWI. However, the accuracy rates of DCE-MRI and T2-weighted MRI with DWI were similar.

Inter-observer agreement was moderate for both combinations, but was comparable to that of other studies evaluating MRI for the prediction of local prostate cancer recurrence after treatment. There was also no improvement in inter reader-agreement when DCE was used. Kim et al. [9] found kappa values for DCE MRI and T2-weighted MRI respectively of 0.59 and 0.46 for local recurrence after HIFU. For local recurrence after prostate cancer radiation therapy, Donati et al. [35] found 0.55 and 0.49 for T2w + DW and T2w +DW+ DCE imaging respectively. The assessment of tumour detection and localisation and the differentiation of viable tumour from treated tumour are notoriously difficult because of volume reduction, loss of zonal anatomy, and shape and signal changes caused by local treatment [36, 37]. Moreover, unlike for naïve prostates, it could reflect the lack of standardisation of reliable criteria for MRI prediction of local prostate cancer recurrence after treatment.

The majority of the mpMRI targets were located on the middle and apex (93%) (Table 2), as previously described by Rischmann [38] (67% rate of apical location) and all localised on the lateral, anterior and inferior edge of the treated zone. This result might be due to the apical safety margin defined in the treatment protocol. The residual disease at the anterior part of the treated area could also be explained by the technical limitations of with a penetration depth < 30 mm.

In a previous study published by Rouvière et al. [8] the detection rate of targeted biopsies was better than for random biopsies in a population for the detection of prostate recurrence after HIFU. In our study, targeted biopsies using TRUS-MR fusion were significantly more likely to contain cancer than STB. TBs were able to diagnose more men with a significant PCa and found more CSCs than non-targeted biopsies. Moreover, PCa undiagnosed by targeted biopsies was usually smaller and less significant than those undiagnosed by random biopsies. This strongly supports that prostate MRI may enable a reduction in the number of cores performed during follow-up biopsies by targeting positive areas, or a reduction in the need for post-treatment biopsy if negative, although this latter application remains speculative.

Our study has some limitations. First, it was a retrospective study and our finding needs to be validated on a larger population. Second, the number of patients was small. HIFU is not widespread in many institutions. Biopsy was not available for all the patients who underwent HIFU at our institution, mostly because of refusal to perform biopsy when MRI did not detect signs of recurrence. Third, salvage prostatectomy is technically difficult and carries a high morbidity rate in this population; therefore we could not use prostatectomy specimens as the reference standard. Finally, images were acquired with 1.5- and 3-T systems, which could affect the performance of this post-treatment context so that we may be underestimating the potential performance of modern MRI with very high b values.

These are encouraging preliminary results, showing the possibility to shorten and simplify everyday practice with a T2w + DWI mpMRI protocol, limiting DCE and so the risk and cost of gadolinium chelate injection.