Introduction

Prostate cancer is the most common cancer in Western men; among malignancies its incidence is the highest and its mortality is third highest [1]. As the range of prostate cancer varies from indolent to highly aggressive, accurate stratification of aggressiveness is essential. Treatment selection is based on serum prostate-specific antigen (PSA) level, digital rectal examination (DRE), and histopathological findings of transrectal ultrasonography (TRUS)-guided biopsy. On the basis of the likelihood of tumor spread and recurrence, localized prostate cancer is stratified as low, intermediate, and high-risk [2]. The Gleason score (GS) is the most commonly accepted and widely used value for evaluating the aggressiveness of prostate cancer. ESUR prostate MR guidelines 2012 [2] classify GS = 6 as low-risk and GS > 6 as intermediate-to-high risk prostate cancer. To avoid serious side effects, low-risk prostate cancer must be diagnosed correctly to prevent unnecessary biopsies or such whole-gland treatments as radical prostatectomy and radiation therapy [3].

Diffusion-weighted imaging (DWI) is a functional imaging technique that yields qualitative and quantitative information on water molecules in tissues. The apparent diffusion coefficient (ADC) is lower for malignancies with greater cell density and space. Prostate MRI studies have shown that the ADC correlates negatively with the GS [4, 5]. However, comparison of ADC values obtained in different studies is difficult, because they vary with the equipment used, the scanning procedures, the b values, and the age and body temperature of patients [613]. The normalized ADC (nADC), calculated by use of the equation ADC (lesion)/ADC (reference), can be used to overcome potential limitations. Determination of the nADC may be necessary because normalization reduces ADC variability attributable to different patient and/or technical factors; it has been useful in evaluation of the brain, liver, pancreas, and bone [1417]. With regard to the pelvic lesion, it has been reported that the nADC is superior to the ADC for estimating the histological grade of bladder carcinoma [18]. The nADC can therefore be useful for reducing the variability of ADC for estimating prostate cancer aggressiveness.

As far as we are aware, however, the clinical applicability of the nADC to prostate cancer has not been sufficiently investigated. The objective of this study was to compare the efficacy of the ADC and the nADC in differentiating between GS = 6 and GS > 6 prostate cancer and to identify the optimum reference for nADC calculations.

Materials and methods

This prospective study received institutional review board approval; informed consent was obtained from all patients prior to their participation.

Patients

Our study included 70 consecutive patients who underwent radical prostatectomy for pathologically confirmed prostate cancer at our institution between January 2013 and February 2014; all had undergone 3T prostate MRI (including DWI) before prostatectomy and all fulfilled our inclusion criteria. We excluded 12 patients who had received hormone treatment before prostatectomy. Consequently, 58 patients (ranging in age from 58 to 81 years, mean 70.7 ± 6.4 years) were included. All patients underwent TRUS-guided biopsy before prostatectomy and 48 underwent biopsy after MRI. The other 10 patients underwent TRUS-guided biopsy before MRI, however the interval between biopsy and MRI was at least 6 weeks to prevent biopsy-related hemorrhage [19]. The interval between MRI study and radical prostatectomy was 45.9 ± 35.0 days (range 9–215 days, median 35 days). The preoperative PSA level ranged from 4.4 to 61.1 ng/ml (mean 10.9 ± 10.5 ng/ml, median 7.6 ng/ml).

MRI

For the MRI studies on our 3T scanner (Ingenia; Philips Medical Systems) we used a 32-channel cardiac SENSE coil. Although patients were not fasting they reduced their diet to approximately 1/3 of normal. If the images were poor because of peristalsis we intravenously injected 20 mg hyoscinebutylbromide (Buscopan; Boehringer, Ingelheim, Germany) and re-started the examination immediately. Axial T2-weighted images, coronal T2-weighted images with fat suppression (FS), axial DWI with two different b values (0 and 1000 s/mm2), and ADC maps using the two b values were obtained. The imaging settings for each sequence are shown in Table 1. We affixed a plastic bottle containing 20 ml saline to each patient’s groin for ADC interpretation. In the examination room we kept the saline bottle at 23 °C to minimize temperature changes because the ADC depends on body temperature [20]. To prevent contamination, disposable bottles were used.

Table 1 Imaging settings used

Pathology

All patients underwent radical prostatectomy after MRI. Each prostatectomy specimen was step-sectioned into 4-mm slices. In routine pathology studies cancer foci were outlined in ink on step-section slices of the prostate. The pathology results were reported by use of the GS system. For radiology–pathology correlation studies, a region was recorded as positive if it contained a tumor with a cross-sectional area greater than 4 mm in diameter (equivalent to a 0.13 cm2 area) [2123]. Markers used for alignment of MR images with histologic slices included the morphologic features of the peripheral zone, transitional zone, apex, midgland, and base of the prostate, and cysts, calcifications, the verumontanum, and the urethra [24, 25]. Each patient’s GS was recorded by a board-certified pathologist with 34 years of experience in interpreting prostate pathology.

Image interpretation

A board-certified radiologist with seven years of experience interpreting prostate MR images matched the step-sectioned pathologic slices with the corresponding T2-weighted images and ADC maps. Regions of interest (ROIs), as large as possible, were drawn manually in the center of the visible pathologically-positive area with the lowest signal. If a pathologic map and a corresponding ADC map contained more than one tumor, the dominant lesion corresponding each patient’s GS with the lowest mean ADC value was used. In the selection of the ROIs on the ADC maps, the tumor margin, prostate capsule, neurovascular bundles, and urethra were excluded. The mean ADC values were automatically calculated on a picture archiving and communication systems (PACS) workstation (EV Insight 2.10.8.81; PSP, Tokyo, Japan): the actual ROI area ranged from 18 to 251 mm2. To select the optimum reference for the nADC calculations, ADC values were also obtained from the obturator internus muscle, urine in the bladder lumen, and the 20-ml saline bottle placed on the groin. Reference ROIs were at least 20 mm2; they were placed in areas as homogenous as possible and visible artifacts were carefully excluded (Fig. 1). The nADC was calculated by use of the equation ADC (prostate cancer)/ADC (reference). The nADC from the references, i.e. the obturator internus muscle, urine in the bladder, and the saline bottle were identified as nADCm, nADCu, and nADCs, respectively. The best cutoff ADC and nADC for identification of prostate cancer with a GS = 6 from GS > 6 were calculated. To investigate the effect of cancer location, ROC analysis was also performed for limited cases with peripheral zone prostate cancer.

Fig. 1
figure 1

ADC measurement of prostate cancer: axial T2WI (a) and axial ADC map at the same slice level (b), at the level of the bladder (c), and at the level of the groin (d). The white arrow indicates prostate cancer in the left peripheral zone (b). ROIs (yellow circles) were placed on the obturator internus muscles (b), urine in the bladder (c), and the 20-ml saline bottle placed on the groin (d)

Statistical analysis

Student’s t test was used to compare the ADC and nADC values for GS = 6 and GS > 6 prostate cancer. The ADC and the nADC values obtained by use of the three references were compared with the GS of the radical prostatectomy specimens; receiver operating characteristic (ROC) analysis was performed to differentiate between GS = 6 and GS > 6 prostate cancer. The ROC curves were drawn by use of computer software (MedCalca, version 9.3.1; MedCalc Software, Mariakerke, Belgium) and the areas under the ROC curves (AUC) were compared. Statistical significance was defined as p < 0.05.

Results

On the basis of our pathology findings for the total of 58 prostatectomies, 7 patients (12 %) had GS = 6 (3 + 3) prostate cancer, and the other 51 (88 %) had GS > 6 prostate cancer: 3 + 4, n = 10; 4 + 3, n = 31; 4 + 4, n = 6; 4 + 5, n = 4. For the dominant lesion of prostate cancer, 36 were peripheral zone and 21 were transitional zone. T stages for all patients were: T2a, n = 17; T2b, n = 2; T2c, n = 34; T3b, n = 5. ADC and nADC values for GS = 6 cancers were higher than those for GS > 6 cancers. Differences among ADC, nADCu, and nADCs values were significant (Table 2).

Table 2 Comparison of ADC and nADC values for prostate cancer

ROC analysis showed that the AUC for ADC, nADCm, nADCu, and nADCs were 0.711, 0.703, 0.762 and 0.846, respectively (Fig. 2). The AUC for nADCs was significantly larger than for ADC for differentiation between GS = 6 and GS > 6 prostate cancer (p = 0.024). The best cutoff nADCs for identification of prostate cancer with GS = 6 was 0.37, with 74.5 % sensitivity and 85.7 % specificity, whereas the best cutoff ADC was 1.1 with 82.4 % sensitivity and 57.1 % specificity. The scatter diagrams for ADC and nADC are shown in Fig. 3.

Fig. 2
figure 2

Results from ROC analysis for differentiation between GS = 6 and GS > 6 prostate cancer

Fig. 3
figure 3

Scatter diagrams for differentiation between GS = 6 and GS > 6 prostate cancer; a ADC, b nADCm, c nADCu, and d nADCs

With regard to ROC analysis for peripheral zone prostate cancer (n = 36 including six GS = 6 prostate cancer), AUC for ADC, nADCm, nADCu, and nADCs were 0.775, 0.678, 0.778 and 0.856, respectively. The AUC excluding nADCm were better than for the ROC analysis including all 58 cases; there were, however, no significant differences between ADC and each nADC for differentiating between GS = 6 and GS > 6 prostate cancer.

Discussion

Although ADC values correlate negatively with tumor aggressiveness [4, 5], their low comparability among studies is a serious problem when the ADC value is used to evaluate cancer aggressiveness. The lack of standardized DWI procedures reduces the reliability of ADC comparisons among studies because the equipment, vendors, coils, pulse sequences, scanner settings, and magnetic susceptibility artifacts are different. These differences can result in inter-study ADC differences of 5–15 % [6, 13, 2628]. We report that the AUC for nADCu and nADCs was higher than for ADC and that the AUC for nADCs was significantly higher than that for ADC for differentiation between GS = 6 and GSA > 6 prostate cancer. This observation suggests that the nADC is superior to the ADC for estimating the aggressiveness of prostate cancer and that it can be useful for the selection of appropriate treatments.

We attribute our finding that use of saline as reference yielded the best nADC to the greater homogeneity of saline over urine. Wang et al. [18] demonstrated that the nADC using urine in the bladder lumen as a reference is best for estimating the histological grade of vesical urothelial carcinoma. Although they used different materials and methods, their results were consistent with ours with regard to the superiority of the nADC. We suggest that using reference materials similar to water can aid identification of the aggressiveness of pelvic cancers. In contrast, the AUC of the nADCm was equivalent to that of the ADC and the AUC of the nADCu and nADCs was better than of the ADC, possibly because of individual variations in the amount of muscle. Although the results in a previous report [18] were different from ours with regard to the utility of the nADC with muscles as reference, they were consistent with our finding that the nADC using urine in the bladder yielded a better AUC. Nonetheless, we consider urine or saline to be a better reference for the nADC than muscles, because inter-patient variations may be smaller and we believe it is the most important factor in the greater ability observed for nADC for discrimination between the aggressiveness of prostate cancer. Although saline was the best reference for the nADC in our study, in the clinical setting, urine in the bladder may be an alternative reference.

Interpretation of ADC for patients with prostate cancer undergoing MRI has been discussed. Litjens et al. [29] reported significant inter-patient variation (1.2–2.0 × 10−3 mm2/s) of peripheral zone ADCs which affected predictability of the aggressiveness of prostate cancer. According to Jacobs et al. [20], in-vivo (37 °C) ADC measurements in the same prostate region of the same individual were significantly (36–48 %) higher than in-vitro (23 °C) measurements. To minimize such ADC variations, all of our patients were scanned on the same 3T instrument using identical coils, pulse sequences, and scanner settings, and the temperature of the saline bottle was maintained at 23 °C. In previous studies with a 3T scanner that used ADC of b = 0 and b = 1000 s/mm2, ADC values for GS = 6 prostate cancer varied from 1.19 to 1.25 × 10−3 mm2/s [6, 30, 31]. Our values were slightly lower (1.14 × 10−3 mm2/s), suggesting that there are variations in the ADC of prostate cancer even when the same MRI settings and b values are used. Lim et al. [32] reported that adding an ADC map to T2-weighted images can improve the diagnostic performance of MRI in the detection of prostate cancer. They emphasized that, as in brain imaging, the nADC should be used to reduce ADC variability on prostate MRI scans. We support their suggestion and our findings provide a clue for evaluation of prostate cancer aggressiveness using the nADC. Nonetheless the clinical application of nADC has not yet been sufficiently investigated; further multi-center prospective studies should therefore be conducted to determine whether nADC really can equalize inter-study ADC variability.

Our study has some limitations:

  • First, there is the issue of selection bias, because all of our patients underwent radical prostatectomy, thereby excluding patients with more aggressive prostate cancer.

  • Second, the temperature of the saline bottle placed on the patients’ groin may rise, because of their body temperature, during the MRI study. Nevertheless, because saline is more homogenous than urine in the bladder, its use as a reference yields more reliable results.

  • Third, we did not investigate cancer detection performance in this study. However, as previous report [33] showed good performance of cancer detection using T2WI and DWI at 3T (AUC = 0.78–0.79). Moreover the authors demonstrated that DWI correlated significantly with prostate cancer aggressiveness. Because our results are in agreement with theirs, nADC may provide better clinical information about tumor aggressiveness and cancer detection.

  • Fourth, there is an inherent limitation in correlating imaging and histologic findings on a section-by-section basis, because the angle of the histologic slices and the MRI scans may be different and prostate tissue tends to shrinks during fixing.

In conclusion, the nADC is superior to the ADC for estimating the aggressiveness of prostate cancer and is a noninvasive technique that aids in the selection of appropriate treatments for patients with prostate cancer.