Introduction

Prostate cancer is the most common newly diagnosed cancer among men in the western world and the second-most common disease worldwide [1]. Not only does multiparametric (mp)-MRI prior to biopsy may help in avoiding unnecessary biopsies in 30% of patients, but it may also reduce the rate of over diagnosis and overtreatment due to a detection bias of MRI toward significant disease [2]. Diffusion-weighted MRI (DW-MRI) has emerged as a key MRI technique for the detection of significant prostate cancer [3,4,5]. High b value images are particularly useful for diffusion weighting as they provide better tissue contrast and less T2 shine-through effect [6, 7] and thereby help differentiate benign from malignant prostate lesions. However, their acquisition contributes significantly to the total scan time in mp-MRI [8,9,10], and they also suffer from the decreased signal-to-noise ratio (SNR), susceptibility effects, and distortions [6, 11, 12]. Extrapolated calculated high b value images is a strategy to avoid these effects and to reduce the total scan time [8, 13,14,15,16,17].

There are new technical advances to address the challenges of high b value DWI, and complex-averaging is one of these. Due to inherent low SNR in particular at high b values, diffusion images with same contrast are acquired multiple times and then averaged. As DWI is also very sensitive to phase variations, this is conventionally done by taking the magnitude first. This approach is very robust, but also changes the noise distribution which is no longer centric. Consequently, the average has a noise bias that appears as a haze-like background in low signal regions. This can be circumvented by an adaptive combination of complex-valued images (complex-averaging) [18]. With this approach, local phase variations between different images are implicitly aligned, and therefore, the combined image no longer has a noise bias. This improves image appearance and SNR in low signal regions. Some of the prior studies [8, 13,14,15,16,17] have evaluated the performance of acquired and calculated high b value images with variable results, but complex-averaging is a relatively new method which can be used with acquired and calculated datasets and has not been previously tested. The purpose of this study was to assess the impact of complex-averaging on image quality and diagnostic accuracy of acquired and calculated high b value (aHBV, cHBV) images in patients with prostate cancer.

Materials and methods

Patient cohort and study design

Institutional Review Board (IRB) and Health Insurance Portability and Accountability Act (HIPAA) approval were obtained, and the requirement for informed consent was waived by our hospital IRB. In this retrospective study, we included eighty-four consecutive patients who underwent multiparametric prostate MRI on a MAGNETOM Skyra 3T system (Siemens Healthcare, Erlangen, Germany) using 18-channel body matrix and 32-channel spine matrix coils between January 2017 and December 2017. These patients underwent prostate MRI because of elevated prostate-specific antigen level and/or suspicion of prostate cancer at digital rectal examination (before biopsy).

Multiparametric MR protocol and parameters

At our institution, mp-MRI of the prostate typically includes T2-weighted images in 3 orientations, axial diffusion-weighted imaging (DWI) with multiple b values of b = 50 s/mm2 and 900 s/mm2 from which an ADC map and a calculated high b value image is derived, a separately acquired, axial high b value image with b = 2000 s/mm2 and axial dynamic contrast-enhanced (DCE) imaging acquired over 2 min. DWI was performed using a prototype sequence that essentially differs from the product sequence by the outlined scanner integrated postprocessing. Sequence parameters of the diffusion protocols can be found in Table 1. All scans are acquired with a slice thickness of 3 mm.

Table 1 Diffusion sequence parameters

Calculated diffusion-weighted images

As discussed above, the high b value can be derived/calculated from the acquired lower b value images in order to improve SNR and avoid susceptibility effects and distortions [6, 8, 11,12,13]. Fitting the acquired b value images to a mono-exponential signal model for determining the T2-weighted magnetization M0 and the apparent diffusion coefficient (ADC) allows extrapolating to diffusion-weighted images at arbitrary b values. Of interest are very high b value images and regions with restricted diffusion, such as tumors, that will stand out and allow an easy visual detection. For prostate cancer, the contrast of b = 2000 s/mm2 is often chosen, and here derived two lower b values (b = 50, 900 s/mm2) using the above-mentioned mono-exponential model.

The approach of cHBV images works well in regions with well determined M0, but artificially enhances noise in regions with low M0 and low ADC values. This may be irrelevant for the target region but leads to irritating background in the extrapolated high b value images. To address this issue in a manner that does not modify the extrapolated b values in actual region of interest, but still addresses the issue of artificial signal enhancement, the following steps were performed: (1) A noise level was determined based on all pixels with a determined ADC smaller than 300 µm2/s. (2) For pixel with a T2-weighted magnetization below the determined noise level, the ADC value used for extrapolating diffusion-weighted images above the highest acquired b value was set to a minimal ADC value that gradually increased from 0 to 1000 µm2/s for M0 decreasing from the noise level to zero.

Complex-averaging of diffusion-weighted images

With the phase of complex-valued diffusion-weighted images being very sensitive to confounding factors such as bulk motion, it is established to perform pixelwise averaging of repeated measurements with identical diffusion encoding based on magnitude values. The approach is robust but comes at the cost that noise no longer cancels out completely in the averaged images and leads to a haze-like background and resultant low SNR. The issue can be circumvented by an adaptive combination of complex-valued images, like the approach taken for the combination of received coil images [18]. Here, a combination of images in local patches is performed without explicitly performing a phase correction of the single images. Referring to Ref. [18] for more technical details, the method is suited for the combination of images that are expected to have the same pixelwise magnitude but are subject to individual signal variations that are of lower resolution and that can be complex-valued. In case of phased-array coils as discussed in [18], these variations correspond to coil sensitivity maps. In the present work, these are the phase variations of the individual averages, e.g., due to bulk motion. The approach assumes that the individual phase variations can be assumed as constant in local patches of the images. In this work, the patch size was chosen to be 7 × 7 pixels. An important advantage of this approach compared to conventional phase correction is that the relative weight of each image in the combination is additionally determined locally for each patch.

Image evaluation

Qualitative evaluation of the calculated and acquired high b value DWI images datasets for both complex-averaged and regular datasets was performed by two radiologists (8 and 10 years of experience) on a PACS workstation (AGFA Impax; AGFA Technical Imaging Systems, Ridgefield Park, NJ) in separate reading sessions with intervals of 3 weeks. The radiologists assessed the image quality (IQ), and level of confidence (LOC). A 3-point scale was used for image quality parameters including: noise, artifact, and resolution (3-good quality for diagnostic interpretation, 2-moderate quality for diagnostic interpretation, 1-poor quality for diagnostic interpretation). Level of confidence was rated on a scale of 1 to 5 (1 ≤ 25%, 2 = 25–50%, 3 = 50–75%, 4 = 75–90%, 5 ≥ 90%) [19]. They also performed PIRADS score in consensus for detected lesions in each session while they were blinded to the clinical details. DWI sequences were of key importance for the final PI-RADS score if the tumor was in the peripheral zone and T2W images in the transitional zone [20, 21]. For all detected lesions, ADC measurement (free-hand ROI drawn over the entire lesion) was performed by a single observer from both magnitude-averaged and complex-averaged datasets in two separate reading sessions but on similar workstations.

For the reading session, MR scans were organized in four sets of images: Image set 1 to set 4.

Image set 1 consisted of (a) axial T1-weighted and axial, sagittal, and coronal T2-weighted images; (b) axial acquired magnitude-averaged high b value images (including the corresponding ADC maps).

Image set 2 consisted of (a) axial T1-weighted and axial, sagittal, and coronal T2-weighted images; (b) axial acquired complex-averaged high b value images (including the corresponding ADC maps).

Image set 3 consisted of (a) axial T1-weighted and axial, sagittal, and coronal T2-weighted images; (b) axial calculated magnitude-averaged high b value images (including the corresponding ADC maps).

Image set 4 consisted of (a) axial T1-weighted and axial, sagittal, and coronal T2-weighted images; (b) axial calculated complex-averaged high b value images (including the corresponding ADC maps).

The electronic medical records of every patient were then reviewed to document patient demographics, PSA level, biopsy result, and clinical diagnosis. For lesions with histological confirmation (n = 46), the correlation was performed between consensus PIRADS and Gleason scores. A Gleason score ≥ 7 were considered as the presence of significant prostate cancer. Diagnostic performances regarding sensitivity, specificity, positive predictive value, negative predictive value, and accuracy were calculated for all the image datasets. In detail, regarding the accuracy for the detection of significant prostate cancer, PI-RADS v2 scores of 1–2–3 were considered negative, and PI-RADS v2 scores of 4–5 were considered positive. To make sure that the histopathology reflected the reported lesion on MRI, the lesion site and biopsy site descriptors (right vs left; base vs. mid-gland vs. apex) were matched.

Statistical analysis

Microsoft Excel 2007 (Microsoft Corporation) and MedCalc software (Version 18.2.1, Ostend, Belgium) were used for statistical analyses. Group statistics were presented as mean ± SD, numbers, and percentages. The comparisons were made using Wilcoxon test and McNemar’s test. P values less than 0.05 were considered as statistically significant.

Results

Patient cohort

The final patient cohort included 84 patients between January 2017 and December 2017. The average age in this patient cohort was 64 ± 8 years (range 45–84 year). The mean PSA was 10.35 ± 15.53 ng/ml. Prostate biopsy report or prostatectomy operation note was available for 46 patients. In 11 cases, prostatectomy operation note was used as the reference, and in 35 cases, we used standard 12-core biopsy/guided biopsy report to determine the Gleason score. The mean scan time for the mp-MRIs was 28 min 31 s (± 4 min 26 s) and for b = 2000 s/mm2 sequence was 6-min 30 s (± 1 min 16 s). The aHBV protocol accounted for approximately 25% of total scan time.

Image evaluation

Qualitative analysis

Complex-averaging improved Image Quality of the acquired and calculated high b value DWI for both readers (P < 0.0001) (Figs. 1, 2). Mean image quality of the complex-averaged acquired high b value DWI for reader A and B was 2.67 and 2.69, respectively, which was significantly higher than conventional magnitude-averaged acquired high b value DWI for both readers (1.81 and 2.10, respectively) (P < 0.0001). Mean image quality of the Complex-averaged calculated high b value DWI for reader A and B was 2.70 and 2.64, respectively which was significantly higher than conventional magnitude-averaged calculated high b value DWI for both readers (1.92 and 2.25, respectively) (P < 0.0001) (Fig. 1; Table 2). Complex-averaging improved LOC of the aHBV for both readers (P < 0.0001, P < 0.05). Mean LOC of the complex-averaged acquired high b value DWI for readers A and B was 4.10 and 4.37, respectively which was significantly higher than conventional magnitude-averaged acquired high b value DWI for both readers (3.01 [P < 0.0001] and 4.06 [P < 0.05], respectively). Mean LOC of the complex-averaged calculated high b value DWI for reader A was significantly higher than conventional magnitude-averaged calculated high b value DWI (4.24 vs. 3.32, [P < 0.0001]). Mean LOC of the complex-averaged calculated high b value DWI for reader B was higher but not statistically significant than conventional magnitude-averaged calculated high b value DWI (4.35 vs. 4.10, [P = 0.058]) (Table 2).

Fig. 1
figure 1

A 74-year-old man with clinically significant prostate cancer (Gleason score 7 from prostatectomy, PSA = 9.83 ng/ml). Acquired magnitude-averaged b = 2000 s/mm2 images a are prone to susceptibility effects and distortion (from rectal gas in this case; long arrow). Calculated b = 2000 s/mm2 images b show lesser image distortion and susceptibility effects. The combination of the calculated high b value and complex-averaging c further improves the image quality

Fig. 2
figure 2

A 60-year-old man with Gleason score 6 prostate lesion from prostatectomy, and PSA = 3.36 ng/ml. Magnitude-averaged b = 2000 s/mm2 images a demonstrate a background signal build up resulting in a haze and lower signal-to-noise ratio. This can be resolved using complex-averaged images b, which have relatively higher signal-to-noise ratio. The resolution of the background haze also results in better conspicuity of focal lesions with diffusion restriction (b)

Table 2 Comparing image quality and level of confidence between magnitude-averaged and complex-averaged high b value images

The IQ values of the cHBV were not significantly different than aHBV images for both readers (Fig. 3). The LOC of the conventional magnitude-averaged cHBV was significantly higher than conventional magnitude-averaged aHBV images for reader A (3.32 vs. 3.01, P = 0.02) but was not significant for reader B (4.1 vs. 4.06, P = 0.38). The LOC of the complex-averaged cHBV images were not significantly different than Complex-averaged aHBV images for both readers (P = 0.2 and 0.43) (Table 3).

Fig. 3
figure 3

A 81-year-old man with clinically significant prostate cancer (Gleason score 7 from biopsy, PSA = 17.85 ng/ml). Axial ADC (a) and T2-weighted image (b) demonstrate a focal lesion of low T2 signal and restricted diffusion. There is high signal within this lesion on acquired magnitude-averaged b = 2000 s/mm2 image (c). The calculated b = 2000 s/mm2 image (d) demonstrates comparable image quality for diagnostic evaluation. In addition, it provides better anatomic definition of the normal peripheral zone (seen as markedly hypointense signal). Better anatomic definition and improved lesion to background signal ratio might increase the diagnostic confidence for lesion detection

Table 3 Comparing image qualities and levels of confidence between acquired and calculated high b value images

Complex-averaging did not impact ADC values of the prostate lesions compared to magnitude-averaging (P = 0.08). Mean of the ADC values of the lesions was 934 ± 241 × 10−6 mm2/s (range 531–1433 × 10−6 mm2/s) from complex-averaged ADC maps, which was not significantly different from the mean of the ADC values of the lesions from conventional magnitude-averaged ADC maps (920 ± 241 × 10−6 mm2/s; range 501–1457 × 10−6 mm2/s) (P = 0.08).

Lesion visibility and PIRADS score

The combination of complex-averaging and cHBV provided the highest diagnostic accuracy for detection of the clinically significant lesion (Gleason score ≥ 7) (Table 4). The specificity of calculated HBV was higher than the acquired HBV in detecting the clinically significant prostate lesion (88% vs. 80%; for both magnitude- and complex-averaged). The sensitivity of complex-averaged DW images was higher than that of the conventional magnitude-averaged DW images in both the acquired and calculated DWI (68.4% vs. 63.1% in calculated DWI and 73.6% vs. 63.1% in acquired DWI). The accuracy of complex-averaged DW images was higher than the conventional magnitude-averaged DW images in both acquired and calculated DWI (79.5% vs. 77.2% in calculated DWI and 77.2% vs. 72.7% in acquired DWI) (Table 4).

Table 4 Sensitivity, specificity, and accuracy for different datasets

Discussion

We are presenting results with the use of a novel complex-averaging method and its impact on image quality and diagnostic accuracy of high b value images in diffusion-weighted MRI of prostate cancer. Our findings suggest that complex-averaging improves the image quality and results in higher level of confidence without having any effect on the quantitative information in the ADC maps. In addition, complex-averaging provides comparable diagnostic accuracy of DWI for the clinically significant prostate cancer. We also assessed the performance and quality of calculated high b value (cHBV) images and found that these images are of noninferior image quality with comparable accuracy to acquired high b value images (aHBV) in detecting clinically significant prostate cancer. cHBV also provides comparable PIRADS score for histologically confirmed prostate cancer and may help in reducing the imaging time for prostate MRI (approx. 25% reduction in scan time).

Several prior studies have evaluated the cHBV images in prostate cancer with consistent results in terms of image quality and mixed results for the diagnostic accuracy [15, 17, 22]. Rosenkrantz et al. reported a superior image quality of cHBV compared to aHBV images with less image distortion but did not find any difference in the diagnostic accuracy between the two techniques [13]. Blackledge et al. reported that computed DW MR imaging in the body allows higher-b value images to be obtained with a good SNR and may improve disease detection [15]. Bittencourt et al. reported that a combination of a high b value extrapolation and sFOV may lead to increased diagnostic accuracy of DWI without an increase of acquisition time [22]. Grant et al., however, report that extrapolation using the IVIM model has equivalent lesion detection and similar image quality to acquired DWI at b = 1000 s/mm2 but is slightly inferior to acquired DWI at b = 2000 s/mm2.

Our observations with the use of complex-averaging in DWI are unique and have never been studied before. As discussed in the Methods section, the magnitude-averaged DWI have a bias toward higher signal particularly in the image center, resulting in a haze-like background signal build-up. In adaptive combination of complex-valued images [18], a combination of images in local patches is performed without explicitly performing a phase correction of the single images. Thus, the averaged images are obtained that do not show a noise bias in the form of a background haze and which appear sharper. This may have resulted in the improvements of the image quality of aHBV images and potentially the increase in the diagnostic accuracy of these images in this study. We used complex-averaging in combination with not only actual but also calculated HBV DW images. Both complex-averaging and calculated HBV images independently contributed in improvement in image quality and combination of the complex-averaging and calculated high b value resulted in highest image quality (Fig. 4) and diagnostic accuracy. Complex-averaging and cHBV images address different caveats of b = 2000 s/mm2 images, such as background haze inherent to averaging and distortions and susceptibility effects. This might have resulted in synergistic effects that lead to highest quality and accuracy of complex-averaged cHBV images.

Fig. 4
figure 4

Axial T2-weighted (a) and ADC images (b) demonstrate a focus of carcinoma with restricted diffusion. Acquired magnitude-averaged b = 2000 s/mm2 image (c) demonstrates corresponding hyperintense signal. The application of complex-averaging results in improved signal-to-noise ratio and lesion conspicuity (d). Similarly, calculated high b value image results in better signal-to-noise ratio and lesion visibility (e), but, the combination of complex-averaging with calculated high b value (f) results in best image quality out of all 4 image sets. The PIRADS score assigned to this case was 3 based on the magnitude-averaged image (c) as it is mildly hyperintense. The score changed to 4 on complex-averaged calculated high b value image (f) where it appears markedly hyperintense. A 59-year-old man with clinically significant prostate cancer (Gleason score 7 from prostatectomy, PSA = 6.51 ng/ml)

Limitations

This retrospective, single-center study has several limitations. Biopsy was the most common reference standard instead of prostatectomy which is susceptible to sampling errors. At our institution, 12 core biopsies are performed, and we subsequently matched the histopathology to the reported lesion on MRI through site descriptors (right vs left; base vs. mid-gland vs. apex). However, this does not completely eliminate bias. Also, decision to biopsy was made based on the results of subjective mp-MRI reads, and affecting actual diagnostic accuracy, which may be better than those reported.

Conclusion

The complex-averaging provides better IQ and LOC for DW images in prostate cancer without significant impact on ADC values. The cHBV images have the potential to replace aHBV and help in reducing the imaging time for prostate MRI. The complex-averaging when used in combination with cHBV may increase the accuracy for detection of the prostate lesions.