Introduction

The incidence of incidentally discovered localized renal masses has risen partly due to increased utilization of ultrasound and cross-sectional imaging [1]. Benign lesions are found at partial or total nephrectomy in up to 11% to 17.9% of cases misdiagnosed as renal cancer on preoperative imaging according to the surgical literature [2,3,4,5]. In light of the potential for iatrogenic chronic kidney disease, partial nephrectomy rather than radical nephrectomy is recommended for managing localized renal masses where feasible [6]. Recently, active surveillance has emerged as an initial management option to elderly and comorbid patients with incidentally detected small renal masses [7]. Differences in prognosis and evolving treatment paradigms based on tumor aggressiveness emphasize the need for noninvasive tools to accurately differentiate benign from malignant renal tumors.

Contrast-enhanced computed tomography (CT) and conventional magnetic resonance imaging (MRI) are routinely used in the evaluation of renal lesions; however, they still have some limitations such as exposure to ionizing radiation and potential risk of renal function impairment. Diffusion-weighted imaging (DWI), free of ionizing radiation and extraneous contrast material, could obtain information about the microscopic motion of water protons, which was indicative of changes at the cellular level [8]. DWI has been widely used to differentiate benign from malignant renal masses, grading and subtyping of RCCs [9, 10]. However, there is always concern that the apparent diffusion coefficient (ADC) obtained from conventional monoexponential DWI could not accurately reflect the real diffusivity, because of the influence of microcirculation [11, 12]. Some previous researchers have suggested that intravoxel incoherent motion (IVIM) and diffusion kurtosis imaging (DKI) might provide more accurate information about water diffusion [13, 14]. The biexponential IVIM model, proposed by Le Bihan et al [13], expresses the diffusion-weighted signal as the sum of a pure diffusion fraction and a perfusion-dominated “pseudo-diffusion” fraction. DKI is a non-Gaussian diffusion-weighted model first described by Jensen et al [15] in 2005, which is believed to better reflect the microstructural complexity of biologic tissue. Since DWI with different models and DKI may demonstrate different aspects of tissue properties, it should be valuable to explore and compare their roles in the differentiation of benign and malignant renal tumors. A number of investigations have compared the diagnostic value of IVIM or DKI over standard DWI for assessing renal tumors [16,17,18,19,20,21]. However, most of these studies have generally used small sample sizes. There are a growing number of studies using a hybrid IVIM-DKI model for tumor evaluation in recent years [22,23,24,25]. To our knowledge, no comparison of these different diffusion parameters in the discrimination of benign and malignant renal tumors in a large cohort has been investigated so far. The purpose of this study was to quantitatively compare the potential of various diffusion parameters obtained from DWI, IVIM, and DKI in the differentiation of benign and malignant renal tumors.

Materials and methods

Patients

Our institutional review board approved this study and waived the requirement to obtain informed consent. Between December 2014 and July 2018, 305 consecutive patients who were known to have, or were suspected of having renal tumors based on previous CT or ultrasonography examinations, underwent a preoperative MR imaging examination including both IVIM and DKI sequences in Zhongshan Hospital, Fudan University. The inclusion criteria were as follows: (a) primary renal lesions without prior treatment and (b) nephrectomy within 1 month after the MR imaging examination. Exclusion criteria included the following: (a) lesions without histopathological confirmation (n = 52), (b) lesions with antiangiogenic therapy (n = 5), (c) small renal tumors less than 1.0 cm in diameter (n = 18) or multiple lesions in one kidney (n = 4), and (d) MR images with severe susceptibility artifacts or respiratory motion artifacts (n = 10). Transitional cell carcinomas (n = 14), cystic renal tumors (n = 4), and renal cysts (n = 8) were also excluded, because the purpose of this study was to differentiate solid benign and malignant tumors originated from renal parenchyma. Ten cases of classical angiomyolipoma were excluded because they are very easy to be differentiated from malignant renal tumors. The 1.0-cm lower threshold was selected to avoid potential confounding from partial volume averaging in smaller renal lesions. Finally, 125 patients were excluded, and a total of 180 patients (119 males and 61 females; mean age, 55.3 years; age range, 20–83 years) were included in the final cohort.

Pathological assessment revealed 166 cases of malignant renal tumors, which were divided into clear cell renal cell carcinoma (ccRCC) (n = 124) and non-ccRCC (n = 42) groups. The non-ccRCC malignant renal tumors were composed of papillary RCC (n = 11), chromophobe RCC (n = 16), clear cell papillary RCC (n = 5), MiT family translocation RCC (n = 2), non-Hodgkin lymphoma (n = 2), unclassified RCC (n = 3), and RCC with sarcomatous transformation (n = 3). Benign renal tumors (n = 14) included 12 fat poor angiomyolipomas and two renal oncocytomas. The mean interval between MR imaging and nephrectomy was 2.2 ± 2.5 days (range, 0–14 days).

Image data acquisition

All patients were examined with a 1.5-T MRI system (Magnetom Aera; Siemens Healthineers) equipped with an 18-channel body array coil. A routine multiparametric MR protocol included axial three-dimensional in- and out-of-phase T1-weighted interpolated breath-hold examination, T2-weighted turbo spin-echo pulse sequence with fat suppression, and pre- and postcontrast dynamic three-dimensional T1-weighted volumetric interpolated breath-hold examination at corticomedullary phase (20–25 s), nephrographic phase (80–90 s), and excretory phase (180 s) after injection of 0.1 mmol/kg gadopentetate dimeglumine (Magnevist; Bayer Schering Pharma AG) at a rate of 2 mL/s.

A multiple b value DWI scan was acquired using a work-in-progress echo-planar imaging sequence (WIP NO. 870) provided by the manufacturer (Siemens Healthineers). This WIP package supports a motion-insensitive PAT reference scan that is suitable for body diffusion measurements in free-breathing mode without respiratory gating in the acquisition. Parallel imaging was used, and fat was suppressed using spectral presaturation inversion recovery (SPIR). Eight b values (0, 25, 50, 80, 150, 300, 500, and 800 s/mm2) were used, TR/TE 5100/70 ms; section thickness, 5 mm; gap, 1 mm; field of view, 380 mm × 308 mm; matrix, 128 × 128; scan time, 5 min 42 s. Then, DKI was acquired separately using a single-shot echo-planar imaging pulse sequence in free-breathing mode. Six b values that ranged from 0 to 2000 s/mm2 (0, 200, 500, 1000, 1500, and 2000 s/mm2) with three orthogonal directions for every b value. TR/TE was 7300/70 ms, section thickness, 5 mm; gap, 1 mm; 380 mm × 308 mm; and acquisition time, 2 min 32 s. Acquisition parameters of the axial multiple b value DWI and DKI sequences are listed in Table 1.

Table 1 Scanning parameters of diffusion sequences

Image data analysis and processing

Multiple b value DWI data were used to compute mono- and biexponential models. Standard ADC map (b = 0 and 800 s/mm2) was generated with the software integrated into the MR system (Syngo; Siemens Healthineers) according to the following equation: S(b) = S0·exp. (−b·ADC). IVIM parameter maps (D, D*, f) were generated offline by using the postprocessing program provided with WIP NO.870 and fitting the following biexponential model [26]: S(b)/S0 = (1−f)·exp. (−bD) + f·exp. [−b(D* + D)], where S(b) was the signal intensity at a given b value, S0 was the signal intensity for b = 0 s/mm2, D was the diffusion parameter representing pure molecular diffusion, D* was the pseudo-diffusion coefficient representing incoherent microcirculation within the voxel, and f was the perfusion fraction of the diffusion linked to microcirculation. DKI parameter maps (K and D) were calculated offline by using prototype software (Body Diffusion Toolbox; Siemens Healthineers). The software performed a voxel-by-voxel analysis and the six b value data were fitted to the following equation [27]: S(b) = S0·exp. (−b·D + b2·D2·K /6), where K is a unitless parameter that represents the deviation of water motion from Gaussian diffusion, and D is the corrected ADC without Gaussian bias. Least-squares fit was used for linear fitting with the monoexponential and DKI models, and IVIM data were fitted with the Levenberg-Marquardt nonlinear fitting algorithm.

ADC and IVIM parameters were evaluated on the Syngo workstation, while mean kurtosis (MK) and mean diffusivity (MD) were measured on the offline software (Body Diffusion Toolbox). Two observers (Y.Q.D, with 7 years of experience and J.J.Z, with 30 years of experience in urinary system imaging diagnosis) who were blinded to pathologic diagnosis analyzed in consensus all the parameter maps in conjunction with T1- and T2-weighted images. Free-hand regions of interest (ROIs) with a mean size of 14.99 ± 15.57 cm2 (range from 0.93 to 104.10 cm2) were outlined around the tumor on D and MD maps at three representative slices and were simultaneously copied to D*, f, ADC, and MK maps, respectively. In order to minimize the selection bias, the largest possible ROIs to include entire tumors were drawn. When the lesion showed isointense with surrounding renal parenchyma on the D or MD maps, the ROIs were selected by referring to T1- and T2-weighted images. These ROIs were selected on the basis of visual inspection of the parametric maps and the averaged values of the three slices were used for final analysis. Thirty cases were randomly selected from ccRCC, non-ccRCC, and benign renal tumor groups (10 cases in each group) and measured by two observers independently to assess the interobserver agreement of diffusion parameters.

Statistical analysis

Data were presented as mean ± standard deviation (SD). ADC, IVIM, and DKI parameters between malignant and benign renal tumor groups were compared using the Mann-Whitney U test or one-way ANOVA. Receiver operating characteristic (ROC) curve analysis was performed to evaluate the diagnostic performances and determine the optimal cutoff value of each parameter in predicting malignancy. Areas under the curves (AUCs) were compared using the DeLong method [28]. Interobserver agreement of diffusion parameter measurements was assessed by using the intraclass correlation coefficient (ICC) with 95% confidence interval (CI) (≦ 0.20, slight; 0.21–0.40, fair; 0.41–0.60, moderate; 0.61–0.80, substantial; and 0.81–1.00, perfect) [29]. Statistical analyses were performed with SPSS (version 22.0) and MedCalc software (version 15.8.0). p < 0.05 was considered statistically significant.

Results

Population characteristics

A total of 124 patients (89 men and 35 women; mean age, 56.7 years; age range, 26–83 years) were confirmed with pathological examination to have ccRCCs (10 Fuhrman I, 92 Fuhrman II, 22 Fuhrman III, 0 Fuhrman IV). There were 42 patients (26 men and 16 women; mean age, 54.2 years; age range, 20–79 years) in the non-ccRCC group, and 14 patients (4 men and 10 women; mean age, 46.3 years; age range, 29–67 years) in the benign renal tumor group. The age of patients in benign renal tumor group was younger than those of the ccRCC group (p = 0.013). The average diameters of the ccRCC (4.7 ± 2.1 cm), non-ccRCC (4.6 ± 2.6 cm), and benign renal tumor (4.4 ± 2.7 cm) group were not statistically different (p = 0.553).

Diffusion metrics

Figures 1, 2, and 3 show the representative MR appearance on multiple diffusion images of pathologically confirmed ccRCC, non-ccRCC, and benign renal tumor, respectively. ADC, IVIM, and DKI parameters of ccRCC, non-ccRCC, and benign renal tumors are shown in Fig. 4 and Table 2. The ADC, D, and MD values of ccRCCs were higher, while D*, f, and MK values were lower than those of benign renal tumors (all p < 0.025). The D* and f values of non-ccRCCs were lower than those of benign renal tumors (p = 0.002 and p < 0.001, respectively). However, the difference of ADC, D, MD, and MK values between non-ccRCCs and benign renal tumors was not statistically significant (p > 0.05). The ADC, D, MD, and f values of ccRCCs were higher, while MK values were lower than those of non-ccRCCs (all p < 0.001). However, the difference of D* values between ccRCCs and non-ccRCCs was not statistically significant (p = 0.127). The agreements of diffusion parameters of the 30 randomly selected cases between two observers were perfect for ADC, D, MK, and MD with ICC (95%) of 0.974 (0.964–0.982), 0.976 (0.966–0.983), 0.904 (0.867–0.930), and 0.937 (0.913–0.955), respectively. The agreements of D* and f were substantial between two observers with ICC (95%CI) of 0.785 (0.711–0.841) and 0.750 (0.666–0.815), respectively.

Fig. 1
figure 1

A 67-year-old man with pathologically confirmed clear cell renal cell carcinoma in the right kidney. The lesion shows high signal intensity (SI) on T2-weighted image (a) and remarkable enhancement in corticomedullary phase (b). The lesion (ROI) demonstrates high SI on the ADC map (c), D map (d), D* map (e), and mean diffusivity map (h) and low SI on the f map (f) and iso-SI on the mean kurtosis map (g), with values of 2.05 × 10−3 mm2/s, 1.73 × 10−3 mm2/s, 51.5 × 10−3 mm2/s, 2.78 × 10−3 mm2/s, 14.45%, and 0.60, respectively

Fig. 2
figure 2

A 65-year-old woman with pathologically confirmed chromophobe renal cell carcinoma in the left kidney. The lesion shows low signal intensity (SI) on T2-weighted image (a) and slight enhancement in corticomedullary phase (b). The lesion (ROI) demonstrates very low SI on the ADC map (c), D map (d), f map (f), and mean diffusivity map (h) and high SI on the D* map (e) and mean kurtosis map (g), with values of 1.22 × 10−3 mm2/s, 1.09 × 10−3 mm2/s, 12.11%, 1.51 × 10−3 mm2/s, 28.47 × 10−3 mm2/s, and 0.93, respectively

Fig. 3
figure 3

A 55-year-old woman with pathologically confirmed epithelioid angiomyolipoma in the left kidney. The lesion shows very low signal intensity (SI) on T2-weighted image (a) and moderate enhancement in corticomedullary phase (b). The lesion (ROI) demonstrates very low SI on the ADC map (c), D map (d), f map (f), and mean diffusivity map (h) and high SI on the D* map (e) and mean kurtosis map (g), with values of 1.29 × 10−3 mm2/s, 0.96 × 10−3 mm2/s, 22.48%, 1.63 × 10−3 mm2/s, 54.25 × 10−3 mm2/s, and 0.87, respectively

Fig. 4
figure 4

Box-and-whisker plots show distributions of ADC (a), D (b), D* (c), f (d), mean kurtosis (e), and mean diffusivity (f) of ccRCC, non-ccRCC, and benign renal tumors. *p < 0.025, **p < 0.001. Dots mean outliers

Table 2 Diffusion parameters of malignant and benign renal tumors

Diagnostic performance of multiple parameters

ROC curves of multiple diffusion parameters for predicting malignant renal tumors were plotted in Fig. 5, and the corresponding diagnostic test characteristics are shown in Table 3. The AUC values of ADC, D, D*, f, MK, and MD were 0.849, 0.891, 0.708, 0.656, 0.862, and 0.838 for differentiating ccRCC from benign renal tumors, respectively. For pairwise comparisons of ROC curves, ADC was worse than D (p = 0.048) and not significantly different from D* (p = 0.170), f (p = 0.088), MK (p = 0.750), and MD (p = 0.811). The difference in diagnostic efficacy among D, MK, and MD was not statistically significant (all p > 0.05). The AUC values of D* and f were 0.772 and 0.866 for discrimination between non-ccRCC and benign renal tumors, respectively. For pairwise comparisons of ROC curves, the difference in diagnostic efficacy among D* and f was not statistically significant (p = 0.149).

Fig. 5
figure 5

Receiver operating characteristic curves of the diagnostic performance of ADC, D, D*, f, mean kurtosis, and mean diffusivity for differentiating between ccRCC and benign renal tumors (a) as well as D* and f for differentiating between non-ccRCC and benign renal tumors (b). Numbers are areas under the curves

Table 3 Diagnostic test characteristics of diffusion parameters in differentiating malignant from benign renal tumors

Discussion

In this study, we compared the diagnostic value of DWI, IVIM, and DKI in differentiating between benign and malignant renal tumors in a large cohort. Our results showed that DWI, IVIM, and DKI parameters were clinically feasible and of great value in differentiating between ccRCCs and benign renal tumors (p < 0.025). However, only IVIM parameters (D* and f) were able to discriminate non-ccRCCs from benign renal tumors (p < 0.01).

The benign renal tumor group in our study was composed of fat poor angiomyolipoma (n = 12) and oncocytoma (n = 2), which are the most common solid benign renal tumors misdiagnosed as renal cancer on preoperative imaging according to the surgical literature [2, 3, 5]. We found that ccRCCs had higher diffusion parameters (ADC, D, MD) than those of benign renal tumors (all p < 0.025). This might be partly explained by their histopathological characteristics. The alveolar and acinar structures of ccRCCs may dilate, producing microcystic and macrocystic patterns, so that the movement of water molecules is relatively unrestricted [30]. On the contrary, fat poor angiomyolipoma is composed of variable proportions of spindle cells or epithelioid smooth muscle cells and abnormal thick-walled blood vessels [30]. As a result, water molecules of fat poor angiomyolipoma in the extracellular interstitial space are obviously hindered because of higher cellularity and an increased viscosity of the tumor tissue [31]. Our findings are in line with previous studies [32,33,34]. The better accuracy of D values compared with ADC in differentiating ccRCCs from benign renal tumors in our study can be explained by the fact that ADC is a nonspecific measurement containing both the perfusion/microcirculation (f) and tissue cellularity (D) information [35], which may have opposite effects on the measurement of ADC in the evaluation of renal tumors, resulting in decreased sensitivity and specificity.

In our study, the perfusion-related IVIM parameters (D* and f) showed greater diagnostic accuracy than those of ADC and DKI parameters in distinguishing non-ccRCCs from benign renal tumors. This is not difficult to understand, since non-ccRCCs were mostly composed of renal tumors such as papillary RCC and chromophobe RCC with lesser enhancement than oncocytoma and fat poor angiomyolipoma in previous contrast-enhanced CT or MRI studies [36,37,38]. Previous study has reported a good correlation between f and perfusion-related parameters using gadolinium-based contrast agents for the discrimination of enhancing from non-enhancing renal lesions [35]. We noticed that D* and f values of benign renal tumors were even higher than those of ccRCCs in our study (p < 0.025). This may partly be related to the fact that fat poor angiomyolipoma contains large and malformed blood vessels, whereas ccRCC has abundant capillaries [30]. Higher D* values of fat poor angiomyolipoma in comparison with ccRCC have been reported in previous studies [19, 20], which is in agreement with our study. However, there is yet no consensus on the number of b values that should be used for extracting perfusion-sensitive information and the magnitude of b values that should be applied. Our study selected eight b values (0, 25, 50, 80, 150, 300, 500, and 800 s/mm2) according to the recommendations of Koh et al [39]. In a previous study [40], intraobserver reproducibility of renal tumors was excellent for ADC (coefficient of variation (CV), 3.45% to 3.82%; Bland-Altman limits of agreements (BA-LA), − 14% to 18%) and D (CV, 3.65% to 4.3%; BA-LA, − 14% to 16%), good for f (CV, 11.96% to 12.41%; BA-LA, − 34% to 30%), and poor for D* (CV, 25.0% to 33.65%; BA-LA, − 111% to 101%). However, the agreements of diffusion parameters between two observers were perfect for ADC and D and substantial for D* and f in our study, with ICC (95%) of 0.974 (0.964–0.982), 0.976 (0.966–0.983), 0.785 (0.711–0.841), and 0.750 (0.666–0.815), respectively. The possible reason might be that only the solid component of the tumor was included in previous study, while the entire tumor was included in our study. Although there were slight differences in IVIM parameter values of renal tumors in previous literature, better diagnostic performance of IVIM over DWI has been reported in grading of ccRCCs [18], differentiating enhancing from non-enhancing renal tumors [35] and differentiating benign from malignant renal tumors, as well as RCC subtypes [16, 17, 20].

Non-Gaussian diffusion, which has the potential to extract more microstructural information beyond ADC, has been found to be useful in the differentiation between low-grade and high-grade ccRCCs [21]. Furthermore, negative correlations were found between MK and MD (r = − 0.56, p < 0.001) and between MD and nuclear-to-cytoplasm ratio (r = − 0.36, p < 0.005), whereas MK and the nuclear-to-cytoplasm ratio were positively correlated (r = 0.45, p = 0.003) [21]. Unfortunately, no added value of the DKI parameters was found compared with ADC in differentiating malignant from benign renal tumors in our study. Theoretically, DKI model potentially better reflects water diffusivity in tissues at ultrahigh b values; it provides an additional parameter MK that contains specific information on the non-Gaussian diffusion behavior, and MD is presumed to be the more accurate measure of tissue diffusion, given its correction for non-Gaussian diffusion behavior [15]. However, obtaining sufficient signal-to-noise ratio at high b values in order for the calculated DKI metrics to be accurate is particularly challenging in body imaging. Rosenkrantz et al [14] suggested that at least three b values and three directions are needed for the application of the DKI sequence. We selected b values of 0, 200, 500, 1000, 1500, and 2000 s/mm2 and three diffusion directions to reduce potential perfusion effects when b value is less than 200 s/mm2 in our study [14]. The agreements of MK and MD between two observers were perfect in our study, with ICC (95%) of 0.904 (0.867–0.930) and 0.937 (0.913–0.955), respectively. Although most of the published papers consider that DKI is superior to DWI in tumor diagnosis and prognosis evaluation, no added values of DKI over DWI have also been reported by Li et al [41] in ovarian tumors, Roethke et al [42] and Tamura et al [43] in prostate cancer, Wan Q et al [24] and Das et al [44] in pulmonary nodules, and Yang L et al [45] in liver fibrosis. The controversial results have implications for ongoing efforts to optimize clinical DKI protocols, and large-scale prospective studies are undoubtedly needed to further validate our findings of DKI in renal tumors.

This study had several limitations. First, due to the limited number of non-ccRCCs and benign tumors of each histologic subtype, we simply divided renal tumors into three groups (ccRCCs, non-ccRCCs, and benign renal neoplasms). In reality, some histologic subtypes within those broad classes have different combinations of microvascular/microstructural features which may limit their distinction as an amalgamated class. Second, the regions of interest were selected in three representative slices instead of the entire renal tumors in this study, which might lead to some selection bias owing to tumor heterogeneity. Third, other advanced DW imaging models were not evaluated, and histogram metrics of the entire tumor were not assessed. In the future, the combination of different diffusion models with the detailed histologic subtypes should be investigated further in a larger study.

In conclusion, our results suggest that the D value is the best parameter for differentiating ccRCC from benign renal tumors, and the f value is the best parameter for differentiating non-ccRCC from benign renal tumors. However, our observations do not show clear added value for DKI compared with conventional DWI for discrimination between malignant and benign renal tumors.