Renal cancer accounts for more than 2% of cancers in humans worldwide [1]. In the United States, the annual incidence of renal cancer increased yearly by 1.6% over the past decade, with over 63,000 new cases in 2014 [2]. A majority of renal tumors are incidentally diagnosed on medical imaging, being often asymptomatic, small in size, and early stage [3, 4].

The importance of the work presented here is in the context of active surveillance (AS) [4]. If a lower grade tumor can be radiologically diagnosed with confidence, the impact on clinical management decision-making in these patients will be significant. It may substitute the need for a percutaneous renal biopsy, if considered, but will also provide more information that will be useful if active surveillance is considered as a management option [5], particularly for patients with small renal masses (<4 cm) and elderly patients or patients with significant comorbidities. As AS requires strong patient commitment, as well as imaging follow-up, having this information while management options are discussed with patients is very helpful. Additionally, it will also help in providing prognostic information and therefore help in building a personalized cancer model for the patient.

There are distinct quantifiable features characterizing renal cell carcinomas (RCC) on contrast-enhanced CT (CECT) examinations, such as peak tumor enhancement, tumor heterogeneity, and percent contrast washout [613]. Qualitative impressions based on visual inspections of the images are frequently sufficient for making major clinical management decisions.

Prior studies have demonstrated the degree of CT enhancement as a potentially valuable parameter in differentiating between RCC subtypes [6]. The purpose of this study is to focus on a specific subtype, clear cell RCC, and assess whether quantitative attenuation, enhancement, and washout metrics may be used to predict the Fuhrman grade of clear cell RCC.

The Fuhrman grading system is widely used for histologic grade stratification of RCC on the basis of nuclear size and shape, as well as the prominence of nucleoli [14]. The original grading system had 4 grades: grade 1 with small, round, uniform nuclei, inconspicuous nucleoli; grade 2 (40% of tumors) with slightly irregular nuclei, nuclear diameter of 15 microns and open chromatin; grade 3 (30%–40% of tumors) with very irregular nuclei, nuclear diameter of 20 microns and open chromatin; and grade 4 (15% of tumors) with mitoses, bizarre, multilobulated, pleomorphic cells, and macronucleoli, in addition to grade 3 features.

Several studies [1520] have shown that the original moderate intra- and interobserver agreement among pathologists is improved to substantial agreement when the Fuhrman grading system is collapsed to two categories where Fuhrman grades 1 and 2 are considered together as low grade and Fuhrman grades 3 and 4 are considered together as high grade.

More recently, with an aim to improve interobserver variation, International Society of Urological Pathology (ISUP) grading system of RCCs has been developed, based on tumor cell nucleolar prominence [2123]. ISUP also has four grades: grade 1 with nucleoli inconspicuous or absent, grade2 with nucleoli not prominent but clearly visible at high-power magnification, grade 3 with prominent nucleoli, easily visualized at low-power magnification, and grade 4 with tumor giant cells present and/or marked nuclear pleomorphism [2123].

Materials and methods

Patients

In this institutional review board-approved, Health Insurance Portability and Accountability Act-compliant study, we retrospectively queried our IRB approved and prospectively maintained surgical database for post-nephrectomy patients who had pathology proven ccRCC and who had preoperative multiphase CECT of the abdomen between June 2009 and June 2011. A total of 65 multiphase renal CT examinations with clear cell RCCs were utilized. 48 of the patients were male, and 17 were female. The mean age of the patients was 61.5 ± 12.1 years. There were 44 tumors with Fuhrman grades 1 or 2 (considered together) as well as 21 tumors with grades 3 or 4 (considered together). There were two primary reasons for considering grades 1 and 2 together, and grades 3 and 4 together: first, as discussed above, several studies have shown significant improvement in intra- and interobserver agreement among pathologists when the grading system is thus collapsed, and secondly, the distribution of cases in our sample was such that there were very few grade 1 and grade 4 cases.

CT examination

All CT examinations were performed with a 64-detector row helical CT scanner (Brilliance, Philips Healthcare, CT). The CT scans were obtained during patient breath-holding with the following parameters: 120 kVp, variable tube current, slice thickness of 0.5 mm with reconstruction interval of 2 mm. Non-contrast, arterial, nephrographic, and excretory phase images of the abdomen were obtained. Approximately 100–150 mL of non-ionic intravenous contrast material (Isovue 350; Bracco Imaging) dosed to weight was administered with a power injector at a rate of 5 mL/sec. Time delay to scanning for arterial phase images, nephrographic phase images, and excretory phase images were 25 s, 90 s, and 5 min, respectively. Arterial phase images were obtained rather than corticomedullary phase images to create a true arterial map for surgical planning. The arterial phase used in our study is essentially similar, however, a bit earlier than a corticomedullary phase. In our routine diagnostic workflow, we did not detect a difference in diagnostic quality with studies performed with arterial phase vs. corticomedullary phase for the patients who had both as comparisons.

Data processing

Multiphase CT acquisitions were transferred to a dedicated Synapse 3D workstation (Fujifilm Medical Systems U.S.A., Stamford, CT). Renal tumor voxels were segmented manually, slice by slice, from surrounding voxels using Synapse 3D as 3D ROIs (Fig. 1), where segmentation refers to defining the area of interest voxel by voxel so that quantitative analysis is possible. Nephrographic phase of imaging was used for segmentation as it provided the clearest tumor demarcation. In only a few cases where the nephrographic phase did not provide adequate demarcation between the tumor and the normal parenchyma, other phases as well as the embedded edge detection software available in Synapse 3D were used. Additionally, the majority of tumors were at least partially exophytic, a feature which was also used to obtain the optimal segmentation. The affected kidney and tumor were segmented out in all phases to facilitate co-registration. The entire 3D tumor volume was chosen as the ROI—as opposed to subjectively selecting smaller subareas of the tumor in a 2D plane. There were several reasons for this approach, with easier automated imaging processing workflow, accounting for tumor heterogeneity, simpler ROI definition, and improved reproducibility being the most important ones.

Fig. 1
figure 1

The whole renal tumor is selected as a ROI. Reconstructions in axial, sagittal, and coronal planes show that the complete tumor is used for quantification. The rendered image display provides a visual reference and also references the total volume of the tumor analyzed

The DICOM-formatted CT images were converted into NIfTI (Neuroimaging Informatics Technology Initiative) volumes. NIfTI (http://nifti.nimh.nih.gov/) is a NIH-sponsored imaging initiative initially created to speed the development and enhance the utility of informatics tools related to neuroimaging. NIfTI is a file format which contains the entire imaged volume as well as a transformation which orients the voxels in three-dimensional scanner space. Given that it is just a file format representing voxels with no inherent knowledge about what the voxels contain, it is not constrained to a particular body part and can be used to convert any multi-file DICOM grayscale image into a single volumetric file.

DICOM keeps the images as individual slices, while NIfTI treats images as multidimensional volumes. The series of images were then co-registered using a Normalized Mutual Information (NMI) cost function implemented in Statistical Parametric Mapping SPM8 software package (The Wellcome Trust Centre for Neuroimaging at UCL, London, UK). NMI is a commonly used image similarity measure in image co-registration. A visual inspection of co-registered images was used to spot check for gross mis-registration. Custom MATLAB (MathWorks, Natick, MA) code was used to extract voxel data corresponding to the ROI.

Subsequently, histogram analysis of voxel data distribution was performed. Histogram mean, median, skewness, kurtosis, standard deviation, and interquartile range were extracted, using custom MATLAB analysis framework, for comparison between the 2 tumor grade groups for the following parameters: attenuation in each phase, absolute enhancement (arterial—pre-contrast), two washout parameters (arterial—nephrographic, arterial—excretory), and residual enhancement in nephrographic phase (nephrographic—pre-contrast) (Tables 1, 2).

Table 1 Histogram analysis of attenuation values by phase of imaging, comparison between low-grade tumors and high-grade tumors
Table 2 Histogram analysis of enhancement and washout parameters, comparison between low-grade tumors and high-grade tumors

While mean and median of parameter values are easily understood, the other histogram distribution parameters need further explanation. They are primarily ways to evaluate variance within a dataset. A more heterogeneous tumor would be expected to have greater variance, in other words, higher standard deviation and interquartile range, and lower kurtosis. Kurtosis is a measure of “peakedness” of histogram distribution, skewness is a measure of asymmetry of histogram distribution, interquartile range is a measure of variability which is not as sensitive to outliers as standard deviation, and lastly standard deviation is a measure of degree of dispersion of attenuation values in a dataset.

Statistical analysis

In this work, Fuhrman grades 1 and 2 were considered together as low grade and Fuhrman grades 3 and 4 were considered together as high grade.

Statistical analysis was performed using SAS 9.4 (SAS Institute Inc., Cary, NC).

Data distribution was examined using D’Agostino-Pearson test, Anderson–Darling test, and histogram-based visual inspection.

Independent t test was used for normally distributed parameters; otherwise, Wilcoxon rank sum test was used. p values less than 0.05 were considered to indicate statistical significance.

Results

Tables 1 and 2 show the results of the histogram analysis of the parameter values discussed above for the lower grade tumors vs. the higher grade tumors.

Statistically significant difference was observed in four measured imaging parameters between grades 1–2 and grades 3–4: interquartile range of nephrographic phase attenuation, standard deviation of absolute enhancement (arterial—pre-contrast), as well as interquartile range and standard deviation of residual nephrographic enhancement (nephrographic—pre-contrast).

Standard deviation of absolute enhancement was 41.26 HU for grades 1–2 and 34.66 HU for grades 3–4 (p value 0.03) (Fig. 2).

Fig. 2
figure 2

Statistically significant difference was observed in standard deviation of absolute enhancement (arterial—pre-contrast). Left graph an example grade 2 RCC, SD 45 HU, right graph an example grade 3 RCC, SD 24

Interquartile range of nephrographic phase attenuation was 292.86 HU for grades 1–2 and 241.19 HU for grades 3–4 (p value 0.02) (Fig. 3).

Fig. 3
figure 3

Statistically significant difference was observed in interquartile range for nephrographic phase attenuation. Left graph an example grade 2 RCC, nephrographic phase interquartile range 549, right graph an example grade 3 RCC, nephrographic phase interquartile range 219

Interquartile range was 297.12 HU for residual nephrographic enhancement for grades 1–2 and 235.57 HU for grades 3–4 (p value 0.02), and standard deviation of the same was 42.45 HU for grades 1–2 and 37.11 HU for grades 3–4 (p value 0.04).

Discussion

Determining the Fuhrman nucleolar grade of a clear cell RCC has important management implications, especially in the context of active surveillance. Determining the Fuhrman grade and its evolutionary counterpart, the ISUP nucleolar grade, on pathologic evaluation, is the most commonly used method to prognosticate renal masses. The development of a non-invasive surrogate would add an additional tool in the management of patients with RCC. However, the development and validation of these metrics to classify the nucleolar grade is complicated given the interobserver variability of pathologic evaluations. In addition, tumors may be heterogenous, and therefore, a single tumor may have loci of more than one tumor grade. Our results suggest that absolute enhancement is more heterogeneous for lower grade tumors and that residual enhancement in nephrographic phase is more heterogeneous for lower grade tumors.

Zhu et al. [24] retrospectively evaluated 255 patients with clear cell RCC and concluded that there was an inverse association between tumor enhancement and nuclear grade of RCC, with low tumor enhancement in the corticomedullary phase an independent predictor of high tumor grade. Our results demonstrated the same trend; however, it was not statistically significant. This may be partially due to sample size, but Zhu’s study differs from ours in that they used subjectively selected smaller ROIs from 2D images, while in our study, the entire 3D tumor volume was defined as ROIs. As discussed above, our technique is subject to less interobserver variability and also accounts for heterogenous tumors. Other minor differences were that they used slower rate of contrast injection (3 vs. 5 mL/sec), and they used corticomedullary phase (30 s delay) vs. true arterial (25 s delay) as used in our study.

The negative correlation found between tumor grade and CT enhancement for clear cell RCC [24, 25] as well as for other tumors such as intrahepatic cholangiocarcinoma [26] has been reported to correlate with microvascular density of the tumors [24, 26]. Wang et al. established in their study of 24 cases of RCC that areas of higher microvascular density resulted in higher CT enhancement [27]. Combining the results of Wang et al. and Zhu et al., a possible but unproven explanation is that lower grade tumors may have higher microvascular density. It is conceivable this higher microvascular density of lower grade tumors then results in more heterogeneous absolute enhancement as well as more heterogeneous residual enhancement in nephrographic phase, as suggested by our results. These possible explanations, however, need further validation. Given the importance of developing quantitative models to predict Fuhrman grade of RCC, this is an important first step in devising a predictive model. Quantifiable physical parameters including a shape and volume analysis as well as texture analysis could provide more diagnostic information.

There are several limitations to this study. This was a single-center retrospective study, and the results ideally need to be validated in a prospective multicenter trial. However, the straightforward definition of the ROI used in this study, viz. the entire 3D tumor volume, should make reproduction of the results at another center possible. Another limitation of this study was that we did not have a quantitative algorithm available to account for the degree or amount of necrosis; we are in fact evaluating such a feature currently. It is known that the amount or degree or necrosis does correlate with a higher grade of tumors and worse prognosis [28, 29]. Lastly, the sample size of our study was relatively small.

Conclusions

This study demonstrated statistically significant difference in four of the parameters characterizing contrast enhancement and washout between lower grade and higher grade tumors, viz. interquartile range of nephrographic phase attenuation, standard deviation of absolute enhancement, as well as interquartile range and standard deviation of residual nephrographic phase enhancement.

Our results suggest that absolute enhancement and residual enhancement in nephrographic phase are both more heterogeneous for lower grade tumors.

Given the importance of developing quantitative model to predict the nucleolar grade, this is an important step in coming up with a non-invasive model.