Introduction

Angiomyolipoma (AML) is the most common renal mesenchymal neoplasm [1]. Most AMLs contain macroscopic adipose tissues that can be easily identified via computed tomography (CT) or magnetic resonance imaging [2]. According to the 2016 World Health Organization (WHO) classification, renal epithelioid angiomyolipoma (EAML) is a rare variant of AML that consists predominantly of epithelioid cells (> 80%) [3]. Unlike typical AMLs, EAMLs contain minimal or no fat that can be recognized, possibly leading to a misdiagnosis of renal cell carcinomas (RCCs) in histologic and imaging evaluations [4]. When we encounter enhancing renal mass with no or minimal fat, differentiating EAML from RCC or fat-poor AML is challenging. However, because of the rarity of this disease, imaging findings of EAML have been less well reported [5,6,7,8,9]. Additionally, to the best of our knowledge, there are no reliable image findings for distinguishing EAMLs from RCCs in CT. Although EAML has been regarded as a potentially malignant mesenchymal neoplasm and treated with surgical resection, the optimal treatment and prognosis of EAML remain controversial and different from those of RCC [10]. Thus, the ability to distinguish EAML from RCC is clinically important.

Radiomics analysis offers quantitative image information that cannot be achieved by radiologists’ visual interpretation [11]. It has been widely used to differentiate different types of renal tumors, including benign versus malignant tumors and RCC subtype classifications in cross-sectional images [12, 13]. Furthermore, recent studies reported accurate differentiation of fat-poor AML from RCC using radiomics-based model [14, 15]. In conjunction with subjective image features, a radiomics-based model may help to quantitatively distinguish EAMLs from RCCs.

In this study, we subjectively analyzed CT image findings of 28 EAMLs. Furthermore, we constructed a radiomics-based model combined with CT imaging characteristics to differentiate EAML from clear cell RCC and compared the diagnostic performance with that of radiologists. We selected clear cell RCC as the comparison tumor because clear cell RCC tend to be more hypervascular and heterogeneous appearance than other common subtypes including papillary and chromophobe RCC–the features that could overlap with EAML [16, 17].

Materials and methods

Study cohort

This retrospective case–control study was approved by the institutional review board, and the requirement for written informed consent was waived. We searched the pathology database of two tertiary care institutions for surgical specimens diagnosed with renal EAMLs between January 2006 and June 2021: We identified 31 patients (16 in center A and 15 in center B). We excluded three patients who did not have three-phase kidney protocol CT, including precontrast, corticomedullary, and excretory phases. Finally, a total of 28 patients (15 men and 13 women, mean age 45.6 ± 13.7 years) were included (Fig. 1).

Fig. 1
figure 1

Flowchart of patient selection in the study. EAML epithelioid angiomyolipoma, RCC renal cell carcinoma

For the comparison group, we collected data on 56 patients with a diagnosis of clear cell RCC (38 men and 18 women, mean age 61.2 ± 11.7 years) during the same study period in center A. We matched the average maximal diameter of the masses between the EAML and clear cell RCC groups to eliminate the influence of the size on discrimination of the tumors. Patients who underwent CT scans before June 2017 were assigned to a training set (20 EAMLs and 40 clear cell RCCs), and the remaining patients were assigned to a test set (eight EAMLs and 16 clear cell RCCs).

Subjective image analysis

Two fellowship-trained radiologists with 7 years of experience in genitourinary imaging, who were blinded to the patients’ information and histopathologic results, independently reviewed the CT scans of EAMLs and clear cell RCCs. Disagreement was resolved by a third radiologist with 17 years of experience in genitourinary imaging. We evaluated the following characteristics: maximal diameter of the mass; tumor laterality (right, left); tumor heterogeneity (homogeneous, heterogeneous); snowman or ice cream cone shape of the tumor; presence of calcification and fat; grade of tumor necrosis (< 33%, 34–66%, ≥ 67%); tumor margin (well-defined, infiltrative); high attenuation on precontrast scan; enhancement pattern (wash-in and wash-out, persistent enhancement); tumor-to-cortex (TOC) enhancement ratio in the corticomedullary phase; presence of intratumoral or peritumoral vessels; renal vein invasion; lymph node metastasis; and distant metastasis.

Tumors were defined as homogeneous when > 90% of the area was occupied by the similar attenuation value via visual inspection; otherwise, they were considered heterogeneous. The snowman shape was defined as a bilobulated, dumbbell-like appearance. The ice cream cone shape was considered present when the exophytic portion had a uniform rounded interface, whereas the endophytic portion had a tapering, pyramidal interface with a definable apex in the parenchyma, which was known to useful morphologic contour to predict AML [18]. In all three phases, we placed regions of interest (ROIs) as large as possible in the solid portion of the tumor, avoiding fat, necrotic, or hemorrhagic components. Additionally, we drew ROIs in the renal cortex in the precontrast and corticomedullary phases. The high attenuation on precontrast was defined as present when the attenuation of the mass was ≥ 10 Hounsfield units (HU) than that of the renal cortex. Wash-in and wash-out were considered present when the attenuation of the solid portion of the tumor decreased by ≥ 20 HU in the excretory phase as compared with the corticomedullary phase [19]; otherwise, the enhancement pattern was considered as a persistent enhancement. We calculated the TOC ratio by dividing the HU of the solid portion of the tumor by the HU of the renal cortex measured in the corticomedullary phase.

Tumor segmentation and feature extraction

We imported axial CT images of the precontrast and corticomedullary phases into commercially available software (MEDIP, medical IP). Lesions were semiautomatically segmented using intensity-based thresholding and the region-growing function in the corticomedullary phases. Subsequently, two radiologists manually and independently edited the margin of the tumors. The masks were copied and pasted to the precontrast image, and subsequent manual refinement was conducted.

We extracted the radiomics features by using syngo.via Frontier software (Siemens Healthineers), which was developed based on the PyRadiomics library (https://github.com/Radiomics/pyradiomics) and scikit-learn machine learning library (https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html). Images were resampled at a spatial resolution of 1 × 1 × 1 mm3 using linear interpolation and discretized with a bin width of 25 HU. Additionally, to minimize CT intensity changes and obtain more stable radiomics features, we normalized the intensity of the image using the following formula (where x represents the original intensity; f(x) represents the normalized intensity; μ indicates the average value; σ refers to variance; and s is an optional scaling ratio, which has been set to 1 by default) [20].

$$f\left( x \right) = \frac{{s \left( {x - {\upmu }_{x} } \right)}}{{{\upsigma }_{x} }}$$

A total of 110 radiomics features, including 18 first-order statistics, 17 shape features, 24 Gy-level co-occurrence matrices, 16 Gy-level run length matrices (GLRLMs), 16 Gy-level size zone matrices (GLSZMs), five neighboring gray tone difference matrices, and 14 Gy-level dependence matrices, were extracted. We extracted the radiomics features from both precontrast and corticomedullary phase images (220 features) and included 16 CT features obtained from the subjective image analysis. Excretory phase images were not used for radiomics analysis because of the different scan timing between two centers. Finally, we used a total of 236 features.

Feature selection and radiomics model development

Because radiomics features were vulnerable to segmented volumes of interest, we evaluated the agreement of the features between the volumes of interest. We excluded the radiomics features with an intra-class correlation coefficient (ICC) of < 0.75 in the interobserver study. The combined radiomics and subjective features were reduced to 10 features in the training set by classic minimum redundancy maximum relevance using the R2 difference. This algorithm guarantees the selection of features that are highly relevant to actual classes while reducing redundancy among selected features [21]. By using the 10 selected features, a random forest model was constructed. We conducted tenfold cross-validation using the repetition of feature selection and model development for each subset. The average area under the receiver-operating characteristic curve (AUC) and average sensitivity, specificity, and accuracy with a threshold probability of 0.5 were provided as performance metrics for the training set. The final trained model was applied to the test set. Two fellowship-trained radiologists (readers 1 and 2) and a third-year resident (reader 3) in training independently reviewed the test set. They were blinded to the clinical information and were encouraged to decide whether each tumor was either an EAML or clear cell RCC.

Statistical analysis

In the group comparisons, we used Fisher’s exact test for categorical variables and the Mann–Whitney U test for continuous variables, as appropriate. Subgroup analysis was conducted according to the size of the mass: < 4 cm, 4–7 cm, and ≥ 7 cm. To determine the independent risk factor for identifying EAMLs, we conducted multivariate logistic regression on the statistically significant parameters (P < 0.05) in the univariate analysis in the training set. Interobserver agreement was evaluated using the Cohen kappa coefficient for categorical variables and ICC for continuous variables. The kappa values were defined as follows: κ < 0, less than chance agreement; 0.01–0.20, slight agreement; 0.21–0.40, fair agreement; 0.41–0.60, moderate agreement; 0.61–0.80, substantial agreement; and 0.81–0.99, almost perfect agreement [22]. In terms of ICC, values ≥ 0.75 were considered reproducible. To evaluate the performance of the prediction model on the training set and test set, we conducted a receiver-operating characteristic analysis. The AUC of the radiomics model was compared with that of the three readers. All statistical analyses were conducted using SPSS (version 25; IBM), and a P < 0.05 was considered statistically significant.

Results

Baseline characteristics of EAMLs

Table 1 summarizes the information on the baseline characteristics of patients with EAMLs. The mean size of the tumors was 6.2 ± 5.0 cm (range, 0.9–17.6 cm). Three patients (10.7%) had a history of tuberous sclerosis. One patient showed bone metastasis on diagnosis. Two patients were presented with lung metastases at 20 and 61 months after surgery, respectively. The remaining 25 patients had no evidence of tumor recurrence or distant metastasis during the median follow-up of 12.5 months (range, 1–137 months).

Table 1 Baseline and radiological characteristics of the patients

Comparison of CT findings between EAMLs and clear cell RCCs

The patients with EAML were significantly younger than those with clear cell RCC (45.6 ± 13.7 vs. 61.2 ± 11.7 years, P < 0.001). The mean mass size was 6.2 ± 5.5 cm in the EAML group and 5.5 ± 3.4 cm in the clear cell RCC group, and did not show significant difference (P = 0.43). In subjective image analysis, the EAMLs showed less heterogeneous appearance than the clear cell RCCs did (78.6% vs. 98.2%, P = 0.002) (Fig. 2). The EAMLs showed more frequent snowman or ice cream cone shape than the clear cell RCCs did (39.3% vs. 5.4%, P = 0.002) (Fig. 3). Four of the 28 EAMLs had a fat component, whereas only one in the clear cell RCC group did (14.3% vs. 1.8%, P = 0.022) (Fig. 4). The EAMLs tended to have smaller necrotic portions (P < 0.001) and well-defined margins (92.9% vs. 71.4%, P = 0.024) than clear cell RCCs did. Additionally, the EAMLs showed more frequent high attenuation on precontrast images (71.4% vs. 17.9%, P < 0.001), persistent enhancement (32.1% vs. 1.8%, P < 0.001), and lower TOC ratio in the corticomedullary phase (0.55 ± 0.18 vs. 0.91 ± 0.17, P < 0.001). We described the result of subgroup analysis according to mass size in Table 2 as well as supplementary content.

Fig. 2
figure 2

ac A 49-year-old man with epithelioid angiomyolipoma. Unenhanced CT image showed a 2.1 cm tumor of high attenuation (48 HU) without gross fat in the right kidney upper pole (a). The tumor demonstrated mild enhancement in the corticomedullary phase (b) and homogeneous, persistent enhancement in the excretory phase (c). d A 75-year old man with clear cell renal cell carcinoma. The tumor showed almost similar enhancement with renal cortex in the corticomedullary phase, with heterogeneous appearance even in the small size

Fig. 3
figure 3

ac A 37-year-old woman with epithelioid angiomyolipoma. Unenhanced CT image showed a 4.6 cm tumor of mildly high attenuation (40 HU) without gross fat in the right kidney (a). The tumor was protruding to the renal sinus with bilobulated shape (arrows) and so-called snowman appearance in coronal image (b). In the excretory phase, the tumor showed delayed wash-out, with no delineable necrotic area (c). d A 61-year-old man with clear cell renal cell carcinoma. In contrast to the aforementioned case, the tumor protruded to renal sinus without any disturbance, destroying the normal renal architecture (arrows)

Fig. 4
figure 4

A 77-year-old man with epithelioid angiomyolipoma. a A fat-containing solid mass was found in the right kidney (arrow). Initially, the lesion was interpreted as a classic angiomyolipoma. bd After 3 years of follow-up, the size of the tumor increased to 7.2 cm. Unenhanced CT image (b) showed a focal fat component at the medial side of the tumor (arrowhead). The tumor showed heterogeneous contrast enhancement, areas of hemorrhage, and necrotic portion (c). In the coronal plane, the endophytic portion had a tapering interface (arrowheads), which showed an ice cream cone shape (d)

Table 2 Comparison of radiologic characteristics between EAML and clear cell RCC according to the size

Univariate and multivariate analyses of CT parameters in identifying EAMLs

Table 3 presents detailed information regarding the univariate and multivariate analyses for identifying EAMLs in the training set. In the univariate analysis, homogeneous enhancement, snowman or ice cream cone tumor shape, lower grade of necrosis (< 33%), well-defined margin, high attenuation on precontrast images, persistent enhancement, and lower TOC ratio in the corticomedullary phase (< 0.7) were significant factors for identifying EAMLs (P < 0.05 for all). In the multivariate logistic regression, snowman or ice cream cone tumor shape (odds ratio [OR] 16.3; 95% confidence interval [CI] 1.7–156.9, P = 0.02) and lower TOC ratio in the corticomedullary phase (OR 33.4; 95% CI 5.7–197, P < 0.001) were significant independent factors in terms of identifying EAMLs.

Table 3 Univariate and multivariate analyses of CT parameters in identifying epithelioid angiomyolipoma

Radiomics feature extraction and development of the radiomics prediction model

Table 4 presents patient characteristics of the training and test sets. Among the 220 radiomics features extracted in the precontrast and corticomedullary phases, we excluded 28 with inacceptable interobserver agreement on image segmentation (ICC < 0.75). Finally, we used 208 features (192 radiomics features and 16 subjective image features) to develop the radiomics prediction model. The 10 most relevant features selected from the training set were three subjective image features (snowman or ice cream cone tumor shape, high attenuation on precontrast images, and TOC ratio), five radiomics features from the corticomedullary phases (two first orders, one GLRLM, one GLSZM, and one shape feature), and two radiomics features from the precontrast phases (GLRLM and GLSZM). Figure 5 presents a heat map of the selected radiomics features between EAMLs and clear cell RCCs.

Table 4 Patient characteristics in the training and test sets
Fig. 5
figure 5

Heat map of the 10 most relevant radiomics features for differentiating between EAMLs and clear cell RCCs. AML angiomyolipoma, RCC renal cell carcinoma, CMP corticomedullary phase, TOC tumor-to-cortex, GLRLM gray-level run length matrix, PRE precontrast phase, GLSZM gray-level size zone matrix

From the tenfold cross-validation, the average AUC of the radiomics model was 0.84, with a sensitivity of 90%, a specificity of 70%, and an accuracy of 77% in the training set. In the test set, the average AUC of the model was 0.89, with a sensitivity of 62.5%, a specificity of 87.5%, and an accuracy of 79.2%.

Comparison between the radiomics model and radiologists

Table 5 and Fig. 6 present information on the diagnostic performance of the radiomics model and the three readers. The AUCs of the three readers were 0.81 (95% CI 0.60–0.94), 0.78 (0.57–0.92), and 0.63 (0.41–0.81) for readers 1, 2, and 3, respectively. The sensitivity and specificity of each reader were 87.5% and 75% for reader 1, 62.5% and 93.8% for reader 2, and 75.0% and 50.0% for reader 3, respectively. There were no significant differences between the AUCs of the radiomics prediction model and those of the reader 1 and 2 (P = 0.55 and 0.41, respectively). The AUC of the radiomics model was significantly higher than that of the reader 3 (P = 0.04).

Table 5 Diagnostic performance of the radiomics prediction model and the three readers in the test set
Fig. 6
figure 6

Receiver-operating characteristic curve (ROC) of the radiomics model and those of the three readers for differentiating epithelioid angiomyolipoma and clear cell renal cell carcinoma in the test set. The area under the ROC curve of the radiomics model was comparable with those of two genitourinary radiologists (reader 1 and 2, P = 0.55 and 0.41, respectively), and higher than that of a third-year resident (reader 3, P = 0.04)

Interobserver agreement

Table 6 summarizes the information of the interobserver agreements. Two readers showed moderate to almost perfect interobserver agreement for the categorical variables (κ: 0.60–1.0) and a reliable ICC (0.96) for TOC ratio measurement.

Table 6 Interobserver agreements of CT parameters in subjective image analysis in the study population between two readers

Discussion

Unlike other subtypes of AML, EAML is a potentially malignant tumor and can be locally aggressive or metastasize, thus surgical resection could be justified [17]. However, there might be significant differences in outcome for clear cell RCC and EAML. Clear cell RCCs have the lowest 5-year survival rate and the worst prognosis among the RCC histologic subtypes, while EAML can exhibit various clinical course from benign to malignant behavior [23, 24]. Furthermore, clear cell RCC and EAML indicate different responses to the molecularly targeted therapies, especially in unresectable cases [25,26,27,28]. Hence, it is important to accurately differentiate EAML from clear cell RCC in imaging, which would be helpful for planning treatment strategies and precision medicine. We analyzed the CT findings of EAMLs and developed a radiomics-based model to differentiate EAMLs from clear cell RCCs. A radiomics-based model was developed from the radiomics features and the subjective image findings, which could be achieved from the CT images. By using this non-invasive information, the radiomics-based model showed good performance in distinguishing EAMLs and clear cell RCCs (AUC 0.89), which was similar to that of genitourinary radiologists (AUC 0.81 and 0.78) and superior to that of a third-year resident (AUC 0.62). To our knowledge, this is the first study to compare the CT findings between the EAML and clear cell RCC, which are difficult to differentiate.

EAML is predominantly composed of epithelioid cells, and according to the 2016 WHO classification, the tumor contains almost no fat. The results from several previous reports also suggest that the EAML has no or a very small volume of fat component in imaging [6, 8, 9]. Similar to these results, fat was detected in only four of the 28 cases in our study. The remaining 24 lesions, which had no definite fat in the precontrast image, were mostly misdiagnosed as RCCs or renal sarcomas in our radiologic reports. These results demonstrate the importance of knowing the different CT manifestations between EAML and RCC.

We found some useful CT findings for distinguishing EAML from clear cell RCC. The snowman or ice cream cone shape of the tumor was one of the independent factors for identifying EAML. The ice cream cone shape or so-called angular interface has been known to be accurate for characterizing benign renal tumors, especially in the exophytic location [18, 29]. The snowman shape also suggests that the tumor was compressed by the adjacent renal parenchyma in the waist of the mass, which is a more favorable finding of a benign tumor. In our study, five EAMLs with exophytic location exhibited an ice cream cone shape, and five with endophytic location showed a snowman shape. Additionally, this finding was especially common in 4–7 cm EAMLs (87.5%). The EAMLs had more frequent snowman or ice cream cone shape than the clear cell RCCs only in the lesions with 4–7 cm, but not in < 4 cm or ≥ 7 cm. Thus, this distinctive shape could be used intuitively and usefully to distinguish EAML and clear cell RCC in moderately sized renal masses. Additionally, our result is inconsistent with the previous study by Kim et al., that ice cream cone shape showed significant difference between RCC and fat-poor AML in small size (< 3 cm) [18]. Although fat-poor AML does not contain enough fat to be detected with images, it can have up to 25% fat cells per high-power field [30]. However, EAMLs predominantly consist of epithelioid component with few or no fat cells. These different histologic compositions between EAML and fat-poor AML could result the discrepancy in tumor shape.

Several imaging findings consistently showed a difference between EAML and clear cell RCC regardless of tumor size. The hyperattenuation on unenhanced CT of EAML has been reported in previous studies, and this might be due to the significant amount of epithelial muscle component [8, 10]. Our study was also consistent with previous results, and the findings on imaging were useful for distinguishing EAMLs from clear cell RCCs (71.4% vs. 17.9%). Both enhancement pattern and degree have been known to reflect the microvessel densities of the tumors, and they were also different between EAML and clear cell RCC [31]. The clear cell RCC group typically showed almost the same enhancement as the cortex in the corticomedullary phase (TOC ratio 0.91) and a frequent wash-out in the excretory phase (98.2%), the so-called “fast in and fast out” [30]. By contrast, the EAML group showed a lower enhancement ratio (TOC ratio 0.55) and less frequent wash-out (67.9%) than the RCC group did, which was similar to the results reported by Luo et al. (TOC ratio 0.64, slow wash-out 68%) [32].

Tumor necrosis is known to be common in EAML as a result of active tumor cell proliferation and insufficient internal blood supply [9, 32]. Nevertheless, the clear cell RCC group tended to demonstrate a higher grade of necrosis than the EAML group did. Additionally, EAMLs < 4 cm in size tended to be homogeneous (50%), whereas most of the clear cell RCCs < 4 cm were heterogeneous (95.8%), even in the small size. These findings might have contributed to the difference in the growth pattern of tumor cells and the degree of nuclear atypia between definite malignant tumors and tumors with malignant potential [33].

Among the clear cell RCCs ≥ 7 cm in this study, 81.2% (13/16) were surgically proven to have perirenal fat invasion and 43.8% (7/16) had renal vein involvement. These pathologic results might be associated with the CT findings of infiltrative margin and gross renal vein invasion of the tumors. Because an RCC > 7 cm has higher risk of perirenal fat invasion or renal vein involvement, a large renal mass with a well-defined margin and absence of renal vein thrombosis would more likely suggest EAML than clear cell RCC [34].

Interestingly, the results of the radiomics features finally selected for the radiomics-based model confirm the results of the visual interpretation. The first-order features from the corticomedullary phase (90th percentile and root mean squared) that were decreased in the EAML group would be attributed to a relatively lesser degree of enhancement of the EAMLs than that of clear cell RCCs. The TOC ratio, which was obtained by a radiologist, was also included among the final 10 features, confirming previous findings. The three-dimension shape features from both radiomics (flatness, ranging from 1 [non-flat, sphere-like] to 0 [flat, non-sphere-like]) and from visual interpretation (snowman or ice cream cone shape) were included among the final 10 features. In concordance with the subjective assessment, flatness was significantly lower in the EAML group, suggesting the non-sphere-like shape of the EAMLs. Furthermore, the GLRLM and GLSZM features that were included in the radiomics model might reflect the difference of heterogeneity and necrotic portion between the EAML and clear cell RCC.

The present study had several limitations. First, the retrospective design might have introduced a selection bias. Additionally, although the study was conducted in two centers, the sample size was small. However, this was avoidable because of the rare nature of EAML. Second, CT images were acquired from different CT scanners with heterogeneous CT parameters, which could have influenced the results of the obtained radiomics parameters [35]. However, we tried to minimize this limitation using several processes, including voxel size resampling, CT number normalization, and exclusion of excretory phase images acquired at different times in two centers [20, 36]. Consequently, we obtained a radiomics-based model with acceptable accuracy in the test set. Third, we did not evaluate the percentage of the epithelioid component. The percentage of epithelioid component affects the CT manifestation between EAML and AML, with a recent study reporting a radiologic–pathologic correlation in the epithelioid component [32]. Finally, EAMLs can be also misdiagnosed as other renal tumors, such as fat-poor AML or non-clear cell RCC, but we did not include this analysis. Further comparison with renal tumors other than clear cell RCC is warranted.

In conclusion, our study found that EAML has characteristic CT signs that can be distinguished from clear cell RCC. Furthermore, our radiomics-based model has the potential to differentiate EAML and clear cell RCC with better diagnostic performance compared with that of an inexperienced radiologist and could be useful when choosing treatment strategies in patients with indeterminate renal masses.