Introduction

Renal cell carcinoma (RCC) is the seventh most common malignant tumour, killing more than 170,000 people worldwide in 2018 [1]. Clear cell RCC (ccRCC) is the primary histological subtype, accounting for approximately 90% of RCC [2]. CcRCC is an extremely aggressive urologic neoplasm. Approximately 30% of patients have distant metastases at the initial diagnosis (synchronous distant metastasis, SDM) [3], whereas approximately 20% of patients without distant metastases at initial nephrectomy will develop distant metastases during follow-up (metachronous distant metastasis) [4]. Among them, ccRCC with SDM has a distinct pathologic and molecular phenotype and results in a poor prognosis [5]. Early identification of SDM can ensure the reasonable, and effective treatments were timely offered or performed and eventually improve patient survival [6,7,8,9].

In general, urologists often evaluate the risk of SDM based on experience, such as the tumour size and clinical symptoms. The tumour size was indeed a valuable predictor for SDM (odds ratio (OR), 1.22 to 1.25) [10, 11]. However, small ccRCCs can also present SDM [12], and the tumours with similar size can demonstrate the different risks in SDM, so the overreliance on the tumour size can lead to the missed diagnosis of SDM. Furthermore, the clinical signs of SDM can be slight or even absent, which could lead to the ignorance of further investigation on SDM by the doctors.

Nowadays, many advanced imaging modalities can contribute to the detection or confirmation of SDM. However, it is of a massive economic and physical burden to use multiple imaging methods to check all potential metastatic sites for each patient with ccRCC; after all, RCC can metastasise to virtually any part of the human body [13]. Besides, sometimes even enough imaging examinations were performed, the risk of a missed diagnosis and misdiagnosis still exists due to the small size of suspicious metastases or an atypical radiographic appearance [14,15,16]. So, besides the tumour size, how we can precisely evaluate the personalised risk of the SDM with increased confidence raised our concern.

The SDM was closely related to the primary tumour. The nature of the primary tumour, such as histological grade, T staging, gene expression and functional characteristics, may be associated with distant metastases [3, 17, 18]. Medical imaging can noninvasively visualise the strong phenotypic difference that primary tumours exhibited [19]. Magnetic resonance imaging (MRI), as the preferred examination method for renal tumours, can provide abundant data related to anatomy, morphology and potential functionality based on its superior soft-tissue contrast, spatial resolution and multiple imaging sequences. Nevertheless, we cannot identify some potential features on a visual basis. As an emerging technique used to extract additional information from diagnostic clinical imaging, radiomics can extract high-throughput quantitative data from medical images and indirectly reflect tumour heterogeneity which could not be recognised by the naked eye [20]. The correlation between quantitative data and pathological phenotypes allows the noninvasive detection of tumour bio-behaviours [21]. Some studies have confirmed that radiomics features could be used as prognostic or predictive indicators [20, 21]. At present, radiomics for kidney cancer has demonstrated the ability in the prediction of histological subtype, pathologic grade and therapeutic effects [20, 22,23,24,25].

In this study, we will evaluate whether a multiparametric MRI radiomics-based nomogram can make individualised prediction of SDM in patients with ccRCC and facilitate the more precise selection of ccRCC patients for further investigation on SDM in clinical practice.

Materials and methods

Patients’ information

Ethical approval was obtained for this retrospective analysis, and informed consent was waived by the institutional review board of our hospital. In this study, one training cohort, an independent internal validation cohort and an external validation cohort were established, respectively. The training cohort and the internal validation cohort comprised a retrospective collection of patients with histologically confirmed ccRCC who visited our centre between January 2013 and October 2018 and from November 2018 to June 2019, respectively. The external validation cohort was established by collecting the patients in the other three medical centres between January 2015 and February 2019. CcRCC patients with SDM were consecutively collected, while patients without SDM were randomly selected according to the distribution of tumour size of ccRCC in the SDM group (4 cm/4–7 cm/7–10 cm/10 cm). The sample size ratio between the SDM group and the non-SDM group was 1:2, which was approximately in line with the incidence of SDM in ccRCC (1/4–1/3) [3]. SDMs were confirmed by biopsy or surgical pathology or follow-up (the suspicious metastases existing before nephrectomy gradually increased in size or number or decreased after molecularly targeted therapy between 6 months and 5 years of follow-up); the patients without SDMs were confirmed by no signs of distant metastases through whole-body imaging examinations before nephrectomy or no suspicious metastasis during follow-up. The enrolment of patients is displayed in Fig. 1, and the inclusion and exclusion criteria are listed in the Supplementary Material.

Fig. 1
figure 1

The flowchart of patient recruitment

MRI acquisition

MR images were acquired with 3.0 T scanners (Signa HDX and Discovery MR750, GE Healthcare) and consisted of six sequences including axial fat-suppressed T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI), T1-weighted imaging (T1WI) and contrast-enhanced T1WI (corticomedullary phase, nephrographic phase and excretory phase (EP)). The detailed parameters of MRI sequences are described in Supplementary Table 1.

Clinicoradiologic model construction

Demographic and clinical data were collected by a urologist (Q.B.H.) from the medical records of the four institutions, which are defined in Table 1. Two radiologists (W.X. and X.B., with a 9-year experience and 5-year experience in abdominal MRI diagnosis, respectively) who were blinded to the clinical data and pathology results simultaneously evaluated the MR images at the picture archiving and communication system (PACS) workstation in consensus.

Table 1 Clinicoradiologic risk factors for SDM in ccRCC patients

The differences of clinicoradiologic characteristic between ccRCC patients with and without SDM in the training cohort were analysed, then the clinicoradiologic characteristics with significant differences were selected to construct a clinicoradiologic model by multivariable logistic regression analysis. The calibration of the model was evaluated by the calibration curve and the Hosmer-Lemeshow test.

MR image segmentation and radiomics feature extraction and selection

Based on kidney MRI data from the PACS, the images from FS-T2WI, DWI, T1WI and contrast-enhanced T1WI were exported and uploaded to the RadCloud (Huiying Medical Technology Co., Ltd.) which was an available platform reliably used in previous studies [26, 27]. Then, except for the first and last slices, the outline of all contiguous slices of the entire tumour were delineated [28] and 1029 radiomics features were extracted from multiparametric MR images, respectively, including first-order features, shape features, second-order features and higher-order features derived from the robust transformation of first-order and second-order features. The transformation methods included exponential, square, square root, logarithmic and wavelet (wavelet-LHL, wavelet-LHH, wavelet-HLL, wavelet-LLH, wavelet-HLH, wavelet-HHH, wavelet-HHL and wavelet-LLL) transformations [29].

In the training cohort, before identifying the optimal features, the resampling strategy was adopted to improve the robustness of the selected features; namely, 90% of samples were randomly extracted 20 times, and feature selection was carried out each time to identify the features that were selected more than 5 times for subsequent analysis.

Based on 90% of the training samples extracted each time and all training samples used for the final determination of the optimal features, three methods were used for feature selection. First, a variance threshold method was applied to remove the undifferentiated variables, and features with variance < 0.8 were excluded from further analysis. Then, associations between the radiomics features and the metastatic status of ccRCC were analysed using SelectKBest and the features with nonsignificant difference (p ≥ 0.05) were further removed. Third, the least absolute shrinkage and selection operator (LASSO) regression with 10-fold cross-validation was used to identify the features with the best correlation and reproducibility [30, 31]. The intra- and inter-observer reproducibility was performed by reader 1 (X.B.) and reader 2 (M.F.W., with a 3-year experience), which was described in detail in the Supplementary Material.

The radiomics scores (Rad-scores) were obtained by calculating all the selected features via a linear combination for each lesion. The potential association of the Rad-score with SDM was assessed using a Mann-Whitney U test and verified by stratified analyses.

Radiomics-based nomogram construction

Combined clinicoradiologic characteristics and the Rad-score, a radiomics-based model was established based on multivariable logistic regression, which presented as a nomogram to predict the individual risk of SDM in ccRCC patients. The calibration curve was plotted to access the calibration of the nomogram, accompanied by the Hosmer-Lemeshow test.

Evaluation, verification and comparison of the two models’ performance

The prediction performance of the radiomics-based nomogram and the clinicoradiologic model was assessed by receiver operating characteristic (ROC) curve analysis. In the internal and external validation cohorts, logistic regression was applied to all patients to test the efficiency and reproducibility of the two models. Decision curve analysis (DCA) was implemented to evaluate the net benefits of the models at different threshold probabilities [32]. Moreover, the prediction performance of the nomogram in arbitrary subgroups of different sizes was also validated based on all ccRCC patients in the three cohorts.

Statistical analysis

All statistical analyses were performed with Stata/MP 14.0 (StataCorp) for Windows. The Mann-Whitney U test was used for the analysis of continuous variables, and the chi-square test was used for the analysis of categorical variables. A p value < 0.05 was considered statistically significant. All estimated p values were two-sided. Inter- and intra-class correlation coefficients (ICCs) greater than 0.80 were considered to have a good agreement. The discrimination ability of the models was evaluated and compared by the area under the ROC curve (AUC) (DeLong’s test).

Results

Patients information

In the training cohort, 126 patients with a solitary tumour were included (97 males and 29 females; mean age, 56.4 ± 9.8 years; 42 ccRCC patients with SDM and 84 ccRCC patients without SDM). In the internal validation cohort, 39 consecutive patients (24 males and 15 females; mean age, 54.5 ± 10.3 years; 13 with SDM and 26 without SDM) were included. In the external validation cohort, 36 patients (27 males and 9 females; mean age, 56.4 ± 9.8 years; 12 with SDM and 24 without SDM) were collected anonymously. The confirmation approaches and sites of metastases are shown in Table 2. The entire recruitment process is shown in the shape of a flowchart (Fig. 1). The clinicoradiologic and pathological characteristics of each patient in the training and validation cohorts are presented in Table 3 and Supplementary Table 2.

Table 2 Confirmation approaches and sites of SDM in ccRCC
Table 3 Clinicoradiologic characteristics associated with SDM in the training and validation cohorts

Clinicoradiologic model construction

According to multivariable logistic regression, three clinicoradiologic indicators (age, regional lymph node (RLN) and pseudocapsule) were incorporated into the model (the clinicoradiologic model) (Table 4). The model demonstrated no apparent departure from a perfect fit on the calibration curve (Fig. 2a) and yielded a nonsignificant statistic in the Hosmer-Lemeshow test (p = 0.794).

Table 4 Risk factors in the predictive models for SDM in ccRCC patients
Fig. 2
figure 2

Calibration curves of the two models in the training cohort. a Calibration curve of the clinicoradiologic model and (b) the radiomics-based nomogram. The calibration curves describe the calibration of each model by comparing the SDM prediction risk with the SDM observations. The y-axis represents the observed SDM rate. The x-axis represents the predicted SDM risk. The diagonal represents a perfect prediction of an ideal model. The blue line segment near the x-axis visually shows the distribution of ccRCCs with and without SDM as the predicted probability increases. SDM, synchronous distant metastasis

Radiomics feature extraction and selection

Among the 126 patients in the training cohort, 64 features were identified by resampling and finally reduced to 9 potential predictive parameters according to the LASSO regression (Fig. 3a–d). The skewness, interquartile range (IQR), large area low grey level emphasis (LALGLE) and dependence variance (DV) of T2WI; skewness and DV of T1WI; kurtosis of DWI; and DV and grey level variance (GLV) of EP had satisfactory intra-observer (ICCs, 0.806–0.962) and inter-observer (ICCs, 0.865–0.994) reproducibility (see details in the Supplementary Material).

Fig. 3
figure 3

Radiomics analysis and model construction workflow. a Volumes of interest on the primary lesions of ccRCCs with and without SDM. The yellow regions indicate the renal lesions manually delineated on T2WI, DWI, T1WI and CE-T1WI. b According to a variance threshold of 0.8, sixty-four features were reduced to sixty-two potential predictors. c Fourteen radiomics features were further selected by univariate analysis (p < 0.05). d Nine optimal radiomics features were finally selected according to LASSO algorithm (10-fold cross-validation, alpha [the optimal value of the LASSO tuning parameter] = 1.7). e Distributions of the radiomics scores for each patient. CcRCC patients with SDM (red line) were more likely to obtain higher radiomics scores, while those without SDM (blue line) had the opposite. f Flowchart of radiomics analysis and model construction. CP, corticomedullary phase; NP, nephrographic phase; EP, excretory phase; SDM, synchronous distant metastasis; LASSO, the least absolute shrinkage and selection operator

The Rad-score was obtained by calculating the 9 optimal features via a linear combination for each lesion (Supplementary Table 3). The difference in the Rad-scores between ccRCC patients with and without SDM was significant in the three cohorts (Table 3). Stratified analysis showed that the significant correlation between the Rad-score and SDM was not affected by sex, tumour size, WHO/ISUP grade, MR scanner, etc. (Table 5). The Rad-score distributions are illustrated in Fig. 3e.

Table 5 Stratified analysis of the association between the Rad-score and SDM

Radiomics-based nomogram construction

After logistic regression analysis, age, RLN, pseudocapsule and the Rad-score were included as independent predictors in the radiomics-based model (Table 4), which was presented as a nomogram (Fig. 4). Good calibration was observed for the prediction probability of SDM (Fig. 2b), and the Hosmer-Lemeshow test yielded a nonsignificant statistic (p = 0.811). The complete flowchart of radiomics analysis and model construction is presented in Fig. 3f.

Fig. 4
figure 4

Radiomics-based nomogram for the prediction of SDM in ccRCC patients. The radiomics-based nomogram was established by incorporating the pseudocapsule, radiomics score, the short axis of regional lymph node and age. The sum of the scores on the scoring ruler corresponding to the four predictors is the total score, and the scale values of the upper ruler corresponding to the total score are the risk of SDM. SDM, synchronous distant metastasis

Evaluation, verification and comparison of the two models’ performance

Using ROC curve analysis, the clinicoradiologic model yielded an AUC of 0.836 (95% CI, 0.761 to 0.910) in the training cohort, 0.731 (95% CI, 0.566 to 0.896) in the internal validation cohort and 0.677 (95% CI, 0.473 to 0.881) in the external validation cohort, respectively. The radiomics-based nomogram reached an AUC of 0.914 (95% CI, 0.864 to 0.963) in the training cohort, which was also validated in the internal cohort (AUC, 0.854 [95% CI, 0.736 to 0.971]) and the external cohort (AUC, 0.816 [95% CI, 0.661 to 0.971]) (Fig. 5a–c).

Fig. 5
figure 5

ROC curves of the radiomics-based nomogram and the clinicoradiologic model for the prediction of SDM in the three cohorts and decision curve analysis for the two models. Compared to the clinicoradiologic model, the radiomics-based nomogram obtained higher AUC values in the training cohort (a), the internal validation cohort (b) and the external validation cohort (c), showing higher predictive power. d Decision curve analysis for the two models. The net benefit versus the threshold probability was plotted. The threshold probability is the probability of SDM in this study, with values ranging from 0 to 100. The sensitivity and specificity of the model are calculated at each threshold to determine the net benefit. The green and black lines represent the hypothesis that all ccRCC patients and no ccRCC patients had SDM (which is used for reference to determine subsequent clinical management), respectively. The net benefits of the radiomics-based nomogram (red line) were superior to the benefits of the clinicoradiologic model (blue line) across most ranges of threshold probability. ROC, receiver operating characteristic curve; AUC, area under ROC curve; SE, sensitivity; SP, specificity

The comparison between the two models showed that the radiomics-based nomogram yielded higher AUC than the clinicoradiologic model in the training cohort (0.914 vs. 0.836, p = 0.004), internal validation cohort (0.854 vs. 0.731, p = 0.046) and external validation cohort (0.816 vs. 0.677, p = 0.011). DCA for the radiomics-based nomogram suggested that using the nomogram to predict SDM added more net benefit than the clinicoradiologic model (Fig. 5d).

Based on all ccRCC patients in the three cohorts, the AUCs of the radiomics-based nomogram among all the subgroups of different sizes of ccRCC are shown as follows: size ≤ 4 cm, 0.875 (95% CI, 0.728–1.000); size 4–7 cm, 0.891 (95% CI, 0.822–0.960); size 7–10 cm, 0.908 (95% CI, 0.833–0.983); and size > 10 cm, 0.881 (95% CI, 0.763–0.999) (Fig. 6).

Fig. 6
figure 6

ROC curves of the radiomics-based nomogram for the prediction of SDM in ccRCC subgroups of different sizes. The radiomics-based nomogram yielded good performance in all the subgroups of ccRCCs with primary tumour size ≤ 4 cm (a), size 4–7 cm (b), size 7–10 cm (c) and size > 10 cm (d). ROC, receiver operating characteristic curve; AUC, area under ROC curve; SE, sensitivity; SP, specificity

Discussion

The occurrence of SDM means more active treatment options for ccRCC [9] and poor prognosis for patients with ccRCC [3]. Early and accurate recognition of the status of SDM was vital for the management of patients with ccRCC. In this study, we established a radiomics-based model for predicting SDM in ccRCC patients by incorporating MRI-derived Rad-score, age, pseudocapsule and RLN. This model, visually displayed on nomogram in the form of scale [33], demonstrated impressive predictive efficiency (AUC, 0.816–914) in the three cohorts. Hence, this nomogram could provide evidence for the individualised identification of patients who need further examination on SDM.

Primary tumour size is an independent predictor of SDM in ccRCC patients [10, 11]. Nevertheless, the risk of SDM varies for tumours with the same size, and it is controversial whether the size of primary tumours in small renal masses (SRMs) is significantly correlated with the risk of SDM [34]. Zastrow et al [35] analysed the prognostic value of the primary tumour size based on their RCC database and found that there was no significant correlation between the size and the risk of SDM in T1a RCC. Their model by combining the primary tumour size and clinical indicators shows good predictive power for SDM of RCC (C-index, 0.82), but there is no evidence that this model can be used to predict the SDM of T1a RCC and available for other independent databases. In this study, we matched the dimensions of primary tumours between ccRCC patients with and without SDM, thus ensuring that the variability of candidate predictors was derived from the different metastatic status of ccRCC rather than the different tumour sizes. Compared with the model built by Zastrow et al [35], our radiomics-based nomogram has convincing prediction power for SDM in ccRCC subgroups of different sizes in both the training cohort and the independent validation cohorts including SRMs (AUC, 0.875–0.908), which indicated that intra-tumour imaging features other than tumour size could also be the predictors for SDM.

Besides the tumour size, some clinicoradiologic indicators associated with the malignant behaviour of ccRCC have been reported by previous works of literature, which also were confirmed in our study. Previously, Wei et al [36] demonstrated that the lack of a pseudocapsule had a remarkably adverse impact on the prognosis of ccRCC patients, and Karlo et al [37] showed that well-defined tumour margins were more common among ccRCC patients with a loss of von-Hippel-Lindau function, which indicates low tumour aggressiveness. Also, tumour thrombus and size of RLN were associated with metastasis in RCC [38, 39]. Similarly, our study revealed significant differences in these aspects between ccRCC patients with and without SDM. In this study, age, pseudocapsule and RLN were eventually used as independent predictors to construct the clinicoradiologic model. Compared with the clinicoradiologic model that only relied on clinical and imaging indicators, the final radiomics-based nomogram achieved higher prediction performance for SDM. Based on the training cohort data, DCA analysis revealed that using the radiomics-based nomogram to predict SDM in ccRCC patients offers more significant benefits than merely relying on clinical and imaging information.

Through the LASSO algorithm, a valid tool to reduce dimensionality and remove the redundancy of radiomics features [31], we ultimately selected nine features derived from six original features (skewness, kurtosis, IQR, DV, LALGLE and GLV) based on T2WI, T1WI, DWI and EP as independent features with the best reproducibility and significant relation to metastasis. In previous literature, kurtosis, skewness, IQR and LALGLE have been proven to be prognostic markers for patients with malignant tumours in ccRCC and other types of tumours [40,41,42,43,44]; for example, the kurtosis and skewness of apparent diffusion coefficient (ADC) map intensities and the IQR of contrast-enhanced CT attenuation values can be used to distinguish between low- (I–II) and high-stage (III–IV) ccRCCs [40, 41]. In our study, skewness, IQR, LALGLE of T2WI, skewness of T1WI and kurtosis of DWI could predict the status of SDM, which may suggest that these types of parameters were beneficial in predicting bio-behaviour of malignant tumours. Meanwhile, we discovered that the DV of T2WI and T1WI and the DV and GLV of EP were associated with the metastatic status of ccRCC, which have not been proposed before.

Our study had several limitations. First, as a retrospective study, the selection bias in patients’ enrolment cannot be completely avoided. Second, some SDMs were confirmed by radiological changes instead of histopathological results, although acceptable in clinical practice, which might lead to the omission of some metastases and weaken the substantiality of this study. Third, this MRI radiomics-based nomogram was only developed and validated on the 3.0 T MR scanners, so its generalisation needs to be further explored. Fourth, due to relatively small sample size and uneven distribution of metastatic sites of SDM, this study only involved in predicting whether the SDM exists instead of the more exciting topic—prediction of the exact location of SDM. However, we believe this study can lay a foundation for the application of radiomics methods to predict the site of SDM accurately. Finally, due to the administration of the gadolinium-based contrast agent for contrast-enhanced imaging, this MRI radiomics-based nomogram may be inapplicable to individuals with severe renal insufficiency.

In conclusion, the MRI radiomics-based nomogram established by combining the Rad-score, age, pseudocapsule and RLN can serve as a potential tool to facilitate the individualised prediction of SDM preoperatively and provide guidance for imaging tests and treatment decision-making.