Abstract
Purpose
This study aims to investigate the predictive value of CT-based radiomics in determining the success of extracorporeal shock wave lithotripsy (SWL) treatment for ureteral stones larger than 10mm in adult patients.
Materials and Methods
A total of 301 eligible patients (165/136 successful/unsuccessful) who underwent SWL were retrospectively evaluated and divided into a training cohort (n = 241) and a test cohort (n = 60) following an 8:2 ratio. Univariate analysis was performed to assess clinical characteristics for constructing a nomogram. Radiomics and conventional radiological characteristics of stones were evaluated. Following feature selection, radiomics and radiological models were constructed using logistic regression (LR), support vector machine (SVM), random forest (RF), K nearest neighbor (KNN), and XGBoost. The models’ performance was compared using metrics such as the area under the receiver operating characteristic curve (AUC), precision, recall, accuracy, and F1 score. Finally, a nomogram was created incorporating the best image model signature and clinical predictors.
Results
The SVM-based radiomics model showed superior predictive performance in both training and test cohorts (AUC: 0.956, 0.891, respectively). The nomogram, which combined SVM-based radiomics signature with proximal ureter diameter (PUD), demonstrated further improved predictive performance in the test cohort (AUC: 0.891 vs. 0.939, P = 0.166).
Conclusions
Integration of CT-derived radiomics and PUD showed excellent ability to predict SWL treatment success in patients with ureteral stones larger than 10mm, providing a promising approach for clinical decision-making.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Urolithiasis is a common urological condition with an increasing incidence and prevalence worldwide [1, 2]. Although current European Association of Urology guidelines recommend URS as the first-line therapy for ureteral stones larger than 10 mm, SWL can also be used in clinical practice. Hence, accurately estimating the probability of successful SWL treatment for large stones would maximize patient benefits [3].
Previous studies investigated various factors from a standard non-contrast CT scan, which may influence the success of SWL treatment of ureteral stones, including stone size, density, texture, etc. [4,5,6,7,8]. The usefulness of CT texture analysis in assessing internal structural heterogeneity and its potential role in enhancing the accuracy of SWL success prediction remains contentious. Overall, AUCs of prediction models based on these factors remained unsatisfactory [9,10,11].
Recent research indicates that morphology can differentiate between COM and calcium oxalate dihydrate (COD) stones [12]. Radiomics extracts quantitative imaging features from radiological images, creating a high-dimensional dataset that enhances diagnostic, prognostic, and predictive accuracy in clinical decision support [13, 14]. Therefore, it is plausible to hypothesize that radiomics features can predict SWL success more effectively. Within the field of urolithiasis, radiomics has been used to identify stone composition [15] and predict stone-free rate of flexible ureteroscopy [16]. However, no published reports have investigated whether radiomics signature may enhance the accuracy of predicting SWL treatment outcomes for ureter stones.
The study aimed to investigate the predictive value of radiomics using non-contrast CT images for SWL treatment success in patients with ureteral stones over 10 mm. We also compared radiomics’ prediction performance with conventional radiological features and developed a combined model incorporating independent clinical predictors and superior imaging signatures.
Patients and methods
Patients
A retrospective study analyzed patients with ureteral stones treated with SWL at our hospital from January 2018 to January 2023. SWL were performed using the Dornier Compact Delta II lithotripter (Dornier MedTech, Wessling, Germany). Stone-free status was assessed two weeks after each SWL using kidney, ureter, and bladder film and ultrasound. Retreatment was carried out for inadequate fragmentation and incomplete clearance of the stone. If the stone remained intact after a maximum of three sessions, the case was considered a failed treatment. Treatment success was defined as achieving complete stone absence.
The inclusion criteria for this study were as follows: (a) availability of pre-treatment non-contrast CT images, (b) complete follow-up during or after treatment, and (c) the maximum diameter of stone larger than 1 cm. The exclusion criteria were as follows: (a) presence of multiple ureteral stones on the ipsilateral side, (b) previous open ureteral surgery or a history of ureteral stricture, and (c) any genitourinary tract anomaly.
A total of 301 patients were included in this study based on predefined inclusion and exclusion criteria. Subsequently, these patients were randomly assigned to a training cohort and a test cohort at a ratio of 8:2.
Baseline clinical data of the patients, including age, gender, body mass index (BMI), presence of diabetes mellitus (DM) or hypertension (HTN), stone laterality, location, hydronephrosis grade (HG) [17], proximal ureter diameter (PUD), skin-to-stone distance (SSD) were obtained from the medical records.
Radiological analysis
Two experienced radiologists reviewed and evaluated pretreatment CT images of stone on our hospital’s picture archiving and communication systems.
Radiographic parameters of stone included maximum craniocaudal diameter (MCD), maximum transverse diameter (MATD), minimum transverse diameter (MITD), mean stone density (MSD), standard deviation of stone density (SDSD) and variation coefficient of stone density [(VCSD, (SDSD)/(MSD)*100(%)] [18].
Radiomics feature extraction and selection
The workflow for radiomics analysis is shown in Supplementary Fig. 1. Image segmentation was performed using ITK-SNAP software (version 3.8.0, http://www.itksnap.org/). The volume of interest (VOI) of the entire stone was semi-automatically segmented.
Radiomics feature extraction was performed using PyRadiomics package, an open source tool (https://pyradiomics.readthedocs.io; version 3.0.1), implemented in Python. Then, radiomics features were extracted from each patient’s VOI, which included geometry, intensity, and texture features. Here, texture features were calculated using various techniques, including gray level cooccurrence matrix (GLCM), gray level run length matrix (GLRLM), gray level size zone matrix (GLSZM), and neighborhood gray tone difference matrix (NGTDM) methods.
To ensure comparability and eliminate any potential bias, all extracted feature data were normalized by z-score analysis. The radiomic feature selection and dimension reduction process in the training cohort involved the following steps: First, a univariate analysis was performed to identify statistically significant features. Then, Spearman’s rank correlation coefficient was applied to reject features with high correlations, and only one of the two was retained with correlation coefficients greater than 0.8. Finally, the least absolute shrinkage and selection operator (LASSO) regression adjusted the penalty parameters through 10-fold cross validation with minimum criteria, and then applied to select the most reliable predictive features with non-zero coefficients for signature construction.
Development of different models and performance evaluation
Following Lasso analysis, we employed five machine learning (ML) algorithms to train models with retained radiomics features to identify the most effective classifier, including logistic regression (LR), support vector machine (SVM), random forest (RF), K nearest neighbor (KNN), and XGBoost.
The construction process of radiological signature was similar to that of radiomics signature. First, features used to construct radiological signatures were selected based on univariate analysis of radiological characteristics, using a significant p-value < 0.05. Similar to the radiomics signature, the same ML model was used in the building process of the radiomic signature.
The prediction performance of radiomics and radiological signature models was assessed by the area under the receiver operating characteristic curve (AUC), accuracy and precision. The DeLong test was used to compare the AUCs of two models. In addition, the prediction models developed using the training sets were validated with independent test sets.
The identified superior imaging signature, along with clinical candidate variables, were used to develop a nomogram. The predictive efficacy of the nomogram was tested in both the training and test cohort using ROC curves. Calibration curves were generated to evaluate the calibration of the nomogram, accompanied by the Hosmer-Lemeshow goodness-of-fit test. Decision curve analysis was conducted to assess the clinical usefulness of the nomogram by calculating the net benefit at various threshold probabilities.
Statistical analysis
We used R software (version 4.1.2; https://www.r-project.org) and Python software (version 3.9.7; http://www.python.org) for statistical analysis. Continuous variables were analyzed using either the one-way ANOVA test or Mann-Whitney U test, while categorical variables were compared using the chi-squared test or Fisher exact test. Statistical significance was determined by a two-sided P value of less than 0.05.
Results
Clinical and radiological characteristics
Of the 301 patients enrolled, 165 were identified as successful in SWL treatment, and 136 were unsuccessful. The training cohort included 132 successful and 109 unsuccessful patients, while the test cohort included 33 successful and 27 unsuccessful patients.
Clinical and radiological characteristics of the training and test cohorts are summarized in Table 1. There were no significant differences in clinical and radiological characteristics between the training and test cohorts. Univariate analysis showed that only PUD differed significantly between the successful and unsuccessful groups in both training and test cohorts (Table 1).
Radiological signature building and performance evaluation
Univariate analysis identified four significant radiological predictors: MCD, MATD, MITD, and MSD. Therefore, the radiological signature was constructed using these four predictors.
Table 2 shows the prediction performance of radiological signature models developed using five ML algorithms in training and test cohorts.
Radiomics signature building and performance evaluation
A total of 1834 radiomics features were extracted from each patient’s CT images. Supplementary Fig. 2 displays LASSO analysis identifying 25 features from the training cohort. Table 2 shows the prediction performance of radiomics signature models constructed using different ML algorithms, both in training and test cohorts.
Comparison of prediction performance between radiomics and radiological signature models
The Delong test demonstrated that radiomics signature models, based on LR and SVM, showed superior predictive performance compared to radiological signature models using the same ML algorithms, in both the training cohort and the test cohort (Table 2).
In addition, there were no significant differences between training and test cohorts within LR- and SVM-based radiomics models, as well as LR- and SVM-based radiological models. This indicates a high generalization capability in these models.
Although the performance in the test cohort was comparable between LR- and SVM-based radiomics models (P = 0.520), we considered SVM-based radiomics to be superior as it yielded better results than LR-based model in the training cohort (P < 0.001). Detailed performance results of SVM-based radiomics are shown in Supplementary Fig. 3.
Combined model construction and validation
A radiomics nomogram model was developed by combining clinical predictor (PUD) and radiomics signature. The nomogram, shown in Fig. 1a, did not improve prediction ability in the training cohort (AUC: 0.955). However, it showed improved prediction performance in the test cohort compared to using only SVM-based radiomics signature, but the differences were not statistically significant (AUC: 0.939, P = 0.166) (Fig. 1b).
Calibration plots showed good calibration of the nomogram in both training and test cohorts (Fig. 1c). The Hosmer-Lemeshow test yielded nonsignificant statistics in both the training (P = 0.825) and test cohort (P = 0321), indicating that there was no deviation from the perfect fit. DCA of the nomogram in the test cohort is shown in Fig. 1d.
Discussion
This study found that radiomics signature derived from non-contrast CT scans outperformed conventional radiological models in predicting the success of SWL treatment for ureteral stones larger than 10 mm, especially when using an SVM-based ML classifier. In addition, incorporating SVM-based radiomics signature and PUD into the nomogram enhanced prediction performance in the test cohort compared to radiomics signature alone, providing important information for proper treatment selection.
In vitro studies suggest that stone heterogeneity in CT scans can indicate susceptibility to SWL for cystine and calcium oxalate stones [8, 19], but not for brushite stones [20]. However, naked-eye observation of stone heterogeneity may not be a reliable predictor of SWL outcomes. Previous studies have used various methods to describe stone heterogeneity [18, 21, 22], but SDSD and VCSD did not show a significant correlation with SWL success in our study, consistent with previous findings [22]. While some studies have suggested that CT texture analysis may offer novel parameters to predict SWL response, its effectiveness remains controversial [9, 11, 22, 23].
Some studies [9, 22] found that CT texture parameters did not improve SWL outcome prediction, with an AUC ranging from 0.64 to 0.7.
The morphology of urinary stones can predict their fragility, as their visual characteristics are closely related to their chemical composition [12]. An automated approach to quantify stone surface morphology, distinguishing between COM and COD stones on CT images, demonstrated excellent discrimination ability with AUC values of 0.93 and 0.90, respectively [12]. This shows potential for clinical use as COD stones are highly responsive to SWL, while COM stones are more resistant. Additionally, conventional CT technology, such as HU values, cannot reliable distinguish COM and COD stones due to their comparable chemical compositions [8]. Radiomics provide a more comprehensive assessment of stone characteristics, including shape features derived from morphology and texture features based on higher-order gray scale matrices [13]. As expected, in our study, the proposed radiomics signature consisting of 25 robust features was identified as an independent factor for predicting SWL success in patients with ureteral stones larger than 10 mm and demonstrated excellent predictive performance in both training and test cohorts.
Based on five different ML models, radiomics signature models outperformed traditional radiological models in the test cohort. Nonlinear SVM showed more stable predictive efficiency and better generalization capability than the other four models, possibly due to our sample size being insufficient for complex models like RF, our training data being nonlinear or linearly inseparable, and SVM achieving comparable performance with a smaller training set. Despite few studies using ML algorithms to predict SWL outcomes, a decision tree model [5] yielded an excellent AUC of 0.95 for predicting SWL outcomes using various clinical and stone characteristics. However, it was not validated in an independent set. Meanwhile, the RF algorithm was used to predict successful SWL, but the limited sample size of 51 patients may affect the reliability of the results [24].
Our study had several limitations. First, the proposed nomogram was developed using data from a single center. To validate the reliability and generalizability of our prediction model, prospective multicenter studies with large datasets are required. Second, other factors that may affect SWL’s success were not included, such as the lithotripter effectiveness, operator skills [25], etc. Third, subsequent treatments were not followed-up and evaluated after SWL failure.
In conclusion, this study indicated that the CT-based radiomics model showed superior predictive ability in predicting the success of SWL treatment in patients with ureteral stones larger than 10 mm than traditional radiological models, especially when using an SVM-based classifier. Therefore, incorporating radiomics and PUD may provide a reliable prognosis to distinguish patients with a high probability of successful treatment outcomes from those with a low probability before making an SWL decision. Based on the nomogram we developed, SWL is suitable for eligible patients with ureteral stones larger than 10 mm.
References
Lang J, Narendrula A, El-Zawahry A et al (2022) Global trends in Incidence and Burden of Urolithiasis from 1990 to 2019: an analysis of global burden of Disease Study Data. Eur Urol Open Sci 35:37–46
Na L, Li J, Pan C et al (2023) Development and validation of a predictive model for major complications after extracorporeal shockwave lithotripsy in patients with ureteral stones: based on a large prospective cohort. Urolithiasis 51:42
Skolarikos A, Neisius HJA, Petřík A, Tailly BST (2023) Gambaro G EAU Guidelines on Urolithiasis
Wiesenthal JD, Ghiculete D, Ray AA et al (2011) A clinical nomogram to predict the successful shock wave lithotripsy of renal and ureteral calculi. J Urol 186:556–562
Choo MS, Uhmn S, Kim JK et al (2018) A prediction model using machine learning algorithm for assessing Stone-Free Status after single Session Shock Wave lithotripsy to treat Ureteral stones. J Urol 200:1371–1377
Guler Y, Erbin A, Kafkasli A et al (2021) Factors affecting success in the treatment of proximal ureteral stones larger than 1 cm with extracorporeal shockwave lithotripsy in adult patients. Urolithiasis 49:51–56
Ozgor F, Tosun M, Kayali Y et al (2017) External validation and evaluation of reliability and validity of the Triple D score to Predict Stone-Free Status after extracorporeal shockwave lithotripsy. J Endourol 31:169–173
Zarse CA, Hameed TA, Jackson ME et al (2007) CT visible internal stone structure, but not Hounsfield unit value, of calcium oxalate monohydrate (COM) calculi predicts lithotripsy fragility in vitro. Urol Res 35:201–206
Cui HW, Silva MD, Mills AW et al (2019) Predicting shockwave lithotripsy outcome for urolithiasis using clinical and stone computed tomography texture analysis variables. Sci Rep 9:14674
Mannil M, von Spiczak J, Hermanns T et al (2018) Prediction of successful shock wave lithotripsy with CT: a phantom study using texture analysis. Abdom Radiol (NY) 43:1432–1438
Xun Y, Li J, Geng Y et al (2018) Single extracorporeal shock-wave lithotripsy for proximal ureter stones: can CT texture analysis technique help predict the therapeutic effect? Eur J Radiol 107:84–89
Duan X, Qu M, Wang J et al (2013) Differentiation of calcium oxalate monohydrate and calcium oxalate dihydrate stones using quantitative morphological information from micro-computerized and clinical computerized tomography. J Urol 189:2350–2356
Lambin P, Leijenaar RTH, Deist TM et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749–762
Tang L, Li W, Zeng X et al (2021) Value of artificial intelligence model based on unenhanced computed tomography of urinary tract for preoperative prediction of calcium oxalate monohydrate stones in vivo. Ann Transl Med 9:1129
Zheng J, Yu H, Batur J et al (2021) A multicenter study to develop a non-invasive radiomic model to identify urinary infection stone in vivo using machine-learning. Kidney Int 100:870–880
Xun Y, Chen M, Liang P et al (2020) A Novel Clinical-Radiomics Model Pre-operatively predicted the Stone-Free rate of flexible Ureteroscopy strategy in kidney stone patients. Front Med (Lausanne) 7:576925
Onen A (2020) Grading of hydronephrosis: an Ongoing Challenge. Front Pead 8
Yamashita S, Kohjimoto Y, Iguchi T et al (2017) Variation coefficient of Stone Density: a novel predictor of the outcome of extracorporeal shockwave lithotripsy. J Endourol 31:384–390
Kim SC, Burns EK, Lingeman JE et al (2007) Cystine Calculi: correlation of CT-visible structure, CT number, and stone morphology with fragmentation by shock wave lithotripsy. Urol Res 35:319–324
Williams JC Jr., Hameed T, Jackson ME et al (2012) Fragility of brushite stones in shock wave lithotripsy: absence of correlation with computerized tomography visible structure. J Urol 188:996–1001
Tailly T, Larish Y, Nadeau B et al (2016) Combining Mean and Standard Deviation of Hounsfield Unit Measurements from preoperative CT allows more accurate prediction of urinary Stone Composition Than Mean Hounsfield units alone. J Endourol 30:453–459
Langenauer J, Betschart P, Hechelhammer L et al (2018) Advanced non-contrasted computed tomography post-processing by CT-Calculometry (CT-CM) outperforms established predictors for the outcome of shock wave lithotripsy. World J Urol 36:2073–2080
Ferrero A, Montoya JC, Vaughan LE et al (2016) Quantitative prediction of Stone Fragility from Routine Dual Energy CT: ex vivo proof of feasibility. Acad Radiol 23:1545–1552
Mannil M, von Spiczak J, Hermanns T et al (2018) Three-dimensional texture analysis with machine learning provides incremental predictive information for successful shock Wave lithotripsy in patients with kidney stones. J Urol 200:829–836
Rassweiler JJ, Knoll T, Kohrmann KU et al (2011) Shock wave technology and application: an update. Eur Urol 59:784–796
Author information
Authors and Affiliations
Contributions
Tang Lijun: Protocol/project development. Liu Bo: Data collection or management. Xu Huixin: Data analysis. Xu Huixin: Manuscript writing/editing.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Research involving human participants
Study was done in concordance with declaration of Helsinki, 1964. Study proceeded according to approval of local ethical committee.
Informed consent
Study was not done except after well-informed consent obtained from all patients. Patient had the choice to refuse or participate in the study and offered alternative suitable modality of therapy in case of refusal.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xu, H., Liu, B. & Tang, L. CT-based radiomics for predicting success of shock wave lithotripsy in ureteral stones larger than 1 cm. World J Urol 42, 397 (2024). https://doi.org/10.1007/s00345-024-05111-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00345-024-05111-0