Introduction

International guidelines for the surgical management of urolithiasis are based on the location and size of the stone, which are used to decide on the surgical modality [1]. The stone burden is mostly evaluated by the measurement of the maximum diameter (MD) [2, 3]. However, the real stone burden, i.e., stone volume (SV), could be different from that estimated by the MD due to various geometrical and non-standard shapes [4, 5]. For example, a cubic stone of 8 mm, a paving stone of 20 × 5 × 5 mm and a spherical stone of 10 mm present nearly similar SVs (500–523 mm3) but different MDs. These MDs could lead to different surgical options according to the international guidelines [1, 6]. Several formulas, such as Ackerman’s (0.6 × π × radius2), spherical (4/3 × π × radius3), half-cubic, surface area and cumulative diameter formulas, have been evaluated to estimate the stone burden, but none of them can assess the exact SV [4, 7].

However, several imaging techniques are now available to estimate the exact stone burden. As one of them, “Kidney Stone Calculator” (KSC), has been developed as a free three-dimensional (3D) software for the surgical planning of kidney stone endoscopic procedures [8]. It is an extension of 3DSlicer, a free software platform for medical image computing and 3D visualization [9]. KSC uses segmentation on a DICOM series of nonenhanced computed tomography (NECT) images. Consequently, KSC determines the stone burden using a volume assessment to help plan the procedure, giving an estimation of the lithotripsy duration based on the following parameters: laser fiber diameter, laser source [Holmium:Yttrium–Aluminium–Garnet (Ho:YAG) or Thulium Fiber Laser (Tm-Fiber)], pulse energy (J) and pulse rate (Hz), multiple stone estimation, and 3D-view of the stone. These parameters are based on in vitro data obtained by estimating the ablation volumes of both low power Ho:YAG and TmFiber and implemented in KSC for lithotripsy duration (DL) estimation [10]. After the development of KSC, we first aimed to evaluate its accuracy in estimating the stone burden based on stone volume measurements by segmentation. Then, a clinical evaluation was conducted to endorse its ability to predict DL based on a comparison with an estimated DL.

Patients and methods

Kidney stone calculator

KSC is a python-scripted free open-source add-on extension of 3DSlicer [11]. It measures the SV independently of its shape and the number of stones by segmentation of the NECT-DICOM series and provides an expected duration of lithotripsy (ExDL). A dedicated tutorial is available to realize SV measurement: https://www.youtube.com/watch?v=pZLXHdfJtP0&t=5s.

Stone volume measurement

To evaluate the accuracy of KSC in measuring the SV, we performed a preclinical evaluation consisting of the SV measurement of a panel of 17 NECT in the bone window. This test cohort included ex vivo human stones of various compositions [calcium-oxalate monohydrate (COM), cystine and uric acid (UA)], 1 cm3 cubes of artificial hard and soft begostones (Bego©, Germany) and two clinical cases. This panel presents various shapes, known and unknown volumes, in vitro and in vivo cases, and human and artificial stones.

After a ten-minute formation of KSC, three participants, i.e., a urology resident, a senior endourologist and a research engineer without prior experience in medical image computing or urology, were asked to assess the volumes of each sample of the panel. The evaluation was conducted in full-blind mode.

Lithotripsy duration estimation

To evaluate the accuracy of the KSC in estimating DL, a prospective observational study was conducted in three centres from January 2020 to March 2020. Approval from the Ethics Committee of the French Association of Urology (CERU-AFU) was obtained (reference: CERU_2020/003). An official declaration to the National Commission of Informatics and Freedoms (CNIL) was also made (reference: 2216615V0-MR-004).

The included patients were aged above 18 years, with single or multiple renal calculi, observed on preoperative NECT and planned for flexible ureterorenoscopy (fURS) with endocorporeal lithotripsy (EL) using a low-power Ho:YAG laser generator and 272 µm laser fibers. For DL estimation, a stone maximum density (SMD) over 1000UH was retained as “hard” and under 1000UH as “soft” stones. The exclusion criteria were patients with ureteric stones, the presence of an incrusted double-J stent, standard or mini-percutaneous nephrolithotomy (PCNL), monobloc extraction, the absence of preoperative NECT, and a known history of struvite stones.

Written informed consent was obtained from all patients after explaining the study protocol and providing an information sheet.

fURS with EL was conducted following an observational design. The collected data included demographic characteristics, personal urolithiasis history and actual characteristics of the stones [side, localization, number, MD (mm), SV (mm3), and SMD (UH)]. Peri-operative data were also collected: surgeon’s expertise [junior/senior (< 100 fURS per year) or expert (> 100 fURS per year)], endourology devices used, laser settings (dusting: 0.5–0.8 J/15–30 Hz; fragmentation: 1–1.5 J/10–15 Hz), operative duration, effective DL (EfDL), delivered total energy (J), volumetric energy (J/mm3), ablation rate (mm3/s), and presence of residual fragments. The EfDL was defined as the duration between the initiation of lasering and its termination. In the case of separate lasering times (basketing between two lithotripsy periods), several durations were recorded and added to obtain the overall EfDL. The postoperative data concerned the stone-free rate level defined by the absence of residual fragments or residual fragments lower than 3 mm MD on postoperative NECT or the absence of endoscopically residual stones at the end of the procedure. Complications were consigned according to the Clavien–Dindo classification. The SV was calculated postoperatively based on the preoperative NECT using KSC. Consecutively, the ExDL was defined according to the chosen laser settings. The investigator was not aware of any data other than the laser settings at the moment of SV and ExDL measurement. Clinical and imaging data were anonymized at inclusion.

The primary endpoint of this study was the difference between ExDL and EfDL. The secondary endpoints were five a priori influencing factors of the DL: presence or absence of relocation of a lower-pole stone, absence of possible relocation of a lower-pole stone, 10–12CH or 12–14CH ureteral access sheath diameters, lithotripsy mode(Dusting/Fragmentation), and surgeon’s expertise(> 100/ < 100 fURS per year).

Statistical analysis

The measured volumes were compared using a two-tailed paired Student’s t-test. The correlation strength was studied using Pearson correlation and Kendall concordance coefficients. In the clinical study, qualitative and quantitative data are presented as percentages and medians with interquartile ranges (Q1–Q3), respectively. Primary and secondary endpoints were compared using a two-tailed paired Student’s t test or chi-square test and Bonferroni’s correction. All statistics used Rstudio and GraphPad Prism. p values less than 0.05 were regarded as statistically significant.

Results

Stone volume measurement

Table 1 presents the 3D views of each sample obtained with KSC and the volumes measured by the three practitioners. Various shapes, sizes and compositions were segmented, showing the ability of KSC to provide a 3D reconstruction of renal calculi, independent of the NECT protocol. No significant difference among the operators was noted (Op1–Op2: p = 0.35, Op1–Op3: p = 0.69, Op2–Op3: p = 0.29, Table 2). In the subgroup analysis, there was no difference in the volume assessment for human stones, artificial stones and clinical cases. A mean difference of 0.28% to 1%, including all cases, was noticed. When measuring the known 1 cm3 cubes, a 1% maximum variation occurred. The maximum difference among operators was observed with a clinical case (15%). The Pearson’s correlation coefficient was r = 0.99 among the three operators, and Kendall’s concordance coefficient value was 0.98. Furthermore, there was no systematic over- or underestimation from any operator.

Table 1 Stone volume measurements from the NECT DICOM database
Table 2 Stone volume measurements from the NECT DICOM database: comparison between operators’ volume assessments (mm3)

Clinical evaluation

From January 2020 to March 2020, 26 patients were included in the clinical evaluation of KSC. Table 3 presents the demographic and renal calculi characteristics. Our cohort had a median age and BMI of 55.5 (38–69.5) years, 28.7 (24.4–32.1) kg/m2, respectively. Sixty-six percent of patients presented a single renal stone and were localized in the lower pole calyx in 42% of cases. Eighty-five percent of patients presented a MSD over 1000UH, with a median MD and SV of 10 (8–12.5) mm and 479 (268–4517) mm3, respectively. A total of 42%, 31% and 27% of procedures were conducted by an expert in endourology (> 100 fURS/year) and senior or junior urologists, respectively (Tables 4, 5, 6, 7). The median difference between ExDL and EfDL was 14% (5.4–24.8); p = 0.36. A median volumetric energy of 17.6 (13.6–24.7) J/mm3 and ablation rate of 0.4 (0.324–0.56) mm3/s were noted. The stone-free rate was 81%, considering both endoscopic and postoperative NECT methods. We reported two postoperative complications: one Grade II event of urosepsis that was medically treated and one Grade IV event of cardiac arrest at postoperative day 1 related to rhythmic dysfunction.

Table 3 Demographic and renal calculi characteristics
Table 4 Peri-operative outcomes: Surgeon expertise and intra-operative devices
Table 5 Peri-operative outcomes: Peri-operative data
Table 6 Peri-operative outcomes: peri-operative complications
Table 7 Peri-operative outcomes: influencing factors of the ExDL

Among the five supposed influencing factors, only the absence of relocation for a lower-pole stone significantly increased the difference between ExDL and EfDL (p = 0.008) after Bonferroni’s correction (Table 6).

Discussion

Volume measurement for kidney stone surgical procedures

Kidney Stone Calculator provides a 3D measurement of the SV by a segmentation process [8]. Although volume is instinctively better than the diameter for the stone burden estimation, previous studies have failed to show that the SV can be used as the new standard [12]. Other devices have been developed for this goal, but none of them aimed to predict the lithotripsy duration [2, 13, 14]. Thus, the main interest of 3D evaluation pertains to its surgical application. The surgeon would be able to manage his laser settings and equipment and estimate the procedure’s duration and the possible iterative sessions of fURS. He could also pre- and postoperatively evaluate himself based on the ExDL and the EfDL. Moreover, as an education programme, KSC could be used to evaluate junior urologists who are working to master the learning curve.

The segmentation process inherent to KSC does not calculate a volume based on two-dimensional measurements and a mathematical formula [4, 5, 15]; rather, it refers to the sum of voxels collected by a manual density scale in a region of interest (ROI) [8, 9]. Segmentation has been shown to be effective in quantifying vascular calcifications or muscular mass as predictive factors in carcinologic or noncarcinologic diseases [16,17,18,19]. Consequently, this concept was also used for in vitro evaluation of ablation rates (mm3/s) for low power Ho:YAG and Tm-Fiber devices, which are secondarily implemented in KSC for ExDL calculation [10]. To date, only volumetric data have been generated, satisfying new concepts of lithotripsy efficiency, namely, ablation rates and volumetric energy [20].

Measurement accuracy

The preclinical evaluation revealed that KSC was reproducible, independent of the shape, size or composition of the stone and showed little difference between operators, with a Pearson’s correlation coefficient of 0.99. Moreover, the Kendall concordance coefficient (0.98) confirmed the excellent reproducibility of KSC. We found no difference between operators or cases in the subgroup analysis. A previous study showed similar results (0.97 interobserver-correlation) with experienced radiologists [19]. In our protocol, rapid training was necessary to provide familiarity with the basics of 3DSlicer, but no prior personal experience seems to be required. However, we have to acknowledge some limits. First, the maximum interobserver variation occurred in a clinical case (15%), which was not the complex staghorn case. A further evaluation of high-burden cases would be necessary to confirm our findings. Moreover, we present a comparison of measured volumes, without reference volumes in most cases. An alternative experiment comparing referenced stone volumes to segmented ones could support the validation of KSC, as performed by Wilhelm et al. [15]. Finally, special attention must be paid to the NECT protocol, which has been shown to influence outcomes. Insufficient slice thickness (3–5 mm) and low-dose irradiation protocols may alter the detection or precision of the segmented volume, especially in obese patients, as well as the calibration window [21, 22]. In the present work, the window was manually adjusted, and the NECT panel included various settings, but changes in these settings did not alter the correlation between operators. Further improvements in KSC would include automatic detection using deep learning, as used for kidney segmentation [22].

Clinical evaluation

We reported a 14% median difference between ExDL and EfDL, which increased in the absence of relocation of a lower-pole stone (Tables 4, 5, 6). This study first aimed to estimate the EL duration with a small number of participants. Considering this specific aspect, we interrupted patient enrolment because of the COVID-19 pandemic crisis, which led to the postponement of kidney stone surgical procedures in our country for 3 months [24]. Among the 26 included patients, we found similar demographic characteristics to those in the available data [25, 26]. We also present a complication rate similar to that reported in the available literature [27]. Whereas two-thirds of our population presented with a single stone, multiple calculi were found in 9 patients, possibly increasing the difference in ExDL-EfDL for two reasons: First, with “stop-and-start” laser emission, multiple stone cases and irrigation flow have increased. Second, due to the increasing probability of producing significant residual fragments, basketing is required. These arguments could consequently explain the relatively low volumetric energy (17 J/mm3) and ablation rate (0.4 mm3/s) compared to the available in vitro or clinical data [10, 20, 28]. Ventimiglia et al. reported 19 J/mm3 and 0.7(0.4–0.9) mm3/s ablation outcomes in their cohort, which included 30 patients with a high stone burden [1599 (630–3502) mm3] treated in a urolithiasis expert centre [20]. Consequently, their dusting ability could have been superior to ours in the present study. Regarding the in vitro data, we acknowledge the ideal conditions of lasering: minimizing the burnback effect and optimizing the ablation rate with a single shot or robotic displacement of the laser fiber, with a new fiber tip used for each procedure. The present study concerned in vivo conditions with irrigation flow, stone retropulsion, deterioration of the fiber tip, and various surgeons’ dusting ability. These lasering parameters could not be explored without possible logistic regression analysis because of the small number of patients. We a priori defined potential influencing factors and analysed the p values with Bonferroni’s correction. We acknowledge that other factors could have been selected. The 81% stone-free rate, defined endoscopically and by the absence of 3 mm fragments, also limits our findings. “Dust” has been recently defined as fragments floating under irrigation [29]. Whereas Keller et al. demonstrated in vitro that fragments smaller than 250 µm, except for struvite (125 µm), met this definition; the SFR level is still under discussion [30, 31]. Consequently, we left the SFR assessment to the surgeon’s preference. Moreover, we studied only the Ho:YAG EL during the clinical evaluation using a low-power(LP) laser generator. High-power (HP) Ho:YAG has been suggested to reduce the lasering time, but the data herein concerned only LP-Ho:YAG and Tm-Fiber. Consequently, we cannot draw conclusions on the ability of KSC to predict DL for HP-Ho:YAG devices or Moses’ technology. The present work consisted of a pilot study, and more patients are needed to investigate the viability of KSC application, including Tm-Fiber EL.

Conclusion

Kidney stone calculator is a free, reproducible, safe and accurate software for stone burden and lithotripsy duration estimations. Further studies with a larger population and endocorporeal laser lithotripsy procedures using Tm-Fiber are needed to confirm our preliminary results.