Introduction

The first-line surgical treatment for renal stones has shifted to endourological procedures, such as ureteroscopy (URS) and percutaneous nephrolithotomy (PNL); consequently, extracorporeal shockwave lithotripsy (SWL) has lost its place as a paramount therapeutic modality despite its proven efficacy [1, 2]. However, SWL remains a primary therapy for solitary renal stones sized < 20 mm according to recent guidelines [1, 2].

Several parameters affecting the stone-free rates (SFRs) after SWL have been determined; these include stone size and location [1, 3, 4], SWL-resistant stone composition (calcium oxalate monohydrate, brushite, or cystine) [5, 6], stone attenuation values on computed tomography (CT) [7], skin-to-stone distance (SSD) [8, 9], spatial pelvicalyceal and lower pole anatomy of the kidney [1, 4], patients’ body mass index (BMI) and obesity [10], and shockwave delivery frequency [11]. Although some combined parameters are useful for clinically predicting SWL outcomes [3, 12,13,14,15], no consensus exists as to the best prediction model probably because of the complexity in modelling for clinical practice and/or heterogeneous recommendations in practice guidelines [16, 17]. Inconsistency among guidelines would lead to clinical confusion among urologists in determining treatment modalities for renal stones, especially those 10–20 mm in diameter [16, 17].

Recently, Tran et al. [14] reported a novel and simple nomogram (Triple D scoring system), which constitutes three CT-based parameters [SSD, stone density, and stone volume (SV)] to screen for the most appropriate patients for SWL. Its clinical usefulness has been externally validated in different retrospective studies [18, 19]. These reports described a high area under the curve (AUC) of 0.751–0.845 [14, 18, 19] for the Triple D score (TrD-S) in predicting successful outcomes of SWL therapy for renal stones. However, they included 4–10-mm kidney stones [18, 19]. For ≤ 10-mm kidney stones, SWL generally achieves SFRs of ~ 50–90% [4]. Therefore, the European Association of Urology guideline on urolithiasis recommends SWL as the preferred first-line therapy for all kidney stones smaller than 10 mm, with URS as an alternative for selected cases and PNL reserved for when SWL and URS have failed [1, 4]. In evaluating the clinical relevance of the TrD-S in routine practice to date, no attention has been paid to 10–20-mm renal stones, the sizes of which relate to overlapping indications for SWL and endourological surgery [14, 18, 19].

Herein, we investigated the clinical efficacy of the TrD-S on SFR prediction following SWL for 10–20-mm renal stones and presented a prediction model modified from this score.

Patients and methods

Patient data collection

We retrospectively reviewed the medical archives of 2063 consecutive patients who underwent the first SWL session for upper urinary stones at South Miyagi Medical Center (n = 797, from August 1, 2002 to May 31, 2015), Yamagata City Hospital Saiseikan (n = 951, from August 1, 2008 to March 31, 2016), and Nihonkai General Hospital (n = 315, from January 1, 2009 to January 31, 2016). The inclusion criterion was 10–20-mm renal stones (n = 375). The exclusion criteria were as follows: (1) partial staghorn calculi (n = 9), (2) calyceal diverticular stone (n = 1), (3) horse-shoe kidney (n = 3), (4) ureteral stricture (n = 3), (5) bed-ridden status (n = 2), (6) unavailable CT images before SWL (n = 85), (7) endourology prior to SWL (n = 9), (8) incomplete treatment owing to mechanical disorders of shockwave lithotripter during sessions (n = 4), (9) follow-up loss 3 months after the final SWL (n = 27), and (10) no stone status evaluation 3 months after the final SWL owing to inadequate medical follow-up timing (n = 6). Finally, 226 patients were eligible for the present study.

The study protocol was approved by the Ethical Committees of Yamagata University School of Medicine (No. 535; March 2, 2018), South Miyagi Medical Center (No. 30-6; October 3, 2018), Yamagata City Hospital Saiseikan (No. 430-014, September 14, 2018), and Nihonkai General Hospital (No. 5; September 25, 2018).

Preoperative evaluation of renal stones targeted with SWL

Radiopaque kidney stones were evaluated before SWL using plain abdominal X-ray imaging of the kidney, ureter, and bladder (KUB) and CT. Stone diameters were measured as the maximum longitudinal diameters. SV was calculated using the following formula: SV = π/6 × (anteroposterior × transverse × cranio-caudal diameters) [14, 18, 19]. Stone density was presented in Hounsfield unit (HU), and SSD was calculated as the average distance from the body surface to a targeted stone at 0°, 45°, and 90° on CT [8].

The TrD-S was calculated as the sum of the numbers of components matching the cutoffs of < 150 mm3 for SV, < 600 HU for stone density, and < 12 cm for SSD as described by Tran et al. [14]. We defined the Quadruple D score as the TrD-S combined with the stone location (i.e., distribution). The location was allocated 0 and 1 point if a certain stone was placed at the lower calyces and other sites, respectively. Thus, the Triple and Quadruple D scores could range from 0 (worst) to 3 (best) points [14] and 0 (worst) to 4 (best) points, respectively.

SWL and postoperative stone status evaluation

The lithotripters used were electromagnetic shockwave ones, the Storz Modulith SLX-MX (South Miyagi Medical Center) and Siemens Lithoskop (Yamagata City Hospital Saiseikan and Nihonkai General Hospital). SWL was performed with a gradual ramping up of shockwave energy at a rate of 60, 90, or 120 shocks per minute, according to the therapist’s preference and manufacturers’ instructions. Treatment efficacy was evaluated on KUB X-ray after each SWL session. Postoperative CT and/or intravenous urography were performed when small calcification shadows on KUB X-ray were not definitely determined as stone residuals. Repeated sessions were conducted when a single session was unsuccessful. Stone-free status was defined as complete absence of stone remnants 3 months after the final SWL session.

Statistics

Continuous variables were compared using Student’s t test or Mann–Whitney U-test; their correlations were assessed using Pearson’s correlation analysis. Cross charts between two categories were analyzed using Fisher’s exact test, Chi-square test, or Cochran–Armitage trend test. Variables that may be differential and predictive in the univariate analyses were further investigated in multivariate logistic regression analyses. P-values of < 0.2 in the univariate analysis were set as the threshold for variable entering, and a stepwise regression method was used with the significance level set at 0.05 for exclusion of variables in the multivariate analysis. All p-values were based on two-sided statistical analyses. P-values of < 0.05 were considered statistically significant. All analyses were performed using the R statistical software version 3.4.1 (http://cran.rproject.org/, accessed on July 28, 2017). Two receiver-operating characteristic (ROC) curves were compared using the pROC package version 1.12.1 (https://cran.rproject.org/web/packages/pROC/index.html, accessed on July 10, 2018).

Results

The patient demographics are presented in Table 1. The patients were classified into two groups according to stone status 3 months after the final SWL sessions: stone-free (n = 124) and residual (n = 102) groups. The residual group had significantly older age, larger stones, higher stone density on CT attenuation than the stone-free group. Stone location was significantly different between the groups, with a higher lower-pole stone incidence in the residual group (27.5% vs. 10.5%, p = 0.002, Fisher’s exact test). No differences in the BMI, the number of SWL sessions, total shockwave energy delivered per patient, shockwave frequency, stone composition, and SSD were observed between the groups. There was a moderately positive correlation between the BMI and SSD in the entire cohort (Pearson’s correlation coefficient; r = 0.534, p < 0.001). In the residual group, 19 (18.6%) patients were completely cleared of stone fragments and thus attained the stone-free status. The TrD-S was significantly lower in the residual group than in the stone-free group.

Table 1 Demographics of the study patients 3 months after the final sessions of SWL

Table 2 shows the results of the multivariate logistic regression model for predicting the stone-free status; age, stone location (non-lower vs. lower-pole stones), TrD-S (0, 1, 2, or 3 points), drainage with ureteral stents (yes vs. no), and number of SWL sessions were initially incorporated to the model because they had p-values of < 0.2 (Table 1). Age, TrD-S, and non-lower-pole stones were independent predictors of the stone-free status in the multivariate analysis, yielding a sufficient AUC of 0.736 [95% confidence interval (CI), 0.670–0.803] in the multivariate logistic regression model.

Table 2 Multivariate logistic regression model predicting the stone-free status 3 months after the final SWL sessions

The TrD-Ss of 0, 1, 2, and 3 points showed SFRs of 40.0%, 51.9%, 73.0%, and 100.0%, respectively (Cochran–Armitage test, p = 0.001; Fig. 1a, left). Conversely, the Quadruple D scores of 0, 1, 2, 3, and 4 points showed SFRs of 0.0%, 37.9%, 54.5%, 84.4%, and 100.0%, respectively (Cochran–Armitage test, p < 0.001; Fig. 1a, right). The AUC for the Quadruple D score was significantly higher than that for the TrD-S (AUC, 0.596 vs. 0.651; 95% CI 0.539–0.654 vs. 0.590–0.712; p = 0.01; Fig. 1b).

Fig. 1
figure 1

a SFRs based upon the TrD-S (left) and Quadruple D score (right). The Quadruple D score was defined as the sum of the TrD-S and intrarenal location of a targeted stone (0 points: lower-pole stone or 1 point: non-lower-pole stone). b ROC curves for the TrD-S and Quadruple D score. AUC area under the curve

Discussion

In this study, we demonstrated that the TrD-S, lower pole location, and age were independent predictors of the SFR after SWL for 10–20-mm renal stones. The SFRs significantly improved as the number of positive components consisting of the Triple and Quadruple D scores increased. These findings support the successful validation of the TrD-S for use in Japanese patients with 10–20-mm renal stones treated with SWL; these also indicate that the Quadruple D score may be more relevant than the TrD-S in clinical decision-making of SWL for medium-sized renal stones.

The ROC curve analysis revealed a low AUC (0.596) of the TrD-S for SFR prediction. This was because the SSD, a component of the TrD-S, was not a significant factor for discriminating stone-free or residual outcomes after SWL. Moderately correlated with the SSD (r = 0.534), the BMI was not different between the groups in the present study. The SSD and BMI, which are clinical indicators of obesity, have been reported as significant predictors of SWL outcomes in univariate analyses [13, 20]. In multivariate analyses, either the SSD or BMI is often excluded from the final models for outcome prediction [8, 13, 20, 21] probably owing to the correlation between them [22]. However, neither the SSD nor BMI was related to the SWL outcomes in the present study. It may be partially because most patients were not obese (BMI, 24.6 ± 3.8 kg/m2), reflecting racial backgrounds discrete from those in previous studies [8, 13, 20, 21]. Generally, the BMI varies among races, and the prevalence of obesity, defined by the World Health Organization as a BMI of ≥ 30 kg/m2, is no more than 2–3% in the Japanese population, in contrast to the 10–20% in Europe and the USA [23]. Based upon the increased incidence of obesity-related morbidities, obesity is specified as a BMI of ≥ 25 kg/m2 in Japan, where the prevalence and degree of obesity remain mild [23]. Moreover, it may be because of the sampling bias resulting from the study design in which the patients with renal stones had similar anthropometric characteristics. In the present study, we investigated patients with 10–20-mm renal stones.

A lower pole location was a significant factor relating to poor SFRs after SWL, consistent with previous reports [1, 2, 4]. A steep infundibular-pelvic angle, long lower-pole calyx (> 10 mm), and narrow infundibulum (< 5 mm) are depicted as unfavorable factors for SWL [1]; however, we did not incorporate these specific conditions in the Quadruple D score, which is the sum of the lower pole location (distribution) and TrD-S, in pursuit of sufficient ease of use in clinical practice. Despite such a simplification, the Quadruple D score significantly improved SFR prediction after SWL for renal stones compared with the TrD-S. Ozgor et al. [19] revealed that the stone location and TrD-S were independent factors affecting SWL success in their multivariate analysis. Larger stone burdens located in lower pole calyces, increasing SSD, and unfavorable lower pole anatomy all decrease the success rates of SWL and URS but have limited influence on PNL outcomes [4]. Thus, for 10–20-mm renal calculi, stone and anatomical factors must be carefully considered when weighing the relative outcomes and invasiveness of each procedure [4].

For renal stones, age is reported as an independent predictor of SWL outcomes in multivariate analyses [13, 24], which is consistent with the present result. In a prospective study [20], the effects of age on SWL outcomes for kidney stones reached a significant marginal level (p = 0.06) in the univariate analysis; it was not confirmed as a significant predictor in the multivariate analysis. Thus, considering that the relationship between age and SWL outcomes remains controversial, age was not considered as an additional component to the TrD-S in the present study. We previously reported that age had no significant effects on the SFR after SWL for ureteral stones [25], which is consistent with other reports [13, 26, 27]. Renal stones planned for surgical treatment are usually larger than ureteral stones [1, 2, 4]. As a general principle, the efficacy of SWL decreases, while the need for ancillary procedures and re-treatment increases as the stone burden enlarges [4]. Interestingly, Ikegaya et al. [28] demonstrated that renal stones were more difficult to be disintegrated with SWL in older patients than in younger patients. The probability of renal hematoma after SWL for kidney stones increased significantly with age, indicating the dose limitation of shockwaves in older patients [29]. Taken together, age might have negative impacts on the SFR (renal stones) owing to resistance to fragmentation rather than stone clearance, unless the kidneys have unfavorable anatomical factors for SWL, such as lower pole configuration.

Many researchers have reported nomograms predicting successful outcomes after SWL for upper urinary stones [3, 12,13,14,15, 24]. Although some nomograms present with excellent outcome prediction accuracy, they are often too complex to calculate in the clinical setting, e.g., because of exponential functions [12, 13, 24]. Based on stone length, location, and number, Kanao et al. [3] reported a simple prediction nomogram of the SFR after a single SWL session; the SFRs were ~ 56.8% (11–15 mm) and 35.1% (16–20 mm) for calyceal stones and 64.4% (11–15 mm) and 42.7% (16–20 mm) for renal pelvic stones. However, this nomogram [3] does not include CT attenuation and the SSD already proven to affect SWL outcomes [1, 2]. Recently, Kim et al. [15] constructed nomograms to predict the SFR after SWL, which are characterized by manual scoring of four or six clinical variables on graphical charts in a CT-independent or -dependent manner. Besides the four variables, sex; stone location, number, and maximal diameter; hydronephrosis grade; and stone CT attenuation are included in the CT-dependent nomogram. Their nomograms and the TrD-S seem to be very practical and easy to use in clinical practice and remain to be externally validated.

There are limitations in the present study. The lower-pole stone morphology and hydronephrosis grade were not assessed [15]. Other limitations include the retrospective design of the study, relatively small number of patients, and lack of a validation dataset. It is unclear whether the Quadruple D score could be extrapolated to ureteral stones. Further studies are needed to confirm the validity of the present findings.

In conclusion, the TrD-S was successfully validated for use in Japanese patients treated with SWL for 10–20-mm renal stones, showing a parallel increase in the SFR with the number of positive components consisting of the TrD-S. Simple addition of the stone location (lower-pole or non-lower-pole stones) to the TrD-S could reinforce SFR prediction after SWL, without losing its simplicity and ease of use for urologists.