Introduction

SWL, RIRS, and PCNL are the standard options for active treatment of nephrolithiasis. According to the European Association of Urology guidelines, all three modalities can be used selectively depending on stone size and location [1].

Successful active stone treatment is mainly defined either by the complete absence of residual stones or by the presence of clinically insignificant residual fragments. SFRs and SCRs quantify the above states, respectively. At the same time, the treatment should be characterized by minimal hospitalization and complication rates (CRs). By definition, SWL offers an outpatient, noninvasive, treatment with the drawback of limited SFRs, often requiring further treatment [2]. A recent meta-analysis by De et al. [3] corroborates the high SFRs but also the higher CRs and longer hospital stay reported for PCNL. Finally, RIRS has been also greatly evolving, being utilized in the management of even larger or multiple renal stones throughout the entire pelvicalyceal system [4, 5]. The main drawbacks of retrograde access include a potential violation of the ureter [6], as well as the limited visualization [7].

There has been a great scientific effort to elucidate the SFRs and SCRs of the different techniques and implement the correct indications for treating different types of stones according to stone size and localization. Undeniably, NCCT is the most sensitive diagnostic tool, for detecting residual fragments as it offers higher sensitivity regardless of stone size, compared to plain kidney–ureter–bladder X-ray and ultrasonography [814]. Additionally, NCCT has a higher capability of predicting occurrence of stone-related events [15]. Different NCCT modalities include regular-dose NCCT and low-dose NCCT, which reduces the radiation risk by offering a sensitivity of 96.6 % and specificity of 94.9 % [16, 17]. At the same time, accuracy of stone detection increases by decreasing the NCCT slice thickness, which nowadays can be as low as 0.625 mm [18, 19]. Yet, most of the studies have not utilized NCCT during in their postoperative follow-up. The aim of this review is to compare the SFRs and SCRs of the three modalities according to NCCT findings.

Materials and methods

Literature search and article selection (fig)

A systematic literature review was performed up to November 2015 using PubMed in order to identify relevant original studies (“Appendix 1, 2”). Different keywords included “extracorporeal shock wave lithotripsy,” “retrograde intrarenal surgery,” “RIRS,” “flexible ureteroscopy,” “percutaneous nephrolithotomy,” “mini PCNL,” “micro PCNL.” Six hundred and twenty six records were identified through database searching. After duplicates were removed, 465 records remained. After record screening, 136 records, which were not written in English or were in vitro and pediatric studies, were excluded. From the 330 full-text articles assessed, complex lithiasis studies, papers not reporting SCRs and SFRs and studies that did not use NCCT in their follow-up, or did not define the method used, were also excluded. Cited references from the selected articles retrieved in the search were also assessed for significant papers. Two independent reviewers (TT, MH) performed the literature search, and twenty-three studies were judged as eligible by their consensus. The quality of the studies was graded according to the Grading of Recommendations, Assessment, Development, and Evaluation system [20]. Levels of evidence (LEs) and grade of guideline recommendations were rated in accordance with the European Association of Urology guidelines [1].

Data extraction

Two independent reviewers (TT, MH) extracted the data of the selected studies. More specifically, six SWL studies (Table 1), five RIRS studies (Table 2), eight PCNL studies (Table 3), and four comparative studies (Tables 13) were reviewed. The data reviewed from the different studies included the number of procedures, number of stones (mean, range), the stone size and location (mean, range), the duration of the procedures (mean, range), the slice thickness of CT images (collimation), the CT quality parameters (in mA and kV), the timing of post-treatment CT performance, and the stone size thresholds used to name residuals as clinically insignificant. The CRs, duration of hospitalization, and need for ancillary procedures were also recorded. The primary endpoint of the review is the SFRs and SCRs of the different procedures (Fig. 1).

Table 1 SWL studies
Table 2 RIRS Studies
Table 3 PCNL studies
Fig. 1
figure 1

Flow diagram indicating the records identified through database searching, the records excluded and the total studies included in the review. SWL extracorporeal shock-wave lithotripsy, RIRS retrograde intrarenal surgery, PCNL percutaneous nephrolitholapaxy

Outcomes

SWL outcomes (Table 1)

A total of 10 studies with 933 cases were identified [2, 2129]. Six of them were descriptive non-comparative studies (LE 4) [2126], one was a cohort study (LE 3) [2], and three were randomized controlled trials (LE 2) [2729]. The range of maximal stone diameter was 5–25 mm. The NCCTs were mainly performed at 3 months [2, 21, 22, 2429]. The SFRs reported were 35–61.3 % [21, 23, 24, 2729] and the SCRs with residual stone size ≤4 mm 43.2–92.9 % [2, 2226, 28]. The rates for further treatment need ranged 16–67.7 % [2, 21, 23, 2729].

RIRS outcomes (Table 2)

Overall, a total of seven RIRS studies with 563 cases were selected as eligible [2, 28, 3034]. Five of them were descriptive non-comparative studies (LE 4) [3034], one was a cohort study (LE 3) [2], and one was a randomized controlled trial (LE 2) [28]. The stone size range was 5–50 mm. The NCCTs were mainly performed 1–3 months postoperatively [2, 28, 3032, 34], except in the study by Takazawa in which they were performed earlier than 2 weeks after the procedures [33]. The SFRs reported were 34.8–59.7 % [28, 30, 31, 34]. In particular, for stones with diameter greater than 2 cm, the SFRs were 59.7 % [34]. The SCRs for residual stone size ≤4 mm were 48–96.7 % [2, 28, 3033]. The need for further active treatment was presented in four studies and was 3.7–35 % [2, 28, 30, 33].

PCNL outcomes (Table 3)

Eight PCNL single-arm (LE: 4) studies [9, 11, 12, 14, 3538], one cohort study (LE 3) [2], and two randomized controlled trials (LE 2) with a total of 998 cases were selected in this review. The stone size could not be clearly assessed. In all studies, procedures were performed with standard PCNL instruments. Interestingly, most NCCTs were performed mainly either direct postoperatively [35] or during the first two postoperative weeks [9, 12, 14, 3638]. Nevertheless, the three comparative [2, 27, 29] and two single-arm studies [11, 37] reported outcomes after 1–12 months. The SFRs reported are 20.8–100 % [9, 11, 12, 14, 27, 29, 36, 37]. The SCRs for residual stone size ≤4 mm were 41.5–91.4 % [2, 9, 11, 14, 35, 37, 38]. The necessity for ancillary procedures reported was 0.00–30.18 % [2, 9, 11, 12, 14, 27, 29, 37].

Discussion

Selecting the optimal renal stone treatment can be challenging, as each technique is characterized by unique benefits and drawbacks. Furthermore, the absence of high-quality comparative trials or reviews and meta-analyses comparing the three modalities is more than apparent. Only recently, a meta-analysis by De et al. [3] compared RIRS with PCNL and minimally invasive PCNL (MIP). The authors concluded that PCNL offers higher SFRs than MIP and RIRS. They also purported that RIRS provides better SFRs than MIP and should be the standard of care for renal stones with diameter <20 mm. However, most of the studies included did not utilize NCCT to detect stone residuals. We tried to assess the SFRs and SCRs of all treatment modalities according to postoperative CT follow-up.

By examining the single-arm studies, the outcomes presented are less favorable than the ones reported by using other means to estimate residual fragments [3945]. Interestingly, by examining the results of the four comparative studies, the results are significantly different in favor of PCNL. More specifically, the SFRs reported were 17–61.3 % for SWL, 50 % for RIRS, and 95–100 % for PCNL [2, 2729]. Pearle et al. [28] reported SFRs of 35 % and 50 % for SWL and RIRS, respectively (p = 0.92). The SFRs presented by Yuruk and associates [29] were 96.7 and 32.2 % for PCNL and SWL (p < 0.001). Deem et al. [27] presented SFRs at 3 months 85 and 33 % (p = 0.006) for PCNL and SWL, respectively. Finally, Resorlu et al. [2] presented SCRs at 1–2 months for stone residual diameter <4 mm of 91.4, 87.0 and 66.5 % for PCNL, RIRS and SWL, respectively (p < 0.001). These outcomes remain less favorable than the already reported for SWL [3941] and RIRS [4245], but are comparable for PCNL [4245].

Albeit our efforts to reduce heterogeneity of studies dealing with active stone treatment, we should be cognizant that a relative deficiency and heterogeneity of data still exists. First, the quality of the different studies varies, with most of them being descriptive, single-arm, retrospective cohorts (LE 4) and only three of them being randomized prospective trials [2729]. Moreover, the sample size difference between the three different treatment modalities is undeniable as nine SWL studies with 863 cases, seven SIRS studies with 563 cases, and 11 PCNL studies with 998 patients were included.

The stone characteristics are not clearly defined in all studies. Regarding stone size, some cohorts use the maximum diameter in mm, while others, mainly PCNL, studies the stone surface in mm2 [11, 37, 38]. Additionally, active stone treatment differs according to stone location and size, making a cumulative presentation of outcomes a great challenge. Three of the SWL groups treated single calculi [2224], while another two reported results by also treating multiple stones [21, 25]. Two studies included only lower pole stones [21, 24], while the rest also reported non-lower and pelvic stones [22, 23, 25]. The stone number range reported in the RIRS studies was 1–3 and all cohorts included lower pole as well as non-lower pole stones. However, the stone number and location in the PCNL groups could not be clearly assessed, possibly because of the high presence of multiple as well as staghorn calculi. Resorlu et al. [2] admit that the size of the stones treated by PCNL was significantly greater than the mean stone sizes of RIRS and SWL. Additionally, especially for SWL, the stone type is an independent predictor for treatment success. However, stone composition is only reported in two studies [2, 29]. The rest of the studies mainly reports stone maximal density in Hounsfield Units [2225, 27, 28].

Furthermore, no SWL study provides information regarding the possible use of adjuvant expulsive therapy, which can facilitate passage of stones and improve SFRs. Additionally, many SWL studies present their cumulative outcomes and not the SFRs and SCRs after the first session [2, 21, 2629]. However, many patients required more than one session in order to achieve the desirable result [2, 21, 26, 27, 29]. Stent placement after RIRS as well as the timing of stent removal in relation to performing NCCT could also add important information. Only two RIRS groups placed stents in all patients [30, 33], the rest leaving stent placement in surgeon discretion [2, 28, 31, 32, 34]. Two groups reported the times of stent removal ranging from 1 to 2 weeks postoperatively [2, 30]. Finally, stone dusting during RIRS could present misleading results, as a possible dust accumulation postoperatively does not necessarily mean the presence of significant residual fragments. The use of CT-magnified bone windows can overcome this impediment [46].

Other important intraoperative and postoperative parameters that could influence the results, like surgeon experience, procedure time, CRs, and length of hospitalization constitute possible drawbacks that reflect the greater invasiveness of PCNL [2, 3, 28]. The mean duration of RIRS and PCNL procedures ranged from 43 to 114 min [2, 2830, 33, 34] and 46.7–149.44 min [2, 28, 29, 38], respectively. However, the authors did not define if the PCNL duration refers to the total duration or the time starting from the kidney puncture and finishing at the end of the procedure. In one study, RIRS had more intraoperative (20 vs. 3 %) but comparable postoperative complications (21.3 vs. 23 %) with SWL [28], and in another, PCNL presented equal complication rates with SWL (6.4 %) [29]. Finally, in the study by Resorlu, the complication rates were significantly higher (p < 0.001) for PCNL (22.1 %) in comparison to RIRS (10.9 %) and SWL (7.6 %) [2]. Nevertheless, only one PCNL study used the Clavien classification system to classify complications [38]. The duration of hospitalization was evident only in two comparative studies ranging 0.06–1.3 days for RIRS and 2.6 days for PCNL [2, 28]. Additionally, as success of PNL is highly surgeon dependent, data on the surgeon experience are of paramount importance for outcome assessment. Unfortunately, this information is unreported in many studies. Finally, although mini-, micro-, and minimally invasive PCNL technology is recently introduced, presenting favorable outcomes and minimizing CRs, no such study presents outcomes according to NCCT follow-up.

The post-treatment imaging utilized to detect stone residuals, but also the best time to perform it, are additional but critical parameters that should always be precisely reported. Currently, many studies substantiate the superiority of NCCT over kidney–ureter–bladder (KUB) X-ray, intravenous pyelography, or ultrasonography in detection and evaluation of residual stones after active treatment, as well as prediction of residual stone-related events and detection of complications of treatment [812, 14]. Furthermore, in many occasions, trying to detect residual fragments with plain KUB is like throwing a coin. Park et al. [11] showed that 45.5 % of patients who were stone free with KUB had actually residual stones larger than 4 mm in diameter (mean size: 7.4 mm) detected by CT. In that way, NCCT secures a better treatment plan and the avoidance of unnecessary ancillary treatment. There is also evidence that CT-magnified bone windows are the most accurate method of stone measurement [46]. To our knowledge, this is the first systematic review trying to estimate the treatment outcomes based exclusively on CT findings. Nevertheless, it is of utmost importance to fully describe the technique and equipment used, and the CT exposure data, as well as the length of the sections taken, a factor that also greatly varied in our study. Imaging at the end of the first month after treatment is considered optimal as it gives enough time for stone debris and small insignificant residual fragments to be excreted [11, 47]. Surprisingly, in almost all the PCNL single-arm studies, NCCT was performed before this time period or even direct postoperatively. As a consequence, most probably their presented SFRs and SCRs do not reflect the reality [9, 12, 14, 3538].

Finally, the most important parameter is the proper definition of the significant residual stone size, as small residual fragments may pass spontaneously without creating stone-related events. In our study, all SWL studies define a cut-off value of 4 mm [2125] as clinically insignificant. Nevertheless, even smaller residual fragments could cause symptoms and require active intervention [37, 48, 49]. Hence, in the CT-era, and preferably utilizing low-dose CT, trying to achieve a stone-free status should be the absolute target of every endourologist, especially in infected stones. Finally, the need for additional treatment or, in other words, the ancillary procedure rates, are not always reported. Nevertheless, as presented by De et al. [3], the need for further treatment appears to be less in PCNL.

Summarizing the apparent shortcomings of the present review, the selected studies often lack important information regarding stone characteristics like size, number, location, and composition. In particular, for SWL studies, the exact number of sessions is not always reported. Additionally, postoperative CT timing differs between different articles and CT characteristics like quality and collimation are often missing. Finally, heterogeneity of SCRs due to variable definitions of significant residual lithiasis greatly impedes a precise outcome evaluation.

Conclusion

The current data for active stone treatment characteristics and outcomes remain incomplete and greatly heterogeneous. By using NCCT for residual stone size evaluation as a filter, it appears that PCNL offers better SFRs than SWL and RIRS. In light of these results, future studies should adapt to the suggested standardizations and postoperative NCCT control in order to improve the quality of presented data.