Introduction

Radical cystectomy (RC) with pelvic lymph node dissection and urinary diversion is the standard treatment for muscle-invasive (MIBC) and very high-risk non-muscle invasive bladder cancer (NMIBC) [1, 2]. Advances in technology have facilitated the adoption of minimally invasive surgery such as robot-assisted radical cystectomy (RARC). With its first description in 2003 [3], it has found increasing adoption worldwide for the treatment of advanced bladder cancer. For example, in the United States, the rate of RARC has increased from 0.6% of all RCs in 2004 to 18.5% in 2012 [4, 5].

Somewhere between 30 and 50% of patients treated with radical cystectomy experience local and/or distant recurrence despite adequate surgery [6, 7]. Various quality criteria have been identified to reflect the oncologic effectiveness of RC such as soft tissue surgical margins (STSM), numbers of lymph nodes removed and early disease recurrence [8,9,10].

Despite several randomized-controlled trials (RCTs) and non-randomized studies (NRCTs) reporting on the short-term as well as long-term oncologic outcomes of RARC and ORC, there is still no consensus on the differential comparative oncologic effectiveness of RARC versus ORC [11,12,13,14,15]. We, therefore, conducted an up-to-date systematic review and meta-analysis of the literature comparing oncologic outcomes of patients treated with RARC to those treated with ORC.

Materials and methods

The protocol was registered in the International Prospective Register of Systematic Reviews database (PROSPERO: CRD42018109437).

Literature search

This systematic review and meta-analysis was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) statement [16]. A completed PRISMA-P 2015 checklist and Meta-analyses Of Observational Studies in Epidemiology (MOOSE) checklist are shown to describe the methodology of our study regarding RCTs and NRCTs, respectively (Supplementary Tables 1, 2). We searched the electronic databases (MEDLINE, Web of Science, Cochrane Library and Scopus) on September 10th 2018 for studies comparing oncologic outcomes between RARC and ORC. After a first screening based on study title and abstract, all full text papers were assessed and excluded with reasons. Two reviewers carried out this process independently. All disagreements were resolved by a consensus or arbitration by third investigators. The following string terms were used in our search strategy: (robotic radical cystectomy OR robot-assisted radical cystectomy OR RARC) AND bladder cancer AND (surgical margin OR lymph node OR oncologic outcome).

Inclusion/exclusion criteria

Studies were included if they compared RARC to ORC and reported surgical margin status, lymph node yield and/or survival outcomes between both arms in RCTs or NRCTs such as prospective and retrospective observational (or cohort) studies. We excluded review articles, editorials, comments, meeting abstracts and not in English. In case of the similar patient cohort publications, either the higher quality or the most recent publication was selected. We manually searched the reference lists of eligible studies to detect any potentially relevant articles.

Data extraction

Two reviewers independently extracted and summarized the following data from the included studies: general study characteristics, patient demographics and oncologic outcomes. The outcomes of interest were PSM (overall, soft tissue and ureteral/urethral PSM) rates, mean lymph node yield (standard or extended lymph node dissection (LND)), disease recurrence (local and distant), recurrence-free survival (RFS) rate, cancer-specific survival (CSS) rate and overall survival (OS) rate. We defined the extent of LND as following based on the guideline [17] if there was no description of its definition in the article; standard LND was defined as the removal of nodal tissue cranially up to the common iliac with the ureter being the medial border, obturator, internal iliac, external iliac and presacral nodes. Extended LND was defined as a proximally up to the aortic bifurcation as well as the area described for standard LND. Any disagreement was resolved by the senior author.

Statistical analysis

The relative risk (RR) and weighted mean difference (WMD) were used as the summary statistic for dichotomous and continuous variables, respectively. All results were reported with 95% confidence intervals (CIs). For studies that presented continuous data as median and range or interquartile range, the means and standard deviations were calculated using the technique described by Hozo et al. [18]. We analyzed the data from RCTs and NRCTs separately to reduce bias. Statistical heterogeneity between studies was assessed by the Chi-square test with p < 0.10 and the I2 test with I2 < 50% used for statistical significance. A random-effect model was used for outcomes in cases of significant heterogeneity; otherwise, the fixed-effect model was used. Statistical analyses were performed using Review Manager Version 5.3 (RevMan-Computer program, Version 5.3 Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014).

Risk of bias

Two reviewers independently assessed the risk of bias (RoB) of each individual study. An evaluation of RoB of the included studies was performed according to the Cochrane handbook [19]. For RoB assessment, selection, performance, detection, attrition, reporting bias and other potential sources of bias were assessed as “yes”, “no”, or “unclear” in each of the included RCTs (Supplementary Table 3). The RoB assessment of NRCTs was evaluated according to The Risk of bias in non-randomized studies of interventions. This tool is based on seven domains that included bias due to confounding, participant selection, classification of interventions, deviations from intended intervention, missing data, measurement of outcomes and selection of the reported result (Supplementary Table 4).

Result

Quantity of evidence identified and characteristics of included studies

A total of 543 articles were identified by the initial search (Fig. 1). After removal of duplicates, 270 remained for the screening of titles and abstracts. We excluded 161 articles based on our inclusion and/or exclusion criteria. Then, we assessed the full texts of the remaining selection leaving 33 studies for the qualitative and quantitative analyses. The general characteristics of the included studies are summarized in Table 1. Five studies were RCTs [11, 12, 20,21,22] comprising 501 patients and 28 studies were NRCTs [13,14,15, 23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47] comprising 25,991 patients. These studies were published between 2006 and 2018 with 19 being from North America, eight from Europe and five from Asia. Twenty-three studies reported neoadjuvant chemotherapy (NAC) rates (RARC: 2.0–100%, ORC: 0–100%). Pooled rates of NAC were 26.8% for RARC and 36.5% for ORC in RCTs and 21.9% for RARC and 19.1% for ORC in NRCTs, respectively. In one study [14], patients who received NAC were excluded. Thirteen studies reported adjuvant chemotherapy rates (RARC: 4.5–36.9%, ORC: 1.5–46.3%). Pathological outcomes are summarized in Table 2. PSM rates and lymph node yield were reported in 33 and 31 studies, respectively. Standard and extended LND were performed in 13 and nine studies, respectively. Other studies included mix cohorts with standard/extended LND or did not report the extent of LND. Oncologic outcomes are summarized in Table 3. The duration of follow-up in RARC and ORC arms varied from 8 to 58.8 months and from 12 to 59.1 months, respectively. Eight studies reported local and/or distant recurrence. Estimated RFS, CSS and OS rates were reported in seven, five and seven studies, respectively. It was not possible to perform meta-analysis of survival outcomes in eligible studies, due to the lack of data availability.

Fig. 1
figure 1

Flow chart for article selection process to analyze oncologic outcomes in patients treated with robot-assisted radical cystectomy (RARC) compared to those treated with open radical cystectomy (ORC)

Table 1 Characteristics of the included studies
Table 2 Pathological outcomes in RARC and ORC
Table 3 Oncologic outcomes after RARC and ORC

Meta-analysis

Comparison of PSM rates between RARC and ORC

Four RCTs including 500 patients and 27 NRCTs including 25,881 patients reported differences in PSM rates between RARC and ORC. The forest plot (Fig. 2a) showed that there was no significant difference in PSM rates (RR: 1.16, 95% CI 0.56–2.37, p = 0.69) between RARC and ORC in RCTs. Conversely, in NRCTs, RARC was associated with lower PSM rates (RR: 0.84, 95% CI 0.77–0.91, p < 0.0001) compared to ORC.

Fig. 2
figure 2

Forest plots showing the comparison of overall positive surgical margin between RARC and ORC. CI confidence interval, M–H Mantel–Haenszel test, ORC open radical cystectomy, RARC robot-assisted radical cystectomy, SD standard deviation

We analyzed separately PSM rates in patients with pathological T1–2 and T3–4 tumors. One RCT including 208 patients and four NRCTs including 6082 patients reported PSM rates in patients with pathological T1–2 tumors between RARC and ORC. Three RCTs including 148 patients and ten NRCTs including 6462 patients reported PSM rates in patients with pathological T3–4 tumors between RARC and ORC. The forest plots (Supplementary Fig. 1a, b) showed that there was no significant difference in PSM rates of patients with pathological T1–2 and T3–4 tumors between RARC and ORC in both RCTs (RR: 1.00, 95% CI 0.14–6.97, p = 1.00 and RR: 1.15, 95% CI 0.50–2.66, p = 0.75, respectively) and NRCTs (RR: 1.11, 95% CI 0.81–1.52, p = 0.52 and RR: 0.90, 95% CI 0.79–1.02, p = 0.09, respectively). The Chi-square and I2 test did not show any heterogeneity in any of the pooled analyses (Supplementary Fig. 1a, b).

Additionally, we analyzed separately the soft tissue positive surgical margin (STSM) and ureteral/urethral PSM rates. Three RCTs including 460 patients and eight NRCTs including 1280 patients reported STSM rates between RARC and ORC. One RCT including 302 patients and five NRCTs including 517 patients reported ureteral/urethral PSM between RARC and ORC. The forest plots (Fig. 3a, b) showed that there was no significant difference in STSM and ureteral/urethral PSM rates between RARC and ORC in both RCTs (RR: 1.00, 95% CI 0.40–2.47, p = 1.00 and RR: 0.76, 95% CI 0.17–3.34, p = 0.72, respectively) and NRCTs (RR: 0.68, 95% CI 0.43–1.07, p = 0.09 and RR: 1.30, 95% CI 0.68–2.50, p = 0.42, respectively). The Chi-square and I2 test did not show any heterogeneity in any of the pooled analyses (Fig. 3a, b).

Fig. 3
figure 3

Forest plots showing the comparison of (a) soft tissue positive surgical margin and (b) ureteral/urethral positive surgical margin between RARC and ORC. CI confidence interval, M–H Mantel–Haenszel test, ORC open radical cystectomy, RARC robot-assisted radical cystectomy, SD standard deviation

Comparison of lymph node yield between RARC and ORC

Two RCTs including 83 patients and eight NRCTs including 1404 patients reported standard LND data comparing RARC to ORC. The forest plot (Supplementary Fig. 2a) showed that there was no significant difference in lymph node yield between RARC and ORC in both RCTs (WMD: 4.81, 95% CI − 4.74–14.36, p = 0.32) and NRCTs (WMD: − 2.08, 95% CI − 5.84–1.67, p = 0.28). The Chi-square and I2 test showed significant heterogeneity in all pooled analyses (Supplementary Fig. 2a). Two RCTs including 116 patients and five NRCTs including 864 patients reported extended LND data comparing RARC to ORC. The forest plot (Supplementary Fig. 2b) showed that there was no significant difference in lymph node yield between RARC and ORC in both RCTs (WMD: − 1.21, 95% CI − 3.91–1.49, p = 0.38) and NRCTs (WMD: − 1.56, 95% CI − 4.31–1.18, p = 0.26). The Chi-square and I2 test showed significant heterogeneity in NRCTs pooled analyses (Supplementary Fig. 2a, b).

Comparison of disease recurrence between RARC and ORC

Two RCTs including 340 patients and five NRCTs including 748 patients reported local recurrence data. Three RCTs including 458 patients and five NRCTs including 767 patients reported distant recurrence data. The forest plots (Supplementary Fig. 3a, b) showed that there was no significant difference in local and distant recurrence rates between RARC and ORC in both RCTs (RR: 1.19, 95% CI 0.39–3.65, p = 0.75 and RR: 0.94, 95% CI 0.69–1.30, p = 0.73, respectively) and NRCTs (RR: 0.72, 95% CI 0.46–1.12, p = 0.15 and RR: 0.79, 95% CI 0.58–1.06, p = 0.12, respectively). The Chi-square and I2 test did not show any heterogeneity in any of the pooled analyses (Supplementary Fig. 3a, b).

Comparison of survival between RARC and ORC

Recurrence free survival

Two RCTs reported disease recurrence rates between RARC and ORC. Bochner et al. [11]. reported that 5-year risk of recurrence rates of 118 patients were 36% for RARC and 41% for ORC after a median follow-up of 58.8 months. Parekh et al. [12]. reported that 2-year progression-free survival rates of 302 patients were 72.3% for RARC and 71.6% for ORC (p = 0.90) with a minimum follow-up of 24 months. Five NRCTs reported RFS rates. 2-year RFS rates ranged from 67 to 87.8% for RARC and 58 to 84.4% for ORC [15, 39, 47]. 5-year RFS rates ranged from 52.1 to 54.2% for RARC and 32.2 to 57.1% for ORC [28, 38]. All studies concluded that there was no significant difference regarding RFS rates between RARC and ORC.

Cancer-specific survival

Five NRCTs reported CSS rates between RARC and ORC. Two-year CSS rates ranged from 75–90% for RARC compared to 63–88.3% for ORC [13, 15, 39, 47]. Gandaglia et al. [28], in the largest NRCT, showed that there was no difference in 5-year CSS with a median follow-up of 40.0 and 59.1 months, respectively [i.e. 73.5% for RARC and 61.9% for ORC (p ≥ 0.1)]. All studies concluded that there was no significant difference regarding CSS rates between RARC and ORC.

Overall survival

Seven NRCTs reported OS between RARC and ORC. Two-year OS rates ranged from 68–85.2% for RARC and 62.5–86% for ORC [13,14,15, 39, 47]. Five-year OS rates ranged from 48.4–59.2% for RARC and 44.3–58.4% for ORC [28, 38]. All studies concluded that there was no significant difference regarding OS rates between RARC and ORC.

Discussion

The present systematic review and meta-analysis of five RCTs comprising 541 patients and 28 NRCTs comprising 25,991 patients aimed to compare differences in oncologic outcomes of patients treated with RARC to those treated with ORC.

Surgical margin status is pathologically diagnosed by the absence or presence of tumor in the margin of soft tissue, ureter or urethra. Positive surgical margin (PSM) was reported as one of independent predictors of disease recurrence [48]. PSM rates in open radical cystectomy (ORC) were reported in around 2% of patients with pathological T1-2 tumors with higher rates in patients with pathological T3–4 tumors [48, 49]. Wide dissection of the peri-vesical tissue would theoretically minimize PSM rates especially in patients with clinical non-organ confined disease [50]. We analyzed RCTs and NRCTs separately to decondition several potential biases that are inherent to NRCTs. We failed to find any difference in PSM rates between RARC and ORC in RCTs. Interestingly, NRCTs found a higher rate of PSM for ORC compared to RARC. For example, Tan et al. [47]. reported that RARC was superior to ORC regarding PSM rates (8.2% vs 19.3%). However, NRCTs are limited in validity due to their design issue leading to selection, reporting and detection bias among others [51]. For example, patients with higher pathological stage were more likely to receive ORC [24, 29, 32, 40, 47]. Therefore, RCTs are necessary to ensure equality or superiority of one method over another. Our meta-analysis of RCTs showed no difference in PSM rates between RARC and ORC. However, the type of PSM confers a different prognosis based on location. Soft tissue surgical margin (STSM) is associated with an almost unanimous locoregional recurrence, distant metastasis and eventual death within 24 months after surgery [48, 49, 52].

We found no difference in STSM rates between RARC and ORC in both RCTs and NRCTs. The pooled rates of STSM in NRCTs were 3.6% for RARC and 8.3% for ORC, which is comparable to historic rates of STSM in expert centers [48, 49]. In our meta-analysis of STSM, all included studies were performed in experienced centers with ≥ 10RCs/year. In RCTs, there was also no difference in STSM rates (3.6% vs 3.6%). Therefore, one could conclude that RARC does not lead to higher rates of STSM compared to ORC performed at experienced centers. Such experience is what patients, regulators and insurances require more and more through centralization of complex surgery such as RC [53].

We found no difference in lymph node yields between RARC and ORC in both RCTs and NRCTs. The extent of LND is a quality criteria for RC. There has been discussion on the optimal LND and whether it is a surrogate for the quality of surgery and care delivered in a complex disease such as MIBC [52]. The EAU guidelines do not establish the extent of LND and number of lymph nodes needed to be removed [1]. Bochner et al. [54]. found that if lymph nodes are given to the pathologist in packets rather than en bloc, the number of lymph nodes removed will be higher. While the number of lymph nodes and template of LND are critical for RC, RFS and eventually CSS and OS are the more important endpoints for the patients and their caretakers.

We found no difference in RFS, CSS and OS between RARC and ORC in both RCTs and NRCTs. However, the median follow-up duration for most studies is too short to allow a conclusive statement. Indeed, most recurrences happen in patients treated with RC within the first 2 years [55]. In the current study, there were only three NRCTs with follow-up duration of more than 2 years [15, 28, 38]. These studies demonstrated that RARC was associated with a higher RFS compared to ORC without significant difference. The explanations of these results might be that the RARC arm has a shorter follow-up duration and includes lower stage patients. Indeed, in RCTs, there is no difference in RFS between RARC and ORC. Nguyen et al. [40]. reported that RARC had higher extrapelvic and peritoneal carcinomatosis compared to ORC. The issue of potential peritoneal tumor spread as a result of the effect of pneumoperitoneum warrants further research.

The major limitation of this meta-analysis is the quality of included studies; All NRCTs were observational and retrospective studies with high risks of bias and confounding. In particular, selection bias may have affected oncologic outcomes. Additionally, heterogeneity in definition of PSM, lymph node template, pathologic review may affect oncologic outcomes. For example, some articles reported PSM as soft tissue status, other articles defined PSM as soft tissue and ureteral/urethral status or did not report. This discrepancy is an ascertainment bias which is related to selection bias. Additionally, the lack of longer follow-up RCTs limit to allow a conclusive statement regarding the long-term oncologic outcomes. In addition, neoadjuvant chemotherapy has been shown an 8% improvement in 5-year survival outcomes in muscle-invasive bladder cancer [56]. In this systematic reviews, pooled NAC rates were 21.9% (2.0–100%) in RARC and 19.1% (0–100%) in ORC. Given the impact of NAC on oncologic outcomes, this might limit the generalizability of our results. Taken together, we highlight that ORC is still the gold standard treatment for MIBC and very high risk NMIBC; RARC might be an alternative by an experienced surgeon. A further long-term follow-up studies are necessary to evaluate further survival outcomes and to differentially assess the quality of life and complications.

Conclusions

Based on the current evidence, there is no difference in the rate of PSMs, lymph node yield, location of recurrence and survival outcomes between RARC and ORC in RCTs. In NRCTs, only PSM rates were better for RARC compared to ORC, but this discrepancy could be likely due to biases in selection and reporting. Furthermore, well-designed studies regarding survival outcomes with long-term data between RARC and ORC are needed to evaluate the oncologic outcomes of these two surgical approaches and to differentially assess the quality of life and complications.