Introduction

Retropubic midurethral slings (RP-MUS) and transobturator midurethral slings (TO-MUS) (inside-out, TVT-O, and outside-in, TOT) are widely used procedures for the surgical treatment of female stress urinary incontinence (SUI), yielding high cure rates [1]. However, there are concerns about the safety of both approaches: RP-MUS have been related to bladder perforations and hemorrhagic complications, and TO-MUS seem to be linked to persistent and/or chronic groin/thigh pain [2]. Concerns about the safety and effectiveness of vaginal meshes led the FDA to release a notification on the subject of complications following their use (http://www.fda.gov/medicaldevices/safety/alertsandnotices/publichealthnotifications/ucm061976.htm). Even though the notification concluded that serious adverse events following the use of vaginal meshes for pelvic organ prolapse are not rare, slings were deemed to be relatively safe. Notwithstanding this document, MUS continue to be an issue, and some regulatory bodies are planning to limit their use until new evidence shows their safety (http://www.sehd.scot.nhs.uk/cmo/CMO(2014)15.pdf). Adding to the debate is the fact that their long-term efficacy has also been questioned by these same regulatory bodies.

A number of meta-analyses evaluating MUS have shown both their superiority in comparison with other techniques (colposuspension, autologous fascia slings, single-incision slings) and their efficacy and safety in different patient populations [1, 311]. All these meta-analyses included mainly short-term studies with only a few medium-term studies, and no long-term studies. In the case of prosthetic material, it has been suggested that only studies evaluating results 5 years after the original procedure can be defined long-term follow-up studies, with medium-term follow-up studies limited to those evaluating results 3 years after the original procedure [12].

The aim of this systematic review and meta-analysis of the literature was to evaluate the long-term effectiveness and safety of RP-MUS and the medium-term outcomes of TO-MUS procedures for SUI reported in randomized controlled trials (RCT) and nonrandomized studies and to compare those outcomes with those associated with different types of MUS.

Sources

An updated meta-analysis was performed following the Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) Statement guidance [13]. All studies investigating the medium-term and long-term effectiveness and safety of MUS in women affected by SUI were included in this review. They were identified by searching the MEDLINE, EMBASE, National Library for Health, ClinicalTrials.gov, and Google Scholar databases (up to June 2014). The abstracts from International Conferences were not included in this systematic review. The search terms included simple text or subject subheadings with the language limited to English. They included: “stress urinary incontinence”, “midurethral sling”, “tension-free tape”, “transobturator tape”, and ”follow-up”. A hand search of the bibliographies and citation lists of all relevant reviews and primary studies was performed to identify articles not captured by the electronic searches with restriction to the English language. No ethical approval was requested because the study was a systematic review and meta-analysis.

Study selection

Retrospective, cohort, prospective nonrandomized studies and RCTs of women who had undergone RP-MUS or TO-MUS (including TVT-O and TOT) as the primary procedure for SUI with a mean or median follow-up of at least 36 months for TO-MUS and at least 60 months for RP-MUS were included. RCTs comparing RP-MUS and TO-MUS required a minimum follow-up of 36 months to be included in the analysis. No limitations concerning inclusion criteria of each study were set and only studies comparing RP-MUS or TO-MUS and another synthetic sling were included in this review. Studies including exclusively devices no longer available on the market at the date of the review were excluded. Studies including both RP-MUS or TO-MUS on the market and a device not on the market were included.

Three reviewers (G.A.T., C.F. and A.F.) selected the studies independently on the basis of the inclusion criteria and clarifications were sought from the individual trial lists if required. Disagreements among reviewers as to the studies to include were solved by discussion and, if necessary, by a majority decision of the reviewers. In cases of duplication, the study with the most recent data was included. In case of cohort studies with multiple publications, the last dataset on efficacy was used, while safety data were extracted from all published articles.

Three researchers (G.A.T., C.D.C. and C.F.) extracted data for quality and results independently. Data included the characteristics of patients (number, lost to follow-up and age), intervention, comparison for RCTs, follow-up length, exclusion criteria, preoperative and outcome assessment methodology, and results of the studies. The Jadad score was used to assess the quality of RCTs and the Newcastle-Ottawa scale was used for nonrandomized studies. The risk of bias across studies was assessed according to the Cochrane Handbook for Systematic Reviews.

Statistical analyses were performed by one author (G.A.T.). Statistical analyses of the RCTs were performed using RevMan 5.3 software (The Nordic Cochrane Center, Copenhagen, Denmark). For homogeneity, the per-protocol outcomes were used in the meta-analysis. For dichotomous data, the results of each study were expressed as odds ratios (OR) with 95 % confidence intervals (CI) and combined for meta-analysis using the Mantel-Haenszel method (fixed effects model). For nonrandomized studies, regression rates from individual studies were meta-analyzed using a fixed effects model. Statistical analyses were performed using CMA software (Biostat, Englewood, NJ). Heterogeneity among studies was analyzed using I 2. Differences in the proportions of complications were evaluated using the chi-squared test and are expressed as OR with 95 % CI. Statistical significance was set at p < 0.05.

Results

A flow chart of the study selection process is shown in Fig. 1. We identified 78 potentially relevant studies and all underwent detailed review. Of these studies, 29 were excluded due to duplication of data reported in included studies, follow-up not meeting the inclusion criteria, outcome not reported or not completely specified, unclear study design, device not available on the market, or nonstandard device or procedure used. Thus, a total of 49 studies were included in the review.

Fig. 1
figure 1

Selection process for the systematic review

There were 11 RCTs [1424] and 38 nonrandomized studies, including prospective, retrospective, and cohort studies [2562] with a total of 6,406 patients (1,200 in RCTs and 5,206 in nonrandomized studies) aged 19 – 89 years. The sample size ranged from 15 to 563 patients evaluated at the last follow-up. The characteristics of the included studies are shown in Tables 1 and 2. The total numbers of patients evaluated according to the device used are shown in Table 3.

Table 1 RCTs included in the systematic review
Table 2 Prospective, retrospective and cohort studies included in the systematic review
Table 3 Number of patients treated and evaluated in the medium-term and long-term per type of device

All RCTs showed a Jadad score of ≥3, with the exception of two studies [20, 24] (Table in Electronic supplementary material). A power calculation was reported in six studies (54.5 %). Among the non-RCT studies, SUI was clinically diagnosed in eight (no urodynamic studies performed) [26, 34, 36, 41, 48, 51, 53, 60] and a blinded assessment was performed in only three [40, 50, 51] (Figure in Electronic supplementary material). Outcome was assessed as a composite measure of cure (objective and subjective definition of cure met) [20, 24, 30, 31, 35, 59], subjective [14, 19, 21, 26, 34, 36, 38, 39, 42, 44, 48], objective outcome [15, 16, 22, 50] or both [17, 18, 23, 25, 2729, 32, 33, 37, 40, 41, 43, 4547, 49, 5158, 6062]. The follow-up was described in each study with a minimum of 36 months and a maximum of 17 years. Subjective parameters and objective outcome measures used in the studies included are reported in Electronic supplementary material.

Cure and success rates

On meta-analysis of RCTs, the ORs for objective cure with RP-MUS and TO-MUS were similar (OR 1.15, 95 % CI 0.75 – 1.76; p = 0.72; Fig. 2a). The two different transobturator approaches were not significantly different from RP-MUS. The subjective cure rate with RP-MUS was significantly higher than with TO-MUS (OR 1.76, 95 % CI 1.08 – 2.86; p = 0.02; Fig. 2b). Separate analysis of TVT-O and TOT showed that subjective cure rate with TOT was lower than with RP-MUS (OR 3.80, 95 % CI 1.74 – 8.33; p = 0.0008), while there was no difference between RP-MUS and TVT-O but based on only one study [16] (OR 0.92, 95 % CI 0.47 – 1.80; p = 0.81; Fig. 2b).

Fig. 2
figure 2

Meta-analysis of the efficacy of retropubic and transobturator TVT (RCTs). a Objective cure rates; b Subjective cure rates

There were no RCTs comparing TVT-O with TOT in which objective outcomes were reported, while a single study [14] showed similar odds of subjective cure (OR 1.08, 95 % CI 0.61 – 1.91; p = 0.80). Similarly, the only RCT comparing the composite cure rates of TVT-O and TOT [20] showed similar outcomes (OR 0.92, 95 % CI 0.25 – 3.32; p = 0.80). Objective and subjective cure rates in RCTs were similar when comparing TO-MUS and Tissue Fixation System (TFS), TVT SECUR and TVT ABBREVO (objective cure OR 1.06, 95 % CI 0.59 – 1.91; subjective cure OR 1.26, 95 % CI 0.68 – 2.33; Fig. 3).

Fig. 3
figure 3

Meta-analysis of the efficacy of transobturator tapes and TFS, TVT SECUR, and TVT ABBREVO (RCTs). a Objective cure rates; b Subjective cure rates

Considering the non-RCTs, RP-MUS showed cumulative subjective, objective and composite cure rates of 72.7 % (95 % CI 70.9 – 74.5 %), 83.2 % (95 % CI 80.8 – 88.7 %), and 84.1 % (95 % CI 81.7 – 86.1 %), respectively (Fig. 4).TO-MUS showed cumulative subjective and objective cure rates of 82.1 % (95 % CI 79.9 – 84 %) and 83.5 % (95 % CI 81.1 – 85.7 %; Fig. 5).

Fig. 4
figure 4

Meta-analysis of the efficacy of retropubic TVT (non-RCTs). a Cumulative subjective cure rates; b Cumulative objective rates; c Composite cure rates

Fig. 5
figure 5

Meta-analysis of the efficacy of transobturator TVT (non-RCTs). a Cumulative subjective cure rates; b Cumulative objective rates

Complications

Overall, there were 766 complications reported with RP-MUS (complication rate 19.3 %; 72 in RCTs, 20.4 %. and 694 in non-RCTs, 19.2 %) and 579 with TO-MUS (complication rate 23.8 %; 198 in RCTs, 23.4 %, and 381 in non-RCTs, 24.0 %). De novo overactive bladder (OAB) symptoms (10 %) and urinary tract infections (UTI, 9.3 %) were the most common complications of RP-MUS, while de novo OAB symptoms (10.3 %) and postoperative and chronic pain (5.9 %) were the most frequent complications of TO-MUS (Fig. 6). Not all studies clearly defined postoperative, persistent and chronic pain. Defining as persistent pain all pain reported beyond the perioperative period (>7 days after procedure), no differences were observed between RP-MUS and TO-MUS (2.2 % vs. 1.9 %). UTIs (10.1 % vs. 3.6 %, OR 3.29, 95 % CI 2.07 – 5.28; p < 0.001), and bladder/urethral perforations (2.5 % vs. 0.4 %, OR 7.01, 95 % CI 2.94 – 17.90; p < 0.001) were more common with RP-MUS than with TO-MUS, while vaginal injuries (3.3 % vs. 0.4 %, OR 7.96, 95 % CI 1.15 – 157.9; p = 0.02) and pain (5.9 % vs. 1.8 %, OR 3.47, 95 % CI 1.93 – 6.35; p < 0.001) were more frequent with TO-MUS.

Fig. 6
figure 6

Complications found in the studies evaluating long-term and medium-term outcomes following retropubic and transobturator TVT. *p < 0.001 , **p = 0.02, vs. transobturator TVT

In RCTs comparing RP-MUS and TO-MUS, 72 (17.2 %) and 77 (18.2 %) complications were reported, respectively. This outcome was not significantly different between the two approaches (OR 0.94, 95 % CI 0.68 – 1.29; p = 0.68; Fig. 7a–b). Bladder injuries were more frequent with RP-MUS (OR 6.80, 95 % CI 1.77 – 26.07; p = 0.005) than with TO-MUS. Pain was not different between RP-MUS and TO-MUS (OR 0.78, 95 % CI 0.19 – 3.20).No significant difference was observed in complication rates between TVT-O and TOT (OR 0.80, 95 % CI 0.52 – 1.21; Fig. 7c). Vaginal injuries were more common with TOT than with TVT-O (OR 0.16, 95 % CI 0.05 – 0.57). Complications were more common with TO-MUS than with the newer minimally invasive tapes (OR 1.99, 95 % CI 1.05 – 3.75; p = 0.03; Fig. 7d). This result was due exclusively to pain-related complications, which were more common with TO-MUS (OR 8.75, 95 % CI 2.02 – 37.90; p = 0.004), based on three studies [2123]. Excluding this type of complication, no significant differences were observed (OR 0.99, 95 % CI 0.45 – 2.19; p = 0.98).

Fig. 7
figure 7figure 7figure 7figure 7

Meta-analysis of complications in RCTs. a-b Retropubic vs. transobturator TVT; c Inside-out vs. outside-in TVT; d Transobturator TVT vs. TFS, TVT SECUR, and TVT ABBREVO

Tape-related long-term complications

Overall, vaginal erosions were not different between RP-MUS and TO-MUS (Fig. 6), being 2.1 and 2.7 %, respectively. Persistent or chronic pain (i.e. pain persisting beyond the perioperative period or reported at the last follow-up visit) was reported by 13 patients for RP-MUS and 30 patients for TO-MUS. Persistent or severe voiding problems (defined either by difficulties persisting beyond the perioperative period or needing a tape release) were observed in 55 patients for RP-MUS and 39 patients for TO-MUS.

In RCTs vaginal erosions were less frequent with RP-MUS than with TO-MUS (OR 0.24, 95 % CI 0.07 – 0.84; p = 0.03; Fig. 7a), while no differences were observed between TOT and TVT-O or between TO-MUS and minimally invasive slings (Fig. 7b, c). Neither pain (OR 0.78, 95 % CI 0.19 – 3.20) nor persistent voiding dysfunction (OR 1.23, 95 % CI 0.58 – 2.59) was significantly different between RP-MUS and TO-MUS (Fig. 7a–b). Similarly, no differences in these complications were observed between TOT and TVT-O (Fig. 7c).

Risk of bias

The risk of bias was assessed using a risk-of-bias graph (see Electronic supplementary material). Most RCTs had good sequence generation and allocation concealment; however, reporting of blinding methods in most RCTs was generally poor.

Heterogeneity

No studies were excluded on the basis of methodological heterogeneity. In RCTs, estimated statistical heterogeneity in complication rate (as measured by I 2) between RP-MUS and TO-MUS was moderate (between 25 % and 75 %) in the comparison. There was a high degree (>75 %) of statistical heterogeneity in subjective cure rate TOT and TVT-O. All other comparisons showed a low statistical heterogeneity (<25 %). In non-RCT studies, there was a high degree of statistical heterogeneity in efficacy outcomes for RP-MUS and moderate heterogeneity in efficacy outcomes for TO-MUS.

Discussion

This meta-analysis indicated that both RP-MUS and TO-MUS are associated with high objective and subjective cure rates in the long- and medium-term. RCTs showed a significantly higher subjective cure rate with RP-MUS than with TO-MUS, due to one study defining subjective success as the absence of reoperation for recurrent SUI on request by the patients [21], which may have altered the results. On the other hand, equivalence of RP-MUS and TVT-O relies only on one study [18]. Our results are similar to those of Latthe et al. [3], Zhu et al. [8], and Schimpf et al. [11] for short term follow-up, but differ from those of Novara et al. [1] and Ogah et al. [5], who found a higher objective cure rate for retropubic TVT. However, on sensitive analysis, no differences were observed, so that a substantial similarity in the efficacy of these two approaches can be hypothesized, both in the short- and long-term. Non-RCT studies showed a somewhat lower subjective cure rate than objective cure rates for RP-MUS (72.7 % vs. 83.2 %). This outcome may be explained differently. Indeed, the tests used to objectively evaluate cure (mainly clinical stress test) may not reflect normal daily activities and thus underestimate the incidence of recurrent SUI. On the other hand, since in these studies women were followed up for a long time, a recall bias, including the onset of de novo urgency or urge incontinence unrelated to the original procedure, may have led to overestimation of the incidence of recurrent SUI by the patients. Finally, the heterogeneity in the tools used to subjectively evaluate patients across studies may also account for this outcome.

The number of RCTs comparing TVT-O and TOT in the medium-term are limited and objective cure rates were not reported in any of the studies. No differences were seen both in subjective cure rate and composite cure rates. This is in accordance with meta-analyses evaluating studies with shorter follow-up [4, 9], indicating that different transobturator tapes may yield similar results and that these results are maintained in the medium-term. There was no difference between objective and subjective cure rates of TO-MUS in non-RCT studies. This different finding compared with RP-MUS may reflect similar objective and subjective outcomes for transobturator tapes but may also be due either to the shorter follow-up period (lower recall bias) or to the fact that TO-MUS may be perceived as less invasive and dangerous.

No differences were observed between TO-MUS and new minimally invasive tapes. This last outcome is in accordance with a recent meta-analysis [6], which showed that traditional slings have similar efficacy when TVT SECUR is omitted from the analysis. However, other meta-analyses [11, 63] seem to indicate that single-incision slings have lower efficacy than traditional MUS. It is possible that, due to poor results, studies comparing single-incision slings and MUS were stopped earlier than after 36 months of follow-up, so that medium-term studies are those reporting good results for single-incision slings. Moreover, the data used for this meta-analysis are too limited to draw any conclusion and a modified transobturator tape was arbitrarily included in the group of minimally invasive slings (TVT ABBREVO) for its low invasiveness. The two approaches also proved to be relatively safe, with only a limited number of serious complications. The most frequent complication was de novo OAB symptoms for both approaches which, although possibly disturbing, can be treated either with tape division or anticholinergic drugs. The second most frequent complication with TO-MUS was pain, with an incidence of 6 %. All reported episodes, i.e. postoperative, persistent and chronic pain, are included in this analysis and most were postoperative or persistent pain that resolved within weeks or months, with no long-term sequelae. The rate of chronic pain in the groin and/or the thigh is far lower. It must be underlined that not all studies clearly defined postoperative, persistent and chronic pain, so that it was difficult to correctly identify the cases that remained unresolved.

RP-MUS showed a higher rate of bladder perforations, in accordance with the findings of other meta-analyses [1, 3, 5, 8]. The higher incidence of vaginal extrusion with TO-MUS than with RP-MUS was not found in those studies evaluating short-term follow-up. A possible explanation may be that, in the medium-term, RP-MUS are more at risk of developing a vaginal extrusion than RP-MUS. Interestingly, pain did not differ between the two groups, contrary to the findings of other meta-analyses [4, 8]. This finding may be due to the fact that in a number of studies only on chronic pain was reported, which may not be different between the two devices.

As reported by others [9], TVT-O was associated with a significantly lower incidence of vaginal injuries than TOT, but other complications did not differ between the two approaches. There was a trend toward lower vaginal extrusion with TVT-O (OR 0.37), not reaching statistical significance. Overall, these results seem to indicate that both TOT approaches yield similar medium-term safety outcomes.

To our knowledge, only one meta-analysis evaluating long-term outcomes of MUS has been done [10], but the authors included studies with a minimum follow-up of 12 months. As suggested by Hilton [12], medical devices cannot be compared to drugs, so that a follow-up of 12 months cannot be regarded as long-term. Considering that synthetic material is left in the body of the patient, with continuous interplay with the patient’s own tissues, for a study to be considered long-term, it should have a follow-up of at least 5 years. Tan et al. [10] found similar objective and subjective cure rates between RP-MUS and TVT-O, thus suggesting that these outcomes are either maintained over the years or that the efficacy of these devices decreases similarly.

The limitations of this analysis were the exclusion of congress abstracts, the inclusion of unselected patients and the inclusion of nonrandomized trials. Congress abstracts often reports preliminary studies that are subsequently published as peer-reviewed articles, so that very little information is lost by excluding these data. The inclusion of an unselected population (patients with mixed urinary incontinence, and patients with intrinsic sphincter deficiency), as well as the inclusion of nonrandomized, prospective and retrospective studies, are representative of situations we normally observe during everyday practice. It is almost impossible to reproduce the scientific setting in which randomized trials are implemented in our offices and operating rooms, and RCTs do not always reproduce “real-life” situations.

In conclusion, this meta-analysis showed that RP-MUS and TO-MUS have similar objective cure rates in the long-term and medium-term but TOTs have a lower subjective cure rate than TVT. This efficacy is backed by a high safety profile, and by a limited number of complications which were seldom severe. More randomized trials comparing TVT-O and TOT investigating objective cure rates and with a longer follow-up are needed, and further data are needed regarding minimally invasive slings to be able to draw more accurate conclusions.