Introduction

Urinary incontinence (UI) is an important health problem resulting in psychological, social, and hygienic impairment, thus affecting the lives of the patients as well as their families. The prevalence of UI increases significantly with age. Three common subtypes of UI are stress urinary incontinence (SUI), urge urinary incontinence (UUI), and mixed urinary incontinence (MUI). SUI is the most common type of UI, affecting an estimated half of all incontinent women, with a mean prevalence of 4.7 % [1, 2]. A systematic review estimated that the prevalence of SUI during pregnancy was up to 41 % and increased with gestational age [3].

SUI is associated with quality of life (QoL), and women experiencing SUI have signs of a lower QoL. Treatments for women with SUI are designed to improve symptoms and incontinence-related QoL. Clinical trials are designed to evaluate treatments, and it is therefore important to measure treatment-related change, not only in symptoms but also in QoL. Therefore, in clinical trials of such treatments, the inclusion of a measure of QoL is particularly important.

In the past several decades, a variety of questionnaires for measuring the impact of SUI on QoL have been developed and tested. Subjective QoL results of SUI using condition-specific QoL questionnaires might differ because there are a plethora of measurement instruments that vary in terms of their scope and content. The use of a standardized outcome measurement for incontinence-specific QoL should be strongly encouraged in order to combine and compare results of trials. The standardization committee of the International Continence Society (ICS) suggested that reliable and sensitive QoL questionnaires should be used in evaluating treatments for SUI. However, they did not recommend the use of specific QoL measures, nor did they give specific guidance on the best way to select measures [4].

Trials on SUI and QoL are scattered and inconsistent and vary widely in the QoL measurement, which limits comparing and combining data from studies with different measures. Therefore, despite the large amount of research, a meta-analysis of the trials on QoL studies in women with SUI is still lacking. Furthermore, several midurethral sling (MUS) procedures, such as tension-free vaginal tape (TVT), TVT obturator (TVT-O), tension-free vaginal tape SECUR (TVT-S), and pubovaginal sling (PVS), have been used for the treatment of female SUI; however, which method is best for improving patient QoL is not known. The first aim of the present study was to collect all clinical randomized controlled trials (RCTs) conducted in women with SUI and to critically discuss the measurement and evaluation of QoL in women with UI. Therefore, we can provide information that is likely to be helpful in both choosing the appropriate QoL instruments for clinical trial research and combining and comparing published studies. The second aim of study was to compare different midurethral sling procedures on QoL improvement over short-term and long-term follow-ups using measurement instruments that vary in terms of their scope and content.

Materials and methods

Search strategy and selection criteria

We searched MEDLINE (January 1966 to March 2015), EMBASE (January 1988 to March 2015), and the Cochrane Incontinence Group Specialised Register (March 2015), which contains trials identified from the Cochrane Central Register. Search terms consisted of the following key words: “stress urinary incontinence”, “SUI”, “quality of life”, “QoL”, “health-related quality of life”, and “HRQoL”. The reference lists of relevant articles were searched for other possible relevant studies. We adapted our search strategy to suit each database.

Only RCTs that reported the improvement in QoL of women with SUI were eligible in this meta-analysis. When a study reported the results from different subpopulations, we treated them independently. Studies involving women with urethral hypermobility, intrinsic sphincter deficiency, and mixed incontinence with a predominantly stress component were included. Quasi-randomized studies, prospective observational studies, and retrospective reviews were excluded. Studies that did not provide useable data, had duplicate results or overlapping data, or had fewer than 50 participants were excluded. When there were multiple publications from the same population, only the one with largest sample size or the most complete follow-up data was included. Two of the authors independently performed the literature search and screen. Any disagreements were resolved by discussion or involvement of a third author. Figure 1 shows the flow diagram for selecting studies for this meta-analysis.

Fig. 1
figure 1

PRISMA 2009 flow diagram for literature search and study selection. PRISMA diagram showing the different steps of systematic review, starting from literature search to study selection and exclusion. At each step, the reasons for exclusion are indicated. doi:10.1371/journal.pone.0052562.g001

Data collection and methodological quality assessment

Two of the authors examined titles, abstracts, and articles independently with identical case definitions, data abstraction forms, and selection criteria. Disagreements were resolved by discussion or involvement of a third author. In addition, attempts were made to contact the authors for any clarification or missing data. The following data were extracted from the eligible studies: characteristics of studies (setting, location, and study design), patient characteristics, inclusion criteria, medical treatment, follow-up procedures, and QoL measures before and after medical treatment.

The authors assessed the methodological quality of the trials’ focuses on allocation concealment, blinding, incomplete outcome data, and selective outcome reporting with the Cochrane Collaboration’s risk-of-bias method [5]. The results of the quality assessment analysis of the trials are presented in Fig. 2.

Fig. 2
figure 2

Risk-of-bias graph

Types of slings procedures

Sling procedures included single-incision slings and standard midurethral slings, retropubic tension-free vaginal tape (RP-TVT), and transobturator tension-free vaginal tape (TO-TVT). Single-incision slings included TVT-Secur (Gynecare, Bridgewater, NJ, USA), U-type single-incision slings, and H-type single-incision slings. TO-TVT was further subdivided by type of transobturator sling into inside-out (TVT-O), outside-in (TOT), modified TVT-O (scheduled to undergo the same surgical procedure using the modified less invasive technique), biological material TOT, and synthetic material TOT.

Primary outcome

The primary outcomes were the features and frequency in clinical trials of a range of incontinence-specific QoL questionnaires and QoL improvement measured and evaluated using these QoL questionnaires. These questionnaires included multidimensional questionnaires and single questions or single items, which covered different scope and content including distress caused by incontinence symptoms, sexual function in women with SUI, post-operative pain, severity, and improvement of symptoms.

Statistical analysis

We calculated risk ratios (RRs) for binary outcomes and weighted mean differences (WMD) for continuous outcomes. We used the Chi-square test and I 2 scores (I 2 > 50 % was regarded as substantial heterogeneity) to assess the degree of statistical heterogeneity [6]. A random effects model was used to aggregate individual effect sizes in order to take into account the heterogeneity of the risk estimates and to provide more conservative estimates compared with the fixed effects model [7]. We investigated potential sources of heterogeneity with subgroup analyses according to follow-up duration (months), disease severity (with urethral hypermobility, with intrinsic sphincter deficiency, and with a mix of urethral hypermobility and intrinsic sphincter deficiency), disease type (with and without mixed incontinence), and mean age of participants in clinical trials.

To establish the robustness of the outcome by sensitivity analyses, we applied a fixed effects model, used the trim-and-fill method, and excluded studies with fewer participants. All analyses were performed using the software STATA version 12.0 (StataCorp LP, College Station, TX, USA) and Review Manage 5.1. All P values were two-sided. A P value <0.05 was considered to be statistically significant.

Results

Characteristics of eligible studies

Figure 1 illustrates the flow of literature through the search and assessment process. We retrieved 962 candidate articles from all searches. A total of 31 RCTs met the inclusion criteria and were included in the meta-analysis [838]. Table 1 shows the characteristics of the trials. The means (SD) age of the participants was 55.74 (5.449) years old, and the mean (SD) longest duration of follow-up was 19.34 (15.90) months.

Table 1 Characteristics of included studies

The risk of bias was assessed using a risk-of-bias graph (Fig. 2). Most RCTs had good sequence generation, allocation concealment, and incomplete outcome data; however, reporting of blinding methods and selective reporting in most RCTs were generally poor.

Condition-specific instruments to assess QoL in women with SUI

A total of 13 different condition-specific instruments were identified in this meta-analysis. In our study, the Urogenital Distress Inventory (UDI), Incontinence Impact Questionnaire (IIQ), and Pelvic Organ Prolapse/Urinary Incontinence Sexual Questionnaire (PISQ-12) were the most frequently used in measuring QoL among women with SUI. The most frequently used combination of incontinence-specific QoL measures was the IIQ with the UDI (reported in 13 trials). The details of these reported condition-specific questionnaires are presented in Table 2.

Table 2 Incontinence-specific questionnaires reported (among the papers which reported using such a measure, n = 31)

Summary of main results

Five RCTs, which used the same questionnaire (PISQ-12), reported the comparison of single-incision slings and TO-TVT on sexual function; improvement in the PISQ-12 score was significantly higher in the single-incision slings group than in the TO-TVT group (WMD 1.06; 95 % CI 0.58–1.54) over a mean follow-up of 12 months (Fig. 3a). The mean pain visual analogue scale (VAS) score during the first three post-operative days in the single-incision slings group was significantly lower than that in TO-TVT group (WMD −1.59; 95 % CI −2.28 to −0.89) (Fig. 3a). Five RCTs compared single-incision slings and TO-TVT with a patient-reported success rate defined as “very much improved/much improved” on the PGI-I, and we did not detect a difference between the two groups over a mean follow-up of 12 months (Fig. 3a). Only one study compared single-incision slings and TO-TVT with the KHQ; the improvement in total KHQ score was significantly lower in the single-incision slings group than in the TO-TVT group (WMD −3.09; 95 % CI −5.58 to −0.60) (Fig. 3a). We did not detect differences in total score improvement with the UDI, IIQ, Incontinence Questionnaire Short Form (ICIQ-SF), or the International Consultation of Incontinence Questionnaire–Frequency of Lower Urinary Tract Symptoms (ICIQ-FLUTS) between the single-incision slings group and the TO-TVT group (Fig. 3a).After excluding RCTs that evaluated TVT-Secur, single-incision mini-slings (SIMS) still had significantly improved PISQ-12 sexual function scores (WMD 0.95; 95 % CI 0.45–146).

Fig. 3
figure 3figure 3figure 3

Meta-analysis results across all continuous outcomes (WMD). TVT tension-free vaginal tape, TVT-Secur single-incision TVT, TO-TVT transobturator TVT, RP-TVT retropubic TVT, TVT-0 inside-out TO-TVT, TOT outside-in TO-TVT, UDI Urogenital Distress Inventory, IIQ the Incontinence Impact Questionnaire. VAS visual analogue scale, KHQ King’s Health Questionnaire, PISQ-12 Pelvic Organ Prolapse/Incontinence Sexual Function Questionnaire, ICIQ-SF Incontinence Questionnaire Short Form

Six RCTs compared TVT and TO-TVT with the UDI; our results showed that TO-TVT-treated patients had significantly greater reductions in UDI (WMD 2.28; 95 % CI 1.77–2.80) than TVT-treated patients over a mean follow-up of 12 months (Fig. 3b). Six RCTs compared TVT and TO-TVT with the IIQ; our results showed that TO-TVT-treated patients had significantly greater reductions in IIQ (WMD 0.89; 95 % CI 0.26–1.52) than TVT-treated patients over a mean follow-up of 12 months (Fig. 3b). Three RCTs compared TVT and TO-TVT with the pain VAS score; we did not detect a difference between the two groups (Fig. 3b).

Four RCTs compared RP-TVT and TO-TVT with the UDI; the reduction in the total UDI score was significantly higher in the RP-TVT group than in the TO-TVT group (WMD −1.00; 95 % CI −1.65 to −0.35) (Fig. 3c). Four RCTs compared RP-TVT and TO-TVT with the IIQ, and no difference was detected between the two groups. Only two RCTs compared TVT and TO-TVT with the pain VAS score, and there was no difference between the two groups (Fig. 3c).

The number of RCTs that compared TVT-O and TOT was small. Of this small number, only two RCTs reported results with the KHQ score, two reported results with the PISQ-12, two reported results with the PGI-I, and one reported results with the VAS. No differences in the improvement of QoL were detected between these groups (Fig. 3d).

Only one RCT compared biological TOT and synthetic TOT, modified TVT-O and TVT-O, U-type single-incision slings and H-type single-incision slings, and there were no differences in improvement of QoL detected between these groups (Fig. 3e).

The meta-analysis results across all binary outcomes, for example having 18-point improvement in KHQ total score, success defined as very much/improved by PGI-I, having 20-point increase in I-QOL total score, having eight-point increase in VAS total score, were presented in Fig. 4, and there were no difference between TVT-Secur and TO-TVT, TVT-O vs TOV, TVT-Secur and RP-TVT on these binary outcomes.

Fig. 4
figure 4

Meta-analysis results across all binary outcomes (RR). TVT tension-free vaginal tape, TVT-Secur single-incision TVT, TO-TVT transobturator TVT, RP-TVT retropubic TVT, TVT-0 inside-out TO-TVT, TOT outside-in TO-TVT, UDI Urogenital Distress Inventory, I-QoL the Incontinence Quality of Life questionnaire, PGI-I Global Impression of Improvement questionnaires, VAS visual analogue scale, KHQ King’s Health Questionnaire

Subgroup analysis

We performed a subgroup analysis of all RCTS that used the UDI by follow-up duration (months) due to a high level of heterogeneity (I 2 = 83.9 %, P < 0.001). Analysis of total UDI scores showed a significantly higher improvement in TO-TVT-treated patients than in TVT-treated patients after a long-term follow-up (>30 months) (WMD 2.80; 95 % CI 1.56–4.04), but after a short-term follow-up (12–15 and 6 months), no significant differences were detected between the two groups (Fig. 5).

Fig. 5
figure 5

Subgroup analysis of the decrease in UDI score compared TVT with TO-TVT according to follow-up duration (months)

Subgroup analysis by disease severity and disease type showed that the improvement in sexual function score (as assessed by the PISQ-12) was significantly higher in the single-incision sling group than in the TO-TVT group for patients with urethral hypermobility (WMD 2.10; 95 % CI 0.52–3.68), with mixed incontinence (WMD 1.00; 95 % CI 0.44–1.56), and without mixed incontinence (WMD 1.23; 95 % CI 0.28–2.18); however, no trial reported the results among patients with intrinsic sphincter deficiency. The post-operative pain VAS scores in the single-incision sling group was significantly lower than that in the TO-TVT group among both patients with mixed incontinence (WMD −1.59; 95 % CI −2.28 to −0.89) and without mixed incontinence (WMD −0.66; 95 % CI −1.01 to −0.32). Only one trial reported the results among patients with mixed urethral hypermobility and intrinsic sphincter deficiency, and there was no evidence of a significant difference in post-operative pain VAS scores between both groups (Table 3).

Table 3 Subgroup analysis results across all outcomes according to disease severity and disease type

TO-TVT-treated patients had significantly greater reductions in total UDI scores than TVT-treated patients in the subgroup of urethral hypermobility (WMD 1.95; 95 % CI 0.59–3.32); in the other subgroups, there was no significant difference in the improvement of total UDI scores between both groups (Table 3).

Evaluation of publication bias

We assessed funnel plot asymmetry using Egger’s linear regression test. The results showed that there was no publication bias in all main outcomes and subgroup analyses (P > 0.05).

Discussion

Health-related QoL in women with SUI is increasingly considered an essential outcome for clinical trials and patient management. During the past several decades, a variety of incontinence-specific questionnaires for measuring the impact of SUI on health-related QoL have been developed and tested. Subjective QoL of women with SUI using incontinence-specific questionnaires might differ because there are excessive measurement instruments that vary in terms of their scope and content. In this meta-analysis, we identified 13 different condition-specific instruments from 31 RCTs of women with SUI. Our results revealed that the UDI and IIQ were the most frequently used tools for measuring QoL among women with SUI, and the two instruments were commonly combined, as in 13 of the trials. The UDI is used to evaluate the distress or “bother” caused by incontinence symptoms. The IIQ uses approximately 30 activities to evaluate the impact that UI has on these activities [39]. The two instruments both had good clinical face validity and measurement properties and were the most appropriate for use in measuring the QoL among women with SUI [40]. The PISQ-12 is often used to complement these QoL measures by evaluating sexual function in women with pelvic organ prolapse and/or SUI in clinical and research settings [41]. The short forms of the IIQ (IIQ-7) [42] and UDI (UDI-6) [42] are becoming more common in order to reduce the burden of questionnaires on patients; in this study, only 2 RCTs (25, 37) reported the results of the long forms of the IIQ (14.29 %) and UDI (13.33 %). Therefore, we recommend that researchers consider using the short forms of the IIQ (IIQ-7) and UDI (UDI-6) with or without the PISQ-12 as their first choice of QoL measurement in trials of incontinence treatments.

The Incontinence Quality of Life questionnaire (I-QoL) evaluates the effects of UI in the three domains of avoidance and limiting behaviour, social embarrassment, and psychosocial impact, and it was endorsed by the International Consultation on Incontinence [43]. The I-QoL is available in a number of languages and has been shown to be a reliable and valid measure of QoL suitable for use in a variety of international settings [44]. Although only 5 of 31 RCTs in this study reported the results of the I-QoL, we still recommend the I-QoL as an important continence-specific measure of QoL in a clinical trial setting. Despite the availability of QoL questionnaires, new questionnaires continue to appear; for example, the Incontinence Outcome Questionnaire (IOQ) [45] and International Consultation on Incontinence Questionnaire Short Form (ICIQ-SF) [46] were also reported in this meta-analysis. The reasons for developing additional questionnaires are not clear but could be related to the limited range of languages available in previously developed questionnaires. However, we recommend translation of the commonly used instruments, for example, the UDI-6, IIQ-7, I-QoL, into different languages as the best choice.

Midurethral slings have been shown to be effective in treating female SUI. However, associated adverse events include bladder and bowel injury, groin pain, and haematoma formation. In an effort to maintain efficacy while eliminating some of the side effects, a new generation of slings has been developed, called “single-incision slings” or “mini-slings”. Our meta-analysis revealed that single-incision slings were associated with significantly higher improvement in sexual function and lower post-operative pain than standard MUS. A meta-analysis performed by Mostafa [47] was inconsistent with our results, which did not detect significant differences in total PISQ-12 scores between SIMS and standard MUS. Two recently published RCTs were added into our analysis, so our results might be more reliable [9, 10]. Furthermore, after excluding RCTs evaluating TVT-Secur (one type of single-incision sling) [10], SIMS still had significant improvement in PISQ-12 sexual function scores. The result that single-incision slings are associated with significantly lower post-operative pain was identified by another meta-analysis [48]. Meanwhile, distress caused by incontinence symptoms (evaluated by the UDI), the impact of SUI on physical, social, and emotional activities (evaluated by the IIQ), the frequency and severity of symptoms (evaluated by the ICIQ-SF), and the subjective success rate (evaluated by the PGI-I) were all not significantly different between the two groups. Only one study [19] reported that midurethral slings were associated with higher improvement on the KHQ total scores than single-incision slings, but this result should be interpreted with caution due to the insufficient research data.

TVT was the first synthetic MUS introduced, and prospective studies have shown long-term cure rates >77 % with TVT for SUI [49]. TO-TVT was introduced to minimize the complications of the retropubic slings, which include injury to the bladder, major vessels, and bowel. TO-TVT has shown similar safety and efficacy to TVT in a meta-analysis [50]. In our meta-analysis, TO-TVT had significantly greater reductions in total UDI scores and total IIQ scores than TVT, but no difference was detected on the post-operative VAS pain score. Considering the high level of heterogeneity among the trials, we performed a subgroup analysis by follow-up duration. Interestingly, the long-term efficacy of the TO-TVT procedure was superior to that of TVT in terms of reducing the distress caused by incontinence symptoms, but the short-term efficacy was not significantly different between the two groups. When compared with RP-TVT, the TO-TVT procedure was significantly associated with a higher reduction in the distress caused by incontinence symptoms, but there were no significant differences in other incontinence-specific QoL measures (IIQ, VAS, UISS). Other comparisons were also performed in this meta-analysis, including biological TOT versus synthetic TOT, modified TVT-O versus TVT-O, and U-type single-incision slings versus H-type single-incision slings, and we detected no differences in improvement of QoL between these groups. However, these results should be interpreted with caution due to the small number of trials.

This meta-analysis had a number of strengths. RCTs are the gold standard in the assessment of surgical interventions. Systematic reviews based on RCTs are the cornerstone of evidence-based medicine because they can reduce bias and random errors and provide the highest quality of evidence. Furthermore, we included only RCTs with more than 50 participants, which improves the efficiency of the work without an appreciable loss of power.

This review also had several limitations. First, some RCTs included in this meta-analysis reported results using the median and range, and we estimated standard deviations (variances) using the formula range/4 for moderately sized samples (15 < n ≤ 70) and the formula range/6 for larger samples (n > 70), which may affect the pooled results. Second, lack of blinding in the RCTs can be a source of bias. The assessors’ blinding methods were poorly documented, and only four RCTs reported detailed blinding procedures. Blinding of participants was also poor (8 of 31 RCTs).

Conclusions

Our meta-analysis indicated that consistent use of the short forms of the IIQ (IIQ-7) and UDI (UDI-6) with or without the PISQ-12 might promote options for comparisons between trials. We can conclude that the single-incision slings procedure is superior to TO-TVT in terms of improving sexual function and lowering post-operative pain scores. The RP-TVT procedure is superior to TO-TVT in terms of reducing distress caused by incontinence symptoms. The long-term efficacy of the TO-TVT procedure was superior to that of TVT in terms of reducing the distress caused by incontinence symptoms, but the short-term efficacy was not significantly different between the two groups.