Introduction

Vesicovaginal fistula is a preventable condition affecting millions of women worldwide, mostly in developing nations, a result of lack of access to emergency obstetric care. Surgery is the primary treatment with reported success rates ranging from 65–100% [15]. Various obstetric fistula (OF) classification systems exist, with none compared in head-to-head analysis of utility in predicting successful surgical repair. Ideally, classification systems provide clinicians and patients with important predictive information and facilitate triage, effective communication between surgeons, and meaningful comparison of treatment techniques.

Currently, there are at least 12 OF classification systems [6]. Most evaluate size, degree of vaginal scarring, location, and organs involved (bladder, rectum, ureter, uterus). Few, if any, of the systems are supported by outcome-based studies. The two most commonly used systems are Waaldijk’s and Goh’s classifications [7, 8]. The Waaldijk’s system categorizes vesicovaginal fistula by involvement of the urethra, its closing mechanism, and fistula size [7]. Goh’s system classifies based on distance from the external urethral meatus, fistula size, and severity of vaginal fibrosis [8] (Fig. 1). This study compares the Waaldijk’s and Goh’s obstetric fistula classification systems abilities to predict successful surgical fistula closure.

Fig. 1
figure 1

Waaldijk’s and Goh’s classification systems

Methods

This is a prospective collection of data on 202 patients evaluated for obstetric fistula at the General Referral Hospital of Panzi, Bukavu, DRC from April 2009 through December 2009. Approval for the study was obtained from the director of the fistula program, as there was no ethics committee at the hospital at the commencement of the study.

Preoperative evaluation of patients used a standardized form in English and French, completed by one of each of the authors, that included demographic and fistula classification using both systems. Staging was completed by one of each of the authors. Classification occurred at initial outpatient examination and confirmed in the operating room during exam under anesthesia. Fistula repair included vaginal or abdominal techniques at the discretion of the surgeon. Postoperative fistula ward care occurred per established hospital protocol, including transurethral catheter for a minimum of 14 days. On postoperative day 14, the catheters were removed on morning rounds followed by spontaneous voiding trial for an additional 2 days to ensure proper bladder emptying. A closed fistula was considered a successful repair. Total length of stay at the hospital grounds varied according to transportation availability. Patients reporting urinary leakage during the postoperative period were reevaluated by the same surgeon in the clinic or operating room to assess for persistent/recurrent fistula or residual urinary incontinence.

Statistical analyses were performed using SAS/STAT Version 9.2 (SAS Institute Inc., Cary, NC). Multiple logistic regression models were used to estimate odds ratios (OR) and 95% confidence intervals (CI) comparing odds of successful fistula closure among groups of patients with various demographic, preoperative, and surgical characteristics. Receiver operating characteristics curves (ROC) were generated from these logistic regression models, and areas under the ROC curves were estimated and compared using PROC LOGISTIC. Specifically, areas under the ROC curves were used to compare the abilities of the Waaldijk’s and Goh’s classification systems to predict successful fistula closure. All hypothesis testing was conducted at the 0.05 level of significance.

Results

Of the 215 patients presenting for evaluation, 102 (52.0%) patients reported fistula after vaginal delivery, 77 (39.3%) after a Cesarean section, and 13 (6.6%) after a vacuum-assisted vaginal delivery (VAVD), all subsequent to prolonged, severely obstructed labor. Thirteen patients with fistula after gynecologic procedure were excluded from analysis. Of those included, 180 had vesicovaginal fistula, 7 had ureterovaginal fistula, 12 had uterovaginal fistula, and 16 had rectovaginal fistula, of which 6 were isolated rectovaginal that were not included in classification analysis. Mean (+SD) age at presentation was 31.14 ± 13.22 years, and mean age at first delivery was 19.0 years. Patient characteristics including length of labor, marital status, and surgical variables are shown in Table 1. In addition, 19 patients reported a history of sexual violence, none of which directly caused the vaginal fistula. Median follow-up time after repair was 14 days (range 7–43 days). At the time of longest follow-up, 176 patients (89.8%) had a successful fistula closure, 23 (11.7%) failed repair, and 12 had missing data. Thirty-two (15.8%) patients demonstrated residual urinary incontinence.

Table 1 Preoperative and surgical characteristics

In multivariate analysis, individual demographic, preoperative, and surgical characteristics were evaluated followed by analysis and comparison of the two classification systems. In covariate analysis, neither mode of delivery nor surgical approach (abdominal vs. vaginal approach) was associated with successful closure. However, higher parity and first and second repair were both significant predictors of successful closure (Tables 2 and 3). Primary and secondary repairs were more likely to be successful (OR = 3.54, 95% CI 1.12–11.21, p = 0.0314), and parous women were more likely to have successful closure (OR = 3.19, 95% CI 1.28–7.96, p = 0.0131).

Table 2 Effect of mode of delivery and parity on outcome
Table 3 Surgical characteristics

Predicted probabilities were determined for each combination of outcomes for both classification systems (Tables 4 and 5). In addition, modeling of both systems including all covariates and classification components was done separately for each system and then compared to each other. For Goh’s system, a designation of type 1 or 2 (distance from the external urethra meatus) is significantly more predictive of a successful closure than a designation of type 4. In Waaldijk’s systems, there is no individual parameter which demonstrated prediction of successful closure. In multivariate analysis, both parity and number of previous repairs significantly improved the predictive ability of successful fistula repair only for the Waaldijk’s system. None of the covariates improved the predictive ability of the Goh’s system. In the final model, the Goh’s classification system demonstrated significantly better prediction of successful closure than the Waaldijk’s system, p = 0.0421 (Table 6).

Table 4 Goh’s prognostic probabilities
Table 5 Waaldijk’s prognostic probabilities
Table 6 Goh vs. Waaldijk

Discussion

From the earliest days of fistula repair, surgeons have classified fistulae, mostly including anatomic description and size. Originally, Sims suggested a simple anatomic classification of four types [9]. This was expanded upon by Mahfouz, who described ureteric fistula and Moir who included circumferential fistulae [10]. These initial descriptive papers provided no correlation to treatment outcome. One of the first papers to describe both a classification system and outcome, as defined by closure and continence, was by McConnachie [11]. In this system, fistulae were given a “grade” based on scarring, surgical access, and sphincter involvement as well as a “type” defined on size. Results demonstrated a lower cure rate with increasing grade or greater tissue scarring and sphincter involvement. McConnachie recommended that this system be used to select surgical technique, anticipate prognosis, and compare between different centers. Subsequent publication by Bird utilized Moir’s system to report outcomes, reporting an association between urethral involvement and failed closure as well as persistent incontinence in patients successfully closed, without fistula recurrence [12, 13].

In 1968 and 1969, two additional classification systems were published by Lawson and Hamlin and Nicholson, respectively, both introduced new terminology [14, 15]. Lawson’s system is based on four types, including “juxta-urethral” and “juxta-cervical,” reporting a higher rate of closer among juxta-cervical compared to juxta-urethra. Hamlin’s system described six types of fistula, introducing “simple” and “difficult” vocabulary to infer complexity of the surgery required to treat the fistula. These new terms were not clearly defined and therefore subjective, as a simple fistula to one surgeon may be difficult to another. In addition, a large fistula may involve both the urethra and the cervix, making a designation of “juxta-urethral” or juxta-cervical” an individual assessment.

In response to the lack of standardized terminology and a system that reliably demonstrates outcome prediction, both Waaldijk and Goh proposed classification systems that incorporated these important parameters. Waaldijk’s system is based on a retrospective analysis of 775 patients evaluated and repaired by the author [7]. The classification system is organized according to anatomic location and surgical approach/results (Table 1). Closure rates and residual incontinence rates decrease as the fistula type increases from type I through type IIBb. Goh’s system utilizes fixed reference points (type 1–4), size of fistula (a–c), and other influential parts such as vaginal length, scarring, and previous attempt at repair (i–iii) (Table 1). In a prospective study of 987 women, Goh et al. demonstrated the ability of the classification system to both predict successful closure and subsequent continence, with type 1, sizes b and i, more likey to be closed as well as continent after treatment [8]. Given the comparative strength of these two systems in contrast to previous examples, they are the two most widely used by contemporary fistula surgeons.

Thus far, the obstetric fistula literature lacks an objective comparison of classification systems. Recent articles by Arrowsmith and Goh et al. highlight the need for a standardized outcome-based classification system with well-defined terminology [6, 16]. This study is the first to address to prospectively compare the two most utilized OF classification systems with regard to treatment outcome. Analysis includes additional potential confounding points not included in either system that carried the potential to confound the results of either system as a whole.

Of the covariates, higher parity was significantly associated with successful closure. These findings are consistent with a recent publication by Muleta et al. of 14,928 women in which multiparous patients had smaller fistulas, less involvement of the urethra or bladder neck, less scarring, more successful repairs, and less residual incontinence after repair [17]. Several other publications echo these findings [6, 1820]. We also found a significant difference in the rate of successful closure between primary and secondary repairs compared to three or more, consistent with prior studies verifying that initial repair is the optimal setting for greatest likelihood of successful OF closure [5, 1824].

This data set further reports the predictive probability of closure for each combination of components for the two classification systems compared. In Goh’s original paper, the predictive probability was reported for each component separately. In contrast, our model includes the weight of each component, demonstrating probabilities that are slightly lower. This is likely due to inclusion of other components, reflecting a more accurate percentage and/or due to the smaller number of patients in this data set. Similar findings occurred with Waaldijk’s system, with probabilities slightly lower. The predictive probability table provides a practical clinical tool to predict successful closure using either of these two systems (Tables 4 and 5).

When analyzing Waaldijk and Goh separately, findings in this data set are consistent with previous reports, wherein Goh’s system type 4 predicted failed closure [8]. These data also support previous observations that size and extent of tissue induration and fibrosis influence repair success. However, in the Waaldijk’s system, these data did not demonstrate an independent variable for predicting repair success or failure.

This study is the first to directly compare two classification systems in the same OF patient group. In this data set, the Goh’s system demonstrated significantly better prediction of successful OF closure than did the Waaldijk’s system. This difference may be due to the focus on urethral involvement and closure function in the Waaldijk’s system whereas the Goh’s system includes size, vaginal induration, fibrosis and scarring, and bladder capacity.

There are several limitations to our study. First is the relatively small sample number of patients and short length of follow-up. Additionally, the statistical difference may not correlate to a clinically relevant difference when the systems are used in the field. Finally, this study used fistula closure as the outcome measure for systems comparison and not residual incontinence, an issue of equal relevance to OF patients and surgeons. Among OF patients successfully closed, 15.8% suffered persistent urinary incontinence. This data set was too small to accurately include residual incontinence in closed fistula as a secondary outcome measure. In order for one system to be universally adopted, a larger study including multiple OF treatment centers with longer follow-up is needed.

In conclusion, Waaldijk and Goh are the two most commonly used obstetrical fistula classification systems. Independently, the variable of primiparity is correlated with less successful fistula repair. When compared, Goh’s classification system has a significantly better ability to predict OF closure than the Waaldijk’s system. Ideally, this information will be followed by a larger study in multiple centers with longer follow-up, in order to unify fistula surgeons with a single classification system with the best prognostic function, thereby facilitating standardization of terminology, accurate comparison of surgical techniques and OF centers, improved triage, and optimize patient counsel and consent.