Introduction

Anastomotic leakage (AL) remains the most challenging complication following colorectal resection, with considerable associated morbidity and/or mortality. Despite the introduction of improved stapler devices and minimally invasive techniques, the symptomatic AL rate is 5–19%, depending on the site and type of anastomosis [1].

As a result, surgeons have developed several intraoperative anastomotic leak tests (IALT) to try to detect anastomotic defects and reduce the incidence of AL. However, there are little data or consensus on which techniques are best. Lazorthes and Chiotassol highlighted the potential benefit that intraoperative air leak testing (ALT) could have in reducing AL following colorectal resection in the 1980s [2].

Since then, ALT has remained the most popular intraoperative technique for identifying AL. In recent decades, other tests have been proposed including direct anastomotic assessment with intraoperative colonoscopy (IOC) or the use of indocyanine green (ICG) to assess integrity of anastomotic blood supply [3]. Both ALT and IOC methods are based on the macroscopic evaluation of the integrity of the anastomosis, while ICG allows real-time angiography during surgery to evaluate the perfusion of the anastomosis. Specifically, this technique is based on direct visualization of the fluorescence emitted by ICG under near-infrared light that can be identified using dedicated cameras and scopes [4].

Recently, several studies have reported promising results with ICG and reduced AL rates [5]; however, there remains little consensus on its routine use. The aim of this network meta-analysis was to determine which IALT was most effective in reducing AL.

Materials and methods

Search strategy

A systematic review was performed according to the guidelines from the preferred reporting items for systematic reviews and meta-analyses checklist (PRISMA-NMA) [6]. Institutional review board approval was not required. According to the recommendations made by Goossen et al, we conducted an electronic systematic search using MEDLINE, CENTRAL and Web of Science [7]. We searched for papers published in English between January 1, 2000 and, October 1, 2019. We used the following search headings: IALT, anastomotic leak, colorectal resection, colorectal surgery, intraoperative colonoscopy, indocyanine green fluorescence, laser fluorescence angiography. All titles were initially screened and suitable abstracts were reviewed. Finally, each of the eligible publication reference section was also checked for other potential papers (Fig. 1).

Fig. 1
figure 1

Network geometry for studies reporting symptomatic AL in Control, ALT, IOC, and ICG groups. Solid lines indicate direct comparisons and dotted lines indirect comparisons

Study inclusion criteria

According to Stroup et al. recommendations only randomized controlled trials (RCTs) or non-controlled clinical trials (non-RCTs) were included in this study [8]. To be included, the articles had to meet the following criteria: (a) patients have had colorectal surgery with an anastomosis; (b) studies have compared one or more IALTs; (c) and studies have clear research methodology.

Data extraction

The following data were retrieved from the selected publications and entered into a data set independently by two investigators (LT and ER). Recorded details included author information, study details; year, country, design, and patients details; gender, age, body mass index (BMI). In addition, primary diagnosis, type of surgery, type of IALTs, and AL rate were also recorded. Data entered were only compared at the end of the reviewing process to reduce the selection bias. A third author (MK) reviewed the database, with duplicates being erased and discrepancies clarified.

Study quality assessment

Assessment of risk of bias of included studies was performed using the Risk of Bias In Non-randomized Studies—of Interventions (ROBINS-I) tool [9], which categorises risk of bias as low, moderate, serious, critical and unclear, with the risk of bias category for each study being reported. If a study’s risk of bias was categorised as serious, critical or unclear, the effect of removing this study was tested and the relevant outcome(s) reported.

The methodological quality of the selected RCTs was appraised using the Cochrane risk-of-bias tool [10]. Trials were graded as follows: L = low risk, H = high risk, U = unclear risk. The quality of all included studies is depicted in Table 1.

Table 1 Quality assessment of the included studies

Clinical leak

According to Rahbari et al., AL was defined as a defect of the intestinal wall at the anastomotic site leading to a communication between the intra- and extraluminal compartments. Severity of AL was graded as: (a) no changes in patients’ management required, (b) requiring active intervention, but not relaparotomy, and (c) requiring relaparotomy [11]. Eight studies [12,13,14,15,16,17,18,19] included in this network meta-analysis defined ALs on both clinical and radiological bases, two studies [20, 21] did not specifically report it, and one study [22] defined it only on clinical terms.

Outcomes of interest

Outcomes of interest were AL rate in the control group (no IALT), versus AL in patients who had ALT, IOC, and ICG testing.

Statistical analysis

We performed fully Bayesian arm based random effect network meta-analysis, in particular mixed treatment comparison. Briefly, the network meta-analysis simultaneously synthesizes data from all available trials within a consistent network and combines direct evidence (comparison of treatments within head-to-head trials) with indirect evidence (comparison of treatments across trials against a common comparator) [23].

We preferred the Bayesian approach because that takes into account all sources of variation and reflects these variations in the pooled result. Furthermore, the Bayesian approach can provide more accurate estimates for small samples. An ordinary consistency model was adopted with the binomial/log model as likelihood was used. Non-informative priors distribution included in this analysis were Normal (0, 1000) for log of Relative Risk (RR) and relative effects, Gamma (0.001, 0.001) distribution for random effect precision. Pairwise comparison were performed using unrelated mean effects model [24]. To provide valid indirect inferences we considered the transitivity assumption (i.e. studies comparing different sets of interventions needed to be sufficiently similar). To assess transitivity, we generated descriptive statistics and we compared the distributions of baseline participant characteristics across studies and treatment comparisons. We assumed a common heterogeneity parameter across the various treatment comparisons. To evaluate statistical heterogeneity, we calculated between-trial variances and I2–index, assuming a common estimate for the heterogeneity variance across the different comparisons. I2–index value of 25% was defined as low heterogeneity, 50% as moderate heterogeneity, and 75% as high heterogeneity [25]. To assess local inconsistencies we used the nodesplitting method [26].

The inference was performed using mean and relative 95% credible intervals (Crl), based on draws from marginal posterior distribution in Monte Carlo Markov chain, simulating 350,000 iterations after a burn-in period of 30,000 iterations. We consider the estimated parameter significance when its 95% Crl encompass null hypothesis value. Sensitivity analysis regarding the choice of prior distribution of random effect precision was considered.

Model convergence was assessed by analysing history, running means density and Brooks-Gelman-Rubin diagnostic plots In addition, autocorrelation plots were assessed to detect the presence of auto-correlation in the chains [27]. We plotted rank probabilities against the possible ranks for all competing treatments. All statistical analyses were carried out using Jags [27] and R-Cran [28].

Results

Literature search and study characteristics

One thousand three hundred and sixty-two publications were found using the search criteria. After removing duplicates, 172 publications were further examined. Further screening revealed that only 11 articles [12,13,14,15,16,17,18,19,20,21,22] met the predefined inclusion criteria. Six studies were prospective [12, 14,15,16, 20, 21], 4 were retrospective [17,18,19, 22], and 1 study was a randomized control trial [13] (Fig. 2).

Fig. 2
figure 2

Flowchart of studies retrieved from literature search

Patient characteristics and follow-up

Three thousand eight hundred and forty-four patients were included in this analysis. One thousand seven hundred and sixty-eight patients (46%) were in the control group (no IALT), 1253 patients (32.5%) had ATL, 555 patients (14.5%) had ICG assessment, and 268 patients (7%) had IOC assessment. The gender of 3231 patients was specified; 1989 (61.5%) were male and 1242 (38.5%) were female. The median age was 61.4 years (range 50–69 years) and the median body mass index 25.7 kg/m2 (range 23.2–27 kg/m2). One thousand four hundred sixty-six patients (62%) had surgery for neoplasia. All patients had either colonic or rectal resection (Tables 2, 3).

Table 2 Demographic and clinical characteristics of patients in the four groups
Table 3 Main findings of the included studies

Anastomotic leak

The point estimation showed that the AL risk in the control group was significantly higher when compared to the ICG group (RR 0.44; Crl 0.14–0.87) and higher but not significant when compared to ALT (RR 0.53; Crl 0.21–1.30) and IOC (RR 0.49; Crl 0.10–1.80). Moreover, the indirect comparison showed that AL risk in patients who had ICG was lower, though not significantly so, when compared with both ALT (RR 0.44; Crl 0.14–0.87) and IOC (RR 0.44; Crl 0.14–0.87) (Fig. 3a). Table 4 is a league table showing all pairwise comparisons in the network meta-analysis.

Fig. 3
figure 3

a Forest plots of network meta-analysis estimates the RR for symptomatic AL. b Rank plot created using the rankogram function from the gemtc R package applied to the four groups illustrating empirical probabilities that each treatment is ranked first through fourth (left to right) for symptomatic AL

Table 4 League table

A rank plot illustrating empirical probabilities for AL with each assessment ranked first through fourth (left to right) is provided in Fig. 3b. The global heterogeneity was low (I2 = 13.0%). The sensitivity analysis regarding the choice of prior distribution for τ showed robustness of findings and the leverage plot did not show a potential outlier.

Discussion

This network meta-analysis observed that among all the IALT currently available, assessment with ICG offers the greatest reduction in symptomatic AL risk following colorectal surgery.

AL still remains the most feared complication following colorectal surgery. Its incidence is affected by several elements including preoperative factors (age, neoadjuvant chemotherapy, nutrition, and type of neoplasm), intraoperative factors (fluid replacement, hypothermia, type and length of surgery and surgical technique) and postoperative factors (medications and postoperative care) [1].

Ultimately, AL is multifactorial, and due to modifiable and non-modifiable factors. Historically, the role of IALT was to assess the structural integrity of the colorectal anastomosis, and out-rule a technical error. The use of ICG has been advocated in recent years to assess the actual perfusion of the new anastomosis and assess for vascular compromise. Concerns over ICG significantly increasing operative times have been refuted in recent studies [15]. However, there is a lack of case-matched RCT evidence to adequately assess if ICG assessment does impact on AL and associated morbidity and mortality. Only with this, can consensus be reached on whether or not ICG assessment should be a standardized step for testing blood supply to all colorectal anastomosis. Without it, the cost of acquiring the necessary equipment to perform ICG will always remain an issue for institutions, given the increased costs of healthcare and the healthcare budget constraints.

We acknowledge that this review has some limitations. The lack of large-volume RCT evidence to date is a significant issue, plus there is some heterogeneity in study populations, selection bias and surgical techniques in the included studies. However, this is the first meta-analysis of current IALT techniques.

The confidence interval crosses null value or includes values favouring either treatment. Including observational studies may be considered a limitation; however, the a priori exclusion of these studies in systematic reviews would be inappropriate and internally inconsistent with an evidence-based approach [29].

The confidence in the estimates was assessed using CINeMA and was found to range from moderate to very low primarily due to the quality of the included studies [30]. Regarding the interpretation of the treatment ranking, caution is needed regarding the confidence level. In fact, the treatment ranking does not consider the magnitude of differences in effects between treatments and therefore chance may explain any apparent difference between treatments. Thus, that our results suggest surgeons should choose the IALT technique, which best suits their expertise and the equipment available at their institution.

Conclusions

Surgical technology has been rapidly evolving, and this review highlights the change in thinking regarding prevention of AL. Historically, assessment of anastomotic integrity was based on structural evaluation and ensuring there were no technical issues with the anastomosis. ICG represents a change in assessment, and surgical philosophy. However, there is a need for better RCTs, to strengthen our results, especially regarding the impact on AL and subsequent morbidity. In addition, cost analysis should be assessed to support its universal acceptance amongst colorectal surgeons.