Introduction

Worldwide, more than 20 million patients undergo groin hernia repair every year [1]. Primary unilateral inguinal hernias account for 75% of abdominal wall hernias, in which there is a lifetime risk of 27% for men and 3% for women [2]. Operative techniques have continuously evolved over the past decades to provide the best management for inguinal hernias [3]. The Open tension-free Lichtenstein technique is the most commonly worldwide performed procedure with optimal outcomes and low recurrence rate [1]. After the introduction of laparoscopy, new techniques such as the transabdominal preperitoneal (TAPP) repair and the totally extraperitoneal repair (TEP) gained progressive acceptance with promising results and comparable outcomes to the Open approach [4]. Afterward, the introduction of robotic platforms opened new horizons and the robotic TAPP (rTAPP) arouses progressive enthusiasm with growing popularity and presumed advantages [5]. The previous meta-analysis analysed these techniques in a pairwise comparison (Open vs. TAPP or TAPP vs. TEP or Open vs. TEP), but a comprehensive review is lacking.

The aim of the present network meta-analysis was to globally compare short-term and recovery outcomes within these four major surgical approaches for primary unilateral inguinal hernia repair.

Materials and methods

A systematic review was performed according to the guidelines from the Preferred Reporting Items for Systematic Reviews and network meta-analyses’ checklist (PRISMA-NMA) [6]. Institutional review board approval was not required.

PubMed, EMBASE, and Web of Science databases were used for systematic search. The last date of search was the February 28st, 2019. We searched for papers using the following search strategy: “primary unilateral inguinal hernia”, “open inguinal hernia repair”, “transabdominal preperitoneal repair”, “totally extraperitoneal endoscopic repair”, and “robotic hernia repair”. All titles were initially evaluated and suitable abstracts extracted. The reference list of included articles was also screened. The study protocol was registered at PROSPERO (International prospective register of systematic reviews) (Registration number: CRD42019130852).

Inclusion and exclusion criteria

Eligible studies for the present meta-analysis included RCT and observational retrospective and prospective clinical studies. To be included in the analysis: (a) studies comparing surgical outcomes in primary unilateral inguinal hernia repair for either Open, TAPP, TEP, and rTAPP; (b) articles written in English; (c) articles with the longest follow-up or the largest sample size when two or more papers were published by the same institution, study group, or used the same data set; (d) published after the year 2000; (e) in case of duplicate studies with accumulating numbers of patients, only the most complete reports were included for quantitative assessment. Studies were excluded if (a) they were not written in English; (b) the methodology was not clearly reported; (c) the surgical technique was not clearly reported; (d) the inguinal hernia repair was performed without mesh or plug; (e) studies that did not contain a comparative group report primary unilateral inguinal hernias.

Data extraction

The following data were retrieved: author, year of publication, country, study design, number of patients, sex, age, body mass index (BMI), surgical approach, and early postoperative outcomes. All data were entered independently by three investigators (FL, AA, and AM) and compared only at the end of the reviewing process. A fourth author (GC) eventually reviewed the database. Discrepancies were clarified.

Quality assessment

Two authors (AA and FL) independently assessed the methodologic quality of the selected trials using the Cochrane risk of bias tool [7]. This tool evaluates the following criteria: (1) method of randomization; (2) allocation concealment; (3) baseline comparability of study groups; and (4) blinding and completeness of follow-up. Trials were graded as follows: A adequate, B unclear, and C inadequate on each criterion. Thus, each RCT was graded as having low, moderate, or high risk of bias. Two authors (FL and AA) independently assessed the quality of observational studies using the ROBINS-I tool [8]. The following domains were considered: confounding bias, selection bias, classification bias, intervention bias, missing data bias, outcomes measurement bias, and reporting bias. Each domain is evaluated with one of the following: “yes”, “probably yes”, “probably no”, and “no”. The categories of judgement for each study are low, moderate, serious, and critical risk of bias. Disagreements were solved by discussion.

Outcomes of interest

Primary outcomes: haematoma, seroma, post-operative chronic pain, and recurrence. Secondary outcomes: surgical site infection (SSI), urinary retention, operative time (minutes), and postoperative hospital length of stay (HLOS).

Statistical analysis

We performed fully Bayesian arm‐based random effect network meta‐analysis, in particular, mixed treatment comparison (MTC). Briefly, the MTC simultaneously synthesizes data from all available trials within a consistent network and combines direct evidence (comparison of treatments within head‐to‐head trials) with indirect evidence (comparison of treatments across trials against a common comparator) [9]. Compared with the frequentist meta‐analysis, the Bayesian approach takes into account all sources of variations, reflects these variations in the pooled result, can provide accurate estimates for small samples, and allows computation of predictive distribution [10]. Furthermore, Bayesian posterior analysis should yield exact coverages, independent of sample size. An ordinary consistency model was adopted with the binomial/log model [11]. We used risk ratio (RR) as a pooled effect size measure. For RR on log scale, we applied two different prior distributions to the model: a “sceptical” and “vague” prior distribution. In particular, we assigned a Normal with zero mean and scale 0.4 as sceptical distribution (10% of the distribution is contained within the clinically unimportant null interval); we assigned Normal with zero mean and scale 100 as vague prior distribution, and we used it as prior distribution into prior sensitivity analysis. For the between‐study variability (τ), we used an informative half‐normal prior with zero mean and scale 0.5 [12] assuming a common heterogeneity parameter across the various treatment comparisons. Sensitivity analysis regarding the choice of prior distribution for τ was also considered [13]. For continuous outcomes, mean difference was adopted using Normal prior distribution with mean 0 and scale 10, and uniform distribution (0.5) as τ prior. To evaluate statistical heterogeneity, we calculated I2 index: value of 25% was defined as low heterogeneity, 50% as moderate heterogeneity, and 75% as high heterogeneity [14]. The inference was performed using mean and relative 95% credible intervals (CrI), based on draws from marginal posterior distribution in Monte Carlo Markov chain (MCMC), simulating 300,000 iterations after a burn‐in period of 30,000 iterations. We consider the estimated parameter statistical significant when its 95% CrI encompasses null‐hypothesis value. The accuracy of the inference was assessed by convergence of MCMC algorithm, checked using graphical inspection of running means, MCMC error, and diagnostic statistics included into R/Boa package [15]. The plot of Leverage values vs. the square root of the residual deviance (mean per data point for each study) was used to identify potentially study outlier. The transitivity assumption (i.e., studies comparing different sets of interventions needed to be sufficiently similar) was considered to provide valid indirect inferences. To assess transitivity, we generated descriptive statistics and we compared the distributions of baseline participant characteristics across studies and treatment comparisons. To assess local inconsistencies, we used the node‐splitting method [16], which was not possible to conduct a formal assessment of the consistency of the direct and indirect evidence where the evidence network included open loops. We plotted rank probabilities against the possible ranks for all competing treatments. The confidence in estimates of the outcome was assessed using Confidence in Network Meta-Analysis (CINeMA) [17]. In accordance with Cochrane guidelines, we did not investigate publication bias as our search consider less than ten studies for each data comparison [18]. Statistical analyses were carried out using JAGS and R‐Cran 3.4.3 (Distributed Statistical Computing; Vienna, Austria) [19, 20].

Review of network geometry

We appraised the geometry of the networks for each outcome separately, and provided network graphs for primary outcomes with nodes reflecting the surgical approaches and two nodes linked together by an edge, if at least one study compared the two corresponding surgical techniques. The connection between surgical approaches was analysed (Fig. 1).

Fig. 1
figure 1

Network geometry for studies reporting: a hematoma, b seroma, c postoperative chronic pain, and d hernia recurrence

Results

Systematic review

Sixteen studies met the inclusion criteria. The flowchart selection process is reported in Fig. 2. Twelve articles were RCT, three were prospective clinical studies, and one was retrospective clinical study. Overall, 51.037 patients were included in the analysis. Of these, 18.135 (35.5%) underwent Open, 17.112 (33.5%) TAPP, 15.687 (30.7%) TEP, and 103 (0.3%) rTAPP repair (Table 1). The age range from 18 to 65 years and the majority were males (81.5%). Body mass index (BMI) ranged from 24 to 52 kg/m2 (Table 1). The hernia site was reported in six studies, the type of hernia, according to the Nyhus or European Hernia Society (EHS) classification, was reported in seven studies, and the hernia size was reported in three studies. The American Society of Anesthesiologists (ASA) classification was reported in seven studies. Three studies reported the cost analysis. All studies reported the postoperative follow-up that ranged from 1 to 60 months. Quality of life according to the Short-Form 36 (SF-36) questionnaire was accomplished in two studies. We tried out to ensure transitivity by applying narrow inclusion criteria and making populations as similar as possible within and across treatment comparisons. The quality of all included studies is depicted in Fig. 3 and Table 2.

Fig. 2
figure 2

The Preferred Reporting Items for Systematic Reviews and network meta-analyses checklist (PRISMA-NMA) diagram

Table 1 Demographic and clinical characteristics of patients undergoing Open Lichtenstein repair (Open), Laparoscopic Trans-Abdominal PrePeritoneal (TAPP), Totally Extra Peritoneal (TEP), and robotic TAPP (rTAPP)
Fig. 3
figure 3

Risk of bias for randomized-controlled trials was assessed with the Cochrane risk-of-bias tool

Table 2 Quality assessment of the included observational studies (ROBINS-I tool)

Meta-analysis

Haematoma

Nine studies for a total of 54.044 patients reported the incidence of postoperative haematoma. Pooled network meta-analysis shows similar RR when comparing TAPP vs. Open (RR 0.68; 95% CrI 0.40–1.30), TEP vs. Open (RR 0.67; 95% CrI 0.43–1.20), and TEP vs. TAPP (RR 1.01; 95% CrI 0.51–1.80) (Fig. 4a). Node splitting revealed no statistical difference between the direct and indirect comparison (TAPP vs. Open p = 0.242; TEP vs. Open p = 0.202; TEP vs. TAPP p = 0.232). The prior sensitivity analysis yields robust results for TAPP vs. Open (RR 0.68; 95% CrI 0.40–1.30) and TEP vs. Open (RR 0.67; 95% CrI 0.42–1.21). The global heterogeneity is low (I2 = 24%). The SUCRA ranking was 91% for Open, 31% for TEP, and 27% for TAPP.

Fig. 4
figure 4

Forest plots of network meta-analysis estimates the RR for a hematoma, b seroma, c postoperative chronic pain, and d hernia recurrence

Seroma

Nine studies for a total of 54.138 patients reported the incidence of postoperative seroma. Pooled network meta-analysis shows similar RR when comparing TAPP vs. Open (RR 0.91; 95% CrI 0.50–1.62), TEP vs. Open (RR 0.64; 95% CrI 0.32–1.33), TEP vs. TAPP (RR 0.70; 95% CrI 0.39–1.31), and rTAPP vs. Open (RR 0.98; 95% CrI 0.37–2.51) (Fig. 4b). Node splitting revealed no statistical difference between the direct and indirect comparison (TAPP vs. Open p = 0.885; TEP vs. Open p = 0.899; TEP vs. TAPP p = 0.917). The prior sensitivity analysis yields robust results for TAPP vs. Open (RR 0.91; 95% CrI 0.49–1.60), TEP vs. Open (RR 0.63; 95% CrI 0.32–1.31), and rTAPP vs. Open (RR 0.98; 95% CrI 0.37–2.61). The global heterogeneity is moderate (I2 = 39%). The SUCRA ranking was 68% for Open, 62% for rTAPP, 55% for TAPP, and 14% for TEP.

Postoperative chronic pain

Six studies for a total of 36.724 patients reported the incidence of postoperative chronic pain. Pooled network meta-analysis shows similar RR when comparing TAPP vs. Open (RR 0.53; 95% CrI 0.27–1.20), TEP vs. Open (RR 0.86; 95% CrI 0.48–1.16), and TEP vs. TAPP (RR 1.70; 95% CrI 0.63–3.20) (Fig. 4c). Node splitting revealed no statistical difference between the direct and indirect comparison (TAPP vs. Open p = 0.872; TEP vs. Open p = 0.894; TEP vs. TAPP p = 0.974). The prior sensitivity analysis yields robust results for TAPP vs. Open (RR 0.54; 95% CrI 0.28–1.12) and TEP vs. Open (RR 0.86; 95% CrI 0.48–1.70). The global heterogeneity is moderate (I2 = 63%). The SUCRA ranking was 81% for Open, 59% for TEP, and 10% for TAPP.

Recurrence

Nine studies for a total of 111.197 patients reported the postoperative early recurrence. Pooled network meta-analysis shows similar RR when comparing TAPP vs. Open (RR 0.96; 95% CrI 0.57–1.51), TEP vs. Open (RR 1.0; 95% CrI 0.65–1.61), TEP vs. TAPP (RR 1.10; 95% CrI 0.63–2.10), and rTAPP vs. Open (RR 0.98; 95% CrI 0.45–2.10) (Fig. 4d). Node splitting revealed no statistical difference between the direct and indirect comparison (TAPP vs. Open p = 0.81; TEP vs. Open p = 0.707; TEP vs. TAPP p = 0.675). The prior sensitivity analysis yields robust results for TAPP vs. Open (RR 0.96; 95% CrI 0.58–1.50), TEP vs. Open (RR 1.0; 95% CrI 0.65–1.62), and rTAPP vs. Open (RR 0.98; 95% CrI 0.45–2.10). The global heterogeneity is zero (I2 = 0.0%). The SUCRA ranking was TEP 56%, 52% for Open, 49% for rTAPP, and 44% for TAPP. The League table for primary outcomes is depicted in Table 3.

Table 3 League table

The subgroup analysis including 12 RCT studies showed comparable non-statistically significant results in term of postoperative hematoma, seroma, chronic pain, and recurrence comparing Open, TAPP, and TEP.

Secondary outcomes

Pooled network meta-analysis does not show statistically significant RR comparing TAPP vs. Open, TEP vs. Open, TEP vs. TAPP, and rTAPP vs. Open in term of SSI (RR 0.65; 95% CrI 0.34–1.43, RR 0.59; 95% CrI 0.31–1.34, RR 0.90; 95% CrI 0.39–2.21, RR 1.10; 95% CrI 0.53–2.46, respectively). The global heterogeneity is zero (I2 = 0.0%). Similarly, the pooled network meta-analysis does not show statistically significant RR comparing TAPP vs. Open, TEP vs. Open, TEP vs. TAPP, and rTAPP vs. Open in term of urinary retention (RR 0.92; 95% CrI 0.49–1.78, RR 1.0; 95% CrI 0.55–1.91, RR 1.10; 95% CrI 0.49–2.57, RR 0.96; 95% CrI 0.45–2.14, respectively). The global heterogeneity is zero (I2 = 0.0%). Again, the pooled network meta-analysis does not show statistically significant difference comparing TAPP vs. Open, TEP vs. Open, and TEP vs. TAPP in term of operative time (smd = 2.20; 95% CrI − 1.60 to 5.90, smd = − 1.40; 95% CrI − 5.10 to 2.30, smd = − 3.60; 95% CrI − 7.70 to 0.58, respectively). By contrast, rTAPP was associated with a statistically significant longer operative time compared to Open (smd = 30.0; 95% CrI 21.01–39.02). The global heterogeneity is high (I2 = 89.4%). The pooled network meta-analysis does not show statistically significant differences comparing TAPP vs. Open, TEP vs. Open, and TEP vs. TAPP in term of postoperative length of stay (smd = − 0.73; 95% CrI − 1.70 to 0.20, smd = − 0.47; 95% CrI − 1.70 to 0.69, smd = 0.27; 95% CrI − 0.84 to 1.30, respectively).

The Leverage plots do not show the evidence of study outliers into this network meta-analysis. For all outcomes, there was no evidence of non-MCMC convergence using the diagnostic tools described in the Statistical analysis section. The assessments of confidence in the estimates using CINeMA show moderate-to-very low confidence, essentially due to study limitation, imprecision, and inconsistence.

Discussion

The aim of the present network meta-analysis was to globally compare short-term outcome of the main surgical approaches for primary unilateral inguinal hernia repair. No significant differences were found in term of postoperative hematoma, seroma, chronic pain, and early recurrence. Similarly, no differences were found in term of postoperative surgical site infections, urinary retention, and postoperative hospital length of stay. rTAPP was found to have a statistically significant longer operative time compared to the Open approach.

While the Open approach is considered the gold standard treatment for non-complicated primary inguinal hernia repair, different minimally invasive techniques have been progressively proposed with the introduction of advanced technologies [21]. In the European Hernia Society’s guidelines, the Lichtenstein procedure and the minimally invasive techniques are recommended as the best evidence-based options in experts’ hands [1]. The pros and cons have been largely discussed, but, to date, results are contrasting and a robust evidence favouring one treatment over another is lacking. The previous observational, RCT studies, and pairwise meta-analyses have been published with discordant results. These were mainly related to the heterogeneity of the included populations, surgical techniques, and indications [22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37]. We performed a fully Bayesian network meta-analysis to globally compare outcomes of the main surgical approaches to inguinal hernia. To reduce background heterogeneity, we mainly focused on mesh-reinforced tension-free techniques for primary non-complicated unilateral inguinal hernia.

Postoperative seroma and hematoma represent the most common surgery-related complications [38]. The method for mesh fixations, large hernia size, and medial hernia represent independent risk factors for seroma formation [39]. Kockerling et al. in a recently published data set-based retrospective study reported a significantly lower incidence of overall postoperative complications and seroma in patients that underwent TEP compared to Open and TAPP procedures [37]. Our study showed that Open, TAPP, TEP, and rTAPP seem associated with similar results in term of postoperative hematoma and seroma. The related-global heterogeneity was moderate and low (39% and 21%, respectively). The meta-regression analysis was not possible, because data were reported as aggregated and individual-patient comorbidity was lacking. These results are in accordance with the study by Wu et al. showing no differences comparing TAPP vs. Open [3]. This may be the result of a rigorous surgical technique and careful anatomical planes dissection performed in the experienced hands [40]. It should be noted, however, that despite the lack of statistically significance the Open approach was ranked in the SUCRA as the treatment having a higher likelihood of postoperative hematoma and seroma.

Postoperative chronic pain has been shown to be associated with perioperative nerves injury or entrapment [38]. Nerves can also be trapped in a shrinked mesh or by peri-prosthesis inflammatory processes. In addition, the type of the mesh, the weight of the mesh (g/m2), and fixation method may be contributing factors for the development of postoperative chronic pain [41]. In the present analysis, none of the studies evaluating the robotic approach reported this outcome, and the Open, TAPP, and TEP approaches seem comparable in term of postoperative chronic pain. The global heterogeneity was moderate (I2 = 63%) probably reflecting the lack of a globally accepted and standardized definition of postoperative pain. Notably, the Open approach was classified in the SUCRA ranking as the treatment with major probability for postoperative chronic pain. In the previous studies, the comparison of Lichtenstein vs. TAPP showed a significantly lower chronic pain for TAPP, whereas there was insufficient evidence to determine the best treatment comparing Open vs. TEP and TAPP vs. TEP [42]. Wu et al. revealed that, compared to the Open approach, TAPP was associated with a lower rate of paresthesia with similar chronic pain [3]. In the Open technique, the prophylactic ilioinguinal neurectomy has been proposed to avoid nerve entrapment, thus, reducing postoperative pain [43]. However, it seems that the intraoperative identification and preservation of all three inguinal nerves during open surgery reduces significantly the incidence of postoperative chronic incapacitating groin pain (< 1%) [38, 44, 45].

Age > 50 years, smoking history, BMI > 30, and medial vs. lateral hernia are independent risk factors for hernia recurrence [46, 47]. The pooled network meta-analysis showed that the four surgical approaches seem comparable in term of hernia recurrence. Therefore, this result may not be generalized and the variability in operative technique should be taken into account. Notably, the global heterogeneity was zero (I2 = 0.0%) indicating a low degree of inconsistence across studies and ultimately giving robustness to the result. No significant differences were found in term of surgical site infections, urinary retention, and postoperative length of stay.

After the introduction of robot-assisted minimally invasive surgery, the number of rTAPP has progressively increased worldwide [48, 47]. The robot-assisted approach offers increased range of instruments motion and improved surgeon ergonomics [5]. However, some concerns exist including operative time, surgeon learning curve, real patient-effectiveness, and global costs [49, 50]. In the present review, the rTAPP was associated with an estimated longer operative time compared to the Open approach, probably attributable to instruments positioning and surgeons’ learning curve. The cost analysis was reported in only three studies with a trend toward higher expenses in minimally invasive rTAPP. The pooled analysis was not performed, because the heterogeneity in cost reporting, but this seems imputable to instrument maintenance and equipment costs.

We acknowledge some limitations to the present meta-analysis, because the publication bias related to the heterogeneity of the included studies. Both RCT and observational design studies were considered. The intrinsic bias of observational studies could be considered a study limitation; however, the a priori exclusion of observational studies in systematic reviews is inappropriate and inconsistent with a comprehensive evidence-based approach [51]. Furthermore, the quality and quantity of RCT for surgical techniques comparison is acknowledged to be limited [52]. The imprecision must be considered for some of the outcomes, because the credible interval crosses null value or includes values favouring either treatment. The treatment ranking should be cautiously interpreted, because the treatment ranking does not consider the magnitude of differences in effects, and therefore, chance may explain any apparent difference between treatments. Therefore, surgeons should choose the most suitable surgical approach appraising the treatment ranking, costs, and personal expertise. Finally, the postoperative follow-up was heterogeneous across studies, thus, adding further background bias and the rTAPP was performed in a limited number of patients.

To our knowledge, this is the first network meta-analysis that globally compare outcomes within the major surgical approaches for primary unilateral inguinal hernia. Using network meta-analytical techniques, we were able to globally synthesize data from numerous studies and, therefore, rank the treatments according to our studied outcomes. The study was planned in agreements with the PRISMA guidelines, and followed a sound methodology that was a priori stated in the PROSPERO protocol. This included comprehensive outcome measures and the evaluation of quality at study level (risk of bias) and confidence in results at outcome level (CINeMA). The selection criteria led to a homogenous population for some of the primary outcomes, as confirmed by low heterogeneity. This makes us confident that the results of this study are robust. Finally, we conducted the study accounting for single-study effect performing the prior sensitivity analysis.

In conclusion, this network meta-analysis suggests that Open, TAPP, TEP, and rTAPP seem comparable in term of postoperative hematoma, seroma, postoperative chronic pain, recurrence, infectious complications, urinary retention, and hospital length of stay. The surgical management of inguinal hernia is evolving and the effect of the adoption of innovative minimally invasive techniques should be further investigated in the long term. Ultimately, the choice of the most suitable treatment should be based on individual surgeon expertise and tailored on each patient.