Introduction

Esophageal cancer is the sixth most common cancer and the eighth most common cause of cancer-related death worldwide [1]. The prognosis is poor while related 5-year overall survival ranges from 15 to 20% [2]. Esophagectomy, lymphadenectomy, and restoration of the gastrointestinal continuity via gastric conduit reconstruction represent the gold standard treatment [3]. The esophagogastric anastomosis is the most delicate and trickiest part of the operation while related complications are feared problems [4]. Anastomotic leak (AL) may occur up to 10% of patients. It has been reported to be associated with a 3-fold increase in mortality, prolonged hospital stay, delayed oral feeding, risk of reintervention, increased risk of recurrence, and decrease of overall/disease-free survival [5, 6]. Anastomotic stricture (AS) may occur up to 30% of patients and may require endoscopic dilation with a negative effect of postoperative recovery, nutritional status, and quality of life [7].

Different techniques have been described for esophagogastric anastomosis such as hand-sewn and mechanical stapled anastomosis [8, 9]. Over the past few decades, surgeons have been improving anastomotic techniques with a gradual shift from hand-sewn to stapled anastomosis [10,11,12,13]. Nowadays, circular-stapled (CS) and linear-stapled (LS) anastomosis are largely adopted into clinical practice for both cervical and thoracic anastomoses [14]. CS and LS have their own advantages and weaknesses while the decision to use one technique over another mainly depends on surgeon expertise and personal preference [15]. Currently, a definitive indication on the best stapling technique for esophageal anastomosis is still to be defined since previous studies showed contrasting results.

Hence, aim of the present systematic review and meta-analysis was to perform an updated literature analysis to compare outcomes for LS vs. CS anastomosis in the setting of esophagectomy for the treatment of esophageal cancer.

Materials and methods

We conducted this study according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement and MOOSE guidelines [16, 17]. Institutional review board approval was not required. PubMed, MEDLINE, Scopus, Web of Science, Cochrane Central Library, and ClinicalTrials.gov were used [18]. The last date of search was the June 30th, 2022. A combination of the following MeSH terms (Medical Subject Headings) was used (“esophagectomy” (tiab), OR “esophagectomies” (tiab), OR, “esophagogastric” (tiab), OR “esoph*” (tiab)) AND (“anastomosis” (tiab), OR “suture” (tiab)) AND (“linear” (tiab), OR “circular” (tiab)) AND (“outcomes” (tiab), OR “complication” (tiab)) AND (“leak” (tiab), OR “leakage” (tiab)) AND (“stricture” (tiab), OR “stenosis” (tiab)). All titles were evaluated and suitable abstracts extracted. The search was completed by consulting the references of each article. The study protocol was registered at the PROSPERO (international prospective register of systematic reviews) (Registration Number: CRD42022328741).

Eligibility criteria

Inclusion criteria: (a) cohort studies and randomized controlled trials (RCTs) comparing outcomes for LS vs. CS among adult patients (>18 year old) undergoing elective esophagectomy for cancer; (b) English-written; (c) when two or more papers were published by the same institution, study group, or used the same dataset, articles with the longest follow-up or the largest sample size; (d) in case of duplicate studies with accumulating numbers of patients only the most complete reports were included for quantitative analysis. Exclusion criteria: (a) not English-written; (b) poor methodology; (c) no clear outcome distinction between LS vs. CS; (d) articles with less than 7 patients per study arm.

Data extraction

The following data were collected: authors, year of publication, country, study design, number of patients, sex, age, body mass index (BMI), American Society of Anesthesiologists (ASA) physical status, comorbidities, surgical indication, tumor characteristics, histological type, tumor location, cancer stage, neoadjuvant chemoradiotherapy, and postoperative outcomes. All data were computed independently by three investigators (AA, AS, FL) and compared at the end of the reviewing process. A fourth author (DB) reviewed the database and determined discrepancies.

Outcomes

Primary outcomes were postoperative AL and AS. The quantitative analysis was performed for the global population and after stratification according to the level of anastomosis (cervical and thoracic). Secondary outcomes were reflux esophagitis, pulmonary complications, operative time (OT) (minutes), hospital length of stay (HLOS) (days), and 30-day mortality. AL was defined as evidence of contrast extravasation at postoperative swallow study and/or CT scan, or endoscopic visualization of anastomotic dehiscence/fistula, or visible loss of saliva through surgical drains combined with clinical signs. AS was diagnosed in case of postoperative dysphagia, evidence of anastomotic lumen stricture, and need for endoscopic dilatation up to 6 months after the operation. Reflux esophagitis was defined as higher than grade A according to the Los Angeles Classification of severity.

Quality assessment

Three authors (AA, AS, MC) independently assessed the methodological quality of included studies. The ROBINS-I tool was used for observational studies [19]. The following domains were considered: confounding bias, selection bias, classification bias, intervention bias, missing data bias, outcomes measurement bias, and reporting bias. Each domain is evaluated with one of the following: “yes,” “probably yes,” “probably no,” or “no.” The categories of judgment for each study are low, moderate, serious, and critical risk of bias. The methodological quality of randomized controlled trials (RCTs) was appraised with the Cochrane risk of bias tool [20]. This tool evaluates the following criteria: [1] method of randomization; [2] allocation concealment; [3] baseline comparability of study groups; and [4] blinding and completeness of follow-up. Trials were graded as follows: A = adequate, B = unclear, and C = inadequate on each criterion. Thus, each RCT was graded as having low, moderate, or high risk of bias. Disagreements were solved by discussion. We used the Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) tool to assess the quality of the body of evidence across studies [21].

Statistical analysis

The results of the systematic review were summarized quantitatively into frequentist random effect meta-analysis of pooled risk ratio (RR) and standardized mean difference (SMD). An inverse-variance method and DerSimonian–Laird estimator for the variance of the true effect size (τ2) were performed [22, 23]. Heterogeneity among studies was evaluated by the I2 index and Cochran’s Q test [24]. Statistical heterogeneity was considered low, moderate, and high for I2 values of 25, 50, and 75%, respectively, and significant when p < 0.10 [25, 26]. The Wald-type 95% confidence interval (CI) was computed for pooled measurements; otherwise, the 95% CI for the I2 index was calculated according to Higgins and Thompson [27]. The prediction interval for the treatment effect of a new study was calculated according to Borenstein et al. [24]. As the sample size was not the same in all studies, we performed a sensitivity analysis by excluding one study each time and rerunning the analysis to verify the robustness of the overall results. The publication bias was also investigated with the trim and fill funnel plot and Egger test. A two-sided p value was considered statistically significant when p < 0.05. All analyses and figures were carried out using the R software program, version 3.2.2 [28].

Results

Systematic review

The PRISMA flow chart is reported in Fig. 1. Four-hundred fifty-eight publications were identified. After duplicates removal, 402 titles were screened. Sixteen articles were excluded after title assessment. Overall, 386 abstracts were reviewed while 21 articles were found possibly relevant for full-text assessment. After full-text evaluation, 18 studies meet the inclusion/exclusion criteria and were included in the quantitative synthesis. Fifteen studies were of retrospective design while three were RCTs. The quality of observational studies and RCTs is reported in Supplementary Table 1 and Supplementary Figure 1.

Fig. 1
figure 1

The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) diagram

Overall, 2861 patients were included (Table 1). Of those 1371 (47.9%) underwent CS while 1490 (52.1%) underwent LS. The age of the patient population ranged from 47 to 87 years, the BMI ranged from 15.3 to 30.2 kg/m2, and the majority (74.9%) were males. The tumor was located in the upper (12.4%), middle (51.6%), and lower (36%) esophagus. Tumor histology was specified in 9 studies; squamous cell carcinoma and adenocarcinoma were diagnosed in 83% and 15.5% of patients, respectively. Pathological tumor staging according to the 7th and 8th edition of the American Joint Committee on Cancer (AJCC) and Japanese gastric cancer classification (JGCC) was detailed in 7 studies (stage 0–I: 25.1%, stage II: 37.1%, stage III: 34.5%, and stage IV: 3.3%). Open, hybrid, or totally minimally invasive esophagectomy were performed depending on operating surgeon preference and expertise. Both cervical and intrathoracic anastomoses were described according to tumor location, operating surgeon preferences, and oncological principles. The use of neoadjuvant chemoradiation therapy was heterogeneously reported in nine studies (i.e., protocols, regimens, etc.). The extent of lymphadenectomy (2-field and 3-field) was specified in 5 studies and varied depending on surgeon expertise and tumor clinical stage/location.

Table 1 Demographic, clinical, and operative data for patients undergoing linear-stapled (LS) and circular-stapled (CS) esophagogastric anastomosis. RCT, randomized controlled trial; Ret, retrospective; PS, propensity score adjusted; BMI, body mass index; U, upper esophagus; M, middle esophagus; L, lower esophagus; SCC, squamous cell carcinoma; AC, adenocarcinoma; Lymhp, level of lymphadenectomy; UICC-TNM, Union for International Cancer Control; AJCC-TNM, American Joint Committee on Cancer; JGCC, Japanese gastric cancer classification; Op, open esophagectomy; Hyb, hybrid esophagectomy; MIE, minimally invasive esophagectomy; 2FL, 2-field lymphadenectomy; 3FL, 3-field lymphadenectomy; *, robotic; NA, not available. Data are reported as numbers, mean ± standard deviation, and median (range)

Meta-analysis—primary outcomes

AL was reported in eighteen studies (2861 patients). The cumulative incidence of AL was reduced for LS vs. CS (7.9% vs. 12.2%). Compared to CS, LS was associated with a significantly reduced AL risk (RR = 0.70; 95% CI 0.54–0.91; p < 0.01) (Fig. 2A). The prediction lower and upper limits were 0.47 and 1.06, respectively. The heterogeneity was zero (I2 = 0.0%, 95% CI 0.0–50%; p < 0.01) and τ2 = 0.01. The funnel plot (Fig. 2B) and the Egger test (p = 0.49) did not show evidence of publication bias. The sensitivity analysis showed the robustness of these findings in terms of point estimation, relative confidence intervals, and heterogeneity. After stratification, LS showed a significant AL risk reduction for cervical anastomosis (7 studies; 952 patients) (RR = 0.61; 95% CI 0.38–0.96; I2 = 11%; p = 0.039) while no significant differences were found for thoracic anastomosis (9 studies, 1581 patients) (RR = 0.78; 95% CI 0.57–1.05; I2 = 0.0%; p = 0.10) (Supplementary Figure 2 A-B).

Fig. 2
figure 2

Anastomotic leak. Forrest (A) and funnel (B) plot. RR, risk ratio; 95% CI, confidence interval

AS was reported in 15 studies (1922 patients). The cumulative incidence of AS was reduced for LS vs. CS (7.7% vs. 18.9%). Compared to CS, LS was associated with a significantly reduced AS risk (RR = 0.32; 95% CI 0.20–0.51; p < 0.0001) (Fig. 3A). The prediction lower and upper limits were 0.11 and 1.05, respectively. The heterogeneity was moderate (I2 = 33.3%, 95% CI 0.0–50%; p < 0.01) and τ2 = 0.24. The funnel plot (Fig. 3B) and the Egger test (p = 0.04) showed that publication bias could not be excluded. The sensitivity analysis showed that omitting the study by Xu et al., the heterogeneity decreased to low (I2 = 12%). After stratification, LS showed a significant AL risk reduction for both cervical (7 studies; 1162 patients) (RR = 0.42; 95% CI 0.30–0.58; p < 0.001; I2 = 43%) and thoracic anastomosis (7 studies, 749 patients) (RR = 0.32; 95% CI 0.14–0.77; p < 0.01; I2 = 33%) (Supplementary Figure 3 A-B).

Fig. 3
figure 3

Anastomotic stricture. Forrest (A) and funnel (B) plot. RR, risk ratio; 95% CI, confidence interval

Subgroup analysis by including only RCTs (3 studies; 285 patients) showed a trend toward reduced AL (RR = 0.56; 95% CI 0.19–1.09; p = 0.32; I2 = 0.0%) and AS risk (RR = 0.17; 95% CI 0.02–1.88; p = 0.15; I2 = 29%) for LS vs. CS.

Meta-analysis—secondary outcomes

Reflux esophagitis (RR = 0.74; 95% CI 0.30–1.41; p = 0.36; I2 = 62%), pneumonia (RR = 0.78; 95% CI 0.57–1.06; p = 0.12; I2 = 0.0%), OT (SMD −0.25; 95% CI −0.25, 0.09; p = 0.16; I2 = 77%), HLOS (SMD 0.13; 95% CI −0.25, 0.51; p = 0.51; I2 = 91%), and 30-day mortality (RR = 1.26; 95% CI 0.72–2.21; p = 0.42; I2 = 0.0%) were similar for LS vs. CS (Fig. 4A-C). Using the GRADE tool, we rated the quality of evidence supporting each outcome as low-moderate mainly because of limitations in study design (Supplementary Table 2).

Fig. 4
figure 4

Forrest plot for reflux esophagitis (A), pneumonia (B), and 30-day mortality (C). RR, risk ratio; 95% CI, confidence interval

Discussion

This meta-analysis shows that compared to CS, LS seems associated with a reduced RR of AL and AS during esophagectomy. After stratification for both cervical and thoracic anastomosis, LS shows a trend toward reduced AL and AS risk. Reflux esophagitis, pneumonia, and 30-day mortality seem similar among treatments.

The esophagogastric anastomosis represents a central part of esophagectomy and may potentially contribute to the significant short- and long-term morbidity and mortality [4, 6, 47]. AL is a major complication with a reported incidence ranging from 10 and 20% [48]. It is associated with increased postoperative morbidity, high postoperative mortality rates, prolonged hospital stay, and increased costs [49,50,51,52]. In addition, it has been shown to decrease long-term quality of life and oncological survival [6, 53]. Factors that may contribute to anastomosis failure are lack of serosa layer, tension, inadequate blood supply of the gastric conduit, surgical technique, malnutrition, and patient comorbidities [54]. Furthermore, it has been suggested that gut microbiome may influence the suture line healing process thus possibly contributing to anastomosis breakdown [55]. Various techniques for esophagogastric anastomosis (hand-sewn vs. stapled) have been described [56,57,58]. Interestingly, there has been a gradual shift from hand-sewn to stapled anastomosis because technical simplicity, comparable safety, and time saving [59]. Currently, both LS and CS anastomosis are used for esophagogastric anastomosis. Their utilization is mainly dependent on surgeon preference while each technique has its pros and cons. Specifically, CS facilitates the anastomosis at the apex of the thorax (cupula pleuralis); however, the anastomotic lumen is dependent on the original esophageal diameter with problems related to possible size mismatch. Furthermore, CS creates an inverted anastomosis where esophageal and gastric mucosa margins are separated by muscular layers thus potentially resulting in a high stricture rates. Conversely, LS anastomosis makes a larger anastomotic diameter, minimizes problems related to visceral mismatch, and makes an extroverted anastomosis which leads to improved mucosa-to-mucosa apposition [29, 37]. Furthermore, it has been postulated that LS might be advantageous in terms of blood supply as the staple line results parallel to the axis of the gastric conduit thus leading to maximum preservation of the vasculature network at the anastomosis mainly perfused through intramural capillaries [44, 60, 61]. Recently Nickel et al. explored the principles of gastric conduit capillaries dynamics in live porcine models undergoing minimally invasive esophagectomy with LS anastomosis. Authors concluded that the dominant direction of flow through the conduit mainly arises through the right gastroepiploic artery and capillaries along the transverse conduit axis thus suggesting the use of short (30 mm) linear staplers to preserve optimal tissue oxygenation at the anastomotic site [62]. Finally, LS anastomosis requires a longer esophageal remnant (at least 2 cm); thus, fashioning an aligned high intra-thoracic tension-free anastomosis is challenging because of limited space.

While both techniques are currently used, robust evidence describing the best technique for stapled-esophageal anastomosis is to be defined. A previous study by Yanni et al. reported significantly reduced thoracic AL for LS vs. CS (4.1% vs. 15.3%; p = 0.019) [38]. Equally, Huang et al. in their 2017 retrospective study described a significantly reduced incidence of cervical AL for LS vs. CS (7.7% vs. 19%) [37]. Despite the lack of statistical significance, Fabbi et al. [46] and Hosoi and colleagues [44] reported a tendency toward reduced AL for LS. In contrast, Zhou et al. in a 2015 meta-analysis affirmed no differences for postoperative AL comparing LS vs. CS (RR = 0.80; p = 0.52) [63]. In our study, we found that LS seems associated with a significantly reduced RR for AL (RR = 0.67; p < 0.01). The related heterogeneity was 0.0% thus adding robustness to the result. Possible explanations include a better vascular/oxygen supply to the anastomotic site, superior anastomosis orientation with concomitantly reduced traction-related tension, and minimization of visceral twisting. The stratification analysis still evidences a significant difference for cervical anastomosis (RR = 0.61; p = 0.04) while no significant differences were found for thoracic anastomosis. Notably, the point estimation was below 1.00 (RR = 0.61) thus suggesting a clinical tendency toward reduced RR. However, this initial indication mandates future analysis and possible confirmation by large trials. Despite the low heterogeneity, our result should be interpreted with caution because the presence of possible confounders related to technical variations (Collard vs. Orringer procedure), closure of the anterior wall (i.e., single layer vs. double layer vs. stapling), suture material (absorbable vs. non-absorbable vs. antibacterial suture), blood flow assessment with indocyanine green, gastric ischemic preconditioning, esophageal diameter (dilated or not), omental wrapping, route of reconstruction (retrosternal vs. posterior mediastinal), patient selection, baseline comorbidities (i.e., diabetes), smoking status, neoadjuvant treatment, tumor characteristics (i.e., grading, size, location, etc.), tumor-free resection margins (R0), and surgical approach.

In our meta-analysis, LS seems associated with a significantly reduced RR for AS (RR = 0.31; p < 0.0001) irrespective from the level of the anastomosis. This is similar to what previously reported by Zhou et al. that acknowledged a significant tendency toward reduced stricture risk for LS (RR = 0.26; p = 0.002) [63]. Similar results are reported by several retrospective analyses [32, 37, 44]. Possible explanations include a wider anastomotic diameter and reduced risk of staple line scarring/fibrosis for LS. Again, these results should be interpreted prudently because lacking of standardized techniques, different sizes (25 mm vs. 28 mm) and techniques for CS (i.e., purse-string suture vs. EEA™ vs. Orvil™), patient selection, postoperative leak occurrence, patient age, comorbidities (i.e., diabetes), smoking status, neoadjuvant treatment, tumor-free resection margins (R0), surgical approach (open vs. minimally invasive), and surgeon proficiency. Interestingly, no significant differences were found for reflux esophagitis (RR = 0.74; p = 0.36). This may be attributable to the increased utilization of proton pump inhibitors to reduce acid production and esophageal exposure. This is different from Hosoi et al. that affirmed a tendency toward higher rates of esophagitis for LS anastomosis [44]. Finally, no significant differences were found for postoperative pneumonia, 30-day mortality, OT, and HLOS.

Notably, both AL and AS may not directly reflect the quality of a specific technique but are also influenced by surgeon proficiency, learning curves, structured training/mentorship programs, and hospital volumes [64,65,66]. Surgical volume and operating surgeon proficiency are critical to obtain optimal surgical outcomes after esophagectomy. It has been shown that case-load centralization in high-volume centers significantly reduces mortality and may improve outcomes [67]. Specifically, during the learning curve, AL has been shown to decrease from 18% in the early phase to 4.5% after 119 cases [68]. Based on these considerations, it should be pondered that AL and AS may not entirely reflect the anastomotic technique but are also influenced by learning curve effect and surgeon experience. Finally, the introduction of new technologies such as fluorescence imaging with indocyanine green to assess the gastric conduit perfusion/anastomosis and the advent of robot-assisted esophagectomy may potentially improve outcomes [69,70,71].

Our study presents the typical limitations of a meta-analysis including observational studies. The lack of inclusion criteria defined a priori, lack of homogenous surgical approach, and lack of globally defined postoperative management protocols. Second, AL and AS were not uniformly defined and classified among included studies. Third, the majority of included studies (12/18) were performed in Asian countries; therefore, results may not be generalizable. Fourth, surgeon experience and volume were not measured with a conceivable effect on outcomes. Lastly, the creation of a single type of anastomosis might have diverse technical variations among different esophageal surgeons and should therefore be considered as possible source of heterogeneity.

Conclusions

There is a variety of different techniques for esophagogastric anastomosis. Compared to CS, LS anastomosis seems associated with a tendency toward a reduced risk for AL and AS even after stratification according to the level of anastomosis (cervical and thoracic). Although surgeon’s own training, learning curve, and experience might direct the choice toward a definite technique for esophagogastric anastomosis, our meta-analysis encourages the implementation of LS anastomosis.