Introduction

Since 1997, obesity has been officially recognized by the World Health Organization as a worldwide epidemic. Morbid obesity and its associated comorbidities have now placed a medical and economic burden on the world [1,2,3]. Bariatric surgery could ensure effective weight loss and maintain weight at a relatively satisfactory level [4]. In addition, it is effective for comorbidities of obesity, such as T2DM and hypertension [4].

Surgical treatment for obesity is either malabsorptive, restrictive, or a combination of the two. LSG and LGCP are the most commonly used restrictive procedures for morbid obesity [5]. In recent decades, LSG has been accepted as an effective and safe bariatric procedure and has been widely performed [6]. It involves resection of most of the stomach, including the entire fundus, along the long axis of the greater curvature, leaving the remaining stomach from the gastric antrum 2–6 cm above the pylorus to the gastroesophageal junction in a “banana shape.” The procedure reduces the volume of the stomach to approximately 180–200 ml, leading to a reduction in food intake [7]. LGCP, on the other hand, is another bariatric procedure that decreases the volume of the stomach by plication of the greater curvature [8, 9]. It could achieve the same weight loss effect as LSG to a certain extent through stomach restriction. Compared to LSG, LGCP has the advantages of technical simplicity and reversibility, preservation of the integrity of the stomach, lower leakage rate, and lower operative costs [10, 11].

Both procedures have advantages and disadvantages and thus need to be carefully examined and compared. However, there are limited comparative trials of the two procedures. The previously published meta-analysis [12] included only 12 comparative studies, and the sample sizes of some of the included studies were quite low. Suarez et al. [13] have just published an in-depth and extensive systematic review of the literature on LSG vs LGCP. However, only 9 of the included 28 studies compared the two procedures. Furthermore, due to the descriptive nature, their study was not subjected to formal statistical analysis. Thus, we conducted this updated meta-analysis to compare the efficacy, safety, operation duration, length of stay, and cost between LSG and LGCP.

Materials and Methods

Study Protocol

This meta-analysis was conducted in accordance with the principles described in the Cochrane Handbook and the PRISMA recommendations [14]. The present study was not registered in any database.

Eligibility Criteria

The inclusion criteria were as follows: (1) study design: any type of comparative study, (2) study population: patients undergoing LGCP or LSG, (3) intervention: comparison between LGCP and LSG, (4) outcomes: inclusion of information on efficacy or safety, (5) availability of the full text, and (6) publication in English. When a study reporting the same patient cohort was included in several publications, only the most recent or complete study was selected. The exclusion criteria included overlapping data and publications in the form of case series, case reports, editorials, letters, conference abstracts, and expert opinions.

Search Strategy

Electronic literature search was performed on Medline, EMBASE, Scopus, and the Cochrane Central inception to December 2020. The language was limited into English. The search terms were as follows: (plication OR imbrication OR gastroplication OR plicature OR verticalplication) AND (sleeve OR vertical) AND (laparoscopy OR laparoscopic). Ongoing clinical trials were identified from the clinicaltrials.gov website. References of the chosen literatures and published systematic reviews were reviewed manually for additional relevant studies.

Study Selection and Data Extraction

Two reviewers (Haoran Li and Weiqiang Wang) independently and blindly evaluated all titles and abstracts for studies that met the inclusion criteria, and excluded any articles that clearly did not meet the selection criteria. The potential inclusions were checked by one author (Xu Wang). Full reports (where available) of potentially relevant studies were retrieved and checked for eligibility. Then data extraction was performed independently and blindly by two investigators (Haoran Li and Weiqiang Wang) according to a standardized form. Efforts were made to get exact numerical data from authors via e-mail if not available in articles. Disagreements were resolved by two reviewers or even the third one if these two reviewers’ decisions could not reach a consensus.

The following data were extracted from included studies: %EWL, BMIL, improvement of comorbidities, major and minor complications, operative time, hospital stay, and cost. Complications were categorized as “minor” or “major” based on the original authors’ discretion, or the Clavien-Dindo scale if not explicitly categorized (grades I and II categorized as mild, grades III and IV categorized as major) [15]. In addition, characteristics of articles including study design, publication year, study location, baseline demographics, baseline BMI, and follow-up were routinely extracted. It is worth emphasizing that the exact numbers of cases at different follow-up time points were retrieved and used for the data analyses of %EWL and BMIL if the article clearly provided, so as to improve the reliability of the meta-analysis results.

Quality Assessment

Two authors (Haoran Li and Weiqiang Wang) assessed the risk of bias independently. We used the revised Jadad rating scale [16] to evaluate the methodological quality of randomized control trials (RCTs), with high-quality studies scoring 4–7 points. The Newcastle–Ottawa scale was used to judge study quality, as recommended by the Cochrane Collaboration [17]. This scale allocates a maximum of nine points with a score of ≥6 being of high quality. Any dispute was solved unanimously through discussion.

Statistical Analysis

Comprehensive meta-analysis was performed using Review Manager Version 5.3. Continuous variables were pooled using weighted mean difference (WMD) with 95% confidence interval (CI), while odds ratio (OR) with 95% CI was applied to perform the statistical analysis for dichotomous variables. A p value of <0.05 was regarded as statistically significant. The χ2 and I2 statistics were adopted to analyze the statistical heterogeneity across the included studies. An I2 value of 25 and 75% indicated low and high levels of heterogeneity, respectively [18]. We used fixed-effect models or random-effect models for comparisons. We considered statistically significant heterogeneity to be present at a p value of <0.10 with the use of a random-effect model.

Risk of Bias Across Studies

The funnel plot of the primary endpoint was visually inspected, in order to determine the presence of publication bias.

Additional Analyses

No sensitivity analysis was generated.

Results

Search Yields

We got 132 search results from Medline, 446 results from EMBase, 60 results from the Cochrane Central Register of Controlled Trials, and 137 results from Scopus. Additional, one study was identified in the gray literature search. All titles and abstracts were screened and 753 studies were excluded. The remaining 23 trials were submitted to a full-text review in order to assess consistency with the predefined eligibility criteria. The full-text screening resulted in the identification and removal of 5 articles (3 repeated publications from the same medical center and 3 irrelevant records). Eighteen studies [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36] were finally selected for meta-analysis. The flow chart for study selection is shown in Fig. 1.

Fig. 1
figure 1

Flow diagram of study selection

Study Characteristics

The available study designs consisted of 6 retrospective studies, 7 prospective studies, and 5 RCTs. The analysis was performed on a total of 1329 patients, of whom 705 underwent LSG and 624 underwent LGCP. Studies were conducted in Spain, Romania, Czech Republic, Iran, Korea, Turkey, France, Ukraine, Italy, India, Egypt, and China. In total, 11 trials were conducted in a single institution, and 6 studies incorporated multiple surgical centers, except for one study that did not specify the institution. The study completion year spanned from 2011 to 2020. Postoperative follow-up extended from 1 month up to 3 years. All studies provided age, BMI, and gender data with a comparison of complete baseline profiles between two treatment groups. The baseline characteristics of included articles are represented in Table 1. Moreover, some of the included studies provided the exact number of cases at different follow-up time points, as summarized in Table 2.

Table 1 General characteristics and baseline variables of the included studies
Table 2 Number of cases at different follow-up time points

Quality of Included Studies

The quality assessment of the included studies is illustrated in Tables 3 and 4. In general, all the 13 cohort studies were considered to be of high quality by gaining a score of ≥ 6 points according to the Newcastle–Ottawa scale, and the quality of 4 included RCTs was also satisfactory by gaining a score of ≥ 4 points in accordance with the revised Jadad rating scale. The quality level of the remaining 1 RCT was estimated to be low, with a score of 2 points.

Table 3 Methodological quality assessment of included cohort studies
Table 4 Methodological quality assessment of included RCTs

Meta-Analysis Results

% EWL

This outcome was evaluated after 1, 3, 6, 12, 18, 24, and 36 months. Our meta-analysis demonstrated a significantly higher %EWL after LSG than LGCP at the follow-up time points of 1 month (MD 4.48, 95% CI [0.43, 8.53], I2 0%), 3 months (MD 5.37 ,95% CI [1.59, 9.16], I2 56%), 6 months (MD 8.47, 95% CI [5.81, 11.13], I2 24%), 12 months (MD 13.23, 95% CI [9.93, 16.54], I2 31%), 18 months (MD 10.63, 95% CI [3.72, 17.54], I2 69%), and 24 months (MD 19.62, 95% CI [1.15,38.08], I2 94%). However, no significant difference was found between these two procedures at 36 months (MD 24.63, 95% CI [−1.94, 51.21], I2 93%). High heterogeneity was noted in the 3-, 18-, 24- and 36-month analyses. Random effect was used, because it was not possible to exclude outliers. The forest plots are shown in Figs. 2, 3, 4, 5, 6, 7 and 8.

Fig. 2
figure 2

Forest plot of comparison between LSG and LGCP in terms of %EWL at 1 month

Fig. 3
figure 3

Forest plot of comparison between LSG and LGCP in terms of %EWL at 3 months

Fig. 4
figure 4

Forest plot of comparison between LSG and LGCP in terms of %EWL at 6 months

Fig. 5
figure 5

Forest plot of comparison between LSG and LGCP in terms of %EWL at 12 months

Fig. 6
figure 6

Forest plot of comparison between LSG and LGCP in terms of %EWL at 18 months

Fig. 7
figure 7

Forest plot of comparison between LSG and LGCP in terms of %EWL at 24 months

Fig. 8
figure 8

Forest plot of comparison between LSG and LGCP in terms of %EWL at 36 months

BMIL

This outcome was evaluated at 6, 12, 24, and 36 months. There was no difference between interventions at 6 months (MD 1.85, 95% CI [− 0.06, 3.76], I2 89%). The results were favorable for LSG at 12 months (MD 3.04, 95% CI [1.35, 4.74], I2 72%), 24 months (MD 5.25, 95% CI [3.65, 6.85], I2 64%), and 36 months (MD 6.03, 95% CI [4.73, 7.32, I2 13%). Random effect was used due to high heterogeneity in the 6-, 12-, and 24-month analyses. The forest plots are shown in Figs. 9, 10, 11 and 12.

Fig. 9
figure 9

Forest plot of comparison between LSG and LGCP in terms of BMIL at 6 months

Fig. 10
figure 10

Forest plot of comparison between LSG and LGCP in terms of BMIL at 12 months

Fig. 11
figure 11

Forest plot of comparison between LSG and LGCP in terms of BMIL at 24 months

Fig. 12
figure 12

Forest plot of comparison between LSG and LGCP in terms of BMIL at 36 months

Comorbidity Improvement

Diabetes, hypertension, dyslipidemia, and sleep apnea were evaluated in this meta-analysis. More specifically, no statistically significant differences between the two groups regarding improvement of diabetes (OR 1.55, 95% CI [0.55, 4.36], I2 0%), hypertension (OR 1.52, 95% CI [0.67, 3.03], I2 0%), dyslipidemia (OR 1.29, 95% CI [0.41, 4.00], I2 0%), and sleep apnea (OR 1.14, 95% CI [0.35, 3.70], I2 0%), were identified. We resorted to a fixed-effect model because there was no remarkable heterogeneity. The forest plots are shown in Fig. 13, 14, 15 and 16.

Fig. 13
figure 13

Forest plot of comparison between LSG and LGCP in terms of diabetes improvement

Fig. 14
figure 14

Forest plot of comparison between LSG and LGCP in terms of hypertension improvement

Fig. 15
figure 15

Forest plot of comparison between LSG and LGCP in terms of dyslipidemia improvement

Fig. 16
figure 16

Forest plot of comparison between LSG and LGCP in terms of sleep apnea improvement

Major Complications

In Figs. 17, 18, 19 and 20, the pooled results of the eligible studies concerning the major complications are depicted. There are no differences between LSG and LGCP with regard to bleeding (OR 1.37, 95% CI [0.61, 3.09], I2 0%), stenosis (OR 0.57, 95% CI [0.23, 1.38], I2 0%), leak (OR 1.58, 95% CI [0.61, 4.15], I2 0%), and mortality (OR 1.39, 95% CI [0.09, 22.55], I2 35%). Heterogeneity was noted in the analysis of mortality and random effect was applied.

Fig. 17
figure 17

Forest plot of comparison between LSG and LGCP in terms of bleeding

Fig. 18
figure 18

Forest plot of comparison between LSG and LGCP in terms of stenosis

Fig. 19
figure 19

Forest plot of comparison between LSG and LGCP in terms of leak

Fig. 20
figure 20

Forest plot of comparison between LSG and LGCP in terms of mortality

Minor Complications

Figures 21, 22, 23, 24 and 25 summarize the data regarding the comparisons between the two groups, in terms of minor complications. No differences were noted with regard to GERD (OR 1.38, 95% CI [0.67, 2.82], I2 0%), wound infection (OR 1.70, 95% CI [0.22, 13.27], I2 0%), port-site hernia (OR 1.26, 95% CI [0.28, 5.77], I2 0%), and abdominal pain (OR 0.48, 95% CI [0.21, 1.07], I2 0%). Statistically significant lower rate of nausea and vomiting (OR 0.36, 95% CI [0.23, 0.54], I2 0%) in the LSG group was recorded.

Fig. 21
figure 21

Forest plot of comparison between LSG and LGCP in terms of nausea and vomiting

Fig. 22
figure 22

Forest plot of comparison between LSG and LGCP in terms of GERD

Fig. 23
figure 23

Forest plot of comparison between LSG and LGCP in terms of wound infection

Fig. 24
figure 24

Forest plot of comparison between LSG and LGCP in terms of port-site hernia

Fig. 25
figure 25

Forest plot of comparison between LSG and LGCP in terms of abdominal pain

Operative Time

Twelve trials included this outcome for meta-analysis, but two were excluded because of lack of standard deviation [21, 33]. Eight hundred twenty-seven patients (435 in LSG group, 392 in LGCP group) from two studies were finally analyzed. A high heterogeneity was noted, and it was not possible to exclude outliers. Consequently, there is no difference for both surgical therapies with random effect (MD1.42, 95% CI [−9.49, 12.34], I2 94%). The forest plot is shown in Fig. 26.

Fig. 26
figure 26

Forest plot of comparison between LSG and LGCP in terms of operative time

Hospital Stay

Thirteen studies reported this information, but two could not be included because of lack of standard deviation [20, 32]. Eight hundred fourteen patients (419 in LSG group, 395 in LGCP group) were finally analyzed. A high heterogeneity was noted in the initial analysis. After identifying and excluding the outlier [36], the heterogeneity remained. A random effect was applied for analysis, and no significant difference was found (MD 0.60, 95% CI [− 1.18, 1.37], I2 97%). The forest plot is shown in Fig. 27.

Fig. 27
figure 27

Forest plot of comparison between LSG and LGCP in terms of hospital stay

Cost

Figure 28 summarizes the data regarding the comparison between the two groups, in terms of cost. A total of three articles reported this result. One hundred fifty-nine patients (80 in LSG group, 79 in LGCP group) were finally analyzed. A high heterogeneity was noted in the initial analysis. After identifying and excluding the outlier [23], the heterogeneity remained. A random effect was used for analysis, and LSG showed inferior result compared with LGCP considering cost (SMD 11.69, 95% CI [3.71, 19.67], I2 97%).

Fig. 28
figure 28

Forest plot of comparison between LSG and LGCP in terms of cost

Publication Bias

Funnel plot of the primary outcome (i.e., %EWL at 6 months) was created to access the publication bias of the literature. Through visual inspection, the shape of the funnel plot did not reveal any evidence of obvious asymmetry (Fig. 29).

Fig. 29
figure 29

Funnel plot of comparison: LSG vs LGCP in terms of %EWL at 6 months. MD mean difference

Discussion

Over the past few decades, LSG has been regarded as an effective and safe treatment for morbid obesity, not only to significantly reduce weight, but also for improving the associated comorbidities [37,38,39]. However, its irreversibility involving gastrectomy, operative complications, and partial mortality were the concern [40]. More specifically, the overall incidence of serious complications after LSG was estimated at 0.89%, which was principally attributed to the leak [41].

As an alternative to LSG, LGCP is performed by the plication of greater curvature with sutures. It could achieve the same weight loss effect as LSG to a certain extent through stomach restriction. As compared to LSG, LGCP could offer advantages including reversibility, preservation of the integrity of the stomach, and alleviating the risk of gastric content leak of the anastomosis [27]. However, its reliability is controversial. Some studies showed a comparable %EWL in the short or medium term for both procedures [42,43,44,45]. Nevertheless, some other trials have shown that LGCP is inferior to LSG regarding %EWL [46, 47]. Through our meta-analysis, although LGCP resulted in significant weight loss, LSG outperformed it in terms of %EWL and BMIL in the first 2 years. At 3 years postoperatively, both groups had achieved a similar %EWL; however, the results of BMIL were favorable for LSG. In an overall view, compared to the previously published meta-analysis, we provided further evidence of the advantage of LSG in terms of mid-term weight loss and preliminarily indicated that long-term weight loss (3 years) seems to be superior in the LSG patients. However, due to the low number of studies included and great heterogeneity of %EWL analysis, long-term results from our data have to be considered with caution.

The difference of weight loss may be due to the following major reasons. First of all, ghrelin plays a crucial role in regulating appetite and body weight [48,49,50,51]. This hormone is mainly produced by P/D1 cells in the fundus of the stomach [52]. Ghrelin secretion can cause hunger by increasing appetite, emptying the stomach, and moving the intestines. It decreased significantly after LSG but increased after LGCP [53]. This is due to the resection of the fundus of the stomach, where most ghrelin is produced, in LSG [54].

Another gastrointestinal hormone, peptide tyrosine tyrosine (PYY), must also be considered. It is produced in L cells located in the colon and distal ileum. This hormone has the effect of reducing appetite and sense of hunger. In Xanthakos’s trial [52], 12 obese and 12 lean human subjects received intravenous injection of PYY. The result showed a 30% reduction in single meal intake in both groups. In addition, PYY reduced ghrelin levels and enhanced its effect on reducing hunger. Karamanakos [55] reported that PYY did not increase after a trial meal after LGCP; however, PYY increased after LSG; this leads to better weight loss after LSG.

Another possible explanation for the different efficiencies of weight loss may be the technical difference between the two procedures. We must consider that the reduction in gastric volume in LGCP is mainly caused by the plication of the greater curvature of the stomach with sutures and the “space-occupying effect” of the mucosal in-folding. Abdelbaki et al. [56] performed gastroscopy on some patients who regained weight more than 1 year after LGCP. They found moderate atrophy of the infolded gastric mucosa. This mechanism may lead to loosening of the running suture and a diminished filling effect from the gastric mucosal in-folding, leading to a relative gastric enlargement and subsequent weight regain in patients undergoing gastric plication.

All of the above factors could explain why LSG is superior to LGCP considering weight loss. In addition, the improvement of obesity-related comorbidities also plays a non-negligible role in the evaluation of efficacy. In our meta-analysis, both procedures were effective treatments for obese patients with diabetes, hypertension, dyslipidemia, and sleep apnea, with no significant difference between the two groups. This may be due to multiple factors such as hormonal changes [57], BMI loss, and decrease in carbohydrate absorption [58] after bariatric surgery.

LGCP, considered as a less invasive technique, was expected to be better tolerated than LSG. We summarized the major complications as follows: bleeding, stenosis, leak, thromboembolism, and mortality. Other complications were secondary and included nausea and vomiting, GERD, wound infection, port-site hernia, sialorrhea, and abdominal pain. Contrary to expectation, this meta-analysis failed to show any advantage of LGCP in reducing complications. More specifically, at the time of classification, LGCP displayed a statistically significant higher rate of nausea and vomiting. This adverse effect may be attributed to the double row stitching, the large stomach folds imbricated in the gastric lumen [59], and mucosal edema due to venous stasis after LGCP [60]. In terms of major complications, bleeding, stenosis, and mortality outcomes were comparable between the two procedures. It should be noted that out of all 18 articles selected, only 12 cases of leakage were reported, with no significant differences between the two procedures. In addition to this, two of the selected articles reported leaks only after LGCP [31, 33]. As leakage is a serious complication of LSG, it is important to demonstrate that complications are not reduced with the use of LGCP.

Besides these, it is important to note that the choice between LSG and LGCP does not depend solely on their effectiveness and safety, but also on the comparison of operative time, length of stay, and economic costs. The expected outcome of LSG is a shorter operative time [11]. However, according to our pooled data, LSG was not associated with statistically shorter operative time. For this outcome interpretation, we must take into account the surgical technique. In the performance of LSG, some surgeons used reinforcement sutures in the stapling line, which may have prolonged the operation and reduce the difference between the two groups. Major complications are considered to be a crucial factor affecting the length of hospital stay. This meta-analysis demonstrated no significant difference between the two procedures regarding hospital stay, which was the same as the analysis of major complications. The cost of LSG was significantly higher as demonstrated in our meta-analysis. However, due to the high heterogeneity level, the low number of studies included, and the fact that one paper included [23] shows massive cost difference ($17100 vs $2620), the significance of the analysis was confined and we could not make a strong conclusion. It is worth emphasizing that the analysis of cost must take into account a variety of factors. Suarez et al. [13] have done important work in terms of comparing the costs of the two procedures. They examined the price of operative consumables (i.e., endoscopic stapler and sutures), length of stay, duration of surgery, and reoperation rates to evaluate the impact of these factors on the total cost. They draw a conclusion that LGCP is a little bit cheaper than LSG; however, the financial burden of treating the complications may lead to an underestimation of the overall cost of LGCP.

This meta-analysis has several limitations. Firstly, most of the included studies reported short- or mid-term results. Long-term result on follow-up of 4 years or more was lacking. Secondly, the small sample sizes of some of the included trials may have contributed to bias. Furthermore, of all 18 included studies, only 5 were RCTs, which may have biased the selection and detection. Last but not the least, only three of the included studies compared the costs of the two procedures. Therefore, the reliability of the cost advantage of LGCP was limited, which may be important for its popularity in economically underdeveloped areas.

Conclusion

The advantages of LSG in terms of short- and mid-term weight loss and minor complications were documented. However, both procedures were similar in terms of improvement of comorbidities, major complications, operation duration, and hospital stay. The cost-effectiveness of LGCP was better than that of LSG. However, given the limitations mentioned above, we should be cautious in drawing conclusions. More prospective studies with large samples and long-term follow-up are needed in the future, as well as more clinical trials on the charges for the procedures.