Introduction

Laparoscopic sleeve gastrectomy (LSG) has contributed to the treatment of extreme obesity [1,2,3], and the number of LSG cases is increasing worldwide [4]. Weight loss after LSG generally takes approximately 1 to 1.5 years to reach maximum, after which, the body weight (BW) increases slightly or plateaus [5,6,7]. Therefore, postoperative follow-up is recommended at 1, 3, 6, and 12 months after surgery [8].

Nevertheless, some patients have insufficient weight loss or weight loss failure after LSG (primary non-responders [PNRs]) [9]. There are various definitions of primary non-response [9,10,11,12,13,14]; the most widely used is the percentage excess weight loss (%EWL) of < 50% at ≥ 1 year after surgery [9,10,11, 13]. However, the use of %EWL for evaluating the postoperative weight loss effect can be disadvantageous because the %EWL tends to be high in cases with a low body mass index (BMI) [15, 16]. Another definition of primary non-response is the percentage total weight loss (%TWL) of < 20% [14]. Recently, it has been recommended that weight loss outcomes after bariatric surgery should be evaluated by %TWL, as the ideal BW does not need to be determined and %TWL does not tend to be low in cases with a high BMI [15, 16].

The rate of revisional surgery has increased to 7.4% for all bariatric surgeries, including LSG [4]. The incidence of revisional surgery after LSG is reported to be 10.4% at ≥ 3 years after LSG and 22.6% at ≥ 10 years [17]. Postoperative weight loss issues, including primary and secondary non-response (so-called weight regain), are the most common reasons for revisional surgery after LSG [17].

Currently, as a clinical problem, even if clinicians follow up patients at 1, 3, 6, and 12 months after surgery in accordance with the guidelines, there are no reference values for weight loss at < 1 year after surgery [8]. Although there are a few reports on post-LSG weight prediction models [12, 18], none of these is based on %TWL. Therefore, the aim of this multicenter study was to find the reference values for weight loss during 1 year after LSG based on %TWL by developing predictive models in Japan.

Patients and Methods

This multicenter retrospective study was conducted at five bariatric institutions in Japan, which were certified by the Japanese Society for Treatment of Obesity. Although this study is retrospective, all data, including demographic characteristics, weight metrics, and comorbidities, were prospectively recorded. Data were collected from each institution for analysis. The institutional ethical review boards of each hospital approved the study (approval number of the principal institutional ethical review board: R2017-030). All procedures performed were in accordance with the ethical standards of the institutional and/or national research committee, and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. The need for written informed consent was waived because of the retrospective nature of the study.

In total, 248 patients who underwent LSG between January 2011 and December 2015 at the five participating institutions and had complete follow-up weight metric data at 1, 3, 6, and 12 months after LSG were enrolled in this study. The criteria for LSG, based on the guidelines for bariatric surgery in Japan [19, 20], were as follows: patients with primary obesity; aged 18–65 years; and who met any of the following conditions, BMI ≥ 35 kg/m2, BMI ≥ 32 kg/m2 with type 2 diabetes mellitus (T2DM), or BMI ≥ 32 kg/m2 with more than 2 comorbidities other than T2DM. Our surgical method for LSG generally followed global standard techniques [2]: five to seven laparoscopic trocars were placed in the upper abdomen, establishing pneumoperitoneum at 12–15 mmHg. The greater omentum was dissected along the greater curvature of the stomach with an ultrasonic energy device or a vessel sealing device. After mobilizing the fundus and exposing the angle of His, the stomach wall was dissected using a 36–45 Fr. bougie with endoscopic linear staplers. The dissection was started 4–5 cm from the oral side of the pylorus up to 1–1.5 cm on the distal side of the gastroesophageal junction. All patients underwent preoperative and postoperative educational programs conducted by a multidisciplinary bariatric surgical team at each institution.

The data collected prospectively were as follows: preoperative comorbidities, including T2DM, hypertension, dyslipidemia, obstructive sleep apnea syndrome (OSAS), mental disorder (MD), and gastroesophageal reflux disease (GERD); insulin use at time of the first visit; Los Angeles classification of GERD [21] before LSG; and anthropometric measurements immediately before surgery, and at 1, 3, and 6 months, 1 year, and 2 years after LSG. Postoperative weight loss was assessed by %TWL (baseline: BW at the first visit) and %EWL (baseline: BW at the first visit; BMI at the ideal BW: 25 kg/m2), and patients with a %TWL < 20% [14] at 1 year after LSG were defined as PNRs.

Descriptive statistics were used to summarize patient characteristics and clinical outcomes. Categorical data are expressed as numbers (proportion), and numerical data are expressed as means ± standard deviation or medians (interquartile range). The comparison between primary responders and PNRs was performed using a Mann–Whitney U or chi-square tests. The relationships among %EWL at 1 year (%EWL1Y), %TWL at 1 year (%TWL1Y), and BMI at the first visit were evaluated by Pearson correlation coefficients. The relationships between %TWL1Y and %TWL at 1 month (%TWL1M), %TWL at 3 months (%TWL3M), and %TWL at 6 months (%TWL6M) were also investigated using Pearson correlation coefficients.

Seven prognostic models for predicting the %TWL1Y were created using multiple regression analyses. Ten preoperative factors, which were previously reported as the factors influencing weight loss or co-morbidities of obesity, were included as independent variables in some models: sex [10], age [10, 11, 22], BMI [11, 22], T2DM [10,11,12], OSAS [11, 12], MD [14, 23], hypertension, dyslipidemia, and GERD. Insulin use [24] was also included because it is the influencing factor for diabetes remission after LSG and Cramer’s V statistic (V = 0.268) indicated a relatively weak association between insulin use and T2DM. In addition to these factors, weight metrics in the early postoperative period (%TWL1/3/6 M) were included as independent variables in some models. The seven models were as follows: model 1, 10 preoperative factors; model 2, %TWL1M only; model 3, 10 preoperative factors + %TWL1M; model 4, %TWL3M only; model 5, 10 preoperative factors + %TWL3M; model 6, %TWL6M only; and model 7, 10 preoperative factors + %TWL6M.

These models were compared based on the adjusted R-squared, as a goodness-of-fit measure, and the area under the receiver operating characteristic (ROC) curve (AUC) for discriminating PNRs from responders, as a measure of discriminative performance. The most satisfactory model was used to create a parsimonious model. The parsimonious model was the model with the smallest corrected Akaike’s information criterion value among models with all possible combinations of the independent variables from the original model.

Finally, the parsimonious model was submitted to a ROC curve analysis to determine appropriate cutoff values for %TWL during 1 year after surgery. Four cutoff values were calculated for predicted %TWL1Y, for discriminating PNRs from responders. Specifically, one based on Youden’s index, one attaining 80% sensitivity, one attaining 95% specificity, and one attaining 98% specificity were calculated. In addition, a ROC analysis was performed to determine cutoffs of the actual value of %TWL3M (rather than the predictive value of %TWL1Y).

All statistical analyses were performed using SPSS version 25.0, for Windows (IBM Corp., Armonk, NY, USA). p < 0.05 was considered statistically significant.

Results

Patient characteristics and clinical outcomes are shown in Table 1. The median age, BW, and BMI at the first visit were 41 years, 113 kg, and 41.4 kg/m2, respectively. Median %EWL1Y and %TWL1Y were 76.4% and 30.1%, respectively. There were 28 (11.3%) PNRs: PNRs had significantly lower BMI at the first visit and lower %TWL at each time point than did primary responders (Table 1). Three patients (1.2%) underwent revisional surgery; one was a primary responder and two were PNRs. The intervals between LSG and revisional surgery in these patients were 6.7, 7.2, and 7.3 years, respectively.

Table 1 Patient characteristics and clinical outcomes; comparison between primary responders and primary non-responders

The %EWL1Y and %TWL1Y were positively correlated (r = 0.606) (Fig. 1a). The %EWL1Y and BMI at the first visit were negatively correlated (r =  − 0.516) (Fig. 1b), while the %TWL1Y and BMI at the first visit were positively correlated (r = 0.263) (Fig. 1c). The %TWL after LSG gradually increased over 1 year (Fig. 2a). Positive correlations were found between %TWL1Y and %TWL1/3/6 M, with gradually increasing correlation coefficients (r = 0.277/ 0.545/ 0.831, respectively) (Fig. 2bd).

Fig. 1
figure 1

a Relationship between %TWL1Y and %EWL1Y (r = 0.606, 95% CI: 0.521, 0.679). b Relationship between BMI at the first visit and %EWL1Y (r =  − 0.516, 95% CI: − 0.602, − 0.419). c Relationship between BMI at the first visit and %TWL1Y (r = 0.263, 95% CI: 0.143, 0.375). %TWL 1Y, percentage total weight loss at 1 year; %EWL 1Y, percentage excess weight loss at 1 year; BMI, body mass index; r, correlation coefficient; CI, confidence interval

Fig. 2
figure 2

a Change in the %TWL after LSG. b Relationship between %TWL1Y and %TWL1M (r = 0.277, 95% CI: 0.158, 0.389). c Relationship between %TWL1Y and %TWL3M (r = 0.545, 95% CI: 0.451, 0.627). d Relationship between %TWL1Y and %TWL6M (r = 0.831, 95% CI: 0.789, 0.866). %TWL, percentage total weight loss; 1 M, 1 month; 3 M, 3 months; 6 M, 6 months; 1Y, 1 year; LSG, laparoscopic sleeve gastrectomy; r, correlation coefficient; CI, confidence interval

Fitted multiple regression models 1–7 and AUCs of ROC curves representing performance of the predicted values of each model in discriminating PNRs from responders are shown in Table 2. For models 1–3, the best adjusted R-squared was 0.148 and the best AUC was 0.725. The %TWL3M was significant in both 3-month models (models 4 and 5), and sex, BMI, T2DM, MD, and OSAS were significant in model 5. Adjusted R-squared of models 4–7 were 0.295, 0.385, 0.690, and 0.718, respectively, and AUCs were 0.803, 0.858, 0.919, and 0.934, respectively.

Table 2 Eight models for predicting primary non-response after LSG

Models 6 and 7 (models including %TWL6M) had high AUCs, but could not be used until the sixth month. Therefore, model 5, which could be used earlier and had a reasonable discrimination performance, was determined to be the most satisfactory. Thus, model 8 was created as a parsimonious model based on model 5; model 8 included seven independent variables, with adjusted R-squared of 0.375 and AUC of 0.846 (Table 2 and Fig. 3ab). The formula for model 8 is:

$$Y=\hspace{0.17em}\hspace{0.17em}-\hspace{0.17em}2.4\hspace{0.17em}\times \hspace{0.17em}Sex\hspace{0.17em}+\hspace{0.17em}0.2\hspace{0.17em}\times \hspace{0.17em}BMI\hspace{0.17em}-\hspace{0.17em}2.5\hspace{0.17em}\times \hspace{0.17em}T2DM\hspace{0.17em}-\hspace{0.17em}3.2\hspace{0.17em}\times \hspace{0.17em}OSAS\hspace{0.17em}-\hspace{0.17em}2.0\hspace{0.17em}\times \hspace{0.17em}MD\hspace{0.17em}-\hspace{0.17em}3.8\hspace{0.17em}\times \hspace{0.17em}Insulin use\hspace{0.17em}+\hspace{0.17em}0.9\hspace{0.17em}\times \hspace{0.17em}\mathrm{\%}TWL3M\hspace{0.17em}+\hspace{0.17em}5.8$$

where Y = the predicted %TWL1Y, Sex = 0 for female, 1 for male; BMI = BMI at the first visit in kg/m2; T2DM = 1 if positive, and 0 if negative, for T2DM; OSAS = 1 if positive, and 0 if negative, for OSAS; MD = 1 if positive, and 0 if negative, for a MD; Insulin use = 1 if positive and 0 if negative.

Fig. 3
figure 3

a Relationship between %TWL at 1 year and predicted value in model 8 (parsimonious model with %TWL at 3 months). b ROC curve of model 8. c ROC curve of model 4, with %TWL at 3 months only. %TWL, percentage total weight loss; Y, year; ROC, receiver operating characteristic; R.2, coefficient of determination; AUC, area under the curve

The ROC curves of model 8 (7 preoperative factors + %TWL3M) and model 4 (%TWL3M only) are shown in Fig. 3bc. The AUC of model 4 was inferior, but comparable, to that of model 8 (Table 3). In model 8, the cutoff values of the predicted %TWL1Y were 26% (using Youden’s index), 30% (attaining 80% sensitivity), 25% (attaining 95% specificity), and 22% (attaining 98% specificity) (Table 3). The cutoff values of the actual %TWL3M values were 18% (using Youden’s index), 22% (attaining 80% sensitivity), 16% (attaining 95% specificity), and 15% (attaining 98% specificity) (Table 3).

Table 3 Cutoffs for predicted value using parsimonious models, and actual value of %TWL at 3 months

Discussion

Although primary non-response after LSG is known to occur [9, 12], there are currently no reference values to predict PNR in the early postoperative period. In the present study, we identified factors that affect postoperative weight loss and developed a model to predict primary non-response. Using this model, patients at risk for primary non-response can be determined at 3 months after LSG. In addition, the AUC for discriminating PNRs from responders using the actual value of %TWL3M (rather than the predictive value of %TWL1Y) was comparable to that of the parsimonious model at 3 months. Therefore, the actual %TWL3M value may be useful for discriminating PNRs from responders in clinical settings.

Although there are numerous reports on models for predicting weight loss after bariatric surgery [12, 18, 26,27,28], there are only a few models for predicting weight loss after LSG [12, 18]. For example, Cottam et al. [12] developed a chart that predicts a primary response, defined as an %EWL > 55%, using %EWL at 1 and 3 months. Another study by Cottam et al. [18] developed models that predicted %EWL and BMI decrease at 1 year after surgery using four preoperative factors (age, BMI, diabetes mellitus, and hypertension). However, these models are based on %EWL, not %TWL. The models developed in the present study used %TWL to predict a primary non-response. To the best of our knowledge, there are no reports on models developed for predicting primary non-response after LSG using %TWL.

In this study, the parsimonious model at 3 months after LSG can determine patients at risk for primary non-response using the optimally set cutoff values. However, we found the actual %TWL3M value predicted %TWL1Y, which was inferior to, but comparable to, the developed model. Actual %TWL3M value has the advantage of easier prediction in clinical settings.

Postoperative weight loss issues are the most common reasons for revisional surgery after LSG [17]. The model created here does not discriminate patients with revisional surgery. It was difficult to create such a model because only three cases underwent revisional surgery in our cohort. However, since PNR can be predicted at 3 months using our model, it is expected that interventions such as strengthening patient education for avoiding revisional surgery or considering revisional surgery can be implemented from an early stage.

The present study has several limitations, including its retrospective nature, lack of validation for the models, and short duration of observation. Therefore, we developed parsimonious models to prevent overfitting in our cohort. However, further studies are needed to validate the models in the longer term. Another limitation was that the study cohort had median preoperative BMI of 41 kg/m2 and therefore this may not have relevance to populations worldwide with higher BMI. A further limitation was the discrepancy between predicted values of the models and actual %TWL values at 1 year. A possible reason for this discrepancy may be that other factors should be included as independent factors in the multiple regression analysis to predict more accurately %TWL1Y, such as psychological factors [14, 23], surgical factors [29], endocrine factors [30], and lifestyle habits [22]. Although the predicted values of the model in the present study deviated from the actual value, the cutoffs were considered useful reference values for predicting primary non-response in the early period after LSG.

Conclusions

We developed a predictive model based on %TWL to determine patients at high risk for primary non-response at 1 year after LSG. We found that actual %TWL3M value predicts %TWL1Y comparable to the developed predictive model. Using the model and actual %TWL3M, appropriate reference values for %TWL during 1 year can be set so patients who may be PNRs can be determined at 3 months after LSG, which can aid medical staff and patients in considering medical or intense interventions such as revisional surgery.