Introduction

Bariatric surgery is rapidly gaining popularity across the USA and across the world. Despite this, there is a relative paucity of data showing what correlates with success and failure after surgery. This work has been previously done for Roux-en-Y gastric bypass but not for SG [1]. Personalized weight loss predictions are of utmost importance for bariatric surgeons due to the large unexplained variances observed in published studies. Weight loss outcomes, the primary outcome for most patients undergoing bariatric surgery, and weight loss predictions have the power to help surgeons educate the patients about procedures best suited for them or make better care plans for the specific patient. This can increase patient satisfaction following bariatric surgery and decrease revisional patients and non-responders.

Methods

Articles were identified by searching PubMed, MEDLINE, and the Cochrane Library using the search query “Sleeve gastrectomy” AND (model OR influence OR mapping OR prediction OR predictive OR predicts OR predictor OR predict OR predictors) for PubMed and similar queries for MEDLINE and the Cochrane Library. Additionally, the references of each article were reviewed for inclusion.

Papers were identified for review by title and abstract. Articles underwent full-text review if they were done on human subjects and referenced, in the abstract, the effect of any variable on weight loss. After full-text review, studies were included if they presented factors that affected weight loss at or greater than 6 months. Studies with more than one bariatric surgery included in the analysis were included if independent statistical analysis was performed for each surgery or if at least 50% of subjects underwent SG.

Each study was independently reviewed twice to collect important information like authors, number of subjects, statistical test performed, significant variables, non-significant variables, and if presented a model to predict weight loss.

Each study was also classified as univariate or multivariate depending on which statistical tests were performed and adequately reported. If both univariate and multivariate methods are reported, the significant factors are reported from the multivariate analysis.

Results

Overall, 1487 studies were identified in our initial search. We eliminated 1373 articles because they did not meet our inclusion criteria, leaving 114 for full-text review. Additionally, from the references of these 114 studies, 6 studies were identified as meeting the criteria for full-text review.

Of the 120 papers that met criteria for full-text review, 72 were excluded because they did not meet inclusion criteria leaving 48 studies for this review. In these studies, eight different statistical methods of identifying variables that affected or were correlated with weight loss were used. Of all studies reviewed, only five presented a model (two multivariate and three univariate) with two other studies presenting only a graphical explanation of their model without any numerical representation and six others presenting an incomplete model for predicting weight loss. Of 14 studies that reported results of a multilinear regression, only two reported the entire mathematical model with coefficients and variables.

There were 16 studies that used multivariate models to find variables that affect weight loss and 32 that used univariate methods to find variables that affect weight loss. Details of all 48 studies can be found in Table 1. From the 48 studies included in this review, 36 different variables were found to correlate with weight loss. All 36 variables and the studies that found them to be significant and non-significant can be found in Table 2. The majority of these variables [45] were analyzed in five or fewer analysis. There were also numerous other variables that were not found to affect weight loss as seen in Table 1. Of the 16 multivariate analyses, only 2 fully reported their finding in a way that they could be verified by other researchers.

Table 1 All 48 studies and collected information
Table 2 All 36 variables found to be significant in univariate and multivariate analysis and which studies found them to be significant

Of the variables that affect weight loss age, BMI, bougie size, DM, gender, HTN, short-term postoperative weight loss, preop weight loss, and sleep apnea (SA) were the only variables studied in more than five studies. Of these nine variables, only six were studied over five times in a multivariate analysis. All variables found to be significant in multivariate studies, and the studies that found them to be significant and non-significant can be found in Table 3.

Table 3 All variables found to be significant in multivariate analysis and which studies found them to be significant

BMI was found to be a negative predictor of weight loss. Seventeen studies investigated BMI as a predictor of weight loss. Twelve of these studies used multivariate analysis to determine the effect of BMI on weight loss. Many methods were reported including the β coefficient from the regression analysis, the odds ratio of failing for every point BMI gained, and viewing BMI as a categorical variable in a regression analysis.

Of the 12 multivariate studies that found BMI to be a significant predictor, 9 found it to be a negative predictor and 2 found it to be a positive predictor. In one of the two studies that BMI was a positive predictor, the model predicted BMI reduction instead of EWL% [23]. When predicting EWL%, the higher the BMI of the patient, the more weight they have to lose to achieve the same EWL%, and often even though larger patients are predicted to lose less EWL%, they often lose more weight.

Cottam et al. reported two models in their paper where they modeled both EWL% and BMI reduction on the same population and BMI was a strong negative predictor for EWL% and a strong positive predictor for BMI reduction [49]. The only other study where BMI was positively associated with weight had a sample size of seven sleeve patients and so is likely due to random error [3]. There was also one study that found that baseline BMI was not predictive of weight loss [27]. However, in this study, the authors included postoperative weight loss which was far more predictive of weight loss than baseline BMI, and so, BMI, while important, did not achieve statistical significance.

Bougie size was studied in 8 of the 49 papers with 3 papers finding that it did affect weight loss and 5 finding that it did not affect weight loss. The range of bougie size was fairly large (32–60) across the studies, and each study compared different sizes. In the three studies that found bougie size to be significant, the bougie size comparisons were 32 to 40, < 36 to > 44, and 32–36 to 38–60, and in the five that found it to not affect weight loss, the bougie comparisons were 32 to 36, 33 to 42, 32 to 42, 40 to 60, and 32 to 44 [7, 11, 18, 24,25,26, 40, 47].

However, these studies that found bougie size to be significant were somewhat misleading. Hady et al. compared bougie sizes 32 and 40 [7]. The initial BMI of the two groups was significantly different at the 0.001 level. Patients in the bougie size of 40 groups started with the mean BMI 5.46 points higher, and at 6 months, the patients with a bougie size of 40 have a mean BMI that is 5.05 higher. This shows that those in the larger bougie size group had a greater BMI reduction; however, since only %EWL, proportional measure, is reported, they appear to lose less weight even though they had a greater BMI reduction.

Discounting this study due to fault in their conclusion, there are two studies that found bougie size to be significant. However, Ellatif et al. only found bougie size to be significant 4 years after surgery. At every follow-up point before 4 years, it was non-significant. At 1 year, the p value was 0.3 [26]. So, during the typical postoperative weight loss period, only one study found bougie size to be significant and the difference was from 32 to 36, and 38–60. They did not report on how many patients received each specific bougie size [40].

Age was studied in 18 different papers. Of these 18 papers, 14 used multivariate analysis to look at the impact of age on weight loss. Of these 14, there was an even split with 7 finding age to affect weight loss and 7 finding that it does not affect weight loss. The studies that did find that age affects weight loss were consistent in reporting age as a negative predictor.

Diabetes was studied in 12 different papers, 10 of which used a form of multivariate analysis. It was found to be a negative predictor in the three multivariate analysis where it was significant and it was not found to be significant in 7 multivariate analysis.

Gender was studied in 16 different papers, 12 of which used a form of multivariate analysis. It was found to be predictive twice out of the 12 studies, and in both of the studies, female gender was found to be predictive of lower weight loss.

Hypertension was studied in eight different papers, all of which were multivariate analysis. It was found to be predictive three times out of the eight studies and in the two studies which reported whether it was a positive or negative predictor; it was a negative predictor. The other study never specified the effect (positive or negative) of HTN on weight loss.

Preoperative weight loss was studied in seven different papers, three of which were multivariate analysis. It was found to be predictive only once out of the three multivariate studies. It was found to be a positive predictor of weight loss in that study. However, in the other two univariate analysis that found it to significantly affect weight loss, it was found to be a positive predictor in one and a negative predictor in the other.

Short-term postoperative weight loss was studied in seven different papers, three of which were multivariate analysis. In all seven studies, it was found to be predictive of weight loss following surgery. The correlation was found as early as 1 month, but the exact best time following surgery varied from paper to paper.

SA was studied in six different papers, all of which were multivariate analysis. It was found to be predictive only once and was a negative predictor. However, all other times, SA was not found to be a predictor of weight loss. A more detailed explanation of why SA is unlikely to be truly predictive of weight loss can be found in Supplementary Materials File 2.

Discussion

While bariatric surgery is not a new field and the sleeve gastrectomy is a well-established procedure within the specialty, there is little known about why patients fail to achieve adequate weight loss after the surgery. Predictive modeling can help to mediate this problem by finding correlation between variables and future weight loss. Alarmingly, only two papers give us enough information to replicate their studies.

This foreknowledge gained through predictive modeling can help doctors immensely as once these problem patients are identified, doctors can start a more targeted approach and warn the patient of the risks of inadequate weight loss. There are two general times to make predictions: the first is presurgery based on the patient’s baseline demographics, and the other is in the short term following the procedure based on the patient’s weight loss curve and the patient’s demographics.

Preoperative predictions regarding the SG are particularly useful since the SG has been shown on average to be a less effective procedure in terms of weight loss when compared to the Roux-en-Y gastric bypass or the duodenal switch. This information is also of use if the surgeon or patient still feels that the SG is the right procedure for them. If they know that the patient has a high likelihood of not completing their goals, they can make a more targeted care plan for the patient. This could include many things such as more follow-up visits, more recommended exercise, phone calls to check up on them, and their nutrition. If a patient and surgeon know before the surgery how much weight the patient is expected to lose, they can consider other surgeries or make a more targeted approach to follow up care so that the patient can succeed.

Postoperative predictions are somewhat less useful as a patient has already undergone the procedure and so cannot choose to start with a more invasive procedure, or with a more targeted care plan. However, they allow surgeons to identify non-responders early in the postoperative treatment and change the individual treatment to better suit the patient. This information can help the surgeon make a targeted care program for the patient still within the typical weight loss window of about the first year.

There are many different ways to predict weight loss following the SG. In simple terms, these tests can be broken into two categories: univariate and multivariate analysis. Univariate analysis like t tests are easily understood and easily misinterpreted. These types of tests very often look promising but are subject to underlying causes and differences in group demographics. Multivariate analysis on the other hand can be somewhat harder to interpret; however, they consider all variables at once which allows the effect of group demographic differences to be mitigated as long as the confounding variable is measured and entered into the model. Multivariate analyses also allow for interacting variables to be more fully studied.

While neither univariate or multivariate analysis will prove causation in SG weight loss prediction, multivariate analysis offers the tools to make prediction models which account for many measurable variables.

These prediction models while not based upon a causational relationship can be very useful in a bariatric center. Patients should be informed before surgery of a personalized weight loss expectation for each procedure that they consider. This will help them choose a procedure appropriate to them when balanced with complication rates or choose a more targeted care plan to help them succeed with the SG. Also, in the early postoperative period, these models can help surgeons identify patients who will not meet their weight loss goals and try preventative measures to help their patients succeed.

Overall, these studies add to the understanding about what variables may correlate with weight loss following the sleeve gastrectomy. There are many variables that are hard to judge if they truly correlate with weight loss because of reporting practice and statistical methods. To rectify this, we recommend a change in how variables are reported so that in future studies, this effect can be measured more accurately.

In some multivariate studies, such as the one reported by Goldenshluger et al., the results are reported as the variables that affect weight loss and an associated p value [30]. This, while useful, fails to tell the reader the true impact of the results. The real question that needs to be answered is to what extent does that variable affect weight loss and in which direction, positive or negative. Also, in six studies, a multilinear regression is reported with all of the variables and beta coefficients; however, they do not report the constant. Without this, the model cannot be validated by other researchers and the entirety of their results is ineffective in a clinical setting.

We recommend reporting the entire model as did Cottam et al. and Goulart et al. so that surgeons can apply these findings in their own practices to improve bariatric care [8, 49]. These studies reported their entire models in the form %EWL = c + B1V1 + B2V2 + … + BiVi where c is a constant from linear regression output, i is the number of significant variables, each V is a different variable labeled 1-I, and each B is the beta coefficient associated with that variable. Another way which would also be equally acceptable is to present the model in a table. An example of an acceptable table can be found in Supplementary Materials File 1. That value c in the equation above is not multiplied by any variable but is necessary for the model to be validated by others.

We also recommend a decrease in the amount of studies which make their final conclusion based upon univariate comparisons. While there are numerous reasons to make univariate comparisons, which are valid and sometimes necessary in bariatric surgery, chopping up the population into subsets over and over again reduces the ability to make accurate conclusions due to the increased likelihood of a type 1 error [50]. If five tests are performed, then there is a 22% chance that incorrectly reports a difference. If you run eight tests, there is a 33% chance that at least one is reported incorrectly reported as significant. A more detailed mathematical explanation can be found in the Supplementary Materials File 2. This high type 1 error rate is why there is need for repeated testing by independent studies to determine the likelihood of a variable affecting weight loss. While this error is present in both univariate and multivariate analysis, univariate comparisons often show differences that are better ascribed to other measured variables where multivariate analysis mitigates this effect.

We also recommend more studies to use interacting variables in their models. It often is true that one variable’s effect on the outcome is mediated by another variable. An example of this would be temperature. The temperature outside does affect how hot people feel, but the degree of the response is affected by the humidity. The same thing can happen in weight loss predictions, for example, Cottam et al. found that DM and HTN were interacting variables in predicting weight loss. HTN and DM individually predicted some decrease in weight loss, but together, the effect was greater than the added effects of the individual comorbidities.

Conclusion

Early postoperative weight loss and baseline BMI are highly correlated with long-term weight loss and should be considered in future weight loss prediction models. All predictive models should be completely reported so that they can be easily validated in other research settings. Current reporting practices are inadequate for continued collaborative research and must change to allow other researcher to use and validate their results.