Prediction of thirty-day morbidity and mortality after laparoscopic sleeve gastrectomy: data from an artificial neural network

Wise, Eric S.; Amateau, Stuart K.; Ikramuddin, Sayeed; Leslie, Daniel B.

doi:10.1007/s00464-019-07130-0

Prediction of thirty-day morbidity and mortality after laparoscopic sleeve gastrectomy: data from an artificial neural network

2019 SAGES Oral
Published: 30 September 2019

Volume 34, pages 3590–3596, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Surgical Endoscopy Aims and scope Submit manuscript

Prediction of thirty-day morbidity and mortality after laparoscopic sleeve gastrectomy: data from an artificial neural network

Download PDF

Eric S. Wise¹,
Stuart K. Amateau²,
Sayeed Ikramuddin¹ &
…
Daniel B. Leslie¹

641 Accesses
20 Citations
Explore all metrics

Abstract

Background

Multiple patient factors may convey increased risk of 30-day morbidity and mortality after laparoscopic vertical sleeve gastrectomy (LVSG). Assessing the likelihood of short-term morbidity is useful for both the bariatric surgeon and patient. Artificial neural networks (ANN) are computational algorithms that use pattern recognition to predict outcomes, providing a potentially more accurate and dynamic model relative to traditional multiple regression. Using a comprehensive national database, this study aims to use an ANN to optimize the prediction of the composite endpoint of 30-day readmission, reoperation, reintervention, or mortality, after LVSG.

Methods

A cohort of 101,721 LVSG patients was considered for analysis from the 2016 Metabolic and Bariatric Surgery Accreditation and Quality Improvement Program national dataset. Select patient factors were chosen a priori as simple, pertinent and easily obtainable, and their association with the 30-day endpoint was assessed. Those factors with a significant association on both bivariate and multivariate nominal logistic regression analysis were incorporated into a back-propagation ANN with three nodes each assigned a training value of 0.333, with k-fold internal validation. Logistic regression and ANN models were compared using area under receiver-operating characteristic curves (AUROC).

Results

Upon bivariate analysis, factors associated with 30-day complications were older age (P = 0.03), non-white race, higher initial body mass index, severe hypertension, diabetes mellitus, non-independent functional status, and previous foregut/bariatric surgery (all P < 0.001). These factors remained significant upon nominal logistic regression analysis (n = 100,791, P < 0.001, r²= 0.008, AUROC = 0.572). Upon ANN analysis, the training set (80% of patients) was more accurate than logistic regression (n = 80,633, r²= 0.011, AUROC = 0.581), and it was confirmed by the validation set (n = 20,158, r²= 0.012, AUROC = 0.585).

Conclusions

This study identifies a panel of simple and easily obtainable preoperative patient factors that may portend increased morbidity after LSG. Using an ANN model, prediction of these events can be optimized relative to standard logistic regression modeling.

The use of artificial neural networks to predict delayed discharge and readmission in enhanced recovery following laparoscopic colorectal cancer surgery

Article 19 June 2015

Prediction of thirty-day morbidity and mortality after duodenal switch using an artificial neural network

Article 28 June 2022

Development and validation of machine learning models to predict gastrointestinal leak and venous thromboembolism after weight loss surgery: an analysis of the MBSAQIP database

Article 17 January 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Laparoscopic vertical sleeve gastrectomy (LVSG), since its Medicare approval as a stand-alone procedure in 2012, has rapidly become the most commonly performed primary bariatric operation in the USA, surpassing the Roux-en-Y gastric bypass (RYGB). The safety profile of LVSG has thus far been excellent, with acceptably low rates of postoperative complications including infection, thromboembolism, stricture, bleeding, and staple line leak. In 2012, the Metabolic and Bariatric Surgery Accreditation and Quality Improvement Program (MBSAQIP) was formed to establish process and outcome measures for accreditation. Among the standards used to quantify and define safety are 30-day postoperative outcomes, notably, readmission, reoperation, reintervention (e.g. endoscopy), or mortality.

Nationwide data on LVSG is contained within the MBSAQIP participant use data file (PUF), an extensive database containing information on operations performed at all the accredited bariatric centers in the USA. In the dataset from operations performed in 2016, almost 800 different centers submitted information on over 186,000 unique bariatric operations [1]. This robust cohort is sufficiently powered to address difficult questions in bariatric surgery in an effort to improve cost, safety, and efficiency. Its data has potential to drive changes in practice with regard to operative technique, preoperative patient preparation and guidance, as well as best practices postoperatively.

Artificial neural networks are complex modeling systems in which an algorithm is generated by teaching the system to predict an outcome. Used far more ubiquitously in other applications such as computer engineering, ANNs are only now quickly emerging as potentially useful tools for projecting clinical outcomes [2]. In surgery, ANNs have successfully been developed to predict survival after liver transplantation, diagnosis of acute appendicitis, and prompt extubation after coronary bypass, as a few representative examples [3,4,5]. Despite their limitations, ANNs can offer predictive modeling with a higher fidelity than statistical techniques commonly used in surgery, such as multiple regression. In this investigation, we aim not only to use the 2016 MBSAQIP dataset to identify the panel of risk factors that may portend the composite endpoint of 30-day morbidity and mortality after VLSG, but also to optimize the modeling of the variance contained within the identified risk factors using an ANN.

Materials and methods

For this study, all patients were taken from the 2016 MBSAQIP dataset. The MBSAQIP constitutes a joint venture by the American College of Surgeons and the American Society for Metabolic and Bariatric Surgery, and its database represents the largest bariatric surgical dataset in the country. Information regarding all bariatric procedures performed at accredited center is input in this database, capturing over 200 patient and surgeon/center variables as well as short-term outcomes after surgery. The MBSAQIP PUF is a Health Insurance Portability and Accountability Act compliant document, and all variables and outcomes are as defined in the accompanying PUF Variables and Definitions Manual [1]. As this dataset does not contain identifying information, it was deemed exempt from the University of Minnesota Institutional Review Board.

Our study cohort was derived first by querying all patients who underwent a LVSG as indicated by the designation of CPT code 43644 (n = 114,251). Patients were excluded only if they did not undergo traditional multi-port LVSG (robotic, single-incision, etc.), generating 101,721 patients who were studied. Select patient factors were chosen a priori for inclusion based on simplicity, accuracy of quantification, and plausibility of having an association with the outcome of interest. These variables included demographics, major comorbidities (severe hypertension, diabetes mellitus), non-independent functional status, and the presence of a history of previous obesity/foregut surgery. Among demographics, binary variables included female gender and non-white race. Continuous variables included age and initial body mass index (BMI₀), defined as BMI at the time of surgery. All other variables were dichotomized based on their presence or absence. Severe hypertension was defined as the use of three or greater anti-hypertensive medications at the time of surgery.

The primary endpoint of interest was a composite 30-day morbidity and mortality, defined by the presence of a 30-day readmission, reoperation, reintervention, or death. Specific interventions and justifications for these events, reported non-uniformly as free-text, were not addressed. Patients were subsequently stratified by presence of a 30-day endpoint, and a bivariate analysis was used to determine association of the patient factors with an event. Bivariate analysis was performed using Chi square test or the Mann–Whitney U test as appropriate. Categorical variables were represented as percentages and continuous variables as median (interquartile range).

Factors significant on bivariate analysis were subsequently included in a multivariate nominal logistic regression analysis for independent association with the 30-day endpoint. Results of the multivariate analysis were expressed by an odds ratio with 95% confidence interval, and strength of association was also characterized with a P value. The quality of the multivariate analysis was characterized by an r² goodness of fit, P value, as well as the area under the receiver-operating characteristic curve (AUROC), and this analysis excluded patients without all included variables known.

Similarly, an ANN model was generated using a three-node back-propagation technique with k-fold validation with each node was assigned an equal training value of 0.333, representing previously used conservative parameters to minimize further model complexity and mitigate overfitting [6]. Eighty percent of the patients were randomly used to train the model, and the other 20% were withheld to constitute the interval validation set. The ANN model is illustrated in Fig. 1. ANN models were characterized by r² goodness of fit, P value, as well as the AUROC. The AUROC values of the two multiple variable models were then compared [7].

A two-tailed P value of 0.05 or less was taken as the threshold to denote statistical significance. All basic statistical analyses were performed using GraphPad Prism 8 (LaJolla, CA). Multivariate analysis and artificial neural network modeling were performed with JMP Pro 13 (Cary, NC).

Results

Of the 101,721 VLSG patients included in the analysis, 79.4% were female with a median age of 44.3 years (n = 101,704). Additionally, 27.6% were of non-white race, and their median BMI₀ was 43.3 kg/m² (n = 100,807). Within this cohort, there were 3853 patients with a 30-day morbidity or mortality and 97,868 patients without a national rate of 3.8%. Of these 3853 patients, 81.5% (n = 3140) had a 30-day readmission, 23.0% (n = 887) had a 30-day reoperation, 23.0% (n = 887) had a 30-day reintervention, and 1.7% (n = 65) patients had a 30-day mortality. Of the patients with a 30-day endpoint, 25.3% (n = 976) were noted as having two or more of the four events that comprise the composite endpoint. These results are summarized in Table 1.

Table 1 Baseline characteristics of the patient cohort

Full size table

Bivariate analysis identified those factors with an association with a 30-day endpoint. Associated demographics include advanced age (P = 0.003), non-white race (P < 0.001), and initial BMI (P < 0.001). Presence of the comorbidities of severe hypertension and diabetes mellitus, as well as non-independent functional status and previous obesity/foregut surgery were also associated with the 30-day endpoint (P < 0.001). These factors were included in a multiple logistic regression analysis to determine independent association with a 30-day outcome. Indeed, all factors remained statistically significant (n = 100,791, P < 0.001, r² = 0.008). Results of the bivariate and multivariate analyses are summarized in Table 2.

Table 2 Bivariate and multivariate analyses of associations with the 30-day endpoint

Full size table

The multivariate analysis was subsequently subject to ROC curve analysis, which generated an AUROC of 0.572 (n = 100,791). The same factors considered in multivariate analysis were imputed in an artificial neural network as described. The algorithm derived from the ANN training set (80% of the patients chosen randomly) generated an AUROC of 0.581 (n = 80,633); similarly, the validation set (derived from the 20% of patients withheld) generated an AUROC of 0.585 (n = 20,158). A comparison of the ROC curves between the multivariable and ANN training set models is illustrated in Fig. 2, revealing an improved goodness of fit of the ANN model.

Discussion

In this investigation, we elected to examine outcomes after the VLSG. Mechanistically, the drastically reduced stomach volume restricts bolus capacity and provides for earlier satiety, allowing for significant caloric reduction [8]. Its 5 year weight loss has been shown commensurate with the gastric bypass, though with less potential for postoperative digestive syndromes [9]. Relative to the bypass, its principle drawbacks include the potential for continued or worsening reflux and potentially inferior resolution of comorbidities, specifically, diabetes mellitus [10].

The primary outcomes of interest, and those used to help characterize whether a bariatric center meets accreditation criteria, are 30-day morbidities (readmission, reoperation and reintervention), as well as 30-day mortality. In particular, significant attention to 30-day readmissions as an outcome measure has been given, as this event is frequently avoidable, may not be reflective of a true complication, is costly, and necessitates utilization of significant emergency department resources [11, 12]. Lower than for the gastric bypass, VLSG has a readmission rate of 2.8%, most frequently, for nausea, vomiting, and dehydration symptoms. Demonstrated risk factors for readmission include black race, diabetes, hypertension, renal failure, and severe chronic obstructive pulmonary disease [13]. As such, determination of a compact, directed panel of preoperative factors chosen to be examined a priori was primarily influenced by previously characterized risk factors for readmission, the most prevalent and perhaps avoidable of the four components of the 30-day endpoint [14]. Demographics, comorbidities with high prevalence, as well as functional status and revisional surgery were thus taken to be used as our examinable risk factors.

Predicting which patients will have a 30-day morbidity or mortality represents a challenge, as it is ostensibly governed by preoperative, intraoperative and postoperative patient, and surgeon/center factors, in addition to an element of random chance. Nonetheless, optimally characterizing the variance attributable to simple preoperative patient factors can help better identify and stratify those patients more likely to have a positive endpoint early in the course of their bariatric care. In this study, multivariable analysis demonstrated the independence of advanced age, non-white race, and higher initial BMI as predictors of a 30-day morbidity and mortality. Additionally, confirming previous reports, severe hypertension and diabetes mellitus were also risk factors, as were non-independent functional status and previous obesity/foregut surgery [15]. Using the logistic regression function to predict the development of a 30-day endpoint on the basis of only the included seven preoperative patient factors generated an AUROC of 0.572, a measure reflecting the goodness of fit of the model in its predictive ability. The ANN model, in contrast, demonstrated by an improved AUROC.

The potential for ANN use in surgery is significant, as the algorithms derived from ANNs are sophisticated, non-linear, and capable of recognizing complex interactions among both continuous and categorical variables in order to optimize outcome prediction. Moreover, if desired, the algorithms can continually be refined prospectively, as new patient data is input. Analogous to neuronal synapse interaction in the brain, ANNs are model systems taught to predict an outcome by considering an input layer of variables with which to undertake pattern recognition. Intermediately derived data from each layer is subsequently processed at hidden nodes, which, similar to a neuron, are used to integrate and weight inputs, and pass on the information for further processing and prediction. The ANN model used in this investigation benefited from the 20% of patients randomly withheld to constitute a cohort to provide internal validation to the algorithm, to prevent against overfitting. Despite their potential, ANNs have seen only limited adaptation to clinical outcomes modeling in surgery, including bariatric surgery.

In 2007, Lee and colleagues used a 249 patient set to demonstrate an improvement in the prediction of post-bariatric surgery weight loss using an ANN model relative to logistic regression, on the basis of type of operation as well as preoperative triglyceride and hemoglobin A1c levels [16]. Subsequently, ANNs were used to predict excess weight loss after adjustable gastric banding in two subsequent small studies [17, 18]. Most recently, we reported our use of ANN modeling to predict weight loss at 6 months and one-year after laparoscopic Roux-en-Y gastric bypass [19]. Using over 647 patients from a single institution, five factors associated with postoperative weight loss were modeled using multiple linear regression and, optimally, by an ANN. This study aimed to overcome the biggest weakness of the ANN model, its complexity precluding it from clinical use, by constructing a web-based patient-centered tool to use the ANN algorithm to generate an estimation of weight loss expectation at 6 and 12 months postoperatively [19]. Similarly, the development of a user-friendly neural network based tool to identify early high risk patients constitutes the best measure for early intervention on modifiable risk factors and subsequent potential improvement in 30-day morbidity and mortality. This investigation contributes to the body of work using ANNs to predict bariatric surgical outcomes, as we successfully use ANNs to demonstrate superiority in its prediction of the occurrence of a 30-day composite endpoint.

The findings of this study must be considered in the context of its limitations. While nationally comprehensive, the MBSAQIP dataset used is subject to the selection bias inherent in retrospective analysis of prospectively collected data. Furthermore, despite standardized definitions of variables and outcomes, attribution of data is subject to bias due to medical record misrepresentation, misclassification, and ultimately misinterpretation by the institutional coders who contribute. In part due to this unavoidable variability, the 30-day endpoints of interest chosen for consideration in this study represented the most well-defined, and attribution was less subject to interpretation of nebulous or conflicting information in the medical record. Next, the use of our endpoint as a composite of 30-day reintervention, readmission, reoperation, or mortality fails to allow for distinguishing characteristics of each of these endpoints alone, as there may be significant variability in factors portending each of these four outcomes. Notably, one might hypothesize that risk factors for readmission may be due to factors more related to a patient’s psychiatric health and burden of comorbidities, while risk factors for patients who required a 30-day reintervention and reoperation may be more related to those that influence technical complexity of the LVSG and their ability to heal. This separation was not discerned in this study. However, use of 30-day composite endpoints after sleeve gastrectomy has been established, and in fact is generally broader in the scope of those events that constitute a 30-day morbidity and mortality [20]. We believe the endpoint chosen represents one that is clinically significant and eminently quantifiable. Next, the ANN algorithms, though beneficial due to their high fidelity and ability to continually be refined, are algorithmically complex and clinical use remains a challenge. The relative weighted contribution of each variable toward the endpoint is not known in ANNs in contrast to logistic or linear regression techniques, and thus, intervening to best improve risk profile is less straightforward. Nonetheless, ANNs remain a promising advanced modeling system in the prediction of surgical outcomes, particularly as datasets grow ever more comprehensive and complex.

In conclusion, this study reveals several risk factors for 30-day morbidity and mortality after VLSG using the best available dataset. In addition, we demonstrate, on a small scale, that ANN models can be used to optimize prediction of postoperative outcomes in bariatric surgery. We acknowledge the limited variance attributable to the factors considered, among the other limitations. However, a more comprehensive analysis of a greater number of variables using an ANN with a much larger input layer may be warranted in the future.

References

MBSAQIP (2016) MBSAQIP Participant Use Data File
Penny W, Frost D (1996) Neural networks in clinical medicine. Med Deci Mak 16:386–398
Article CAS Google Scholar
Yoldas O, Tez M, Karaca T (2012) Artificial neural networks in the diagnosis of acute appendicitis. Am J Emerg Med 30:1245–1247
Article Google Scholar
Wise ES, Stonko DP, Glaser ZA, Garcia KL, Huang JJ, Kim JS, Kallos JA, Starnes JR, Fleming JW, Hocking KM, Brophy CM, Eagle SS (2017) Prediction of prolonged ventilation after coronary artery bypass grafting: data from an artificial neural network. Heart Surg Forum 20:E007–E014
Article Google Scholar
Cruz-Ramirez M, Hervas-Martinez C, Fernandez JC, Briceno J, de la Mata M (2013) Predicting patient survival after liver transplantation using evolutionary multi-objective artificial neural networks. Artif Intell Med 58:37–49
Article Google Scholar
Wise ES, Hocking KM, Brophy CM (2015) Prediction of in-hospital mortality after ruptured abdominal aortic aneurysm repair using an artificial neural network. J Vasc Surg 62(1):8–15
Article Google Scholar
Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36
Article CAS Google Scholar
Benaiges D, Mas-Lorenzo A, Goday A, Ramon JM, Chillaron JJ, Pedro-Botet J, Flores-Le Roux JA (2015) Laparoscopic sleeve gastrectomy: more than a restrictive bariatric surgery procedure? World J Gastroenterol 21:11804–11814
Article CAS Google Scholar
Salminen P, Helmio M, Ovaska J, Juuti A, Leivonen M, Peromaa-Haavisto P, Hurme S, Soinio M, Nuutila P, Victorzon M (2018) Effect of laparoscopic sleeve gastrectomy versus laparoscopic Roux-en-Y gastric bypass on weight loss at 5 years among patients with morbid obesity: the sleevepass randomized clinical trial. JAMA 319:241–254
Article Google Scholar
Melissas J, Braghetto I, Molina JC, Silecchia G, Iossa A, Iannelli A, Foletto M (2015) Gastroesophageal reflux disease and sleeve gastrectomy. Obes Surg 25:2430–2435
Article Google Scholar
Telem DA, Yang J, Altieri M, Patterson W, Peoples B, Chen H, Talamini M, Pryor AD (2016) Rates and risk factors for unplanned emergency department utilization and hospital readmission following bariatric surgery. Ann Surg 263:956–960
Article Google Scholar
Major P, Wysocki M, Torbicz G, Gajewska N, Dudek A, Malczak P, Pedziwiatr M, Pisarska M, Radkowiak D, Budzynski A (2018) Risk factors for prolonged length of hospital stay and readmissions after laparoscopic sleeve gastrectomy and laparoscopic Roux-en-Y gastric bypass. Obes Surg 28:323–332
Article Google Scholar
Sippey M, Kasten KR, Chapman WH, Pories WJ, Spaniolas K (2016) 30-day readmissions after sleeve gastrectomy versus Roux-en-Y gastric bypass. Surg Obes Relat Dis 12:991–996
Article Google Scholar
Garg T, Rosas U, Rivas H, Azagury D, Morton JM (2016) National prevalence, causes, and risk factors for bariatric surgery readmissions. Am J Surg 212:76–80
Article Google Scholar
Lak KL, Helm MC, Kindel TL, Gould JC (2019) Metabolic syndrome is a significant predictor of postoperative morbidity and mortality following bariatric surgery. J Gastrointest Surg 23:739–744
Article Google Scholar
Lee YC, Lee WJ, Lee TS, Lin YC, Wang W, Liew PL, Huang MT, Chien CW (2007) Prediction of successful weight reduction after bariatric surgery by data mining technologies. Obes Surg 17:1235–1241
Article Google Scholar
Piaggi P, Lippi C, Fierabracci P, Maffei M, Calderone A, Mauri M, Anselmino M, Cassano GB, Vitti P, Pinchera A, Landi A, Santini F (2010) Artificial neural networks in the outcome prediction of adjustable gastric banding in obese women. PLoS ONE 5:e13624
Article Google Scholar
Lee YC, Liew PL, Lee WJ, Lin YC, Lee CK, Huangs MT, Wang W, Lin SC (2009) Prediction of successful weight reduction after laparoscopic adjustable gastric banding. Hepatogastroenterology 56:1222–1226
CAS PubMed Google Scholar
Wise ES, Hocking KM, Kavic SM (2016) Prediction of excess weight loss after laparoscopic Roux-en-Y gastric bypass: data from an artificial neural network. Surg Endosc 30:480–488
Article Google Scholar
Minhem MA, Safadi BY, Habib RH, Raad EPB, Alami RS (2018) Increased adverse outcomes after laparoscopic sleeve gastrectomy in older super-obese patients: analysis of American College of Surgeons National Surgical Quality Improvement Program Database. Surg Obes Relat Dis 14:1463–1470
Article Google Scholar

Download references

Acknowledgements

None.

Funding

There was no funding used for this manuscript.

Author information

Authors and Affiliations

Department of Surgery, Division of Gastrointestinal/Bariatric Surgery, University Of Minnesota, 420 Delaware St SE, MMC 195, Minneapolis, MN, 55455, USA
Eric S. Wise, Sayeed Ikramuddin & Daniel B. Leslie
Division of Gastroenterology, Hepatology and Nutrition, Section of Interventional and Advanced Endoscopy, Department of Medicine, University of Minnesota, Minneapolis, USA
Stuart K. Amateau

Authors

Eric S. Wise
View author publications
You can also search for this author in PubMed Google Scholar
Stuart K. Amateau
View author publications
You can also search for this author in PubMed Google Scholar
Sayeed Ikramuddin
View author publications
You can also search for this author in PubMed Google Scholar
Daniel B. Leslie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eric S. Wise.

Ethics declarations

Disclosures

Eric Wise, Stuart Amateau, Sayeed Ikramuddin, and Daniel Leslie have no conflicts of interest or financial ties to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wise, E.S., Amateau, S.K., Ikramuddin, S. et al. Prediction of thirty-day morbidity and mortality after laparoscopic sleeve gastrectomy: data from an artificial neural network. Surg Endosc 34, 3590–3596 (2020). https://doi.org/10.1007/s00464-019-07130-0

Download citation

Received: 05 April 2019
Accepted: 17 September 2019
Published: 30 September 2019
Issue Date: August 2020
DOI: https://doi.org/10.1007/s00464-019-07130-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Prediction of thirty-day morbidity and mortality after laparoscopic sleeve gastrectomy: data from an artificial neural network