Introduction

Metabolic surgery is the most effective treatment for type 2 diabetes (T2D) in people with obesity. High-quality evidence from randomized controlled trials has revealed superior benefits of metabolic surgery to intensive medical therapy in improving glycemic control [1, 2]. Roux-en-Y gastric bypass (RYGB) and sleeve gastrectomy (SG) are currently the most commonly performed surgical therapies for T2D. It is important to note that not all people who undergo metabolic surgery achieve T2D remission. Therefore, identifying preoperative predictors of T2D remission would be of substantial clinical utility as they improve candidate selection for surgery and define realistic postoperative goals.

Several scoring systems have been developed to predict success of T2D remission following metabolic surgery. In Asia, Lee et al. constructed the 4-factor ABCD scoring system (age, body mass index (BMI), C-peptide and T2D duration) that is useful for predicting diabetes remission after gastric bypass and SG [3, 4]. The DiaRem score (age, HbA1c, type of anti-diabetic medication, and insulin usage) was subsequently developed based on a predominantly White Caucasian cohort [5]. The Advanced-DiaRem (Ad-DiaRem) score has two additional factors (T2D duration and number of anti-diabetic medication) and modified penalty scores for each category in the original DiaRem [6]. The individualized metabolic surgery score (T2D duration, HbA1c, anti-diabetic medication, and insulin usage) categorizes individuals into 3 stages of diabetes severity, of which mild T2D is associated with the most favorable T2D outcome [7].

The efficacy of the Asia-developed ABCD scoring system for predicting T2D remission has not been previously defined in multi-ethnic Asian populations. In this study, we assessed the predictive performance of the ABCD score for T2D remission after metabolic surgery in Singapore’s multi-ethnic Asian setting. Additionally, we devised a new predictive scoring system and compared it with the ABCD score.

Materials and Methods

Subjects

Adults scheduled for metabolic surgery were recruited between August 2007 and November 2018 (Fig. 1). This retrospective study included people with T2D who underwent either RYGB or SG, and had preoperative HbA1c of ≥ 6.5% or treated with anti-diabetic medication (n = 114). Subjects who had preoperative HbA1c < 6.5% without anti-diabetic medication (i.e. unlikely to have diabetes at baseline), had surgeries other than RYGB or SG, or were lost to 1-year follow-up were excluded from the analysis. Information on medical history and medication in use was recorded by a questionnaire.

Fig. 1
figure 1

Flow diagram of patient recruitment, selection, and follow-up

Anthropometric and Biochemical Measurements

Height and weight were measured by standard procedure, and BMI was computed as weight (in kg) divided by height (in m2). Percentage of total body weight loss was calculated as 100 × (preoperative weight – postoperative weight) / preoperative weight. HbA1c was measured using a point-of-care immunoassay analyzer certified by the National Glycohaemoglobin Standardization Programme (DCA Vantage Analyser; Siemens Healthcare Diagnostics, Erlangen, Germany). Fasting plasma glucose concentrations were measured by the glucose-oxidase technique. The fasting C-peptide levels were measured using sandwich chemiluminescence immunoassay. The lowest detection limit is 17 pmol/L, and the total imprecision ranges from 1.9 to 4.6% for a concentration range of 346–4643 pmol/L. The homeostasis model assessment (HOMA) indexes were estimated from the fasting plasma glucose and C-peptide concentrations by using the online-available HOMA2 calculator [8].

ABCD Diabetes Surgery Score System

As described by Lee et al. [4], this simple 10-point scoring system integrates age, BMI, C-peptide, and duration of diabetes. While age uses a 2-point scale (years; ≥ 40 = 0, < 40 = 1), BMI (kg/m2; < 27 = 0, 27–34.9 = 1, 35–41.9 = 2, ≥ 42 = 3), c-peptide (ng/ml; < 2 = 0, 2–2.9 = 1, 3–4.9 = 2, ≥ 5 = 3), and diabetes duration (years; > 8 = 0, 4–8 = 1, 1–3.9 = 2, < 1 = 3) use a 4-point scale. A high ABCD score implies higher probability of T2D remission after metabolic surgery.

Statistical Analysis, Definitions, and Development of the Prediction Algorithm

The statistical analyses were performed using the STATA software version 14 (Stata Corporation, College Station, TX, USA) and SPSS version 22 (IBM Corp, NY, USA). Continuous variables were presented as mean ± SD or median (interquartile range), and categorical data were presented as n (%). Parametric Student’s t test and non-parametric Mann-Whitney test statistical tests were employed for comparison of continuous variables between groups. Chi-square test was used to compare categorical variables. Backward stepwise logistic regression with a P value inclusion threshold of 0.1 was performed to identify predictors of complete T2D remission defined as HbA1c < 6% without use of anti-diabetic medication at 1 year after metabolic surgery [4]. Partial T2D remission was defined as 1-year postoperative HbA1c of between 6 and < 6.5%, irrespective of use of anti-diabetic medication. The Metabolic surgery Diabetes Remission (MDR) score was developed based on the variables retained in the final stepwise logistic regression model. Taking into consideration variable informativeness, the components of MDR that had P < 0.05 in the final stepwise regression model (HbA1c and T2D duration) were assigned a score ranging from 0 to 3 as guided by quartile cut-offs, whereas those that were marginally significant (age and HOMA-B) were graded a score ranging from 0 to 2 as guided by tertile cut-offs. The points for each component were added, and the total possible score ranged from 0 to 10 points. The diagnostic accuracy of MDR and ABCD scores was assessed using receiver operating characteristic (ROC) curves, and the areas under the ROC curves (AUC) were compared using the DeLong method [9]. In addition, the sensitivity, specificity, positive predictive value, negative predictive value, and optimal cut-off by highest Youden index [10] were calculated.

Results

Characteristics of Subjects

Table 1 shows the preoperative characteristics of the participants subgrouped by 1-year complete T2D remission status (n = 114). Compared with the non-remitters (12.3% partial remission and 40.4% persistent T2D), the remitters (47.4%) had younger age and shorter T2D duration. Those who responded favorably also displayed better preoperative glycaemic control and β-cell function (HOMA-B). At 1 year post-surgery, the HbA1c levels of non-remitters and remitters were 6.8 ± 1.2% and 5.5 ± 0.3%, respectively (P < 0.001). Despite having similar preoperative BMI, the remitters experienced greater 1 year percentage of total weight loss than the non-remitters (27.7 ± 7.3% versus 20.8 ± 7.5%, P < 0.001).

Table 1 Baseline subject characteristics subgrouped by diabetes remission status post-metabolic surgery

Predictive Ability of the ABCD Score

We assessed the predictive accuracy of the ABCD score in our cohort (Table 2). None of the participants obtained a score of 0 or 10 points. At high scores of 8–9, correct prediction of complete T2D remission was 100%. However, at mid-high scores of 6 and 7, the accuracy of correctly predicting complete T2D remission was 50–56%. Expectedly, misclassifications were less frequent when predicting partial/complete T2D remission.

Table 2 Predictive performance of ABCD for T2D remission at 1-year follow-up

Development and Performance of MDR

Backward stepwise logistic regression analysis with complete T2D remission as the outcome was performed using candidate variables selected from the univariate test shown in Table 1 (age, gender, T2D duration, HbA1c, HOMA-B, insulin usage). Of note, HOMA-B could not be computed for 6 people due to missing fasting glucose data. Hence, the cohort size analyzed was 108. The final model retained four independent predictors: age (odds ratio (OR), 0.95; 95% CI, 0.90–1.00; P = 0.054), T2D duration (OR, 0.88; 95% CI, 0.81–0.97; P = 0.006), HbA1c (OR, 0.72; 95% CI, 0.55–0.94; P = 0.016), and HOMA-B (OR, 1.07; 95% CI, 1.00–1.02; P = 0.099). Using the four variables, we devised the MDR scoring system as described in methods (Table 3). A MDR score of 6 points and higher yielded a prediction accuracy ranging from 83 to 100% and 71 to 100% for partial/complete and complete T2D remission, respectively (Table 4). Moreover, the discriminative ability of MDR to identify complete T2D remission was significantly greater than ABCD (Fig. 2; AUC 0.79 versus 0.67, P = 0.007). As determined by the highest Youden index, the optimal MDR cut-off score was ≥ 4 points, yielding a sensitivity of 76% and specificity of 70.7%.

Table 3 Variables and score used to calculate MDR
Table 4 Predictive performance of MDR for T2D remission at 1-year follow-up
Fig. 2
figure 2

Discriminative ability of ABCD and MDR. a ROC curves and AUC. b Optimal cut-off scores, sensitivity, specificity, positive predictive value, negative predictive value, and Youden index for each model

Discussion

Not all people with obesity achieve complete T2D remission after metabolic surgery. In our cohort, 47.4% of subjects experienced complete T2D remission at 1 year after metabolic surgery, while 12.3% had partial remission. Consistently, a retrospective analysis of 134 individuals who underwent RYGB revealed that 15.7% and 46.1% achieved partial and complete T2D remission during a mean follow-up of 12 months, respectively, and a non-remission rate of 38.2% [11]. Given the invasive nature of metabolic surgery, it is important to identify patients who can best benefit from surgery.

The ABCD score is a prediction algorithm of T2D remission developed in Asia [3, 4]. Although limited, the efficacy of ABCD has been tested in several Asian populations that are ethnically homogeneous. A Japanese study showed that the ROC curve of the ABCD score for T2D remission had an AUC of 0.79 [12]. More recently, Kam et al. demonstrated in Chinese adults that the AUC values of ABCD in predicting T2D remission after RYGB at 1-year and 3-year follow-up were 0.74 and 0.75, respectively [13]. The performance of ABCD in multi-ethnic Asian populations had not been previously reported. In our mixed ethnic cohort, we observed frequent misclassifications at mid-high ABCD scores. Moreover, the AUC obtained by our cohort was 0.67. The reduced discriminative ability of ABCD could be partly explained by the variability in adiposity and insulin sensitivity in different Asian ethnic groups [14]. Moreover, the cohort involved for ABCD construction had a clinical profile different from our cohort [3, 4]. Our participants generally had older age and longer T2D duration, suggesting a poorer β-cell reserve. While obesity is a feature of the ABCD score, we did not observe a marked difference in preoperative BMI between remitters and non-remitters of T2D (Table 1). Evolving body of evidence suggests that the susceptibility to obesity-associated metabolic complications is not mediated by the amount of fatness per se, but by the inability for excess energy to be stored appropriately in adipose tissue after reaching an individual’s fat threshold [15, 16]. In the face of sustained excessive nutrition, exhaustion of adipose tissue expandability creates stress on adipocytes and elicits a maladaptive inflammatory response over time, eventually causing insulin resistance and T2D [16]. Therefore, BMI alone may not sufficiently capture this concept of maladaptive excess adiposity.

In order to improve the prediction of T2D remission following metabolic surgery in our multi-ethnic cohort, we developed a new scoring algorithm (MDR). Like the ABCD [3, 4], MDR is a 4-factor 10-point scoring system, with higher scores predicting higher probability of diabetes remission. We demonstrated that MDR outperformed ABCD in our cohort, yielding a reasonable discriminative power of 0.79. At a score of 6 and above, MDR yielded a predictive accuracy of complete T2D remission ranging from 71 to 100%. However, the utility of MDR for predicting long-term T2D remission warrants further investigation.

The MDR, consisting of age, T2D duration, HbA1c, and HOMA-B, devotes great emphasis on pancreatic islet cell involvement. Specifically, age reflects the general reserve of β-cell function; T2D duration is a proxy for natural course and deterioration of β-cell function; HbA1c is associated with insulin secretion; and HOMA-B reflects β-cell secretory activity in response to plasma glucose concentrations [17, 18]. This supports that better-preserved pancreatic β-cell reserve is critical for facilitating amelioration of T2D after metabolic surgery. People with diminished β-cell reserve may not have the capacity to synthesize and secrete an adequate concentration of insulin to restore glycaemic control [19, 20]. Usage of insulin, which reflects T2D severity, is a common component of the DiaRem, Ad-DiaRem, and individualized metabolic surgery score [5,6,7]. Although our data showed that insulin usage was significantly different between non-remitters and remitters (Table 1), the variable was not retained as an independent predictor of T2D remission in the stepwise regression model, possibly due to considerable association with HbA1c and T2D duration.

Our work included subjects who underwent either RYGB or SG. We observed that a higher proportion of individuals who underwent RYGB than SG achieved complete diabetes remission (Table 1; 52.5% versus 35.3%), but the difference was not statistically significant (P = 0.092). Consistently, the 1-year data from the Surgical Treatment and Medications Potentially Eradicate Diabetes Efficiently (STAMPEDE) trial showed that RYGB did not differ from SG in terms of complete diabetes remission (42% versus 27%, respectively, P = 0.10) [21]. Similarly, the 5-year data from the Finnish Sleeve vs Bypass (SLEEVEPASS) and Swiss Multicenter Bypass or Sleeve Study (SM-BOSS) trials reported comparable rates of complete T2D remission between RYGB and SG [22, 23]. Nevertheless, it is notable that the short- and long-term analysis of the STAMPEDE trial revealed more individuals in the RYGB group achieving the primary endpoint target of HbA1c < 6% without the use of anti-diabetic medication than those in the SG group [2, 21]. These findings demonstrate superiority of RYGB over SG in T2D reversal.

To our knowledge, this study is the first of its kind in multi-ethnic Asian populations. However, we acknowledge several limitations including the retrospective design and short follow-up. In addition, the small sample size limits ethnic subgroup analysis. The proportion of SG is considerably lower than RYGB; thus, the comparison between the two procedures may not be robust. Our medication data lack granularity. Hence, the performance of other available scoring systems such as DiaRem and Ad-DiaRem could not be assessed in our cohort. Loss to follow-up, typically consisting of younger subjects with less severe T2D condition (i.e. shorter T2D duration and lower HbA1c), may bias the results. Our findings were not validated in an independent cohort and may not be generalizable beyond this study population.

Conclusion

In this study, we have devised a clinically useful algorithm MDR that outperforms ABCD score for predicting T2D remission at 1 year after metabolic surgery in Singapore’s multi-ethnic Asian cohort.