Introduction

Obstructive sleep apnea (OSA) is a highly prevalent chronic medical condition [1, 2] and of the several risk factors attributable to its pathophysiology, obesity is by far the more robust predictor [3]. Obesity commonly leads to increases in neck circumference (NC), which can reduce upper airway caliber and facilitate the emergence of increased upper airway collapsibility due to local adipose tissue deposition [4]. Consequently, since obesity is an important risk factor for OSA, a high prevalence of OSA in obese subjects and especially among morbidly obese individuals is to be expected [5, 6]. Similar to what occurs in the general population, OSA is also underdiagnosed in the bariatric population [7, 8], prompting most, but not all, bariatric surgery programs to engage in routine evaluation of all patients for the presence of OSA, regardless of any sleep complaints being reported [9, 10]. Furthermore, bariatric patients with OSA appear to be at high risk for both perioperative and postoperative complications, thereby justifying early recognition of OSA for improved outcomes [11, 12]. Indeed, bariatric patients who did not receive continuous positive airway pressure (CPAP) treatment postoperatively were more likely to develop pulmonary complications [11, 12].

OSA is characterized by recurrent upper airway obstructive episodes, resulting in intermittent hypoxemia, arousals, and has been associated with cardiovascular and metabolic consequences, while also promoting increased overall mortality [13]. Currently, the gold standard for OSA diagnosis consists of overnight polysomnography (PSG) in the laboratory; however, it is not readily available for the large number of patients with suspected OSA, mainly in areas with limited healthcare resources. Using clinical instruments, it is possible to identify patients at high risk for OSA and offer them portable diagnostic methods, and accordingly reduce the long waiting times found in many sleep laboratories [3]. This strategy has already been tested in bariatric patients subjected to unattended diagnostic methods, an approach that apparently demonstrated good diagnostic accuracy [14].

The recently developed and validated instrument “No-Apnea” [15] is a very practical tool with only two objective parameters: NC and age, with a final score ranging from 0 to 9 points. A cut-off point ≥ 3 was previously employed to classify patients at high risk of OSA, with robust areas under the curve (AUCs) for screening of OSA at different levels of severity. Furthermore, despite its simplicity, when compared to two previously validated and widely used instruments (i.e., STOP-Bang questionnaire [16] and NoSAS score [17]), there were no statistically significant differences in performance between No-Apnea and the other tools [15].

Notwithstanding, there is only a scarce number of studies using questionnaires in the morbidly obese population [5, 18, 19], such that we hypothesized that screening of OSA risk with No-Apnea would provide a reliable approach in morbidly obese patients. For the present study, our main objectives were to evaluate the predictive performance of No-Apnea in a large sample of morbidly obese Brazilian patients referred for PSG evaluation in the context of bariatric surgery (BS cohort) or referred for clinical suspicion of OSA (NBS cohort), and also to compare No-Apnea discriminatory performance with three other instruments, namely STOP-Bang [16], NoSAS [17], and Epworth Sleepiness Scale (ESS) [20].

Methods

Study design

This cross-sectional study, from January 2017 to August 2018, prospectively enrolled morbidly obese subjects who were clinically referred for overnight in-laboratory PSG evaluation by their respective treating physicians. Inclusion criteria consisted of age ≥ 18 years and body mass index (BMI) ≥ 35.0 kg/m2. Exclusion criteria were as follows: previously diagnosed OSA, use of portable sleep studies, incomplete clinical data, and technically inadequate PSG. All morbidly obese subjects enrolled were grouped into two independent groups: BS (patients undergoing assessment prior to bariatric surgery) and NBS (morbidly obese patients referred for suspected OSA). The study protocol was approved by the Ethics Committee of the Federal University of Rio de Janeiro (no. 1.764.165) and was carried out in accordance with the Declaration of Helsinki. Written informed consent was obtained from each subject and anonymity of each participant was preserved.

On the evening of the PSG, clinical data were collected in all patients: gender, age, BMI, NC, self-reported comorbidities (smoking, diabetes mellitus, and hypertension), and data from each of the screening instruments (No-Apnea, STOP-Bang, NoSAS, and ESS). The BMI was calculated by dividing the weight in kilograms by the square of the height in meters (kg/m2), while NC (in cm) was systematically measured using a flexible tape with all subjects in the upright sitting position, with the upper edge of the tape measure placed immediately below the laryngeal prominence and applied perpendicularly to the long axis of the neck.

Screening tools

The No-Apnea scoring system [15] contains two objective parameters: NC and age. NC is scored as follows: 37.0–39.9 cm (1 point), 40.0–42.9 cm (3 points), and ≥ 43.0 cm (6 points), while age is scored as follows: 35–44 years (1 point), 45–54 years (2 points), ≥ 55 years (3 points). The points for each variable are added reaching a final score of 0–9 points, being a cut-off point ≥ 3 used to classify patients at high risk of OSA [15].

The STOP-Bang questionnaire [16] was originally developed for the screening of OSA in preoperative patients and consists of eight yes-or-no questions: loud snoring, tiredness, observed apnea, hypertension, BMI > 35 kg/m2, age > 50 years, NC > 40 cm, and male gender. A STOP-Bang score ≥ 3 points is considered as positive; however, a cut-off point ≥ 4 is more suitable to screen obese and morbidly obese subjects [21], being that cut-off point ≥ 4 was employed in our study.

The NoSAS score [17] is a recently reported screening tool containing five parameters: NC > 40 cm (4 points), BMI 25.0–29.9 kg/m2 (3 points), BMI ≥ 30.0 kg/m2 (5 points), snoring (2 points), age > 55 years (4 points), male gender (2 points), totaling a score of 0–17 points. A score ≥ 8 points was considered as high risk for OSA [17].

The ESS [20] is an 8-item questionnaire that assesses the subjective likelihood of falling asleep in various contexts: each item is scored from zero (never sleeps) to three (high chance of falling asleep), with a final score from 0 to 24 points. A score ≥ 11 points was considered as excessive daytime somnolence [20].

Sleep studies

All tests were conducted in Sleep Laboratory at Centro Medico BarraShopping, Rio de Janeiro, Brazil. All patients underwent an attended, in-lab full PSG (EMBLA® S7000, Embla Systems, Inc., Broomfield, CO, USA) with video monitoring during the whole night, consisting of recording of electroencephalography, electrooculography, electromyography (chin and legs), electrocardiography, airflow, thoracic and abdominal impedance belts, oxygen saturation (SpO2), microphone for snoring, and sensors for body position. Polysomnographic records were scored manually in accordance with the latest 2012 American Academy of Sleep Medicine (AASM) guidelines [22] by two board-certified sleep physicians, both of whom were blinded for No-Apnea, STOP-Bang, NoSAS, and ESS results. These data included total sleep time (TST), sleep efficiency, sleep stages, rapid-eye movement (REM) and sleep latencies, arousal index, apnea/hypopnea index (AHI), and SpO2 values. Apneas were defined as a decrease of at least 90% of airflow from baseline, lasting at least 10 s, while hypopneas were classified with a decrease ≥ 30% of pre-event during ≥ 10 s associated with ≥ 3% oxygen desaturation or an arousal [22]. The AHI was calculated as the sum of the number of apneas and hypopneas per hour of sleep, with OSA severity assessed by three different AHI cut-off values: ≥ 5.0/h as any OSA (OSA≥5), ≥ 15.0/h as moderate/severe OSA (OSA≥15), and ≥ 30.0/h as severe OSA (OSA≥30).

Statistical analysis

Data analysis was carried out using SPSS for Windows (version 21.0; Chicago, IL, USA). Results are shown as mean ± SD for quantitative variables and as number (n) and percentage (%) for qualitative variables. Comparisons between groups were performed using the chi-square test for dichotomous variables, Student’s t test, and univariate analysis of variance (ANOVA) for continuous variables. Correlation was evaluated by Spearman correlation coefficient (r). Discrimination was estimated from the AUC obtained by receiver operator characteristic (ROC) curves. The AUCs obtained were compared using a validated algorithm [23]. Calibration was evaluated by Hosmer-Lemeshow chi-square test (p < 0.05 considered as poor calibration). Overall performance was evaluated using the Nagelkerke R2. Using the 2 × 2 contingency tables, the following parameters were calculated: sensitivity, specificity, positive and negative predictive values (PPV and NPV, respectively), and accuracy (rate of correctly screened patients). All estimates were reported with their respective 95% confidence intervals (CIs). A two-tailed p value < 0.05 was considered statistically significant.

Results

The flowchart of the study is reported in Fig. 1. Overall, 1017 morbidly obese subjects were allocated into two independent cohorts: BS (n = 411; 40.4%) and NBS (n = 606; 59.6%). As shown in Table 1, BS patients (vs. NBS patients) were younger, had a predominance of women, and had a higher BMI (all with p < 0.001). All polysomnographic measures (Table 1) were statistically different between BS and NBS groups, except for TST (p = 0.139), sleep efficiency (p = 0.333), stage N1 sleep (p = 0.148), stage REM sleep (p = 0.072), and hypopnea index (p = 0.419). Prevalence of OSA was higher in the NBS than in the BS cohort: (i) OSA≥5: 92.4 vs. 84.7% (p < 0.001), (ii) OSA≥15: 77.2 vs. 62.5% (p < 0.001), and (iii) OSA≥30: 59.1 vs. 41.4% (p < 0.001). In the BS cohort, the prevalence of OSA was higher in men than in women: (i) OSA≥5: 97.1 vs. 78.3% (p < 0.001), OSA≥15: 87.8 vs. 49.6% (p < 0.001), and OSA≥30: 73.4 vs. 25.0% (p < 0.001). Similarly, in the NBS cohort, the prevalence of OSA was higher in men than in women: (i) OSA≥5: 96.3 vs. 87.0% (p < 0.001), (ii) OSA≥15: 86.1 vs. 64.8% (p < 0.001), and (iii) OSA≥30: 75.6 vs. 36.0% (p < 0.001). Figure 2 illustrates that the frequency of patients assessed as high risk was always higher in the NBS cohort than in the BS cohort, for all screening instruments evaluated.

Fig. 1
figure 1

The flowchart of the patients. OSA obstructive sleep apnea, PSG polysomnography, BS bariatric surgery, and NBS non-bariatric surgery

Table 1 General and sleep characteristics (n = 1017)
Fig. 2
figure 2

Percentage (%) of patients at high risk for diagnosis of obstructive sleep apnea according to four screening instruments evaluated: No-Apnea, STOP-Bang, NoSAS, and Epworth Sleepiness Scale (ESS). BS bariatric surgery and NBS non-bariatric surgery

Characteristics of the No-Apnea

The mean No-Apnea score was significantly lower in BS patients compared to NBS patients (4.7 ± 2.5 points vs. 5.7 ± 2.3 points; p < 0.001). Similarly, mean AHI values increased linearly according to increasing No-Apnea scores (from 0 to 9 points): in the BS cohort, the mean AHI ranged from 9.7 ± 11.1 to 69.8 ± 22.9/h (p value for trend <0.001), while in the NBS cohort, it ranged from 14.1 ± 21.8 to 49.1 ± 28.7/h (p value for trend < 0.001). As can be seen in Table 2, No-Apnea showed statistically significant correlation with all respiratory measures during sleep obtained in the PSG (all with p < 0.001). According to Table 3, the No-Apnea instrument performed similarly among genders for screening of OSA≥5 (p = 0.973) and OSA≥15 (p = 0.817), while it performed significantly better in women than in men for screening of OSA≥30 (p = 0.033).

Table 2 Spearman correlation between No-Apnea scores and respiratory parameters obtained by polysomnography (n = 1017)
Table 3 No-Apnea discriminatory ability according to gender (n = 1017)

In the BS cohort, the No-Apnea instrument showed adequate calibration and overall performance for screening of OSA≥5, OSA≥15, and OSA≥30: Hosmer-Lemeshow test 4.446 (p = 0.727) and Nagelkerke R2 0.272; Hosmer-Lemeshow test 5.337 (p = 0.619) and Nagelkerke R2 0.351; and Hosmer-Lemeshow test 1.321 (p = 0.995) and Nagelkerke R2 0.369; respectively. Similarly, in the NBS cohort, our model displayed adequate calibration and overall performance for screening of OSA≥5, OSA≥15, and OSA≥30: Hosmer-Lemeshow test 5.266 (p = 0.729) and Nagelkerke R2 0.236; Hosmer-Lemeshow test 9.583 (p = 0.143) and Nagelkerke R2 0.197; and Hosmer-Lemeshow test 13.237 (p = 0.067) and Nagelkerke R2 0.223; respectively.

Performance of all instruments

Table 4 summarizes the predictive performance of all instruments evaluated. In the BS cohort, for screening of OSA≥5, OSA≥15, and OSA≥30, No-Apnea showed sensitivity ranging from 80.7 to 93.5% and specificity ranging from 42.9 to 34.4%, respectively. In addition, No-Apnea showed the highest sensitivity among all instruments: 80.7% (OSA≥5), 87.9% (OSA≥15), and 93.5% (OSA≥30). In the NBS cohort, No-Apnea reported the following characteristics for screening of OSA≥5, OSA≥15, and OSA≥30: sensitivity ranging from 91.6 to 96.9%, specificity ranging from 47.8 to 23.4%, and accuracy ranging from 88.3 to 66.8%; respectively. As shown in Fig. 3, No-Apnea had non-inferior discrimination to STOP-Bang and NoSAS for screening of OSA≥5, OSA≥15, and OSA≥30. The ESS was not adequate in the prediction of OSA, and No-Apnea performed significantly better than ESS in both of the cohorts evaluated (p < 0.001).

Table 4 Predicting obstructive sleep apnea with No-Apnea, STOP-Bang, NoSAS, and Epworth Sleepiness Scale (ESS)
Fig. 3
figure 3

Discriminatory performance, reported as area under the curve (95% confidence interval), of No-Apnea, STOP-Bang, NoSAS, and Epworth Sleepiness Scale (ESS) for screening of obstructive sleep apnea (OSA) assessed by an apnea/hypopnea index (AHI) ≥ 5.0/h (OSA≥5), ≥ 15.0/h (OSA≥15), and ≥ 30.0/h (OSA≥30). In the bariatric surgery (BS) cohort, No-Apnea had similar discrimination to STOP-Bang and NoSAS for predicting OSA≥5 (p = 0.979 and p = 0.358, respectively), OSA≥15 (p = 0.158 and p = 0.399, respectively), and OSA≥30 (p = 0.388 and p = 0.903, respectively). In the non-bariatric surgery (NBS) cohort, No-Apnea had similar discrimination to STOP-Bang and NoSAS for predicting OSA≥5 (p = 0.528 and p = 0.428, respectively), OSA≥15 (p = 0.825 and p = 0.108, respectively), and OSA≥30 (p = 0.458 and p = 0.186, respectively). The ESS was not a useful screening model for OSA in both subsamples. No-Apnea performed significantly better than ESS in both BS and NBS cohorts (p < 0.001)

Discussion

The present study showed that No-Apnea—an extremely simple, practical, and objective instrument containing only two parameters—can be used satisfactorily as a screening tool for OSA among morbidly obese patients, regardless of whether the indication for PSG was clinical suspicion of OSA or preoperative evaluation before bariatric surgery. No-Apnea showed adequate discrimination, calibration, and overall performance in both morbidly obese groups being evaluated. Moreover, despite its apparent simplicity, its discriminatory ability was not inferior to that achieved by the STOP-Bang or NoSAS, at all levels of OSA severity. The simplicity and ease aspects of No-Apnea can obviously confer a significant advantage, since STOP-Bang and NoSAS contain more parameters (eight and five parameters, respectively) without any incremental value being added. Moreover, No-Apnea is exclusively composed of objective parameters, thereby reducing the subjective information bias, and can also be used in individuals who sleep alone, where subjective sleep information from a bed partner is not available. The simplicity of No-Apnea can facilitate its widespread implementation as a screening model for morbidly obese individuals, allowing the referral of these patients towards diagnosis using home-based portable diagnostic systems, thus avoiding the long waiting times for full PSG. Of note, similar findings and conclusions emerged during study of development and validation of the No-Apnea [15].

As anticipated, we found a high prevalence of OSA in this large sample of consecutively enrolled morbidly obese patients. Previous studies have also reported a high prevalence of OSA in BS patients [5, 7,8,9]. Additionally, BS patients were younger, more obese, and had a predominance of women when compared to NBS patients, a finding that is compatible with other studies in bariatric cohorts [5, 9, 10], but not in studies of the general population [1, 2] or in sleep-lab setting [24], in which there is a well-established male predominance. In addition, NBS patients had increased severity of sleep-disordered breathing than BS patients reflecting the compulsory referral of BS subjects independently from the presence OSA-evoking symptoms. We are unaware of any studies to date comparing clinical and polysomnographic data in two distinct groups of morbidly obese patients according to their indication for PSG.

As reported above, our findings showed that the two cohorts that were evaluated (BS and NBS cohorts) presented clear differences in their demographic and anthropometric characteristics, as illustrated by a higher prevalence of OSA≥5, OSA≥15, and OSA≥30 in the NBS cohort compared to the BS cohort. The differences in the prevalence of OSA may account for the performance differences of No-Apnea in the BS and NBS groups, whereby it performed better in the bariatric group, particularly in the more severe categories of OSA (OSA≥15 and OSA≥30).

The STOP-Bang questionnaire [16], a widely used screening tool, was initially developed in surgical patients to screen for the presence of OSA and showed a sensitivity 83.6%, specificity 56.4%, PPV 81.0%, and NPV 60.8%. Although the original STOP-Bang [16] uses a cut-off point ≥ 3 to indicate patients with high risk of OSA, when applied to obese subjects, a cut-off point ≥ 4 is apparently more suitable [21], as illustrated by a high sensitivity (87.5%) and high NPV (90.5%), while in morbidly obese patients, the same score showed a sensitivity of 89.5% and a specificity of 25.5% [21]. The favorable yield of the STOP-Bang in screening patients for OSA among different populations was corroborated in a recent meta-analysis [25].

However, in a single-center retrospective study with 266 bariatric patients evaluated with standard overnight PSG, it was observed that neither STOP-Bang nor the Berlin questionnaire were useful screening models in this surgical population [26]. Interestingly, the results of this study were substantially different from others, being the performance obtained by the screening models substantially lower than previously reported in the literature [26]. In addition, ESS was also not predictive of OSA≥30 (AUC 0.557; 95% CI 0.476–0.639) or OSA≥15 (AUC 0.512; 95% CI 0.440–0.584) [26].

The recently developed NoSAS score [17] performed well in a general population sample from Switzerland included in the HypnoLaus cohort with an AUC of 0.74. In another general population cohort from Brazil, the EPISONO cohort, the NoSAS showed an AUC of 0.81. The NoSAS score performed significantly better than STOP-Bang questionnaire (p < 0.0001) and Berlin questionnaire (p < 0.0001) [17]. However, in a study [27] of a multiethnic Asian cohort, STOP-Bang, Berlin questionnaire, and NoSAS score performed similarly, with AUCs being clustered around 0.682–0.748. In another study [28] aiming to validate the NoSAS as a screening tool for OSA in clinical populations, this tool showed an AUC of 0.707, performing significantly better than the STOP (AUC: 0.655), STOP-Bang (AUC: 0.704), and the ESS (AUC: 0.642).

A previous study applied four models (ESS, Fatigue Severity Scale, STOP-Bang, and NoSAS) to 251 bariatric patients (76% females): STOP-Bang and NoSAS performed better than the ESS and Fatigue Severity Scale. Except for the ESS, all sleep questionnaires allowed better OSA prediction in women than in men [19]. However, this study presented some limitations that deserve mention: (i) retrospective study design, (ii) patients were evaluated with polygraphy rather than PSG, and (iii) few men were enrolled in the final analysis (n = 60). Although our study was not designed to evaluate gender differences in the performance of No-Apnea among morbidly obese patients, some of the findings deserve mention: there were no gender-related differences in the discriminatory ability for screening of OSA≥5 and OSA≥15; however, for screening of OSA≥30, No-Apnea performed significantly better in women than in men.

This present study also reported that the ESS was not shown to be useful as a screening tool for OSA in morbidly obese patients. Although ESS has been widely used in clinical practice, some studies show a poor utility of ESS as a screening model for OSA [29, 30], possibly because excessive daytime sleepiness is not always present in individuals suffering from OSA.

Strengths and limitations

Our study had some limitations that deserve comment. First, selection of patients occurred in a sleep laboratory, where a high prevalence of OSA is anticipated, and therefore the possibility of selection bias is plausible. In addition, the study was conducted at a single institution, which may limit the reproducibility of our findings in other populations. Despite these limitations, our study enrolled a large sample of consecutive morbidly obese patients. All patients underwent in-lab PSG, which was scored manually by two experienced physicians according to 2012 AASM guidelines [22]. To our knowledge, this is the first study that was effectively designed to assess differences in No-Apnea, STOP-Bang, NoSAS, and ESS predictive performance among morbidly obese subjects grouped into two independent and different groups of obese patients, namely BS and NBS.

Conclusions

OSA is a very common disorder among morbidly obese patients. A very pragmatic and objective screening tool, No-Apnea, showed adequate predictive performance for diagnosis of OSA, and no statistically significant differences emerged when compared to STOP-Bang or NoSAS, at all levels of OSA severity. On the other hand, ESS did not present satisfactory discrimination as OSA screening model in morbidly obese patients.