Introduction

Obstructive sleep apnea (OSA) syndrome is a common disorder affecting 6–13% of the industrialized world [1, 2], characterized by daytime sleepiness, snoring, and witnessed apneas [3, 4]. Currently, the most effective and widely used treatment for OSA syndrome is continuous positive airway pressure (CPAP) [5].

OSA in which obstructive apneas and hypopneas occur predominantly during rapid-eye movement (REM) sleep is known as REM-OSA and is associated with hypertension, impaired glucose metabolism, and neurocognitive function [6]. REM-OSA prevalence varies between 10 and 36% of patients with OSA, due to differences in study populations and varying REM-OSA definitions [7, 8]. Previous studies also included patients who had a significant degree of non-REM (NREM)-related OSA.

Therefore, a revised polysomnography criteria have been proposed to better define REM-OSA [9]:

  1. 1.

    Apnea-hypopnea index (AHI) during NREM sleep [AHINREM]<5 events/h

  2. 2.

    AHI during REM sleep [AHIREM]≥5 events/h

  3. 3.

    ≥30 min of REM sleep

This definition provides a more accurate polysomnography-based description of REM-OSA that effectively excludes NREM-OSA and provides clear criteria for diagnosing REM-OSA as a separate clinical OSA phenotype. There are only a few studies that have assessed patient characteristics or response to treatment using this newer REM-OSA criterion [10, 11].

REM-OSA patients are female-predominant, younger, and less obese and have less severe sleep-disordered breathing (SDB) and lower blood pressure, but have similar levels of sleepiness compared with NREM/REM-OSA [12]. It was unclear whether REM-OSA patients formed a specific clinical phenotype of OSA, or if this simply represented an early stage in the development of the more usual clinical OSA syndrome. Despite this uncertainty, careful evaluation of short- and long-term responses to treatment is required to provide evidence of treatment benefit in what would otherwise be regarded as a “milder” form of OSA syndrome.

Therefore, we applied this revised criteria to diagnose patients with moderate-to-severe REM-OSA (i.e. AHIREM≥15 events/h) referred to a tertiary hospital clinical sleep service, and to evaluate their clinical and functional outcomes after a CPAP treatment trial for 12 months.

Methods

Inclusion criteria

We conducted a prospective single-arm, pre-post study using CPAP treatment for 12 months in patients diagnosed with moderate-to-severe REM-OSA at a university teaching hospital (Westmead Hospital, New South Wales, Australia). Participants were initially referred to investigate for SDB between June 2015 and August 2018. They underwent overnight diagnostic laboratory polysomnography (Profusion 4, Compumedics, Abbotsford, Victoria, Australia), and studies were scored using AASM criteria, Version 2.5 (2018).

Eligible participants with REM-OSA were invited to participate in the trial if they were aged ≥18 years, had clinical symptoms of daytime sleepiness, and met the following polysomnography inclusion criteria: AHIREM≥15 events/h, AHINREM<5 events/h, and at least 30 min of REM sleep on the diagnostic polysomnography. We excluded participants with unstable cardiac or neurologic disease, history of stroke, a previous diagnosis of OSA, refusal to commence or use CPAP therapy, and active psychiatric disorders. Informed consent was obtained prior to study enrolment. The study was approved by the Local Health District Research Ethics Committee (HREC/14/WMEAD/383).

Protocol

Study participants underwent an overnight laboratory-based CPAP titration polysomnography as per laboratory protocol based on Australasian Sleep Association and American Thoracic Society guidelines [13, 14]. An independent qualified sleep physician determined an optimum fixed pressure. Participants were then provided with a CPAP device (S9, ResMed Ltd., Bella Vista NSW, Australia, or Respironics One, Phillips Respironics, Murrysville, PA, USA) at the recommended pressure. We contacted participants via monthly telephone calls to optimize CPAP usage. Participants were followed-up in a review clinic at 1, 3, 6, and 12 months with a clinical examination and review of CPAP adherence using a download of the CPAP device (either ResScan software, version 5.5.0.9285, or EncorePro 2 software, version 2.19.1.0).

Measurements

From our clinical sleep service database, we obtained age, gender, weight, body mass index (BMI), total AHI, REM-AHI, NREM-AHI, blood pressure, and Epworth Sleepiness Scale (ESS) data of all patients who met the polysomnography inclusion criteria during the study recruitment period (Nexus 1.6, Compumedics, Abbotsford, Victoria, Australia).

For study participants who consented for the trial, we also performed baseline functional assessments including Psychomotor Vigilance Test (PVT), Functional Outcomes of Sleep Questionnaire (FOSQ), and Depression Anxiety and Stress Scale-21 items (DASS-21). We obtained follow-up anthropometric, ESS, PVT, and CPAP adherence data at 1-, 3-, 6-, and 12-month visits, and FOSQ and DASS-21 at 1-, 3-, and 12-month visits.

The ESS is an 8-item, self-reported questionnaire that measures the subject’s average sleep propensity in daily life [15]. An ESS score > 10 units is consistent with excessive daytime sleepiness. FOSQ is a 30-item, self-reported questionnaire that assesses the impact of disorders of excessive sleepiness on multiple activities of daily living [16]. Lower scores indicate greater functional impairment. DASS-21 is a set of three self-report scales each containing 7 items designed to measure the emotional states of depression (DASS-21-D), anxiety (DASS-21-A), and stress (DASS-21-S) with higher scores indicating higher levels of negative emotional states [17]. We tested changes in alertness using a computer-based version of the 10-min PVT, which is a sustained visual vigilance/attention reaction time (RT) test [18] and assessed mean reaction time (PVT-RT) as a measure of speed.

CPAP adherence was determined by data downloaded from CPAP devices and measured as the average of total hours used per day over 12 months, including participants who did not complete the study. We defined adherent CPAP use as an average of ≥4 h/night and non-adherent as <4 h/night [19, 20].

Data analysis

Continuous variables were summarized as mean and standard deviation (SD) or median and interquartile range (IQR). Frequencies and percentages (%) were used for categorical variables. We compared baseline variables between groups using the two-sample t test or Mann-Whitney test for continuous variables, and Fisher’s exact test for categorical variables.

The primary outcome was the within subject change in ESS from baseline across the 12 months from commencement of CPAP therapy. Our secondary outcomes were the within subject changes from baseline in BP, PVT-RT, CPAP adherence, FOSQ, and DASS-21 over the same period. All variables except for FOSQ and DASS-21 were measured at 1, 3, 6, and 12 months. FOSQ and DASS-21 were measured at 1, 3, and 12 months. Our pilot study data estimated that the standard deviation of the within subject change from baseline in ESS at 6 months was 4.5 units. Assuming this standard deviation for within subject change, a sample size of 29 subjects has 80% power to detect a mean change in ESS of 2.3 units at a specific time point, or a change of 3 units at 1, 3, 6, or 12 months with an overall 5% significance level after Bonferroni correction. A sample size of 32 participants was chosen to allow for a dropout rate of up to 10%.

We used linear mixed effects models (LME) to investigate the joint effects of time (considered either a continuous covariate or as a 5-level factor: baseline, 1, 3, 6, and 12 months) and average CPAP adherence (2-level factor: <4 versus ≥4 h/night) on the continuous outcomes of interest (ESS, BP, PVT-RT, FOSQ, and DASS-21). In these models, subject ID was the group identifier, time was fitted as a random effect, and time, adherence, and their interaction were fitted as fixed effects. The interaction term was used to assess whether changes observed in an outcome variable over time depended on adherence status. Diagnostic plots were used to check that the assumptions underlying the LMEs were satisfied. PVT-RT data were log-transformed to stabilize the variance prior to analysis and presented on the natural log-transformed scale.

We performed logistic regression analysis to assess the association between baseline participant characteristics and CPAP adherence. Characteristics with univariable association p < 0.10 were included as candidate variables in a multiple logistic regression model. Backward stepwise variable selection was used to find the independent predictors of CPAP adherence. Odds ratios (OR) and their 95% confidence intervals (95% CI) were used to quantify the association. We used IBM SPSS Statistics for Windows, Version 25.0 (IBM Corp., Armonk, NY, USA), and S-PLUS Version 8.1 (TIBCO Software Inc., Palo Alto, CA, USA) for data analysis using two-tailed tests with a significance level of 5%.

Results

We identified 220 subjects from 2164 referrals who fulfilled polysomnography inclusion criteria for moderate-to-severe REM-OSA, for a prevalence of 10.2% within our sleep service population (Fig. 1). This population was middle-aged (56.0 years), predominantly female (63.6%), obese (BMI 33.5 kg/m2) but not hypertensive, and without excessive sleepiness (ESS 8.8 units). One hundred eighty-eight of 220 subjects were ineligible for the study or declined participation; therefore, we obtained a convenience sample of 32 participants from this identified population. Three participants dropped out before the 1st month data collection, and two participants did not satisfy inclusion/exclusion criteria on later review. The remaining 27 participants aged 53.2 (12.1) years (mean (SD)) were included for analysis. Subject baseline characteristics are presented in Tables 1 and 2.

Fig. 1
figure 1

Subject recruitment flow diagram. The final study sample included 27 participants who completed ≥1 month data collection and were included for analysis

Table 1 Summary statistics for clinical characteristics of patients screened for REM-OSA during study period
Table 2 Summary statistics of baseline demographics and clinical characteristics for participants by CPAP adherence

Total AHI for study participants was mildly elevated at 9.0 (4.1) events/h with an AHIREM of 34.8 (16.5) events/h. They had increased sleepiness with a baseline ESS at 12.6 (5.3) units, and sleep-associated quality of life was reduced (FOSQ 12.5 (3.2) units). DASS-21-D score was moderately elevated at 7 (3–10) units (median (IQR)), DASS-21-A score was moderately elevated at 6 (3.0–9.5) units, and DASS-21-S score was mildly elevated at 8 (5.0–9.5) units. PVT-RT was within normal limits at 414 (327–578) ms. Study participants had significantly higher ESS compared to non-participants (p = 0.0002); other baseline characteristics were comparable (Table 1).

During the CPAP trial, 8 participants (30%) stopped using CPAP (4 before 3 months, 1 before 6 months, 3 before 12 months), and 19 participants (70%) completed 12 months of treatment. Mean CPAP usage for all 27 participants over 12 months was 2.9 (2.4) hours. Ten participants (37%) were adherent to CPAP with an average nightly use ≥4 h, and 17 participants (63%) were classified as non-adherent participants (Table 2). Adherent participants were on average 14.5 years (95% CI, 7.5 to 21.4 years) older than non-adherent participants. Logistic regression analysis identified age as the only independent predictor of adherence after adjusting for diastolic BP (p = 0.052) and AHIREM (p = 0.08). For each year increase in age, the odds of CPAP adherence increased by a factor of 1.25 (95% CI, 1.04 to 1.50; p = 0.019).

CPAP treatment resulted in an overall significant decrease in ESS (Fig. 2a, linear trend −0.27 units/month) and increase in FOSQ (linear trend 0.10 units/month) at all measured time points compared with baseline values (Tables 3 and 4). Decreases in PVT-RT and DASS-21 were not sustained over 12 months. There was no significant change in BP at any time point.

Fig. 2
figure 2

a Change in ESS as whole group compared with baseline (all time points p < 0.01). Data expressed as mean with error bars representing SEM. b Change in ESS from baseline (0 months) between CPAP-adherent and non-adherent groups over 1, 3, 6, and 12 months. There was a consistent reduction in ESS at all time points for the CPAP-adherent group which was not present in the non-adherent group (linear mixed effects). Data expressed as mean with error bars representing SEM. *p < 0.05, **p < 0.01, ***p < 0.001 (all relative to 0 months)

Table 3 Summary statistics for clinical variables by months on study (n = 27) together with p value for test of linear time trend within participants from LME model
Table 4 Mean within patient change in ESS, FOSQ, log(PVT-RT), and DASS subscales over 12 months for all participants (n = 27) and for adherent and non-adherent subgroups

There was no statistically significant interaction between the effect of time on ESS treated as a 5-level factor and adherence status on ESS (p = 0.27). When time was treated as a continuous covariate, there was borderline evidence (interaction p = 0.068) of a difference between the estimated 0.45 units/month (p = 0.003) within subject ESS decline in the adherent subgroup compared to the 0.13 units/month (p = 0.245) decline in non-adherent subjects. ESS was comparable between the two groups at baseline (p = 0.19). However, the adherent group had a significant decrease in ESS at each time point which was not present in the non-adherent group (Fig. 2b). There were no significant differences between adherent and non-adherent groups for FOSQ, PVT-RT, and DASS-21 at any time point.

Discussion

To our knowledge, we present the largest study reporting clinical outcomes for CPAP treatment in sleepy patients with a strictly defined moderate-to-severe REM-OSA phenotype (i.e. REMAHI≥15 events/h). In this group of clinically sleepy REM-OSA patients, treatment with CPAP for up to 1 year was associated with a significant decrease in sleepiness and improvement in sleep-related quality of life. However, no sustained improvements in negative emotional states or alertness were observed. Overall group CPAP adherence was poor, but greater adherence was associated with increasing age and with a decrease in sleepiness.

Definitions of REM-OSA

Our prevalence of moderate-to-severe REM-OSA was 10.2% of all patients referred to our sleep service during the study period. Previously described REM-OSA prevalence using older criteria was 14–36% of OSA patients [7, 8], and this variability is likely due to the varying definitions for REM-OSA within different populations. One historical definition utilized an overall AHI≥5 events/h, with AHIREM/AHINREM ratio ≥2 (definition #1) [7], but this generalized definition of REM-OSA did not effectively exclude patients with significant coexisting NREM-OSA, potentially resulting in a heterogeneous patient population [9]. Additional qualifying criteria in other studies included various degrees of adjustment for AHINREM, including AHINREM≤15 events/h (definition #2, currently most widely used in the literature) [8] or AHINREM<10 events/h [2].

One study evaluated the effect of various definitions of REM-OSA on prevalence, #1, #2, and a “strict” definition with AHINREM<8 and > 10.5 min of REM sleep duration, and found that the prevalence was 36.7%, 24.4%, and 13.5%, respectively [21] which demonstrates that REM-OSA prevalence is highly and proportionately dependent of the quantity of NREM-OSA permitted by the definition. In order to separate the NREM component and ensure that there is adequate REM sleep time to accurately estimate the severity of REM-OSA, Mokhlesi et al. [9] proposed the definition using AHINREM≤5 events/h and ≥ 30 min of REM sleep. When this definition was applied to a population of patients referred to a sleep laboratory in Jordan for evaluation of OSA, the prevalence of REM-OSA fell from 18% using definition #1 to 2.7% [11]. Overall, our data have demonstrated that applying the new criteria resulted in a relatively high prevalence of moderate-to-severe REM-OSA. Hence, REM-OSA likely remains a prevalent OSA phenotype in a sleep clinic referred population, though its prevalence may vary dependent upon ethnic background and source of referrals.

Clinical phenotype of REM-OSA

A caveat with attempting to characterize the clinical phenotype of REM-OSA in past studies is the variable definitions of REM-OSA, and that the instruments used may not have been sensitive enough to assess the relevant phenotype characteristics.

The moderate-to-severe REM-OSA population characteristics defined using the revised diagnostic criteria from our database were in broad agreement with previous studies which have shown that REM-OSA patients are female-predominant and have less severe overall OSA [2, 7, 12], except one study that showed an association with hypertension [10]. Previous studies have also found similar levels of sleepiness in REM-OSA compared with NREM/REM-OSA patients, and no significant difference in BMI [11, 21].

Our enrolled participants had characteristics which were representative of our moderate-to-severe REM-OSA population, apart from hypersomnolence (mean ESS score 12.6 vs 8.3 units) as our study required subjective sleepiness as an inclusion criterion. At baseline, our sleepy REM-OSA participants had reduced sleep-related quality of life (FOSQ) and negative emotional states with elevated DASS-21 scores. Previous studies demonstrated increased anxiety and depression in REM-OSA patients compared to NREM/REM-OSA patients despite no difference in ESS scores [21, 22], although others have not found an independent association between REM-OSA and impaired health-related quality of life [23, 24]. We cannot exclude that the hypersomnolence contributed to the presence of negative emotional states and impaired quality of life in our participants. Despite increased sleepiness, there was no impairment in attention as shown by the normal Psychomotor Vigilance Test results which likely reflects the association of impaired performance with loss of sleep itself, rather than disruption to a particular sleep stage or change in sleep quality [25].

Clinical outcomes of CPAP treatment for REM-OSA

Most studies examining REM-OSA were limited to demographic and clinical characteristics of the patients [7, 8, 11, 12, 21]. The only study examining the effect of treatment was a retrospective observational study using the previous REM-OSA definition (i.e. AHIREM/AHINREM ratio of ≥2 and AHINREM≤15 events/h), demonstrating an improvement in functional outcomes with CPAP treatment, including ESS, depression symptoms, fatigue, and FOSQ [26].

Our main finding was that CPAP treatment reduced the ESS score in our participants with moderate-to-severe REM-OSA by 3.5 units over 12 months (ESS minimum important difference (MID) in OSA = 2 units) [27], despite the overall mild severity of OSA in the group (average AHI 8.1 events/h). There were reductions in ESS in the CPAP-adherent subgroup at all measured time points, while ESS reduction did not reach significance for the non-adherent subgroup. This is in keeping with previous literature, which showed that reduction in ESS was apparent with CPAP usage of more than 4 h [20], with a linear dose-response relationship between increased usage and achieving normal levels for subjective sleepiness in NREM/REM-OSA. Clearly, CPAP adherence is an important factor for improvement in excessive sleepiness in both REM and NREM/REM-OSA.

There was also a statistically and clinically significant improvement in functional outcomes of sleep as measured by a decrease in FOSQ of 1.6 units (MID = 1 unit) [28]. This was achieved despite the low average CPAP use of 2.9 h/night, which was surprising as it has previously been shown that improvement in FOSQ was achieved at a high CPAP usage of >7 h/night for NREM/REM-OSA [20]. There was no significant difference in FOSQ improvement between adherent and non-adherent groups.

The impact of CPAP on other measured secondary end points demonstrated only borderline significant changes that were highly influenced by subject dropouts and large variability in measurements over time points.

CPAP use in REM-OSA

The average usage of CPAP in this study is lower (2.9 h/night) compared to other CPAP studies, based on a meta-analysis over 20 years which demonstrated an average of 4.6 h/night [29]. We also had a high drop-out rate with only 19 of 27 participants completing 12 months of treatment. Major reasons for study discontinuation included poor CPAP tolerance due to nasal congestion, dryness, and mask discomfort. The non-adherent subgroup had higher levels of depression, anxiety, and stress than the adherent subgroup, but this did not reach statistical significance. The only independent variable positively associated with improved CPAP adherence was increasing age, which has previously been demonstrated for NREM/REM-OSA patients [30]. Nevertheless, approximately 30% of the enrolled participants completed 1 year of treatment with satisfactory compliance, demonstrating that CPAP treatment is feasible and worthwhile in selected sleepy moderate-to-severe REM-OSA patients with only mild OSA severity overall, with older patients likely to do better.

Strengths and limitations

There are some limitations to our study. Firstly, as our study was observational in design without a control arm or randomization, it is not possible to prove that CPAP intervention over 12 months was the sole reason for improvement in sleepiness and quality of life. However, the consistent reduction in ESS at all measured time points in the CPAP-adherent group compared with the non-adherent group supports the benefit of CPAP in this population. Secondly, our participants consisted of patients who presented to a sleep centre with subjective daytime sleepiness, and therefore, the outcomes for our participants may not be representative of all people fulfilling the revised diagnostic criteria for moderate-to-severe REM-OSA. Thirdly, we acknowledge that the final sample sizes of recruited participants (n = 27) and associated subgroups (non-adherent, n = 17; and adherent, n = 10) were small and likely limited our ability to detect significant differences. Although we did not demonstrate consistent improvements in PVT or DASS-21, we cannot exclude the possibility of a type II error leading to this outcome, and the study is likely inadequately powered to draw conclusions for some of the secondary outcomes. Finally, the duration of CPAP use by participants was highly variable (from 1 month to more than 1 year), further limiting our ability to detect significant changes in our clinical outcomes for the group.

However, there were several strengths of our study. We measured CPAP adherence objectively from CPAP downloads which eliminated recall bias. We also used standardized outcomes to assess responses to treatment, and measurements such as ESS and FOSQ have been validated for OSA. We also used novel definition criteria specific for REM-OSA in order to address the clinical outcomes in this homogenous group of patients without any significant NREM-OSA, which is distinct from the previous definitions.

Conclusion

To date, this is the first study examining the effect of CPAP in sleepy patients with moderate-to-severe REM-OSA defined by the revised diagnostic criteria [9]. At baseline, our REM-OSA participants had overall mild OSA but with symptoms of increased sleepiness, reduced quality of life, and negative emotional states. There was a significant and sustained improvement in subjective sleepiness and sleep-related quality of life during 12 months of CPAP treatment despite suboptimal CPAP adherence overall. REM-OSA likely represents a clinically relevant OSA phenotype, where patients with relatively mild OSA overall may demonstrate excessive sleepiness and reduced quality of life, which may be improved by CPAP therapy. Further studies with a larger patient group would be necessary to demonstrate if there are also clinically significant improvements in emotional states and relevant cardiovascular outcomes.