Introduction

Obstructive sleep apnea (OSA) refers to the apnea and lack of ventilation caused by the collapse of the upper airway during sleep, also known as obstructive sleep apnea hypopnea syndrome (OSAHS). It is mainly manifested as snoring, asphyxiation, daytime sleepiness, and memory decline, which can cause cardiovascular and cerebrovascular diseases and cognitive dysfunction in the long term. OSA can occur in a variety of populations, most of which are middle-aged and elderly. OSA affects a significant number of people worldwide each year. It has been estimated that approximately 1 billion adults aged 30 to 69 years had obstructive sleep apnea, and 425 million of them had moderate to severe obstructive sleep apnea [1], even with an incidence of up to 50% in some countries [2]. With the increase of obesity rate, patients with OSA present an increasing trend [3]. OSA is considered to be a systemic problem and a high-risk factor for many common conditions, such as hypertension, diabetes, and cardiovascular and cerebrovascular accidents, which can lead to sudden death and increase the likelihood of traffic accidents. In addition, studies have shown that OSA is associated with gastroesophageal reflux disease (GERD) [4, 5].

Gastroesophageal reflux disease can be defined as troublesome symptoms that result from the reflux of gastric contents into the esophagus, throat, mouth, or lungs, such as regurgitation, heartburn, and belching. It can also cause a series of long-term complications and end-organ effects, such as vocal cord leucorrhea, reflux asthma, arrhythmia, and sleep disorders [6, 7]. The diagnosis of GERD may involve symptom and sign scale, endoscopic examination, pH-metric criteria, and other indicators. PH-metry ambulatory reflux monitoring is the only test that can evaluate reflux symptom association, including the following indicators commonly used: percentage time pH < 4, the longest reflux duration, number of reflux events, and the DeMeester score. And reflux diseases questionnaire score (RDQ score) and reflux symptom index (RSI) are commonly used clinical scores for symptoms and signs. RDQ score is scored by investigating heartburn, chest pain, acid reflux, and food loss in the past 4 weeks. RSI contains nine items of reflux symptoms. Improvement of life style and acid inhibition therapy are main treatments at present. The frequent co-occurrence of OSA and GERD may be explained by the following mechanisms. Firstly, GERD and OSA patients had similar risk factors: age, obesity, smoking, and alcohol consumption [8, 9]. In addition, strong protective mechanisms for swallowing and esophageal peristalsis decline during sleep, which may delay esophageal acid clearance, thereby increasing acid-mucosal contact time [10]. Most importantly, the negative intrathoracic pressure during inspiratory efforts against the occluded airway and the repetitive arousals to counter obstruction events are responsible for the increased incidence of regurgitation [11, 12].

It is well known that continuous positive airway pressure (CPAP) can significantly improve the hypoxic status and symptoms of OSA and reduce the incidence of related complications, so it is the preferred treatment for patients with OSA. However, whether or not CPAP can improve reflux symptoms in patients with GERD is still controversial. Studies have suggested that CPAP can reduce reflux events in patients with OSA, and even alleviate reflux symptoms in patients without sleep disorders [13, 14]. However, some studies have suggested that the use of CPAP can lead to gastric distension, which can aggravate symptoms such as reflux [15]. The inconsistency of results have been due mainly to small sample sizes and differences in study design. In this study, by integrating changes in reflux symptoms and parameters before and after CPAP treatment documented in previous studies, we attempted to clarify whether or not CPAP treatment is beneficial for GERD in patients with OSA.

Methods

Search strategy

The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines were used. We searched PubMed, Elsevier, Cochrane library, and CNKI database to obtain the targeted articles. The retrieval time of the article is up to March 2020, with no language restrictions. Search terms included “OSAHS or OSA or obstructive sleep apnea hypopnea syndrome or obstructive sleep apnea or Sleep Apnea, Obstructive”, “gastroesophageal reflux or esophageal Reflux or reflux or GER or GERD,” and “continuous positive airway pressure ventilation or CPAP or Ventilation Modes, APRV.” Computer retrieval was combined with manual retrieval of bibliographies of all retrieved articles. Articles that were potentially relevant were evaluated for inclusion according to pre-specified inclusion and exclusion criteria.

Inclusion and exclusion criteria of literature

  1. 1.

    All subjects in this study were adults with OSAHS and reflux symptoms. OSA was diagnosed by polysomnography with apnea hypopnea index (AHI) ≥ 5, and GERD patients were mainly diagnosed on the basis of reflux parameters and symptoms.

  2. 2.

    All patients did not take acid-suppressing drugs and did not receive CPAP treatment for 1 month prior to the trial to prevent their results from being affected.

  3. 3.

    The study must include reflux-relevant values on pre-CPAP and post-CPAP. If the study data could not be provided, or the data was not expressed in the form of WMD and SD, the study was excluded.

  4. 4.

    Research is limited to humans.

  5. 5.

    Abstracts, review, editorials, and case reports were deleted.

Statistical analysis

Review Manager 5.3 and STATA (version 12.0) were used for statistical analysis. The statistical results of continuous outcomes were expressed by weighted mean difference (WMD) and 95% confidence interval (CI), while dichotomous outcomes were expressed by risk ratio (RR) and 95% CI. The Mantel-Haenszel analysis was suitable for dichotomous variables, while the inverse variance method was utilized for continuous variable [16]. The statistical significance level was P < 0.05. Statistical heterogeneity was evaluated according to the I-square (I2) value. Heterogeneity is high when I2 > 75%, moderate when I2 = 50–75%, and low when I2 = 25–50%, and no heterogeneity when I2 < 25% [17]. If I2 < 50%, the study is considered low in heterogeneity, and fixed effects models are used to aggregate the results. If I2 > 50%, the study is considered to be moderately or highly heterogeneous, and the random effects model is used to aggregate the data [18, 19].

Considering the effect of CPAP therapy duration on the outcome, we performed a subgroup analysis (CPAP therapy duration < 1 week and ≥ 1 week). Sensitivity analysis was used to assess the stability of the results. The possible sources of heterogeneity were identified by exploiting meta-regression. We obtained the forest map through Review Manager synthesis. The funnel plot, the Begg test, and the Egger test were used to assess potential publication bias [20, 21]. We also used pruning and filling methods to identify and correct the asymmetry of funnel plot caused by publication bias [22].

Results

Search results

The search and screening of the literature was carried out by two reviewers independently. We initially screened 66 articles from the database, then roughly screened the abstracts or titles of these articles and identified 28 relevant articles. And 5 articles were excluded because of data duplication. We further screened the full text of 23 articles according to the inclusion and exclusion criteria; since there were not enough data or there was a lack of the mean and standard deviation of pre-CPAP and post-CPAP reflux-related indicators, 13 articles were excluded. In the end, a total of 10 studies were included for further research. The steps of document retrieval are displayed in Fig. 1.

Fig. 1
figure 1

Flow diagram of the literature search

Characteristics of included studies

Ten studies involving a total of 272 participants were included in this review [17, 23, 24]. Our review incorporated 6 sets of data representing reflux parameters: percentage time pH < 4, the longest reflux duration, number of reflux events, DeMeester score, RDQ score, and RSI. One study provided 4 sets of data, and 3 studies provided 3 sets of data. The analysis of percentage with time pH < 4 included 5 studies, the longest reflux duration included 6 studies, the number of reflux events included 4 studies, and the DeMeester score, RDQ score, and RSI respectively included 2 studies. All the included self-controlled trials were level 2, and the design of randomized controlled trials was level 1. The authors, year of publication, national sources, sample sizes, duration of treatment, study design, and level of research evidence are shown in Table 1.

Table 1 Characteristics of included studies

Pooled analysis

We obtained a forest map that reflected the relationship between CPAP and percentage time pH < 4, the longest reflux duration, number of reflux events, DeMeester score, RDQ score, and RSI. We summarize the key information of the forest map in Table 2, including reflux parameters, subgroup, studies included, sample size, χ2 (df), P value, mean difference (95% CI), and heterogeneity (I2). The analysis shows that the I2 values of the indicators in the study are 95%, 85%, 97%, 0, 31%, and 82% respectively, and the heterogeneity varies greatly. Therefore, we use the random effects model to combine the effect size. The results illustrated that the total of WMD was − 17.68 (95% CI − 30.67 to − 4.69, P = 0.008) for percentage time pH < 4, − 24.66 (95% CI − 36.15 to − 13.18, P < 0.0001) for the longest reflux duration, − 27.53 (95% CI − 49.53 to − 5.52, P = 0.01) for number of reflux events, − 49.76 (95% CI − 60.18 to − 39.35, P < 0.00001) for the DeMeester score, − 1.85 (95% CI − 3.00 to − 0.71, P = 0.002) for the RDQ score, and − 8.95 (95% CI − 16.00 to − 1.89, P = 0.01) for RSI.

Table 2 The overall analysis and subgroup analysis of reflux parameters before and after CPAP treatment

Subgroup analysis

According to the dividing line of treatment time of 1 week, we performed subgroup analysis for three sets of parameters including percentage time pH < 4, the longest reflux duration, and the number of reflux events (Table 2).

Treatment time < 1 week: the analysis of percentage with time pH < 4 included 3 studies, the longest reflux duration included 3 studies, and the number of reflux events included 2 studies. The total of WMD for percentage time pH < 4 was − 12.55 (95% CI − 22.65 to − 2.44, P = 0.01), the corresponding value for the longest reflux duration was − 27.70 (95% CI − 47.67 to − 7.71, P = 0.007), and the corresponding value for the number of reflux events was − 9.57 (95% CI − 12.43 to − 6.71,P < 0.00001).

Treatment time ≥ 1 week: the analysis of percentage with time pH < 4 included 3 studies, the longest reflux duration included 3 studies, and the number of reflux events included 2 studies. The total of WMD for percentage time pH < 4 was − 25.44 (95% CI − 45.45 to − 5.43, P = 0.01), the corresponding value for the longest reflux duration was − 24.77 (95% CI − 36.74 to − 12.80, P < 0.0001), and the corresponding value for the number of reflux events was − 49.27 (95% CI − 60.92 to − 37.63, P < 0.00001).

Sensitivity analysis and quality assessment

Sensitivity analysis showed that the five indicators including percentage time pH < 4, the longest reflux duration, DeMeester score, RDQ score, and RSI would not destroy the current pooled analysis results after removing any study. While one indicator (number of reflux events) showed no significant difference in the pooled analysis results after the deletion of Maroun’s study. After changing the random effects model to the fixed effects model, the analysis results did not change significantly.

Publication bias

There may be a slight publication bias because the funnel plot was not completely symmetrical. However, we performed the Begg and Egger tests for the three groups of parameters in the study, which were percentage time pH < 4, the longest reflux duration, and number of reflux events. The Begg test showed no publication bias in percentage time pH < 4 (P = 0.806), the longest reflux duration (P = 0.452), and number of reflux events (P = 0.734). In analogy, there was no publication bias in Egger’test for percentage time pH < 4 (P = 0.817), the longest reflux duration (P = 0.482), and number of reflux events (P = 0.833). The pruning and filling methods showed that the symmetry of funnel plots did not require statistical correction.

At the same time, we evaluated bias sources used risk of bias from Cochrane risk-of-bias tool (Fig. 2).

Fig. 2
figure 2

Risk of bias summary and risk of bias graph

Discussion

It is well known that OSA patients with reflux symptoms are a very common phenomenon. Although CPAP is considered to be the preferred treatment for OSA patients, the influence of CPAP therapy on reflux in OSA patients remains controversial. On the one hand, some studies have suggested that CPAP treatment increases rate of GERD, which may be related to esophageal aerophagia, that is, CPAP tend to move air down to the esophagus and stomach through pressure, causing abdominal distension and further triggering lower esophageal sphincter relaxation [25, 26]. However, Ozcelik H et al. demonstrated that CPAP treatment does not cause reflux through objective parameters of the esophageal manometer and pH meter [27]. On the other hand, unlike the above, studies have demonstrated that after CPAP treatment, the reflux symptoms of OSA patients are alleviated 17,29-31. They believe that CPAP treatment can reduce airway obstruction, relieve inspiratory force, increase esophageal pressure, and lower arousal and movement frequency, all of which are associated with a reduced risk of GERD. In addition, CPAP pressure in severe OSA patients increased lower esophageal sphincter pressure and further decreased GERD [28]. Some researchers deem that improvement of reflux after CPAP treatment is the result of increased esophageal pressure and lower esophageal sphincter pressure [29].

In previous studies, we could perceive that different indicators and test methods were used to evaluate GERD. The diagnosis of GERD requires comprehensive consideration, which may involve symptoms, endoscopic examination, pH-metric criteria, manometry, PPI trail, and detection of helicobacter pylori. A presumptive diagnosis of GERD can be originated in the setting of typical symptoms of heartburn and regurgitation [28]. The reflux diseases questionnaire and the gastroesophageal reflux diseases questionnaire are currently the most widely used epidemiological studies of diagnostic questionnaire tools. “PPI trial” is often used in clinical practice, but is prohibited in our study. Although the pH value criterion, esophageal pressure measurement, and endoscopy are not used in routine clinical diagnosis, they embody a comprehensive feature of GERD, and they are important in intractable and atypical symptoms. PH-metry ambulatory reflux monitoring is the only test that can evaluate reflux symptom association [30], including the following indicators commonly used: percentage time pH < 4, the longest reflux duration, number of reflux events, and the DeMeester score.

Our study combined a number of indicators for evaluating GERD, including symptom scores and pH-metry parameters. The changes in six indicators before and after CPAP treatment showed that CPAP was significantly correlated with reflux index (P < 0.01), which effectively improved reflux symptoms and pH-metry parameters in OSA patients. There was a degree of heterogeneity in this study. Considering the effect of CPAP therapy duration on the outcome, we performed a subgroup analysis based on the treatment time to find the source of heterogeneity. The analysis demonstrated that the results were significant regardless of whether the treatment time was longer or shorter than 1 week, indicating that CPAP treatment can reduce reflux events regardless of duration of treatment. No publication bias was found in the results of Begg’s test and Egger’s test. At the same time, the sensitivity analysis showed that the overall results were basically unchanged after excluding the single study or converting the fixed effects model to the random effects model. Therefore, the outcome of our meta-analysis was strongly convincing.

Although our research acquired meaningful reliable results, it still had some limitations. First of all, the study sample size included in the analysis was too small, so we may need more studies to address the impact of this factor. Secondly, the diagnosis criteria for GERD was partly based on subjective symptoms and empirical PPI therapy (“PPI trial”), but PPI therapy was prohibited in our study, which made the diagnosis challenging. In addition, different studies adopted different indicators and measurement techniques, which may produce inconsistent results to some extent. Finally, some results had a large heterogeneity, but we could not define the source of heterogeneity from the limited research data.

Conclusion

Our study aggregated six indicators, including two symptom scale scores and four indicators related to pH-metry parameters. Our meta-analysis showed that CPAP treatment relieved reflux symptoms and reduced the value of pH-metry. In conclusion, CPAP is beneficial for gastroesophageal reflux in patients with OSAHS. Therefore, in patients with OSAHS who suffer with GERD, CPAP is a considerable option for the treatment of OSA and the improvement of reflux symptoms. However, more prospective data are needed to confirm which patients benefit.

RDQ score, reflux diseases questionnaire score; RSI, reflux symptom index; LOE, level of evidence; RCT, randomized controlled trials; SCT, self-control trials

RDQ score, reflux diseases questionnaire; RSI, reflux symptom index; NA, not appliable; CI, confidence interval