Introduction

Cognitive decline in later life can have significant effects on daily functioning, quality of life and independence (Harada et al. 2013). With the increase in the ageing population and concomitant increase in the rates of dementia, efforts to understand the contributors to cognitive decline have become imperative. Between 30 and 50% of the risk for dementia could be due to modifiable risk factors such as depression, physical inactivity and cardiovascular disease (Norton et al. 2014). Over the last few years it has also become apparent that cognitive decline may also be linked to various forms of sleep disturbance (McKinnon et al. 2014; Naismith et al. 2014; Naismith et al. 2010; Naismith et al. 2011; Yaffe et al. 2011) to the extent that sleep disturbance is now also being considered as a potential modifiable risk factor (Mander et al. 2016).

There are various aetiological mechanisms by which sleep disturbance may be linked to cognitive decline, including disruption of critical neurotrophins and neurogenesis (Meerlo et al. 2009), disruption of sleep microarchitecture and the overnight consolidation of memories (Pace-Schott and Spencer 2015), or even excessive daytime sleepiness (Jaussent et al. 2012). There is also a well-established link between obstructive sleep apnoea (OSA) and cognitive dysfunction (Lal et al. 2012; Naismith et al. 2004). OSA is a common breathing disorder that involves repetitive cessation of breathing due to collapse of the pharyngeal airway during sleep. Immediate consequences of OSA are changes in blood oxygen and carbon dioxide levels, acute increases in blood pressure, as well as intermittent sleep fragmentation and arousals caused by the apnoeic events. In addition to being associated with multiple co-morbidities such as cardiovascular and metabolic dysfunction, neurobehavioral symptoms are often common, including excessive daytime sleepiness, increased risk of motor vehicle accidents, and neuropsychological dysfunction (for review, see(Jordan et al. 2014)).

There is substantial evidence surrounding the impact of OSA on neuropsychological function in adults. In younger and middle aged adults, meta-analytic studies suggest that OSA is associated with a decline in memory (Wallace and Bucks 2013), vigilance, executive functioning and motor coordination (Beebe et al. 2003; Bucks et al. 2013; Engleman et al. 2000). While the mechanisms have not been fully delineated, it appears that both hypoxemia and sleep fragmentation may contribute differentially to cognitive decline, with the former having potentially more deleterious effects. Of significance, effective treatment of OSA with continuous positive airway pressure (CPAP) appears to ameliorate some deficits associated with OSA but there is considerable heterogeneity in the outcomes. It appears that CPAP is associated with improvements in attention and vigilance, but deficits in other domains tend to persist despite adequate treatment (Kylstra et al. 2013; Pan et al. 2015).

However, the literature pertaining to the relationship between OSA and neuropsychological functioning in later life has not been specifically or quantitatively analysed. This is despite evidence that the risk and incidence of OSA increases with age (Launois et al. 2007; Heinzer et al. 2015). Affecting approximately 2–4% of middle aged adults (Young et al. 1993), OSA has been estimated to occur in anywhere from 15% to 60% of elderly adults (Ancoli-Israel et al. 1991; Young et al. 2002). The mechanisms behind an increased risk and prevalence of OSA in aged adults are not completely understood, but have been proposed to be due to a combination of anatomical and mechanical changes in the upper airway (Launois et al. 2007; Edwards et al. 2010). Importantly, older adults are at greater risk of medical burden. Not only does this reduce the reserve and resilience of older adults to OSA, these comorbidities are themselves associated with a greater risk for cognitive decline (Biessels et al. 2006; Justin et al. 2013). Of concern, there is longitudinal evidence of a link between OSA and increased risk of cognitive decline and dementia (Chang et al. 2013; Osorio et al. 2015; Yaffe et al. 2011). Therefore, to inform clinical screening and dementia prevention strategies (Naismith et al. 2009), it is important to clearly understand the relationship between OSA and cognitive function in older adults.

Unfortunately, the neuropsychology literature involving OSA can often be inconsistent and difficult to reliably quantify. Empirical studies differ significantly in their design and sampling of specific populations, increasing the risk of bias and reducing the generalisation of results to the general population. Studies also do not always use reliable or objective diagnosis measurements of both OSA such as polysomnography (PSG). Additionally, neuropsychological tests are not always well standardized, and since neuropsychological assessments are expensive and challenging to administer, especially in large samples, epidemiological or community studies may incorporate only gross and insensitive measures of cognition. These are important factors to be considered when reviewing and analysing the literature.

Therefore, the aim of this systematic review was to both find and examine the studies that have objectively investigated the effect of OSA on neuropsychological function with a specific focus on older adults. We also aimed to analyse the literature with a particular focus on investigating how differences in these studies may contribute as moderators of the effects, to better inform understanding in this area and potentially indicate which older adults may be more vulnerable to the cognitive consequences of OSA.

Methods

This work complies with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Liberati et al. 2009) (see Checklist S1). Methods of analysis and inclusion criteria were registered in advance with PROSPERO (2015:CRD42015020239) and are documented in Protocol S1.

Eligibility

This systematic review and meta-analysis aimed to answer the question: what is the association between OSA and cognitive function across various cognitive domains in older community-dwelling adults (aged >50 years)?

Participants

Eligible studies consisted of healthy participants with a mean participant age and lower bound of 1 standard-deviation above 50 years. Healthy participants were any adult without a diagnosis of mild cognitive impairment (MCI), dementia or other neurological disorders (e.g. stroke, epilepsy). Any studies that investigated the effects of medications that may have influenced cognition (e.g. anesthesia, benzodiazepines) were also excluded from the review.

Diagnosis Measures

Only objectively defined OSA as measured by a validated sleep apnoea diagnostic device or technique was eligible. Both in-lab and home-based PSG diagnostic sleep studies were acceptable as long as comparable measures of breathing and blood oxygen saturation had been derived in order to calculate the rate of apnoeas and hypopneas per hour of sleep (the apnoea-hypopnea index; AHI) and determine the severity of OSA. In concordance with the American Academy of Sleep Medicine guidelines (Epstein et al. 2009), OSA has been defined by a minimum of AHI ≥ 5, and severity rated as mild (5 ≤ AHI < 15), moderate (15 ≤ AHI < 30), or severe (AHI ≥ 30). Although there may have been differences between studies in their calculation of AHI indices (Ho et al. 2015; Ruehland et al. 2009), we extracted the AHI scores as reported by each of the individual studies.

This study only included cases of OSA. While central sleep apnoea is another sleep related breathing disorder of interest, the apnoeas and hypopnoeas are caused by a disturbance in neural output to the respiratory motor neurons rather than airway obstruction, such as in OSA (Young et al. 2002). Therefore the pathophysiology and characteristics of central sleep apnoea are distinct from OSA, justifying exclusion from this analysis.

Comparisons

Comparator groups must have consisted of older adults that shared the same age and gender distributions as OSA participants. The comparator participants must have also undergone a diagnostic PSG to rule out the presence of OSA. Absence of AHI was defined by an AHI < 5. Studies that compared cognitive outcomes with OSA severity measures were eligible if they included control participants in the sample (i.e. a complete range of OSA severity). Unlike previous meta-analyses in this area (Beebe et al. 2003; Wallace and Bucks 2013), we did not compare outcomes to published norms. This is because published neuropsychological normative data will have not taken into consideration the potential presence of OSA in reference samples. Additionally, the use of normative data will most likely differ across studies. We also did not include studies where OSA patients were their own controls before and after treatment (e.g. with CPAP). This is because changes in cognitive function associated with OSA may require treatment longer than can be seen in short-term intervention trials, or because practice effects on neuropsychological testing may bias observations. Therefore we reduced our analysis to simply look at cross-sectional data.

Outcome Measures

Outcomes included scores on standardised neuropsychological tests (Strauss et al. 2006) that measure performance in one or more of the following domains: global cognition, verbal memory, non-verbal memory, working memory, processing speed, attention and vigilance, language, visuospatial skills and executive functions. All outcomes that met this definition were included in the review, including secondary outcomes and multiple tests of the same cognitive domain.

Study Design

Published, peer-reviewed studies presenting cross-sectional or case-control data were selected for analysis. Studies reporting associations between cognitive outcomes and measure of OSA severity (i.e. AHI) were also included. Other study designs (e.g. cohort studies or trials) were eligible if they reported analyzable baseline outcome measures; only baseline data were included in the analyses.

Information Sources and Search Strategy

We searched Embase, PsychInfo and Medline from inception to August 2016. Search terms were: (“cognit*” OR “neurocognit*” OR “neuropsycholog*” OR “attention” OR “executive speed” OR “language” OR “visuospatial” OR “fine motor” OR “memory” OR “learning”) AND (“apnoea” OR “sleep apnea” OR “snoring” OR “sleep disordered breathing” OR “sleep AND apnoea” OR “sleep AND apnea” OR “sleep-disordered breathing” OR “sleep related breathing disorder” OR “sleep-related breathing disorder”). Syntax for the search in each database is provided in Protocol S1. There were no restrictions for date of publication.

Study Selection

Search results were screened for initial eligibility based on title and abstract by N.C. Full-text manuscripts of studies that could not be definitively screened out exclusively based on the information in their title or abstract were assessed by two reviewers (N.C. and J.P.) independently. Disagreements about study eligibility were resolved by A.L., who approved the final list of included studies. Language was limited to English, German, Portuguese, Polish or Hebrew, while conference abstracts with no corresponding full text manuscripts were excluded.

Risk of Bias

Within Studies

The risk of bias in studies was assessed by N.C. using criteria modified from the NIH National Heart, Lung and Blood Institute’s Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies (National Institutes of Health 2014). Studies were regarded as being at high or unclear risk of bias if they did not meet core criteria (see table Table S1); all other studies we regarded as being at low risk of bias.

Across Studies

In order to evaluate risk of small-study effect, funnel plots of effect size against standard error for overall outcomes as well as for each cognitive domain were inspected for asymmetry (Sterne et al. 2011). When a meta-analysis included ten or more studies, further testing of funnel plot asymmetry was conducted using Egger’s test of intercepts (Egger et al. 1997). A positive intercept is indicative that studies with smaller samples were more likely to report more positive results than larger studies. When statistically significant asymmetry was found (defined as p < 0.1), a Trim and Fill analysis (Duval and Tweedie 2000) was used to quantify the impact of funnel plot asymmetry on the pooled effect size.

Data Extraction and Coding

Means and standard deviations of each neuropsychological test scores for OSA and comparator groups were extracted from studies and entered into a spreadsheet by N.C. Where available data regarding mean age of participants, years of education, the setting of the study (sleep clinic or community), and the percentage of females in the sample were also extracted from each study. Approximately 50% of the data were entered under the direct supervision of A.L. Primary outcome data from most studies were entered as means and standard deviations (SDs) for the OSA and comparator groups, however in some cases data was entered as a mean difference between groups with a p-value (Alchanatis et al. 2008; Hjelm et al. 2013), or as a correlation with OSA severity (Kim et al. 2011; Terpening et al. 2015; Yesavage et al. 1985). Outcome measures were coded into set cognitive domains by two reviewers (N.C. and A. L.) based on standard neuropsychological categorization (Strauss et al. 2006) or by consensus (for our classifications of neuropsychological tests into cognitive domains, see Table S2). Some studies reported findings between distinct groups based on OSA severity cutoffs (AHI > 5, AHI ≥ 15, AHI ≥ 30) (Ju et al. 2012; Lutsey et al. 2015; Blackwell et al. 2011). In these cases we combined means and SDs from the three groups into a single OSA arm using the formula provided in the Cochrane Handbook (Higgins and Green 2011).

Data Analysis

All analyses were performed using a random effects model in Comprehensive Meta-Analysis (CMA) version 3 (Biostat, Englewood, New Jersey). The primary outcome was standardised mean difference (SMD, calculated as Hedges’ g) in performance on neuropsychological tests between OSA and control groups, or the association between OSA severity measures. CMA flexibly allows for differently reported study outcomes to be entered into the model. Analyses were conducted for all cognitive results combined, as well as for each of the following cognitive domains: attention & vigilance, processing speed, memory, general cognitive or intellectual ability, executive function, language, visuospatial ability and motor learning. Precision of the SMD was calculated for each study by the 95% confidence interval (CI). A positive SMD suggests that controls performed better on cognitive tests than the OSA sample.

When the one study presented data from more than one neuropsychological test, these were combined in two ways. First, results for all tests were combined to produce a single SMD per study, following an established method (Lampit et al. 2014). Additionally, separate tests were classified by neuropsychological capability (see Table S1), such that each study could contribute to one or more cognitive-domain specific SMDs but no study could contribute more than once to each SMD calculated. When outcomes from a given study were combined, the effect estimate reflected the mean amongst the related tests, and the estimate’s variance was scaled up based on an assumed intercorrelation between the tests of 0.6 (Borenstein et al. 2009; Gleser and Olkin 2009). An assumed intercorrelation of 0.8 was used between tests when calculating the effect size for a specific cognitive domain within each study.

The main analysis consisted of combining all outcomes from each study and these were pooled to determine the association between OSA and overall cognition. We also performed domain-specific meta-analyses, in which only studies that reported outcomes on a specified cognitive domain were included, using one combined SMD per study. The overall and domain-specific meta-analyses were performed using a random-effects model. Heterogeneity among studies was quantified using I2 (Higgins et al. 2003).

Subgroup meta-analyses were conducted in order to examine between-study variability and identify how design elements may moderate these associations. These moderators were defined as: the setting of studies (sleep clinic or community-based), AHI cutoff used to define OSA and control participants, and risk of bias (high or low). The subgroup analyses were based on a mixed effects model, which uses a random-effects model to generate within-subgroup variance and a fixed-effects model to compare effects between subgroups. Between-subgroup heterogeneity was tested using the Cochrane’s Q statistic (Higgins and Green 2011) and was defined significant at the p < 0.05 level. Mean age of sample, years of education and percentage of females in study were examined as potential confounders through random effects meta-regression.

Results

Study Selection

The search identified a total of 10,308 records. After duplicate search results were removed, 7757 studies were screened for eligibility based on abstract and title, of which 7560 were definitively excluded. One hundred and ninety-seven full-text articles were assessed for eligibility, of which 184 were excluded using our PICOS criteria (Fig. 1). In addition, 6 articles (Boland et al. 2002; Blackwell et al. 2015; Martin et al. 2015; Mathieu et al. 2007; Spira et al. 2008; Phillips et al. 1994) met our inclusion criteria however used the same or substantial proportions of participants from studies already included in the systematic review (Berry et al. 1990; Yaffe et al. 2011; Mathieu et al. 2008; Blackwell et al. 2011; Quan et al. 2011; Sforza et al. 2010). These were excluded in order to prevent replicate analysis of the same participants. Four studies were found that did not meet our criteria of a comparator group, as they compared samples of participants dichotomously split by a cutoff of (AHI < 15) (Hjelm et al. 2013; Ju et al. 2012; Sforza et al. 2010; Yaffe et al. 2011). However, we included these studies in the systematic review and meta-analysis for additional context. We found one study that met our inclusion criteria but did not report standard deviations or standard errors, and so could not be included in the meta-analysis (Foley et al. 2003). Thirteen articles were eventually deemed eligible and included in the meta-analysis (for a detailed description of each study, see Table 1).

Fig. 1
figure 1

CONSORT summary of trial identification and selection

Table 1 Study characteristics

Characteristics of Included Studies

The 13 studies included in the review reported a total of 5104 participants (OSA, n = 3128, median group size = 41, range 8–1840; controls n = 1879, median group size = 49, range 12–618; Table 1) and 38 neuropsychological test outcomes encompassing 8 cognitive domains (attention & vigilance; processing speed; memory; executive functioning; motor learning; language; visuospatial ability, and; general cognitive ability/intelligence). The most commonly reported cognitive domains were executive function (k = 10), processing speed (k = 8) and global cognition (k = 7). Mean participant age ranged from 55 to 82 years, and approximately 28% of participants were women where this was reported. The studies were largely from the US (Berry et al. 1990; Blackwell et al. 2011; Lutsey et al. 2015; Quan et al. 2006; Yaffe et al. 2011; Yesavage et al. 1985) or Europe (Alchanatis et al. 2008; Ferini-Strambi et al. 2003; Hjelm et al. 2013; Sforza et al. 2010), in addition to studies from Canada (Mathieu et al. 2008), Australia (Terpening et al. 2015) and Korea (Ju et al. 2012; Kim et al. 2011).

Ten of the studies (77%) reported differences in cognitive outcomes between an OSA group and a control group, while 3 studies reported correlations between OSA severity (AHI) and outcome measures (Kim et al. 2011; Terpening et al. 2015; Yesavage et al. 1985). Five of the case-control studies consisted of baseline data taken from a single time-point in cohort studies (Blackwell et al. 2011; Lutsey et al. 2015; Quan et al. 2006; Sforza et al. 2010; Yaffe et al. 2011).

Risk of Bias

The sampling of participants was the major source of bias, with studies either using convenience sampling or participants from a specific cohort study. Case-control studies arising from specialist sleep clinic populations were significantly smaller and were unclear when reporting whether cases and controls were recruited from the same populations. Three studies recruited their sample from different populations (Ju et al. 2012; Yaffe et al. 2011; Yesavage et al. 1985).

Three of the case control studies using sleep clinic populations examined specifically moderate (Mathieu et al. 2008) or severe (Ferini-Strambi et al. 2003; Alchanatis et al. 2008) cases of OSA only and excluded milder cases of OSA from their analyses. Four studies did not report whether the neuropsychological assessment was conducted in a standardized manner by a trained assessor (Alchanatis et al. 2008; Berry et al. 1990; Yaffe et al. 2011; Yesavage et al. 1985).

Overall Effect of OSA on Neuropsychological Performance

The overall effect size of OSA on neuropsychological performance in the combined sample of 13 studies was small and statistically significant (Hedge’s g = 0.18, 95% CI 0.04 to 0.32, p = 0.009; Fig. 2). Heterogeneity across studies was moderate (Tau2 = 0.029 I2 = 69.2%). The funnel plot showed significant asymmetry (Egger’s intercept = 1.52, p = 0.048; Fig. 3) suggesting smaller studies reported larger effects. A trim and fill analysis imputed 6 studies; the adjusted effect size was then negligible and statistically non-significant (g = 0.02, 95%CI -0.12 to 0.16).

Fig. 2
figure 2

Forest plot of individual mean and weighted effect sizes across all cognitive domains. Effect size estimates are based on a random-effects model

Fig. 3
figure 3

The funnel plot for all reviewed studies’ effect sizes on all cognitive domains

Domain-Specific Effects

Executive functioning was the most reported cognitive domain (Fig. 4).

Fig. 4
figure 4

Association between OSA and neuropsychological functioning within individual cognitive domains

The effect size for this domain was small and statistically non-significant (k = 10, g = 0.12, 95% CI -0.03 to 0.27, p = 0.106, I2 = 69.9%, Tau2 = 0.03). The funnel plot did not reveal any asymmetry (Egger’s intercept = 1.14, p = 0.217; Fig. S1). In the processing speed domain, there was a small and statistically significant effect (k = 8, g = 0.20, 95% CI 0.02 to 0.38, p = 0.033, I2 = 57.6%, Tau2 = 0.03). The funnel plot did not reveal any asymmetry (Fig. S2).

The overall effect in declarative memory was small and statistically significant (k = 6, g = 0.17, 95% CI 0.04 to 0.30, p = 0.009, I2 = 22.5%, Tau2 = 0.01), and the funnel plot did not reveal any asymmetry (Fig. S3). The effect in working memory was negligible and statistically non-significant (k = 3, g = 0.04, 95% CI -0.08 to 0.15, p = 0.511, I2 = 0.0%, Tau2 = 0.0). The combined effect size was small and statistically non-significant for global cognition (k = 7, g = 0.14, 95% CI -0.09 to 0.36, p = 0.226, I2 = 78.3%, Tau2 = 0.05), and medium and statistically non-significant for motor learning (k = 3, g = 0.37, 95% CI -0.17 to 0.92, p = 0.18, I2 = 95.3%, Tau2 = 0.29). However, these effect sizes should be interpreted with caution as the small number of studies reporting on this cognitive domain may decrease the stability of these findings.

Moderators of the Effect of OSA on Neuropsychological Performance

Subgroup analysis based on the differences in study samples was also performed (Fig. 5). Firstly, the effect size on all combined cognitive domains in case-control studies of populations arising from sleep clinics (k = 6, g = 0.49, 95% CI 0.22 to 0.75, p < 0.001; I2 = 0.0%, Tau2 = 0.03) was significantly larger than that arising from baseline data from cohort studies (k = 5, g = 0.04, 95% CI -0.09 to 0.16, p = 0.320, I2 = 67.7%, Tau2 = 0.01; Q statistic for between-group heterogeneity = 8.08, df = 1, p = 0.004).

Fig. 5
figure 5

Subgroup analyses of moderators of the association between neuropsychological performance and OSA. Q-test was performed for between-group heterogeneity, using a mixed effects model

The analysis of studies was also stratified based around the differences in AHI cutoff used to define and separate groups. Negligible and statistically non-significant effects were observed for case-control studies split by AHI = 5 (k = 2, g = 0.14, 95% CI -0.12 to 0.40, p = 0.287, I2 = 89.7%, Tau2 = 0.03), or AHI = 15 (g = −0.02, 95% CI -0.16 to 0.13, p = 0.795, I2 = 33.1%, Tau2 = 0.01). Studies using a cutoff of AHI < 5 for controls but only including moderate to severe cases in the OSA group (i.E. minimum AHI of 10 to 40) exhibited a small trend toward better performance in controls (k = 5, g = 0.38, 95% CI -0.03 to 0.78, p = 0.077, I2 = 70.5%, Tau2 = 0.14). There were no significant differences in between-group heterogeneity (Q-between = 3.68, df = 2, p = 0.159). No asymmetry was observable on the funnel plot for these studies. There was a trend for studies with a higher risk of bias (k = 8, g = 0.40, 95% CI 0.07 to 0.73, p = 0.052, I2 = 73.0%, Tau2 = 0.15) to report larger effects than those considered at low risk (k = 5, g = 0.08, 95% CI -0.05 to 0.20, p = 0.250, I2 = 65.4%, Tau2 = 0.01; Q-between = 2.96, p = 0.067).

Meta-regression indicated that a greater mean age of study participants significantly predicted variance in the analysis and reduced the effect across all combined cognitive domains (beta = −0.02, df = 1,11, Q = 27.9, p = 0.027, R2 = 34.5%). Years of education, mean AHI or proportion of females in the study samples did not have any effect. Year of publication had no significant effect on the outcomes.

Discussion

This systematic review and meta-analysis is the first to specifically assess the evidence pertaining to the relationship between OSA and neuropsychological function in older adults. We identified thirteen articles that objectively measured the relationship between OSA and cognition in older adults. The meta-analysis of these studies showed a small negative association across a combined range of neuropsychology outcomes. However, this effect appears to be driven by publication bias, with larger studies reporting no link between the presence of OSA and cognition. Also analysis of those studies regarded as having a low risk of bias found no significant effect. Importantly the studies with low risk of bias were larger studies, and the actual pool of participants only declined by 14% within this analysis, indicating that the loss of effect was not due to a lack of remaining power.

Differences in study methodology appeared to influence the findings, and these were explored through the analysis of subgroups. Smaller case-control studies in samples of older adults from specialised sleep clinics showed a significant medium strength negative association between OSA and neuropsychological function, while no overall effect of OSA was observable in larger cohort studies sampled from the community. It is possible the larger effects seen in these sleep centre studies are due to clinical referral bias. Specifically, the types of individuals that present to these specialised clinics are generally more symptomatic, particularly with behavioural impairments such as severe hypersomnolence. Therefore, it is plausible that these patients would be at a higher risk of exhibiting neuropsychological deficits than individuals in the community who are not seeking medical care. The clinical significance of this message suggests that the link between OSA and cognition may be most pronounced in those seeking specialist assessments. In terms of dementia prevention, efforts to target OSA as a modifiable risk factor may be best focused towards older patients in sleep clinics, rather than screening in the general community. Unfortunately, however, once an individual attends a sleep clinic, irreversible cognitive impairment may have already occurred (Naismith et al. 2004).

Notwithstanding this possibility, it must be cautioned that the findings from sleep clinics were derived from studies with small samples with a higher risk of publication bias. Further, some studies were unclear with regards to the recruitment of patients and controls as to whether these arose from the same populations, while most of the clinical case-control studies investigated only severe cases of OSA, and excluded patients with mild or moderate OSA. Conversely, cohort studies included participants across a range of OSA severities. These factors could have potentially inflated the size of the association between OSA and neuropsychological function in the sleep clinic studies. However, while the effect size was larger in studies using more severe cases of OSA, between group heterogeneity was not influenced by the AHI cutoffs used to compare OSA cases with controls. Therefore these findings may suggest that neuropsychological deficits are only significantly observable between extremities of OSA severity and those without the disorder, and not within mild and moderate cases of OSA, however there is a lack of power in the current evidence to conclusively support this (Chowdhuri et al. 2016).

Therefore, while it is possible that older adults with OSA who present to sleep clinics may exhibit significant neuropsychological dysfunction, any conclusions are premature given the available evidence. Baseline data from larger cohort studies has failed to show any overall association between OSA and cognition in older adults. While these studies exhibit a lower risk of bias and consist of significantly larger samples from the community, the study designs are based on specific criteria, in effect reducing their representation of the general older population. It is also possible that OSA defined by the AHI metric is not a true reflection of the disease. A clinical diagnosis of OSA will incorporate the abnormal breathing events alongside neurobehavioral dysfunction (e.g. hypersomnolence). Therefore the presence of an elevated AHI alone may not result in neuropsychological impairment if this index is not an accurate measurement of the impact of OSA on the brain. Our choice to use the AHI as our exposure severity measure may have also been affected by the changes in the definition of that metric during the 30 year timespan of studies included in our review. Random misclassification errors in both our exposure and our outcome measures would have the effect of biasing towards a null finding– which is what we have observed. We also observed no association between year of publication and the strength of the association that might have indicated that the changes in AHI definition had caused the metric to become less predictive of dysfunction over successive revisions (Ho et al. 2015; Ruehland et al. 2009). Alternatively, there may be a particular phenotype of OSA patient with neuropsychological impairment. This review of the current evidence highlights the need for larger and higher quality studies using neuropsychological screening in clinical settings to definitively understand the association between clinical OSA and neuropsychological function in older adults. Following selective assessment and analysis of the evidence, it could be argued that OSA may have different associations with cognitive function depending on the type of older adults studied.

That certain individuals or their characteristics make some older people susceptible to the negative consequences of OSA is still unclear. In the neuropsychological context, it has been suggested that adults with increased cognitive reserve or brain resources may be more resilient to cognitive deficits associated with OSA (Adams et al. 2001; Alchanatis et al. 2005). Unfortunately the current evidence cannot answer this, as only around 60% of studies adequately reported years of education, even though this is regarded as one of the most important demographic factors in performance on neuropsychological tests (Strauss et al. 2006). Likewise, studies did not consider using more detailed measures of cognitive reserve and did not necessarily examine the potential for confounding effects of other key medical co-morbidities that are associated with cognitive decline and dementia. Determining the influence these factors may have on the association between OSA and cognition, as well as other possible covariates, should be a focus in future research in this area.

These current findings do not suggest that OSA should not be seriously considered in older adults. This review cannot rule out that neuropsychological deficits in certain older adults may be attributable to OSA, especially in light of the findings in studies from sleep clinics. Furthermore, there is longitudinal evidence linking long-term presence of OSA with an increased risk of cognitive decline, dementia and mortality (Chang et al. 2013; Pan and Kastin 2014; Marshall et al. 2014; Osorio et al. 2015; Yaffe et al. 2011). This may be because OSA is associated with a range of comorbidities that are also linked with dementia risk, such as elevated blood pressure and cholesterol, cardiovascular disease, and metabolic dysfunction (Norton et al. 2014; Calvin et al. 2009; Somers et al. 2008). There is also evidence from animal studies showing the negative effects of hypoxia and sleep fragmentation on the brain and neurobehavioral function (for review see (Pan and Kastin 2014)), while neuroimaging studies in humans have found associations with the presence of OSA and changes in grey (Torelli et al. 2011) and white matter (Kim et al. 2013), and even markers of oxidative stress (Duffy et al. 2016). Finally, OSA has been shown to significantly alter sleep microarchitecture and EEG brain activity (for review see (D’Rozario et al. 2016)), that have been linked to neurobehavioral dysfunction. Of particular interest from this review in older adults is the decline observed in declarative memory performance (e.g. word lists, stories), as these are the most affected domains in dementia. Sleep is strongly linked to memory consolidation; however research into how OSA may influence sleep-dependent memory consolidation is still in its infancy. Only a few small studies have been conducted in this area, suggesting that this process may be impaired in OSA (Kloepfer et al. 2009), possibly due to sleep fragmentation (Djonlagic et al. 2012) even in REM sleep (Varga et al. 2014).

These results are slightly different to those from previous meta-analyses of the younger and middle-aged literature. Unlike in younger adults (Beebe et al. 2003; Bucks et al. 2013; Saunamaki and Jehkonen 2007), the current analyses suggest that there is no association between OSA and executive dysfunction in older adults. A possible reason why the current results are inconsistent with previous analyses of the literature in younger cohorts could be our analysis method. The approach to average outcome data within each study to remove the dependence of effect sizes and calculate a composite score may have been a factor in the relatively small findings. However, given the level of information that was available, our approach is a robust and commonly employed method in the neuropsychological field. Furthermore, differences could be due to the particular grouping of cognitive domains. For instance, previously working memory has been included in measurements of executive function (Beebe et al. 2003; Bucks et al. 2013). However, we separated working memory as a distinct cognitive domain and coded our tests (e.g. digit span) by consulting a widely cited textbook (Strauss et al. 2006). Regardless, working memory was only reported in 3 studies with a negligible, very small pooled effect. Due to its role in managing the use of other cognitive domains, the integrity of executive functioning is considered a vital predictor for functional decline. However, it appears from these results that OSA has little effect on this cognitive domain in healthy older adults, and recent longitudinal evidence supports these findings in this domain (Martin et al. 2016).

This review only focused on studies of healthy adults (i.e. patients free of known or reported neurodegenerative or neurological diseases). Studying neuropsychological performance in cognitively intact participants can increase the risk of ceiling effects, which may reduce the chance of observing cognitive deficits. While it is difficult to ascertain whether this occurred in the reviewed studies and to what extent, this should have been accounted for by our systematic review only including studies that utilized standardized neuropsychological tests. Regardless, measuring OSA and cognitive function in adults who may be already experiencing cognitive impairments would be insightful, especially given the evidence suggesting an increased risk for further cognitive decline. Treatment with CPAP has shown positive effects in limited cognitive domains only (Kylstra et al. 2013; Olaithe and Bucks 2013; Pan et al. 2015), however some evidence suggests it may slow and reduce the progression of dementia (Osorio et al. 2015). Further research is needed to understand the effect of ongoing OSA treatment on neuropsychological performance in older adults. Ideally, this should be investigated through randomized controlled trials in individuals with increased risk of further cognitive decline such as those with MCI, in order to understand the reversibility of any association between OSA and cognitive decline.

Interestingly, while this systematic review focused on adult participants over the age of 50 years, meta-regression showed that as the mean age increased across the studies, the effect size slightly decreased. This suggests that greater age reduces the susceptibility to any cognitive impairment associated with OSA. This finding may illustrate a healthy survivor-effect which has been observed and discussed previously in a cardiovascular context (Gottlieb et al. 2010; Punjabi et al. 2009). As OSA is also associated with a range of negative health outcomes (e.g. cardiovascular disease), those individuals who show greater susceptibility to any negative consequences of OSA may have a higher mortality risk at an earlier age compared to those who are more resilient, making them less likely to be included in a research study. This possibility only serves to emphasise the importance of managing OSA in older adults for not only cognitive impairment, but overall health and quality of life.

Conclusions

The aim of this review was to refine the estimate of the association between OSA and cognitive functioning specifically in older adults. We found from our systematic search of studies that OSA appears to have no association with cognition overall, however the variance in the literature is significantly influenced by the setting. It is likely that there may be a relationship in some older adults, and evidence does suggest that OSA may increase the risk of cognitive decline. Further exploration of the factors that determine inter-individual differences in neuropsychological outcomes attributed to OSA will better aid in developing robust designs for early intervention trials aimed at slowing cognitive decline in older adults. These will better inform us whether management of OSA may significantly improve the quality of life in older adults, particularly those who are already experiencing cognitive impairment and at heightened risk for cognitive decline and dementia.