Introduction

One in two women 50 years of age or older will experience an osteoporotic fracture in their lifetime [1]. Unfortunately, an underlying etiology is often not identified when an evaluation for low bone mineral density (BMD) and increased fracture risk is performed [2,3,4]. Emerging data suggest that sleep and circadian disturbance may be unrecognized, potentially modifiable risk factors for impaired bone health [5, 6]. The Nurses’ Health Study [7] and the Women’s Health Initiative (WHI) [8] identified an increased risk of fracture in postmenopausal women who reported a history of rotating night shiftwork and self-reported short (≤ 5 h) sleep duration, respectively. Rats exposed to chronic sleep restriction demonstrated significantly lower bone formation and lower BMD [9, 10]. We reported that after approximately 3 weeks of cumulative sleep restriction and concurrent circadian disruption, men had significantly lower levels of the bone formation marker, propeptide of type 1 procollagen N-terminal (P1NP), despite no change in a bone resorption marker C-telopeptide of type I collagen (CTX) [11]. There have been numerous epidemiological studies of the association between sleep duration and bone mineral density (BMD) in humans, with mixed results. Methodological differences and limitations make the actual association and its direction unclear.

Prior studies performed in mostly or entirely female populations have found that both long [12,13,14,15,16,17,18,19,20] and short [14,15,16, 20,21,22,23] self-reported sleep durations have been associated with lower BMD, while others identified no association [24, 25]. The cutoffs used to define short and long sleep durations varied considerably. Short sleep was most commonly defined as < 6 h/night [12, 16, 18, 19, 22,23,24, 26, 27] but ranged from ≤ 5 [12, 18, 22] up to 7–8 h/night [14, 15]. Conversely, long sleep was most commonly defined as ≥ 8 or ≥ 9 h/day. Only one study examined sleep duration as a continuous variable and found no association with osteopenia/osteoporosis but had few individuals who reported < 6 h of sleep per night [25]. Two meta-analyses investigated the association between sleep duration and osteoporosis [17, 20]. The most recent indicated that both long (defined as ≥ 9 h/night) and short (defined as ≤ 7 h/night) self-reported sleep durations were associated with an increased risk of osteoporosis, with the lowest risk in those middle-aged and elderly adults sleeping ~ 8 h/night [20]. These conflicting data may be due to different covariate adjustments and study populations (Asian, United States, South American, and European countries), inadequate consideration for actual versus desired total sleep time, and BMD assessment by different methods (DXA, QUS) at various anatomical sites [6]. Most notably, all prior analyses used subjective (self-reported) sleep duration, with various definitions of short, reference, and long sleep times. Subjective sleep duration is variably correlated with objectively measured sleep duration and subjective estimates may be particularly inaccurate for those with shorter sleep duration or insomnia [28,29,30].

The National Institutes of Health (NIH) recommends 7–8 h of sleep per day for individuals 18 years of age and older [31]. The American Academy and Sleep Medicine (AASM) and Sleep Research Society (SRS) joint consensus statement recommended adults sleep “7 or more hours per night on a regular basis to promote optimal health” [32]. Despite these recommendations, over one-third of US adults report getting less than the recommended amount of sleep [33] and the Centers for Disease Control (CDC) highlighted insufficient sleep as a public health epidemic in 2014 [34]. With these recommendations in mind, we used the Study of Osteoporotic Fractures (SOF) ancillary sleep study of 3137 women to investigate the relationship between objective sleep duration measured by actigraphy and BMD in postmenopausal women. BMD was compared in older, postmenopausal women according to their sleep duration. Objective sleep duration was (a) dichotomized with short sleep duration defined as < 6 h/night and the recommended sleep duration defined as 7–8 h/night and (b) also considered as a continuous variable without any exclusions based on total sleep time. Sleep duration cutoffs for the dichotomized analysis were chosen because they were in line with the NIH recommendations [31], they facilitated comparison to some previous literature, the short sleep duration was similar to the sleep restriction imposed in our prior intervention study [11], and because the groups were sufficiently different to identify a clinically significant difference between short and recommended sleep duration groups. Based on previous interventional data from animals [9, 10] and humans [11], we hypothesized that shorter sleep duration would be associated with lower BMD.

Methods

Study design and participant selection

The study design and cohort characteristics of the Study of Osteoporotic Fractures (SOF) have been previously described [35]. In short, 9704 community-dwelling, ambulatory women ≥ 65 years were recruited using mailings to age-eligible women identified from community-based listings between September 1986 and October 1988 from 4 metropolitan areas in the USA (Baltimore, MD; Minneapolis, MN; Monongahela Valley near Pittsburgh, PA; and Portland, OR). In February 1997 through February 1998, 662 African American women were added to the original cohort for a total of 10,366 women [36].

Of the 10,366 women initially recruited for the SOF study, 3137 (66% of active survivors) participated in the ancillary SOF Sleep Study between January 2002 and April 2004. During their clinical visit, participants completed the Epworth Sleepiness Scale (ESS) and Pittsburgh Sleep Quality Index (PSQI), along with other medical questionnaires and clinical assessments. Actigraphy data were collected on 3052 women who wore the device on their non-dominant wrist for a minimum of 3 consecutive 24-h periods (average 4 nights, 5 days), except during water sports or while bathing [36]. Participants were asked to complete a sleep diary concurrent with actigraphy.

Women were included in the sleep duration/BMD analysis if they had useable actigraphy data and had a concurrent BMD assessment of the total hip and femoral neck (Fig. 1). Lumbar spine BMD was not available at this clinic visit. Women were excluded from the analysis if they had concurrent medical and/or sleep conditions that may confound the association between sleep duration and BMD including current tobacco use, current excess alcohol intake (≥ 14 drinks/week), use of bisphosphonates or oral glucocorticoids, or self-reported diabetes mellitus, hyperthyroidism, or sleep apnea diagnoses (Fig. 1). No women used non-bisphosphonate osteoporosis medications (e.g., calcitonin, denosumab, anabolic therapy) at the time the SOF sleep study was performed (2002–2004). Serum creatinine was not available on all participants at this visit. A subset of women had an assessment of renal status 5–6 years prior to this data collection. Conservatively, we excluded any woman with an eGFR < 60 ml/min/1.73m2 at that time, anticipating that renal function would decline over time (Fig. 1). Due to inconsistent results in previous sleep duration-BMD literature, we took the approach of excluding women with potential confounders to isolate the sleep duration-BMD association. Any signal detected could then be followed up in a more generalizable study population.

Fig. 1
figure 1

Subject flow diagram for SOF sleep duration and BMD analyses

To examine the association between sleep duration and BMD, sleep duration was considered as a dichotomized (primary analysis) and continuous (pre-planned secondary analysis) variable. A total of 1624 women met the above eligibility criteria for the continuous sleep duration analysis (Fig. 1). Subsequently, women with 6–7 and > 8 h of sleep/night were excluded for the dichotomous sleep duration analysis leaving 874 women who were categorized as “short” (< 6 h; N = 382) or “recommended” (7–8 h; N = 492) sleep duration, respectively (Fig. 1).

The Institutional Review Board at each SOF clinical site approved the study, and all participants provided written consent. The current analysis utilized de-identified data and was deemed non-human subjects research by the Colorado Multiple Institutional Review Board.

Wrist actigraphy and objectively measured sleep duration

The Sleepwatch O (Ambulatory Monitoring, Inc., Ardsley, NY) was used to collect actigraphy data [36]. Clinic staff in charge of collecting actigraphy data underwent centralized training and certification. The San Francisco Coordinating Center at California Pacific Medical Center and the University of California, San Francisco (San Francisco, CA) scored and processed the actigraphy data centrally using Action W-2 software [36]. The proportional integration mode (PIM) was used to assess sleep duration as this mode has been reported to have the best accuracy for older cohorts [37]. Sleep duration was averaged for the nights wrist actigraphy was worn. Nocturnal sleep duration did not include daytime sleep (naps). Daytime sleep duration (naps) was measured separately and only included in analyses that used total 24-h sleep duration.

Bone mineral density (BMD) by dual-energy X-ray absorptiometry (DXA)

Bone mineral density (BMD) at the total hip and the femoral neck was assessed using Hologic QDR 1000 (Hologic Inc., Waltham, MA) dual-energy X-ray absorptiometry (DXA) [35]. Image analysis was performed at each clinical site, with review of a random subset and of flagged scans by the San Francisco Coordinating Center.

Subjective sleep duration and other sleep questionnaires

Each participant completed a questionnaire on sleep habits on which she answered the question “On most nights, how many hours do you sleep each night?” Other questions asked about occurrence/frequency of napping, number of hours of sleep needed to feel rested, and presence of sleep disorders. Two validated sleep questionnaires (Pittsburgh Sleep Quality Index (PSQI) and Epworth Sleepiness Scale (ESS)) were also completed. A woman was considered a poor sleeper according to the PSQI if her score was > 5 [38]. A score > 10 on ESS was considered as excessive daytime sleepiness [39].

Other measurements

BMI was calculated using body weight (kg) on a balance beam scale divided by height (m2) measured using a wall-mounted Harpenden stadiometer. Self-reported questionnaires were used to determine history of physician-diagnosed medical conditions (e.g., osteoporosis, hypertension, COPD, depression, insomnia). Medication use (e.g., estrogen, calcium, vitamin D) was determined by cataloging all medications (prescription, over-the-counter) brought in by SOF participants to their study visit. Walking speed (m/s) was determined from the 6-m walk test with participants walking at their usual pace.

Statistical analysis

A priori power analysis using preliminary data determined 53 women with short and 53 women with recommended sleep duration was required to see a 0.055 g/cm2 difference in BMD (with 80% power and 2-sided α = 0.05). This was felt to be a clinically significant difference based on data demonstrating that smaller BMD changes correlate with fracture risk reduction [40]. With our final, larger sample size, the same test would have 95% power to detect a 0.025 g/cm2 difference in BMD (N = 874) with 2-sided α = 0.05. A p value ≤ 0.05 was considered statistically significant.

For the primary analysis, total hip and femoral neck BMD were compared between women with short nocturnal sleep duration (< 6 h/night) and those getting the NIH recommended amount of sleep (7–8 h/night) using a 2-sample t test (unadjusted, model A). A multivariate regression model was used to test whether BMD differed significantly between women with short vs. recommended nocturnal sleep durations in minimally adjusted (model B, adjusted for age, race, clinic site, and body mass index (BMI)) and fully adjusted (model C, additionally adjusted for use of calcium/vitamin D/estrogen, depression, walking speed, hypertension, COPD, self-reported daily naps, and number of hours needed to feel rested) analyses. An individual was excluded from an analysis if a covariate(s) was missing (< 10% had a missing covariate); therefore, the analyzed sample size is noted for each model. Covariates were selected based on clinical relevance and significant differences identified in baseline characteristics (Table 1).

Table 1 Baseline characteristics of SOF participants by exposure. Group results are presented as N(%) or mean ± SD as appropriate, unless otherwise stated

As a pre-planned analysis, the association between nocturnal sleep duration and BMD was also examined modeling sleep duration as a continuous exposure variable. A simple linear regression model was used to estimate the parameter β, representing the change in BMD (g/cm2) per each additional hour of sleep in the full SOF sleep cohort (without any exclusions for total sleep time; N = 1624). Multivariate regression models were also used to estimate β in the minimally (model B) and fully adjusted (model C) models using the same covariates as above. Although the relationship between self-reported sleep duration and BMD may be non-linear [20], visual inspection of SOF objective nocturnal sleep duration-BMD data plots and results from testing a quadratic term indicated linear models were appropriate. An interaction between BMI and sleep duration was tested by including an interaction term in model C. To facilitate comparison to prior literature, sleep duration-BMD analyses were also repeated using subjective, self-reported, sleep duration as the exposure variable. The association between BMD and objective total 24-h sleep duration including daytime naps was also examined, modeling sleep duration as a continuous exposure variable. A quadratic term was tested in model C in this 24-h total sleep duration model to investigate a non-linear association. Sensitivity analyses were performed to determine if results differed when women using bisphosphonates were included in the objective and subjective sleep duration cohorts with adjustment for bisphosphonate use. Sensitivity analyses were also performed to determine if results differed when women meeting exclusion criteria (e.g., smokers, glucocorticoid use) were included in the objective sleep duration analysis and these factors were instead adjusted for in model C.

Results

A total of 874 women met the eligibility criteria and were included in the BMD comparison between women with short (< 6 h/night, mean = 5.2 ± 0.8 h; N = 382) and recommended (7–8 h/night, mean 7.5 ± 0.3 h; N = 492) nocturnal sleep durations (Fig. 1). They were 83.3 ± 3.4 years old. Overall, approximately 90% of women were Caucasian, although the short nocturnal sleep duration group had a significantly higher percentage of African American women (Table 1). The women with short sleep duration had a higher BMI compared with those with the recommended sleep duration (27.9 ± 4.6 kg/m2 vs. 26.3 ± 4.2 kg/m2, p < 0.001). As expected, those with shorter nocturnal sleep duration had significantly higher scores on the Epworth Sleepiness Scale (Table 1). Women with short sleep duration indicated they needed 7 h of sleep per night to feel rested and their self-reported sleep duration tended to overestimate their measured sleep duration by 1.4 ± 1.4 h. Conversely, women with the recommended sleep duration reported they needed 7.6 h of sleep per night to feel rested and slightly underestimated their sleep duration on self-report compared with on actigraphy by 0.2 ± 1.3 h.

Short sleepers had significantly higher BMD than women getting the recommended amount of sleep in the unadjusted model at the total hip (0.018 g/cm2, p = 0.054), but not femoral neck (0.005 g/cm2, p = 0.54; Table 2, model A). After adjustment for BMI and other factors, there was no statistically significant difference in BMD at the total hip or femoral neck in postmenopausal women with measured short vs. recommended nocturnal sleep duration (Table 2, models B, C). In the fully adjusted model, femoral neck BMD tended to be 0.016 g/cm2 lower in women with short sleep duration compared with those with the recommended amount of sleep but this did not reach statistical significance (p = 0.06). An interaction term between BMI and sleep duration was not significant in the fully adjusted model for total hip (p = 0.77) or femoral neck (p = 0.15). Results were essentially unchanged when the AASM and SRS’s recommended sleep duration (7–9 h/night) [32] was used to define the recommended sleep group (data not shown).

Table 2 BMD (g/cm2) in Postmenopausal Women with Short (<6 hours/night) vs. Recommended (7-8 hours/night) Sleep Duration Using Objective (Actigraphy) and Subjective (Self-Report) Sleep Duration. Data are presented as adjusted means (95%CI)

As expected, more women met the inclusion criteria when measured nocturnal sleep duration was analyzed as a continuous variable (N = 1624). Consistent with the findings based on dichotomous nocturnal sleep duration, BMD was lower at the total hip (β = − 0.009, p < 0.01) with each additional hour of sleep in the unadjusted model (Table 3, model A) but there were no significant associations between objectively measured continuous nocturnal sleep duration and BMD at the total hip or femoral neck after adjustment (Table 3, models B, C).

Table 3 Association Between BMD and Sleep Duration (Continuous Variable) Using Objective (Actigraphy) and Subjective (Self-Report) Sleep Durations (Nocturnal and Total 24-h). Data are presented as β, which is the change in BMD (g/cm2) per additional hour of sleep, p-value, (95%CI)

There were no clinically or statistically significant associations between dichotomized (< 6 h vs. 7–8 h of sleep/night) or continuous subjective (self-reported) sleep duration and BMD at the total hip or femoral neck in any model (all p ≥ 0.35; Tables 2 and 3). All results in dichotomous and continuous analyses using objective and subjective nocturnal sleep durations were unchanged when bisphosphonate users were included in the analytical cohort and adjusted for in the model. Similarly, results in dichotomous and continuous analyses using objective nocturnal sleep duration were unchanged when women meeting exclusion criteria were included in the analytical cohort and instead those factors (e.g., glucocorticoid use, smoking) were adjusted for in model C (data not shown).

Only 40 women had 9+ h of sleep per night. These women were of similar age (average 83.7 ± 3.4 years) to the other nocturnal sleep duration groups and 85% were Caucasian. On average, these women slept 9.6 ± 0.6 h per night and had a lower BMI (25.9 ± 3.7 kg/m2). The sample size was too limited for formal analysis; however, osteoporosis diagnosis (27.5%), history of falls (40%), and history of fracture (60%) were more prevalent in this group than in the other sleep duration groups.

Sleep duration was significantly longer when daytime naps were included. Average actigraphic daytime nap duration was 70 ± 60 min (range 0–479 min). Longer 24-h sleep duration was associated with significantly lower total hip BMD in model C (β = − 0.005, p = 0.04; Table 3). This relationship was non-linear. The quadratic term was significant in model C for both the total hip (p = 0.001) and femoral neck (p = 0.04). This analysis suggested that maximum total hip BMD occurs at ~ 7.26 h of sleep per 24 h, but there was a large range of sleep durations with clinically similar BMD values.

Discussion

This study was the first to examine the association between objectively measured sleep duration and BMD in postmenopausal women using the NIH sleep duration recommendations [31] to categorize nocturnal sleep duration as short (< 6 h/night) and recommended (7–8 h/night). Contrary to our hypothesis, no association between nocturnal sleep duration and BMD was identified when measured or self-reported sleep durations were considered as a dichotomized or continuous variable. In fact, total hip BMD was higher in women with shorter nocturnal sleep duration in unadjusted models but this was not significant after adjustment. The association between nocturnal sleep duration and BMD in the unadjusted model was stronger in continuous vs. dichotomized analyses, likely because of the larger sample size and in consideration of extremely short and long nocturnal sleep durations in the continuous model. However, after adjustment for BMI, age, race, and clinical site in the minimally adjusted model (model B), there was no clinically or statistically significant association between nocturnal sleep duration and BMD at the total hip or femoral neck between short (< 6 h/night) and recommended (7–8 h/night) sleepers or when sleep was considered as a continuous variable. When total sleep time included daytime naps, longer 24-h sleep duration was associated with significantly lower BMD at the total hip. The relationship between 24-h sleep duration and BMD was non-linear, suggesting that maximum total hip BMD occurs at ~ 7.26 h in older postmenopausal women, but there was a large range of sleep durations with clinically similar BMD values.

Shorter nocturnal sleep duration was associated with higher total hip BMD in the unadjusted models, likely because women with shorter nocturnal sleep duration had higher BMI and a higher percentage of African American women who tend to have higher BMD than Caucasian women [41]. There were no significant associations between nocturnal sleep duration and BMD after adjusting for these two factors, along with age and clinical site. Data from the subjective nocturnal sleep duration analyses were similar, although BMD differences were even smaller in magnitude and associations weaker than when measured sleep duration was used, despite larger sample sizes.

These data contradict prior analyses that indicated a possible U-shaped association between self-reported sleep duration and BMD [14,15,16, 20, 27]. The main difference between current and previous studies was the use of actigraphy to objectively measure sleep duration in the current study. However, there was also no association of self-reported nocturnal sleep duration, as used in previous studies, with BMD. Our contradictory findings compared with prior literature may also be due to the differences in anatomical site assessed for BMD, radiographic BMD technique (central DXA vs. QUS), age/sex/sex hormone status of the study population, and sleep duration cutoffs (rationale of which were not always provided). The average age of the current cohort was older than any other study, by > 15 years in most studies. In addition, because very few women in this postmenopausal cohort had long nocturnal sleep duration (< 5% had measured sleep duration of 9+ h/night), we may have been underpowered to detect a relationship between long nocturnal sleep duration and BMD. There were trends for a higher percentage of osteoporosis diagnosis, falls, and fractures in the small group of women who had 9+ h of nocturnal sleep. In fact, when total sleep time included daytime naps and sleep durations in the group became longer, an association was observed between longer 24-h sleep duration and lower BMD at the total hip. Samples sizes were generally larger in prior analyses; however, the current study was adequately powered to detect the magnitude of BMD difference reported in prior studies.

These data are consistent with two prior reports [24, 25] that found no association between self-reported sleep duration and BMD using (a) continuous sleep duration and similar BMD assessment sites [25] as the current study and (b) similar sleep duration cutoffs [24] as the current study but with proximal femur volumetric BMD by CT. Those studies had similar [25] or larger [24] sample sizes than the current study. Although both included younger men and women, the average age in the Marques et al. study was 77 years [24], perhaps suggesting that age modifies the effect of sleep duration on bone metabolism. Finally, this was the first study to consider the amount of sleep needed to feel rested and highlighted that those getting less sleep report needing slightly less sleep than those getting the recommended amount. Overall, the findings suggest that short nocturnal sleep duration is not associated with low BMD after adjustment for other clinically relevant factors, such as BMI.

It is likely important to include daytime naps in sleep duration analyses performed in older individuals. Stone et al. previously identified an increased risk of falls and fractures with self-reported long sleep duration (> 10 h/24 h interval, including naps) compared with women who slept 8–9 h in age-adjusted analyses but not in multivariate analyses in SOF [42]. Stone et al. also noted a significantly increased risk of falls and hip fractures in older postmenopausal women in SOF who reported daily napping, after age (hip fracture) and multivariate (falls) adjustment [42]. Although that study was based on self-reported sleep duration, current results are in line with those findings when total actigraphic 24-h sleep duration included naps. In the current analysis, an association between longer sleep duration and lower total hip BMD was only observed once daytime naps were included. Daytime naps on actigraphy were, on average, over an hour in duration, representing a significant percentage of 24-h sleep duration in these women. Although sedentary behavior in older individuals can compromise accurate actigraphic sleep assessment, 24-h sleep durations (including naps) may be more representative in these older populations than nocturnal sleep durations.

There are likely confounders or mediators in the sleep-bone relationship (e.g., muscle strength, falls, BMI, naps, comorbidities) that mitigate any apparent association between sleep duration and bone outcomes. If a small difference in BMD according to nocturnal sleep duration exists, it may be difficult to detect in older postmenopausal women, in whom other factors (e.g., prolonged estrogen deficiency) have a greater effect on bone metabolism. Furthermore, only ~ 25% of women in this cohort self-reported a high physical activity level. This level of inactivity in an elderly cohort (average age 83.3 ± 3.4 years) may have made it difficult to detect a nocturnal sleep duration-BMD association, since both older age and inactivity are associated with low BMD. More detailed information on 24-h activity patterns may help to identify sleep/wake patterns that are related to low BMD in older women. In addition, the complexity of sleep phenotype (e.g., duration, efficiency, quality) and potential for night-to-night variability in sleep may preclude detection of an association with BMD, which does not acutely change from one night to another. Moreover, the magnitude of the effect of sleep disruption on bone may vary by individual based on underlying factors (e.g., baseline BTM levels, age). Lastly, it is unclear if chronically short or long sleep durations continue to alter bone turnover or if bone metabolism is only impacted by short-term perturbations and adapts over time. If the latter is true, then no appreciable difference in BMD could be detected in observational, cross-sectional analyses. Interventional studies may be more capable of detecting sleep disruption-induced skeletal changes.

This study represents the largest analysis of the association between objectively determined sleep duration and BMD in postmenopausal older women. However, there were limitations. BMD at the lumbar spine was not available in the cohort. Results may differ at skeletal sites that have higher trabecular bone content (such as the lumbar spine) as compared to those with a higher cortical bone content (such as the total hip and femoral neck). In addition, serum 25-hydroxy vitamin D was not measured, which may modify the effect of sleep duration on BMD. It is possible that excluding those on osteoporosis therapy biased our results towards the null by eliminating those with the lowest BMD. However, results were unchanged when bisphosphonate users were included in the cohort with adjustment for drug therapy. This study assessed two-dimensional BMD and therefore could not evaluate differences in bone quality or microarchitecture that may be influenced by sleep duration. Although we excluded women who reported having sleep apnea, the high prevalence of sleep-disordered breathing in older adults [43] means there may have been undiagnosed sleep apnea, particularly in the short sleep duration group. We expect this would have increased the risk of a type 1 error and, therefore, was unlikely to have changed the results. A longer actigraphy assessment period may provide a more accurate assessment of sleep duration. Women in this cohort wore their wrist actigraphy device for 4 nights, on average, and sleep duration was averaged over the duration of use so first night effect was minimized [44]. Although women were excluded for current bisphosphonate use, ~ 3% of the BMD cohorts were taking estrogen or raloxifene. This small number of users is not expected to have significantly influenced the overall results.

Conclusion

Nocturnal sleep duration, whether assessed objectively with actigraphy or subjectively by questionnaire, was not associated with BMD in older postmenopausal women after adjusting for BMI. Longer 24-h sleep duration (including naps) was associated with lower total hip BMD.