Introduction

In the United States, more than 53 million people have osteoporosis or are at risk for developing the disease due to low bone mass [1]. Osteoporosis increases the risk for bone fracture; depending on age and bone density, a 10-year fracture risk can more than double [2, 3]. Osteoporotic fractures are quite frequent; one study estimated that there are more than 2 million osteoporosis-related fractures annually in the USA resulting in almost $17 billion in healthcare spending in 2005 [4]. In a recent study of women over 55, there was an age-adjusted rate of 1124 admissions per 100,000 person-years, compared to myocardial infarction (MI) (668), stroke (687), and breast cancer (151). Costs for these events are also much higher than for other diseases in this age group: annual facility-related hospital costs for osteoporotic fracture were $5.1 billion, higher than stroke ($3.3 billion), MI ($4.3 billion), or breast cancer ($0.5 billion) [5]. The numbers of fractures and associated expenditures are projected to increase due to an aging population and increasing health care costs [4].

In addition to financial costs and specific health outcomes, osteoporosis-related fracture is associated with changes in functional status and quality of life. Functional loss post-fracture has been found to be high; one review focusing on fractures of the hip found that survivors of hip fractures had worse mobility, less functional independence, higher rates of hospitalization, and lower quality of life following the fracture, with the bulk of recovery of walking and activities of daily life occurring within the 6 months following the fracture [6]. Another study found that a large percentage (between 20 and 65%) of individuals who had been independent before their fracture needed assistance with various self-care activities one to 2 years after a hip fracture, while 53% required assistance to get to places out of walking distance [7]. In additional to functional status decreases, mortality also increases after hip fracture [8] Most of the prior research examining functional health has been shorter term, with outcomes measures examined only up to 2 years after the fracture [9,10,11]. In addition, these studies are often more than a decade old, consider a limited range of outcomes, and tend to be based on smaller sample sizes [8, 12]. Less is known about the longer-term effects of fracture on functional status.

Using a dataset combining longitudinal Medicare claim data with self-reported survey data related to labor, wellbeing, and functional status, we aimed to provide an updated, comprehensive assessment of the impacts of osteoporotic fractures on non-economic burden measures with a particular focus on how the associations between fracture and outcomes change over time. Based on the impact of many fractures on the ability to ambulate and otherwise self-care, we hypothesized that Medicare beneficiaries with an osteoporosis-related fracture would have worse overall health and functional status after the fracture event and for a sustained duration (up to 4 years after fracture) than matched comparisons.

Methods

Data source and sample

We used data from the Health and Retirement Study (HRS) linked to Medicare claims. The HRS, sponsored by the National Institutes of Aging (NIA) and the Social Security Administration and fielded by the Institute for Social Research at the University of Michigan, is a biennial panel survey, which collects health, demographic, psychosocial, and economic measures from a nationally representative sample of the US population of individuals age 50 and over. Approximately 80% of respondents who are eligible for Medicare have agreed to have their HRS data linked to their Medicare professional and facility claims [13].

We used the HRS-Medicare linked sample provided by MedRIC (Acumen) with the following restrictions (see Appendix Fig. 1). Individuals must be at least 65 years of age in the pre-period and continuously enrolled in traditional Medicare (i.e., we excluded those covered by Medicare managed care). We used the 1992 to 2012 waves of the data (collected every 2 years), but required that each respondent with a fracture have at least one wave of data prior to the index date (date of the fracture) and another wave after the fracture. We identified osteoporosis-related fracture cases as those with at least one fracture ICD-9 diagnosis during an inpatient stay or at least one fracture ICD-9 diagnosis on a professional claim with at least one fracture repair procedure (see Appendix Table 3) within 90 days of the initial professional claim with a fracture diagnosis. Among fracture cases, were excluded respondents with a cancer diagnosis (ICD-9 codes 140–172, 174–208, or 213) within 6 months before or after fracture diagnosis; respondents with a fracture diagnosis within the first 6 months of Medicare enrollment or within the last 6 months of 2012, and respondents with major trauma E-codes (E800-E848, E881-E884, E908-E909, or E916-E928) concurrent with the index fracture event. Our final analytic sample contained 745 fracture cases, whom we followed across three waves: the pre-fracture wave (baseline), the wave during which the fracture occurred according to Medicare claims (post-period 1),Footnote 1 and one additional follow-up wave (post-period 2). To create the control group, we matched fracture cases to comparison respondents (n = 745) using baseline data (as described below) and then included three additional waves of data (from the index period, based on an index date; follow-up period 1, relative to the index period; and follow-up period 2) for comparison. We refer to these time frames for both fracture cases and comparison respondents as the index period (in which a fracture does or does not occur); post-period 1 (0–2 years after the index period), and post-period 2 (2–4 years after the index period).

Measures

Primary outcomes

We examined a number of self-reported measures of health status and functioning. General health status was measured on a 5-point scale ranging from 1 (excellent) to 5 (poor) based on respondents’ answers to “Would you say your health is excellent, very good, good, fair, or poor?” There is a large body of literature on measuring physical function [14], but here we focus on the key measures consistently available in the HRS that track mobility, large muscle functioning, fine and gross motor skills, limitations in activities “which people perform habitually and universally,” [15] and functioning that reflects “instrumental self-care” [16]. The number of ADLs was summed based on whether a respondent reported difficulty bathing, dressing, eating, getting in/out of bed, and walking across a room. Similarly, respondents’ IADLs were summed based on whether they reported difficulty with using the phone, managing money, and taking medications. We examined three measures of mobility/movement: (1) a mobility index measured as a count of the respondent’s difficulty with the following: walking one block, walking several blocks, walking across a room, climbing one flight of stairs, and climbing several flights of stairs;Footnote 2 (2) a gross motor skills index measured as a count of the respondent’s difficulty with the following: walking one block, walking across a room, climbing one flight of stairs, getting in or out of bed, and bathing;Footnote 3 (3) a fine motor skills index measured as a count of the respondent’s difficulty with the following: picking up a dime, eating, and dressing activities [17].Footnote 4

Covariates

We used the following measures in our analyses as described below: age (continuous), gender (indicator), race/ethnicity (categorical), whether a proxy answered the HRS questionnaire (indicator), respondent’s body mass index (BMI), whether the respondent is a current smoker (indicator), whether the respondent drinks three or more alcoholic drinks per day (indicator), whether the respondent reported vigorous activity three or more times per week (indicator), and total baseline medical costs (based on claims). The proxy status measure is correlated with the respondent’s cognitive function and ability to answer questions and is therefore, an important matching characteristic [18].

Statistical analyses

We used propensity score matching to create an analytic sample of fracture cases matched to comparisons that were statistically similar in the pre-fracture or index period. We estimated a logistic regression where “had fracture” was a function of the following baseline measures: respondent’s gender, age, race/ethnicity, proxy status, body mass index (BMI), and total medical costs as reported in the HRS in the wave prior to the fracture. We calculated the predicted probability of a fracture event using the estimated coefficients from this regression and selected comparison cases with the nearest propensity score without replacement (i.e., comparisons could not serve as comparisons for more than one fracture case) for each fracture case. We excluded cases outside of the common support.Footnote 5 The propensity score matching was performed using SAS 9.3 [19].

Using the matched sample, we estimated regression-adjusted difference-in-differences (DD) models, which allowed us to compare changes in functional health for fracture cases over time relative to statistically similar comparisons. The DD allows us to isolate changes in functional status that are related to the fracture from other secular trends in functional health (e.g., functional health tends to decline in general as a result of aging), but relies on the assumption that trends in the pre-period were similar among fracture and comparison cases. Although we cannot examine trends beyond one pre-period, the propensity score matched comparisons will be similar to fracture cases in the pre-period by construction. We used regression adjustment for the DD analysis in order to obtain doubly robust estimates [20] and to better adjust for baseline costs. Although we used baseline costs in the propensity score matching, they were based on self-reported costs as reported in the HRS in the wave prior to the fracture and thus covered a two-year pre-period. We had to use HRS costs for this purpose because comparison cases would not have had a fracture or index date yet to use for matching. After comparison cases were matched to fracture cases, they were assigned the fracture/index date of their matched case. Then, we were able to create a claims-based measure of costs in the year prior to the fracture/index date, excluding the week prior to that date. In the DD analysis, we included this measure—costs in the year prior to the fracture/index date—as a control variable. We also included age, race, gender, and proxy status using a Poisson regression. DD regressions were estimated using Stata 13 [21]. We used HRS respondent-level sampling weights in all estimations.

Results

Table 1 presents descriptive statistics of the sample population. Even after matching on costs prior to the fracture date, fracture cases tend to report greater out-of-pocket health care costs, but these cases tend to be similar on other observable demographic characteristics. Fracture and comparison cases reported similar functional limitations at baseline, but fracture cases reported slightly worse self-reported general health status at baseline.

Table 1 Weighted sample descriptive statistics

Adjusted means and DD estimates for fracture and comparison cases across three HRS periods are provided in Table 2. Overall, fracture cases reported more functional health limitations in both post-periods relative to their counterparts without fracture. The increases in limitations over time were also greater for fracture cases with the DDs from baseline to follow-up period 1 all statistically significant at p < 0.05.

Table 2 Regression-adjusted means at baseline and post waves, and difference-in-differences in functional outcomes (Robust standard errors in parentheses)

The number of ADL limitations increased from 0.49 to 0.90 (84%) for fracture cases from baseline to the first follow-up period relative to an increase from 0.39 to 0.47 (21%) for matched comparisons. Thus, fracture cases had a 0.33 greater increase in the number of reported limitations with ADLs than matched comparisons experienced, which was about a 63 percentage point difference (84% increase vs. 21% increase). Similarly, fracture cases reported a 0.44 greater increase (31 percentage point increase) in difficulties with mobility, 0.43 greater increase (57 percentage point increase) in difficulties with gross motor skills, 0.11 greater increase (33 percentage point increase) in difficulties with fine motor skills, and 0.10 greater increase (36 percentage point increase) in limitations to IADLs than matched comparisons (all p values < 0.01 except for fine motor skills and count of IADLs, which had p value < 0.05).

The magnitude of these effects diminishes in the second follow-up fracture wave (2 to 4 years after fracture/index date) to 0.19 greater increase (27 percentage point increase, compared to the index period) in limitations to ADLs, 0.41 greater increase (27 percentage point increase) in mobility limitations, 0.32 greater increase (42 percentage point increase) in limitations to gross motor skills (all p values < 0.01), and 0.11 (34 percentage point increase) greater increase in limitations to fine motor skills 2 to 4 years after the fracture relative to matched comparisons (p value < 0.05). The difference in the count of IADLS was no longer statistically significant in the second follow-up wave.

Thus the effects seen after a fracture mostly persisted, though attenuated, with the exception of changes in IADLs, which ceased to be significantly different by the second follow-up period. There was no significant difference in the change in self-reported health status across fracture and comparison cases over time at either follow-up period.

Discussion

Prior research has established the heavy burden of fracture on functional status, particularly for those in long-term care facilities [22,23,24]. However, prior work has typically focused on the first few months after a fracture, does not look at general health, does not have a comparison group which is necessary to remove any aging-related decline in functional status, or has not been updated recently [22, 23, 25, 26].

The HRS-Medicare linked data offer a unique insight into the longer-term impact of fractures on richer measures of physical health. These data allowed us to test whether there are longer-term implications regarding physical functioning for older individuals post-fracture.

Comparing changes in functional status of individuals with an osteoporosis-related fracture to individuals with statistically similar health histories and profiles suggests that the former group experienced greater reductions in functional health relative to their comparisons. This difference persisted over time.

The increase in IADLs at follow-up period 1 becomes statistically insignificant in the second follow-up period but all other comparisons continue to be significant into the 2–4-year time frame. These findings support the hypothesis that fractures impose a burden on older individuals (at least for some measures of functional status) that lasts longer than is typically studied in the literature.

Although the magnitude of the associations between fracture and fine motor skills and IADLs were relatively small at follow-up period 1 and 2, and no longer significant for IADLs at follow-up period 2, these measures encompass movement that is less likely to be affected, especially in the longer term, by a fracture. For example, activities that typically do not require large muscle strength, such as managing money, taking medications, eating, picking up a dime, or using the phone, are less likely be impeded by a broken femur or hip. We also found no evidence of a differential effect of a fracture on self-reported general health. As people switch insurance frequently, insurance companies may focus on shorter-term costs and impact with less focus on longer-term outcomes. Medicare may be less likely to focus on the short-term as its population stays enrolled longer, but may still be influenced by shorter-term measures and costs. We encourage both researchers and insurers to consider the longer-term impact of major health events; even as the impact attenuates, the difference between fracture cases and controls remains after 2 to 4 years. Further research should explore beyond the first year post-fracture, which is the most studied. Only with more information on the longer-term impact of fractures will decision-makers (including payers and policymakers focusing on public health like CDC) be able to best prioritize investments in technologies, treatments, and other interventions to reduce the likelihood of fractures.

Limitations

Although we employed a rigorous empirical approach that is standard in the evaluation literature—DD estimation [27]—it relies on the assumption that the health trajectories of fracture and comparison cases were similar prior to the fracture. We endeavored to ensure this by using the propensity score matching approach matching fracture cases to comparison cases with similar gender, age, race/ethnicity, baseline proxy status, baseline BMI, and baseline total medical costs,Footnote 6 but note that we are only matching using one pre-period. Our approach also addresses issues of selection, whereby we might be concerned that individuals who have fractures are systematically different in both observable (e.g., frailer or sicker as indicated in the HRS and Medicare data) and unobservable ways (e.g., genetic predisposition). Because we are comparing the change in both fracture and comparison cases, any comorbidities or individual characteristics that are time-invariant are effectively removed in the DD model.Footnote 7

Additionally, as noted above, approximately 20% of HRS respondents who are eligible for Medicare opted not to have their claims linked to the HRS. Our findings may not be representative or generalizable if those who did not consent were systematically different from those who did consent (e.g., those who are sicker). Standard concerns of measurement error from self-reported data could result in attenuation bias, which is exacerbated in models with differencing.

Conclusion

The shorter-term impact of fractures has been well described, but until now, researchers have not measured the longer-term implications. Our study is one of the first to explore this issue, identifying a significant impact of these events on functional health. Future studies similarly explore the longer-term impact of fractures on cost and overall health status.