Introduction

Osteoporosis is a systemic skeletal disorder, manifested by increased fracture risk [1]. About one of every two Caucasian women and one of five Caucasian men experience an osteoporosis-related fracture during their lifetimes [2] . Bisphosphonates (BP), bone antiresorptive drugs, are the pillar stones of the medical treatment of osteoporosis. These drugs reduce osteoclasts number and activity and thus decrease bone turnover. This results in increased bone mineral density and reduction in fracture risk by 30–70% [3,4,5,6,7].

In the past decade, BP treatment has been shown to be associated with atypical femoral fractures (AFFs), which are rare fractures with a transverse morphology [8]. AFF pathogenesis is considered as the development of an unhealed stress fracture [9]. In the physiological state, a stress fracture is usually followed by normal bone remodeling and healing. Long-term use of BP impairs the capacity of this natural repair process and rarely results in atypical fragility fractures [8, 10,11,12,13]. The estimated incidence of AFF is 1.8/100,000 patient years after 2 years of BP treatment, 16/100,000 patient years after 5 years of treatment, and 113/100,000 after 10 years of treatment [14,15,16]. It should be emphasized however that the absolute risk of AFF remained very low as compared with reductions in the risk of hip and other fractures with BP treatment [17]. Due to this established correlation between the duration of BP treatment and AFF, current guidelines suggest a “drug holiday,” after 3–5 years of BP treatment [7, 15]. It was shown that even few months of “drug holiday” may reduce significantly the risk of AFF among patients who were treated with BP [17]. Additional risk factors for AFF may include diabetes mellitus, rheumatoid arthritis, and medical treatment with glucocorticoid and proton pump inhibitors (PPI) [18,19,20]. Asian ancestry, higher BMI, and specific femoral geometry (e.g., proximal femoral varus) have also been associated with increased risk of AFF [15, 17, 18, 21].

In this retrospective study, we aimed to characterize AFF among patients who attended a tertiary care hospital fracture liaison service (FLS). We compared the rates and durations of BP exposure, as well as demographic, clinical, biochemical, and radiological characteristics, and drug purchase data between patients with AFF (cases) and patients with typical osteoporotic hip fractures (controls). We examined two hypotheses. Our first hypothesis was that patients with AFF would have distinct demographic and clinical characteristics compared with patients with typical osteoporotic hip fractures, regardless of the specific site of these typical fractures. Second, we hypothesized that the risk of AFF among patients admitted to our institution with osteoporotic hip fracture would be greater among those treated with BP and would increase from a certain point of time during the course of BP exposure.

Methods

Study population

This is a retrospective, nested case–control study of patients treated at Soroka University Medical Center, a 1100-bed teaching, tertiary care referral hospital in Beer-Sheva, Israel. Patients over age 50 years, admitted to our institution with low trauma hip fracture from July 2014 to November 2018, were offered evaluation, follow-up, and treatment by our FLS. Patients were not offered to attend the service if they had a severe comorbidity (e.g., active malignancy, end-stage kidney disease treated by hemodialysis), had not undergone an operation for the fracture, had difficulty communication, or were not living in the district. In addition, patients did not attend our FLS if they were insured by a health maintenance organization other than Clalit Health Services. This is because we did not have pre-admission data for those patients, and follow-up and treatment in our institution were not covered by their health maintenance organizations. We excluded from the current analysis patients with device-associated fractures and patients with incomplete data.

Data captured

Demographic, clinical, biochemical, radiological, and drug purchase data of all the individuals in the cohort were collected from a unified electronical medical file and reviewed. The unified medical files included all the hospital and community records of each patient. Past medical diagnoses were used to calculate Charlson’s Comorbidity Index (CCI) [22]. Laboratory results of hemoglobin and creatinine were retrieved from the computerized data of admissions with hip fracture. Estimated glomerular filtration rate (eGFR) was calculated using the MDRD equation [23]. Parathyroid hormone (PTH) and 25(OH)-vitamin D serum levels were retrieved from 6 months prior and up to 1 month after the index admission. Drug purchase data were retrieved and analyzed up to 5 years prior to the index admission with an osteoporotic hip fracture. One year of BP treatment was defined as a period of at least 6 months of purchase of oral preparations or one purchase of zoledronic acid per year, as was defined by others [24,25,26]. For medications other than BP, we classified any purchase during 5 years before the fracture as a positive result and no purchase as a negative result.

Radiographic assessment

From review of lower extremity radiographs and surgical records of patients admitted with low trauma hip fracture, we classified AFF based on the revised definition of the American Society for Bone and Mineral Research [9]. Accordingly, the fracture is located along the femoral diaphysis, from just distal to the lesser trochanter to just proximal to the supracondylar flare. In addition, at least three of four major radiographic features must be present: (1) the fracture line originates at the lateral cortex and is substantially transverse in its orientation; (2) complete fractures extend through both cortices and may be associated with a medial spike, while incomplete fractures involve only the lateral cortex; (3) the fracture is non-comminuted or minimally comminuted; and (4) localized periosteal or endosteal thickening of the lateral cortex is present at the fracture site.

All fractures that were not classified as AFF were considered typical osteoporotic hip fractures, based on the guidelines of the American Association of Clinical Endocrinologists [27]. To address our first hypothesis that patients admitted with AFF had distinct demographic, clinical, and biochemical characteristics compared with patients admitted with typical osteoporotic hip fractures, regardless of the specific site of these typical fractures, we subdivided the latter according to the site of fracture: (1) femur neck (sub-capital and trans-cervical, not involving the trochanteric area); (2) inter-trochanteric; and (3) sub-trochanteric, not classified as AFF, and not involving the greater or lesser trochanter. Bilateral and recurrent fractures were documented separately.

Hip fracture images of all the patients were reviewed by our research team, which included a radiologist and an endocrinologist. The reviewers were blinded to BP treatment status. Images were viewed on our hospital picture archival and communication system (PACS). Radiographic images were magnified in order to detect and accurately classify the fractures. Following review by the study team, fracture classifications were compared with the ICD-9 code of the discharge diagnosis from the orthopedic department. In cases of discordance between the ICD-9 discharge code and our diagnosis, we used a consensus decision method among our research team members.

Statistical analyses

Baseline demographic, clinical, and medical characteristics and blood test results were summarized using descriptive statistics. Patients with AFF were compared with the 3 subgroups of patients with typical osteoporotic hip fractures using the ANOVA test or the Kruskal–Wallis test, as appropriate. Categorical data were compared using the χ2 test or Fisher’s exact test. A p value < 0.05 was considered statistically significant. We aimed to detect differences in BP exposure between patients with a fracture classified as an AFF and patients with typical osteoporotic hip fractures (the control group). The probability for AFF in patients admitted with hip fractures according to their BP exposure time was assessed in two models. The first model compared the AFF group (n = 31) with the entire typical osteoporotic hip fracture group (n = 958). To correct for the younger age of patients with AFF, a second model was used. We randomly matched each case (AFF) with three controls according to age (\(\pm\) 1 year) and sex. We used conditional logistic regression models for the multivariable analysis. Each time period of BP treatment prior to fracture has been analyzed separately. We examined four time-periods: any BP exposure, 2 years or less out of the 5 years prior to fracture, between 2 and 5 years prior to fracture and lastly, and 5 years or longer prior to fracture. We presented the models once with a BP exposure window without adjustment and second, following adjustment to the variables that were found to differ significantly between cases (AFF) and controls. Statistical analysis of pooled data was performed using SPSS version 25.0 (SPSS, Chicago, IL).

Ethics statement

The study was approved by the Helsinki committee of Soroka University Medical Center (SCRC-170–19). The requirement for informed consent was waived due to the retrospective nature of the study.

Results

Baseline characteristics

During the study period, 1616 patients over age 50 years were admitted to our institution with low trauma hip fracture. Of these, 1123 patients were offered to attend our FLS. Following exclusion of 23 patients with device-associated fractures and 111 patients with missing data (mainly due to low-quality radiographs), the final study cohort consisted of the 989 patients whose full data were available to us (Fig. 1). Radiographic assessment classified 31 (3%) as having an AFF, of whom 1 was incomplete fracture, 330 (33%) with femur neck fractures, 600 (61%) with inter-trochanteric fractures, and 28 (3%) with sub-trochanteric fractures not classified as AFF and not involving the greater or lesser trochanter. Baseline characteristics according to the site of fracture are presented in Table 1. Overall, 71% of the patients were women. Patients who presented with AFF (mean age ± SD: 72.3 ± 10.3 years) were younger than those who presented with femur neck fractures (78.0 ± 10.3) and younger than those with inter-trochanteric fractures (80.2 ± 9.6); p = 0.004 and p < 0.001 for the respective comparisons. Patients with AFF showed a better medical condition at baseline as was demonstrated by a statistically significant lower mean CCI (2.9 ± 3.7) than patients with femur neck fractures (4.9 ± 5.1), inter-trochanteric fractures (5.5 ± 4.3), and non-AFF sub-trochanteric fractures (5.2 ± 4.3), p = 0.028, p = 0.001, and p = 0.029 for the respective comparisons. Rates of any duration of PPI treatment within the 5 years prior to fracture were lower in patients with AFF (18.8%) than in patients with femur neck fractures (46.7%) and lower than in patients with inter-trochanteric fractures (50.9%), p = 0.004 and p < 0.001 for the respective comparisons. Other suggested risk factors for AFF, such as diabetes mellitus and treatment with glucocorticoids, were not found to be statistically different between the AFF group and each of the subgroups of typical osteoporotic hip fractures. The mean hemoglobin level at admission was higher among patients with AFF (13.4 ± 1.2 gr/dL) than among those with inter-trochanteric fractures (12.04 ± 1.9 gr/dL) and higher than among those with non-AFF sub-trochanteric fractures (11.9 ± 1.2), p = 0.018 and p = 0.024 for the respective comparisons. Other lab test results, including creatinine, eGFR, PTH, and 25(OH)-vitamin D levels, did not differ between the groups.

Fig. 1
figure 1

Study flow chart

Table 1 Baseline characteristics of patients included in the final study cohort according to the site and type of fracture

According to the matched groups (Table 2), the mean CCI of patients with AFF was significantly lower than that of patients with typical osteoporotic hip fractures (2.9 ± 3.7 vs. 4.7 ± 4.2; p = 0.030). In addition, rates of PPI treatment were lower in the AFF group (18.8% vs. 51.6% p = 0.002). Differences between the groups were not significant in the other parameters examined, including hemoglobin level, eGFR, PTH and 25(OH)-vitamin D levels, diabetes mellitus status, and glucocorticoid treatment.

Table 2 Baseline characteristics of patients included in the final study cohort following 3:1 matching of typical osteoporotic hip fracture to atypical femoral fractures for age (\(\pm\) 1 year) and sex

BP exposure

Computerized data of BP purchases were available for 5 years prior to the fracture date for patients with all fracture types. We compared the duration of BP exposure between patients diagnosed with AFF (n = 31) and those with typical osteoporotic hip fractures (n = 958) (Table 3). The proportion of patients treated with BP for 5 years or more was higher among those with AFF than among those with typical osteoporotic hip fractures (58.1 vs. 16.0%; p < 0.001). The only incomplete AFF patient was under prolonged BP treatment of 5 years or more.

Table 3 Bisphosphonate treatment years among patients included in the final study cohort, according to the type of fracturea

Odds ratios (ORs) for the risk for AFF at admission with hip fracture for the entire cohort and for the matched group, according to the length of BP exposure time, are presented in Table 4. In both models, the adjusted OR of being admitted with AFF increased as the number of BP treatment years increased, becoming statistically significant for 5 years of treatment or more (adjusted OR using the model that included the entire cohort- 19.34; 95% CI-7.84–47.93 and 21.7; 95% CI-4.1–113.9 for the matched cohort model). For patients treated with BP for more than 2 years and less than 5 years, the trend for increased risk did not reach statistical significance. The risk of being admitted with AFF was higher in the adjusted models than in the unadjusted models (Table 4).

Table 4 Odds ratios (95% CI) for the risk of atypical femoral fracture at admission with hip fracture among the entire cohorta and the matched cohortb, according to the exposure time to bisphosphonates

Discussion

This case control study showed that patients with AFF (cases) were younger and in better medical condition at baseline than patients with all other sites of typical osteoporotic hip fracture (controls). This was evident from a statistically significant lower value of CCI. CCI remained significantly lower (p = 0.03) following 1:3 matching for age (\(\pm\) 1 year) and sex, of cases with controls. These results, taken together, concur with our first hypothesis that patients admitted with AFF had distinct demographic and clinical characteristics compared with patients admitted with typical osteoporotic hip fractures. We also found that a higher proportion of cases than controls had been treated with BP for 5 years or more (58.1 vs. 16.0%; p < 0.001). A multivariate analysis of patients matched for age and sex found that the OR for being admitted with an AFF, adjusted for CCI and any duration of PPI treatment, increased as the number of BP treatment years increased and reached statistical significance after 5 years or more of treatment (OR-21.7; 95% CI-4.1–113.9). A second multivariant analysis for the entire cohort, adjusted for sex, age, CCI, and any duration of PPI treatment showed comparable results (adjusted OR-19.34; 95% CI-7.84–47.93) and, thus, reassure the concern regarding selection bias of using the matched group model. These findings address our second hypothesis, as to the risk of AFF in relation to the length of BP exposure. Our results demonstrated that 5 years of BP treatment may represent a point of time at which the risk of AFF should become a significant consideration in decisions about treatment management. Moreover, though not statistically significant, our results suggest that the critical point for decision making may occur even earlier, between 2 and 5 years of treatment. These results are congruent with previous studies that suggested an exponential correlation between the duration of BP exposure and the risk of AFF [14, 15, 17].

Our study is consistent with a number of reports that showed a younger age of patients with AFF than with typical osteoporotic hip fractures [9, 11, 18]. Associations of other risk factors with AFF are controversial. We found a lower rate of any duration of PPI treatment among patients admitted with AFF. This contrasts with others who demonstrated higher rates of PPI treatment among patients admitted with AFF, compared with typical osteoporotic hip fracture [11, 18]. However, since according to our methods any duration of PPI treatment within the 5 years prior to hip fracture was considered as “positive” for PPI treatment, it is questionable whether this observation has clinical significance.

Treatment with glucocorticoids (GC) is another controversial, yet commonly reported risk factor for AFF [9, 11, 17, 18, 21]. We did not find an association of any GC treatment with increased risk of AFF compared with typical osteoporotic hip fractures. Our findings concur with those of a retrospective Korean study [11] but contrast with those of a retrospective Canadian study that showed a higher rate of GC treatment among patients admitted with AFF than with other fractures [18]. A recent, large, well-designed retrospective study, by Black et al., has convincingly shown that the risk of AFF, among women treated with BP, increased significantly after one or more years of GC treatment [17]. A possible explanation for the discrepancy between the studies is the lack of information regarding the duration of GC treatment in our study and others [11, 18]. Accordingly, studies aimed to evaluate the added risk of GC to the development of AFF should include data regarding treatment duration with both drugs [28].

This study has several limitations. First, data collection was retrospective. A prospective evaluation may have achieved better quality data. Second, our cohort does not represent either the general population or the population of patients with osteoporosis. More specifically, our cohort represents high-risk osteoporotic patients [7] who were admitted to our institution with hip fracture. Accordingly, we do not claim that the increased risk found in our study represents the risk of having an AFF following certain BP exposure time in the general osteoporotic population but rather the risk of AFF among patients over age 50 years, admitted with low trauma hip fracture. Third, pre-treatment bone mineral density (BMD) measurements were available for only few patients in our cohort (less than 20%). Therefore, we could not completely exclude the possibility of reverse causality, in which the patients on longer treatment with BP are those with more severe osteoporosis. However, the lower CCI in patients with AFF shown in our study suggests that the overall morbidity load was lower in the AFF group compared with typical osteoporotic hip fractures. We assumed that this better medical condition is also applicable to the AFF patient’s bone health status. The consistency of the CCI trend after adjustment to age and sex strengthens this assumption. Fourth, the number of patients with AFF in our study was small, due to the rarity of this condition. This resulted in numerically unequal “cases” and “controls” groups. We overcame this disparity by creating a control group that was randomly matched 3:1 for sex and age (\(\pm\) 1 year). Fifth, we could not determine the exact adherence of each individual patient to BP treatment, as well as to treatment with other medications of interest, such as PPIs and glucocorticoids. Finally, a possible selection bias arises in that 310 patients were excluded from the analysis due to their being insured by a health medical organization for which we did not have drug purchase data. There is no reason to believe that their characteristics differed from the patients included in the cohort.

Despite the above limitations, this study has important strengths. The unified electronical medical files, used in our hospital and in the community medical services, enabled access to all recorded information of all the patients in our fracture liaison service. Specifically, we could easily calculate the CCI [22], a well validated index for comorbidities. Additionally, the femoral fracture images of all our patients were reviewed by the same research team members, including a radiologist and an endocrinologist, blinded to BP treatment status. The classification of the typical femoral hip fractures into subgroups and the assessment of each group separately strengthen the credibility of our hypothesis, namely that patients admitted with AFF had distinct characteristics, compared with patients admitted with typical osteoporotic hip fractures, regardless of the specific site of these typical fractures.

In conclusion, our study showed that patients with AFF compared to typical hip fractures showed better baseline medical conditions irrespective of their younger age. A higher proportion of patients with AFF than patients with typical osteoporotic hip fractures was treated by BPs for 5 or more years prior to their fractures. Further, the risk of AFF, though absolutely low, increased as the number of BP treatment years increased and reached statistical significance after 5 years of treatment. This finding concurs with the rationale of current guidelines, which suggests a “drug holiday” after 3–5 years of BP treatment, for patients with a mild to moderate fracture risk [7, 15, 29]. Nevertheless, the beneficial effect of BP treatment on the risk of vertebral and non-vertebral typical osteoporotic fractures should be considered when decisions on “drug holiday,” and its duration is taken [17]. To overcome the limitations of our study, a large, prospective study that follows patients treated with BP, including adherence data, is needed. This will enable drawing more precise conclusions regarding the time point during BP treatment at which the risk of AFF becomes significantly elevated.