Introduction

Adjuvant aromatase inhibitors (AI) reduce the risk of breast cancer recurrence and death in postmenopausal women with early-stage hormone receptor-positive disease [1]. While all three approved third generation AIs (exemestane, anastrozole, letrozole) reduce the concentration of circulating estrogens by inhibiting aromatase and improve breast cancer outcomes, these agents are associated with common toxicities leading to discontinuation of therapy [2]. Studies have demonstrated that 25–32% of women will discontinue AI prior to completing a 5-year course, and in studies where 10 years of AI therapy was recommended, adherence decreased to 38% [2,3,4]. A common cause for AI discontinuation is AI-associated musculoskeletal symptoms (AIMSS), which result in an 18–24% discontinuation rates during the first 2 years [2, 5].

Up to 61% of women treated with AIs experience AIMSS, which adversely affects quality of life [2, 6]. Factors that may contribute to AIMSS include prior chemotherapy, in particular taxanes, obesity, and the interval time since menopause [7]. Previous studies have suggested that exercise, non-steroidal anti-inflammatory drugs, acetaminophen, opioids, dietary supplementation, vitamin D, serotonin and norepinephrine reuptake inhibitors, and acupuncture may improve AIMSS; however, no confirmatory studies have been reported and there are currently no formal guidelines [7,8,9,10,11].

In women with skeletal metastasis from breast cancer bisphosphonates can decrease bone-related pain and improve quality of life [12, 13]. Furthermore, a retrospective study of women treated with AIs has demonstrated that women taking bisphosphonates and calcium were less likely to report arthralgias [14]. Bisphosphonates are a particularly attractive class of agents for management of AIMSS because they are well-tolerated, have been associated with reduced bone loss and improved bone mineral density (BMD) in patients treated with AIs, and may also be associated with improved clinical outcomes in states of estrogen-depletion [15, 16]. We hypothesized that women prescribed zoledronic acid while taking adjuvant AI would be less likely to report AIMSS compared to historical controls in a similar population of women taking adjuvant AIs but not taking bisphosphonates.

Materials and methods

Study design

We conducted a single-arm prospective clinical trial in postmenopausal women with ductal carcinoma in situ (DCIS) or stage I-III breast cancer scheduled to initiate adjuvant AI who were not taking bisphosphonates, designated the Zoledronic Acid Prophylaxis (ZAP) trial. Eligible women received 4 mg of intravenous zoledronic acid at baseline and 6 months. Participants started 2.5 mg of daily oral letrozole 1–2 weeks following the initial dose of zoledronic acid. As directed per protocol, the percentage of women who developed AIMSS over 12 months was compared to historical controls from the Exemestane and Letrozole Pharmacogenetics (ELPh) trial (NCT00263913) who were not taking bisphosphonates. The ELPh trial was a prospective multicenter randomized observational open-label trial in postmenopausal women with early-stage breast cancer (stage 0-III) evaluating the effects of 2 years of therapy with either letrozole or exemestane on a variety of biomarkers of estrogen activity and potential AI toxicity, including a prospective evaluation of AIMSS; these results have been previously published [8, 17]. Participants on the ELPh trial who were randomized to letrozole not taking bisphosphonates at any point during the initial 12 months were included in this analyses as historical controls; those randomized to exemestane or taking bisphosphonates were excluded.

The primary objective of this analysis was to compare the percentage of participants reporting AIMSS taking letrozole in both the ZAP and ELPh trials. Secondary objectives examined differences in the Health Assessment Questionnaire Disability Index (HAQ-DI) and pain scores on the Visual Analog Scale (VAS), and AI discontinuation rates between participants of the two studies [2, 8, 9]. We assessed additional patient-reported outcomes (PROs) evaluating depression, anxiety, hot flashes, menopausal symptoms, sleep quality, and overall quality of life, and determined changes in BMD, over 12 months.

Assessment of AIMSS

AIMSS in both studies was defined using the HAQ-DI and VAS instruments, using methods which have been previously validated [2, 8, 9]. AIMSS were defined as an increase of 0.22 in a scale of 0–3 in the HAQ-DI and/or an increase of 2.0 cm in a scale of 10 cm in the VAS. In both studies we assessed AIMSS using the HAQ-DI and VAS questionnaires at 1, 3, 6, and 12 months.

Collection of other patient-reported outcomes

Additional PROs were collected using validated questionnaires at baseline, 1, 3, 6, and 12 months, and included measures of depression, anxiety, hot flashes, menopausal symptoms, sleep quality, and overall quality of life. Depression or depressive symptoms were assessed via the Center for Epidemiologic Studies Depression Scale (CESD), where responses are summed to obtain total scores ranging from 0 to 60, and a higher score reflects more depressive symptoms [18]. Anxiety was assessed with the anxiety subscale of the Hospital Anxiety and Depression Scale (HADS-A); total HADS-A score was computed by summing scores ranging from 0 to 21, where a higher score reflects more depressive and anxiety symptoms [19]. The Hot Flash Related Daily Interference Scale (HFRDI) was used to assess the severity of hot flashes. Participants rated the degree to which hot flashes had interfered with each item during the previous week using a 0 (do not interfere) to 10 (completely interfere) scale, and a total score was computed by summing items and ranged from 0 to 100 [20]. Menopausal symptoms were assessed using the revised National Surgical Adjuvant Breast and Bowel Project (NSABP) menopausal symptoms questionnaire assessing 47 common symptoms. A total score was computed by summing items and ranged from 0 to 90 where higher scores reflect more menopausal symptoms [21]. Sleep quality and disturbance were assessed using the well-validated Pittsburgh Sleep Quality Index (PSQI), where a global PSQI score was computed by summing scores from the seven component scores and ranged from 0 to 21 and scores 5 or greater suggest poor sleep quality [22]. An overall rating of quality of life was assessed using the EuroQol. Participants were asked to provide a single number to describe their own health state that day using a Cantril-like ladder scale with best health at the top (100) and worst health (death) at the bottom (0) [23]. Participants were also asked to provide a rating for an average woman their own age for use as a comparison standard to judge their own rating.

Bone mineral density

BMD was measured in the anterior-posterior spine, femoral neck, trochanter, and total hip at baseline and after 12 months using dual-energy X-ray absorptiometry (DEXA) scans. T scores were extracted from individual scan reports, and T scores in their respective bones assessed at baseline and 1 year.

Statistical approach

Sample size was calculated based on the assumption that zoledronic acid would reduce the frequency of AIMSS after 12 months of treatment from the expected frequency of 50–30% within the ZAP trial. Allowing for a 20% drop out rate of the 59 planned patients for accrual, 47 evaluable patients for analysis would yield 80% power to detect an improvement in the frequency of AIMSS from 50 to 30% with a two-sided type 1 error of 5%. Characteristics of participants and scores at baseline were summarized with descriptive measures and compared between studies with Fisher’s exact test and Wilcoxon rank sum tests. AIMSS was calculated based on HAQ-DI and VAS scores as described above, and were summarized at 1, 3, 6 and 12 months for each study. The development of AIMSS at any time in the first year was explored between studies using a logistic regression model adjusting for patient age, prior chemotherapy, and prior tamoxifen use where data were available. Differences between studies was explored in subgroups (age, prior therapy, BMI, baseline VAS) using interaction analyses. Differences in the frequency of AIMSS at each time point between studies were estimated longitudinally using linear contrasts and summarized with odds ratios from a multivariable mixed-effects logistic regression model that included fixed effects for study-by-time point main effects and interaction, age and prior chemotherapy as covariates, and a random intercept for each patient to account for repeated measures.

Changes from baseline in HAQ-DI and VAS between studies were estimated by dichotomizing the HAQ-DI and VAS scores as 0 versus > 0. Multivariable mixed-effects logistic regression models and linear contrasts were used to estimate odds ratios for comparing ZAP to ELPh each follow-up time point relative to baseline. One model was fit for each outcome with fixed effects for study-by-time point main effects and interaction, age, prior chemotherapy, and prior tamoxifen use, and a random intercept for each subject. The cumulative probability of AI discontinuation or change to another endocrine therapy by 1 year was evaluated using the Kaplan–Meier method.

Changes from baseline in PROs for the ZAP cohort were analyzed using mixed-effects linear regression models that included the log-scale PRO as the dependent variable and fixed effect indicator variables for the 3 follow-up time points and a random intercept for each patient to account for repeated measures. Frequency distributions of BMD at baseline and 12 months were assessed. BMD and T score data are summarized at baseline and 12 months using means and standard deviations (SD). Changes in BMD and T scores were analyzed using paired t tests.

Results

From February 2011 to January 2013, 63 women were consented for the ZAP study. Of these, three women were found to be ineligible during screening (two based on creatinine clearance < 50 mL/min, and one due to metastatic disease observed on routine imaging). Of the 60 eligible women 59 eligible are included in the analysis (Fig. 1). All 59 (100%) women who enrolled received baseline zoledronic acid and 52 (88%) received the 6-month dose. Of the 7 who did not receive the 6-month dose, four withdrew consent or came off study early for reasons not otherwise specified; one decided to switch to tamoxifen; one decided not to receive the follow-up dose due to bone and jaw pain (not diagnosed with osteonecrosis of the jaw); and one had recurrent disease, requiring study discontinuation. Of the 503 women enrolled in the ELPh study from August 2005 until July 2009, 206 were eligible as historical controls (Fig. 1). A higher proportion of women in the ELPh trial received prior tamoxifen, and had higher VAS and HAQ-DI scores at baseline; however, there were no additional differences between cohorts that were statistically significant (Tables 1, 2).

Fig. 1
figure 1

Consort diagram for ZAP and ELPh cohorts

Table 1 Participant characteristics in ZAP and ELPh cohorts
Table 2 Median HAQ-DI and VAS scores and odds ratios for differences between ZAP and ELPh cohorts over 12 months

Women on the ZAP trial tolerated zoledronic acid well with no major toxicity, including no reports of osteonecrosis of the jaw or documented hypocalcaemia. The most common side effects attributed to zoledronic acid reported by women in the ZAP study were flu-like symptoms (n = 23, 39%) and gastrointestinal side effects (n = 14, 24%).

Participants in the ELPh cohort had higher VAS and HAQ-DI scores at baseline compared to the ZAP cohort. HAQ-DI scores stayed the same in ELPh and ZAP (interaction p-value = 0.23), however, VAS scores increased in ELPh but remained stable in ZAP (interaction p-value = 0.02). Over the 12-month study period, VAS and HAQ-DI scores remained higher in the ELPh cohort compared to the ZAP cohort (Table 2). When both HAQ-DI and VAS were scored to assess AIMSS, presence of AIMSS at each time point throughout the year was significantly lower in ZAP compared to the ELPh cohort (Fig. 2). Participants on ZAP were significantly less likely to report AIMSS at any point in the first 12 months (odds ratio (OR) = 0.28, 95% confidence interval (CI) 0.14–0.53, p < 0.0001, adjusted for baseline VAS score and prior use of tamoxifen). There was no interaction between study cohort and prior tamoxifen use or study cohort and baseline VAS on reporting AIMSS in the first 12 months. Exploratory subgroup analyses found that participants in ZAP were less likely to report AIMSS irrespective of age, prior taxanes or tamoxifen use, BMI, or baseline VAS scores (Supplementary Fig. 1). Five (8%) women on the ZAP study, and 22 (11%) on the ELPh study discontinued AI or changed endocrine therapy during the first year (p = 0.81). The cumulative probability of discontinuing AI or changing endocrine therapy by 1 year was 9% for ZAP and 14% for ELPh (log rank p = 0.57). None (0%) of the ZAP participants discontinued AI after the 6 month time point, compared to nine (5%) discontinuations on ELPh (Table 3).

Fig. 2
figure 2

Percentage of participants reporting AIMSS in the ZAP and ELPh cohorts over 12 months

Table 3 Cumulative probability of discontinuing AI or crossing over to the other treatment arm (ELPh patients)

Analysis of other PROs within the ZAP trial did not reveal any significant change at any time point during the 12-month study period compared to baseline (Fig. 3). However, by 12 months, ZAP participants rated their overall quality of life (EuroQol) higher compared to what they felt those their own age would have.

Fig. 3
figure 3

Patient reported outcomes (PRO) in ZAP participants over 12 months assessing changes in (a) depression symptoms (Center for Epidemiologic Studies Depression Scale, CESD), b anxiety and depression (Hospital Anxiety and Depression Scale, HADS-A), c hot flashes (Hot Flash Related Daily Interface Scale, HFRDI), d menopausal symptoms (National Surgical Adjuvant Breast and Bowel Project menopausal symptoms questionnaire, NSABP Menopausal Score), e sleep quality (Pittsburgh Sleep Quality Index, PSQI), and f overall quality of life

In the ZAP study, 30 patients had normal BMD (52%), 24 had osteopenia (41%), and 4 had osteoporosis (7%) at baseline. Of the 48 patients with 12 month BMD data, 42 remained with normal BMD (88%) and four patients had an improvement in BMD category (8%), while only two patients had declines in BMD category (4%). Furthermore, mean T score in the trochanter improved by 0.12 (95% CI 0–0.23, p = 0.046), and both mean T score and BMD in the spine improved by 0.23 (95% CI 0.13–0.33, p < 0.001) and 0.03 (95% CI 0.01–0.04, p < 0.001), respectively (Supplementary Table 1).

Discussion

This is the first prospective study to demonstrate that the administration of zoledronic acid in the adjuvant setting, compared to historical controls, is associated with a decreased incidence of AIMSS in women taking adjuvant letrozole. Additional exploratory analysis demonstrated probability of AI discontinuation in women using zoledronic acid was numerically lower than in those not receiving zoledronic acid (9 versus 14 month time to discontinuation (TTD), Table 3), although this was not statistically significant (HR = 0.87, 95% CI 0.32–2.35, p = 0.57). These findings are particularly important since AIMSS is a leading cause of premature AI discontinuation.

Bisphosphonates have anti-inflammatory properties, and studies have found they decrease cytokines such as tumor necrosis factor alpha, and interleukin 1 and 6 [24, 25]. Furthermore, in patients with rheumatoid arthritis, bisphosphonates have been shown to deplete synovial macrophages and decrease expression of adhesion molecules in the synovial lining [26]. Estrogens have been found to have anti- and pro-inflammatory properties, thus by decreasing estrogens, AIs may be increasing local inflammation in synovial fluid leading to pain [27, 28]. The anti-inflammatory effects of bisphosphonates, especially in synovial fluid, may explain our findings, although additional research is needed to elucidate the pathology of AIMSS and the effects of bisphosphonates.

The ZAP trial is a single-arm prospective study. Thus, results may be influenced by a placebo effect. While a placebo effect cannot be fully mitigated in a single-arm study, to compare the effects of bisphosphonates on AIMSS we used a similar cohort of women taking AIs from another prospective study with similar entry criteria as historical controls. Both the ZAP and ELPh trials were conducted close to the same time, and participants underwent evaluation of AIMSS at the same intervals using identical tools. Both cohorts had similar characteristics, with the exception of previous tamoxifen use and baseline VAS scores. Previous use of tamoxifen has been associated with joint stiffness; however, data from the ELPh trial have not demonstrated that tamoxifen use predicts AIMSS [8, 29]. ELPh participants reported higher VAS scores at baseline compared to ZAP participants; however, association of high baseline VAS scores with AI discontinuation is weak (HR 1.09, 95% CI 0.99–1.2, p = 0.067) [30]. Indeed, adjusted analysis for prior tamoxifen use and higher baseline VAS did not change our primary findings. Furthermore, exploratory subgroup analysis including age, BMI, those who previously received taxanes or tamoxifen, and baseline VAS scores, found that participants in ZAP reported improve rates of AIMSS.

While ZAP participants had a decreased probability of AI discontinuation compared to ELPh participants, this result did not reach statistical significance. These results, however, must be regarded as exploratory, since the ZAP study was not designed or powered to assess this endpoint. Rates of AI discontinuation are particularly relevant as recent data suggest extended AI regimens may improve disease free survival, and increasing data suggest that AIs may have a role in the adjuvant treatment of premenopausal women [4, 31].

While the recommended duration of adjuvant AI therapy is 5 to 10 years, in this pilot study we only evaluated AIMSS over 1 year. Although the every 6 month schedule used in this study could potentially be extended through 5 years, the majority of AIMSS are usually reported within the first 3 months and peak by 6 months [6, 8].

Although we did not observe a difference among various PROs within the ZAP trial, participants consistently reported superior perception of quality of life compared to those their own age. Similar improvements in overall quality of life have been described in patients with metastatic breast cancer taking bisphosphonates [32]. Furthermore, our study shows that 96% of patients either maintained a normal bone density or had an improvement in BMD category, and mean T scores in the trochanter and spine improved, demonstrating the known beneficial effects of bisphosphonates on bone health. Other studies have also confirmed that the benefit of zoledronic acid on BMD translates to improvements in preventing fractures [33].

Our results contribute additional rationale for the potential role of bisphosphonates in the adjuvant setting for women taking AIs. Bisphosphonates are approved for the treatment of osteoporosis by improving BMD and decreasing the risk of fractures. The detrimental effects on BMD caused by AI therapy are well studied, and we demonstrate as others have that bisphosphonates can reverse this effect [34, 35]. As studies suggest extended AI therapy may modestly improve rates of breast cancer recurrence, bone health will become an increasingly relevant factor [36]. Furthermore, several studies have demonstrated decreased rates of skeletal metastasis and improved survival in women taking adjuvant bisphosphonates for early breast cancer, particularly in estrogen-depleted states where AI therapy would be a consideration [16, 37, 38].

In summary, our data suggest that bisphosphonates are associated with reduced incidence of AIMSS in postmenopausal women with early breast cancer treated with letrozole. While these data suggest bisphosphonates may have important implications in bone health, they may also impact survival either through direct anti-cancer mechanisms or improving tolerance to AI therapy. These results add additional evidence and rationale for the potential role of bisphosphonates in the adjuvant setting; therefore, definitive randomized prospective studies evaluating the role of bisphosphonates in the adjuvant setting to prevent AIMSS are warranted.