Introduction

Five or more years of adjuvant endocrine therapy (ET) with an aromatase inhibitor (AI) is superior to selective estrogen modulator therapy for reducing deaths due to postmenopausal hormone receptor-positive (HR +) breast cancer [1]. High rates of discontinuation and higher recurrence rates in non-adherent patients, however, have emerged as a challenge in clinical practice [2,3,4,5,6,7,8].

ET side effects are reported by more than one-third of patients as the main cause for early discontinuation [9]. For AIs, which act by suppressing peripheral estrogen synthesis, new onset or worsening of musculoskeletal symptoms (MSS) (e.g., arthralgia) is a common reason given by patients for non-adherence [10]. Currently, there is no consensus definition for AI-MSS, though symmetrical joint pains characterize AI-associated MSS (AI-MSS) and involve the wrists, hands, hips, and knees along with morning stiffness and general myalgia [11, 12]. Onset and intensity of AI-MSS are variable, typically appearing within the first 2 months and peaking between 3 and 6 months, though the natural history thereafter remains poorly characterized [13,14,15]. Pooled incidence estimates of 46% using “joint pain without objective pathology” [12], and findings that symptoms are relieved by discontinuing AI use, demonstrate unequivocally that AI-MSS is a drug-induced disorder.

AI-induced estrogen depletion has been suggested to increase inflammation, mediated by bradykinin and pro-inflammatory prostaglandins [16]. However, the exact mechanism(s) that underlie AI-MSS are unknown. Experimental evidence suggests that AIs enhance sensitivity to pain signals (i.e., nociception) [17]. This includes findings that AI drugs react chemically with the transient receptor potential ankyrin 1 (TRPA1), a ‘pain-transducing’ cation channel expressed on primary sensory neurons of the dorsal root ganglia, to promote neurogenic inflammation and AI-MSS in mice [18, 19]. Such nociception mechanisms may explain benefits of the selective serotonin and norepinephrine reuptake inhibitor (SSNRI) duloxetine (Cymbalta) [20] and acupuncture [21, 22] in patients with moderate-to-severe pain.

Clinically, current practice recommendations [23] are directed at managing symptoms based on their severity. Initial therapy often includes use of non-steroidal anti-inflammatory drugs (NSAIDs) based on anecdotal evidence and on efficacy in osteoarthritis/joint symptoms in postmenopausal women [24]. For the prevention and treatment of mild-to-moderate symptomology, emphasis is now increasingly placed on weight loss and on the benefits of aerobic and strengthening exercises, including yoga, on moderate-level evidence [25] and the other associated health benefits of exercise. Use of an alternate AI or switching to tamoxifen and introduction of an SSNRI show benefit in patients with more severe symptoms and are recommended [23].

As part of our investigation of the NSAID sulindac for anti-cancer effects on breast density after 12 months of use in patients on AI therapy [26], we included longitudinal measures of MSS severity using validated patient-reported outcome (PRO) questionnaires. Sulindac, marketed as Clinoril™, is a non-selective NSAID approved for use for the treatment of osteoarthritis and rheumatoid arthritis. Its activity is attributed to the non-selective inhibition of cyclooxygenase 2 and synthesis of pro-inflammatory prostaglandin E2. MSS severity over 12 months was also evaluated in a separate, non-randomized control group of patients on AI therapy with similar eligibility criteria to patients receiving sulindac.

Methods

Study design and patient eligibility

We conducted an open-label phase II study of sulindac (Clinoril®) 150 mg given orally, twice daily for 12 months in patients on AI therapy for HR + postmenopausal breast cancer for effects on breast density [26]. A total of 58 patients enrolled and initiated a 4-month NSAID washout, of which 50 started sulindac treatment and 43 completed to 12 months. Separately, a non-randomized control arm of 56 postmenopausal breast cancer patients on AI therapy was enrolled to observation only, and 40 completed to 12 months. Eligible patients included postmenopausal women age ≤ 75 years stable on AI therapy for at least 3 months for treatment of early-stage HR + breast cancer. Patients were enrolled at The University of Arizona Cancer Center or the Stony Brook Cancer Center. For the sulindac arm, additional inclusion criteria included willingness to refrain from NSAID use (low-dose aspirin [≤ 81 mg/day] was permitted), adequate renal function, normal or controlled blood pressure, and no contraindications to NSAIDs. There was no pre-specified baseline pain or arthralgia level requirement. The study was registered with ClinicalTrials.gov (NCT01761877) and approved by the institutional review boards (IRB) at both study sites. All patients provided written informed consent.

Outcome assessment

For arthralgia symptom severity and pain, patients completed the WOMAC (version 3.1) and other questionnaires at baseline (prior to initiation of sulindac) and after 3, 6, and 12 months. The WOMAC is a 24-item instrument validated to assess lower-extremity joint symptoms in the past 7 days using 3 subscales: pain (5 items), stiffness (2 items), and physical function (17 items) [27]. All questions use a 5-point rating scale (0 = none to 4 = extreme). As discussed in Bellamy [27], for convenience and for comparison purposes to previous studies, total scores and each subscale were normalized to a range of 0–100. Higher scores indicate greater symptom severity. In the case of missing data, subscales were considered valid as long as ≤ 3 core items were missing for function, ≤ 1 for pain, and ≤ 1 for stiffness.

The Brief Pain Inventory Short Form (BPI-SF) is a 15-item questionnaire developed and validated for use in cancer [28] and osteoarthritis [29] patients to assess worst pain, pain severity, and pain interference (range from no symptoms to worst, 0–10) over the past 24 h. Pain severity was derived as the average of responses to 4 questions on worst pain, average pain, least pain, and pain right now (complete data required). In the case of the interference score, 4 of 7 responses were required, and the average interference score of the completed items was imputed for missing values, as previously described [28].

The 28-item Functional Assessment of Cancer Therapy—General form (FACT-G) uses a 5-point Likert scale [0 (not at all) to 4 (very much)] that measures functional and physical well-being (score range 0–108) comprising 4 subscales: physical well-being (7 items, score range 0–28), social/family well-being (7 items, score range 0–28), emotional well-being (6 items, score range 0–24), and functional well-being (7 items, score range 0–28). For missing data, if ≥ 50% of the items within a subscale were answered, a subscale score was computed as the prorated sum of the item responses for that subscale as described [30]. The FACT-G total score was computed as the sum of the 4 subscale scores for all participants with an overall item response of at least 80% (i.e., at least 22 of the 27 items answered). Negatively worded items (all 7 physical well-being questions, and 5 of the 6 emotional well-being questions) were reverse scored prior to summing so that higher subscale and total scores indicate better QOL [31].

Toxicity assessment

Toxicity data were collected prospectively for the sulindac group and are detailed in Thompson et al. [26].

Statistical analyses

Spearman correlation coefficients measured associations between baseline patient characteristics (age, BMI, time on AI, anastrozole, letrozole, exemestane) and each summary pain/QOL score. Change in each outcome at 3 (sulindac group only), 6, 9 (sulindac group only), and 12 months relative to baseline (after washout for sulindac) was estimated in each group separately using linear mixed effects models with an indicator variable for each time point, adjusted for age at baseline, body mass index (BMI), time on AI, and use of non-NSAID pain medication. To assess the importance of baseline symptoms to change, a sensitivity analysis was conducted for the subset of subjects whose baseline total WOMAC was greater than the sulindac group–specific median of 19.2 points (sulindac group, n = 25; observation group, n = 14). No corrections were made for multiple comparisons.

Results

Study population

Participants who started sulindac or observation had a respective median age of 61.9 and 63.0 years, median BMI of 26.9 and 28.2 kg/m2, and were primarily non-Hispanic White with some college education. Patients enrolled to both arms had a similar median time on AI therapy prior to enrollment of ~ 1 year (Table 1). Anastrozole was the most prescribed AI therapy. Patients who enrolled to sulindac reported a higher use of exemestane than those enrolling to observation, likely a reflection of switching from one of the non-steroidal AIs (letrozole or anastrozole) due to intolerance. In the sulindac intervention group, 13 (26%) participants reported taking low-dose aspirin, and 8 (16%) reported any use of non-NSAID medication for pain (e.g., tramadol, duloxetine), of which 4 were also taking low-dose aspirin. For patients enrolled to observation, 15 (30%) reported taking an NSAID or aspirin at any dose, of which 13 (26%) reported using low-dose aspirin. Of 10 (20%) who reported any use of non-NSAID medication as needed for pain, 5 were also taking low-dose aspirin.

Table 1 Baseline characteristics of AI Sulindac and AI Observation trial participants

Baseline pain in each study group

For patients enrolled to sulindac, baseline measures occurred after a 4-month NSAID washout. Mean WOMAC subscale scores (Table 2) were in the mild range of severity: Total (22.9), Pain (23.5), Stiffness (34.0), and Physical Function Limitation (21.4). Forty percent of patients reported moderate or higher symptom severity in at least one WOMAC subscale with average stiffness severity scores higher than the other subscales. The individual question ‘how severe has your stiffness been after you first woke up in the morning?’ yielded the highest group mean score (37.0). Consistent with previous reports [12], pain in the joints, particularly the hands and wrists, were more common (data not shown). Mean BPI worst pain was mild at 3.34, with 42% reporting no or minimal joint pain (BPI worst pain score ≤ 2), 26% reporting mild joint pain (BPI worst pain score 3–4), and 30% reporting moderate-to-severe pain (BPI worst pain score 5–10). Overall, the study population reported high QOL (mean total FACT-G, 89.7 of 108 points).

Table 2 Predicted means (95% confidence intervals)a in WOMAC, BPI-SF, and FACT-G scores at 0, 3, 6, 9, and 12 months in patients enrolled to sulindac (n = 50)

For patients enrolled to observation, the baseline mean WOMAC subscale scores ranged from no to mild symptom intensity: Total (12.5), pain (13.5), stiffness (22.2) and physical function limitation (11.0) (Supplemental Table 1). Only 22% of participants reported moderate symptom severity in at least one WOMAC subscale, with stiffness contributing more than pain or physical function limitations. Baseline mean pain measured by BPI worst pain was low at 2.3, with 50% reporting no or minimal joint pain (BPI worst pain score ≤ 2), 34% reporting mild joint pain (BPI worst pain score 3–4), and 16% reporting moderate-to-severe pain (BPI worst pain score 5–10).

Change in MSS over 12 months in each study group

From baseline to end of treatment at 12 months, patients enrolled to sulindac reported small, but significant, improvements in each of the WOMAC subscales (Table 2). Decreases in total WOMAC (− 5.8, P = 0.003), pain (− 5 .4, P = 0.043), stiffness (− 9.5, P < 0.001), and function (− 5.6, P = 0.006) were observed. No change was observed in BPI worst pain, severity, or interference. For QOL measures, a small but significant improvement in Total FACT-G (+ 3.0, P = 0.028) was observed, with non-significant changes in physical, social/family, emotional, or functional subscores. Responses to the FACT-G question, “I am bothered by side effects of my treatment” significantly improved (0.65–0.22, P = 0.008) over the 12-month intervention. The most clinically relevant MSS improvements are illustrated in Fig. 1A–C and included reduction of moderate-level complaints related to functional limitations (getting out of bed, doing heavy chores) and stiffness, as opposed to pain.

Fig. 1
figure 1

Representative responses to individual WOMAC questions about pain, stiffness and physical function at baseline, 6 and 12 months for sulindac and observation groups

When restricted to patients enrolled to sulindac whose baseline Total WOMAC score was above the group median of 19.2 points (n = 25), significant improvements were observed in all WOMAC subscales at 3, 6, 9 and 12 months (Table 3). By 12 months, 37% of patients with higher baseline symptoms reported ≥ 50% improvement in Total WOMAC (i.e., meaningful clinical improvement [32, 33]) and a respective 39% and 42% for WOMAC pain and physical function limitation subscales. However, only 16% of patients in this subgroup experienced ≥ 50% decrease in stiffness symptom severity by 12 months. BPI worst pain also decreased by 1.4 points, from 4.8 to 3.4 (P = 0.015) with decreases in BPI severity (0.82 points, P = 0.033) and interference (0.98 points, P = 0.008). Overall QOL improved at 12 months (increase in Total FACT-G from 84.1 to 90.3, P = 0.003) with improvements in physical well-being scores from 21.7 to 23.8 (P = 0.002) at 12 months. Underlying this change were significant improvements in specific questions related to energy level, treatment-related side effects, meeting needs of family, and pain level.

Table 3 Predicted means [95% confidence intervals]a in WOMAC, BPI-SF, and FACT-G scores at 3, 6, 9, and 12 months in patients enrolled to Sulindac with elevated musculoskeletal symptoms at baseline (n = 25)

For the observation group, there were no significant changes in patient-reported outcomes for WOMAC, BPI-SF, or FACT-G (Supplemental Table 1). Illustrated with a few specific questions in Fig. 1D–F, daily activities, pain, and stiffness levels showed no significant improvement over the 12 months of observation. In patients with complete data whose baseline Total WOMAC scores were above the sulindac baseline median (n = 13), mean WOMAC pain scores declined (35.7–32.0), as did stiffness (50.0–46.9) and function (31.0–29.2), but the decreases were small and not significant (Supplemental Table 2). Furthermore, patients in this subgroup reported significant increases in BPI-Interference (0.9, P = 0.018) and decreases in Physical Well-Being subscale (− 2.4, P = 0.039), along with no significant increase in BPI worst pain (1.3, P = 0.125). The proportion of the 14 observation patients with elevated baseline symptoms who experienced a ≥ 50% improvement in symptoms over 12 months of observation was 12.5% for Total WOMAC, 25% for WOMAC pain, and 12.5% each for WOMAC stiffness and physical function limitation.

Safety and tolerability of sulindac

Overall, sulindac therapy was well tolerated in the study sample, with grade 1 and 2 gastrointestinal adverse events (AE) [nausea, abdominal pain, gastroesophageal reflux disease (GERD)] as the primary side effect [26]. Of 50 participants, 11 experienced a grade 2 AE (n = 9) or serious AE (SAE, n = 2) considered possibly or probably related to study agent. Three AEs, including two SAEs (1 transient pancreatitis and 1 cerebral hemorrhage in a patient with amyloid angiopathy) and one grade 2 gastrointestinal event, resulted in early discontinuation of study drug.

Discussion

In this open-label study of sulindac at doses used to treat arthritis in breast cancer patients on AI therapy, 40% of patients reported moderate-to-high symptom severity in at least one of the WOMAC subscales at baseline. Moderate-to-severe stiffness, rather than pain, was the most common and disabling condition. After 12 months of sulindac intervention, the proportion of patients with moderate or higher symptom severity reduced to 18.6%, with significant declines in Total WOMAC and all three WOMAC subscale scores. Patients enrolled to observation had no improvements in Total WOMAC or any WOMAC subscale over the year of follow-up.

Consistent with the recommended use of NSAIDs for short duration [23], improvements in the sulindac group occurred by 3 months and one-third experienced ≥ 50% improvement in WOMAC pain and function by 12 months. The mean difference of − 15.8 points for function and − 20.6 points for stiffness meet consensus values of absolute change necessary for minimal clinically important improvement (MCII) [32]. However, the 14.3-point decrease for WOMAC pain is less than the consensus 20-point change for MCII.

Improvements in WOMAC physical function and stiffness in the sulindac group were most significant in patients with baseline symptoms above the group median. Notably, baseline arthralgia symptoms were 13.1 points higher in overweight and obese women relative to healthy weight women. In exploratory analyses, statistically significant improvement in Total WOMAC scores at 3, 6, 9, and 12 months was present only among overweight/obese patients in the sulindac group (Supplemental Table 3). No change in Total WOMAC was observed over 12 months in the observation group by BMI (data not shown).

A number of research groups have conducted randomized clinical trials of drug or supplement interventions for AI-MSS using change in BPI worst pain score as the primary outcome (e.g., duloxetine [20], vitamin D [33, 34], omega-3 fatty acids [ω-3 FA] [35, 36]). These studies enrolled patients selected for moderate-to-severe BPI worst pain scores > 3. Patients in the current study were stable on AI therapy with no eligibility criteria based on baseline pain severity. Among all 50 participants who initiated sulindac, 20 (40%) had a baseline BPI worst pain score ≥ 4, and 16 completed 12 months of therapy. Of these 16 patients, 13 (81%) showed any improvement (≥ 1 point), and 9 (56%) improved by ≥ 2 points. Separately, among participants in the observation group, 16 (32%) had a baseline worst pain score ≥ 4, and 11 of these patients were followed for 12 months. Only 5 patients (45%) showed improvement over 12 months without defined therapy, all by ≥ 2 points.

Supporting the potential benefit of sulindac, the change in FACT-G exceeded the 4.0% increase MCII identified from meta-analyses for small but meaningful benefit [37]. This was particularly notable in patients with higher baseline symptoms (9.0% change). The lack of any significant improvements in MSS or QOL in the observation group suggest that for patients on AI therapy with MSS, symptoms likely persist, remain stable, or even worsen, and affect QOL over time.

While the relevance of our findings to AI adherence is unknown, of 14 patients who responded at baseline to the FACT-G item, “I am bothered by side effects of treatment” and who completed 12 months of sulindac, 13 (93%) reported not being bothered at end of study. Of note, only one sulindac patient discontinued AI therapy across 12 months versus four observation patients.

Major limitations are the lack of randomization or a placebo control. As such, the results may be due in part or entirely to placebo effects. Placebo effects in pain and arthritis studies are well documented. For AI-MSS studies, several randomized placebo-controlled trials including testing effects of duloxetine [20] or supplementation with vitamin D [33, 34] or ω-3 FA [35, 36] on AI-associated pain have shown small to large placebo effects up to 50%. These and our findings may reflect differences in the persistence and possibly underlying pathology of MSS in AI patients with mild-to-moderate symptoms.

While the use of NSAIDs is recommended for new onset or worsening arthralgia in patients initiating AI therapy [23], only one randomized study to our knowledge has published on the effect of an NSAID on AI patient–reported outcomes [38]. In secondary analysis of the Celecoxib Anti-Aromatase Neoadjuvant (CAAN) trial, FACT-G QOL among women taking celecoxib + exemestane was higher than in women taking either exemestane or letrozole alone. Neither BPI nor WOMAC were assessed. Separately, in a published abstract from the ETAN Trial, a phase III randomized placebo-controlled trial of etoricoxib (60 mg/day) in breast cancer patients on AI for the prevention of breast cancer recurrence, MSS pain symptoms were reported to improve with intervention compared to placebo (though the trial discontinued early) [39]. With median etoricoxib and placebo treatment durations of 14 and 12 months, respectively, 50 of 73 treated patients (68%) reported decreased pain compared to 16 of 67 (24%) in the placebo group. However, neither the methods for pain assessment nor the magnitude of change in pain were reported.

In the current study, MSS severity reflects profiles seen in clinical practice. Further, improvement in patient-reported stiffness and functional limitations with sulindac and greater benefit in overweight and obese patients are consistent with underlying contribution of inflammation-associated arthralgia symptoms. As with other NSAIDs, GI upset was the most common and expected side effect with sulindac use occurring in 10% of subjects [26]. While long-term use of NSAIDs is discouraged due to rare cardiovascular risk with extended use [40], further examination of the benefit of short course NSAIDs, especially in combination with efforts directed at increasing physical activity and promoting weight loss among overweight and obese patients initiating AI therapy, is warranted.