Introduction

Breast cancer is the most frequently diagnosed cancer in women globally (1.68 million new cases estimated in 2012) [1, 2]. The 5-year relative survival rate in many developed countries has improved steadily in recent decades [1, 3]. As the prevalent breast cancer survivor population continues to grow [4], important questions remain regarding long-term standard of care, physical functioning, and health-related quality of life (HRQoL) in this patient group.

Treatment of breast cancer can include surgery to the breast and axilla and adjuvant chemotherapy, radiotherapy, and endocrine therapies. These interventions have increased survival [57] but can induce chronic side effects such as breast cancer-related lymphedema (BCRL) [8], upper body functional impairment [9], chronic fatigue [10], weight gain [11, 12], bone loss [13], inflammation [14], immunosuppression [15], peripheral neuropathy [16], and psychological impairments (e.g., depression) [17]. The adverse effects of breast cancer treatment are often associated with decreased physical activity [18] and fitness [9, 19], and impairments of physical functioning [20] and HRQoL [21]. Low physical functioning and HRQoL, in turn, contribute to greater mortality in this population [22, 23].

Progressive resistance training (PRT) is an anabolic exercise modality that can potentially target many of the adverse effects of breast cancer treatment, improving physical functioning and HRQoL [24]. However, there have been concerns regarding the safety of strenuous upper body PRT in women treated for breast cancer, particularly on the risk of BCRL [25]. Since 2006, several randomized controlled trials (RCTs) have investigated the safety and efficacy of PRT regimens involving upper body exertion [2640]. However, these data have not yet been systematically reviewed; accordingly, recommendations for PRT (and prescribed exercise training in general) remain absent from clinical guidelines [4143]. We therefore systematically assessed the total body of RCT evidence on the safety and efficacy of PRT to inform clinical guidelines and practice.

Methods

Search strategy and study selection

Search strategy: A systematic review of all published literature using the following electronic databases was conducted up to November 2013: MEDLINE (OvidSP, Wolters Kluwer), PubMed (NCBI, U.S. National Library of Medicine), ScienceDirect (SciVerse, Elsevier), SPORTDiscus (EBSCOhost, EBSCO), Scopus (SciVerse, Elsevier), Web of Science (Web of Knowledge, Thomson Reuters), the Cochrane Library (John Wiley & Sons), Embase (OvidSP, Wolters Kluwer), CINAHL, and Google Scholar. Search syntaxes were developed in consultation with an experienced university librarian taking into account a broad range of terms and phrases used in definitions related to breast cancer (e.g., breast cancer, breast neoplasm, breast carcinoma, breast tumor, mammary carcinoma, etc.) and resistance training (e.g., resistance training, resistance exercise, weight training, weight lifting, strength training, etc.). A sample search strategy (PubMed and Scopus) has been presented in the Electronic Supplementary Material (Appendix S1). Reference lists of retrieved full-text articles and recent reviews were examined to identify additional articles not found by our search strategy.

Study selection: Electronic references were compiled in an Endnote X6© (Thomson Reuters) file, and duplicates were identified and deleted. Two authors (BSC and EA) independently reviewed the titles and abstracts of each reference for potential inclusion. Each reviewer then performed a second screening on the full-text version of these articles, and disagreements were resolved by discussion. RCTs that investigated the isolated effects of PRT on BCRL (number of cases of incidence or exacerbation, arm volume, and severity of BCRL symptoms) and/or upper body strength, and/or lower body strength and/or HRQoL in women surgically treated for primary tumor of the breast were included. PRT interventions may have included but were not restricted to, any form of resistive type exercise using body weight (calisthenics), equipment (machine weights, free weights) or apparatus (elastic bands), and had to have been at least 6 weeks in duration. Studies that prescribed aerobic training plus PRT were excluded, unless a comparison group undertook the same dosage of aerobic training in isolation (i.e., to control for confounding effect of aerobic training). Studies that prescribed flexibility training plus PRT were included given that PRT involves aspects of flexibility training, i.e., loaded movements throughout a complete range of motion. Where multiple PRT prescriptions were tested, higher intensity regimens were prioritised over lower intensity regimens. The review was restricted to articles published in English.

Primary outcomes (safety)

Primary outcomes assessed the effect of PRT on BCRL outcomes, including: (1) cases of BCRL incidence or exacerbation during the trial, (2) arm volume outcomes, and (3) BCRL symptom severity between the treatment and control group. Where multiple measures of BCRL incidence or exacerbation were reported, we prioritized clinician-defined diagnosis with objective physical measurements over other methods. Data for BCRL incidence and exacerbation were combined given that the physiologic mechanism between cases is identical (i.e., a decrease in lymphatic transport capacity relative to lymphatic load) [44]. Where multiple arm volume outcomes were reported, we prioritized measurements of the interlimb volume difference, followed by volume of the affected limb. Where multiple BCRL symptom severity outcomes were reported, we prioritized assessments using the arm symptoms subscale of the European Organization for Research and Treatment of Cancer Breast Cancer Module (QLQ-BR23) [45].

Data about additional PRT-related adverse events (beyond BCRL) were also included for a descriptive synthesis.

Secondary outcomes (efficacy)

Secondary outcomes assessed efficacy data of PRT and included: (1) upper body strength, (2) lower body strength, and (3) HRQoL after intervention (post-treatment) between the treatment and control group. Where multiple upper body muscular strength outcomes were reported, we prioritized the most common measure (i.e., bench press) followed by shoulder press, shoulder flexion, and wrist flexion. Where assessments of upper body strength were completed bilaterally, we prioritized measures of the ipsilateral (affected) extremity over those of the contralateral extremity. Where multiple lower body muscular strength outcomes were reported, we prioritized leg press followed by knee extension. Where multiple HRQoL outcomes were reported, we prioritized domain and then summary scale scores of physical functioning, followed by global scores of HRQoL.

Data extraction

Data extraction of included studies was performed and/or verified independently by three reviewers (BSC, SLK, and PF). Discrepancies were resolved through discussion. Authors of relevant studies were contacted, where possible, for data that could not be extracted from the published articles.

Quality assessment

The following data were extracted from included studies using a standard proforma: study population characteristics, PRT intervention (e.g., specific exercises, number of sets per exercise, number of repetitions per set, intensity (load), frequency, and duration of training and loading progression). Our quality checklist was designed based on established criteria for the assessment of RCTs [46]. Quality items for RCT studies reviewed were (each worth 1.0 numerical point) as follows: (1) evidence of randomization and concealment of treatment allocation, (2) statistical similarity of groups at baseline, (3) specification of eligibility criteria, (4) blinding of outcomes assessors, (5) reporting of compliance, (6) supervision of exercise sessions, (7) reporting of dropouts, (8) presenting data for primary and secondary outcomes, (9) use of intention-to-treat analysis (if data for >90 % of baseline sample were analyzed, a score of 1.0 was given), and (10) reporting of adverse events. Summated scores ranged from 0 to 10 points with higher scores reflecting better quality. The quality assessment was completed and checked by two reviewers (BSC and SLK).

Data synthesis

Three reviewers (BSC, SLK, and EA) independently collated and/or verified extracted data to present a descriptive synthesis of important study characteristics and a quantitative synthesis of effect estimates.

Statistical methods

We pooled and weighted studies first using random effects meta-analysis models and second using fixed effects models for verification [47]. The effect was measured as the difference between groups after the treatment period without correction for possible baseline differences between groups. The mean and standard deviation of the pre- to post-treatment improvement in outcome were unavailable for the majority of papers. While these statistics could have been estimated from the pre- and post-treatment statistics [48], such estimation requires the pre–post correlation. We computed point estimates of correlation for those few papers which provided pre-, post-, and change means and standard deviations [48]. However, as the number of studies with full information was small and the estimated correlations from these studies were not fully consistent, we opted to restrict the analyses to the known post-treatment statistics without correction for possible baseline differences. We checked all studies for differences between groups at baseline and where statistically significant differences were found, we used sensitivity analysis to check the impact of these differences on the pooled results.

To examine the incidence/exacerbation of lymphedema cases, we reported the pooled odds ratio between treatment and control groups and associated 95 % confidence interval (95 % CI). Where articles reported 0 cases in either the treatment or control group, the Haldane continuity correction was applied by adding 0.5 to all four cells [49]. Articles which reported 0 cases in both the treatment and control groups were excluded from the analysis as differences in group size would produce bias in the continuity correction [50]. To examine the effects of PRT on arm swelling, lymphedema severity, upper and lower body strength, and HRQoL outcomes, the standardized mean difference (SMD) from each study was pooled to produce an overall estimate of effect and associated 95 % CI between treatment and control groups. For each meta-analysis model, the degree of heterogeneity was assessed by visual inspection, the I-squared statistic (I 2) (moderate being <50 %) [51], and the χ 2-test of goodness of fit [52]. When evidence of heterogeneity was observed, we checked data extracted from individual outlier studies, qualitatively investigated reasons for their different results, and explored the effects of study exclusion in sensitivity analyses.

We also used sensitivity analysis to investigate the robustness of the meta-analyses models. We variously excluded studies that combined PRT with other exercise modalities or physical therapies, studies that did not include a no-treatment control group, studies that prescribed PRT during the adjuvant chemotherapy treatment phase, studies conducted outside the US, studies of shorter duration (≤12 weeks), studies in older cohorts (≥60 years), studies in which BCRL was an entry criterion, and studies of lower quality (score ≤6.0). Publication bias, which reflects the tendency for smaller studies to be published in the literature only when findings are positive, was assessed visually using funnel plots [53]. All calculations were performed in Stata version 12 (StataCorp, College Station, TX, USA) using the ‘metan’ and ‘metafunnel’ commands. A two-tailed P value <0.05 was considered statistically significant throughout the analyses.

Results

Figure 1 presents a flowchart summarizing identification of potentially relevant studies and those included. Our search strategy identified 446 citations after duplicates were removed. Of these, 392 citations were excluded after the first screening of titles and/or abstracts for inclusion and exclusion criteria. After further assessment of the remaining 54 citations, 40 were excluded (Electronic Supplementary Material, Appendix S2) for reasons listed in Fig. 1. An expert in the field provided one recent citation not captured by our search. Fifteen citations were included in the present review. Most citations were excluded due to being a conference abstract only or due to being redundant citations of the same study.

Fig. 1
figure 1

Flowchart summarizing identification of studies for review. PRT progressive resistance training, RCT randomized controlled trial

Descriptive data synthesis

Table 1 presents the study characteristics of the 15 RCTs included for review, which were published between 2006 and 2013 [2640]. Ten studies were conducted in the USA [26, 27, 2933, 35, 36, 38] with others conducted in Canada [28, 40], Australia [37, 39], and Korea [34]. Major inclusion criteria typically were the completion of all breast cancer-related treatments (except hormonal therapy) [26, 27, 3033, 35, 36, 38] or the initiation of chemotherapy treatment for breast cancer [28, 40]. Lymphedema-related inclusion criteria were lymph node dissection (or sentinel node biopsy) [26, 31, 32, 37] and/or clinical diagnosis of lymphedema by clinician [31, 34, 39]. Major exclusion criteria primarily emphasized uncontrolled cardiovascular diseases and other chronic illnesses that would contraindicate PRT. Lymphedema-related exclusion criteria included bilateral lymph node dissection [3133], bilateral lymphedema [34], history of lymphedema [37], unstable lymphedema [39], and incomplete axillary surgery [28, 40]. Analyzed sample sizes ranged from 21 to 232, resulting in a total of 1,652 participants across studies. Mean age of the samples ranged from 46 to 62 years.

Table 1 Characteristics of randomized controlled trials reviewed

PRT interventions were prescribed two to three times per week in 12 studies (Table 1). Other studies prescribed a split-routine four sessions per week [29] or lower intensity training for five [34] or seven sessions per week [37]. Upper body training was prescribed in all studies (Table 1), and only two studies did not target lower body musculature [34, 37]. PRT was typically prescribed using machine and/or free weights, while two studies used resistance bands only [29, 36], and three studies incorporated a combination thereof [35, 37, 38]. Training sessions were fully supervised in only three studies [28, 39, 40], while ten studies involved partial supervision [26, 27, 3035, 37, 38], and two studies did not provide supervision [29, 36]. In general, lower body PRT was prescribed according to standard training principles for healthy adults. Upper body exercises were initiated at low intensities and progressed according to tolerance in most studies, while four studies prescribed upper body PRT at approximately 65–75 % of one repetition maximum (RM) [28, 35, 38, 40], and one study prescribed upper body PRT at 6–10 RM intensity. All studies indicated that training loads were progressively increased with strength adaptation.

Nine studies compared PRT intervention to usual care (no exercise) [2629, 3133, 37, 39], while three studies incorporated flexibility training as a sham condition [35, 36, 38]. The other three studies compared PRT plus an additional intervention (i.e., calcium and vitamin D supplement [30], complete decongestive physiotherapy [34], and aerobic training [40]) compared to the latter intervention only. Trial durations ranged from 8 to 104 weeks in duration; six studies were ≥52 weeks, three studies were 26 weeks, and six studies ranged from 8 to 17 weeks.

Primary outcomes were (1) cases of lymphedema incidence or exacerbation evaluated via clinician-defined assessment based on multiple objective tests [31, 32], the interlimb volume [28, 35, 37, 38, 39], or circumference difference [26, 34]; (2) extent of arm swelling outcomes evaluated via the interlimb volume difference [28, 31, 39] or volume of the ipsilateral extremity [34]; and (3) lymphedema symptom severity outcomes evaluated via validated [54] questionnaire [31, 32] or the arm symptoms subscale of the QLQ-BR23 [37, 39]. Secondary outcomes were upper body muscular strength evaluated via bench press [28, 31, 32, 35, 36, 3840], shoulder press [29], arm flexion [37], and wrist flexion [30], lower body muscular strength evaluated via leg press [31, 32, 35, 3840] or knee extension [2830], and HRQoL evaluated via the physical global score on the Cancer Rehabilitation and Evaluation System short form [27], the Functional Assessment of Cancer Therapy–Anemia scale [28], and the physical functioning domain score [34, 35, 39, 40] and physical component summary scale [33] of the Medical Outcomes Trust Short Form-36 (SF-36). Quality scores ranged from 5.0 to 9.5, and 13 studies received a score of 8.0 or higher (Electronic Supplementary Material, Table S1).

Quantitative data synthesis

Primary outcomes

Figure 2 presents the OR for the incidence and/or exacerbation of BCRL after PRT intervention between the treatment and control groups for five studies in 647 participants [26, 28, 31, 32, 37]. Four studies [34, 35, 38, 39] were excluded from the analysis given that no cases of BCRL were observed in either group. PRT resulted in significantly lower risk of BCRL incidence/exacerbation compared with control conditions [OR = 0.53 (95 % CI 0.31–0.90)]. There was no statistical heterogeneity between studies (I 2 = 0 %, P = 0.80).

Fig. 2
figure 2

Odds ratio for the incidence/exacerbation of BCRL outcomes after PRT between the treatment and control groups. ID identification, OR odds ratio, CI confidence interval, BCRL breast cancer-related lymphedema

Figure 3 presents the SMD for arm volume (4 studies in 384 participants [28, 31, 34, 39]) and patient-reported severity of BCRL (4 studies in 479 participants [31, 32, 37, 39]) after PRT between the treatment and control groups. PRT did not change arm volume [SMD = −0.07 (95 % CI −0.28 to 0.14)] or patient-reported severity of BCRL [SMD = −0.07 (95 % CI −0.25 to 0.11)] compared with control conditions. There was no evidence of statistical heterogeneity in either of these analyses (both I 2 = 0 %, Fig. 3). Funnel plots showed no evidence of publication bias for either outcome (Electronic Supplementary Material, Figs. S1 and S2).

Fig. 3
figure 3

Standardized mean difference in arm volume and BCRL symptom severity outcomes between the treatment and control groups. ID identification, SMD standardized mean difference, CI confidence interval, BCRL breast cancer-related lymphedema

Descriptive synthesis of additional PRT-related adverse events

Five studies reported that no adverse events occurred as a consequence of exercise training [31, 34, 35, 39, 40]. Other studies generally reported temporary muscle soreness [30] or musculoskeletal injuries. Winters-Stone et al. [38] reported episodes of back (N = 2) and knee pain (N = 1) which resulted in one participant discontinuing with lower body training. Adverse events in the study by Ohira et al. [27] have been documented in a separate article [55] which noted back injuries (N = 4) in the experimental group; however, none of these participants became unable to exercise. Musanti [36] noted two cases of tendinitis (shoulder and foot) during their study but did not specify the group allocation of the participants affected. Brown et al. [56] have summarized the adverse events encountered in three trials included in our review [3133]. Nine women randomized to the PRT group reported 10 musculoskeletal injuries related to training that impaired activities of daily living for ≥1 week [56]. Of these, there were a greater number of incidents in women with BCRL (8 injuries) as compared to those at risk for lymphedema (N = 2) [56]. Courneya et al. [28] reported on two adverse events unrelated to PRT. Three studies did not report on adverse events beyond lymphedema [26, 29, 37].

Secondary outcomes

Figure 4 presents the SMD for upper body muscular strength (11 studies in 1,252 participants [2832, 3540]), lower body muscular strength (9 studies in 1,079 participants [2832, 35, 3840]), and HRQoL outcomes (7 studies in 823 participants [27, 28, 3335, 39, 40]) after PRT between the treatment and control groups. PRT significantly improved standardized upper body [SMD = 0.57 (95 % CI 0.37–0.76)] and lower body [SMD = 0.48 (95 % CI 0.30–0.67)] muscular strength outcomes compared with control conditions. There was evidence of moderate heterogeneity between studies in each of these analyses. The sensitivity analyses (Electronic Supplementary Material, Tables S2 and S3) showed that the pooled SMD was similarly medium to large in the fixed effect model and after each of the various studies was excluded (SMD = 0.49–0.68 and 0.40–0.59 for upper and lower body muscular strength outcomes). Heterogeneity in upper body strength outcomes (I 2 = 58.4 %) could not be explained by our sensitivity analysis, whereas heterogeneity in lower body strength outcomes (I 2 = 46.7 %) was reduced with the exclusion of one study [31] that noted a significant difference between groups at baseline. Funnel plots were produced and showed little evidence of publication bias (Electronic Supplementary Material, Figs. S3 and S4).

Fig. 4
figure 4

Standardized mean difference in upper body muscular strength, lower body muscular strength and HRQoL outcomes after PRT between the treatment and control groups. ID identification, SMD standardized mean difference, CI confidence interval, HRQoL health-related quality of life

Our primary analysis revealed that PRT induced a small improvement in HRQoL [SMD = 0.17 (95 % CI −0.03 to 0.38)] compared with control conditions, but this effect was not statistically significant, and there was evidence of moderate heterogeneity (I 2 = 47.0 %). The sensitivity analyses presented in Table S4 showed that the pooled SMD was similarly small in the fixed effect model and after each of the various studies was excluded (SMD = 0.11–0.24). Notably, the findings became significant, and heterogeneity was reduced, when studies conducted during adjuvant chemotherapy [SMD = 0.30 (95 % CI 0.04–0.55), I 2 = 37.0 %] and studies that did not include a no-treatment control group [SMD = 0.24 (95 % CI 0.01–0.46), I 2 = 29.2 %] were excluded. The corresponding funnel plot showed little evidence of publication bias (Electronic Supplementary Material, Fig. S5).

Discussion

Summary of the evidence

Based on RCT evidence in women surgically treated for breast cancer, our results for safety outcomes were consistent. PRT reduced the risk of BCRL and did not exacerbate arm volume or patient-reported severity of BCRL versus control conditions (Figs. 2, 3). Our finding that PRT nearly halves the odds of BCRL incidence/exacerbation is clinically relevant given that studies have consistently shown that women with a diagnosis of BCRL suffer greater impairments of upper body functioning [57, 58] and HRQoL [59, 60] compared to their non-affected peers. Further, the null effect of PRT on measures of arm volume and BCRL symptom severity indicates that PRT does not worsen lymphedema symptoms, in contrast to prior assertions [25].

For efficacy data, our results indicate that PRT significantly improved upper and lower body muscular strength and induced a small improvement of HRQoL (Fig. 4). The mean improvement in upper body muscular strength was more than half a standard deviation in our primary and sensitivity analysis (SMD = 0.49–0.68) and is clinically relevant. A recent prospective study [61] showed that mean upper body strength of both the affected and unaffected extremity is significantly reduced from pre-surgery to 2.5 years post-treatment in women who received axillary lymph node dissection (both SMD = −0.42). The mean PRT-induced improvement of upper body muscular strength documented in our study (SMD = 0.57) is therefore greater than the expected long-term (2.5 year) decline [61] indicating that PRT can, on average, counteract treatment-induced upper body morbidity [57, 58].

The mean improvement in lower body muscular strength approached half a standard deviation in our primary and sensitivity analysis (SMD = 0.40–0.59) and is also clinically relevant. Breast cancer survivors suffer from significantly reduced leg strength compared to healthy controls (SMD = −1.16) [62]. Moreover, prospective studies have shown that mean lower body strength declines rapidly with age (2.6–3.0 % per year) [63], and the loss of lower body strength (SMD = −0.44) is a powerful predictor of all-cause mortality [64, 65]. Poor leg strength is therefore an important target for rehabilitation in the breast cancer population.

The small effect of PRT on mean HRQoL noted in our primary and sensitivity analysis is also clinically relevant. HRQoL is reduced in women with breast cancer, both at diagnosis and post-treatment, compared to the general population [66, 67]. However, higher levels of physical activity pre- or post-breast cancer treatment can contribute to higher HRQoL, particularly in the physical domains of HRQoL [68, 69]. For example, in a cancer registry study [67] that identified and recruited women at 5, 10, and 15 years post-breast cancer diagnosis, the mean score of the physical functioning domain of HRQoL was reduced at the 5-year (SMD = −0.27) and 10-year timepoint (SMD = −0.18) compared to healthy controls. The magnitude of change of HRQoL in our study was SMD = 0.30 exclusive of two studies conducted during adjuvant chemotherapy [28, 40], suggesting that women engaging in PRT post-chemotherapy can experience an improvement of HRQoL beyond the levels expected in healthy peers.

The effect of PRT on upper and lower body muscular strength remained robust in fixed effect models and after exclusion of studies that combined PRT with other exercise modalities (or therapies), studies without a no-treatment control group, studies prescribed PRT during chemotherapy treatment, studies conducted outside the US, studies of shorter duration, studies in older cohorts, studies in which BCRL was an entry criterion, and studies of lower quality.

In summary, our primary results indicate that that PRT does not exacerbate measures of BCRL and may lower risk. PRT also improves upper and lower body muscular strength, and elicits a small improvement in HRQoL. No serious PRT-induced adverse events were reported in the studies reviewed. These findings are clinically relevant. Therefore, clinical practice guidelines should be updated to inform clinicians on the benefits of PRT in this patient group.

Limitations

Several limitations require careful consideration. First, our analysis of arm volume and patient-reported severity of BCRL outcomes was based on a limited number of studies (Fig. 3), and only three of these studies included a clinical diagnosis of BCRL as a participant entry criterion. Women without BCRL are unlikely to improve these outcomes, and therefore additional studies limited to women with BCRL are warranted. Second, we did not distinguish the affected and non-affected extremity in the assessment of upper body muscular strength outcomes. There is evidence that bilateral strength deficits may be incurred by breast cancer treatment [70], and future research is required to distinguish the effect of PRT on both the surgically treated and non-treated side. Third, there was heterogeneity with respect to the exercise prescriptions, including the level of supervision, training equipment, and training frequency and intensity (Table 1); training intensity was also not quantitatively defined in many studies. We did not investigate any dose–response effects in the present review; accordingly, the optimal dosages of PRT to adapt the specific outcomes in this patient group remain unknown and require further research.

Conclusion

Our meta-analytic results are sufficiently reliable to recommend that clinicians consider PRT for reducing the risk of BCRL and improving upper and lower body muscular strength and HRQoL outcomes in women treated for breast cancer.