Introduction

Current evidence suggests that upper extremity impairments from treatment for breast cancer can extend beyond the acute stages of recovery and may be considered a component of chronic illness [1, 8, 21, 43]. Investigators have found that a proportion of women treated for breast cancer continue to experience upper extremity functional limitations two or more years after treatment [18, 20, 26, 41]. Of particular concern relative to shoulder morbidity in the breast cancer patient are nodal dissection and radiotherapy.

Upper extremity lymphedema following breast cancer treatments is a well-documented phenomenon that has received considerable attention [7]. However, the magnitude of shoulder impairments as late effects from breast cancer surgery and radiotherapy independent of lymphedema has received considerably less attention. Moreover, most studies on shoulder impairments post-treatment for breast cancer do not account for pretreatment shoulder morbidity or control in some way for the effects of aging. Because both factors are related to long-term shoulder morbidity in the breast cancer population [16, 30, 42], prevalence data on shoulder morbidity may be inflated and potentially dismissed as weak evidence. Studies that attempt to control for selected covariates by conducting comparisons over time, between groups, or between affected and unaffected arms often report only P-values rather than magnitude of effects. The actual impact of breast cancer treatment on shoulder function cannot be ascertained from P-values alone. Further, the acknowledged variability of effects among patients may mask potentially important impairments for many individuals with relatively high P-values.

Both axillary lymph node dissection (ALND) and radiotherapy have been reported to affect long term shoulder function. Studies indicate that sentinel node biopsy (SNB) reduces shoulder morbidity as compared to ALND [19, 21, 31, 40], although both procedures may contribute to shoulder impairments [19]. Shoulder impairments following radiotherapy may occur after “latent periods” of several months to several years, with late reactions continuing beyond that period in some individuals [1, 10, 47]. Patients receiving axillary radiation (as opposed to chest wall radiation alone) are at higher risk for late arm morbidity [3, 35, 41]. Cheville and Tchou noted that failure to recognize lasting sequelae from treatment for breast cancer delays treatment referrals and may lead to greater long-term shoulder morbidity [7]. Inattention to such morbidity may be an issue for both individuals who have been treated for breast cancer and their health care providers. The purpose of this systematic review was to identify literature that reported or would allow assessment of the magnitude of late effects of breast cancer treatments on shoulder function 1 year or more after diagnosis, with a particular focus on lymph node dissection and on radiotherapy. Presentation of such data in one place will assist health care professionals who have periodic contact with individuals treated for breast cancer in understanding the extent to which late effects on the shoulder may affect such an individual.

Methods

Literature search

Eligible papers had to include breast cancer subjects at least 1 year post-diagnosis as the minimum criterion for late effects. Papers had to be published after 1980 and could not include pre-1980 radiotherapy to minimize the likelihood of including subjects receiving older forms of radiotherapy no longer meeting current standards. Outcomes had to include assessment of impairments or functional activities of the shoulder and, to the extent to which they can be separated, were not to include symptoms related to lymphedema. Pain was accepted as an outcome only if it was reported with functional activity. Quality of life outcomes were not considered because the study focus was to isolate the physical effects of shoulder morbidity. Studies required at least two comparison groups or comparisons to the unaffected side. Comparisons across time were acceptable only when baseline measures were performed preoperatively or pre-radiotherapy. Randomized controlled trials, prospective, retrospective and cross-sectional designs were considered acceptable. The available data had to include means and standard deviations (SDs) or 95% confidence intervals (CIs) for both groups, or group data that would permit computation of odds ratios if not reported by the authors.

Search engines used were PubMed, Medline, CINAHL, Cochrane, Health Source Nursing, Google, and Google Scholar. Search terms were limited to the title, abstract, or keywords and included combinations of breast neoplasm, breast cancer, shoulder, arm, scapula, or humerus, along with sentinel node, brachial plexopathy, pectoralis major, pectoralis minor, latissimus dorsi, rotator cuff, teres minor, late effects, ROM, range of motion, radiotherapy, and radiation. Retrieved abstracts were reviewed for possible inclusion. When warranted, full articles were obtained for review. All full articles were reviewed by the two authors. Any differences in opinion on eligibility were resolved by discussion. There was agreement on all papers that were finally accepted as eligible for inclusion in the systematic review. A methodological checklist was adapted from several sources to assess quality of the included papers [27, 33, 44]. Each eligible paper was subjected to methodological review using the checklist (Table 1).

Table 1 Methodological criteria check-list

Data analysis

When means and standard deviations were reported for groups, standardized effect sizes were calculated as (M 1 − M 2)/SD, where M is the mean for each group and the mean difference is standardized using the standard deviation of the referent or control group [34]. Although there is no consensus on the interpretation of standardized effect sizes [34], the guidelines proposed by Cohen were used; an effect size of 0.20 is considered to be small, 0.50 to be moderate and 0.80 to be large [9]. When data were dichotomous, odds ratios were computed to estimate effect size using frequency counts. If frequency counts were not available, reported proportions were used and so noted.

Results

The searches yielded a total of 375 citations from January 1980 through May 2008. After review of all abstracts, 88 papers available in English were retrieved for further examination. Of those, 22 were determined to meet eligibility criteria [3, 6, 1214, 16, 17, 19, 20, 22, 25, 2831, 33, 3740, 42, 45]. The results of the methodological review are shown in Table 1.

Overall breast cancer treatment effects on shoulder function

Four studies were found that permitted comparison of the affected shoulders of subjects treated for breast cancer to the same shoulder pre-treatment [33], to the uninvolved shoulder [30] or to subjects without breast cancer [36, 37]. In these studies, specific treatments were not evaluated; rather, the effects of all treatments were assessed. Effect sizes are reported in Table 2. For two of the studies, standardized effect sizes were computed from available data [30, 33]. For one study, crude odds ratios and 95% CIs were computed for each of the presented age groups and are reported along with the adjusted odds ratios given by the authors [36]. For one paper, the authors reported odds ratios and 95% CIs for each age group using ridit analyses for ordinal data and logistic regression [37]. In Table 2, the effect size represents the additional morbidity associated with treatment for breast cancer compared to the referent of pretreatment condition, the untreated arm, or subjects without breast cancer. All data show increases of varying magnitudes in morbidity with breast cancer with the exception of selected tasks in the 75–84 year old group in the Satariano et al. study [37].

Table 2 Breast cancer compared to non-breast cancer shoulder outcomes

Magnitude of effect on shoulder function from ALND

Twelve studies included shoulder morbidity data for subjects who were randomized to or underwent either ALND or SNB (or no axillary dissection) [6, 12, 19, 20, 22, 25, 29, 31, 33, 39, 40, 45]. As shown in Table 3, crude odds ratios and 95% CIs were estimated from available data from eight studies [6, 12, 19, 20, 22, 39, 40, 45], while standardized effect sizes were computed for two [29, 31]. The SNB group was considered the referent group with the exception of Lash and Sillman [20] and Caban et al. [6] where the referent to ALND was no axillary surgery. Each of these effect sizes represents the excess shoulder morbidity of ALND over the referent. Mansel et al. [25] reported 95% CIs rather than SDs so effect sizes could not be computed. The magnitude of changes after 1 year in the affected arm for each group are reported in Table 3. Reitman et al. [33] reported the means and SDs of change scores for each group after 2 years (Table 3) but did not include the SD for preoperative (baseline) measures. If the SD of the change score for the SNB group is used in lieu of the SD of the SNB group’s preoperative measures, the standardized effect sizes for the difference between groups in reduction from preoperative to 2 year values can be obtained. Using this strategy, the standardized effect size for reduction in ranges of shoulder abduction and abduction/external rotation and for reductions in grip strength are 0.73, 0.46 and 0.50, respectively, with ALND showing higher morbidity.

Table 3 Axillary lymph node dissection compared to sentinel node biopsy or no axillary dissection

Most of the data in Table 3 show an increase in morbidity with ALND with at least selected functions. Caban et al. [6] did not demonstrate an effect on flexion range of motion. Lash and Sillman [20] found a paradoxical protective effect in both the crude and adjusted odds ratios of ALND compared to no nodal dissection on self-reported decline in one or more of three tasks compared to preoperative status. Schulze et al. [40] found a slight protective effect of ALND compared to SNB when self-reported loss of mobility was assessed, although objective data showed a strong positive association for reduced abduction range of motion.

Magnitude of chest wall and axillary radiotherapy effect on shoulder function

Ten studies were found that permitted one or more comparisons of shoulder morbidity between radiotherapy and no radiotherapy, or between differing radiotherapy fields [3, 6, 13, 14, 16, 17, 20, 28, 39, 42]. Five of the papers included comparisons of a subject group with any radiotherapy (or unspecified radiotherapy) to a group without radiotherapy [3, 6, 13, 14, 20]. Three papers compared chest wall radiotherapy alone to a group without radiotherapy [17, 39, 42], while four compared a group with chest wall radiotherapy to a group with both chest wall and at least axillary radiotherapy [16, 17, 39, 42]. One paper compared a group with multiple radiation fields including the full axilla to a group with multiple radiation fields but only at the apex of the axilla [28]. Johansson et al. [17] also compared a group with chest wall and axillary radiotherapy to a group with no radiotherapy. For eight of the studies, effect sizes were computed from published data or data provided by the authors (Table 4). Standardized effect sizes were computed when appropriate measurement data were available (two studies); crude odds ratios and 95% CIs were estimated when only frequencies were available (seven studies). Adjusted odds ratios obtained via logistic regression were reported in five studies and are included in Table 4 [6, 13, 16, 28, 39]. For all calculated effect sizes, the referent was the no radiotherapy group (compared to a radiotherapy group), or the chest wall radiotherapy group (compared to a chest wall and axillary radiation group). The effect sizes, therefore, represent the excess shoulder morbidity associated with radiotherapy or more extensive radiotherapy. The only exception to this generalization is the adjusted (logistic) odds ratio reported by Caban et al. [6]. Their odds ratio and 95% CI represents the reduction in morbidity (protective effect) associated with not having radiotherapy because their referent was the radiotherapy group. Unless otherwise noted in Table 4, the extent of radiotherapy (inclusion of the axilla or additional fields) was not specified.

Table 4 Radiotherapy compared to none or chest wall and axillary radiation compared to chest wall radiotherapy alone

The majority of studies in Table 4 show increased morbidity associated with more extensive radiotherapy in at least some of the assessed functions. The exceptions are Johannson et al. [17] whose data showed some small but protective effects of chest radiation compared to no radiation on selected range and strength measures, and the Lash and Sillman [20] data where the crude odds ratio showed a protective effect of radiotherapy compared to no radiotherapy on decline in upper body function. The adjusted odds ratio reported by Lash and Sillman, however, showed a small excess of morbidity associated with radiotherapy.

Discussion

Review of Tables 2, 3, and 4 show, with the few noted exceptions, that there is an excess morbidity associated with treatment for breast cancer with ALND compared to SNB, and with radiotherapy or more extensive radiotherapy. However, the effect sizes vary dramatically from small standardized effect sizes (0.20 or less or odds ratios near 1.0) to substantial standardized effect sizes well in excess of 0.80 and odds ratios of 2.0–3.0 or more. Moderate to large effects predominate, especially where abduction and flexion ROMs were reported. This review would largely appear to reinforce the conclusion of Blomqvist et al. [3] that shoulder morbidity, while most evident at the individual level, is sufficient to be seen at the group level. The variations in magnitude and in the size of standard deviations and confidence intervals also reflect the variations in morbidity that exist between studied individuals.

Variability in shoulder morbidity effects may be attributed in part to the diversity of outcome measures and the diversity of methods by which even similar outcomes were assessed and reported. In ten studies, patient self-report of loss of strength, range of motion (ROM) or functional ability were used [11, 13, 14, 20, 22, 36, 37, 39, 40, 45]. In these instances, data were either dichotomized or categorized. When categorical data were collected, data were collapsed by the study authors or by the authors of this paper into impairment as present or absent because of small cell numbers. Only the ridit analyses of Satariano et al. [37] maintained categories of effect within a functional limitation. Two studies subjectively assessed observed impairment and dichotomized the outcomes [6, 16]. Of the studies that measured ROM objectively, seven reported actual ranges [3, 17, 25, 2931, 33], while five dichotomized their findings [13, 19, 22, 40, 42]. Ranges found to be 10°, 20° or 10% less than the contralateral limb or full ROM were considered to be impaired. This is a strategy similar to that used in other studies [5, 15, 43, 46]. Box et al. and Voogd et al. [5, 46] specifically addressed losses of 20° from preoperative or contralateral measures as associated with decreased function. Some variability in effect size can be attributed to estimates of crude as opposed to adjusted odds ratios. In six of the studies, odds ratios computed from logistic regression indicated morbidity associated with radiotherapy after adjusting for other covariates including number of nodes dissected, or ALND adjusted for other covariates including radiotherapy. Another source of variability in effect may be the different follow-up periods. Caban et al. [6] used a 12 month follow-up; however, they suggested that this observation period may be insufficient to see the full impact of radiation-induced fibrosis and recommended a follow-up period of 5 years to show the late effects of radiotherapy. Lastly, self-reported estimates of morbidity may be affected by individual expectations associated, for example, with age and by the individual’s ability to adapt over time to limitations [1, 37, 38].

The methodological checklist was not intended to be a quantitative assessment; however, it included 20 possible checks for randomized controlled trials and 19 for other designs. Of note was that only two studies achieved 16 checks [12, 30]. While thirteen studies obtained fewer than 10 checks (see Table 1). Given the variability in magnitudes of effect for ascertainment of morbidity and in methodological quality among these papers, it was determined that attempts to summarize the data using meta-analytic techniques were inappropriate. Rietman et al. [32] came to a similar conclusion in their systematic review of late morbidity after treatment for breast cancer.

Most of the studies included in this review followed subjects out for 3 years or less. Bentzen et al. [2] found that time to expression of 90% of the ultimately expected damage from radiotherapy was 3.7–4.2 years depending on the clinical covariates in the model (although radiotherapy techniques in their subjects may have varied from current standards). Fathers and Thrush [10] encountered symptoms of brachial plexopathy in the affected limb 8–20 years after radiotherapy. Because many of the studies included in this review assessed subjects only 1–2 years post-diagnosis or treatment, it may be that morbidity will continue to increase over time for some percentage of the subjects who received radiation. No reference to potential time effects of morbidity related to ALND or SNB were found in the literature although increases over time cannot be ruled out.

Current and future shoulder morbidity may be linked to radiation-induced changes. Bentzen et al. [2] hypothesized that damage to the pectoralis major muscle from radiotherapy was a significant factor in shoulder movement impairment even without clinically detectable subcutaneous tissue fibrosis. Shamley et al. [41] found that pectoralis major and minor muscles decreased in size on the affected side in a series of 57 breast cancer patients from 6 months to 6 years post-surgery, most of whom had chest wall radiation. However, surgery in the vicinity of the pectoralis major or minor may also create fibrotic changes with healing. Fibrosis in the pectoralis major and minor may be a factor in range of motion limitations which may also increase the risk for shoulder impingement syndrome or rotator cuff tears [23]. Ludewig and Cook [24] found that subjects with symptoms of impingement showed greater scapular anterior tipping, decreased upward rotation, and increased scapular medial rotation under load conditions. Borstad and Ludewig [4] found that subjects with a clinically determined short pectoralis minor muscle demonstrated scapular kinematics similar to those found for subjects with shoulder impingement. Decreased scapular upward rotation and increased anterior tipping may be critical in limiting adequate clearance for the rotator cuff tendons. Ludewig and Cook [24] hypothesized that even small limitations in scapular motion (4–6°) may be clinically important given the small size of the suprahumeral space, potentially contributing to initiation or progression of shoulder impingement symptoms. These same limitations to scapular motion are likely to reduce available shoulder range of motion given that the scapulothoracic joint contributes 60° to the total shoulder range of flexion and abduction [23]. Consequently, even the small decreases shown in motions like shoulder abduction and flexion among subjects who have had breast cancer may place these individuals at increased risk for shoulder impingement or rotator cuff problems over time. The magnitude of risk may be hypothesized to increase with increases in motion restrictions.

Limitations

Numerous factors prevent precise determination of the contribution of breast cancer treatments to impaired shoulder function. Tables 2, 3 and 4 and the discussion highlight several of these factors, but are not all inclusive. Bentzen and Dische and Langer et al. [1, 19] provide a comprehensive review of confounding and interactive factors that affect study results related to determination of post-treatment shoulder morbidity, including the lack of a uniformly accepted system for recording and grading shoulder morbidity that limits comparisons across treatment types and across studies.

Conclusion

This systematic review demonstrates the magnitude and the variability in effect sizes of shoulder morbidity attributable to treatment for breast cancer, with a focus on ALND and radiotherapy. Although mathematical aggregation of data was not attempted, the large majority of studies indicated increased shoulder impairments as a late effect from these treatments, with effects varying from small to substantial. Given the functional limitations that are described and the hypothesized relation between even minor limitations to shoulder motion and the potential for shoulder impingement or rotator cuff tear, this review supports the importance of health care professionals routinely asking patients about shoulder function, assessing function, and referring patients for remediation when even subtle limitations are present. Chirikos et al. [8] found that a breast cancer group was more likely to experience adverse economic outcomes compared to age-matched controls, with work-related differences narrowly missing statistical significance. This observation suggests that economic benefits of interventions that minimize or prevent problems are potentially high. The authors of this paper also concur with the recommendations of other investigators that use of selected valid and reliable shoulder function measures is required to enable comparisons across studies and to determine meaningful conclusions. Much work remains to be done to confirm the need to attend to late effects of breast cancer treatments on shoulder function.