Introduction

Disease-free cancer survivors utilize over twice the healthcare resources consumed by their demographically and comorbidity-matched peers. [1,2,3] Limited understanding of the factors that drive utilization among survivors represents a problematic knowledge gap in light of their precipitously increasing numbers; by 2040 there will be over 26 million US cancer survivors [4]. Addressing preventable, low value, healthcare utilization is vital to sustainable cancer care; however, this need is particularly pressing for breast cancer survivors who comprise roughly 25% of the survivor population [5].

Adverse, chronic late effects of cancer treatment may contribute importantly to survivors’ healthcare consumption. Late effects are prevalent, affecting 2/3 of survivors, and heterogeneous, ranging from gait instability from peripheral neuropathy to chronic pain. Late effects have been causally linked to costly medical and functional morbidity [6,7,8]. Moreover, better late effect treatments may be a means of reducing utilization. Late effects are inconsistently detected and addressed, despite the availability of evidence-based strategies to screen for, prevent, and mitigate them [9, 10].

Breast cancer–related lymphedema (BCRL), as one of the most studied late effects, serves as a useful exemplar in estimating the contribution of late effects to cancer survivors’ overall healthcare consumption. BCRL is also of interest because it offers a pragmatic instance in which broader coverage of guideline-concordant care could potentially reduce costs. The adverse consequences of BCRL have been described; chronic pain, functional limitations, job loss, and mood disorders, as have effective strategies to prevent these poor outcomes through prevention and management practices [9, 11]. These practices are inconsistently covered by payers and therefore limitedly available to many BC survivors [12]; fee-for-service Medicare does not cover compression garments, the mainstay of BCRL treatment.

A paucity of lymphedema-related economic analyses has been previously highlighted. [13] Two reports described increased utilization during the 2 years following surgery. One noted that survivors with BCRL incurred $7300 more healthcare costs annually, in 2006 dollars, over the first 2 years following BC diagnosis [14]. The second identified a 5-fold increase in hospitalizations among survivors with complicated BCRL—defined as having been hospitalized for a diagnosis of lymphedema complications or complications thereof [15]. These reports suggest that cost shifting away from reactive measures initiated after a survivor develops BCRL toward proactive, evidence-based BCRL screening, prevention, and early treatment may be cost neutral or even saving. Anecdotal reports of models that advance early intervention approaches are promising [13]. However, determination of the duration and robustness of increased healthcare utilization among survivors with BCRL is required to support this case. This paper reports the results of a population-based, longitudinal cohort study that was designed to estimate the incremental healthcare utilization associated with BCRL over 2 decades after BC diagnosis in order to support the case for more proactive, anticipatory care. The study additionally assessed the impact of body mass index (BMI) on these differences given robust reports of its association with BCRL progression [16, 17].

Methods

Subjects

Subjects were identified through the Rochester Epidemiology Project (REP), a medical records-linkage system that has for more than 50 years identified and tracked virtually all healthcare services provided to residents of Olmsted County, MN by Olmsted County-based providers [18, 19]. The population counts obtained by the REP Census are similar to those obtained by the US Census, indicating that virtually the entire population of the county is captured by the system [20]. We used the REP Census to identify all persons who resided in Olmsted County and had granted permission to use their medical records for research, over 98%. The REP electronic indices were searched to identify BCRL-relevant ICD-9, CPT, and HCPCS codes associated with the encounter, as well as encounter date and time. Finally, the REP also tracks the vital and residential status of all Olmsted County residents, and REP data were used to censor the study population on date of death or date of last follow-up [20].

Identification of incident breast cancers in Olmsted residents

To identify incident BC cases diagnosed from 1990 to 2010, we used (1) a preexisting cohort of patients diagnosed with incident BC between 1/1/1990 and 12/31/1999 [21]; (2) the Mayo Clinic Cancer Registry (MCCR) which includes all individuals evaluated and/or treated for cancer at the Mayo Clinic (MC); and (3) the REP’s searchable index of all assigned CPT, HCPCS, and ICD codes. We then cross-referenced the entire 1990–2010 REP BC cohort with BC cases listed in MCCR and linked to Olmsted County zip codes from 1/1/1990 to 12/31/2010.

Ascertainment of BC status and abstraction of BC clinical information

The records of all identified individuals were reviewed by a trained nurse abstractor and BC clinical data were entered into computerized case report forms. BC index dates were defined as the date when a new primary breast cancer was pathologically confirmed. Individuals who did not receive a pathological diagnosis of BC, generally because workup results would not alter clinical management due to advanced age, dementia, or comorbidity burden were not included in the cohort. Abstracted clinical and treatment information included affected side, pathology type and grade, tumor size, the number of lymph nodes removed/the number that were positive as well as the type of breast, axillary, reconstructive surgery, radiation fields, and the nature of any chemo- or hormonal therapy. BC Stage was defined per the 2010 American Joint Committee on Cancer system [22] with uncertainties being resolved through review by a BC medical oncologist with over 30 years of experience (TM). If the identified BC was found to be a recurrence of BC that had been diagnosed prior to 1990, the patient was removed from the cohort. However, patients with a previous history of BC who were diagnosed with a new primary BC during the study period were included.

Ascertainment of BCRL status

Three strategies were used to identify cohort members who developed BCRL.

First, we searched for all instances, since index, in which the BCRL-specific ICD-9 code, 457.0, had been assigned to a cohort member. Other lymphedema diagnostic codes, 457.1 and 757.0, are not unique to the arm and secondary lymphedema, respectively, and were not utilized.

Second, we used the resources of the REP and the textual search tools of the MC and OMC electronic health records to review documentation from all clinical encounters for evidence of BCRL from index through December 31, 2017. Text words indicative of BCRL were identified by reviewing the charts of 100 patients with known BCRL for > 5 years and by conducting focus groups with BCRL clinical experts. Electronic search terms included lymphedema, edema, edematous, swollen, swelling, puffy, puffiness, heavy, heaviness, erythema, aching, tingling, pain, and pitting. All documents, irrespective of discipline or clinical context, describing clinical encounters, test results, or communications containing one or more of the search terms were manually reviewed for reference to arm or hand swelling, altered limb contour, generalized heaviness, or altered tissue texture or quality ipsilateral to the side of BC treatment, as outlined in Common Toxicity Criteria v.3.0 [23]. If documentation described a co-occurring condition that could produce edema or arm symptoms then the instance was coded as “possible” BCRL. Arm swelling and symptoms that occurred in the absence of an alternate etiology were considered “probable BCRL.” “Definite” instances were those documented by a lymphedema or BC specialist. A “definite” BCRL classification was also assigned to cohort members that had ≥ 3 instances of “probable” BCRL. Agreement among the four RN abstractors in reviewing a random sample of 100 documents that contained words suggestive of BCRL was excellent as reflected in a Fleiss’ Kappa of 0.85 [24].

The third BCRL ascertainment strategy consisted of a robustly validated BCRL screening questionnaire being mailed to all surviving cohort members (n = 1204). The questionnaire has a reported sensitivity > 0.86 and specificity > 0.69 for BCRL detection [25], and has been used for BCRL assessments in numerous epidemiological studies and clinical trials [26,27,28].

Discordances were assessed with manual review by an abstractor naive to the subject and blinded to the subject’s survey/ICD-9-based BCRL status and BC characteristics. Instances of persistent uncertainty were adjudicated by a BCRL physician specialist with 20 years of clinical experience (AC), who was similarly blinded to cohort members’ BC characteristics. Surviving cohort members whose status could not be definitively determined were telephonically interviewed.

Since BCRL is incurable [16, 29], we considered cohort members who developed BCRL to have been BCRL positive since their index date. Specifically, we did not specify a BCRL index date. Our rationale was two-fold. First, roughly 90% of BCRL cases develop during the first 24 months after treatments [30,31,32]. Second, incipient BCRL is not consistently detected with volume-, circumference-, or PRO-based screening [33], making the determination of a specific BCRL index date a theoretical rather than practical exercise.

Ascertainment of weight and BC recurrence status

Cohort members’ weights and heights were electronically available through the MC health record beginning in 1999; weights prior to 1999 and all OMC weights were manually abstracted from paper records. Outlying measurements, defined as a > 30% change from the next earlier or later weight, were identified and the records manually reviewed. If multiple weights were available in a single year, the average was used in analyses.

BC recurrences through December 31, 2017, among cohort members were ascertained by manually reviewing the records of those who either died during the study period or were assigned either a diagnostic code suggestive of BC recurrence or a billing code for breast or axillary surgery, radiation, or chemotherapy following completion of their primary BC treatment.

Post-index healthcare utilization

HC utilization charges from January 1, 1995, when billing codes first became available through the REP, or index date, if after 1995, through December 31, 2017, were characterized using the well validated and widely used Berenson-Eggers Type of Service (BETOS) categories [34, 35]. BETOS codes are assigned for each HCPCS procedure code. They were devised as readily understandable clinical categories for analyzing increases in Medicare expenditures. BETOS categories include (1) Evaluation and management, (2) Procedures, (3) Imaging, (4) Tests, (5) Durable medical equipment, (6) Physical/occupational therapy, (7) Other, and (8) Exceptions/Unclassified.

Statistical analysis

BETOS counts were aggregated by survivorship year of post-index follow-up. The primary analysis used all BETOS codes. Secondary analyses used specific BETOS categories. Analysis was based on Poisson regression. To account for subject-level correlation, generalized estimating equations (GEE) were used, assuming an autoregressive error pattern [36]. Model covariates included gender, age, cancer stage, breast and axillary surgery, and adjuvant hormonal therapy, chemotherapy, or radiation therapy. Splines were used to model the continuous covariates of BMI and age. Exploration of the model’s goodness of fit included use of other correlation structures (exchangeable and working independence), of zero-inflated models to account for subjects with no follow-up costs, and of over-dispersed Poisson and negative binomial models to account for possible excess between-patient variation.

Sensitivity analyses were used to evaluate the impact of BCRL ascertainment method, utilization related to BC recurrences, changes in BC care over time, and common BCRL sequelae (e.g., cellulitis). Models were constructed that (1) used BCRL status ascertained per ICD-9 code and mailed survey; (2) censored members with recurrent BC; (3) included only members with BC index dates after 2000 and 2005 when use of sentinel lymph node biopsy had become increasingly standardized; and (4) excluded years when a cohort member was assigned an ICD-9 code for cellulitis or local infection. To address the issue of potential ascertainment bias, we used the method of inverse probability weights to create causal models of utilization with BCRL. [37] First, a model was constructed with BCRL as the outcome and predictors of age, sex, Charlson score, cancer stage, and cancer treatments as predictors. From this IPW weights were constructed that balance the BCRL and non-BCRL subjects with respect to the predictors. Last a model of utilization using the IPW weights was constructed. All models were fit using the R statistical packages [38].

Results

REP BC cohort

A total of 1906 Olmsted County residents who developed incident BC from 1990 to 2010 were identified. Of these, 1800 were assessed as 25 (1.3%) had not granted permission for use of their health records and 81 (4.2%) had no recording of additional HC encounters. Table 1 describes cohort members’ characteristics at BC index, broken down by those identified as BCRL(+) and BCRL(−). An average of 86 individuals entered the cohort each calendar year (range 45 to 118). Members contributed a mean of 12.8 years of follow-up per survivor (median, 11, range 1–25 years). BC recurrence, among those who received curative treatment, was detected in 163 cohort members (9%). Prior to December 31, 2017, 483 cohort members had died and 78 had moved out of Olmsted County, MN, USA. When follow-up was completed, 1239 cohort members were alive and resided in Olmsted County.

Table 1 Demographic and clinical characteristics of the Rochester Epidemiology Project Incident Breast Cancer cohort

BCRL

BCRL was identified in 253 (14.1%) cohort members. BCRL incidence was similar as ascertained by the screening survey. Of the 1204 surviving cohort members who were mailed the survey, 785 responded (65.2%) but 65 surveys could not be scored. Of the 720 scored survey respondents, 114 (15.8%) were BCRL(+). Among cohort members with discrepant BCRL status determination by survey versus the adjudicated chart and administrative coding, 89% had breast or truncal lymphedema which was not included in the study.

Body mass index

The majority of cohort members, 71.7%, had an annual BMI for all years following index. Among those missing at least one annual BMI measurement, only 13.1% lacked one for more than two consecutive years. On average, cohort members’ BMI increased by 3.4 (paired t test statistic = 35.2, p < .001) from index to last follow-up. However, considerable variation was noted in patterns of weight loss and gain with 17.9% experiencing BMI increases of > 5, 3.8% > 10, and 1.2% > 15. The maximal BMI at any time during follow-up for the cohort was as follows: < 19 1.6%, 19–25 18.8%, 25–30 28.6%, 30–35 19.4%, 35–40 11.5%, and > 40 8.2%.

Utilization of healthcare services following index

Table 2 presents output from the multivariate model of aggregated BETOS codes and shows that patients with BCRL used an average of 31% more services, excluding the first year after index although the difference utilization between BCRL(+) and BCRL(−) survivors diminished over time.

Table 2 Multivariate model output with estimates of the association of healthcare utilization with cohort members’ demographic, breast cancer, and treatment characteristics as well as BCRL status

A number of BC clinical and treatment characteristics (age, BMI, gender, receipt of chemotherapy, and BC Stage) were significantly associated with increased utilization. Figure 1 presents a forest plot with estimates from multivariate models of BETOS codes for evaluation and management, imaging, procedures, and physical therapy. While the finding of increased service utilization among BCRL(+) cohort members was robust across all BETOS codes, the magnitude of the increase varied and was most marked for physical and occupational therapy.

Fig. 1
figure 1

Forest plot of the magnitude of the BCRL coefficient and related increased utilization in different BETOS categories

Healthcare utilization was significantly higher among cohort members as their BMIs increased. Figure 2 presents a line graph of total HC utilization, reflected by aggregated BETOS codes (procedures, imaging, DME, etc.), as a function of increasing BMI among cohort members with and without BCRL. The slope of the BRCL(+) and BCRL(−) lines are parallel as they ascend to the right and left of the nadir at BMI = 22 indicating the absence of an interaction between BMI and BCRL status. The widening of the interval for BMI > 40 and < 20 reflects the lesser representation of these BMI values in the cohort. The 95% confidence interval was identical for BCRL(+) patients, making it impossible to rule out a potential interaction between BMI and BCRL at the higher and lower BMI values. Of note, to the right of BMI = 24, the slope increased by 0.01, while to the left of BMI = 24 the slope increased by 0.10. These changes equate to a 1% and 12% increase in utilization counts per unit increase and decrease in BMI, respectively.

Fig. 2
figure 2

The line graph plots BMI against total BETOS counts per year of survivorship. The parallel slopes of the BCRL(+) and BCRL(−) lines to the right and left of BMI = 22 indicate the absences of an interaction between BMI and BCRL status. The dotted lines represent the confidence interval for the BCRL(−) line

Sensitivity analyses

The significance of the BCRL coefficient was robust in all sensitivity analyses. BCRL coefficients were 0.22 (25% increase) and 0.31 (36% increase) with survey- and code-based BCRL ascertainment, respectively; 0.24 (27% increase) with censoring after BC recurrence; 0.28 (32% increase) and 0.18 (20% increase) with restriction to BC index dates post 2000 and 2005, respectively; 0.27 (31% increase) with elimination of years when infection codes were assigned; and 0.32 (36% increase) in the models that account for Charlson index via IPW weights. All coefficient p values were < 0.001.

Discussion

This assessment of HC utilization in our population-based cohort of 1800 BC survivors yielded a number of significant findings. First, our assessment confirmed that survivors with BCRL use substantially more (> 30%) healthcare services annually. Second, the finding of increased utilization extended across all aspects of healthcare utilization and cannot be solely attributed to care delivered to address lymphedema. Third, increased utilization among BCRL(+) survivors, while it gradually lessens, persisted for at least 10 years. And fourth, increased BMI was also associated with higher utilization among BC survivors, but did not interact with BCRL in driving healthcare utilization.

The impact of chronic late effects of cancer treatment on survivors’ long-term healthcare utilization remains under-researched. Our results suggest that the increased utilization noted among survivors with BCRL in the first years after BC diagnosis [14], persists for at least a decade, and that addressable late effects may contribute importantly to the higher utilization noted among BC survivors. Obesity has also been limitedly examined as a determinant of cancer survivors’ healthcare utilization [39,40,41], yet our findings suggest that obesity may also be an addressable cause of increased utilization. The lack of models for testing and hypothesis generation that describe how survivors’ late effects may mediate utilization represents a problematic knowledge gap.

This study’s strengths include its use of a large (N = 1800) population-based cohort, virtually complete review of post-index healthcare encounters, rigorous adjustment for BC characteristics, and comprehensive utilization capture across all payers and providers. The study’s limitations include the fact that cohort members were not prospectively screened for BCRL; mildly affected individuals or those who died before developing BCRL may have been erroneously considered BCRL(−). However, systematic screening would not eliminate this potential bias as marked differences in BCRL incidence rates have been reported with volumetric and circumferential diagnostic criteria [42], and no screening approach has an area under the curve exceeding 0.83 [33, 42]. The relevance of our findings could be questioned in light changes in BCRL and BC treatment that have occurred during the 21-year follow-up interval. However, complex decongestive therapy has remained the BCRL treatment standard throughout this period [43], and the BCRL coefficients were limitedly changed in models that included members with BC index dates after 2000 and 2005.

Ascertainment bias is a concern since cohort members with frequent healthcare encounters had more opportunities to switch from the default BCRL(−), to BCRL(+) status. Several factors indicate that this bias did not impact our results. First, model BCRL coefficients were only slightly lower in the sensitivity analysis that used BCRL ascertained by the survey, which would be utilization independent and diminished by survivor bias. Second, the model BCRL coefficient increased with propensity score adjustment. Last, the cohort’s BCRL incidence replicates robust prior reports.

Conclusion

BCRL is associated with persistently increased utilization of all service types for at least a decade following BC diagnosis. BCRL appears to be in important driver of survivors’ healthcare utilization and guideline-concordant activities to reduce its incidence and severity may be cost neutral or saving.