Introduction

Joint estrogen receptor (ER) and progesterone receptor (PR), or hormone receptor (HR), status and human epidermal growth factor receptor 2 (HER2) overexpression serve as the basis of four well-established molecular subtypes of breast cancer: HR+/HER2−, HR+/HER2+, ER−/PR−/HER2−, or triple-negative (TN), and HR−/HER2+. HR+/HER2− breast cancers have the best prognosis, while TN cancers have the worst prognosis, in terms of 4-year breast cancer specific survival [1, 2]. Additionally, while HR−/HER2+tumors are generally more aggressive, they have targeted therapies available that can improve survival outcomes [3].

Black, Hispanic White, Pacific Islander, and American Indian/Alaska Native women are 20–60% more likely than non-Hispanic White women to be diagnosed with advanced stage tumors [4], and have 1.4 to 2.4 times increased risk of breast cancer-specific mortality [5], with Black women in particular having significantly worse survival than women of other racial/ethnic groups [6].

Previous population-based studies have found significant disparities between Black and White women in overall and breast cancer-specific mortality among select stages and subtypes of breast cancer, but not among all stages and subtypes [7, 8]. Specifically, these studies found that Black women had worse cancer-specific survival[7] and a higher risk of cancer-specific mortality[8] in stage II and III of the HR+/HER2− subtype [7, 8], stage II of the HR+/HER2+ subtype [7], stage IV of the HR−/HER2+ subtype [7], and stages II [7] and III [8] of the TN subtype. These studies, however, focused exclusively on comparing Black women with White women, and did not include other racial/ethnic groups. Using the Surveillance, Epidemiology, and End Results (SEER) registries, which cover about 35% of the US population as a whole and have documented HER2 status since 2010 for breast cancer cases, the present study expands these findings by examining disparities in risk of breast cancer-specific mortality across multiple racial and ethnic groups. As women of different racial/ethnic backgrounds may have differences in tumor-related factors, experiences with the health care system, socioeconomic factors, and stress over the life-course that can also impact breast cancer-specific mortality [9], we hypothesized that the risk of breast cancer-specific mortality would be different for women of racial/ethnic minorities than for non-Hispanic White women across multiple subtypes and stages at diagnosis.

Methods

Study population

The SEER program is comprised of 18 population-based cancer registries (Atlanta, Connecticut, Detroit, Hawaii, Iowa, New Mexico, San Francisco-Oakland, Seattle-Puget Sound, Utah, Los Angeles, San Jose-Monterey, Rural Georgia, Alaska Native Tumor Registry, Greater California, Greater Georgia, Kentucky, Louisiana, and New Jersey) that together cover 34.6% of the U.S. population, collecting information on all cancer cases diagnosed in their corresponding geographic regions [10]. In each of the SEER registries, patient information is extracted from medical records for each case and de-identified information is submitted to the SEER database [11]. SEER collects vital status from the National Center for Health Statistics and provides a cause-specific death classification using an algorithm that considers cause of death, tumor occurrence, original tumor site, and other comorbidities [12]. With the 2018 SEER data submission, follow-up for vital status is available through 12/31/16 [13]. SEER has documented the status of two HRs, estrogen receptor (ER) and progesterone receptor (PR), since 1990 and has captured HER2 receptor status beginning in 2010. SEER defines, and provides a variable for, four breast cancer subtypes by joint HR and HER2 status: HR+/HER2−, HR+/HER2+, HR−/HER2− (TN), and HR−/HER2+ [14] These data are publicly accessible after the submission of a signed data use agreement to the SEER program, with treatment data also publicly accessible after submission of an additional data use agreement. Data on surgery type, chemotherapy, and radiation therapy are available, but data on hormonal treatment and other targeted therapies are not included in current SEER data releases.

In this study, de-identified data on patient demographics, cancer characteristics, treatment, outcomes as of 12/31/2016, and survival time in months was collected from the SEER database [13] using SEER*Stat software [15] for women diagnosed with a first primary invasive breast cancer between 2010 and 2016 who were 20–85 years of age at diagnosis. A total of 343,499 cases meeting inclusion criteria were identified. Cases with missing follow-up information, unknown race, unknown molecular subtype, unknown stage, stage 0, or stage NOS were excluded (n = 34,474). Cases with < 1 month of follow-up information (n = 5506), missing data on definitive local treatment (n = 8936), missing data on insurance status (n = 4141), or missing cause of death (n = 397) were also excluded from analyses resulting in a final analytic set of 290,045.

Race/ethnicity

Race/ethnicity was divided into five mutually exclusive groups: Non-Hispanic White, Black, Hispanic White, Asian/Pacific Islander, and American Indian/Alaskan Native. Less than two percent of Black women in this sample were also classified as Hispanic (n = 398) so non-Hispanic Black and Hispanic Black women were combined in the same group. Detailed Hispanic subgroups, restricted to Hispanic White women, included Cuban, Dominican, Mexican, Puerto Rican, South/Central American, and other specified Hispanic origin. Data on Hispanic ethnic subgroups were only available for cases diagnosed between 2010 and 2015, with follow-up through 12/31/2015, so analyses for Hispanic ethnic subgroups are restricted to those years. Detailed Asian/Pacific Islander subgroups are available from all registries and include Korean, Chinese, Japanese, Filipino, Pacific Islander, Southeast Asian, Indian Subcontinent, and other specified Asian ethnicity.

Statistical analysis

Unadjusted Kaplan–Meier survival curves were used to summarize differences in survival between racial/ethnic groups for each subtype and a stratified log-rank test was used to compare curves within strata of stage. Multivariable-adjusted Cox proportional hazards models were fit to estimate hazard ratios (HRs) and their associated 95% confidence intervals (CIs) for the association between race/ethnicity and breast cancer-specific mortality, with Non-Hispanic White women serving as the reference group in all models. Separate models were fit for each stratum of subtype and stage. Cases who remained alive were censored at their month of last known follow-up. Models for Hispanic White subgroups excluded women who were flagged as Hispanic by surname match alone or who were Hispanic NOS (n = 410).

Effect modification by age at diagnosis and insurance status (uninsured/any Medicaid, insured/insured—no specifics) was assessed through likelihood ratio testing, Four different models were fit to adjust for different sets of confounders, adding in additional covariates with each model. Model 1 adjusted for age at diagnosis (as a continuous variable) and year of diagnosis (as a categorical variable). Model 2 adjusted for the covariates included in Model 1, and tumor grade (1, 2, 3/4, unknown). Model 3 adjusted for the covariates included in Model 2 and receipt of chemotherapy (yes, no/unknown) and definitive local treatment (breast-conserving surgery with radiation, total mastectomy with or without radiation, other). Model 4 adjusted for the covariates included in Model 3 and insurance status (uninsured, any Medicaid, insured). The county-level proportion of individuals living below the poverty level in 2010 was examined as a potential confounder but it did not change estimates by more than 10% and was not included in the final models.

The proportional hazards assumption was assessed by testing the correlation of the scaled Schoenfeld residuals and ranked failure time and by examining log–log Cox adjusted survival curves, and no strong evidence of violation of the assumption was found. Cells with fewer than five breast cancer-specific deaths are not shown. P-values less than 0.05 were considered significant, and all hypothesis tests were two-sided. All analyses were completed using Stata SE 15.0 software.

Results

In this sample, non-Hispanic White women were on average older and Hispanic White women were on average younger (means = 60.8 and 55.7 years, respectively) compared to women of other racial/ethnic groups (Table 1). Among non-Hispanic White women, the proportions diagnosed with HR+/HER2− disease, diagnosed at earlier stages, diagnosed with lower grade disease, not treated with chemotherapy (or with unknown chemotherapy status), treated with breast-conserving surgery with radiation, and insured were all greater than the equivalent proportions among other groups. Black women were more likely to be diagnosed with TN breast cancer, to be diagnosed at later stages, to be diagnosed with grade III or IV disease, to not have received definitive local treatment, and to reside in counties where larger proportions of the population have household incomes below the poverty level. Hispanic White women were less likely than Black women, but more likely than non-Hispanic White women, to be diagnosed at later stages or with grade III or IV disease and were least likely to have had breast-conserving surgery with radiation. Asian-American women were least likely to be diagnosed with TN breast cancer, to be diagnosed with stage IV disease, and to live where larger proportions of the population have household incomes below the poverty level.

Table 1 Distribution of demographic and clinical characteristics across racial/ethnic groups among women diagnosed with breast cancer between 2010–2016, SEER 18 registries

There were 26,685 deaths from any cause, 17,910 of which were attributed to breast cancer, between January 1st, 2010 and December 31st, 2016. There was a median follow-up time of 34 months and a maximum follow-up time of 83 months. Unadjusted Kaplan–Meier survival curves for women of all stages combined for each subtype are presented in Fig. 1. In each subtype, Black women had poorer breast cancer-specific survival than other racial/ethnic groups, and Asian/Pacific Islander women experienced better breast cancer-specific survival. In each subtype, the probability of breast cancer-specific mortality differed between women of different racial/ethnic groups within strata of stage (stratified log-rank test p-values < 0.001). Age at diagnosis (< 50 years, 50 + years) was a statistically significant effect modifier for over half of the stage and subtype combinations in each racial/ethnic group, but there was no effect modification by insurance status. Models additionally including an interaction term for binary age at diagnosis, with a linear age term for the main effect, were fit. Results for all subtypes are shown in Table 2, all stratified by age and adjusted for age at diagnosis, year at diagnosis, tumor grade, definitive local treatment, chemotherapy, and insurance status. Results for all models fit are reported in Supplementary Tables 1–3. Data is not shown for American Indian/Alaska Native cases due to there being less than 5 deaths for most of the subtypes and stages in each age group.

Fig. 1
figure 1

Kaplan–Meier curves for breast cancer-specific survival by molecular subtype among Non-Hispanic White, Hispanic White, Black, Asian/Pacific Islander, and American Indian/Alaska Native women diagnosed with breast cancer between 2010–2016, SEER 18 registries. p-value < 0.001 for all stratified log-rank tests, with stage as stratifying variable

Table 2 Hazard ratios and 95% confidence intervals for breast cancer-specific mortality comparing Hispanic White, Black, and Asian/Pacific Islander women to non-Hispanic White women diagnosed with four molecular subtypes breast cancer between 2010–2016, stratified by age at diagnosis, subtype and stage, SEER 18 registries

Black women younger than 50 at diagnosis had an over 50% greater risk of breast cancer-specific death across all stages of HR+/HER2− and stages II-IV HR+/HER2+ disease (data not shown for stage I HR+/HER2+ because less than 5 breast cancer deaths occurred in this group) compared to non-Hispanic White women. This observed disparity persisted after adjustment for grade, treatment characteristics, and insurance status (Supplementary Table 1). For TN and HR−/HER2+ disease, young Black women had slightly elevated risks in some stages that were mostly explained by treatment and insurance status, except for stage I HR-/HER2+ where a four times greater hazard of breast cancer-specific death was estimated after adjustment for age, year, grade, treatment, and insurance status (HR: 4.09; 95% CI 1.50, 11.15). Black women aged 50 or older at diagnosis had an over 50% greater risk of breast cancer-specific mortality for stage IV HR+/HER2− and stage III-IV HER+/HER2+ breast cancer but did not have meaningful differences in risks for TN and HR−/HER2+ disease, compared to non-Hispanic White women of the same age group.

For stages I-III HR+/HER2− and HR+/HER2+ breast cancer, Asian/Pacific Islander women generally had a lower hazard of breast cancer-specific mortality than non-Hispanic White women, and this lower hazard was more pronounced among older women for HR+/HER2− breast cancer. Older Asian/Pacific Islander women with TN or HR−/HER2+ breast cancer also generally had a lower risk of breast cancer-specific death, while younger women had similar risk of mortality compared to non-Hispanic Whites. For HR+/HER2− and stages III/IV TN disease, results did not differ greatly between Chinese, Filipino, and Indian Subcontinent ethnic groups (Table 3). For early stage TN breast cancer, older Chinese and Filipino women had significantly lower risks of mortality than non-Hispanic white women, while Indian Subcontinent women had similar risks to non-Hispanic White women. Results for HR−/HER2+or HR+/HER2+ disease by Asian/Pacific Islander ethnic subgroup could not be presented due to small sample size.

Table 3 Hazard ratios and 95% confidence intervals for breast cancer-specific mortality comparing different ethnic subgroups of Asian/Pacific Islander women to non-Hispanic White women diagnosed with HR+/HER2− or triple-negative breast cancer between 2010–2016, SEER 18 registries

Compared to non-Hispanic White women, Hispanic White women aged 50 or older at diagnosis with stages I-III HR+/HER2− disease had 20–40% lower risks of breast cancer-specific death, while younger Hispanic White women had higher risks of breast cancer-specific death for later stage HR+/HER2− disease that were not statistically significant after adjustment for treatment and insurance status, respectively (Supplementary Table 3). Hispanic White women with TN and HR−/HER2+ breast cancer had similar risks of death compared to non-Hispanic White women in both age groups, except that older women with stage II TN breast cancer had a lower risk of breast cancer-specific mortality. Heterogeneity in breast cancer-specific survival was observed among Mexican, Puerto Rican, and South/Central American ethnic groups (Table 4). Younger Puerto Rican women with early stage (I/II) TN breast cancer had the highest risks of breast cancer-specific mortality that persisted after accounting for tumor grade, treatment, and insurance status (HR: 2.94; 95% CI 1.38, 6.27); however, the sample size was small (n = 39). Similarly, younger South/Central American women with early stage HR+/HER2− disease had higher, and older women had lower, risks of breast cancer-specific mortality that was not fully explained by grade, treatment, or insurance status. Results for HR−/HER2+ or HR+/HER2+ disease by Hispanic ethnic subgroup could not be presented due to small sample size.

Table 4 Hazard ratios and 95% confidence intervals for breast cancer specific mortality comparing different ethnic subgroups of Hispanic White women to non-Hispanic White women diagnosed with HR+/HER− or triple-negative breast cancer between 2010–2015, SEER 18 registries

Discussion

In this large population-based study, disparities in risk of breast cancer-specific mortality were observed across women of different races/ethnicities and by stage, tumor subtype, and age. Black women under 50 years old at diagnosis experienced disparities of the largest magnitude compared to non-Hispanic White women. In contrast, older Hispanic White and Asian/Pacific Islander women experienced lower risks of breast cancer specific mortality. The present study is the first, to our knowledge, to examine disparities in risk of mortality among more than two racial/ethnic groups, stage, and subtype while also stratifying by age at diagnosis. Other similar studies did not assess effect modification by age at diagnosis[7, 8, 16] although young age at breast cancer diagnosis has been associated with poorer prognosis [17], particularly among HR+ subtypes [18, 19], and an interaction between race/ethnicity and linear age has been observed in a survival analysis adjusting for stage and subtype [20]. Two previous studies focused exclusively on comparing Black women with White women and did not include other racial/ethnic groups [7, 8], while this study, along with others found that other racial/ethnic groups also experience different survival than non-Hispanic White women [16, 21]. Additionally, Arciero et al. did not separate Hispanic White women from non-Hispanic White women in their reference group [7], and Hispanic White women in the present study experienced different risks of death from breast cancer than non-Hispanic White women. The overall disparities among Black women and the lower risk of breast cancer-specific mortality observed among Asian/Pacific Islander women are consistent with previous literature [5,6,7,8, 16, 21]. Additionally, results from previous studies on Hispanic disparities in breast cancer-specific mortality are conflicting, and differences in findings may be explained by unexplored effect modification by age in these studies [5, 6, 16, 21].

By considering different models adjusting for different factors we were able to observe whether adjusting for certain variables accounted for initially observed disparities. Adjusting for tumor grade reduced the disparity for Black women mainly for the HR+/HER2− subtype, and to a lesser degree for other subtypes. Generally, a greater proportion of observed disparities were explained by insurance status, so insurance status, as a proxy for other health care and social factors, may drive some of the observed differences in mortality. Factors such as access to and quality of health care, stress over the life course, racism, and health behaviors are examples of factors related to health care and social factors that could contribute to risk of breast cancer-specific mortality [9]. Many of the observed disparities were still statistically significant after adjustment for tumor grade, treatment characteristics, and insurance status, indicating that other factors, such as hormonal treatment or obesity, may be responsible for the observed disparities. Some prior studies have shown that Black and Hispanic women are less likely to receive hormonal treatment than non-Hispanic white women [22, 23], but one study restricted to postmenopausal women did not find differences in hormonal therapy use by race or ethnicity [24]. Hormonal treatment is associated with a lower risk of breast cancer-specific death, and if young Black women in this study were less likely to receive hormonal treatment than non-Hispanic White women, then differences in hormonal treatment may explain the remaining survival disparity in HR+ disease. Moreover, obesity and diabetes are associated with breast cancer-specific mortality and may differ between racial/ethnic groups, which could contribute to the disparities that remained after adjustment for measured confounders [25,26,27]. In addition to obesity and diabetes, other potential explanations for the reduced risk of breast cancer-specific mortality observed among Asian/Pacific Islander women relative to non-Hispanic White women in this study include differences in diet [28] and physical activity [29].

One limitation of this study is the completeness of radiation therapy and chemotherapy as adjustment variables [30]. For cases missing information on whether radiation or chemotherapy was received, it is unclear whether they did not receive these treatments or whether the registry failed to capture this data, which prevented sub-analyses stratifying by treatment [30]. Furthermore, we did not have detailed radiation or chemotherapy information, beyond whether the patient received it or not, or information on targeted or hormone therapies. This may result in residual confounding by treatment factors, if women of different racial/ethnic groups were more, or less, likely to receive these treatments than non-Hispanic white women. The present study, like other registry-based studies, is also limited by the lack of data on hormonal treatment and targeted therapies, other important comorbidities, lifestyle factors, individual and health systems-level social determinants of health (e.g., education, income, acculturation, occupation, health care provider details), and reproductive history that may confound the relationship between race/ethnicity and mortality. Although registry data may fail to capture these important variables, the observed disparities in this study warrant additional attention, regardless of their cause. We also excluded 15% of the initial population due to missing data, which may have introduced bias into our results if the data were not missing completely at random. Finally, this study uses broad racial/ethnic categories and heterogeneity within subgroups exists; however, small sample sizes of ethnic groups combined with multiple strata and short follow-up prevents analysis of more detailed ethnic groups beyond what was examined.

To summarize, these findings demonstrate that women of different racial/ethnic groups experience different breast cancer-specific mortality than non-Hispanic White women in certain stages and subtypes, with age at diagnosis acting as an important effect modifier. Further research of protective factors that may contribute to the reduced risk of breast cancer-specific mortality of Asian/Pacific Islander and Hispanic White women of certain subtypes is warranted. Additionally, future studies could examine whether other tumor characteristics may mediate these relationships. Efforts to address disparities in breast cancer-specific mortality should also place additional focus on young Black women, as they bear a disproportionate breast cancer mortality burden compared to non-Hispanic White women, particularly for HR+ subtypes.