Introduction

Breast cancer is the most commonly diagnosed cancer, as well as the second leading cause of cancer death, among females in the United States. An estimated 226,870 new cases of invasive breast cancer are expected to occur among females in the US during 2012, and the age-adjusted death rate from female breast cancer was 23.5 per 100,000 based on data from 2004 to 2008 [1]. Breast cancer has large variations in survival [2]. Some research suggests that survival after diagnosis with breast cancer is associated with modifiable factors including environmental exposures [3].

Ambient outdoor air pollution has been associated with increased morbidity and mortality from a variety of cancers [47]. Some epidemiological studies have reported associations between air pollution and breast cancer incidence [8, 9]. However, to our knowledge, no studies have been conducted to examine the long-term effects of air pollution on survival of breast cancer. With the advancement of urbanization and industrialization, releases of toxic pollutants into the air continue to increase [10]. Because of the high prevalence of both air pollution and breast cancer, and the observed linkage of air pollution with breast cancer incidence, it is necessary to understand whether exposure to air pollution also has adverse effects on female breast cancer survival.

Since air pollution is a complex mixture of different gaseous and particulate components, it is difficult to define, given that the biological mechanisms of air pollution’s effects on human health are largely unknown [11]. Particulate matter (PM) includes a variety of pollutants made up of extremely small particles and liquid droplets containing acids, organic chemicals, metals, and soil or dust particles, which can remain suspended in the atmosphere [12]. Inhalable particles, which have an aerodynamic diameter less than 10 μm (PM10), can reach the upper airways, while fine particles (less than 2.5 μm, PM2.5) can even reach more deeply and be retained in the pulmonary alveoli [13]. PM’s effect on human morbidity and mortality has been suggested by many studies, and it is continued to be the most reliable component of air pollution that is associated with human health [14].

In this study, we examined adverse effects of PM on female breast cancer survival using the 1999–2009 Surveillance Epidemiology and End Results (SEER) cancer data in the U.S. to be linked with PM data which were obtained from the U.S. Environmental Protection Agency (EPA). In addition, this study further investigated whether there were interactions between PM and different stages of cancer at diagnosis on female breast cancer survival.

Materials and methods

Study population

The SEER cancer registry is the premier source for cancer statistics in the United States, which collects information on incidence, prevalence and survival from specific geographic areas representing 28 % of the US population. We selected all incident breast cancer cases in 1999–2009 from all counties in California, United States available from SEER cancer data (n = 287,623). All male cases were excluded (n = 1,801). Cases from three counties (Alpine, Sierra, and Yuba) were further excluded because of lacking of PM data in those counties during the study period (n = 716). A total of 285,106 cases were then linked to ambient PM monitoring data in this study. Female breast cancers were identified based on primary site using International Classification of Disease for Oncology, Third Edition (ICD-O-3). Cancers included were nipple (including areola, C500), central portion of breast (C501), upper-inner quadrant of breast (C502), lower-inner quadrant of breast (C503), upper-outer quadrant of breast (C504), lower-outer quadrant of breast (C505), axillary tail of breast (C506), overlapping lesion of breast (C508), and breast, NOS (C509).

Outcome assessment

Death from breast cancer among females was the main outcome of interest. All deaths caused by other diseases and survival cancer patients at the end of the study period were considered as censored cases. Survival time (unit: months) was calculated as the interval from the time of diagnosis to the time of death or to the end of the study.

Exposure assessment

We obtained daily monitoring data of PM10 and PM2.5 from the U.S. EPA’s monitors located in different areas in each county. Since the monitoring data of PM2.5 were only available from 1999, we selected the incident breast cancer cases of 1999–2009 from SEER. If there were more than one monitor in a county, the mean concentrations of these different monitors were used as daily concentrations of PM10 and PM2.5 in that county. A monthly mean concentration was then calculated based on daily mean concentration for each county. Finally, we linked the monthly mean concentration of PM10 and PM2.5 to the cases to be obtained from SEER by county identifiers, and the mean concentrations of PM10 and PM2.5 during survival time were calculated for each case. For the accuracy of the estimation of exposure to PM, we further excluded cases with any missing PM data during any month of the survival period (n = 17,446). Both PM10 and PM2.5 were then categorized into three groups (low, medium and high exposures) using tertiles.

Covariates

SEER contains individual-level information for each cancer case, including demographics (age at diagnosis, race/ethnicity, and marital status at diagnosis) and cancer characteristics (date of diagnosis, site, and stages of cancer). These factors available from SEER were included as potential confounders in the statistical models. In addition, the year diagnosed was also considered as a potential confounder due to the improvement of treatments and care over time.

Data of county-level covariates were obtained from the 2000 Census and the 2006 Behavioral Risk Factor Surveillance System (BRFSS). We extracted county-level median household income, cumulative educational attainment for female population 25 years and over, population size of female 25 years and over, population inside urbanized areas, and total population of all the counties in California. Percentages of those who had at least neither a high school diploma nor GED among the female population 25 years and over, and percentages of those who live inside urbanized areas were calculated for each county. In addition, the count of hospital beds at general hospitals per 100,000 population in 2004 was obtained from the 2006 BRFSS supplement county-level data. The median household income and calculated percentages were then linked to the cancer cases and categorized into three groups (low, medium, and high) by tertiles as indicators of county-level income, education level, and accessibility to medical resources, respectively. The count of hospital beds was categorized into two groups (low, high) by the median as an indicator of the overall medical resources level in each county, which was taken into account with percentages of population inside urbanized areas.

Additional 12,532 cases were excluded from the analysis because of missing data of covariates such as age, race, marital status, and cancer stages. A total of 255,128 female breast cancer cases were finally analyzed in this study.

Statistical analysis

Descriptive statistics (e.g., χ2 tests) were performed for comparing the distributions of categorical variables among different exposure levels of both PM10 and PM2.5. Survival curves were created using Kaplan–Meier life table analyses, and log-rank tests were applied to test the significance of the difference of survival among three groups for both PM10 and PM2.5. Marginal Cox proportional hazard models to be implemented in PROC PHREG in SAS (SAS Institute Inc., Cary, NC) were used to estimate the hazard of mortality due to different exposure levels of PM10 and PM2.5 among breast cancer cases [15]. Marginal approach was used for the county-level analysis that included multiple subjects and was carried out using maximum partial likelihood estimates of regression parameters and a robust sandwich covariance matrix estimate to account for the dependence among subjects within counties [15, 16]. PM10 and PM2.5 were modeled as both categorical variables and continuous variables. We also considered the interactions between cancer stage and PM. All statistical analyses were conducted using SAS 9.3 software (SAS Institute Inc., Cary, NC).

Results

Table 1 presents the characteristics of all female breast cancer cases by PM concentration level. Exposure levels of PM10 and PM2.5 significantly differed by age at diagnosis, race/ethnicity, marital status at diagnosis, cancer stage at diagnosis, and year diagnosed as well as the education, income level, medical resources, and urbanization on county-level.

Table 1 Characteristics of breast cancer cases among females in California, U.S. from 1999 to 2009 by pollution levels (n = 255,128)

Figure 1 shows the results of Kaplan–Meier survival curves of all female breast cancer cases by PM10 and PM2.5 levels. These figures suggest that female breast cancer cases that lived in counties with higher exposures to PM10 and PM2.5 experienced significantly shorter survival than those living in counties with lower exposures (p < 0.0001).

Fig. 1
figure 1

Survival plot for female breast cancer cases in California, U.S. from 1999 to 2009 by PM10 and PM2.5 levels

Tables 2 and 3 show the unadjusted and adjusted hazard ratios (HR) and 95 % confidence intervals (CI) for death due to breast cancer among females. Cases who lived in counties with high PM10 or high PM2.5 level during survival period had a statistically significant increase of 33 and 40 %, respectively, in mortality compared to those living in counties with low PM10 or low PM2.5 level (for PM10, HR = 1.33, 95 % CI: 1.01–1.76; for PM2.5, HR = 1.40, 95 % CI: 1.14–1.73). After adjusting for both individual-level covariates such as age, race/ethnicity, marital status, cancer stage and year diagnosed, and county-level covariates such as education level, income level, and accessibility to medical resources, the rates of mortality among subjects living in counties with high PM10 levels and high PM2.5 levels remained statistically significantly higher (HR = 1.44 and HR = 1.76, respectively). In the models which PM10 and PM2.5 were analyzed as continuous variables, cases had HRs of 1.14 and 1.13 with per 10 μg/m3 increase in PM10 concentration and 1.61 and 1.86 with per 5 μg/m3 increase in PM2.5 concentration for unadjusted and adjusted models, respectively.

Table 2 Adjusted and unadjusted hazard ratios (HR) and 95 % confidence intervals (CI) for breast cancer death from 1999 to 2009 among females in California, U.S. (n = 255,128)
Table 3 Adjusted and unadjusted hazard ratios (HR) and 95 % confidence intervals (CI) for breast cancer death from 1999 to 2009 among females in California, U.S. (n = 255,128)

Table 4 presents the results of the models in which cancer stage at diagnosis was considered as an effect modifier. The HRs of cases living in areas with high PM exposure levels decrease when the cancer stage at diagnosis is advanced. Similar results were observed in the continuous models.

Table 4 Adjusted hazard ratios (HR) and 95 % confidence intervals (CI) for breast cancer death from 1999 to 2009 among females in California, U.S. by cancer stage (n = 255,128)

Discussion

In this study, we found that female breast cancer patients who live in high PM level areas had significantly shorter survival time compared to those living in a low PM level area. Our study also suggests that female breast cancer patients with earlier cancer stages are likely to be more sensitive to the adverse effects on survival time caused by air pollution. Our findings raised great concerns that air pollution exposure may have a great impact on the length of survival of female breast cancer patients. More importantly, this study also calls attention to preventive efforts to protect female breast cancer patients from air pollution, especially those with less advanced cancer stages. Clinicians may assume those with advanced cancer are more vulnerable to environmental exposures, yet our data suggest that we need to be concerned about preventing exposure among early stage cancer patients in particular.

The adverse effect of ambient PM exposures on the survival of female breast cancer patients is biologically plausible. Both short-term and long-term exposures to PM have been linked with cancer-specific mortality [1721]. A recent ecological study suggests that high levels of PM2.5 may be associated with an increased risk of death from breast cancer among 61 municipalities [22]. Results from these studies are all consistent with our findings.

The biological mechanisms by which exposures to high levels of PM can impact the length of survival of female breast cancer patients are not clear. But, several previous studies may suggest some potential pathways underlying it. Air pollution is a prevalent environmental oxidative stressor, which can trigger redox-sensitive pathways that lead to different biological processes such as inflammation and cell death [23]. Through the potential biological mechanisms of oxidative stress and inflammation, air pollution can compromise the function of cardiovascular system and/or respiratory system and increase the risk of morbidity and mortality, which has been consistently reported and may also play a role in this observed association [24]. In addition, another plausible mechanism is through DNA methylation induced by high air pollution in breast cancer patients. A study found that exposure to higher air pollution was associated with changes in p16, a gene involved in tumor suppression among breast cancer patients, which may aggregate the progression of breast cancer [25]. Moreover, DNA damage such as polycyclic aromatic hydrocarbons (PAH)–DNA adducts may be another critical underlying mechanism to explain the shorter survival time we observed. PM in urban air is comprised of a complex mixture of organic and inorganic chemicals. PAH, is an important group of chemicals with adverse health effects associated with air PM. Several animal and epidemiologic studies reported an association between PAH and DNA adducts and breast cancer [2631]. The accumulation of PAH–DNA adducts might increase mutations and genomic instability, further contributing to the cancerous phenotype of the cells, thus playing an important role in the further progression of the malignant cells. Moreover, a recent study found the links between airborne particle-induced oxidative stress and the subsequent induction of DNA damage as well as repair in human breast cancer cells, leading to more rapid progressions of breast cancer, which further confirms the potential pathways as we suggested above [32].

In addition, a recent study suggests that patients in early stages (i.e., ductal carcinoma in situ, DCIS) were most common to take radiation therapy and hormone therapy were more frequently adopted by those in advanced stages [33]. It has also been found that patients received radiation therapy had a twofold to sixfold higher adduct associations with breast cancer mortality, while those received hormone therapy had relatively lower associations and treatment is a stronger effect modifier between environmental exposures and breast cancer mortality [34]. This finding may help us explain the mechanism underlying the observed elevated risks among patients in early cancer stages in the present study.

Our study offered several strengths. First, using the SEER cancer data linked with county-level PM data, we have a unique opportunity to study the relationships between exposure to PM and survival of female breast cancer in a large multiethnic sample. What is the most novelty about our study is that we were able to link daily PM exposure data for each case’s county of residence, providing individual measurements of PM10 and PM2.5 levels post-diagnosis. Therefore, using these existing data, it provided an extremely cost-effective way of studying this topic. Furthermore, the SEER data included patient’s demographic information and information on cancer characteristics and treatment, whose potential confounding effects we were able to control for when the associations between PM exposures and breast cancer survival were investigated.

Several limitations of this study also needed to be noted. First, although we were able to choose cases from one state, which may have minimized the effects of some potential confounders, we were unable to control for differences in types of healthcare, culture, dietary habits, individual-level socio economic status, smoking, and alcohol consumption. However, confounding effects of substance abuse may be limited, as current tobacco use and alcohol consumption among cancer patients after diagnosis is not common [35, 36]. In addition, in order to minimize the potential impacts of social economic status and types of healthcare access, we used the data from the 2000 Census and the 2006 Behavioral Risk Factor Surveillance System (BRFSS) to get information on median family income, educational level and medical resources at county-level in our analysis. After controlling for these county-level predictors, the results remain consistent. Second, we used the county-level monitored air pollution data to estimate individual air pollution exposure, which may cause misclassification of exposure. The closest air monitor to individual address may provide a better estimation of individual exposure than the county average of air pollution data. However, the SEER data which we obtained only include county-level identifiers instead of residential address. We cannot link the cases with the nearest monitors by their residential addresses. Furthermore, since breast cancer patients may spend significant time indoors and individual exposure to PM for all cases may be different from ambient air monitored levels, the misclassification of exposure may still exist. However, this misclassification of exposure is more likely to be a non-differential bias, which would cause our estimation biased toward null. Third, we excluded the cases in three counties because of no availability of PM monitored data, which may introduce another source of bias. However, these counties only had a total of 716 cases (i.e., 0.25 % of all cases). Given such a small proportion of cases in these counties, it would not dramatically affect the result even if extreme scenario occurs. In addition, we compared our selected cases with all cases regarding demographical distribution. It shows that two groups of cases were no significant differences regarding demographical distribution. Finally, the status of migrant worker in some counties where have agriculture based pollution may also cause another potential sources of bias because these worker are unlikely to have health care coverage. While we were controlling for race/ethnicity in our analysis, this may not be subtle enough given the large population of Hispanic residents who are not engaged in agribusiness. As individual status of migrant worker is not available in the data, we are unable to control for the impacts of this variable in our analysis. However, we conducted sensitive analysis among only white cases and found consistent results. Moreover, we further checked our data and found that there were about 11 % of cases who were other than white and black included in this study. Given the migrant workers who are only a small proportion of this group, the results may not be largely impacted by this potential confounder.

Conclusion

Although these limitations were noted in the study, our results suggested that exposure to high PM levels may have deleterious effects on the length of survival from breast cancer among females. Besides, compared to patients with advanced cancer stages, those with earlier cancer stages were more likely to be impacted by such exposures. Although further studies are necessary to confirm our findings, our study provides preliminary evidence that air pollution may contribute to decreased length of survival among female breast cancer patients.