Introduction

Among women in the U.S., breast cancer continues to be the most frequently diagnosed cancer and the second most common cause of cancer death. African–American (AA) women have increased incidence of early onset disease and die at a higher rate than any other racial/ethnic group [1]. While a socio-demographic component is a major factor in this disparate burden, the role of tumor biology or genetics cannot be ignored as breast cancers, like all cancers, arise as a result of genetic alterations [29].

The heterogeneity of breast cancer has recently been demonstrated through genetic profiling which has identified the presence of several major breast tumor subtypes, each with distinct gene-expression patterns [1016]. These major subtypes have important implications in breast cancer etiology, the systemic therapies prescribed, the effectiveness of such therapies, and in outcome, both recurrence and survival [14, 1724]. Breast cancer-related proteins correspond with the major genetic subtypes; specifically, protein expression of estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) [10, 11, 13, 18, 25]. The triple negative tumor (TNT) phenotype, denoted by lack of expression of all three protein receptors (ER−PR−HER2−) has recently emerged as a distinct subtype. TNTs may arise at an earlier age, are basal-like and almost exclusively high grade tumors. These characteristics suggest distinct molecular origins [14, 17, 20, 26, 27]. Unfortunately, these tumors also currently have no available effective target therapies [27].

TNTs account for between 10–30% of all invasive breast cancers in the U.S. and AA women appear to be twice as likely as white women to present with these tumors [18, 2830]. Singly and in combination, the earlier age at onset, lack of viable therapeutic targets and highly aggressive features of the TNTs may contribute to poor outcome and may explain in part the historically poorest survival observed among AA women.

Several U.S. studies have recently reported a higher prevalence of TNTs among younger and African–American women [18, 2830]. However, only two published studies addressing race, breast cancer subtypes, and survival are population-based and both have several limitations [18, 28]. Testing of the biomarkers and ascertainment of other pathology components was not performed in a centralized/uniform manner, treatment information was not available hence was not accounted for in survival analyses, nearly 45% of the biomarker information to construct subtypes was missing in one study [28], and neither study examined other factors such as stage, grade, poverty index, and treatment factors which could explain racial differences in survival among the breast cancer subtypes. Our population-based study compensates for these limitations and further augments the biologic understanding of breast tumor subtypes in several ways. In our study all pathology review and immunohistochemical (IHC) testing was centrally conducted by one pathologist (P.L.P.). In addition to standard prognostic factors, our survival analyses account for delay in treatment, type of treatment received, and co-morbidities. With a study population of 476 younger women (age 20–54 at diagnosis) and 30% AA representation, we were able to simultaneously consider multiple factors in our assessment of racial differences in subtypes and survival, with the aim of identifying factors which could explain these racial differences. Finally, this is also the first population-based study to investigate the association of the subtypes with expression of breast tumor proteins important to regulation of the cell-cycle or to programmed cell death (apoptosis). The proteins we investigated are those which are fairly common in breast cancers and may have particular importance in early age breast carcinogenesis, aggressiveness, or clinical outcome [31]. We investigate the prevalence of triple subtypes of breast cancer (based on immunohistochemical (IHC) protein expression of ER, PR, and HER2) and their association with race and overall survival; adjusting for socio-demographic, prognostic, and treatment factors.

Methods

Population

This study builds on the metropolitan Atlanta arm of a multicenter population-based, case–control study of breast cancer incidence and risk factors in younger women (aged 20–54) [32]. The original study identified 950 African–American (AA) or white women, aged 20–54, who were residents of the three largest counties in metropolitan Atlanta, Georgia (Cobb, Fulton, and DeKalb) when newly diagnosed with unilateral invasive breast cancer between May 1, 1990 and December 31, 1992. The 3-county area represented a population of over 1.64 million residents (36.4% AA and 60.5% white). Case identification was through rapid ascertainment of hospital admission, surgery, and pathology records. Completeness of ascertainment was assessed by periodic checks against the Metropolitan Atlanta Surveillance Epidemiology and End Results (SEER) Cancer Registry, part of the NCI-funded SEER program. The overall response rate for Atlanta interview of patients in the parent case–control study was 87.9% (835/950): 87.2% (251/288) for AA women and 88.2% (584/662) for white women. For the current study, four of the 835 women originally interviewed were excluded (1 later self-reported ‘other’ race and 3 were interviewed as controls, prior to becoming cases), resulting in 831 eligible women for the current study (251 AA and 580 white). After obtaining Institutional Review Board (IRB) approval at each collaborating institution, a more comprehensive medical record review was conducted and tumor specimens were obtained. Of the 831 women, slides for centralized pathology review and archival tissue specimens appropriate for further laboratory analysis were obtained for 476 (116 AA and 360 white).

Data sources & elements

Data sources

There were several sources of data for this study, including information previously collected in the original study and Atlanta SEER registry data. Follow-Up telephone interviews and extensive medical record abstraction from hospitals, physicians’ offices, diagnostic and radiation facilities, and pathology laboratories were conducted beginning in 1997. Trained medical record abstractors and interviewers conducted the interviews and medical record abstraction. In cases where multiple sources for the same data conflicted, extensive expert review (J.W.E.) was conducted.

Patient-related factors

Data from the parent study were the primary source of the non-tumor factors, including race, age, education, and poverty index. A detailed description of these factors was published previously [32]. Racial identity was self-reported. The youngest age category (20–34) included by necessity a larger range because there were so few cases in this age range as previously described [33]. A poverty index (as an indicator of socio-economic status) was calculated based on the combination of annual household income and the number of people supported by that income, with the annual household income divided by the 1991 national poverty level income for a family of the corresponding size [33, 34]. The lowest ordinal category represents those living at the federally defined poverty level. Co-morbid illness information was obtained from the original patient interview and abstraction of hospital records. The co-morbid variable sums the number of co-morbid medical conditions for each patient and includes only those illnesses thought to contribute to treatment and/or survival: Diabetes, drug abuse, gastrointestinal disease, heart disease, HIV, hypertension, renal disease, liver disease, lung disease, neuropathic disease, psychiatric disorders, and rheumatoid arthritis. These measurements represent the time period up to the date of initial diagnosis.

Treatment-related factors

Treatment data were obtained from abstraction of hospital, medical (oncology) office, and radiation facility records, cancer registry data, and from the original interview. Chemotherapy, hormonal and radiation therapies were categorized as receipt, non-receipt, or unknown. Type of surgery, based on the most extensive surgery performed, was categorized as breast conserving surgery (BCS), mastectomy, or none. Diagnosis delay was defined at the elapsed time between first medical consultation and biopsy-proven diagnosis of primary invasive breast cancer. Treatment delay was defined as the time between biopsy-proven diagnosis and the date of either definitive surgery, initiation of neo-adjuvant chemotherapy, or treatment for metastatic disease.

Tumor-related factors

Stage and its components (tumor size, nodal status, and distant metastasis) were obtained from the cancer registry and/or independent abstraction of medical records. Stage at diagnosis was defined using the American Joint Committee on Cancer Staging criteria that were in use during the case ascertainment period (1990–1992) [35]. Stage represents pathologic stage at the time of the first diagnostic procedure confirming invasive breast cancer (or clinical stage if the patient received neo-adjuvant therapy) and was divided into groups, based on similarity of expected prognosis and to achieve adequate numbers in our analyses: I, IIA, IIB, and III/IV.

Pathology

A detailed description of the pathology review and laboratory methodology was previously reported [31]. All components of the pathology review and immunohistochemical (IHC) assays were conducted without knowledge of patient characteristics or clinical outcome. Tumor tissue specimens were reviewed by a single pathologist (P.L.P) for Nottingham tumor grade [36, 37]. The levels of tumor marker proteins were assayed using standard IHC techniques, including antigen retrieval when appropriate, on tumor tissue sections using the antibodies specified below [3842].

Representative tumor blocks were selected for testing, and the expression levels of the following proteins were assessed: estrogen receptor (ER; ERID5; Immunotech, Westbrook, ME) [4345], progesterone receptor (PR; 1A6; Novocastra, Newcastle-Upon-Tyne, United Kingdom) [46], c-ErbB-2 (HER-2/neu) oncogene protein (AO485/DAKO15; Dako, Capinteria, CA) [47], p53 tumor suppression gene protein (pAb 1801; Oncogene Science, Manhassett, NY) [4850], Ki-67 proliferation-related antigen (MIB-1; Immunotech) [38, 39], and the cell-cycle regulatory proteins cyclin E (cyclin E polyclonal; Dr. James M. Roberts, Fred Hutchinson Cancer Research Center, Seattle, WA) [51, 52], cyclin D1 (5D4; Immunotech) [53, 54], p16 (Ab-2(ZJ11); Neomarkers Freemont, CA) [55], p21 (EA10; Calbiochem, San Diego, CA) [56], p27 (Ab1-DCS-72.F6; Neomarkers) [52, 57], p130 (Mab 10; Transductions Labs) [58], pRb (G3–245 PharMingen) [59], and Bcl-2 (6C8 D; Hockenbery Lab, FHCRC, Seattle, WA) [60]. Apoptotic index (AI) was determined by terminal deoxyunucleotidyl transferease-mediated dUTP-biotin nick end labeling (TUNEL) assay [61].

IHC assays were scored according to staining intensity and/or percentage of tumor cells that were positive. Scores were then collapsed into positive (high) and negative (or low) categories. For ER and PR, any nuclear staining was considered indicative of positivity. Antibody staining for HER2 was performed using the AO485 antibody from Dako and was initiated before the acceptance of the HercepTest kit (Dako) as the Food and Drug Administration-approved technique for the evaluation of HER2expression [62]. Tumor specimens were categorized as positive if there was ≥2+ positive membranous staining observed relative to normal breast epithelium. The clinical use of the marker to indicate therapy was not the objective of this current study and Herceptin for the treatment of HER2+ (c-ErbB-2 +) tumors was not a standard therapeutic option during the relevant study follow-up years (1990–2003). For p53, nuclear staining of ≥10% of tumor cells was categorized as positive. Expression levels of cyclin E and p27 were assigned scores ranging from 1 (negative) to 7 (highest intensity), with subsequent groupings as previously described [31]. Bcl-2 was categorized as positive if intermediate or high intensity cytoplasmic staining was detected and negative if no or low intensity cytoplasmic staining was detected.

The remaining tumor markers (Ki-67, p21, p16, pRb, p130, cyclin D1, AI) were graded according to the percentage of tumor nuclei with positive staining in four high power fields (HPF). An average of >1000 tumor cells were evaluated for each case. The results were categorized into tertiles (excluding zero values, which comprised a fourth group). They were then further dichotomized as “low” if there was no positive staining or if the percentage of positive cells was in the lowest tertile, and “high” if the percentage of positive stainings were in the two highest tertiles.

Triple subtypes

The breast cancer cases were divided into 4 IHC subtypes based on whether ER, PR, and HER2 expression were positive (+) or negative (−): Triple negative (ER−PR−HER2−), ERand/orPR+, HER2−; ERand/orPR+, HER2+; and ER−PR−HER2+. These four subtypes are approximately equivalent to the following respective subtypes developed through genetic hierarchal clustering: basal-like, Luminal A, Luminal B, and HER2+ (but distinct from Luminal B) [27, 63].

Follow-up and survival (Time to death)

Survival information was obtained from the Atlanta SEER registry, which utilizes multiple sources for active and passive follow-up, and routinely matches all cancers with the State of Georgia vital statistics database. Women who no longer resided in Georgia were followed-up on a yearly basis through the National Death Index. Death certificates were requested from the Georgia State Health Department or from the state where the participant resided at the time of death. For living patients, the follow-up interviews and additional medical record abstraction were also used to update last contact information. Cause of death was obtained from death certificates where underlying cause of death codes 174.0–174.9 or C50.0–C50.9 were classified as death due to breast cancer (International Classification of Diseases for Oncology). Survival (time to death from any cause) was calculated from the elapsed time between the date of diagnosis and the date of death or most recent follow-up. For this study, follow-up was truncated at January 1, 2003.

Statistical analysis

As previously reported, we determined if the inability to obtain tumor specimens on all of the eligible women may have caused selection bias [31]. We compared the pathology cohort to the entire eligible population. This comparison indicated that AA women accounted for 30% of the eligible population but only 24% of the pathology cohort. AA women in the pathology cohort were also younger (16% of AA in the eligible population were aged 21–34 years, compared to 20% of the pathology cohort) and more likely to have died of disease (64% vs. 55%) than were AA women in the overall eligible population. These differences between the eligible population and the pathology cohort were not observed among white women. Therefore, to reduce the possibility of overestimating the magnitude of tumor characteristic based on race, this analysis was weighted based on the inverses of the probabilities that women in the eligible population were included in the pathology cohort. Women were categorized into 20 groups, based on race, age at diagnosis (5 groups), and vital status at the time of contact to obtain consent (1998) [31]. For each group, the statistical weighting procedure used the inverse of the proportion of women in the eligible population with the same 3 characteristics, who were included in the pathology cohort. All reported results are based on weighted analyses, as described previously. [31]

In the present analyses, frequency distributions and χ2 tests of independence (or Fisher Exact tests where expected cell counts were less than 5) were used to describe the IHC triple subtypes [ER−PR−HER2−, ER−PR−HER2+, ER/PR+HER−, ER/PR+HER2+] by race and other patient-related (socio-demographic), tumor-related, and treatment-related factors. Odds ratios (unadjusted and adjusted for age and stage at diagnosis) and 95% confidence intervals (CIs) examined the magnitude and precision of the prevalence associations. Racial differences in the prevalence of the IHC triple subtypes were further examined by adjusting for patient/socio-demographic and diagnosis-related factors (age, poverty index, education, stage, and delay in time to diagnosis). The Hosmer–Lemeshow Goodness of Fit (GOF) test was satisfied for the multivariate logistic models we present.

Kaplan Meier curves and log rank tests were used to estimate unadjusted survival functions and to compare them across strata: race, the IHC triple subtypes, and the other patient-related and tumor-related prognostic factors and treatment factors [64]. Cox Proportional Hazards (PH) models were used to estimate the effect of each of the independent variables on the risk of death (unadjusted hazard ratio (HR)). No violations of the PH assumption were observed based on graphical and statistical approaches.

Racial differences in overall survival and within the IHC triple subtypes, were assessed by additional adjustment for factors that were determined a-priori as being clinically relevant (age, stage, grade, education status, poverty index, co-morbid status, and treatment factors [receipt of chemotherapy, radiation therapy, hormonal therapy, delay in receipt of treatment, and type of surgery]). Due to the restrictive sample sizes in the triple subtypes, interaction terms between race and the other covariates were not assessed in survival models.

Results

IHC triple subtype associations with race and other factors

Characteristics of the study population by breast cancer triple subtypes and race are presented in Table 1. The most common subtype was ER+/PR+/HER2−, representing 56.3% of all breast tumors, followed by triple negative tumors (29.5%), with HER2+ expressing subtypes being the rarest (14.1%); ER−PR−Her2+ (6.7%) and ER/PR+HER2+ (7.3%). Triple negative tumors (TNTs) were the most prevalent subtype among African–American (AA) women, accounting for 46.6% of their tumors. In contrast, among white women, TNTs comprised only 21.8% of their tumors, while nearly two-thirds (64.3%) were the ER+/PR+/HER2− subtype. The racial differences in TNTs strongly persisted within age strata (Fig. 1) and stage strata (Fig. 2).

Table 1 Frequencies (weighted percentages) of IHC triple subtypes according to race, Atlanta follow-up study
Fig. 1
figure 1

Racial differences in triple negative tumor prevalence within age groups. P < 0.01 for racial differences in each age sub-group

Fig. 2
figure 2

Racial differences in triple negative tumor prevalence within stage groups. P < 0.01 for racial differences within each stage

The four triple subtypes significantly differed by several other factors, including age at diagnosis, poverty index, and multiple tumor characteristics (Table 2). Patients with TNTs were younger and impoverished (Table 2). TNTs also demonstrated the largest proportions of high grade and Ki-67; high expression of p16, p53, and Cyclin E; and low Bcl-2, Cyclin D1, and p130 expression. TNTs, along with the ER−PR−HER2+ subtype, were least likely to be found in Stage I disease. TNTs were the rarest subtype among tumors ≤1.0 cm. There were no differences among the four subtypes by nodal status or distant metastasis.

Table 2 Frequencies (weighted percentages) of IHC triple subtypes according to patient demographics and tumor factors, Atlanta follow-up study

Odds ratios for the triple subtypes, adjusted for age and stage at diagnosis, are presented in Table 3. The most common subtype (ER/PR+HER2−) serves as the referent group. The TNT subtype was strongly associated with AA women (OR = 3.0, 95% CI 2.1–4.2); as well as younger age (<40 years) and low poverty level. The TNTs were also significantly and strongly associated with high grade (OR = 9.4), as well as high expression of p53 (OR = 9.1), p16 (OR = 3.2), cyclin E (OR = 8.8), and low expression of cyclin D1 (OR = 8.3) and Bcl-2 (OR = 6.3).

Table 3 Adjusted Odds Ratios (OR) between triple subtypes, patient and tumor characteristics, Atlanta follow-up studya

The high TNT prevalence observed among AA women was attenuated, but persisted, after further adjustment for grade, poverty index, and diagnosis delay (OR = 1.9, 95% CI 1.2–2.9), (Table 4). After this same further adjustment, AA and white women did not differ with respect to the other breast cancer subtypes: ER−PR−HER2+ (OR = 1.3, 95% CI 0.6–2.7) and ER/PR+HER2+ (OR = 1.1, 95% CI 0.5–2.2).

Table 4 Adjusted Odds Ratios (OR) for race (AA vs. White)—according to triple subtypes of breast cancer—Atlanta follow-up studya

Survival, race, and the triple subtypes

Median follow-up was 11.4 years and did not differ by race (AA = 11.5 years and white = 11.3 years). Of the 476 women in the cohort, 131 were deceased at the end of the study period, 57 AA and 74 white women, representing 39.5% and 23.6% of their respective populations (P < 0.001). In this young population of women ages 20–54, breast cancer was cited as an underlying cause of death in over 88% and did not differ by race.

Survival significantly differed by the triple subtypes and by race. Because survival curves could not be adjusted for weighting, Fig. 3 presents unadjusted/unweighted survival curves, stratified on race. Table 5 presents hazard ratios (HR) for all-cause mortality, comparing AA women to white women overall and within each IHC triple subtype. Overall, AA women were twice as likely to die (HR = 1.9, 95% CI 1.5–2.5). After adjustment for age, stage, grade, and poverty index this survival disparity was fully attenuated (OR = 0.9, 95% CI 0.7–1.3). Additional adjustment for treatment, including receipt of surgery, radiation, chemotherapy, and hormonal therapy drove the racial difference toward better survival for AA women, though non-significant (OR = 0.8, 95% CI 0.6–1.2).

Fig. 3
figure 3

Unadjusted/Unweighted survival curves comparing TNT to ER/PR+HER2− tumors, stratified on race

Table 5 Survival frequencies (weighted %) and hazard ratios (all-cause mortality) for race (AA vs. White)-overall and stratified by IHC triple subtypes of breast cancer

Within every triple subtype, AA women also experienced poorer survival, when no other factors were considered (Table 5). The worst survival experiences for AA women were among the TNT group (OR = 2.1, 95% CI 1.3–3.3) and the ER−PR−HER2+ group (OR = 3.5, 1.8–7.1). Adjustment for age, stage, grade, and poverty index, had no affect on the disparate survival observed in the TNT subtype; AA women continued to experience statistically significant poorer survival (HR = 2.1, 95% CI 1.1–4.0). In the ER−PR−HER2+ group, composed of only 33 women, the survival differences were decreased by 50% (HR = 2.3, 95% CI 0.8–6.4) after this adjustment. Additional adjustment for treatment did not relevantly impact the observed survival differences in the TNT group (HR = 2.0, 95% CI 1.0, 3.7).

Discussion

Our study of this cohort of women under age 55 diagnosed with breast cancer in the early 1990s revealed that triple negative tumors (TNT) were the most common breast cancer subtype diagnosed among African–American (AA) women; TNT accounted for nearly 47% of their breast tumors, compared to 22% among white women. Moreover, every age subgroup and every stage subgroup presented this marked racial difference in TNT prevalence, suggesting that in this already young population it is not merely age or stage at diagnosis that is driving these differences. Indeed, the significant double prevalence for AA women persisted, when concurrently considering differences in age, stage at diagnosis, tumor grade, diagnosis delay, and socio-demographic factors (OR = 1.9). Our findings are consistent with the Carolina Breast Cancer Study (CBCS), where Carey et al. [18] reported that pre-menopausal AA women were most likely to present with TNTs (39%) vs. pre-menopausal non-AA women (14%); but racial differences were not observed in post-menopausal women. After adjustment for age and stage, TNTs were similarly twice as likely among AA women vs. white women in that population-based study (OR = 2.1). A California registry study, although missing nearly 45% of the data to construct triple subtypes, also found the highest prevalence for TNT among AA women (25%), followed by Hispanics (17%), and white women (11%) [28]. Among non-population based studies in Michigan and Philadelphia, higher TNT prevalences among AA women compared to white women have also been noted [29, 30]. Similar to previous studies [18, 30], we observed no racial differences in the other subtypes, including the other ER−PR− subtype (ER−PR−HER2+).

The population proportion of TNT we report is substantially greater than prior estimates [23]. This could partly be attributable to differences in assay kits and antibodies, processing and storage, cutoff points for assays, and disease stage. Additionally, most prior estimates were based on studies consisting primarily of homogenous Caucasian populations, both pre-and post-menopausal. Thus, the differences may be attributable to our younger cohort with large representation of AA women. Similar to AA women in the United States, early onset and more aggressive breast cancers are very common in sub-Saharan African women, but peak incidence occurs at an earlier age in the sub-Saharan region, between 35 to 45 years of age [65]. In a study of breast cancer among Nigerian women, 59% of cases were ER−negative and HER2−negative [66]. That this distinctly high prevalence of TNT among AA women in our study remained after accounting for age, stage, and grade at diagnosis, as well as poverty index, suggests it is not merely differences in access to care or social deprivation that drives this propensity for triple negative tumors in AA women. The consistency across the studies strongly suggests a role for African ancestry, associated genetic variations, unmeasured risk and biological factors, and gene–environment interactions in the etiology of TN breast cancers.

There is a clear need to refine etiologic heterogeneity through evaluating interactions among genes and environmental exposures i.e. behavioral risk factors or system-related factors (gene–environment interactions ) or through surrogates such as our study which bases subtypes on tumor protein expression [phenotype–environment interactions]. Very few studies have examined risk factors or racial differences in risk factors that could contribute to heterogeneity in breast cancer subtypes. A population-based study of Polish women identified increased BMI as a factor which significantly reduced risk of the Luminal A (ER+PR+HER2−) subtype (OR = 0.71), but not the basal-like (ER−PR−HER2−, CK5+ and/or HER1+) subtype (OR = 1.18, 95% CI 0.86–1.64). Furthermore, the association was exclusively in pre-menopausal women. This risk heterogeneity is consistent with our findings that overweight/obese women were at 2–3 fold increased odds for the TNT subtype (data not shown). However, the Polish study also suggests that our findings are driven more by the decreased propensity for Luminal A cancers among overweight/obese women, rather than overweight/obesity increasing risk for TNTs. In the very recent study of the CBCS population, Millikan et al. [67] suggest that racial differences in parity and breast feeding history account for much of the increased prevalence of TNTs among AA women.

There is compelling evidence that TNT, or at least those that are the basal-like phenotype, may arise through distinct stem cells or through early genetic alterations e.g. inherited susceptibility as in BRCA1 carriers, thus predisposing to earlier-onset disease [26, 63, 68]. Similar to other studies, TNT in our study exhibited a ‘BRCA1-ness’ in that they were uniquely associated with high grade and mitotic activity, medullary and atypical medullary histologies, high expression of p53 and cyclin E, and low expression of cyclin D1 [24, 68, 69]. The proclivity for hereditary BRCA-related breast cancers in Ashkenazi Jews is well established. Among AA women, a similar defect in the BRCA pathway that is yet to be identified, hereditary or sporadic (genetic or epigenetic), may underlie the high prevalence of TNT that we and others have reported.

To improve patient outcome, it will be necessary to identify tumor targets/proteins for directed therapies. Potential targets have been proposed in the last year, but the literature suggests that no one target is 100% up-regulated among triple negative tumors and those most effective treatments will be those that target a combination of proteins [23, 70]. In our study, although several proteins were uniquely up-regulated in the TNT subtype (i.e. Cyclin E, p16, and p53), the highest proportion was seen for p53; over-expression was noted in 51%. We previously reported racial differences in this population for many of these same tumor proteins, suggesting that AA descent may be a determinant, or surrogate, for defects in specific pathways [31]. Our current study suggests this may be particularly so for those associated with the aggressive TNT subtype. Thus, different and more aggressive therapeutic strategies will be required in treating these tumors. Phase I and II trials targeting other specific proteins and pathways are underway [23]. Considering the high prevalence of TNTs among AA women, current and future trials must take care that there be commensurate AA, and other minority, representation.

Through earlier detection and improved treatments, mortality rates among women with breast cancer have declined over the last decades and 5-year relative survival has risen from 75% to 90% [1]. Yet mortality differences between AA and white women have grown ever wider and survival for AA women today is equivalent to that of white women from 25 years ago. Our study suggests that, beyond factors related to socioeconomic status, a large proportion of AA women have not been able to reap the benefits of improved treatments because of the intrinsic biology of their tumors. Consistent with other studies [13, 18], the poorest survival in our study was among women with ER−PR−HER2+ expressing tumors and the TNT, the worst for the former subtype. This subtype now benefits from trastuzumab therapy (Herceptin) which specifically targets HER2+ tumors, and has lead to marked improvements in outcome [71, 72]. Thus, TNT may now carry with them the worst clinical prognosis. Furthermore, within the TN subtype, we found that the risk of death was twice as high for AA women as for white women, after adjusting for other tumor, treatment, and socio-demographic factors. Additional adjustment for co-morbid illnesses had no affect on the mortality differences (data not shown); although in this young cohort of women, very few had co-morbid illnesses that would impact treatment options or survival. This suggests that beyond socio-demographic and system-related factors that may lead to poorer outcome among AA women, the currently available chemotherapeutic regimens may be less effective in AA women. The poorer survival associated with TNT, combined with the high prevalence of TNT among young and AA women, bode a poor clinical course for young women, AA women, and particularly young AA women with breast cancer. Considering the high prevalence of TNTs among AA women and the accompanying worse prognosis, strategies must encompass socio-demographic factors and treatment that is both targeted and adherent in order to overcome the foreboding poor outcome.

This is the first population-based study to characterize major breast cancer subtypes based on centralized pathology review and testing, permitting standardized criteria for assessment thereby reducing the potential for measurement error. It is also the first to investigate the association of TNTs with breast tumor proteins important to regulation of the cell-cycle or to programmed cell death (apoptosis); proteins which may have particular importance in early age breast carcinogenesis, aggressiveness, or clinical outcome [31]. In addition to standard prognostic factors, our survival analyses are the first to account for delay in treatment, type of treatment received, and co-morbidities. The comprehensive array of standard prognostic factors, treatment factors, and socio-demographic indicators is, we believe, a major strength. Finally, the nearly 500 cases and 30% African–American representation, allowed for multivariate adjustment of these factors by major triple subtypes of breast cancer, so as to examine factors which could explain these racial differences. However, there is the possibility that we may have not completely accounted for SES and lifestyle differences across the lifecourse that may have contributed to the higher TNT prevalence among AA women. We were also limited in our ability to include interaction terms in our multivariate analyses. Also, though not completely concordant, TNT are typically associated with the basal-like phenotype, the latter characterized by over-expression of cytokeratins 5, 6, and 17 or by up-regulation of EGFR (HER1), as well as TN status. While we analyzed a number of proteins potentially associated with breast cancer early carcinogenesis and outcome, we did not analyze markers considered specific to the basal-like phenotype. Additionally, we considered HER2 IHC values of 2+ as positive (and these were not confirmed by FISH analyses), hence a large proportion of our HER2+ cancers could in fact be triple negative. This would likely increase the incidence of TNTs in both AA and white women, but should not have major impact on the racial differences we report. Also, our findings may not be generalizable to women of all ages or from other national or international regions as in this population-based setting, only younger women from three counties of Atlanta were included. A more serious limitation is that tumor characteristics could not be assessed for all women in the cohort. However, as in prior analyses, we weighted the sample to address these issues and we believe our estimates are not largely influenced by this bias, suggesting that residual selection bias does not significantly alter our findings.

In conclusion, the high prevalence of aggressive triple negative tumors among younger women and particularly younger AA women, along with the unique protein expression patterns and poorer survival noted with these breast cancers, suggests varying gene–environment etiologies with respect to age and race/ethnicity and ultimately a direct connection to prognosis [73]. The necessity to identify gene–environment interactions that program biologically distinct breast cancer subtypes such as triple negative tumors (TNTs) among young women, African–American women, and in nearly half of young AA women, is apparent. Regarding survival; poverty, tumor biology, and treatment impact overall survival, but fail to explain the poorer survival experienced by AA women with the TN subtype. The increased risk of death for women with TNTs, and particularly AA women, underscores the need for better characterization of TNT signatures and to develop effective targeted therapies and expeditiously translate them into clinical practice. It is clearly time for breast cancer to be interrogated as a heterogeneous group of diseases with acknowledgment of the observed differences in its molecular biology; racial, age, and otherwise. In doing so, we may be able to offer new hope to those in greatest jeopardy.