Introduction

Breast cancer is a heterogeneous disease of different molecular subtypes. Currently, the most simple and applicable stratification of breast cancer is based on expression of the hormone receptors for both estrogen (ER) and progesterone (PgR) as well as the human epidermal growth factor receptor 2 (HER2) [36]. Based on these three receptors, tumors are characterized as hormone receptor positive, HER2 positive (i.e., amplification or overexpression of HER2), or triple negative breast cancer (TNBC) lacking the expression of all three receptors. In addition, several refined stratifications applying genomic methods or the inclusion of additional immunohistochemical markers (e.g., Ki67) allow the distinction of “Basal-like” breast cancers as well as “Luminal A” and “Luminal B” subgroups each with different prognosis and clinical behavior [2729]. In recent years, it became increasingly clear that the subtype composition of a dataset can strongly influence the prognostic and predictive gene signatures derived from it [36, 39]. Often these “first generation” signatures represent a surrogate marker for the subtype distinction itself [29]. As a consequence, several recent guidelines have suggested to analyze subtypes of breast cancers separately and to derive subtype-specific genomic tests [12, 19].

In previous work, we had assembled a large dataset of TNBC and used an unsupervised method to derive signatures which are capable of delivering highly significant prognostic information within the TNBC subgroup [31]. In the present study, we analyzed whether this signature has also prognostic value within other subtypes of breast cancer. Interestingly, we could demonstrate that despite the signature identified similar fractions of samples among the different subtypes its prognostic significance was restricted to triple negative and basal-like breast cancer. Our results underline the importance of subtype-specific analyses and suggest a sequential multistep approach for application of future genomic tests in the clinic.

Methods

All analyses were performed according to the “REporting recommendations for tumour MARKer prognostic studies” (REMARK) [24, 35] and the respective guidelines to microarray-based studies for clinical outcomes [7]. A diagram of the complete analytical strategy and the flow of patients through the study, including the number of patients included in each stage of the analysis, is given in Fig. 1. We compiled Affymetrix gene expression data (U133A or U133Plus2.0 arrays) of 4,467 breast cancer patients from 40 publicly available datasets (Supplementary Table S1). However, 329 triple negative cases of these 4,467 samples have been used before as a finding cohort in our previous studies [17, 31]. Thus, these samples had to be excluded from all subsequent analyses resulting in 4,138 applicable samples. Affymetrix CEL files were processed with the MAS5.0 algorithm of the affy package [9] of the Bioconductor software project [10]. Data from each array were log2-transformed, median-centered, and the expression values of all the probesets from the U133A array were multiplied by a scale factor S so that the magnitude (sum of the squares of the values) equals one.

Fig. 1
figure 1

Diagram of the analytical strategy according to REMARK criteria. The analytical strategy and the flow of patients through the study is presented as recommended by the REMARK criteria [14]

To identify the intrinsic subtypes of breast cancer, we used two alternative previously described approaches: First, we used the simple method according to Hugh et al. [14] which is based on the expression of single marker genes (ESR1, PgR, HER2, and Ki67) to define TNBC-, HER2-, Luminal A-, and Luminal B-subtypes. The cutoff values for ESR1, PgR, and HER2 have been described previously [16]. For a distinction of Luminal A and Luminal B subgroups, all 2,884 ERpositive/HER2negative samples were selected and a median split according to Ki67 expression was performed. In addition, 106 ERpositive/HER2positive cases were assigned to the Luminal B subtype according to Hugh et al. [14]. As a second, alternative method for subtype determination, we applied a single sample predictor (SSP) [38] according to the centroid method using the gene set from Hu et al. [13]. The centroid analyses were performed separately in six larger datasets encompassing a total of 1,142 samples. However, 195 of those samples were used as finding cohort of triple negative cases in our previous studies [17, 31] and thus had to be excluded from further analyses presented here. This exclusion results in a reduced proportion of basal-like tumors among the remaining cohort of 947 samples (see Table 1). The individual assignments according to the two different methods of molecular subtype definition are given for each sample in Supplementary Table S2.

Table 1 Clinical characteristics of the samples compiled and those used in the study

We have previously reported an unsupervised analysis identifying metagenes that distinguish molecular subsets within TNBC [31]. In the present study, we calculated expression values for two of these metagenes, namely the B cell and IL8 metagenes, for all 4,467 samples as the mean expression of four (IL8 metagene) and 48 (B cell metagene) Affymetrix probesets, respectively. The probesets used are listed in Supplementary Table S3. We had also reported in our previous study that high expression of the B cell metagene was associated with good prognosis and high expression of the IL-8-related metagene with poor prognosis in TNBC. We had then combined the two metagenes in the previous study into a signature: Samples with both high B cell metagene and low IL-8 metagene expression were characterized as “Good Prognosis”; all the other samples were characterized as “Poor Prognosis.” The respective cutoff values (B cell metagene >0.005 and IL-8 metagene <−0.001) were defined in a discovery cohort of 394 TNBC [31]. In the present study, we now used these pre-defined cutoff values to assess the prognostic value of the B Cell/IL-8 signature within different molecular subtypes (see Supplementary Table S2 for the assignment of each individual sample). We then analyzed the event-free survival of patients according to this signature separately in the different molecular subtypes.

In the conduct of the presented analysis, event-free survival (EFS) was calculated as preferentially corresponding to the RFS endpoint, but measured with respect to the DMFS endpoint if RFS was not available. All results from survival analyses were verified by examining the effect of the different endpoints in stratified analyses. Followup data for those women in whom the envisaged end point was not reached were censored as of the last followup date or at 120 months. Subjects with missing values were excluded from the analyses. We constructed Kaplan–Meier curves and used the log-rank test to determine the univariate significance of the variables. A Cox proportional-hazards model was used to simultaneously examine the effects of multiple covariates on survival. The effect of each individual variable was assessed with the use of the Wald test and described by the hazard ratio with a 95 percent confidence interval (95 % CI).

Results

Compilation of Affymetrix microarray datasets

Figure 1 presents the analytical strategy and the flow of patients through the study. We compiled Affymetrix gene expression data of 4,467 breast cancer patients from 40 publicly available datsets (see Methods section and Supplementary Table S1). 329 triple negative cases of these samples had to be excluded since they were used in our previous studies as a finding cohort [17, 31] leaving 4,138 applicable samples for the analyses. Table 1 gives the clinical characteristics of all 4,467 samples and of the cohort of 4,138 samples used in the subsequent analyses.

Both B cell and IL-8 metagenes display highest expression in ER negative breast cancers

We first compared the expression of B cell and IL-8 metagenes among the different molecular subtypes of breast cancer. Molecular subtyping was performed by two alternative strategies as described in the Methods section: Either a single marker method according to Hugh et al. [14] or the centroid method applying a single sample predictor (SSP) [13, 38]. Results of the single marker method were available for all 4,138 samples: data from the centroid method for 947 cases. When the single marker method was used for stratification, we detected the lowest expression of both metagenes among the ER-positive Luminal subtypes, whereas high expression was observed in the ER negative subtypes, i.e., TNBC and HER2 (Fig. 2a, c). Similarly, when applying the centroid method for stratification, we detected the highest levels of both the IL-8 metagene and the B cell metagene in the basal-like subgroup (Fig. 2b, d). Again, the lowest expression of both metagenes was seen in the Luminal A and Luminal B subgroups (Fig. 2b, d).

Fig. 2
figure 2

Box plots of IL-8 and B cell metagene expression among different molecular subtypes of breast cancer. Expression of the IL-8 metagene is shown in panels (a) and (b), while expression of the B cell metagene is shown in panels (c) and (d). In a and c 4,138 breast cancer samples were stratified into molecular subtypes using single marker genes according to the method of Hugh et al. [14]. In panels b and d, a single subtype predictor (SSP) applying the centroid method was used to classify 947 samples from six larger dataset

Prognostic value of the IL-8/B cell signature depends on the ER status of the tumor

Since both B cell and IL-8 metagenes displayed higher expression in ER-negative disease, we next analyzed the prognostic value of the combined IL-8/B cell signature relation to the ER status of the tumor. The previously defined cutoff values were used to stratify patients in a “good prognosis group” characterized by both high B cell and low IL-8 expression and a “poor prognosis group” encompassing all the remaining samples. Followup information was available for 2,353 samples. 28.9 and 33.5 % of the samples were assigned a “good prognosis” in the ER-positive and ER-negative subgroup, respectively. This roughly similar relative proportion (about one third) of good prognosis samples in both ER-positive and ER-negative disease results from the reduced expression of both of the metagenes among ER-positive tumors (see Fig. 2). However, as shown in Fig. 3, the prognostic value of the signature is clearly dependent on ER status. We observed no prognostic value among ER-positive tumors (P = 0.36), while it was highly significant for ER negative disease (P = 0.039).

Fig. 3
figure 3

Prognostic value of the IL-8/B cell signature in ER-positive and ER-negative breast cancer. For 2,353 of the 4,138 samples, followup information was available (Table 1). The IL-8/B cell signatures were used to classify those samples into “good prognosis” or “poor prognosis.” Kaplan–Meier analyses of the samples are shown separetely for 1,941 ER-positive samples in (a) and 412 ER-negative samples in (b). A significant difference in survival was only observed in the ER negative subgroup (P = 0.039)

Prognostic value of the IL-8/B cell signature in the different molecular subtypes of breast cancer

For a more detailed analysis, we further stratified the samples according to the molecular subtypes of breast cancer. Again, we alternatively applied the two different methodologies for subtyping as given above. As shown in Table 2, in all subgroups, a considerable fraction of the samples (range 20.7–42.0 %) was assigned to “good prognosis” by the IL-8/B cell signature. We then analyzed the followup of the patients separately for the different subtypes. As demonstrated in Figs. 4 and 5, a significant prognostic value of the IL8/B cell signature was only observed in the TNBC subgroup (P = 0.035, Fig. 4a) and a strong trend in the Basal-like subtype (P = 0.061, Fig. 5a), respectively. Among all other subtypes, the IL-8/B cell signature displayed no significant prognostic value (Table 3).

Table 2 Frequency of samples with poor/good prognosis according to the IL-8/B cell signature among the molecular subtypes of breast cancer
Fig. 4
figure 4

Prognostic value of the IL-8/B cell signature in different molecular subtypes of breast cancer according to the single marker method. The IL-8/B cell signature was used to classify 2,353 samples with followup information as “good prognosis” or “poor prognosis.” Separate Kaplan–Meier analyses are given in panels ad for the four different subtypes of breast cancer stratified applying the single marker method. A significant prognostic value of the IL-8/B cell signature was only observed among triple negative breast cancers (panel a, P = 0.035)

Fig. 5
figure 5

Prognostic value of the IL-8/B cell signature in different molecular subtypes of breast cancer according to the centroid method. A single subtype predictor (SSP) applying the centroid method was used to classify 947 samples from six larger dataset into the molecular subtypes. For 787 of these samples followup information was available. Panels ae show separate Kaplan–Meier analyses for the different molecular subtypes classified by the IL-8/B cell signature. A strong trend for significant difference was only observed in the Basal-like subtype (P = 0.061), while no prognostic value was detected in the remaining subtypes

Table 3 Prognostic value of the IL-8/B cell signature in different molecular subtypes of breast cancer

Multivariate analysis of the prognostic value of the IL-8/B cell signature in TNBC

For 148 new TNBC samples that were not included in the finding cohort of our previous publication, both followup data and complete clinical information on lymph node status, tumor size, age, and histologic grading were available. Table 4 displays the results of a multivariate Cox regression including all these clinical parameters together with the IL-8/B cell signature. Only the IL-8/B cell signature was significant (HR 3.20, 95 % CI 1.43–7.16; P = 0.005) in that analysis.

Table 4 Multivariate cox regression of clinical parameters and the IL-8/B cell signature in TNBC

Discussion

Triple negative breast cancers (TNBC) are clinically heterogeneous and prognostic markers, and biology-based therapies are needed to better treat this disease [4, 8]. By applying a subtype-specific approach, we have previously identified a gene signature which demonstrates significant prognostic value within this subgroup of breast cancers [31]. In our current study, we show that this signature assigns similar proportions of samples to “good” or “poor” prognosis even among other subtypes of breast cancer. However, we failed to detect any significant prognostic value of the signature in subtypes other than TNBC or basal-like breast cancer. These results demonstrate the importance of subtype-specific analyses both for the development and for the application of gene signatures in a clinical setting.

Regarding the cellular source of expression of the gene signature, we had already demonstrated that lymphocyte infiltration is responsible for the high level of expression of the B cell metagene [3, 30, 32], while IL-8 is expressed by the carcinoma cells themselves [31]. Supplementary Figure S1 displays the analyses of Affymetrix microarray data including microdissected samples which are in line with our previous results. So far, we could only demonstrate a pure prognostic value of our signature since better prognosis was observed both for patients treated with chemotherapy as well as those without adjuvant treatment [31]. We detected only a modest predictive value of the B cell metagene for response to neoadjuvant chemotherapy in our datasets [17, 31], despite that an independent larger study clearly demonstrated a predictive value of lymphocyte infiltration within the TNBC subgroup [6]. Several recent studies suggest that the IL-8/B cell gene signature could potentially provide predictive value for specific therapeutic approaches. Inhibition of IL-8 signaling has been suggested as a target to block breast cancer stem cells and cancer’s inflammatory roots [11, 2022]. Consequentially, those TNBC patients characterized by high IL-8 expression and a poor prognosis would be first candidates for such an approach. On the other hand, T cell immunomodulators like anti-CTLA4 antibodies have shown great successes in some cancer patients at least in combination therapy [25, 33]. In this approach, it will be crucial to identify the right patients where one should take of the lid from immunosuppression [18, 26, 40] and a potential predictive value of the IL-8/B cell ratio could be worth testing.

Conclusions

A restriction of the prognostic value of different gene signatures and biologic processes to specific subtypes has been shown in many studies [2, 15, 23, 28, 29, 39]. For example, signatures consisting mainly of proliferation genes achieve their prognostic value only within ER-positive breast cancer. These subtype differences seem also to hold true for the next generation of genomic methods. Recent results from whole genome sequencing studies in breast cancer revealed that the number of somatic mutations varied markedly between individual tumors and the number of genes which were repeatedly found to be mutated at high frequency in is rather small [34, 37]. However, the frequency of mutations clearly differs between subtypes with TNBC and HER2-positive subtypes showing higher numbers of mutations than Luminal subtypes [1]. In addition, large scale integrated analyses of copy number variation and gene expression also suggest additional molecular stratification of breast cancer beyond the known intrinsic subtypes [5]. All these data underline the importance of stratified analyses in different subtypes and suggest a sequential multistep approach for application of future genomic tests in the clinic.