Introduction

Since current prognostic and predictive factors for breast cancer do still not provide optimal risk stratification, additional information is necessary for improved tailored treatment for the individual patient.

In a recent review, four main molecular classes of breast cancer were reported, which have been identified by gene expression profiling [1]. According to the classification of Perou et al., these four classes are proposed as basal-like type tumors, luminal cancers (A and B), and Her-2 positive cancers. These subgroups correspond reasonably well to the clinical characterization on the basis of hormone receptor status, HER2 expression, and differentiation level [2]. At least 15% of the breast cancers are basal-like, of which the majority comprises the so-called triple-negative cancers [3]. Triple-negative breast cancer is a subtype of invasive breast cancer defined as estrogen receptor (ER)-negative, progesterone receptor (PgR)-negative, and HER-2-negative phenotypes. These triple-negative breast tumors are more likely to affect younger women and are associated with a more advanced stage. Regardless of the stage at diagnosis, women with triple-negative breast cancers have a worse outcome compared to other breast cancer subtypes [4]. Currently, there are no specific treatment guidelines for triple-negative breast cancer [5]. At this moment, neither currently used clinico-pathological parameters nor molecular profiling techniques are able to subdivide this set of patients with respect to prognosis [1, 6, 7]. Therefore, there is a strong need to develop additional parameters for this subgroup.

Metastasis is the leading cause of morbidity and mortality in early breast cancer patients [8]. Tumor invasion and metastasis are considered to be a multifactor process involving complex interactions of biological pathways [9]. Tumor-associated stroma and cancer-associated fibroblasts may play an important role in these tumor progression phases [10]. It is hypothesized that processes similar to wound-healing response are activated, which results in cell motility, angiogenesis, and matrix remodeling. In fact, for breast cancer, a wound-response gene expression signature, a fibromatosis signature, and a stromal signature have been found to be predictors of patient outcome and tumor progression [1013]. These data suggest that stroma in the vicinity of tumors undergoes, or may be affected by, changes during tumor progression. A recent study of our group further supports this hypothesis. In this study, tumors with distinct patterns of intra-tumor stromal percentage were identified in a population of colon cancer stage II patients who had not received chemotherapy or radiotherapy. We defined two groups, stroma-low (<50%) and stroma-high (≥50%) patients, showing statistically significant differences in survival [14, 15]. In this study, we hypothesize that tumors with a high stroma production (stroma-rich tumors) may reflect poor tumor biology and thus a worse outcome. We have evaluated the prognostic value of this stroma parameter in a set of 574 breast cancer patients. Analyses were stratified for adjuvant systemic therapy to correct for possible interactions and furthermore for the triple-negative cancer patients to determine the prognostic value of the tumor–stroma ratio within this subset of patients.

Materials and methods

Study population

In a retrospective cohort study, patients were included with non-metastatic invasive breast cancer who were primarily treated with surgery in the Leiden University Medical Center (LUMC) between 1985 and 1994 (n = 677). Patients with a history of cancer (other than basal cell carcinoma of the skin or in situ carcinoma) or bilateral tumors were excluded. The following data were available: age, tumor grade, histological type, TNM stage, local and systemic therapy, locoregional/distant recurrence, second primaries, and overall survival. Expression of ER, PgR, Ki67, and HER2 were a priori centrally determined according to standard diagnostic procedure, using standard histological staining protocols and automated microscopy determining the histoscore for quantification. The historscore considers both intensity and percentage of stained cells and, therefore, is considered to be a very complete method for quantification. In addition, all tumors were histologically graded according to current pathological standards by a pathologist (VS).(van Nes JGH, de Kruijf EM, Faratian D, van de Velde CJH, Putter H, Falconer C, Smit VTHBM, Kay C, van de Vijver MJ, Kuppen PJK, Bartlett JMS COX2 expression in prognosis and in prediction to endocrine therapy in early breast cancer patients. Accepted for publication) Systemic treatment criteria of breast cancer patients from 1985 to 1994 were as follows: endocrine tamoxifen treatment was administered to patients with positive lymph node metastasis; chemotherapy was given to younger patients with larger and higher grade tumors and more positive lymph node metastasis. Approval was obtained from the Leiden University Medical Center Medical Ethics Committee. All the samples were handled in a coded fashion, according to National ethical guidelines (“Code for Proper Secondary Use of Human Tissue”, Dutch Federation of Medical Scientific Societies). The REMARK criteria were respected for analyses of the tumor–stroma ratio and writing of this article [16].

Staining and evaluation

Formalin-fixed paraffin-embedded (FFPE) tumor blocks of the primary tumor were collected from pathology archives. Sections were cut and stained with Haematoxylin and Eosin (H&E) according to standard histological protocols. Tumor–stroma ratio was quantified as described before [14]. Using a 5× objective, the most invasive tumor area of the whole tissue slide was selected. Subsequently, using a 10× objective, only fields were scored where both stroma and tumor were present and, most importantly, tumor cells were seen on all sides of the microscopical image field (north–east–south–west). The tumor–stroma ratio was visually estimated in a blinded manner by two investigators (EDK, JVN) and scored per tenfold percentage (10, 20, 30% etc.). In case of heterogeneity, the highest stromal percentage was considered decisive.

In some cases of necrosis, mucus-forming tumors or tumors with compartments of ductal carcinoma in situ (DCIS), quantification of the tumor–stroma ratio was more difficult, but still possible. In these cases, quantification was performed by ignoring these compartments (by visual eye balling) when determining the stroma percentages. Representative examples of microscopical fields selected for tumor–stroma ratio quantification from stroma-rich and stroma-poor tumors are shown in Fig. 1.

Fig. 1
figure 1

Haematoxylin and eosin (H and E)-stained 4-μm paraffin sections of primary breast tumors; 100× magnification (10× objective). a Tumor–stroma ratio estimated as 80%: stroma-rich. b Tumor–stroma ratio estimated as 20%: stroma-poor

Statistical analysis

Data were analyzed using the statistical package SPSS for Windows 15.0 (SPSS Inc, Chicago, IL, USA). Cut-off of the tumor–stroma ratio was taken as 50% as previously determined in colon cancer by maximum discriminative power, which was also confirmed in this breast cancer population (supplementary Table 1) [14]. Intra-observer variability was analyzed using the Cohen’s kappa coefficient. Relationship between tumor–stroma ratio and well-established factors was investigated with the chi-squared test. The Kaplan–Meier method and log-rank test were used for analysis and comparison of survival curves. Relapse-free period rates were reported as cumulative incidence functions, after accounting for death as competing risk [17]. Cox regression was used for univariate and multivariate analyses of relapse-free period (RFP) and overall survival (OS). For all the models, we performed a link test for possible interaction. In addition, goodness of fit and proportional hazard assumptions were checked with COX–Snell residuals and Schoenfeld residuals, respectively. Variables included in multivariate analysis were all those variables which were of influence on outcome of patients in univariate analysis (P < 0.1). RFP was the time of date of surgery until a relapse (locoregional or distant relapse); OS was the time of date of surgery until death. Stratifications based on adjuvant treatment, to account for adjuvant therapy effects on prognosis of patients, and on triple-negative breast carcinoma patients were performed in survival analyses.

Table 1 Correlations between tumor–stroma ratio and well-established prognostic factors

Results

Clinico-pathological features

The FFPE blocks were available from pathology archives for 86% (574/677) of the patients. No statistically significant differences in clinico-pathological parameters were seen between patients with and without available FFPE blocks. Median age of patients was 57 years (23–96) and the median follow-up time was 19 years (14–24 years) of patients alive. Clinico-pathological, local/systemic treatment, and outcome characteristics of these patients are shown in Table 1.

Tumor–stroma ratio in breast cancer

Routine H&E-stained slides from the most invasive part of the tumor were microscopically analyzed for the presence of stromal involvement using a 5× and a 10× objective. We observed areas with abundant stroma (stroma-rich) with a size as large as one microscopic field (10× objective, 100× total magnification), but also larger areas matching 2–4 fields were seen or even more, independent of the size of the tumor. Estimation of the tumor–stroma ratio was performed successfully in all the tumors (100%). Cohen’s kappa coefficient revealed an almost perfect agreement in classification (kappa = 0.85; 94% concordance in classification) for a set of tumors (32% of total set) scored by both observers.

Tumor–stroma ratio and prognostic associations with outcome

Out of the whole cohort (n = 574), 264 (46%) patients had a relapse of disease, and 370 (54%) patients deceased. A total of 388 (68%) patients were classified as stroma-rich and 186 (32%) patients as stroma-poor. A worse RFP (Hazard ratio (HR) 1.62; 95% CI 1.23–2.13; P = 0.001) (Table 2) and OS (HR1.29; 95% CI 1.03–1.60; P = 0.025) (supplementary Table 2) were found for patients with stroma-rich tumors as compared to patients with stroma-poor tumors, with 5 year RFP and OS of, respectively, 64 vs. 79%, and 71 vs. 83% (Fig. 2a, b). In multivariate analysis, the tumor–stroma ratio was an independent prognostic variable for RFP (HR 1.97; 95% CI 1.47–2.64; P < 0.001) (Table 2) and OS (HR: 1.50; 95% CI 1.18–1.91; P = 0.001) (supplementary Table 2) independent of other clinico-pathological parameters.

Table 2 Cox univariate and multivariate analysis for recurrence-free period of all patients. Number of patients (N), hazard ratio (HR), 95% confidence intervals (95%CI) and p-values are given
Fig. 2
figure 2

Kaplan–Meier graphs for tumor–stroma ratio for the total patient population. Patients with stroma-rich tumors show a significant worse relapse-free period (a) and overall survival (b) compared to patients with stroma-poor tumors

Next, in order to account for systemic therapy effects, subanalyses were performed based on systemic therapy administration. In the group of patients that received only local therapy, 66% (244/369) had a stroma-rich tumor, and 34% (125/369) a stroma-poor tumor. Patients with a stroma-rich tumor showed a worse clinical outcome for RFP (HR 1.55; 95% CI 1.10–2.18; P = 0.012) and OS (HR 1.25; 95% CI 0.97–1.62; P = 0.098), with 5 year RFP of, respectively, 66 and 81%, and OS of 76 and 82%. In multivariate analysis, tumor–stroma ratio was an independent parameter for RFP (HR 2.06; 95% CI 1.42–2.97; P < 0.001).

The prognostic influence of tumor–stroma ratio in patients treated with chemotherapy (130/574) and endocrine therapy (93/574) was determined. Within the patients treated with chemotherapy, 68% (88/130) had stroma-rich tumors and 32% (42/130) stroma-poor tumors. Patients with a stroma-rich tumor showed a trend toward a worse outcome for RFP (HR 1.66; 95% CI 0.94–2.93; P = 0.082) and OS (HR 1.72; 95% CI 0.99–2.98; P = 0.054), with 5 year RFP of respectively 63 and 79%, and OS 65 and 83%. In multivariate analysis, tumor–stroma ratio showed to be of independent influence on RFP (HR 1.83; 95% CI 1.04–3.25; P = 0.038).

In patients treated with endocrine therapy, stroma-poor and stroma-rich tumors were found in 71% (66/93) and 29% (27/93) of the tumors, respectively. Associations with outcome were similar to those found for the chemotherapy-treated patients. Patients with stroma-rich tumors showed a trend toward a significantly worse RFP (HR 1.98; 95% CI 0.95–4.11; P = 0.068) and OS (HR 1.72; 95% CI 0.99–2.98; P = 0.054), with 5 year RFP of, respectively, 58 and 70%, and OS of 61 and 89%. In a multivariate analysis for RFP, both tumor stage and tumor–stroma ratio (HR 2.59; 95% CI 1.13–5.91; P = 0.024) resulted as independent prognostic factors.

For all subanalyses based on systemic treatment, interaction models were introduced in a Cox regression analysis. These analyses revealed that there was no statistically significant interaction between tumor–stroma ratio and systemic therapy (P = 0.684), chemotherapy (P = 0.916), and endocrine therapy (P = 0.506). These results further indicated that the effect on clinical outcome of tumor–stroma ratio was unrelated to systemic therapy.

Tumor–stroma ratio and prognostic associations with outcome in triple-negative breast carcinomas

Of the set of patients with a triple-negative breast carcinoma (n = 82) 35 (43%) patients had a relapse of disease, and 55 (67%) patients deceased. Of these patients, 56% (46/82) had a stroma-rich tumour, and 44% (36/82) stroma-poor. Patients with stroma-rich tumors had a statistically significant worse outcome compared to patients with stroma-poor tumors for RFP (HR 3.91; 95% CI 1.49–6.83; P = 0.003) (Table 3) and showed a trend toward a worse survival for OS (HR 1.60; 95% CI 0.93–2.74; P = 0.088) (supplementary Table 3), with 5 year RFP and OS of respectively 56 vs. 81% and 44 vs. 83% (Fig. 3a, b). In a multivariate analysis, grade, nodal status, and tumor–stroma ratio were of independent influence on RFP (tumor–stroma ratio: HR 2.92; 95% CI 1.36–6.32; P = 0.006) (Table 3). When corrected for other clinico-pathological parameters, patients with stroma-rich tumors had a statistically significant worse OS compared to stroma-poor tumors (HR 1.87; 95% CI 1.07–3.26; P = 0.028) (supplementary Table 3).

Table 3 Cox univariate and multivariate analyses for recurrence-free period of triple negative breast carcinoma patients
Fig. 3
figure 3

Kaplan–Meier graphs for tumor–stroma ratio for triple-negative carcinoma patients. Patients with stroma-rich tumors show a significant worse relapse-free period (a) compared to patients with stroma-poor tumors. A trend was seen toward a worse survival for patients with stroma-rich tumors compared to patients with stroma poor tumors (b)

Triple-negative carcinoma patients treated with local therapy only showed similar results for influence of tumor–stroma ratio on RFP (HR 4.12; 95% CI 1.49–11.39; P = 0.006). Five-year RFP for patients with stroma-rich compared to stroma-poor tumors was, respectively, 58 and 88%. The tumor–stroma ratio was independent of all other variables, since it was the only parameter which met multivariate inclusion criteria.

Tumor–stroma ratio remained of independent influence on outcome in patients in the non-triple-negative group, when corrected for other clinico-pathological parameters (RFP: HR 1.50 95% CI 1.09–2.07 P = 0.013).

An interaction model for tumor–stroma ratio with triple-negative cancer patients was introduced in Cox regression analysis. This revealed a significant interaction between the two variables on clinical outcome (P = 0.018), further supporting our hypothesis that a relationship between tumor–stroma ratio and triple-negative cancer exists.

For all Cox regression models, we performed a link test. There seems to be no interaction in the models: For OS, the whole patient cohort and triple-negative group link test P-values were, respectively, P = 0.6 and P = 0.9; For RFP, the whole patient cohort, patients only treated with local therapy, chemotherapy-treated patients, endocrine-treated patients, all triple-negative cancer patients, and triple-negative cancer patients who received only local therapy, link test P-values were, respectively: 0.05, 0.2, 0.2, 0.5, 0.9, and 0.9. Besides this, we checked for goodness of fit visually with COX–Snell residuals; all the models showed to have a good model fit. Finally, we checked for the proportional hazards assumption based on the Schoenfeld residuals for the models with patients who received systemic therapy (chemotherapy treated: P = 0.2; endocrine treated: P = 0.3) and triple-negative patients (P = 0.6) for RFP, and found no evidence that the models violated the proportional hazards assumption.

Discussion

Our study shows that the tumor–stroma ratio is an independent prognostic factor for breast cancer patients. Stroma-rich tumors were associated with an increased risk of relapse. Importantly, for the triple-negative breast carcinoma patient group, a group for which no definite prognostic biomarkers are available, the tumor–stroma ratio appeared to be a prognostic parameter which significantly correlates with a better 5 year relapse-free period rate of 81% (stoma-poor) compared to 56% (stroma-rich) [7]. In addition, tumor–stroma ratio showed the same prognostic power for patients treated with endocrine and chemotherapy, indicating that the prognostic effect was independent of systemic therapy. Importantly, high intra-observer agreement Kappa values for tumor–stroma ratio, as found in this study and prior studies on colorectal cancer and esophageal cancer, convincingly demonstrate that tumor–stroma ratio is a highly reproducible method. For our breast cancer population, Cohen’s kappa coefficient was 0.85. For colorectal cancer, kappa values of three different pathologists’ intra-observer agreements varied between 0.596 and 0.702 [14, 15]. For esophageal cancer, Cohen’s kappa coefficient was 0.859 [18]. These kappa values can be translated as almost to be in perfect agreements. In addition, the tumor–stroma ratio can be easily assessed and provides no additional costs above standard diagnostics [15]. Therefore, we believe that the tumor–stroma ratio may be a candidate parameter to be implemented as a standard procedure within pathology laboratories.

Many prognostic and predictive factors have been found for breast cancer. The ASCO guidelines advice the use of different prognostic and predictive factors in clinical practice: ER, PgR, HER2, urokinases plasminogen activator (uPA), plasminogen activator inhibitor-1 (PAI-1), and gene profiles detected with multiparameter gene expression assays [19]. ER, PgR, and HER2 are well known parameters and are determined by the use of immunohistochemistry. For ER and PgR, different antibodies and quantification methods (i.e., percentage of positive stained cells, Allred score, and histoscore) are used, which are reported with variable results [20]. For the determination of HER2 expression, many different antibodies are utilized in clinical practice, which may result in a broad aspect of results [21]. However, because of the great prognostic and predictive effect of HER2 and the many side-effects and high cost of trastuzumab therapy, accurate determination of HER2 overexpression in tumors is of crucial importance. UPA and PAI-1 markers are still subject of debate and currently evaluated in the prospective Node-Negative Breast Cancer III (NNBC 3) Europe Trial [22]. Microarray-based prognostic tools, like the MammaPrint, a 70-gene expression profile, and Oncotype DX, a 21-gene expression profile, can be used for prognostication of breast cancer patients according to the ASCO guidelines [23]. However, the clinical value of gene expression profiling is currently being debated and under investigation [24, 25]. Recent evidence shows that gene profiles do not outperform prognostication with current clinico-pathological parameters [26, 27]. Evidence showing that complex gene signatures quantify evident tumor characteristics, such as tumor grade, ER, HER2, cell cycle, and cell proliferation, is accumulating [26, 2830]. Moreover, it is noteworthy that gene expression arrays are not suitable for all tumors. Frozen material is often needed, and when applying gene expression array analyses for tumor tissue, it is common practice to select those parts of the tissue in which tumor cells form the major component, as admixtures of abundant stroma and inflammatory cells will lead to masking of amplifications and deletions. This may lead to exclusion of tumors with a high percentage of stroma (>50%) for gene expressionarray analysis, as these samples do not meet the criteria for a reliable array. Our study shows that a high percentage of intra-tumor stroma results in a worse outcome for patients, and obviously this set of patients is not eligible for gene expression analysis. Therefore, we suspect that arraying techniques may form a selection bias for patients with a better prognosis and that the tumor–stroma ratio provides additional prognostic information to gene expression profiles. Adjuvant! Online (http://www.adjuvantonline.com) is another tool to quantitatively estimate the prognosis and response to systemic treatment of patients with early breast cancer. It is based on age, ER status, histological grade, tumor size, and lymph node status. Although Adjuvant! Online is a useful method to predict the prognosis of disease and benefit of treatment, it needs more validation and modification because it is too optimistic for subgroups enriched with adverse prognostic factors such as HER2 overexpression and negative PgR [31]. Our tumor–stroma ratio parameter was of prognostic value independently of all parameters on which Adjuvant! Online is based and is, therefore, a candidate parameter to prognostication in addition to this digital tool.

The use of systemic therapy for triple-negative breast cancer patients remains controversial. Triple-negative carcinomas, in general, have a relatively poor prognosis, but are a heterogeneous group [5]. In the triple-negative carcinomas, a subgroup of patients has been associated with improved prognosis which could not be identified by gene expression-based prognostic tools [1, 6, 7]. According to a meta-analysis of publicly available gene-expression and clinical data, all gene-expression signatures showed to have the best performance on survival prediction for ER-positive tumors, whereas they were not significantly of influence on outcome for tumors with HER2 overexpression and triple-negative carcinomas [30]. In this article we show that the tumor–stroma ratio is a strong independent prognosticator for triple-negative carcinoma patients.

The majority of studies of neoplastic transformation have focused on events that occur within cancer cells. Other studies have addressed the microenvironment of tumor cells supporting tumor progression [12, 13, 32]. Recent study now provides more insight into possible initiation and progression of malignant cells. The phenomenon of tumor–stroma ratio might be explained by biological processes involved with breast cancer invasion and metastasis. Various mechanisms underlie these processes, including loss of cell adhesion, proteolysis, matrix remodeling, and cytoskeletal rearrangements. Increasing evidence supports the role of the tumor microenvironment (i.e., fibroblasts, myoepithelial cells, macrophages, proteases etc.) in stimulating tumor progression, invasion, and metastasis [32]. Increase in abundance of fibroblasts in a tumor causes deposition of fibrotic extra-cellular matrix (ECM). Changes in ECM structure can be further stimulated by proteases, which degrade stroma. Together, this results in disruption of epithelial tissue and remodeling of the ECM, facilitating invasion of tumors cells. In addition, fibroblasts secrete growth factors, angiogenic factors, and inflammatory factors, which all contribute to tumor progression and expansion [33, 34]. In vivo studies on cell lines and in mice support this theory of cancer-associated fibroblasts. In prostatic tumors and breast tumors, it was shown that fibroblast from tumor environment, compared to fibroblasts derived from areas that were not intimately associated with invasive carcinoma, significantly increased growth of epithelium and provided better support for cancer growth [35, 36]. These findings all support the hypothesis that tumor-associated stroma is of influence on the tumor’s invasive behavior.

Future plans are necessary to validate our findings, especially for the triple-negative patients on a larger patient set, to analyze the underlying mechanisms of the stroma formation using molecular techniques and model systems, and ultimately come to the development of new agents.

In conclusion, tumor–stroma ratio has shown to be a prognostic parameter in breast cancer patients, independently of other clinico-pathological parameters and systemic therapy. Furthermore, we were able to stratify the triple-negative cancer subgroup according to their risk of relapse, with a three times higher risk of relapse for patients with stoma-rich tumors. The tumor–stroma ratio is easy to determine, reproducible, and quickly performed for all types of breast cancers. In our opinion, it is a candidate parameter that could easily be implemented in routine pathology diagnostics, to optimize risk stratification for breast cancer patients, especially for the triple-negative carcinoma subgroup.