Introduction

Attempts have been made during the last 10 years to incorporate predictive factors for both the treatment response and the prognosis into the decision-making processes involved in the treatment of breast cancer (BC) patients [1]. Some biomarkers related to molecular subtypes of BC have already been included in clinical trials [2] and are likely to be incorporated into clinical practice soon [3]. Research studies on neoadjuvant therapy are an ideal setting not only for assessing the response to specific forms of treatment, but also for correlating the treatment response with the initially assessed prognosis.

Several recent studies have confirmed that the pathological complete response (pCR) after neoadjuvant chemotherapy is an outstanding predictive factor for the prognosis. However, these findings have mainly involved HER2-positive and triple-negative subgroups of patients [47]. In patients with hormone receptor–positive BC, pCRs appear to be too infrequent for this type of effect to be seen, due to the sample sizes in the published studies. Some biomarkers have already been quite consistently associated with a pCR after neoadjuvant chemotherapy; these include the estrogen receptor (ER), progesterone receptor (PgR), grading (G), and proliferation as assessed by Ki-67 [8].

Additional molecular markers for the prognosis have mainly been described in adjuvant therapy studies or in cohort studies without treatment. These studies have led to the development of what are known as multigene marker tests, in which hundreds or thousands of markers have been tested, some 20–80 of which have been validated as predictive for the prognosis [911].

The GeparTrio study recently reported on the benefits of a preplanned treatment change to an alternative chemotherapy regimen in patients who do not respond to neoadjuvant chemotherapy. The pCR rates did not differ between the groups of patients who received response-guided treatment and those who did not. However, particularly in the groups of patients with hormone receptor positive carcinomas, the treatment change in nonresponders improved the prognosis for these patients independently of pCR [12]. These results imply that tumors may behave differently in relation to the treatment response and prognosis.

The aim of the present study was therefore to investigate the value of six BC biomarkers previously described in the literature in relation to the treatment response and prognosis, in addition to established biomarkers and well-known predictive and prognostic factors, in a cohort of patients who were treated in standardized fashion with neoadjuvant anthracycline-based chemotherapy.

Materials and methods

Patients

The study population consisted of 100 patients with histologically confirmed invasive BC who were treated with a neoadjuvant chemotherapy regimen consisting of four cycles of epirubicin (90 mg/m2 body surface area) and cyclophosphamide (600 mg/m2 BSA) between January 2000 and December 2005 at the University Breast Center of Franconia (Bavaria, Germany). No other adjuvant chemotherapy was allowed in patients included in the study. No patient received neoadjuvant or adjuvant trastuzumab. Patients were eligible for neoadjuvant chemotherapy if they had a tumor that was at least 1.5 cm in size as assessed on ultrasound or mammography prior to neoadjuvant chemotherapy. A total of 225 patients were treated with neoadjuvant chemotherapy between 2000 and 2005. Ninety-five patients were excluded because they did not receive the standardized neoadjuvant chemotherapy regimen described above. Two patients had to be excluded because they did not complete the chemotherapy regimen before surgery, and 13 patients were excluded because they had metastases at the time of the initial diagnosis. Adequate core biopsy material was not available for immunohistochemistry in 15 patients. The study was approved by the institutional ethics committee of the Medical Faculty of the University of Erlangen–Nuremberg. Informed consent was obtained from each patient before inclusion in the study.

Clinical data

The University Breast Center of Franconia has received certification from the German Cancer Society (Deutsche Krebsgesellschaft) and the German Society for the Study of Breast Diseases (Deutsche Gesellschaft für Senologie). To obtain certification, a breast center has to document each case of BC, including patient and tumor characteristics, treatment data, and some epidemiological data (details are available at http://www.onkozert.de). Follow-up information regarding local recurrences, distant metastases, and death has to be provided for up to 10 years. All histopathological data also have to be documented from the original pathological reports, including tumor size, axillary lymph-node status, grading, and estrogen receptor, progesterone receptor, and HER2/neu status. As part of the continuous certification process and quality assurance measures, breast centers and the quality of their data are audited annually. Data obtained through these processes were used in the analysis presented here.

Immunohistochemistry

Immunohistochemical analyses of whole-section slides of the core biopsies were carried out before the neoadjuvant chemotherapy, using the following primary antibodies in accordance with the manufacturers’ instructions: estrogen receptor (1:200, monoclonal mouse IgG, clone 1D5; Dako, Denmark), progesterone receptor (1:200 monoclonal mouse IgG, clone PgR 636; Dako, Denmark), HER2/neu (1:200, polyclonal rabbit IgG, clone cerB-2; Dako, Denmark), Ki-67 (1:100, monoclonal mouse IgG, clone MIB-1; Dako, Denmark), cytokeratin 5/6 (1:50, CK5/6, monoclonal mouse IgG, clone D5/16B4; Zytomed, Germany), cytokeratin 8 (1:200, CK8, monoclonal mouse IgG, clone C51; BioGenex, USA), cytokeratin 18 (1:500, CK18, monoclonal mouse IgG, clone CY-90; Sigma, USA), Bcl-2 (1:100, monoclonal mouse IgG, clone 124; Dako, Denmark), cyclin D1 (1:200, monoclonal rabbit IgG, clone SP4; DCS, Germany), P53 (1:50, monoclonal mouse IgG, clone DO-7; Dako, Denmark), and topoisomerase IIα (TOP2A, monoclonal mouse IgG, clone 3D4; Abcam, UK).

Secondary antibodies were selected relative to the primary antibodies; biotinylated rabbit antimouse IgG (Dako) and biotinylated goat antirabbit IgG (Dako) were used. Staining of Bcl-2, p53, CK5/6, CK8, and CK18 was visualized using strep-AB complex. The horseradish peroxidase (HRP) method was used for ER, PgR, HER2/neu, and MIB-1. A biotin–tyramine enhancement solution was used for staining of cyclin D1 and topoisomerase IIα.

All of the slides were evaluated independently by two observers. If there was disagreement regarding the interpretation of the immunohistochemical staining results, slides were examined with a two-headed microscope until consent was reached. The Remmele–Stegner immunoreactive score (IRS) was used [13]. HER2 status was scored in the pathology reports as 0, 1+, 2+, or 3+ in accordance with the guidelines published by Sauter et al. [14]. Tumors with a score of 0 or 1+ were regarded as HER2-negative, and those with a score of 3+ were regarded as HER2-positive. Tumors with a score of 2+ were tested for gene copy numbers of HER2 using chromogene in situ hybridization. Using a kit with two probes of different colors (ZytoDot, 2C SPEC HER2/CEN17; Zyto Vision Ltd., Bremerhaven, Germany), the gene copy numbers of HER2 and centromeres from the corresponding chromosome 17 were retrieved. A HER2/CEN17 ratio of ≥2.2 was regarded as amplification of HER2. Scoring was carried out in a standardized way by a group of dedicated pathologists in routine surgical pathology.

The pCR assessment was based on histopathological reports, all from one institution. Patients with a ypT0 ypN0 assessment were considered to have achieved a pCR. No invasive or noninvasive residual tumor tissue in the breast or nodes was allowed, as in the assessment by Sinn et al. [15].

Statistical considerations

Continuous data are presented as means with standard deviation, and categorical data as frequencies and percentages. As the biomarker values did not have a normal distribution, median values and interquartile ranges are shown. Spearman correlation coefficients between biomarkers were calculated.

The prediction of pCR was studied with logistic regression analyses. Simple logistic regression models were used for patient and tumor characteristics (age at diagnosis, tumor size before chemotherapy, histological type, and grading), for established biomarkers (Ki-67, ER, and PgR), and for investigational biomarkers (HER2 status, CK5/6, CK8, CK18, Bcl-2, p53). Odds ratios (OR) with 95 % confidence intervals are shown.

The predictive value of the investigational biomarkers, in addition to patient and tumor characteristics and the established biomarkers, was studied as follows. Initially, a multifactorial logistic regression model with the patient and tumor characteristics and established biomarkers was fitted. Backward stepwise variable selection was then carried out to obtain the best model in accordance with the Akaike information criterion (the final model without the investigational biomarkers). Next, another logistic regression model was fitted, containing the investigational biomarkers and the predictors from the final model without the investigational biomarkers. The variable selection procedure from above was carried out again, but on the condition that the selected predictors from the final model without the investigational biomarkers were retained. The resulting model (the final model with the investigational biomarkers) was compared with the final model without the investigational biomarkers using the likelihood ratio test. A significant test result means that the investigational biomarkers have predictive value independently of the established predictors. ORs with 95 % confidence intervals for the final model with the investigational biomarkers are shown. The predictive ability of the modeling procedure was estimated using the area under the curve (AUC) of the receiver operating characteristic, which was cross-validated twofold with 20 repetitions.

Overall survival and distant disease-free survival were studied using Cox proportional hazards (PH) models. Simple Cox PH models were used for the patient and tumor characteristics (age at diagnosis, tumor size before chemotherapy, histological type, grading, and nodal status), for the established biomarkers for prognosis (Ki-67, ER, PgR and HER2 status), and for the investigational biomarkers for prognosis (CK5/6, CK8, CK18, Bcl-2, and p53). Hazard ratios (HR) with 95 % confidence intervals are shown. The prognostic value of the investigational biomarkers, in addition to the patient and tumor characteristics and the established biomarkers, was studied in the same way as the logistic regression procedures described above. The proportional hazards assumptions in the final models were checked using tests correlating scaled Schoenfeld residuals with a suitable time transformation [16]. If the proportional hazards assumptions were not fulfilled, the analysis was to be repeated separately for survival times up to 5 years and from 5 years on.

All of the tests were two-sided, and P < 0.05 was regarded as statistically significant. Calculations were carried out using the R system for statistical computing (version 2.13.1; R Development Core Team, Vienna, Austria, 2011).

Results

A total of 100 patients were included in the analysis, 11 of whom had a pCR. The patients’ mean age was 53.6 years (±11.9 years). Patient characteristics and biomarker results are shown in Table 1.

Table 1 Characteristics of the patients and tumors, showing means and standard deviation for continuous variables; frequencies and percentages for categorical variables; and medians and interquartile ranges for biomarkers

Expression of each of the biomarkers was correlated with the others. Estrogen-receptor expression and progesterone-receptor expression clearly correlated with almost every biomarker except TOP2A and CK5/6. Cytokeratin 5/6 did not correlate with any of the other biomarkers, and CK8 and CK18 strongly correlated with each other. All correlations are presented in Table 2.

Table 2 Spearman correlations between the biomarkers

To assess the predictive value relative to pCR, simple logistic regression analyses were initially performed. Some of the investigational biomarkers were inversely associated with pCR, such as Bcl-2 expression (OR 0.72; 95 % CI, 0.52–0.99), CK8 (OR 0.70; 95 % CI, 0.56–0.87), and CK18 (OR 0.75; 95 % CI, 0.62–0.91). Ki-67, ER, PgR, and grading showed the expected associations (Table 3).

Table 3 Simple logistic regression analyses for predicting pathological complete remission, showing odds ratios with 95 % confidence intervals and the corresponding P-values

The preliminary multifactorial logistic regression analysis, which did not take the investigational biomarkers into account, identified grading and Ki-67 as relevant predictors. The other predictors were dropped during the variable selection process—i.e., their predictive value appeared to be irrelevant, or they were already explained by grading and Ki-67.

The ultimate logistic regression analyses, taking into account the relevant predictors from the preliminary analysis and in addition the investigational biomarkers, showed that the prediction can be improved overall by including CK5/6 and CK18 (P = 0.01, likelihood ratio test). ORs for the final model are shown in Table 4. Higher levels of CK5/6 or CK18 were associated with a decreased pCR rate. The cross-validated AUC was 0.67.

Table 4 Multiple logistic regression analysis with backward stepwise variable selection for the prediction of pathological complete response (pCR), showing odds ratios with 95 % confidence intervals and the corresponding P-values

With regard to overall survival (OS), simple Cox regression analyses showed that negative nodal status and PgR expression were significantly associated with an improved OS. In addition, a high level of p53 expression (HR 1.16; 95 % CI, 1.02–1.32) and lower TOP2A expression (HR 0.76; 95 % CI, 0.58–0.99) were also associated with OS (Table 5).

Table 5 Simple Cox proportional hazards regression analyses for overall survival, showing hazard ratios with 95 % confidence intervals and the corresponding P-values

The preliminary multifactorial Cox regression analysis, which did not take the investigational biomarkers into account, identified age, pN, Ki-67, and PgR as relevant prognostic factors. The ultimate Cox regression analyses, taking into account the relevant prognostic factors from the preliminary analysis and in addition the investigational biomarkers, did not yield any additional relevant prognostic factors—i.e., all investigational biomarkers were dropped during the variable selection process (data not shown). The HRs of the final Cox model are shown in Table 6.

Table 6 Simple Cox proportional hazards regression analyses for distant disease-free survival, showing hazard ratios with 95 % confidence intervals and the corresponding P values. The hazard ratio for pathological complete remission (pCR; yes/no) is not shown, as no patients with pCRs developed any distant metastases

With regard to distant disease-free survival (DDFS), only nodal status was found to be a significant prognostic factor in the simple Cox regression analyses (Table 6). In the multivariate model with established prognostic factors, nodal status, Ki-67 expression, and PgR expression were selected as relevant prognostic factors. None of the investigational immunohistochemical markers helped to improve the prognosis significantly (data not shown).

Discussion

In this retrospective study of 100 BC patients who were treated in a standardized fashion with four cycles of epirubicin and cyclophosphamide, grading and Ki-67 expression were found to be the strongest among the established predictors of a pCR. The investigational biomarkers CK5/6 and CK18 appeared to have an additional predictive value for pCR. Other markers such as Bcl-2 and hormone receptor expression did not remain significant predictors in the multivariate model.

Although the study has a small sample size and only a limited number of events relative to the outcome parameter pCR, with 11 patients, most of the correlations with biomarkers and pCR identified are in accordance with the published literature findings. Patients with higher proliferation, assessed either by grading or Ki-67, were associated with a higher pCR rate. This is consistent with several published studies [8]. The proliferation marker topoisomerase IIα, discussed as a target of epirubicin [17], was associated with a higher response rate (OR 1.39; 95 % CI, 0.86–2.25), although not with statistical significance. Hormone receptor expression also correlated with a decreased pCR.

Luminal cytokeratin markers 8 and 18 were inversely correlated with pCR in the univariate analysis, probably reflecting their positive correlation with expression of hormone receptors and negative correlation with proliferation markers. In addition, CK18 and CK5/6 helped improve pCR prediction in the multivariate model. Interestingly, this correlation was inverse. The greater the expression of the basal marker CK5/6 was, the less likely it was for a pCR to occur. This is consistent with the findings of other studies such as the one by Li et al., in which CK5/6-positive patients had a pCR rate of 22.2 % and CK5/6-negative patients had a pCR rate of 47.8 % [18]. Since triple-negative BCs generally have a higher pCR rate, CK5/6 positivity might reflect a more basal-like subtype that is resistant to chemotherapy. In the present small study, CK5/6 did not have any relevance for predicting the prognosis either for OS or DDFS.

The progesterone receptor (PgR) is one example of the way in which the response and prognosis may be interpreted in relation to one biomarker in this type of study. A high level of PgR expression indicates a reduced pCR and a better prognosis. The effects of the PgR on the prognosis may therefore be less likely to be mediated by a response to the chemotherapy. CK18 does not appear to have a very strong effect on the prognosis; it was not helpful in predicting OS or DDFS. However, it did appear to be associated with pCR. If its absence of effect relative to predicting the prognosis is a genuinely negative finding, then it could be hypothesized that there must be additional prognostic effects of CK18 on the prognosis that are independent of the chemotherapy response.

The predictiveness of biomarkers relative to pCR and relative to the survival prognosis was analyzed separately. Ideally, in order to investigate the relationship between biomarkers, the treatment response as assessed by pCR, and the prognosis, a Cox regression analysis including all the biomarkers of interest and pCR, as well as the interactions between the biomarkers and pCR as prognostic factors, would combine the separate analyses in the present study. Using a Cox model of this type, it would be possible to evaluate in a differentiated fashion the way in which biomarkers influence survival independently of pCR. For each of the four combinations of PgR high/low with pCR yes/no, for example, survival rates could be calculated and HRs based on these combinations could be estimated and compared. The greatest problem in carrying out this type of analysis is the sample size; as pCR occurs only relatively rarely, large numbers of patients who reach the survival event are needed.

It should be pointed out that the conclusions drawn here need to be regarded with caution, as one obvious weakness of the present study is the small sample size, with only 100 patients and 11 patients with a pCR. Therfore for example analyses in subsets of patients such as luminal A like, luminal B like HER2 positive or triple negatives were not feasible. However, the small sample size was the result of rigorously excluding patients who received any form of treatment other than the one specified, in order to restrict interpretation of the results to the chemotherapy regimen with anthracycline and cyclophosphamide. This as well restricts the conclusions to patients that did not receive trastuzumab, and it has to be kept in mind, that HER2 positive patients did not receive trastuzumab, neither in the neoadjuvant nor in the adjuvant setting, as the drug was not approved yet at the time, when the patients were treated. Some markers appeared to be significant signals for predicting both pCR and prognosis, so that a causal connection may be hypothesized. In recent years, the use of taxanes as a standard treatment has been included in most neoadjuvant study protocols [7, 1922]. Furthermore targeted therapies such as lapatinib, trastuzumab, bevacizumab, pertuzumab, everolimus and others are tested especially when a resistance against a standard chemothereapy is observed or anticipated [6, 2325]. Therefore, this might be another limitation for interpreting the results with regard to the clinics.

In conclusion, this study provides evidence that the basal marker CK5/6 and the luminal marker CK18 may be potentially helpful in predicting the response to chemotherapy regimens containing anthracycline and cyclophosphamide. In this small retrospective study, it is not possible to assess whether these effects may translate into prognostic factors. Further studies are needed in order to differentiate biomarkers with regard to their ability to predict the treatment response and prognosis.