Introduction

Breast cancer is the leading cause of cancer death among women in most western countries. The annual death rate is estimated to be 40,480 women in the U.S. [1] and 5,914 in Spain [2]. MBC is incurable in most of the cases, and has a median survival of 18–24 months [3]. Therefore, treatment is usually palliative during this stage of the disease and is intended to improve the patient’s quality of life and prolong survival.

Despite advances in early detection programmes and adjuvant treatments of early-stage disease, breast cancer remains as a significant health problem and patients with recurrent MBC usually die of their disease [4]. However, life expectancy of these patients varies greatly (from a few months to several years).

The prospects for MBC patients are generally poor, with only 10% expected to survive 10 years after diagnosis [5]. The management of these patients continues to evolve, and many new chemotherapeutic agents have become available which have improved the efficacy and tolerability of the treatments. The use of new hormonal therapies, new cytotoxic agents, the introduction of molecular targeted drugs and the different sequences of therapies administered throughout the disease have modified the natural history of MBC [6].

The value of prognostic factors in patients with early-stage breast carcinoma has been confirmed in a large number of studies. Thus, axillary nodal status, tumour size, oestrogen receptor status and histologic grade are well established [7]. In contrast, there have been only a few reports on prognostic factors in patients with MBC [8]. The analysis of short series indicates that the survival of patients with recurrent disease is related to the site of metastases (visceral recurrences have been associated with shorter survival), the hormonal receptor status, the adjuvant therapy received, the stage of the disease at diagnosis and the disease free interval [9].

It has been demonstrated that a small fraction of MBC patients can achieve long-term survival after systemic therapies. The identification of patients with a high or low life expectancy could have relevant therapeutic implications.

In the current study, we have analyzed women with recurrent MBC from a Spanish hospital register (El Alamo, GEICAM) in order to identify the most helpful prognostic factors to predict survival and to construct a practical prognostic index.

Materials and methods

Patients (for construction of the prognostic index)

The “Alamo Project” is a hospital registry that collects data on consecutive women diagnosed as having breast cancer in institutions belonging to GEICAM (Spanish Breast Cancer Research Group). This project is conducted in consecutive intervals of 4 years with the objective of analyzing the demography of the patients, the characteristics of their tumours, the treatment received and the clinical evolution of the disease. Data gathered in the “Alamo I & II” projects were obtained from 50 Spanish hospitals. “Alamo I” consisted of 4,532 patients diagnosed de novo with breast cancer between 1990 and 1993. These data were collected by end of the year 2000. “Alamo II” consisted of 10,322 patients diagnosed de novo between 1994 and 1997. These data were collected by end of the year 2003. Overall, these series includes more than 10% of all breast cancer patients diagnosed in Spain during the corresponding periods of time.

There were three versions of the data collection form: data recording form on paper, electronic recording for remote access and loading into the database on SQL Server and an electronic survey form for the recording in the database using Access®. Once the inclusion of the data into the database was concluded, an audit was performed to confirm the integrity, accuracy and coherence of the data. This involved selecting 100 forms at random (ensuring at least three from each participating hospital) and the data was manually checked with that in the database. The percentage discordance was <5% of the total data compared [10].

Protection of personal data was applied in accordance with the requirements of Spanish legislation.

The inclusion criteria covered women patients diagnosed of operable invasive breast cancer between 1990 and 1997 in GEICAM hospitals who have had metastatic recurrence. Patients with stage IV at diagnosis or with isolated locorregional recurrence were not included in this analysis. Patients with a previous neoplasm except for in situ breast cancer, basal cell or squamous cell carcinoma of the skin or in situ carcinoma of the cervix appropriately treated were excluded from the analysis. Those who developed secondary neoplasms or second primary breast cancer during the follow-up period were also excluded.

Statistical analysis

Data from patients with recurrent breast cancer after primary treatment (surgery, radiation and systemic adjuvant treatment) were used to construct the prognostic index. The main endpoint was survival from the time of metastatic recurrence.

This study was designed as descriptive, retrospective and transversal. Qualitative variables were expressed as the absolute and relative frequencies and quantitative data were expressed as the means of central tendency and of dispersion (mean, median, trends and range). Survival rates were calculated using the Kaplan and Meier method. All deaths were considered as events, regardless of their cause. Each patient was considered to be alive at the time of last evaluation unless death had been documented.

Both a univariate and a multivariate approaches were used for the analysis of potential prognostic factors. The Cox proportional hazards model was applied for the selection of the most significant variables contributing to survival. Differences with P-value less than 0.05 were considered significant. All P-values were two-sided. Potential prognostic factors were analyzed in a multivariate analysis using a forward step-wise conditional Cox proportional hazards model.

For the construction of the prognostic index, the values of the β coefficients of the Cox model were calculated. The results obtained from all of the significant factors in the Cox model were added together for each patient.

All analyses were performed using SPSS version 12.0 statistical software (SPSS Inc, Chicago, IL). The prognostic index was applied to the patients and overall survival rates were calculated in order to create three different prognostic groups.

We have followed the reporting recommendations for tumour marker prognostic studies (REMARK) published in 2006 [11]. These guidelines recommend elements and formats for these kind of studies, and the goal of these guidelines is to encourage transparent and complete reporting so that the relevant information will be available to others to help them to judge the usefulness of the data.

Results

Data from 2322 patients with recurrent breast cancer after primary treatment (surgery, radiation and systemic adjuvant treatment) who had been accrued between January 1990 and December 1997 on the “Alamo” projects were used to construct the prognostic index. The median follow-up time of the whole population was 7 years (range, 2–132 months). The median overall survival was 1.8 years form the time of recurrence. Table 1 shows the variables that showed a statistically significant influence on survival in the univariate analysis.

Table 1 Univariate survival analysis

These variables were selected for a multivariate regression analysis using the Cox proportional hazards model. Nine-hundred and sixty-two patients with complete information in all significant variables were included in the multivariate analysis. The distributions of main variables were similar in the cohort of patients included in the multivariate analysis and in the remaining patients. Stepwise regression procedures were applied to calculate the values of the β coefficients of the Cox model, which are listed in Table. 2.

Table 2 Multivariate survival analysis using Cox′s model

Age and pathological state/stage of disease at diagnosis, histological grade, hormonal receptor status, nodal ratio, administration of neo and/or adjuvant chemotherapy, dominant site of metastasis, number of hormonal lines in metastatic disease and response to the first line therapy remained significantly related to survival at the end of the multivariate analysis (Table 3).

Table 3 Prognostic index score

A quantitative score derived from the regression coefficients of each independent prognostic variable was created as described earlier. The prognostic index score for each individual patient was calculated by totalling up the scores of each independent variable. The maximum score obtainable was 26.1. Nine-hundred and sixty-two patients who had complete data of all variables were used for the computation of prognostic index score. We were able to stratify them into three prognostic groups based on the prognostic index score: 322 patients in the good risk group (score ≤13.5), 308 patients in the intermediate risk group (score 13.51–15.60) and 332 patients in the poor risk group (score ≥15.61). The median survivals for these groups were 3.69, 2.27 and 1.02 years, respectively (P < 0.0001). The area under ROC curve (AUC) for the nomogram was 0.694 (0.661–0.728). The survival curves stratified by prognostic score index groups are depicted in Fig. 1.

Fig. 1
figure 1

Comparative survival curves according to the prognostic index. The blue curve represents the good risk group (score ≤13.5), the green curve represents the intermediate risk group (score 13.51–15.60) and the red curve represents the poor risk group (score ≥15.61). The median survivals for these groups were 3.69, 2.27 and 1.02 years, respectively, (P < 0.0001)

Discussion

Despite the advances accomplished in the adjuvant treatment of breast cancer, nearly a 20% node-negative and 60% node-positive breast cancer patients will recur from their disease. Recurrence location has clearly shown to be a key prognostic factor for MBC patients in terms of overall survival. Thus, visceral involvement correlates with a shorter survival than soft tissue limited disease. Other factors reported in the literature with prognostic impact have been tumour size and stage at diagnosis, disease-free survival, hormone receptor status or the adjuvant regimen received.

A Spanish report on 439 MBC patients with a median follow-up of 30 months from the diagnosis of recurrence [9], showed that three variables were independently associated with better survival: bone or soft-tissue only metastasis, achieving an objective response (partial response + complete response) to the first-line treatment strategy and DFS over 2 years. Five-year OS was 35% in responders and virtually zero in non-responders. The value of these three prognostic variables has been confirmed in our series. In our series, patients with skin and soft-tissue recurrences had not reached the median survival time yet, while those patients with complete response to the first-line treatment for metastatic disease had a median survival time of 4.95 years.

An American study coincided to show DFS and the recurrence location, together with the hormone receptor status, as significant prognostic factors. The median OS was 15 months in patients with HR-negative tumours, visceral metastasis and DFS less than 2 years versus 90 months in patients with HR-positive tumours, soft-tissue metastasis and DFS over 2 years [12]. In our experience, the HR-status demonstrated independent prognostic value, hence 57.44% of the HR-positive patients remained alive at 2 years versus 36.12% in the HR-negative group.

The value of lymph node involvement at diagnosis as a prognostic factor of survival in patients with recurrent disease is a matter of controversy. Chang et al. [13] analyzed the primary tumour characteristics of node-positive patients who have recurred of the disease. According to their data, tumour size and number of lymph nodes affected were crucial to the prediction of the risk of recurrence, but did not correlate with survival once metastasis had already appeared. In the multivariate model, older age (>50 years), DFS less than a year, PR-negative status, high S-phase fraction (SPF) and visceral disease were significant predictors of lower survival. On the contrary, in the experience reported by Insa et al. [9], either tumour size or number of lymph nodes involved at diagnosis had independent prognostic significance. In the Alamo database, lymph node involvement, considered as nodal ratio (absolute number of involved nodes–number of nodes resected [14]), turned out to be a relevant prognostic factor in breast cancer patients with distant metastasis. As a result, for patients with <0.25 ratio median survival was 2.31 years, versus 1.78 years for those whose ratio was between 0.25 and 0.75, and 1.25 for ratios >0.75.

As we noted in the introduction, all the gathered evidences from literature do not clarify the potential prognostic value of age. In this Spanish series, the differences among the 59.5% of young patients (<35 years-old) alive at 2 years, the 51.83% between 36 and 50, the 46.89% between 51 and 75 and the 21.82% over 75 years-old were statistically significant (P < 0.001). This observation is coincident with Chang’s previous finding of worsening survival at older ages.

Regarding the prognostic impact of the metastatic location, some authors classify the metastatic involvement in two prognostic subgroups of low and high risk in order to highlight this particular aspect. The most favourable profile includes skin, subcutaneous, bone and lymph node metastasis together with nodular lung metastasis, and is characterized by a slower growth rate and a longer survival. In contrast, brain, liver and multiple lung metastasis, carcinomatous lymphangitis or massive pleural effusions are closely linked to a worse outcome. Almost all the multivariate analyses agree to emphasize that liver disease is a predictor of poor response to chemotherapy and a marker of adverse outcome, even if this is the only location of the disease. Our study is fully consistent with this reality so that median survival of MBC patients with visceral involvement (mainly liver metastases) is 1.15 years, versus the 2.21 years if patients have nodular lung metastasis, 2.55 years in the case of bone metastasis and not even reached median survival if skin and/or soft-tissue metastasis is detected.

The number of organs involved is also correlated with OS. The Alamo project analysis showed 54.12% of the patients were alive at 2 years if only one organ was affected by metastasis, which decreased to 39.01% in the case of multiple organ involvement. In a previous study of patients with MBC including the bone, median OS was nearly 2.6 years when this was the only location of the disease and dropped to 1 year if a second metastatic location existed [15]. The data from the previously mentioned study of Insa et al. [9], demonstrated a 30 months OS for the single location group, 22 months for two metastatic locations and 11 months if there were three or more locations. Conversely, the EORTC joint analysis showed 22–26 months of OS for patients with exclusive liver involvement and 14–16 months if the disease was scattered in several locations [16].

In view of our results, the treatment regimen administered to MBC patients has a potential prognostic impact. First of all, a worse survival was found for those patients who received adjuvant or neoadjuvant treatment, specially if it was an anthracycline-based regimen. This fact is consistent with the published evidence, such as the results of the Spanish study carried out over 297 MBC patients, which showed a more adverse outcome for patients with multiple metastatic involvement, a short DFS interval and previous anthracycline-based chemotherapy regimes [17]. A plausible explanation is that these treatments may select resistant clones or may introduce de novo mutations leading to a more aggressive metastatic disease behaviour.

Regarding the role of local therapies such as surgery or radiotherapy, there are no available randomized clinical trial data demonstrating the benefit of a surgical resection of metastasis over systemic therapies, so that there is no conclusive evidence at this point. However, some observational studies have provided some valuable information showing long-survival outcomes after the resection of solitary metastasis, mainly in good performance status and long disease-free interval patients [18]. It is likely that our results reflect this tendency and that probably those patients surgically managed in our series shared a more favourable prognostic profile.

Finally, two histopathologic aspects showed prognostic relevance in our experience. Histopathologic grade, extracapsular extension and tumoural stage at first diagnosis are established prognostic factors in the therapeutic decision algorithm of early breast cancer. In contrast, their prognostic significance in the metastatic setting is not so well defined in the literature. The GEICAM series showed 3.08 years median survival for stage I patients after recurrence versus 1.39 years for stage III (P < 0.001), 2.86 years median survival for grade I tumours versus 1. 48 for grade III ones (P < 0.001), and 2.21 years median survival for patients with extracapsular extension versus 1.33 if they do not present this characteristic.

The development of a prognostic model starts with the precise definition of the result variable, usually death, and the analysis of the potential prognostic factors related to it. It has always been a question of remarkable interest for human beings to predict future events in advance, in fact, prophets enjoyed a bunch of privileges when the world was controlled by uncertainty and foretelling. As humanity evolved, humans looked for the way to improve their prediction methods overcoming their insecurity. Medicine did not skip this phenomenon and has been always focused in the prediction of patients outcome.

Once a model is built, it requires a validation phase that involves different steps to confirm that the model is reliable and robust. From this perspective, the qualification of the discrimination ability of the models depends on the area under the ROC curve. Our study reached the 0.69 value for distant disease. Note that even when these models have a high discrimination ability, the value will never be over 1. For this reason, the use of a predictive score in a particular patient has a purely orientative value. A particular risk can be estimated but we could never predict the final result in this case. In other words, models with very good predictive ability can estimate a mortality, for example, of five out of 100 patients with a small error, but will not be able to determine which five patients will die.

Another big weakness that has been suggested is that scores are the representation of a snapshot from the clinical practice in an specific moment. In the Alamo series, the practice took place between 1990 and 1997 and is quite different in some important aspects form the current clinical scope. New drugs have recently joined the therapeutic store such as anti-angiogenic drugs (bevacizumab), monoclonal antibodies targeting HER2 (trastuzumab) and small tyrosine-kinase inhibitors (lapatinib).

Conclusion

Risk scores are extraordinarily valuable tools, highly recommendable in the clinical practice. However, their real use needs to be redefined and most importantly, we need to have their particular conditions of use in mind together with their limits and misinterpretation risks.

The main strengths of the present study are the big sample size of patients included in this national database (which constitutes an exceptional reflect of MBC patients diagnosed and treated in Spain), because we have considered factors at the time of recurrence and at the time of first diagnosis. The best endpoint to consider in long-term follow-up studies is survival. In our experience, we focused on OS and tried to set up the estimated probability at 12, 36 and 60 months, thanks to the long-term follow-up available for these patients that let us analyze a real survival measurement, rather than a projected survival.

The study of prognostic factors in recurrent breast cancer is a clinically relevant field of development, from a biological but also from a clinical point of view. Large population databases with a long-term follow-up are an excellent source of clinical information that reflects the real clinical practice outside clinical trials. Our study will be helpful for better stratification and more accurate predictions of survival in the metastatic breast cancer (MBC). Focusing on overall survival we tried to estimate the probability of death at 12, 36 and 60 months, constructing a prognostic index model to be further validated in another set of patients and possibly completed with new molecular markers in future Alamo projects.