Introduction

With the aging of society and changes of lifestyle, the incidence of breast cancer remains to increase all over the world, especially in developing countries including China [1]. However, the mortality rate has seen a dramatic decline mainly due to the introduction of adjuvant systemic therapy [2]. As an indispensable component of it, targeted treatment has revealed its striking effect on the improvement of breast cancer prognosis [36]. Despite the unfailing emergence of targeted agents, a significant subgroup of patients derives little benefit, whose breast cancer features absent expression of estrogen, progesterone and ERBB2 receptors, namely “triple negative”.

Triple negative category accounts for 10–15% of breast cancers [7], which is characterized by aggressive clinical history. An extensive body of literature has reported that this subgroup was associated with shorter survival times [815] and even exhibited a trend towards a poorer outcome compared with ERBB2+ phenotype [9, 15]. Furthermore, a higher percentage of visceral metastases, local relapse and cerebral metastases arose in triple negative breast cancers than that in non-triple negative counterparts, while the incidence of bone metastases was just the opposite [12, 16, 17]. On the other hand, the use of trastuzumab and lapatinib has led to considerable reduction in the recurrence or mortality risk of ERBB2+ patients [4], which may render the triple negative women at most risk of early recurrence.

However, few studies have been conducted among non-western populations. Kurebayashi and coworkers reported the survival by each subtype in Japanese breast cancer patients that ERBB2+ and luminal A subgroup remained to live shortest and longest respectively before metastatic disease occurred, whereas the curves for triple negative and luminal B phenotype were mostly intermediate and virtually superimposable for disease-free survival (DFS) as well as for overall survival (OS), with a highly significant difference between subclasses (P < 0.001 for both DFS and OS) [18]. Despite controversies, these conflicting results provide evidence for an increasing recognition that breast cancer is a heterogeneous disease categorized as different subtypes with distinct biological characteristics [9]. Therefore, it is most urgent for further investigation on the prognosis of triple negative breast cancer in Chinese patients due to the unavailability of similar data.

In most studies, prognosis was generally delineated by survival curves rather than hazard function. In the former, the cumulative event-free time distribution is usually illustrated to describe the proportion of patients who remain event-free at a given time after primary therapy, which results in the lack of insight into changes of the event probability over time. However, the hazard function is capable of highlighting such information as the risk of event at any instant among the remaining ‘at risk’ individuals. It depicts not only the timing but also the magnitude of the hazard rate [19].

In recent years, there is an increasing number of investigators who have become intrigued in the hazard function. The applicability of this method has been described in clinical trials, as exemplified by serial reports of the ATAC (Arimidex, Tamoxifen, Alone or in Combination) trial. These data clearly substantiate the long-term benefit in risk reduction for anastrozole over tamoxifen throughout the entire follow-up period [20, 21]. Therefore, the hazard function is of extraordinary value in prognostic evaluation, contributing to strategy development of both treatment and follow-up. However, the prognostic pattern for triple negative breast cancer has not been well studied by use of the hazard function, which has been a major shortcoming in probing the full complexities of this category. It is noteworthy that, due to the dearth of therapeutic targets and follow-up measures at present, this disease entity is a much-feared diagnosis among the majority of patients with breast cancer [7]. In consequence, the profound understanding of recurrence hazard for triple negative tumors will most likely lead to a shift in how we deal with such patients.

On the basis of the above points, we carried out a retrospective analysis on the clinicopathological features of triple negative breast cancer patients undergoing surgery in Cancer Hospital of Fudan University, Shanghai, China. Considering the heterogeneity of breast cancer, we sought to gather relevant information so as to get a clear picture of the prognosis for Chinese patients with triple negative phenotype and its discrepancies from other populations, accordingly, offering the implications for the underlying distinction in tumor biology between different subgroups and various races.

Materials and methods

Patients

This study was conducted retrospectively from a large database of patients undergoing surgery from January 1, 1991 to December 31, 2003 in Cancer Hospital of Fudan University, Shanghai, China. Before surgery all patients were evaluated through complete physical examination, chest radioscopy, bilateral mammography, ECG, ultrasonography of breasts, axillary fossa, cervical parts, abdomen, and pelvis, complete blood count, and routine biochemical tests. After exact staging, each patient was treated with lumpectomy or mastectomy followed by adjuvant therapy according to the standards used at the time of surgery. Follow-up information regarding tumor recurrences and survival status was accomplished through the retrieve of follow-up medical records kept in the outpatient department, personal contact with the patient as well as the assistance of Shanghai Center for Disease Control and Prevention (CDC). Thereinto, personal contact with the patient referred to routine correspondence or telephone visits, which were carried out in Cancer Hospital of Fudan University every 3 months during the first two years, every 6 months during the next two years and once a year thereafter. Recurrence or its absence was diagnosed by query to the patient, by biopsy, or by scan of bone, chest, abdomen, pelvis or skull. Whenever the tumor recurred, additional information, including sites of recurrence and therapy, was requested. All data were entered into a computerized database and verified to minimize errors in data entry.

Similar to other relevant reports [22, 23], 1993 patients were included in our study if they met all of the following criteria: female gender, an initial diagnosis of unilateral primary breast cancer without distant metastases, at least 3 months of follow-up information for disease recurrence and death, and available information on age, tumor size, number of involved axillary lymph nodes (ALN), status of estrogen receptor (ER), progesterone receptor (PR) as well as ERBB2. The mean age at diagnosis was 52 years old (range 16–90). Median follow-up was 2.81 years, ranging from 3 months to 11 years. Among them, 1696 (85.10%) were administered adjuvant chemotherapy of different regimens for 4–6 cycles. Out of 1391 patients with positive ER and/or PR (ER/PR), 696 (50.04%) received adjuvant endocrine therapy, among whom 681 took tamoxifen and 15 aromatase inhibitors. None of the patients received trastuzumab.

Immunohistochemistry and scoring

For each patient in our database, ER, PR, ERBB2, TP53 and Cathepsin-D status were determined by immunohistochemical staining, which was carried out as a standard operating procedure in the pathology department of Cancer Hospital, Fudan University. All primary monoclonal antibodies were from Dako. For TP53 worthy of note, the antibody labels both wild- and mutant-type of the p53 protein. The percentage and the intensity of tumor cells stained were assessed by at least two pathologists, and were denoted respectively as a proportion score and an intensity score. The former was interpreted as follows: a score of 0 required no staining seen, 1 required ≤25% of cells positive, 2 required 25–50% of cells stained, 3 required 50–75% of positive cells and 4 required >75% of staining cells. As to the intensity score, a negative result was defined as a score of 0, weakly positive as 1, moderately positive as 2, and strongly positive as 3. The final score was calculated as the product of the proportion score and the intensity score. Thereby, staining results ranged from score 0 to 12. This semiquantitative scoring system for nuclear ER, PR, TP53 and plasma Cathepsin-D were defined as negative for score 0 and as positive for scores of 1–12 with staining of carcinoma cells, whereas ERBB2 status was defined as negative for scores of 0–8 (namely, 0, 1+ and 2+ in the DAKO scoring system) and as positive for strong membranous staining with scores of 9–12 (namely DAKO score 3+).

Statistical analysis

Recurrence-free survival (RFS) was defined as the time from surgery to the earliest occurrence of relapse (locoregional or distant) or death from any cause. Those without any evidence of relapse were censored at the last date they were known to be alive. Clinicopathologic parameters were compared between different subgroups using one-way analysis of variance (ANOVA) test for continuous variables, chi-square test for unordered categorical variables and nonparametric Kruskal–Wallis rank test for ordinal categorical variables. Post hoc multiple comparison tests were conducted with the Bonferroni method.

Survival distributions were estimated by the Kaplan–Meier product-limit method and were compared using the log-rank test. Cox proportional hazards regression and time dependent Cox non-proportional hazards regression were applied to modeling the relationship between subgroup and RFS, adjusted for known prognostic variables, including tumor size (≤2 cm, >2 cm), ALN status (0, 1–3, ≥4), TP53 (negative and positive) and Cathepsin-D (negative and positive) expression as well as use of adjuvant chemotherapy (yes, no) and endocrine therapy (yes, no). The proportional hazards assumption was tested by global test [24, 25]. For graphical display of RFS, annual hazard rates were estimated using a Kernel method of smoothing. All statistical tests were two sided and < 0.05 was considered significant. All statistical analyses were performed with Stata statistical software package (release 9.0; Stata Corporation, College Station, Texas, USA). Relative risks (RRs) were presented with their 95% confidence intervals (CIs).

Considerable evidence revealed that many covariate effects on recurrence risk were time-varying, that is, they were not proportional throughout the entire follow-up interval [2631]. Accordingly, several estimates such as RRs were further compared within different time frames (i.e. 0–2 years from surgery and from 2 years to the end of the follow-up period).

Results

General characteristics

According to different combinations of hormone receptor (HR) and ERBB2 status, 1993 patients were categorized into the three subgroups as follows: ERBB2+ (32.41%), HR+/ERBB2− (48.97%, HR+ referred to ER+ or PR+) and triple negative (18.62%). ERBB2+ patients tended to be younger at diagnosis than HR+/ERBB2− and triple negative counterparts, although there was no statistical significance (51.4, 52.8 and 52.4 respectively, P > 0.05; Table 1). The rate of larger tumors (greater than 2 cm in diameter) in triple negative patients was moderately higher than that in HR+/ERBB2− women, but much lower than that in ERBB2+ subgroup (70.35%, 66.19% and 76.32% respectively, P = 0.0001; Table 1). In addition, 21.83% of patients whose tumors were triple negative had four or more nodes involved in the axilla as compared to 27.40% of patients with ERBB2+ tumors and 22.75% of those with HR+/ERBB2− tumors (P = 0.0056; Table 1). We made further analysis with post hoc multiple comparison (Bonferroni) tests, which demonstrated that statistical difference was discerned between ERBB2+ and triple negative tumors rather than between HR+/ERBB2− and triple negative breast cancers (data not shown) in terms of ALN status. Such was also the distribution of TP53 expression and Cathepsin-D status (Table 1; part of data not shown). When it came to histologic grade, triple negative and ERBB2+ patients were more likely to have grade III tumors than HR+/ERBB2− subgroup (33.54% and 34.36% vs. 20.35%, P = 0.0001; Table 1).

Table 1 Summary of subgroup characteristics

Initial symptoms varied from one subgroup to another. In comparison with HR+/ERBB2− and triple negative patients, nipple-related symptoms such as nipple discharge, nipple indrawn or changed accounted for a higher proportion in ERBB2+ women, whereas intramammary symptoms including a breast lump and breast pain were just the opposite (P = 0.029; Table 1).

Subgroups also influenced the proportion and timing of recurrence. When compared with triple negative tumors, more recurrence events were observed in ERBB2+ patients but less in HR+/ERBB2− ones (P = 0.002; Table 1). Besides, 71.91% recurrence occurred in ERBB2+ subgroup within two years of follow-up, while the corresponding rates for HR+/ERBB2− and triple negative subgroups were 57.32% and 47.37% respectively (P = 0.019).

Survival analysis

In the univariate analysis, we found significantly different recurrence-free survivals (RFS) among the three subgroups throughout the entire follow-up period (P = 0.0037; Table 2), with the rate of 72.89% for ERBB2+ tumors, 78.40% for HR+/ERBB2− tumors and 75.76% for triple-negative tumors at the 11th year respectively (Fig. 1). A similar effect remained to be validated till 2 years from surgery (P < 0.0001; Table 3), yet failed to be sustained for the interval from 2 years after surgery to the end of follow-up (P = 0.4539; Table 3).

Table 2 Survival analyses of RFS in 1993 breast cancer patients
Fig. 1
figure 1

Kaplan–Meier curves for recurrence-free survival in 1993 breast cancer patients by different subgroups

Table 3 Predictive value of subgroups for RFS within different time frames

In Cox proportional hazards regression analysis, we found that tumor size (P = 0.013; RR = 1.863, 95% CI 1.138–3.048), ALN status (< 0.0001; RR = 2.508, 95% CI 2.007–3.133), use of adjuvant chemotherapy (P = 0.001; RR = 0.422, 95% CI 0.251–0.708) and endocrine therapy (P = 0.002; RR = 0.546, 95% CI 0.370–0.807)were independent prognostic factors for RFS excluding subgroup (P = 0.099; RR = 0.867, 95% CI 0.731–1.060; Table 2) during the period from the beginning to the end of follow-up. To further evaluate whether prognostic effect of subgroup remained unabated over time, the test for lack of proportionality was performed [24, 25]. However, it was statistically significant (global test, P = 0.0084), which clarified a violation of proportional hazards for subgroup and hinted at a demand for the employment of Cox non-proportional hazards regression rather than Cox proportional hazards regression in this analysis. In time dependent Cox model, subgroup (P = 0.026; RR = 0.758, 95% CI 0.593–0.968) as well as tumor size (P = 0.014; RR = 1.860, 95% CI 1.136–3.043), ALN status (P < 0.0001; RR = 2.502, 95% CI 2.002–3.126), use of adjuvant chemotherapy (P = 0.001; RR = 0.425, 95% CI 0.253–0.714) and endocrine therapy (P = 0.003; RR = 0.551, 95% CI 0.373–0.814; Table 2) were included when subgroup was taken as an time-varying covariate. To identify the optimal cutoff point for time-varying pattern of subgroup, we examined all the time points in the unit of follow-up year. As expected, women with triple negative breast cancers had a significantly increased likelihood of recurrence within 2 years after surgery (P = 0.004; RR = 0.716, 95% CI 0.571–0.897) rather than thereafter (P = 0.446; RR = 1.111, 95% CI 0.847–1.457; Table 3).

Recurrence hazard analysis

As to hazard peaks, discrepancies existed in different subgroups. Triple negative patients showed an early major recurrence surge peaking at the 2.5th year after surgery, followed by a gradual decline until year 6.5 and then a modest increase. As to HR+/ERBB2− patients, the hazard plot exhibited a wide initial plateau-like wave covering at least 4 years and a subsequent rise from approximately the 8th year. The hazard rate for ERBB2+ patients displayed a tapering sharp at the 1st year, accompanied with a second ridge at year 5.5 as well as a quite similar pattern to triple negative counterparts thereafter. Furthermore, the first peak of triple negative tumors was higher than that of HR+/ERBB2− patients, but lower than that of ERBB2+ ones (Fig. 2).

Fig. 2
figure 2

Annual recurrence hazard rate for 1993 breast cancer patients by different subgroups. * The hazard rates described demonstrate hazard of recurrence for each 1-year interval

Discussion

Our study is, to the best of our knowledge, the largest retrospective analysis on clinicopathological features of the triple negative tumors in Chinese breast cancer patients. According to Carey et al., the triple negative subtype occurred with a higher incidence in pre-menopausal African American women (39%) when compared to post-menopausal African American women (14%) and non-African American women of any age (16%) in the Carolina Breast Cancer Study, a population-based, case–control study [13]. Additionally, 87 out of 149 (59%) patients were identified as triple negative in a random cohort of Nigerians [32]. By contrast, there was a rather lower frequency of triple negative tumors (8–14%) in Japanese patients [18, 33]. In this analysis, triple negative tumors accounted for 18.62% among all types of Chinese breast cancer, which was much similar to Japanese series. Therefore, the prevalence of triple negative breast cancers seemed to differ by race as had been suggested by other studies [13, 34]. Whether this racial preference is attributed to genetic or environmental factors, or to a combination of both still remains unclear and open to investigation [7].

Both tumor size and ALN status have been accepted as sufficiently established to guide risk allocation [35, 36]. In addition, TP53 together with Cathepsin-D is somewhat indicative of prognosis despite inadequate evidence for routine use in clinical practice [37]. We detected a moderate rate of larger tumors as well as a low percentage of positive ALN status, TP53 and Cathepsin-D staining in triple negative category, contributing to its better survival compared with ERBB2+ phenotype. However, these findings were discordant with western reports, which demonstrated relatively large tumors, slightly more nodes involved [10] and high p53 protein expression [38] in triple negative cancers, accordingly, well elucidating the corresponding aggressive biological behavior and poor clinical outcomes [8, 10, 11, 14]. This absolute disagreement mirrored the fact that breast cancer is a disease of heterogeneity, and therefore, race, as an important, even pivotal factor, should be taken into account when talking of clinicopathological features for triple negative tumors.

The triple negative breast cancers in this study were more prone to presenting with intramammary symptoms than nipple-related ones at diagnosis. It has been reported that the type of initial symptom was related to delay in seeking for medical care. Breast lump or breast pain were more likely to be recognized by women as a symptom provoking their eventual attendance to a doctor [3941]. Besides, several researches substantiated that a longer time period before visiting the doctor was linked to a shorter survival from breast cancer [42, 43]. By this token, patients’ interpretation of initial symptom might also exert an indirect impact on prognosis, which was quite congruent with our observations.

At present, much enthusiasm has been aroused for triple negative breast cancers due not only to tricky treatments but also to poor prognosis. In terms of survival, the previous studies were almost confined to western populations [8, 1012, 14, 15]. Besides racial disparities, subgroup taxonomy should not be ignored either when comparing survival data between different series. In the majority of studies, subgroup as an explanatory variable was dichotomized as triple negative or the other breast cancers [1012, 14, 44], which featured clarity and uniformity but obscured diversity and complexity. By contrast, it was capable of uncovering much fuller prognostic profiles in a multichotomized way. Herein existed slight differences of note. Some researchers divided subgroup mainly into HR+, ERBB2+/HR− and triple negative phenotypes [13, 15, 18]. This classification was more or less concordant with intrinsic molecular subtypes defined by microarray; nevertheless, it was not a perfect way. ERBB2 amplification, as a powerful prognostic marker [45], has been identified to confer a strong negative effect on survival in untreated ER+ patients [46]. And further, no significant difference in disease-free survival (DFS) was observed between patients with and without tamoxifen treatment for those co-expressing HR and ERBB2 [46, 47], which validated the in vitro findings that ERBB2 overexpression compromised the antitumor effect of tamoxifen [4851]. Taken together, all of these data inferred that ERBB2 status added prognostic information in HR+ breast cancer patients regardless of ET, corroborating that HR+/ERBB2+ tumors constituted a clinical entity with poor outcome separated from HR+/ERBB2− counterparts [46, 52, 53]. Based on the superiority of the ERBB2 status to HR as a prognostic factor in breast cancer [54], subgroup was classified into ERBB2+, HR+/ERBB2− and triple negative in this study just as other few publications [8]. Our results indicated that RFS for triple negative category was obviously superior to that for ERBB2+ patients but relatively inferior to that for HR+/ERBB2− subgroup, which was totally coherent with Nielson’s report [8] and partly consistent with others studies [13, 15, 18, 34].

There is growing attention toward violations of proportional hazards. Failure to recognize the patterns of time-variation might miss the effects of clinically important or biologically interesting factors [24]. Valid conclusions for early prognostic effects can be yielded with short follow-up, whereas those for late effects fail to be verified without longer follow-up, as represented by HR status [24]. Our analysis indicated that the recurrence risk for triple negative subgroup was not proportional across the time, which was consistent with previous reports [10]. Alertly detecting and explicitly modeling time-dependence, which may be a more common phenomenon than is appreciated in the medical literature, could conduce to novel biological insights and clinical use of various factors [24].

Up to now, there is no universally agreed-upon pattern of recurrence risk for triple negative patients owing to little information [10]. Moreover, such pattern had not yet been reported in Chinese populations. Dent and colleagues [10] found a single peak reaching the maximum at about 1 year after diagnosis for patients with triple negative breast cancers, as distinguished from a steady risk of recurrence spanning 17 years after diagnosis for those with other cancers [10]. It was slightly discrepant from their report that we demonstrated double-peaked time distribution of recurrence risk irrespective of subgroup and intermediate recurrence pattern for triple negative tumors between that for ERBB2+ and HR+/ERBB2− counterparts, since the first peak for triple negative subgroup lied between those for ERBB2+ and HR+/ERBB2− ones with regard to both timing and height. On this premise, prognosis for Chinese triple negative patients seemed somewhat better than that for Western counterparts. Despite contradictory results, there is no denying the fact that the risk of recurrence or death persists for a long duration, even for years or decades after surgery, which has been supported by manifold data [22, 27, 5557].

This study has some potential and inevitable limitations on account of its retrospective nature. Recurrences are probably somewhat underreported or misinformed for a substantial portion of the patients in this database; nonetheless, underreporting or misinformation of recurrences would have not varied by clinicopathological parameters [51]. Besides, we did not evaluate the effect of treatment on survival in the present study, but all the RRs were adjusted for treatment administered [10].

In conclusion, biological behavior and clinical outcome for triple negative tumors in Chinese breast cancer patients may be more favorable and somewhat different from those in Western populations. However, this does not mean the unnecessity of aggressive treatment for these patients as the sustained existence of recurrence risk should also be taken seriously. Although there is to date no specific systemic regimen for recommendation due to little data on which to base treatment selection [7], this treatment bottle-neck have prompted a better knowledge of triple negative category to shed more light on efficacious ways of post-operative management, including both treatment and follow-up [58].