Background

The 5- or 10-year disease-free survival rate is commonly used as a criterion of success in non-metastatic cancer therapy. If this rate is estimated by Kaplan–Meier method [1], it is still required that a considerable proportion of all case have a sufficient follow-up. Pocock et al. [2] proposed to curtail Kaplan–Meier plot when only around 10–20 % are still in follow-up. Reducing the follow-up period time required to estimate long-term survival rates has become an important issue in oncology. The use of parametrical statistical models can be a solution but they have to be validated in practice. From the 50 s, several authors have worked on this topic. Considering only short-term survival data, the standard lognormal model was used to predict survival rates at 10, 15 years or more [3, 4]. The values predicted by the model were then compared to results observed at these time points. This strategy was also used to predict long-term survival of patients treated for cancer of the larynx [5] or inflammatory breast cancer [6].

The methodology used in these publications considers a single or a composite endpoint. However, in breast cancer, patients are at risk of multiple types of events. A patient can experience loco-regional recurrence, distant metastasis, contralateral cancer, second primary cancer or death. According to the nature of the event types, different treatment strategies may influence the time to occurrence of only certain event types. In practice, medical searchers may find interest in nature and time of a particular event, distant metastasis for example, and the other events are considered in competition with the event of interest. Kaplan–Meier estimator treated competing events as independent censorings and overestimates the proportion of distant metastasis. The cumulative incidence function (CIF) [7] takes into account the informative nature of the censoring and corresponds to the probability of occurrence of a particular event without the assumption of independence between event types. Recently, Jeong et al. proposed a parametric model which permits to directly model cumulative incidence. As this parametric method requires a minimum follow-up, it is therefore appropriate to propose an adaptation of this method by differentiating the types of event using the competing risks methodology. The direct approach proposed by Jeong and Fine was used [8]. The main objective of this paper was to evaluate the ability of this parametric model to predict the cumulative incidence rates at 10, 15, 20, 25 years using short-term data. Breast cancer data and statistical methodology are described in second section and results presented in third section will be discussed in fourth section.

Patients and methods

Study population

Data were obtained from the Institut Curie Breast cancer database. This database contains records from 32515 non-metastatic breast cancer patients treated between 1980 and 2005 at the Institut Curie. This study focused on patients without lymph node involvement (pN0) who underwent breast conservative surgery for a tumor size <5 cm (T1 or T2) and clinical size <3 cm. Patients not treated by radiotherapy and who had neoadjuvant or adjuvant chemotherapy were also excluded. This population includes 4761 patients treated between 1981 and 2000. Follow-up was until September 4, 2013.

Statistical analysis

Competing risks

A competing risk is defined as an event whose occurrence precludes or alters the probability of occurrence of a main event under examination [9]. In this study, patients were considered at risk of three event types: loco-regional recurrence, distant metastasis and contralateral cancer or second cancer (another cancer in a different location), or death from any cause. The event of interest was the first event occurred.

It seems appropriate to distinguish loco-regional recurrence and distant metastasis, because these two event types are quite different in terms of predictive factors and survival prognosis of patients [10]. The third event is a composite event and includes contralateral cancer, second cancer and death from any cause. If a patient did not experience any event, it was censored at the date of last known contact.

In the setting of competing risks, the probability of failure overtime is described by the so-called cumulative incidence defined by

$$F_{k} (t) = P(T \le t,\Delta = k),$$

where T is the time between the diagnosis and the occurrence of the first event and Δ = 1, …, K is the indicator of the event type.

Prediction models

Non-parametric

The Kalbfleisch and Prentice method [7] is widely used in the literature to summarize competing risks data. Let t 1 < t 2 < ··· < t J represent J ordered times to first events. The non-parametric estimation of the cumulative incidence associated with an event type k (k = 1,…, 3) is defined by

$$\hat{F}_{k}^{NP} (t) = \sum\limits_{{t \le t_{j} }} {\hat{S}(t_{j - 1} ) \times \hat{h}{}_{k}(t_{j} )},$$

where \(\hat{S}(t_{j - 1} )\) corresponds to event-free survival estimated by Kaplan–Meier method [1] at time t j−1 and \(\hat{h}_{k}(t_{j} ) = {d_{kj}} /{n_{j}}\)where d kj is the number of type k events observed at time t j and n j is the number of patients at risk at time t j .

Non-parametric confidence intervals are obtained using the log(-log) transformation proposed by Choudhury [11].

Parametric

Jeong and Fine [8] proposed a direct approach to model parametrically the cumulative incidence. Each cumulative incidence is directly modelized using a Gompertz distribution. The cumulative incidence of event k is defined by

$$F_{k}^{P} (t,\alpha_{k} ,\beta_{k} ) = 1 - \exp \left[ {\frac{{\beta_{k} }}{{\alpha_{k} }}(1 - \exp (\alpha_{k} t))} \right],$$

where α k is the shape parameter and β k the scale parameter.

The Gompertz distribution is improper when α k  < 0 and β k  > 0, leading to a plateau in the tail of the distribution which is particularly well suited to capture cure rates as it can be seen in breast cancer. The probability of experiencing the event of type k is \(1 - \exp({\beta_{k}}/{\alpha_{k}})\). The parameters are estimated by the maximum-likelihood method. Here, patients were at risk of three event types, so a total of six parameters had to be estimated.

Using the model for prediction

The method used in order to evaluate the predictive ability of the model is illustrated by Fig. 1.

Fig. 1
figure 1

Schematic diagram of evaluation predictive ability of the model

The study population was divided into four periods of 5 years where patients were treated, treatment intervals were defined as follows: 1981–1985, 1986–1990, 1991–1995, and 1996–2000. For each period, follow-up period was limited at t p  = 1, 2, 3, 4, and 5 years, respectively. For example, a patient treated between 1981 and 1985 was followed at most until 1986, 1987, 1988, 1989, and 1990, respectively. After this period, it was censored if no events had occurred before. The parameters of the Jeong and Fine model were estimated using these short-term data (from T = 0 to T = 5 + t p ).

Non-parametric cumulative incidence functions estimated by Kalbfleisch and Prentice method using complete follow-up data (from T = 0 to T = T max ) were considered as the gold standard estimation. The long-term cumulative incidence functions predicted by Jeong and Fine model at 10, 15, 20, and 25 years were compared to this gold standard.

This procedure was iterated for each period of treatment and each follow-up period time.

The agreement between cumulative incidence rates predicted by the Jeong and Fine model and non-parametric estimation using Kalbfleisch and Prentice method at T = 10, 15, 20, and 25 years was defined using validation criteria based on non-parametric confidence intervals proposed by Choudhury et al. [11] and defined by

$$\Delta :\hat{F}_{k}^{NP} (T)^{{\exp \left[ {\frac{{ - c_{{{\alpha \mathord{\left/ {\vphantom {\alpha {}}} \right. \kern-0pt} {}}2}} \hat{\sigma }_{k} (T)}}{{\hat{F}_{k}^{NP} (T)\log (\hat{F}_{k}^{NP} (T))}}} \right]}} \le \hat{F}_{k}^{P} (T) \le \hat{F}_{k}^{NP} (T)^{{\exp \left[ {\frac{{c_{{_{{{\alpha \mathord{\left/ {\vphantom {\alpha {}}} \right. \kern-0pt} {}}2}} }} \hat{\sigma }_{k} (T)}}{{\hat{F}_{k}^{NP} (T)\log (\hat{F}_{k}^{NP} (T))}}} \right]}},$$

where c α/2 is the upper α/2 quantile of the standard normal distribution and \(\hat{\sigma }_{k} (T)\) is the square root of \(\text{var} (\hat{F}_{k}^{NP} (T))\). Two validation criteria were computed, Δ1 corresponding to 95 % non-parametric confidence interval \(\left( {\alpha = 1 - 0.95} \right)\) and Δ2 corresponding to 99.9 % non-parametric confidence interval \(\left( {\alpha = 1 - 0.999} \right)\) .

$$\Delta _{1} :CI_{ - }^{95\% } \left( {\hat{F}_{k}^{NP} (T)} \right) \le \hat{F}_{k}^{P} (T) \le CI_{ + }^{95\% } \left( {\hat{F}_{k}^{NP} (T)} \right)$$
$$\Delta _{2} :CI_{ - }^{99.9\% } \left( {\hat{F}_{k}^{NP} (T)} \right) \le \hat{F}_{k}^{P} (T) \le CI_{ + }^{99.9\% } \left( {\hat{F}_{k}^{NP} (T)} \right)$$

The analysis was performed under R 2.15.1 software, four packages are also used: survival, cmprsk, optimx, and numDeriv.

Results

Patient's characteristics

A total of 4761 patients were extracted from the population database of breast cancer of Institut Curie. Patient characteristics are displayed in Table 1 for each period.

Table 1 Patient's characteristics

The median follow-up was 14.48 years in the overall population. The median age was 55 years (24–90). Majority of tumor size was lower than 2 cm (71.5 % T1 vs 28.5 % T2). About 1685 patients had tumor with grade I (37.3 %), 1776 grade II (39.3 %), and 1061 grade III (23.5 %). A majority of patients had hormone receptor-positive status (87.9 %). There were 673, 1178, 1382, and 1528 patients in each cohort, respectively. The median follow-up was 23.43 years for the patients treated between 1981 and 1985 and decreased until 11.18 years for the patients treated between 1996 and 2000. The median age was 49.5 years for the patients treated between 1981 and 1985 and increased until 58 years for the patients treated between 1996 and 2000. Proportion of T1 tumor and grade I increased with time (60.6 % until 77.1 % and 22.7 % until 45.3 %). The same trend for proportion of hormone receptor-positive status (81.6 % until 91.8 %) was observed.

The number of each event types by treatment interval and follow-up period are displayed in Table 2.

Table 2 Number of each event types by treatment interval and follow-up period

In the first group, where patients were treated between 1981 and 1985, 133 loco-regional recurrences, 99 distant metastasis, and 171 contralateral cancer or second cancer or death were observed during the complete follow-up (from 1981 to today). If the follow-up was limited at 1 year (from 1981 to 1986), only 31 loco-regional recurrences, 38 distant metastasis, and 27 contralateral cancer or second cancer or death were observed. If t p  = 5 years, 67 loco-regional recurrences, 60 distant metastasis, and 50 contralateral cancer or second cancer or death were observed from 1981 to 1990.

Prediction results

Table 3 displays the cumulative incidence rates predicted by the model and non-parametric estimates with 95 and 99.9 % confidence intervals at 10, 15, 20, and 25 years for each event types using data from the first treatment interval period (1981–1985). Graphical representation of these results is shown in Figs. 2a–c. Results obtained from the three other groups of treatment periods (1986–1990, 1991–1995, and 1996–2000) are available in the supplementary materials. The values of CIF in the body of Table 3 are shaded when they satisfy the validation criteria Δ1 or Δ2.

Table 3 Cumulative incidence rates of loco-regional recurrence, distant metastasis, and contralateral/2nd cancer/death for the 1981–1985 treatment period
Fig. 2
figure 2

Cumulative incidence rates of loco-regional recurrence (a), distant metastasis (b), and contralateral/2nd cancer/death (c) for the 1981–1985 treatment period

Loco-regional recurrence

Non-parametric estimates of cumulative incidence functions of loco-regional recurrence were equal to 0.126, 0.149, 0.182, and 0.226 at 10, 15, 20, and 25 years, respectively. Cumulative incidence rates at 10 years predicted by Jeong and Fine model satisfied Δ1 (95 % CI = [0.102; 0.153]) whatever the follow-up period time. They were equal to 0.124, 0.127, 0.146, 0.134, and 0.135 with t p  = 1, 2, 3, 4, and 5 years, respectively. Predictions at 15 years satisfied Δ1 with limited follow-up at 1 or 2 years (0.168 and 0.176) and were slightly overestimated with t p  = 3, 4, or 5 years (0.203, 0.184, and 0.189). At 20 and 25 years, predictions were also slightly overestimated except that with limited follow-up at 1 year (0.205 and 0.235) and t p  = 2 or 4 years for 25 years prediction (0.250 and 0.259). A maximum difference of 5.5 % was observed between non-parametric estimation and prediction rates at 20 or 25 years with follow-up limited at 4 or 5 years (0.237 − 0.182 = 0.055).

All prediction rates satisfied Δ2 except prediction rates at 15, 20, and 25 years with t p  = 3 years and prediction rate at 20 years with t p  = 5 years.

For the other treatment periods, prediction rates were generally slightly underestimated with follow-up limited at 1, 2, or 3 years. But all prediction rates satisfied Δ1 or Δ2 with follow-up limited at 3, 4, or 5 years.

Distant metastasis

Non-parametric estimates of cumulative incidence functions of distant metastasis were equal to 0.110, 0.138, 0.147, and 0.154 at 10, 15, 20, and 25 years, respectively. Cumulative incidence rates at 10 years predicted by Jeong and Fine model satisfied Δ1 (95 % CI = [0.087; 0.135]) whatever the follow-up period time except with follow-up limited at 1 year (0.146). Predictions at 15 years satisfied Δ1 with limited follow-up at 3, 4, or 5 years (0.144, 0.156, and 0.130) and were slightly overestimated with t p  = 1 or 2 years (0.193 and 0.169). At 20 and 25 years, predictions were also slightly overestimated except that with t p  = 3 or 5 years for 20 years (0.159 and 0.141) and 25 years predictions (0.168 and 0.147). A maximum difference of 4.2 % was observed between non-parametric estimation and prediction rates at 20 or 25 years with follow-up limited at 4 or 5 years (0.196 − 0.154 = 0.042).

All prediction rates satisfied Δ2 except prediction rates at 15, 20, 25 years with t p  = 1 years and prediction rate at 25 years with t p  = 2 years.

For the other treatment periods, prediction rates were also generally overestimated with follow-up limited at 1 or 2 years. But all prediction rates satisfied Δ1 or Δ2 with follow-up limited at 3, 4, or 5 years. Except for the 1986–1990 period where prediction rates were overestimated even if follow-up was limited at 5 years.

Contralateral or second cancer or death

Non-parametric estimates of cumulative incidence functions of contralateral or second cancer or death were equal to 0.113, 0.176, 0.240, and 0.292 at 10, 15, 20, and 25 years, respectively. Cumulative incidence rates at 10 years predicted by Jeong and Fine model satisfied Δ1 (95 % CI = [0.090; 0.139]) whatever follow-up period time. They were equal to 0.105, 0.099, 0.099, 0.107, and 0.095 with t p  = 1, 2, 3, 4, and 5 years, respectively. However, predictions at 15, 20, and 25 years were all underestimated with any follow-up period time. They were equal or lower than 0.144, 0.173, and 0.197 at 15, 20, and 25 years, while lower bounds of 95 % non-parametric confidence intervals were equal to 0.148, 0.206, and 0.253, respectively.

Only prediction rates at 15 years with follow-up limited at 1, 2 or 4 years which were equal to 0.139, 0.132 and 0.144 satisfied Δ2 (99.9 % CI = [0.130; 0.226]).

For the other treatment periods, prediction rates were generally also underestimated whatever follow-up period time.

Similar results for the three event types were obtained in the subpopulation with hormone receptor-positive status (data not shown). Due to the small number of patients with hormone receptor-negative status, it was not possible to perform analysis in this subpopulation.

Discussion

Competing risks methodology has been developed for more than 30 years and this setting is increasingly used in cancer research. The aim of this paper was to evaluate the ability of competing risks parametric model to predict long-term cumulative incidences using short-term breast cancer data. A good prognosis population was selected for predictions in order to have a homogeneous sample in terms of risk of disease recurrence and overall survival. The direct approach proposed by Jeong and Fine seems to provide a useful alternative to non-parametric method of cumulative incidence estimation even when follow-up data are sufficiently extensive. Predictive ability of this model on breast cancer data was generally good considering the short follow-up period time used for estimation.

For the loco-regional recurrence risk, predictions were generally accurate even if follow-up period was very short (1 or 2 years). With follow-up limited at 3 years, Jeong and Fine model overestimated cumulative incidence rates but predictions were better with a longer follow-up period (i.e., 4 or 5 years). For the distant metastasis risk, predictions generally satisfied the validation criteria, except with follow-up limited at 1 or 2 years where long-term predictions were overestimated. Saphner et al. [12] highlighted the presence of a peak hazard of recurrence after several years of follow-up and decreasing thereafter in breast cancer after primary therapy. This phenomenon can explain overestimation of prediction rates of loco-regional recurrence and distant metastasis. Indeed, if follow-up period is not sufficiently large to capture the peak hazard of recurrence, cumulative incidence rates can be overestimated. It therefore would be preferable to fit the model using sufficiently large follow-up period in order to averaging this peak hazard of recurrence. In the second group where patients were treated between 1986 and 1990, predictions rates of distant metastasis were highly overestimated even if follow-up was limited at 5 years. Follow-up were extended to 6 or 7 years in order to capture the peak hazard of the first years, and prediction rates satisfied validation criteria (see Supplementary materials). Sensitivity analyses were also performed on treatment period (2 and 3 years) to evaluate the performance of the technique with fewer events. Due to the smaller number of patients and a shorter follow-up period, long-term predictions were less accurate (data not shown).

95 % non-parametric confidence interval (Δ1) was considered as validation criteria for the predictive ability of the model. Mould [5] suggested relaxing the validation criteria for long-term prediction at 20 and 25 years. Cumulative incidence functions were overall well predicted by the model at 10 and 15 years but they were less accurate in the long term according to Δ1. Indeed, predictions were calculated using at most 10 years of follow-up data making at least 10 and 15 years of follow-up saved rather than non-parametric method for predictions at 20 and 25 years, respectively. It seems reasonable, given the number of years saved, to apply this principle and use a wider non-parametric confidence interval as validation criteria for long-term predictions. 99.9 % non-parametric confidence interval (Δ2) was also used. In the first treatment group, width of Δ2 at 20 years (resp. 25 years) was 10.1 % (resp. 12.3 %) for the loco-regional recurrence and 9.1 % (resp. 9.6 %) for the distant metastasis. Most of the prediction rates with follow-up limited at 4 or 5 years satisfied Δ2, except prediction rate at 20 years with t p  = 4 for the loco-regional recurrence which was equal to 0.237 and 99.9 % CI = [0.134; 0.235]. Prediction rates at 20 or 25 years had a maximum difference of around 6 % compared to the gold standard estimation. The same trend for the other treatment periods was observed (see Supplementary materials). It is also possible to apply these validation criteria at any time provided that there were enough patients still at risk at this time in order to have accurate non-parametric estimates. As pointwise confidence interval is valid for a single fixed time, an alternative approach is the use of confidence bands. Upper and lower confidence band guarantee, with a confidence level, the cumulative incidence function that falls within the band. This could allow the possibility to evaluate the predictive ability of the model over time and not at a specified time points.

For the third event type (contralateral cancer or second cancer or death), predictions of cumulative incidence were less accurate than the two first event types. The heterogeneity of this composite event including contralateral cancer, second cancer, and death may explain this lack of precision. Furthermore, this model is built to be applied on a distribution where hazard decreases over time. In this study, death from any cause was considered and not only death from cancer. It is quite unusual to die from cancer without experiencing a local or distant relapsed first, so it is reasonable to think that most deaths observed as first event were not related to cancer. Given the age at the end of treatment of patients (49.5 years for the 1981–1985 period), the instantaneous hazard of death increased with time instead of decreasing as assumed by the model. This feature could also explain the underestimations of cumulative incidence of contralateral or second cancer or death at long term. If only death related to cancer would be considered instead of death from any cause, predictions of cumulative incidence of the third event type would be more accurate. Several authors [8, 13] recommend testing the goodness of fit of the data to the model before using it. The modified Gompertz model is only applicable when the hazard function decreases with time. This is why this model is more suitable to predict cumulative incidence of a particular event like loco-regional recurrence or distant metastasis than the third event where death not related to cancer was considered. In fact at infinity, all patients cured from the disease will die from other cause.

This is one of the limits of this direct estimation approach, the fitted event-free survivor function is not constrained to be a proper survivor function, i.e., the sum of the cumulative incidence functions at infinity equals 1. More recently, Shi et al. [14] proposed a parameterization that explicitly accounts this constraint [15]. This merits further investigation.

Another important aspect in oncology is the probability of cure which commonly names as the cure rate. In fact, recent advances in cancer treatment have modified the course of some cancer types and some patients will never relapse after treatment [16]. The term of cure can be used only when data present a large number of long-term survivor. Different models address the problem of cure rate estimation in the analysis of survival data [3, 17, 18] but few attentions were given in the literature concerning the evaluation of cure rate in the presence of competing risks data [19, 20]. Future work need to evaluate statistical tools to analyze competing risk data in the presence of cure fraction.

The Jeong and fine prediction model provides a useful addition to the Kalbfleisch and Prentice non-parametric method for the estimation of cumulative incidence. Using this model, it is possible to save a significant number of follow-up years for the estimation of long-term data. This could result in more rapid advances in breast cancer treatment and have the potential benefit of a reduction of cost. However, this prediction model can only be used in clinical practice when it is fully validated for a cancer site.