Introduction

Breast cancer is the most prevalent type of malignant neoplasm worldwide, and hormone receptor (HR)-positive breast cancer is the most common phenotype. Since their introduction in the 90s, aromatase inhibitors (AIs)—anastrozole, letrozole, and exemestane—became the standard of care for postmenopausal HR-positive breast cancer both in early-stage and advanced disease settings. For patients with HR-positive advanced breast cancer (ABC), clinical guidelines recommend sequential treatment with endocrine therapy, unless they are experiencing visceral crises and/or endocrine resistance is known or suspected [1, 2].

Progression-free survival (PFS) and time to progression (TTP) have been the most frequently used endpoints in ABC. In recent studies, both PFS and TTP (PFS/TTP) have been considered as surrogate endpoints for overall survival (OS), for the purpose of drug approval regulatory process. PFS/TTP are attractive endpoints because they are available earlier than OS, are less likely than OS to be influenced by competing causes of death, and are not affected by the treatments administered after progression. On the other hand, these endpoints are subject to measurement errors and bias [3].

Recently published randomized clinical trials evaluating new therapeutic approaches used AI monotherapy arms as the control group. Better than expected survival outcomes in patients treated with single-agent AI have been observed. This finding has potential implications for therapeutics decisions regarding the sequencing of endocrine agents as well as for the design and conduct of ongoing and future clinical trials.

This study aims to evaluate the evolution over time of the PFS/TTP obtained in trials testing first-line AI monotherapy in patients with HR-positive ABC.

Methods

A literature review was conducted using the MEDLINE database to identify randomized, controlled phase II or III trials which included at least one arm of first-line AI to treat HR-positive ABC patients using the following strategy: (((((breast cancer[Title]) AND aromatase inhibitor[Title/Abstract])) OR ((breast cancer[Title]) AND anastrozole[Title/Abstract])) OR ((breast cancer[Title]) AND letrozole[Title/Abstract])) OR ((breast cancer[Title]) AND exemestane[Title/Abstract]). For this analysis, first-line treatment was defined as the first endocrine therapy used for ABC irrespective of prior (neo) adjuvant endocrine therapies for early-stage disease. The search strategy included terms applicable to the patient population (postmenopausal women with locally advanced or metastatic breast cancer). Publications reporting PFS or TTP data in patients treated with first-line AI monotherapy (anastrozole, letrozole, or exemestane) were considered eligible for further assessment. However, there are some subtle differences between these outcomes that must be pointed out. While PFS is defined as the time elapsed between treatment initiation and tumor progression or death from any cause; TTP considers only disease progression as event, censoring patients who die from any cause before progression. Language was restricted to English, but there was no date of publication restriction. Data were evaluated from publications reporting either the primary or follow-up analyses. Studies designed for HER-2-positive breast cancer were excluded.

The following data were extracted from each relevant publication identified: phase, sample size, year of publication, the period of patients` accrual, estimated median PFS/TTP used for sample size calculation and median observed PFS/TTP. We also evaluated baseline patients’ characteristics such as age, the proportion of patients with visceral disease, previous endocrine therapy exposure, the proportion of patients with confirmed HR positivity and the proportion of patients with confirmed HER2 negativity.

A linear correlation was used to access the association between the year of the first patient enrolled and the PFS/TTP reported with AI as first-line treatment for HR-positive ABC in each study. We hypothesized that the reported PFS/TTP times would be longer as the year of the first patient enrolled increases. p values lower than 5% were deemed to be significant.

Results

The literature search retrieved 568 publications, of which 19 met the inclusion criteria for this review, accounting for 4552 postmenopausal patients divided into 21 separate treatment arms with an AI as first-line monotherapy treatment for HR-positive ABC. Figure 1 summarizes the flowchart for study selection. The studies included in this review are described in Table 1.

Fig. 1
figure 1

Flowchart for study selection summary. Phase III and randomized phase II clinical trials with at least one AI monotherapy arm for first-line treatment of postmenopausal patients with HR+ HER2 negative advanced breast cancer were included

Table 1 Randomized controlled trials of endocrine therapies for the treatment of postmenopausal women with hormone receptor-positive advanced breast cancer in the first-line setting

The overall median PFS/TTP ranged from 6.1 to 15.6 months among trials. Nine studies reported median PFS/TTP for anastrozole ranging from 8.2 to 15 months [4,5,6,7,8,9,10,11,12,13,14], while seven studies reported PFS/TTP between 9 and 15.6 months with letrozole [15,16,17,18,19,20,21,22] and 6.1 and 13.8 months with exemestane in three studies [7, 14, 23] Our analysis showed a positive correlation between the year of the first patient enrolled in these trials and median PFS/TTP reported (R 2 = 0.34; p < 0.01), meaning that patients treated with first-line AIs in the more recently conducted trials have longer PFS/TTP than their counterparts treated with the same drugs but in older studies (see Fig. 2). In univariate analyses, no correlation was found between the year of the first patient included in these trials and other potential prognostic factors such as the proportion of patients with visceral metastasis at baseline (R 2 = 0.26; p = 0.20), median age (R 2 = 0.10; p = 0.15), and percentage of patients who had never been exposed to adjuvant therapy (R 2 = 0.05; p = 0.18). Multivariate analysis was not carried out because of the reduced number of studies available to be included in this exploratory estimation. No other correlations with potential prognostic factors were sought. The number of patients accrued in each individual study is a factor that could influence the precision of the median PFS/TTP times reported by the studies included in this analysis. Univariate linear regression did not show significant variation in the sample size magnitude along time (R 2 < 0.01; p = 0.68). As described in Table 1, 10 studies reported the estimated 95% confidence interval for their median PFS/TTP, while nine did not. Therefore, this variable was not included in the estimations presented and represents a limitation of this analysis.

Fig. 2
figure 2

Linear correlation between the year of the first patient enrolled within first-line AI trials in HR-positive ABC and median PFS/TTP reported

Table 2 describes the nine trials in which the estimated PFS/TTP in the AI monotherapy group used for sample size calculations were reported. In all but one trial, the expected PFS/TTP was lower than the observed PFS/TTP.

Table 2 Description of the studies reporting the expected and observed PFS/TTP for first-line AI monotherapy

Discussion

Our analysis suggests that the PFS/TTP achieved with aromatase inhibitors therapy as first-line treatment for advanced breast cancer have shown progressive improvement. The PFS/TTP increased from 6 to 9 months in the earlier trials to 13–16 months in the current era, representing an absolute gain of approximately 7 months, without the addition of any other drug. These advances raise challenges on how to incorporate these findings into the interpretation of available clinical trial data and the design of future research. Immediate implications are related to sample size and follow-up time estimation for ET trials design.

Most likely, a combination of improved medical care and patient selection factors justifies most of this progressive improvement in outcomes obtained with the same therapeutic strategy. While clinical trial entry criteria have not changed significantly over time, more recent studies are probably including a different population of patients. With the widespread use of modern radiology tests, the population included in the latest trials may include a higher proportion of patients with a lower burden of metastatic disease. Exclusion of HER2 patients is routinely done in recently published endocrine therapy trials; however, HER2 status was not available in earlier studies. At the same time, older trials included a minority of patients with unknown HR status. Therefore, a small fraction of patients with HER2-positive or HR-negative breast cancer was included, and this could lead to inferior PFS/TTP in older studies.

Better outcomes may also be a result of improved loco-regional treatments and increasing experience with managing the expected adverse effects. Additionally, improved general medical care and the use of adjuvant therapies (i.e., radiotherapy, bisphosphonates, and RANKL inhibitors) is another potential explanation for the observed improvement in PFS outcomes using the same therapy on similar population, even though these factors are hard to measure and probably of small importance.

An alternative explanation that may contribute to a trend for better survival outcomes in more recent trials is a change in tumor biologic features, with a hypothetical shift towards a higher proportion of endocrine-sensitive tumors. Some studies have reported a difference in the epidemiology of breast cancer over time, with more ER-positive tumors being observed than in the past [24,25,26]. Whether these changes occur in the metastatic setting or are simply related to improvements in screening strategies of early-stage indolent tumors has not been adequately studied.

The inclusion of a population with characteristics associated with hormone sensitivity and/or less aggressive disease, such as a higher proportion of endocrine therapy-naive patients or patients with non-visceral metastases, could lead to longer PFS. However, our analysis shows that there were no significant differences over time in the proportion of endocrine therapy-naïve patients neither on the proportion of patients with non-visceral disease.

Estimating the event rate to proceed with sample size calculation in clinical trials is challenging in this scenario. For instance, PFS/TTP results for the AI monotherapy arms in the trials included in this review have consistently surpassed the expectations used for calculations of HR and sample size (see Table 2). With the exception of one study, in all studies, the observed rates surpassed the estimated PFS/TTP in trial design. The CALGB 40503 trial provides an extreme example. This phase III study was designed to investigate whether bevacizumab prolongs PFS when added to first-line letrozole as treatment of HR-positive advanced breast cancer [21]. This trial had 90% power to detect a 50% improvement in median PFS from 6 to 9 months based on literature available when the study was designed. Surprisingly, the observed PFS for the letrozole monotherapy arm was 15.6 months.

We expect that ongoing trials will face similar challenges. The VICTORIANE trial is an ongoing randomized phase 3 study assessing the addition of oral vinorelbine to aromatase inhibitors for the treatment of patients with endocrine therapy-naive HR-positive ABC. The sample size for this trial was calculated based on an expected PFS for the AI monotherapy arm of only 9 months [27]. The MONARCH 3 is a randomized phase III study of AI plus abemaciclib, a CDK4/6 inhibitor, or placebo in first-line treatment of women with ER+ advanced breast cancer [28]. Similarly, the estimated PFS in the AI monotherapy arm considered for study planning was 10 months. These trials are examples where there is a high probability that the PFS in the AI monotherapy control arm will surpass the estimative. Calculating sample sizes considering a PFS/TTP lower than the actual one and an excessively optimistic hazard ratio may result in underpowered clinical trials with critical issues impacting sample size and follow-up time.

Interestingly, similar improvements in survival outcomes have also been identified in studies of early-stage breast cancer with AI monotherapy as the control group. Because an annual risk of relapse persists over decades, the development of new adjuvant ET approaches requires protracted follow-up and large sample sizes. In the FACE trial [29], comparing anastrozole versus letrozole in the adjuvant setting, investigators were challenged with a lower-than-expected event rate and decided to report results before reaching the expected number of events. Similarly, in the SOFT [30] and TEXT [31] trials, that evaluated the role of ovarian function suppression and exemestane in premenopausal patients with early-stage HR-positive breast cancer, the expected survival outcomes used for sample size calculation were overly pessimistic. Consequently, the event rate was over-estimated and the duration of follow-up time to reach the targeted number of events for final analysis was under-estimated [32].

Other than the impact on methodological issues for future research, these findings also impact clinical practice. Given continued efforts to develop therapies that delay the emergence of endocrine resistance, the results of this analysis demonstrating the favorable outcome for these endocrine-sensitive subgroups has immediate relevance. With added clinical and financial toxicities, it is important to define HR-positive populations that are more likely to benefit from combination therapy and to identify those patients who may do well with endocrine therapy alone. [33]. Future research on prognostic and predictive biomarkers might assist in estimating the pattern of endocrine sensitivity or resistance and guide therapeutic decisions on ET sequencing strategies.

Conclusion

While acknowledging the inherent cross-trial analyses limitations, our work suggests a progressive improvement in PFS/TTP of first-line endocrine therapy with aromatase Inhibitors containing trials over time. Our scrutiny of the data does not reveal any definitive reason that fully explains this phenomenon. Nevertheless, these findings have important implications for the planning and sample calculation of currently ongoing and future randomized clinical trials.