Introduction

There is a wide perception that the methodologies which analyze breast cancer for prognostic and predictive purposes are neither accurate nor standardized [1, 2].

One of the main reason is the intrinsic limit of immunohistochemical reactions, particularly due to both the conspicuous effects of the pre-analytical phase [3] and the subjectivity of interpretative criteria [4], which are often based on unclear guidelines [5, 6].

In addition, another source of poor reproducibility is the extreme intra- and inter-tumoral heterogeneity [7], which is poorly framed by the current classification schemes.

The introduction of both molecular breast cancer classification [8] and its immunohistochemical surrogate [9] has attenuated some discrepancies, ameliorating the therapeutic choices, particularly as regards the category of luminal breast cancers (LBCs). This clinical-pathological category encompasses the luminal A-like carcinomas—characterized by estrogen receptor (ER) and progesterone receptor (PgR) positivity, HER2 negativity, and a low proliferation index (Ki-67)—which (either in adjuvant or neoadjuvant setting) need only endocrine therapy and luminal B-like carcinomas—ER and PgR positive or negative, either HER2 positive or negative and Ki-67 high neoplasms—for which (adjuvant or neoadjuvant) chemotherapy could be added to endocrine therapy [9]. However, the category of LBCs, which includes about 65% of all breast cancer cases, is characterized by a remarkable variability of biological behaviors and, consequently, diverse response to therapies. This scenario strongly disagrees with the needs of a true precision medicine which presupposes a precision pathology [10,11,12].

Multi-gene expression profiles (MGEPs) have significantly improved our skills to predict the outcome of LBCs; moreover, they, in addition to conventional prognostic and predictive markers, significantly help treatment decisions (chemotherapy use and/or extended endocrine therapy) [3], as currently recommended by several guidelines [13]. Among MGEPs, EndoPredict® (EP Myriad Genetics®, Salt Lake City, USA) is a last-generation prognostic gene signature assay that uses gene expression to calculate the risk of metastases within 10 years from the initial diagnosis of breast cancer [14]. This test has been specifically developed for LBCs. In contrast with almost all the other commercially available multigene tests, it is suitable for decentralized use instead of a single reference laboratory and allows to correlate molecular data and clinical-pathological parameters in a single integrated diagnostic algorithm useful for clinical decisions [15,16,17].

Using a quantitative RT-qPCR on formalin-fixed, paraffin-embedded (FFPE) samples, EP is based on mRNA expression of 12 genes—UBE2C, BIRC5, and DHCR7 (proliferation-associated genes); RBBP8, IL6S7, AZGP1, MGP, and STC2 (estrogen receptor-associated genes); CALM2, OAZ1, and RPL37A (reference genes); and a control gene for the presence of DNA contamination [14, 15, 18]—to calculate the EndoPredict score (EP score), which, using a validated cutoff value of 5, can classify patients with LBCs into either low- or high-risk class for developing metastases after endocrine therapy [14, 19]. The EP score can be combined with the nodal status and tumor size to obtain the integrated molecular and clinical risk score (EP clin), which allows a better risk stratification in terms of recurrences at 10 years; according to the methods described by Filipits et al. [14], a cutoff point of 3.3 corresponds to a 10-year risk (EP clin risk) of 10%.

The availability of EndoPredict® (EP) has given us the opportunity to have an experience in this field, which is reported in the current study, highlighting, in particular, how such a technique can greatly improve the service that anatomic pathology provides for the benefit of patients suffering from breast cancer.

Materials and methods

EndoPredict ®

The EP test was performed on total RNA, which, according to the instructions [19], was manually isolated from FFPE blocks of the primary resected tumor. For isolation, a 10-μm section of an FFPE block (surgical specimen or core biopsy) was used. The portion of the tumor must have been at least 30% (ratio of invasive tumor tissues to total tissue area). Adipose tissue is not to be considered as tissue, as it hardly contains RNA. The evaluation of the portion of the tumor was determined using the adjacent Hematoxylin and Eosin section. If the tumor section content was less 30%, the proportion of the tumor was enriched by macro-dissection. The isolation procedure started with the removal of the paraffin and the lysis of the tissue, and subsequently, the nucleic acid was extracted using silica-coated magnetic particles. After extraction and elution of the nucleic acids, a DNase digest was carried out. As a sufficient quantity of RNA and efficient DNA digestion is required for a valid EndoPredict test, all the samples were controlled with the EndoPredict QC Strip to verify the adequate level of RNA before subsequent analysis, according to the manufacturer’s instructions. [19].

After a specific training, we started the implementation of EP in our Department, strictly following the manufacturer’s instructions [19].

Study design and patients selection

The study was divided into three phases. Phase 1 aimed to verify the reproducibility of results of EP in difficult technical conditions, both regarding the pre-analytical phase of tumoral tissues collection—presence or absence of inadequate fixation artifacts such as tissue shrinkage, loss of cellular cohesiveness, cytoplasmic clumping, indistinct nucleoli, and loss of other cellular details; thermal stress; scarcity of useful tissue derived from core biopsies; differences between FFPE specimens which underwent a previous intraoperative examination by cryostat and thereafter routinely formalin-fixed and paraffin-embedded—and comparing the primary invasive tumors with their own axillary node metastases or their own in situ component, which were prepared for the molecular test by macrodissection, as above-mentioned for tumor section content less 30%. For this purpose, 12 samples of 6 representative cases were retrospectively recruited.

In the phase 2, the EP was applied on representative FFPE blocks of eight patients selected from our archive, having both an adequate follow-up (period of follow-up 2005–2016) and a complete therapeutic history, in order to firstly analyze retrospectively the concordance between the results of EP test and the prognostic grouping based on conventional bio-pathological parameters, the therapeutic choices made at the time of the diagnosis, the patients’ outcomes, and the variations of therapeutic choice that EP test would determine if adopted at the time of the original diagnosis.

In phase 3, the EP test was applied in daily clinical practice (period of diagnoses from 2015 to 2018) based either on pathologists’ autonomous decision—whenever a breast carcinoma belonged to the luminal B category of breast cancer or presented clinical-pathological discrepancies, such as inconsistency between tumor grading/histotype and nodal status or ambiguous bio-pathological characterization—or on oncologists’ requests—collected both as oncologists’ opinions in the frame of multidisciplinary teams and as oncologists’ independent requests—in order to assess their attitude toward it. These 100 cases, randomly collected based on the chronological order of the request, were therefore stratified in 3 risk classes (low, high, and equivocal risk), according to both clinical-pathological characteristics and the main international guidelines in force at the time of the diagnosis [20,21,22,23,24].

Descriptive statistical methods were used for both analyzing and representing all the obtained data, displaying them as numbers, counts, and percentages.

Results

Phase 1—reproducibility of EP test

The first phase of the study (Table 1) included 12 specimens obtained from 6 patients, with median age of 62.5 years and with no special type (NST) breast carcinomas G2 and G3.

Table 1 First patients’ group studied with EndoPredict®

Among these, two (33%) cases were studied for highlighting the eventual differences in EP test results between specimens from the same patient processed one as frozen biopsy (and afterward underwent the routinely FFPE processing) and the other routinely FFPE. There were not any differences among the results in these cases. Comparing the EP test performed both on the primary cancer and on one out of its two axillary node metastasis (20 axillary nodes totally examined), there were no discrepancies among the results obtained. Moreover, no differences in the results of EP test in case of core biopsy material versus (vs) surgical specimen, adequate vs inadequate fixation of the tumor tissue, and invasive vs in situ component of breast carcinoma were found.

Phase 2—concordance analysis of patients’ risk stratification

The second phase of the study (Table 2) included eight cases having a mean follow-up of 95.5 months (median follow-up of 96 months, range 51–132 months).

Table 2 Second patients’ group studied with EndoPredict®

The EP test results showed four (50%) breast cancers belonging to the low EPclin risk class and four (50%) cases included into the high EPclin risk class.

Comparing the EP test results with the patients’ outcomes, we highlighted that three (38%) cases could have received a more appropriate therapy. In fact, one (33%) patient, still alive and well (AW), belonging to the low EPclin risk class underwent adjuvant chemotherapy, while only adjuvant endocrine therapy (ET) should have been enough for her. Moreover, two (67%) deceased patients could have undergone adjuvant chemotherapy (CHT) if they had known, at the time of diagnosis, that their breast cancer belonged to the high EPclin risk class.

On the other hand, two (25%) patients still AW underwent only adjuvant ET, resulting in the correct therapeutic choice, as their low EPclin risk class confirmed. Furthermore, one (13%) patient with EP test result of high-risk disease underwent neoadjuvant CHT and showed complete response.

One (13%) case underwent the appropriate treatment at the time of diagnosis (adjuvant ET+ CHT), but this male patient died, consistently with the high-risk class demonstrated by EP test.

Finally, one (13%) case with low EPclin risk had a partial response after neoadjuvant CHT.

Phase 3—application of EP test in routinely clinical practice

As regards the third phase of the study (Table 3), EP test was performed for 100 patients, according either to pathologists’ or oncologists’ requests.

Table 3 Primary motivations of EndoPredict® request

Although the EP has been specifically developed for early LBC risk stratification, the motivation for EP test request in our study also encompassed a range of features involved in therapeutic decisions. In particular, while involved pathologist chooses EP mainly (41% of cases) for borderline bio-pathological aspects (e.g., Ki-67 close to the cut-off [20]—30%—or low hormonal receptor expression—11%), oncologists’ recourse to EP test was often (43%) connected with either tumoral staging interpretation (e.g., tumor diameter less than 10 mm—8%—minimal nodes involvement—13%) or, even, with 1–3 positive node cases (23%).

Applying the EP test in this way, both unexpected and clinically useful results have been reached, such as the finding of two (2%) G1 breast carcinomas or of a usually harmless tubular histotype with high likelihood of aggressive biological behavior (Fig. 1).

Fig. 1
figure 1

An example of a LBCs belonging to the high EP clin class, the histotype (tubular) of which usually does not show an aggressive clinical behavior. ER = estrogen receptor; PR and PgR = progesterone receptor; Ki67 = proliferative index; *surrogate = immunohistochemical surrogate classification according to St Gallen guidelines [20]; CHT = adjuvant chemotherapy; ET = endocrine therapy. HE = Hematoxylin and Eosin. The cross pointer in the chart indicates the patient’s risk stratification class in terms of recurrences at 10 years

Clinical-pathological risk prediction

As regards the clinical-pathological point of view, the EP test revealed its fundamental importance for the therapeutic choices of the 100 cases analyzed, as highlighted in Fig. 2.

Fig. 2
figure 2

EP test application in daily clinical practice, cumulative considerations obtained from the first 100 cases routinely tested

Among the 28 cases judged as high risk based upon clinical-pathological findings, the EP test allowed to identify four (14%) patients belonging to the low-risk class of EPclin. On the other hand, the EP test found 1 (5%) out of 21 cases assessed as low risk after clinical-pathological evaluation which presented, actually, a high EPclin risk, requiring CHT treatment. Finally, EP test helped to take a proper therapeutic decision for the 51 cases with equivocal risk.

Discussion

The decision of adding chemotherapy (CHT) to endocrine therapy (ET) in early ER-positive, HER2-negative breast cancer is often controversial and it is known that most patients can safely avoid it [25, 26]. In fact, conventional histopathological features (size, histotype, grade, vascular invasion, and node status) and biomarkers (ER, PgR, HER2, and Ki-67), together with guidelines and additional on-line predictive tools, are often clear enough for oncologists to either recommend or omit CHT. However, given the heterogeneity of luminal breast cancers (LBCs, about 65% of the total), accurate estimation in terms of risk of recurrence after ET alone is not possible. In fact, these parameters are unable to estimate the risk of late distant recurrence (5–10 years), particularly in women with node-negative disease or with limited nodal involvement, and to identify the subset of patients who remain at low risk of distant recurrence even after 10 years [27]. On the other hand, these bio-pathological characteristics are not always predictive for possible CHT benefit and a substantial overtreatment in these patients is a logical consequence.

On this background, the main purpose of our study was both to describe and to analyze the impact deriving from the implementation of a gene expression assay for LBCs in the routine diagnostic activity of a department of anatomic pathology, focusing on difficulties of implementing this test in routine practice and on improvements, in terms of prognostic and predictive stratification, compared with conventional bio-pathological parameters and, consequently, on oncologists’ behavior towards this useful aid for therapeutic choices.

Based on our experience, EP test can be easily implemented in an Anatomic Pathology Department which has sufficient experience in the bio-pathological characterization of breast carcinomas [15].

The first part of our study had tested the reproducibility of EP. According to the concordance of results obtained on material that undergone different pre-analytic phases, such as either frozen versus formalin fixation or adequate versus inadequate formalin fixation, we showed that EP is a resilient tool which can overcome the detrimental effects of an inaccurate pre-analytic phase [3, 12], allowing a good assessment of prognostic and predictive parameters of breast carcinoma, even in tumoral tissues suboptimally preserved. It is well known that all these conditions make immunohistochemical (IHC) interpretation of biomarkers’ expression difficult or impossible; therefore, EP test, compensating these intrinsic limitations of IHC—particularly issues regarding both accuracy and reproducibility of cell proliferation determination using the monoclonal antibody Ki-67 [4]—could improve the pathologists’ role for a more precise diagnosis and treatment of breast cancer.

An additional result, worthy of further studies, is the similar gene expression profiles obtained comparing either carcinomas in situ to their invasive counterpart, primary tumors to their nodal metastases or core biopsies to respective surgical specimens [18]. All these observations seem to support a possible ability of the EP test to overcome conditions of strong intra- and inter-tumor heterogeneity or limited representativeness of the sample.

As regards the evidences derived from the second phase of our study, we confirm the literature data [14, 17], according to which the EP test is a more useful tool for a better risk stratification of patient with early LBCs than conventional clinical-pathological parameters, allowing the choice of more personalized treatment. In fact, according to the retrospectively obtained results of the EP test, among the group of still AW patients, one received probably overtreatment with adjuvant CHT; on the other hand, two out of deceased patients did not receive adjuvant CHT treatment, either based upon oncologist’s decision or upon patient’s refusal. Obviously, if at that time, the oncologist had had the EP test available, the therapeutic choices would have been significantly different. It is also important to note that one patient died despite appropriate therapy due to an intrinsic greater aggressiveness of their tumors, as their high class of EPclin risk demonstrated.

Regarding the third phase of our study—the routine use of the EP test on current cases—one of the main aspects that we noticed was the attitude of the oncologists towards this new test. In fact, although we had proposed the introduction of the EP test as a tool for decision aids, their rate of requests was extremely low and not very enthusiastic. Thereafter, since scientific societies and international conferences have allowed the gradual emergence of specific recommendations in the guidelines issued [13, 20, 22], oncologists’ interest in the EP test grew progressively, extending their requests not only to LBCs, but also to carcinomas with other types of therapeutic uncertainty, regardless of tumor diameter, histotype, grading, and node status (Table 3). This evolution shows that EP test can alleviate the heaviness of these colleagues’ therapeutic choices [28].

Although remarkably interesting, the results obtained need additional validation on a larger number of samples, particularly regarding the usefulness of EP test to overcome conditions related to either inadequate preanalytical phase or tumor heterogeneity, observed in the first phase of the current study. Moreover, it could be interesting to confirm the accuracy of therapeutic choices taken according to EP test results once an adequate follow-up period of at least 10 years will be achieved by the patients enrolled in the third phase of the study and, consequently, corroborate the evidence of EP as a helpful tool in routine breast cancer diagnosis and management.

Conclusions

EP test, if appropriately performed in selected cases, is able to identify subtypes of LBCs with prognostic and predictive factors otherwise not accurately highlighted by conventional clinical-pathological characterization. EP test is a useful supplement to pathological routinely used methods, compensating the intrinsic limitations of both laboratory techniques and the marked intra- and inter-tumor heterogeneity, which characterize this disease. EP test, if applied widely, enhances the informative value of pathologists’ report, allowing both oncologists and patients to solve the prognostic assessment and therapeutic uncertainties, particularly avoiding chemotherapy overtreatment in breast cancer. In the era of diagnosis and stratification of distinct disease subtypes at cellular level, the multi-gene expression profiles represent an attempt to best identify the prognosis and the optimal therapeutic choices for individual patients. The pathologists’ perception is that EP test is able to introduce remarkable and useful improvements in the characterization of individual cases and, consequently, in more appropriate therapeutic choices.