Introduction

Neoadjuvant chemotherapy (NAC) is increasingly used in the treatment of patients with large but still operable breast cancer or proven lymph node metastases. Although the overall survival benefit is similar to that of adjuvant chemotherapy [1, 2], NAC has two main advantages. First, reducing the tumor burden in the breast increases the rate of breast conservation surgery [3, 4]. Second, it allows assessment of tumor response to specific chemotherapeutic agents. The conventional outcome measurement of NAC is pathologic complete response (pCR) at surgery, which has been shown to be associated with improved survival [5, 6].

The rates of pCR differ among molecular subtypes, but it is not known how tumor histology affects these rates. Invasive breast cancers can be divided into two main histologic subtypes: ductal and lobular. Invasive lobular cancers (ILCs) account for 10 % of breast cancers, comprising a small but important subgroup. They are characterized by loss of the cell adhesion marker e-cadherin, and a diffuse growth pattern characterized by non-cohesive cells arranged in a single-file pattern [7, 8].

ILC and invasive ductal cancer (IDC) differ in expression of standard histopathology markers, molecular expression patterns, and radiographic appearance [911]. Compared to IDC, ILCs are more likely to be hormone receptor (HR) positive [12], less likely to be high grade, and often appear more diffuse both radiographically and clinically. Some of these differences are attributable to differences in cell adhesion resulting from loss of e-cadherin, rather than to different anatomical origins, as both ILC and IDC are derived from the terminal duct lobular unit [11].

Several reports have suggested that ILC is less responsive to chemotherapy than IDC and thus should be treated with endocrine therapy only [10, 13, 14]. Similarly, investigators have noted a lower pCR rate to NAC among lobular cancers, which may result in a lower rate of breast conservation surgery (BCS) than in IDC [15, 16]. However, it remains unclear whether lower rates of pCR and BCS observed in ILC are due to histology or to biologic and molecular factors associated with histology, such as HR status [1619].

In view of the controversy over the role of NAC in ILC, we compared ILC and IDC in a large series of neoadjuvantly treated breast cancer patients for whom MR imaging and gene expression data were available. We compared clinical, pathological, and molecular features of ILC and IDC using pCR and BCS rates as the primary outcome variables. We report the combined results from two neoadjuvant studies with a total sample size of 676 locally advanced breast cancers, of which 75 were ILCs. We compare molecular characteristics, and relate them to differences in pathologic and surgical outcomes for ILC and IDC.

Methods

Subjects

Study subjects were treated through the I-SPY 1 trial (n = 221) or two ongoing neoadjuvant studies at the Netherlands Cancer Institute (NKI, n = 455). The I-SPY 1 trial was a multi-center NAC trial in the United States which accrued patients between 2002 and 2006 [20, 21]. Eligible patients were women with invasive breast cancer measuring at least 3 cm by clinical examination or imaging, with no evidence of distant metastatic disease. All patients received anthracycline-based NAC, with the use of taxanes left to the discretion of the treating physicians. After 2005, trastuzumab was given to patients whose tumors were HER2+ (n = 20). Patients underwent serial core needle biopsies before, during, and after NAC, as well as serial MRI examinations.

The two NKI trials accrued 181 and 274 patients between 2004 and 2010. Eligible patients had a breast carcinoma with either a primary tumor size of at least 3 cm, or the presence of axillary lymph node metastases proven by fine needle aspiration (FNA) [22, 23]. A treatment regimen was assigned to each patient, consisting of one of the following, depending on the study: (1) six courses of dose-dense doxorubicin/cyclophosphamide (ddAC) or (2) six courses of capecitabine/docetaxel (XD). If the therapy response was considered unfavorable by MRI evaluation after three courses, ddAC was changed to XD or vice versa. Patients with HER2+ tumors received three 8-week courses of trastuzumab, paclitaxel, and carboplatin (PTC). Patients underwent serial core needle biopsies before and after NAC. A subset of patients (n = 138) underwent serial MRI examinations.

Standard pathology biomarkers

All breast pathology slides were reviewed by specialty trained breast pathologists (JW for the NKI, CL for the I-SPY 1 trial). A tumor was termed ILC based on histologic growth pattern. E-cadherin staining was performed centrally for the I-SPY subjects and NKI subjects, respectively, but was not included in the definition of ILC. Standard clinical biomarkers including HR and HER2 status were measured at local sites using immunohistochemistry and/or fluorescence in situ hybridization assays (FISH) in the case of HER2. P53 expression was assessed by immunohistochemistry, centrally for the I-SPY 1 patients or at the NKI for the NKI patients. pCR was defined as the absence of invasive tumor in both the breast and axillary lymph nodes after NAC. A patient with only residual DCIS or microscopic tumor cells was considered to have a pCR.

Molecular data

High quality gene-expression arrays were available on 149 patients from the I-SPY trial (Agilent arrays) and 265 patients from the NKI trials (Illumina 6v3 arrays). From the I-Spy 1 trial, intrinsic subtype classification was determined by PAM50 50-gene assays [24]. The 70-gene prognostic profile was determined using representative probes as previously described and classifies patients as high or low risk for recurrence [25]. The wound healing signature was used to classify tumors as quiescent or activated [26].

Imaging data

For I-SPY 1 patients, centrally trained breast radiologists at each site read all MRIs and assigned an imaging phenotype for the pre-treatment MRI based on five previously described imaging patterns: (1) well defined, unicentric mass; (2) well defined, multilobulated mass; (3) area enhancement with nodularity; (4) area enhancement without nodularity; and (5) septal spreading [27]. For the subjects enrolled at NKI, a specialized breast radiologist (CL) determined whether pre-treatment MRIs showed the tumor to be either mass-like, multinodular, or diffuse. For purposes of analysis, the 5 phenotypes used in the I-SPY 1 trial were collapsed such that phenotypes 1 and 2 were defined as mass-like; phenotype 3 was multinodular; and phenotypes 4 and 5 were considered to be diffuse. A consort diagram showing availability of pathology biomarker, molecular, and imaging data (overall and by site) is shown in Fig. 1.

Fig. 1
figure 1

Consort diagram showing data availability

Statistical analysis

Data from the I-SPY 1 and NKI trials were combined and analyzed together, without adjustments for site-specific differences. Associations between histologic subtype and other clinical, pathological, molecular, and imaging parameters were assessed using the Chi-square test for categorical and Student’s t test for continuous variables, respectively. pCR and BCS rates for ILCs and IDCs were compared using the Fisher exact test over all cases and within subsets defined by histologic subtype-associated variables. Associations between pCR and BCS with histologic subtype-associated features were also assessed using univariate logistic regression and provided as supplemental results. Data were analyzed in Stata Version 11 (College Station, Texas) and JMP 9.0.0 (SAS Institute Inc).

Results

Clinical and pathological characteristics of ILC versus IDC

Subjects with ILC were older at diagnosis, more likely to have larger tumors at presentation (defined as higher T stage by pre-treatment MRI) and a multinodular pattern on MRI when compared to subjects with IDC (Table 1). However, when the analysis was restricted to HR+/HER2− tumors, there were no significant differences between MRI pattern (60 and 31.5 % multinodular in ILC and IDC, respectively, p = 0.099). Race, proportion of patients with positive lymph nodes, and chemotherapeutic regimens administered were similar between subjects with ILC and IDC.

Table 1 Clinical, histological, and molecular characteristics of the study cohort: comparison of ILCs and IDCs

Molecular characteristics of ILC versus IDC

ILC cases were more likely to be HR+/HER2− than IDC cases (79 vs. 46 %, p < 0.001). Additionally, ILC cases were less likely to be high grade, and had lower rates of p53 expression than IDC cases. Overall, 85 % of ILC designated on the basis of histopathology were negative for e-cadherin.

A subset of cases (41/75 ILC and 368/595 IDC) had evaluable gene expression data. Consistent with the HR and HER2 comparison, the luminal A subtype was more common in lobular than ductal cancers (Table 1). Despite the prevalence of the luminal A subtype, ILCs were heterogeneous, with all intrinsic subtypes represented (Table 1). In particular, 4/41 (10 %) of ILCs were found to be of the basal intrinsic subtype, compared to 26 % of IDC. All four basal ILC cases were grade 2; two were e-cadherin negative, and two were e-cadherin positive.

ILC cases had decreased expression of the activated wound healing signature compared to IDCs (22 vs. 65 %, p < 0.001) and lower risk according to 70-gene prognostic profile (56 vs. 90 %, p < 0.001). Significant differences persisted when analysis was restricted to HR+ cases only: 18 % of HR+ ILC versus 56 % of HR+ IDC expressed the activated wound healing signature, and 54 % of HR+ ILC versus 84 % of HR+ IDC had a the high risk 70-gene prognostic profile in this cohort (data not shown).

Pathologic response to neoadjuvant therapy in ILC versus IDC

ILCs were significantly less likely to have a pCR after NAC than IDCs (11 vs. 24 %, p = 0.008). However, ILC and IDC have different clinical, pathological, and molecular characteristics; and among the nine variables showing a significant association with histological subtype (Table 1), six were also significantly correlated with pCR (HR/HER2 subtype, grade, p53 expression, molecular subtype, wound healing signature, and 70-gene signature risk groups, Supplemental Table 1). When we adjusted for these six features, differences in pCR rates between ILC and IDC were no longer significant within individual clinical HR/HER2 subtypes or dichotomized subsets defined by grade (1/2 or 3) and p53 expression (≤75 or >75 %) (Table 2). Among the 41 ILC cases for which gene expression data were available, ILC had significantly lower pCR rates within the 70-gene high risk group than IDCs (4 vs. 27 %, p = 0.013). None of the basal-like ILC had pCR, compared to 41 % among the IDC. In addition, ILC cases within the wound healing quiescent group also appeared less likely to achieve a pCR than their IDC counterparts (0 vs. 15 %, p = 0.026).

Table 2 Comparison of rates of pCR in ILC and IDC: stratified analysis among histological types

Surgical outcomes in ILC versus IDC

Overall, ILCs had a significantly lower rate of successful BCS after NAC than IDCs (33 vs. 46 %, Fisher exact test p = 0.037). However, among the clinical, pathological, and molecular features associated with lobular histology, age, T stage, and MRI pattern were also significantly associated with BCS (Supplemental Table 2). Patients over the age of 49 were 1.5 times as likely to have BCS (p = 0.007, 95 % CI 1.12–2.1). Higher T stage at presentation was significantly associated with decreased odds of BCS (OR 0.38 for T2 tumors compared to T1, p = 0.006, 95 % CI 0.19–0.76), as was a non-mass pattern on MRI (OR 0.26 and 0.43, p < 0.001 and p = 0.002, 95 % CI 0.15–0.44 and 0.26–0.73 for multinodular and diffuse patterns, respectively, when compared to the mass MRI pattern). The difference in BCS rates between ILC and IDC were no longer significant when evaluated within T stage and MRI pattern defined subsets (Table 3). ILCs remained significantly less likely to have BCS than IDCs within the older cohort (32 vs. 54 %, p = 0.021), but not the younger cohort (35 vs. 41 %, p = 0.60).

Table 3 Comparison of rates of breast conserving surgery (BCS) in ILC and IDC: stratified analysis among histologic types

Discussion

While most NAC trials have not distinguished outcome between lobular and ductal cancers, those that have done so frequently find lower rates of pCR and BCS in ILC compared to IDC. This apparent lack of benefit from NAC has led many to question whether physicians should offer NAC to patients with ILC, while others have argued that insufficient evidence exists to recommend either for or against its use [9, 10, 14, 18, 28]. Many have pointed out that lobular histology may be a marker of poor response to NAC because of its association with other factors, and not because of a difference intrinsic to lobular histology [16, 18, 19]. However, previous studies have had a small number of lobular cancers available for analysis precluding the molecular subset analyses that have been performed for ductal cancers.

Given the prevailing belief that: (1) ILC represents a more homogeneous breast cancer type than IDC and that (2) ILC demonstrates a homogeneously poor response to NAC, we wished to determine whether ILC in fact consisted of different subtypes in which tumor biology rather than histology alone could be driving response to NAC. By combining findings from three NAC trials which prospectively collected clinical, molecular, and MR imaging data we had the opportunity to identify features associated with lobular histology which could explain the apparent decreased responsiveness to NAC. We postulated that a sufficiently large cohort would allow comparison of biologic and molecular predictors of response to NAC between ILC and IDC using the endpoints of pCR and BCS rate.

We confirmed that ILC differed from IDC in clinical presentation, histopathologic characteristics, gene expression, and MR imaging features. Subjects with ILC presented at older ages with higher T stage, and ILC tumors were more likely to be lower grade, HR+, HER2−, and have lower p53 expression. These findings are consistent with previous reports [7, 10, 14, 15]. In the combined cohort, pCR was achieved in 11 % of ILC and 24 % of IDC. However, when adjusted for HR and HER2 status, lobular histology was not an independent predictor of response to NAC. Regardless of histology, cancers which were HR+/HER2− had very low pCR rates (7 and 5 % among HR+/HER2− lobular vs. ductal cancers, respectively). These findings support that tumors with lobular histology generally respond poorly to NAC, but that this is due to the enrichment of the HR+/HER2− phenotype among ILC, rather than histology.

Only 41 ILC had molecular subtyping. While most of the ILC cases expressed the luminal A intrinsic subtype, nearly 10 % expressed the basal subtype. These were not due to inclusion of pleomorphic lobular cases, as none of the tumors were grade 3. The identification of a basal subset within ILC has also been reported by another group which found cytokeratin 5/6 expression, a commonly accepted marker of the basal subtype, in 17 % of ILC cases tested [29]. These observations support that both clinical and molecular heterogeneity exist among ILC.

We also found that a proportion of the ILC cases expressed the activated wound healing gene expression signature (22 % among ILC and 65 % among IDC) and more than half had a high risk 70-gene prognostic signature (56 % in ILC and 90 % in IDC), indicating the presence of biologically high risk tumors even within the ILC subset. Tumors with an activated wound healing signature express genes indicative of a wound healing environment, while tumors without expression of these genes are said to be quiescent. The activated wound healing signature is associated with poor outcomes [26]. We also compared the 70-gene prognostic signature which is a validated profile that identifies patients at a high risk of distant metastases and death [25]. The high risk 70-gene prognostic profile has also been found to be associated with pCR [30]. In this subset, the low risk signature (found in 43 % of ILC and 10 % of IDC) was 100 % predictive of failure to achieve pCR among both ILC and IDC, again supporting that the ILC phenotype includes a higher prevalence of those molecular predictors of poor NAC response when compared to IDC and that these expression patterns predict for low pCR rates in both ILC and IDC.

Radiographic findings have emerged as important clinical biomarkers, and baseline MRI characteristics in particular have been shown to be an important predictive biomarker of NAC response [31]. In this context, we sought to determine how MR findings might differ between invasive ductal and ILCs. We found that MR imaging features were significantly different between ILC and IDC, but that this observation was driven by differences between HR(+) cancers and HR(−) cancers, rather than tumor histology. Multinodular and diffuse patterns of enhancement were more prevalent among lobular cancers, with mass-like enhancement more commonly seen among ductal cancers, specifically among the HER2(+) IDC (35 % mass-like) and HR(−) IDC (61 % mass-like) compared to ILC (13 % mass-like). Although MR enhancement pattern did not independently predict for pCR, a mass-like pattern was associated with a higher breast conservation rate among women treated with neoadjuvant therapy among all tumor subtypes.

As pCR rates are low in lobular cancer, it is important to consider whether partial response rate is also a clinically meaningful goal for neoadjuvant therapy in ILC, since even if a patient does not achieve pCR, a partial response can make BCS possible. Methods for more quantitative assessment of tumor response have recently been developed including the previously described residual cancer burden and the neoadjuvant response index (NRI) [23, 32]. Analysis in the NKI cohort using NRI as the outcome yielded the same results as the pCR-based analysis, indicating that lobular histology is not independently associated with lower likelihood of partial response (data not shown). Thus, in the small subset of patients with HER2(+) or HR-negative lobular cancers, or in those patients with a mass-like pattern in MRI, there may be a role for NAC in improving outcomes of breast conserving surgery, even if pCR is not achieved.

An important limitation of this study is the small number of lobular cases for which expression data were available, even with the combined results of three clinical trials. Specifically, we discovered that although the pCR rate was 8/75 (11 %) among the total group of ILC, only one case among the 41 ILC that had expression data (2 %) had a pCR. The clinical features of those cases with and without microarray data were reviewed, and no obvious differences emerged. We confirmed that all microarray assays were performed on pre-treatment biopsies. Given the small size of samples, the cases subjected to microarray analysis were selected for high tumor cellularity; in the lobular cases, those might have represented the more chemotherapy resistant tumors. However, the numbers are too small to derive definitive answers and the source of this substantial bias remains unclear. Conclusions based on these molecular data should therefore be interpreted with caution and confirmation in other studies is necessary to understand the clinical role of molecular subtyping in predicting response to neoadjuvant therapy in lobular cancers.

In this clinical trial cohort, we confirmed that ILC has a lower pCR rate than IDC in the setting of NAC. However, this difference could be attributed to clear differences in tumor characteristics. Among these patients, HR and HER2 status determined response to NAC independent of tumor histology. The lowest pCR rate was seen in HR+/HER2− tumors for both ILC and IDC; the low pCR rate in ILC thus reflected the high prevalence of this tumor type among lobular cancers. The small subset of HR− and/or HER2+ ILC derived benefit from neoadjuvant therapy as demonstrated by a pCR rate of 25 % in this group. Molecular heterogeneity was found among ILCs including the discovery of biologically high risk tumors as defined by high NKI score or activated wound healing signature. Future marker studies will allow greater confidence to conclude whether molecular and radiologic subtyping can enhance the ability to predict response to chemotherapy above and beyond HR and HER2 status.