Axillary lymph node status provides prognostic information and guides treatment decisions for patients with breast cancer. Patients presenting with axillary nodal metastases often receive neoadjuvant chemotherapy (NAC), which eradicates nodal disease in 40–75 % of patients.1 5 Patients who achieve nodal pathologic complete response (pCR) after NAC have improved locoregional and survival outcomes.1 , 3 , 6 In fact, findings have shown nodal pCR to be a more important prognostic indicator than primary tumor response.3 , 6

Historically, clinically node-positive patients have undergone axillary lymph node dissection (ALND) after NAC regardless of nodal response. Recently, interest has focused on minimally invasive approaches to identify patients who convert to pathologic node-negative status after NAC who might avoid the morbidities associated with ALND.5 , 7 10 With the possibility of less extensive axillary surgery for patients who achieve nodal pCR, predicting nodal response to NAC has clinical implications.

This study aimed to evaluate potential predictors of nodal conversion after NAC for patients presenting with cN1 breast cancer and to generate nomograms predicting the probability of achieving a nodal pCR that might aid clinicians in treatment decisions.

Methods

Study Population

This study was approved by the Institutional Review Board. Patients with clinical T1–4, N1, M0 breast cancer who received NAC followed by ALND between 2001 and 2013 at the University of Texas MD Anderson Cancer Center were identified from a prospectively maintained database. All the patients had nodal metastases confirmed by needle biopsy.11 Patients presenting with recurrence, a history of axillary surgery including sentinel lymph node dissection (SLND), or no NAC treatment were excluded from the study. All the patients received anthracycline- and/or taxane-based chemotherapy. To reflect current practice patterns, only patients with human epidermal growth factor receptor 2+ (HER2+) tumors who received HER2-targeted therapy were included in the study.

Clinicopathologic and treatment data from 584 patients were used to determine factors predictive of nodal conversion after NAC. Complete data related to the significant factors were available for 578 patients. These data were used to create nomograms predicting conversion to pathologic node-negative status. Data from 315 cN1 patients who completed NAC followed by ALND at The European Institute of Oncology (EIO) were used as an external validation cohort.

Data Collection and Analysis

Clinic and pathology reports were reviewed to determine pretreatment data. Estrogen receptor (ER) status and progesterone receptor (PR) status were recorded as the percentage of cells staining positive by immunohistochemistry (IHC). The HER2 status was recorded as positive if immunostaining was scored as 3+ or the amplification ratio was higher than 2 as shown by fluorescence in situ hybridization. Radiology reports were reviewed to determine clinical tumor size, radiographic evidence of multifocal or multicentric disease, and number and size of abnormal axillary nodes on ultrasonography. The number of suspicious axillary lymph nodes was recorded as fewer than four or as four or more. Clinical T and N status were defined according to the seventh edition of the American Joint Committee on Cancer cancer staging system.12 Nodal status after NAC was determined from surgical pathology reports after ALND.

End Point and Statistical Analysis

Nodal pCR after NAC was defined as no evidence of residual tumor in the axillary lymph nodes. Any nodal metastases, including isolated tumor cells (ITCs), were considered to be positive. Data were summarized using standard descriptive statistics and frequency tabulation. Univariate logistic regression analysis was performed to determine the association of axillary pCR with clinicopathologic and treatment variables.

Both statistically significant and clinically relevant features were used to create multivariable logistic regression models and nomograms. One nomogram was developed using ER status as a binary variable, with ER+ defined as 1 % or more staining and ER− as less than 1 % staining. A second nomogram then was created using the percentage of cells with ER+ staining as a continuous variable to explore the possibility that the degree of ER positivity might influence the response to chemotherapy. We also created a nomogram that did not include the number of abnormal nodes for use at institutions in which this is not available [area under the curve (AUC) 77.6 %] (Supplemental Fig. 1).

The discrimination of the nomograms was evaluated using the AUC of the receiver operating characteristic (ROC) curve. The AUC and corresponding 95 % confidence interval (CI) are provided for each nomogram. In addition, to adjust for bias, the nomograms were validated with 10,000 bootstrap samples, and the bootstrap-adjusted AUC was estimated for each. A calibration curve was generated to show the association between the observed outcome frequencies and the predicted probabilities by categorizing patients on the basis of their predictive probability of converting to pathologic node-negative status. The nomogram was subsequently validated using data from the external cohort. Statistical analyses were performed using R version 3.2.0 software (http://www.r-project.org, R Foundation for Statistical Computing, Vienna, Austria). All statistical tests used a significance level of 5 %.

Results

Patient, Tumor, and Treatment Characteristics

The clinicopathologic characteristics of the development and validation cohorts are depicted in Table 1. The nodal pCR rates were 37.2 % (217/584) in the MD Anderson cohort and 37.3 % (118/315) in the EIO cohort.

Table 1 Comparison of clinicopathologic and treatment characteristics between the development (MD Anderson) and validation (European Institute of Oncology) cohorts

Uni- and Multivariate Analyses

Table 2 summarizes the univariate analyses for the MD Anderson cohort used to identify factors predictive of achieving nodal pCR. The odds of nodal conversion were significantly improved for tumors with a high nuclear grade [grade 3 vs. 1: odds ratio (OR) 13.4, 95 % CI 3.1–57.6], higher Ki-67 scores (OR 1.03 per unit, 95 % CI 1.02–1.04), ER− status (OR 3.5, 95 % CI 4.4–5.1), PR− status (OR 4.3, 95 % CI 3.0–6.2), or HER2+ status (OR 4.7, 95 % CI 3.1–7.3). The odds of conversion to pathologic node-negative status decreased as the amount of ER percentage staining increased (OR 0.98 per unit, 95 % CI 0.975–0.984). In addition, patients with multifocal/centric disease (OR 0.7, 95 % CI 0.4–1.0) had decreased odds of achieving nodal pCR (p = 0.05).

Table 2 Univariate Cox proportional hazards analysis of factors predicting axillary pathologic complete response in clinically node-positive patients treated with neoadjuvant chemotherapy

The aforementioned factors and others thought to be clinically relevant (clinical T stage, number of suspicious nodes on ultrasound categorized as <4 vs. ≥4 nodes) were included in the multivariate analysis (Table 3) and nomogram models. The analysis did not include Ki-67 because it was not known in a large number of patients (n = 181).

Table 3 Multivariable logistic regression model predicting axillary pathologic complete response in clinically node-positive patients treated with neoadjuvant chemotherapy

These variables then were used to create two nomogram models to predict the likelihood of achieving nodal pCR after NAC. The first model categorized ER status as positive or negative. The predicted accuracy as estimated by the AUC was 78 % (95 % CI 74.2–81.7 %). The bootstrap-adjusted AUC, which corrects for optimism, was 76.5 % (Fig. 1). A nomogram using the percentage of ER staining as a continuous variable then was created to investigate whether the extent of ER positivity might influence the nodal response to chemotherapy. Using this approach, the predictive ability of the model slightly improved, with an AUC of 78.7 % (95 % CI 75.0–82.5 %) (Fig. 2a) and a bootstrap optimism-corrected AUC of 77.3 %. An ROC curve plotting the tradeoff between sensitivity and specificity for different probability cutoffs is shown in Fig. 2b.

Fig. 1
figure 1

a Nomogram for predicting axillary pathologic complete response after neoadjuvant chemotherapy using estrogen receptor categorized as positive (≥1 % of cells staining) or negative (0 % of cells staining). To calculate the individual probability of nodal conversion, the number of points for each factor is first determined by drawing a vertical line from the specific characteristic to the points scale and figuring the sum of total points. A line drawn from the total points line to the lower line shows the individual probability of achieving an axillary pathologic complete response. b Receiver operative characteristic curve for the nomogram. The predicted accuracy-unadjusted area under the curve (AUC) was 78 % [95 % confidence interval (CI) 74.2–81.7 %]. The bootstrap-adjusted AUC for this model was 76.5 %

Fig. 2
figure 2

a Nomogram for predicting axillary pathologic complete response after neoadjuvant chemotherapy using estrogen receptor as the percentage of cells stained (continuous variable from 0 to 100 %). b Receiver operative characteristic curve for the nomogram. The unadjusted area under the curve (AUC) for this model was 78.7 %, 95 % confidence interval (CI) 75.0–82.5 %]. The bootstrap-adjusted AUC was 77.3 %

External Validation of the Nomogram

Data from 315 patients treated at the EIO were used to determine the external validity of the nomogram. The validation cohort differed from the MD Anderson cohort, showing a greater percentage of clinical T4 tumors (34 vs. 7 %, p = 0.0005), a higher proportion with multifocal/centric disease (32 vs. 24 %, p = 0.01), and a higher nuclear grade (grade 3: 65 vs. 52 %, p = 0.0005). It was more likely to be ER− (43 vs. 25 %, p < 0.0001), PR− (58 vs. 36 %, p < 0.0001), and HER2+ (31 vs. 20 %, p = 0.0004).

The validation cohort was used in the first model, which categorized ER status as positive or negative with an unadjusted AUC of 79.4 % (95 % CI 74.6–84.3 %). When the nomogram using percentage of ER staining as a continuous variable was evaluated using the validation cohort, it continued to show robust accuracy, with an unadjusted AUC of 82.2 % (95 % CI 77.6–86.8 %).

Discussion

In this study, we identified high nuclear grade, high Ki-67 score, ER and PR negativity, and HER2 positivity as factors associated with achieving a nodal pCR in patients with cN1 disease receiving NAC. In addition, we developed nomograms that estimate the likelihood of axillary pCR in these patients. Because many institutions worldwide assess ER status categorized as positive or negative, a nomogram was constructed using ER status as a dichotomous variable, with an AUC of 78 %, suggesting the utility of this predictive tool when details regarding the absolute percentage of ER+ staining are not available. Using ER staining percentage as a continuous variable slightly improved the nomogram (AUC 78.7 %). Both models were validated using an independent cohort of patients demonstrating that the nomogram is reliable for use at other institutions.

Nodal response to chemotherapy has been established as an important prognostic feature. In a study of 403 patients with biopsy-confirmed axillary metastases who received NAC, Hennessy et al.3 showed that patients who achieved a nodal pCR had better 5-year overall survival (OS 93 %) and relapse-free survival (RFS 87 %) than those who had residual nodal disease (OS 72 %, RFS 60 %). For patients not achieving a nodal pCR, outcomes are related to the number of nodes harboring residual disease as well as the tumor subtype. One study showed that patients with hormone receptor-positive (HR+)/HER2− tumors with zero to three positive nodes after NAC had a 5-year locoregional recurrence rate (LRR) of 2 % compared with 7 % if four or more positive nodes were identified. In contrast, those with HR−/HER2− tumors had 5-year LRR of 9 % if there were zero to three positive nodes compared with 44 % if there were four or more positive nodes.13

Traditionally, patients with axillary nodal metastases have undergone ALND, a procedure with considerable morbidity such as chronic pain, decreased range of motion, and lymphedema.14 16 Just as NAC can eradicate disease in the breast,2 , 17 , 18 it also can convert clinically node-positive patients to pathologic node-negative status in 40–75 % of cases.2 5 Unfortunately, accurate identification of these patients likely to have a nodal pCR who may not benefit from extensive axillary surgery has been difficult. Single-institution and retrospective reports using SLND to restage the axilla after NAC have shown false-negative rates of 5–20 %, which has limited its use in this setting.19 22

Several prospective trials have recently addressed this issue, showing that the false-negative rate of SLND in cN1 patients after NAC is 12.6–14.1 %.5 , 7 , 8 However, subset analyses of these trials show that the accuracy of SLND can be improved using a dual-tracer technique and retrieval of two or more SLNs, with ensured removal of the node initially confirmed to contain metastases (marked with a clip), and using IHC for pathologic evaluation.5 , 7 , 8 , 23 , 24 Several groups currently are working on ways to ensure removal of the clipped nodes using techniques such as targeted axillary dissection.9 , 10 , 25 27 As techniques to ensure reliable axillary staging are developed, predicting axillary nodal response to therapy will become increasingly important in treatment planning.

Similar to the primary tumor, nodal response to therapy varies according to tumor biology. The American College of Surgeons Oncology Group (ACOSOG) Z1071 trial, which reflects current chemotherapy approaches including the use of trastuzumab for patients with HER2+ tumors, showed differential nodal responses based on tumor biology. The overall nodal conversion rate was 41.1 %, but this varied from 21.1 % in patients with HR+/HER2− tumors to 49.4 % in patients with triple-negative breast cancer to 64.7 % in patients with HER2+ disease (p < 0.0001).28 Our study corroborated these findings, with ER, PR, and HER2 status strongly predictive of nodal response.

This nomogram may be used as an adjunct to clinical decision making about appropriateness of NAC but not as a replacement for current decision paradigms. For instance, the decision to use NAC often is driven by the goal to downsize the primary tumor, making breast-conserving therapy possible. Considering omission of ALND with eradication of nodal disease, clinicians may currently consider NAC as a route to downstaging of the axilla as well. Although the overall probability of nodal conversion in recent trials has been about 40 %,28 certain subgroups, such as patients with ER+ tumors, have a much lower chance of conversion, as described earlier. This nomogram may be especially helpful in predicting which patients with ER+ tumors might gain a benefit from NAC with regard to nodal response, allowing a more individualized assessment of nodal conversion probability than that defined by tumor receptor status alone. It also may be used to counsel patients on the expected outcomes after chemotherapy. However, many other factors go into the decision to administer chemotherapy in the neoadjuvant setting that are not related to nodal status. For some patients, tumor biology or primary tumor characteristics such as skin or chest wall involvement may drive the decision to administer NAC rather than the probability of nodal conversion.

A Dutch group has developed a similar model predicting the likelihood of nodal conversion based on 291 clinically node-positive breast cancer patients treated with NAC.29 Their model includes age, clinical T stage, tumor histology, ER, PR, and HER2 status, as well as whether the patients received trastuzumab or a taxane, with a reported AUC of 0.77 (95 % CI 0.71–0.82). Several differences need to be noted between their model and those in the current study.

First, in the Dutch cohort, only 54.2 % (52/96) of the patients with HER2+ tumors received trastuzumab, whereas 78.4 % (228/291) of the entire cohort received neoadjuvant taxane-based therapy. Because the use of HER2-targeted therapy currently is a standard component of neoadjuvant therapy for patients with HER2+ disease, to reflect contemporary practice patterns, we excluded patients with HER2+ tumors who did not receive trastuzumab. Additionally, we included only patients who received the entire planned course of chemotherapy. Our model is designed to help the clinician seeing a patient with cN1 disease to decide on the optimal order of therapy. Thus, we wanted to model the probability of nodal conversion with the expectation that patients would receive all therapy.

Second, the Dutch group considered less than 10 % ER staining to be ER−. Our study corroborated the notion that low ER expression influences nodal conversion rates both when included as a continuous variable and when categorized using a cutoff of 1 %, consistent with the current American Society of Clinical Oncology guidelines.30

Finally, our model considered any residual nodal disease as positive, even in cases of ITCs, in contrast to the model presented by Schipper et al.,29 which considered ITCs as negative. Both the ACOSOG Z1071 trial and the sentinel node biopsy following NAC trial showed that defining ITCs found in SLNs as positive decreased the false-negative rate of SLND. Although ITCs identified in a patient undergoing upfront surgery are treated as node-negative,31 , 32 the impact of ITCs after NAC is unclear, and consideration should be given to performing completion ALND.8 , 33 Our model has the added advantage of an external validation, which performed similarly to the MD Anderson cohort although it represented a subgroup with larger tumors and higher proportions of ER−, PR−, and HER2+ tumors.

In conclusion, as the paradigm shifts to omission of extensive axillary surgery after response to NAC, using NAC to eradicate nodal disease is a clinically relevant strategy. These nomograms can be used to predict the probability of achieving nodal pCR after NAC with readily available clinicopathologic features. Therefore, they may have utility for informing treatment decisions.