Esophageal cancer is a devastating disease because of the wide range of lymph node (LN) metastasis, even at the early stage.1,2 Multidisciplinary treatment combining surgery, chemotherapy, and radiotherapy has been standard to improve the outcome.3,4 Neoadjuvant chemoradiotherapy (NACRT) and neoadjuvant chemotherapy (NAC) has been used in Western countries5 and Japan, respectively, primarily for esophageal squamous cell carcinoma (ESCC).6,7,8 Although postoperative treatment such as adjuvant chemotherapy was established as a standard treatment in gastric or colorectal cancer, proving the efficacy of adjuvant therapy for esophageal cancer has been challenging because of its high surgical invasiveness.9 In fact, the percentage of patients who could receive adjuvant therapy was not high enough after esophagectomy.10,11

Recently, immune checkpoint inhibitors (ICI) were introduced for esophageal cancer and proved efficacious in the metastatic setting.12 ICIs have less gastrointestinal toxicity compared with cytotoxic chemotherapy, potentially improving their tolerability in patients who undergo esophagectomy. More recently, the CheckMate-577 trial demonstrated that adjuvant nivolumab significantly improved the survival of patients who received NACRT followed by esophagectomy.13 Therefore, while efficacy in patients who receive NAC remains unclear, adjuvant therapy including nivolumab could be the next standard treatment. On the other hand, immune-related adverse events (irAE) were reported to occur and required longer-lasting treatment in a certain number of patients.14,15 Furthermore, the high cost of treatment would destroy public insurance systems worldwide. Therefore, risk stratification for appropriate selection of candidates for adjuvant therapy is desired.

TNM staging has been a standard to identify patients at high risk of postoperative recurrence.16 In addition to the Japanese esophageal cancer classification,17,18 the UICC/AJCC classification is a standard. Independent of the TNM staging system, the response to neoadjuvant treatment has been shown to predict postoperative survival, including in our recent report.19,20,21,22 Therefore, this study hypothesized that the combination of pathological stage and histological response could be a simple and reliable risk stratification of cancer death. We developed a novel staging system combining pathological stage and histological response in ESCC patients who underwent esophagectomy after NAC, using the database from two Japanese esophageal centers.

Methods

Patient Selection

Patients with ESCC who underwent transthoracic subtotal esophagectomy at Keio University Hospital or the Cancer Institute Hospital of the Japanese Foundation for Cancer Research between 2004 and 2016 were retrospectively reviewed. Among these, patients with cStage I, II, III (excluding cT1N0 and cT4b), or IV ESCC due to supraclavicular LN metastasis who received NAC, and whose histological response was available, were included in the analysis. Patients with salvage esophagectomy after definitive chemoradiotherapy or R2 resection were excluded.

Patient characteristics, clinicopathological factors, and surgical procedures were reviewed. Clinical stage was determined by esophagogastroduodenoscopy (EGD) and computed tomography (CT) in all patients. Postoperative complications were evaluated using the Clavien–Dindo classification.23 This study was approved by the ethics committee of the Keio University School of Medicine and Cancer Institute Hospital of Japanese Foundation for Cancer Research. Extent of tumor spread was evaluated using the 8th edition of the TNM classification, which was established by the Union for International Cancer Control (Supplementary Table 1). pT1 was applied with the patient’s pN following the TNM staging manual when patients encountered complete histological response in the primary tumor while having residual disease in the lymph node.

Treatment and Follow-Up

Transthoracic esophagectomy with right thoracotomy and gastric tube reconstruction is performed as a standard curative surgical procedure. Mediastinal LNs with bilateral recurrent nerve and abdominal LNs, including paracardial LNs and LNs along the lesser curvature and the left gastric artery, were routinely dissected. The supraclavicular LNs were also dissected if the primary tumor was located between the upper- and mid-thoracic esophagus. Cisplatin and 5-FU (CF) therapy twice every 3 weeks was the standard preoperative treatment at the present study’s institutions based on the JCOG9907 study.4 A regimen of CF plus docetaxel (DCF) administered three times every 3 weeks was an alternative treatment option. DCF was considered for patients in this study with either borderline-resectable disease or multiple LN metastases at diagnosis. Regardless of response to NAC before esophagectomy, postoperative treatment was not given unless patients had apparent residual disease. All treatment strategies were discussed at a multidisciplinary oncology conference attended by surgical oncologists, medical oncologists, diagnostic radiologists, and radiation oncologists. Postoperative follow-up included EGD and CT every 6 months for 5 years after surgery. When a patient was diagnosed as having pN2 or higher, the CT follow-up interval was 3–4 months during the first year after surgery. Cancer-specific survival (CSS) was calculated from day of surgery to day of death due to primary disease or last follow-up. In CSS, noncancer deaths were dealt with as censored cases at date of death. Overall survival was calculated from day of surgery to day of death or last follow-up. Patients were followed up until death or until July 2019.

Histological Response Criteria and Patterns of Recurrence

The primary tumors were examined according to the Japanese Classification of Esophageal Cancer to evaluate the histological response to preoperative treatment.17,18 This classification scheme includes five grade (grade 0, no tumor response; grade 1a, necrotic or fibrotic change observed in less than one-third of the tumor; grade 1b, necrotic or fibrotic change observed in between one- and two-thirds of the tumor; grade 2, more than two-thirds of the tumor was necrotic or fibrotic; and grade 3, no viable tumor cells).

Statistical Analysis

Student’s t-test was conducted to identify the difference in mean age between the groups. The distribution of operative time, blood, and number of LN retrieved were described using median (range) and compared using Mann–Whitney U test. Differences between categories were identified using the chi-squared test or Fisher’s exact test. Survival curves were estimated using the Kaplan–Meier method. All comparisons between the two groups were made using the log-rank test. The hazard ratio (HR) was calculated using the Cox proportional-hazards model in cases in which more than two groups were described in survival analysis or in multivariate survival analysis. Factors with significance level of p < 0.1 on univariate analysis were included in the multivariate analysis. All statistical analyses were conducted using IBM SPSS Statistics for Windows, version 25.0 (IBM Corp., Armonk, NY, USA), and differences were considered statistically significant when associated for two-sided p-value < 0.05.

Results

Response to Neoadjuvant Chemotherapy and Clinicopathological Factors

The distribution of patients across grade 0–3 is presented in Table 1. Of the 483 patients included in this study, 82% were male. Among all patients, 50% of the tumors were located in the mid-thoracic esophagus. The numbers of cStage I, II, III, and IV patients were 52 (11%), 199 (41%), 204 (42%), and 28 (6%), respectively. In terms of NAC, 453 patients (94%) received CF and 30 (6%) received DCF. Regarding histological responses, grade 0–1a, 1b, 2, and 3 response was observed in 339 (72%), 68 (14%), 56 (10%), and 20 (4%), respectively. There was a significant association between clinical stage and histological response (Table 1). The NAC regimen did not affect the response. Regarding pathological stage, the numbers of pStage 0, I, II, III, and IV patients were 13 (3%), 81 (17%), 134 (28%), 180 (37%), and 75 (15%), respectively. Twelve (21%) and two (10%) patients in the grade 2 and 3 groups, respectively, showed pathological stage (pStage) III or higher. The response did not influence the short-term outcomes, including operative time, amount of blood loss, and postoperative complications.

Table 1 Comparison of pathological grade and patients

Cancer-Specific Survival and Histological Response

To evaluate the external validity of histological responses, the prognostic impact of histological response criteria was compared between institutions. Consequently, we found that the cancer-specific survival of responders and nonresponders was almost identical between institutions (Supplementary Fig. 1). The CSS among the groups stratified by pathologic grade is shown in Fig. 1a. The HR (95% CI) of CSS in grade 1b, 2, and 3 relative to that in grade 0–1a was 0.27 (0.14–0.53), 0.13 (0.05–0.36), and 0.09 (0.01–0.65), respectively. The differences in the CSS between grade 0–1a and the others were common among the pathologic stages (Fig. 1b–e). Therefore, grade 1b, 2, and 3 were defined as responders and grade 0–1a as nonresponders in subsequent analyses.

Fig. 1
figure 1

Histological response and cancer-specific survival. (a) Histological response of primary tumor to NAC and cancer-specific survival (CSS). (bd) Pathological histological response and CSS under stratification using the pathological stage

Prognostic Significance of Responders for Each Pathological Stage

The survival of responders was favorable compared with nonresponders in pStage 0–I [HR (95% CI), 0.29 (0.07–1.17); Fig. 2a] and II [HR (95% CI), 0.37 (0.12–1.10); Fig. 2b], but not statistically significantly so. Meanwhile, the HR (95% CI) of responders was 0.37 (0.15–0.92) and 0.24 (0.06–0.98) in pStage III (Fig. 2c) and IV (Fig. 2d), respectively, when compared with nonresponders. As shown in Supplementary Figs. 2 and 3, responders had significantly longer CSS than nonresponders in all groups stratified by pN stages, and in those with pT1 and pT3.

Fig. 2
figure 2

Comparison between responders and nonresponders for each pathological stage

Establishment of Classification and Risk Score Using pStage and Histological Response

The 5-year CSS (5y-CSS) rates of responders in pStage 0–I, II, III, and IV were 94%, 92%, 76%, and 71%, respectively. Therefore, responders in pStage 0–II were classified into one group, and those in pStage III and IV into another (Fig. 3). The 5y-CSS rates of the nonresponders in pStage 0–I and II were 81% and 77%, thus we classified these patients into one group. Finally, the patients were subdivided into five groups. Intriguingly, the 5y-CSS in pStage III–IV responders was 75%, almost identical to that of nonresponders in pStage 0–II (78%). As shown in Supplementary Fig. 4, the previously reported stratifications using pN and histological response21,24 were not as notable as our classification. When overall survival was evaluated, prognostic value was almost identical to CSS, as shown in Supplementary Fig. 5.

Fig. 3
figure 3

Development of prognostic risk stratification system of cancer-specific survival combining pStage and response to neoadjuvant chemotherapy

To develop a risk score, we conducted multivariate analysis using pStage and histological response, which were found to be significant prognostic factors on univariate analysis (Table 2). The results showed that these variables were independent prognostic factors of CSS (pStage II/III/IV, HR 1.25/3.71/7.59, p = 0.579/<0.001/<0.001; nonresponders 3.07, p < 0.001). Based on the multiplication of hazard ratio from each group in which the reference group was set as 1.0 and all HR were rounded to integers, a scoring system was developed. As shown in Fig. 4, the scoring system successfully stratified the patient’s prognosis significantly.

Table 2 Clinicopathological factors, histological response, and cancer specific survival: uni- and multivariate analysis
Fig. 4
figure 4

Development of risk scoring system using pStage and response to neoadjuvant chemotherapy

Discussion

The efficacy of NAC has been a well-known prognostic marker in ESCC patients.19,20 In line with previous reports, when responders were defined as those with grade 1b or higher, they showed significantly better prognosis. Then, based on the histological response, the patients were further stratified into two groups for each pStage. The difference between the responders and nonresponders was remarkable, especially in pStage III–IV. The current stratification could be beneficial to exclude patients who have lower risk of cancer-related death in the same pathological stage. Several previous studies have shown a survival advantage for pathological responders within the same TNM class.24 However, this is the first multicenter study to elucidate the clinical significance of the combination of pStage and histological response in patients who receive esophagectomy after NAC.

The responders showed favorable prognosis within the same pT and pN groups. Even though the locoregional tumor was completely resected by surgery, microscopic residual tumor cells inside the operative field or micrometastases to distant organs can cause recurrence. When NAC was effective, the residual tumors or micrometastases could be eradicated, resulting in lower cancer recurrence even in pStage III–IV. The current results are consistent with our recent report showing that pathological responders after NAC had less incidence of distant failure.19 In the CROSS trial, neoadjuvant chemoradiotherapy reduced not only locoregional recurrence but also distant metastases. Besides, pCR was a favorable prognostic factor for both locoregional and systemic recurrence.25

Risk stratification methods such as TNM staging must be simple and accurate to be used in daily clinical practice. When patients are classified as responders and nonresponders in the same pStage, the more detailed classification can be proposed, although the number of groups is doubled and becomes more complex, which may decrease the external validity of the stratification. In the current analysis, they were classified to be in the same group because of the almost identical CSS in pStage 0–II and III–IV. Validation using a larger dataset is required for clinical application. In terms of the novelty of the current combination, several studies have reported the clinical significance of combining pathological findings and histological response. However, they primarily used pN stage and the presence of downstaging.21,22,24 Nevertheless, especially in pStage III/IV, the prognosis of pathological responders was notably better within the same pStage in our dataset. Therefore, combining pStage and histological response would benefit these populations. Furthermore, since interinstitutional consistency is vital for establishing a new prognostic classification, the current first multiinstitutional study can be informative to predict prognosis in esophageal cancer patients who receive NAC.

Esophagectomy is a highly invasive procedure, and postoperative complications may slow down a patient’s recovery, making it challenging to receive effective adjuvant therapy. Furthermore, postoperative complications negatively impact cancer relapse and overall survival.26 Our group previously reported that the postoperative systemic inflammatory response is associated with disease recurrence independent of infectious complications.27 These previous findings further suggested the importance of reducing the number of tumor cells before esophagectomy, therefore leading to the introduction of neoadjuvant treatment. However, adjuvant treatment combined with neoadjuvant therapy can further improve survival. In 2020, the CheckMate-577 trial showed remarkable improvement of disease-free survival using adjuvant nivolumab.13 Because of the fewer gastrointestinal adverse events, immune checkpoint inhibitors may be better tolerated by patients after transthoracic esophagectomy than cytotoxic agents. However, since immune-related adverse events can be crucial, the immunotherapy indication should be carefully chosen, especially in patients with better prognosis. On the other hand, the current stratification cannot predict the efficacy of chemotherapy. Regardless of the prognosis, adjuvant chemotherapy can potentially improve the prognosis of all patients. The current prognostic stratification would be a novel guide to predict long-term outcomes after surgery and determine the indication for adjuvant chemotherapy regardless of the group involved. Based on the fact that immunotherapy will not be the only adjuvant treatment option, further investigation is needed to establish the ideal treatment strategy, including cytotoxic chemotherapy for those with unfavorable prognosis.

This study had several limitations. First, the potential selection bias cannot be fully excluded because of its retrospective nature. However, consecutive patients who underwent transthoracic esophagectomy after NAC were reviewed to minimize the bias. Furthermore, the current dataset was obtained from two Japanese high-volume esophageal centers. A multicenter trial would help improve the external validity of the current analysis. Second, the definition of the histological response to chemotherapy is another limitation. Several classifications have been proposed to evaluate the histological responses. Although the criteria were clearly stated in the JES classification, the diagnosis was provided by only two pathologists. In general, tumor size before NAC needs to be guessed to calculate the percentage of residual tumor cells, resulting in variation among investigators. However, the cutoff between grade 1a and 1b was set in the current analysis, which was relatively clear. Furthermore, the current separation would not greatly affect the results because of the remarkable difference in CSS between responders and nonresponders, with consistency between institutions (Supplementary Fig. 1). While the efficacy of neoadjuvant chemotherapy followed by surgery was proven in a randomized phase III trial,6 with comparable survival to neoadjuvant chemoradiotherapy reported in the CROSS trial,5 the application of an NAC-centered treatment strategy is limited. Therefore, the external validity of the current study is limited. On the other hand, given that triplet chemotherapy with higher response rates potentially improved the prognosis further in the JCOG1109 trial, intensified NAC treatment could be adopted as one of the treatment options in the future.

In conclusion, this novel risk stratification combining pathological stage and histological response was proven to be effective. The current risk stratification system will contribute to selecting appropriate candidates for adjuvant therapy.