Background

Esophageal cancer (EC) continues to represent a significant therapeutic challenge, with an increasing incidence and death rate, and a mere 16% overall survival (OS) rate.1,2 Despite its potential to induce significant morbidity, esophagectomy can lead to better OS results than any other treatment modality alone, especially when performed in a high volume setting that is linked to a lower postoperative mortality3 and superior long-term survival.4 Many high-volume surgical centers preferably perform extended resections, such as en-bloc esophagectomies or two- or three-field dissections, which may contribute to better regional disease control because of removal of metastatic lymph nodes (LNs), and may be linked to better survival.59 However, neither the minimum number of LNs to be removed during curative intent esophagectomy nor the optimum LN count that could be linked to the best survival results have been well established. Recommended minimum LN counts range from 12 for a greater than 90% staging sensitivity10, over 16 for greatest survival benefit11, to 18 for optimal staging accuracy.12 Few clinical studies have comparatively addressed outcomes after various degrees of LN dissections (LND). A randomized controlled trial (RCT) examined upper mediastinal and cervical LND in patients with squamous cell cancer (SCC) of the mid-esophagus; mean LN counts were 82 compared to 43 in the comparison group, and the OS at 5 years was 66% compared to 48%.13 A RCT comparing transthoracic with transhiatal esophagectomy (THE) yielded 31 versus 16 LNs and a 5-year OS of 39% versus 29%.14 A case-control study of patients with T3N1 EC undergoing en-bloc esophagectomy compared to transhiatal resection resulted in total LN counts of 52 versus 29 and an OS of 32% compared to 9%.15 Finally, a nonrandomized European study of two-field LND with THE versus THE alone reported 17 and 5 LNs, respectively, with a disease-free survival at 5 years of 41 and 10%.16 Thus, it appears that in all studies that compare different operative approaches to EC resection that are associated to different LN counts, survival results are superior for patients in whom more extensive LNDs have been performed, as evidenced through higher LN counts.

We have previously investigated the impact of LN counts on survival after operative therapy for various gastrointestinal cancers, including gastric cancer of early and advanced stages,17,18 extrahepatic cholangiocarcinomas,19 and pancreatic cancer.20 In all instances, population data revealed a strong association between increasing total or negative LN counts and better survival. The rationale for this study was to determine possible associations of LN counts and survival after esophagectomy for EC. To address this question, we resorted to US population information from the Surveillance, Epidemiology, and End-Results (SEER) data set published by the National Cancer Institute.

Patients and Methods

An EC data set was created through structured queries to the public version SEER 1973–2003 database, which includes combined records from multiple cancer registries representative of the US population. EC stage information was created according to the sixth edition American Joint Committee on Cancer tumor–node–metastasis (TNM) criteria,21 with the exception that metastatic involvement of LNs was classified as N1 disease only, as detailed information on extraregional nodal location was lacking. From 40,129 individuals with EC, 5,620 were extracted based on sufficient information regarding disease extent, operative treatment administered, and known survival outcomes. Those patients who received adjuvant radiation treatment were kept within the analysis; information on chemotherapy is not provided in the SEER data. Patients with incomplete resection information, such as “surgery, not otherwise specified,” were kept in the analysis, as long as sufficient information was available to document that resection of the primary tumor had taken place, such as through details in the pathologic findings. Several variables were recategorized or computed anew, such as the negative LN count (from total and positive LNs) and the LN ratio (positive to total LNs removed).

OS was the primary outcome component of interest. OS information in the SEER database reflects time from diagnosis to last follow-up (death or last contact) in monthly increments; censoring criteria were generated accordingly. Actuarial survival was analyzed via the Kaplan–Meier method, for the entire cohort, and for node-negative or node-positive groups separately. To eliminate early postoperative mortality and to determine the impact of LN counts on long-term survival, a conditional OS analysis was performed, only including patients who were alive at least 6 months or beyond. Univariate group comparisons utilized the log-rank test. Cox regression was used for multivariate analysis, with a backward elimination model for all covariates; we selected a threshold for keeping a variable in this elimination model at p = 0.05. All continuous variables were entered into this analysis as continuous data. Variables included into this multivariate calculation were grade (high versus low), T stage category (T1 versus T2 versus T3 + T4), total number of LNs examined (and/or number of negative LNs), N stage category (N0 versus N1), and/or number of positive LNs, race, age, gender, tumor size, year of diagnosis, presence of metastases, and tumor location (overlapping, upper, middle, or lower third). A projected 5-year survival analysis was performed based on a linear projection model as described earlier.17,18 Simple group data comparisons based on parametric statistics were done via t-test; for categorical parameters, chi-square testing was used. Significance of differences was assumed at p values of less than 0.05. Calculations were performed using the SAS 8.2 statistical software package (SAS, Cary, NC) or StatView 5.0.1 software for Macintosh computers (SAS Institute).

Results

Patient Demographics

From a cohort of 40,129 patients with an EC diagnosis within SEER, disease extent information was available in 15,417, and sufficient treatment and survival information was available for 12,102 individuals to calculate actuarial OS as postoperative outcome. Completeness of LN staging information could be identified for 5,620 individuals, which were included in the first multivariate analysis. Of these, 3,568 patients had undergone a resection. After exclusion of unspecified categories, 2,597 cases remained, which served as cohort for subsequent analyses relevant to LN count questions. The median age within the cohort was 65 years (range: 11–102), and 75% of patients were men. Ethnic information identified white patients in 82%, black patients in 12%, and other racial groups in 6% of cases. The location of the primary tumor could be classified as upper esophagus for 4%, middle esophagus for 18%, lower esophagus for 71%, and overlapping or unspecified for 7% of patients. The median tumor size was 5.0 cm (range: 0.1–30). Adenocarcinomas encompassed 57% of cases, and squamous cell carcinomas 43%. Of the resected patients with at least one LN examined, the median total LN count was 8 (range: 1–77), the median positive LN count 1 (0–46), and the negative LN count 6 (0–72). Differences were observed in the frequency of categorized number of total LNs examined when separated by N stage category (Fig. 1); patients classified as N0 tended to have fewer LNs identified more frequently than those classified as N1. The median follow-up was 15 months (range: 0–188), with a median follow-up for survivors of 25 months.

Figure 1
figure 1

Frequency of categorized number of total lymph nodes examined by N stage category.

Multivariate Survival Analysis

On multivariate analysis, the total LN count was an independent prognostic variable, aside from age, race, resection status, radiation, T category, N category (all at p < 0.0001), and M category (p = 0.0003). Parameter estimates and risk ratios for all patients selected on the basis of this Cox proportional hazards model are listed in Table 1. Total LN counts were exchangeable for negative LN counts in this model, at a similar significance level with p < 0.0001. A second multivariate model based on patients with complete pathologic staging and LN count information yielded the same prognostic variables, in addition to positive LN counts, tumor size, and race (Table 2). Again, negative LN counts were exchangeable with total LN counts. With the second model, grade and tumor location were entered into the model, but the presence of each of these factors forced the resection factor to become nonsignificant above the 0.05 level. It was difficult to interpret this conditional relationship, and so, we chose to report the model in which resection was significant.

Table 1 Parameter Estimates and Risk Ratios for all Patients Selected on the Basis of the Cox Proportional Hazards Model (n = 5,620)
Table 2 Parameter Estimates and Risk Ratios for all Staged Patients Selected on the Basis of the Cox Proportional Hazards Model (n = 2,597)

Univariate Survival Analysis of Lymph Node Count Impact

Higher total LN counts (up to >30) and negative LN counts (up to >15) categories were associated with the best OS (p < 0.0001) and the lowest 30- and 90-day mortality (p < 0.0001). The numeric total LN count effect on OS is depicted in Fig. 2. It was observed for both N0 and N1 stage subgroups, but appeared to be linked to greater OS differences for N0 EC in comparison to N1 EC (Fig. 3). A similar effect of better OS outcomes with higher total LN counts was observed for both squamous cell and adenocarcinoma EC histologies (data not shown). Negative LN counts demonstrated a strong association with OS as well. The actuarial OS for patients with EC dependent on various negative LN count categories is displayed in Fig. 4. This negative LN count impact persisted when the cohort was split by nodal status and appeared to present in a similar magnitude of OS differences (Fig. 5). Median survival and long-term OS (in percent) are listed in Table 3.

Figure 2
figure 2

Actuarial overall survival curve for patients with esophageal cancer by various total lymph node count categories.

Figure 3
figure 3

Actuarial overall survival curve for patients with esophageal cancer by various total lymph node count categories and separated by N category.

Figure 4
figure 4

Actuarial overall survival curve for patients with esophageal cancer by various negative lymph node count categories.

Figure 5
figure 5

Actuarial overall survival curve for patients with esophageal cancer by various negative lymph node count categories and separated by N category.

Table 3 Overall Survival by Total LN Count, by Nodal Staging Category

A cutpoint analysis intended to detect the total LN number related to the greatest OS differences. As tabulated in the same table, the highest chi-square statistic representing greatest group differences was observed at low LN counts: one for the overall cohort and five for N0 and N1 resected patients. However, significant differences were still encountered for cutpoints above 30 total LNs, always in favor of the group with higher total LN counts. The highest significant cutpoints were at 45 for N0 and at 35 for N1 disease stages.

Early Postoperative Deaths Based on Lymph Node Numbers

To separate esophagectomy-related (early) mortality from long-term survival in the analysis of LN count associations, we analyzed early mortality associations and conditional long-term OS separately. Death within 30 days occurred to 3% of N0 and 5% of N1 patients (p = 0.0004). Similarly, mortality at 30 days after resection was 5%, but 14% without resection (p < 0.0001); the corresponding 90-day mortality was 13% versus 30% (p < 0.0001). Significant relationships between mortality and LN counts existed for total LN counts, LN ratio, and negative LN counts, always with the lowest mortality rate for the higher LN count categories. Figure 6 depicts such mortality within 90 days by total LN count categories, LN ratio categories, and negative LN count categories. A long-term survival impact of LN counts was examined after excluding all deaths within 6 months after diagnosis. Figure 7 depicts actuarial conditional OS curves for patients with EC by various total LN count categories. Survival differences are less obvious, but still evident especially at LN counts of >30.

Figure 6
figure 6

Mortality within 90 days by total LN count categories, LN ratio categories and negative LN count categories. The horizontal bars mark the average 90-day mortality for that patient cohort.

Figure 7
figure 7

Actuarial conditional overall survival curves for patients with esophageal cancer by various total lymph node count categories. Included are only individuals alive at least 6 months from diagnosis.

Projected Numeric Lymph Node Impact on Overall Survival

Plots of actuarial OS at 5 years and at 10 years were generated for various total LN count categories (Fig. 8). The highest OS results were invariably observed at the highest LN count categories for the overall patient cohort as well as for adenocarcinoma and squamous cell carcinoma histologies. Based on a resulting linear regression model, the projected numeric total LN impact on 5-year OS was calculated for the entire cohort and separately by histologic type (Table 4). The results show a relative increase in OS at 5 years for every ten LNs identified of between 4 and 5%.

Figure 8
figure 8

Plots of actuarial overall survival at 5 and 10 years by total lymph node count categories. The shaded areas represent the 95% confidence intervals.

Table 4 Projected Numeric Total Lymph Node Impact on 5-Year Overall Survival, by Histologic Type

Implications of Lymph Node Ratio

The ratio of metastatic to total LNs (LN ratio), a previously reported prognostic parameter for EC survival, showed a strong association with OS results. When divided into quintiles, the lowest LN ratio (0.01 to 0.19) associated with the best survival (median = 1.75 years) and the highest LN ratio (0.8 and greater) with the worst OS (median = 0.67 years; p < 0.0001) in nodal positive patients. To examine the implications of total LN counts on LN ratio, we assessed median OS relationships with various LN ratio categories, again excluding 0 (i.e., N0 patients). Separation between OS outcomes of different LN ratio categories was greatly enhanced in settings of higher total LN counts, as displayed in Fig. 9.

Figure 9
figure 9

Median actuarial overall survival by various lymph node ratio and total LN count categories. The bars represent the standard error. LNR lymph node ratio. Only N+ patients are included.

Discussion

The results show a strong association between postoperative LN counts and survival after esophagectomy for EC. Invariably, higher total LN counts or negative LN counts are linked to better OS, which is observed in both N0 and N1 stage groups, as well as in both main histologic types of EC. These findings are perhaps even more profound, as they are derived from population data, with an anticipated mix between providing hospitals and surgeons regarding esophagectomy volume. Best survival after esophagectomy is usually obtained in high-volume settings, where more extensive resections including extended LNDs are the norm.57 Our findings would therefore generally corroborate those reports of others in which resection techniques linked to larger LN counts are associated with better OS results.1316 From available reports, it remains unclear which EC patients might benefit most from more extensive dissections with greater LN counts. Accordingly, among subsets that have been reported to benefit are patients with N0 SCC,22 N0 adenocarcinoma,23 T3N1 adenocarcinomas when less than nine LNs are involved,15 early SCCs where distant LN spread is more common that in early adenocarcinoma,24 or in those midthoracic lesions for which cervical and/or abdominal LND is included.6,2527 Although, in our results, the total LN count impact was more obvious in N0 than N1 disease, the observed benefits of greater LN counts are not restricted to any specific patient subsets and have thus to be explained as a more general phenomenon.

Whereas a therapeutic benefit of removing more LNs with potential metastatic disease is assumed to partake in this phenomenon, it cannot be proven from the available information. The numeric total LN effect in N1 patients, the benefit of negative LN counts in patients with more than 1 positive LN, and the conditional survival benefits of LN counts beyond 6 months, all usually within a range of 10 to 20% when comparing lowest and highest LN count groups, let us suspect some therapeutic effect because of better regional disease control. Multiple studies have described a high rate of immunohistochemically identified micrometastases to regional LNs, with generally negative prognostic implications, even when standard histopathologic examination would not reveal evidence for LN involvement.2830 Removing more of these LNs at risk may reasonably reduce avenues for subsequent oncologic progression.

There are, however, numerous caveats that need to be respected in the interpretation of our results. The large SEER population database has not been established to analyze specific surgical technical questions, and therefore, significant limitations in information accompany this analysis. Firstly, patients with sufficient information are highly selected from within the database, and coding errors or potential omissions cannot be ruled out. The selection process is necessitated in part by identifying patients who underwent surgical therapy, but also because of lack of complete data among surgically treated individuals. Naturally, this selection could introduce bias, if cases with complete data differ from others by treatment or other survival hazards; however, such potential bias cannot be controlled for in the context of numeric LN analyses. Furthermore, we lack data on LN location, the exact operative technique for local and regional dissections, any margin status, any chemotherapy given, or any response to preoperative chemoradiation. Other parameters that have been linked to post-esophagectomy survival are equally unknown, such as the institutional volume, surgeon volume, the patient’s performance or nutritional status, and the quality of macroscopic and histopathologic examination, all of which could possibly influence the LN status entered into the database. Is the total LN count or the negative LN count not just a result of more extensive regional dissection, but perhaps a surrogate for a healthier patient, or a better patient selection reflective of a high-volume, higher quality healthcare setting where better survival can be expected without actual better oncologic control of the underlying cancer? The SEER data alone do not allow controlling for volume–outcome relationships. However, high esophagectomy volume institutions frequently subscribe to standardized, wider regional dissection extents, and much of the undisputable volume–survival relationship may in fact already result from a greater lymphadenectomy extent alone.4 It is thus plausible that a large component of the LN count effects observed in the population data represents the spectrum from low-volume institutions in low LN count categories to high LN counts obtained in many high-volume settings. Obviously, LN numbers do not always equate to the true lymphadenectomy extent, but they certainly are the best surrogate available. Nevertheless, all these questions have to be considered carefully before possibly any practical implications of the results can be claimed.

Total and negative LN counts appear to be rather important for survival prediction of EC, and this information extends beyond predictive information from the TNM staging criteria. Limitations of the TNM staging system have been highlighted by others, but outside the number of positive LNs, LN counts have not been suggested as clinical staging criteria.31,32 The LN ratio does obviously reflect total LN counts aside from positive LN number. The LN ratio has been reported as prognostic variable in EC,5,32,33 including in one study based on the SEER data for EC between 1988 and 1997.34 We did not intend to merely duplicate this earlier effort with our analysis, but had a specific practical interest to define an optimal LN number to be removed at the time of esophagectomy, which would preferably represent a valid numeric target even for N0 disease, which the LN ratio is not. A definable number of LNs known preoperatively as target, to be removed by the surgeon and to be identified by the pathologist, would likely serve as a standard goal of EC care, irrespective of ultimate nodal involvement, in a system where standards throughout the population appear rather variable. Undoubtedly, wider LND influences the quality of staging,12,35 and the LN count impact on OS in N0 disease will reflect a mechanism of stage migration to a large extent. This is certainly corroborated by our findings of nodal stage assignment linked to different LN count profiles, and the largest cutoff point differences in low LN count ranges. Irrespective of the contributing mechanism being a result of better staging and/or better disease control, total LN counts of 30 or higher would appear to represent this preoperative target that can be linked to optimal survival results in our analysis. It should be noted, however, that the recommended total LN count of 30 is merely reflective of a desirable practical target; the observed numeric LN count impact is not an all-or-nothing phenomenon, but a gradual effect of a continuous biologic variable, i.e., the involved LN count. Complex biologic tumor and patient heterogeneity would suggest that the risk for residual positive LNs is not eliminated at a certain total or negative LN count, but rather likely to decrease gradually with increasing counts. Evidence for a continued numeric LN effect at higher LN count ranges and for nodal positive patients, is perhaps the strongest argument in favor of a true lymphadenectomy–survival relationship that can be extracted from the available data. In addition, these population-derived observations corroborate the findings of the few available RCTs mentioned earlier.13,14,16

Our results suggest that larger total LN counts are linked to better outcomes, with an optimal number of 30 or greater. This putative dissection goal is derived from standard LN evaluation techniques and may indeed change with qualitative analysis of LN involvement, such as through the sentinel LN technique.36 Other factors that may influence a wider LND goal in the future may be the development of specific and reliable staging criteria for early stage disease or major responses to preoperative chemoradiation,37 which could render the need for LN removal superfluous. For now, however, we interpret the findings as supportive for a more extended LN retrieval at the time of esophagectomy and recommend to obtain 30 or more LNs to expect an optimized quality of numeric EC staging, an optimal ability for survival prediction, and an optimized regional disease control with its potential for improved EC survival.