Introduction

The American Joint Committee on Cancer (AJCC) TNM staging system is the most commonly used staging system in gastric cancer, which stratifies patients into several risk groups according to the depth of tumor invasion, number of metastatic lymph nodes (mLNs), and distant metastasis. The seventh edition published in 2010 suggested that at least 16 lymph nodes should be examined for optimal staging.1

Several studies have demonstrated that an adequate number of examined lymph nodes (eLNs) were associated with improved overall survival (OS) in gastric cancer.24 Insufficient number (<16) of eLNs may cause “stage migration,” which was observed in 10–15% cases of gastric cancer,57 and further affects the treatment and prognosis of this group of patients.8

Lymph node ratio (LNR) is defined as the ratio between mLNs and eLNs. Several studies proposed the ratio-based staging systems, which have been approved to reduce stage migration.7, 911 However, the revised ratio-based classification still has some limitations.12, 13 In addition, several proposed ratio-based staging systems have different cutoff values.9, 11, 14 Thus, the extensive implication of ratio-based systems could also be limited by the different cutoff values.

Nomogram, a statistics-based tool that provides the overall probability of a specific outcome, has been widely used in many kinds of malignant tumors.15 Several nomograms have been established to predict the OS of gastric cancer patients.1620 However, they were mainly based on the data of patients with sufficient numbers of eLNs, and no current nomogram has been validated in the patients with insufficient number of eLNs. Therefore, the predictive values of these nomograms may be limited in those with insufficient number of eLNs.

In the current study, we attempt to develop and validate a nomogram which may predict accurately OS of patients with insufficient number of eLNs, compared with AJCC TNM staging systems and the revised ratio-based staging systems.

Methods

Patients

Since D2 lymphadenectomy was associated with lower loco-regional recurrence and cancer-related death and recommended as standard surgery in both Asian and Western,21, 22 patients with gastric cancer were recruited who underwent extended lymphadenectomy (≥D2) and achieved radical (R0) resection at the Department of Surgical Oncology, The First Affiliated Hospital of China Medical University between January 1980 and December 2006. The extent of lymphadenectomy was performed according to the guidelines of the Japanese Gastric Cancer Association (JGCA).23 R0 resection was defined as complete removal of all tumors without microscopic or macroscopic tumor residual.

Patients with insufficient number of eLNs (<16) and without other combined cancer were included. Patients were excluded if they received neoadjuvant chemotherapy or if any information had been missed. Patients who died in the postoperative period were also excluded. Finally, 872 patients with insufficient numbers of eLNs and complete information were included in our primary cohort. This study was approved by the Institutional Ethics Committee of China Medical University.

The following demographic and pathologic characteristics were collected: age, sex, tumor location, tumor size, macroscopic type, histological type, lymphovascular invasion, extent of lymphadenectomy, depth of invasion, the number of eLNs and mLNs, the follow-up period, and survival status. The tumor location was classified into three categories according to the location of the main lesion: upper third, middle third, and lower third. The macroscopic type was classified into five categories: early gastric cancer and advanced gastric cancer Borrmann types I to IV. The histological type was divided into two categories: differentiated type, including papillary adenocarcinoma and well- and moderately differentiated tubular adenocarcinoma, and undifferentiated type, including poorly differentiated tubular adenocarcinoma, signet ring cell carcinoma, and mucinous adenocarcinoma. The depth of invasion was classified according to the seventh edition of the AJCC TNM classification.1 In our present study, 464 (53.2%) patients received adjuvant therapy during our study period and the treatment pattern was highly variable. Therefore, adjuvant therapy may not be an ideal model covariate in our present study. In addition, Balachandran et al. suggested that treatment should be avoided as a covariate in the prediction model unless there are validated data from a randomized clinical trial.24 In the light of these considerations, we did not include adjuvant therapy in our prediction model and only used clinicopathological characteristic variables to predict survival.

After being discharged from the hospital, the patients received telephone interview or outpatient follow-up every 3 months in the first 3 years, 6 months in the fourth and fifth years, and once a year thereafter. The follow-up period was measured from the time of operation to the date of last follow-up (March 31, 2015) or death date.

Development and Validation of the Nomogram

Nomogram construction and validation were performed on the basis of the guidelines proposed by Iasonos.15 We assigned randomly two thirds of patients to the development cohort (n = 582) and one third to the validation cohort (n = 290).

To identify the independent survival predictors, multivariable analysis was performed using a Cox proportional hazards regression model and the data of development cohort. Continuous variables (tumor size, the number of eLNs and mLNs) were fit using restricted cubic splines to relax the linearity assumption.25 The nomogram was constructed to predict the 5- and 10-year OS based on the results of the multivariable Cox regression model. And the final factors included in the model were identified by a backward stepwise selection process with the Akaike information criterion.25

The performance of the nomogram was evaluated with respect to discrimination and calibration. Discrimination was measured by the concordance index (C-index),26 which refers to models’ ability to distinguish the outcomes correctly. The larger the C-index, the more accurate the model predicted.27 Calibration curves were performed by comparing the means of the nomogram-predicted OS with the observed Kaplan–Meier estimate of actual OS. We used the bootstrapping (1000 repetitions) method to reduce the estimate bias. Additionally, the model validations were also performed using the validation cohort. For patients in validation cohort, we first calculated total points according to the established nomogram. Next, we performed the Cox regression analysis using the total points as a factor. Finally, the regression analysis was used to generate the C-index and calibration curves.

Categorization of Patients in Different Staging Systems

Patients were categorized according to the seventh edition of AJCC TNM staging systems.1 A novel LNR-based staging system, named TNrM, was established according to the same T and M definitions of the seventh AJCC staging system and the Nr classification (Nr0 0%, Nr1 1∼20%, Nr2 21∼50%, Nr3 >50%). The cutoff values of LNR were determined by using the log-rank test and were consistent with the results of previous studies which came from the same patient database.9, 13

Statistical Analysis

The clinicopathological characteristics were compared using the χ 2 test for categorical variables and two-tailed Student’s t test for continuous variables. OS was estimated using the Kaplan–Meier method. The log-rank test was used to examine the survival difference between subgroups. A two-tailed P value <0.05 was considered as statistically significant. All statistical analyses were performed using STATA version 12.0 for Windows (StataCorp, College Station, TX) and R software version 3.2.0 with the rms, Hmisc, and survival package (http://www.r-project.org/).

Results

Clinicopathological Characteristics and Survival

The clinicopathological characteristics of 872 patients are presented in Table 1. In the primary cohort, three quarters of patients were male and the median age was 60 years old (range 26–90). Most of tumors were located in the lower third (n = 533, 61.1%). More than 70% patients were classified as pT3 or pT4 stage according to the seventh AJCC TNM staging system. A majority of patients underwent D2 lymphadenectomy (n = 645, 74.0%). The mean numbers of eLNs and mLNs were 9.45 ± 4.02 (range 1–15) and 2.14 ± 2.92 (range 0–14), respectively. In the development and validation cohorts, the mean numbers of eLNs were 9.36 ± 4.03 and 9.62 ± 4.01, and the mean numbers of mLNs were 2.15 ± 2.94 and 2.14 ± 2.88. No significantly different variables were observed between development and validation cohorts (all P > 0.05).

Table 1 Demographic and clinicopathologic variables of the development and validation cohorts

At the last follow-up, a total of 694 patients had died and the median and mean follow-up time was 48 and 86 months (range 3–410 months), respectively. The 5-year OS of the primary cohort was 45.2%.

Nomogram Development and Validation

In the univariate analysis, sex, age at operation, tumor location, tumor size, macroscopic type, lymphovascular invasion, depth of invasion, and the number of mLNs were associated with OS, whereas the histological type, the extent of lymphadenectomy, and the number of eLNs were not (Table 2). We also incorporated the number of eLNs in multivariate analysis to improve the model performance of our nomogram.

Table 2 Univariate and multivariable Cox proportional hazards regression analysis based on development cohort

Multivariate analysis showed that sex, tumor location, macroscopic type, lymphovascular invasion, depth of invasion, and the number of mLNs and eLNs were significantly associated with OS (all P < 0.05) (Table 2).

On the basis of the results of multivariate analysis, we built up a nomogram which could predict the 5- and 10-year OS (Fig. 1a). For each prognostic factor in the nomogram, the point can be determined by drawing a line straight upward from each predictor to the point axis. The total points are estimated by summing each point and located in the total point axis. The probability of 5- and 10-year OS can be determined by drawing a line straight down from the total point axis. The C-index for the developed nomogram to predict the OS was 0.742 [95% confidence interval (CI) 0.719∼0.764]. The calibration curves showed good agreement between the nomogram-predicted probability and the actual survival probability (Fig. 1b).

Fig. 1
figure 1

a Nomogram predicting 5- and 10-year overall survival of patients who underwent extended lymph node dissection with insufficient examined lymph nodes. Calibration curves of the nomogram in the development cohort (b) and validation cohort (c). The nomogram-predicted probability of overall survival is plotted on the x-axis; the actual overall survival estimated by the Kaplan–Meier method is plotted on the y-axis (EGC early gastric cancer, LNs lymph nodes)

In the validation cohort, the C-index for predicting OS was 0.743 (95% CI 0.710∼0.776). The calibration curves showed that the nomogram performed well in predicting the 5- and 10-year OS (Fig. 1c).

We also evaluated the discrimination ability of the nomogram by dividing the patients into quartiles according to the points and plotted the Kaplan–Meier curves of both development and validation cohorts (Fig. 2). The survival curves showed that our nomogram had good discriminatory ability for patients.

Fig. 2
figure 2

Kaplan–Meier survival curves of the development cohort (a) and validation cohort (b) according to quartiles of predicted overall survival

Comparison Between the Developed Nomogram and Other Staging Systems

The TNrM staging system was established according to the method that we have mentioned earlier. The novel staging system has the same T and M classification with the seventh AJCC staging system, and the N classification was just replaced by the Nr classification. The novel revised staging system is presented in Table 3.

Table 3 The novel TNrM staging system

The C-index of our nomogram in the development cohort was 0.742 (95% CI 0.719∼0.764), which was significantly higher than the seventh AJCC TNM staging system (C-index 0.707; 95% CI 0.683∼0.732) and the TNrM staging system (C-index 0.712; 95% CI 0.688∼0.735) (all P < 0.001). The same results were also observed in the validation cohort; the C-index of the nomogram (C-index 0.743; 95% CI 0.710∼0.776) was higher than the C-indices of the seventh AJCC staging system (C-index 0.705; 95% CI 0.670∼0.740) and the TNrM staging system (C-index 0.707; 95% CI 0.673∼0.741) (all P < 0.001). In addition, Fig. 3 shows the distribution of nomogram-predicted 5-year survival according to the seventh AJCC TNM staging system and TNrM staging system. A wide range of nomogram-predicted survival was identified in both TNM staging system and TNrM staging system. The heterogeneity of OS in each stage of TNM staging systems and TNrM staging systems is significant, especially in stages II and III.

Fig. 3
figure 3

Distribution of nomogram-predicted 5-year overall survival according to seventh AJCC staging system (a) and TNrM staging system (b)

These results revealed that our nomogram might be superior to the current staging systems for prediction of OS in gastric cancer patients with insufficient numbers of eLNs.

Discussion

For malignant tumor, staging is a critical component of therapy. The accurate staging of patients could convey the extent of disease, help oncologists make appropriate clinical therapy decisions, and evaluate the prognosis. The status of mLNs was considered one of the most important prognostic indicators of gastric cancer,2830 and the AJCC TNM staging systems defined N classification according to the absolute number of mLNs since 1997.1, 31, 32 However, some authors reported that the accurate classification of the mLNs is influenced by the number of eLNs significantly.33, 34 Insufficient number of eLNs can lead to an underestimation of stage, which is also called “Will Rogers” phenomenon.35

The number of eLNs is often determined by two independent parts: one is the extent of lymphadenectomy, and the other is lymph nodes harvested from specimen after surgery, and furthermore, the difference of lymph nodes number was mainly associated with lymph nodes harvested from the specimen after surgery, rather than the extent of lymphadenectomy.36 Surgeons may also be influenced by the suggested minimum number of LNs required for examination and carry out retrieval carelessly or not strictly.3 In addition, some clinicopathological factors and the immunologic response of patients may be associated with insufficient number of eLNs.4, 8, 37, 38 Actually, many previous studies also found that about 10%–20% of patients had insufficient number of lymph nodes examined despite the extended lymphadenectomy was performed, especially in western population.6, 39, 40

In the current study, we developed and validated a nomogram predicting the OS of patients who underwent extended lymphadenectomy with insufficient number of eLNs. The C-indices and calibration curves revealed that our nomogram performed well in predicting OS. Comparing with the conventional TNM staging systems, nomograms attempt to combine all proven prognostic factors and quantify risk as precisely as possible, rather than producing different risk groups.20 Thus, for many cancers, nomograms are proved to be more accurate in predicting individualized survival than TNM staging systems.16, 4143 In our present study, our nomogram showed better predictive accuracy of OS than the seventh edition AJCC TNM staging systems.

Several nomograms categorized the mLNs according to the seventh AJCC TNM staging system or using statistical methods and incorporated into the nomogram.1618 Categorical variable is convenient for clinical implication, but insufficient number of eLNs may limit accuracy of classification. For instance, it is impossible to categorize patients with insufficient numbers of eLNs as N3b group. Besides that, N1 group can be upstaged to N2 or N3 group when more LNs were examined.33 Thus, categorization of mLNs may be not able to reflect truly the extent of disease and assign patients inaccurate points.

One most recent nomogram based on a western database incorporated LNR but without the number of eLNs included.44 On the one hand, previous studies have showed that patients with different lymph nodes examined had different survival rates, especially for patients with insufficient number of eLNs and rN0 group.9, 12 Therefore, the impact of eLNs should not be ignored. On the other hand, patients with the same LNR may have the different survival12 and incorporating LNR into our model may introduce bias. In our nomogram, we incorporated both the number of eLNs and mLNs into the nomogram and using continuous variables to predict the OS, which may help to reflect accurately the status of lymph node involvement in some extent. In addition, we noted that including both the number of eLNs and mLNs had a larger goodness-of-fit statistic (R 2) than we included LNR or mLNs alone and showed better model performance. However, including more covariates may lead to overfitting of the model.24 In our present study, we used the bootstrapping method (1000 repetitions) to quantify the overfitting of our model. Our calibration curves showed good agreement between nomogram-predicted and the actual survival (Fig. 1b, c) These results revealed minimal evidence of overfitting.

There are some limitations in the current study. First, our nomogram was established based on clinicopathological data from a single institution. Thus, our results might be biased by institutional practice patterns.24 And whether this nomogram can be applied to other patient groups with insufficient number of eLNs remained to be evaluated. Second, our study cohort lasted for a long time period (1980–2006), which may introduce external variations in long-time outcomes.45 Third, adjuvant chemotherapy has been confirmed to improve OS in gastric cancer patients after extended lymphadenectomy in recent years.46, 47 Nonetheless, we did not include adjuvant chemotherapy in our analysis, because adjuvant therapy was not a common treatment during the long period of our study and many patients did not receive the standard adjuvant chemotherapy. Thus, our current study could not evaluate precisely the effect of adjuvant therapy, which may affect the accuracy of our conclusions. In addition, adjuvant therapy was not controlled in our predictive model; however, it may be a confounder for OS and the validity of our results may be limited. Thus, the treatment effect of adjuvant chemotherapy will be considered in our further studies.

Conclusions

In summary, the nomogram established in current study predicted accurately the OS of gastric cancer patients with insufficient number of eLNs after extended lymphadenectomies. We provided a new method to assess the prognosis for gastric cancer patients who had high risk of stage migration. The value of this nomogram for gastric cancer patients from other institutions would be determined in the future studies.