Introduction

After the discouraging results of randomized controlled trials on extended lymph node dissection in non-specialized western units at the end of last century [13], it appeared clear that a wise option for the implementation of extended lymph node dissection in such context was to limit splenectomy and distal pancreatectomy and to “tailor” the extent of surgery on the disease stage and especially on patient’s comorbidities [46]. Actually, we already demonstrated that the presence of comorbidities is the main independent predictor of operative morbidity and mortality. Maintaining a standardized tailored approach coupled with adequate coaching of less-experienced surgeons led to a safe implementation of extended lymph node dissection in our unit without any influence of surgeon-related factors [7]. The main drawback of this “tailored” approach could be a potential jeopardy of oncologic radicality in elderly and/or comorbid patients. Actually, the therapeutic basis of surgery for gastric cancer are built over the opportunity to perform a curative resection, or a R0 resection, defined as the absence of microscopic or macroscopic residual tumor by the TNM staging system [8]. This goal is reached in 50–85% of western patients, but local and/or systemic relapses are encountered in up to 50% of such cases [9, 10]. The current definition of R0 radicality in staging systems alone is insufficient to identify patients at risk of disease recurrence, as it does not take into account the impact of treatment-related variables, such as the extent of lymph node dissection. The clinical research during the past years tried to fill this gap through the development of new prognostic scores [1115].

The Italian Research Group for Gastric Cancer (IRGGC) developed and tested a prognostic score based on five independent variables [16]: pT status, pN status, type of lymph node dissection, age and maximum diameter of the tumor. A prognostic score allows to stratify the recurrence risk into low (IRGGC score ≤30), intermediate (31–60) and high (>60), with more than 80% of cases allocated into the two extreme classes.

The National Cancer Center Hospital in Tokyo developed a computer program in order to predict the percentage of lymph node metastases, based on a database of 3,843 cases treated in Japan [17]. After the input of seven variables (sex, age, macroscopic type, microscopic type, location, depth of tumor invasion, and maximum diameter) the program generates an output on the probability of metastasis in each of the 16 lymph node groups defined by the Japanese Gastric Cancer Association (JGCA) [18]. Diagnostic accuracy of this program was further tested on a German [19] and an Italian [20] population. Considering that several of the input variables are generally obtained after surgery on pathologic examination, the program was subsequently used to generate a prognostic index more than to guide the lymph node dissection in the single case: summing together the percentages of metastasis in the lymph node groups that are not removed at surgery it is possible to obtain the so-called “Maruyama index” (MI). This prognostic index has been applied to patients enrolled in the North American trial on adjuvant chemoradiation [21] and in the Dutch trial on extended lymph node dissection [22, 23], identifying a prognostic cutoff for MI = 5.

In order to overcome the problem of stage migration induced by extended lymph node dissection [24, 25], during the past years a new independent prognostic factor was largely investigated and validated: the lymph node ratio (LNR), defined as the absolute ratio between metastatic and examined lymph nodes, appeared to be a strong independent indicator of prognosis [2629], even in case of inadequate nodal staging (<15 examined lymph nodes) [30]. The IRGGC defined four prognostic intervals for LNR [21]: 0, 0.01–0.09, 0.10–0.25, >0.25. These intervals corresponded to a definite different prognosis even within the same pN status, independently from the extent of nodal dissection.

Currently, little or nothing has been published concerning these prognostic factors outside eastern or specialized centers. The purpose of this study was therefore a retrospective evaluation of factors influencing long-term results in a prospective database of R0 resections built during the implementation of extended lymph node dissection in our surgical unit.

Materials and methods

From 1998 to 2007, surgery for gastric adenocarcinoma was performed on 387 consecutive patients in our unit. Of these, 63 patients (16.3%) were not resected. The remaining 324 cases (83.7%) were submitted to resection of the tumor. Ninety-one cases were excluded from this study: 22 gastric stump neoplasms; 9 Siewert type II cardia neoplasms; 6 palliative resections with D0 lymph node dissection due to poor general conditions and emergency (bleeding) or a synchronous malignant neoplasm in other organs; 6 resections of a recurrent tumor; 2 resections after neo-adjuvant chemotherapy and 46 R1–2 resections. The remaining 233 cases constitute the population of this study. All perioperative data of these patients were prospectively recorded into a database and outpatient follow-up scheduled every 6 months during the first 3 years and yearly thereafter, up to the study censor scheduled at 31 December 2009.

Setting

All the operations were performed according to a standardized protocol of pre-, intra- and post-operative care in a tertiary care, 700-bed hospital of the Italian National Health System. In 1998, extended lymph node dissection (D2) according to the JGCA [18] was introduced into the clinical practice of the unit. Two surgeons (M.C., G.B.G.) with previous specific training in extended lymph node dissection obtained through stages in Japanese surgical centers, personally monitored its implementation and diffusion, coaching the other eight surgeons participating in the study.

Patient-related variables

There were 127 males and 106 females, mean ± SD age was 66.9 ± 11.5 years (median 68, range 36–95) and it was categorized according to its median value in univariate analysis or calculated as a continuous variable in multivariate analysis. Concerning the presence of comorbidities, patients were classified utilizing the American Society of Anesthesiologists (ASA) system: 147 were ASA class I–II and 86 ASA class III–IV. Body mass index (BMI, expressed as kg/m2) mean ± SD was 25.4 ± 3.7 (median 25.4, range 15.2–37.9) and it was categorized according to its median value in univariate analysis or calculated as a continuous variable in multivariate analysis.

Neoplasm-related variables

Location of the tumor was classified according to JGCA [18] into lower third, middle third and upper third. The mean ± SD maximum diameter of the tumor was 43.3 ± 25.0 mm (median 40, range 5–130), and it was categorized according to its median value in univariate analysis or calculated as a continuous variable in multivariate analysis. The sixth edition UICC-TNM staging system [8] was used to define depth of invasion (pT), nodal status (pN), stage grouping and grading. Microscopic type was defined as “intestinal” or as “non intestinal” (diffuse or mixed) according to Lauren [31].

Treatment-related variables

All operations were carried out in an elective setting under a strict perioperative care protocol including antibiotic and antithromboembolic prophylaxis. Details about the extent of organ resection and lymph node dissection according to patient- and neoplasm-related variables have been previously published [7, 32].

Overall mean ± SD number of examined lymph nodes per patient was 28.9 ± 14.4 (median 27, range 2–78). It was 17.8 ± 9.4 (median 15, range 2–52) after D1α-β lymph node dissection and 34.0 ± 13.9 (median 32, range 9–78) after D2 lymph node dissection, according to JGCA [18]. Overall number of cases with inadequate nodal staging (<15 examined lymph nodes) was 34 (14.6%), 29 (37.2%) after D1α-β lymph node dissection and 5 (3.2%) after D2 lymph node dissection.

At the study censor, the chief surgeon (G.B.G., surgeon #1) performed more than 50% of the operations and six out of ten participating surgeons performed at least ten cases each.

The MI [2123] (mean 17.2 ± 29.3, median 6, range 0–200) was categorized as < or ≥5 in univariate analysis or calculated as a continuous variable in multivariate analysis. The IRGGC score [16] (mean 46.9 ± 35.0, median 35.1, range 1.59–99.9) was categorized as ≤30, 31–60 and >60 in univariate analysis or calculated as a continuous variable in multivariate analysis. The LNR (mean 0.16 ± 0.24, median 0.03, range 0.0–0.94) was categorized as 0, 0.01–0.09, 0.10–0.25 and >0.25 in univariate analysis or calculated as a continuous variable in multivariate analysis. The eventual association of an adjuvant treatment (chemotherapy or chemo-radiation therapy) as well as the accrual year was also recorded and evaluated.

Endpoints and statistical analysis

All data were analyzed with StatsDirect© statistical software, version 2.7.7 (StatsDirect Ltd., Altrinchan, UK).

Disease-free survival was calculated at 5 and at 10 years from the operation by the Kaplan–Meier method [33]. Univariate analysis was performed applying the log-rank test [34]. All the resulting significant variables were considered into a multivariate analysis applying Cox’s proportional hazards model [35], generating “dummy” variable indicators for non-dichotomic variables and avoiding automated model building in order to eliminate any potential bias induced by the stepwise procedure [36]. Independent patient-related variables identified at multivariate analysis were further cross-matched with the status at censor (alive without disease, death due to operative mortality, alive with or death due to recurrent disease, death due to other causes or lost to follow-up) and the different subsets analyzed with the generalized Cochran–Mantel–Haenszel tests [37]. Statistical significance was assumed for p values <0.05.

Results

After a mean ± SD follow-up of 57.2 ± 39.2 months (median 52, range 0.1–142.2), 132 patients (56.6%) were alive at the study censor and 122 (52.4%) were without any evidence of recurrent disease. Causes of death of the remaining 101 patients were 10 cases (4.3%) of operative mortality (calculated as any death occurring within 60 days from the operation), other causes in 29 patients (12.4%), disease recurrence in 60 patients (25.7%) and two patients (0.9%) lost to follow-up.

Disease-free survival (mean ± SE) was 57.1 ± 3.1% at 5 years and 42.9 ± 4.2% at 10 years (Fig. 1). After univariate analysis (Table 1), gender, age, ASA status, stage, extent of lymph node dissection, MI, IRGGC score and LNR were considered into a multivariate analysis (Table 2). Disease-free survival was independently influenced by growing age (hazard ratio 1.03, 95% CI 1.01–1.05), ASA status III–IV (hazard ratio 2.35, 95% CI 1.53–3.61) and by growing LNR (hazard ratio 18.7, 95% CI 4.58–76.8).

Fig. 1
figure 1

Disease-free survival curve

Table 1 Univariate analysis
Table 2 Multiple variable analysis

Subset analysis of the patterns of age and ASA status cross-matched with the status at censor is represented in Table 3. No significant trends were identified (coefficient = 2.22, p = 0.136).

Table 3 Status at censor stratified by age group and ASA status

Discussion

A 5-year disease-free survival rate above 50% (Fig. 1) with an overall 30% rate of disease recurrence (Table 3) in this selected population of R0 resections is within the range of results previously reported in specialized western centers [28, 38] and well beyond those reported by western population-based studies [39]. However, the fact that long-term disease-free survival was independently affected by growing age and by the presence of comorbidities in this study can be interpreted as a lack of radicality in elderly and comorbid patients. Actually, subset analysis for the causes of death (Table 3) showed that recurrence rates are not significantly different among subgroups and that worst survival of elderly and comorbid patients is mainly due to operative mortality and death due to other causes. Particularly, the 4.3% operative mortality rate recorded in this study is higher than 1.7% reported in a series from a non-specialized western center adopting the same “tailored” policy [6]; nonetheless, 80% of these deaths were recorded in the group of elderly and comorbid patients (Table 3).

Concerning the other patient-related variable analyzed, BMI did not influence survival in our study, as it did not considering early results and lymph node retrieval [7, 32], while in eastern centers the presence of overweight (BMI > 25 in about 15% of cases) adversely affects both early and late results [40]. It should not be surprising while considering that overweight and obesity are a definite characteristic of 50–60% of western patients, western surgeons being probably more used than their eastern colleagues to deal with it [41]. Another possible issue of jeopardy of oncologic radicality may be represented by inadequate lymph node retrieval (<15 examined lymph nodes), already identified as having an independent negative impact on long-term results [42]. Notwithstanding a significantly higher incidence of inadequate lymph node retrieval in D1α-β versus D2 lymph node dissection [32, 43], it simply had no effect on long-term disease-free survival in this study. A possible explanation of this phenomenon may rely on one of the independent determinants of inadequate lymph node retrieval in our series, the single pathologist performing lymph node retrieval on formalin-fixed en-bloc specimens [32], meaning that inadequate nodal examination by the pathologist does not automatically translate into inadequate nodal clearance by the surgeon. The absence of any effect of the single surgeon or of the chief surgeon versus the others, as well as the absence of any time-trend depending on the accrual year can lead to the inference that the process of standardization and coaching during the implementation of extended lymph node dissection in our unit was effective and stable from the beginning of the experience.

Concerning the prognostic factors after R0 surgery, LNR was the only independent predictor of long-term disease-free survival, with four well-defined subgroups of patients according to the intervals determined by the IRGGC [28] (Fig. 2). This study suggests that LNR should therefore be used to evaluate the long-term prognosis after R0 surgery for gastric cancer also in a non-specialized western center. It was already demonstrated that LNR is not significantly influenced by the number of examined lymph nodes [28, 32, 44], therefore limiting the stage-migration phenomenon induced by extended lymph node dissection and a tumor-ratio-metastasis (TRM) instead of a tumor-node-metastasis (TNM) staging system already suggested [45]. In the recent new seventh edition of the TNM staging system [46], the UICC elected to retain the pN classification based on the number of metastatic nodes, modifying pN grouping with a new pN1 group with 1 or 2 metastatic lymph nodes and to introduce a different stage grouping. It seems that this new pN stage-grouping is more accurate as a prognostic predictor than its precursor [47]. It will be interesting, in a near future, to investigate the prognostic yield of this new TNM stage-grouping faced to LNR.

Fig. 2
figure 2

Disease-free survival curves according to the lymph node ratio (LNR) categories