Introduction

Gastric cancer is one of the most common cancers and is associated with a high mortality rate worldwide [1]. The prognosis of patients with advanced gastric cancer remains poor despite of the recent progress in cancer treatment. Gastrectomy with regional lymph node dissection is the most effective treatment for gastric cancer. For advanced gastric cancer, gastrectomy with D2 lymph node dissection is recommended worldwide [2,3,4]. Though subtotal gastrectomy is the preferred surgical procedure whenever possible in order to limit post-operative morbidity and to optimize quality of life, total gastrectomy (TG) is indicated in the treatment of certain gastric cancers, including that located in the upper third of the stomach or advanced gastric cancer extending to the cardia, diffuse signet ring gastric adenocarcinoma, and hereditary diffuse gastric cancer (CDH1 mutation), both in the prophylactic setting.

TG is well known as complex procedure that carries a significant risk of postoperative morbidity and mortality. Major complications after TG including esophagojejunal anastomotic leakage, duodenal stump leakage, and pancreatic fistula related to lymphadenectomy can be fatal. Serious morbidity and mortality were significantly higher in the TG group than the partial gastrectomy group [5]. Large Western centers have previously demonstrated the data regarding morbidity and mortality after TG, which conclude the 30-day morbidity and mortality rates of approximately 30% and 5%, respectively [5, 6]. In Eastern, Watanabe et al. reported that the overall morbidity was 26.2%, with a 30-day mortality rate of 0.9% based on the National Clinical Database (NCD) and established risk stratification model of postoperative outcomes and identification of patients at risk for morbidity [7].

Recently, there is a growing recognition that the multidisciplinary care in high-volume hospital can improve postoperative both short and long outcomes of gastric cancer patients [8, 9]. Claassen et al. demonstrated that gastric surgery performed in high-volume hospital was associated with better surgical quality and prognosis based on data from CRITICS trial [10, 11]. In Germany, the observational study using national hospital discharge data reveled that treatment in very high volume is associated with lower mortality compared to low-volume hospital [12]. In also Eastern, hospital volume–outcome relationship was demonstrated for perioperative mortality after TG using Taiwan National Insurance Research Database between 2000 and 2010 [13]. However, the relationship between surgeon and hospital volume with postoperative morbidity and mortality after TG for gastric cancer remains unknown in a larger and more recently cohort.

In Japan, NCD was founded in 2010 as the parent body of the database system linked to the board certification system, which contains records of ≥ 95% of the surgeries performed by regular surgeons in Japan [14]. Almost 5000 facilities have enrolled, and over 11,300,000 cases have been registered. Most recently, we revealed that hospital volume had a strong impact on postoperative mortality after distal gastrectomy (DG) for Japanese patients with gastric cancer using NCD [15]. In the current study, to clarify the impact of surgeon and hospital volume on postoperative mortality for TG, we evaluated data from 71,307 Japanese patients with gastric cancer enrolled in NCD.

Methods

Data collection

From 2011, the NCD collected data on more than 11,300,000 surgical cases from more than 5000 institutes. In the gastroenterological surgery section, the database registered all surgical cases that fell into this category; in addition, it required detailed input items for eight procedures, including gastrectomy, that were determined to represent the performance of surgery in each specialty. The NCD constructed software for an Internet-based data collection system, and the data managers of the participating hospitals were responsible for forwarding their data to the NCD office. The NCD ensures traceability of its data by maintaining continuity in the staff who approve the data, the staff of the departments in charge of annual cases, and the data-entry personnel. It also validates data consistency via random inspections of participating institutions.

In this study, we focused on the specific NCD section for gastrointestinal surgery. Briefly, potential independent variables included patient demographics, pre-existing comorbidities, preoperative laboratory values, and operative data.

Patients

A total of 71,307 patients, who underwent TG for gastric cancer at 2051 institutions between January 1, 2011 and December 31, 2015, were eligible for analysis. Records with missing data on patient age, sex, or outcome were excluded.

Endpoint

The primary outcome measure of this study was 30-day and operative mortalities. Operative mortality was defined as death during the index hospitalization, regardless of the length of hospital stay (≤ 90 days), as well as after hospital discharge within 30 days from the operation date.

Surgeon and hospital volume

We defined surgeon volume as the number of TGs performed by a patient’s surgeon in the previous year. Surgeon volume was divided into the following four groups: S1 (0–2 cases per year), S2 (3–9 cases), S3 (10–25 cases), and S4 (26+ cases). We divided hospital volume by the number of TGs performed in the previous year into the following three tertiles: H1 (0–11 cases per year), H2 (12–26 cases), and H3 (27–146).

Statistical methods

All statistical calculations were performed with STATA 15 (STATA Corp., TX, USA). We compared median values with the Kruskal–Wallis test for operation time and estimated blood loss, and Chi-squared test for all other variables. All P values were two-sided, and we considered P < 0.05 as statistically significant. First, we analyzed the relationship between annual surgical volume (hospital volume or surgeon volume) and surgical mortality after gastrectomy with hierarchical logistic regression models accounting for clustering of patients by surgeons and hospital levels. To adjust for patient-level risk factors, the following variables, which were used in the scoring system established by NCD data [16], were utilized: demographic factors, such as age category; preoperative functional status, such as need for any assistance with activities of daily living (ADL); history of cerebrovascular disease; weight loss more than 10%; uncontrolled ascites; ASA score class 3 or more; pre-existing comorbidities, such as the presence of respiratory distress, disseminated cancer, operative factors, such as emergency surgery, with pancreatectomy and laparoscopic gastrectomy; and preoperative laboratory data, such as white blood cell count more than 11,000/μL, anemia (hematocrit < 30%), serum albumin less than 3.0 g/dL, alkaline phosphatase more than 600 IU/L, serum Na less than 138 mEq/L, and prothrombin time-international normalized ratio more than 1.25, low platelet count (< 12 × 104/μL), aspartate aminotransferase more than 35 IU/L, and increased level of total bilirubin (> 2 mg/dL). In addition, to illustrate the relationship between operative mortality and surgeon volume as a continuous variable, generalized estimation equation logistic regression models were utilized, in which a restricted cubic spline model was implemented. All procedures were conducted in accordance with the ethical standards of the respective committees on human experimentation (institutional and national) and with the Helsinki Declaration. An ethics committee that includes members of the Japanese Surgical Society ethics board, lawyers, patient representatives, and experts on information security that considered the ethical propriety of the entire initiative approved it and made the review process public on the Japan Surgical Society website. The use of data from the registry for retrospective observational studies was approved by The Japanese Society of Gastroenterological Surgery committee and the institutional Review Board of Kumamoto University committee (No. 1514), and individual written or verbal informed consent was waived because of the retrospective design.

Results

We retrieved data on a total of 71,307 patients who underwent TG for gastric cancer by 10,274 surgeons at 2051 institutions from January 2011 to December 2015. Surgeon volume ranged from1 to 79 TG per year. Annual surgeon volume was divided into the following four groups: S1: 0–2 (n = 34,362), S2: 3–9 (n = 27,532), S3: 10–25 (n = 8088), and S3: 26–79 (n = 1325) cases per year (Fig. 1a).

Fig. 1
figure 1

The definition and distribution of surgeon volume and hospital volume a surgeon volume, b hospital volume

Patient demographic data and preoperative risk assessment according to surgeon volume are summarized in Table 1. Low-volume surgeons had operated on significantly older patients and poorer-risk patients with various comorbidities and organ dysfunctions. More laparoscopic TG were performed by the high-volume surgeons (P < 0.001). The operative mortality rate decreased with surgeon volume, 2.5% in S1, 1.8% in S2, 1.2% in S3, and 0.6% in S4. Regarding surgical complications, anastomotic leakage was more frequently observed among low-volume surgeons and the incidence of pancreatic fistula was higher among high-volume surgeons (P < 0.001), which is consistent with our DG data [15]. The rate of all nonsurgical complications including pneumonia, the reoperation rate and the rate of septic shock were significantly higher among low-volume surgeons (P < 0.001).

Table 1 Preoperative risk assessment and surgical outcomes according to surgeon volume category

Hospital volume ranged from 1 to 146 TG per year. Annual hospital volume was distributed among the tertiles as follows: category H1: 1–11 (n = 24,027), H2: 12–26 (n = 24,908), and H3: 27–146 (n = 22,327) cases per year (Fig. 1b). Patient demographic data and preoperative risk assessment according to hospital volume category are summarized in Table 2. Low-volume hospitals had significantly older patients and poorer-risk patients with various comorbidities and organ dysfunctions. More laparoscopic TG were performed in the high-volume hospitals (P < 0.001). Operative mortality was 3.1% in H1, 1.7% in H2, and 1.2% in H3. A significant reduction in mortality rate was observed according to hospital volume (P < 0.001). Regarding surgical complications, we found no significant difference of the rate of anastomotic leakage between in low- and high-volume hospitals. The incidence of pancreatic fistula was higher in high-volume hospitals, which is similar to low-volume surgeons (P < 0.001). The rate of all nonsurgical complications excluding pneumonia, the rate of septic shock, and event rate of central nervous system were significantly higher in low-volume hospitals (P < 0.001).

Table 2 Preoperative risk assessment and surgical outcomes according to hospital volume category

Table 3 summarizes the 95% CIs for overall mortality after TG from the hierarchical logistic regression models. The lowest volume surgeons were significantly associated with higher mortality (Fig. 2a, S1: OR, 1.29, 95% CI 1.01–1.64, P = 0.043), and higher-volume hospitals were significantly associated with a decreased risk of mortality in a dose-dependent manner (Fig. 2b, H2: OR, 0.61, 95% CI 0.53–0.71; P < 0.001, H3: OR, 0.51, 95% CI 0.43–0.61, P < 0.001). After risk adjustment for surgeon and hospital volume, hospital volume was significantly associated with operative morality (Fig. 2d, H2: OR, 0.63, 95% CI 0.54–0.73, P < 0.001; H3: OR, 0.53, 95% CI 0.43–0.63, P < 0.001), whereas surgeon volume was not (Fig. 2c), which is consistent with our DG data [15].

Table 3 Ninety-five percentage confidence intervals for overall mortality after total gastrectomy from the hierarchical logistic regression models
Fig. 2
figure 2

Forest plot for overall mortality calculated by multivariate logistic regression analysis. a Surgeon volume adjusted by risk model variables, b hospital volume adjusted by risk model variables, c, d surgeon and hospital volume adjusted by risk model variables including hospital volume. Demographic factors, preoperative functional status, pre-existing comorbidities, operative factors, and preoperative laboratory data weere utilized to adjust for patient-level risk factors

Furthermore, the OR for operative mortality gradually decreased in a surgeon volume-dependent manner after risk adjustment for both only patient factors such as demographic factors, preoperative functional status, pre-existing comorbidities, operative factors, and preoperative laboratory data (Fig. 3a) and adding hospital volume (Fig. 3b). OR for operative mortality reaches the plateau in about 40 cases/year with or without risk adjustment.

Fig. 3
figure 3

Odds ratio of mortality after gastrectomy according to surgeon caseload per year a adjusted by risk model variables and b adjusted by risk model variables including hospital volume. Solid lines: observed odds ratio; dashed lined: 95% confidence interval from the logistic regression model. Demographic factors, preoperative functional status, pre-existing comorbidities, operative factors, and preoperative laboratory data were utilized to adjust for patient-level risk factors

Discussion

In the current nationwide study, we revealed that hospital volume was associated with postoperative mortality after TG compared to surgeon volume among 71,307 Japanese patients with gastric cancer. This result is consistent with our previous data regarding DG and Taiwan study by Wu et al. [13, 15]. Recently, there are growing evidences that hospital volume is associated with postoperative mortality based on national database and large-scale clinical trials [10, 11]. However, most of studies included several types of gastrectomy such as TG and DG with or without the resection of the surrounding organs. To the best of our knowledge, this is the first largest scale of report focused on TG, which is the most complex procedure for gastric cancer using national clinical database.

Wu et al. first revealed the impact of the hospital volume of TG on the long-term surgical outcomes in 7905 patients with gastric cancer between 2000 and 2010 in nationwide study [13]. As Wu et al. discuss, this finding is attributed to some possible factors as follows: advances of perioperative care, more experienced surgical skills, and development of surgical devices in recent time period. On the issue of survey period, we believe that our data collected from NCD between 2011 and 2015 were more feasible for clinical practice in this era. Most recently, based on the data from CRITICS trial, pivotal clinical trials regarding perioperative adjuvant therapy for advanced gastric cancer conducted in Netherland, association between hospital volume and both short-, and long-term outcomes was published [10, 11]. Intriguingly, although there were no significant difference in postoperative complications or mortality between the hospital volume categories [11], high hospital volume was associated with higher overall and disease free survival [10]. Approximately 85% hospital in the CRITICS trial has more than 20 resections per year. Therefore, centralizing gastrectomy for gastric cancer naturally occurs. However, it remains unknown whether centralization of gastric cancer surgery can improve morbidity and mortality rates in clinical practice because the patients enrolled into clinical trials are likely to have fair condition.

Previous studies that examined the centralization of gastric cancer treatment produced heterogeneous and conflicting findings due to technical, regional, and demographic factors. Nelen et al. recently reported that centralizing gastrectomy improved the number of harvested lymph nodes and successfully introduced laparoscopic gastrectomy [8]. Furthermore, Lee et al. demonstrated that hospital volume did not directly affect postoperative morbidity and mortality achieved by well-trained beginners of laparoscopic gastrectomy [17]. This finding suggests that surgeon volume is an important factor affecting postoperative outcomes in laparoscopic gastrectomy. On the other hand, Nelen et al. also showed that centralization of gastrectomy did not improve the mortality rate in the Eastern part of the Netherlands [8], which is similar to the results reported by Thompson et al. [18]. However, these studies were conducted in small cohorts and in some parts of the West. In Japan, morbidity and mortality rates of gastric cancer surgery are substantially lower, even in with low surgeon and hospital volumes, compared to the West [19]. These findings lead to more questions as to how a low surgeon volume in a high-volume hospital can impact mortality.

There are several potential benefits for short-term outcomes that result from a high hospital volume. First, the surgeon can provide referrals to various experts for patients with postoperative morbidities. Second, high-volume hospitals generally have a sufficient cooperative structure for diagnostic and interventional procedures after postoperative morbidities. Third, surgeon volume will increase in high-volume centers under supervision by experts for gastrectomy, leading to lower mortality. Konno et al. reported that the participation of board-certified surgeons in gastroenterological surgery, including TG, contributes to favorable surgical outcomes of gastroenterological surgery using NCD data [20]. These potential benefits may lead to favorable short-term outcomes in higher volume hospitals. Furthermore, it is possible that the patient characteristics depend on each surgeon and hospital. Busweiler et al. reported that elderly patients might benefit specifically from centralization [21]. In this study, low-volume surgeons and hospitals had significantly older patients and poorer-risk patients with various comorbidities and organ dysfunctions (Tables 1, 2). Our study is real-world evidence from a nationwide cohort study after adjustment for these independent variables for operative mortality, which is used in the scoring system established by the NCD data.

Our study has some limitations. First, long-term outcomes, such as recurrence-free survival and overall survival, were not evaluated. Further analysis of the impact of surgeon and hospital volume on long-term outcomes after TG is required, because postoperative complications can lead to adverse effects on OS and DFS [22, 23]. In Japan, the data of long-term outcome are interlinked to NCD by the nation gastric cancer registration system by Japanese Gastric Cancer Association. Second, it remains controversial whether the cut-off value of surgeon and hospital volume defined by Japanese population is appropriate to apply to clinical practice in other Eastern and Western countries. To clarify the criteria of the surgeon volume as a continuous variable, we analyzed OR of mortality by generalized estimation equation logistic regression models. The OR of mortality according to surgeon caseload per year reaches the plateau in about 40 cases per year. However, it is difficult for a surgeon to experience 40 cases per year even if in high-volume hospitals. Furthermore, it is possible that surgeon volume drastically changes every year, depending on hospital where surgeon belongs, whereas hospital volume is assumed to be constant. This is possible reason why hospital volume had a strong impact on postoperative mortality after TG. Therefore, to determine the concrete threshold of surgeon and hospital volume, the differences of epidemiology, biology, and treatment strategy of each country should be considered. Despite these limitations, it is possible that our results have implications for improving healthcare delivery.

Conclusions

Hospital volume had a strong impact on postoperative mortality after TG for Japanese patients with gastric cancer in a nationwide web-based data entry system, NCD, which suggests that centralization may improve outcomes after TG.