Since the first report of laparoscopic gastrectomy with lymph node dissection for gastric cancer in 1991, laparoscopic distal gastrectomy (LDG) has become widely accepted by surgeons as an option of common practice [1, 2]. So far, many retrospective studies have demonstrated the benefits from its minimal invasiveness including shorter hospital stay, less bleeding, and accelerated recovery [3, 4]. Furthermore, recent phase III studies have shown that LDG is comparable to open gastrectomy in terms of the surgical outcomes [5,6,7]. On the other hand, laparoscopic total gastrectomy (LTG) for gastric cancer has not been regarded as a common procedure because reconstruction and lymph node dissection around the spleen, which is required in LTG, are more technically complicated and difficult. Although some retrospective studies have demonstrated the safety of LTG compared to open total gastrectomy (OTG), the small sample sizes of these studies have prohibited us from making firm conclusions about the non-inferiority of LTG to OTG [8,9,10,11]. In addition, to our knowledge, no phase III randomized control trials (RCTs) have evaluated the surgical outcomes of LTG against OTG. Therefore, whether the surgical outcomes of LTG are comparable to those of OTG in our current practice still remains as an important clinical question for gastrointestinal surgeons.

To evaluate and improve the quality of surgeries, a nationwide surgical case registration system, the National Clinical Database (NCD), was initiated in 2011 in Japan [12, 13]. Presently, 1,200,000 patients who underwent surgery per year are registered from 4105 institutions. This system has covered more than 90% of general surgery procedures in all of Japan. Using the registry platform, we conducted a prospective data collection of prespecified variables that we saw as the determinants for selecting LTG over OTG, to enhance our propensity score model. The aim of this study was to evaluate the surgical outcomes and safety of LTG compared to OTG.

Methods

Data collection

NCD is a nationwide registry platform in association with the board certification system for surgery in Japan. The present study was a prospective observational cohort study using this NCD registration system. To extract representative patients from all over Japan, target institutions were selected via stratified random sampling. The locations of all institutions were distributed proportionally throughout Japan, and institution locations in each area were selected randomly. First, as shown in Fig. 1, Japan was divided into ten regions, and all municipalities were classified into three urban/rural levels: metropolitan areas (population ≥ 1 million; black circles in Fig. 1) [14], larger cities (≥ 100,000), and small cities (< 100,000). Second, we randomly sampled the facilities in each group stratified by hospital volume (number of surgical cases per year), hospital type (university hospital, specialized hospital, and “others”), and location (10 regions and three urban levels), so as to reflect the distribution of these factors in Japan as a whole. One hundred seventy-nine institutions were selected from this procedure, of which 169 ultimately agreed to participate in the study.

Fig. 1
figure 1

Ten geographical regions of Japan and twelve metropolitan areas. To select representative hospital from all over Japan, the stratified random sampling was performed. Adjusting for hospital volume, types and location, the prospective cohort was developed in the present study

From August 2014 to July 2015, consecutive patients of the participating institutions who underwent LTG and OTG were enrolled prospectively.

Propensity Score (PS) modeling

Expecting a difference in the baseline preoperative characteristics between the patients undergoing LTG and OTG, we planned a PS matching analysis for confounding adjustment. To model the optimal PS, our study team consisting of surgeons who were experts of endoscopic surgery, clinical epidemiologists, and biostatisticians identified the preoperative information related to the decision making by the surgeon as to whether open surgery or laparoscopic surgery would be performed. As a result, covariates for PS estimation included patient’s age, sex, body mass index (BMI), American Society of Anesthesiologist (ASA) physical status, comorbidity, tumor depth (T), node metastasis (N), and distant metastasis (M) as clinical diagnostic factors according to 7th edition of the American Joint Committee on Cancer TNM classification, histological type on biopsy, the history of preoperative treatment, and surgical factors including the presence of concomitant cholecystectomy/splenectomy, reconstruction approach. As not all data components were collected routinely in the NCD system, we added them as survey variables collected for the purposes of conducting this study.

During the 1-year period from August 2014 to July 2015, we identified 2494 patients undergoing total gastrectomy within our system who met the study enrollment criteria: 1569 undergoing OTG and 925 undergoing LTG. After PS matching, 512 patients each in the LTG and OTG groups were included in the final analyses.

Endpoints

The primary endpoints were postoperative morbidity and mortality. Secondary endpoints included length of operative time, amount of blood loss, the number of harvested lymph node, incidence of switches to open surgery from LTG, re-operation and re-admission, and length of postoperative hospital stay.

Statistical analysis

We tabulated the proportion of patients with the baseline characteristic variables listed above for those undergoing OTG versus LTG, using means and medians for continuous variables and proportions for categorical variables. After excluding a small portion of patients who had missing baseline characteristic factors, we modeled the PS for being treated laparoscopically using multivariable logistic regression models from the variables as listed previously, separately in 4 different yearly case-volume facility groups of < 10, 10–19, 20–29, and 30 and above. This was done as we suspected a different familiarity with procedure, and to balance the proportion of patients treated at different volume groups in the matched cohort. We conducted a greedy nearest neighbor matching within the volume groups with a caliper of 0.2 standard deviation of the logit (PS) at a 1:1 ratio without replacement, using macro by Coca-Perraillon [15], and combined the matched cases into one cohort. We assessed the difference in the distribution of the confounding factors before and after PS matching using standardized differences estimated using the macro by Yang and Dalton [16].

In the matched cohort, we compared the occurrences of the study endpoints between those undergoing LTG and OTG. Pearson’s χ 2 test or Fisher’s exact test was used as appropriate for the comparison of categorical variables, and Wilcoxon rank sum test was used for comparing continuous variables. All statistical tests were two-sided, and p values less than 0.05 were considered significant. All analyses were conducted using SAS 9.4 (SAS Institute Inc., Cary, NC).

Results

Patient characteristics

Patient baseline characteristics of each treatment group are shown in Table 1. The OTG group included a higher proportion of high-risk patients with an ASA physical status of 3 or above, or those with an emergent surgical indication. In addition, there were a considerably greater number of patients with advanced clinical staging in the OTG group, with more than half of the patients having ≥ T3 invasion, and lymph node metastasis. Preoperative chemotherapy and combined resection of other organs, such as the spleen and gallbladder, was much more common in OTG patients. Preoperative endoscopic resection was more common in the LTG group. There were no substantial differences in age, BMI, or histological findings of the biopsy specimen between the two groups. LTG was more frequently performed at high-volume hospitals rather than low-volume hospitals.

Table 1 Baseline characteristics before and after propensity score matching

We excluded a small number of patients with unknown TNM classification or with missing baseline variables, and modeled the PS in 2390 patients. The four propensity models had c-statics ranging from 0.84 to 0.86 showing good discrimination between the two treatment groups. After combining the matched cases into one cohort, all baseline variables included in the model were well balanced within the standardized difference below 0.1 (Table 1).

Morbidity and mortality

A comparison of primary endpoints between the two procedure groups is shown in Table 2. The overall morbidity was observed in 84 (16.4%) cases in the OTG and 54 (10.5%) cases in the LTG groups (p = 0.01). The incidence of anastomotic leakage was not significantly different between the two groups (OTG 6.1% vs. LTG 5.3%, p = 0.59). The incidence of pancreatic fistula grade B or above was also not significantly different between the two groups (OTG 3.7% vs. LTG 2.7%, p = 0.38). There were no significant differences between the two groups regarding surgical site infection, intraabdominal abscess, wound dehiscence, mechanical bowel obstruction, pneumonia, pulmonary embolism, and sepsis.

Table 2 Morbidity, mortality, and surgical outcomes

With respect to mortality, there were no significant differences in the 30-day and in-hospital mortality rates between the two groups (OTG 0.4% vs. LTG 0.2%, p = 0.56; OTG 0.4% vs. LTG 0.4%, p = 1.00, respectively).

Secondary endpoints

The LTG groups had significantly longer operating times (median, 352 vs. 254 min, p < 0.001) and significantly lower blood loss (median blood loss, 80 vs. 342 ml, p < 0.001) than the OTG group. A total of 7 (1.4%) patients in the LTG group needed to switch to open surgery caused by intraoperative accident. There was no significant difference in the incidence of reoperations within 30 days after initial operation between the two groups (OTG 2.9% vs. LTG 4.3%, p = 0.24). With regard to reoperations due to anastomotic leakage or drainage, there was also no significant difference between the two groups (OTG 2.0% vs. LTG 2.9%, p = 0.31). The number of harvested lymph node in the LTG group was slightly larger than that in the OTG group (median number, 39 vs. 37, p = 0.03).

The LTG group had significantly shorter postoperative stay than the OTG group (median, 13 vs. 14 days, p = 0.002). There was no significant difference in the incidence of re-admission within 30 days after initial discharge between the two groups (OTG 4.1% vs. LTG 2.3%, p = 0.11).

Discussion

In the present study, based on the data from a Japanese nationwide web-based database, we assessed the feasibility of LTG for gastric cancer with regard to the surgical outcomes. The overall morbidity and 30-day mortality rates in propensity score-matched LTG and OTG groups were 10.5, 0.2% and 16.4, 0.4%, respectively. We also have demonstrated the minimal invasiveness of LTG through the evaluations of secondary endpoints. These results from our nationwide database, which is most likely representative of the Japanese healthcare provision, support the usefulness of LTG for the treatment of gastric cancer.

There were several unique characteristics to our study design. First, as the data registered in the NCD are used for the surgical board certification in Japan, we believe the data registration to be highly accurate and with near-complete coverage of all the procedures performed at the facilities. Second, in addition to the variables routinely recorded in the current NCD system, we prospectively collected some additional variables for this particular research, i.e., variables associated with surgeon’s decision making for the allocation of open or laparoscopic surgery, such as preoperative TNM stage. Third, stratified random sampling was performed based on hospital capacity and regional/metropolitan level to recruit patients who are representative of Japan as a whole. The study design enabled us to compare LTG and OTG using a PS that has good clinical persuasiveness. In this cohort, LTG was more likely to be performed for patients with earlier stage lesions including T1 and N0 lesions compared to those undergoing OTG. This suggests that the indication of LTG for gastric cancer is being considered carefully by Japanese surgeons at this moment. Consequently, our matched population included predominantly early-stage cancer patients, and our results are mainly applicable to these patients.

Many studies have previously shown postoperative morbidity and mortality rates of gastrectomy for gastric cancer [8,9,10,11, 17, 18]. They found morbidity and mortality rates for open gastrectomy to be at 17.4–24.5% and 0.6–0.8%, respectively, with examples of independent risk factors including age, combined resection, BMI, and operation time [18,19,20,22]. Among these reports, however, few have mentioned these rates for total gastrectomy alone. In a recent study for developing a risk model for total gastrectomy using NCD data, the ASA score was one of the most important variables affecting morbidity and mortality [12]. In our study, even after baseline variables such as age, combined resection with spleen for lymph node dissection, BMI, and ASA included in the model were balanced, morbidity and mortality rates in LTG were favorable compared to that in OTG. Furthermore, re-admission and re-operation rates did not differ between the LTG and OTG groups. We believe that when collectively evaluated, these data support the safety, feasibility, and minimal invasiveness of LTG for gastric cancer compared to OTG.

Among the postoperative morbidities in total gastrectomy, anastomotic leakage is one of the main causes of operative or in-hospital mortality. Esophagojejunostomy in LTG is technically demanding, and previous retrospective studies have shown that the incidence of anastomotic leakage in LTG was high ranging from 0 to 7.4% [8,9,10, 17, 23]. A national survey conducted by the Japan Society of Endoscopic Surgery also showed the incidence of anastomotic leakage to be 6.1% [3]. In our study, the incidence of anastomotic leakage in LTG was 5.3% and that in OTG was 6.1%. While the assessment of different types of esophagojejunostomy was outside the scope of our study, recent development in the anastomotic procedure, both in regard to surgical techniques and surgical instruments which enable protection of deep organs in the cavity, may have played a role in lowering the incidence of postoperative morbidities in LTG cases [23,24,25,26]. Development of useful devices or establishment of standard techniques in anastomosis might lead to the expansion of the indication of LTG and shortening of its operation time in the near future.

The present study had several limitations. First, because the study was not an RCT, the comparisons of outcome incidences between the two groups may be confounded by the differences in the baseline characteristics of the patients. We tried to minimize the effect of confounding by carefully constructing the PS from variables that our study team predetermined to affect the choice of the surgical approach and by using a matching method for balancing the factors. However, residual confounding unmeasured factors may remain. Second, our dataset might include some heterogeneity in the surgical techniques used in the procedure. No data were collected on the details of the surgical techniques including the use of energy devices, the type of staplers for anastomosis, and 2 or 3 dimensional visions of laparoscopy, which may influence the results of the study to a certain degree. Lastly, data on patients’ long-term outcomes could not be obtained in the current study. More studies are needed to evaluate the comparative safety and effectiveness of the LTG vs. OTG on long-term clinical outcomes.

Conclusions

The results from our comprehensive nationally representative data analysis showed that LTG could be a safe procedure for gastric cancer compared to OTG. The indication for LTG should be evaluated thoroughly in the clinical setting.