Background

Gastric cancer is the one of the leading cause of cancer-related deaths in the world [1,2,3]. Locally advanced gastric cancers (LAGC) are with high risk of lymph node (LN) metastasis and poor prognosis. The National Comprehensive Cancer Network (NCCN) proposes neoadjuvant chemotherapy (NAC) to induce tumor regression and downstaging before the surgery [4, 5]. Lymph node metastasis is a significant predictor of survival in gastric cancer [6]. Although the N stages will be done on histopathology by the nodal dissection of the radical gastrectomy after NAC, more accurate predicting nodal involvement on imaging may help surgeons detect suspicious lymph node metastases that are beyond the scope of the routine D2 lymph node dissection. Computed tomography features have the potential to predict pathological complete response (ypT0N0) after NAC in patients with LAGC [7]. Patients who had a full pathological response (pT0N0 stage) following NAC had a considerably higher chance of survival [8].

CT is an approach for preoperative staging and response evaluation of gastric cancer (GC) [7, 9,10,11]. CT has been a standard practice for preoperative N staging, where the identification of larger and round-shaped lymph nodes serves as an indicator of metastasis. Previous imaging research studies have found that regional lymph nodes with an increased short-axis diameter greater than 10 millimeters, necrosis, or numerous clusters on imaging are suspicious for metastasis [6, 9, 12, 13]. Nevertheless, the diagnostic accuracy of computed tomography (CT) for detecting lymph node metastasis (LNM) ranges from 50 to 70% on baseline diagnosis [14,15,16,17,18]. FDG-PET has significantly lower sensitivity compared with CT in the detection of local lymph node involvement (56% vs. 78%), although FDG-PET has improved specificity (92% vs. 62%) [19]. Furthermore, the accuracy of restaging by computed tomography (CT) following neoadjuvant chemotherapy (NAC) remains limited in patients diagnosed with LAGC. After chemotherapy, the accuracy of CT was found to be only 37% for N staging [20]. Radiomics is a new method for quantitatively diagnosing cancer by first converting medical images into radiomics features and then choosing key aspects as the signature [21]. In recent years, there has been an increasing utilization of CT-based radiomics in the realm of stage prediction and therapy selection for patients diagnosed with advanced gastric cancer (AGC) [22,23,24,25].

Thus, present study aimed to create a radiomics model for predicting the N0 or N + stages in LAGC following NAC.

Methods

Patients

This retrospective study was authorized and approved Ethics Committee of our hospital, and informed consent was waived due to retrospective nature. Patients were included based on the following inclusion criteria: (a) patients with histologically and CT-confirmed LAGC (cT2-4aNxM0); (b) patients who underwent radical gastrectomy and D2 lymph node dissection following two to four cycles of NAC; and (c) CT performed at baseline and less than two weeks prior to surgery. Patients were excluded based on the following criteria: (i) previous abdominal malignancies or inflammatory diseases; (ii) difficulty of tumor segmenting due to inadequate gastric distention; (iii) the adipose surrounding the stomach was almost entirely eradicated by the gastric cancer; and (iv) baseline or pre-operative CT examinations were performed at other hospitals.

We retrospectively reviewed data for 158 patients with gastric cancer who underwent surgery in two hospitals. The training cohort (125 patients) was included from Peking university cancer hospital. The testing cohort (33 patients) was included from our hospital. These patients in the training cohort were consecutively treated from January 1, 2019, to December 31, 2022. The NAC regimens for patients in training cohort included SOX (75 cases, S-1 plus oxaliplatin), XELOX (26 cases, oxaliplatin plus capecitabine), and mFOLFOX7 (24 cases, modified regimen of leucovorin, fluorouracil, and oxaliplatin). Patients in testing cohort were consecutively treated from August 1, 2018, to December 31, 2021. The NAC regimens for patients in testing cohort included SOX (23 cases, S-1 plus oxaliplatin) and XELOX (10 cases, oxaliplatin plus capecitabine). The patients and tumor features have been summarized in Table 1. Figure 1 displays a flowchart outlining the study design employed for the creation and testing of the radiomics model.

Table 1 The clinicopathological features of patients included in this study
Fig. 1
figure 1

Flowchart of patient selection

CT Protocol

CT examinations was performed using the 64–detector row CT scanner (LightSpeed 64; GE Healthcare, Milwaukee, Wis). Each patient had been fasting for more than 8 h prior to the CT examination. To enable gastric distention and reduce gastric motility, the patients received 8 g gas-producing crystals orally before the examination. Upper abdominal unenhanced CT scans from the diaphragmatic domes to 2 cm below the lower margin of the air-distended gastric body were acquired with a collimation of 0.625 mm, 120–140 kVp, and 300–350 mAs. Subsequently, a total of 100 ml of non-ionic contrast medium (Ultravist; Schering, Berlin, Germany) was administered intravenously through an 18-gauge angiographic catheter inserted into an antecubital vein at 3 mL/sec by using an automatic injector. Contrast-enhanced CT scans were performed in the arterial phase (30 s) and in the portal venous phase (70 s). The portal venous phase was used to evaluate lymph node status. The portal venous phase axial CT images were reconstructed with a 5-mm section thickness and a 5-mm reconstruction interval for clinical interpretation and with a 0.625-mm section thickness for MPR reconstruction.

Tumor Region Segmentation

The regions of interest (ROIs) of tumor and two largest short-axis size lymph nodes were manually delineated on arterial and portal venous phase CT images by the radiologist (J.S and Z.L.W., with 10 and 12 years of experience in abdominal CT, respectively) in each center. For each CT phase, only one slice with the largest tumor area was chosen visually by the radiologist using ITK-SNAP software (version 3.6.0; http://www.itksnap.org). Due to the diversity and irregularity of gastric cancer morphology, it is time-consuming to manually draw the ROI of gastric cancer at all slices. Regarding lymph nodes in RECIST criteria, a maximum of two lesions from each organ are selected as representatives [26]. If too many lymph nodes are selected, some small lymph nodes may disappear after chemotherapy, making it impossible to draw the ROIs. Thus, the ROI of slices with the largest and second largest lymph node areas were manually delineated on portal venous phase CT images.

Radiomic Feature Extraction

For each patient, the ROIs of tumor and lymph nodes were delineated in pretreatment and post-treatment CT images respectively, including tumor region in the artery phase, tumor region and 2 typical LNs in the venous phase. A total of 488 features were extracted from each ROI by PyRadiomics (Version 2.1.2). Totally, 488 × 8 = 3904 features were extracted.

Radiomic Feature Selection

Feature selection was performed according to the following steps:

  1. 1.

    Features from 2 LNs were averaged. A total of 488 pretreatment LN features and 488 post-treatment LN features remained. Totally 2928 features were preserved for next step.

  2. 2.

    In the training group, T-test was used to remove the features that show insignificant difference (P > 0.05) between ypN0 and ypN + groups. After this step, 918 features were preserved for next step. T value for each feature was saved for next step.

  3. 3.

    Correlations were calculated between each two features to avoid redundancy. If the absolute value of the Pearson correlation coefficients was larger than 0.5, the feature with the smaller T value was removed. After this step, 34 features were preserved for next step.

  4. 4.

    Logistic regression with least absolute shrinkage and selection operator (LASSO) was performed in the training group for both feature reduction and classification. Five-fold cross-validation was used to determine the hyperparameter that determines feature number. Finally, logistic regression returns a radiomics score by linearly combining the selected 13 features.

CT Image Analysis

The CT image analysis was performed by two radiologists with more than 10 years of abdominal diagnostic experience in the two hospitals using the workstation (Carestream Health, Shanghai, China). They reviewed the baseline and post-NAC CT images. Regional lymph nodes of gastric cancer were considered as metastases if they were solitary or separate nodes 8 mm or greater in short-axis diameter with marked enhancement, which was defined as attenuation greater than  85 HU in the post-contrast portal venous phase. Lymph nodes were also  considered to represent metastases if they were cluster nodes or solitary nodes with associated reticular strands or necrosis [12].

Statistical Analysis

All the data collected in this study were analyzed using SPSS 22.0 software. Normally distributed measurement data were expressed as mean ± standard deviation (SD), while non-normally distributed measurement data were expressed as median (interquartil range), and the comparisons were examined by Student t-test and Mann-Whitney test (non-parametric distribution). The categorical data were expressed as (%), and the differences between the two groups were examined by chi-square analysis or Fisher’s exact test. < 0.05 was considered statistically significant.

Results

After the radiomics feature selection process, 13 features (wavelet-HH_glcm_MaximumProbability, wavelet-HH_glszm_SmallAreaEmphasis, wavelet-HL_glszm_GrayLevelVariance, wavelet-LH_firstorder_Kurtosis, wavelet-HH_glszm_SmallAreaHighGrayLevelEmphasis, wavelet-HL_firstorder_Skewness, wavelet-LH_firstorder_Kurtosis, original_glszm_Zone%, wavelet-HL_gldm_DependenceVariance, wavelet-LL_firstorder_Kurtosis, wavelet-HL_glszm_LowGrayLevelZoneEmphasis, wavelet-LH_gldm_DependenceVariance, and wavelet-LH_glszm_Zone%) were selected by the linear combination with the radiomics score (Table 2).

Table 2 The name, ROI, and the weight of the selected 13 features, whose linear combination contributes the radiomics score

The radiomics model based on pre- and post-neoadjuvant chemotherapy produced good results in both the training and testing cohorts (AUC 0.846 and 0.843, respectively) (Fig. 2). The sensitivity, specificity, positive predictive value, and negative predictive value in the training cohort were 81.3%, 77.8%, 86.7%, and 70.0%, respectively, and 84.0%, 75.0%, 91.3%, and 60.0% in the testing cohort (Table 3).

Table 3 Statistical values of the prediction lymph node metastasis by radiomics score
Fig. 2
figure 2

The areas under the ROC curve of the raadiomics model for predicting lymph node metastasis were 0.846 in training group and 0.843 in testing group

The results of the radiologists to diagnose a patient with or without lymph node metastasis by traditional CT morphological findings were counted. The AUC, sensitivity, specificity, positive predictive value, and negative predictive value of baseline CT imaging were 0.561, 70.0%, 42.2%, 68.3%, nd 44.2% in the training cohort and 0.525, 80.0%, 25.0%, 71.4%, and 28.6% in the testing cohort, respectively (Table 4). By assessing preoperative CT, the AUC, sensitivity, specificity, positive predictive value, and negative predictive value were 0.542, 46.3%, 62.2%, 68.5%, and 39.4% in the training cohort and 0.533, 48.0%, 62.5%, 80.0%, and 27.8% in the testing cohort, respectively (Table 5). The P value for statistically comparisons between the AUC of radiomics model and radiologists using baseline CT imaging were < 0.001 and 0.017 respectively in the training and testing cohorts.

Table 4 Statistical values of the prediction lymph node metastasis by baseline CT
Table 5 Statistical values of the prediction lymph node metastasis by preoperative CT

Discussion

Our study was a collaborative effort between two centers, with the aim of predicting the presence or absence of LNM in LAGC following NAC. Our radiomics model demonstrated great predictability and reproducibility across centers. Restaging gastric cancer using CT after NAC was thought to be incorrect and unreliable [27]. Our radiomics model, in contrast, outperformed the distinguish between N0 vs. N + stages on a large scale. Our results showed that the model was more effective in diagnosing patients with or without lymph node metastasis than traditional CT imaging. For example, one enlarged lymph node was diagnosed as metastasis by radiologist at baseline CT, and this node shrank after neoadjuvant chemotherapy. It is difficult to determine whether the tumor is still present in this reduced lymph node using CT images (Fig. 3). Conversely, some inflammatory lymph nodes will appear to be enlarged in size, and when the short diameter is greater than 8 mm and with high enhancement, it will be diagnosed as metastasis by the radiologists (Fig. 4). Our radiomics model has significantly higher specificity compared with traditional CT imaging diagnosis and can correct some overstaging of lymph node cases. In recent years, the application of immunotherapy in gastric cancer has been more and more accepted all over the world. The combination immunotherapy performed a higher pathological complete response (pCR) rate than that of chemotherapy alone. A phase II single-arm study of SOX chemotherapy plus anti-PD-1 immunotherapy regimen for locally advanced gastric cancer showed that the postoperative pCR rate reached 26.9% [28]. Our radiomics model has the potential to be applied to the diagnosis of pathological complete response in the future.

Fig. 3
figure 3

A–D A 59-year-old male patient with gastric adenocarcinoma. He received radical gastrectomy after 3 cycles of neoadjuvant chemotherapy. Pathological results showed that there was no tumor cell residue in the stomach. Some changes after chemotherapy, including lymphocyte and foam cell infiltration and interstitial fibrosis, were observed in the gastric surgical specimen. No lymph node metastasis was detected. The pathological stage was ypT0N0 stage. A An ulcerative tumor of the gastric cardia was found on axial CT with significant enhancement in baseline examination. B After neoadjuvant chemotherapy, the thickness of gastric wall in the cardia was reduced. C A 36*25 mm enlarged lymph node (arrow) with central necrosis was detected at the lesser curvature area of stomach. This node was diagnosed as metastatic by the radiologist. D This lymph node was significantly reduced after chemotherapy with the size of 7*5 mm. The radiomics model and radiologist both wrongly diagnosed the patient as lymph node metastasis (N + stage) after chemotherapy

Fig. 4
figure 4

A–D A 62-year-old female patient with adenocarcinoma of antrum of stomach. Postoperative pathological stage was ypT3N0 stage. No lymph node metastasis was detected in pathological results. A The thickened gastric wall in the antrum was found on axial CT in baseline examination. B After neoadjuvant chemotherapy, gastric wall thickening, and enhancement decreased. C Three enlarged lymph nodes with cluster distribution (arrow) were detected at the lesser curvature area of stomach. The largest lymph nodes are 13*10 mm in size. Radiologist diagnosed the patient with lymph node metastasis. D These lymph nodes shrank slightly after chemotherapy, with the largest node shrinking to 10*8 mm. According to the diagnostic criteria of CT imaging, these three lymph nodes were diagnosed as metastases by the radiologist after chemotherapy. However, the pathological stage was N0 stage. The radiomics model correctly diagnosed the patient as stage N0

Compared to previous investigations on the use of imaging model to predict lymph node metastasis in gastric cancer [20, 22], this study was carried out on tumor ROIs from multiphase CT images of baseline and pre-operative CT scans rather than on one ROI from a single CT phase. In addition, we added image ROI data of two enlarged lymph nodes to the model. The imaging information of lymph node itself plays a complementary role in the diagnosis of N0. In medical terms, this makes radiomics models predicting lymph node metastasis more explicable. Using radiology features to predict pathological complete response after NAC is a potential component of treatment strategy for patients with LAGC. Those patients who had a pathological complete response after NAC would get significantly better survival outcomes [8].

In this study, 2D features were extracted from the ROI delineated on the largest slice. Compared with 3D features, 2D features are insensitive to the anisotropic scanning. In both training and testing groups, the intra-slice resolution is around 0.7 mm×0.7 mm, but the inter-slice resolution is around 5.0 mm. If 3D features are used, isotropic resampling is required. However, the influence from resampling still needs investigation. In contrary, 2D features are more robust, and resampling is not needed. The authors selected 13 features in the linear model, and the 2 features with largest weight are post-treatment Zone% feature in venous phase and post-treatment low gray level zone emphasis in venous phase. Both of these two features are related with the percentage of subregions in tumor ROI, especially with the percentage of low-signal subregions. Although the weight of other features are relatively less, they also reflects the changes in the tumor and lymph nodes before and after chemotherapy from the perspective of the image texture features. But in current studies, it is difficult to explain the relationship between each feature and clinical significance from a medical perspective. The negative weights show that both features are negatively related with the probability of LN metastasis. Probably, the high percentage of low-signal subregions is caused by an increase of necrosis or fibrosis after NAC, which is an indicator of good response to NAC.

sOur study had some limitations. First, this is a retrospective research, which might be with selection bias and small sample size, though it is a two-center analysis. Second, we built the model using single-layer images of the maximum length diameter of the tumor and lymph nodes, which may not represent all informations about the tumor due to the heterogeneity within the tumor. In future studies, further modeling studies with tumor and lymph node volume image should be considered. Due to the large workload of repeatedly delineating the ROI of gastric cancer, intrareader/interreader reproducibility of radiomics features were not performed in this study.

In conclusion, we demonstrated that radiomics models using baseline and restaging CT offered good predictive value for the presence of lymph node metastasis after NAC. The model had the potential to be applied to the prediction of pathological complete response in the future.