Introduction

Gastric cancer (GC) was the fifth most frequently diagnosed cancer and the fifth leading cause of cancer death in 2022 [1]. Precise staging, particularly regarding lymph node metastasis (LNM), is essential for treatment planning and prognosis [2,3,4]. LNM status dictates treatment options, such as contraindicating endoscopic mucosal resection for early GC and indicating neoadjuvant therapy for locally advanced cases [3, 4]. Additionally, LNM significantly impacts patient survival and disease progression [5,6,7]. Current preoperative diagnosis of LNM primarily relies on imaging-based assessment, with computed tomography (CT) being the recommended modality [8, 9]. However, the performance of CT in predicting LNM varies markedly [10,11,12]. Positron emission tomography has been used for staging of GC, but no significantly added benefit has been reported in detecting positive nodes in esophago-gastric cancer or GC [13, 14]. Functional magnetic resonance imaging, particularly diffusion weighted imaging (DWI), has shown additional value in the staging of GC [15, 16]. However, in mono-exponential model of DWI, water molecular motion is simulated to follow a Gaussian distribution which is not the case in heterogeneous cancerous tissue [17]. Emerging advanced DWI models, such as diffusion kurtosis imaging (DKI), stretched-exponential models (SEM), intravoxel incoherent motion (IVIM), the fractional order calculus (FROC) model and continuous-time random walk (CTRW) model are increasingly utilized to depict authentic water motion in vivo and tumor heterogeneity in the microenvironment [18,19,20,21,22,23,24], and offer new sets of imaging markers for cancers. Regarding GC, the true diffusion coefficient of IVIM is a useful index for evaluating LNM and lymphovascular invasion [19, 25]. The kurtosis of DKI is positively correlated with Ki-67 expression and useful for evaluating treatment response [18, 26]. The FROC model provides useful tools for noninvasive assessment of the Lauren classification [27]. These studies together highlight the promise of advanced DWI models in the stomach. We postulated that advanced DWI models with additional novel imaging markers can provide multiple perspectives into tumor characteristics, which may assist more accurate preoperative prediction of LNM among GC patients. Therefore, the aim of this study was to investigate the potential and added value of six DWI models (mono-exponential DWI, IVIM, SEM, DIK, FROC, and CTRW) for preoperative prediction of LNM in patients with resectable GC.

Methods

Patients

The present prospective study received approval from our hospital’s institutional review board and adhered to the principles of the Declaration of Helsinki. All participants provided written informed consent (NCT04028375). Between Nov 2022 and Nov 2023, patients diagnosed with gastric adenocarcinoma by endoscopic biopsy were enrolled following the inclusion criteria as: (a) had no history of prior anticancer treatment or any concurrent malignancy; (b) had completed gastric MRI scans before surgery; and (c) had resectable GC lesions (cT1–4a, N0-1, M0) assessed by the multidisciplinary tumor board (MDT) and scheduled direct radical gastrectomy. Patients were excluded if (a) postoperative pathology confirmed as mucinous adenocarcinomas because its mucinous nature may cause migrations of ADC values; (b) had the tumor diameter < 10 mm, which was insufficient to place a valid region of interest (ROI); or (c) had low-quality images (image score < 3). The flowchart for patient recruitment is displayed in Fig. 1. The framework of MDT for evaluating tumor resectability is shown in Figure S1, Supplementary Materials.

Fig. 1
figure 1

The flowchart of patient recruitment

MRI protocol

All participants underwent gastric MRI scans within one week before surgery (median: 3 days) using a 3.0 T MR scanner (MAGNETOM Prisma, Siemens Healthineers) equipped with an anterior 18-element body coil and an integrated posterior 32-element spine coil array. After 6 h of fasting, each patient was trained to breathe rhythmically to reduce unwanted respiratory motion artifacts, and was instructed to consume 500 mL of warm water 10 min in advance to distend the stomach. Thereafter, raceanisodamine hydrochloride (10 mg) was intramuscularly administered within 15 min before MRI acquisition. Table 1 presents the standard protocol with detailed acquisition parameters, including (a) 3D volumetric interpolated breath-hold examination (VIBE) axial T1WI. (b) Axial respiratory-triggered, fat-suppressed turbo spin echo (fs_TSE) T2WI. (c) Multi-b value DWI obtained using echo planar imaging-based the prototyped integrated specific slice dynamic Shim (iShim) sequence. With the aforementioned sequences, 2D multigradient echo images were initially acquired for each imaging slice. The orientation and field of view (FOV) for these images were adjusted based on the corresponding imaging slices. Subsequently, we distinguished fat and water by analyzing the difference in echo time between the first and last echoes. Following this, a phase difference image was computed from the two echoes, and 12 b values were applied within the range of 0–3000 s/mm2 (0, 25, 50, 75, 100, 200, 400, 800, 1200, 1600, 2400, 3000 s/mm2) [28]. The diffusion gradient parameters included a gradient amplitude (G) of 80 mT/m, diffusion gradient duration (δ) of 11.1 ms, time separation between two diffusion gradient lobes (Δ) of 23.9 ms, and a total acquisition time of 3 min and 40 s.

Table 1 MRI acquisition protocol

Diffusion models reconstruction

A domestic BoDiLab software was used to process the raw multi-b value DWI data to generate parametric maps by fitting six DWI models, including CTRW, DKI, FROC, IVIM, SEM, and conventional mono-exponential model. These sequences generated 14 parameters, including temporal diffusion heterogeneity (CTRW_α), spatial diffusion heterogeneity (CTRW_β), and anomalous diffusion coefficient (CTRW_D) from CTRW, diffusivity (DKI_D) and kurtosis (DKI_K) from DKI; the anomalous diffusion coefficient (FROC_D), intravoxel diffusion heterogeneity parameter (FROC_β), and spatial parameter (FROC_mµ) from FROC; the true diffusion coefficient (IVIM_D), the pseudodiffusion coefficient (IVIM_D*) and the perfusion fraction (IVIM_f) from IVIM; the water diffusion heterogeneity index (SEM_α) and distributed diffusion coefficient (SEM_DDC) of SEM; and the mono_ADC, respectively.

Postprocessing and image interpretation

Two board-certified radiologists with 10 and 14 years of experience in gastrointestinal (GI) radiology independently assessed the quality of MR images via the 5-point scale approach. Scores ranging from 1 to 5 indicate poor, mild, average, good and excellent tumor detection and anatomical detail display, respectively, and severe, massive, moderate, mild, and none for observed artifacts or image distortions. Images scored less than 3 by any of the radiologists were excluded from analysis. The raw data were imported to ITK-SNAP (v3.8.0, http://www.itksnap.org) for tumor segmentation using a 3D volume of interest (VOI) method. The two radiologists used T2WI as the reference and manually delineated the region of interest (ROI) along the outer boundary of tumor on the axial plane of the b = 800 s/mm2 images, and finally integrated all the ROIs into a 3D-VOI (Fig. 2). Thereafter, the data obtained from the six DWI models were uploaded as supplementary data, and the same VOI was duplicated and applied on each DWI model to generate relevant parametric maps (Fig. 3). Subsequently, the corresponding quantitative parameters were automatically computed using the “volume and statistics” function within the “segmentation” toolbar of ITK-SNAP. The interobserver agreement between the two readers’ measurements were performed, and if good agreements were achieved, the mean of their measurements were used for subsequent analysis. Regarding morphologic features, the two radiologists independently interpreted the MR images without knowing the pathological results. We determined tumor thickness based on the maximum diameter perpendicular to the longest axis plane [12]. Clinical staging in this study refers to the local-regional tumor (cT) and node staging (cN). Clinical node staging was classified into cN-positive or negative, described as positive or negative MRI-reported-LN status, following the newly proposed Node-RADS 1.0 scheme [29, 30] and patients were recorded as positive when regional nodes reached Node-RADS score ≥ 3 by any of the radiologist, vice versa. Specifically, cN-positive was determined on T2WI as the presence of a nodal morphologic abnormality (short-axis diameter ≥ 10 mm, round shape, clustering, irregular/spiculated nodal margins, and heterogeneous signal intensity); and on DWI as the presence of restricted signal intensity on b = 800 s/mm2 and hypointense on corresponding ADC maps. Clinical tumor staging refers to tumor infiltration depth (cT1 ~ 4a) in this study and were determined according to the 8th edition of the American Joint Commission on Cancer, or AJCC [8] by radiologists’ readings on T2WI, DWI and enhanced MRI [12, 14, 25]. Be specific, in this study, T1 tumors regard to invasion depth limited to mucosa or submucosa and present only mildly irregular mucosal abnormality; T2: tumoral infiltration of the muscularis propria which can be identified as focal wall-thickening with or without ulceration; T3: tumoral penetration of the subserosal fat without invasion of the visceral peritoneum or adjacent structures; and T4a: tumoral invasion of the serosa, peri-gastric fat infiltration or nodular bulging around the tumor can be clearly identified. The final diagnoses were determined by the consensus of the two radiologists, any disagreement was resolved by consulting to a third GI radiologist with 23 years of experience.

Fig. 2
figure 2

Diagrams of tumor segmentation process and relevant tumor display on conventional MR images. (A) Tumor segmentation along the outer boundary at the maximal layer on the axial plane; (B) Completed tumor segmentation occupying the whole tumor area on the axial plane; (C) Completed tumor segmentation on the coronal plane; (D) Completed tumor segmentation on the sagittal plane; (E) Fused 3D volumetric tumor segmentation; (F) Axial fat suppressed T2WI shows a mildly hyperintense tumor in the gastric body; (G) ADC map shows the tumor presents markedly hypointense appearance; (H) Axial contrast-enhanced images

Fig. 3
figure 3

Parametric maps derived from six diffusion-weighted MRI models in a LNM-positive patients with pathologically confirmed gastric adenocarcinoma by radical gastrectomy, pT3N1M0. (A) Mono_ADC grayscale map with tumor segmentation; (B) Mono_ADC jet map; (C) CTRW_α map; (D) CTRW_β map; (E) CTRW_D map; (F) DKI_D map; (G) DKI_K map; (H) FROC_β map; (I) FROC_D map; (J) FROC_ mµ map; (K) IVIM_D map; (L) IVIM_D* map; (M) IVIM_f map; (N) SEM_α map; (O) SEM_DDC map

Pathologic diagnosis

Post-surgical pathology from 4-µm-thick sections stained with hematoxylin and eosin (HE) were examined by a board-certified pathologist with 9 years of experience in GI pathology, following the diagnostic criteria of the 8th AJCC [8]. Tumor location, histodifferentiation grade, LNM, ulceration status, pathological tumor stage (pT), Lauren subtype, perineural invasion (PNI), and lymphovascular invasion (LVI) were analyzed.

Statistical analysis

The statistical analysis was conducted with Medcalc 18.0. Two-sided P < 0.05 indicated statistical significance. The intraclass correlation coefficient (ICC) and 95% confidence intervals (CIs) were used to evaluate inter-reader variability between the measurements of DWI models-derived parameters. The ICC values of 0.00–0.20, 0.21–0.40, 0.41–0.60, 0.61–0.80, and 0.81–1.00 indicated low, fair, moderate, good, and excellent correlations, respectively. Normally distributed data are presented as the mean ± standard deviation (X ± s); otherwise, the data are presented as the median (25th quartile, 75th quartile) [M (Q1, Q3)]. Continuous variables were compared using either the Mann–Whitney U test or Student’s t test. Categorical variables were analyzed using Fisher’s exact test or the chi-square test. Multivariable logistic regression was employed to identify independent predictors of LNM. Diagnostic performances were evaluated through the ROC analysis. The area under the curve (AUC), specificity, sensitivity, positive and negative predictive values (PPV/NPV) were calculated. Differences among AUCs were compared using the Delong test. The relationships between DWI parameters and LNM status were evaluated through Spearman’s rank correlation test, with r values of 0.00–0.20, 0.21–0.40, 0.41–0.60, 0.61–0.80, and 0.81–1.00 indicating low, fair, moderate, good, and excellent correlations, respectively.

Results

Clinicopathological characteristics

Fifty-eight patients were ultimately recruited, including 41 men and 17 women, aged 39–77 (average, 60.88 ± 10.48) years. After surgery, 42 patients were pathologically confirmed as LNM positive (72.4%), and 16 were LNM negative (27.6%). No evidences of differences were identified in age, sex, location, or ulceration between the two groups (all P > 0.05). As depicted in Table 2, significant differences were observed in terms of histodifferentiation, pT, Lauren subtype, LVI, and PNI. Specifically, compared to the LNM-negative group, the LNM-positive group contained more lesions with poor histodifferentiation (31 vs. 7), the diffuse Lauren subtype (28 vs. 5), advanced pT3-4a lesions (30 vs. 4), positive LVI (37 vs. 3), and positive PNI (31 vs. 4).

Table 2 Comparison of clinicopathological characteristics between LNM negative and LNM positive groups

Comparisons of MRI parameters

The values of DWI models-derived parameters showed good to perfect inter-reader variability, ranging from ICC = 0.815 to ICC = 0.981 (Table S1, Supplementary material).

As shown in Fig. 4; Table 3, the mean tumor thickness was 20.20 ± 6.49 mm in the LNM-positive group, significantly larger than those in the LNM-negative group (t = 2.345, P = 0.023). Compared with the LNM-negative group, the mean values of CTRW_D, DKI_D, FROC_D, IVIM_D*, and IVIM_f in the LNM-positive group decreased, whereas the mean value of DKI_K significantly increased (all P < 0.05). In addition, cT and MRI-reported-LN status were significantly different between the two groups (all P < 0.05). No evidences of differences were observed in CTRW_α, CTRW_β, FROC_β, FROC_mµ, IVIM_D, SEM_α or SEM_DDC between the two groups.

Fig. 4
figure 4

Box and whisker plots for the comparison of mean values of significant MRI parameters between LNM negative group and LNM positive group. (A) Thickness; (B) CTRW_D; (C) DKI_D, (D) DKI_K; (E) FROC_D; (F) IVIM_D*; (G) IVIM_f; (H) IVIM_ADC

Table 3 Comparison of MRI parameters between LNM negative and LNM positive groups

Independent predictors

Univariable regression analysis showed that cT, MRI-reported-LN, thickness, CTRW_D, DKI_D, DKI_K, FROC_D, IVIM_D*, IVIM_f, Mono_ADC were significant for predicting LNM status. Multivariable logistic regression analysis demonstrated that cT and CTRW_D were independent predictors and used to establish a combined parameter (odds ratio = 3.241, P = 0.037, for cT, odds ratio = 0.948, P = 0.020, for CTRW_D; Table 4). The Hosmer and Lemeshow test showed good consistency between the probability of LNM positivity predicted by the combined parameter and the actual incidence of LNM positivity (χ = 4.648, P = 0.794).

Table 4 Univariable and multivariable regression analysis of MRI parameters for LNM prediction

Diagnostic performance

As shown in Fig. 5, the ROC analysis results revealed the combined parameter yielded the highest AUC of 0.930 (95%CI: 0.831, 0.980), significantly higher than the AUCs of cT and CTRW_D (Z = 2.590, P = 0.010 vs. cT; Z = 2.149, P = 0.032 vs. CTRW_D). The overall accuracy of MRI-reported-LN was 65.52% with its AUC reached 0.723; compared to surgical pathology, MRI-reported-LN correctly diagnosed 14 true LNM-negative and 24 true LNM-positive. The combination of CTRW_D and MRI-reported-LN showed a significantly improved performance with AUC being 0.869. Specifically, MRI-reported-LN diagnosed LNM-negative in 32 patients, among whom 18 were pathologically confirmed as LNM-positive; when using > cT2 and CTRW_D ≤ 2.467 × 10− 3/mm2 as the diagnostic criteria, 17 and 23 were categorized as LNM-positive rather than negative. Compared to pathologic results, cT and CTRW correctly diagnosed 21 and 24 out of these 32 patients with their accuracies reaching 65.63% (21/32) and 75% (24/32), respectively. Furthermore, model-by-model comparison revealed that DWI models demonstrated non-significantly different performances. No evidences of significant differences were identified among AUCs of individual parameters (all P>0.05, Table S2 in Supplementary material). The combined parameter exhibited significantly higher performances than any other parameters and models as shown in Table 5.

Fig. 5
figure 5

Comparison of performances of MRI parameters for predicting LNM in gastric cancer patients. The combined parameter (cT + CTRW_D, the solid purple line) presents the highest AUC of 0.930

Table 5 Comparison among diagnostic efficacies of MRI parameters in predicting LNM positivity in resectable gastric cancer

Correlations between significant MRI parameters and LNM status

Tumor thickness, cT, MRI-reported-LN status, DKI_K, and the combined parameter were significantly and positively correlated with the LNM status. Inverse correlations were identified between FROC_D, IVIM_D*, IVIM_f, CTRW_D, DKI_D, Mono_ADC, and the LNM status (Table S3, Supplementary material).

Discussion

This work investigated the potential of six advanced DWI models for preoperative prediction of LNM in resectable GC patients. ADC, CTRW_D, DKI_D, DKI_K, FROC_D, IVIM_D*, and IVIM_f demonstrated clinically acceptable performance for LNM prediction. Furthermore, CTRW_D and cT were independent predictors and used to establish a combined parameter, which further improved the performance. These DWI models offer new sets of imaging markers for depicting tumor characteristics and can serve as useful tools for quantitative prediction of LNM. The mono-exponential diffusion model-derived ADC is useful for distinguishing LN status in GC [31]. The ADC values in the LNM-positive group were significantly lower than those in the LNM-negative group, and the ADC alone demonstrated a similar performance to that reported previously (0.767 vs. 0.788). Shi B et al. reported that DKI_D and DKI_K were correlated with the tumor necrosis fraction after TGF-β1 inhibition in mouse GC models [32]. A lower DKI_D is suggestive of higher risk of LNM and advanced stage for endometrial carcinoma [33]. DKI_K holds the potential to capture microstructural complexity and heterogeneity [26, 34]. Therefore, DKI_D together with DKI-K can provide insights into tumor aggressiveness and heterogeneity, which may facilitate preoperative identification of LNM. We found significantly decreased DKI_D and increased DKI_K values in the lesions of LNM-positive group, suggesting the higher levels of heterogeneity in these lesions. IVIM_f is effective for distinguishing different node stages in GC [19], and a prior study has proved that IVIM_f is a useful marker for assessing LVI [25]. GC metastasis to lymph nodes is a complicated process involving disruption of the lymphatic network and neovascularization [35, 36], which may result in an increase in the f value in LNM-positive GC lesions. Based on the previous evidences, IVIM_D* is not useful for characterizing or staging GC [19, 25]. However, we found a significant reduction of D* value in the LNM-positive group. According to the IVIM theory, D* represents microcirculation perfusion of tissues [35]. The reduced D* value may indicate that GC lesions with positive LNM have less perfusion than those without LNM; nevertheless, this was a preliminary result that needs further validation. CTRW was firstly applied in the brain for characterizing both degenerative diseases and gliomas [37, 38] and has been utilized for solid tumors such as breast cancer recently [39]; nevertheless, no study has ever applied CTRW in the stomach. Our preliminary results showed that GC lesions with positive LNM had lower CTRW_D values than those without LNM. Moreover, CTRW_D exhibited superior performance compared to the other parameters and emerged as one of the risk factors of LNM. Lower CTRW_D were closely associated with more advanced disease stages [38, 39]. Tumors with advanced stage are of high cellularity, leading to reduced extracellular space and lower CTRW_D values [39]. However, in this study, we failed to prove the usefulness of the α and β parameters for predicting LNM, which were contrary to a previous report [36]. This discrepancy may attribute to the variations in sample size, different types of cancers; the choice of b values for sequence acquisition may be another possible reason. The FROC has been applied to classify the Lauren subtype in GC [27]. The derived β, D, and mµ parameters exhibited favorable performance for distinguishing the intestinal-and-mixed subtypes from the diffuse subtypes. Besides, previous studies have proved close and inverse likelihood between D values and significant prognostic histologic markers of LVI and LNM [25, 31]; consistently, negative correlations were identified between D values and LNM status in this study. β is related to intravoxel diffusion heterogeneity, and mµ is linked to the diffusion mean free length and cellularity [27]. Theoretically, these factors together reflect tumor cellularity heterogeneity and may vary between patients with different LNM status. However, no significant differences in β or mµ were observed in this study. The possible reasons may due to the small sample size and b values adopted for DWI acquisition. Regarding morphologic MRI features, cT overweighed others and were a predictor of LNM. cT is the standard of care in clinical practice and closely associated with LNM [12, 19]. We found that cT and CTRW_D have overweighted MRI-reported-LN and exhibited added value in MRI-reported-LN negative cohort. Besides, combining CTRW_D with MRI-reported-LN exhibited significantly better performance compared to MRI-reported-LN alone, suggesting quantitative CTRW_D has incremental benefit on this issue. Furthermore, the combination of cT and CTRW_D demonstrated a more favorable performance, underscoring the feasibility and superiority of integrating both qualitative and quantitative parameters from a single MRI scan for GC patients. Certain limitations should be noted in this study. First, the sample size was small, and the results were preliminary; therefore, further researches are needed. Second, mixed adenocarcinomas were eliminated from the analysis since they may cause migration and bias in ADC values calculations. Third, the standard scanning specification for multi-b value DWI lacks global consensus, our results were generated from the sequence with the maximal b-value at 3000 s/mm2, which may not be applicable to other cancers. In conclusion, the proposed six DWI models offer new sets of quantitative imaging markers for preoperative prediction of LNM in resectable GC, with CTRW_D appearing as the most beneficial marker. The combination of CTRW_D and cT demonstrates significant added value and further enhances the predictive performance.