Introduction

The presence of lymph node metastasis is an important prognostic factor for curable esophageal cancer (EC) [7, 13, 19]. Lymphatic spread of EC is highly variable and unpredictable because of the unique submucosal lymphatic drainage system of the esophagus [9]. The presence of lymph node involvement is generally associated with worse overall survival [24]. All positive lymph nodes should be removed together with the tumor to improve long-term survival. Extended lymph node resection might increase the incidence of postoperative recurrence and worsen the prognosis of EC patients [4]. Lymph node staging is also important for the decision to administer neoadjuvant therapy [23]. Therefore, lymph node involvement should be assessed before treatment. However, the detection accuracy of positive lymph nodes on preoperative CT still remains controversial, and the reported sensitivity, specificity and accuracy ranged between 37.3%-67.2%, 63.9%-96.4% and 85.8%-87.2%, respectively [12]. MRI has been shown to be more accurate in detecting positive lymph nodes in other cancers [6, 14, 20]. However, conventional MRI has limited image quality in the chest. Recently, contrast-enhanced StarVIBE has been applied in the detection of EC because of its high image quality [18], especially in patients who are unable to suspend respiration [2]. Meanwhile, T2-TSE-BLADE involves an acquisition scheme similar to the periodically rotated overlapping parallel lines with an enhanced reconstruction (BLADE) technique [17], and it may decrease motion artifacts in non-cooperative patients.

Radiomics is gaining momentum in cancer research [11]. High-throughput mining extracts quantitative image features from digitally encrypted medical images, and this is coupled with powerful image-based signatures that could potentially enhance precision diagnosis and treatment. Radiomic research recently revealed the potential of MRI to substantially improve the ability to detect or predict lymph node metastases [5, 11].

In this study, we aim to build and validate a MR radiomic-based model based on T2-TSE-BLADE and contrast-enhanced StarVIBE for predicting LN metastasis in preoperative EC patients.

Materials & method

Patients

This prospective study was approved by the institutional review board, and the patients’ informed consents were obtained. In the present study, 181 patients with EC were enrolled between April 2015 and September 2017. Pathological confirmation of lymph node metastasis status was obtained in all cases.

Inclusion criteria were: (1) patients with endoscopically biopsy-proven potentially resectable EC and T1/T2/T3/T4a staging by CT; (2) patients who received lymph node dissection within 7 days after the baseline MRI study; (3) patients who had pathologically confirmed LN status after surgery. Exclusion criteria were: (1) patients who were under 18 years of age; (2) patients who received treatment (radiotherapy or chemotherapy) before surgery; (3) patients who had received prior treatment in other institutions; (4) histological grade was not confirmed; (e) patients who could not tolerate the MR examination. The flowchart of patients included this study is shown in Fig. 1.

Fig. 1
figure 1

(a) Original contrast-enhanced StarVIBE image. (b) Segmented tumor on contrast-enhanced StarVIBE image. (c) Original T2-TSE-BLADE image. (d) Segmented tumor on T2-TSE-BLADE image

Image acquisition

All MR images were reconstructed with a standard kernel. These MR images were retrieved from the picture archiving and communication system (PACS) (Neusoft v5.5.60801).

MRI examination was performed on a 3-T MR scanner (MAGNETOM Skyra, Siemens Healthcare) with routine sequences, T2-TSE-BLADE and contrast-enhanced StarVIBE. The total scanning time was about 40 min. T2-TSE-BLADE was performed with the following parameters: TR/TE = 5000 ms/97 ms; voxel = 0.9 mm × 0.9 mm × 3.0 mm; FOV = 240 mm × 240 mm, scan time = 240 s-360 s. StarVIBE was performed for the whole chest during free breathing at 20 s post-contrast media administration with the following parameters: TR/TE = 3.98 ms/1.91 ms; FOV = 300 mm × 300 mm × 72 mm; resolution = 1.0 mm × 1.0 mm × 3.0 mm; flip angle = 12 degrees; radial views = 1659; acquisition time = 309 s. Gadopentetate dimeglumine [0.2 ml/kg of body weight, (Consun)] was injected at a rate of 2.5 ml/s by a MR-compatible automated double-tube high-pressure injector (Spectris Solaris EP, Medrad) immediately followed by an equal volume of normal saline solution to flush the tube.

Tumor segmentation

Manual segmentation of the EC was performed on each patient's MR images utilizing “ITK-SNAP” (www.itksnap.org), which is an open-source and free software application (Fig. 2). Two radiologists (reader 1 and 2), both with 5 years of chest radiology experience, carefully contoured the tumor on all T2-TSE-BLADE and StarVIBE images to generate two 3D segmentations of the entire tumor. A senior (reader 3) with > 15 years of chest radiology experience examined all segmented tumors and selected the best segmentation to include in the analysis. These regions of interests (ROIs) were performed for subsequent feature extraction for further analysis.

Fig. 2
figure 2

Flowchart of this study: (I) Original MR images: T2-TSE-BLADE and contrast-enhanced StarVIBE. (II) Segmentation was performed on both T2-TSE-BLADE and contrast-enhanced StarVIBE images to define the tumor region. (III) Radiomic features were extracted from the tumor region, including shape, first-order histogram, texture and wavelet group analysis. (IV) Several features were selected to build the radiomic signature using the least absolute shrinkage and selection operator (LASSO) method. (V) Finally, the classification ability of the radiomic signature was tested by the receiver-operating characteristics (ROC) curves in both the training and validation cohort

Radiomic feature calculation, selection, and signature building

Patients were allocated into a training and validation cohort depending on the date of surgery. MR images of the first half of this cohort (90 patients with surgery performed between April 2015 and September 2017) were set as the training cohort. The remaining half (91 patients with surgery performed by September 2017) were set as the validation cohort.

Radiomic features were calculated based on the segmentation results from the training cohort using a homemade program in the Matlab (Matlab 2014b). Features were categorized into three primary types: (1) 8 shapes and 1 size feature; (2) 14 first-order histogram statistics and (3) 116 texture-based features. The following procedure was to add five additional wavelet filters on MR images and re-calculate type 2 and type 3 features; hence, a total of 789 features of each tumor were obtained (9 + 14 + 116 + 5 * 130 = 789). For each MRI sequence, the 789 total extracted features covered the major feature pool in recent radiomic studies. The details of the radiomic feature calculation is shown in the Appendix [10]. Since two MR sequences were utilized, there were 1578 radiomic features in total for each patient.

The proper feature selection procedure was used to simplify the building model and avoid over-fitting issues. The “elastic net” approach was used to select the calculated features, which could be considered a combination the LASSO (least absolute shrinkage selection operator) and the ridge regression approaches. Ten-fold cross validation was used in the parameter tuning of the “elastic net.” For tuning coefficient λ and α, the criterion of minimum standard deviation and maximum AUC was followed, respectively.

The logistic regression model was exploited in the “elastic net” approach to build the radiomic signature for each patient. After the training procedure, a linear combination of the selected features was extracted. The sums of those linear combinations formed the radiomic signatures of each patient.

Statistics analysis

Statistical analysis was performed in R (version 3.3.0; http://www.Rproject.org). The package used is in Appendix Table S1. The statistical significance levels were all set as two sided at p < 0.05 in the current study. Interobserver reproducibility of radiomic feature extraction was assessed by inter-class correlation coefficients (ICCs). An ICC > 0.75 was considered good agreement. A kappa test was used to evaluate the differences between the features generated by reader 1 and 2.

A two-sample t-test was conducted for common comparisons of patients’ characteristics for continuous variables, and Fisher’s exact test and the χ2 test were used for categorical variables. The Mann-Whitney U test was adopted for testing the potential correlation of the radiomic signature and the LN status in both the training and validation cohorts.

Results

Patients characteristics

Table 1 shows the demographic statistics of patients in the training and validation cohorts. There were no significant differences between the training and validation cohorts in terms of gender, age, position of the tumor (upper, middle and lower parts of the thoracic esophagus), post-treatment T stage and post-treatment N stage.

Table 1 Demographic statistics of patients in the training and validation cohorts

Radiomic features

ICCs showed the excellence of 1578 extracting features from the two MRI sequences as assessed by the two radiologists, and the kappa value was 0.943. The features were selected with non-zero coefficients, based on the elastic net approach in the training cohort. As a result, 9 out of 1578 radiomic features were included. These were the included t2_length, t2_Sphericity, t2_ a2_GLRL_GLN_90, t2_hd_GLCM_CONTRAST_0, t2_vd_GLRL_RLN_0, dyn_a1_GLCM_PROBABILITY_90, dyn_a1_GLRL_SRE_135, dyn_hd_GLRL_HGRE_90 and dyn_dd_ENTROPY. These features included two shape and size features, six texture features and one wavelet filter feature. The parameter-tuning procedure of the regression model and the feature space reduction are illustrated in Fig. 3, and the name and description of the selected features are listed in Table 2. Distribution of nine features that could distinguish between positive and negative lymph node metastasis was analyzed by t-test. Of those, the five features (t2_length, t2_Sphericity, t2_hd_GLCM_CONTRAST_0, t2_vd_GLRL_RLN_0, dyn_hd_GLRL_HGRE_90) had a p value < 0.05.The p value is shown in Fig. 4.

Fig. 3
figure 3

Feature selection using the elastic net method with a logistic regression model. (a) Tuning parameter λ in the elastic net model and λ were selected under the minimum criteria. The vertical line was drawn at the value chosen according to 10-fold cross-validation, including nine optimized nonzero coefficients. (b) The model coefficient trend lines of the 1578 radiomic features. A coefficient profile plot was performed by coefficients against the L1 norm (inverse proportional to log λ)

Table 2 Selected features with descriptions
Fig. 4
figure 4

Distribution of five features that showed the frequency for both positive and negative lymph node metastasis

Radiomic signature discrimination

The radiomic signature was built by employing the selected features in the last section, which is the linear combination of the logistic regression model of those features. The radiomic signature’s discriminative power of the LN metastasis was assessed by two ROCs in the training and validation cohorts (Fig. 5). Radiomic scores (Rad scores) of EC patients in the training and validation cohorts were calculated through the elastic net model with selected features with their corresponding weights. Each patient’s Rad scores in both the training and validation cohorts are shown in Fig. 6.

Fig. 5
figure 5

ROCs were employed to assess the MRI radiomic signature discriminative performance of the LN metastasis in preoperative esophageal cancer patients. AUC in the training cohort with 0.821 (95% CI: 0.7042-0.9376, sensitivity = 68.0%, specificity = 92.3%) and AUC in the validation cohort with 0.762 (95% CI: 0.7127-0.812, sensitivity = 80.7%, specificity = 70.3%)

Fig. 6
figure 6

Rad score for each EC patient in the training cohort (a) and Rad score for each EC patient in the validation cohort (b)

Discussion

The present study showed that the proposed MRI radiomics-based model involving the radiomic signature and radiological observation factors has potential ability in predicting LN metastasis preoperatively in EC patients. These observations are based on high-quality T2-TSE-BLADE and contrast-enhanced StarVIBE.

Lymph node status is the single most important prognostic factor in EC [1, 21]. CT is the most commonly utilized imaging modality in staging patients with EC. However, the accuracy of CT in detecting lymph node metastasis in EC is still controversial, because detection of pathological lymph nodes on CT depends primarily on size criteria [12]. MRI has higher soft tissue resolution than CT, and a prior study demonstrated its superiority to CT in T staging for EC patients [18].

The present study showed the feasibility of the MR radiomic feature for predicting LN metastasis in EC patients. We extracted 1578 quantitative image features of tumors using both T2-TSE-BLADE and contrast-enhanced StarVIBE. These sequences can provide high image quality and anatomic details in EC with the ability to accurately delineate the different layers of the esophageal wall. Hence, both T2-TSE-BLADE and contrast-enhanced StarVIBE were feasible in texture analysis. The image features assessed included shape, first order histogram, texture and wavelet group analysis and the elastic net method with a logistic regression model that was employed to reduce dimensionality. The current study showed that a stable classification mode with a similar AUC value in both the primary and validation cohorts based on the MR images and radiomic features obtained from both T2-TSE-BLADE and contrast-enhanced StarVIBE are useful to differentiate metastatic from non-metastatic lymph nodes.

Gray-level histograms and texture features have been useful in feature extraction and in discriminating between benign and malignant lesions [3, 25]. Texture features analysis on MR imaging was also proven to discriminate between benign and malignant masses of the breast [16]. In the present study, nine radiomic features, which included two shape and size features, six texture features and one wavelet filter feature, were selected to build the MR model, which was useful in assessing lymph node status in EC, and 9 features of the 181 cases had a proper ratio for building a predicting model that could avoid overfitting. It is still challenging to identify other radiomic features related to lymph node status in EC patients. Features of t2_length and t2_Sphericity, which are shape and size features, are highly consistent with the radiologists’ experience, and they describe the external contour information of the tumor. The longer length and larger sphericity indicate more tumor invasions; hence, this leads to higher risk of LN metastasis. Although these two features can be captured subjectively, additional features were extracted from MR images of EC patients, and these can be quantified and statistically analyzed. These six texture features and one wavelet filter feature include t2_ a2_GLRL_GLN_90, t2_hd_GLCM_CONTRAST_0, t2_vd_GLRL_RLN_0, dyn_a1_GLCM_PROBABILITY_90, dyn_a1_GLRL_SRE_135, dyn_hd_GLRL_HGRE_90 and dyn dd_ENTROPY and mainly represent the texture complexity of tumors, which was highly associated with the tumors’ heterogeneity, and prognosis [15, 26].

There are some limitations to this study. First, it did not analyze multimodality medical images, especially CT images. Second, MR images did not include DWI, which may expand the feature pool and show more valuable radiomic features because the image quality of DWI at 3.0 T may be associated with significant artifacts, although DWI has shown powerful ability in differentiating benign and malignant lymph nodes in some cancers [8, 22]. Third, a larger sample size will improve the confidence and performance of this EC model. Moreover, a larger multicenter database combining genomic and radiomic information could potentially improve the confidence and performance of the current model.

In conclusion, our study showed that MR radiomic features have the potential to predict lymph node status in EC patients. They could be used clinically to assess lymph node status for EC patients preoperatively.