Introduction

Lung cancer is the leading cause of cancer-related death [1]. At present, surgical resection offers the best opportunity for long-term survival and cure in patients with lung cancer [2]. However, even after curative resection, 30–55% of patients with lung cancer develop recurrence and die of the disease [3]. The prognosis of patients with lung cancer is predicted largely based on pathological stage. Although pathological staging is simple and easy to use, there are limitations to stratifying patients precisely. Patients at the same stage can exhibit wide variations in their incidence of recurrence and survival after curative resection. For guidance of adjuvant therapies, there is a need to find complementary parameters that can help to predict the prognosis more accurately in patients with resectable disease [4].

Radiomics has attracted attention as a promising prognostic tool for the prediction of patient survival [5,6,7]. Radiomics extracts and uses large numbers of quantitative imaging features, with high-throughput. Recently Aerts et al showed that radiomics has prognostic power by capturing intratumour heterogeneity in lung cancer, and is associated with underlying gene-expression patterns [8]. However, as radiomic approaches are performed using routine CT scans such as non-contrast or contrast-enhanced images, tumour perfusion and angiogenesis, which are known to be prognostic factors [9,10,11], cannot be assessed properly.

In this context, dual-energy CT (DECT) enables selective visualisation and quantification of iodine enhancement at certain time points during data acquisition, which can provide additional information on tumour perfusion. Because an iodine map portraying the distribution of iodine in the tissue correlates strongly with blood volume and vascular density of a lesion, DECT can be used as an imaging surrogate for the assessment of tumour angiogenesis [12,13,14]. Kim et al demonstrated that DECT is practically useful for evaluating the net iodine enhancement of tumours and for evaluating tumour responses to anti-angiogenic treatment, which are reflected by perfusion decreases [13]. Furthermore, according to Bae et al’s study, quantitative radiomic values derived from DECT, including uniformity and 97.5th percentile attenuation, showed potential for prediction of pathological invasiveness and prognosis in adenocarcinoma of the lung [14].

We hypothesised that radiomics based on the iodine overlay map could provide additional prognostic information for better stratification of resectable lung cancer. Thus, we performed this study to investigate whether radiomics on iodine overlay maps from DECT can predict survival outcome in patients with resectable lung cancer.

Materials and methods

Study population

Our institutional review board approved this retrospective cohort study, and informed consent was waived. Retrospective review of a database at our institution identified 131 consecutive patients who underwent DECT scans for staging workups of lung cancer at the time of diagnosis, from October 2010 to December 2012. As our study targeted patients with resectable lung cancer, 38 out of 131 patients were excluded from the study for the following reasons: 14 patients had ground-glass nodules, nine patients had stage IV disease (pleural seeding, n = 6; bone and dural metastasis, n = 1; lung metastasis, n = 1; and lung and adrenal metastasis, n =1), 11 patients underwent stereotactic body radiation therapy rather than surgery, two patients were lost to follow-up before treatment, and two patients had double primary cancers that could have affected overall survival (Fig. 1).

Fig. 1
figure 1

Schematic flow chart of patient inclusion criteria

A total of 93 patients were finally included in the study (M:F = 66:27, mean age 64.7 ± 9.0 years, range 37–84). In each patient, the primary tumour was visible on the staging CT, with distinguishable tumour margin and histopathological type of tumour, and pathological staging was confirmed through surgery (Table 1).

Table 1 Patient characteristics

The overall tumour stage of each patient was derived from pathological staging (local-regional) confirmed by surgery, CT and positron emission tomography (PET)/CT, based on the 7th lung cancer TNM classification and staging system of the American Joint Committee on Cancer [15]. In addition to pre-operative CT imaging, maximal standard uptake values (SUVmax) from PET were also collected for the patients for whom it was available (n = 84).

The primary outcomes were disease-free survival (DFS) and overall survival (OS). DFS was defined as the time from the date of surgery until either the date of relapse (event), which refers to local tumour recurrence or distant metastasis, or death, or until the date that the patient was last known to be free of relapse (censored). OS was calculated from the date of surgery until either death from any cause (event) or until the date that the patient was last known to be alive (censored). The survival state of the patients and the date of death were obtained from the Korean national demographic database. The median follow-up was 60.4 months (range 0.7–76.5).

Imaging acquisition – dual-energy perfusion computed tomography

The initial CT scans were obtained within 6 weeks prior to the surgery (interval between perfusion CT and operation: median 12 days, range 0–42). All patients underwent CT examination using a CT system (SOMATOM Definition; Siemens Healthcare, Forchheim, Germany) in the dual-energy mode (80 and 140 kV). Two sets of spiral CT data were acquired with a collimation of 14 x 1.2 mm, a pitch of 0.5 and a rotation time of 0.33 s. The tube current for the 80 kV level was adjusted to be four times greater than that for the 140 kV level. A total of 125 ml of iodinated contrast agent (Ultravist, 370 mgI/ml; Schering) was injected intravenously using a dual-head power injector at a rate of 2.5 ml/s, followed by a saline chaser of 30 ml at the same injection rate. The scan delay was fixed at 40 s. Images were reconstructed using a medium-soft convolution kernel (D30), with a 1.5 mm slice thickness and a 0.7 mm increment. Three sets of weighted average images, iodine distribution images and overlay images of the weighted images were generated on a commercially available workstation with the pulmonary blood volume application of Syngo Dual Energy (Siemens Healthcare), as described by Chae et al [16, 17]. We used these iodine distribution images as the ‘iodine overlay map’.

Data post-processing and image reconstruction

The DECT images were retrieved from the institution’s archives and loaded by our in-house software (a medical imaging solution for segmentation and radiomic analysis). Tumour segmentation was performed on weighted-average images to delineate tumour borders, and the resulting mask was applied to the iodine overlay maps. Tumours were drawn three-dimensionally using a semi-automated technique, covering as large a portion of the whole tumour as possible by the software, and then minimally corrected by a radiologist who was blinded to the pathological results and clinical outcome. The interobserver reproducibility was analysed for region-of-interest (ROI)–based texture feature extraction by two experienced radiologists (L.S.M. and J.C.; readers 1 and 2, with 6 and 2 years of experience in chest CT, respectively). Further analyses were performed using the measurements of the first reader.

Histogram- and gradient-based features, along with run-length encoding (RL), absolute gradient (GRAD), moment [18] and grey-level co-occurrence matrix (GLCM)-based features were obtained from a quantitative CT analysis of the automatically derived ROI. Histogram analysis (HIST) was performed for assessment of mean attenuation, skewness (skewness of pixel distribution), kurtosis (magnitude of pixel distribution), energy and entropy (irregularity). Texture parameters derived from the GLCM were also calculated, including angular second moment (ASM), contrast, correlation, entropy and homogeneity. The derivation of the GLCM of an image was based on the statistical relationships of the intensities of nearby pixels [19].

Statistical analysis

All statistical analyses were performed using SPSS (version 21; IBM, Armonk, NY, USA) and the R statistical package, version 3.1.2 [http://www.r-project.org]). For all comparisons, p < 0.05 was considered statistically significant.

The associations of continuous and categorical variables with OS and DFS were evaluated using Cox proportional hazards regression models. In univariate analyses, each variable was included in a Cox regression model alone. Multivariate analyses were performed using the backward elimination method, and variables with p-values < 0.05 in the univariate analysis were used as input variables for multivariate analyses. The removal of variables was based on likelihood ratio statistics, with a probability of 0.10.

Using parameter estimates from the multivariate Cox model, we calculated predictive scores. K-adaptive partitioning, a multi-group partitioning algorithm that divides data into K heterogeneous subgroups based on information from a prognostic factor, was then used to determine the cut-off value of the score that could best separate the patients into groups with poor, intermediate and good survival outcomes (as indicated by the best p-value determined using the log-rank test) [20]. Kaplan-Meier survival curves stratified according to the risk groups were generated. The incremental value of the radiomic feature to the traditional staging system was assessed with the C-index, and the area under the receiver operating characteristic (ROC) curve was calculated. The interobserver reproducibility of feature extraction was evaluated using the interclass correlation coefficient (ICC) [21].

Results

Patients

Among the 93 patients, 42 patients (45.2%) had disease recurrence and 39 patients (41.9%) died during the follow-up period. The mean DFS was 49.8 ± 3.7 months and mean OS was 55.2 ± 2.8 months. The cohort included 55 (59.1%) patients with adenocarcinoma, 30 (32.3%) patients with squamous cell carcinoma and eight (8.6%) patients with other histopathological subtypes of tumour, including sarcomatoid carcinoma and adenosquamous carcinoma. For the pathological staging, 26 (28.0%) patients were diagnosed with stage IA, 24 (25.8%) with IB, 14 (15.1%) with IIA, 10 (10.8%) with IIB and 19 (20.4%) with IIIA.

Survival analysis: Univariate

In the univariate Cox regression analysis, the significant predictors of survival were stage (stage 1 vs. 3) and radiomic parameters, including histogram energy, histogram entropy, GLCM ASM, GLCM entropy and homogeneity for both DFS and OS (Table 2). GLCM contrast (hazard radio (HR) = 1.070, p = 0.048) was the only significant predictor for DFS.

Table 2 Univariate analysis of tumour parameters for predicting disease-free survival and overall survival

SUVmax showed borderline significance as a predictor of survival (p = 0.057 for DFS and 0.076 for OS). Stage was also evaluated by regrouping into two new subgroups, stage 1/2A and stage 2B/3. The regrouped stage was a significant predictor of both DFS and OS (HR = 2.793 for DFS and 2.782 for OS).

Survival analysis: Multivariate

The multivariate Cox regression analysis identified the regrouped overall stage and histogram entropy as independent risk factors predicting DFS (regrouped stage, HR = 2.132, p = 0.034; entropy, HR = 1.497, p = 0.034; Table 3) and OS (regrouped stage, HR = 2.020, p = 0.046; entropy, HR = 1.543, p = 0.021; Table 4).

Table 3 Multivariate Cox proportional hazards regression analysis for prediction of disease-free survival and its performance
Table 4 Multivariate Cox proportional hazards regression analysis for prediction of overall survival and its performance

For the prediction of DFS, histogram entropy alone had slightly lower diagnostic performance compared to regrouped stage (C-index, 0.631for entropy and 0.671 for regrouped stage) (Table 3). For the prediction of OS, the diagnostic performance of regrouped stage and entropy were similar (C-index, 0.667 for regrouped stage and 0.677 for entropy). ROC analysis showed that adding the radiomic feature produced a better model for prediction of overall survival compared to stage only (0.720 and 0.667, respectively; p = 0.048, Table 4). Adding radiomic feature to stage also slightly improved the model performance for DFS prediction but without statistical significance (0.703 and 0.671; p = 0.090).

Applying this prediction model, a risk score can be calculated using the following formula.

Prediction of DFS:

$$ \mathrm{Risk}-\mathrm{score}=0.757\ \mathrm{x}\ \mathrm{stage}\ \left(1\ \mathrm{for}\ \mathrm{stage}\ 1/2\mathrm{A},2\ \mathrm{for}\ \mathrm{stage}\ 2\mathrm{B}/3\right)+0.404\ \mathrm{x}\ \mathrm{HIST}\_\mathrm{entropy} $$

Patients were subsequently classified into a high-risk group (G3; risk-score > 2.99, n = 13), an intermediate-risk group (G2; 1.64 < risk-score ≤ 2.99, n = 65) and a low-risk group (G1; risk-score ≤ 1.64, n = 15).

Prediction of OS:

$$ \mathrm{Risk}-\mathrm{score}=0.703\ \mathrm{x}\ \mathrm{stage}\ \left(1\ \mathrm{for}\ \mathrm{stage}\ 1/2\mathrm{A},2\ \mathrm{for}\ \mathrm{stage}\ 2\mathrm{B}/3\right)+0.434\ \mathrm{x}\ \mathrm{HIST}\_\mathrm{entropy} $$

High-risk group (G3; risk-score > 2.92, n = 15), an intermediate-risk group (G2; 1.78 < risk-score ≤ 2.92, n = 56) and a low-risk group (G1; risk-score ≤ 1.78, n = 22, Fig. 2). Kaplan-Meier survival analyses based on the risk-score showed significant differences in survival between the three groups (Fig. 3).

Fig. 2
figure 2

A 78-year-old man with non-small-cell lung cancer who underwent pre-operative dual-energy perfusion computed tomography (CT). Preoperative CT showed a 4.3 cm irregular enhancing mass in the right middle lobe in contact with the pleura and associated mild pleural thickening (a). The same anatomical tumour is shown on the iodine overlay map and the blue line denotes the region of interest (ROI) for radiomic analysis (b). Histogram entropy in this patient was 2.41. The patient underwent right middle lobectomy and chest wall invasion was confirmed. The final pathological stage of lung cancer was stage 2B (T3 N0 M0). Regarding the risk-score using regrouped pathological stage and histogram entropy, the patient was grouped into the intermediate-risk group (disease-free survival risk score 2.48, overall survival risk-score 2.45). There was no relapse and the overall survival was 56.9 months

Fig. 3
figure 3

Prediction model for survival analysis using stage and radiomic features extracted from an iodine overlay map. Kaplan-Meier curves for disease-free survival (a) and overall survival (b). G1, G2 and G3 are risk groups defined by the prediction model

Interobserver agreement for CT radiomic features

There were no statistically significant differences between the measurements of the two readers for any feature. The interobserver ICCs of all metrics calculated on the basis of the two readers’ measurements (except run-length and moment-based texture features) were excellent, ranging from 0.929 to 0.996 (all p-values < 0.0001, Table 5). The run-length-based texture features showed moderate reliability (ICC = 0.653–0.753), and the moment-based feature showed good reliability (ICC = 0.813).

Table 5 Interclass correlation coefficient for the quantitative histogram- and gradient-based parameters

Discussion

This is the first study using radiomic features extracted from a DECT iodine overlay map to predict the prognosis of patients with resectable lung cancer. Our study showed that radiomics feature from DECT iodine overlay map was an independent predictor of survival including both OS and DFS, with an incremental value when incorporated with the staging system. The combination of stage and radiomic feature successfully stratified the patients into high-, intermediate- and low-risk groups, with significant differences in 5-year OS and DFS.

In our study, histogram energy, histogram entropy, GLCM ASM, GLCM entropy and homogeneity were consistently significant in the univariate analysis for predicting both OS and DFS. These features reflect the heterogeneity of a tumour, and our results are consistent with those of recent studies that evaluated prognostic radiomic parameters and risk stratification of lung cancer in cross-sectional imaging [22,23,24]. However, the implications of heterogeneity in an iodine overlay map can be different from those of heterogeneity in non-contrast and contrast-enhanced images. Because the DECT technique allows selective visualisation and quantification of iodine-related attenuation, and thus enables quantification of tumour blood supply more accurately at the time of CT scan than routine enhanced CT [16, 25], heterogeneity on an iodine map may represent heterogeneity of intratumoral perfusion and permeability, which will be different among regions of well-oxygenated tissue, hypoxia or necrosis. Heterogeneity of intratumoral perfusion, especially for necrosis, which is associated with aggressive tumour behaviour and reduced survival, has been recognised in previous studies as a prognostic factor in patients with lung cancer [26,27,28]. Therefore, based on our results, we believe that radiomic analysis of iodine overlay maps may capture the status of blood supply within a tumour.

Our final prediction model combined histogram entropy and staging (C-index, 0.72 for OS and 0.70 for DFS). The fact that histogram entropy showed significant additional value for prediction of OS demonstrates that radiomic features of iodine overlay maps can provide new information on lung cancer prognosis. For the prediction of DFS, adding histogram entropy to stage also slightly increased the performance but did not show statistical significance, which might result from irregular follow-up intervals exhibited in the patients. The performance of our prediction model incorporating radiomic features was comparable or better than that of previous studies using radiomics (C-index: 0.65–0.72), although there were differences in study population and endpoints for survival [8, 24]. Therefore, we believe that radiomic features derived from iodine overlay maps can be used for survival estimation, and may help to determine the management of patients in terms of patient-tailored treatment. Given that DECT has several advantages in addition to evaluation of lung cancer staging, including better characterisation of tumour using iodine-enhanced images and virtual non-contrast images without additional radiation doses [29], pre-operative/post-operative functional evaluation of surgical candidates [17] and additional prognostic information [14], DECT can be a good option for comprehensive evaluation in cases of resectable lung cancer. However, as we did not compare the performance of radiomic features between routine CT images and DECT-derived iodine overlay maps, further studies are needed to verify whether a multiparametric radiomics approach using several CT images (non-contrast, contrast-enhanced and iodine overlay map images) could further improve performance in survival prediction.

The results indicated that histogram- and gradient-based features using a three-dimensional semi-automatically drawn ROI covering the whole tumour on iodine map were all reproducible. Although one of the main challenges of radiomics is tumour segmentation and reproducibility, the interobserver agreement in our study showed an excellent range of ICC. The entropy in particular was a robustly reproducible parameter (ICC = 0.987). In previous lung cancer research, interobserver agreement for texture features ranged from 0.752 to 1.00. The ICC was mostly higher than this in our study; the difference might be caused by a different method of tumour segmentation, using a semiautomatic region-growing volumetric segmentation algorithm with minimal manual correction by a reader. Semiautomatic volumetric segmentation is known to reduce interobserver variability, and significantly improve the robustness of radiomic feature quantification, compared to manual segmentation [30, 31].

Interestingly, SUVmax showed marginal significance for prediction of survival in patients with resectable lung cancer (p = 0.076). It is well known that high values of SUVmax are associated with a higher risk of disease recurrence or death in lung cancer patients [32, 33]. However, there is high measurement variability in reported SUV values derived from FDG-PET/CT in clinical practice between different centres, due to different time of FDG injection, start of scanning, instrument performance changes, execution/interpretation of the examination and condition of the patient [34, 35]. The marginal significance of SUVmax in our study might be due to the retrospective nature of the study, and to measurement variance of SUVmax values, which were based on scans acquired and evaluated in several different centres.

Our study had several limitations. First, the study was performed in a single centre using a retrospective design. Also, the study population was relatively small. Therefore, the generalizability of our results may be limited. However, the DECT data were all acquired using the same machine, which is subjected to a robust quantitative analysis. Our study demonstrated the usefulness of radiomic analysis of iodine overlay map and provided evidence for expanding the scope of radiomic approaches. Second, as we used in-house software in our study, a multicentre reproducibility study may be warranted. Finally, an inherent limitation of DECT is that it can only provide a static picture of pulmonary perfusion at a particular time point and iodine enhancement can be influenced by several confounding factors such as age, sex, body weight and cardiac function of the patient despite the same scanning parameters.

In conclusion, radiomic features extracted from iodine overlay map reflecting heterogeneity of tumour perfusion can add prognostic information for patients with resectable lung cancer. Future prospective studies are needed to further explore the potential of this technique to guide individualised post-operative management for these patients.