Introduction

In the prognosis of chronic obstructive pulmonary disease (COPD), the underlying morphological abnormalities in the airways and alveoli underlie the heterogeneity of clinical outcome and mortality [1, 2]. Long-term prospective studies in various settings have succeeded in identifying significant prognostic markers [3,4,5,6,7,8,9]. The BODE index, a multidimensional grading system, including the body mass index (B), the degree of airflow obstruction (O) and dyspnoea (D) and exercise capacity (E), has been proposed to be a better mortality predictor than a single index [4]. Specifically, in quantitative imaging, Haruna et al [6] showed that the percentage of a low attenuation area (LAA) on chest computed tomography (CT) effectively predicts mortality. In contrast, although there is an association between CT airway measurements and COPD risk [10], the relationship with mortality remains unclear. Advanced quantitative imaging technologies have not yet been used to strengthen prediction models. We propose implementing the extent of morphological complexity in CT images of COPD patients to improve survival prediction.

Mishima et al [11] showed that the size distributions of LAAs from lung CTs of COPD patients followed the power law, implying a fractal structure. The power law exponent (PLE) is associated with pulmonary function, and can be used as an indicator of the extent of lung complexity [11,12,13,14,15]. The functional capacity of the lung is mainly established by its complex morphometry [14,15,16,17]. Similar previous studies have utilised fractal properties to gain a better understanding of lung structure-function relationships [11, 14, 15, 17,18,19,20,21,22,23,24]. A longitudinal study demonstrated that emphysema progression decreases heterogeneity [9], implying that PLE carries essential information about COPD pathophysiology. However, in these aforementioned studies, complexity analyses were performed using two-dimensional (2D) images; three-dimensional (3D) information was not fully utilised.

Here, we hypothesised that emphysema compromises lung morphometric complexity, diminishing lung function and consequently death. If true, a complexity measure could predict patient survival. We applied fractal dimension methods with PLE and box-counting to baseline CT data, and followed up patient survival to ascertain the predictive performance of the model.

Materials and methods

This study was approved by the institutional review board of the Asan Medical Center (no. 2005-0345). Written informed consent was obtained from all patients.

Patients and clinical assessments

Records of subjects from a Korean obstructive lung disease (KOLD) cohort were retrospectively reviewed. All subjects met the following criteria: a diagnosis of COPD, i.e. post-BD forced expiratory volume in 1 s (FEV1)/forced vital capacity (FVC), 0.7, and no or minimal abnormality, other than emphysema, on chest radiographs. The KOLD study design has been published previously [25]. 302 COPD patients who underwent CT scans between June 2005 and June 2015 were enrolled, and their survival status at February 2017 was ascertained. Age, sex and smoking history were recorded and body mass index, 6-min walking distance and dyspnoea scale were measured at enrolment. For all patients, all test results, measurements, spirometry testing and CT scans were acquired within 1 month of enrolment. Pulmonary function tests and CT scans were performed according to the KOLD cohort study protocol [25]. The BODE index was calculated in accordance with Celli et al [4].

CT scans

All subjects underwent multi-detector row CT. Details on the CT and automatic segmentation methods are provided in Supplementary Material 1, and in previous publications [25, 26].

Complexity analysis

For complexity analysis, two methods were applied simultaneously: the PLE of the size distribution of emphysema, Dsize, and the box-counting fractal dimension, Dbox3D. The algorithm for measuring Dsize has been described previously [26, 27] and is summarised in Supplementary Material 1.

Dbox3D was measured using an in-house box-counting fractal analysis tool with MATLAB release 2015a (MathWorks Inc., Natick, MA, USA) that utilises the full-3D CT data. The box-counting fractal dimension was measured as first suggested by Grassberger [12] and presented by Ott [13]. Figure 1 shows a conceptual representation of this method. In brief, numbers of imaginary cubes needed to cover the lung parenchyma mask, which is automatically generated by subtracting the emphysema mask from the lung mask, are counted, using varying sizes, to yield a power-law distribution, and its exponent is considered as the fractal dimension. A higher box-counting fractal dimension would imply a greater space-filling property of the lung parenchyma in 3D space. Further details are provided in Supplementary Material 1.

Fig. 1
figure 1

A schematic representation of Dbox3D measurement by a box-counting fractal dimension analysis method with a representative patient image. (a) Automatically segmented lung and emphysema lesion masks are the two major inputs for the analysis. A lung mask is defined by a volume with < −400 HU, with the exception of large airways, and an emphysematous lesion mask has a volume of < −950 HU in the lung. The patient shown in this figure had EI% = 32%. The lung mask subtracted from the emphysematous lesion mask represents the lung parenchyma mask. (b) Imaginary cubes with various sizes scan the lung space in three-dimensions and the algorithm counts the number of cubes needed to cover the lung parenchyma; a logarithmic regression is performed to estimate the Dbox3D value. More details about measuring Dbox3D can be found in Supplementary Material 1

To compare the performance and reliability of the full-3D method with the conventional 2D method, 2D box-counting fractal analysis was concurrently performed, and further details are provided in Supplementary Material 1.

Statistical analysis

To investigate the prediction power of Dsize and Dbox3D for mortality, univariate and multivariate Cox proportional hazards analyses [28] were performed. In the multivariate mortality model, multicollinear predictors with a variance inflation factor (VIF) > 3 were excluded. Conditional backward elimination was applied to remove insignificant predictors with a probability > 0.05. As there are no generally accepted optimal cut-off values for risk factors involving continuous variables for differentiating these groups, we stratified patients according to Martinez et al [8] and Haruna et al [6]. The poorest performing quintile (approximately 20% of the population) were designated the ‘unhealthy’ group and the rest the reference group. Data are shown as mean ± standard deviation, and p-values as well as the 95% confidence interval (95% CI) were calculated. To evaluate the performance of the prediction models, the concordance index (C-index) and area under the receiver operating characteristic curve (AUC-ROC) were analysed. AUC-ROC was determined for 6-, 9- and 12-year survival. R statistics software version 3.4.3 (R Foundation for Statistical Computing, Vienna, Austria) and the following R packages were used: dplyr [29], survival [30] and survivalROC [31].

Results

Baseline characteristics and mortality outcome

The patients’ baseline characteristics are shown in Table 1. Eleven patients were excluded for BODE measurements due to missing results for the 6-min walk test. The median follow-up time was 6.1 years (range: 0.2−11.6 years). Sixty-three patients (20.9%) died; the median survival was 5.1 years. For 35 patients, death was lung-related, and included respiratory failure (17 patients), pneumonia (11 patients), lung cancer (eight patients), sepsis (four patients) and pneumothorax (one patient), with some patients having multiple causes. The other causes of death were non-lung cancer malignancy (seven patients), cardiovascular disease (three patients), septic shock (one patient) and unknown (17 patients).

Table 1 Baseline characteristics of 302 chronic obstructive pulmonary disease patients

Prognostic significance of predictors

The relationships among all-cause or lung-related mortality and candidate predictors were assessed using Cox proportional hazards analysis (Table 2). Older age (hazard ratio [HR], 4.44; 95% confidence interval [CI], 2.63–7.50), heavier smoking history (HR, 2.00; 95%CI, 1.17–3.40), higher BODE index (HR, 7.14; 95%CI, 4.27–11.95), lower percentage of diffusing capacity of the lung for carbon monoxide corrected with hemoglobin to an expected value (cDLCO%, HR, 4.46; 95%CI, 2.70–7.44), higher percentage of lung volume occupied by emphysema (EI%, HR, 3.95; 95%CI, 2.41–6.49), higher Dsize (HR, 3.46; 95%CI, 2.09–5.70), and higher Dbox3D (HR, 4.05; 95%CI, 2.46–6.67) were significantly associated with all-cause mortality. Sex (female sex HR, 0.73; 95%CI, 0.23–2.32) was not a significant predictor (p = 0.059). The same predictors also significantly predicted lung-related mortality.

Table 2 Univariate Cox proportional hazards regression analysis showing prognostic significance of demographic parameters, smoking history, BODE index, cDLCO%, EI%, Dsize and Dbox3D

Considerable multicollinearity (VIF > 3) between cDLCO%, EI%, Dsize, and Dbox3D were found, and thus, only one should be included in multivariate analysis. In univariate analysis, cDLCO% performed best in predicting all-cause mortality (HR, 4.46), followed by Dbox3D (HR, 4.05), whereas Dbox3D performed best in predicting lung-related mortality (HR, 6.16), followed by EI% (HR, 5.71). Because the superiority among cDLCO%, EI%, Dsize and Dbox3D as a predictor was difficult to distinguish, four models including cDLCO%, EI%, Dsize or Dbox3D, with variables age, sex, smoking history and BODE index, were tested using conditional backward elimination. In the model including Dbox3D, the variables age, smoking history, BODE index and Dbox3D were identified as significant predictors for all-cause mortality (C-index, 0.790) and lung-related mortality (C-index, 0.785) (Table 3). Other models including cDLCO%, EI% or Dsize instead of Dbox3D showed qualitatively similar results (Supplementary Tables S13). The analysis revealed that the Dbox3D value was a significant predictor of mortality of COPD patients, independent of age and the BODE index.

Table 3 Significant predictors in multivariate Cox proportional hazards regression analysis: age at baseline, smoking history, BODE index, and Dbox3D. The concordance indices of the multivariate prediction model with all four significant predictors are shown

Predicting survival by fractal dimension

The prediction power of Dbox3D on COPD patient mortality was assessed using Kaplan-Meier survival curves of the Dbox3D subgroups (Fig. 2). The low-Dbox3D subgroup is the lowest quintile (n = 60) and the high-Dbox3D subgroup comprised all the remaining patients (n = 242) (Table 2). For both all-cause death and lung-related death, the survival curves for the high- and low-Dbox3D subgroups differed significantly (p < 0.001).

Fig. 2
figure 2

Kaplan-Meier survival curves showing the outcomes of chronic obstructive pulmonary disease patients stratified by Dbox3D, for all-cause death (a) and death related to lung (b). The patients were stratified into two groups, the lowest Dbox3D quintile (n = 60) and the remaining patients (n = 242) with relatively higher Dbox3D. The patients with high Dbox3D showed a better prognosis than those with a low Dbox3D, for both all-cause death and death related to lung. Dbox3D: the Box-counting fractal dimension of the lung parenchyma shown in full in three dimensions

Contribution of complexity measures to combinatorial prediction models

Table 4 shows combinatorial prediction models and their prediction performances. A multivariate Cox proportional hazard model including age, smoking history and BODE index was used as a basis to which cDLCO%, EI%, Dsize and Dbox3D were added, individually. Dsize and Dbox3D were each added to the base model plus cDLCO% and base model plus EI%. The C-index and AUC-ROC for 6-, 9- and 12-year survival of the resulting nine combinatorial models were evaluated. The base model was concordant (C-index, 0.777) and highly predictive of 6-, 9- and 12-year survival (AUC-ROC, 0.828, 0.837 and 0.824, respectively). Adding Dsize or Dbox3D to the base model enhanced its performance subtly. Adding cDLCO% or EI% instead of Dsize or Dbox3D similarly enhanced prediction performance. Combining the base model, cDLCO% or EI%, and Dsize or Dbox3D minimally affected prediction performance. These effects were too subtle to indicate superiority among the predictors. Thus, the contribution of Dsize or Dbox3D is comparable to that of EI% or cDLCO%.

Table 4 Prediction performances of combinations of predictors. Concordance index (C-index) and area under the receiver operating characteristics (AUC-ROC) curves for various combinations of predictors are shown. Considering a prediction model comprising age, smoking history, and BODE index a base model, addition of Dsize or Dbox3D enhanced the model performance subtly. Dsize or Dbox3D benefitted the prediction model to a similar extent as cDLCO% or EI%

Comparing the 2D versus 3D box-counting methods

Figure 2 shows the difference between the 2D and 3D box-counting methods in prediction performance. Compared with the 2D box-counting (grey dots) for a small sampling number, such as three or five, the 3D box-counting method (blue line) outperformed the 2D method in both predictive significance and reliability. Forty or more 2D slices should be sampled to achieve a similar performance as the 3D method.

Discussion

We here defined methods to quantify lung parenchymal morphometric complexity, and propose two complexity measures, Dsize and Dbox3D; both are novel and significant predictors of survival in COPD patients.

In this study, we applied box-counting fractal analysis to CT scans of COPD patients, used the full-3D data, which yielded better prediction performance than 2D slice sampling, and revealed that morphometric complexity is closely associated with survival.

Lower morphometric complexity has been observed in patients with more severe COPD [11, 26]. We found that age, smoking history, BODE index, cDLCO%, EI%, Dsize, and Dbox3D are significant predictors of mortality, which was in agreement with previous studies [3,4,5,6,7,8], although the different study designs prohibited direct comparisons.

Dsize and its correlations with COPD severity and progression, and its robustness to the breath-hold level have previously been studied [9, 11, 26]; however, its significance in mortality prediction has not yet been reported. Although Dsize is a significant predictor, its performance is similar to or slightly inferior to those of EI%, cDLCO% and the BODE index.

The prognostic significance of Dbox3D for both all-cause and lung-related mortality is equivalent to those of EI% and cDLCO%, and comparable to that of the BODE index. Dsize accounts only for the size distribution of the emphysema lesions, whereas Dbox3D also reflects the lesions’ spatial distribution pattern.

The predictive significance of the BODE index and cDLCO% was similar in this and previous studies [4, 6,7,8]. The BODE index [4, 8] and/or its individual elements, FEV1 [3, 4, 6,7,8], body mass index [4, 6,7,8], dyspnoea scale [4, 7] and 6-min walking distance [4, 7, 8] were identified as significant survival predictors in our and previous studies. Similar to our results, DLCO% [8] and DLCO/VA [6] have previously been proposed as mortality predictors.

A positive association with smoking history was significant in our study, in contrast to previous studies [3,4,5,6,7,8]. We deduce that smoking history only marginally influences mortality in the KOLD cohort, in agreement with other studies. A relatively shorter smoking history (41 pack years ± 28; Table 1) in the KOLD cohort than subjects in previous studies may underlie the discrepancy.

EI% was a highly significant predictor of mortality in our and previous studies [5,6,7], but not in the study of Martinez et al [8], who found that the percentage of emphysema is not significant, but rather that the difference in the percentage of emphysema between the upper and lower lung was a predictive factor [8]. Their findings were in contradiction to those of Dawkins et al [5] and they proposed that it may be due to differences in the patient selection criteria [8]. Upper or lower lung predominance in terms of EI% and/or Dbox3D should be investigated in future to elucidate this matter.

cDLCO%, EI%, Dsize and Dbox3D had similar prediction performances, and were also considerably collinear. To avoid multi-collinearity in multivariate analysis, we chose the variable Dbox3D as it had the best HR in predicting lung-related death and the second best in predicting all-cause death. Although the HR of cDLCO% was slightly higher than that of Dbox3D in all-cause death (4.46 vs. 4.05), it ranked only third in predicting lung-related death.

The conditional backward elimination method revealed Dbox3D to be a significant variable, along with age, smoking history and BODE index; their predictive performances were similar. Dbox3D had smaller HR values than those of the BODE index (Tables 2 and 3), but was not inferior.

Dbox3D and all other predictors performed better for lung-related death than all-cause death (Tables 2 and 3 and Fig. 2). A Dbox3D value in the lowest quintile, indicating relatively low lung morphometric complexity, was a significant risk factor for mortality. The low-Dbox3D quintile 10-year survival for lung-related diseases halved, while nearly 90% of the patients with high-Dbox3D values survived.

Dsize or Dbox3D contributed to a combinatorial survival prediction model comprising age, smoking history and the BODE index, similar to EI% or cDLCO%, where the BODE index contributed the most.

We also investigated the benefits of utilising full-3D data over 2D data. The extent of agreement among the repeated 2D measurements for sampling 3, 5, 10, 20, 30 and 40 slices was almost saturated to 1 between 10–20 slices (Fig. 3, Supplementary Material 2). Even three sampling slices yielded a very high inter-class correlation coefficient (> 0.998) over 20 repetitions. Dbox2D values from the sampled slices were highly concordant between repeated measurements (Supplementary Material 2). However, considering the prediction performance, at least 40 slices should be sampled to obtain stable HR values similar in quality to those of the 3D method (Fig. 3). Overall, we recommend the full-3D method rather than applying the 2D box-counting method to an arbitrary number of sampled slices, to improve mortality prediction.

Fig. 3
figure 3

The prediction significance of a two-dimensional slice selection schema for all-cause death (a) and death related to lung (b). When a small number of slices were selected, such as three or five, some random trials give higher hazard ratios than the three-dimensional (3D) measurement, but the average performance is lower than that of the 3D measurement. Moreover, the predictive significance varies from trial to trial, which implies that the measurement is unreliable. Over 40 slices should be measured to achieve a similar performance to that of 3D box-counting fractal analysis

We speculate that morphological complexity successfully predicts patients’ survival because it represents the integrity of the lung parenchyma. Patients with healthier, more intact lung parenchyma have a higher probability of surviving longer than patients with a less complex lung. Considering the ‘lung at risk’ concept [32, 33], a lung parenchymal area that is apparently normal but under mechanical stress leads to disease progression; thus, a loss of morphometric complexity may be a prelude to progressive emphysema. By definition, fractal geometry allows irregular structures to be quantitatively characterised in geometric terms, even if they are difficult to describe by classical geometry [14, 16]. The box-counting fractal dimension method best represents the so-called ‘space-filling property’, which is similar to, but not equivalent to, EI%. More than simply estimating the volumetric ratio of the LAA with respect to the whole lung, the fractal dimension measures the complexity, the space-filling property, and the integrity of the lung parenchyma. Thus, the concept of ‘a good lung’, designed under fractal rules, is a highly space-filling and complex lung, as formulated by Weibel [14, 15, 17].

We also speculate that the predictive performance of Dbox3D would have been superior to EI% or cDLCO% in a larger cohort and with a longer follow-up period. Although both EI% [6] and cDLCO% [8] are proven predictors of mortality in COPD patients, they have variable reproducibility [26, 34, 35]. EI% is susceptible to intra- and/or inter-subject variations in the breath-hold-level in CT imaging, whereas fractal measures are more robust [26]. The DLCO test also has variable reproducibility [35], requires additional time and effort to obtain, and cannot be performed properly by some patients [34]. Dbox3D may be used in conjunction with cDLCO% in such circumstances.

The limitations of the current study are as follows: (1) The CT data were analysed only at baseline and not at follow-up. In future studies, follow-up CT data of our patients and follow-up lung clinical indices, such as the BODE index and cDLCO%, may reveal the nature and temporal evolution of these indices. (2) The causes of deaths were not completely followed up. Among 63 deaths, the cause of death in 17 patients remained unknown. (3) Accounting for the full-3D CT data acquired in quantification is a virtue of Dbox3D; however, the full-greyscale data were excluded from the current analysis. The high collinearity between Dbox3D and EI% may be due to the same -950 Hounsfield units-threshold binary masks that these parameters share. In future studies, a 4D fractal analysis could be performed with the inclusion of an additional greyscale dimension, which currently would require excessive computational time. (4) The study population was only Korean. External validation that includes populations of multinational, multi-ethnic COPD patients may be needed. Nevertheless, we did demonstrate that morphological complexity is a significant predictor of mortality in COPD patients, and is unique and non-inferior to conventional prognostic indices.

In conclusion, we have demonstrated two methods for measuring the morphometric complexity in the lung on chest CT images, and determined that low morphometric complexity in the lung is a risk factor for mortality in patients with COPD, which has not been reported previously. Further investigations of lung complexity may facilitate our understanding of the nature of COPD and enhance the ability to predict mortality in clinical practice.