Introduction

White matter hyperintensities (WMH) refer to the neuroimaging feature of small blood vessel disease found during brain magnetic resonance imaging (MRI). WMH have been associated with cognitive decline, depression, and Alzheimer’s disease [1]. Accordingly, it is particularly important to identify and prevent the progression of WMH [2]. However, this has been difficult as the pathogenesis of WMH remains unclear [3,4,5]. Certain factors such as age and hypertension have been associated with WMH [6], but the determinants involved in the formation of WMH are likely different from those involved in the progression of WMH [7]. Once the risk factors associated with the progression of WMH have been identified, early intervention strategies may be developed to attenuate brain injury.

Radiomics, which is a promising and rapidly growing discipline, is a method for extracting a large number of quantitative features from medical images and converting the information into mineable data [8,9,10]. These data can be subsequently analyzed to construct biomarkers for disease prediction and diagnosis through feature selection. Radiomics has been widely used in neurological diseases, including small vascular disease [4], multiple sclerosis [11], and traumatic brain injury [12]. Diffusion tensor imaging (DTI) or arterial spin labeling can detect microscopic changes in the whole-brain white matter (WBWM) to assess the severity and progression of WMH status, and some studies have revealed that microstructures have changed in normal-appearing white matter (NAWM) [13, 14]. NAWM is visually similar to healthy tissues, yet WMH may be detected by advanced MRI techniques. Understanding the conversion of NAWM to WMH may further help identify patients at risk of developing progressive WMH, bringing valuable information for clinicians to offer precise preventive and therapeutic options to patients. Compared with those advanced MRI studies, radiomics has the benefit of using clinical routine MRI sequences to evaluate the status of WMH.

In our previous study using MRI [15], we found that the radiomics analysis of WMH penumbra was predictive of the development of WMH while in the early form of NAWM. However, this is difficult to achieve in the traditional clinic as WMH penumbra must be drawn manually and accurately, which is difficult and time-consuming. The WBWM, including normal white matter, WMH, and the penumbra, can easily be extracted on T1-weighted fluid-attenuated inversion recovery (FLAIR) using the software. Accordingly, we utilized the radiomics score of WBWM from conventional brain MRI scans to predict the progression and identify associated risk factors of WMH. The purpose of the current study was to investigate whether radiomics of WBWM could be used to predict the progression and identify associated risk factors of WMH based on radiomics features extracted from different brain regions.

Materials and methods

This retrospective study was approved by our institutional review board, and the requirement for informed consent was waived.

Study population

The MRI data of 1073 patients who underwent brain MRI scans at our hospital from March 2014 to May 2018 were retrospectively analyzed. Among the patients, 141 patients with WMH who underwent two consecutive brain MRI scans were used in this study, including 71 males and 70 females (mean age, 67 years; range, 50–87 years). Patients were included based on the following criteria: (1) WMHs were visible on T2-weighted imaging and T2FLAIR; (2) patients were ≥ 50 years of age; (3) no stroke lesions on current diffusion-weighted imaging (except lacunar infarction); and (4) no indications of multiple sclerosis, Alzheimer’s disease, Parkinson’s disease, or traumatic brain injury. The exclusion criteria were as follows: (1) non-vascular white matter lesions (i.e., demyelinating autoimmune disease, metabolism, poison, infection); (2) signs of intracranial hemorrhage; or (3) serious head motion artifacts.

Clinicodemographic data were collected, including age, gender, diabetes status, hypertension, hyperlipidemia, and coronary heart disease. Smoke and alcohol consumption history in the past 5 years was also obtained. Smoking was defined as having more than 10 cigarettes per day, and alcohol consumption was defined as the amount of ethanol ≥ 40 g per day for men and ≥ 20 g per day for women, respectively. Hypertension was defined as a systolic blood pressure of ≥ 140 mmHg, diastolic pressure of ≥ 90 mmHg, or with the current usage of antihypertensive medications. Hyperlipidemia was defined as the overnight fasting low-density lipoprotein (LDL) of ≥ 130 mg/dl.

MRI acquisition

All brain MRI scans were performed in a local hospital on a 3.0 T MRI scanner (Discovery MR 750, GE Healthcare) with an eight-channel head coil using the same MR parameters. The routine sequences included T1FLAIR, T2-weighted imaging, diffusion-weighted imaging, and T2FLAIR. Axial T2FLAIR was used for observing the WMH with the following parameters: repetition time/echo time = 9000/120 ms and field of view = 256 × 256 mm, matrix = 256 × 256, flip angle = 160°, echo chain = 18, bandwidth = 50, section thickness = 5 mm, inter-slice gap = 0. T1FLAIR was used to segment the white matter with repetition time = 1750 ms, echo time = 24 ms, field of view = 256 × 256 mm, resolution = 256 × 256, flip angle = 111°, echo chain = 10, bandwidth = 31.25, section thickness = 5 mm, and inter-slice gap = 0.

Processing of MRI scans

All T1FLAIR baseline images were imported into the SPM12 software (https://www.fil.ion.ucl.ac.uk/spm/software/spm12/) for automatic segmentation of the white matter. The WBWM was manually modified by an experienced neuroradiologist who was blinded to the clinical data using the ITK-SNAP software (http://www.itksnap.org/pmwiki/pmwiki.php). This was accomplished by the following steps: (1) removal of non-brain tissues, brainstem, and cerebellum; and (2) modified white matter segmentation. The WBWM is shown in Figure S1.

The segmented WBWM was imported into the AK software (Artificial Intelligence Kit V3.0.0.R, GE Healthcare) for feature extraction. Radiomics features, including histogram, formfactor, gray-level co-occurrence matrix (GLCM), and run-length matrix (RLM), were calculated. Several pre-processing steps using the AK software were performed to extract the radiomics features. First, all WBWM images were resampled to change the voxels to 1 × 1 × 1 mm by linear interpolation to eliminate the effect of anisotropy on the features. Gaussian filtering was used to reduce noises in the image. The magnetic field migration correction was performed, which also assisted in reducing external interference factors. Finally, histogram mapping and intensity standardization were performed to limit the gray level of all images to 0–255 for comparative analysis of the features.

Progression of WMH

The visual rating scale proposed by Prins et al [16] was used to assess the progression of WMH in follow-up MRI scans, ranging from − 3 to + 3 in periventricular regions and − 4 to + 4 in deep regions. The progression of WMH was defined as a score of ≥ 1. Two experienced neuroradiologists, who were blinded to clinical information, reviewed the images independently to assess the changes in WMH. A disagreement between the neuroradiologists was resolved by discussion until a consensus was reached. Since the pathogenesis of periventricular WMH and subcortical WMH may be different [17], the group showing progressive WMH was divided into three subsets based on regions: periventricular white matter hyperintensity (PWMH) progression, deep white matter hyperintensity (DWMH) progression, and progression of WMH in any periventricular or deep region (AWMH). PWMH was defined as WMH directly connected to ventricles, while DWMH was defined as WMH connected to the subcortical area without extending to ventricles [18, 19].

Extraction of radiomics features and construction of the radiomics signature

Patients in each subgroup were randomly divided into the training (n = 98) and validation (n = 43) sets according to a ratio of 7 to 3, respectively, and the radiomics features were extracted for each patient. In total, 328 features were extracted for each patient, including 32 histogram features, eight formfactor features, 144 GLCM features, and 144 RLM features. The extracted features were standardized, and dimension reductions were performed. The least absolute shrinkage and selection operator (LASSO) was used to select the most powerful features of the training set to build the radiomics signature. Next, the radiomics signature was used to calculate the radiomics score (rad-score) for each patient, and this score was defined as the risk score for the progression of WMH. A radiomics score was calculated for each patient in the validation set using the same formula constructed for the training set. The predictive accuracy of the radiomics signature was quantified by the area under the receiver operating characteristic (ROC) curve (AUC) in the training and validation sets. Additional information about the process of radiomics feature extraction and feature selection can be found in the Supplementary Materials.

We initially chose 30 random images for the region of interest (ROI) segmentation and feature extraction, which was performed independently by the two neuroradiologists (combined 19 years of experience) who were blinded to the clinical information. The inter-observer reproducibility of feature extraction was initially analyzed between the two radiologists. For assessing intra-observer reproducibility, the first neuroradiologist repeated the feature extraction process twice during a 2-week period using the same procedures.

Construction and assessment of the radiomics model

The clinical data of patients with progressive WMH in each subgroup were dichotomized, and the differences in the radiomics score between the two classifications were compared for selecting potential predictive variables. In the training set, a univariate logistic regression analysis was performed for each potential predictive factor. Next, a multivariable logistic regression analysis was used to establish a radiomics model for assessing the progression of WMH, combining the independent clinical risk factors and radiomics signature. Internal validation of the radiomics model was performed with the validation set. The model was assessed using a calibration curve, and the Hosmer-Lemeshow test was performed to assess the goodness-of-fit of the model. Furthermore, the AUC was used to evaluate the discrimination of the model.

Statistical analysis

The statistical analyses were performed using SPSS 17.0, MedCalc15.8 software, and GraphPad Prism 6. The Kolmogorov-Smirnov test was applied to evaluate the normality of distribution, and the chi-square test was used for categorical data. This study carried out two classifications of continuous variables, including body mass index (BMI) with ≥ 25 kg/m2 being defined as obese, age with ≥ 60 years being defined as an older adult, and low-density lipoprotein (LDL) levels with ≥ 130 mg/dl being defined as hyperlipidemia. After conducting a comparative analysis of radiomics scores between the two subgroups for each risk factor, through the Student’s t test or Mann-Whitney test, statistically significant risk factors were included in the univariate logistic regression to evaluate whether the risk factor was independently associated with the progression of WMH. A predictive model was established using the multivariable logistic stepwise regression for progression of WMH in sub-regions, and the predictive performance was evaluated with the ROC curve. Finally, a correlation analysis was performed between the independent risk factors and radiomics signatures. Statistically significant values were those with p values of ≤ 0.05.

Results

Patient characteristics

The radiomics workflow is presented in Fig. 1. A total of 141 patients with WMH were included in the study, of which 65 subjects were divided into the progression group and 76 subjects in the non-progression group. The number of patients having coronary heart disease or hyperlipidemia reached statistically significant levels between the two groups (p ≤ 0.05). No significant differences were observed in the other clinical factors, including MR interval, age, BMI, diabetes, hypertension, smoking history, alcoholic history, and baseline Fazekas scores (all p ≤ 0.05), as shown in Table 1.

Fig. 1
figure 1

Radiomics workflow of normal white matter

Table 1 Clinical and demographic information of patients in this study

Inter-observer and intra-observer reproducibility of radiomics feature extraction

The intra-observer intra-class correlation coefficient (ICC), which was calculated based on two measurements from the first neuroradiologist, ranged from 0.826 to 0.937. The inter-observer agreement between the two neuroradiologist readers ranged from 0.773 to 0.916. These results indicated favorable intra- and inter-observer reproducibility of the feature extraction process.

Radiomics signatures

The most powerful characteristics were selected to construct the radiomics signatures, which included eight features in the AWMH subgroup, nine features in the PWMH subgroup, and 11 features in the DWMH subgroup. Additional information about these features is shown in Table S1. The radiomics properties extracted from the three subgroups were used to develop the radiomics signatures by LASSO. Additional details regarding the creation of the radiomics and the results may be found in the Supplementary Materials. Using the formula for the radiomics signature, the rad-score was calculated and found to be significantly different between progression and non-progression groups in the AWMH, PWMH, and DWMH subgroups (all p ≤ 0.05). The predictive efficacies for the radiomics scores of the training and validation sets were also significantly different between the three subgroups. The AUC values were 0.758 and 0.714 for AWMH, 0.749 and 0.697 for PWMH, and 0.775 and 0.717 for DWMH (Table 2).

Table 2 Comparison of radiomic scores between the training and validation sets in the three subgroups of AWMH, PWMH, and DWMH

Clinical factors in the progressive WMH subgroups

Among all of the clinical risk factors in subgroups, age, sex, hypertension, smoking, and hyperlipidemia were associated with AWMH progression of (p < 0.05); age, sex, hypertension, coronary heart disease, smoking, and hyperlipidemia were associated with PWMH progression (p < 0.05); age, weight, smoking, and hyperlipidemia were associated with the progression of DWMH (p < 0.05); see Table 3 for more details.

Table 3 Statistical analysis of the radiomics scores based on the subgroups of clinical risk factors

Logistic regression analyses of progression in three WMH subgroups

Multivariable logistic regression analyses revealed specific radiomics signatures that were independent predictors for all of the subgroups. Age and hyperlipidemia were independent predictors of progression of AWMH, age and BMI were independent predictors of DWMH, and hyperlipidemia was an independent predictor of PWMH. Prediction models were established based on the results of the multivariate logistic regression. The Hosmer-Lemeshow test in the three subgroups showed no significant differences in the goodness-of-fit for the training set models (p = 3.493, 2.385, and 2471 for AWMH, DWMH, and PWMH, respectively). The calibration curve showed excellent calibration in the training and validation sets of the three subgroups. Additionally, we evaluated the accuracy of the radiomics model for the training and validation sets in the three subgroups. For AWMH, the AUC was 0.848 and 0.824, for the training and validation sets, respectively. For PWMH, the AUC was 0.863 and 0.818, respectively. For DWMH, the AUC was 0.861 and 0.833, respectively (Table 4; Figs. 2, 3, and 4).

Table 4 Risk factors associated with the progression of WMH in different regions by logistic regression analysis
Fig. 2
figure 2

Calibration of the radiomics model for progression of WMH in any region of the training and validation sets. The dashed line represents a reference line where an ideal model would lie. The dotted line represents the performance of a hybrid model, while the solid line corrects for any bias in the hybrid model. The prediction performance of the ROC curves for AWMH and the AUC were 0.848 and 0.824, respectively

Fig. 3
figure 3

Calibration of the radiomics model for progression of periventricular WMH in the training and validation sets. The dashed line represents a reference line where an ideal model would lie. The dotted line represents the performance of a hybrid model, while the solid line corrects for any bias in the hybrid model. The prediction performance of the ROC curves for AWMH and the AUC were 0.863 and 0.818, respectively

Fig. 4
figure 4

Calibration of the radiomics model for progression of DWMH in the training and validation sets. The dashed line represents a reference line where an ideal model would lie. The dotted line represents the performance of a hybrid model, while the solid line corrects for any bias in the hybrid model. The prediction performance of the ROC curves for AWMH and the AUC were 0.861 and 0.833, respectively

Correlations between radiomics signatures and independent risk factors

Seven radiomics features were significantly correlated with independent risk factors (Figs. 5 and 6). In the AWMH subgroup, SurfaceArea of the formfactor feature was negatively correlated with age and LDL (r = − 0.571, − 0.403, p < 0.001), and InverseDifferenceMoment_ALLDirection_offset1_SD of GLCM feature was positively correlated with age (r = 0.638, p < 0.001). In the PWMH subgroup, Compactness2 of the formfactor feature was negatively correlated with LDL (r = − 0.592, p < 0.001), and InverseDifferenceMoment_angle0_offset7 of GLCM feature was positively correlated with LDL (r = 0.204, p < 0.001). In the DWMH subgroup, LongRunEmphasis_angle135_offset7 of RLM feature was negatively correlated with age (r = − 0.293, p < 0.001), and GLCMEntropy_angle45_offset4 of GLCM characteristic was positively correlated with age (r = 0.301, p = 0.006). Lastly, InverseDifferenceMoment_angle0_offset7 of the GLCM feature was positively correlated with BMI (r = 0.394, p < 0.001).

Fig. 5
figure 5

Correlation map between statistically significant clinical risk factors and features of the radiomics signature in each subgroup. The color bar on the right represents the size of the correlation coefficient; the larger the circle in the figure, the higher the correlation and vice versa. Asterisk indicates p < 0.05. Double asterisks indicate p < 0.005. AWMH, white matter hyperintensities in any (periventricular and/or deep) region; PWMH, periventricular white matter hyperintensities; DWMH, deep white matter hyperintensities; BMI, body mass index; LDL, low-density lipoprotein

Fig. 6
figure 6

The scatter plots between the radiomics features and different independent risk factors. The different colors represent different feature categories

Discussion

We used radiomics to predict the progression of WMH and identify the associated risk factors. Our findings suggest that radiomics of WBWM could be used to predict the progression of WMH, which may support the theory of the existence of WBWH penumbra proposed by Maillard et al [2]. We also found that the clinical risk factors associated with the progression of WMH were different in PWMH and DWMH, suggesting that the pathogenesis may be different between these regions [20, 21]. Finally, specific radiomics signatures were found to be associated with risk factors for the progression of WMH, revealing the potential pathophysiological transformations from NAWM to WMH.

One of the advantages of the current study is that we extracted radiomics features from WBWM other than sporadic and progressive WMH, which could reflect the changes in white matter structures from NAWM to WMH, suggesting that it may be more powerful than features from WMH alone. The microstructural integrity of NAWM changed due to axonal injury and demyelination [22], and we believe this may be the reason for the significant differences in radiomics scores detected between the progression and non-progression groups. In addition, the radiomics score of the DWMH subgroup was lower than that of PWMH, which indicated that the heterogeneity of PWMH was more remarkable. This finding is consistent with a previous pathological finding that progression between DWMH and PWMH is different [23], yet it supports our hypothesis that radiomics may reflect the pathophysiological changes of WMH.

Furthermore, we demonstrated that the clinical factors pertinent to the progression of WMH were selectively associated with radiomics signatures in different regions. Accordingly, the combination of the prediction model and risk factors was more effective than radiomics alone, implying that specific risk factors may accelerate the progression of WMH. In fact, microvascular structures are changed in MRI scans of visually NAWM [24,25,26]. Previously, Maillard et al [27] suggested that the early detection of microstructural damage in NAWM would require advanced MRI techniques, such as diffusion tensor imaging (DTI), rather than conventional MRI sequences like T1FLAIR to quantify the NAWM integrity. However, our findings suggest that conventional sequences, such as T1FLAIR, when combined with radiomics can be used to quantify heterogeneity of visually NAWM.

We noticed that some risk factors, such as high blood pressure, did not survive in our regression model, though hypertension is thought to be a major burden of WMH. We suspect that patients with hypertension in this study were taking antihypertensive drugs for a long time, which might lead to changes in NAWM that are difficult to detect. A previous study showed that antiplatelet therapy was not effective in slowing down the progression of WMH [28]. However, further studies are needed to determine if antihypertensive medications can decelerate the progression of WMH. Age was demonstrated as an independent risk factor for the progression of WMH, which is consistent with the literature [29, 30]. Interestingly, age was not found to be an independent risk factor for PWMH, which supports a previous pathological study showing that reduced vascular density in WMH was not associated with age [31]. Our research further demonstrated that only changes in PWMH were unrelated to age, which explains why the prediction model for PWMH showed the lowest performance. However, this conclusion is limited, as this is a preliminary study using a small sample size. In addition, hyperlipidemia was shown to be a risk factor for the progression of WMH. Previous studies have shown that hyperlipidemia leads to microvascular hemodynamic regulation disorder [32, 33], which increases viscosity and resistance of blood flow [34]. We speculate that the PWMH burden is more likely to be determined by chronic hemodynamic effects (low perfusion). Hyperlipidemia may be related to other risk factors, such as high blood pressure or obesity, for the progression of WMH [35]. BMI was found to be an independent risk factor for the progression of DWMH, which is consistent with the findings of Griffantia et al [36], indicating the progression of DWMH might be delayed by controlling the BMI of patients.

Many studies have investigated potential biomarkers for assessing the progression and severity of WMH [37, 38]. Compared with those biomarkers, radiomics may serve as a non-invasive biomarker to reflect white matter status. Our research showed that dimensionality-reduced features were primarily GLCM, RLM, and formfactor features, very similar to the existing study on the texture analysis of white matter [39, 40]; GLCM represents heterogeneity and homogeneity, which indirectly indicates the heterogeneity changes during the conversion from NAWM to WMH. The results are similar to a previous study by Loizou [41], who found that GLCM and formfactor features were significantly different in the progression of NAWM. Formfactor features, such as SurfaceArea, represent changes in volume, area, or shape that may reflect microscopic structures. The volume and morphology of WMH may change with age [40], which can also be confirmed by SurfaceArea being present in all three subgroups. In addition, the study showed that the RLM, which reflects roughness and directionality, was also associated with the progression of WMH, as features with directionality have a longer run length at a specific angle [42]. Healthy white matter fiber bundles are oriented and regulated, but myelin damaged causes them to lose their regular structure and become thickened and blurred [43], indicating that WMH progression could be indicated from WBWM using RLM, especially in the DMWH subgroup, which had a close correlation with age. This suggests that deep white matter is more susceptible to myelin damage over time, which is consistent with our results that the radiomics score was lowest in the DWMH subgroup. These results also indicate that the radiomics features of WBWM can reveal the relationship between the progression and risk factors of WMH in conventional MRI.

Our study has some limitations. First, we selected a single time interval for the progression of WMH, yet continuous changes cannot be fully observed in the process of NAWM converting to WMH. However, we could elucidate the pathological mechanism in the formation and progression of WMH based on radiomics features. Furthermore, we did not use volumetric measurements to assess the progression of WMH, which may lead to deviations in the assessment of progressive WMH. Nevertheless, the visual WMH changes scale used in the current study was highly correlated with changes in the volume of WMH [44]. Lastly, we performed a retrospective study in a single-center, and our results cannot be directly applied to patients at other institutions with WMH. Nevertheless, this study established a radiomics-based biomarker for assessing the progression of WMH using conventional MRI, which may be helpful in future multicenter prospective studies.

In conclusion, the radiomics analysis of WBWM showed that the progression of WMH involved diffuse damage far beyond visually hyperintense areas on conventional MRI sequences. High-risk populations with possible progressive WMH can be identified using radiomics of WBWM.