Introduction

Meningiomas are the most common primary intracranial neoplasms in adults, comprising 36.7% of all intracranial tumors [1]. According to the World Health Organization (WHO) classification system, 78% are benign (grade I), 20.4% are atypical (grade II), and 1.6% are anaplastic (grade III) [2]. Atypical or anaplastic tumors have an aggressive biological behavior, a tendency to recur [3], and a poor prognosis, with 5-year survival rates of 67.5 and 60.0% for patients with atypical and anaplastic meningiomas, respectively.

The initial extent of tumor resection and histological grade are key determinants of recurrence [4]. Thus, preoperative prediction of the meningioma grade is important because it influences treatment planning, including the surgical resection strategy. Further, according to the recent guideline published by the European Association of Neuro-Oncology [5], incidentally discovered and radiologically presumed meningiomas may be managed by observation only; thus, histological verification is not mandatory in these cases. Considering the numbers of tumors that are managed without histopathologic diagnosis, it is important to noninvasively differentiate between low-grade and high-grade meningiomas.

According to the 2016 WHO classification, benign meningiomas differ from atypical meningiomas in their numbers of mitoses, cellularity, nucleus-to-cytoplasm ratio, and brain invasion, as well as in their histologic patterns. The different histological grade may reflect alterations in tumor cell proliferation and microvessel density, which may be related to the characteristic imaging features. Diffusion tensor imaging (DTI) provides information regarding the motion of water protons at the cellular level [6], and the directional restriction of water diffusivity can be measured as the fractional anisotropy (FA); however, previous studies have shown controversial results regarding the prediction of the meningioma grade using diffusion-weighted imaging or DTI values [7,8,9,10,11,12,13].

Additionally, previous studies have investigated imaging features to predict the meningioma grade within conventional imaging approaches, including tumor heterogeneity on postcontrast T1-weighted (T1C) images, as well as irregular tumor margin and peritumoral brain edema [14,15,16]; however, these qualitative imaging features are highly subjective. In contrast, radiomics provides a comprehensive quantification of tumor phenotypes noninvasively by extracting a large number of high-throughput imaging features, such as shape and texture, potentially reflecting biologic properties, especially tumor heterogeneity. Recent studies using radiomics have shown promising results in differentiating molecular subtypes and predicting survival in gliomas [17,18,19].

In benign meningiomas, tumor consistency is important in determining the surgical outcome. Fibroblastic subtypes have been reported to exhibit a firmer tumor consistency, which thus requires painstaking dissection, especially for tumors located at the skull base [20]. Thus, preoperative information regarding the histological subtypes of meningiomas is also important.

The purpose of this study was to evaluate the role of radiomics features including texture and morphology of T1C images and DTI parameters, based on the entire tumor volume, in preoperatively determining the grades and histological subtypes of meningiomas.

Methods

Patient population

The institutional review board waived the requirement to obtain informed patient consent for this retrospective study. We retrospectively reviewed meningioma cases in which pathological confirmation, preoperative MRI with T1C image, and whole-brain DTI with a b value of 600 s/mm2 were performed. Exclusion criteria were the following: (1) patients with a previous history of operation, (2) patients with a history of tumor embolization or gamma knife surgery before the MRI exam, (3) patients with incomplete MRI sequences or suboptimal image quality, and (4) error in image processing. The study population was chronologically divided into two sets. First, 136 consecutive patients (108 low-grade [benign], 28 high-grade [atypical and anaplastic]), diagnosed as having meningiomas between June 2010 and October 2016, were included in the discovery set (108 women and 28 men; mean age, 56.00 ± 12.93 years), and 58 consecutive patients (46 low-grade, 12 high-grade), diagnosed as having meningiomas between November 2016 and March 2018, were included in the validation set (44 women and 14 men; mean age, 59.24 ± 13.17 years). The mean interval between the MRI examination and operation was less than 1 day for all patients. The flow chart of the study population is shown in Fig. 1.

Fig. 1
figure 1

The flowchart for the discovery and validation set

Pathological diagnosis

Pathological diagnosis was performed by a neuropathologist (S.H.K., 16 years of experience), according to the 2016 WHO criteria [21]. Criteria for atypical meningioma (WHO grade II) comprised 4–19 mitoses per 10 high-power fields, the presence of brain invasion, or the presence of at least three of these features (“sheet-like” growth, hypercellularity, spontaneous necrosis, large and prominent nucleoli, and small cells); criteria for anaplastic meningioma (WHO grade III) comprised frank anaplasia (histology resembling carcinoma, sarcoma, or melanoma) or elevated mitoses (> 20 mitoses per 10 high-power fields) [21]. Additionally, the Ki-67 labeling index was evaluated in 187 patients. In 152 of 194 patients, the mitotic count was evaluated using the mitotic marker phosphohistone-H3.

MRI protocol

Preoperative MRI was performed using a 3.0-T MRI scanner (Achieva, Philips Medical Systems) with an eight-channel sensitivity-encoding head coil. The preoperative MRI protocol included T1-weighted (T1) (TR/TE 2000/10 ms; field of view, 230 mm; section thickness, 5 mm; and matrix, 320 × 198) and T1C (TR/TE, 2000/10 ms; field of view, 250 mm; section thickness, 2 mm; and matrix, 256 × 256) images, as well as whole-brain DTI (with b values of 600 and 0 s/mm2, 32 directions, and the following parameters: TR/TE, 8400–8500/70–80 ms; field of view, 220 mm; section thickness, 2 mm; matrix, 112 × 112; acceleration factor, 2.5; and acquisition time, 5 min and 20 s). T1C images were acquired after administration of 0.1 mL/kg of gadolinium-based contrast material (Gadovist; Bayer).

Image preprocessing and postprocessing: volume acquisition

Data processing was performed offline. Preprocessing of the T1C images was performed to standardize the data analysis across patients. Before analysis, unwanted low-frequency intensity nonuniformity was removed by applying the N4 bias correction algorithm [22]. T1C image intensities were normalized using WhiteStripe [23] packages, implemented in R version 3.4.0 (R Foundation for Statistical Computing). T1C image and DTI data were processed with a multi-platform, free, and open-source software package for visualization and medical image computing (3D slicer, version 4.6.2-1; available at: http://slicer.org/). T1C images were coregistered to ADC and FA maps by affine transformation with normalized mutual information as a cost function [24,25,26]; ROIs were drawn on every tumor section on T1C images, using a semiautomatic method with an interactive level-set volume of interest using threshold-based and edge-based algorithms. Gross cystic, hemorrhagic, or necrotic areas were avoided by using conventional T1 and T1C images. The ROIs were transferred to ADC and FA maps. The ROIs were drawn by a single neuroradiologist and confirmed by another neuroradiologist (Y.W.P. and S.S.A., with 7 and 12 years of experience, respectively), both of whom were blinded to the corresponding clinical information and histopathologic results.

In total, 90 texture features (T1C, ADC, FA maps × 30 features, belonging to three categories) and eight morphology features were calculated (Supplementary Table 1). The three categories of texture features included histogram-based, co-occurrence matrix-based, and run-length matrix-based parameters. Detailed information regarding these features can be found in previous literature [27, 28]. A schematic for the data processing is shown in Fig. 2.

Fig. 2
figure 2

The workflow for imaging data processing and analysis. CM, co-occurrence matrix; RLM, run-length matrix; RF, random forest; SVM, support vector machine; AUC, area under the curve

Statistical analysis

The correlations between radiomics features and the meningioma grade were assessed using Student’s t test or the Mann-Whitney test according to the normality tests.

Machine learning classifiers from various combinations of feature subset selection, machine learning methods, and subsampling were trained to predict the meningioma grade. For feature subset selection, either no subset selection or recursive feature elimination (RFE) was performed. These feature selection methods were combined with support vector machine (SVM) and random forest (RF) machine learning methods. In addition, to overcome disparity in the frequencies of the meningioma grade, each machine learning model was trained (1) without subsampling, (2) with random over-sampling examples (ROSE), and (3) with the synthetic minority over-sampling technique (SMOTE) [29, 30]. Thus, a total of 12 combinations of feature subset selection, machine learning algorithm, and subsampling were trained and validated to classify the tumor grade.

Machine learning classifiers were trained on the discovery set (n = 136) and validated on the validation set (n = 58). For each machine learning combination, we trained the model on the discovery set using leave-one-out cross validation. The classifier yielded predicted probabilities for the meningioma grade. The area under the curve (AUC), accuracy, sensitivity, and specificity were calculated. A hypothesis test was performed (with the binom.test function) to evaluate whether the accuracy rate is higher than the no-information rate (the majority percentage for the meningioma grade, i.e., prediction by chance). Also, the precision-recall plots were calculated.

The correlation between radiomics features with the mitosis count and the Ki-67 labeling index was evaluated by Pearson correlation coefficient analysis.

Benign meningiomas were subdivided into fibroblastic meningiomas and other subtypes. Correlations between texture and morphology parameters and subgroups were assessed using Student’s t test or the Mann-Whitney test, according to the results of normality testing.

Statistical analysis was performed (by J.M.O., a biostatistician with 5 years of experience in computational biology) using statistical software R (version 3.3.1; R Foundation for Statistical Computing). The RFE feature selection and classification algorithms were performed using the caret R package [31]. ROSE and SMOTE subsampling were performed using the ROSE and DMwR R packages [32, 33]. The e1071 and pROC packages were used. Statistical significance was set at p < 0.05.

Results

Characteristics of the 194 patients in the discovery (n = 136) and validation (n = 58) sets are summarized in Table 1. In the discovery set, 108 patients were pathologically diagnosed with low-grade meningioma and 28 were diagnosed with high-grade meningioma (25 atypical, three anaplastic). In the validation set, 46 were pathologically diagnosed with low-grade meningioma and 12 were diagnosed with high-grade meningioma (11 atypical, one anaplastic).

Table 1 Patient characteristics in the discovery and validation sets

Radiomics feature analyses for determining meningioma grades

Various radiomics features were significantly different according to the meningioma grade (Supplementary Table 2). According to texture features, high-grade meningiomas showed higher histogram entropy, dissimilarity, coarseness, and less homogeneity along the run lengths of T1C, ADC, and FA values. For the morphology features with discriminative ability, high-grade meningiomas present as larger volumes with larger maximum 3D diameter and are more compact (higher “compactness 1,” defined as the ratio of volume to the [surface area]^1.5) [34] than low-grade meningiomas. Supplementary Fig. 1 shows representative cases that manifest the texture and morphology differences between different meningioma grades.

The performances of machine learning models in differentiating the meningioma grade in the validation set are demonstrated in Table 2. The AUC values from various machine learning classifiers to predict the grade are shown in Fig. 3. The best performance was yielded from a combination of feature selection by RFE and SVM classification algorithm with SMOTE, with an AUC, accuracy, sensitivity, and specificity of 0.86 (95% confidence interval [CI], 0.74–0.98), 89.7%, 75.0%, and 93.5%, respectively. The accuracy was significantly higher than the no-information rate accuracy (79.3%) on validation (p = 0.03). Supplementary Fig. 2 shows the receiver operating characteristic and precision-recall curves. Supplementary Fig. 3 shows a heat map according to meningioma grades in the discovery and validation sets, which reveals a strong relationship between texture and morphology features (p < 0.05) and meningioma grades.

Table 2 The performances of machine learning models in differentiating the meningioma grade in the validation set
Fig. 3
figure 3

The heat map of area under the curve (AUC) values from various machine learning classifiers to predict the meningioma grade in the (a) discovery and (b) validation sets. ROSE, random over-sampling examples; SMOTE, synthetic minority over-sampling technique; RFE, recursive feature elimination; RF, random forest; SVM, support vector machine

Relation of texture and morphology parameters with mitosis count and Ki-67 labeling index

Both the mitotic count and the Ki-67 labeling index were significantly correlated with multiple texture and morphology parameters (Supplementary Table 3 and Table 4). Supplementary Fig. 4 shows correlation matrix plots among texture and morphologic features, the mitosis count, and the Ki-67 labeling index (p < 0.05).

Texture and morphology analyses according to fibroblastic versus other subtypes

The subtypes of benign meningiomas were divided into 17 fibroblastic and 137 non-fibroblastic (49 meningothelial, 71 transitional, 3 psammomatous, 1 secretory, 7 angiomatous, and 6 microcystic) subtypes. There were significant differences in the various texture parameters between fibroblastic and non-fibroblastic meningioma subtypes (Supplementary Table 5).

Discussion

Preoperative prediction of the meningioma grade is clinically important. Although surgical resection is the primary treatment for meningiomas, observation or gamma knife surgery is considered in patients with asymptomatic small meningiomas without pathological confirmation [35]. Additionally, adjuvant radiotherapy has demonstrated efficacy for high-grade meningiomas [36, 37], whereas the role of adjuvant radiotherapy for low-grade (WHO grade I) meningiomas is unclear [38, 39]. Therefore, we comprehensively analyzed the texture and morphology features of T1C images, ADC, and FA maps, according to the meningioma grades and histologic subtypes. The machine learning classifiers showed fair to excellent performance to predict the meningioma grade in the discovery set (AUC, 0.7–0.1) and fair to good performance in the validation set (AUC 0.74–0.86). In benign meningiomas, there were significant differences in the various texture features between fibroblastic and non-fibroblastic meningioma subtypes.

Spatial and temporal textures are based on the destruction of normal anatomy by tumors, vasogenic edema, tumor cellularity, degenerative changes, or the compression of normal structures, as well as some that the human visual system fails to detect. In fact, the notion that texture analysis can reveal visually imperceptible tumor information extends beyond radiology to histopathology; texture analysis has been reported to be a potentially useful approach for estimating grades and molecular status in brain tumors [17, 40].

Among conventional MR features, heterogeneous enhancement was reported to be associated with atypical meningiomas [15, 41]. Heterogeneous enhancement is associated with heterogeneous distribution of tumor cells, which reflects intratumoral ischemic necrosis, calcification, hemorrhage, and cystic change. Previous reports have stated that atypical and anaplastic meningiomas have significantly more intratumoral cystic changes, compared with benign meningiomas [42, 43]. These can be quantitatively measured by texture features.

The usefulness of diffusion-weighted imaging in predicting the histological grades of meningiomas is controversial. Several studies have demonstrated significant differences between the grades and ADC values of meningiomas [7,8,9, 13]. However, other studies have shown that the mean ADC values in meningiomas were not significantly different according to grades [10,11,12]. In our study, there was no significant difference in the mean ADC or FA values. However, various texture features of ADC and FA maps were significantly different. High-grade meningiomas show a heterogeneous distribution of proliferating cells, resulting in an imbalance of cell density within the tumors [44]. This heterogeneous distribution of cell density was quantified by ADC and FA texture features in our study; high-grade meningiomas showed higher asymmetry, contrast, dissimilarity, and coarseness, as well as less homogeneity along run lengths of ADC and FA values. According to texture features, high-grade meningiomas showed higher histogram entropy, dissimilarity, coarseness, and less homogeneity along the run lengths of T1C, ADC, and FA values. Although we excluded gross cystic or necrotic areas from the radiomics analysis, radiomics features might have reflected microscopic heterogeneity within the tumors. Traditionally, studies including ADC or FA maps have focused on the enhancing solid portion of the tumor, and a recent study using both ADC and T1C radiomics features also performed tumor segmentation only in the enhancing area [45].

A recent study has also reported an association between tumor volume and the histopathological grade of meningioma, which may be due to the relatively high proliferative potential of high-grade tumors [46]. The finding that tumor morphology is an indicator of malignancy has been controversial. Some studies have reported that irregular shape is associated with meningioma aggressiveness [44, 47]. However, another study reported that irregular shape was not significant in a multivariate analysis of high-grade meningioma [15]. A recent study has reported that quantitative shape features were significantly different according to the meningioma grade [40], which is consistent with our results.

A previous study aimed to determine meningioma grades with machine learning based on T1C imaging, but the sample size was smaller with analyses of a single section on the T1C sequence rather than the analysis of the whole tumor, and the data imbalance was not of concern [40]. Due to the relatively low incidence of high-grade meningiomas, data imbalance in meningioma grading is inevitable. Imbalanced datasets can potentially cause a negative effect on fitting of machine learning classification models [30, 48]. In our study, data imbalances were mitigated by using state-of-the-art subsampling techniques that downsample the majority class and synthesize new data points in the minority class, which are recognized as suitable strategies in machine learning [48, 49]. The performance was mostly improved by both ROSE and SMOTE; the best performance was achieved by a combination of feature selection by RFE and SVM classification algorithm with SMOTE, and the accuracy was higher than the no-information rate accuracy. Also, we have assessed the precision-recall curve, which is known to be more informative than the receiver operating characteristic curve for imbalanced datasets [50].

In the present study, various texture and morphology features were correlated with the mitotic count and the Ki-67 labeling index. Previous studies have shown discrepant results regarding the relationship between ADC values (normalized or minimum ADC ratios or mean ADC values) and the Ki-67 labeling index [9, 25, 51]. In our study, there was no significant correlation between the mean ADC and the Ki-67 labeling index (according to Supplementary Table 4). However, various texture and morphology features were significantly correlated with the mitotic count and the Ki-67 labeling index. Mitosis is a complex biological process in which the cell undergoes various morphological transformations [52]; these changes may contribute to the various correlations with the texture and morphology features.

In our study, the texture features were significantly different according to fibroblastic and non-fibroblastic subtypes of meningiomas. Previous studies have shown that DTI-based measurement of FA may aid in predicting subtypes of benign meningiomas [53]. The high degree of anisotropy within fibroblastic meningiomas has been attributed to their high content of intercellular fibers and the fascicular orientation of long spindle-shaped tumor cells [54, 55]. Another study showed that the fibroblastic meningiomas are less vascular than meningothelial meningiomas [56], which may be related to differences in T1C image features. Previous studies to classify meningioma subtypes by texture analyses of histopathologic specimen slides have proven effective [57, 58], and we speculate that tumor imaging may be also effective in classifying meningioma subtypes because of the reflection of underlying histology on MRI [59]. Further studies with larger numbers of meningioma subtypes are required for texture analyses in preoperative differentiation of meningioma subtypes.

Our study had several limitations. First, it was based on a single-center, retrospectively collected dataset. Further studies with external validations are needed for evaluating the generalizability of the machine learning classifiers. Second, b = 600 s/mm2 was used, rather than b = 1000 s/mm2 or higher, to perform DTI. This low b value imaging protocol was initially set up at our hospital due to technical requirements within reasonable acquisition time. Since then, the protocol has not changed in order to obtain subsequent patients with the same protocol. This low b value may have resulted in the overestimation of ADC values because of perfusion effects. However, this would likely have a small influence on the analysis of the differences between meningioma grades, since the entire pixels are affected. Third, only the enhancing portion of the tumor was segmented and evaluated in order to assess both DTI and T1C sequences. Although most studies including ADC or FA maps have focused on the enhancing solid portion of the tumor, other studies including T1C have included the nonenhancing area in the tumor [40, 60]. Further study is indicated to compare the results in different tumor segmentation masks, especially in cases of microcystic meningiomas.

Conclusions

Radiomics feature-based machine learning classifiers of the conventional T1C imaging and DTI may be helpful for the differentiation of low-grade and high-grade meningiomas.