Introduction

Unruptured intracranial aneurysms are detected with greater frequency with the improvement of modern imaging technique, such as digital subtraction angiography, magnetic resonance angiography, and 3-dimensional computerized tomography angiography (CTA) [1]. Aneurysm rupture is a devastating disease and is associated with a high morbidity and mortality [2, 3]. The management of these unruptured intracranial aneurysms is still controversial because of the treatment-related fatality and morbidity [4, 5]. Therefore, determining the aneurysm rupture risk is of great importance to make treatment decision for unruptured intracranial aneurysms.

In recent years, many risk factors have been found to be associated with aneurysm rupture. These risk factors include clinical characteristics, such as age, smoking, and aneurysm multiplicity [6,7,8], and aneurysm morphological features, such as aneurysm size, aspect ratio, and wall irregularity [9,10,11,12,13,14]. However, the relationship among the risk factors and aneurysm rupture risk is complex; accurately predicting the aneurysm rupture risk is challenging. As a powerful and promising tool, machine learning technique is capable of identifying complex relationships in large datasets [15, 16], which makes it suited to the task of aneurysm rupture risk evaluation. In recent years, machine learning has been applied to aneurysm risk prediction. A two-layer feed-forward neural network was proposed to predict the rupture risk of anterior communicating artery aneurysms and achieved a good prediction accuracy [17]. Linear and ridge regression models were developed to predict the stability of 420 aneurysms at different locations using morphological features extracted from PyRadiomics [18]. Most recently, support vector machine, random forest, and feed-forward artificial neural networks have been adopted for aneurysm stability assessment [19]. All the aforementioned machine learning algorithms were applied to either anterior communicating artery aneurysm or aneurysms at several locations. Aneurysm rupture risk was location dependent [20]. Few studies currently focus on the aneurysm rupture risk assessment using a large dataset of middle cerebral artery (MCA) aneurysms, although approximately 35% of all intracranial aneurysm are MCA aneurysms [21]. Moreover, these machine learning models behaved like black boxes and could not be interpretable, which might limit their application in clinics. Therefore, the purpose of this study was to evaluate the MCA aneurysm rupture risk using a machine learning method and provide a nomogram that can be practically utilized for clinical applications.

Materials and methods

Patients

This study was approved by our hospital ethics committee, and written informed consent was waived. A total of 472 patients with MCA aneurysm were found from the Electronic Medical Record System from January 2009 to October 2016. Patients were excluded if they had fusiform MCA aneurysms, or Moyamoya disease, or arteriovenous malformations. To ensure the accuracy of morphology measurement, we excluded those with poor image quality.

Image acquisition and morphology definition

We adopted three CT scanner for image acquisition, including a 320-detector row CT scanner (Aquilion ONE, Toshiba Medical Systems, Tochigi, Japan) with a section thickness of 0.5 mm and a reconstruction interval of 0.5 mm, a 64-channel multidetector CT scanner (LightSpeed VCT 64, General Electric Medical Systems, Milwaukee, WI, USA) with a section thickness of 0.625 mm and a reconstruction interval of 0.625 mm, and a 16-channel multidetector CT scanner (LightSpeed pro 16, General Electric Medical Systems, Milwaukee, WI, USA) with a section thickness of 1.25 mm and a reconstruction interval of 0.625 mm.

We reconstructed three-dimensional aneurysms from CTA images using GE Workstation ver. 4.6 and manually measured lengths and angles related to aneurysm features. Figure 1 illustrates the measurement of vessel size, aneurysm height, aneurysm width, perpendicular height, flow angle, vessel angle, parent-daughter angle, and aneurysm angle. Vessel size was determined as the average cross-sectional diameter of all arteries associated with the aneurysm. The cross-sectional diameter of a particular vessel diameter was measured by averaging the cross-sectional diameter of the artery proximal to the aneurysm neck (Via) and the cross-sectional diameter at 1.5XVia away from the aneurysm neck (Vib) (see Fig. 1), i.e., (Via+Vib)/2, i = 1, 2, or 3. Aneurysm size is the largest cross-sectional length of the aneurysm dome. Aneurysm height is the greatest distance from the center of the aneurysm neck to aneurysm dome, whereas the perpendicular height is the largest perpendicular distance from the center of aneurysm neck to the aneurysm dome. Aneurysm width is the maximal diameter perpendicular to the aneurysm height. Height-width ratio was calculated as the ratio of aneurysm width to aneurysm neck. Aspect ratio was defined as the ratio between the perpendicular height and the aneurysm neck size. Size ratio was calculated as the ratio of the aneurysm height to the vessel size. Bottleneck factor is the ratio between aneurysm width and neck size. In addition, 4 angle-related parameters were measured. Flow angle is the angle between the aneurysm height line and the vector of the blood flow in the parent artery; vessel angle is the angle between the vector of the blood flow and the aneurysm neck line; aneurysm angle represents the angle between the aneurysm neck line and the aneurysm height line; parent-daughter angle is the angle between the vector of the blood flow through the daughter vessel and the vector of the blood flow through the parent vessel.

Fig. 1
figure 1

Measurements of aneurysm morphological features: Via and Vib (i = 1, 2, 3), cross-section diameter of the artery; Hmax, aneurysm height

Aneurysm lobulations include three types, i.e., regular, irregular, and daughter-sac types [17]. Locations of MCA aneurysms were divided into 4 groups: (1). aneurysms arising on the main trunk (M1) of the MCA, between the bifurcation of internal carotid artery and the main MCA bifurcation, at the origin of lenticulostriate arteries (M1-LSAAs), (2) aneurysms arising on M1 at the origin of early cortical branches (M1-ECBAs), (3) aneurysms at the main MCA bifurcation (MbifAs), and (4) aneurysms distal to the main MCA bifurcation on M2, M3, or M4 segments (MdistAs) [22]. The main projection of the MCA aneurysm [12] was determined as follows: in the axial CTA view, the aneurysm dome was grouped as anterior (in front of the MCA), posterior (behind the MCA), or neutral (neither anterior nor posterior projection); in the coronal CTA view, the aneurysm dome was categorized as superior (toward frontal lobe), inferior (toward temporal lobe), or neutral (neither superior nor inferior projection).

Model development

In this study, 80% of the included aneurysms were randomly chosen as the training dataset, and the rest were used for test. Number of features was reduced by evaluating the correlation between the features and the class, and the remaining features were further filtered by using univariate analysis. The following features were finally chosen: multiplicity, aneurysm lobulations, size ratio, bottleneck factor, height-width ratio, and aneurysm angle. A generalized linear model with logit link (https://www.rdocumentation.org/packages/stats/versions/3.6.2/topics/glm) was developed using the selected features. The prediction accuracy of the model was quantified by the sensitivity, specificity, and the area under the receiver operating characteristic (ROC) curve (AUC). For the purpose of clinical application, we further constructed a nomogram on the basis of the developed model. A nomogram represents a graphical calculation instrument that facilitates the approximate computation of a mathematical function via intersecting lines and is designed to extract the maximum amount of information from data and provide a quick approximation of the output probability [23, 24]. R statistical software (version 3.6.1) was used for model development.

Statistical analysis

Categorical variables were presented as the frequency (percentage) and continuous variables as mean ± standard deviation. Student t-test or Mann-Whitney U tests were adopted for comparisons of continuous variables, and Fisher exact tests or χ2 tests were applied for comparisons of categorical variables. A P value less than 0.05 was considered statistically significant. All data was analyzed using the software package SPSS 22.0 (IBM Corp, Armonk, New York).

Results

Baseline characteristics

A total of 403 patients were enrolled in this study. Of these patients, the mean age was 56.5 years old, and 231 (57.3%) were women. One hundred and thirty patients had multiple aneurysms. Table 1 shows patients’ baseline information between the ruptured and unruptured MCA aneurysms. Patients with ruptured aneurysms were significantly more likely to be younger (55.1 vs. 59.5 years old), whereas patients with unruptured aneurysm were significantly more likely to have multiple aneurysms (p < .001). More female patients and more smoking patients had ruptured aneurysms, although the differences were not statistically significant. The main MCA bifurcation was the most common location for both ruptured and unruptured aneurysms, and patients with aneurysms at the main MCA bifurcation were more likely to be ruptured.

Table 1 Baseline characteristics

Morphologic characteristics between ruptured and unruptured MCA aneurysms

Table 2 demonstrates the comparison of morphological characteristics between ruptured and unruptured aneurysms. Aneurysm size, aneurysm height, and perpendicular height were significantly higher in ruptured aneurysms than in unruptured ones (P<0.05), which implied that larger aneurysms were prone to be unstable. Four ratios, including aspect ratio, size ratio, bottleneck factor, and height-width ratio, were all significantly higher in ruptured aneurysms than in unruptured ones (P<0.05). Aneurysm angle and flow angle were significantly different between the ruptured and unruptured aneurysms. Aneurysm lobulations was significantly different between the ruptured and unruptured aneurysms. Unruptured aneurysms were more frequently regular type (70.4% versus 46.4%), whereas ruptured aneurysm were more frequently daughter-sac type (29.5% versus 10.4%).

Table 2 Morphological characteristics between ruptured and unruptured MCA aneurysm

Nomogram for predicting rupture risk of MCA aneurysm

The generalized linear model adopted six variables, including aneurysm multiplicity, aneurysm lobulations, size ratio, bottleneck factor, height-width ratio, and aneurysm angle. Figure 2 represents the use of a nomogram to visually interpret the developed model. Application of this nomogram is simple. One can read the scoring points from the “Point” reference line according to the variable values, including aneurysm multiplicity, aneurysm lobulations, size ratio, bottleneck factor, height-width ratio, and aneurysm angle. The total points can be obtained by summing the points of each variable. According to the total points, the predicted probability of rupture risk can be read at the bottom “Probability” line.

Fig. 2
figure 2

Nomogram to predict the rupture risk of MCA aneurysms. Aneurysm multiplicity: 1 (single aneurysm) and 2 (multiple aneurysms). Aneurysm lobulations: 1 (regular type), 2 (irregular type), and 3 (daughter-sac type)

Table 3 summarizes the prediction performance of the generalized linear model. Figure 3 illustrates the ROC curves of the developed model for both training and test datasets. The model achieved a sensitivity of 0.73 and a specificity of 0.71 in the training dataset and a sensitivity of 0.75 and a specificity of 0.72 in the test dataset. The areas under ROC curves were 0.77 (95% CI, 0.71–0.83) and 0.76 (95% CI, 0.65–0.88) for the training and testing datasets, respectively.

Table 3 Prediction accuracy of the developed model
Fig. 3
figure 3

Receiver operating characteristic (ROC) curves developed for (a) training dataset and (b) testing dataset

Discussion

In this study, we measured the morphological features of MCA aneurysm and developed a generalized linear model to predict the rupture risk of MCA aneurysm based on the measured morphological features. The model achieved a good prediction performance with AUC of 0.77 in training dataset and 0.76 in testing dataset. A nomogram was further provided to facilitate the practical application of this model in clinics.

Six features, including aneurysm multiplicity, aneurysm lobulations, size ratio, bottleneck factor, height-width ratio, and aneurysm angle, were adopted in our prediction model. Multiple aneurysms were more commonly seen in unruptured aneurysms [18], and aneurysm multiplicity was significant factor for rupture of small aneurysm [7]. Aneurysm lobulations reflect the irregularity of an aneurysm dome. Our study found that the presence of a daughter-sac was more common in ruptured aneurysms. Previous study [25] showed the strong correlation between the presence of a daughter-sac and the rupture of MCA aneurysms. The presence of a daughter aneurysm was a likely path to aneurysm rupture. A mathematical model revealed that the tensile stress in the daughter aneurysm wall first decreased to protect against rupture as the daughter aneurysm developed, and further growth of the daughter aneurysm resulted in an increase of stress and eventually led to rupture [26]. Size ratio, bottleneck factor, and height-width ratio have been reported to be related to aneurysm rupture [12]. In our study, all these three ratios were significantly higher in ruptured than in unruptured MCA aneurysms. Aneurysm angle has been shown to be a promising morphological metric for intracranial aneurysm rupture risk assessment [27].

Great efforts have been exerted for evaluating the aneurysm rupture risk. Morphological analysis, hemodynamic analysis, and clinical factor assessment have been performed to find the risk factors of aneurysm rupture [27,28,29]. A few approaches, such as scoring system and machine learning models, have been proposed to predict the aneurysm rupture risk using these risk factors. A PHASES score system [6] has been developed from a prospective cohort study. This score system used age, hypertension, a history of subarachnoid hemorrhage, aneurysm size, aneurysm location, and geographical region as predictors and was found to be an easily applicable aid for the aneurysm rupture analysis. Recent studies [30, 31], however, demonstrated that the PHASES score might only provide a weak tool for evaluating aneurysm rupture risk, and parameters beyond the features of the PHASES score might be needed to improve the prediction accuracy. With the advance of artificial intelligence, machine learning technique has been applied for aneurysm rupture risk prediction. A feed-forward artificial neural network was successfully used to predict the rupture risk of anterior communicating artery aneurysms based on patients’ basic characteristics and aneurysm morphological features [17]. Random forest, linear support vector machine, and radial basis function kernel support vector machine models were applied to classify aneurysm rupture status at different locations, and it was found that aneurysm location and size were the strongest predictors of aneurysm rupture and machine learning models were robust tools in predicting aneurysm rupture risk [32]. Linear, ridge, and lasso regression models were developed to predict the stability of 420 aneurysms at different locations based on PyRadiomics-derived morphological features; it was found that the extracted features were useful for aneurysm stratification and the developed models could be used for the prediction of aneurysm stability [18]. Recently, a random forest machine learning algorithm was applied to classify the rupture status of 226 intracranial aneurysms at different locations using only morphologic variables (model 1), only hemodynamic parameters (model 2), and both morphologic and hemodynamic parameters (model 3); corresponding accuracy was 77.0% for model 1, 71.2% for model 2, and 78.3% for model 3 [33]. These machine learning methods mentioned above were applied in either anterior communicating artery aneurysm or aneurysms at different locations. To date, application of machine learning technique to predict the rupture risk of MCA aneurysm has been rarely reported. In this study, we enrolled 403 MCA aneurysms. A generalized linear model was constructed to predict the rupture risk of MCA aneurysm, and a relatively good prediction performance was achieved. More important, a nomogram, as a model visualization figure, provided a simple and easy tool for the practical application of the developed model in clinics.

Limitations

There were several limitations in this study. First, this was retrospective and single-center study, and only Chinese population were involved. Previous research has shown that aneurysm rupture risk is population dependent [6]. Generalization of our model to other population may not be satisfied. Second, aneurysm growth is a dynamic process, and aneurysm may evolve before rupture. However, we did not perform long-term follow-up study to monitor unruptured aneurysm over time because of the ethical dilemma and patients’ safety issue [34]. Third, we did not test our model using external dataset. Further external validation in a large independent population is still required. Finally, our model includes only radiological data regarding the aneurysms, and no clinical data are included, which may limit the application of our model. Many clinical factors have been reported to be associated with intracranial aneurysm rupture [35, 36]; however, only several clinical parameters were considered in this study due to the retrospective design. Future work considering more clinical data is still needed. Nevertheless, our work may be considered as a starting point for the development of further clinical-radiological models that may guide physicians to make treatment decision for each single patient.

Conclusions

In summary, we examined the risk factors from a large MCA aneurysm dataset and developed a classification machine learning model to predict rupture status of MCA aneurysms. In addition, we constructed an easy to use nomogram tool for the practical application purpose, which might facilitate the treatment optimization.