Introduction

Glioblastoma (GBM) is the most frequent malignant primary brain tumour and the most lethal type, with a median survival of 14.6 months for those patients receiving the standard of care, i.e. maximal safe surgery plus radiotherapy and chemotherapy [1]. MRI is routinely used for diagnosis, treatment planning, response evaluation, and follow-up.

Typical GBM appearance upon diagnosis on MRI consists of an enhancing ring mass with central non-enhancing core of necrosis observed mainly on contrast enhanced T1-weighted images; this is surrounded by an area of signal hyperintensity on fluid-attenuated inversion recovery (FLAIR) or T2 images representing oedema that is well known to contain infiltrated tumour cells.

Recently there has been an increased use of advanced imaging techniques to characterize the connection of the so-called radiophenotype with the tumour genotype (see e.g. recent reviews for GBM [2, 3]). However, the use of those techniques requires further research and validation to achieve broad clinical applicability. Thus, contrast enhanced T1 and T2/FLAIR are still the gold standard for diagnosis and treatment planning [4].

Some works have studied geometrical measures measured on pretreatment T1 and FLAIR images as prognostic “biomarkers”. Specifically, Iliadis et al. [5] analyzed volumetric data of 50 GBM patients to confirm the importance of age and performance status in progression-free survival (PFS) and overall survival (OS) of patients, while volume was not significant. Mazurowski et al. [6] showed the association of survival in 68 GBM patients with the proportion of enhancing tumour and major axis length. Zinn et al. [7] used tumour volume, patient age, and Karnofsky performance status (KPS), to develop a simple 3-point scoring system, the Volume-Age-KPS (VAK) classification, classifying patients into groups with significant survival differences. Other studies have added sets of imaging descriptors to the usual age and KPS to improve the predictive power of survival models for GBM patients with modest results [8] or combined several imaging features using data mining of imaging variables [9].

Pérez-García et al. [10] developed a mathematical model predicting a positive correlation between the spherical rim width and the tumour’s biological aggressiveness, which would have an impact on overall survival.

Our purpose in this study was to study the prognostic potential of full three-dimensional geometrical measures of postcontrast T1 MRIs in a large set of GBM patients with high-resolution images in the absence of previous treatment.

Patients and methods

Patients

A retrospective study including patients from six medical institutions was organized. The respective ethical committees approved the study. Patients with pathologically confirmed diagnosis of GBM diagnosed in the period 2006–2014 were included in the study. The inclusion criteria were availability of the relevant clinical variables and availability of pretreatment postcontrast T1 MRI. Multifocal GBMs without contrast enhancement (CE) and very diffuse tumours with no clear boundaries were excluded from the study. A total of 117 patients complied with inclusion criteria. Patients’ characteristics are summarized in Table 1.

Table 1 Summary of patient characteristics, MRI data, and volumetric parameters for the 117 patients included in the study

The extent of resection was determined from the postcontrast T1 MRI usually within 48 h after surgery. Gross total resection was defined as the absence of visible CE.

PFS and OS were assessed. Recurrence-free patients at last follow-up were considered censored events for the PFS analysis. Overall survival was measured from time of surgery to death. Patients who were still alive at last follow-up were considered censored events for the OS analysis.

Image acquisition parameters

Two different field strengths were used to obtain the MRIs of this retrospective study: 1.5 Tesla (100 patients) and 3 Tesla (17 patients). MRIs were of high spatial resolution as shown in Table 1. As to the other acquisition parameters, they satisfy the recent consensus recommendations for standardized brain tumour imaging [11].

Image analysis

The pretreatment postcontrast T1 MRI Digital Imaging and Communication in Medicine (DICOM) files were imported into the scientific software package Matlab (R2015b, The MathWorks, Inc., Natick, MA, USA) and pre-processed using a semi-automatic image segmentation procedure considering two possible regions: the enhancing tumour and the inner tumour region. Then, tumour segmentations were manually corrected slice by slice. Figure 1 summarizes the segmentation process.

Fig. 1
figure 1

Image segmentation procedure: tumours described by the MRI slices (a) were semi-automatically segmented and then manually corrected by an image expert. The resulting segmented slices (b) are then joined to construct a full 3D tumour (c)

Geometrical measures

After the segmentation, a set of geometrical 3D measures were computed automatically. First, we calculated volumetric measures on the reconstructed tumours: the CE volume (V CE ), the volume surrounded by the CE areas or inner volume (V I ), and the total postcontrast T1 tumour volume (V = V CE  + V I ). In most cases the inner volume corresponds to necrotic areas. Also, the maximum tumour diameter in 3D (d max 3D) and some measures of the tumour surface irregularity were computed.

The average size for the CE rim was called spherical rim width (δ s ) because it can be obtained from a spherical approximation from the total and CE volume, using the formula.

$$ {\delta}_s=\sqrt[3]{\frac{3}{4\pi }}\left(\sqrt[3]{V}-\sqrt[3]{V_I}\right)\approx 0.62\left(\sqrt[3]{V}-\sqrt[3]{V_I}\right) $$

In order to characterize the CE rim structure we constructed the surfaces enclosing the CE areas (inner and outer). For every point on these surfaces the minimal distance to the other surface was computed, providing a large set of distances that was used to construct histograms of rim sizes. This histogram was characterized by many measures (median, mean, positions of the quartiles, etc.). Of them, the most relevant found was the geometric heterogeneity of the CE rim width (G H ). This is defined as the difference between the values of quartiles 3 and 4 of the distribution of CE rim widths divided by quartile 4, i.e..

$$ {G}_H=\frac{\left({\delta}_4-{\delta}_3\right)}{\delta_4} $$

The geometric heterogeneity uses the difference between the values of quartiles 3 and 4, i.e. the size of the region of the most asymmetric distances of the tumour, divided by the longest rim width in the tumour. This is a normalized measure of the tumour’s asymmetry with values in the range [0,1]. Figure 4 provides visual examples of the meaning of the most relevant geometrical parameters.

Statistical methods

To identify parameters associated with prognosis we used Kaplan-Meier curves and log-rank analysis. Kaplan-Meier curves measure the fraction of patients living for a certain amount of time [12]. An important advantage of this method is that it can take into account some types of censored data, particularly if a patient withdraws from a study, is lost to follow-up, or is alive without event occurrence at last follow-up. It can be calculated for two groups of subjects, showing their statistical difference in the survival. A 2-tailed significance level (p-value) of p < 0.05 was used.

We searched for robust thresholds separating patient populations in subgroups with significant differences in terms of OS and PFS. To find the optimal thresholds, we computed the p-values for the full range of thresholds of each geometrical variable with subgroups of at least 30 patients and looked for the minimum p-value. Only minima located in low-p regions of the parameter space were considered in order to discard purely statistical fluctuations.

We also computed the hazard ratio (HR) as indicator of risk by using a single-variable Cox proportional hazards regression analysis.

We also computed the correlations between the significant variables of the previous analysis. Normality of the variables was analyzed using Kolmogorov-Smirnov test. Because of the results of the normality analysis, Spearman's correlation coefficient was chosen to study the relation between independent quantitative variables.

We used multivariate proportional hazard Cox analysis with a stepwise method in order to construct a predictive model. To guarantee the robustness of the geometric significant variables, we performed two different types of analysis for both the OS and the PFS. The first one including all the significant geometrical variables and the second one also including the age in order to evaluate its effect in the multivariable model. SPSS software (v. 22.0.00) was used for the statistical analysis.

Results

Overall survival

The best thresholds for the total volume, CE volume, geometric heterogeneity and spherical rim width were 15.6 cm3 (p = 0.034), 10 cm3 (p = 0.017), 0.6146 (p = 0.015), and 4.4 mm (p = 0.007), respectively (see Table 3). The increases in the median survival times for the favourable subgroups were 133 days (V < 15.6 cm3), 199 days (V CE <10 cm3), 155 days (G H  < 0.6146), and 213 days (δ s  < 4.4 mm). Figure 2 shows the Kaplan-Meier survival curves for each of those significant variables.

Fig. 2
figure 2

Kaplan-Meier curves for the significant geometrical variables for OS. The median difference, the p-value, and the number of patients in each subgroup are shown for: a the total volume V, b the CE volume V CE , c the spherical rim width δ s , and d the geometric heterogeneity G H

Progression free survival

The geometric heterogeneity (p = 0.012) was statistically significant in terms of progression-free survival. However, the total volume (p = 0.093), the CE volume (p = 0.075), and the spherical rim width (p = 0.077) were only marginal predictors of PFS.

Spherical rim width (δ s )

The threshold for the most significant parameter, the spherical rim width (δ s ), was found to be 4.4 mm. Interestingly, a broad range of threshold values ranging from 3.7 mm to 5.1 mm were found to provide significant results (p < 0.05). Within this interval, patient subgroups ranged from 37 (below threshold) and 80 (above threshold) patients to 75 (below threshold) and 42 (above threshold) patients. Thus, δ s was not only the most significant geometrical variable for OS but also the most robust one.

Geometric heterogeneity of the rim width (G H )

The only other significant geometrical measure according to the Kaplan-Meier analysis for both OS and PFS was the geometric heterogeneity (G H ). The threshold value found was 0.6146 and the groups were well balanced with 68 and 49 patients, respectively. The median OS in both subgroups were 509 days (patients with G H  < 0.6146) and 354 days (G H  > 0.6146). Results for PFS were 351 (G H  < 0.6146) and 229 (G H  > 0.6146) days, respectively.

Classification in terms of prognosis

We classified patients in terms of prognosis by considering three groups: double positive (+,+) = (δ s and G H above the threshold), double negative (-,-) = (δ s and G H below the threshold), and single positive (+,-) = (δ s above and G H below or vice versa). We found significant OS differences between the first two groups (Fig. 3, p < 0.001). The double positive group of patients had the worse prognosis. There were no long-term double positive survivors while 40 % of double negative patients lived for more than 24 months. In general, single positive patients (whatever of both imaging biomarkers was positive) had better prognosis than double positive patients and worse than double negative patients (Fig. 3). Figure 4 shows four different tumour geometries in terms of the four previous groups.

Fig. 3
figure 3

Kaplan-Meier curves for the significant variables spherical rim width (δ s ) and geometric heterogeneity (GH) considering three different groups: double positive patients (+,+)=(δ s and G H above the threshold), double negative (-,-)=(δ s and G H below the threshold), and single positive (+,-)=(whatever imaging biomarker δ s + or G H + is positive). The median difference and p-value correspond to the comparison between the (+,+) and the (-,-) curves

Fig. 4
figure 4

Four different tumour geometries in terms of the δ s and G H possible configurations: a corresponds to a tumour characterized by high spherical rim width and high geometric heterogeneity (bad prognosis), b shows a tumour characterized by high spherical rim width and low geometric heterogeneity, c corresponds to a tumour characterized by low spherical rim width and high geometric heterogeneity, and d shows a tumour characterized by low spherical rim width and low geometric heterogeneity (good prognosis)

Correlations and multivariable cox analysis

We computed the Spearman’s correlation coefficient between every pair of variables and the associated p-values. The geometric heterogeneity (G H ), spherical rim width (δ s ), and age, all significant in the Kaplan-Meier analysis (Table 2), are mutually uncorrelated. The volumetric variables (V CE , V I , V, d max 3D) were strongly correlated.

Table 2 Summary of univariate Cox and Kaplan-Meier analysis for the more representative variables included in the study

The results of the multivariable Cox analysis are summarized in Table 3. We first performed an analysis including all the significant geometrical variables of Table 2. For OS, the geometric heterogeneity (G H ) and the spherical rim width (δ s ) were significant in terms of survival. The hazard ratios were 1.588 (G H ) and 1.696 (δ s ). For PFS, the only variable remaining significant in the multivariable model was the geometric heterogeneity (HR = 1.672).

Table 3 Results of the multivariate Cox analysis

As a second step, we performed the same multivariable analysis using age as adjusting factor. Interestingly, only the same two geometrical variables were significant for both the OS and the PFS analysis. For OS, the hazard ratios were 1.036, 1.570, and 1.536 for age, geometric heterogeneity (G H ), and spherical rim width (δ s ), respectively.

Discussion

Previous studies have identified MRI-derived variables to be associated with survival. Some of the more relevant ones are the ratio of T2-FLAIR image signal to T1 image tumour volume [13], the volume of enhancing tumour crossing the corpus callosum [14], the CE tumour volume [9, 14], the proportion of enhancing tumour [6], and the longest axis length of tumour [6, 15].

However, to our knowledge, most previous analyses have been either qualitative, or when quantitative the analysis has been two-dimensional. Moreover, most studies have been carried out using limited resolution images. No study has tried to characterize fully the complex 3D nature of irregular rim-shaped CE areas present in GBMs. The need for that kind of 3D study using high-resolution data to obtain reliable conclusions has been put forward recently [11]. A very recent study has analyzed a series of imaging features on 3D images and has identified several significant ones including the volume, CE volume and major axis length [16].

In comparison to previous studies: (i) our dataset of 117 patients was larger than that of most previous studies (typically below 100). (ii) Only high-resolution MRIs were included, thus reducing noise due to large voxel size and/or voxel interspacing. (iii) We included only pretreatment images to avoid post-therapy imaging artefacts/distortions. (iv) The analysis was fully 3D on the reconstructed tumours. In addition to the more “classical” volumetric measures of the different regions [17], we included the maximal 3D diameter (instead of the major axis), a measure of the tumour’s surface, and the full set of geometrical measures characterizing the geometry of the CE rim as described in the methods section.

Since our study was based on high resolution MRIs and our main interest was on the geometry of CE regions we considered only postcontrast preoperative T1 images. Some qualitative features of T2/FLAIR have been found in previous studies to have a limited prognostic value. However, those image modalities are not available in sufficient resolution to allow for an accurate 3D reconstruction of the tumour.

It has been hypothesized that the tumour surface may play a relevant role in tumour progression [18]. Up to the spatial resolution of our study, the irregularity of the tumour surface was neither a predictor of OS or PFS as evidenced by the non-significance of the corresponding geometric parameter.

The Kaplan-Meier analysis demonstrated that the total volume, CE volume, geometric heterogeneity, and spherical rim width were significant variables. Regarding volume, the threshold found was small (15.6 cm3), which means that small tumours have better prognosis, probably because a lower biological aggressiveness, reduced treatment side effects and much larger fraction of completely resected tumours (64 % vs. 46 % in our dataset).

It is remarkable that the simplest geometrical measure of the rim size, the spherical rim width (δ s ), was the outstanding parameter for the OS analysis, able to classify the patient population into two subgroups with the highest statistical signification (p = 0.007) and the largest different between median survivals (213 days). It is very interesting that the computation of this novel prognostic indicator is very simple, since only volumetric measures of the CE and necrotic regions are required. As the threshold for this variable was very flexible, this measure is a powerful imaging biomarker predicting survival.

The differences in treatment between the two subgroups (δ s + and δ s ) were smaller than in the case of the groups obtained from total volume (or diameter) as cutoff variables. δ s patients received complete resections in 58 % of the cases and 83 % of these patients received chemo-radiotherapy (Stupp regime). δ s + patients’ tumours were completely resected in 49 % cases and 73 % of these patients received chemo-radiotherapy. The differences in treatment between both subgroups cannot explain the large differences observed between the median survivals. This points to essentially different biological behaviour between both subgroups (i.e. not only a “size” effect).

The mathematical model of Perez-Garcia et al. [10] allows computing the tumour’s growth speed as a function of several biological parameters: the tumour cell’s average infiltration speed, the proliferation rate, and the tumour cell loss factor rate. It turns out that the rim width has a similar dependence on the parameters as the tumour’s growth speed. Thus, based on the mathematical model, one would expect tumours with large rims to grow faster than those with small rims. This would have direct implications on survival. Our results corroborate the predictions of the mathematical model.

The other significant geometrical variable was the geometric heterogeneity (G H ). The threshold value found (0.6146) points to a bad prognosis of tumours with more than 60 % of the full range of possible width values occupied by the top 25 % of the measures. The median survival of G H , low geometric heterogeneity patients, is 5 months longer than that of G H + patients. One may expect a tumour to present this kind of heterogeneity either because of the existence of underlying anatomical structures (i.e. fibre tracts) supporting growth and allowing infiltration in a specific direction or because of the presence of regions with more aggressive tumour cell phenotypes. The fact that the percentage of total resections is substantially smaller in the large G H group (60 % vs. 43 %) supports that at least a fraction of those tumours grow along fibre tracts, making complete resection without functional loss less feasible. It may happen that these patients benefit from either more aggressive resections around finger areas or radiotherapy dose escalation whenever possible in order to target those potentially more harmful cells.

The geometric heterogeneity (G H ) and spherical rim width (δ s ) parameters were identified by the multivariate Cox analysis as the most relevant geometrical variables (either age-adjusted or not), in line with the results from the Kaplan-Meier analysis. In fact, their combined use leads to the definition of a set of doubly negative patients (G H and δ s ) with the longest median survival and doubly positive patients (G H + and δ s +) with almost 1 year less median survival. Moreover, there were almost no long-term survivors in the double positive group.

As a final comment, the main interest of the analysis developed in this paper is the definition of simple geometric quantities of direct prognostic significance. A prospective study design is needed to validate the utility of these imaging biomarkers. Several recent works have studied the potential of complex sets of bidimensional (2D) or aggregated 2D image-based features to predict molecular subtypes in GBM patients [18, 19]. Genomic data was not available for our patient set. However, it is interesting that several geometrical measures were powerful predictors of survival. Their combination, together with other relevant variables (e.g., age, KPS) might be used to develop a classification of patients in terms of prognosis. The main advantage would be the simplicity of the approach in comparison with more complex sets of image features used to classify patients in some recent works [18, 19]. Because of their relevant prognostic value the geometric variables found to be significant in this study could also help to predict a benefit from surgery and other therapies.

Conclusions

Our analysis of geometrical measures of postcontrast T1 pre-treatment images found the total volume, CE volume, spherical rim width, and geometric heterogeneity of the CE rim as individual predictors of overall survival for GBM patients. Of them spherical rim width and geometric heterogeneity were the two most significant ones. Small values of those variables led to an improved survival as predicted by previous mathematical models. Small spherical rim widths and/or low geometric heterogeneity were associated with improved survival on multivariate analysis and allowed to define three subgroups: double positive, double negative, and mixed with different outcomes.

Thus, the two novel measures of the tumour’s CE rim in T1+Gd MRI images have a strong individual and combined prognostic value, reflecting different biological behaviour between the subgroups.