Introduction

Despite significant advances in diagnostics and therapy over the last decades, the prognosis for patients with glioblastoma (GBM), the most frequent form of glioma and the most aggressive and lethal primary brain tumour, remains dismal (median survival 15–20 months) [1,2,3]. Currently, the proposed first-line therapy for patients with GBM is a maximally safe resection followed by radiotherapy plus concomitant and adjuvant temozolomide chemotherapy [1, 4]. Importantly, the extent of the tumour resection is associated with improved overall survival, i.e. a “complete” resection (currently defined as a lack of contrast enhancement on the early postoperative MRI within 24–48 h of surgery [5]) is superior to resection smaller than the area of contrast enhancement or biopsy only [6,7,8]. Consequently, contrast-enhancing tumour portions on MRI are the target of neurosurgical resection in the majority of patients [4, 9]. Similar to neurosurgical resection planning, radiotherapy target volume definition is also often based on contrast-enhanced MRI [10]. In stark contrast, several studies have revealed that a considerable number of glioma cells, or even the main tumour burden, may not show contrast enhancement, particularly in patients with GBM [11,12,13]. Furthermore, comparison of the histopathological findings in whole-brain sections from patients with brain tumours with MRI data has shown that the extent of malignant gliomas is not correctly delineated by conventional MRI [14]. Finally, the precise delineation of glioma tissue is particularly problematic if no contrast enhancement is present, as is frequently observed in low-grade gliomas, and even 30–40% of anaplastic gliomas show no contrast enhancement [15]. In rare cases, this also occurs in patients with GBM [16, 17]. In such patients, the hyperintensity on T2-weighted or fluid-attenuated inversion recovery (FLAIR) MRI is frequently used for treatment planning [18].

Over the past years, several studies have shown that PET using radiolabelled amino acids detects tumour parts that show no contrast enhancement and provides better delineation of gliomas before treatment planning [19,20,21,22]. A number of amino acid PET studies using l-methyl-[11C]-methionine (MET) or O-(2-[18F]-fluoroethyl)-l-tyrosine (FET) have evaluated the correlation between imaging findings and histology. The results suggest reliable detection of glioma tissue independent of the integrity of the blood–brain barrier [19, 23,24,25]. Consistent with these findings, a post-mortem study compared MET PET and contrast-enhanced CT findings with whole-brain histopathology in a patient with an anaplastic astrocytoma. MET PET was able to correctly identify the full tumour extent while contrast-enhanced CT missed more than 50% of the tumour mass [26].

To the best of our knowledge, the fraction of untreated GBM that shows significant differences between contrast enhancement, FLAIR hyperintensity and FET accumulation, and whether this is relevant in clinical practice, remains to be investigated. To this end, we performed a systematic volumetric comparison between FET uptake and MR contrast enhancement and a visual comparison with FLAIR hyperintensity in a series of 50 patients with newly diagnosed GBM in terms of tumour size, overlap, and spatial correlation.

Materials and methods

Patients

Between February 2010 and August 2017, 50 adult patients with a newly diagnosed and untreated GBM investigated using FET PET at the Institute of Neuroscience and Medicine, Forschungszentrum Juelich, Germany, were included in this retrospective study (19 women, 31 men; mean age 56 ± 14 years, range 26–79 years). Postoperatively, neuropathological work-up confirmed GBM in all patients. The study adhered to the standards established in the Declaration of Helsinki. All patients provided written informed consent before each FET PET investigation. Given its retrospective nature, the local ethics committee of the University of Aachen waived the requirement for additional approval. Table 1 presents further details of all patients.

Table 1 Demographic and clinical data of the patient cohort and summary of results

MR imaging

Before neuropathological confirmation, all patients underwent a 3D T2-weighted FLAIR MRI scan and a 3D T1-weighted MRI scan after intravenous administration of gadoteric acid (0.1 mmol/kg body weight). Five patients were scanned using a 3 T Achieva MRI scanner (Philips Medical Systems, Inc.), 17 patients using a 1.5 T Intera MRI scanner (Philips Medical Systems, Inc.), and 28 patients using a high-resolution 3 T hybrid PET/MR scanner (3 T Tim TRIO; Siemens Medical Systems, Inc.). FLAIR images were not available or incomplete in four patients.

FET PET imaging

The amino acid FET was produced and used as described previously [27, 28]. Before neuropathological confirmation and close in time to the MRI examination, all patients underwent a dynamic PET scan from 0 to 50 min after injection of 3 MBq of FET per kg body weight. The time between MRI and PET was in the range 0–20 days (median 0 days). Fifteen patients were scanned using a stand-alone PET scanner (ECAT EXACT HR+, Siemens Medical Systems, Inc.) in 3D mode (32 rings, axial field of view 15.5 cm). The reconstructed dynamic dataset consisted of 16 time-frames (5 × 1 min, 5 × 3 min, 6 × 5 min). A transmission scan (duration 10 min) using three rotating line sources (68Ge/68Ga) was used for attenuation correction. Before reconstruction using the iterative ordered subsets expectation maximization (OSEM) algorithm (16 subsets, six iterations), data were corrected for dead time, random and scatter coincidences. The other 35 patients were scanned using a high-resolution 3 T hybrid PET/MR scanner (BrainPET, Siemens Medical Systems, Inc.; 72 rings, axial field of view 19.2 cm). Image data were corrected for random and scatter coincidences, as well as for dead time before ordinary Poisson OSEM reconstruction using software provided by the manufacturer (two subsets, 32 iterations). The reconstructed dynamic dataset consisted of 16 time-frames (5 × 1 min, 5 × 3 min, 6 × 5 min). Since the hybrid PET/MR scanner did not provide a transmission source, attenuation correction was performed by a template-based MRI approach [29].

Data analysis and calculation of tumour volumes

After adjusting the spatial resolution of the BrainPET images to those of the ECAT EXACT HR+ scanner by applying a 2.5 mm 3D gaussian filter [30], PET images were corrected for motion, and the MR and PET images (summation images from 20 to 40 min after injection) were resliced to a voxel size of 1 × 1 × 1 mm using the software PMOD, version 3.505 (PMOD Technologies Ltd.) to provide robust coregistration and accurate volume calculations. MR and PET images were then coregistered using rigid matching algorithms from PMOD. The results of the coregistration process were validated visually and, if necessary, manual correction was performed according to anatomical landmarks. Areas of contrast enhancement on the T1-weighted MR images were manually segmented on each consecutive transverse slice, and the sum of the circumscribed areas was multiplied by the slice thickness to obtain volumes of interest (VOI) representing the total volume of contrast enhancement in each patient. Necrotic parts were excluded from the volume of contrast enhancement. A second investigator validated the results of coregistration and the manual delineation. Since most patients exhibited diffuse and widespread FLAIR hyperintensities, reproducible and objective segmentation was not possible.

Standardized uptake values (SUV) were used for normalization of FET uptake by dividing the radioactivity in the tissue by the radioactivity injected per gram of body weight. A spherical VOI of constant size (diameter 30 mm) was positioned in normal-appearing brain tissue including grey and white matter in the hemisphere contralateral to the lesion. A 3D autocontouring process using a tumour-to-brain ratio (TBR) of 1.6 or more was used for segmentation of the tumour volume. This threshold was based on a biopsy-controlled study in which this value provided the best separation between vital tumour and healthy brain parenchyma on FET PET [19]. Mean TBR (TBRmean) was calculated by dividing the mean SUV of the tumour VOI by the mean SUV of the background VOI. The maximum TBR (TBRmax) was calculated by dividing the maximum SUV of the tumour VOI by the mean SUV of the background VOI.

Calculation of spatial correlation and overlap between MRI-based and PET-based tumour volumes

As a measure of spatial correlation [31], the spatial similarity between the VOIs defined by contrast enhancement on MRI and increased tracer uptake on FET PET was evaluated using the Dice similarity coefficient (DSC) [32] and the Jaccard similarity coefficient (JSC) [33]. The DSC is calculated as:

$$ \mathrm{DSC}\left({\mathrm{V}}_{\mathrm{PET}},{\mathrm{V}}_{\mathrm{MRI}}\right)=\frac{2\ \left({\mathrm{V}}_{\mathrm{PET}}\cap {\mathrm{V}}_{\mathrm{MRI}}\right)}{{\mathrm{V}}_{\mathrm{PET}}+{\mathrm{V}}_{\mathrm{MRI}}}, $$

where VPET, and VMRI are the VOIs defined by FET PET and MRI, respectively. The JSC is calculated as:

$$ \mathrm{JSC}\left({\mathrm{V}}_{\mathrm{PET}},{\mathrm{V}}_{\mathrm{MRI}}\right)=\frac{{\mathrm{V}}_{\mathrm{PET}}\cap {\mathrm{V}}_{\mathrm{MRI}}}{{\mathrm{V}}_{\mathrm{PET}}\cup {\mathrm{V}}_{\mathrm{MRI}}}=\frac{\mathrm{DSC}}{2-\mathrm{DSC}}, $$

which is the ratio of the intersection and the union of the FET PET and the MRI VOIs. Additionally, the overlap volume (OV) [31] was calculated. The OV considers, in contrast to the DSC and JSC, volumetric differences between the FET PET and MRI VOIs, and is defined as the ratio of the intersection and the smallest volume according to:

$$ \mathrm{OV}\left({\mathrm{V}}_{\mathrm{PET}},{\mathrm{V}}_{\mathrm{MRI}}\right)=\frac{{\mathrm{V}}_{\mathrm{PET}}\cap {\mathrm{V}}_{\mathrm{MRI}}}{\min \left({\mathrm{V}}_{\mathrm{PET}},{\mathrm{V}}_{\mathrm{MRI}}\right)}. $$

While DSC and JSC are similarity coefficients that range between 0 (no similarity) and 1 (perfect agreement), the OV is an overlap coefficient with a value of 100% indicating that one volume is completely contained within the other. All calculations were implemented and performed in MATLAB, version R2015a (Mathworks, Inc.). Figure 1 shows the differences between the three coefficients.

Fig. 1
figure 1

Relationships between the Dice similarity coefficient (DSC), Jaccard similarity coefficient (JSC), and the overlap volume (OV). Similarity coefficients are commonly used for comparing algorithms for image segmentation against a known reference mask (ground truth). To find out the volume differences between two different imaging modalities, e.g. FET PET and contrast enhancement, the ground truth (true tumour extent) is usually not known. Thus, the results from similarity coefficients can be misleading and have to be interpreted with caution. a Two volumes of different size (20 mL, 10 mL) completely overlap. b Two volumes of equal size (10 mL) only partially overlap. Nevertheless, the similarity coefficients DSC and JSC yield identical results in both cases (DSC 0.7, JSC 0.5), indicating the same spatial similarity. On the other hand, the measure OV depends on the volume and gives different results between the two cases (a OV 100%, b OV 70%). Therefore, the combination of OV with DSC and JSC allows improved interpretation of volumetric properties

Visual analysis of spatial correlation between FET-based and FLAIR-based tumour volumes

The spatial correlation between FET-based and FLAIR-based tumour volumes was assessed visually according to the following criteria: (1) FET-based tumour volume larger than FLAIR-based tumour volume, (2) FET-based tumour volume smaller than FLAIR-based tumour volume, (3) FET-based tumour volume similar to FLAIR-based tumour volume, and (4) FET-based tumour volume partially located outside FLAIR-based tumour volume. The results of the visual analysis were validated by a second investigator.

Statistical analysis

Descriptive statistics are presented as means and standard deviations or medians and ranges. The nonparametric Wilcoxon rank-sum test was used for intergroup comparisons. P values less than 0.05 were considered significant. Statistical analysis was performed using the statistical software packages SPSS, version 24 (IBM Corp.) and Excel (Excel for Mac 2016, version 16.12; Microsoft Corp.).

Results

Tumour volumes

One patient showed neither FET uptake nor contrast enhancement. All the remaining 49 patients showed increased FET uptake (average TBRmean 2.2 ± 0.4, range 1.2–3.3; average TBRmax 4.2 ± 1.4, range 2.1–8.7). Ten patients (20%) showed no contrast enhancement on MRI (Fig. 2). The FET PET tumour volumes and the contrast enhancement volumes were in the range 0.0–56.4 mL and 0.0–44.6 mL, respectively. In 43 patients (86%), the average FET PET tumour volume was significantly larger than the average contrast enhancement volume (21.5 ± 14.3 mL vs. 9.4 ± 11.3 mL; P < 0.001). These results are summarized in Table 1 and Fig. 3.

Fig. 2
figure 2

Contrast-enhanced (CE) MR images (left), fluid-attenuated inversion recovery (FLAIR) MR images (centre) and FET PET images (right) in representative patients without contrast enhancement (patients 49 and 39). The tumour volume segmented by FET PET based on a tumour-to-brain ratio of more than 1.6 is projected onto the MR images (red contour). A considerable portion of FET PET tumour volume is located outside the FLAIR hyperintensity in patient 49 (top). In general, there are considerable discrepancies among the CE MRI, FLAIR and FET PET images

Fig. 3
figure 3

Differences between FET PET tumour volumes (red) and contrast-enhanced MRI tumour volumes (blue). In ten patients (20%), no contrast enhancement was present. In 43 patients (86%), the FET PET volume is considerably larger than the respective contrast enhancement MRI volume. *P < 0.001

Overlap and spatial similarity between MRI-based and PET-based tumour volumes

Of the 50 patients, 40 (80%) showed increases in both FET uptake and contrast enhancement on MRI. In these 40 patients, the OV was moderate (mean OV 77 ± 26%, range 8–100%), but the spatial similarity was low (mean DSC 0.39 ± 0.21, range 0.02–0.70; mean JSC 0.26 ± 0.16, range 0.01–0.54). Furthermore, 6 of these 40 patients (15%) showed both a low OV of less than 50% (mean OV 28 ± 15%) as well as a low spatial similarity (mean DSC 0.15 ± 0.08, mean JSC 0.08 ± 0.05). In total, 16 patients (32%) showed either no contrast enhancement (Fig. 2) or an OV of <50% in combination with a low spatial similarity (DSC <0.4, JSC <0.3).

Overlap and spatial similarity depending on tumour volume

In a subgroup analysis, 6 of 50 patients (12%) with a FET PET tumour volume smaller than the contrast enhancement volume showed both a low OV (mean OV 40 ± 21%, range 8–71%) and low spatial similarity (mean DSC 0.33 ± 0.24, range 0.05–0.70; mean JSC 0.22 ± 0.19, range 0.02–0.54). Patients with a FET PET tumour volume larger than the contrast enhancement volume (n = 43) had a high OV (mean OV 84 ± 20%, range 24–100%) but a low spatial similarity (mean DSC 0.40 ± 0.24, range 0.02–0.69; mean JSC 0.27 ± 0.18, range 0.01–0.53). These results are summarized in Table 1 and Fig. 3. Figure 4 shows contrast-enhanced MRI and FET PET tumour volumes and their overlap in two representative patients.

Fig. 4
figure 4

Contrast-enhanced (CE) MR images (left), CE MR images with projection of the FET PET tumour volume segmentation using a tumour-to-brain ratio of 1.6 or more (second column, red contour), FET PET images (third column), and 3D reconstruction images of the segmented tumour volumes (right) based on CE MRI (white) and FET PET (red) in two representative patients. Patient 4: The contrast enhancement volume is larger than the FET uptake volume (21.9 mL vs. 14.0 mL). The area of FET uptake is partially outside and beyond the area of contrast enhancement. The spatial similarity and the overlap volume are low (DSC 0.16, JSC 0.09, OV 20%). Patient 28: The contrast enhancement volume is smaller than the FET uptake volume (0.5 mL vs. 14.3 mL). The contrast enhancement volume is mainly contained within the FET PET volume. Despite a high overlap volume (OV 89%), the spatial similarity is low (DSC 0.06, JSC 0.03). In both examples, only combined interpretation of the overlap and spatial similarity measures leads to a correct impression of the spatial orientation

Spatial correlation of PET-based and FLAIR-based tumour volumes

The FET PET tumour volume was clearly smaller than the FLAIR volume in 35 patients (70%). In ten patients (20%), the FET PET volume was similar to the FLAIR volume and in one patient (2%) the FET PET volume was larger than the FLAIR volume. In five patients (10%), a considerable portion of the FET PET tumour volume was outside the FLAIR volume (Fig. 2, top row). These results are summarized in Table 1.

Discussion

Using MRI and amino acid PET, previous studies have compared predominantly tumour volumes. To the best of our knowledge, this study is the first in this group of patients that has assessed and evaluated various volumetric parameters. The main finding of our study is that there were significant differences in size, overlap and spatial correlation of tumour volumes, indicating that conventional contrast-enhanced MRI substantially underestimates tumour volume. In 43 of the 50 patients (86%) the FET PET tumour volume was significantly larger than the contrast enhancement volume. Interestingly, ten of the 50 patients (20%) showed no contrast enhancement. One of these patients additionally showed no FET uptake. Furthermore, in the remaining 40 patients (80%) with increased FET uptake and contrast enhancement, the spatial similarity (evaluated using the DSC and JSC) was low despite moderate OV (average OV, 77%). Moreover, in six of these 40 patients (15%), the spatial similarity between the FET PET volume and the contrast enhancement volume was low and the OV was <50%, i.e. relevant FET PET volumes were located partially outside and hence extended beyond the contrast enhancement volumes (Fig. 4). In total, 16 of the 50 patients (32%) showed either no contrast enhancement at all or a low OV in combination with a low spatial similarity between the FET PET volume and the contrast enhancement volume.

The OV when applied alone needs to be interpreted with caution, i.e. a high OV represents only two spatially congruent volumes if both volumes are of similar size. If the volumes are different, the interpretation of the OV alone may be misleading. The addition of measures of spatial similarity such as the DSC and the JSC for combined interpretation of the results leads to a correct impression of the spatial orientation (Figs. 1 and 4). In the present study, the OV of nearly 80% between FET PET and contrast-enhanced MRI volumes in our patient cohort might have led to the premature conclusion that the contrast-enhanced MRI volume adequately represents the metabolically active tumour in the majority of patients. However, combined evaluation of OV and low spatial similarity (DSC <0.4, JSC <0.3) shows that a high OV is biased by the initial differences between the tumour volumes defined by contrast enhancement and increased FET uptake. These important differences between the volumetric measures used are illustrated in more detail in Figs. 1 and 4.

Visual analysis of the spatial similarity between FLAIR and FET PET tumour volumes revealed significant differences. Although the hyperintense areas on FLAIR images were larger than both the contrast enhancement areas and areas of increased FET uptake in 70% of patients, a considerable proportion of the tumour volume on FET PET was located outside the FLAIR signal in 10% of patients. Thus, even the combination of contrast enhancement and FLAIR hyperintensity did not cover the full extent of the tumour in all GBM patients (Fig. 2, top row).

The value of PET using the PET amino acid tracer MET in brain tumour assessment in comparison with conventional MRI has been investigated in several studies. For example, Pirotte et al. [34] found that a MET PET-guided “complete” tumour resection significantly prolonged overall survival in patients with high-grade glioma whereas “complete” resection based on contrast enhancement did not. Grosu et al. [35] investigated the value of MET PET in comparison with conventional MRI for target volume definition before radiotherapy in patients with resected high-grade glioma. In 74% of the patients, gross tumour volumes based on MET PET were considerably larger than those based on contrast-enhanced MRI, a finding confirmed by Mahasittiwat et al. [36]. Assessing the spatial differences between MET PET and contrast-enhanced MRI tumour volumes in patients with recurrent GBM, we observed that contrast-enhanced MRI substantially underestimated the metabolically active tumour volume as defined on MET PET [24]. However, the short half-life of 11C (20 min) restricts the use of MET to institutions with an on-site cyclotron.

The use of FET labelled with 18F (half-life 110 min) overcomes the logistic disadvantages of MET. FET PET has been intensively evaluated over the last two decades for the detection of vital tumour tissue for biopsy guidance and treatment planning, especially for neurosurgical resection and radiotherapy [37]. Usually, summed images obtained from 20 to 40 min after injection are used for the analysis of FET PET scans in patients with brain tumours. Unterrainer et al. [38] recently reported that the FET PET tumour volume is considerably larger in early summation images obtained from 5 to 15 min after injection, especially in high-grade gliomas. Consequently, the true discrepancy between conventional MRI and FET PET tumour volumes might be even larger if early summation images are used for FET PET tumour segmentation. However, in that study validation by histological evaluation of local stereotactic biopsies to determine the true tumour extent was not performed.

Predominantly, the impact of FET PET on radiotherapy target volume definition in patients with low-grade and high-grade gliomas has been investigated [39,40,41,42]. These studies provide converging evidence that the spatial correlation between MRI and FET PET target volumes is low and that FET PET target volumes are considerably larger than MRI target volumes. Consequently, the integration of FET PET for target volume definition has been recommended. However, these studies did not further investigate in detail the OVs or the spatial correlation using dedicated quantitative measures such as the DSC or JSC. To the best of our knowledge, only two studies have investigated the spatial similarity between FET PET and contrast enhancement tumour volumes using the DSC, in analogy to our study. Henriksen et al. [43] calculated the DSC between FET PET and contrast-enhanced MRI tumour volumes in 17 of 32 patients with pretreated low-grade and high-grade gliomas. The estimated spatial similarity was low (mean DSC 0.19) and only 8 of the 17 patients had a DSC of more than 0.1. In our study, the spatial similarity was higher (mean DSC 0.39) and only five patients had a DSC below 0.1, most likely because of a more homogeneous patient population. Henriksen et al. also found that the metabolic tumour volume as delineated using a TBR of more than 1.6 (i.e. identical to our approach) was significantly larger than the contrast enhancement volume, which is in line with our results. However, the number of patients in that study was small, and the patient population was comparatively heterogeneous (5 patients with low-grade glioma and 12 patients with high-grade glioma), which is also reflected in the variety of pretreatments including antiangiogenic therapy.

Unterrainer et al. [44] also calculated the DSC between FET PET and contrast-enhanced MRI tumour volumes in 20 patients with high-grade glioma at initial diagnosis or recurrence prior to any further therapy. The mean DSC was 0.38, which is in good agreement with our results (mean DSC, 0.39).

Considering that a maximally safe tumour resection is a critical component in the treatment of patients with GBM due to its survival benefits, adequate preoperative volumetric representation of the metabolically active tumour tissue is especially important. There is substantial evidence in the literature that the extent of tumour resection is a prognostic factor in patients with glioma [4, 8, 9, 45]. Consequently, correct preoperative spatial delineation of the vital tumour tissue is essential for maximizing the extent of resection. In many centres, 5-aminolevulinic acid (5-ALA) fluorescence is used intraoperatively to identify tumour tissue [4]. It has been demonstrated that a 5-ALA fluorescence-guided tumour resection in patients with GBM is associated with improved progression-free survival at 6 months [4]. In that study, it was also demonstrated that 5-ALA fluorescence identifies tumour tissue beyond areas of preoperative MRI contrast enhancement, indicating that 5-ALA fluorescence delineates nonenhancing parts of the tumour. This confirms the low sensitivity of MRI in correctly representing tumour tissue [46]. It is noteworthy that FET PET has been found to be even more sensitive in detecting glioma tissue than 5-ALA fluorescence [47].

A limitation of our study was the lack of confirmation of the spatial neuroimaging findings by histological evaluation of local stereotactic biopsies. Thus, prospective studies including spatial correlation between imaging findings and histological findings in stereotactic biopsies are needed to confirm our results. On the other hand, the patient cohort was fairly homogeneous (i.e. in all patients, GBM was confirmed neuropathologically according to the 2016 WHO classification of brain tumours [48]) and all tumours were untreated. Furthermore, three different quantitative measures to describe the spatial similarity and OV were used to assess volumetric and spatial differences between the FET PET and contrast enhancement tumour volumes. Furthermore, spatial differences between areas of FLAIR hyperintensity and FET PET tumour volume were evaluated visually.

In conclusion, our results suggest that the metabolically active tumour volume in patients with newly diagnosed, untreated GBM is critically underestimated by contrast-enhanced MRI. Furthermore, the spatial similarity between the contrast-enhanced MRI and FET PET tumour volumes was low, i.e. the areas of FET uptake extended substantially beyond the area of contrast enhancement and also beyond the area of FLAIR hyperintensity in some patients. In these patients, FET PET yields essential information for correct tumour delineation beyond the area of contrast enhancement and thereby contributes significantly to tailoring the tumour resection. It remains to be determined whether FET PET-guided tumour resection or target volume delineation before radiotherapy significantly affects patient survival. A randomized phase II trial (NOA-10/GLIAA study) [49] is currently evaluating whether radiotherapy target volume delineation by amino acid PET significantly prolongs survival in comparison with contrast-enhanced MRI in patients with progressive GBM.