Introduction

Next to surgery and chemotherapy, radiation is part of standard treatment in patients with glioma [1]. Postradiation effects include radiation necrosis (RN) and pseudoprogression, manifesting as magnetic resonance imaging (MRI) T1-postgadolinium contrast enhancement [2]. While RN represents a necrotic local tissue reaction to radiation, pseudoprogression may constitute an overresponse to therapy [3]. The Response Assessment in Neuro-Oncology (RANO) group proposed structural MRI including diffusion-weighted imaging (DWI) as standard protocol for glioma assessment [4]; however, postradiation features seen in structural MRI often overlap with those of progressive disease (PD), limiting the accuracy [5].

As, opposed to necrotic tissue, PD is characterized by hypervascularization and hypercellularity, MRI perfusion-weighted imaging (PWI) and DWI provide additional information for differentiation. Prior studies confirmed the usefulness of dynamic susceptibility contrast-enhanced (DSC) MRI perfusion-derived CBV and ADC increasing the specificity for differentiation of RN from PD [6]. While the efficacy of MRI-PWI is generally accepted, some results are conflicting as for instance Boxerman et al. could not differentiate treatment-related changes from PD on the basis of initial CBV measurement alone [7]. While some authors reported the usefulness of specific thresholds for CBV for differentiation of treatment-related changes from PD [8], others claimed the opposite [9] and stated a dependence of CBV on MGMT promotor methylation [10]. The CBV measurements may be heterogeneously distributed in recurrent lesions [9, 11] and overlap between PD and treatment-related changes [12, 13]. The nonquantitative nature and lack of complete lesion coverage in the presence of susceptibility artefacts limit the DSC MRI perfusion method [14,15,16].

Another modality to distinguish RN from PD is using radiolabeled amino acids like O‑(2-[18F]fluoroethyl-)-L-tyrosine ([18F]FET) [17]. After passing through the blood-brain barrier, [18F]FET is transferred into cells by amino acid transporters. This transfer is measured by positron emission tomography (PET) [18]. Overall, previous studies accredited [18F]FET-PET (using TBR = tumor to brain ratio) superior accuracy for differentiation of RN from PD compared to MRI-PWI (CBV) [17, 19, 20]. A few studies showed increased diagnostic performance when combining MRI-PWI with [18F]FET-PET for differentiation of RN from PD, and only under specific circumstances or with at best modest benefit [17, 19].

In summary, previous literature suggests that both MRI-PWI and [18F]FET-PET are potentially useful for differentiation of RN from PD in glioma patients. While, overall, current literature indicates [18F]FET-PET to be superior and combination with MRI-PWI may slightly increase diagnostic performance as discussed above. Ultimately, the overall benefit of combining MRI-PWI with [18F]FET-PET for differentiation of RN from PD compared to using a single modality is not clear to this day. Therefore, in this single center study, we evaluated the potential of multiparametric analysis combining MRI-PWI with [18F]FET-PET. Secondarily, we assessed the usefulness of cerebral blood flow (CBF), mean transit time (MTT) and time to peak (TTP) as previous studies mainly examined cerebral blood volume (CBV).

Methods

Patient Population

From our database, we retrospectively identified all [18F]FET-PET brain scans (n = 411) performed between January 2016 and July 2021 at Neuromed Campus, Kepler University Hospital Linz, Austria. Further selection was based on the criteria and patients’ digital records stated below. We conducted this study according to ethical principles of the Declaration of Helsinki.

Inclusion Criteria

  1. 1.

    Adult patients (> 18 years) with histopathological verified glioma (WHO grades II, III or IV) according to the WHO 2016 classification.

  2. 2.

    Referral for [18F]FET-PET for differentiation between PD and RN due to previous MRI findings suspicious for PD according to RANO [21].

  3. 3.

    Previous radiotherapy as part of the standard therapy of patients with glioma.

Exclusion Criteria

  1. 1.

    > 2 months between initial PET and previous MRI scan.

  2. 2.

    Change in therapy or surgery of the target lesion after initial scans in the examined period.

  3. 3.

    Lack of histopathological, radiological, and clinical follow-up and therefore compromised outcome evaluation.

MRI Protocol

Except for one patient, all scans were obtained from a single MRI scanner (3 Tesla Skyra®, Siemens, Munich, Germany). For the remaining patient, scans were obtained from another device (3 Tesla Verio®, Siemens). Scanning parameters for all sequences are available in the supplement (file name = cross-validation). A prebolus was applied before administration of the main bolus via intravenous route (gadolinium-based agent, 0.1 ml/kg body weight; infusion rate, 5 ml/s followed by 25 ml of NaCl). Structural MRI scans based on T2 and contrast-enhanced T1 sequences were obtained. Perfusion maps were generated by Syngo MR Neuro Perfusion Engine® (Siemens, Munich, Germany). We performed postprocessing leakage correction based on the Boxerman-Weisskoff method [22].

[18F]FET PET Protocol

The synthesis and use of [18F]‑FET and PET imaging was conducted in accordance with current guidelines and as described in our previous studies [23,24,25]. All patients underwent FET-PET imaging, either by a Gemini GXL® (Philips, Amsterdam, Netherlands) or a Veroes® (Philips, Amsterdam, Netherlands) device. Patients fasted for a minimum of 6 h before intravenous injection of 185 MBq of FET. PET images were acquired 30 min after injection (3‑D acquisition). Computed tomography was used for attenuation correction.

Image Analysis

For this study, a single author (JP), blinded to the disease course after imaging, re-analyzed both MRI and [18F]FET-PET data. Decisions were backed by previous assessment of a multidisciplinary tumor board including highly experienced neuroradiologists and neuronuclear medicine physicians. Both structural and perfusion MRI but not FET-PET data were analyzed with Syngo.via® (Siemens, Munich, Germany). Subsequently, structural MRI data were co-registered with [18F]FET-PET data and analyzed via Hermes® workstation (Hermes Medical Solutions, Stockholm, Sweden).

For every patient, the target lesion was defined by delineation of tumor margins based on postcontrast enhancement on T1 MRI or/and hyperintensities on T 2, drawing a 2D region of interest (ROI). Blood vessels, resection cavity and cerebrospinal fluid (CSF) spaces were excluded manually. Perfusion maps (CBVmean, CBFmean, MTT, TTP) were obtained by Syngo MR Neuro Perfusion Engine®. Perfusion maps were normalized (rCBVmean, rCBFmean, rMTT, rTTP) by drawing a standardized 2 cm circular ROI in the normal-appearing brain tissue of the contralateral hemisphere (relative value = ipsilesional value/contralesional value). While perfusion parameters are related with each other (CBV = CBF * MTT), we analyzed every parameter individually in a later step.

In a second step we fused [18F]FET-PET images with structural MRI data. The previously defined ROIs of the lesion and contralateral normal-appearing brain tissue were copied and used for measurement of standard uptake value (SUV = image activity concentration [Bq/g] · patient weight [g] / injected activity [Bq]). Mean and maximum tumor to brain ratios (TBRmean, TBRmax) were calculated by division of target lesions SUV by the SUV of the contralateral normal-appearing brain tissue.

Follow-up and Final Diagnosis

Histopathologic evaluation (resection or biopsy) or radioclinical follow-up confirmed either PD or RN. Regarding WHO grade II gliomas, radiological and clinical assessment had to be stable for at least 12 months to diagnose RN. In case of WHO grades III–IV gliomas, at least 6 months of stable radiologic and clinical follow-up controls were required for confirmation of RN. According to the RANO criteria PD was considered when growth of the target lesions in at least 2 subsequent MRI scans was present, accompanied by clinical deterioration or death of patient (not attributable to other causes than tumor-related ones) [21].

Statistics

For MRI and PET parameters intergroup differences were tested via Mann-Whitney U test. The strength of correlation between individual parameters was assessed by Pearson’s correlation coefficient. A coefficient of ≥ 0.7 marked strong, 0.4–0.6 moderate and 0–0.3 weak correlation. Using the final diagnosis as reference, we calculated the area under the curve (AUC) for each single and multiparametric model by receiver operating characteristics (ROC) analysis. Statistical significances of AUC values were tested via Wilcoxon signed-rank test [26]. Optimal cut-offs, sensitivities, specificities, and accuracies for discrimination of PD from RN were obtained by maximization of Youden’s index. A likelihood ratio test (= analysis of variance) was performed to evaluate whether a multiparametric model significantly outperforms a single parameter approach for discrimination between PD and RN. Finally, leave one out cross-validation was performed for all models to account for the small sample size and to assess the classification performance of the models for new unseen data. Fisher’s exact test was performed to test the significance of the cross-validated models. R (R Core Team 2022, Version 4.2.2, R Foundation for Statistical Computing, Vienna, Austria) was used for all statistical analyses. A p-value < 0.05 defined statistical significance.

Results

Cohort Characteristics

Out of all patients, 37 met the inclusion criteria. We repot a median age of 55 years (range 29–84 years), 51% were female and 49% were male. Of the patients 31 (83.8%) had high grade (WHO grade III or IV) and 6 patients had low-grade (WHO grade II) glioma. Glioblastoma accounted for the highest share of all tumor subtypes (64.9%). In 9 out of 37 patients, MGMT methylation was not identified due to poor quality of the resected tissue. In the remaining 28 patients, the promotor was methylated in 17 (60.7%) and unmethylated in 11 patients (39.3%). RN was found in 6 of 17 patients (35.3%) with a methylated MGMT promotor and in 1 of 10 patients (9.1%) with an unmethylated promotor. With respect to the MGMT promotor status, further statistical analysis was not possible due to small subgroups which would have compromised the validity of the results. While all patients received at least one course of radiotherapy prior to initial scanning, 18.9% received at least a second course. The chemotherapy regimen included temozolomide in 97% and bevacizumab in 40.5% of all patients and 13.5% of all patients received at least a second tumor resection prior to initial scanning. The average interval between initial [18F]FET PET and MRI scan was 5 days. The mean follow-up (based on MRI and clinical assessment) duration was 14 months. Histopathology defined the final diagnosis (PD, n = 27; RN, n = 10) in 8 cases and radioclinical follow-up in 29. Table 1 summarizes tumor-related characteristics.

Table 1 Tumor-related characteristics

Single Parameter Analysis

In PD median values for TBRmean (2.23 vs. 0.97, p < 0.001), TBRmax (2.28 vs. 1.04, p < 0.001) and rCBVmean (1.78 vs. 1.18, p < 0.001) were significantly higher and those for rTTP (1.01 vs. 1.11, p < 0.001) significantly lower than in RN (Table 2). While median values for rCBFmean (1.48 vs. 0.8, p = 0.098) and rMTT (1.1 vs. 1.04, p = 0.801) differed between PD and RN, the difference was not statistically significant. In single parameter analysis, rTBRmean (AUC = 0.91, p < 0.001), rTBRmax (AUC = 0.89, p < 0.001), rTTP (AUC = 0.87, p < 0.001) and rCBVmean (AUC = 0.84, p < 0.001) were efficacious for discrimination of PD from RN (Figs. 1, 2 and 3). rCBFmean (AUC = 0.68, p = 0.09) and rMTT (AUC = 0.47, p = 0.51) did not reach statistical significance. According to Pearson’s correlation coefficient, TBRmean and TBRmax were highly correlated with each other. Due to the correlation and the higher AUC of the former parameter, TBRmax was excluded for further analysis. Optimal cut-offs, sensitivities, specificities, and accuracies for discrimination of PD from RN are given in Table 2.

Table 2 Performance of PWI-MRI/FET-PET parameters for differentiation of progressive disease (PD) from radiation necrosis (RN)
Fig. 1
figure 1

ROC curves for PWI-MRI and [18F]FET-PET parameters ROC curves and their respective AUC values for PWI-MRI (rCBVmean, rCBVmean, rMTT, rTTP) and [18F]FET-PET (TBRmean, TBRmax) parameters for differentiation between progressive disease and radiation necrosis. TBRmean yielded the highest AUC, followed by TBRmax, rTPP, rCBVmean, rCBFmean and rMTT. ROC receiver operating characteristic, PWI-MRI perfusion weighted magnetic resonance imaging, [18F]FET-PET O-(2-[18F]fluoroethyl-)-l-tyrosine-positron emission tomography, rCBVmean mean relative cerebral blood volume, rCBFmean mean relative cerebral blood flow, rMTT relative mean transit time, rTTP relative time to peak, TBRmean mean tumor to brain ratio, TBRmax maximal tumor to brain ratio, AUC area under the curve

Fig. 2
figure 2

Progressive disease example of a patient with tumor progression, 7 months after initial diagnosis of anaplastic astrocytoma, resection and chemoradiotherapy the patient clinically deteriorated. Axial T1-weighted contrast-enhanced MRI (a) demonstrates right hemispheric progressive tumor tissue, corresponding PWI maps CBV (b), CBF (c), MTT (d), TTP (e) and [18F]FET-PET (f) imaging are depicted. MRI magnetic resonance imaging, PWI perfusion weighted imaging, [18F]FET-PET O-(2-[18F]fluoroethyl-)-l-tyrosine-positron emission tomography, CBVmean mean cerebral blood volume, CBFmean mean cerebral blood flow, MTT mean transit time, TTP time to peak, TBRmean mean tumor to brain ratio, TBRmax maximal tumor to brain ratio

Fig. 3
figure 3

Radiation necrosis example of a patient with radiation necrosis 20 months after initial diagnosis of glioblastoma, resection and chemoradiotherapy, follow-up imaging showed a new contrast enhancing lesion. Axial T1-weighted contrast-enhanced MRI (a) demonstrates right hemispheric necrosis, corresponding PWI maps CBV (b), CBF (c), MTT (d), TTP (e) and [18F]FET-PET (f) imaging are depicted. MRI magnetic resonance imaging, PWI perfusion weighted imaging, [18F]FET-PET O-(2-[18F]fluoroethyl-)-l-tyrosine-positron emission tomography, CBVmean mean cerebral blood volume, CBFmean mean cerebral blood flow, MTT mean transit time, TTP time to peak, TBRmean mean tumor to brain ratio, TBRmax maximal tumor to brain ratio

Multiparametric Analysis

In multiparametric analysis, a classification model consisting of TBRmean, rCBVmean and rTTP achieved an AUC of 0.98 (p < 0.001) and significantly outperformed the use of rTBRmean alone (p = 0.002), which was the single parametric approach with the highest AUC (Table 2). Exchanging TBRmean for TBRmax resulted in the same AUC value, sensitivity, specificity, and accuracy. Adding more parameters did not further increase the AUC. Analysis of variance was performed for the previously mentioned parameters and confirmed that the multiparametric approach was superior to the single parameter one (p < 0.002). To assess the considered models in their performances to predict new unseen data, we performed leave one out cross-validation for all models. While cross-validation attributed the highest AUC value to the model consisting of TBRmean and rCBVmean, it also suggested that the addition of rTTP resulted in the highest accuracy. Full results of cross-validation were uploaded to the supplement section.

Discussion

Our study evaluated whether a multiparametric classification model combining MRI-PWI and [18F]FET-PET is superior to a single parameter approach to differentiate PD from RN. In single parameter analysis, rTBRmean, rTBRmax, rTTP and rCBVmean were useful for discrimination of PD from RN but rCBFmean and rMTT did not reach statistical significance. In multiparametric analysis, a classification model consisting of rTBRmean, rCBVmean and rTTP significantly outperformed the use of rTBRmean alone which was the single parametric approach with the highest AUC. Secondarily, we firstly report the usefulness of rTTP for differentiation of PD from RN.

In single parameter analysis, rTBRmean, rTBRmax, rTTP and rCBVmean were useful for discrimination of PD from RN. The diagnostic performance of TBR (mean and max) and rCBV was similar to previous results [17, 19, 20]. In line with our results, CBF was reported to be not useful for differentiation of PD from RN [27]. As reported previously, in our cohort RN was associated with a higher frequency of methylated than nonmethylated MGMT status [10]. Kong et al. [10] also showed a difference of rCBVmean between patients with unmethylated MGMT promotor but not in the methylated promotor group. We were not able to verify these findings as small subgroups would have compromised the validity of statistical analysis. We firstly examined the usefulness of PWI rMTT and PWI rTTP for differentiating PD from RN. rMTT was not better than chance in discerning PD from RN. Even though median rTTP was only slightly higher in RN compared to PD, it achieved a moderately high AUC for differentiation PD from RN. As median rCBFmean was lower in RN than in PD, a reduced blood flow and therefore delayed arrival of the contrast agent in necrotic tissue may explain the rTTP increase in RN.

In a second step we assessed the performance of a multiparametric approach. We found maximum AUC for discrimination of PD from RN when combining TBRmean, rCBVmean and rTTP. This model significantly outperformed the use of rTBRmean alone, which was the single parametric approach with the highest AUC. Analysis of variance confirmed the superiority of the multiparametric over the single parameter one. In a final step we performed leave one out cross-validation to assess the robustness of the considered classification models. The approach basically confirmed the results from the multiparametric analysis. The combination of TBRmean and rCBVmean led to the highest AUC value, whereas additionally adding rTTP resulted in the highest accuracy. Overall, multiparametric models showed better discriminating performance than each of the univariate parametric models.

A few studies reported a clear advantage for combined use of MRI-PWI and [18F]FET-PET (TBR), and only under specific circumstances or with at best modest benefit. Pyka et al. (2018) also found increased sensitivity (76%) when combining [18F]FET-PET (TBR) with MRI (ADC + CBV) compared to [18F]FET-PET alone (56%) at a specificity set at 100% [19]. In contrast, we did not focus on maximization of specificity but on AUC, as it encompasses both sensitivity and specificity, making it a more relevant parameter for real-life patient care. Jena et al. (2016) reported increased accuracy (0.91) when combining TBRmean with rCBVmean and maximum accuracy (0.94) when combining TBRmean with rCBVmean and Choline/Creatine ratio [17]; however, this approach was limited by the requirement for magnetic resonance spectroscopy imaging (MRSI). This is an important practical limitation as MRSI is not widely available due to several factors, including increased MRI scanning and postprocessing times, low sensitivity and spectral resolution, requirement of expertise for specific voxel techniques, and the danger of metabolite contamination and degradation [28, 29]. Sogani et al. (2017) found increased performance when combining TBR (max and mean) with rCBVmean, ADCmean and Choline/Creatine ratio [20]. The practicality of this approach was again limited by the requirement of MRSI for achieving maximum accuracy. The results for a combined use of TBR with PWI without MRSI was not reported. We believe that our classification model is more practical for real-life patient care, as it is not dependent on MRSI. Also, Neuromed Campus, Kepler University Hospital Linz is the sole provider of FET-PET imaging for brain tumors in this federal state (Upper Austria) and bordering areas. Thus, we perform all scans without selection bias in a region encompassing a population of 1.5–2 million people. Thus, we believe to have a more representative patient collective compared to previous studies; however, prospective studies with larger samples are needed to validate our findings.

Limitations

We included both high-grade and low-grade gliomas to increase overall sample size; however, the great majority had high-grade (III–IV) glioma and only a few patients had low-grade (II) glioma, which limited the actual heterogeneity. Due to the rather small sample size, we verified the robustness of our models with leave one out cross-validation. For glioma grading we used the WHO 2016 criteria as the histopathological records date from a timespan when the old classification was in place (2016–2021); however, these grading differences have no actual influence on our results. The use of a hotspot-based method to assess quantitative values limits the reproducibility of our findings. We used this method as our neuroradiologists and neuronuclear medicine physicians have decades of experience with this approach. Instead of creating an artificial study setting we wanted to use the method that is actually used in our routine clinical work. With respect to FET-PET, our study featured only static but not dynamic parameters. As histology was only available in 21.6% of cases, we mainly used radioclinical follow-up for determination of the final diagnosis. While a single author re-analyzed the imaging data, the decisions were backed by a previous assessment of a multidisciplinary tumor board including highly experienced neuroradiologists and neuronuclear medicine physicians.

Conclusion

We report a significant benefit of multiparametric MRI-PWI and [18F]FET-PET over single parameter analysis for discrimination of progressive disease (PD) from radiation necrosis (RN) in patients with glioma. A classification model consisting of TBRmean, rCBVmean and PWI rTTP significantly outperformed the use of rTBRmean alone, which was the best single parametric approach. Secondarily, we firstly report the potential usefulness of PWI rTTP for discrimination of PD from RN in patients with glioma; however, for validation of our findings the conduction of prospective studies with larger patient samples is necessary.