Introduction

Recently the classification of lower-grade gliomas (LGG) has included the isocitrate dehydrogenase 1/2 (IDH1/2) mutations and 1p19q codeletion as major main molecular parameters [1].

IDH1/2 converts isocitrate to α-ketoglutarate respectively in the cytosol and in mitochondria, and its mutations result in the accumulation of 2-hydroxyglutarate (2HG), with intracellular concentrations on the order of millimolar [2]. Mutations in the IDH genes are reported in up to 70–80% of LGG and secondary glioblastomas (GBM), while in less than 10% of primary GBM.

The most frequent (up to 93%) is IDH1 mutation R132H (c.395G > A), detectable by immunohistochemistry (IHC) on tissue samples, while the remaining IDH1/2 mutations require direct sequencing [3].

IDH mutant (IDHmut) gliomas show distinct characteristics relative to IDH wild type (wt) and are associated to better prognosis, and identifying IDH mutations is important to tailor therapeutic approach [4]. Magnetic resonance imaging (MRI) is an ideal candidate for non-invasively providing in vivo biomarkers, including 2HG detection by MR spectroscopy (MRS) to predict IDH status [5, 6]. Detection of 2HG on biological fluids has been also proposed, but this approach is not yet confirmed in glioma [7].

The intracerebral accumulation of 2HG cannot be detected by standard MRS because of the structural complexity of the molecule that gives rise to multiplets resonating at three frequencies at 3 T (1.9, 2.25, and 4.02 ppm) whose spectrum overlaps those, greater, of N-acetyl-aspartate (NAA) and glutamate (Glu) [8, 9]. Specific MRS sequences have been implemented since 2012 to identify and quantify in vivo 2HG in patients affected by IDHmut gliomas, based on 2HG accumulation in IDHmut tumors [10,11,12].

Different MRS techniques have been used to determine 2HG in vivo in glioma patients (Table 1) reporting a range of accuracy depending on multiple factors including technical characteristics as the field strength, technique, and echo times and on tumoral features as location, volume, and cellularity [5, 9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29].

Table 1 Previous in vivo 2HG-MRS studies on human glioma patients

The aim of the present study was to determine the clinical feasibility and accuracy of different 2HG-tailored MRS PRESS sequences at 3 T in treatment-naïve, suspected LGG. As secondary aims, we investigated the associations with tumor location, histopathology, main molecular features, and concentrations of other metabolites. Preliminary analyses of 2HG level on plasma and urine were also performed.

Patients and methods

Patients

This prospective, observational study of in vivo MRS detection and quantification of 2HG in treatment-naïve suspected LGG was performed at the Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy. The Institutional Ethics Committee approved the study (protocol n. 419/2014), which conforms to the Declaration of Helsinki. Informed consent was obtained from all participants.

Between 2014 and 2016, all consecutive adult patients with suspected brain LGG according to pre-surgical conventional MRI (cMRI) were considered for inclusion.

A glioma was suspected to be lower-grade when no necrosis or relevant edema (i.e., absence of edema or minimal edema not distinguishable from tumor burden) was observed and when contrast enhancement was absent or faint on cMRI.

Patients who had already undergone biopsy/surgery or treated with chemo- or radiotherapy were excluded.

Enrolled patients underwent MRS during the pre-surgical MRI.

Magnetic resonance imaging

MRI was performed on a Philips Achieva 3 T scanner with a 32 channel head coil.

Whole brain multiplanar T1- and T2-weighted images, fluid-attenuated inversion recovery (FLAIR), and diffusion-weighted imaging (DWI, b = 0 and 1000 s/mm2) were acquired and used for guidance in the positioning of the spectroscopic voxel.

Contrast-enhanced T13D sequence was acquired after MRS.

On DWI, the apparent diffusion coefficient (ADC) value was calculated within a region of interest (ROI) corresponding to the MRS voxel site. A senior neuroradiologist placed circular ROIs in different areas of the lesion and one in the CHWM. The ROIs were placed on the basis of cMRI and ADC map appearance in order to select the areas of supposed highest cell density (inversely proportional to ADC). The tumoral ADC values were normalized (rADC) to those obtained from the CHWM. The MRS SV was then placed where the lowest ADC values were found, compatibly with the lesion site and size.

1H-magnetic resonance spectroscopy

Single-voxel (SV) MRS point-resolved spectroscopy (PRESS) sequences with TE = 30 ms (TE1/TE2/TR = 20/10/2000 ms; 144 averages) and TE = 97 ms (TE1/TE2/TR = 32/65/2000 ms; 160 averages) were performed, tailored using numerical simulation in order to maximize the 2HG 2.25-ppm signal detection, leading to improved differentiation between 2HG and nearby GABA, glutamine (Gln), and Glu signals. These sequences were previously implemented by another group [9, 10], and we more recently optimized the TE30 sequence on our scanner in a phantom study [8].

1.5 × 1.5 × 1.5-cm3 and 2 × 2 × 2-cm3 voxel sizes were tested.

Two identical SVs were acquired: one in the lesion and one in the contralateral hemisphere.

Each MRS sequence took about 6-min time scan.

LCModel

All spectra were analyzed using LCModel software version 6.3 [30] performing phase correction and using a linear combination of spectra from a simulated metabolite basis-set including 20 metabolites [10]. A spectrum with the same sequence parameters but no water-suppression was acquired before every PRESS sequence and employed as reference for the evaluation of absolute concentration. Concentration uncertainties were estimated using Cramér-Rao lower bounds (CRLBs), and 20% was considered as reliability limit [30].

See Online Resource (Supplementary Methods) for Histopathology, DNA sequencing of IDH1/2 genes, loss of heterozygosity (LOH) analysis, O6-methylguanine-DNA-methyltransferase (MGMT) promoter hypermethylation analysis by PCR on tissue samples, and circulating 2HG analysis.

Statistical analysis

Descriptive statistics were used to describe all variables recorded in this study. Significant associations were assessed with the use of Mann-Whitney test and Chi-square or Fisher exact test. Continuous variables were further analyzed through receiver operating characteristic (ROC) curves in order to identify the best cutoff in terms of sensitivity and specificity to distinguish patients in relation to IDH, 1p19q LOH, and MGMT promoter methylation status. As proposed by others [29], the optimal cutoff values were chosen to optimize specificity with sensitivity above 50%.

A logistic regression analysis was performed to assess whether measured metabolites could be used to discriminate between two patient groups based on IDH, 1p/19q LOH, and MGMT promoter methylation. First, for each metabolite (i.e., predictor), univariate logistic models were performed. Second, if 2HG and metabolites were detected at significant levels by univariate test, 2HG associated to another metabolite were entered as predictors in a single block multivariate logistic regression to assess if the prediction of this model improved in comparison with univariate logistic regression. Goodness of fit of the model was evaluated with Hosmer statistics for small samples.

Results

Fifty-nine patients were studied by MRS, underwent subsequent surgery, obtained histological diagnosis, and were investigated for the IDH mutation by histology or Sanger sequencing. Median time between MRS and neurosurgery was 6 days (range 1–37 days). Three cohorts of patients were studied with different MRS protocols.

The first cohort included 13 subjects (10 IDHmut, 3 IDHwt) and underwent MRS by the TE30 ms: only in 5 cases, 2HG peak corresponded to the IDH status; four cases were false negative (FN); two were false positive (FP) within tumor voxels; and FP also resulted in contralateral voxels.

The second cohort was studied by the TE 97 ms sequence with voxel 1.5 × 1.5 × 1.5 cm3 and included 14 subjects (10 IDHmut, 4 IDHwt). No contralateral FP were found, but there were six FN and two FP in tumor voxels.

Given the low accuracy, the above sequences were abandoned and a third cohort was studied.

A TE 97 ms sequence with larger voxel (2 × 2 × 2 cm3) was used in the last 32 patients, including 19 IDHmut and 13 IDHwt. 2HG could be quantified in 15/19 IDHmut gliomas (Fig. 1a) with CRLB ≤ 20% in all of true positive (TP) cases. True negative (TN) spectra (11/13 IDHwt) showed absence of detectable 2HG (concentration 0, CRLB 999%) (Fig. 1b). 2HG was never detected in contralateral voxels.

Fig. 1
figure 1

Panel showing some representative cases. a True positive (TP), patient 20. b True negative (TN), patient 30. c Unreliable MRS result: small tumor (volume 1.6 cm3) showing 1.85 mM 2HG level but CRLB exhibiting low reliability (36%). IHC was negative for IDH1 R132H mutation, while tumoral tissue was not enough to perform PCR analysis. d False negative (FN), patient 8: small tumor (volume 3.4 cm3) with low cellularity and proliferation index

Two cases with 1.85 and 1.3 mM 2HG at MRS analysis showed CRLB above the 20% limit (respectively 36% and 47%), suggesting low reliability, and were thus erased from subsequent analyses. Both were IDH1wt on IHC; IDH1/2 PCR analysis resulted negative for one patient, while for the other one tumoral tissue was insufficient to perform PCR (Fig. 1c).

The four FN included one case in which tumor volume (3.2 cm3) was smaller than the voxel size (Fig. 1d), and three with relative ADC (rADC) 1.6–1.9, suggesting low cellularity according to tumoral cell density evaluation (scale: cell number < 20, low (L); cell number 20–25, medium (M); cell number ≥ 26, high (H)) on histological specimens (see Supplementary Methods), as displayed in Table 2.

Table 2 Demographic features and main characteristics of the patients, including histology, proliferation index (MIB-1), cell density, IDH status, and main molecular data

Statistical analysis focused on the third cohort. Patient characteristics, histological diagnosis, molecular assessment, and 2HG level by brain MRS are shown in Table 2.

The presence of 2HG on MRS was associated with IDH mutations (P = 0.001, Fisher’s exact test). MRS showed 79% (95% confidence interval (CI), 54.43–93.95%) sensitivity, 100% (95% CI, 71.51–100%) specificity, and 87% (95% CI, 69.28–96.24 %) accuracy with respect to the IDH status. Area under the curves (AUC) was 0.895 (95% CI: 0.756–1).

1p19q codeleted gliomas (oligodendrogliomas: ODG) exhibited more frequently IDH mutations (P = 0.009, chi-square test) than 1p19q non-codel gliomas (astrocytomas: A) and 2HG-MRS levels were higher (Mann-Whitney test) (Table 3). Using ROC analysis, 2HG cutoff of 0.75 differentiated the two types with sensitivity of 66.7% and specificity of 81.3%. AUC was 0.77 (95% CI, 0.574–0.967).

Table 3 Median metabolite levels and IDH, 1p19q, MGMT status. P values of significant differences are also shown

Only one out of four GBM was IDHmut.

A 2HG cutoff 0.65 showed 60% sensitivity and 91.7% specificity to differentiate MGMT promoter methylation status; the AUC was 0.72 (95% CI, 0.526–0.919).

10q loss was more frequent in IDHwt (P = 0.03, Fisher’s exact test).

Aside from 2HG, lower concentrations of creatine (Cr), Glu, myo-inositol (mI), and glutathione (GSH) were found in IDHmut than in IDHwt gliomas (Mann-Whitney test) (Table 3). AUC were 0.73 (95% CI, 0.542–0.911), 0.86 (95% CI, 0.695–1), 0.85 (95% CI, 0.715–0.989), and 0.73 (95% CI, 0.557–0.912), respectively. Using ROC analysis, we investigated the metabolites’ levels in determining IDH-mutation status in the cohort. Cutoff values were 2.1 for Cr (sensitivity of 54% and specificity of 89.5%), 2.65 for Glu (sensitivity of 77% and specificity of 95%), 4.7 for mI (sensitivity of 77% and specificity of 84%), and 0.35 for GSH (sensitivity of 61.5% and specificity of 68.5%) respectively.

In the patient classification for IDH, univariate logistic regressions were significant for 2HG, Glu, mI, and GSH (Table 4). Forward and backward logistic regression including cross-validation showed that the goodness of fit of the model in multivariate logistic regression gained 93.5% prediction to diagnose IDH mutation combining 2HG, Glu, and mI (P < 0.01, omnibus chi-square, Hosmer and Lameshow).

Table 4 Univariate logistic regressions for significant metabolites in the patient classification for IDH

ROC curves for MRS metabolites and IDH, 1p19q, and MGMT status are shown in Supplementary Data (Figure 1S).

Serum samples from 22 patients and urine samples from 13 patients of cohort 3 were analyzed to measure 2HG: no significant differences were found in IDHmut vs IDHwt.

Patients carrying IDH mutations were younger than wt (median 34 vs 55 yrs, P = 0.005), as expected (Mann-Whitney test). IDH status did not significantly correlate to gender, lesion site, contrast enhancement, tumor grade, concentrations of metabolites other than those above described (Table 2), p53, ATRX, TERT, and MIB-1 (proliferation index).

Discussion

Diagnosis and classification of gliomas are based on tissue histology integrated with molecular markers [1, 4]. In LGG, these markers include IDH1/2 status, 1p/19q codeletion, 10q deletion, TERT and ATRX mutation, and MGMT promoter methylation status [4]. IDH mutations have prognostic relevance irrespective of glioma grade [4].

In vivo brain metabolite imaging provides a unique opportunity for the evaluation of the lesion and surrounding tissue: specifically, 2HG-MRS may help define the infiltrative tumor area, guiding surgical and radiotherapeutic approaches, and can be useful to monitor treatment response to cytoreductive or IDH-targeted therapies [2, 15, 20,21,22, 29, 31,32,33,34,35].

We enrolled adult patients with suspected LGG scheduled for surgery, aiming to evaluate the clinical feasibility and accuracy of 2HG-MRS. IDHmut and wt gliomas were represented, and we analyzed histological type and grade, the salient molecular features, radiological features as tumor site, contrast enhancement and ADC, and the main MRS metabolites, in order to find correlations with IDH status. Various techniques have been proposed to perform 2HG-MRS: PRESS, MEGAPRESS, and MEGALASER have been found to exhibit good accuracy, with some limitations in specific setting series [6]. Some authors reported better accuracy of MEGAPRESS with respect to PRESS [12, 26], differently from Choi et al. [10]. We decided to use PRESS to enhance the 2.25-ppm 2HG peak, which is most represented at 3 T. Two TE (30 and 97 ms) and voxel sizes were assessed: after confirmation by phantom study that TE 97 is more accurate than TE 30 (data not showed), we focalized on TE 97 in study patients. The first cohort was assessed using smaller voxel size, and later, based also on the literature, we moved forward and evaluated larger voxel. Likewise with Choi et al. [9], we confirmed in our cohort that TE 97 ms with the larger voxel (2 × 2 × 2 cm3) shows high accuracy. By this procedure, 2HG-MRS showed 79% sensitivity and 100% specificity with respect to the IDH status.

A contribution of our study is that we studied a prospective, homogeneous cohort of treatment-naïve consecutive patients with suspect lower-grade gliomas and we showed the gain of a good diagnostic accuracy of 2HG-MRS with TP spectra showing CRLB below 20%, a reliability limit used also in other studies [9, 12]. We excluded only two patients whose spectra showed CRLB > 20%, thus unreliable.

We addressed the issue using a technology (3 T scanner, LCModel software) available for clinical practice and MRS sequences lasting around 6 min.

Moreover, we expanded the evaluation to MRS analysis to other brain metabolites and molecular features. In particular, we found that the combination of Glu and mI with 2HG improved diagnostic accuracy. The decrease in Glu concentrations observed in our sample is consistent with the documented production of 2HG from Glu-derived α-KG through Glu [16, 36]. mI is an osmolyte normally present in astrocytes, involved in cell membrane and myelin sheet. Our results of significantly higher levels of mI in IDH1 wt glioma may support its role as promoter of PKC activation associated with the aggressive behavior of high-grade gliomas [37].

Our study found difference in 2HG level between ODG and A, as recently reported [36] using ex vivo proton high-resolution magic angle spinning spectroscopy on glioma tissue samples. The relationship between 2HG and MGMT methylation is coherent with the tight association between MGMT methylation and IDH mutation in gliomas [38].

Present study carries some limitations. MRS levels of 2HG in the majority of our patients were quite low, in spite of the good reliability (detected by CRLB). Although a recent meta-analysis identified 1.76 mM as cutoff to determine the presence of 2HG on MRS [6], in our study we found that lower levels of 2HG with CRLB < 20% identified IDH mutant gliomas while in IDH wild-type 2HG was undetectable (concentration 0, CRLB 999%) in TN cases.

FN spectroscopic results were found in one small glioma and in three lesions with low cellularity, in agreement with previous data [9]. Two cases were excluded because 2HG was measured by MRS but CRLB was above the reliability limit; both were IDHwt at IHC: one of these tumors was very small and PCR could not be performed, and the other patient was a 38-year-old man carrying a not-enhancing IDHwt anaplastic A, with tumor volume smaller than voxel size.

In a cohort of patients, we tried to quantify 2HG in plasma and urine samples, as previously performed with inconclusive results [7, 39, 40]: we assessed pre-surgery 2HG level in plasma and urine by LC-MS, without finding a significant association between circulating 2HG and IDH status. Our 2HG level range is aligned with that reported by Fathi et al. for glioma patients.

Although intratumoral 2HG accumulation provides a marker for the presence of IDH mutations, circulating 2HG may not yet be a useful marker for glioma molecular classification: further approaches need to get an actionable-reproducible test.

In summary, our data confirm that non-invasive detection and quantification of 2HG by MRS are clinically feasible at 3 T using PRESS TE 97 ms with good diagnostic performance [6].

Association with other metabolites evaluable by MRS may increase the diagnostic and predictive value of 2HG testing.

Further investigations in multi-institutional studies are needed to achieve unambiguous and reproducible accuracy widely applicable in clinical settings of SV and MV 2HG-MRS to improve LGG diagnosis and to facilitate personalized treatments (NCT02193347; NCT03343197, NCT02454634, NCT02481154, NCT02273739,. NCT02381886) and follow-up [15, 20, 21, 29; NCT03952598, NCT03677999].