Introduction

Epilepsy is a common neurological condition that confers a weighty disease-related burden on individuals and families with the condition [1, 2]. Unlike in adults, developmental lesions are a common source of seizures in children [3]. Epilepsies occurring in the setting of such lesions are predisposed to pharmacoresistance [4]. As a result, surgical resection has become an important tool in the management of localization-related epilepsy in children. In such cases, preoperative identification of a structural lesion at MRI is a dominant prognostic factor [5, 6]. Despite the central role of MRI, [F-18]2-fluoro-2-deoxyglucose (FDG) positron emission tomography (PET) imaging plays a pivotal role in the work-up of this patient population. In particular, FDG-PET can be used to improve detection of subtle developmental lesions at MRI [7] and can even localize seizure origin in patients with MR-negative temporal lobe [8, 9] and neocortical epilepsy [10]. Overall, the combination of FDG-PET and MRI exams has been shown to enhance seizure focus detection [11] and to improve the outcome of epilepsy surgery [11, 12] more than either modality alone.

The current practice standard for FDG-PET imaging is PET/CT. PET/CT scanners acquire a CT exam to be used for attenuation correction and anatomical correlation. CT, however, lacks soft-tissue contrast, especially in relation to epileptogenic lesions; this shortcoming mandates an additional, separate MR exam. The delivery of ionizing radiation beyond that associated with radiotracer administration as well as the extended anesthesia required to acquire multiple imaging exams are, therefore, significant shortcomings of PET/CT in children.

Hybrid PET/MR scanners can potentially combine the superior soft-tissue contrast of MRI and the metabolic characterization of FDG-PET in a single exam without the additional ionizing radiation inherent to PET/CT systems. Although these virtues are of great potential value to children, accurate attenuation correction algorithms based on MR images will be indispensable to widespread clinical implementation [13]. Unlike CT images, however, MR signal intensity does not have a direct, linear relationship to photon attenuation coefficients, which necessitates alternative methods of correction [14]. To date, the preponderance of studies has demonstrated relatively small, though somewhat variable, quantitative differences between PET/CT and PET/MR-acquired FDG exams [15, 16]. Scant data exist, however, regarding the impact of such small quantitative differences on the diagnostic value of PET/MR, particularly in children with epilepsy. The goal of this study was to compare the diagnostic accuracy of PET/MR-acquired FDG brain exams to that of PET/CT with respect to identifying seizure foci in children with localization-related epilepsy.

Materials and methods

We performed a single-center prospective study of consecutive pediatric patients (age ≤21 years) with localization-related epilepsy referred for a clinical FDG-PET/CT examination of the brain. All patients were prospectively recruited to undergo an additional research PET acquisition on a tandem PET/MR system immediately following the PET/CT. This eligible study population was adjusted based on the following exclusions: 1) Contraindication to MRI; 2) Patients deemed unstable to undergo extended anesthesia beyond that required for the clinical PET/CT exam; 3) Patients not requiring sedation for their PET/CT but deemed unable to lie still for the subsequent PET/MR (no patient received sedation solely for the purposes of the research PET/MR exam); 4) Patients/families choosing not to participate, 5) Scheduling conflicts in the department precluding contiguous examinations on the PET/CT and PET/MR scanners and 6) Artifactual degradation to image quality. This Health Insurance Portability and Accountability Act compliant study was approved by the local Institutional Review Board. Informed consent was obtained.

PET/CT imaging

Clinical brain FDG-PET scans were performed using a large-bore PET/CT scanner (Philips TruFlight Select PET/CT; Philips Healthcare, Andover, MA) according to a standard clinical protocol [17]. Patients were instructed to have no caloric intake for 6 h prior to the study. All patients were normoglycemic (blood glucose below 150 mg/dL) at the time of FDG injection. After the injection of a weight-based dose of 2-(18F)-fluoro-2-deoxy-D-glucose (0.1 milliCurie/kg; minimum/maximum: 1/10 mCi), patients rested for 30 min in a quiet room to allow for tissue uptake. PET scans were performed for 12 min using 3-D acquisition and time-of-flight technology over two table positions. Each data set consisted of 90 transaxial PET images with 2 mm slice thickness and 256 mm field of view (voxel: 2×2×2). CT images for attenuation correction were acquired for each patient with the same protocol (120 kV, 40 mAs, pitch: 0.813, slice thickness: 2 mm). Images were reconstructed using 3-D filtered back projection and utilizing standard CT attenuation correction.

PET/MR imaging

All research PET/MR exams were performed immediately following completion of the PET/CT exam, without additional FDG injection, on a tandem PET/MR scanner (Philips Ingenuity TF; Philips Healthcare, Andover, MA) [3]. This scanner combines PET imaging hardware identical to that of the clinical PET/CT scanner described above with a 3-Tesla magnet. PET scans were again acquired for a total of 12 min using 3-D acquisition and time-of-flight technology over two table positions; scan parameters were identical to those for the PET/CT acquisition. Attenuation correction MR images were acquired without contrast using a 32-channel head coil. Attenuation maps were created based on an axial volumetric T1-weighted spoiled gradient echo sequence (TR/TE [ms]: 4.2/2.2, flip angle [degrees]: 2, voxel [mm]: 2×2×2) using a three-segment, atlas-based algorithm implemented in software provided by the scanner manufacturer and cleared by the U.S. Food and Drug Administration [18].

Image analysis

The attenuation corrected FDG-PET exams acquired on both scanners were coregistered to the MR attenuation correction sequence for the purposes of anatomical correlation (both PET exams were coregistered to the MRI for that patient to assure blinded interpretation with respect to the type of PET scanner) (Fig. 1). It is important to note that this design was adopted to compare the diagnostic accuracy of FDG-PET data obtained from the two modalities; it does not test the added value of PET/MR per se. Five pediatric radiologists with expertise in FDG-PET imaging of the brain reviewed all exams using dedicated viewing software (Intellispace Portal; Philips Healthcare, Andover, MA). Post-fellowship experience of the reviewers was as follows: V.S.: 8 years; A.S.: 3 years; M.J.P.: 7 years, N.M.: 6 years and J.Y.J.: 15 years. Brain exams for 35 patients were included. Each FDG-PET exam was read independently; hence, each reader interpreted a total of 70 FDG-PET exams (consisting of a PET/MR and a PET/CT acquired PET exam for each of the 35 patients). MR-acquired exams were presented in the first batch in approximately half of the patients (n=18); CT-acquired exams were presented in the first batch in the other half (n=17). Readers were blinded to the scanner type as well as all other clinical and imaging data; an indication of “localization-related epilepsy” was the only information provided. Each reader interpreted each examination as “normal” versus “abnormal.” For abnormal studies, the lobe (or lobes) of abnormality was documented. Studies with abnormal tracer accumulation spanning multiple lobes in one hemisphere in non-contiguous fashion were categorized as “multifocal.” Studies with abnormal FDG uptake in both hemispheres were categorized as “bilateral.” In sum, 350 interpretations of 70 FDG-PET exams in 35 patients were then compared to the reference standard. The reference standard was considered the lobe of seizure origin determined after work-up at the institution’s interdisciplinary epilepsy surgery work rounds. This work-up included, at a minimum, a patient’s clinical history, physical exam, electroencephalography, brain PET/CT and structural MRI of the brain.

Fig. 1
figure 1

Positron emission tomography (PET)/CT- (a) and PET/MR- (b) acquired [F-18]2-fluoro-2-deoxyglucose (FDG) examinations of the brain in a 12-year-old girl demonstrate a focal area of decreased tracer accumulation consistent with a seizure focus (arrows). During blinded interpretation, both FDG exams were coregistered to the attenuation correction MR sequence (c) and presented independently to the five readers. Axial volumetric T1-weighted (d) and fast spin echo T2-weighted (e) MR images demonstrate a focal area of cortical thickening and blurred gray-white junction in association with a pyramidal tail of abnormal T2 prolongation (white arrows) characteristic of focal cortical dysplasia

A 5-point Likert scale was applied to assess image quality, again the reader was blinded to the scanner type, as follows: 5: an excellent study without artifacts; 4: a good study with minor artifacts not affecting clinical use; 3: moderate artifacts probably affecting clinical use, 2: poor quality with major artifacts not advised for clinical use and 1: extremely poor quality with major artifacts and not clinically useful.

Statistics

Statistical testing was performed using R statistical software package, version 3.0.2 (R Foundation for Statistical Computing, Vienna, Austria). Each of the 350 observations from the readers above (175 observations from PET/MR and 175 observations from PET/CT) regarding lobe of origin was compared against the reference standard. Sensitivity, specificity and overall accuracy with 95% confidence intervals of PET/MR and PET/CT-acquired FDG exams for the localization of a seizure focus were calculated using standard methods [19].

Our primary analysis was to test for non-inferiority of the accuracy of PET/MR to that of PET/CT. Our a priori non-inferiority margin was a clinically significant 10% difference in accuracy between PET/CT and PET/MR for the localization of a seizure focus. The non-inferiority test for paired data [20] was applied with the following null (H0) and alternative (H1) hypotheses:

$$ \begin{array}{c}\hfill {H}_0:{p}_{MRI}-{p}_{CT}\le -\delta \hfill \\ {}\hfill {H}_1:{p}_{MRI}-{p}_{CT}>-\delta, \hfill \end{array} $$

p MRI and p CT are the diagnostic accuracies of PET/MRI and PET/CT, respectively, and δ is the non-inferiority margin of 0.1. The Wald-type asymptotic test for this hypothesis at the level α was given by:

$$ Z=\frac{\ {\widehat{p}}_{MRI}-{\widehat{p}}_{CT}+\delta}{\sqrt{\frac{{\widehat{p}}_{01}+{\widehat{p}}_{10}-{\left({\widehat{p}}_{MRI}-{\widehat{p}}_{CT}\right)}^2}{n}}}\ge {z}_{\alpha} $$

Z is the standard score (z-score), n is the sample size, \( {\widehat{p}}_{MRI} \)and \( {\widehat{p}}_{CT} \) are respectively estimates of p MRI and p CT determined from the study data, \( {\widehat{p}}_{01} \) is the frequency that the PET/CT diagnosis is correct but PET/MR is incorrect, \( {\widehat{p}}_{10} \) is the frequency that the PET/MR diagnosis is correct but PET/CT is incorrect, and z α is the critical value for hypothesis testing determined by a standard normal distribution.

In non-inferiority testing, the type 1 error reflects the likelihood that the null is rejected (i.e. conclude non-inferiority) when the null is true (i.e. the test is inferior). In this study, the interpretations of each reader are not completely independent of one another given that multiple readers interpreted each individual examination. In order to account for the dependent structure of the data imparted by multiple readers, the P-value was adjusted using a Monte Carlo simulation as follows: 1) A simulated data set was generated according to the null hypothesis (that PET/MR is inferior to PET/CT) and the dependent structure observed in our experimental data set. 2) The Z value (see above formula) was calculated on the simulated sample. In this case, the accuracy of PET/CT (p CT )was set to the observed accuracy of PET/CT in our experimental data set and the accuracy of PET/MR (p MRI ) was set to be p CT  − δ. 3) Steps 1 and 2 are repeated 10,000 times. 4) Calculate an estimate of Z under the null hypothesis from our experimental data, denoted by \( \widehat{Z} \). 5) The adjusted P-value is calculated as the frequency with which the value of is \( \widehat{Z} \) larger than or equal to Z for the simulated data sets. In other words, at what frequency do we reject the null and spuriously conclude non-inferiority. The adjusted P-value threshold below which the null hypothesis would be rejected was set to 0.05.

Wilcoxon signed rank test was performed to compare image quality between the two modalities (alpha: 0.05). Cohen’s kappa coefficient was computed to quantify intraobserver (between a given reader’s interpretations of the PET/MR and PET/CT examinations for the same subject) and interobserver agreement [21]. To assess for potential differences at the level of individual readers, 95% confidence intervals for each reader’s accuracy were computed.

Results

Patients

The study spanned May 2013 through September 2014. During this period, 97 pediatric patients with localization-related epilepsy were referred for a clinical PET/CT examination of the brain. Of these, the following patients were excluded: 22 could not be accommodated for a subsequent PET/MR exam due to scheduling conflicts in radiology, 9 were incompatible with a 3-T magnet (all for vagus nerve stimulators), 7 were deemed by the anesthesiologist unable to undergo anesthesia long enough to span both exams, 12 were unable to complete the MR examination without sedation (but did not require sedation for the PET/CT) and 12 declined. Thirty-five patients with localization-related epilepsy composed the final study group (age: 2-19 years; median: 11 years; 21 males). Twenty-five patients were deemed to have a consistent focal seizure origin according to the reference standard. The majority of these patients (20/25) had associated structural lesions at MRI, the most common of which was focal cortical dysplasia. Distribution of seizure origin and associated structural lesions are presented in Table 1. By contrast, most patients without a consistent focal seizure origin had no structural abnormalities (8/10); one had agenesis of the corpus callosum and one had hydrocephalus.

Table 1 Distribution of seizure origin according to the reference standard and associated structural lesions

FDG-PET imaging

PET/MR exams were performed immediately following completion of the PET/CT in all patients. Mean (standard deviation) start of acquisition after FDG injection was 60 (11) min for PET/CT and 75 (10) min for PET/MR. All exams on both scanners were deemed sufficient for clinical use (Likert scale 4 or 5). Likert scores did not differ significantly between the two scanners (PET/MR mean [SD]: 4.6 [0.5]; PET/CT mean [SD]: 4.5 [0.5]). Representative case examples are presented in Figs. 1 and 2.

Fig. 2
figure 2

Positron emission tomography (PET)/CT- (a) and PET/MR- (b) acquired [F-18]2-fluoro-2-deoxyglucose (FDG) examinations in a 14-year-old boy demonstrate decreased tracer accumulation throughout the right temporal lobe (circles). Coronal fast spin echo T2-weighted image (c) demonstrates volume loss, loss of normal architecture and abnormal T2 prolongation of the right hippocampus (arrow) consistent with mesial temporal sclerosis

Test characteristics of blinded interpretation of the PET/MR- and PET/CT-acquired FDG brain exams compared to the reference standard are presented in Table 2. The accuracy of PET/MR for localization of a seizure focus was not inferior to that of PET/CT (P=0.017). Although this study was specifically designed to compare accuracy as the primary endpoint, there was no statistical difference between the two modalities in any test characteristic (Table 2). Similarly, there was no difference in diagnostic accuracy between the PET/CT and PET/MR-acquired examinations at the level of the individual readers (Table 3). Intrareader agreement was substantial for all readers (Table 3); inter-reader agreement was also substantial (kappa: 0.65).

Table 2 Diagnostic performance of [F-18]2-fluoro-2-deoxyglucose (FDG)-positron emission tomography (PET) brain exams for detecting seizure origin when acquired on a tandem PET/MR scanner versus those acquired on a PET/CT scanner
Table 3 Diagnostic accuracy of blinded interpretation of [F-18]2-fluoro-2-deoxyglucose (FDG)-positron emission tomography (PET) brain exams when acquired on a PET/CT scanner and on a tandem PET/MR scanner and Cohen’s kappa for agreement between the two exams for each reader

Discussion

We performed a single-center, prospective study of brain FDG-PET exams in children with localization-related epilepsy to compare the test characteristics of PET/MR with those of PET/CT. We report two main findings: 1) The diagnostic accuracy of FDG-PET data acquired on a PET/MR scanner was not inferior to those acquired on a PET/CT scanner for localization of a seizure focus and 2) Image quality did not differ significantly between PET/MR- and PET/CT-acquired FDG exams.

The current practice standard for FDG-PET imaging in children with focal seizures is PET/CT fused with anatomical MRI. The accuracy of PET/CT has been previously reported at approximately 78% [22]. Test characteristics of PET/CT in our study were similar to published estimates [9, 23]. Our results extend work that previously demonstrated a strong quantitative relationship between standardized uptake values measured on PET/MR- and PET/CT- acquired exams in both the body and brain [18, 24]. Our findings suggest that, beyond quantitative similarity, PET/MR-acquired FDG brain exams are diagnostically non-inferior to those acquired on a PET/CT scanner. Taken together, these results suggest that PET/MR is a valid clinical alternative to PET/CT in this pediatric population.

Although we compared the accuracy of the two modalities in toto, PET hardware was identical between the two scanners. Hence, potential differences between the two FDG exams would have resulted mainly from attenuation correction. There are three basic methods for MR-based attenuation correction: 1) Direct segmentation: assigns predefined attenuation coefficients to tissue classes segmented from a structural image, 2) Atlas-based approaches: Attenuation maps generated using transmission-based correction are applied by coregistration of CT templates to the patient’s MR images and 3) Machine learning: Tissue types are predicted on the basis of structural MR images using a pretrained computer-learning algorithm. In our study, a 40 mAs CT was used to correct the PET/CT-acquired FDG exams using a standard transmission-based algorithm [25]. The PET/MR exams were corrected according to a vendor-provided segmentation-based algorithm [18, 24]. This correction technique assigns attenuation coefficients to each voxel based on automated segmentation of a 3-D-acquired T1-weighted image into three tissue classes: air, lungs and soft tissue. The MRI-derived attenuation map is then used to correct the raw PET emission data according to the same algorithms used in PET/CT. Using this three-segment model for attenuation correction, we found that the accuracy of seizure localization using PET/MR was not inferior to that obtained using PET/CT-acquired images. It should be noted that the three-segment model for attenuation correction used in this study does not account for bone. This shortcoming would be expected to result in less accurate correction in certain patients, particularly those with asymmetries in the skull. Future implementation of MR sequences that are capable of imaging cortical bone, for example acquisitions with ultra-short TE [14, 26], will be of great value to optimizing attenuation correction in PET/MR.

Patient-related artifacts in MRI, most notably susceptibility, would be an additional potential source of differences between PET/MR and PET/CT-acquired FDG exams. Magnetic susceptibility refers to the tendency of a structure, when subjected to an external magnetic field, to produce a magnetic field contribution of its own. Not only do such artifacts reduce the image quality of a diagnostic MR exam, they have the potential to impact the efficacy of attenuation correction. Although overall we observed no significant degradation to either PET image quality or diagnostic accuracy, awareness of potential patient-related artifacts (for example, EEG leads) will be important to appropriate patient selection as PET/MR becomes more widely utilized.

To our knowledge, there is very little data evaluating the diagnostic accuracy of PET/MR-acquired brain exams in children. However, our results are in general agreement with previous studies using PET/MR to evaluate central nervous system abnormalities in adults. For example, Boss et al. [27] demonstrated similar image quality and excellent tumor delineation by both PET/CT and PET/MR in adults with brain tumors. And Schwenzer et al. [28] reported similar observations in an adult cohort with intracranial masses, head and neck tumors, and neurodegenerative disorders. Of note, both studies utilized C11-methionine as a radiotracer. However, our results are also in line with prior work using FDG-PET/MR in several oncological populations, including adult head and neck tumors [29], bone tumors [30], lung cancer [31] and a general pediatric oncological cohort [32], which has demonstrated no statistical difference in diagnostic performance compared with PET/CT.

This study has several limitations. First, this is a highly selected cohort of pediatric patients with localization-related epilepsy. Generalization of these results to patients with other central nervous system abnormalities, or even adults with epilepsy, may not be valid. Second, the PET/MR exam was acquired after the PET/CT in each case. This workflow was developed to assure that the research study did not interfere with successful completion of the clinically requested PET/CT exam. Although the effects of this limitation are difficult to predict with certainty, both exams were performed on average within the generally accepted plateau phase. In other words, since brain tissue concentrations of FDG reach a plateau within 1 h (the average time of PET/CT acquisition in our study), it is unlikely that continued uptake between the two exams would have had a significant impact on image quality [33]. Similarly, we would expect very little effect of radiotracer decay over the interval between the two PET exams given the long half-life of FDG. Third, we tested a non-inferiority margin between the two imaging modalities of 10%. Although this number represents a relatively small difference in overall diagnostic accuracy, our findings do not imply that smaller differences in accuracy are not of clinical relevance. Fourth, the gold standard was defined, in small part, using information obtained from the PET/CT (but not the PET/MR) acquired FDG-PET exam. This design could result in a small advantage for PET/CT and, therefore, may be limited in terms of its ability to identify potential superiority of PET/MR. In terms of the primary goal of this study, however, this advantage contributes to a more stringent assessment of the non-inferiority of PET/MR. Finally, it is important to note that these results do not necessitate a choice of PET/MR over PET/CT when imaging children with focal epilepsy. It is likely that both tools will continue to have their place in the clinical armamentarium of FDG-PET imaging. These data simply demonstrate that, should you choose to acquire FDG-PET exams of the brain on a PET/MR scanner in this patient population, there is no significant cost in terms of diagnostic accuracy.

Conclusion

In a cohort of children with localization-related epilepsy, we report the diagnostic accuracy of PET/MR was not inferior to that of PET/CT for localization of a seizure focus. These results suggest that PET/MR is a valid clinical alternative to PET/CT in children with epilepsy.