Introduction

An important technical challenge of integrated brain PET/MR is achieving an accurate PET attenuation correction (AC) comparable with transmission-based computed tomography (CT) [1]. It is a major prerequisite for fluoro-2-deoxy-d-glucose (FDG) PET to be quantitative, especially in the context of early diagnosis of Alzheimer disease (AD). In current PET/MR scanners, AC is accomplished by generating an MRI-based surrogate CT from which AC-maps are derived [2]. MR-based AC methods (MRAC) implemented on clinical PET/MR scanners, use a 2-point Dixon MR which is segmented into four tissues classes (i.e., lung, air, fat, and soft tissue) with predefined attenuation coefficients. In brain studies, it is supplemented by the use of a single average CT template image generated from the average of normal head CT scans (atlas-based AC) or substituted by an ultrashort or zero echo time (UTE/ZTE) MRI to capture bone information. Atlas-based AC method relies on registering the single CT template to patient MR to correct brain metabolism from bone attenuation due to the skull. However, this approach has shown some limitations when applied to unusual head anatomies [3,4,5,6], and is highly dependent on image registration quality of the CT template to individual MR image. ZTE-based AC methods have been suggested as good candidates for accurate and subject-specific AC as ZTE MRI provides signal in tissue with short T2*, enabling the capture of air and bone information [6]. They demonstrated promising performances compared with the atlas-AC in a small number of subjects [5,6,7,8]. However, their performance on brain metabolism measurements in patients with neurodegenerative diseases has not been evaluated so far. Additionally, although ZTE-AC globally improved AC compared with the atlas-AC, a remaining bias was reported in the nasal sinus cavity and temporal bones as these structures, containing bone-air-soft tissue interfaces, were not correctly segmented due to partial volume effect [9].

In this work, we aim to evaluate ZTE-AC in 50 patients referred for cognitive impairment, by comparing it with the atlas-AC method implemented in a clinical setting and to CT-AC method set as reference. The ZTE-AC generation algorithm used here was modified to improve the segmentation in the sinus and temporal bones and detailed in the “Materials and methods” section.

Materials and methods

Population

Fifty consecutive patients (70 ± 11 years old, 28 men) examined at our institution between 2016 and 2017, and fulfilling the following criteria, were included: (1) brain FDG-PET/MR scan (SIGNA 3T PET/MR, GE Healthcare) performed in the context of neurocognitive disorders investigation; (2) available brain CT scan without iodinated contrast agent injection; (3) no skull, face, or neck surgery or injury between CT and PET/MR examinations. All data were extracted from the database of the Pitié-Salpêtrière Hospital, Paris, France, and data use was approved by the French authority for the protection of privacy and personal data in clinical research (CNIL, approval no. 2111722). This study was performed according to the principles of the Declaration of Helsinki.

PET/MR acquisition

Brain PET/MR was acquired 30 min after the injection of 2 MBq/kg FDG. MR imaging included 3D T1-weighted inversion-recovery fast spoiled gradient echo acquisition, 3D FLAIR; 3D susceptibility-weighted MRI and axial diffusion-weighted imaging. Scanning time also included a 20-min single-bed position PET emission scan.

To correct for photon attenuation, a 2-point Dixon (LAVA-Flex) T1-weighted MRI (axial acquisition TR = 4 ms; TE = 1.12 and 2.23 ms; flip angle 5°; slice thickness 5.2 mm, with a 2.6-mm overlap; 120 slices; pixel size of 1.95 × 1.95 mm2, acquisition time 15 s) was acquired yielding water, fat, and in-phase and out-phase contrasts. A ZTE MRI was acquired with the following parameters: 3D center-out radial acquisition; TR = 390 ms; TE = 0 ms; flip angle 0.8°; slice thickness 2.4 mm; voxel size 2.4 × 2.4 × 2.4 mm3; field of view 26.4 × 26.4 m2; bandwidth ± 62.5 kHz; acquisition time 40 s.

AC map processing

Three AC maps were created for each patient: one attenuation image derived from a CT scan (CT-AC map) set as reference, and two MR-based images (ZTE- and atlas-based AC maps) being evaluated.

First, an atlas-AC map was generated, as previously described [10], using a single atlas-based AC method corresponding to a non-rigid registration of an average CT template on the Dixon in-phase image using the implemented process on the PET/MR scanner.

A second AC map was generated from ZTE MRI segmentation (ZTE-AC). An offline post-processing pipeline provided by the manufacturer was used to generate bone tissue estimates from the acquired ZTE data based on the method proposed by Wiesinger et al [11], including sinus-edge correction described in Delso et al [12] and Yang et al [13]. First, bias correction was applied to homogenize ZTE signal intensity over the patient tissue, followed by an intensity normalization by the value of the soft tissue peak in the intensity histogram (intensity of air voxels centered at 0, intensity of soft tissue voxels centered at 1) [14]. Second, a three-class tissue segmentation (bone, air, and tissue) was performed using histogram-based thresholding as follows: air voxels have an intensity lower than 0.25, soft tissue voxels have an intensity higher than 0.85, and bone voxels are in between. Finally, an attenuation coefficient was assigned to each tissue class to generate a pseudo-CT. A value of − 1000 HU was used for air and 42 HU was used for soft tissue. The assumption that attenuation coefficients of soft tissue are uniform is acceptable even in the case of brain atrophy since attenuation coefficients of cerebral spinal fluid (15 HU), gray matter (37–45 HU), and white matter (20–30 HU) are not significantly different. Voxels classified as bone were assigned density values of 42 + 2400·(1 − IZTE) HU, where IZTE represents the normalized ZTE voxel intensity. This relationship linking ZTE intensity to CT HU values, although not determined on this population, was published in Wiesinger et al [15]. Additionally, a number of sensitive anatomical regions were selected to impose additional constraints on the segmentation, namely to prevent false-positive bone voxels in areas affected by partial volume effect as explained in Delso et al [12]. These regions were identified in previous studies [9, 13] and were known to pose difficulties to pure intensity-based bone segmentation. The paranasal sinuses are arguably the most relevant of these regions, due to their complex hollow structure. Another challenging region is the temporal bones, where the air of the ear canal and mastoid cells are in close proximity with some of the densest sections of bone in the body.

CT-based AC method was set as the gold standard as it is used to correct for the attenuation of PET 511-keV photons in PET/CT systems. A non-enhanced, low-dose head CT (120 kVp, 50 mAs) was collected for each patient from brain PET/CT examinations performed in our department with either a Gemini GXL 16, Philips PET/CT system (3 mm slice thickness, 6.8 mm in plane pixel size), or a Biograph mCT Flow, Siemens PET/CT system (2 mm slice thickness, 5.9 mm in plane pixel size). The quality control of CT component of the PET/CT systems (CT number, uniformity, image noise, and image artifacts) was carried out regularly (three times a year). The time interval between CT scan and PET/MR examination was 16 ± 25 months (range, 0–134 months). So, for each patient, CT and MR images were compared by a physician trained in brain imaging to ensure that no anatomical discrepancies were found between MR and CT scans (no skull deformation, post-operative sequelae, or brain lesion). CTAC maps were created after registering CT images to ZTE MR images using an affine registration based on mutual information maximization algorithm with the MIPAV software (CIT-NIH) and visually checked to detect any spatial mismatch [16]. In addition, as attenuation depends on both photon energy and density of absorbing material, we used the bilinear conversion method based on the attenuation of water and cortical bone to convert linear attenuation coefficients from CT energies (120 keV) to 511-keV PET photon energy [17, 18] to generate the CTAC map used as the gold standard [19].

All AC maps (Suppl. Fig. 1) were then processed as follows with MATLAB software (MathWorks Inc.). As described in Zhang et al [20] and Eldib et al [21], the attenuation maps of non-patient objects within the FOV (coils, bed, and all surrounding materials) were added to AC maps using the PET Recon toolbox provided by GE Healthcare. AC maps were smoothed with a Gaussian filter kernel of 10-mm full-width at half-maximum, which is the default smoothing applied on the atlas-AC map by the manufacturer. Then, PET images were reconstructed with each AC map generating three images per patient (PETATLAS, PETZTE, and PETCT corrected with Atlas-AC, ZTE-AC, and CT-AC, respectively) using the following parameters: Ordered Subsets Expectation Maximization algorithm integrating TOF, PSF modeling, attenuation and scatter correction with 8 iterations and 28 subsets, a FOV of 300 × 300 mm2, and a voxel size of 1.17 × 1.17 × 2.78 mm3.

Quantitative analysis

PET images were spatially normalized in the Montreal Neurological Institute (MNI) space with SPM12 (http://www.fil.ion.ucl.ac.uk/spm/software/spm12) and processed to generate maps of mean relative difference and standard deviation for the 50 patients and each AC method as follows:

$$ {\mathrm{Diff}}_{\mathrm{MRAC}}\ \left(\%\right)=\frac{\left({\mathrm{PET}}_{\mathrm{MRAC}}-{\mathrm{PET}}_{\mathrm{CT}}\right)}{{\mathrm{PET}}_{\mathrm{CT}}}\times 100 $$

Additionally, MRAC methods were quantitatively assessed with volumes of interest (VOIs). PET images were normalized in intensity by dividing voxel activity by the mean pons uptake, yielding images of SUV ratios (SUVR). The pons was chosen as its metabolism is relatively preserved in cognitive disorders [22] and not significantly modified by MRAC in preliminary analyses. Ninety cortical and subcortical VOIs from the Automated Anatomical Labeling brain atlas (AAL, http://www.gin.cnrs.fr/fr/outils/aal-aal2/) [23] were applied on SUVR images to extract regional values from PETATLAS (SUVRATLAS), PETZTE (SUVRZTE), and PETCT (SUVRCT). The percentage of relative (ΔSUVR) and absolute (|ΔSUVR|) differences were computed with respect to PETCT, as follows:

$$ \Delta \mathrm{SUVR}\ \left(\%\right)=100\times \frac{\left({\mathrm{SUVR}}_{\mathrm{MRAC}}-{\mathrm{SUVR}}_{\mathrm{CT}}\right)\ }{{\mathrm{SUVR}}_{\mathrm{CT}}} $$

PETATLAS, PETZTE, and PETCT images—previously spatially normalized, normalized in intensity, and smoothed—were also compared on voxel basis with paired t tests, using the following contrasts: (i) PETATLAS/ZTE> PETCT to exhibit cortical regions whose metabolism is overestimated by MRAC in comparison with CTAC; (ii) PETCT> PETATLAS/ZTE to explore regions with statistically significant metabolism underestimation. The threshold was set to p < 0.001 corrected for multiple comparisons with the family-wise error (FWE) method at the voxel level. Only clusters bigger than 300 voxels were considered.

Finally, to test the impact of MRAC on between-groups analyses, two subgroups of patients were extracted from the whole population. Patient selection was established on the basis of suspected diagnosis and/or prominent symptoms at presentation, and metabolic profile on PETCT images interpreted visually by a physician with 15 years’ experience. Two groups were defined: (i) 22 patients with moderate to severe hypometabolism in associative posterior cortex compatible with Alzheimer’s disease (73 ± 8.1 years old; 12 men); (ii) 15 patients with mild cognitive impairment whose cortical metabolism was considered normal at visual inspection (67 ± 16.1 years old; 9 men). Between-groups comparisons were conducted with two-sample t tests with PETATLAS, PETZTE, and PETCT providing three SPM T maps that were subsequently compared, and the results obtained with CTAC being used as reference. Age and sex were set as covariates. Considering the small sample size, the threshold was lowered at p < 0.001 uncorrected, with a minimal cluster size of 300 voxels.

Results

VOI analysis results are detailed in Fig. 1, displaying the mean SUVR difference and standard deviation (SD). Over all VOIs, the ZTE-based AC yielded a lower bias (|ΔSUVR| = 3.6 ± 3.2%) than the atlas method (|ΔSUVR| = 4.5 ± 6.1%). The atlas-AC also yielded higher interindividual (6.8% versus 4.6% for ZTE-AC) and inter-regional variability (2.6% versus 1.4% for ZTE-AC) in comparison with the ZTE method.

Fig. 1
figure 1

Mean SUVR relative difference (a) and standard deviation (b) in PETATLAS (in red) and PETZTE (in blue) in comparison with PETCT. Differences were calculated in 90 VOIs using the AAL toolbox [23] and averaged over the 50 patients. In each VOI, the relative difference is normalized by SUVR in PETCT. In (a), the full black line depicts the zero line and the gray ring indicates the [− 5, + 5] % interval. In (b), the gray disk indicates the [0, + 5] % interval. VOIs are listed in Suppl. table 1

Maps of mean and SD relative difference between MRAC and CTAC are shown in Fig. 2. They show that atlas-based AC overestimates glucose metabolism in the bilateral frontoparietal cortex extending to the temporoparietal junction. The highest overestimation was found in regions close to the vertex, mainly the middle frontal gyrus (ΔSUVR = 4.3 ± 6.5%), supplementary motor area (ΔSUVR = 8.2 ± 7.3%), bilateral paracentral lobule (ΔSUVR = 7.9 ± 6.9%), and superior parietal cortex (ΔSUVR = 7.4 ± 13.0%) as shown in Fig. 2. These results were confirmed by the SPM approach (p < 0.001 FWE corrected; Fig.3 and Supp. table 2). Conversely, the atlas-AC underestimates the regional metabolism below the anterior commissure–posterior commissure (AC-PC) line, in the inferior temporal, occipital, orbitofrontal cortices, the temporal pole, and the cerebellum (Fig. 2). The underestimation reached statistical significance in the cerebellum only (p < 0.001 FWE corrected; Fig. 3 and Supp. table 2). SD maps revealed high interindividual variability in parietotemporal cortex especially in the superior parietal cortex (12.2%) and the right angular gyrus (11.4%) (Figs. 1, 2).

Fig. 2
figure 2

Maps of SUV relative difference and standard deviation of relative difference. Map of SUV mean relative difference (%Diff, upper) and standard deviation (SD, lower) between PETATLAS/ZTE and PETCT overlaid on T1-weighted MRI template in the MNI space

Fig. 3
figure 3

SUV comparison between PETZTE and PETATLAS compared with PETCT: SPM results. Paired t test comparisons between PETATLAS and PETCT (upper) and between PETZTE and PETCT (lower) in MNI space. Statistical maps were thresholded for significance at p < 0.001 FWE-corrected at the voxel level. PETATLAS/ZTE > PETCT are shown in hot colors with positive values to highlight the brain regions with significant metabolism overestimation and PETATLAS/ZTE < PETCT are shown in cold colors with negative values to highlight the brain regions with significant metabolism underestimation. The metabolism overestimation/underestimation is in regard with the PETCT gold standard

Regarding the ZTE-based AC, the metabolism was moderately underestimated through the whole cortex (Fig. 2). Voxel-wise statistical significance was observed in the cerebellum, and the fusiform gyrus extending through the parietooccipital cortex, as well as in a very small cluster in the prefrontal dorsolateral cortex (p < 0.001 FWE corrected; Fig. 3, Supp. table 2). ΔSUVR was − 4.2 ± 4.6% in the cuneus, − 4.9 ± 7.4% in the superior parietal cortex, and − 1.3 ± 2.6% in the cerebellum. No regional metabolism overestimation was found with ZTE-AC. Interindividual variability was lower than that of atlas-based AC (Figs. 1 and 2).

SPM comparisons between patients with normal vs. abnormal PET are shown in Fig. 4 and Supp. table 3. When performed with PETCT, SPM analysis revealed a large hypometabolism of the bilateral posterior associative cortex: the posterior cingulate in the patients with metabolic pattern suggestive of AD (p < 0.001). The same comparison performed with PETZTE provided significant clusters of slightly smaller size and lower t scores but similar topographies, whereas only scattered clusters were found in the left posterior junction and the cuneus when using atlas-based AC.

Fig. 4
figure 4

Voxel-wise comparisons between normal and abnormal brain FDG-PET performed with PETCT (a, reference), PETZTE (b), and PETATLAS (c). Two-sample t test between normal PET (n = 15) and abnormal PET suggestive of AD (n = 22) with (a) CT-AC (whose results are used as reference), (b) ZTE-AC, and (c) atlas-AC. Statistical maps were thresholded for significance at p < 0.001 uncorrected and cluster extent > 300. Results are displayed in neurological convention (left is left)

Discussion

We evaluated the impact of AC algorithms based on CT-atlas and ZTE-MRI on brain metabolism in 50 patients explored for cognitive impairment with PET/MRI. VOIs and voxel-based analyses showed that ZTE-AC method performed better than the atlas-AC, yielding a moderate bias on metabolism measurements with low interindividual and inter-regional variabilities.

PETATLAS showed a significant positive bias in the regions located above the AC-PC line especially those close to the vertex, and a negative bias in the lower regions, especially the cerebellum. Such variability across regions might result in a difficult interpretation of FDG-PET data especially when the cerebellum is used as reference for the activity normalization of the whole brain. Similarly, the interindividual variability was significantly higher with PETATLAS than the one measured with PETZTE, mainly in the parietotemporal junction that is one of the earliest regions involved in Alzheimer’s disease. The limits of the atlas-AC method may be explained by the use of average healthy subjects’ CT scans to generate AC maps, which cannot take into account the specificity of head anatomies. For example, the cranial vault thickness is often overestimated on AC maps, especially in the upper part of the frontal and parietal bones, which could cause metabolism overestimation of the adjacent cortex; on the contrary, the rostral portion of the frontal bone, the mastoid part of the temporal bone, and the skull base may be underestimated or missing, possibly leading to metabolism underestimation observed in the orbitofrontal, inferior temporal cortices, and cerebellum.

When using the ZTE-AC method, the bias decreases by 21% with respect to atlas-AC. ZTE-AC yielded a lower bias (3.6 ± 3.2%) and a reduced interindividual (4.6% versus 6.8%) and inter-regional (1.4% versus 2.6%) variabilities. ZTE-AC yielded a moderate metabolic underestimation mainly in the occipital cortex and cerebellum. On ZTE-AC maps, the skull, mastoid, and the skull base were correctly depicted. Furthermore, the improved ZTE segmentation algorithm showed good performance in the regions reported as troublesome: air cavities in the frontal sinus, maxillary sinus, and nasal fossa were correctly segmented in ZTE-AC maps as well as the mastoid cells. Nonetheless, air cells and segments of the ethmoid bone were sometimes replaced by soft tissues. The occipital bone was sometimes thinner than that observed in CT scan, which may cause metabolism underestimation in the cerebellum.

In the multicenter evaluation of Ladefoged et al [2], the three major types of clinical feasibility MR-AC methods were selected for evaluation: template/atlas-based, maximum-likelihood reconstruction-based and segmentation-based AC (including a MR-AC method using an ultrashort time echo sequence (UTE)). All of the proposed methods had an average global performance within likely acceptable limits (± 5% of CT-based reference). However, the accuracy of ZTE-based AC was not evaluated in this panel. Our results are consistent with those of previous studies [5, 9, 13, 24], which found advantages of ZTE-AC over atlas-AC method, although their results were overall less pronounced. In Sekine et al [9] and Rezaei et al [25], ZTE-AC showed a lower bias (1.77 ± 1.41% and 0.2% ± 0.8%, respectively) than in our study, whereas it was larger (5.6 ± 3.5%) in Yang et al [13]. Sousa et al showed also that ZTE-AC seems to be a more robust technique than atlas-AC for intra- and interpatient variabilities with 68-Ge transmission scan–based AC [24]. Furthermore, multi-atlas AC method was proven to be superior to the single-atlas method [4, 5], but has not yet been compared to ZTE-AC. The ZTE-AC algorithm used in this work was slightly different than the one used in previous studies. The segmentation algorithm had been improved to reduce segmentation errors in the sinus cavity and hollow temporal bones, which may explain differences in performance compared with previous reports. Also, the spatial resolution of ZTE images as well as the smoothing of PET images, which are different in the three studies, might be causing partial volume effect and thus tissue segmentation error in AC maps [8, 11].

Our aim was also to test the feasibility of ZTE-AC in clinical settings, with the shortest acquisition time and processing time possible. ZTE MR acquisition lasts 40 s and the time for PET reconstruction and attenuation correction is less than 5 min making it compatible with clinical constraints. ZTE-AC demonstrated better performances compared with the atlas-AC method in brain PET imaging. The large number of patients included allowed evaluating ZTE-AC in a heterogeneous sample of patients having mild to severe cognitive deficit and variable metabolic defects. In this context, the low interindividual variability of ZTE-AC denotes the ability of this method to successfully deal with subject specificity.

Additionally, voxel-based comparisons between patient subgroups suggested that SPM(t) maps generated with ZTE-AC were close to those obtained with CT-AC, whereas some clusters were missing in some key regions when using PETATLAS images, suggesting that ZTE-AC was more sensitive to detect pathological areas in suspected neurodegenerative dementia than atlas-AC, in our sample of patients.

This study has several advantages and limitations. Compared with previously published studies, this work included a large number of subjects and was the first one to specifically include patients with neurodegenerative disorders. The main limits of this work are its retrospective nature and the use of brain CT scans acquired on different CT systems. This limitation is common for retrospective PET/MRI studies where it is difficult to have both PET/MRI and PET/CT acquisition for the same patient in the same day. However, because spatial mismatch between the CT and MR images can occur leading to quantitative bias, a quality control was systematically performed in each patient for every coregistration process (between CT and MR images, and during spatial normalization). In addition, AC maps were smoothed with a Gaussian filter with full-width at half-maximum of 10 mm (which is the default blurring applied on the AC map by the vendor) to reduce this variability. Another limitation was the delay between PET/MRI and CT scans. For this, every CT and MR image was visually interpreted by a physician trained in brain imaging, to ensure that no morphological discrepancy was found between them. Only patients without skull abnormality or brain lesion appearing during the interval between CT and PET/MR examinations were included in this work.

Despite some limits of this study, the ZTE-AC method seemed to improve accuracy of the FDG metabolism quantification in this population referred for neurocognitive disorder investigation. It could be valuable to support and generalize interpretation of brain PET images with the lowest variability. This new AC method is readily applicable on SIGNA PET/MR and is now implemented on PET/MR software. Additionally, the use of ZTE-AC could be extended to other brain applications such as oncology.

Further improvement of ZTE-AC is expected from deep learning (DL) approaches with convolutional neural networks (CNN). DL has recently been applied to medical imaging with successful implementations showing promising results in segmenting brain structures, bone, and cartilage, and generating discrete-valued pseudo-CT scans from MR images. First results suggest it performs better than current clinical approaches (Dixon-based soft-tissue and air segmentation and atlas-based template registration) [26,27,28]. Recently, Ladefoged et al found that their UTE-based CNN improved quantification accuracy compared with UTE in pediatric brain tumors [29]. Large studies using prospective ZTE MR acquisition for deep learning MRAC are warranted.

Conclusion

The ZTE-based AC improved the accuracy of the metabolism quantification in PET—mainly in the regions close to the vertex—compared with the atlas-AC method. Our results suggest that the ZTE-AC method performed better than the atlas-AC method in patients with suspected cognitive dementia. Although ZTE-AC slightly decreased brain metabolism, it demonstrated a reduced inter-regional and interindividual variabilities. It seemed to improve the identification of regional hypometabolism in the context of cognitive impairment, and hence might be promising in other pathologies involving brain lesions.