Introduction

Head and neck cancers (HNC) are among the most prevalent cancers, accounting for more than 550,000 cases annually worldwide [1]. The primary risk factors associated with HNC include tobacco use, alcohol consumption, human papillomavirus (HPV) infection (for oropharyngeal cancer), and Epstein-Barr virus (EBV) infection (for nasopharyngeal cancer) [2]. Standard treatments for head and neck cancers include radiation therapy and surgery, and for certain types of head and neck cancer, chemotherapy. Survival is partly poor and has partly improved over the past three decades [3], possibly related to a better tumor staging system and a strict follow-up protocol, including the imaging approach.

18F-fluorodeoxy-D-glucose (FDG) positron emission tomography/magnetic resonance imaging (PET/MRI) appears to be an excellent tool for imaging evaluation of head and neck cancers due to the high soft-tissue contrast provided by the MR component and the possibly better characterization of HNC. The definition of MR protocols for PET/MRI is challenging due to the vast amount of differently weighted sequences, but acquisition time should ideally be no longer than a standard PET/CT with contrast media. Furthermore, because of the availability of the PET component, the information provided by the MR component should be complementary or confirmatory to the PET information, but not redundant.

Diffusion-weighted imaging (DWI) was initially established as a functional MR technique that helps in detecting acute stroke, and has been researched for a wide variety of extracranial applications as well [4, 5]. It analyses the structure of a biologic tissue at a microscopic level based on the motion of water molecules. The differences in water mobility are quantified by an apparent diffusion coefficient (ADC). The ADC reflects the signal loss on DWI that occurs with increasing b-values and is inversely correlated with tissue cellularity [68].

FDG is a glucose analogue that accumulates in tumor cells after its transformation to FDG-6-phosphate by hexokinase, which cannot be oxidized further through glycolysis [9]. The semi-quantitative assessment of the glucose metabolism in these cells is expressed by the standardized uptake value (SUV). The SUV is defined as the FDG uptake in a tumor over a certain time interval, considering tracer decay, the administered dose of the PET tracer, and the patient’s body weight [10].

In HNC, DWI has been widely used for tissue characterization for primary tumors and lymph node (LN) metastases, prediction and monitoring of treatment response, and differentiation of recurrent tumors from post-therapeutic changes [11]. However, even if DWI and FDG do show different molecular phenomena, in the context of PET/MRI with the FDG-PET component available, the additional DWI component might actually represent redundant information to FDG-PET. Furthermore, one study has shown that DWI does not improve lesion detection in a PET/MRI protocol for whole-body cancer staging in a mixed oncologic population [12].

Thus, the aim of this study was to analyze whether the addition of a DWI sequence to PET/MRI adds significant additional information concerning lesion detection and characterization in patients with head and neck cancer.

Materials and methods

Patient population

A total of 157 adult patients who underwent PET/CT-MR between February 2012 and March 2013, either for staging or re-staging of various head and neck cancers, were enrolled in this prospective study. The inclusion criterion was the presence of at least one suspicious lesion on full-diagnostic PET/MRI, regardless of the type of treatment for recurrent lesions. The exclusion criteria of the present study included: unwillingness to undergo an additional MR exam, claustrophobia, MR-incompatible medical devices (e.g., cardiac pacemakers, neurostimulators, cochlear implants, and insulin pumps), or possible metallic fragments in the body. The institutional ethics committee approved this study and signed informed consent was obtained from all subjects prior to the examination.

PET/CT and MR imaging

Sequential PET/CT, ceCT, and ceMR were performed on a tri-modality PET/CT-MRI setup (full ring, time-of-flight Discovery PET/CT 690, 3T Discovery MR 750 w, both GE Healthcare, Waukesha, WI, USA). The dedicated MR- and CT-compatible shuttle transfer mechanism connecting the MR system and the PET/CT system allowed for PET/CT scanning that was free of radiofrequency (RF) coil-induced artifacts, and also made it possible to ascertain the placement of dedicated RF coils for MR imaging without repositioning of the patient [13, 14].

In accordance with the European Association of Nuclear Medicine (EANM) procedure guidelines for PET imaging, patients fasted for at least 4 h prior to injection of a standard dose of 4.5 MBq per kg body weight [15]. After an uptake time of 30 min, the patients were positioned on the shuttle table in the MR suite and the MR acquisition covering the region between the orbital roof and the cranial end of the sternum was initiated. The images were acquired using a dedicated RF coil (32-Channel HD Head-Neck-Spine, GE Healthcare). The applied MR pulse sequences (PS) included an axial, T1-weighted (T1w), three-dimensional (3D), spoiled-gradient echo pulse sequence (LAVA), an axial, two-point, Dixon-based, T2-weighted (T2w) gradient echo sequence (IDEAL), an axial, two-point, Dixon-based, ceT1w-gradient echo sequence (IDEAL), coronal and sagittal, two-point, Dixon-based, ceT1w-gradient echo sequences (LAVA flex), and an axial DWI. All images were acquired with a slice thickness of 4 mm within a total duration of the MR of 20–25 min (additional scanning parameters are listed in Table 1). The intravenously (IV) injected amount of contrast medium (Omniscan, GE Healthcare) was 0.2 mL/kg body weight at a flow rate of 1.5 ml/s. After completion of the MR scan, the coils were removed and the patient was transferred to the PET/CT scanner, still being positioned on the shuttle board. This ensures an identical position of the patient during the acquisition of both the PET/CT and the MR exam.

Table 1 MR acquisition parameters

Non-enhanced low-dose CT and PET emission data were acquired from the mid-thigh to the vertex of the skull.

PET data was acquired in 3D time of flight (TOF) mode with a scan duration of 2 min per bed position, an overlap of bed positions of 23 %, and an axial field of view (FOV) of 153 mm. The emission data was corrected for attenuation using the low-dose CT (CTAC), and was then iteratively reconstructed (matrix size 256 × 256 pixels, Fourier rebinning (VIP mode), VUE Point FX (3D) with three iterations, 18 subsets).

Image processing

The acquired PET, CT, and ceMR images were transmitted to a dedicated review workstation (Advantage Workstation, Version 4.5, GE Healthcare), which enables the review of the PET, CT, and ceMR images side by side or in fused/overlay mode (PET/CT; cePET/MRI). Due to the calibrated tri-modality system, no software-based image registration was necessary. A previously conducted study validated the image registration accuracy with less than 4 mm of lateral misalignment between CT, PET, and MR data sets, similar to the intrinsic error assessed with phantom measurements [16].

Image analysis

All images were analysed in consensus by a board-certified nuclear medicine physician/radiologist and a radiologist with substantial experience in PET/CT image reading. The presence of at least one suspicious lesion on full-diagnostic PET/MRI was mandatory for further evaluation.

DWI-only, PET/MRI-only, and PET/MRI with DWI images were analyzed concerning the detection and characterization of lesions. First, only the DW sequence was evaluated, with thresholds applied as mentioned below. Then, the PET/MRI with axial T2w fat-suppressed, axial LAVA, and multiplanar ceT1 were analyzed. Last, PET/MRI with DWI was evaluated.

On DWI, a malignant lesion was defined if the mean ADC (ADCmean) value was lower than and 1.2 × 10−3 mm2/s, based on reports available in the literature [1722]. Additionally, several other thresholds were applied, ranging from 1.0 × 10−3 mm2/s to 1.5 × 10−3 mm2/s, in order to identify the threshold with best accuracy. So, the lesions were quantitatively defined as: 1 (higher than 1.4 × 10−3 mm2/s); 2 (between 1.2 and 1.4 × 10−3 mm2/s); 3 (between 1.0 and 1.2 × 10−3 mm2/s); and 4 (lower than 1.0 × 10−3 mm2/s).

For PET/MRI, a malignant lesion was defined based on both functional and morphological criteria. The functional criterion used for the PET compound was a maximum SUV (SUVmax) of at least two-fold higher than the surrounding background activity (as published before). The morphological criteria for malignancy on MRI included: (1) a mass-like lesion with irregular borders and contrast enhancement; (2) enlarged lymph nodes (LN) greater than 1.0 cm in the short axis (and 1.5 cm for angular lymph nodes), cystic, with a necrotic centre, round-shaped, in a cluster formation, with an irregular boundary of the LN capsule and/or extra capsular LN spread. If there were discordant findings between PET and MRI, the combination of the most relevant findings (morphological and functional) was taken into account (e.g., an enlarged and irregular LN was considered malignant even if there was no FDG uptake) [23].

The lesions were additionally classified both on PET/CT and MRI using a likelihood evaluation ranging from 1 to 4 (1, negative (meaning no suspicious lesion detected); 2, probably benign; 3, likely (lesion likely to be malignant); and 4, very likely (lesion with very high suspicion of malignancy).

The standard of reference consisted of the histopathology (n = 65) of the detected lesions, clinical evaluation (n = 59), and imaging follow-up including all other imaging modalities (n = 64).

ADC values were measured for each pixel with b-factors of 0 and 800 s/mm2 using the standard software on the workstation (GE Healthcare, Waukesha, WI, USA). The ADC values were evaluated within a manually drawn, oval region of interest (ROI) placed carefully within the center of the lesions, avoiding apparent cystic changes or necrosis. The b-values were also calculated using the same software after placing the ROI on the DW image (b = 800 s/mm2).

Anatomical localization, SUVmax, mean SUV (SUVmean), ADCmean ,and mean and maximum b-values (b-valuemean and b-valuemax, respectively) were also assessed.

Statistical analysis

All statistical tests were performed using SPSS Statistics Version 21 (IBM, Armonk, NY, USA). P-values < 0.05 were considered statistically significant. Wilcoxon's signed-rank test was used for the comparison of the likelihood evaluation in PET/MRI and DWI. Spearman’s correlation analysis was performed to evaluate the correlation between SUVmax, SUVmean, b-valuemax, b-valuemean , and ADCmean. The Mann–Whitney Test was applied to analyze the difference of SUVmax, SUVmean, b-valuemax, b-valuemean , and ADCmean in malignant and non-malignant lesions. McNemar's test was used to evaluate differences in the accuracy of PET/MRI and DWI compared to the standard of reference.

Results

One hundred eighty-eight (188) lesions were identified in 70 patients (53 men, 17 women; mean age 63.8 years, range 26–86 years).

The remaining 87 patients showed no FDG-positive or suspicious lesions on MRI in PET/MRI of the head and neck area, and were therefore excluded.

Of the 188 lesions, 118 were malignant (37 tumors, 74 lymph node metastases, and seven soft-tissue metastases). Of the malignant lesions, 41 lesions were confirmed by histopathology, 44 by imaging, and 33 by clinical follow-up.

Additionally, 70 benign lesions were detected (56 inflammatory/reactive lymph nodes, 10 unspecific findings, and four Whartin’s tumors) and confirmed by clinical and imaging follow-up.

Of the overall 70 patients with lesions, 16 underwent imaging for primary staging and 54 for follow-up/re-staging. Mean follow-up time after PET/MRI was 196 days (range, 43 [this patient died]–394, median 180 days). Forty-one patients were alive without disease at the end of the follow-up phase, 23 were alive with disease, and six were dead as a result of disease.

The majority of the primary tumor histology (tumor histology at initial staging) was squamous-cell carcinoma (83.1 %). Overall primary tumor staging was as follows: one patient was T0 (1.6 %), 17 were T1 (26.6 %), 19 were T2 (29.7 %), nine were T3 (14.1 %), and 18 were T4 (28.1 %). The initial N-staging was N0 in 25 patients (39.1 %), N1 in eight (12.5 %), N2 in 29 (45.3 %), and N3 in two (3.1 %). The overall patient and tumor characteristics are summarized in Table 2.

Table 2 Patient and tumor characteristics

PET/MRI without DWI had a higher accuracy in detecting malignant lesions than DWI alone (86.8 % vs. 60.6 %, p < 0.001). PET/MRI read jointly with DWI had a slightly lower accuracy, although without statistical significance (86.8 % vs. 84.0 %, p > 0.05; see Table 3). The PET/MRI likelihood evaluation was significantly different from the DWI evaluation (p = 0.001; Table 4). PET/MRI was superior to DWI for all different ADCmean threshold settings, mainly reflected by higher sensitivity (see Table 5).

Table 3 Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy of PET/MRI with DWI, PET/MRI without DWI, and DWI alone (using ADC mean threshold for malignant lesions of 1,2)
Table 4 Likelihood evaluation of PET/MRI and DWI. Numbers represent the evaluation of the lesion found in all patients (number of patients)
Table 5 Sensitivity, specificity, and accuracy of DWI with different ADC thresholds

DWI missed 48 lesions detected by PET/MRI. Thirty-one lesions were not detected by DWI due to technical reasons, such as MR artifacts (due to metal, movement) and lesion location at the edge of the field of view (FOV). The remaining 17 lesions had no restricted diffusion and were therefore missed by DWI.

Of those 48 lesions missed by DWI, 20 were malignant (nine tumors [seven recurrent and two primary], nine recurrent lymph nodes, and two metastasis). Fourteen lesions of those 20 (six tumors, six recurrent lymph nodes, and two metastasis) were missed based on technical issues (artifacts). The remaining six malignant lesions (three tumors and three recurrent lymph nodes) did not show restricted diffusion (Fig. 1).

Fig. 1
figure 1

Thirty-five-year-old male with recurrence of a mesopharynx carcinoma in the right rosenmuller fossa. PET/MRI accurately detected it, but DWI showed no restriction (ADCmean = 1.73 × 10−3 mm2/s)

DWI added 20 lesions to the PET/MRI findings, and of those, 11 were inflammatory/reactive lymph nodes and nine were malignant lymph nodes. However, none of these nine malignant lymph nodes changed the overall staging since other lymph nodes defining the N-stage were already detected in PET/MRI without DWI (Fig. 2).

Fig. 2
figure 2

ac Sixty-nine-year-old man with squamous cell carcinoma of the base of the tongue. a Axial T2w, Axial ceT1w, DWI, ADC, and PET/MRI showing the primary tumor in the base of the tongue. b PET/MRI in different levels showing seven metastatic lymph nodes. c DWI (top) and ADC map (bottom) in different levels showing nine metastatic lymph nodes. In both PET/MRI and DWI, N-staging remains N2c

Concerning the quantitative values of PET and DWI, there was a significant difference (p = 0.001) in SUVmax and SUVmean between malignant and non-malignant lesions (10.5 vs 6.6 and 6.4 vs 4.1, respectively). ADCmean as well as b-valuemean and b-valuemax showed no statistical significant difference between those lesions (Table 6).

Table 6 Quantitative values of PET and in malignant and non-malignant lesions

PET and DWI values were significantly different between subjects referred for staging and re-staging (Table 7).

Table 7 Quantitative values of PET and DWI in primary staging and follow-up/recurrence patients

Discussion

In this study, it has been shown that PET/MRI is superior to DWI alone and that PET/MRI with DWI does not achieve a higher accuracy than PET/MRI without DWI in patients with head and neck tumors. However, DWI is able to add different pathological lesions, although not changing the final staging. Therefore, DWI is not essential in clinical PET/MRI protocols for the staging or re-staging of head and neck cancers.

General aspects

PET/MRI currently is emerging as a potential diagnostic tool that combines the functional PET information on tumor metabolism with the excellent anatomical correlation provided by different PS in MRI [13, 24, 25]. Furthermore, MRI can potentially offer additional physiological sequences that may provide information on tumor microstructure, e.g., with DWI sequences [10, 26, 27].

DWI has been promoted as a useful imaging tool to detect and characterize malignant lesions and predict tumor response in oncology [28]. In head and neck cancer single-modality MRI, DWI is often performed for tumor detection and characterization, to monitor treatment response, and for the differentiation of recurrence from post-radiation changes [11, 29, 30]. However, the quality of the DWI sequence can be severely distorted by susceptibility artifacts, particularly in the head and neck area due to dental implants, and has a relatively low specificity [2931].

Diagnostic accuracy

In our patient population, PET/MRI had a significantly higher diagnostic accuracy and negative predictive value than DWI alone, independent of different thresholds selected for the ADC. Our study showed a DWI sensitivity varying from 44.6 to 82.2 %, depending on the ADC threshold used. However, the specificity decreases at the same time (from 65.9 % to 31.7 %). The current literature is controversial regarding ADC thresholds in oncological imaging. Razek and co-workers have shown that ADC values for residual or recurrent head and neck tumors were significantly lower than that for post-treatment changes [32]. Other publications have demonstrated that ADC was significantly higher in metastatic lymph nodes than in benign lymphadenopathy [33]. However, the ADC thresholds usually differ when analyzing HNC before treatment and after therapy [11]. Since our population includes patients referred for primary staging and for follow-up/recurrence imaging, identifying one ideal threshold value that could fit both at the same time is practically impossible. This is probably reflected in the high number of false-negative findings observed by DWI (22/142). This leads to the limited value of integrating DWI into clinical PET/MRI protocols. On the other hand, PET/MRI is able to clearly detect pathologies both during the initial staging and the follow-up using the same technique.

The superiority of PET/MRI as compared to DWI is also reflected by the different likelihood evaluations. It is already known that multimodality imaging provides information that is superior to morphological or functional methods alone for the detection of malignant lesions [10]. The introduction of PET/CT has demonstrated improved diagnostic performance over PET alone by reducing the number of false-positive findings in patients with initial staging and follow-up of head and neck malignancy [34, 35]. Studying a population of oro- and hypopharyngeal SCC patients, Chan and colleagues have shown that PET/CT affords higher diagnostic capability than whole-body MRI in detecting residual/recurrent tumors or associated second primary tumors [36]. However, when integrating DWI into the PET/MRI protocol in our patient population, no improvement could be found.

Malignant and non-malignant lesions

Another point regarding the ability of PET/MRI to detect and characterize malignancies was the significant difference of SUVmax between malignant and benign lesions. This finding is well-known in the PET and PET/CT literature. Ghanooni and colleagues, for example, found that SUVmax values were significantly higher in malignant than in benign, post-treatment lesions in head and neck cancer patients [37].

Conversely, the quantitative DWI parameters such as b-values and ADCs of malignant vs. benign lesions didn’t show any statistical difference between malignant and non-malignant lesions. This finding is partially in contradiction to the current literature, which states that DWI is able to differentiate benign from malignant lesions in primary staging of head and neck cancers using ADC with different threshold values [6, 11], and is also able to differentiate changes related to treatment from loco-regional recurrence [32, 38]. One potential explanation for these conflicting results is that our b-values are partly lower than previously reported in the literature (800 vs. 1,000–2,000). It is known that the ADC value decreases when the b-value increases beyond 1,000 s/mm2 [30]. The decrease in the observed ADC with an increasing b-value is explained by the decay of biexponential signal intensity [30]. Thus, this requires further investigation as to whether higher b-values should be used in head and neck cancer protocols. The problem with higher b-values is that MR imaging time increases and integration into a clinically acceptable PET/MRI protocol is thus even more difficult.

PET values and DWI parameters

Our study did not show a correlation between SUVmax and ADC or b-values. Recent studies have demonstrated for head and neck cancers that SUVmax is inversely correlated to ADC ratio between different b-values [39] or ADC ratio between minimum and mean [40]. However, Nakajo and co-authors have shown a negative and significant correlation between SUVmax and ADC mean, studying 26 patients with HNSCC before treatment [41]. Our results are probably different due to the fact that our population is heterogenous (as it is in any clinical routine), including different histologic types of HNC and not only primary tumors.

When PET values and DWI values for different populations are compared, it has been found that they yield similar results. A malignant lesion at primary staging, for example, shows a high SUVmax and b-value, and low ADC. Thus, the information provided by DWI is superfluous, since it does not provide any information that wasn't already available from the PET/MRI.

Evaluating 30 patients with different malignancies, Borra and co-workers recently have shown the prognostic potential of b-values compared to PET uptake. The b-values represent the average signal intensity from the native DW images for each ROI, and are therefore easy to use. It was found that a high b-value was more indicative of malignant FDG uptake than the traditionally used signal loss on ADC maps [42]. However, we could not demonstrate this in our patient population with HNC. Thus, b-values might be useful in certain oncological settings, but possibly not in HNC.

Limitations

One limitation of this study was a mixed-population analysis, which included primary staging and follow-up/recurrence exams, as well as different histologic types. This might have limited the ability of the ADC threshold to differentiate benign from malignant lesions. However, it does not render the conclusion invalid, since our purpose was to evaluate the utility of DWI in a PET/MRI protocol for a routine, clinical HNC population, not specifically for primary staging or follow-up. Furthermore, different ADC thresholds were applied without significant differences concerning the overall accuracy compared to significant differences in SUV. Another limitation was the lack of pathological confirmation of all lesions, however, this is not always possible in a clinical setting. The reading process may also be considered a limitation, since two independent readers could possibly enhance the results. The DWI-only evaluation might not be highly relevant in a routine clinical setting, since DWI should be read in conjunction with a complete diagnostic MRI protocol. However, to account for similar DWI studies that are available in the current literature, it has been included.

Conclusion

The use of DWI as part of PET/MRI procedures to evaluate head and neck cancers did not provide diagnostically relevant information in our study. Thus, the use of DWI might not be needed in clinical PET/MRI protocols for the staging or restaging of head and neck cancers.