Keywords

12.1 Introduction

Soon after the clinical implementation of PET/MR in 2011 and after initial feasibility studies, first comparative studies reported a draw between PET/MR and PET/CT with regard to the staging of head and neck cancer patients [1,2,3,4,5,6,7,8]. In the long run, however, PET/MR is expected to outstrip PET/CT in this field. This chapter aims to highlight current clinical applications and future directions of PET/MR in head and neck oncology.

PET/MR combines the advantages of molecular tumor imaging and high soft tissue contrast in one single examination—in contrast to the traditional approach, which often required both PET/CT and contrast-enhanced MR for a thorough workup of head and neck cancer patients. A more widespread clinical use of PET/MR is currently limited by monetary issues. On the one hand, the price for a PET/MR scanner including its requirements on building infrastructure is at least threefold the price of a PET/CT scanner. On the other hand, several countries are still lacking reimbursement models for PET/MR.

Centers with access to a PET/MR scanner generally prefer PET/MR over PET/CT in the initial staging of head and neck cancer patients—with the exception of cancers arising in the hypopharynx and larynx, where motion plays a prominent role, mainly swallowing [5]. For the nodal staging, both modalities are generally considered equivalent, although certain functional MR techniques might tip the scales toward PET/MR. The lung is the place where the majority of distant metastases of head and neck cancer patients occurs and was considered a black box for MR imaging for a long time [9]. Recent advances in the development of MR pulse sequences, some of them stimulated by the advent of PET/MR and its intrinsic need for proper lung tissue visualization, help elucidate this black box [10,11,12,13].

The scintigraphic imaging of malignant head and neck tumors today is mainly reserved to 18F-fluoro-2-deoxy-D-glucose (FDG). Other radiotracers might play a future role in the imaging of FDG-negative salivary gland malignancies, e.g., 68Ga-prostate-specific membrane antigen (PSMA); in neuroendocrine tumors, e.g., 68Ga-1,4,7,10-tetraazacyclododecane-1,4,7,10-tetraacetic acid (Tyr3)-octreotate (DOTATATE); and in certain non-malignant neoplasms, e.g., 18F-fluorocholine (FCH) in parathyroid adenomas.

Proper photon attenuation correction (AC) is important in the head and neck, owing to oftentimes small-sized lesions of interest, narrow anatomical relationships, and a multitude a neighboring spaces and compartments. This is challenged by an abundance of tissues with different attenuation properties within a small and complex area and by the typical presence of artifacts elicited by dental hardware. Time-of-flight (TOF) PET dataset reconstructions are considered important in the head and neck, both for the detection and correct localization of small lesion but also for decreasing implant-related artifacts and for optimizing the MR-based AC [14,15,16,17,18,19]. While atlas methods can be used for the head in PET/MR, MR-based AC in the neck mainly relies on the Dixon method, which yields four different tissue properties (air, fat, water, soft tissue), but neglects the bone, which represents another black box for MR. In fact, bone assumes μ map values similar to air, while the photon attenuation in both structures is entirely different. This issue might be overcome using MR pulse sequences with ultrashort echo time (UTE) or even—by definition—zero echo time (ZTE) [20,21,22].

12.2 PET/MR Protocols for the Head and Neck

The definition of valid protocols for PET/MR is a key requisite for clinical head and neck cancer assessment. MR pulse sequences should be selected to provide most complementary information or at least confirmatory information with regard to PET data. In integrated PET/MR scanners, the use of TOF PET information helps to identify small lesions, reduces artifacts—particularly those related to metallic implants such as dental hardware—and increases the accuracy of the MR-based AC [14, 15, 17,18,19]. For whole-body exams, the PET/MR protocol is divided into two separate, yet not independent, parts [23, 24]:

  1. 1.

    A fast basic whole-body oncologic PET/MR acquisition which contains the pulse sequence(s) used for MR-based AC, sufficient anatomic correlation, and basic soft-tissue characterization outside the head and neck area

  2. 2.

    A dedicated head and neck PET/MR acquisition that comprises high-resolution anatomical tumor imaging and enables the definition of specific tumor features, such as cellularity and vascularization

12.2.1 Basic Whole-Body PET/MR Acquisition

For the basic whole-body PET/MR acquisition, the patient should be positioned with arms down. The Z-axis scanning range covers the area from the vertex of the skull to the mid-thighs. A dedicated phased-array head and neck coil is used in conjunction with a body surface coil. PET datasets usually require 4–8 bed positions per patient (depending on the individual body height and on scanner geometry) of 2–5 min each (depending on scanner type and injected activity), using 3D image acquisition and reconstruction. The MR pulse sequences used for the AC are acquired during the PET acquisition. For the trunk, usually a T1-weighted Dixon-type sequence is used, whereas for the head, atlas methods may be used alternatively. Since the extremities are subject to wrapping artifacts in MR, AC of the extremities relies on PET data only. The same Dixon-type MR pulse sequence used for AC may also be used for diagnostic imaging, albeit with higher resolution. Repeating this pulse sequence after the administration of intravenous contrast is not essentially needed in head and neck cancer patients. If done, the acquisition takes place after the accomplishment of the non-contrast-enhanced MR pulse sequences of the head and neck. Using the phase-encoding gradient in anteroposterior direction helps reduce flow artifacts, especially in the posterior pharynx and larynx, although this requires a somewhat longer acquisition time [25].

Another diagnostic whole-body MR pulse sequence acquired is usually a T2-weighted image dataset, preferentially in coronal plane and with fat suppression, e.g., single-shot fast spin echo (SSFSE). One specific MR pulse sequence of the lung should also be part of this first step, especially in patients with a high likelihood of pulmonary metastases (higher T stage, lymph node metastasis in the lower neck, non-epithelial primary carcinoma, more than one primary tumor, etc.) [24, 26]. Most centers use a T2-weighted pulse sequence with motion correction, such as periodically rotated overlapping parallel lines with enhanced reconstruction (PROPELLER; GE Healthcare, Waukesha, WI). This sequence is acquired axially during free breathing and uses respiratory triggering [27, 28]. As an alternative, MR pulse sequences with ultrashort echo time or zero echo time may be considered. A recent study has shown that a free-breathing UTE pulse sequence has a high sensitivity for the detection of small pulmonary nodules (4–8 mm), including those not FDG-avid [29].

The PET acquisition and reconstruction covering the chest may take into account the respiratory motion in order to achieve an optimal standardized uptake value (SUV), particularly in small lung nodules and in lesions located in the base of the lungs. This can be accomplished using bellows-driven gating, MR-driven gating, or PET-driven gating [30]. This first part of the protocol can take from 12 to 25 min, depending on the parameters chosen for PET acquisition and MR pulse sequences.

12.2.2 Regional Head and Neck PET/MR Acquisition

A dedicated head and neck PET/MR protocol should not simply duplicate the clinical head and neck MR protocol routinely performed in the radiology department. A careful selection of MR pulse sequences is needed in order to optimize the diagnostic capability of PET/MR. The field of view and slicing of all regional MR pulse sequences need to be tailored to the head and neck and should not copy the parameters of the whole-body MR pulse sequences. Several studies have addressed the contribution of different MR pulse sequences for the assessment of head and neck tumors with PET/MR.

One typical recommendation is to acquire a T2-weighted sequence in at least two planes, using fat suppression, which gives sufficient information for tumor delineation. This can be supplanted with a T1-weighted sequence without fat suppression in at least one plane, preferably axial. This basic approach is already sufficient in many instances and avoids the injection of MR contrast medium [5]. However, this simple solution is not acceptable for presurgical planning, in stage T4 tumors, in recurrent tumors, and in cases of perineural spread [1, 5]. Here, contrast-enhanced fat-suppressed MR pulse sequences, acquired in at least two planes, are needed [1, 5, 24]. Since such a regional MR acquisition takes comparably long, a separate regional PET acquisition of the head and neck may be part of the protocol—making use of a long local bed time but without increasing the total examination time.

Functional MR sequences are also viable options for tumor characterization. This includes diffusion-weighted imaging (DWI), a surrogate marker of tumor cellularity, and perfusion-weighted imaging, a biomarker of vascularization and neoangiogenesis. Of note, information on tumoral glucose metabolism is already available from FDG-PET and might obviate the need of such rather time-consuming functional MR pulse sequences. PET data may, e.g., be sufficient for therapy response assessment or the characterization of subcentimeter lymph nodes. Occasionally, FDG-avid pathologic lesions remain occult in areas with high physiologic glucose uptake, such as the lymphoepithelial tissue commonly found in the palatine tonsils and lingual tonsils. Here, functional MR pulse sequences, particularly DWI, assume a more prominent role. Recent studies analyzed the association of glucose metabolism, cellularity, and histological parameters in patients with squamous cell carcinoma and showed that DWI and FDG-PET may work as independent and complementary biomarkers [31,32,33,34,35]. Preliminary data are also available for perfusion-weighted imaging as part of a multiparametric PET/MR protocol for head and neck cancer. Covello and coworkers have proven the feasibility of such a protocol, which enables the simultaneous collection of metabolic and functional data [36]. Such a multiparametric approach may allow for a noninvasive characterization of tumor or recurrent tumor and might facilitate treatment planning.

Acquisition time, patient throughput, and individual patient tolerance are important issues to be considered. An advanced MR head and neck protocol takes about 30–35 min. Patients are instructed to breathe softly and not to swallow during the examination. Such is usually well tolerated. Notably, a multiparametric PET/MR acquisition may take longer, and the surface coils covering the head and neck as well as the torso might be uncomfortable and might preclude long acquisition times.

In summary, a regional PET/MR protocol for head and neck cancer should be tailored according to the specific questions that need to be answered. In most cases, the use of MR contrast medium is required, and functional MR sequences may not be needed. Specific parameters of MR pulse sequences have been suggested previously [24].

12.3 PET/MR Imaging of Carcinoma and Lymphoma in the Head and Neck

12.3.1 T Staging

More than 90% of all head and neck carcinomas are of squamous cell histology. The risk factors for this type of tumor comprise smoking and alcohol in general (exponential risk), and certain viral oncogenes in specific subsets, such as Epstein-Barr virus (EBV) in the nasopharynx and human papillomavirus (HPV) in younger patients, mainly in the oropharynx.

When a patient is referred for the initial imaging staging of a head and neck tumor, typically the tumor has already been identified, and good estimates have been made on its local extension and potential infiltration of adjacent structures [37, 38]. There is ample literature on comparing PET/CT and MRI for the local staging of tumors, which basically shows that MRI identifies the tumor extent more accurately, owing to better soft tissue contrast [37]. On the other hand, MR has also been compared with PET/MR, and most authors found no significant difference in the assessment of primary tumors [5, 39]. Based on these findings, one would expect a higher accuracy of PET/MR than PET/CT in the T staging of head and neck carcinomas, but this is controversial. Most studies performed with sequential or simultaneous PET/MR scanners could not prove superiority of PET/MR over PET/CT [1, 3, 5, 6, 8]. Vice versa, this means that PET could compensate for the known shortcomings of CT in the head and neck [1]. If MR contrast medium cannot be injected, PET/MR using only T2-weighted neck imaging yields similar accuracy as contrast-enhanced PET/CT [5].

In our opinion, the imaging of head and neck tumors requires a more specific approach. Data derived from studies, where tumors from several sites in the head and neck are literally lumped together, need to be regarded with caution. A site-specific comparison of modalities is desired, but data in the literature is currently sparse. Site-dependent differences in the accuracy of PET/CT and PET/MR are expected to arise from the anatomical component, i.e., CT or MR, respectively, since uptake measurements are comparable and reproducible among both modalities [4, 40].

It is expected that tumors arising in the oral cavity and in the oropharynx should better be imaged with PET/MR than with PET/CT, owing to less artifacts from dental hardware on MR than on CT and higher soft tissue contrast of MR (Fig. 12.1) [5, 41]. The PET data is compromised both by dental artifacts on CT and MR, which deteriorate the AC—although on PET/MR purportedly to a lesser extent than on PET/CT [17, 19, 42, 43]. The puffed cheek approach has not been studied on PET/MR so far and is expected to be more challenging owing to longer regional acquisition time with PET/MR [44]. Tumors hiding within tissue with physiologically high FDG uptake, such as lymphoepithelial tissue, which might be abundant in the oropharynx, can be missed with PET/MR, unless DWI is used [8, 45,46,47,48].

Fig. 12.1
figure 1

Squamous cell carcinoma arising in the oropharynx. The axial contrast-enhanced, fat-suppressed T1-weighted 18F-FDG-PET/MR image shows an intensively FDG-avid tumor in the base of the tongue on the right side (a, arrow) and an FDG-avid lymph node metastasis in cervical level IIA (arrow head). The tumor extends from the tongue base to the vallecula, as seen on the sagittal T1-weighted 18F-FDG-PET/MR image (b, arrow). On the fat-suppressed T2-weighted image, the tumor is of inhomogeneous signal intensity with predominant hyperintense areas compared to muscle (c, long arrow) and infiltrates the intrinsic and extrinsic tongue musculature (c, long arrow) as well as the parapharyngeal adipose tissue and the right-sided submandibular gland (c, short arrow). The lymph node metastasis is seen as well (c, arrow head)

In the hypopharynx and larynx, the situation is expected to be different. Artifacts there mainly derive from patient motion, such as swallowing or breathing [5, 49]. The soft tissue contrast of CT in this area is sufficient, owing to comparably sharp density increments between the tissues there, such as intralaryngeal air, laryngeal muscles, paraglottic fat, laryngeal cartilage/ossified cartilage, and paralaryngeal fat and muscles. Additionally, CT offers higher spatial resolution. Thus, PET/CT might remain the modality of choice for the imaging of hypopharyngeal and laryngeal tumors, especially for smaller ones—although hybrid imaging is performed uncommonly in T1 tumors, and such small tumors might be missed completely with any imaging modality [5, 50]. There are, however, also studies with small patient cohorts that showed a similar performance of PET/MR and PET/CT in the staging of laryngeal carcinoma [51].

The potential resectability of head and neck tumors depends on several issues [52]. Not all of them are part of the TNM staging system, and if they are part, they not necessarily denote a T4b stage, which is generally considered unresectable. Particularly, the following findings with a head and neck tumor might preclude surgery or lead to a different surgical approach: vascular encasement, invasion of the prevertebral space, perineural spread, orbital invasion, bone infiltration, skull base invasion, dural infiltration, invasion of the laryngeal cartilage, invasion of the brachial plexus, and mediastinal invasion (Fig. 12.2). For all of these items, except bone and cartilage infiltration, PET/MR is expected to yield a higher accuracy than PET/CT, although evidence is currently sparse [1, 5, 53]. Particularly in cases of perineural spread, PET/MR is expected to be more accurate, owing to the higher soft tissue contrast of MR and the availability of differently weighted images [5, 37, 53]. Both contrast-enhanced MR pulse sequences with and without fat suppression may be used for this purpose in general, but preferably non-suppressed sequences should be used in the skull base [38, 54].

Fig. 12.2
figure 2

Esthesioneuroblastoma arising in the ethmoid air cells. The axial non-suppressed (a) and coronal fat-suppressed (b) T2-weighted images show a tumor (arrows) in the right-sided ethmoid air cells. The tumor is of intermediate signal intensity and contains cystic spaces. The osseous nasal septum is infiltrated (arrow head). No extension into the orbita or infiltration of the dura is seen; the nasal bone is preserved. The axial non-suppressed (c) and coronal fat-suppressed (d) T2-weighted 18F-FDG-PET/MR images confirm high FDG uptake of the tumor (arrows) and absence of pathologic FDG uptake in the orbit

PET/MR provides no advantage over PET/CT in specifying FDG-positive incidental findings in the head and neck area [55]. Wang and colleagues showed that FDG-PET/MR can be used for radiation therapy planning in the head and neck, yielding similar gross tumor volumes as contrast-enhanced CT [56].

It is generally agreed upon that local tumor recurrence is best assessed with PET/MR [39, 57,58,59]. PET/CT is limited in identifying a morphological correlate for focal FDG uptake in the postsurgical head and neck [57, 58]. While PET may guide biopsy in such cases, biopsy might be difficult in lesions that are located in so-called blind spots, e.g., the piriform sinus or the postcricoid area, or if submucosal tumor recurrence is suggested and in-depth biopsy (e.g., using laser) is required. On the other hand, muscle tissue of surgical flaps or orthotopic muscle adjacent to such flaps might present with unusual and remarkable FDG uptake, owing to increased muscle tone as a consequence of the altered anatomy in the postsurgical state [37, 60, 61]. The higher soft tissue contrast of MR allows for a more specific assessment of focal FDG uptake identified on follow-up exams of head and neck cancer patients and for a more reliable discrimination of normal muscle and recurrent tumor [24, 37, 39, 48, 57, 59]. Additionally, DWI may be helpful, particularly in irradiated patients [24, 47, 48, 62].

12.3.2 N Staging

The presence of nodal metastases is a very important and independent prognostic factor, which worsens the prognosis of head and neck cancer patients [37]. One single lymph node metastasis already decreases the overall survival by approximately 50% [37, 63,64,65]. The prognosis worsens with the number of lymph nodes involved, with presence of extracapsular spread, and with pathologic nodes in the lower neck (e.g., level IV) [37, 64,65,66,67].

The probably most important tool in the N staging with hybrid imaging is PET, no matter if combined with CT or with MR. Therefore, most studies could not provide evidence that PET/MR surpasses PET/CT in this field [1, 4, 6, 53, 68, 69]. This reflects various previous works on PET/CT, MR, and CT, where a similar accuracy of modalities was found, ranging from approximately 60% to 90% with regard to sensitivity and specificity [9, 37, 50, 70,71,72]. One possible advantage of PET/MR could lie in the more accurate staging of small nodes, especially those with necrotic or cystic centers (e.g., with human papillomavirus subtypes 16, 18, or 31), since these might be faintly FDG-avid or even not avid but are more easily identified on fat-suppressed T2-weighted MR images than on CT [37, 73,74,75]. Another advantage of PET/MR might be the possibility to acquire a regional PET dataset with higher resolution during the regional MR exam, without increasing the total acquisition time. This might help identify small pathologic nodes. The use of DWI does not increase the accuracy of nodal staging with PET/MR [62].

12.3.3 M Staging and Second Primaries

Approximately 10% of all head and neck cancer patients have distant metastases upon initial presentation [37]. Per year, another 5% of patients develop second primary tumors due to field cancerization. The most common site of distant metastases and second primaries in patients with head and neck cancer is the lung [9]. Therefore, PET/MR in head and neck patients should also incorporate adequate lung imaging (see above). Studies have shown that MR lung sequences used in PET/MR, e.g., PROPELLER, detect lung nodules of 3 mm [10, 29]. Moreover, more than 98% of all FDG-negative subcentimeter lung nodules are benign, and 97% of lung nodules missed on PET/MR do not grow [76, 77]. For a more detailed discussion of lung nodules, we refer to Chap. 15 of this book.

Another common site for metastases is the skeleton. Here, some studies with general oncological cohorts reported a higher confidence in PET/MR than PET/CT, although no significant difference was found [2, 78, 79]. Specific studies on bone metastases in head and neck cancer patients today are missing.

Altogether, the majority of the currently published studies report a similar accuracy of PET/CT and PET/MR in the M staging of head and neck cancer patients [1, 2, 4, 36]. An overview on published major PET/MR studies is given in Table 12.1.

Table 12.1 Overview on published major prospective studies in PET/MR of the head and neck, as of May 2017

12.3.4 Lymphoma

The most common type of primary lymphoma of the head and neck region is non-Hodgkin lymphoma (NHL). Approximately 90% of NHL of the head and neck are of B-cell lineage, while only 10% are of T-cell lineage [80]. Tumors are mainly located in the oral cavity, in the nasal cavity, in the paranasal sinuses, and in major salivary glands (Fig. 12.3). Approximately one third of head and neck lymphomas arise in the bone, one third in soft tissues, and one third in multiple structures [80]. More than half of head and neck lymphomas come without pathologic lymph nodes, while in the rest nodal involvement may skip anatomical levels. Sinonasal lymphoma most often manifests as diffuse large-cell B-cell lymphoma (DLBCL), followed by NK/T-cell lymphoma. Sinonasal lymphoma typically presents as a comparably homogeneous mass that is of intermediate signal intensity on T1-weighted images and of high signal intensity on T2-weighted images. Hodgkin lymphoma often also involves neck lymph nodes, but disease is typically not limited to the neck, but occurs also in the mediastinum and sometimes in the spleen.

Fig. 12.3
figure 3

Primary lymphoma of the head and neck, arising from the inferior portion of the submandibular gland. An intensively FDG-avid lesion is seen in the angle of the mandible on the right side on contrast-enhanced, fat-suppressed T1-weighted 18F-FDG-PET/MR image (a, arrow). The lesion is homogeneously hyperintense on the fat-suppressed T2-weighted image (b, arrow) and isointense to muscle on the T1-weighted image (c, arrow). No perilesional stranding is seen in the subcutaneous or parapharyngeal adipose tissue. The lesion displaces the superior portion of the right-sided submandibular gland. Normal lymphoepithelial tissue with moderate FDG uptake is seen as well (arrow heads on ac)

Specific PET/MR studies on head and neck lymphoma are currently lacking. PET/MR experience in more general lymphoma cohorts shows that SUVs from PET/CT and PET/MR are strongly correlated [81]. Apparent diffusion coefficient (ADC) values of lymphoma are unrelated to the SUV, which supports the assumption that both parameter sets represent independent biological information [81]. The diagnostic capability of PET/CT and PET/MR is similar and exceeds whole-body MR imaging with DWI [82]. For a more extensive discussion of PET/MR imaging of lymphoma, we refer to Chap. 22 of this book.

In summary, PET/MR has advantages over PET/CT in tumors arising in the oral cavity and in the oropharynx, as well as in recurrent tumors. The one-stop-shop opportunity, the future optimization of MR-based attenuation correction, the incremental use of MR artifact reduction techniques, and the advancement of MR pulse sequences dedicated to lung imaging might emphasize the role of PET/MR in the imaging of head and neck carcinoma and lymphoma.

12.4 PET/MR Imaging of Thyroid Carcinoma and Other Head and Neck Malignancies

The therapy of differentiated thyroid carcinoma requires thyroidectomy along with postoperative radioiodine therapy in most cases. Once there is a suspicion of recurrence, e.g., with rising levels of thyroglobulin, the detection and localization of recurrent tumor is mandatory in order to guide treatment. Recurrence may occur in the thyroid bed, in regional lymph nodes and soft tissues, or uncommonly in distant sites. The search for recurrent tumor is usually performed with hybrid imaging modalities, such as 123I-SPECT/CT or 18F-FDG-PET/CT in case of suspected dedifferentiation. However, abnormal focal radioiodine uptake or 18F-FDG uptake may be present without an obvious pathomorphological correlate in thyroid carcinoma patients. This can be either because the pathologic lesions are below the spatial resolution of CT or because of certain limitations of the CT protocol. The CT is usually acquired as “low-dose” scan, and ideally without intravenous iodinated contrast, which would render a subsequent radioiodine therapy futile for a couple of weeks.

In this regard, 124I-PET/MR gained some ground. This modality allows for a better morphological correlation than PET/CT for characterizing neck tissue, especially in lesions smaller than 10 mm, thereby improving the pretherapeutic lesion dosimetry [83]. These results are not completely reflected in more recent work, where 124I-PET/MR was indeed superior to PET/CT in detecting iodine-positive lesions, although this remains arguable [84, 85]. Yet, 124I-PET/MR could not distinguish thyroid remnant from metastasis, while the volumetric MR information was considered useful for dosimetry purposes [84, 85].

In another study, 18F-FDG-PET/MR yielded an accuracy similar to contrast-enhanced 18F-FDG-PET/CT in patients with suspicion of dedifferentiated thyroid cancer, except for the detection of lung nodules, where PET/CT was superior [86]. Thus, PET/MR using either 124I or 18F-FDG is recommended at the moment only in cases of pretherapeutic dosimetry and when the use of iodine-based contrast medium is contraindicated. In all other clinical scenarios, PET/CT should be preferred, which also provides shorter acquisition time, better cost-effectiveness, and a somewhat more accurate AC [87].

On the other hand, parathyroid hyperplasia of single or multiple glands might be well addressed with PET/MR using 18F-fluorocholine (FCH), which might be useful both for diagnosis and pretherapeutic planning (Fig. 12.4) [88]. A prospective pilot study investigated the performance of 18F-FCH-PET/MR imaging in ten patients with biochemical primary hyperparathyroidism and inconclusive results at ultrasound and 99mTc sestamibi scintigraphy. This small study reported a sensitivity of 90% for PET/MR in this challenging patient cohort, without any false-positive results, allowing for an accurate localization of adenomas and providing detailed anatomic information [89].

Fig. 12.4
figure 4

Parathyroid adenoma in a patient with hyperparathyroidism and negative ultrasound and negative dual-isotope subtraction SPECT/CT. A tiny choline-positive nodule is seen below the right thyroid lobe on contrast-enhanced, fat-suppressed T1-weighted 18F-FCH-PET/MR image (a, arrow). The nodule is isointense to muscle on the fat-suppressed T2-weighted image (b, arrow), hardly visible as isointense to muscle on the T1-weighted image (c, arrow), and does not take significant contrast on the contrast-enhanced, fat-suppressed T1-weighted image (d, arrow)

Other rare tumors of the head and neck, such as those with neuroendocrine differentiation (paragangliomas) and meningiomas, may also be imaged with PET/MR, especially in conjunction with somatostatin analogue radiotracers, such as 68Ga-1,4,7,10-tetraazacyclododecane-1,4,7,10-tetraacetic acid (Tyr3)-octreotate (DOTATATE). The combination of specific morphological features of these tumors together with 68Ga-DOTATATE uptake allows for an accurate diagnosis and also opens the opportunity for peptide receptor radionuclide therapy, e.g., with 177Lu-DOTATATE, if surgery is not possible [24, 90].

More recently established clinical imaging using 68Ga-labeled prostate-specific membrane antigen (PSMA) might gain importance in salivary gland malignancies, owing to the comparably high physiologic uptake of 68Ga-PSMA in salivary gland tissue. While the primary tumor within the gland and its extent is delineated by the MR component of PET/MR, pathologic lymph nodes and distant metastases are possibly identified using 68Ga-PSMA-PET [91]. This novel radiotracer might play a role especially in those neoplasms, which generally show only faint 18F-FDG uptake, such as adenoid cystic carcinoma or acinic cell carcinoma [24, 49]. However, comparative studies are currently lacking.

In summary, the use of PET/MR in head and neck oncology might exceed the traditional assessment of head and neck squamous cell carcinoma with 18F-FDG. The advent of different tracers in the last decade together with the diagnostic capability of MR in lesion detection and characterization may facilitate the dissemination of PET/MR for the imaging of various types of tumors that occur in the head and neck.

12.5 Multiparametric PET/MR

PET/MR in head and neck cancer is mainly performed for one or more of the following reasons: TNM staging, surgery or radiotherapy planning, prognostic information, therapy response assessment, and detection of tumor recurrence. As with anatomical MR pulse sequences, the use of functional MR pulse sequences should depend on the pertinent clinical information and questions to be answered, in order to optimize the study time and prevent a redundancy of information. The following paragraphs discuss the most commonly used functional MR pulse sequences that might enhance the diagnostic accuracy of PET/MR in head and neck cancer (Fig. 12.5). A summary of the potential benefits of each discussed MR pulse sequence in comparison to the obligatory PET is given in Table 12.2.

Fig. 12.5
figure 5

Squamous cell carcinoma of the nasopharynx. Multiparametric PET/MR acquisition including axial T2-weighted image (a), contrast-enhanced T1-weighted image with fat suppression (b), perfusion-weighted image map of maximum slope of increase (c), diffusion-weighted image (d) and corresponding ADC map (e), as well as co-registered T2-weighted 18F-FDG-PET/MR image (f) shows a heterogeneous FDG-avid tumor in the right-sided lateral wall of the nasopharynx (arrows on a–f). The tumor infiltrates the prevertebral muscles (arrow head on a, b), the right-sided hypoglossal canal (short arrow on a and b), the clivus, the parapharyngeal space including the carotid space, and the masticator space. FDG-negative fluid retention in the right-sided mastoid air cells (gray arrow on a, f) indicates obstruction of the Eustachian tube by the tumor and should not be confused with mastoiditis. Vascularization of the tumor is rather poor (c), indicating unfavorable response to treatment. The tumor shows restricted diffusion (d, e), compatible with high cellularity, and corresponding to poor differentiation, as revealed by subsequent histopathology. Several enlarged and FDG-avid lymph nodes were also seen on the right side (not shown)

Table 12.2 MR pulse sequence benefits in different clinical scenarios in comparison with PET imaging

12.5.1 Diffusion-Weighted Imaging (DWI)

Application of DWI in MR exams for head and neck cancer has been extensively studied and is used for several indications that can be roughly divided into two different scenarios: before and after treatment. The pretreatment scenario provides information on primary tumor location, on nodal status, for therapeutic planning, and for the prediction of treatment response. In the posttreatment scenario, DWI is used for the assessment of therapy response and discrimination of post-therapeutic changes and tumor recurrence.

A recent meta-analysis showed no added value of DWI in detecting the primary tumor but a potential role in nodal staging, allowing the differentiation of benign and metastatic cervical lymph nodes [92]. As to PET/MR, this information is already contained in the 18F-FDG-PET component yet with a higher level of purity [62]. Moving from macroscopic imaging tasks, such as staging, to microscopic imaging tasks, the situation is different. For the assessment of specific features of tumors and lymph nodes, such as their histopathological profile, it was shown that DWI provides information that is complementary to the PET-derived information, similarly as shown before in other parts of the body [31]. Some tumor characteristics, such as high stromal content and low cellularity, are associated with resistance to treatment and increased water diffusivity in head and neck cancers. Thus, in general, high mean ADC values are considered predictors of poor treatment response and outcome [93]. Studies investigating DWI parameters and clinical outcome have shown that high ADC values both in primary tumors and metastatic lymph nodes were able to predict tumor relapse, failure of regional control, and poor disease-free survival [94, 95]. On the other hand, poorly differentiated squamous cell carcinoma usually presents low ADC values and is at higher risk for metastatic disease, being another independent biomarker of poor treatment response [96]. This controversy confirms that prognostication by means of quantitative DWI parameters should be regarded with caution.

For therapeutic planning, DWI has been studied for dose painting in comparison to 18F-FDG-PET. It has been shown that both techniques contain different information, which might influence the target volumes of radiotherapy [97]. Similarly, Rasmussen and colleagues have recently assessed the overlap of the radiation therapy volume of interest, as measured with DWI and 18F-FDG-PET by multiparametric PET/MR in patients with head and neck cancer [32]. They showed that the target volume for radiotherapy overlapped substantially, although not completely, suggesting that glucose uptake and diffusion coefficient yield complementary information, which may be relevant for radiotherapy treatment planning [32].

DWI after treatment provides valuable information for therapy response assessment. A rising ADC value, which may be more reproducible than a single measurement, was described as an early biomarker of response to treatment and might play a role when gadolinium cannot be used [98,99,100]. Treatment-related changes in the head and neck region after surgery and radiotherapy limit the discriminability between viable tumor and therapy-induced inflammation. In this regard, DWI might tip the scales by showing significant lower ADC values in tumors compared to posttreatment changes, especially when using high b-values (higher than 1000 s/mm2) [101, 102].

Some technical issues of DWI need to be addressed. The lack of standardization, such as the choice of b-values and the method to draw the ROI, and the high prevalence of artifacts (susceptibility and movement) might hamper the image acquisition and quality, limiting its reproducibility interindividually and intraindividually [93].

In summary, DWI plays a more complementary than redundant role in the assessment of head and neck cancer using 18F-FDG-PET/MR, with potential clinical benefits in differentiating benign from malignant disease, particularly in the restaging after treatment (recurrence vs. inflammation). It may also play a role for therapy response assessment—provided the protocol is standardized and pertinent thresholds are being established and validated in future studies.

12.5.2 Perfusion-Weighted Imaging

Dynamic contrast-enhanced (DCE) sequences and dynamic susceptibility contrast (DSC) sequences are MR techniques targeting tissue perfusion. They are performed after the intravenous injection of contrast medium. Both techniques depict the vascular properties of a lesion, serving as an imaging biomarker of neoangiogenesis and hypoxia, which can be used to characterize a tissue as malignant and also to predict treatment failure. In head and neck oncology, lesions with high perfusion parameters generally tend to show better response to treatment, probably owing to more appropriate delivery of the therapeutic agents, but are more likely to develop hematogenic metastasis. Some parameters can be extracted from perfusion-weighted MRI, both quantitative parameters such as ktrans (derived through pharmacokinetic models) and semiquantitative parameters (analyzed by the time-signal intensity curve). Several parameters were studied in different clinical settings.

Two main applications of perfusion-weighted MRI in head and neck oncology are identified: pretreatment prognostic information and, more commonly, therapy response assessment. In general, high ktrans values both in primary tumors and in metastatic lymph nodes are related to favorable treatment response [103,104,105]. However, the lack of standardization of image acquisition and analysis renders perfusion-weighted MRI challenging, with oftentimes limited reproducible.

On the other hand, head and neck tumors with higher vascularization are candidates to assess treatment response with perfusion-weighted MRI. In these tumors, perfusion parameters might allow an accurate identification of patients with improved response to chemotherapy and radiotherapy and prolonged survival. This is a reflection of the association of high blood flow and increased oxygenation, resulting in better delivery of the antineoplastic agents and increased radiosensitivity [103]. Even more important, change in ktrans after therapy is much more consistent than the pretreatment measurement. Recent studies provide preliminary data on the parametric response map as an early predictor of treatment efficacy in head and neck cancer and also highlight the potential of posttreatment DCE-MRI in identifying residual masses [106,107,108]. Thus, perfusion parameters could help to individualize therapy, avoiding unnecessary treatment and improving patient survival.

In conclusion, similarly to DWI, perfusion-weighted MR imaging has a potential role in the multiparametric analysis of head and neck cancer. Its preferred applications are most likely the prediction of response to treatment by identifying primary tumors with high blood flow and the anticipation of resistance to chemotherapy and radiation therapy.

12.5.3 Other Functional MR Techniques (MR Spectroscopy, IVIM, and BOLD)

A selection of other functional MR pulse sequences may also aid in the assessment of head and neck cancer. Care should be taken on which MR sequence is essential for the PET/MR examination and which could benefit patient management without increasing the scanning time. Otherwise, the translation of PET/MR into clinical routine would be impaired, and PET/MR might only be reserved for special (and rare) occasions.

One of the first functional sequences studied was MR spectroscopy (MRS). Although challenging due to long acquisition time and complex post-processing and analysis, MRS might play a role in tumor prognostication and in monitoring treatment response [109]. High choline-to-creatine ratios in primary head and neck tumors were observed in patients with poor response to therapy [110]. In the posttreatment scenario, the presence of a choline peak in a residual mass may serve as a marker of residual cancer [111]. However, there are only a few and small studies addressing the role of MRS in head and neck cancer. Therefore, one should take into account the imaging time available and the expected findings before considering including MRS into a PET/MR protocol.

The intra-voxel incoherent motion (IVIM) represents an MR technique that allows the simultaneous evaluation of vascularization and diffusion restriction without the use of contrast medium [112]. Thus, the potential applications of DWI and MR perfusion might be covered with a single acquisition. IVIM can be used to characterize head and neck primary tumors and metastatic lymph nodes [113]. It was also shown useful for monitoring therapy response, where high perfusion parameters were associated with worse clinical outcome, and an increase in IVIM parameters (e.g., diffusion coefficient D) was observed in responders [114, 115]. However, in our experience IVIM still suffers from considerable field inhomogeneities in PET/MR imaging and complex post-processing in general.

Blood-oxygen-level-dependent (BOLD) MR imaging is an indirect biomarker of tumor hypoxia, which might reflect resistance to chemotherapy and radiotherapy of head and neck cancer and consequently poor outcome. The principle of BOLD imaging in oncology is based on tumor oxygenation. It measures the decrease of signal intensity on T2*-weighted images owing to the paramagnetic effect of deoxyhemoglobin [93]. Although promising, feasible, and potentially reproducible, research effort still is needed to prove its efficacy in a clinical environment.

Another potential utility of MR imaging is the use of superparamagnetic iron oxide (SPIO) as contrast agent. Although only preliminary data is available to date, SPIO MR was shown able to distinguish benign and malignant lymph nodes, which is oftentimes difficult with conventional cross-sectional imaging in lesions smaller than 1 cm [116]. Such might be useful if the information derived from 18F-FDG-PET is equivocal.

In summary, the aforementioned functional MR sequences have yet not translated into clinical routine at most centers. They are time-consuming and their reproducibility is often limited. IVIM has been extensively studied for therapy response assessment and may be considered clinically in the future, provided more stable acquisition and faster post-processing are available. BOLD imaging might be used to predict response to treatment but still lacks consistent literature. MRS is technically challenging and might be considered mainly for research purposes. Thus, the implementation of such MR pulse sequences as part of a clinical PET/MR protocol remains questionable, given the compulsory presence of 18F-FDG-PET. Another counterargument is the availability of different radiotracers that image similar biologic processes, such as 18F-fluoroazomycin arabinoside (FAZA) or 18F-fluoromisonidazole (FMISO) for hypoxia estimation [117,118,119].

12.6 Summary

The use of PET/MR gains ground on PET/CT for the assessment of head and neck tumor patients. Once certain technical challenges are solved, such as a further improvement of lung imaging and more robust attenuation correction methods, e.g., by using zero echo time MR pulse sequences, a stable and reliable clinical hybrid imaging modality is at hand. This happens parallel to the clinical implementation of new PET radiotracers, the validation of existing clinical PET radiotracers for new indications (e.g., parathyroid imaging with 18F-fluorocholine), and an increase in the availability of functional MR techniques. Altogether, there is prospect of a more sophisticated and complementary characterization of complex biological processes. The combined assessment of different tumor features, such as glucose metabolism, cellularity, and vascularization, opens up possibilities for a more detailed characterization of the primary tumor and for the prognostication of therapy response and clinical outcome. Therefore, PET/MR represents the optimal non-invasive diagnostic tool for a personalized therapeutic approach in head and neck cancer patients.