Introduction

Head and neck malignancies (HNM) can extend along neurovascular routes, leading to greater difficulty in locoregional control of the disease. Perineural spread (PNS) is associated with worse outcome, including poorer prognosis, lower survival rate, and increased rates of local recurrence and distant metastasis [1]. The early detection of PNS thus has a tremendous impact on patient management, with implications for risk stratification, staging, treatment planning, and follow-up.

Medical imaging guides challenging treatment decisions, with the goal of striking the optimal balance between satisfactory oncological and functional results. Medical imaging then plays a key role in assessing the efficacy of treatment strategies. There is growing interest in the power of big data from medical imaging to fill knowledge gaps and identify alternative metrics that can predict patient outcome at an early stage. In that context, FDG PET provides early prognostic and predictive biomarkers that outperform CT and MRI [2,3,4].

At our tertiary reference centre, we observed that PNS in HNM was rarely reported on 18F-FDG PET/CT, compared to direct tumour extension pathways and lymphatic dissemination [1]. Additionally, we noted that PNS was the least documented route of locoregional spread in the 18F-FDG PET/CT literature, with only a few case reports [5,6,7,8,9,10]. This prompted us to investigate whether intrinsic limiting factors of PET were responsible for poor sensitivity in the detection of cranial nerve involvement. One hypothesis is that the low rate of PNS detection by 18F-FDG PET stems from its intrinsic limitations: limited spatial resolution and the high physiological FDG uptake of the nearby brain.

The current consensus is that there are no significant differences between PET/MRI and PET/CT in HNM, although there is a trend towards higher accuracy for resectability [11,12,13]. However, patients with perineural spread represent a very specific population, requiring specific treatment strategies that are never systematically analysed in PET studies, thus creating a knowledge gap in the current literature. Theoretically, patients with PNS might benefit from hybrid modalities such as 18F-FDG PET/CT or PET/MRI.

This paper constitutes a preliminary pilot study of the extraction of PET biomarkers from PNS, with the intent that these data could enable the development of new PET-guided management strategies. The primary objective of the study was to determine the effectiveness of 18F-FDG PET for the detection of PNS using a case–control design. The secondary objective was to evaluate whether quantitative imaging biomarkers derived from metabolic activity (18F-FDG PET) and anatomical characteristics (contrast-enhanced CT scan and MRI) of the PNS could improve risk stratification of patients for overall survival.

Patients and methods

Patient selection

This case–control study (1:2 ratio) enrolled 81 patients with FDG-avid HNM during the period from May 2009 to July 2017 in a comprehensive cancer centre. The study included consecutive patients reviewed by the tumour board for initial management or locoregional recurrence of PNS of HNM. The reference standard for the diagnosis of PNS was MRI and tumour board conclusion. PNS was confirmed by analysis of histology on surgery (12 patients) or by typical tumour evolution (n = 28) during follow-up. The mean (SD) follow-up was 13 (15) months in the case group and 12 (16) months in the control group. 18F-FDG PET evaluation was performed within a mean (SD) of 21 (24) days from the MRI in the case group and 21 (20) days in the control group. The need for ethical approval was waived by the local research ethics committee.

Imaging protocols

MRI acquisitions (transverse, sagittal, coronal views) were performed with a 1.5-Tesla unit (Signa Excite HD, GE Healthcare, Milwaukee, WI, USA) using T2-weighted sequences and T1-weighted sequences with and without gadolinium chelate injection and fat signal suppression. 18F-FDG PET/CT acquisitions and reconstructions were performed according to the European Association of Nuclear Medicine/European Society of Cardiology (EANM/ESC) guidelines for PET tumour imaging (intravenously injected FDG activity: 3-4 MBq/kg). Images were acquired on a GE Discovery 690 unit (GE Medical Systems, Waukesha, WI, USA) with lutetium-yttrium oxyorthosilicate (LYSO)-based detectors. Whole-body acquisitions (2 min per bed) were obtained 60 min post-injection from head to mid-thigh, including dedicated head and neck reconstructions (3 min per bed), for detection of small lymph node metastases and decreased risk of motion artefacts [14]. Unenhanced CT scans allowed for attenuation correction. A fully 3D time-of-flight iterative reconstruction scheme (GE VUE Point FX [VPFX]; GE Healthcare) was used (ordered subset expectation maximization [OSEM] algorithm with scatter, random coincidence, dead time, attenuation, and normalisation corrections). The isotropic voxel size was 3 mm (matrix: 256×256).

Extraction of quantitative imaging biomarkers

Magnetic resonance imaging

PNS was recorded as positive if two major signs of cranial nerve involvement were registered [15,16,17,18] using a three-plane axis (Fig. 1). The major signs of cranial nerve involvement were as follows [6; 7]: (i) topography of the abnormality typically along or within the distributions of a cranial nerve (focal, linear, or curvilinear) and in close proximity to the primary lesion or the site of recurrence; (ii) enlargement of the cranial nerve in comparison to supposed healthy contralateral nerve and/or contrast enhancement asymmetry. The major signs of foramina extent included the following: (i) the abnormality concerned a foramen involved by a cranial nerve in contact with the tumour (e.g. foramen ovale and cranial nerve V3, foramen rotundum and cranial nerve V2); (ii) enlarged foramen and/or bone erosion were present. Minor indirect signs included (i) absence/replacement of normal juxta-foraminal fat (in the skull base and in the pterygopalatine fossa) on MRI identified by decreased T1 signal intensity on a T1-weighted sequence without gadolinium chelate injection; and (ii) presence of acute or chronic nerve dysfunction (muscular changes due to denervation) which was non-specific. Acute nerve dysfunction was characterised by increased T2 signal, decreased T1 signal, increased enhancement of the muscle, and increased muscle volume (e.g. masticatory space and cranial nerve V3). Muscular atrophy due to a fatty involution of the muscle was recorded as an indirect sign of chronic nerve dysfunction (increased T1 and T2 signal).

Fig. 1
figure 1

Most frequent sites of perineural spread. The most frequently involved cranial nerves are V1, V2, V3, and VII, which pass through the superior orbital fissure (SOF), the foramen rotundum (FR), the foramen ovale (FO), and the internal acoustic meatus (IAM), respectively

18F-FDG PET/CT

Metrics were calculated within the primary head and neck tumour, the PNS, and the brain (contralateral healthy cortex of the cerebellum) using PET VCAR (Volume Computer Assisted Reading; GE Healthcare). The metabolic tumour volume (MTV) was obtained using a segmented volume based on a 42% threshold of SUVmax. The standardised uptake value (SUV), SUVmax, and SUVmean were extracted within this volume of interest. We computed the SUV heterogeneity index (HISUV) as follows: HISUV = SUVmax / SUVmean [19].

Statistical analysis

Data were presented as frequency and percentages for qualitative variables and as mean with standard deviation for continuous variables. The 95% confidence intervals (95% CI) of binary variables were calculated assuming a binomial distribution. Calculated data with p values < 0.05 were considered statistically significant. Analysis of variance (ANOVA) was conducted to compare the distribution of the variables between subgroups, assuming a normal distribution. The diagnostic ability of quantitative imaging biomarkers to detect a malignant process was calculated using a binary classification system (normal tissue vs. malignant process) and a receiver operating characteristic (AUC) curve. A Cox proportional hazards regression model was used for studying the dependency of overall survival time on predictor variables in the population of patients with squamous cell carcinoma (SCC; case group: 14; control group: 53). First, we incorporated the presence of PNS into the TNM Classification of Malignant Tumours (TNM) model (i.e. the current reference standard for tumour staging and risk stratification) to determine whether it added information by multivariate analysis. We then determined the association between quantitative imaging biomarkers extracted from PNS and overall survival by univariate analysis. We computed the association with overall survival in the entire cohort of SCC patients using Kaplan-Meier analysis and divided our cohort into two groups: case and control. All statistical analyses were performed using SPSS version 24.0 software (IBM Corp., Armonk, NY, USA).

Results

Patient characteristics

Eighty-one patients were included: 28 patients in the case group and 53 in the control group. In the case group, the primary tumour histological subtypes were SCC (50%), adenoid cystic carcinoma (14%), and sarcoma (11%). Other subtypes included odontogenic tumour, esthesioneuroblastoma, and embryonal rhabdomyosarcoma. Table 1 provides a summary of patient characteristics.

Table 1 Main patient characteristics

Detection of PNS by 18F-FDG PET

Overall characteristics

See Table 2.

Table 2 Main imaging features in patients with PNS on 18F-FDG PET, CT scan, and MRI

Detection rate

18F-FDG PET/CT detected PNS in all patients in the case group (n = 28/28), with no false-positive results in the control group (n = 0/53). Consequently, the detection rate (95% confidence interval) was 100% (88:100). Figure 2 shows that the SUVmax was greater than 10 in most patients despite the limited spatial resolution of PET.

Fig. 2
figure 2

SUVmax and cranial nerve size in PNS of both SCC and non-SCC patients. Despite the limited spatial resolution of PET, the FDG uptake was significantly increased in all PNS

Typical topography of PNS

We observed a typical distribution of PNS along specific cranial nerves V3 (61%), V2 (43%), V1 (7%), and VII (4%), with a total of 35 instances of PNS detected among the 28 patients.

Increased FDG uptake within PNS

The average maximum metabolism within the primary tumour site of HNM (SUVmax: 19 ± 11) and PNS (SUVmax: 18 ± 11) did not differ significantly from the physiological healthy cortex brain uptake (SUVmax: 15 ± 6) according to ANOVA (p = 0.30 and p = 0.37, respectively). Consequently, there was no significant threshold enabling differentiation of the tumour tissue from the brain tissue: SUVmax (AUC [95% CI]: 0.53 [0.36-0.71], p = 0.69) and SUVmean (AUC [95% CI]: 0.37 [0.20-0.54], p = 0.14, assuming that the higher SUVmean should be associated with the malignant process). We concluded that healthy brain physiological metabolism could potentially mask an increased FDG uptake within PNS, although this was not observed in our series.

Heterogeneity of FDG uptake

The HISUV (mean ± SD) was 1.5 ± 0.2, 1.5 ± 0.1, and 1.2 ± 0.1 within the primary tumour, the PNS, and the brain, respectively. The HISUV of the PNS was similar to that of the primary tumour and 30% higher than that of healthy brain tissue in all cases (ANOVA, p < 0.001). HISUV > 1.3 achieved 95% sensitivity and 100% specificity for differentiating between a cranial nerve invaded by PNS and healthy cortex brain tissue (AUC [95% CI]: 0.98 [0.95-1.00], p < 0.001).

Risk stratification for the prediction of overall survival

We compared the overall survival of patients with SCC between those in the case (n = 14) and control groups (n = 50). Table 1 shows patient characteristics. On average, patients in the case group tended to be younger (53 vs. 61 years of age) and were less frequently N1 (38% vs. 83%) or M1 (0% vs. 4%) than the control group. The management was comparable.

Kaplan-Meier analysis (Fig. 3) showed significantly poorer overall survival in the case group (Kaplan-Meier analysis: p = 0.003).

Fig. 3
figure 3

Kaplan-Meier analysis showed poorer cumulative overall survival (p=0.003) in SCC patients with PNS

In the multivariate Cox proportional hazards regression model (Table 3), the presence of PNS was associated with a hazard ratio of 5.5 (1.4:20.9; p = 0.013). Univariate Cox regression analysis (Table 4) showed that the ratio of cranial nerve SUVmax displayed the strongest association with overall survival (p = 0.001). This ratio compares the metabolism in the PNS with the metabolism in the contralateral cranial nerve. Therefore, the ratio is 1 in the control group, because both cranial nerves are normal.

Table 3 Cox proportional hazards regression model for the multivariate analysis of overall survival data in squamous cell carcinoma patients
Table 4 Cox proportional hazards regression model for the univariate analysis of overall survival data in squamous cell carcinoma patients

Association with clinical symptoms

Quantitative imaging biomarkers were not associated with the presence of clinical sensory or motor symptoms. The ANOVA showed no significant differences between patients with sensory symptoms and asymptomatic patients with regard to SUVmax (p = 0.80), SUVmean (p = 0.95), HISUV (p = 0.20), or the ratio of nerve enlargement (p = 0.27). In addition, ANOVA showed no significant differences between patients with motor symptoms and asymptomatic patients with regard to SUVmax (p = 0.38), SUVmean (p = 0.37), HISUV (p = 0.86), or the ratio of nerve enlargement (p = 0.10).

Discussion

Our preliminary pilot study confirmed the feasibility of extracting 18F-FDG PET biomarkers from PNS in FDG-avid HNM. Our results should encourage the development of new PET/CT or PET/MRI-guided management strategies in further prospective studies.

In a large series including 28 patients with PNS (Figs. 4, 5, 6), we reported 100% detection of PNS by 18F-FDG PET and described five direct or indirect imaging signs: (i) increased FDG uptake; (ii) heterogeneous glucose consumption; (iii) typical topography along/within cranial nerves V3, V2, V1, and VII; (iv) cranial nerve enlargement; and (v) foramen enlargement. These excellent results are the consequence of a very sensitive reading by a pair of specialists trained in the detection of PNS in FDG-avid malignancies. Our findings suggest that the underdiagnosis of PNS using 18F-FDG PET in clinical routine should not be attributed solely to technical limitations, such as the limited spatial resolution of PET and the physiological FDG uptake of the nearby brain.

Fig. 4
figure 4

PNS along cranial nerve V1. T1-weighted sequences with gadolinium chelate (A: axial view, B: coronal view) are compared with 18F-FDG PET/CT (C: axial view, D: coronal view). Increased contrast enhancement (MRI) and increased FDG uptake are associated with a linear enlargement within the distributions of the trigeminal nerve (V1)

Fig. 5
figure 5

PNS along cranial nerve V2. An enlargement of the right cranial nerve V2 is associated with increased FDG uptake (18F-FDG PET/CT: A, B, D) and increased gadolinium enhancement (MRI: C, E). An enlargement of the right foramen rotundum (FR, coronal view: A, B, C) is noted in the skull base. Invasion of the pterygopalatine fossa (PPF) is also observed (axial view: D, E)

Fig. 6
figure 6

18F-FDG PET/CT suggested a relapse earlier than MRI. MRI (T1-weighted sequences with gadolinium chelate: A, C; T2-weighted sequence: B) and 18F-FDG PET/CT (A, B) are compared in coronal views (A, C) and axial views (B). MRI and 18F-FDG PET/CT were performed after radiotherapy (A, B) and reassessed 3 months later (C). Initially, the pattern on MRI was not specific: either a scar due to radiotherapy or relapse can cause increased enhancement. It was observed in the cavernous sinus (arrow), foramen ovale (*), pterygoid muscle, and infratemporal fossa (x). The association with increased uptake of FDG was suggestive of relapse, which was confirmed at 3 months on MRI

Nonetheless, the aim of our report is not to suggest that 18F-FDG PET might replace MRI for PNS detection. MRI is the reference standard [18, 20,21,22] and achieves 95% sensitivity in detecting PNS and 63% sensitivity for accurate mapping of the complete PNS [23], due to its multiplanar capability and superior soft tissue contrast [16]. In comparison, CT is superior in detecting indirect signs of nerve enlargement, such as foramina enlargement and foramina destruction in the bony skull base [16]. Indeed, our opinion is that PET and MRI could be combined, and that contrast-enhanced PET/MRI will outperform PET/CT if PNS is anticipated, as suggested by a study in which PNS was detected in three of 44 primary tumours (7%) [13].

The detection of PNS is clinically significant, since PNS is frequently asymptomatic at the time of initial staging and is rarely detectable at physical examination [16]. Strikingly, half of our patients were asymptomatic, yet imaging revealed a twofold enlargement of the cranial nerve and/or of the skull base foramen in the majority of cases (Table 2 and 3). Our results confirmed non-specific PNS risk factors reported in the literature, including large tumour size, rapid tumour progression, male gender, mid-face location of the tumour, stage T3 or T4 at diagnosis, recurrence after treatment, tumour subtype (SCC, salivary gland tumours), and poor histological differentiation [16]. We found a preponderance of sensory over motor dysfunction (48% sensory symptoms, 41% motor symptoms), which explains patient complaints about pain, paraesthesia, and numbness, in line with the fact that motor dysfunction usually appears at a later stage [16].

We evaluated whether quantitative imaging biomarkers extracted from PNS were effective in predicting patient overall survival. This was intended to be a proof of concept of the feasibility of PET-guided management strategies for further prospective studies. We demonstrated that the presence or absence of PNS on medical imaging is a binary imaging biomarker associated with poorer overall survival in the population of patients with SCC (p = 0.003). The hazard ratio for mortality in patients with PNS was 5.5 in a multivariate analysis including the T-stage, the N-stage, and the M-stage. As a proof of concept, we compared the prognostic value of four quantitative imaging biomarkers: the ratio of cranial nerve diameter, the ratio of foramen diameter, the ratio of cranial nerve SUVmax, and the ratio of cranial nerve HISUV. In univariate analysis, we showed that increased glucose consumption was significantly associated with poorer survival (p = 0.001). We also explored whether heterogeneity in glucose consumption was associated with a poorer outcome, given that (i) increased glucose metabolism is a hallmark of cancer, and (ii) tumour heterogeneity is associated with poorer outcome and treatment failure because of the generation of resistant phenotypes [24,25,26]. We hypothesised that the heterogeneity of FDG uptake would differ between PNS and healthy nervous tissue, as demonstrated in peripheral nerve sheath tumours in neurofibromatosis type 1 [19]. Our results demonstrate that heterogeneity in FDG uptake within the PNS is indeed similar to that of the primary tumour, and is significantly higher than physiological brain uptake. However, the association between HISUV and overall survival did not outperform SUVmax, and HISUV may be difficult to standardise among centres, further demonstrating the current limitations of heterogeneity as an imaging biomarker [27].

The detection of PNS is crucial for preoperative evaluation. The goal of surgery will be complete excision of the cranial nerve involved with free margins; skull base surgery is sometimes considered in centres with surgical expertise [28]. MRI is the reference standard for preoperative surgical mapping [23] and for the detection of postoperative complications (e.g. cerebral spinal fluid leak) and incomplete tumour resection. Because of its ability to detect distant metastases, the use of 18F-FDG PET helps avoid unnecessary surgery. In our series, the majority of patients with PNS had locoregional disease (62% N0, 91% M0).

Diagnosis of PNS is critical for radiation therapy planning, as it can change the dose distribution, fractionation, treatment time, and target volume [29,30,31]. New techniques such as intensity-modulated radiation therapy (IMRT) enable the delivery of an effective radiation dose to the pathological structure, including the entire extent of PNS, while limiting the dose to healthy tissue. Based on our series, further prospective studies should investigate whether the combination of F18-FDG PET and MRI findings could help to define optimal tumour volumes for external beam radiation therapy in patients with PNS by (i) guiding the treatment of patients with complex tumour shape or targets close to critical structures, (ii) defining the biological target volume, and (iii) defining the zone for targeting of the boost dose of radiation therapy, since the anatomical location of failure after radiation therapy is most often within the PET biological target volume [32].

The ability to detect PNS by 18F-FDG PET paves the way for further prospective studies investigating PET/CT- or PET/MRI-guided approaches for the early detection of relapse after locoregional treatment. The diagnostic accuracy of MRI is decreased after locoregional treatment because of nonspecific inflammatory contrast enhancement (Fig. 5: focal increased FDG uptake suggesting a relapse on PET; inflammatory scarring after chemoradiation masked the early relapse on MRI, which was subsequently confirmed on MRI 3 months later). MRI can fail to differentiate post-treatment changes from residual/recurrent tumours. In this context, it is important to stress the excellent negative predictive value of post-therapy 18F-FDG PET at 3 months for the evaluation of both the primary tumour and lymph nodes. Consequently, 18F-FDG PET might be of special interest for response evaluation (Fig. 5) by differentiating inflammatory changes (diffuse and homogeneous FDG uptake) from treatment failure (focal increased and heterogeneous uptake).

There are limitations to our study: (i) because of the low frequency of PNS, the study was retrospective in nature and used a case–control design; (ii) the increased demand for glucose due to the Warburg effect is a hallmark of cancer tissue [33] that is non-specific and can be observed in tissue proliferation such as infectious and inflammatory processes [34]; (iii) the reading was very sensitive by a pair of trained imaging specialists; (iv) the number of patients was limited, although this is the largest PNS series on PET to date; and (v) no alpha adjustment was made in this exploratory study.

In conclusion, 18F-FDG PET/CT enabled the detection of PNS in FDG-avid HNM using the direct and indirect signs described above. Precision medicine approaches guided by PET/CT or PET/MR should thus be evaluated in prospective studies for prognostic stratification, delineation of the biological tumour volume prior to external beam radiation therapy, treatment response evaluation, and early diagnosis of relapse in the post-treatment setting.