Introduction

Most patients with oesophageal cancer present with locally advanced disease or with distant metastases [1]. 18F‑fluorodeoxyglucose (FDG) positron-emission tomography with computed tomography (PET/CT) has an important role in staging oesophageal cancer patients and therefore in selecting patients for potentially curative treatment. In the Netherlands, potentially curative treatment consists of neoadjuvant chemoradiotherapy followed by surgery for operable patients and definitive chemoradiotherapy for inoperable patients or patients with irresectable tumours [2, 3].

PET/CT is increasingly incorporated in radiation treatment preparation. Fusion of PET images with those of the planning CT scan facilitates demarcation of the oesophageal gross tumour volume (GTV) and helps to identify regional lymph node metastases that should be included in the radiation volume [4,5,6,7]. To improve delineation accuracy, automatic threshold-based delineation has been suggested [8]. Using a threshold with a certain standardized uptake value (SUV) or a predefined percentage of the maximum (SUVmax) has been described [6, 8]. The influence of motion-compensated PET/CT techniques on these PET tumour segmentation volumes is unknown.

Furthermore, detecting all lymph node metastases before the start of treatment is of crucial importance, to increase chances of treatment success. Combined PET/CT can reveal lymph node metastases, but its sensitivity is low (approximately 51%) and its specificity is moderate (84%) [9]. Possible explanations for this suboptimal performance are 1) a small proportion (0–32%) of oesophageal tumours are non-FDG-avid [5], 2) small lymph nodes can give a metabolic signal that remains below the detection level and 3) respiratory motion can cause image blurring of the lesions, which can fade out the signal of the lymph node metastases. The technology to reduce the effects of tumour motion induced by breathing was first studied in lung cancer [10,11,12,13]. The four-dimensional (4D) PET/CT can be reconstructed in a time-averaged motion-compensated (MC) PET/CT. Compared to 3D PET/CT, image blurring was significantly reduced, resulting in better characterization of lung lesions and lymph nodes. Also for liver lesions, improved diagnostic accuracy was seen with the motion-compensated PET/CT technique [14, 15].

Like lung tumours and liver metastases, oesophageal tumours and mediastinal lymph nodes are subject to substantial respiratory-induced motion [16, 17]. Furthermore, lymph node metastases of tumours at the distal oesophagus or gastroesophageal junction are expected to be located around the distal oesophagus or celiac trunk where a larger motion is expected than at the upper mediastinum [17]. As a result of the motion-induced image blurring, we anticipate that quantitative measurements are influenced. The metabolic signal spreads during image acquisition and, consequently, we hypothesize that SUVmax will be higher and metabolic tumour volume (MTV) more confined in images acquired on a motion-compensated PET/CT. As yet, the role of motion-compensated PET/CT in oesophageal cancer is not yet defined and the influence on lymph node assessment has not been studied. The aims of this prospective study were to compare 3D and motion-compensated PET/CT for measurement of the SUVmax and MTV of the primary tumour and for detection of lymph node metastases in patients with oesophageal cancer.

Methods

Between April 2014 and December 2016, this single-centre prospective cohort study (NCT02424864) included newly diagnosed oesophageal cancer patients undergoing diagnostic workup with PET/CT in the Netherlands Cancer institute. Patients of 18 years or older, with pathologically proven oesophageal cancer or tumour highly suspicious for oesophageal cancer who were planned for FDG PET-CT scan for staging purposes, could be included in the study. Written informed consent was obtained from all patients according to the International Conference of Harmonisation/Good Clinical Practise (ICH/GCP) and national and local regulations. This study was approved by the institute’s medical ethical committee.

FDG PET/CT scans were acquired using a combined PET/CT scanner (Gemini TF/Big Bore; Philips Medical Systems, Cleveland Ohia, USA). A PET with low-dose CT scan from skull base to thighs was acquired 60 ± 10 min after injection of FDG (190–260 MBq) at 2 min per bed position in 3D mode. Two bed positions centred around the oesophageal tumour were acquired in 4D mode at 4 min per bed position. Subsequently, the 4D CT of the thorax and upper abdominal region was acquired and reconstructed with a slice thickness of 3 mm. The reconstruction voxel size of the PET data was 4 × 4 × 4 mm, while the voxel size of the CT data was 1 × 1 × 3 mm. Motion of the tumour was determined by assessing the displacement during the breathing cycle in 10 phases in left–right (LR), craniocaudal (CC) and anteroposterior (AP) directions with in-house developed software. The 10 breathing phases were used to reconstruct a time-averaged mid position CT and attenuation-corrected motion-compensated PET scan; in addition, the 3D CT scan was used to reconstruct the 3D attenuation-corrected PET scan [11]. Details on the 4D PET/CT acquisition and reconstruction are provided in the supplementary material.

The 3D and motion-compensated PET/CT reconstructions were pseudonymised. Observers (EV, FV) were blinded for technique. The nuclear medicine physician (EV) assessed the quality of the scan on a five-point scale (very good, good, acceptable, poor and very poor).

The tumour was manually indicated and a volume of interest (VOI) was subsequently computed around the tumour using an automatic region-growing algorithm based on a minimum SUV threshold using Osirix software (Pixmeo SARL, Geneva, Switzerland). SUVmax was measured. Automatic segmentations with an SUV threshold of 2.5 and a threshold of 50% of the SUVmax were generated [6, 18]. When these segmentations extended into other FDG-avid organs (e.g. heart or liver) they were manually corrected using the anatomical boundaries on CT. Subsequently, of these segmented volumes, the SUVmean, metabolic tumour volume (MTV) and craniocaudal extension were measured and compared between 3D and motion-compensated reconstructions. Craniocaudal tumour length on PET/CT was also compared with the endoscopic tumour length.

An experienced gastrointestinal nuclear medicine physician (EV) scored the degree of suspicion of the lymph nodes on a four-point scale (highly suspicious, suspicious, dubious or unlikely) based on the intensity of the metabolic signal. The two most suspicious lymph nodes were localized and SUVmax per node was measured. The numbers and locations of highly suspicious, suspicious, dubious or unlikely nodes were compared between 3D and motion-compensated PET/CT.

Measurements between the two groups (3D versus motion-compensated scans) were compared using a paired Student’s t‑test, Wilcoxon signed-rank test and Bland–Altman analysis. Pearson correlation coefficients were calculated. P-values < 0.05 were considered statistically significant. Statistical analyses were performed using SPSS (version 24; IBM Corp., Armonk, NY, USA).

Results

Thirty-nine patients were prospectively recruited for this study. In 2 patients, there was technical failure during 4D reconstruction, resulting in a total of 37 patients available for analysis. The patient characteristics are shown in Table 1. Based on all staging procedures combined, most patients had a clinically node-positive adenocarcinoma located in the distal oesophagus or gastroesophageal junction. Figure 1 shows images of a patient with a node-positive gastroesophageal junction tumour. After staging, patients were treated with neoadjuvant chemoradiotherapy (n = 23), definitive chemoradiotherapy (n = 9), neoadjuvant chemotherapy (n = 1) or palliative treatment (n = 4).

Table 1 Patient characteristics
Fig. 1
figure 1

Motion-compensated (MC) and three-dimensional (3D) positron-emission tomography/computed tomography (PET/CT) of a patient with a gastroesophageal junction tumour. Fused PET/CT (a,d,g,j) and PET only (b,e,h,k) transversal slices, and maximum intensity projection (c,f,i,l) images of the primary tumour are shown in the upper panels (af), and a highly suspicious lymph node at the celiac trunk is shown in the lower panels (gl). The primary tumour showed a maximum standardized uptake value (SUVmax) of 12.0 on 3D PET/CT and 14.7 on MC PET/CT. The pathological lymph node showed a SUVmax of 6.6 on 3D PET/CT versus 7.6 on MC PET/CT

Scan quality was rated as good in 65 and 68% of the 3D and motion-compensated PET/CT scans, respectively. Motion amplitude of the oesophageal tumour was largest in the CC direction and CC motion varied between 0 and 21 mm with a median of 6.0 mm. In 6 patients the CC motion amplitude was > 10 mm. There was no correlation between CC amplitude and 3D/motion-compensated differences in SUVmax (r = −0.18, p = 0.29), MTV2.5 (r = 0.05, p = 0.78) or MTV50% (r = −0.01, p = 0.94; see also Fig. 2).

Fig. 2
figure 2

Difference in maximum standardized uptake value (SUVmax) of the tumour between three-dimensional (3D) and motion-compensated (MC), plotted as a function of peak-to-peak craniocaudal amplitude of the tumour. The horizontal dashed line represents difference = 0. The vertical dashed line represents craniocaudal amplitude = 1.0 cm. (CT computed tomography)

The results of the quantitative analyses are shown in Table 2 and Fig. 3. In 60% of patients the automatically segmented MTVs based on SUV ≥ 2.5 (MTV2.5) had to be edited manually, because they extended into surrounding organs (such as heart, kidneys or liver). The extension of the volume in the oesophagus or stomach was not edited, because tumour extension and oesophagitis or gastritis could not be distinguished.

Table 2 Qualitative analysis of metabolic activity and metabolic volumes on 3D and MC PET/CT
Fig. 3
figure 3

Bland–Altman plot of maximum standardized uptake value (SUVmax) of the tumour (a), SUVmax of the most intense lymph node (b), MTV2.5 (c) and MTV50% (d) measurements on three-dimensional (3D) positron-emission tomography/computed tomography (PET/CT) and motion-compensated (MC) PET/CT. The x‑axis shows the mean of the measurement on 3D PET/CT and motion-compensated PET/CT and the y‑axis shows the difference between the measurements. The dotted lines show the mean differences (3D – MC) between both methods. The solid lines show the upper and lower 95% limits of agreement. (MTV2.5 metabolic tumour volume with SUV threshold of 2.5, MTV50% metabolic tumour volume with threshold 50% of SUVmax)

The primary oesophageal tumour showed a mean (95% CI) SUVmax of 13.1 (10.6–15.5) on 3D versus 13.0 (10.4–15.6) on motion-compensated PET/CT (p = 0.809). The most intense lymph node showed a mean (95% CI) SUVmax of 6.6 (4.3–8.8) on 3D versus 6.8 (4.6–9.1) on motion-compensated PET/CT (p = 0.176). MTV2.5s were not significantly different between 3D and motion-compensated PET/CT. Also, MTVs based on SUVmax ≥ 50% (MTV50%) were similar on both scans (p = 0.554). The Bland–Altman analysis showed mean differences between 3D and motion-compensated measurement (and corresponding 95% limits of agreement) of 0.08 (−3.60–3.75), −0.26 (−2.34–1.82), 4.66 (−29.61–38.92) cm3 and −0.95 (−19.9–18.0) cm3 for tumour SUVmax, lymph node SUVmax, MTV2.5, and MTV50%, respectively (Fig. 3). Percentage differences of 3D and motion-compensated MTV50% showed an interquartile range (IQR) of 92–109%. The two outliers (Figs. 3d and 4 and Supplementary Fig. 1D) were patients with a relatively low SUVmax (≤8) of the primary tumour, resulting in differences in MTV50% between −81% and +160% on the 3D and 4D scans.

Fig. 4
figure 4

Image of a patient with a distal oesophageal tumour with motion-compensated (MC) and three-dimensional (3D) reconstruction. The primary oesophageal tumour showed a maximum standardized uptake value (SUVmax) of 8.0 on MC positron-emission tomography/computed tomography (PET/CT) and 6.9 on 3D PET/CT. The corresponding metabolic tumour volume (green volume) of ≥ 50% SUVmax was 5.89 cm3 on MC PET/CT versus 12.38 cm3 on 3D PET/CT. This was probably caused by extension of the volume of interest (VOI) to an area of oesophagitis due to the lower SUVmax on 3D PET/CT

The median craniocaudal length of the MTV2.5 was 94 mm (range 44–204 mm) on 3D PET/CT and 88 mm (range 56–200 mm) on motion-compensated PET/CT (p = 0.336). The median craniocaudal length of the MTV50% was 44 mm (range 12–120 mm) on 3D and also 44 mm (12–124 mm) on motion-compensated PET/CT (p = 0.645). A comparison of the length of the tumour on endoscopy and the length of the MTVs is presented in Fig. 5. The MTV-based tumour lengths showed poor correlation with the endoscopic tumour length, endoscopic length with MTV2.5 lengths (3D: r = 0.12, p = 0.48, MC: r = 0.07, p = 0.69) and endoscopic length with MTV50% (3D: r = 0.15, p = 0.39, MC: r = 0.07, p = 0.69).

Fig. 5
figure 5

Tumour length (mm) in craniocaudal direction as measured during endoscopy (black circles), length of MTV2.5 (grey squares) and length of MTV50% (black crosses). Patients are ordered by endoscopic tumour length. (3D three-dimensional, SUVmax maximum standardized uptake value, MTV2.5 metabolic tumour volume with SUV threshold of 2.5, MTV50% metabolic tumour volume with threshold 50% of SUVmax)

The degree of suspicion for lymph node metastases based on the 3D and motion-compensated method was classified as follows: 30 and 34 nodes highly suspicious, 20 and 22 suspicious and 66 and 59 dubious, respectively. A more detailed analysis was performed in a subgroup of patients in whom there was a discrepancy in grade of suspicion that would have a clinical impact. In 8/37 (22%) patients, lymph nodes were scored as unlikely to contain metastases on 3D while scored as dubious or suspicious on motion-compensated PET/CT or vice versa. These nodes were correlated with the assessment at endoscopic ultrasound (EUS) and pathology if the nodes were investigated with EUS-guided fine-needle aspiration (EUS-FNA). In 7 of 8 discrepant cases, the nodes defined at motion-compensated PET/CT as “unlikely” nodes were negative at EUS/pathology and “dubious” or “suspicious” nodes were positive at EUS/pathology, while this was conversely so for 3D PET/CT. In one patient the dubious node at motion-compensated PET/CT was suspicious at EUS, but FNA was not representative. These cases with discrepancies were reassessed on both scans. Direct comparison of the 3D with the motion-compensated PET/CT showed no clear differences in the intensity or clarity of these discrepant nodes. If those nodes were to be reclassified, they would be scored with an equal degree of suspicion on 3D and motion-compensated PET/CT.

Discussion

In this study, motion-compensated PET/CT revealed no clear clinical benefit compared to 3D PET/CT imaging in oesophageal cancer patients. SUVmax of the primary tumour and lymph nodes was similar between 3D and motion-compensated PET/CT. The MTV2.5 and MTV50% were also similar on both scans for the complete group. In a subgroup of patients (SUVmax of the primary tumour of ≤ 8), large differences of the MTV50% were observed between 3D and motion-compensated PET/CT, probably caused by uptake around a SUV of 4 in inflammatory tissue surrounding the tumour, with the result that minor differences in SUVmax could show a large impact on the MTV50%. In addition, threshold-based and endoscopic-based tumour lengths were poorly correlated. Furthermore, the motion-compensated technique did not improve the detection rate of lymph node metastases.

Prior studies on 4D PET/CT in oesophageal cancer focused on the primary tumour volumes only. Guo et al. evaluated the clinical target volumes (CTV) and planning target volumes (PTV) based on 3D CT, 4D CT and 3D PET/CT [19]. Volumes generated with 3D PET with 4D CT were significantly larger than those generated with 3D CT or 4D CT. Direct comparison of 3D PET/CT with 4D PET/CT was, however, not performed. Wang et al. determined GTV volumes on the average 4D PET/CT with auto-contouring methods using eight different thresholds and compared these PET-derived GTVs with a CT-derived GTVs [20]. SUV ≥ 2.5 and SUV ≥ 20% volumes correlated best with tumour length on CT. However, 4D PET/CT was not compared with 3D PET/CT. Scarsbrook et al. compared PTVs on 3D PET/4D CT with 4D PET/CT in 15 oesophageal cancer patients [21]. Volumes on both scans were similar, but overlap analysis demonstrated a median Dice similarity coefficient of 0.88 between both scans, leading to chance of under-coverage. However, the lack of pathology confirmation in this study obviates the clinical interpretation when this part of the volume is excluded from the PTV.

To our knowledge, the presented study is the first study in which the influence of motion-compensated PET/CT on the detection rate of pathological lymph nodes in oesophageal cancer was studied. Unfortunately, motion compensation did not improve the lymph node detection rate. PET/CT is superior to CT in the detection of distant metastases. However, for the identification of regional lymph node metastases, the sensitivity of PET/CT is insufficient [9, 22]. Moreover, the location of suspicious lymph nodes may influence radiation target volumes and/or choice of surgical approach. Suspicious nodes are included in the radiation target volumes [4], but larger radiation volumes increase the chance of toxicity [23, 24]. Postoperative complications increase with extended mediastinal lymph node dissection; a possible long-term benefit should thus be carefully weighed against the risk of a more complicated postoperative course [25]. Preferably, suspicious nodes are cytologically confirmed by EUS-FNA [26].

In this study, the additional value of motion-compensated PET/CT for the metabolic characterisation of oesophageal cancer was limited. There are several possible explanations for this finding. Firstly, the amplitude of the tumour motion was relatively small; on average 6.0 mm in craniocaudal direction. Zhao et al. described an average oesophageal tumour motion of 8.7 mm in craniocaudal direction using 4D CT [16]. The modest motion amplitude in our study might be explained by the 1 h resting phase prior to PET/CT acquisition, which is not applicable in the abovementioned CT studies, resulting in more relaxed patients with a more superficial breathing pattern. Kruis et al. investigated the influence of motion amplitude on 4D PET/CT in lung tumours and liver metastases [11]. For targets with a CC motion of less than 5 mm the benefit of motion-compensated PET/CT was limited, while a clear effect was seen with amplitudes of more than 10 mm. This effect was not observed in our study, possibly since only 6 patients had a peak-to-peak amplitude of > 10 mm. Furthermore, a limitation of the study is the spatial resolution. The full-width half maximum was 4.9 mm. Consequently, detection of differences with amplitudes of ≤ 10 mm is expected to be difficult. Conversely, Kruis et al. showed that for objects with a small volume, e.g. lymph nodes, motion-compensated PET/CT can improve the distinctiveness even with small motion amplitudes [11].

Secondly, the identification of the borders of the oesophageal tumour on PET/CT imaging may also be difficult. Oesophageal tumours frequently show submucosal spread along the oesophagus and proximal stomach, which might complicate demarcation of the tumour boundaries [27]. Also, oesophagitis and gastritis can give an increased metabolic signal and are often seen in oesophageal cancer patients, limiting differentiation of tumour from inflammation. In patients with tumours with relatively low FDG avidity, this can be even more difficult. These factors hamper demarcation of the tumour, even if the blurring due to the respiratory signal has been compensated for. In this study, the craniocaudal extension of MTVs showed large differences with the tumour length as determined during endoscopy.

And lastly, the accuracy of threshold-based delineation is not only influenced by object motion, but also by other factors, such as metabolic activity of the tumour, homogeneity of uptake, voxel size and signal-to-noise ratio [28]. In this study, we only investigated the influence of object motion.

The threshold-based volumes in this study showed low agreement with tumour length on endoscopy. Currently, semi-quantitative PET analyses are increasingly investigated in oesophageal cancer [29, 30]. Threshold-based volumes are proposed for delineation purposes and SUVmax or metabolic changes are proposed for response assessment or as prognostic factors. However, current segmentation methods show shortcomings [28]. Advanced image segmentation algorithms are emerging that can cope with such challenges [31]. Promising results have been shown with algorithms (e.g. voxel-specific threshold method) for delineation that can cope with tumour heterogeneity by using tumour reference value and a background value instead of a simple SUV threshold [32]. Also, different methodologies for segmentation or combined thresholds can contribute to the robustness of semi-quantitative volumes. However, most of these new segmentation methods have not yet been validated and the most reliable and robust delineation method remains to be found.

The limitations of our study are the relatively low motion amplitude of both tumour and lymph nodes during PET/CT acquisition, prohibiting proper analysis of motion-compensated PET/CT in patients with larger amplitude motion. Furthermore, cytological confirmation of lymph node involvement was only available for a limited number of patients and pathological assessment of the actual pretreatment tumour length was not possible due to neoadjuvant chemoradiotherapy or a non-surgical approach. Despite these limitations, the prospective study design with a relatively large patient sample contributes to the currently available literature.

In the future, the impact of PET-based measurements may increase even further. In the prospective dose-escalation phase I/II study of Yu et al., definitive chemoradiotherapy was given with an escalated simultaneous integrated boost to the volume of ≥ 50% SUVmax [33]. Encouraging local control rates and acceptable toxicity were seen. Unfortunately, the PET scan technique and segmentation algorithm were not described in the manuscript. For interpretation and implementation of results of studies using PET-volume guided treatment, uniform segmentation methods and PET technique are necessary.

Conclusion

Metabolic tumour volumes of the primary tumour and the SUVmax of the primary tumour and lymph nodes metastases were similar between 3D and motion-compensated PET/CT in the majority of patients. Motion compensation during PET/CT in oesophageal cancer patients did not improve the detection rate of lymph node metastases. Automatic segmentation should be used with caution, especially in patients with a primary tumour with a relatively low SUVmax.