Introduction

Trimodality treatment is a frequently chosen therapeutic approach for locally advanced oesophageal cancer. The combination of neoadjuvant radiochemotherapy (RCT) and surgery has been shown to lead to higher rates of complete (R0) resections and improved overall survival (OS) compared with surgery alone [1]. As an alternative to this procedure, patients can also receive definitive RCT to higher radiation doses than prescribed in the neoadjuvant situation. As this nonsurgical approach was shown to have a slightly worse OS in a meta-analysis [2], it is usually reserved for medically unfit patients in whom the perioperative risks are too high. Though postoperative mortality and morbidity have decreased during recent decades and may be further reduced by laparoscopic resection methods [3] and other novel surgical approaches, RCT remains a potentially risky and costly procedure. Given that 20–30% of patients show complete pathological remission even after neoadjuvant RCT, identification of these patients is a pivotal issue in avoiding surgery of highly RCT-sensitive tumours. A promising method for prediction of pathological complete response is the evaluation of 18F-FDG PET parameters, as FDG PET is often performed for restaging prior to surgery [4].

Several studies have assessed the utility of an FDG PET scan during or at the end of RCT to identify patients with a pathological complete response. Some studies have shown associations between pathological complete remission/OS and tumour maximum standardized uptake value (SUVmax) and metabolic tumour volume [5, 6]. Others have not been able to confirm these findings [7, 8] and have raised doubt as to the usefulness of FDG PET during RCT, especially in patients with squamous cell carcinoma [9]. Even positive studies have had insufficient sensitivity and specificity to identify complete responders on an individual basis. Hence recommendations are that FDG PET response of the tumour should not be used as the only marker to decide whether surgery can be omitted. Nevertheless, it has been shown that treatment adaptation based on the FDG PET response is feasible in patients receiving neoadjuvant chemotherapy for carcinomas of the oesophagogastric junction [10] and the FDG PET response seems to be a promising parameter for stratifying patients to trimodality treatment or definitive RCT [11].

Regarding radiotherapy/RCT several studies of tumours at different sites have demonstrated a correlation between acute side effects and patient outcome [12, 13]. In patients with oesophageal cancer, oesophagitis, defined as worsening of odynophagia during treatment, was associated with an almost fourfold increased probability of complete pathological remission and better OS than in patients without worsening of odynophagia during therapy [14]. However, clinical assessment of oesophagitis is difficult as it needs careful consideration of the temporal course and is often observer-biased, especially in retrospective analyses. We recently identified and validated oral non-tumour FDG uptake measured on midtreatment PET, that can be assumed to ba a surrogate for inflammation, i.e. radiation-induced mucositis, as a prognostic marker for long-term local control (LC) and OS in head and neck squamous cell carcinomas [15]. The aim of the present study was to determine if FDG uptake by non-tumour-affected oesophagus (NTO) within the irradiated volume has prognostic value in oesophageal cancer.

Materials and methods

Patient characteristics

In the present study 64 consecutive patients with FDG PET/CT-staged oesophageal carcinoma were analysed retrospectively. Evaluation of the data was approved by the Institutional Ethics Committee and all patients signed written informed consent. All cases were discussed in an interdisciplinary tumour board and all patients received curatively intended neoadjuvant RCT between October 2005 and December 2014. Inclusion criteria for the analysis were: age >18 years, histologically confirmed potentially R0 resectable oesophageal carcinoma, FDG PET/CT before and at the end (last week of treatment) of RCT, no evidence of distant metastases (DM) on both PET/CT and surgery, curative treatment intention and a minimum follow up of 12 months. Only patients with successful surgical resection were included in the final analysis, which was defined as oncological resection without postoperative mortality. However, analysis of all patients undergoing surgery was also performed and yielded similar results.

Treatment and follow up

Patients were treated with normofractionated RCT with single doses of 2 Gy per fraction up to 40 Gy, and concomitant cisplatin (70 mg/m2 of body surface area) and 5-fluorouracil (5-FU, 3,000 mg/m2of body surface area) during weeks 1 and 4 of radiotherapy, or alternatively weekly carboplatin (targeted area under the curve 2) and paclitaxel (50 mg/m2 of body surface area) [1]. Radiation was prescribed to the whole elective nodal drainage and was administered as three-dimensional conformal or intensity-modulated radiotherapy. Treatment planning was performed with OTP software using the collapsed cone algorithm. The dose output of the linear accelerators was validated at least once per week with an acceptable deviation of less than ±1%. Patient position was verified by two-dimensional imaging and ExacTrac if indicated at the discretion of the treating physician. Surgery was scheduled for 6 weeks after the end of RCT.

Postoperative follow-up usually consisted of clinical examination and CT scans of the thorax and abdomen every 3 months for the first 2 years after treatment and every 6 months thereafter until completion of 5 years of follow up. Endoscopic examination was performed at least at every second follow-up visit. Additional diagnostic procedures were performed as indicated at the discretion of the treating physician. Pathological tumour regression was retrospectively evaluated from patients’ files and scored according to the method of either Mandard et al. [16] or of Becker et al. [17]. For analysis and ease of comparison Becker scores were transformed to Mandard scores (Becker 1a to Mandard 1 and Becker 1b to Mandard 2 etc.) as suggested by Thies and Langer [18]. Side effects were scored clinically according to the Common Terminology Criteria For Adverse Events (CTCAE). Usually side effects were scored during weekly visits. Table 1 provides an overview of tumour and treatment characteristics.

Table 1 Patient characteristics before treatment and chemotherapy regimens

FDG PET/CT protocol

All patients underwent a hybrid 18F-FDG PET/CT scan prior to treatment. Scans (3D PET acquisition, 3 min emission per bed position) were performed using a Biograph 16 (Siemens Medical Solutions Inc., Knoxville, TN). A second scan was performed during the last week of RCT (restaging PET). Data acquisition started 85 ± 16 min (range 59–153 min) after injection of 248–401 MBq 18F-FDG. All patients had fasted for at least 6 h prior to FDG injection. The average serum glucose concentration measured prior to injection was 6.1 mmol/ml. Tomographic images were reconstructed using attenuation-weighted OSEM reconstruction (four iterations, eight subsets, 5 mm FWHM Gaussian filter). The resulting image data had a voxel size of 4.1 × 4.1 × 5 mm.

Data analysis

To determine the tracer uptake in the NTO, a roughly cylindrical region of interest (ROI) was manually delineated. The minimum longitudinal distance to the tumour or affected lymph nodes was four slices of the PET/CT scan (20 mm). The ROI had to be within the high-dose elective treatment volume, which was verified by inspecting the treatment plan in patient files, and the minimum longitudinal length was 20 mm. The minimum volume of NTO was 5 cm3. Figure 1 shows an example NTO ROI with the corresponding PET images and treatment plan. As radiotherapy and chemotherapy can each alone cause inflammation, another ROI was placed outside the irradiated volume in the oral cavity and pharynx, comprising the lymphatic tonsils and oral mucosa and submucosa tissue (SMT). For the delineated ROIs, SUVmax and SUVmean were computed. In this retrospective study the tracer uptake times were not standardized. Therefore all SUVs were corrected for scan time to T 0 = 75 min after injection using the expression:

$$ {\mathrm{SUV}}_{tc}= SUV\times {\left(\frac{T_0}{T}\right)}^{\left(1- b\right)} $$
Fig. 1
figure 1

Example delineation of non-tumour-affected oesophagus (NTO) in two patients: left NTO contours in pink on the PET/CT fusion images; right corresponding treatment plans with isodose distributions

where T is the time at which the SUV was actually measured and b = 0.31 describes the shape and decrease in the arterial input function over time [19]. Since only the time-corrected values were investigated the index “tc” is omitted in the following discussion.

The fraction difference between SUVmax or SUVmean between the first and second scans was computed as:

$$ \varDelta SUV=100 X\frac{SUV2- SUV1}{SUV1} $$

where the indices 1 and 2 refer to the pretherapy and restaging PET scans, respectively. For comparison, we also determined the SUVmax of the primary tumour on the pretherapy and restaging PET scans and computed the fractional difference as described above. In the following discussion we refer to these quantities as SUVlesion/1, SUVlesion/2 and ΔSUVlesion. The ROIs were defined and analysed using the ROVER software, version 2.1.26 (ABX, Radeberg, Germany).

Statistical analysis

Survival was analysed with respect to the endpoints OS, LC, occurrence of DM and treatment failure (TF, defined as any locoregional recurrence or occurrence of DM) for the period from the start of radiotherapy to death and/or event. Patients who did not keep follow-up appointments and for whom information on survival or tumour status was thus unavailable were censored with the date of the last follow-up. The associations between the endpoints and clinically relevant parameters (age, grade, histology) as well as quantitative PET parameters were analysed using univariate Cox proportional hazards regression in which the PET parameters were included as binarized parameters. The clinical parameters gender, T stage and N stage were not included because the values were asymmetrically distributed (Table 1). The cut-off values used for binarization were calculated by performing univariate Cox regression for each measured value. The value leading to the hazard ratio (HR) with the highest significance was used as the cut-off value. To avoid groups being too small, only values within the interquartile range were considered as potential cut-off values. The cut-off values were separately computed for all endpoints. The probabilities of survival were computed and rendered as Kaplan-Meier curves.

The independence of PET parameters and clinically relevant parameters was analysed by multivariate Cox regression. Those parameters with at least a trend for significance in a univariate Cox regression (p < 0.1) were included. Each PET parameter was analysed separately together with the clinical parameters. The HRs and the p values of the clinical parameters were averaged over all analyses. The independence of the prognostic value of NTO SUV and tumour SUV was tested in a multivariate analysis with the clinical parameters significant in the univariate analysis as confounding factors. Goodness of fit of nested Cox regression models was compared using analysis of variance based on log likelihoods.

Correlations were tested using Spearman’s rank correlation method. Statistical significance was assumed for p values less than 0.05. Statistical analysis was performed with the R language and environment for statistical computing, version 3.1.2 [20].

Results

The 2-year, 3-year, and 5-year OS rates were 62%, 47% and 40%, respectively. These values are in line with data from current literature [1, 21]. Overall, 50% of patients died during the observation period. The rates for local tumour control, absence of TF and freedom from DM at 5 years were 75%, 42% and 53%, respectively. As expected, there were strong correlations between NTO SUVmax and SUVmean and between NTO ∆SUVmax and ∆SUVmean (Spearman’s rho 0.81 and 0.9, respectively). Spearman’s rho for the correlations between absolute NTO SUV and NTO ∆SUV ranged from 0.55 to 0.68. There were rather weak correlations between SMT SUV and NTO parameters (Spearman’s rho from 0.25 to 0.28), which indicates that both regions have the potential to provide independent prognostic information. All correlations were significant. Supplementary Fig. 1 shows the correlations between SMT and NTO SUVmean values in individual patients. SUVlesion and NTO SUV were essentially uncorrelated (p > 0.2) in most cases. The only exceptions were the correlations between ΔSUVlesion and NTO ΔSUVmax (p = 0.07) and between ΔSUVlesion and NTO ΔSUVmean (p = 0.02). However, in both cases the correlation was rather weak (rho <0.3).

Of the 64 patients, 18 (28%) showed pathological complete remission after RCT. Patients with pathological complete remission were compared to patients without pathological complete remission. Complete responders had a significantly higher NTO SUVmean, irrespective of the tumour histology (SUVmean 1.9 vs. 1.5, p = 0.002; Fig. 2b–d). Furthermore, NTO SUVmean was inversely correlated with tumour regression scores (rho −0.35, p = 0.005; Fig. 2a). No significant correlation between NTO SUV and clinical oesophagitis score was observed (data not shown).

Fig. 2
figure 2

Correlations between SUVmean of non-tumour-affected oesophagus (NTO) and pathological response parameters in patients with squamous cell carcinomas (SCC) and adenocarcinomas (ADC). a Correlation with postsurgical pathological tumour regression (1 complete regression, 5 no response). b–d NTO SUVmean in patients with complete remission (CR, group 1 in a) and without complete remission (groups 2-5 in a) in all patients (b), in patients with SCC (c) and in patients with ADC (d)

Figures 3 and 4 show the Kaplan Meier plots for the clinical outcome parameters evaluated (OS, TF, LC, DM) in patients stratified by NTO SUVmax and SUVmean. In the univariate Cox regression all normal tissue PET parameters investigated (SUVmax, SUVmean, ∆SUVmax and NTO ∆SUVmean and SMT SUVmean) were prognostic for OS, TF and DM (Tables 2 and 3; Supplementary Table 1). Univariate analysis showed significant associations between LC and NTO SUVmean (p = 0.044) and NTO ∆SUVmax (p = 0.042; Supplementary Table 2. ΔSUVlesion showed a trend for significant associations with OS, TNM and DM. SUVlesion on restaging was prognostic for TNM (P = 0.032) and showed a trend for significance for DM.

Fig. 3
figure 3

Kaplan-Meier estimates for overall survival (OS), absence of treatment failure (TF), local control (LC) and freedom from distant metastases (DM) in patients stratified by SUVmax of non-tumour-affected oesophagus (NTO)

Fig. 4
figure 4

Kaplan-Meier estimates for overall survival (OS), absence of treatment failure (TF), local control (LC) and freedom from distant metastases (DM) in patients stratified by SUVmean of non-tumour-affected oesophagus (NTO)

Table 2 Univariate and multivariate Cox regression analysis with respect to treatment failure
Table 3 Univariate and multivariate Cox regression analysis with respect to overall survival

In multivariate Cox regression only histology was an independent significant prognostic factor for OS (p = 0.03). A trend for significance was found for the NTO parameters (SUVmax p = 0.064, ∆SUVmean p = 0.064; Table 3). In the multivariate analysis with respect to TF and DM, all NTO parameters were significant prognostic factors (Table 2; Supplementary Table 1). Histology, SUVlesion/2 and ΔSUVlesion showed a trend for significance with respect to DM. SUVlesion/2 was an independent prognostic factor for TNM (P = 0.028). In the multivariate analysis with respect to LC only NTO ∆SUVmax showed a significant association (p = 0.036). However, due to the low number of events (9 of 64), the results for LC are of limited value. For SMT SUVmean no significant associations with any of the clinical endpoints were found in the multivariate analysis.

To determine whether NTO SUV provided information in addition to tumour SUV, multivariate Cox regressions including histology, NTO ΔSUVlesion and NTO SUVmax were performed. Inclusion of NTO SUVmax in the Cox regression model resulted in a significantly improved fit for OS (p = 0.028), TF (p = 0.00012) and DM (p = 0.035), whereas a nonsignificant increase in log likelihood was observed for LC (p = 0.12).

Discussion

We report here the first use of nontumour 18F-FDG PET parameters in irradiated normal tissues as prognostic markers in oesophageal carcinoma. FDG PET is known to reflect tissue inflammation and is therefore in used clinically not only in oncology but also for the staging and localization of inflammatory diseases [22, 23]. An interim FDG PET scan was recently proposed as a tool for predicting oesophageal injury in patients undergoing radiotherapy for non-small-cell lung cancer [24]. In our study inflammation in the NTO was identified as a strong prognostic factor for important oncological outcome parameters (pathological regression, TF, local recurrence). In addition it provides information in addition to clinical parameters as well as to tumour SUV.

Acute side effects of oncological treatment and their association with outcome in patients with gastrointestinal malignancies are receiving increasing attention. Pronounced side effects have been shown to be a favourable prognostic factor in patients with colorectal carcinoma undergoing chemotherapy [25, 26]. This association has also been seen in patients with rectal carcinoma treated with radiotherapy or RCT. in patients treated with RCT, it was a positive prognostic factor for pathological complete tumour remission [27, 28]. In patients with oesophageal carcinoma, one study found an association between worsening of odynophagia during treatment and higher rates of tumour regression and improved OS [14].

The underlying reason for the observed association between side effects and tumour response is not well understood. One obvious explanation is that responses of normal tissue and tumour are determined to a significant extent by genetically defined intrinsic or cellular radiation sensitivity. Thus, higher normal tissue toxicity would be associated with a better tumour response. This explanation seems reasonable as an early in vivo study suggested similar radiosensitivities in tumour and normal tissue of different strains of mice [29]. Furthermore several ex vivo studies indicated an association between ex vivo radiosensitivity of fibroblasts and lymphocytes and pronounced side effects suggesting similar individual radiosensitivities of different tissues [30, 31]. However, other large studies did not show such an association [32, 33]. Another explanation is that a proinflammatory immunological response leads to increased inflammation and tumour clearance at the same time. This is reasonable as polarization of immune cells by radiotherapy is possible [34] and CD8-positive lymphocytes have been shown to have prognostic value in head and neck cancers [35]. Besides these explanations, it is well known that side effects increase with radiation dose. However, in the present study all patients received equal fractionation schedules and total radiation doses, and the high-dose treatment volumes were evaluated.

Another possible confounding factor was the use of concomitant chemotherapy, which is known to increase side effects [36]. This is even more important, as several studies have also shown associations between chemotherapy side effects and tumour response and patient outcome [25, 37], including oral mucositis after administration of 5-fluorouracil [26]. To gain further insight into the contribution of chemotherapeutics, we additionally contoured the oral submucosa and mucosa. Increased uptake in this structure should not represent a radiation response, as it is completely outside the irradiated volume. Although increased oral FDG uptake was associated with several outcome parameters in the univariate analysis, it did not remain a significant factor nor did it show a trend in the multivariate analysis. Therefore, it is unlikely that the observed differences for NTO were only due to increased activity of concomitant chemotherapy. Furthermore, the observed correlations between NTO FDG parameters and outcome were much stronger than any correlation between tumour FDG parameters and outcome. Normal tissue FDG PET parameters seem to be of astonishing robustness, as several investigated parameters (SUVmax, SUVmax, ∆SUV) showed significant differences between prognostic groups.

Although this study was limited by its retrospective nature and lacked an independent cohort for validation, it indicates that non-tumour parameters may be relevant when analysing the prognostic impact of functional imaging. A combination of tumour and normal tissue parameters may be a very promising way to further personalize cancer treatment and identify patients with highly RCT-sensitive tumours, in whom trials on omission of surgery may be performed. This is even more important as in recent analyses the use of the full spatial and temporal FDG PET information instead of only classical PET parameters has been shown to be very promising for the identification of these patients [38,38,39,41]. However, even these features may not be sufficient to predict pathological complete responses [42]. Inclusion of tumour and nontumour parameters in current models could further increase sensitivity and specificity so that sufficient information is available to personalize treatment in clinical trials on omission of surgery and to individualize radiotherapy dose prescription [43].

Overall, our study revealed a high prognostic impact of NTO FDG PET parameters obtained during the last week of neoadjuvant RCT. This finding corroborates the prognostic importance of non-tumour parameters in PET imaging and, after validation, has the potential to be included in novel strategies for personalizing therapy.