Introduction

Thyroid incidentalomas displaying focal 18F-fluorodeoxyglucose (FDG) uptake on positron emission tomography (PET)/CT are relatively frequent, with a reported incidence of about 2.5 % [1]. The majority of these findings are benign and about one third are malignant [1, 2]. No proper standardized uptake value (SUV) cut-off able to distinguish benign from malignant lesions has been identified [1, 2]. Thus, all 18F-FDG PET thyroid incidentalomas need further clinical investigation, and, if hyperfunctioning nodules are excluded, fine-needle aspiration cytology (FNAC) is suggested [3]. Many authors have investigated the incidence of malignancy in PET incidentalomas [412], and more recently the role of 18F-FDG uptake has been evaluated in thyroid nodules with indeterminate cytology [2, 13, 14]. However, to date little can be said about the prognostic significance of 18F-FDG uptake in primary differentiated thyroid cancer (DTC) incidentally detected at the time of PET/CT for non-thyroid purposes. Indeed, no data are available in this field. Several articles support the hypothesis that 18F-FDG uptake may have prognostic value in DTC. On the one hand, 18F-FDG uptake in primary thyroid cancer is related to glucose transporter (GLUT) expression and differentiation [1517]; on the other hand, an association between 18F-FDG uptake and aggressive histological features [18], tumour size and lymph node metastases has been found [2]. Thus, the 18F-FDG avidity of primary thyroid cancer might theoretically have prognostic implications, as reported for other tumours [1921] and for DTC metastases [22].

The aim of this study was to assess the association between 18F-FDG uptake and event-free survival (EFS) in primary DTC. We also tested the positive predictive value (PPV) of 18F-FDG uptake with regard to persistence/recurrence of disease after total thyroidectomy followed by 131I remnant ablation (initial treatment). We compared the PPV of 18F-FDG uptake with that of the other initial prognostic parameters that influence the outcome of DTC patients. Finally, we also evaluated the association between 18F-FDG uptake and all prognostic variables included in this study.

Materials and methods

From a total of 884 patients with focal 18F-FDG PET thyroid incidentalomas referred to our 4 Nuclear Medicine Departments from 1 January 2006 to 31 December 2012, 219 patients underwent FNAC. We retrospectively investigated 54 patients in whom a DTC was histopathologically confirmed and clinical thyroid follow-up was available. None of the 54 patients had undergone chemotherapy during the last year. The principal indications for PET/CT study were colon/rectum cancer (n = 11), non-Hodgkin’s lymphoma (n = 9), lung cancer (n = 8), urogynaecological cancer (n = 7), breast cancer (n = 7), head and neck cancer (n = 3), melanoma (n = 1), other cancers (n = 3) and characterization of lung nodules (n = 5).

Imaging modality

In centres 1 and 2, 18F-FDG PET/CT was performed in the fasting state (at least 6 h), when the glucose level was lower than 150 mg/dl. An 18F-FDG activity of 5.5 MBq/kg was administered intravenously; 50 min after the injection, data were acquired in two-dimensional mode by means of a dedicated PET/CT system (Discovery ST, General Electric Healthcare Technologies, Milwaukee, WI, USA). The CT parameters used for acquisition were: 140 kV, 80 mA, 0.5 s per rotation and pitch 6:1, with a slice thickness of 3.25 mm equal to that of PET. PET was acquired from the upper neck to the upper thighs, by means of sequential fields of view, each covering 15 cm, over an acquisition time of 4 min.

In centre 3, patients fasted for at least 6 h before 18F-FDG injection; serum glucose levels immediately before tracer injection were below 150 mg/dl in all patients. Sixty minutes after the i.v. injection of 18F-FDG (2 MBq/kg) images were acquired in three-dimensional mode, corrected for attenuation by means of X-ray CT attenuation mapping and reconstructed by means of an iterative algorithm (ordered subset maximum likelihood expectation) on a Biograph 6 combined PET/CT scanner (Siemens Medical Solutions, Erlangen, Germany). The patients’ arms were positioned alongside the body. Five or six sequential acquisition steps were performed to scan from the upper neck to the upper thighs.

In centre 4, 18F-FDG PET/CT was performed in the fasting state (at least 6 h), when the glucose level was lower than 150 mg/dl. An 18F-FDG activity of 3 MBq/kg was administered intravenously; 50 min after the injection, data were acquired in three-dimensional mode by using a dedicated PET/CT system (Discovery ST, General Electric Healthcare Technologies, Milwaukee, WI, USA). The CT parameters used for acquisition were: 140 kV, 80 mA, 0.5 s per rotation and pitch 6:1, with a slice thickness of 3.25 mm equal to that of PET. PET was acquired from the upper neck to the upper thighs, by means of sequential fields of view, each covering 12 cm, over an acquisition time of 3 min.

Imaging analysis

In each Nuclear Medicine Department the PET images were analysed visually and semi-quantitatively by measuring the SUVmax. However, no specific SUVmax cut-off value was considered in order to discriminate focal incidental uptake or physiological activity; for this purpose, only the deviation from the activity of the normal tissue or blood pool was used.

Nevertheless, the SUVmax of the thyroid nodules and the SUVmean of the liver were recorded for each scan in each department using a circular region of interest (ROI) of 8-mm diameter. The ratio between thyroid cancer SUVmax and normal liver SUVmean (SUV ratio) was recorded for each scan. Considering that PET/CT scans were acquired in four different centres, we used the SUV ratio in order to properly semi-quantify the 18F-FDG uptake since we had previously found a high correlation (r 2 = 0.75) between SUVmax and SUV ratio. SUV ratio should be less affected by the intrinsic characteristics of each tomograph and by each PET acquisition protocol [23]. The final diagnosis of DTC was confirmed by histological examination after surgery.

Treatment and follow-up

In accordance with our protocol, all patients underwent: (1) total thyroidectomy, (2) 131I remnant ablation (radioactive iodine, RAI) with an activity ranging from 1,850 to 3,700 MBq 4–6 weeks after surgery [thyroid stimulating hormone (TSH) >30 μIU/ml] and (3) laevothyroxine (LT4) suppressive therapy (serum TSH levels <0.2 μIU/ml). All patients underwent currently accepted follow-up protocols after RAI, in accordance with Pacini et al. [24]. Patients with locoregional recurrence/metastases underwent surgery whenever possible; patients requiring further 131I treatment for distant metastases received fixed 131I doses, as recommended by the “European consensus for the management of patients with differentiated thyroid carcinoma of the follicular epithelium” [24].

Each patient was risk-stratified by means of the American Joint Cancer Committee/Union Internationale Contre le Cancer (AJCC/UICC) staging system [25] and histological subtype (virulent, non-virulent) [26, 27]. All clinical data obtained over a median of 39 months’ follow-up were used to assess the response to initial therapy and the outcome of each patient. We broke down our population into three different groups according to outcome: (A) complete remission after initial therapy (surgery + RAI), (B) persistence of disease after initial therapy and complete remission after further adequate treatment (surgery and/or 131I administration) and (C) persistence of disease after initial therapy and persistence/progression of disease after further adequate treatment (surgery and/or 131I administration).

Patients were deemed to be in complete remission at the final follow-up examination if they had an undetectable suppressive thyroglobulin(Tg) level (Tg < 0.1 μg/l), negative neck ultrasonography (US) and TSH-stimulated Tg levels <2 μg/l [27]. Patients were considered to have persistence of disease if they displayed detectable suppressive Tg levels and evidence of disease confirmed by morphological or functional imaging. Patients were considered to have stable disease at the last follow-up examination if their suppressive Tg levels (≥10 μg/l) had remained stable over the last year (<20 % increase vs previous value) and evidence of disease was seen on morphological (RECIST for CT or MRI) or functional imaging [European Organization for Research and Treatment of Cancer (EORTC) criteria for PET] [28], whether or not confirmed by cytology or histopathology. Finally, patients were considered to have progressive disease at the final follow-up examination if they displayed a higher increase (>20 % increase vs previous value) in Tg levels (≥10 μg/l on suppressive LT4 therapy) and/or evidence of disease on morphological (RECIST for CT or MRI) or functional imaging (EORTC criteria for PET), whether or not confirmed by cytology or histopathology.

Statistical analysis

Descriptive statistics included mean, standard deviation, median, percentiles, minimum and maximum of continuous factors and scores; in the case of categorical factors, number and percentage distributions were used. The Pearson chi-square and Kruskal-Wallis rank tests were used to compare categorical and continuous factors among the groups of response to initial therapy (A, B and C, as previously described). The Wilcoxon rank sum test was used to compare median SUV ratio levels between binary categories of European Thyroid Association (ETA) risk [24], tumour size, M, N and histological subtype.

PPV and negative predictive value (NPV) were used for descriptive purposes. Kaplan-Meier estimates of the cumulative probability of EFS, defined as the time from initial therapy to the onset of persistent/progressive disease, were obtained for all factors considered, including age, sex, tumour size (T), nodal status (N), distant metastases (M), histopathology, histological subtypes, divided into virulent (VS) and non-virulent subtypes (NVS), ETA initial risk classification (high/low risk) [24] and SUV ratio.

Univariate odds ratios (OR) and 95 % confidence intervals (CI) were the main measures of effect that we adopted to quantify the association between main characteristics of subjects and the persistence/progression of disease. Multivariate Cox regression analysis was adopted to assess the independent association between disease persistence/progression (EFS) and all factors under investigation.

SUV ratio was primarily tested as a continuous variable in the model (data not shown), then as a binary variable considering the median (3.0) value of the SUV ratio as a cut-off. To make the results more readable, we mainly expressed the results on the basis of this cut-off. Since SUVmax and SUV ratio were highly correlated, in order to avoid collinearity, we assessed SUV ratio and SUVmax separately in two multivariate Cox models, and showed the results of the model in terms of SUV ratio. However, due to the high correlation between SUVmax and SUV ratio, the two multivariate Cox models showed similar results.

All analyses were conducted by means of Stata (version 13, StataCorp, College Station, TX, USA) software. Two-tailed probabilities were reported and the p value of 0.05 was used to define nominal statistical significance.

Results

We retrospectively analyzed 54 DTC patients. The main characteristics of the patients are summarized in Table 1. Of 54 patients (group A), 39 achieved complete remission after initial treatment. Over a median follow-up of 39 months (range 3–78 months), we identified 15 patients with persistence of disease after initial treatment. Of these 15 patients (group B), 6 achieved complete remission after further adequate treatment, while the other 9 (group C) showed persistence of disease or disease progression despite further proper treatment. Of these nine patients, three, who were affected by poorly differentiated thyroid cancer, died of disease within 6 months after RAI. No difference in terms of follow-up length was found between these three groups (Table 1).

Table 1 Main patient characteristics by groups of response to initial therapy (n = 54)

SUV ratio levels were significantly higher in groups B and C than in group A (Table 1). Specifically, 13 of 27 patients (48 %) with SUV ratio ≥3.0 (the overall median of the SUV ratio) belonged to groups B and C (Fig. 1). By contrast, only two patients (7 %) affected by DTC primary tumour with SUV ratio <3.0 developed persistence/progression of disease.

Fig. 1
figure 1

Large part of patients (48 %) affected by DTC primary tumour with SUV ratio ≥3.0 developed persistence/progression of disease after initial treatment (belonging to groups B and C). By contrast, only two patients (7 %) affected by DTC primary tumour with SUV ratio <3.0 developed persistence/progression of disease

In patients who achieved complete remission after initial treatment (group A), the median of the SUV ratio (2.4) was significantly lower (p < 0.001) than in patients with persistence or progression of disease (6.2). An SUV ratio ≥3.0 was observed in 69 % of high-risk patients (Fig. 2). Moreover, as shown in Fig. 2, high 18F-FDG uptake was found in the majority of DTC patients with T3–T4 primary tumour (70 %), locoregional involvement (67 %), distant metastases (75 %) and virulent histological subtypes (64 %).

Fig. 2
figure 2

SUV ratio ≥3.0 was observed in 69 % of ETA high risk patients. High 18F-FDG uptake was found in the majority of DTC patients with T3–T4 primary tumour (70 %), locoregional involvement (67 %), distant metastases (75 %) and virulent histological subtypes (64 %)

The median values of the SUV ratio in ETA low-risk patients (2.2), in T1–T2 (2.2), in M0 (2.7) and in non-virulent subtypes (2.7) were significantly lower (p < 0.03) than in ETA high-risk patients (5.1), T3–T4 (5.0), M1 (7.3) and virulent subtypes (6.0).

Among the variables assessed, the presence of distant metastases (M1) displayed the highest PPV (100 %) for persistence of disease (Table 2). The 18F-FDG uptake, as expressed by SUV ratio ≥3.0, showed a relatively low PPV (48 %). On the other hand, a low 18F-FDG uptake (SUV ratio <3.0) displayed a high NPV (93 %).

Table 2 PPV and NPV of the variables studied with regard to persistence/recurrence of disease after initial treatment

In order to estimate the association between disease persistence (B and C type of response during follow-up) and all the main variables considered, we divided patients into two groups according to their response to initial treatment (surgery+RAI): patients with complete response (group A, “responders”) and patients with persistence of disease (groups B+C, “non-responders”). On univariate analysis (Table 3), SUV ratio, tumour size, nodal status, distant metastases and histological subtype were all significantly associated with disease persistence. In particular, 18F-FDG uptake was high in non-responders; indeed, we found that patients with an SUV ratio ≥3.0 had a risk of disease persistence/progression about 12-fold higher than patients with an SUV ratio <3.0 (OR = 11.6, 95 % CI 1.8–73).

Table 3 Univariate associations between main subject characteristics and persistence/progression of disease

Figures 3 and 4 show Kaplan-Meier EFS curves for all of the main factors analysed in our study. However, when we adjusted risk estimates by using a multivariate Cox model, only T remained independently associated (p = 0.05) with persistence/progression of disease (Table 4). Nonetheless, a trend towards a higher risk of disease persistence/progression emerged for patients with an SUV ratio ≥3, or with a virulent histological subtype or with positive lymph nodes (N1) or with metastases (M1), even though statistical significance was not reached, probably because of the low number of events.

Fig. 3
figure 3

Kaplan-Meier EFS curves relative to 18F-FDG uptake, expressed by SUV ratio, and histological subtype [non-virulent subtype (NVS) and virulent subtype (VS)]

Fig. 4
figure 4

Kaplan-Meier EFS curves relative to tumour size (T), lymph node involvement (N) and distant metastases (M)

Table 4 Multivariate Cox proportional hazard model of EFS (n = 54)

Similar results (data not shown) were found using a different multivariate Cox model including SUVmax instead of SUV ratio.

Discussion

DTC localizations are often characterized by low 18F-FDG uptake, especially in young patients affected by well-differentiated thyroid cancer subtypes [29, 30]. Neck US and diagnostic 131I whole-body scan (DxWBS) are the diagnostic techniques of choice when persistence/recurrence of disease is suspected in the presence of increased levels of suppressive/stimulated Tg after initial treatment [31]. 18F-FDG PET and PET/CT are mainly used in the event of suspected DTC dedifferentiation associated with the inability of iodine uptake and increased gene expression of the GLUT type 1 (GLUT1) [1, 15]. In these cases, 18F-FDG PET/CT may be helpful in detecting disease recurrence despite negative DxWBS and negative neck US findings. Moreover, as demonstrated by Robbins and colleagues [22], the detection of 18F-FDG-avid metastases has important diagnostic and prognostic implications for DTC management [3234] and is associated with an increased risk of mortality [22]. In this scenario, it is not clear why some primary DTC incidentally detected by PET/CT are characterized by high 18F-FDG uptake. Several reports have shown that 18F-FDG uptake by primary DTC is associated with the more aggressive histological subtypes [18]. Thus, it has been hypothesized that 18F-FDG uptake may be considered a prognostic factor in primary DTC. However, to date, no data have shown a direct association between 18F-FDG uptake and DTC outcome. Firstly, we evaluated the influence of histological parameters on the SUV ratio and confirmed that T and histological subtype are related to intense 18F-FDG uptake. In addition, we reported that the primary DTC of the patients with distant metastases may more often show intense 18F-FDG uptake.

Moreover, we assessed the prognostic implication of 18F-FDG uptake in primary DTC. We studied 54 consecutive DTC patients among 884 patients with focal 18F-FDG PET/CT thyroid incidentalomas and we found 29 patients at high risk (53 %) according to the ETA risk classification [24]. Specifically, 8 of 54 patients (15 %) presented distant metastases and 6 (11 %) showed lymph node involvement at the time of first diagnosis. Moreover, 15 of the 54 (28 %) showed persistence/progression of disease after initial treatment and 3 patients, affected by poorly differentiated cancer, died of disease. Thus, we probably identified a subgroup of DTC patients who are more likely destined to relapse than the general DTC population. When we identified the SUV ratio 3.0 as a cut-off point (3.0 is the median SUV ratio), we found that patients with higher 18F-FDG uptake had a higher risk of persistence/progression of disease than patients with lower uptake. Similarly, all other risk factors included in our study (histological subtype, T, N and M) were correlated with persistence/progression of disease.

Although we were aware that the low number of events would have yielded a low statistical power on multivariate analysis, we nevertheless decided to run a Cox proportional hazard model to check for the independence of the associations found between EFS and the main factors considered in the univariate analysis. We found that, after adjusting for all other factors considered, tumour size was the only independent factor which remained associated with persistence/progression of disease (p = 0.05). Although the significant association we found on univariate analysis between high 18F-FDG uptake (SUV ratio ≥ 3) and persistence/progression of disease was not confirmed in the multivariate model, the risk estimate (HR = 1.75) revealed a trend towards a direct association between SUV ratio ≥3 and disease persistence/progression even after adjusting for all other factors. We must also consider the close relationship/correlation between variables such as tumour size/histological subtype and 18F-FDG uptake [18], which, from a statistical point of view, leads to a risk of multicollinearity. Thus, the prognostic value of 18F-FDG uptake might have been underestimated. Further prospective studies including a larger number of DTC patients may better clarify this important issue.

Our findings may open a door to some considerations regarding the real prognostic value of 18F-FDG uptake in primary DTC. In other words, we can affirm that intense 18F-FDG uptake by the primary DTC may be helpful in identifying a subgroup of patients characterized by high ETA risk. Moreover, high 18F-FDG uptake is associated with persistence/progression of disease after initial treatment. Nevertheless, when all other prognostic factors are taken into account, 18F-FDG uptake does not add further prognostic information.

When we considered the prognostic implication of other risk factors, such as distant metastases, we did not find a significant association between M1 and persistence/progression of disease on multivariate analysis. This finding is probably related to the low number of patients with distant metastases and to the fact that all M1 patients presented T3 and T4 primary tumours. In other words, in the multivariate Cox model, the close relationship between these two parameters underestimated the real prognostic value of M. However, when we analysed the PPV of all risk variables included in our study, we found that the highest PPV was displayed by M1 (100 %). By contrast, the PPV of SUV ratio ≥ 3 was low (48 %), as was that displayed by N1 (66 %), T3–T4 (50 %) and VS (57 %).

The present study has some limitations, in particular: (1) the retrospective evaluation of data and (2) the period of follow-up considered. Studies with a longer follow-up are probably needed in order to properly assess the risk of recurrence. However, our median 39-month follow-up seems to be a reasonable period in which to assess clinical outcome, particularly the risk of disease persistence. Another limitation lies in the relatively low number of patients and events considered in the study and in the fact that only DTC patients in whom clinical follow-up was available were included in the study. However, to our knowledge, no other papers have investigated the outcome of more than 50 cases of DTC discovered among patients with 18F-FDG-PET focal thyroid incidentalomas.

We investigated only 54 patients from a total 219 patients who underwent FNAC. For all of these 54 patients, a DTC histopathological confirmation and a clinical follow-up were available. The other 165 patients showed negative cytological findings and for them neither histopathological confirmation nor clinical follow-up were available. This limitation reduces the possibility to find other unexpected DTC in these patients and to assess the association between 18F-FDG uptake and persistence/recurrence of disease after initial treatment.

Because this was a multicentre study, different PET/CT scanners acquiring in two- or three-dimensional mode had been used. This limitation might have influenced the correct SUV assessment. To overcome this drawback, we introduced SUV ratio, which is less affected by the intrinsic characteristics of each tomograph and by each PET acquisition protocol [23].

Finally, we included patients affected by primary DTC, all of whom were characterized by high 18F-FDG uptake on PET/CT. From a theoretical point of view, it would be more appropriate to also include patients affected by primary DTC without 18F-FDG avidity. However, the selection of this type of patient is very difficult considering the relatively low value of 18F-FDG PET/CT at the time of DTC diagnosis and staging. Moreover, we broke down our patients into low 18F-FDG-avid primary DTC (SUV ratio < 3) and high 18F-FDG-avid primary DTC according to the intensity of tracer uptake. This approach might be sufficient to assess 18F-FDG uptake as a prognostic variable.

Conclusion

The intense uptake of 18F-FDG by primary DTC is influenced by tumour size and histopathology subtype. While a high SUV ratio was associated with persistence/progression of disease after initial treatment, it displayed low PPV. When all other prognostic factors were taken into account, 18F-FDG uptake did not add further prognostic information.