Introduction

In patients with non-small-cell lung cancer (NSCLC), differentiating benign from metastatic mediastinal lymph nodes is decisive in determining further therapeutic strategies. Therefore, the importance of preoperative staging of these nodes cannot be over-emphasized [1, 2]. Surgery remains the standard treatment in patients with medically operable clinical stage I or II NSCLC when either there is no evidence of lymph node metastases or there are only hilar or intrapulmonary metastases. In the case of N2 lymph node metastasis, definitive concurrent chemoradiation therapy or induction chemotherapy with or without radiation therapy is recommended [25]. To evaluate the accuracy of preoperative staging in NSCLC, a widely and routinely used noninvasive diagnostic method is contrast-enhanced computed tomography (CT). However, numerous studies have now shown that this method of lymph node staging has limited reliability when compared with the evaluation of tumor size and the invasion of adjacent structures [68]. Currently, the combination of 18F-fluoro-2-deoxyglucose positron emission tomography and CT imaging (18F-FDG PET/CT) is widely used as a noninvasive diagnostic method. Unlike CT alone, PET/CT provides functional metabolic information along with anatomic information. The standardized uptake value (SUV) is a relative measure of FDG uptake. Recent studies have demonstrated that the diagnostic performance of 18F-FDG PET/CT is superior to that of contrast-enhanced CT in the preoperative staging of mediastinal lymph nodes [3, 911].

Although 18F-FDG PET/CT is one of the most accurate noninvasive diagnostic methods for mediastinal lymph node staging, it is limited in its ability to differentiate between benign and metastatic lymph nodes. Small lymph nodes, micrometastases or an adjacent primary mass could be the reason for false-negative results, whereas inflammation or granulation tissue could be the reason for false-positive results [12, 13]. Therefore, precise diagnostic criteria are needed for successful mediastinal lymph node staging in NSCLC using 18F-FDG PET/CT. To overcome these limitations, several kinds of semiquantitative and quantitative methods, in addition to visual assessment, have been proposed, including the maximum SUV (SUVmax) threshold, metabolic heterogeneity analysis, lymph node SUVmax (SUV-LN) to blood pool SUVmax ratio, and the SUV-LN to primary tumor SUVmax (SUV-T) ratio (SUV-R) [1216]. Although SUV-R can easily be determined during the routine measurement of SUVmax, its diagnostic performance in mediastinal lymph node staging is controversial [13, 14, 16], possibly owing to differences in the number of patients studied, patient characteristics, scanner hardware, and approaches to reconstruction. To investigate the diagnostic performance of SUV-R more accurately, we grouped patients according to their SUV-T and compared SUV-R and SUV-LN in each group.

This study aimed to determine the value of SUV-R for mediastinal lymph node staging in NSCLC in comparison with that of SUV-LN. In addition, we determined the optimal thresholds of each of these parameters for such staging.

Materials and Methods

Patients

In this retrospective study, we reviewed the records of 98 patients with NSCLC (adenocarcinoma, squamous cell carcinoma, and adenosquamous carcinoma) who underwent preoperative 18F-FDG PET/CT for staging and subsequent pathologic studies of mediastinal lymph nodes at our institution between May 2014 and October 2015. The study group comprised 65 men and 33 women with a mean age of 68.1 ± 9.1 years. Pathologic results from a total of 289 mediastinal lymph node stations were obtained by lymphadenectomy and/or endobronchial ultrasound-guided transbronchial needle aspiration (EBUS-TBNA) within 2 months of the 18F-FDG PET/CT scan. Of the 98 patients, 34 underwent lymphadenectomy only, 51 underwent EBUS-TBNA only, and 13 underwent EBUS-TBNA followed by lymphadenectomy. Patients with a history of lung cancer or who had undergone neoadjuvant chemotherapy or radiotherapy for NSCLC were excluded from the analysis. The study protocol for this retrospective study was reviewed and approved by the institutional review board at our institution.

18F-FDG PET/CT Imaging

18F-FDG PET/CT images were obtained using a Gemini TF PET/CT scanner (Philips Medical Systems, Cleveland, OH), with an axial field of view of 18 cm and a spatial resolution of 4.4 mm. All patients had fasted for at least 6 h prior to the scanning. 18F-FDG PET/CT was performed 60 min after intravenous injection of 370 to 555 MBq of 18F-FDG, depending on body weight. The PET scan was obtained from the skull base to the thigh for 1 min per bed position, and was followed by a low-dose CT scan (50 mA, 120 kV, 512 × 512 matrix) for attenuation correction. Maximum intensity projection, cross-sectional views, and fusion images were reviewed.

Image Analysis

For semiquantitative analysis, all images were reviewed on a dedicated workstation (Extended Brilliance Workspace 4.0, with a PET/CT viewer for automated image registration; Philips, Amsterdam, The Netherlands). Lymph node stations were assigned according to the Lung Cancer Staging manual of the American Joint Committee on Cancer (7th edition). We measured SUV-T and SUV-LN within a designated region of interest. SUVmax was the highest SUV of the pixels within the region of interest. If there was more than one mass in the lung, the mass with the highest SUVmax was considered the primary lung tumor. We also assumed that the lymph node with the highest SUVmax had the highest probability of malignancy, and we measured these nodes for each nodal station. Each nodal station was classified as either positive or negative based on the pathologic findings, and the SUV-R was determined in each case, as follows:

$$ \mathrm{S}\mathrm{U}\mathrm{V}-\mathrm{R}=\frac{\mathrm{SUV}-\mathrm{L}\mathrm{N}\left(\mathrm{SUVmax}\ \mathrm{of}\ \mathrm{mediastinal}\ \mathrm{lymph}\ \mathrm{node}\ \mathrm{station}\right)}{\mathrm{SUV}-\mathrm{T}\ \left(\mathrm{SUVmax}\ \mathrm{of}\ \mathrm{primary}\ \mathrm{lung}\ \mathrm{tumor}\right)} $$

Data Analysis

We classified the primary lung tumors into low, medium and high SUV-T groups because we assumed SUV-LN and SUV-R would vary according with SUV-T. In particular, we assumed that a higher SUV-T might have a lower SUV-R, and a lower SUV-T might have a higher SUV-R. SUV-T were divided into three groups based on their tertile distribution. Nodal stations with an SUV-T of ≤5, >5 – 8 and >8 were allocated to the low, medium and high SUV-T groups, respectively. From 13 patients, 17 nodal stations were evaluated by both lymphadenectomy and EBUS-TBNA. Of these nodal stations, the results for two were discordant, and in these cases the pathologic result on lymphadenectomy was selected for analysis.

Statistical Analysis

Statistical analyses were performed using MedCalc software version 12.3.0 (MedCalc, Mariakerke, Belgium). Differences were considered statistically significant when P values were less than 0.05. The significance of differences between benign and malignant lymph nodes was tested using Student’s t test for SUV-LN and SUV-R. One-way analysis of variance was used to compare SUV-LN and SUV-R in each of the three groups. Receiver operating characteristic (ROC) curve analysis was used to compare the diagnostic performance of the two variables SUV-LN and SUV-R, and to determine their optimal cut-off values for maximum sensitivity and specificity in detecting metastatic lymph nodes.

Results

Clinical Data

Of the 98 patients with NSCLC, metastatic involvement of mediastinal lymph nodes was detected in 49. Of 298 lymph node stations pathologically confirmed by EBUS-TBNA or lymphadenectomy, 213 were benign and 76 were malignant. The characteristics of the patients and lymph node stations are shown in Tables 1 and 2.

Table 1 Characteristics of patients
Table 2 Characterization of mediastinal lymph node stations by confirmed methods

18F-FDG Uptake on PET/CT

On preoperative 18F-FDG PET/CT, the mean SUVmax of malignant lymph nodes was significantly higher than that of benign lymph nodes (5.12 ± 2.70 vs. 1.92 ± 0.73, P < 0.001). The average SUV-R in patients with malignant lymph nodes was significantly higher than in those with benign lymph nodes (0.79 ± 0.45 vs. 0.36 ± 0.23, P < 0.001; Table 3). The mean SUV-T of squamous cell carcinoma was significantly higher than that of adenocarcinoma (8.89 ± 3.79 vs. 6.68 ± 4.68, P < 0.001). There was no significant difference in mean SUV-LN between patients with adenocarcinoma (5.11 ± 2.85) and those with squamous cell carcinoma (5.13 ± 2.44) or in mean SUV-R between patients with adenocarcinoma (0.82 ± 0.48) and those with squamous cell carcinoma (0.72 ± 0.39; Table 4).

Table 3 Characterization of lymph node stations on 18F-FDG PET/CT
Table 4 Metabolic characterization of adenocarcinoma and squamous cell carcinoma

The mean SUV-T was 7.63 ± 4.43. Of the 289 lymph node stations, 102 were in the low SUV-T group, 101 were in the medium SUV-T group, and 86 were in the high SUV-T group. There were significant differences in SUV-LN and SUV-R between benign and metastatic lymph nodes in all three groups (P < 0.001) (Table 5). SUV-LN of metastatic lymph nodes differed significantly among the three SUV-T groups, but SUV-LN of benign lymph nodes did not differ significantly. There were significant differences in SUV-R among the three SUV-T groups for both benign and metastatic lymph nodes (Fig. 1).

Table 5 SUV-LN and SUV-R for the three SUV-T groups
Fig. 1
figure 1

18F-FDG PET/CT images in an example patient with NSCLC. Maximum intensity projection (MIP) image (a) and axial fused image (b) show the FDG-avid lesion in lung right lower lobe (arrowhead) with SUV-T 2.8, which was adenocarcinoma. Mild FDG uptake is seen in the left upper paratracheal lymph node (arrows a c) with SUV-LN 2.4 and SUV-R 0.86, which was confirmed as metastatic. This shows the ability of SUV-R to differentiate between benign and malignant lymph nodes in patients with low SUV-T, while SUV-LN was false-negative. The FDG-avid subcarinal and right hilar lymph nodes (open arrows a, d) with SUV-LN 5.6 and SUV-R 2.0 were also confirmed as metastatic

ROC Curve Comparison

The diagnostic performances of SUV-LN and of SUV-R for each group were compared by ROC curve analysis. In the low SUV-T group, the areas under the curve (AUC) for SUV-R and SUV-LN were 0.885 and 0.810, respectively (P = 0.019). The AUCs for SUV-LN and SUV-R did not differ significantly in the medium SUV-T group (0.924 and 0.918, respectively; P = 0.31) or in the high SUV-T group (0.929 and 0.938, respectively; P = 0.47). The AUCs for SUV-LN and SUV-R did not differ significantly for all nodal stations without grouping (0.866 and 0.848, respectively; P = 0.19; Fig. 2). The optimal SUV-LN and SUV-R cut-off values were determined for each group. The optimal SUV-R cut-off value for the low SUV-T group was 0.71 (Table 6).

Fig. 2
figure 2

ROC curve analysis comparing the diagnostic performance of SUV-R and SUV-LN. a In the low SUV-T group, the AUC of SUV-R was significantly higher than that of SUV-LN (0.885 vs. 0.810, respectively; P = 0.019). b In all nodal stations without grouping, the AUCs of SUV-LN and SUV-R were not significantly different (0.866 vs. 0.848, respectively; P = 0.190)

Table 6 Optimal SUV-LN and SUV-R cut-off values

Discussion

The results of this study indicate that SUV-LN and SUV-R, as determined on 18F-FDG PET/CT imaging, have good diagnostic performance for evaluating metastatic and benign lymph nodes. In the low SUV-T group, SUV-R had better diagnostic performance than SUV-LN. SUVmax is widely used as a semiquantitative diagnostic indicator of lymph node metastasis in lung cancer. For example, both Hellwig et al. [12] and Kumar et al. [17] found that SUVmax 2.5 was the optimal cut-off value for distinguishing metastatic from benign lymph nodes. In our study, despite having classified the SUV-R into three groups, we found generally similar cut-off values to those found previously.

We also found that the SUV-R of malignant lymph nodes was significantly higher than that of benign lymph nodes. Consistent with our findings, Cerfolio et al. [14] found that the SUV-LN/SUV-T ratio in patients with NSCLC predicts mediastinal nodal pathology, and the ratio with the maximum sensitivity is 0.56 or greater (sensitivity 94 %, specificity 72 %). In agreement with that study, Cirak et al. [16] found that as SUV-R increases in patients with NSCLC, the possibility of detecting malignant lymph nodes may increase. In contrast to these results, Lee et al. [13] found that the difference in SUV-R between metastatic and benign lymph nodes is not significant, while the lymph node SUV to blood pool SUV ratio was significantly higher for malignant lymph nodes. Although the reasons for these different results are not clear, it is likely that they are related to the difference in the methods used to obtain the lymph nodes. Lee et al. obtained lymph nodes only by lymphadenectomy during surgery in patients in whom the clinical stage of the disease was relatively low and operability was sufficiently high, while Cerfolio et al. and Cirak et al. obtained biopsy specimens by mediastinoscopy for diagnostic purposes.

We classified lymph node stations into three groups assuming that the discriminating ability of SUV-LN and SUV-R vary depending on the SUV-T. Indeed, SUV-R had better diagnostic performance than SUV-LN in the low SUV-T group, which might have clinical implications. This group had a lower SUV-LN, which possibly increased the likelihood of a false-negative result, and in this situation the SUV-R value may be of help in making a clinical diagnosis. Another prospective advantage of using SUV-R, even though SUV measurements have been shown to vary by 15 – 20 % between centers because of variations in scanner hardware and reconstruction approaches [14, 1822], is that it is relatively less vulnerable to such variations [14].

Size (short-axis), maximum Hounsfield units, coefficient of variation, and LN/blood pool SUV ratio were evaluated in the previous studies as criteria for diagnosing mediastinal lymph node metastasis. The generally accepted criterion for a normal sized mediastinal lymph node on CT is ≤1 cm [23]. The optimal cut-off values for maximum Hounsfield units on unenhanced CT, for the coefficient of variation for assessing the heterogeneity of FDG uptake, and for the LN/blood pool SUV ratio have been suggested to be 136, 0.20 [15], and 1.4 [13], respectively. Concerning other cancers, several studies have evaluated SUV-R in invasive breast cancer. The axillary lymph node to breast tumor ratio is a good diagnostic parameter [24] and is more predictive than visual analysis or the SUV of axillary lymph nodes [25]. In our study, adenocarcinoma had a lower SUV-T than squamous cell carcinoma. Consistent with this finding, previous studies have shown lower metabolism of adenocarcinoma. These findings are thought to be a result of the higher cellular proliferation index of squamous cell carcinoma [13, 26]. However, there were no significant differences in SUV-LN or SUV-R between adenocarcinoma and squamous cell carcinoma in this study.

Although mediastinoscopy has been considered the gold standard for staging mediastinal lymph nodes, accumulating evidence indicates that EBUS-TBNA and mediastinoscopy achieve compatible results for this purpose [2731]. EBUS-TBNA might be preferable for histologic sampling of paratracheal and subcarinal mediastinal adenopathy because the diagnostic yield can surpass that of mediastinoscopy [29]. Recent studies have demonstrated that the sensitivity of EBUS-TBNA is about 90 % [3, 29, 32, 33]. Lymphadenectomy through surgery can confirm surgically resectable lymph nodes only at an operable stage. Therefore, in the present study, we used EBUS-TBNA in addition to lymphadenectomy through surgery to include nonresectable and nonoperable cases. In this way, we were able to minimize selection bias without prominently distinguishing the clinical stage.

Nevertheless, our study had some limitations. It was retrospective and may have shown a selection bias. Although EBUS-TBNA has good diagnostic performance, it can still result in false-negative findings. Also, it is possible that some FDG-avid lymph nodes could not be reached by either EBUS-TBNA or lymphadenectomy. Further studies are needed to confirm these results, with more data from a larger number of patients and from different centers.

Conclusions

18F-FDG PET/CT is a noninvasive and reliable diagnostic imaging modality, and SUV-LN in patients with NSCLC is known to be a helpful diagnostic parameter. We chose SUV-R to further determine its diagnostic accuracy in comparison with simply using SUV-LN and found that the SUV-R has good diagnostic performance for distinguishing metastatic from benign lymph nodes. In particular, SUV-R had better diagnostic performance than SUV-LN in the low SUV-T group with an optimal cut-off value of 0.71.