Introduction

While CT is still the gold-standard for differentiating between benign and malignant pulmonary nodules, it sometimes lacks sufficient objectivity to achieve satisfactory results. To cover these shortcomings, positron emission tomography (PET) using 18F-fluorodeoxyglucose (FDG) has been used to distinguish between benign and malignant ones. However, PET has also been known to sometimes provide false-negative results for well-differentiated pulmonary adenocarcinoma [16] and false-positive results for inflammatory nodules [3, 7].

On the other hand, recent advances in magnetic resonance (MR) gradient technology have led to the introduction of diffusion-weighted MR imaging (DWI), which provides excellent tissue contrast based on differences in the diffusion of water molecules among tissues and this modality is entirely different from ordinary T1- and T2-weighted MR images. Owing to the fact that the diffusion of water molecules is disturbed by intracellular organelles and macromolecules, any architectural changes in the proportion of extracellular to intracellular water molecules alter the values of the apparent diffusion coefficient (ADC) [8, 9]. Because malignant tumors have an increased cellularity, larger nuclei with more abundant macromolecular proteins, larger nuclear to cytoplasmic ratio, and less extracellular space in comparison to normal tissue, the values of ADC on DWI tend to decrease in malignant tumors, which thus allows for the imaging of malignant tumors by DWI [10, 11]. Until recently, the usefulness of DWI has been reported in the imaging of various kinds of malignant tumors, including lung cancer [1221].

While several authors have compared the images between DWI and PET in a small number of lung tumors [2224], there have so far been few reports comparing the sensitivity and specificity of DWI and FDG-PET for diagnosing pulmonary malignancy. Mori et al. [18] reported DWI to have a similar sensitivity and a higher specificity compared to FDG-PET for distinguishing between malignant and benign pulmonary nodules; however, they examined lesions including those larger than 3 cm in size, which could thus be diagnosed easily without using DWI or PET.

While the quantitative analysis on DWI for differentiating between malignant and benign lesions has been made by measuring the values of the apparent diffusion coefficient (ADC) [12, 13], a recent study reported that the signal intensity (SI) of the lesion-to-spinal cord ratio showed a significantly higher sensitivity and specificity than ADC for differentiating between malignant and benign pulmonary nodules [24].

In the present study, to examine the usefulness of DWI in discriminating between non-small cell lung cancer (NSCLC) and benign pulmonary nodules measuring from 1 to 3 cm in size, we compared the sensitivity and specificity between DWI and FDG-PET, using SI of the lesion-to-spinal cord ratio (LSR) on DWI. In addition to examining the usefulness of DWI for predicting tumor aggressiveness, we also compared the SI on DWI between tumors with histological invasiveness and lymph node metastasis and those without.

Materials and methods

Eligibility

The study protocol for examining FDG-PET and DWI in patients with NSCLC prior to surgery was approved by the Ethics Committee of Kameda Medical Center. Informed consent was obtained from all patients after receiving a thorough explanation from their surgeons.

Patients

Between December 2012 and March 2014, 105 patients with pulmonary nodules prospectively underwent FDG-PET and DWI within a 4-week interval. The exclusion criteria for the study are as follows: (1) tumors with a ground-glass opacity (GGO) appearance, i.e. size of consolidation/tumor ratio less than 0.5; (2) tumors measuring less than 1 cm in size; and (3) tumors that were larger than 3 cm in size, of which the number of patients was 6, 4, and 18, respectively. As a result, 77 patients with 87 nodules measuring between 1 and 3 cm in size were selected (Table 1). The nodules with a ground-glass opacity (GGO) appearance were excluded because they contained air, which makes the ADC value near 0. Of the 87 nodules, 66 were NSCLCs which were diagnosed histologically from surgical specimens. Of the 66 NSCLCs, 25 had lymphatic, vascular or pleural invasion and 12 had lymph node metastases. The 21 benign nodules were diagnosed as follows: 18 of them were diagnosed as old inflammatory nodules by their unchanged appearances for more than 2 years upon a review of retrospective chest X-ray films or CT, while the remaining three were diagnosed as active or old inflammation based on the pathological findings obtained from surgical specimens. The mean sizes of NSCLCs and benign nodules were 2.1 ± 0.6 cm (range 1.0–3.0 cm) and 1.5 ± 0.4 cm (range 1.0–2.7 cm), respectively.

Table 1 Characteristics of pulmonary nodules less than 3 cm

Diffusion-weighted imaging (DWI)

All MR images were obtained with a 1.5 T superconducting system (MAGNETOM Avanto; Siemens, Germany). Conventional MR images and DWI were acquired during the same session. The conventional MR images consisted of a coronal T1-weighted sequence (repetition time [TR] ms/echo time [TE] ms, 177/4.7), and coronal and axial single-shot spin-echo T2-weighted sequences (1000/102). The T1- and T2-weighted images were acquired at a section thickness of 5 mm with 0 mm gap, with a 320 × 156 and 256 × 194 matrix, respectively, and a 263 × 350 mm and 280 × 350 mm field of view (FOV), respectively.

DWI was performed in the transverse plane using a spin-echo, echo-planar imaging sequence with pacing to each breath using the following parameters: TR/TE, 3100/73 with each breath, diffusion gradient encoding in 3 orthogonal directions; b = 800 s/mm2; FOV 310 × 350 mm; 118 × 118 matrix; section thickness, 5 mm with 0 mm gap. Fusion images were made by sources of T2-weighted imaging and DWI to confirm the lesions.

PET-CT scanning

PET scanning in all patients was conducted at the Kameda Medical Center using an integrated PET/CT device (Discovery ST; GE Medical Systems). The patients were instructed to fast for at least 5 h before the intravenous administration of FDG. The dose of FDG was 185–230 MBq, which was determined by each patient’s body mass index.

The acquisition time in 3-dimensional mode was 2.5 or 3 min per table position. Transaxial PET/CT data were acquired at a section thickness of 3.75 mm. PET image data were reconstructed iteratively using the ordered subsets expectation–maximization algorithm with segmented attenuation correction and CT data.

Measurement of signal intensity on DWI

The SI on DWI was measured using the region of interest (ROI) on the lesion using the AZE software program (Virtual Space, AZE Co., Tokyo Japan). The ROI was placed on the nodules. The maximum SI within the ROI was measured on each nodule. To eliminate any fluctuations of SI caused by the imaging conditions in each nodule, the contrast ratio of SI between lesions and the spinal cord (SI-CR) was calculated by the maximum SI of the lesion/mean SI of spinal cord at the same level of the lesion. To examine the variability of the spinal cord SI, the average value of SI at 6 levels of the spinal cord (each three levels above and below the lesion) was compared with the SI of the spinal cord at the same level of the lesion.

PET data analysis

PET data were evaluated semiquantitatively on the basis of the contrast ratio of SUV (SUV-CR), as described previously [25, 26]. Briefly, the ROI was placed on the nodules and at the equivalent point in the contralateral lung. The maximum SUV in the tumor ROI (T) and in the contralateral normal lung ROI (L) was measured. The SUV-CR of each nodule was calculated by T/L as an index of the FDG uptake.

Determining the cutoff value of SI-CR and SUV-CR

A receiver operating characteristic (ROC) curve was constructed using the StateMate software program (StateMate III: ATOMS Ltd., Tokyo Japan) to determine the cutoff values of SI-CR and SUV-CR to discriminate between NSCLC and benign nodules. Nodules with more than the cutoff values of SI-CR and SUV-CR were defined as positive on DWI and FDG-PET, respectively.

Statistical analysis

True-positive, true-negative, false-positive, and false-negative results for detecting pulmonary malignancies were compared between DWI and PET-CT. Sensitivity was calculated as true-positive/true-positive + false-negative, and specificity as true-negative/true-negative + false-positive. The differences in the sensitivity and specificity between DWI and PET were analyzed using McNemar’s test. The differences in the SI of the spinal cord between the same level of the nodule and the other 6 levels were examined by the paired t test. The differences in the tumor size, SI-CR, and SUV-CR between tumors with histological tumor invasiveness or lymph node metastasis and those without were analyzed by the unpaired t test. A statistical analysis was performed using the StatMate software program (StateMate III: ATOMS Ltd., Tokyo Japan). All values in the text and tables are given as the mean ± standard deviation.

Results

DWI could image NSCLCs as well as FDG-PET (Fig. 1). The ROC curve for the diagnosis of malignant nodules showed the optimal cutoff value of SI-CR to be 0.45 and SUV-CR on PET to be 2.65. The mean values of SI of the spinal cord at the same level of nodules and the other 6 levels in each case were 64.0 ± 13.6 (range 39–90) and 65.1 ± 14.4 (range 37–88), respectively, and the difference was not significant.

Fig. 1
figure 1

Pulmonary adenocarcinoma: a CT image, b diffusion-weighted image, and c fluorodeoxyglucose positron emission tomography

Table 2 shows the results of DWI and PET in each histological type of the 66 NSCLCs, i.e. DWI was positive in 57 (86 %) and negative in 9 (14 %), while PET was positive in 47 (71 %) and negative in 19 (29 %). Of the 56 adenocarcinomas, a false-negative finding was shown in 9 (14 %) with DWI, while it was 19 (34 %) with PET. Of the 10 other histological types, none of the lesions (0 %) showed any false-negative findings with both DWI and PET.

Table 2 Findings of diffusion-weighted imaging and positron emission tomography in malignant nodules

Table 3 shows the results of DWI and PET in the 21 inflammatory nodules, i.e. DWI was positive in 3 nodules (14 %) and negative in 18 (86 %), while PET was positive in 4 (19 %) and negative in 17 (81 %).

Table 3 Findings of diffusion-weighted imaging and positron emission tomography in benign nodules

Tables 4, 5, and 6 show the results of comparing between DWI and PET by McNemar’s test. Table 4 shows the results of the diagnosis in NSCLCs. The DWI showed a significantly higher sensitivity than PET (0.86 vs. 0.71, p = 0.013). Table 5 shows the result of the diagnosis in lung adenocarcinoma and other histological types. The DWI showed a significantly higher sensitivity than PET for adenocarcinoma (0.84 vs. 0.66, p = 0.012), whereas all 10 of the other histological types showed a positive finding for both devices. Table 6 shows the result of the diagnosis in benign nodules. The specificities of DWI and PET were 0.90 and 0.81, respectively, of which the difference was not significant (p = 0.63). The one nodule which showed a false-positive finding with DWI and a true-negative finding with PET showed granulomatous change in the surgical materials, i.e. suggestive of old tuberculosis.

Table 4 Correlation between diffusion-weighted imaging and positron emission tomography for diagnosing non-small-cell lung cancer
Table 5 Correlation between diffusion-weighted imaging and positron emission tomography for diagnosing lung adenocarcinoma and other histological types
Table 6 Correlation between diffusion-weighted imaging and positron emission tomography for diagnosing benign pulmonary nodules

Figure 2 shows the distributions of SI-CR and SUV-CR between 25 tumors with lymphatic, vascular, or pleural involvement and 41 without. The mean SI-CR of the invasive ones was 1.66 ± 0.84, which was significantly higher than 0.88 ± 0.6 of the non-invasive ones (Fig. 2a, p < 0.001). The mean SUV-CR of the former was 10.6 ± 7.7, which was significantly higher than 3.7 ± 2.7 of the latter (Fig. 2b, p < 0.001). The mean tumor sizes of the former and latter were 2.1 ± 0.5 and 2.0 ± 0.6 cm, respectively, and the difference was not significant.

Fig. 2
figure 2

The distributions of SI-CR on diffusion-weighted imaging (a) and SUV-CR on fluorodeoxyglucose positron emission tomography (b) between 25 tumors with lymphatic, vascular, or pleural involvement and 41 tumors without

Figure 3 shows the distributions of SI-CR and SUV-CR between 12 tumors with lymph node metastasis and 54 without. The mean SI-CR value of the metastatic ones was 1.76 ± 0.97, which was significantly higher than 1.04 ± 0.6 of the non-metastatic ones (Fig. 3a, p < 0.001). The mean SUV-CR of the metastatic ones was 13.5 ± 84, which was significantly higher than 4.7 ± 4.5 of the non-metastatic ones (Fig. 3b, p < 0.001). The mean tumor sizes of the former and latter were 2.1 ± 0.5 and 2.0 ± 0.6 cm, respectively, and the difference was not significant.

Fig. 3
figure 3

The distributions of SI-CR on diffusion-weighted imaging (a) and SUV-CR on fluorodeoxyglucose-positron emission tomography (b) between 12 tumors with lymph node metastasis and 54 tumors without

Discussion

The present study highlights the following two points: (1) the SI-CR on DWI showed a higher sensitivity than SUV-CR on FDG-PET for NSCLCs measuring less than 3 cm in size, especially for adenocarcinoma, whereas there was no difference in the specificity between the two devices for benign nodules; and (2) the tumors with histological invasiveness or lymph node metastasis showed a significantly higher SI-CR on DWI than those without, similar to the findings for SUV-CR on PET.

The higher sensitivity of DWI compared with FDG-PET was dependent on that for lung adenocarcinoma, which could have been caused by the following: (1) some adenocarcinomas had a lower glucose metabolism, thus resulting in false-negatives by FDG-PET [16]; and (2) because the adenocarcinomas in the present study excluded adenocarcinoma in situ, all of their imaging on CT was solid type, but not GGO, which could be visualized by DWI because of low ADC values and a high SI, independent of their glucose metabolism. We therefore considered that while lung adenocarcinomas with poor invasiveness and a low proliferative activity could not be visualized by FDG-PET, because of their low glucose metabolism, DWI could visualize them except for adenocarcinoma with a GGO appearance.

The present study showed that the SI-CR on DWI showed significant relationships with tumor aggressiveness, as well as SUV-CR on FDG-PET. While Ohba et al. [19] reported that DWI could not predict the tumor invasiveness of NSCLC as well as FDG-PET, their study measured the minimum value of ADC on DWI in each tumor. In this study, we measured the SI-CR instead of ADC because of the following: (1) while the ADC measures the true value of diffusion of water molecules on DWI, it is sometimes affected by the position of ROI within tumors, because the ADC value with ROI including air within tumors or surrounding lung tissue will become lower than its true value; (2) while it cannot be denied that SI can be affected in T2 images, because of imaging on one b value (b = 800 in the present study), the maximum SI in ROI over the tumor is not affected by its positioning. As Wang et al. [27] described that ADC values cannot be measured for the lesions located adjacent to air-containing organs because of susceptibility artifacts, it is therefore difficult to measure the ADC value in tissue containing air. We consider that SI would be more reliable than ADC for analyzing DWI in NSCLC. However, it should be notified that the SI values in the present study are not common, because of a delicate difference in the SI among the devices of MRI.

While there was no significant difference in the specificities between DWI and PET, one benign nodule that was suggestive of old tuberculosis showed a false-positive finding with DWI and a true-negative finding with PET, which thus supported the results of our previous report showing false-positive findings for lymph node with granulomatous change [19]. It should be noted that nodules with granulomatous change or dense fibrotic scar may show false-positive findings for DWI.

While the present study showed the superiority of DWI to PET for discriminating between malignant and benign nodules, DWI still has the following advantages over PET: (1) patients do not have to fast before examination; (2) there is no radiation exposure; (3) less time is required for the examination (30 min in DWI vs. 90 min in PET-CT); and (4) the cost is considerably less ($300 for DWI vs. $1400 for PET-CT in Japan). While FDG-PET is still superior to DWI for whole body imaging, this study showed that DWI could be used for discriminating between benign pulmonary nodules and NSCLC, especially adenocarcinoma, and even for predicting tumor aggressiveness as well as FDG-PET.