The prevalence of adrenal masses incidentally detected by cross-sectional imaging modalities (incidentalomas) may range between 0.35% and 6% of the different imaged population [1, 2]. Although most such adrenal incidentalomas are benign tumors such as adenoma (50–80%) or myelolipoma (5–10%), pheochromocytoma is sometimes discovered incidentally (5%) [1]. The most common clinical conundrum lies in differentiating malignant from benign tumors. However, differentiating pheochromocytoma from other benign adrenal tumors is also important, because management strategies differ substantially.

Imaging plays a key role in the characterization of adrenal tumors. An attenuation value of ≤10 Hounsfield unit (HU) on unenhanced computed tomography (CT) is highly specific for lipid-rich adenoma [3]. However, up to 30% of adenomas exhibit attenuation values >10 HU (lipid-poor adenomas) and cannot be specifically diagnosed on unenhanced CT [4]. In such cases, other imaging techniques are required for further characterization of the adrenal tumors. Several CT washout percentage methods reportedly offer high diagnostic accuracy for discriminating between adenomas and non-adenomas [1]. Pheochromocytomas not only may show low loss of CT contrast enhancement when compared with adenomas [5], but may also show adenoma-like patterns of enhancement loss [6]. Chemical shift magnetic resonance imaging (MRI) is also useful in differentiating adenomas from non-adenomas [7]. However, some lipid-poor adenomas cannot be differentiated from other adrenal tumors using this technique [1]. Although 123I meta-iodobenzylguanidine (MIBG) imaging is the technique of choice for evaluating pheochromocytoma, offering a specificity of 95–100%, and a sensitivity of 83–100% [8], pheochromocytoma occasionally exhibit variable imaging features that may mimic other benign and malignant tumors.

Uptake of the glucose analog 18F-fluorodeoxyglucose (FDG) represents local glycolytic activity, and this substance has thus seen wide use as a tracer for positron emission tomography (PET)/CT in oncology [9]. FDG-PET has been used for evaluating adrenal lesions, and benign adrenal tumors typically show lower FDG uptake than background liver uptake [10]. Most pheochromocytomas are seen as FDG-avid tumors [11, 12], and FDG-PET may therefore be useful to differentiate pheochromocytoma from other benign adrenal tumors. However, to the best of our knowledge, no reports have focused on this theme.

Diffusion-weighted imaging (DWI) has emerged as a new diagnostic technique in the evaluation of various abdominal lesions and has been shown to be helpful in the characterization of tumors on the basis of water molecular diffusivity through the measurement of apparent diffusion coefficient (ADC) [13]. Despite previous investigational efforts, the usefulness of DWI in characterizing adrenal tumors has not been established [1418]. In particular, only a few reports have described DWI for pheochromocytoma [18, 19], and the value of this modality in differentiating between pheochromocytoma and other benign adrenal tumors is largely unknown.

Although both FDG-PET and DWI are available for investigating the biological characteristics of tumors, no reports appear to have determined FDG standardized uptake value (SUV) and ADC from DWI in the same patients with adrenal tumors. The aim of this study was to compare the diagnostic performances of FDG-PET/CT-based indices and ADC in differentiating between pheochromocytoma and other benign adrenal tumors.

Materials and methods

Patients

All study protocols for this retrospective study were approved by the institutional review board, and the need to obtain informed consent was waived. From January 2011 to June 2013, a total of 66 consecutive patients underwent MRI for evaluation of suspected adrenal disease. Clinical records were reviewed to identify patients for analysis in the study. Inclusion criteria were (1) patients who underwent both DWI and FDG-PET/CT; (2) with adrenal lesions >1 cm in diameter; and (3) diagnoses confirmed by hormonal, pathological, and/or follow-up imaging examinations.

Final diagnoses of the adrenal tumors were based on the available clinical, imaging, and pathological data. For the definite diagnosis of adenoma when pathological diagnosis was not available, we used imaging results from follow-up for ≥6 months. Lesions that showed stability on a 6-month follow-up scan were considered as benign in accordance with a previous report [20]. Adrenal myelolipoma was diagnosed based on the detection of macroscopic fat on CT [21] or surgical pathology. Pheochromocytoma and schwannoma were diagnosed based on surgical pathology.

Imaging protocols

FDG-PET/CT protocols and I-123 MIBG imaging

All PET/CT studies were performed using a Discovery 600 M PET/CT system (GE Medical Systems, Milwaukee, WI). All patients were instructed to fast for ≥5 h before the examinations. Image acquisition started 1 h after intravenous injection of FDG (140-210 MBq). At FDG injection, the mean plasma glucose level was 105 mg/dl (range 74–149 mg/dl). CT covering from the brain to the feet was performed immediately before PET using a 16-slice CT scanner [slice thickness, 3.75 mm; pitch, 1.75 mm; 120 keV; auto mA (35–100 mA depending on the total body mass of the patient)]. Whole-body PET was then performed, covering an area identical to that covered by CT. Acquisition time was 2.5 min per bed position, with 14 bed positions. Emission data were reconstructed using a 3-dimensional (3D) ordered-subset expectation maximization algorithm (16 subsets, 2 iterations: VUE Point Plus) to 192 × 192 matrices using the CT data for attenuation correction. The reconstructed transaxial spatial resolution for PET was 5.1 mm full-width half-maximum in-plane.

123I-MIBG imaging was performed as follows: Anterior and posterior whole-body planar images were acquired 24 h after intravenous injection of 111 MBq of 123I-MIBG, and single photon emission CT (SPECT) was also performed to cover the areas suspected of abnormal tracer uptake in whole-body planar images (E.COM Signature; Toshiba/Siemens, Tokyo, Japan).

MRI protocols

All MRI studies were performed using a 3.0-T system (Ingenia 3.0 T; Philips Healthcare, Best, the Netherlands) with a maximum gradient strength of 45 mT/m, slew rate of 200 mT/m/s, and a dStream Torso coil. Free breathing DWI with b values of 0 and 1000 s/mm2 (motion probing gradients in the x, y, and z directions) was performed in 20 axial slices using a single-shot echo-planar (EP) sequence and spectral presaturation with inversion recovery for fat suppression. Pulse sequence parameters were as follows: repetition time (TR), 5350 ms; echo time (TE), 50 ms; field of view (FOV), 350 mm; matrix, 152 × 112; EP imaging factor, 83; half scan factor, 0.6; sensitivity encoding (SENSE) acceleration factor, 2; section thickness, 5 mm; section gap, 0.5 mm; and number of excitations (NEX), 3. ADC maps were generated using the software supplied with the MRI unit. Prior to DWI, breath-holding transverse fast field-echo in-phase (IP) and opposed-phase (OP) T1-weighted imaging [TR, 150 ms; TE, 2.3 ms (IP) and 1.13 ms (OP); FOV, 350 mm; matrix, 225 × 220; SENSE factor, 1.4; slice thickness, 5 mm; slice gap, 0.5 mm; NEX, 1] and respiratory-triggered transverse T2-weighted turbo spin-echo with fat-suppression imaging [TR, depending on respiratory interval; TE, 73 ms; FOV, 350 mm; matrix, 198 × 256; echo train length, 17; SENSE factor, 2; slice thickness, 5 mm; slice gap, 0.5 mm; NEX, 1] were performed.

Image analysis

All PET/CT images were analyzed on a workstation (Advantage Windows Workstation; GE Healthcare, Milwaukee, WI). Two nuclear medicine radiologists (M.J. and Y.N., with 4 and 2 years of experience in PET/CT interpretation, respectively) who knew the purpose of PET/CT studies but were blinded to clinical and pathological information and results of other imaging studies, interpreted PET/CT images in consensus. They visually scored FDG uptake of a lesion using a 4-point scale [20]: 0, less than the liver; 1, equal to the liver; 2, moderately increased compared to the liver; or 3, markedly increased compared to the liver. In addition, semiquantitative analysis of adrenal tumors was performed by them in consensus using maximum SUV (SUVmax). SUVmax was automatically measured as follows: first, a rectangular 3D-search region of interest (ROI) was drawn around the tumor in a suitable reference transaxial plane, excluding adjacent avid normal structures. The volume of interest was then defined by adapting an orthogonal ROI in the sagittal plane to encompass the craniocaudal extent of the tumor and in the coronal plane to cover the mediolateral extent of the tumor. A circular ROI measuring 17.5 cm2 was placed over the central solid portion of the right hepatic lobe to obtain liver SUVmax. For each lesion, the ratio of adrenal lesion-to-liver SUVmax (A/L SUVmax ratio) was calculated by dividing the adrenal lesion SUVmax by the liver SUVmax.

All images from DWI were reviewed in consensus by two radiologists (T.S. and Y.F., with 12 and 22 years of experience in abdominal radiology, respectively) who were blinded to the clinical and pathological information and results of other imaging studies. ADC distribution was demonstrated on an ADC map using the communication system workstation monitor (SYNAPS; Fujifilm Medical Systems, Tokyo, Japan). Mean ADC was measured within a circular or ovoid ROI placed in each tumor. The size of the ROI was determined to be as large as possible, and care was taken to avoid any cystic, necrotic, or hemorrhagic components using other MRI sequences as reference. The SUVmax/ADC (SUVmax × 10−3 mm2/sec÷ADC) ratio was a newly generated index using adrenal lesion SUVmax and ADC for discriminating between pheochromocytoma and other benign adrenal tumors or lipid-poor adenomas.

CT attenuation and the size (maximum diameter) of each lesion were measured in consensus by other two radiologists (MT.N. and MY.N., with 13 and 41 years of experience in radiology, respectively) on unenhanced CT from PET/CT. Mean CT density was obtained within a circular or ovoid ROI placed in each tumor, avoiding any cystic, necrotic, hemorrhagic, and macroscopic fatty components. These two radiologists also determined in consensus whether 123I-MIBG uptake was seen in the adrenal tumors.

Statistical analysis

The Mann–Whitney U test was used to assess differences in numerical variables between pheochromocytoma and other benign adrenal tumors. The correlation between SUVmax and ADC was analyzed using the Spearman’s rank test. To examine the diagnostic performances of FDG visual score, SUVmax, A/L SUVmax ratio, ADC, and SUVmax/ADC ratio in discriminating between pheochromocytoma and other benign adrenal tumors or lipid-poor adenomas, receiver operating characteristic (ROC) analyses were conducted. Area under the curve (AUC) was compared using the Student’s t test. Data are expressed as mean value ± standard deviation (SD) and/or range. A value of P < 0.05 was considered statistically significant and all P-values presented are two-sided. All statistical analyses were performed using MedCalc statistical software (MedCalc Software, Mariakerke, Belgium).

Results

Tumor characteristics

From the initial 66 patients, 35 patients were excluded because only DWI had been performed. One patient with a clinically diagnosed metastatic adrenal tumor from rectal cancer was excluded. Finally, 30 patients (19 males, 11 females; mean (±SD) age, 57 ± 20 years; range 16–86 years) with 33 adrenal tumors met the inclusion criteria and thus were eligible for analysis (Fig. 1). Table 1 shows the characteristics of the study population. A unilateral adrenal tumor was present in 27 patients. One patient showed an adenoma in the left adrenal and a pheochromocytoma in the right adrenal. Another patient had a schwannoma in the right gland and a pheochromocytoma in the left gland. The remaining patient showed bilateral pheochromocytomas. As a result, 17 lipid-poor adenomas, 3 myelolipomas, 2 schwannomas, and 11 pheochromocytomas were identified. Fifteen lipid-poor adenomas were diagnosed by lack of both adrenal hyper-function and tumor enlargement on follow-up CT or MRI studies (mean follow-up period, 272 ± 47 days; range 195–350 days), and two adenomas were diagnosed by surgical pathology. Two myelolipomas were diagnosed by CT and one myelolipoma was diagnosed by surgical pathology. The mean interval between FDG-PET/CT and MRI studies was 25 ± 21 days (range 3–71 days), and 23 patients underwent FDG-PET/CT first.

Fig. 1
figure 1

Flow-chart of the study population.

Table 1 Characteristics of 33 adrenal tumors in 30 patients

Unenhanced CT attenuation was >10 HU in all 33 tumors (mean 31.4 ± 7.1 HU; range 20–46 HU), and no significant difference in CT density was seen between pheochromocytoma (n = 11; 34.3 ± 7.9 HU; range 22–46 HU) and other benign adrenal tumors (n = 22; 29.9 ± 6.3 HU; range 20–39 HU; P = 0.08). Pheochromocytoma (n = 11; 56 ± 32 mm; range 20–130 mm) was significantly larger than both other benign adrenal tumors (n = 22; 30 ± 27 mm; range 12–130 mm; P = 0.005) and lipid-poor adenoma (n = 17; 20 ± 8 mm; range 12–45 mm; P < 0.001).

123I-MIBG imaging was performed in 25 patients with 28 adrenal tumors. All 11 pheochromocytomas showed avid (positive) uptake, and all other 17 benign tumors showed non-avid (negative) uptake on 123I-MIBG imaging. The interval between 123I-MIBG imaging and FDG-PET/CT or MRI was 16 ± 9 days (range 1–31 days), and 15 patients underwent 123I-MIBG imaging first.

Comparison of FDG indices and ADC between pheochromocytoma and other benign adrenal tumors

All FDG indices were significantly higher, and ADC was significantly lower in pheochromocytoma (n = 11) than in other benign tumors (n = 22). Mean values for pheochromocytoma and other benign tumors were 2.8 ± 0.4 (range 2–3) vs. 1.3 ± 0.9 (range 0–3) for visual score (P < 0.001), 11.0 ± 8.9 (range 3.8–32.5) vs. 3.2 ± 1.4 (range 1.9–7.9) for SUVmax (P < 0.001), 3.96 ± 3.48 (range 1.1–10.8) vs. 0.96 ± 0.51 (range 0.58–2.59) for A/L SUVmax ratio (P < 0.001), and 1.08 ± 0.23 × 10−3 mm2/s (range 0.72–1.43 × 10−3 mm2/s) vs. 1.43 ± 0.29 × 10−3 mm2/s (range 1.10–1.99 × 10−3 mm2/s) for ADC (P = 0.003), respectively. All FDG indices were also significantly higher, and ADC was significantly lower in pheochromocytoma (n = 11) than in lipid-poor adenoma (n = 17). Mean values for lipid-poor adenoma were 1.0 ± 0.6 (range 0–2) for visual score (P < 0.001 compared with pheochromocytoma), 2.7 ± 0.4 (range 1.9–3.3) for SUVmax (P < 0.001), 0.78 ± 0.14 (range 0.58–1.00) for A/L SUVmax ratio (P < 0.001), and 1.43 ± 0.30 × 10−3 mm2/s (range 1.10–1.99 × 10−3 mm2/s) for ADC (P = 0.004), respectively. SUVmax/ADC ratio was also significantly higher in pheochromocytoma (mean 10.6 ± 8.09; range 2.71–25.1) than in other benign tumors (mean 2.28 ± 0.98; range 1.07–5.64) or lipid-poor adenoma (mean 1.91 ± 0.45; range 1.07–2.91) (both P < 0.001). SUVmax and ADC showed a significant negative correlation (ρ = −0.36, P = 0.039) (Fig. 2).

Fig. 2
figure 2

Scatter plots of primary tumor FDG SUVmax and ADC in 33 adrenal tumors. A significant negative correlation is seen between SUVmax and ADC (ρ = −0.36, P = 0.039). Filled circle: Pheochromocytoma; filled diamond: Other benign adrenal tumor. The horizontal line at an SUVmax of 3.3 and the vertical line at an ADC of 1.08 × 10−3 mm2/s represent the optimum thresholds to discriminate between pheochromocytoma and other benign adrenal tumor groups, respectively.

FDG indices and ADC to differentiate pheochromocytoma from other benign adrenal tumors

Optimal cutoff values were 2 for visual score, 3.3 for SUVmax, 1.0 for A/L SUVmax ratio, 1.08 × 10−3 mm2/s for ADC, and 4.0 for SUVmax/ADC ratio to differentiate between pheochromocytoma and other benign adrenal tumors. Diagnostic performances of the FDG indices and ADC to detect pheochromocytoma are summarized in Table 2. Accuracy and AUC were 82% and 0.91 for visual score, 91% and 0.95 for SUVmax, 91% and 0.93 for A/L SUVmax ratio, 88% and 0.83 for ADC, and 91% and 0.97 for SUVmax/ADC ratio. No significant differences in AUC were found among FDG indices, ADC and SUVmax/ADC ratio (P ≥ 0.05 each).

Table 2 Diagnostic performances of FDG-based indices and ADC for diagnosing pheochromocytoma

The accuracies of the combined diagnosis of two of FDG-PET indices, ADC and SUVmax/ADC ratio, were not superior to the individual index in differentiating pheochromocytomas from other benign tumors as shown in Table 3.

Table 3 Diagnostic performances for diagnosing pheochromocytoma using combined use of two indices

Representative images of adrenal adenoma and pheochromocytoma are shown in Figs. 3 and 4, respectively.

Fig. 3
figure 3

62-year-old man with a left adrenal lipid-poor adenoma (Tumor 6). The left adrenal tumor shows slight FDG uptake (arrow) (visual score, 1; SUVmax, 3.3; A/L SUVmax ratio, 0.73) on FDG-PET/CT (A), hyperintensity (arrow) on DWI (B) and a high ADC (1.23 × 10−3 mm2/s, arrow) on the ADC map (C).

Fig. 4
figure 4

72-year-old woman with a right adrenal pheochromocytoma (Tumor 29) The right adrenal tumor shows increased FDG uptake (arrow) (visual score, 3; SUVmax, 6.1, A/L SUVmax ratio, 1.79) on FDG-PET/CT (A), hyperintensity (arrow) on DWI (B) and a lower ADC (0.95 × 10−3 mm2/s, arrow) on the ADC map (C).

Discussion

The purpose of this study was to examine the diagnostic performances of FDG-based indices and DWI ADC to distinguish between benign pheochromocytoma and other benign adrenal tumors in the patients who were performed both FDG-PET/CT and DWI examinations. For this purpose, we selected retrospectively the eligible patients from the MRI database. As a result, the frequencies of individual incidentalomas differed considerably from those of imaged population [1]; 11 pheochromocytomas (33%) and 22 other benign adrenal tumors (67%) including 17 lipid-poor adenomas (52%), 3 myelolipomas (9%) and 2 schwannomas (6%), all of which might be difficult to be diagnosed by one-point unenhanced CT examinations unless the macroscopic fat component was present in myelolipoma [21].

FDG-PET and PET/CT have been reported to be valuable in the differentiation of adrenal masses, especially between malignant and benign lesions [20]. Although the normal adrenal gland shows uptake of FDG, SUVmax ranges from 0.95 to 2.46, generally less than that of the liver [22]. Groussin et al. [23] reported that use of a SUVmax cutoff of 3.4 offered a sensitivity of 100% and a specificity of 70% in distinguishing between adenoma and adrenocortical carcinoma. Metser et al. [24] reported that use of an SUV cutoff of 3.1 yielded a sensitivity of 100% and a specificity of 98% for diagnosing malignancy. When the SUVmax did not exceed that of the liver (A/L SUVmax ratio < 1), FDG/PET demonstrated a sensitivity of 93.8% and a specificity of 100% for diagnosing benign adrenal tumors [25]. In that report, the A/L SUVmax ratio ranged between 0.22 and 1.47 for benign lesions.

In the present study, 16 of 22 (73%) benign non-pheochromocytoma tumors showed visually weak FDG uptake (visual score 0–1), and the mean SUVmax and A/L SUVmax ratio for these 22 benign tumors were 3.2 (range 1.9–7.9) and 0.96 (range 0.58–2.59), respectively. Seventeen lipid-poor adenomas showed similar findings (mean SUVmax, 2.7 (range 1.9–3.3); mean A/L SUVmax ratio, 0.78 (range 0.58–1.00), and these results were compatible with those in the reports described above.

Shulkin et al. [11] found FDG-avid pheochromocytomas in 22 of 29 patients (76%). Taïeb et al. [12] investigated 28 patients with benign and malignant chromaffin-derived tumors (pheochromocytoma and paraganglioma) and reported that the mean SUVmax was 8.2 (range 1.9–42) for benign tumors and 9.7 (range 2.3–29.3) for malignant tumors. In our study, all benign pheochromocytomas showed FDG-avid uptake above the liver activity (visual score 2–3), and the mean SUVmax and A/L SUVmax ratio were 11.0 (range 3.8–32.5) and 3.96 (range 1.1–10.8), respectively, significantly higher than those of other benign adrenal tumors. Pheochromocytomas, even benign tumors, may thus show high FDG-avid uptake like malignant adrenal tumors.

To the best of our knowledge, only a few reports have described ADC for pheochromocytoma [18, 19]. Song et al. [18] found no significant difference in mean ADC between benign pheochromocytoma [n = 22, 1.15 (range 0.49–2.14) × 10−3 mm2/s] and lipid-poor adenoma [n = 18, 1.07 (range 0.81–1.53) × 10−3 mm2/s] at 3.0 T (b = 0 and 800 s/mm2). However, Dong et al. [19] used the same 3.0 T machine (b = 0 and 800 s/mm2) found the mean ADC (n = 29, 0.918 ± 0.124 × 10−3 mm2/s) of benign pheochromocytoma which was lower than that of Song et al. We found that the mean ADC was significantly lower in pheochromocytoma (1.08 ± 0.23 × 10−3 mm2/s; range 0.72–1.43 × 10−3 mm2/s) than in either other benign adrenal tumors (1.43 ± 0.29 × 10−3 mm2/s; range 1.10–1.99 × 10−3 mm2/s) or lipid-poor adenoma (1.43 ± 0.30 × 10−3 mm2/s; range 1.10–1.99 × 10−3 mm2/s) at the different 3.0 T machine (b = 0 and 1000 s/mm2). The reasons for wide variation of ADC values between the studies are not clear. However, they may be related to differences in used machines, imaging parameters including b values and the number of tumors.

Although the diagnostic accuracies to differentiate pheochromocytoma from other benign adrenal tumors did not differ significantly between individual FDG-based indices and ADC, the sensitivity and specificity differed between FDG-based indices and ADC: On FDG-PET/CT, the sensitivities were high (100% for all FDG-based indices [11/11]), but the specificities were low; 73% (16/22) for visual score, 86% (19/22) for SUVmax and A/L ratio, respectively. The false-positive non-pheochromocytoma tumors were 3 adenomas, 2 schwannomas and one myelolipoma for visual score, and 2 schwannomas and one myelolipoma for SUVmax and A/L ratio. It is known that occasionally, adenomas demonstrate positive FDG uptake, at levels even those in the liver [2]. However, 3 false-positive adenomas with visual score 2 showed SUVmax of 2.9–3.2 and A/L ratio of 0.97–1.0 which were less than the individual cutoff values. Thus, 2 schwannomas and one myelolipoma were definitively false positive. Although FDG uptake was reported to be low in 6 myelolipomas [24], high uptake of FDG was also noted in 3 myelolipomas [26, 27] and in a schwannoma [28] as case reports, respectively. These results suggest that some benign non-pheochromocytoma adrenal tumors (27% for visual score and 14% for SUVmax and A/L ratio in our study) can show avid FDG uptake like pheochromocytoma.

On the other hand, on MRI, the specificity was high, 100% (22/22), and the sensitivity was low, 64% (7/11) due to 4 false-negative pheochromocytomas for ADC, suggesting that some pheochromocytomas (36% in our study) can exhibit ADC values like non-pheochromocytoma tumors. The reasons why some pheochromocytomas showed the high ADC values like non-pheochromocytoma tumors are unknown. However, 3 of 4 false-negative pheochromocytomas showed relatively low FDG uptake (SUVmax; 3.8, 4, and 5.1) in comparison with 7 other pheochromocytomas (SUVmax; 6–18.9). Thus, metabolic activity may be related to ADC.

When we examined the relationship between FDG SUVmax and ADC for all tumors, a significant negative correlation was identified between SUVmax and ADC (Fig. 2). Given the higher SUVmax in pheochromocytomas compared to other benign adrenal tumors, these results appear to agree with previous reports of a significant negative correlation between SUVmax and ADC in breast cancers and head and neck cancers [29, 30]. FDG uptake in the tumor represents glycolytic activity, which is influenced by several enzymatic and biological properties [31], while ADC reflects the restriction of water molecular diffusivity by microstructural barriers in biological tissues such as cell membranes [32]. The moderate negative correlation between SUVmax and ADC revealed in the present study suggests that glycolytic activity shows a significant relationship to the microstructural environment in adrenal tumors. Thus, we generated a new index using this negative correlation, SUVmax/ADC ratio. Although this index and other indices did not differ significantly in diagnostic accuracy, it yielded the highest AUC of 0.97, the balanced sensitivity (82% [9/11]), specificity (95% [21/22]), positive predictive value (90% [9/10]), and negative predictive value (91% [21/23]).

We examined whether combination of two of each modality indices could improve the diagnostic accuracy. However, the significant improvement in diagnostic accuracy was not obtained in any combinations (Table 3). This may be due to the relatively high diagnostic accuracy of each index in this small study population. Apart from the diagnostic accuracy, the confident level would increase when concordant results are obtained in various indices.

123I-MIBG, a noradrenaline analog, accumulates in pheochromocytomas. Previous reports have shown that 123I-MIBG imaging offers a sensitivity of 83–100% and a specificity of 95–100% for diagnosing pheochromocytomas [8]. In the present study, no false-negative or false-positive tumors were identified among 11 pheochromocytomas and 17 non-pheochromocytoma tumors of the 25 patients who underwent 123I-MIBG imaging. Although 123I-MIBG imaging showed the highest sensitivity and specificity, both FDG-based indices and ADC also showed high accuracies in discriminating between pheochromocytoma and other benign adrenal tumors (Table 2). Some reports have described false-negative MIBG examinations that might have been caused by necrosis or dedifferentiation of pheochromocytomas [33], and FDG-PET/CT may help diagnose pheochromocytoma in such cases, as described by Mamede et al. [34].

Boland et al. suggested an excellent algorithmic approach including biochemical assessment, previous imaging, various CT, MRI, and PET methods, and biopsy in characterization of adrenal incidentalomas [1]. Biochemical assessment is essential for characterization of adrenal incidentalomas, and when pheochromocytoma is suspected biochemically, MIBG imaging may be the first choice of morphological imaging methods. However, biochemical examinations are not always performed before detailed morphological evaluation using contrast enhanced CT, MRI, and FDG-PET/CT in clinical practice. When adrenal incidentalomas are discovered, FDG-based indices or ADC may be useful as well as other morphological methods to differentiate pheochromocytoma from other benign adrenal tumors.

Various limitations of this retrospective study should be considered when interpreting the results. As already mentioned, the resultant study cohort was biased and selected. The study included a relatively small number of patients from a single institution. Pathological confirmation was not always obtained for all tumors. Intervals between FDG-PET/CT and MRI varied substantially (range 3–71 days), and most patients were done FDG-PET/CT first. Therefore, an incidental finding of FDG-PET/CT might lead to MRI which led to biased population. No lipid-rich adenomas were identified in the resultant study population, probably because lipid-rich adenomas were diagnosed by their low density on unenhanced CT, and further imaging studies were considered unnecessary [1]. Tumors less than 1 cm in diameter were not identified, probably because of the limited resolution of PET [2]. Comparison between benign and malignant tumors was not performed. We speculate that discriminating pheochromocytoma from malignant adrenal tumors by FDG indices or ADC might be difficult, because our study revealed that pheochromocytoma typically shows high FDG uptake and low ADC, which may be similar to malignant adrenal tumors. However, this speculation remains to be clarified.

In conclusion, although 123I-MIBG imaging may be a specific imaging modality for diagnosing pheochromocytoma, FDG indices and ADC may also prove highly diagnostic in differentiating pheochromocytoma from other benign adrenal tumors.