Introduction

The external auditory canal and middle ear contain a wide spectrum of benign and malignant tumors [1]. Since the early signs and symptoms are non-specific, benign and malignant tumors in this area can have similar manifestations [2]. Clinically, patients with malignant external auditory canal and middle ear tumors (EAMETs) often have an overall poor prognosis and generally require radiation therapy and chemotherapy as well as surgical resection [3, 4], whereas patients with benign tumors, such as papilloma, only undergo complete surgical resection [5]. Thus, the differentiation of benign and malignant EAMETs is greatly essential for therapeutic decisions and prognosis.

Temporal bone is irregular and located at the sides and base of the skull, which hinders the physical examination of the tumors in this area. Hence, imaging techniques, such as CT and MR imaging, play an important role in the diagnosis of auricular tumors. CT is highly sensitive in detecting the osseous invasion, and MRI is superior to CT in the demonstration of early lesions without bony destruction or those with an associated soft tissue component and extraosseous involvement, or the description of the possible invasion into the membranous labyrinth [6, 7]. Thus, imaging approach is important in specifying benign and malignant tumors in this region. Moreover, advanced MRI techniques, such as diffusion-weighted MRI (DWI), have the potential to reflect the physiological and functional information [8]. As previously reported, DWI has been used to discriminate between benign and malignant head and neck lesions [9, 10]. High-resolution DWI using readout-segmented echo-planar imaging (termed RESOLVE) can offer higher signal-to-noise ratio and resolution in the head and neck areas than conventional single shot echo planar image (SS-EPI) DWI [11, 12], thus potentially improving the diagnostic efficiency and reliability of ADC value. Thus, the present study aims to explore morphology and high-resolution diffusion features on CT and MRI to discriminate these two entities.

Materials and methods

Patient selection

Our institutional review board approved this retrospective study, and the requirement for patient informed consent was waived due to the retrospective nature of this study. Seventy consecutive patients with benign and malignant EAMETs were recruited from January 2015 to December 2019. The recruitment based on the following inclusion criteria: (1) all tumors were confirmed by histopathology (surgery/biopsy) and (2) standard CT, MRI, and RESOLVE-DWI were performed. The exclusion criteria were as follows: (1) image quality was insufficient because of severe artifacts (n = 5); (2) patients had a history of treatment or recurrence before CT, MRI and RESOLVE–DWI examinations (n = 10); (3) the maximum diameter of the tumors ≤ 1 cm (n = 8).

CT protocol

The CT examinations were performed on a CT scanner (Somatom Definition; Siemens Healthcare, Erlangen, Germany). The following parameters were used for data acquisition: tube voltage = 120 kVp; tube current = 240 mAs; pitch = 0.8; slice thickness = 2 mm; and field of view (FOV) = 220 × 240 mm. The soft tissue and bone algorithm reconstruction images were automatically obtained with or without intravenous injection of 70 ml iopamidol (320 mg I/ml; Isovue, Bracco Diagnostics, Inc, Singen, Germany).

MRI protocol

All scans were performed on a 3.0 T MR scanner (MAGNETOM, Verio, Siemens, Erlangen, Germany) using a 12-channel head coil. Conventional MRI consisted of an axial fast spin-echo T1-weighted imaging (T1WI) (repetition time [TR] = 384 ms, echo time [TE] = 9.1 ms), an axial fast spin-echo T2-weighted imaging (T2WI) (TR = 4000 ms, TE = 99 ms) with or without fat suppression, and axial, coronal and sagittal contrast-enhanced T1WI with fat suppression. Gadolinium-enhanced three-dimensional gradient recalled-echo T1-weighted imaging (3D-GRE-T1WI) was performed after intravenous injection of 0.1 mmol/kg of Gd-DTPA (Magnevist, Bayer Schering, Berlin, Germany) at a rate of 2 ml/s, followed by 20 ml of 0.9% saline flush using a power injector, with a matrix of 192 × 192, a field of view (FOV) of 220 × 220 mm, and a thickness/gap of 2/0.2 mm. For RESOLVE-DWI, a readout-segmented EPI, parallel imaging and two-dimensional navigator-based reacquisition were used; all DWI sequences were scanned axially. The parameters were as follows: TR = 4700 ms, TE = 66 ms; slice thickness/gap = 2/0.2 mm; matrix = 192 × 192; FOV = 240 × 240 mm; readout segments = 7; number of excitation = 1; gradient factors: b = 0, 1000 s/mm2; diffusion directions = 3.

Imaging analysis

Two radiologists (readers 1 and 2), who had 7 and 8 years of experience in head and neck imaging, respectively, were blinded to the patients' histopathology before they independently analyzed the images. Disagreements were resolved through further discussion with a third radiologist (with 20 years of experience) to reach a consensus. The features of the mass, including size, shape, signal intensity, border, enhancement degree, and bone changes, were assessed. The sizes were measured in maximum dimensions on the transverse plane. The shape was defined as round/oval or irregular. Signal intensity was classified as homogeneous or heterogeneous on both pre- and post-contrast CT and MR images. The border of the tumor was described as well-defined (more than two thirds of the border was sharply demarcated) or ill-defined (less than one-third of the border was sharply demarcated). The enhancement degree was described as poor-moderate (similar to or greater than muscles, but lower than blood vessels), or intense (similar to the blood vessels). Bone changes were defined as expansive change or invasive destruction on CT images.

To quantitatively analyze the data acquired from the RESOLVE-DWI, two radiologists blinded to the clinical and histopathological data measured the ADC values of the tumors by drawing ROIs (mean area, 117.17 mm2; range, 33.12–445.40 mm2) on an offline workstation, taking care to exclude obvious hemorrhagic and necrotic areas and avoid the most peripheral portions to exclude partial volume effects of adjacent extralesional tissue. The measurements made by readers 1 and 2 were averaged for statistical analysis.

Statistical analysis

The demographic, CT and MRI features between benign and malignant EAMETs were compared by using the chi-square test for categorical variables and the t test for noncategorical data. If the categorical count data was less than five, Fisher’s exact test was applied instead. Then, significant CT and MRI characteristics as well as the ADC value were included into multivariate logistic regression analyses to calculate the odds ratios (ORs) with 95% confidence intervals (CIs). Receiver operating characteristic (ROC) curves were used to evaluate the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy of RESOLVE-DWI, CT + MRI, and CT + MRI + RESOLVE-DWI. Delong test was used to compare the AUCs for further assess their diagnostic efficacy [13]. The kappa values and interclass correlation coefficients (ICCs) were calculated to evaluate inter-reader agreement of qualitative and quantitative features on the CT and MRI, respectively. Kappa values of 0.61 to 1.00 indicated substantial to almost perfect agreement [14], and ICC > 0.75 was considered to indicate a good to excellent reliability agreement [15]. The statistical analyses were performed using the MedCalc statistical software (Version 15.2.2, Ostend, Belgium) and SPSS statistical package (Version 22.0, Chicago, IL). A p-value < 0.05 was considered to indicate a statistically significant difference.

Results

As summarized in Table 1, a cohort of 47 patients consisting of 23 patients with benign tumors (8 males and 15 females; mean age 47 ± 11 years) and 24 patients with malignant ones (11 males and 13 females; mean age 50 ± 14 years) was included in this study. The shape, border, bone change, and enhancement degree of malignant tumors were all significantly different from those of benign ones (p < 0.05). Compared with benign tumors, the malignant EAMETs are characterized by irregular shape, ill-defined border, invasive bone destruction, and intense enhancement (Figs. 1, 2, Table 2). In addition, the size and signal intensity were not significantly different between the benign and malignant tumors (p > 0.05).

Table 1 Demographic and histology of benign and malignant EAMETs
Fig. 1
figure 1

Facial nerve schwannoman in a 35-year-old man. Axial CT soft tissue image (A) and bone algorithm reconstruction image (B) shows that a mass predominantly locates in the right middle ear and mastoid portions with expansive bone change. Axial T2W (C) and T1W (D) imaging demonstrate an oval, well-defined homogeneous signal lesion with compressive bone resorption. A post-contrast axial T1W image (E) and coronal T1W image (F) reveal a mild enhanced mass. The mass showed hypointensity on the RESOLVE-DWI (G). On the ADC map (E), the mass showed a high signal intensity with an ADC value of 974.17 × 10–6 mm2/s

Fig. 2
figure 2

Squamous cell carcinoma in a 57-year-old man. Axial CT soft tissue image (A) and bone algorithm reconstruction image (B) shows that a mass predominantly locates in the left middle ear and mastoid portions with invasive bone destruction. Axial T2W (C) and T1W (D) imaging demonstrate an irregular, ill-defined heterogeneous signal lesion with invasive bone destruction. A post-contrast axial T1W image (E) and coronal T1W image (F) reveal an intense enhanced mass. The mass showed hyperintensity on the RESOLVE-DWI (G). On the ADC map (E), the mass showed a low signal intensity with an ADC value of 706.27 × 10–6 mm2/s

Table 2 The CT and MRI features of benign and malignant EAMETs

Malignant EAMETs exhibited significantly lower ADC values than benign entities (p < 0.05), and the tumor with an ADC ≤ 920.33 × 10–6 mm2/s were more likely to be malignant lesions than those with an ADC > 920.33 × 10–6 mm2/s. Multivariate logistic regression analysis suggested that tumors with ill-defined margin, invasive bone destruction, and low ADC (≤ 920.33 × 10–6 mm2/s) were more likely to be malignant than those without this finding (OR = 13.06, 95% CI = 5.09–19.10, p < 0.001; OR = 2.76, 95% CI = 0.91–5.36, p = 0.022; and OR = 16.24, 95% CI = 9.38–31.99, p < 0.001, respectively).

As presented in Fig. 3 and Table 3, the ROC curves showed that the AUCs of RESOLVE-DWI, CT + MRI, and CT + MRI + RESOLVE-DWI for differentiating benign and malignant tumors were 0.789, 0.851, and 0.961, respectively. Significant differences of AUC were found between CT + MRI and CT + MRI + RESOLVE-DWI (Z = 2.72, p < 0.05), and between RESOLVE-DWI and CT + MRI + RESOLVE-DWI (Z = 2.398, p < 0.05).

Fig. 3
figure 3

ROC curves of RESOLVE-DWI, CT + MRI, and CT + MRI + RESOLVE-DWI for differentiating benign and malignant EAMETs

Table 3 Diagnostic performance of RESOLVE-DWI, CT + MRI and CT + MRI + RESOLVE-DWI in the differentiation between benign and malignant tumors

As shown in Table 4, the inter-reader agreements were substantial to almost perfect for qualitative features (size, ADC), good to excellent for quantitative parameters on the CT and MRI (shape, signal intensity, border, enhancement degree, and bone changes).

Table 4 Inter-reader agreement for qualitative features and quantitative parameters on the CT and MRI

Discussion

The incidence of EAMETs is low, and its signs and symptoms are non-specific [2, 3]. Hence, malignant tumors are often misdiagnosed as benign diseases with similar symptoms, and the treatment is delayed. The morphologic and diffusions characteristics can offer better characterization their anatomic relationship with surrounding structures and physiological processes. Therefore, the imaging technologies are of great significance in the differential diagnosis of benign and malignant EAMETs, contributing to select optimal treatment strategies and improve prognosis of patients. In this study, the presence of ill-defined border, invasive bone destruction, and ADC ≤ 920.33 × 10–6 mm2/s were the most valuable CT and MRI features that may be helpful for differentiating these two entities. With a combination of the morphologic and diffusions characteristics on CT and MRI, optimal sensitivity, specificity, and accuracy was achieved in the discrimination of benign and malignant EAMETs.

Malignant EAMETs typically show aggressive manner, while benign tumors, such as facial nerve schwannoma, often present as an enlargement of the bony delineation and well-defined sharp margin [16, 17]. In this study, the CT and MRI morphological characteristics of margin, and bone change were significantly different between benign and malignant EAMETs, and the lesion with ill-defined margin and invasive bone destruction was more likely to be malignant EAMETs than those without these findings. This finding was consistent with a previous study in head and neck lesions [7, 18, 19]. In addition, no previous studies have compared the enhancement degrees of benign and malignant EAMETs on CT and MRI. In our study, benign tumors showed less intense enhancement than malignant ones. This finding might be explained by increased neovascularization in malignancies, which may play a key role in supplying nutrition for the development and metastasis [20]. But it should be noted that some benign tumor of auricular area, such as glomus tympanicum tumor, may also demonstrate intense enhancement similar to malignancies [21, 22]; thus, comprehensive evaluation of the radiological features on CT and MRI is still important in clinical practice. In addition, our study found benign tumors are of regular shape (such as round or oval shape) but malignant tumor is of irregular shape. However, some benign tumors, such as facial nerve schwannoman, can also exhibit a lytic lesion when they centered on the geniculate ganglion, leading to these benign tumors with an irregular shape [23]. In our study, the signal intensity finding was not sufficiently specific for the differentiation of benign and malignant EAMETs. This may be due to some benign tumors, such as schwannoman, can have intratumoral cystic change and subacute hemorrhage [24]. However, these findings cannot be ignored when diagnosing a lesion involving this area. For example, malignant tumors, such as squamous cell carcinoma commonly appear as heterogeneous lesions on MRI [23].

Diffusion-weighted imaging (DWI) can quantitatively reflect the random motion of water molecules in biological tissues, contributing to noninvasively reflecting cellularity within lesions using the apparent diffusion coefficient (ADC) value [25]. However, the conventional single-shot echo-planar imaging (SS-EPI) DWI is prone to generate susceptibility distortion due to the air-bone interface of the auricular region [26, 27]. The newly developed RESOLVE-DWI approach significantly improves image quality by reducing the susceptibility artifacts, distortion and blurring when compared with SS-EPI [28]. Thus, RESOVLE-DWI can offer more accurate ADC values than SS-EPI for auricular tumors. In this study, we found that the mean ADC value of malignant EAMETs was significantly lower than that of the benign entities. This finding was in accordance with previous study by Yu et al., who found significantly lower ADC value in malignant tumor than that of benign lesion [29]. These may be due to high cellularity in malignancies, which can effectively restrict the water molecule diffusion of tumors, resulting in significantly low ADC values [30, 31]. In the present study, the threshold value of 920.33 × 10–6 mm2/s for ADC provided a relative high sensitivity, specificity, and accuracy for differentiating these two entities. Hence, our study revealed that ADC could be a useful imaging parameter for the differentiation of benign and malignant EAMETs. Furthermore, our results found that the combination of the morphologic and diffusions characteristics on CT and MRI can obtain the significant higher accuracy that the morphologic and diffusions characteristics alone in the differentiation of benign and malignant tumors, indicating that the comprehensive analysis of morphologic and diffusions characteristics may be the optimal approach for discriminating malignant tumors of external auditory canal and middle ear from benign ones.

The present study has several limitations. First, the retrospective study harbored potential bias. Second, the measurement of ADC value did not correlate with histological findings on a site-to-site basis. Thus, further studies are needed to investigate the findings of these correlations. Third, given the wide variety of histologic types of benign and malignant tumors, detailed subgroup analyses of the conventional CT, MRI findings and ADC values should be performed in future.

In conclusion, some CT and MRI characteristics are helpful in identifying malignant EAMETs from benign ones (especially ill-defined margin, invasive bone destruction, and low ADC value), and the combination of morphology and diffusion features on CT and MRI have best diagnostic efficiency for discriminating these two entities, which could be highly valuable for earlier therapy and improving the prognosis for patients with EAMETs.