Introduction

Neurogenic neoplasms including traumatic neuroma, Morton’s neuroma, neural fibrolipoma, nerve sheath ganglion, and peripheral nerve sheath tumors are commonly encountered in daily practice [1]. Among them, benign peripheral nerve sheath tumors (BPNSTs), which can be divided into schwannoma and neurofibroma, account for 10–12% of the benign soft tissue tumors occurring in the general population. However, malignant peripheral nerve sheath tumors (MPNSTs) are relatively rare, with an incidence ranging from 3 to 10% of soft tissue sarcomas, giving consideration to their higher prevalence in patients with neurofibromatosis type 1 (NF-1) [2, 3]. The differentiation of benign from malignant PNSTs is important for appropriate management plans, and as MPNST is highly malignant with a tendency for recurrence and metastasis, its early diagnosis is necessary. When it is certain that a mass is benign, which is the case for most sporadic neurogenic tumors, follow-up and monitoring are reasonable management options. Otherwise, indeterminate lesions, for which there is uncertainty as to whether they are benign or malignant, require subsequent biopsy or surgical removal for accurate histological diagnosis [4]. Unfortunately, biopsy of these tumors may cause severe pain, nerve palsy, or even seeding of malignant tumor cells into visceral organs [5].

Magnetic resonance imaging (MRI) plays a central role in the evaluation and monitoring of soft tissue tumors, and also in biopsy planning. There have been previous studies investigating the use of imaging morphology for the differentiation of benign from malignant PNSTs [6,7,8,9], and these have mostly suggested that some imaging features are helpful for distinguishing the tumor types, but they have not stated a single or combination of findings that can provide a more definite determination. Over the last decade, diffusion-weighted imaging (DWI) has been reported as a useful supplemental technique to conventional MRI for the differentiation of benign from malignant soft tissue tumors, as it provides additional information reflecting the cellularity of tumors, which is represented by water diffusivity in the tissue. A recent study by Demehri et al suggested that minimum apparent diffusion coefficient (ADC) values on DWI along with average tumor diameter could be useful for differentiating BPNSTs from MPNSTs [10].

In this study, we evaluated conventional MRI and DWI of malignant and benign PNSTs of an uncertain nature, and assessed the independent imaging findings on MRI and DWI that were associated with MPNSTs. If possible, accurate characterization of PNSTs by a noninvasive imaging technique could avoid unnecessary biopsies of benign tumors.

Materials and methods

Study population

This retrospective study was approved by our institutional review board, and the requirement for informed consent was waived. Eighty-seven cases of malignant or benign PNSTs located in the trunk and extremities of 83 patients between Sep. 2014 and Dec. 2017 were enrolled in this study. These PNSTs were confirmed histologically by either surgical excision or ultrasound-guided core needle biopsy at one of two different tertiary institutions. The patients underwent preoperative conventional MRI examinations with intravenous contrast enhancement and DWI. Of the total 87 cases, 32 cases of tumors presumed to be definitely benign (maximum size < 3 cm, well-defined margin, a target sign, homogeneous signal intensity, and the absence of necrosis in NF-1) based on findings of previous reports [5, 10]; recurrent tumors after surgical removal; and infiltrative subcutaneous lesions indicating neurofibroma were excluded from further analysis. Finally, the remaining 55 cases in 51 patients, which included 24 men (mean age, 42.6 years; range, 16–72 years) and 27 women (mean age, 54.7 years; range, 16–85 years), were included in this study. Sixteen masses were located in an upper extremity, 23 in a lower extremity, and 16 in the trunk. Of the 55 cases, 18 were malignant and 37 were benign. Table 1 summarizes the demographic characteristics of the patients.

Table 1 Demographic characteristics of the patients with benign and malignant peripheral nerve sheath tumors

MRI examination

All patients underwent MRI examinations on a 1.5-T (Achieva, Philips Healthcare [n = 1]; or Magnetom Avanto, Siemens Healthineers [n = 1]) or 3-T (Ingenia or Achieva, Philips Healthcare [n = 18 and 19, respectively]; or Magnetom Skyra, Siemens Healthineers [n = 16]) scanner, with adjustments to the coils, field of view, and matrix depending on the tumor location. The acquisition parameters varied depending on the anatomical region and the referral base of each institution; however, all patients in this study underwent axial and either coronal or sagittal T1-weighted imaging prior to injection of intravenous contrast material, axial and either coronal or sagittal T2-weighted imaging with or without fat saturation, and contrast-enhanced fat-suppressed T1-weighted imaging (CEFST1WI) in all three planes (Gadovist, Bayer; or Dotarem, Guerbet). Before contrast enhancement, DWI was performed in the axial plane using a spin-echo single-shot echo-planar imaging sequence with b-values of 0, 400, and 800 s/mm2. ADC maps were automatically constructed using a mono-exponential calculation from the DWI. Table 2 summarizes the detailed parameters for each MRI sequence.

Table 2 MR imaging sequence parameters

MRI analysis

Two musculoskeletal radiologists (reader 1 with 20 years and reader 2 with 2 years of experience in musculoskeletal radiology), who were blinded to the clinical information and histologic findings, interpreted the MRI features independently, assessing signal heterogeneity (homogeneous or heterogeneous) on T1- and T2-weighted images, shape (round/ovoid, irregular), and margin (well-defined; partly well-defined, > 70% of margin well-defined; ill-defined, < 70% of margin well-defined). The presence or absence of the split fat sign, fascicular sign, target sign, peritumoral edema, eccentricity to the nerve, and intratumoral hemorrhage/cystic change were recorded. The effects on the adjacent bone (no effect, remodeling, destruction) were also evaluated. The presence or absence of tumor enhancement on CEFST1WI was evaluated, and if present, the percentage of tumor enhancement (< 10%, 11–50%, and > 50%) was recorded.

A third independent radiologist (with 2 years of experience in musculoskeletal radiology) evaluated ADC values and tumor size on MRI. The tumor maximum diameter was measured in three planes (craniocaudal, anteroposterior, and transverse), and the average diameter of each tumor was calculated. ADC values were obtained from one representative axial plane. Side-by-side comparisons between conventional MR imaging and DW imaging were made when drawing the regions of interest (ROIs). ROIs were manually drawn within the solid enhancing tumor portion that also showed hyperintense signal on high b-value DWI and the most restricted area on the ADC map on visual assessment (Fig. 1) [11, 12]. The minimum and mean ADC values were recorded for each tumor using an in-house software package written as a plugin to ImageJ software (http://rsb.info.nih.gov/ij/).

Fig. 1
figure 1

A 57-year-old female with a peripheral nerve sheath tumor in the right ankle. a Sagittal T1-weighted imaging shows a soft tissue mass encasing the tibialis anterior tendon. b CEFST1WI shows that the mass is mainly enhanced in the antero-medial aspect (arrows). c High signal intensity is visible on the medial side of the tumor on the DWI with a b-value of 800 s/mm2 (arrows). This area is included in the solid enhancing areas on CEFST1WI. d Within the areas showing high signal intensity on DWI, the area that appears to be the most restricted area by visual assessment was selected as the ROI (white drawing). Areas considered as artifacts were excluded. The mean ADC value of the ROI was 0.788 × 10−3 mm2/s. This tumor was surgically removed and pathologically confirmed as a malignant peripheral nerve sheath tumor

Statistical analysis

Differences in the imaging characteristics between malignant and benign PNSTs were compared using Fisher’s exact test for categorized variables and the Mann–Whitney U test for continuous variables. P values of < 0.05 were considered to represent significant differences. Inter-reader agreement was evaluated by kappa coefficient (κ). A κ value < 0.20 was taken to indicate poor agreement, 0.21–0.40 fair agreement, 0.41–0.60 moderate agreement, 0.61–0.80 substantial agreement, and 0.81–1.00 excellent agreement. The diagnostic performance of ADC values was assessed using receiver operating characteristic (ROC) curves, with 95% confidence intervals being used to express the statistical precision of the results. Optimal ADC threshold values were obtained using ROC curves.

Univariate and multivariable logistic regression analyses were used to identify factors independently associated with MPNSTs. To deal with rare event situations in a selected population, bootstrap methods were applied for statistical inference and confidence interval building in the regression analyses, especially the logistic regression. Variables were selected as risk factors in the logistic regression model when their relative selection frequency was more than 40% among 1000 bootstrap samples. Statistical analyses were performed with commercially available software (SPSS version 25.0, IBM Corp. and MedCalc version 18.10, MedCalc software). p values of < 0.05 were considered to represent significant differences.

Results

Table 3 summarizes the MRI features evaluated by both readers. Among the imaging features, tumor margin, perilesional edema, and the presence of a split fat sign, fascicular sign, and target sign were significantly different between MPNSTs and BPNSTs. For both readers, perilesional edema was observed in 10 MPNST cases but in none of the BPNST cases with a κ value of 1.0. The inter-reader agreement for MRI features was substantial to excellent, except for signal heterogeneity on T2-weighted images, eccentricity to the nerve, and intratumoral cystic change.

Table 3 Comparison of MRI features between benign and malignant peripheral nerve sheath tumors

The maximum diameters in three planes and the average diameters were significantly larger in MPNSTs than in BPNSTs (craniocaudal, 8.6 ± 1.8 vs. 4.6 ± 2.2, p < 0.001; anteroposterior, 4.7 ± 3.3 vs. 3.2 ± 1.6, p = 0.041; transverse, 5.5 ± 0.4 vs. 3.6 ± 1.7, p = 0.025; average, 6.3 ± 3.4 vs. 3.8 ± 1.7, p = 0.001).

Both minimum and mean ADC values were significantly lower in MPNSTs than in BPNSTs (minimum ADC values, 0.73 ± 0.47 × 10−3 mm2/s vs. 1.12 ± 0.37 × 10−3 mm2/s, p = 0.002; mean ADC values, 0.94 ± 0.37 × 10−3 mm2/s vs. 1.50 ± 0.40 × 10−3 mm2/s, p < 0.0001). The AUC of the minimum ADC value was 0.759 (95% confidence interval [CI]: 0.595–0.923), while that of the mean ADC value was 0.846 (95% CI: 0.715–0.977; Fig. 2). According to the Youden index, the minimum and mean ADC threshold values for achieving the highest diagnostic performance were 0.89 × 10−3 mm2/s and 1.15 × 10−3 mm2/s, respectively.

Fig. 2
figure 2

Receiver operating characteristic curves comparing the diagnostic performance of the mean ADC values (area under the curve, 0.846 [95% confidence interval (CI), 0.715–0.977]) with the minimum ADC values (area under the curve, 0.759 [95% CI, 0.595–0.923]) in the assessment of MPNSTs

In the univariate analysis (Table 4), MPSNTs and BPNSTs showed significant differences in tumor size in the craniocaudal and transverse, and average diameters, and in the minimum and mean ADC values. For both readers, the absence of a split fat sign, fascicular sign, and target sign was significantly associated with MPNSTs. For reader 1 only, the shape and eccentricity to the nerve were significantly associated with MPNSTs, while for reader 2 only, a partly well-defined margin and bone destruction were significantly associated with MPNSTs.

Table 4 Univariate analyses for imaging findings associated with MPNSTs

The bootstrap methods identified mean ADC value and absence of a split fat sign as robust imaging findings (Table 5), and the multivariable logistic regression model was constructed using these two imaging findings (Figs. 3 and 4). The odds ratios of mean ADC values and absence of a split fat sign for MPNST were 25.67 (95% CI: 2.69–245.06) and 13.19 (95% CI: 1.84–94.74) for reader 1 and 117.91 (95% CI, 5.88 to > 999.99) and 49.05 (95% CI, 4.62–520.82) for reader 2. The final multivariable model demonstrated excellent discrimination ability for reader 1, reader 2, and both readers (C-index = 0.90, 0.95, and 0.92 respectively).

Table 5 The selection proportions among 1000 bootstrap samples
Fig. 3
figure 3

A 41-year-old male with peripheral nerve sheath tumor in the left thigh. a Sagittal T1-weighted imaging shows the soft tissue mass with an oval shape and well-defined margin. The split fat sign (arrows) is evident. b Axial fat-suppressed T2-weighted imaging shows heterogeneous signal intensity without a target or fascicular sign. There is no edema around the tumor. c CEFST1WI shows inhomogeneous enhancement. d The ADC map has a mean ADC value of 1.353 × 10−3 mm2/s. The white drawing indicates the ROI for the ADC measurements. This case was read as indeterminate PNST, and the possibility of malignancy could not be ruled out. This tumor was surgically removed and pathologically confirmed as schwannoma

Fig. 4
figure 4

A 31-year-old female with neurofibromatosis type 1. a Coronal T1-weighted imaging shows a number of peripheral nerve sheath tumors in the right upper arm. The soft tissue mass located in the most lateral aspect of the upper arm (long arrow) shows no definite fat split sign distal to the tumor (arrow heads), but a slightly obliterated fat signal in the proximal area of the tumor (short arrow). b Axial fat-suppressed T2-weighted imaging reveals more than three tumors in the right upper arm. The tumor located in the lateral aspect (long arrow) has faintly increased peritumoral signal (short arrow) at the posterior aspect. c This mass (arrow) is homogenously enhanced on CEFST1WI. d On the ADC map, the ROI shows diffusion restriction with an ADC value of 1.093 × 10−3 mm2/s. e The tumor (arrow) demonstrates high metabolic activity on PET/CT (maximum standard uptake value, 12.7). This tumor was surgically removed and pathologically confirmed as a malignant peripheral nerve sheath tumor

Discussion

In our study, tumor size, poor margin, presence of perilesional edema, absence of split fat, target, and fascicular signs, and minimum and mean ADC values were MRI and DWI features showing significant differences between benign and malignant PNSTs. These identified features are mostly consistent with previous studies [5, 7, 10, 13, 14]. We used a bootstrapping method to find imaging findings associated with MPNSTs, which identified absence of a split fat sign and mean ADC as robust imaging findings, and these were then used to construct the multivariable analysis. The combination of these two features in the multivariable models showed excellent discrimination of MPNSTs from BPNSTs, with C-indices higher than 0.9 for both readers. Bootstrapping, a resampling technique using independent sampling with replacement from existing sample data, provided a straightforward approach for calculating an approximate distribution from the empirical distribution function of the observed data [15]. We believe this method to be an appropriate metric for dealing with the imaging features of benign and malignant PNSTs, which were rare event situations in the selected population.

The split fat sign represents a rim of fat surrounding the tumor and reflects the rate at which the tumor grows. The neurovascular bundle is usually surrounded by fat in the intermuscular plane. BPNSTs arising at the neurovascular bundle maintain a rim of surrounding fat. As they slowly enlarge, the normal fat of the intermuscular space is displaced, creating a triangular fatty rind. The fatty atrophy of the surrounding muscles, caused by either a mass effect from slow growing tumors or by denervation changes, may influence the split fat sign [12]. By contrast, MPNSTs tend to be more aggressive and infiltrative in nature, resulting in the obliteration of fat at the ends of the lesion [1, 16, 17]. This suggests why the bootstrap method resulted in the absence of a split fat sign being selected as an imaging finding for MPNSTs for both readers.

Such differences in the growth patterns of BPNSTs and MPNSTs also affect the margins of the tumor and the presence of perilesional soft tissue edema [18, 19]. A previous study revealed that peritumoral infiltration of sarcomas on MRI provides information about the peripheral tumor growth, which is important for determining the prognosis and risk of metastasis [20]. In our series of 55 cases, 10 cases showing perilesional soft tissue edema were all MPNSTs, and this sign showed excellent inter-reader agreement with a κ value of 1.0 for the two readers. None of the BPNSTs showed perilesional soft tissue edema. These results are in agreement with previous studies by Wasa et al and Li et al [5, 7], who reported that lesions with perilesional edema were all MPNSTs, although perilesional edema was not seen in all MPNSTs. We assume that perilesional edema could be a useful finding for determining MPNSTs, although it could not be included in the statistical analysis because of statistical technical issues.

Histologically, neurofibromas consist of a central area of fibrocollagenous tissue and a surrounding area of myxomatous tissue. In schwannomas, more-cellular Antoni type A tissue patterns are usually distributed in the center, while less-cellular Antoni type B patterns are located in the peripheral border [16, 21]. This cellular composition in neurofibroma and schwannoma affects the MRI findings, resulting in low central and high peripheral signal intensities on T2-weighted imaging, the so-called target sign. After contrast enhancement, these tumors typically show central enhancement with peripheral hypointensity [16]. These histological characteristics also affect DWI; the peripheral area of BPNSTs, which is not well enhanced on post-contrast imaging, shows high ADC values, which likely reflect the loose acellular matrix or hypocellular Antoni B area. By contrast, the central fibrocollagenous or cellular Antoni A area may represent low ADC values. The loss of the target sign in MPNSTs may be explained by the replacement of tissue by malignant cells with increased cellularity and low diffusivity, instead of a normal cell distribution [16, 21].

In our results, both the minimum and mean ADC values were significantly lower in MPNSTs than in BPNSTs, with the AUC of the ROC curve of the mean ADC value being slightly higher than that of the minimum ADC value (mean ADC value, 0.846; minimum ADC value, 0.759). The threshold values for the minimum and mean ADC values were 0.89 × 10−3 mm2/s and 1.15 × 10−3 mm2/s, respectively. Furthermore, the mean ADC value was identified as an associated factor in the bootstrapping model. This seems to be in contrast to previously reported results, which showed the minimum ADC value to be a better predictor of the risk of MPNSTs than the average ADC value [10]. We speculate that this difference may be caused by different methods for drawing the ROIs. While Demehri et al selected the largest circle or ellipse ROI enclosed entirely within the tumor [10], we selected ROIs according to the most restricted area within the tumors on visual assessment, because we determined that the areas showing low ADC values are more likely to contain malignant cells [14]. We also used in-house software to measure the ADC values, to avoid technical errors, and to facilitate the inclusion of images from different institutions, which may also have resulted in differences.

A large tumor size is a feature of malignant soft tissue tumors, including MPNSTs [5, 7, 10]. Demehri et al [10] reported that the average tumor diameter of MPNSTs was significantly larger than that of BPNSTs, but that the maximum diameter of MPNSTs was not. They suggested that the reason for this is that BPNSTs may have a longer maximum diameter along the nerve of origin, but that the average diameter incorporating the lengths in the other two planes is smaller than the maximum diameter alone. Our study demonstrated that tumor diameters of MPNSTs and BPNSTs were significantly different in all directions, and that the frequency at bootstrap sampling was relatively higher than those of the other variables; however, the average diameter and the maximum diameters on all three planes were not found to be independent imaging findings.

Our results support the fact that DWI could complement the weakness of conventional MRI for differentiating benign and malignant PNSTs, when both PNSTs show similar morphologic MRI features. DWI can provide the functional information about water diffusivity at the cellular level, resultantly aiding in differentiation of tumor cellularity. Diffusion tensor imaging, which is a subtype of DWI, can measure the direction of water diffusion in addition to the degree of diffusion. This technique with fiber tractography could allow direct delineation of fascicular course and integrity of the nerves, even in cases of PNSTs, which could be helpful in surgical planning for tumor resection [14].

This study has several limitations. First, it is of a retrospective nonrandomized design with a relatively small number of patients. Nevertheless, this study included a larger number of patients than previous similar studies. The MRI examinations were performed using different MR scanners at two different institutions, which could have affected the absolute ADC values. In previous studies, difference in magnetic field strength or MRI construction was reported to affect DWI and ADC values [22,23,24]. Second, there was a relatively high rate (32.7%, 18/55) of MPNSTs, which was much higher than the incidence in the general population. This is probably because we included cases that were pathologically confirmed at the tertiary institutions, which might have limited the generalizability. Third, our identified ADC cut-off values were not tested in a separate cohort to verify that they were optimal, nor was the predictive value of the combined MRI features tested on an independent data set. Further prospective multicenter studies with larger numbers of patients are warranted to consolidate our findings.

We conclude that benign and malignant PNSTs showed different imaging features on conventional MRI and DWI. Among them, the absence of a split fat sign and the mean ADC value were found to be associated with the diagnosis of MPNSTs. The combination of these two findings could be useful for discriminating between benign and malignant PNSTs.