Introduction

Magnetic resonance (MR) imaging is the primary modality of choice to identify, characterize, and guide the treatment of musculoskeletal soft tissue sarcomas and malignancies [1,2,3,4]. A biopsy of the tumor, followed by histological analysis, provides a final diagnosis of soft tissue sarcoma [5]. The utilization of MR imaging to characterize soft tissue tumors has reached the extent of moderate accuracy in differentiating between benign and malignant tumors, as well as various histological grades of tumors [6,7,8]. There is considerable room for the discovery of techniques that can characterize and differentiate soft tissue sarcomas within the categorization of benign and malignancy. Treatments for different malignant tumors, ranging from liposarcomas to malignant peripheral nerve sheath tumors (MPNST) vary greatly depending on their location and underlying histology [9,10,11].

Diffusion-weighted imaging (DWI) and derived apparent diffusion coefficient (ADC) mapping are currently utilized to evaluate tumor cellularity in musculoskeletal soft tissue lesions. DWI interrogates free and restricted water diffusion within the tissue of interest. Tumors with lower free water content due to increased cellularity or protein content restrict diffusion [8]. These differences become relevant when comparing different tissue pathologies and their cellular compositions. Furthermore, the degree of diffusion as quantified using ADC mapping has been shown to differentiate between benign and malignant soft tissue tumors [12]. Difficulties in interpreting DWI and ADC measurements may arise due to possible image distortions and artifactual signals in the measurements [13]. However, ADC measurements have been found to be reproducible across readers and among different magnet strengths [8,9,10,11,12]. An analysis of soft tissue sarcomas using ADC measurement for tumor histology diagnosis characterization has not been explored.

We conducted an analysis of histology-proven soft tissue sarcomas blinded to any relevant information apart from their MR imaging findings to identify trends in their ADC measurements with respect to tumor type classification and other tumor characteristics. Inter-reader reliability analysis was also performed. Our hypothesis was that DWI and ADC measurements offer differentiation and characterization of various common malignant soft tissue tumors and soft tissue sarcomas.

Materials and methods

This study was a cross-sectional retrospective analysis conducted using relevant institutional IRB approval and regulations. Patient informed consent was waived by the institution.

Patient population

An electronic search was conducted across two institutions specifically looking for musculoskeletal extremity contrast-enhanced MR imaging studies performed for soft tissue sarcomas from 2013 to 2021 from PACS (picture archiving and communications system) and sarcoma board lists. Inclusion criteria included both genders, ages between 18 and 100 years old, upper or lower extremity malignant soft tissue sarcomas and tumors, established final histology on biopsy or tumor resection, and a complete series of MR images including T1W, T2W, contrast imaging, DWI prior to biopsy, and ADC mapping. Exclusion criteria included benign tumors, metastasis, previous surgical resection, or neoadjuvant treatment. Patient demographics included age, gender, and location of the tumor. Retroperitoneal sarcomas and gynecological sarcomas were excluded.

MR imaging

The primary MR imaging modality varied between the different scanners across both institutions (Siemens, GE, and Philips) using institution-based protocols for musculoskeletal tumor imaging. Both 1.5-T scanners and 3-T scanners were used to perform these scans (1.5 T: n = 49, 3.0 T = 29). Each imaging series included conventional imaging (T1W, fsT2W), contrast-enhanced imaging (3D volumetric fat-suppressed isotropic voxel, pre- and post-contrast imaging), and DWI/ADC were obtained in all cases as outlined in the inclusion criteria. The parameters were as follows: T1-weighted (repetition time (TR)/echo time (TE), 600–715/9–11 ms; section thickness, 4 mm; axial and sagittal planes), fat-suppressed (fs) T2-weighted (TR/TE, 3600–4500/50-60 ms; section thickness, 4 mm; axial plane), unenhanced and gadolinium-enhanced fsT1W sequence (isotropic resolution, repetition time/echo time (TR/TE), 4.6–6.3/1.4–1.5; section thickness, 1.5 mm isotropic; modified Dixon fat suppression; coronal plane acquisition with axial and sagittal reconstructions; 0.1 mmol/kg gadolinium-based contrast agent). The axial DWI was performed using single-shot echo-planar imaging (TR/TE, 8000–11,700/70–92; section thickness, 4 mm; b values, 50, 400, and 600–800 s/mm2 (600 s/mm2: n = 14, 800 s/mm2 n = 64); flip angle, 90°; matrix, 128 × 128; fat suppression, spectral adiabatic inversion recovery; time of acquisition, 5 min 10 s). The ADC value was calculated using all b values, and the ADC map was automatically generated from the scanner.

Data evaluation

Using conventional MR images and DWI, ADC measurements were performed by two readers. These two readers (R1: musculoskeletal fellowship-trained faculty, R2: current musculoskeletal radiology fellow, 6 years post ACGME-equivalent radiology residency training completed internationally) evaluated anonymized MR imaging data sets blinded to the histology-determined diagnosis as well as each other’s measurements and diagnosis predictions. Measurement of the maximum longitudinal dimension of the tumor was performed by the first reader, R1. R1 also recorded the tumor signal intensity alterations and tumor heterogeneity on T1W and fsT2W, intralesional hemorrhage, non-enhancing necrotic-cystic areas, peritumoral edema, post-contrast enhancement, and the maximum longitudinal dimension of the tumor, as mentioned before. Both R1 and R2 measured the mean and minimum ADC values for the entire tumor lesion, as well as for the hypointense area of the lesion on the ADC map. The ADC values were calculated using a free-hand measure of the region of interest. Furthermore, both R1 and R2 attempted to guess the final diagnosis of the lesion based on MR images and DWI.

Statistical analysis

Intraclass correlation (ICC) and kappa calculations were utilized to assess inter-reader agreements between continuous and categorical variables. The continuous variables were the mean and minimum ADC values of the whole tumor and the darkest (most hypointense) area of the tumor, and the categorical variable was the reader diagnosis for determining inter-reader agreements. The agreement was considered poor with ICC/kappa < 0.4; fair, 0.4–0.59; good, 0.6–0.74; and excellent, 0.75–1. Non-parametric Kruskal–Wallis test was used to assess differences in the ADC values between histology-confirmed tumor diagnoses. Non-parametric Wilcoxon rank-sum tests were also conducted between specific histology-confirmed tumor diagnoses and the remaining field of tumors to assess additional differences. Comparisons were made between the sample field as a whole as well as isolating analysis of many of the different tumor types versus the remainder of the field. More specifically, imaging features were also tested between liposarcomas and non-liposarcomas, and between myxoid tumors and non-myxoid tumors. The significance level was set at 0.05. All analyses were done in R 4.0.2 (R core team, Vienna, Austria).

Results

Patient population

In the final inclusion sample, 78 patients with 78 musculoskeletal soft tissue malignancies were included as primary de-novo malignancies. The final diagnoses were determined by surgical histopathology or percutaneous core biopsy. the most common histologies were undifferentiated pleomorphic soft tissue sarcomas, myxofibrosarcomas, and synovial sarcomas, while a few samples were included for other subtypes as well. There was nearly a 1:1 ratio of men to women (47:53). There was only one grade 1 liposarcoma. Other liposarcomas were higher grades (IIs and IIIs) (Table 1).

Table 1 Demographics information

Conventional MR imaging analysis

Among different histological tumor grades, the longitudinal tumor dimension was not determined to be significantly different among the 15 sarcoma subtypes (P = 0.583), with 1 Burkitt lymphoma registering the largest size (41.6 cm) and one epithelioid sarcoma registering the smallest size (5.6 cm). Among the other categorical signal observations, only T1W signal characteristics and the presence of hemorrhage were determined to be different across the samples (P = 0.034 and P =  < 0.001, respectively). More notably, liposarcomas were relatively consistent in having an absence of hemorrhage (9/11, 81.8%) while undifferentiated pleomorphic sarcomas consistently showed intralesional hemorrhage (18/20, 90%). Liposarcomas were shown to be heterogenous on T1W signal characteristics with a hyperintense T1W signal when compared to a tumor not classified as a liposarcoma (P = 0.046 and P = 0.010, respectively). Myxoid tumors were similarly shown to be heterogenous on T2W signal characteristics when compared to non-myxoid tumors (P = 0.044).

DWI analysis

Among the 78 malignant soft tissue tumors and sarcomas, the mean ADC value for the whole tumor and the mean ADC value for the darkest region (most hypointense-lowest signal area) of the tumor were determined to be different by both readers (whole tumor: R1 P = 0.010, R2 P = 0.016; darkest area: R1 P = 0.021, R2 P = 0.011). A separate analysis was conducted between the three tumor types with the largest sample size (undifferentiated pleomorphic sarcoma, myxofibrosarcomas, and synovial sarcomas) along with the remaining tumors grouped as a fourth conglomerate. Both readers found a difference in the mean ADC value for the whole tumor between these four groupings (P < 0.05). When comparing individual tumor classifications against the rest of the samples, lymphomas registered lower ADC values in all four dependent, continuous variables for both readers (P < 0.05). Consistently lower ADC values were demonstrated when comparing synovial sarcomas to the remainder of the sample, but only with respect to the mean ADC value of the whole tumor (P < 0.05 for R1 and R2). Both readers found significant differences in ADC values when differentiating between liposarcomas and non-liposarcomas when analyzing the whole tumor ADC mean, darkest area ADC mean, and darkest area ADC minimum (P < 0.05 for R1 and R2). Both readers found significant differences in ADC values when differentiating between synovial sarcomas and non-synovial sarcomas with respect to the minimum ADC value of the whole tumor (P < 0.05 for R1 and R2). Notable cases are displayed in Figs. 1, 2, 3 and 4, and full statistics are available in Tables 2, 3, 4, and 5.

Fig. 1
figure 1

A 46-year-old male with medial calf swelling and pain. Proven soft tissue lymphoma. A Axial T1W image, B axial T2W image of Dixon water map, C DWI, b = 800 s/mm2, D ADC map, and E post-contrast fsT1W image. Notice mildly heterogeneously enhancing bi-lobed mass (arrows) in the medial subcutaneous tissues with overlying skin thickening. The mean and minimum ADC values of the whole tumor are 0.50 and 0, while the mean and minimum ADC values of the darkest area were 0.25 and 0.20

Fig. 2
figure 2

A 42-year-old female with an undifferentiated pleomorphic soft tissue sarcoma of the left thigh. A Axial T1W image, B axial T2W image of Dixon water map, C, DWI, b = 800 s/mm2, D ADC map, and E post-contrast fsT1W image. Notice a large heterogeneously enhancing mass (arrows) of the left quadriceps and adductor musculature with cystic/necrotic changes. The mean and minimum ADC values of the whole tumor are 2.50 and 1.50, while the mean and minimum ADC values of the darkest area were 2.10 and 1.65

Fig. 3
figure 3

A 52-year-old male with a myxofibrosarcoma of the left femur. A Axial T1W image, B Axial T2W image of Dixon water map, C DWI, b = 800 s/mm2, D ADC Map, and E post-contrast fsT1W image. Notice a large heterogeneously and peripherally enhancing mass (arrows) of the left quadriceps musculature with myxoid changes. The mean and minimum ADC values of the whole tumor are 1.85 and 0.30, while the mean and minimum ADC values of the darkest area were 0.90 and 0.65

Fig. 4
figure 4

A 38-year-old female with a high-grade liposarcoma of the extensor compartment of the right forearm. A Axial T1W image, B axial fsT2W image, C DWI, b = 800 s/mm2, D ADC map, and E post-contrast fsT1W image. Notice a large heterogeneously enhancing mass (arrows) of the right forearm subcutaneous tissues with heterogeneity and areas of increased T1W signal alteration. The mean and minimum ADC values of the whole tumor are 1.80 and 0.385, while the mean and minimum ADC values of the darkest area were 1.25 and 1.20

Table 2 Apparent diffusion coefficient measurements: all tumors
Table 3 Apparent diffusion coefficient measurements: liposarcomas vs. non-liposarcomas
Table 4 Apparent diffusion coefficient measurements: myxoid vs. non-myxoid
Table 5 Apparent diffusion coefficient measurements: synovial vs. non-synovial

Inter-reader analysis

The interobserver agreement between the two readers was good to excellent for all four continuous measurements (ADC values) (ICC = 0.65–0.98). The interobserver agreement between the two readers was poor with respect to arriving at the same diagnosis given the same case (ICC = 0.31). Figure 5 displays a Bland–Altman plot of mean ADC values of the whole tumor compared between the two readers.

Fig. 5
figure 5

Bland Altman plot of mean ADC values of the whole tumor when comparing the two readers’ measurements

Discussion

This study provides insight into the utility of DWI and its associated ADC values in differentiating among various malignant soft tissue tumors and sarcoma histologies using MRI. While we generally did not find systematic ADC differences among various soft tissue malignancies and though limited in scope, there was a demonstrated difference in ADC values of the whole tumor and the darkest (most hypointense/lower signal) area across a wide array of soft tissue sarcoma subtypes by both readers (P < 0.05). Calculating a similar value when reducing the number of classifications of tumors further validated that utilizing the mean ADC value for the whole tumor shows differences between a few histological classifications. More interestingly, mean ADC values for the whole tumor were consistently lower for synovial sarcomas and lymphomas when compared to other soft tissue sarcomas. Utilizing circumstantial factors, such as age and tumor location, may enhance the utility of the ADC values to instantly provide a more plausible diagnosis of the tumor before histology can confirm the diagnosis. Using ADC, authors have been able to differentiate grade I from grade III sarcomas but not grade I from grade II [8]. This study focused on the unique aim of differentiating among various soft tissue malignancies without significant success. The work suggests the limited capability of DWI for histologic characterization apart from a few select tumors with packed cellularity, e.g., round cell tumors like lymphoma or synovial sarcomas. Ahlawat et al. demonstrated the utility of DWI in benign vs. malignant tumors, and Demehri et al. showed the utility of ADC in benign versus malignant peripheral nerve sheath tumors [12,13,14]. Our results are similar to previously reported works of Ashikyan et al. and Santos et al. using limited sample sizes and expand to the body of knowledge in this domain [15, 16]. Ashikyan et al. had also reported a minimum ADC of 0.8, and the mean ADC was 1.2 among a case series of synovial sarcomas.

The other notable finding of this study was the good to the excellent agreement seen in ADC values (ICC = 0.65–0.98), but poor agreement in diagnosis prediction by the two readers (ICC = 0.31). This implies that while multiple readers can accurately measure ADC values within a reasonable margin of error, the image alone may lend itself to widely varying interpretations and diagnosis. Myxoid soft tissue malignancies tend to exhibit higher mean ADC. The minimum ADC for myxofibrosarcomas in our study was 0.5–1.2. Although minimum ADC values were variable among different sarcomas (0.0–1.3), these were again found to be the lowest for lymphoma.

This study has a few limitations. First, soft tissue sarcomas and malignancies are uncommon, and many are diagnosed on histology before MRI with DWI is obtained. As a result, numerous tumor categorizations hold only one or a few cases, which limited the interpretation of the Kruskal–Wallis test. To minimize the impact, analyses were performed with clinically meaningful dichotomizations of the subtypes. Second, only a few centers routinely do DWI. Thus, different magnets are commonly used for their imaging. It is hoped that in the future, a multi-center effort can lead to the collation of a larger sample of cases with more uniform imaging and produce new knowledge in this domain.

In conclusion, diffusion-weighted imaging and apparent diffusion coefficient currently offer limited insight in being able to differentiate among soft tissue malignant tumors and sarcomas while preserving good to excellent inter-reader agreement in measurements. T1W and T2W signal characteristics also offer limited insight in differentiating between soft tissue malignancies.