Introduction

Thymomas are benign or low-grade malignant tumours of the thymic epithelium and represent up to half of all neoplasms of the anterior mediastinum [16]. The histological classification proposed by the World Health Organization (WHO) and the Masaoka-Koga staging system, based on surgical findings, is currently used for treatment planning and significantly correlate with prognosis [18]. The Jeong simplification of WHO classification includes low-risk thymomas, high-risk thymomas, and thymic carcinomas [9]. The Masaoka-Koga system includes early (stage I–II) and advanced thymomas (stage III–IV) [8]. The correct prediction of early or advanced disease before surgery at imaging is essential to plan treatment strategies, because the latter receives neo-adjuvant chemotherapy [2]. Unfortunately, there is no perfect correlation among Jeong and Masaoka-Koga classifications because, although low-risk thymomas are frequently grouped in the early-disease group, percentages of high-risk thymomas are reported at a rate up to 45 % in the early stage disease group [10].

Limited data are available on quantitative assessment of thymic epithelial tumours (TETs) by using diffusion-weighted magnetic resonance imaging (DW-MRI) from small cohorts of studies which considered various anterior mediastinal tumours, although those studies did not attempt to differentiate thymomas based on WHO and Masaoka-Koga classifications by using the ADC [1118]. More recently, a study involving 30 patients with TETs has demonstrated the ability of ADC in differentiating low-risk from high-risk tumours, and early from advanced disease [19]. Despite such promising results, that study is influenced by a strong correlation between Jeong and Masaoka-Koga classifications and by the inclusion of thymic carcinomas in the advanced disease group which leads to further decrease of the mean ADC value of this group compared with the early disease group [19, 20]. Moreover, perfusion-sensitive ADC values have been used for analyses by including the b value of 0 sec/mm2 for ADC computation, obtaining a measured signal which reflects not only pure molecular water diffusion, but also “pseudo-diffusion” resulting from fast-moving water molecules from blood flow in vessels and capillaries [2125]. Hence, the ability of DW-MRI in distinguishing thymomas based on WHO and Masaoka-Koga classifications remains uncertain and optimal ADC cut-points should be validated.

To date, no study has evaluated the predictive value of ADC of thymomas on disease-free survival (DFS) of patients with a thymoma for optimizing clinical decision making at diagnosis and improving patient care, although it is used as independent predictor of DFS and overall survival in different organs [2325].

The aim of our study was to investigate the reliability of DW-MRI in predicting histological types of thymomas according to WHO classification, and whether DW-MRI can differentiate low-risk from high-risk thymomas, and early from advanced disease by using perfusion-free ADC measurements, with proposed optimal cut-point values for each differentiation. Moreover, we evaluated the prognostic value of ADC of a primary tumour for predicting DFS.

Materials and methods

Patient population, reference diagnosis and patient outcome

From March 2007 through December 2014, 41 consecutive subjects [19 men, 22 women; all subjects: age range 28–80 ys, mean age ± standard deviation (SD) 56.9 ys ±14.0 ys; male: 38–80 ys, 57.8 ys ± 13.3 ys; female: 28–80 ys, 56.2 ys ± 14.7 ys), who underwent thymomectomy and no neo-adjuvant treatments, were evaluated with MRI. Institutional review board approval was obtained and data were prospectively assessed.

The histopathologic results from surgical specimens were the standard of reference for each case classified based on WHO histological classification, simplified WHO classification proposed by Jeong, and the Masaoka-Koga staging system [79]. Tables 1 and 2 report definitions for both classifications with corresponding sample size.

Table 1 World Health Organization (WHO) histological classification of thymic epithelial tumours (2004)*
Table 2 Masaoka-Koga staging system for thymoma*

For analyzing prognosis, recurrent disease was defined as evidence of local, regional, or distant disease determined by follow-up imaging and confirmed by biopsy. DFS was determined as the time interval between the date of surgery and date of documented disease recurrence or last follow-up for subjects with no recurrence. No deaths related to tumour occurred during the observational period, thus, no analysis on overall-survival was performed.

MRI protocol

DW-MRI was performed on a 1.5-Tesla (T) unit (Intera-Achieva, Philips Healthcare, Best, The Netherlands) using a single-shot spin-echo/echo-planar sequence with the following parameters: 360–500 mm field of view, 116 × 224 image matrix, 45 echo planar imaging factor, 5-mm slice thickness, 0.5-mm intersection gap, 4500-msec repetition time, 62-msec echo time, 1600-Hz/pixel bandwidth. Four signals were acquired per image with diffusion sensitizing gradients applied in three orthogonal planes with b values of 0, 150, 500, and 800 sec/mm2. All sequences were performed using a parallel imaging acquisition technique and sensitivity encoding reconstruction (reduction factor, 2). Fat was suppressed by placing a frequency-selective radiofrequency pulse before the pulse sequence.

For each case, the entire lesion was included in the MRI. The mean time between MRI and surgery was 10 days (range, 1–21 days).

MR image analysis

All exams were evaluated independently by two radiologists (A.M.P., D.G.) who were blinded to the patients’ identifications and clinical data. The longest thymoma diameter was measured at the widest dimension on transverse cross-sectional images. For quantitative assessment, ADC maps were reconstructed after excluding the b value of 0 sec/mm2 to suppress tissue perfusion for obtaining perfusion-free ADC values by eliminating vascular contributions to the calculated ADC which would increase the ADC value [2125]. For each case, within axial sections which included the tumour on ADC maps, a region of interest (ROI) was manually drawn along the circumference of the tumour, but within boundaries, to include only the solid portion of tissue, taking care to exclude obvious cystic and/or necrotic areas (which could give falsely elevated ADC values) that were identified on corresponding T2-weighted images. The mean ADC was calculated from all values obtained for each case.

Statistical analysis

Categorical parameters and continuous numeric data with a normal distribution of demographic characteristics and imaging findings were compared between defined groups of each classification by using the Fisher’s exact test and a Student’s t-test, respectively. Data concerning ADC levels were expressed as mean values ± the standard deviation (SD) and ranges, because of their normal distributions proved by the Shapiro-Wilk test. The inter-reader agreement of ADC measurements was evaluated between readers by using the intra-class correlation coefficient (ICC, range: 0–1). The ADC levels were compared between defined groups of each classification (Jeong and Masaoka-Koga) and differences in mean values were analyzed using the Student’s t-test. Moreover, for WHO classification, differences in the mean ADC value of each histological type were assessed by using one-way analysis of variance (ANOVA) with multiple comparisons of means. Logistic regression models were estimated for evaluating the ability of ADC levels to discriminate between defined groups of Jeong and Masaoka-Koga classifications. Discrimination abilities were evaluated by the area under the receiver operating characteristic curve (AUROC) and optimal cut-points were determined according to the Youden index with computation of diagnostic accuracy. For WHO classification, AUROC with optimal cut-point and relative diagnostic accuracy were also estimated to assess the ability of ADC to differentiate type-B3 from other histological types. Additional analyses aimed to control for potential confounding factors (age, gender, lesion size) were performed for both classifications by including such factors in logistic regression models. The two-way ANOVA was used to examine the influence of defined groups of Jeong and Masaoka-Koga classifications together with their interaction. Lastly, the Cramér’s V (range: 0–1) was estimated to assess the association between Jeong and Masaoka-Koga classifications in our cohort. Considering DFS, the mean ADC levels were compared between two groups divided into subjects with and without recurrence at last follow-up by using the Student’s t-test. The ROC curve analysis was used to determine optimal cut-point values of individual prognostic factors for DFS (age, gender, lesion size, resection status, WHO type, Masaoka-Koga stage, ADC), with computation of AUROC and diagnostic accuracy. Prognostic factors were analyzed by univariate analysis using the Kaplan-Meier method for estimation of DFS rates and differences between DFS curves, obtained by using the optimal cut-point for risk stratification of recurrence, were analyzed by the log-rank test. Multivariate analysis of prognostic factors was performed using the Cox proportional hazards regression model by including significant parameters obtained from univariate analysis in order to estimate the overall effect of ADC levels on DFS for ascertaining the independent prognostic value of ADC. A p < 0.05 was considered indicative of a statistically significant difference. All analyses were performed with “R” and MedCalc software.

Results

Demographic data and imaging findings at conventional MRI

Demographic characteristics and imaging findings of patients and sample size of defined groups are detailed in Table 3. For both classifications, there was a significant difference in mean age and size of tumours in the low-risk and early-disease group compared with those in the high-risk and advanced-disease group, respectively.

Table 3 Demographic characteristics and imaging findings of the 41 enrolled patients

Table 4 reports the relation between Jeong classification and Masaoka-Koga staging. Low-risk thymomas were mostly classified in the early-disease group (16/19, 84.2 %), whereas only 15.8 % (3/19) of cases were defined as advanced disease. Conversely, a substantial percentage of high-risk thymomas were grouped in the early-disease group (8/22, 36.4 %) with the remaining 63.6 % (14/22) of high-risk thymomas in the advanced-disease group. The estimation of Cramér’s index (V = 0.518) indicated a mild association between Jeong and Masaoka-Koga classifications in our cohort.

Table 4 Correlation between WHO histological classification and Masaoka-Koga staging system

Inter-reader variability for ADC measurements

For ADC measurements, the inter-reader agreement was high with an ICC of 0.885 (95 % confidence interval [95 % CI]: 0.842–0.911). Therefore, for each case, the mean ADC value of readers was used for further analyses.

WHO classification: differences in ADC between histological types

Table 5 and Fig. 1a report ADC values and box plots of each histological type of thymoma according to WHO classification. The higher mean and maximum ADC values were found for type-AB (1.62 ± 0.22 and 2.01 × 10-3 mm/sec, respectively), whereas the lower mean and minimum ADC values were found for type-B3 (1.05 ± 0.17 and 0.73 × 10-3 mm/sec, respectively). The one-way ANOVA demonstrated that ADC of type-B3 was significantly lower compared with all other types (B3 vs. B2, p = 0.023; B3 vs. B1, p = 0.003; B3 vs. AB, p < 0.0001; B3 vs. A, p = 0.0003). Contrariwise, no significant differences in mean ADC values were found between the other types, except for type-B2 compared with type-AB (p = 0.004).

Table 5 Mean ADC values of thymomas according to WHO histological classification
Fig. 1
figure 1

Box-and-whiskers plots show ADC values for all cases divided into groups according to the WHO classification (a), the Jeong simplification of WHO classification (b), and the Masaoka-Koga staging system (c). The boxes represent data from the 25th to the 75th percentile. The horizontal line inside the boxes is the median (50th percentile). The vertical lines with whiskers indicate the range from the largest to smallest observed data point, within 1.5 × IQR of the higher and lower quartile, respectively. The values outside this range (b, high-risk; c, advanced and early) are displayed as individual points. The continuous horizontal red line in the graphs shows the optimal cut-point value for discrimination among defined groups deriving from the ROC analysis. Considering the Jeong and Masaoka-Koga classifications, a larger overlap zone is appreciable for Masaoka-Koga staging system (from 0.99 to 1.81 × 10-3 mm2/sec) compared with Jeong classification (from 1.24 to 1.75 × 10-3 mm2/sec)

The AUROC of ADC in discriminating B3 from all other types was 0.925 (95 % CI: 0.841–1.000; Fig. 2a). The optimal cut-point was identified as an ADC of 1.243 × 10-3 mm2/sec (Youden index J = 0.764). Applying this cut-point for the identification of types other than B3, sensitivity was 87.5 % (28/32; 95 % CI: 71.9–95.0), specificity was 88.9 % (8/9; 95 % CI: 56.5–98.0), and accuracy was 87.8 % (36/41; 95 % CI: 74.5–94.7).

Fig. 2
figure 2

Receiver operating characteristic (ROC) curves indicate the sensitivity, specificity, and diagnostic accuracy of ADC-based differentiation between the defined groups according to the WHO classification (a, B3 vs. all other types), the Jeong simplification of WHO classification (b, low-risk vs. high-risk), and the Masaoka-Koga staging system (c, early vs. advanced). For each curve, the optimal cut-off value is highlighted with the relative values of sensitivity, specificity, and diagnostic accuracy

Jeong classification: differences in ADC between groups (low-risk vs. high-risk)

The mean ADC was significantly different between groups (1.58 and 1.21 × 10-3mm/sec, respectively, p < 0.0001; Table 6; Figs. 3 and 4). Figure 1b shows the box plots comparing the ADC values for the low-risk and high-risk groups.

Table 6 Mean ADC values of low-risk versus high-risk and early versus advanced thymoma
Fig. 3
figure 3

Low-risk and early thymoma in a 56-year-old woman (WHO type AB, Masaoka-Koga stage II). a Transverse T2-weighted fat-suppressed MR image shows a soft tissue mass (arrow; maximal diameter, 59 mm) in the left side of the anterior mediastinum, adjacent to the aortic arch, with well defined borders and slightly heterogeneous high signal intensity. b The ADC map of DW-MRI demonstrates grey appearance of the mass (arrow), similar to that of muscles, with a mean ADC of 1.473 × 10-3 mm2/sec (standard deviation, 0.184; minimum and maximum value, 1.037 and 1.851 × 10-3 mm2/sec, respectively). c Photomicrograph of a representative histological section detects tissue of high cellular density characterized by a mixed composition of pale lymphocyte-poor areas with spindle cell morphology (A, WHO type A component) and dark limphocyte-rich areas (B, WHO type B component). The tumour is encapsulated (C, capsule) although microscopic invasion by small clusters of tumour cells is appreciable into the thin capsule (dotted arrows) and into the surrounding mediastinal fatty tissue (arrows; F, mediastinal fat; haematoxylin and eosin stain, original magnification × 40)

Fig. 4
figure 4

High-risk and advanced thymoma in a 28-year-old woman (WHO type B3, Masaoka-Koga stage IVb for lymph node metastasis). a Transverse T2-weighted fat-suppressed MR image demonstrates an oval-shaped solid mass (arrow; maximal diameter, 91 mm) in the anterior mediastinum, adjacent to the pulmonary trunk, with intermediate and high signal intensity. b The ADC-map of DW-MRI demonstrates heterogeneous dark appearance of the mass (arrow), lower than that of muscles, with a mean ADC of 1.006 × 10-3 mm2/sec (standard deviation, 0.235; minimum and maximum value, 0.190 and 1.457 × 10-3 mm2/sec, respectively). c Photomicrograph of a representative histological section shows islands of thymoma (T) composed of a prominent population of epithelial cells mixed with a minor component of lymphocytes. Tumour cells are closely adjacent or interspersed with lung cells (L), consistent with lung invasion (haematoxylin and eosin stain, original magnification × 40)

Considering the logistic regression model for the groups, the probability of finding high-risk thymoma decreased with the increase of ADC [odds Ratio (OR) per 0.01 increase, 0.928; 95 % CI: 0.879–0.965; p = 0.001]. The AUROC of ADC in discriminating between groups was 0.864 (95 % CI: 0.754–0.973; Fig. 2b). The optimal cut-point was identified as an ADC of 1.309 × 10-3 mm2/sec (Youden Index J = 0.584). Applying this cut-point for the diagnosis of low-risk thymoma, the sensitivity was 94.7 % (18/19; 95 % CI: 75.4–99.1), specificity was 63.6 % (14/22; 95 % CI: 43.0–80.3), and accuracy was 78.1 % (32/41; 95 %CI: 63.3–88.0). In the adjusted model, the OR was 0.930 per an ADC increase of 0.01 (95 %CI: 0.873–0.973; p = 0.007).

Masaoka-Koga classification: differences in ADC between groups (early vs. advanced)

The mean ADC differed significantly between groups (1.43 and 1.31 × 10-3 mm/sec, respectively, p = 0.016; Table 6; Figs. 3 and 4). Figure 1c shows the box plots comparing the ADC values for the early and advanced group.

Considering the crude logistic regression model for the groups, the probability of finding advanced disease decreased with the increase of ADC (OR per 0.01 increase, 0.960; 95 % CI: 0.932–0.992; p = 0.02). The AUROC of ADC in discriminating between groups was 0.730 (95 % CI: 0.558–0.903; Fig. 2c). The optimal cut-point was identified as an ADC of 1.243 × 10-3 mm2/sec (Youden index J = 0.505). Applying this cut-point for the diagnosis of early thymoma, sensitivity was 91.7 % (22/24; 95 % CI: 74.2–97.7), specificity was 58.8 % (10/17; 95 % CI: 36.0–78.4), and accuracy was 73.2 % (30/41; 95 % CI: 59.4–87.0). In the adjusted model, the OR was 0.98 per 0.01 ADC increase but it did not yield statistical significance (p = 0.31), as was the case for the two-way ANOVA (p = 0.38).

Lastly, considering low-risk thymomas, there was insignificant difference between the mean ADC of low-risk thymomas classified in the early-disease and advanced-disease group (early: 13 cases, 1.55 ± 0.22 × 10-3 mm/sec; advanced: 3 cases, 1.67 ± 0.16 × 10-3 mm/sec; p = 0.42). Similarly, for high-risk thymomas, there was no significant difference in mean ADC among the advanced-disease and early-disease groups (advanced: 14 cases, 1.23 ± 0.27 × 10-3 mm/sec; early: 8 cases, 1.17 ± 0.13 × 10-3 mm/s; p = 0.56; Figs. 4 and 5). Furthermore, there was no difference in mean ADC among type-B2 thymomas assigned in the early and advanced groups (p = 0.08), as was the case among type-B3 thymomas (p = 0.95).

Fig. 5
figure 5

High-risk and early thymoma in a 50-year-old man (WHO type B2, Masaoka-Koga stage IIb). a Axial T2-weighted fat-suppressed MR image detects an oval-shaped unilateral soft tissue mass (arrow; maximal diameter, 74 mm), adjacent to the aortic arch, with lobulated margins and slightly heterogeneous high signal intensity. b The ADC-map of DW-MRI demonstrates heterogeneous dark appearance of the mass (arrow) with a mean ADC of 1.175 × 10-3 mm2/sec (standard deviation, 0.289; minimum and maximum value, 0.289 and 2.072 × 10-3 mm2/sec, respectively). c Photomicrograph of a representative histological section shows a tissue of high cellular density characterized by mixed composition of lymphocytes and predominant epithelial cells resembling the normal thymic cortex. The tumour is completely encapsulated (C, capsule) and microscopic areas of tumour cells are appreciable into the thick capsule (haematoxylin and eosin stain, original magnification × 40)

Patient outcome and DFS analysis

Follow-up ranged from 6 to 94 months (median, 36 months) and seven subjects (17.1 %, 7/41) had recurrent disease (median, 22 months; range, 12–51 months; Table 7). DFS rates at last follow-up for the low-risk and early-disease group were 94.7 % (18/19) and 91.7 % (22/24), respectively, whereas those for the high-risk and advanced-disease group were 72.7 % (16/22) and 70.6 % (12/17), respectively.

Table 7 Details of patients with disease recurrence during the observational period

Optimal ADC cut-point for group discrimination (recurrence vs. disease free) was determined using ROC analysis in ADC of 1.299 × 10-3 mm2/sec (Youden Index J = 0.622; AUROC = 0.834). Applying this cut-point, for prediction of recurrence, sensitivity was 85.7 % (6/7; 95 %CI:0.744–0.964), specificity was 76.5 % (26/34; 95 %CI:0.631–0.897), and accuracy was 78.0 % (32/41; 95 %CI: 0.651–0.909; Fig. 6). By using the optimal cut-point as dichotomization threshold for Kaplan-Meier survival analysis, the group with lower ADC was found to have poorer DFS compared with the group with higher ADC (57.1 % [8/14] vs. 96.3 % [26/27]; p = 0.001; Fig. 7). At univariate analysis, ADC was found to be significant prognostic factor (hazard ratio [HR] = 0.959; 95 %CI: 0.931–0.988; p < 0.01). Furthermore, at Kaplan-Meier analysis, lesion size and resection status were other factors significantly related with DFS (p = 0.007 and 0.041, respectively; Table 8). At multivariate analysis performed with significant factors by using Cox proportional hazards regression, ADC was confirmed as significant independent prognostic factor (HR = 0.969; 95 %CI: 0.939–0.999; p = 0.02) with a significant overall model fit (p = 0.02).

Fig. 6
figure 6

Receiver operating characteristic (ROC) curve shows the sensitivity, specificity, and diagnostic accuracy of ADC-based differentiation in the prediction of recurrence. The optimal cut-point value is reported with the relative values of sensitivity, specificity, and diagnostic accuracy

Fig. 7
figure 7

Kaplan-Meier curves show a comparison of disease-free survival between groups of patients with ADC of the primary tumour higher (n = 27) and lower or equal (n = 14) than the optimal threshold value of 1.299 × 10-3 mm2/sec for the prediction of recurrence

Table 8 Univariate analysis of clinicopathological factors and imaging parameters for DFS

Discussion

In recent years, the use of MRI in clinical practice for evaluating the anterior mediastinum has considerably increased due to technological improvements and characterization of tumours; even characterization by means of quantitative data has become possible [19, 2630]. For DW-MRI, our research demonstrates that quantitative assessment of thymomas is helpful in differentiating low-risk from high-risk tumours and the more aggressive type-B3 from all other histological types according to WHO classification, including type-B2, being that the ADC of type-B3 and high-risk thymomas is significantly lower compared with the other defined groups. This finding reflects the ability of DW-MRI in differentiating thymomas based on their different histological composition because lower ADC values are expected for type-B3 thymomas which are typically characterized by a prominent population of medium-sized polygonal epithelial cells and are the only histological type of thymoma that presents nuclear atypia and prominent nucleoli [7]. Indeed, in our study, diagnostic performance of DW-MRI was greater for distinguishing type B3-from all other histological types of thymoma rather than for differentiating high-risk from low-risk thymomas, demonstrating a closer correlation with the simplified WHO classification proposed by Rieker et al. (which consists of types A-AB-B1-B2 as one group and type-B3 as an other group) compared with the Jeong classification [9, 31].

For DW-MRI, there is a lack of standardization for analyzing tissue ADCs among different studies, first in the acquisition parameters (e.g., echo time, number and amplitude of b values) and image analysis, which makes it not possible to compare quantitative data among studies and to deduce optimal thresholds to use for characterizing thymomas in clinical practice [1419]. Indeed, Abdel Razek et al. have found different mean ADC values for each type of tumour according to WHO classification, a lower optimal cut-point of 1.235 × 10-3 mm/sec for discriminating high-risk from low-risk TETs, and a lower AUROC of 0.804, compared with our results [19]. This discordance may be due to different technical parameters in acquisition and analysis of DW-MRI for obtaining ADC measurements among such studies, which may have influenced the ADC value. First, although echo time should be kept as short as possible (typically 50–65 msec in thoracic MRI for avoiding underestimation of ADC measurements), the unusual choice of a long echo time of 108 msec by Abdel Razek et al. (used two b values of 0 and 1000 sec/mm2) led to a reduction in ADC which was higher for tissues with higher diffusion restriction, such as high-risk thymomas and thymic carcinomas [19, 3235]. Indeed, the underestimation of the ADC is up to 30 % by using two b values of 100 and 1000 sec/mm2 and an echo time of 92 msec compared with ADC measurements obtained with similar b values and an echo time of 52 msec [32]. Moreover, ADC measurements obtained by Abdel Razek et al. were also influenced by inclusion of a b value of 0 sec/mm2 for ADC computation, which led to “perfusion-bias” with greater overestimation in measurements for higher vascular tissues, such as type-B3 thymoma and thymic carcinoma [19]. Indeed, because low b values are able to detect water molecules with a large degree of motion or great diffusion distance and because blood flow is completely suppressed at a b value of 150 sec/mm2, the b value of 0 s/mm2 should be always excluded from ADC computation to separately assess the effects of perfusion and diffusion in order to enable calculation of perfusion-insensitive ADC measurements [23]. Hence, in quantitative DW-MRI, standardization of acquisition parameters and data analysis is needed for obtaining reproducible ADC measurements and reliable optimal cut-off points.

Conversely to WHO classification, in our study, DW-MRI did not show the excellent predictive ability for distinguishing early from advanced disease according to the Masaoka-Koga system. Although mean ADC was significantly different between groups per univariate analysis, such significance was not confirmed neither via two-way ANOVA analysis nor via an adjusted logistic regression model, suggesting the inability of DW-MRI in predicting the invasiveness of thymomas regardless of their histological type. This finding is confirmed by the insignificant differences in mean ADC values of a specific WHO histological type of thymoma between cases grouped in the early and advanced disease that we found in our study. The ability of DW-MRI in predicting early or advanced disease strongly depends on the degree of association between Jeong and Masaoka-Koga classifications in the study cohort. For example, in the study of Abdel Razek et al., in which the ADC was able to discriminate among early and advanced disease (p = 0.005), the strong correlation between classifications is suggested by the low percentage of high-risk thymomas in the early-disease group (2/9 patients, 22 %), whereas this percentage is higher in previous clinical studies and in our cohort (8/22 subjects, 36 %) [10, 19]. Furthermore, Abdel Razek et al. included in the advanced-disease group five thymic carcinomas (31 % of tumours defined as advanced disease), but none in the early-disease group [19]. Because thymic carcinomas are typically characterized by low ADC values, their inclusion in the advanced-disease group leads to a reduction of mean ADC value of such group compared with the early-disease group, promoting a finding of statistical significance. Thus, thymic carcinomas should be grouped separately because they have different histology compared with thymomas, exhibit an aggressive behaviour and occasionally are classified as early-stage disease [36].

Considering prognosis, we found a significant difference for DFS between subjects divided by an ADC cut-point value of 1.299 × 10-3 mm2/sec, DFS being poorer in patients with low ADC compared with patients with high ADC. Moreover, ADC obtained prior to treatment was a significant predictor and independent prognostic parameter of DFS for thymoma in our study population, confirming a greater risk of recurrence for patients with ADC lower than the threshold value. Similar results were obtained in studies which evaluated the prognostic value of ADC in different tumours other than thymoma [3739]. Hence, ADC has significant impact on recurrence and should be considered to have, even for thymoma, prognostic value on DFS in addition to existing quantitative imaging parameters such as SUVmax [40, 41]. Thus, DW-MRI is helpful in stratification of risk and could be used for optimizing clinical decision-making at time of diagnosis.

The present study has some limitations. First, although this was the largest series of thymomas investigated with DW-MRI, the sample size was limited to a small number of patients. However, the only research which has solely investigated TETs included mixed populations of patients with thymic carcinoma and thymoma, creating difficulties in the applicability of cut-point ADC values in the general population for their daily clinical use [19]. Second, our proposed cut-points were derived from the data analyzed and require validation in independent samples of patients before being recommended for use in clinical practice. Third, the duration of follow-up was relatively short for our patients with thymoma, which typically has higher DFS and overall-survival rates compared with the highly aggressive thymic carcinoma which was excluded from this study.

In conclusion, the ADC of DW-MRI was shown to be highly reproducible and accurate quantitative parameter for distinguishing high-risk from low-risk thymomas and the more aggressive type-B3 thymoma from all other histological types according to WHO classification. Conversely, the ability of DW-MRI in determining stage based on the Masaoka-Koga system is uncertain, depends on the degree of correlation among WHO and Masaoka-Koga classifications in the study cohort, and requires further validations. Lastly, the ADC of a primary tumour has prognostic value on DFS after surgical resection of thymomas and could be used to predict recurrence, thereby improving disease stratification and guiding the treatment strategy for each patient.