Introduction

Surgical resection, chemotherapy, radiotherapy, and their combinations are common treatments for head and neck squamous cell carcinoma (SCC), including nasal or sinonasal cavity SCC [1]. Super-selective arterial infusions of cisplatin with concomitant radiotherapy, in particular, have become popular for SCC because of its higher local control rate in advanced cases of nasal or sinonasal SCC [2, 3]. For such a nonsurgical therapy, it would be helpful to be able to predict — either before treatment or during the early treatment period — the treatment results of patients who may fail to respond to treatment at the primary site; doing so could optimize patient management. Especially, arterial infusion therapy is a great opportunity to optimize the treatment by adjusting distribution and dose of anticancer drugs, compared to the systemic therapy.

For assessing treatment results, the Response Evaluation Criteria in Solid Tumours (RECIST) criteria provide a standard method [4], but it was reported that tumour size information is not very reliable for predicting treatment outcomes [5]. A 2012 review article contended that the tumour diffusion and perfusion reflects the tumour microenvironment and reveals its relation to treatment results [6]. The apparent diffusion coefficient (ADC) obtained by diffusion-weighted imaging (DWI) noninvasively reveals the tumour microenvironment of tumour cellular density, and the ADC was useful for predicting patient prognoses [7, 8].

In recent years, extended models of DWI have been described for lesions in the head and neck [918] and also other organs such as lung [19]. In such extended models of DWI, intravoxel incoherent motion (IVIM) reveals more details about intratumoral diffusion than the ADC by using bi-exponential fitting with the division of the slow and fast diffusion components [20]. Fast and slow diffusion respectively reflect the tissue perfusion component and the extravascular component. Diffusional kurtosis imaging (DKI) reflects the non-Gaussian water diffusion and, thus, can provide a specific measure of tissue structure, such as cellular compartments and membranes [21]. IVIM and DKI can clarify the tumour microstructural condition and provide useful information to determine the treatment effect. There has been only a very limited amount of research investigating the utility of IVIM and DKI for elucidating treatment responses in SCC, and systematic studies of both IVIM and DKI in SCC patients are needed to predict treatment response.

Here, we assessed the usefulness of IVIM and DKI parameters as a predictor of local control versus failure in nasal or sinonasal SCC patients.

Materials and methods

Subjects

Our institutional review board approved the study protocol. Written informed consent was obtained from all patients. From September 2010 to June 2014, 31 consecutive patients who were referred to our hospital to undergo a super-selective arterial infusion of cisplatin with concomitant radiotherapy under a diagnosis of nasal or sinonasal cancer were prospectively enrolled with the following inclusion criteria: (1) histopathological diagnosis of SCC, and (2) planning for a full course of curative treatment with 70-Gy radiation. In all patients, magnetic resonance imaging (MRI), including multiple b-value DWI for the acquisition of IVIM and DKI data, was performed in both the pretreatment and early treatment periods.

Among these 31 patients, three discontinued treatment due to severe complications and were excluded. A total of 28 patients were thus eligible: 23 males (mean age 60.0 yrs, range 46–73 yrs) and five females (53.0 yrs, 43–59 yrs). The primary lesions involved the maxillary sinus in 25 patients and the nasal cavity in three patients. The histopathological diagnoses were SCC in all patients. The T stage was T2 in one patient, T3 in 11, T4a in 11, and T4b in five. The treatment regimen was a super-selective arterial infusion of cisplatin with concomitant radiotherapy for all patients. Treatment details were as follows: an arterial infusion of cisplatin (100–120 mg/m2 per week for 4 weeks) to the primary tumour’s dominant blood supply, using a microcatheter, with concurrent radiotherapy of a total of 70 Gy in 35 fractions [2]. MRI scans including IVIM and DKI were performed in all patients before treatment and again in the early treatment period. The time interval between the pretreatment MRI and the treatment start was 1–27 days (9.3 ± 3.9 days). MR scanning in the early treatment period was performed upon the delivery of 16–20 Gy (mean 19.2 Gy) of the total 70 Gy, immediately after the second arterial infusion of cisplatin.

Clinical endpoint

In all patients, clinical and radiological follow-ups were performed after the treatment to determine the final diagnosis of the local control/failure at the primary site. Local failure was determined by the histopathological confirmation of SCC by biopsy or surgical resection, the development of a new mass in the post-treatment granulation tissue, or residual tumour enlargement during follow-up, which was ≥ 1 year (minimum follow-up period was set at 1 year). Local control was determined by histopathological confirmation of the absence of SCC by surgical resection, absence of enlargement of the suspected lesion of the residual tumour, or absence of a new lesion in the post-treatment granulation tissue within the follow-up period.

MR imaging protocol

All MR imaging was performed using a 3.0 Tesla unit (Achieva TX; Philips Healthcare, Best, Netherlands) with a 16-channel neurovascular coil. The DWI acquisition of IVIM and DKI data used single-shot spin-echo echo-planar imaging (EPI) with three orthogonal motion probing gradients. Twelve b-values (0, 10, 20, 30, 50, 80, 100, 200, 400, 800, 1000, and 2000 s/mm2) were used. The other imaging parameters were: TR, 4500 ms; TE, 64 ms; DELTA (large delta; gradient time interval), 30.1 ms; delta (small delta; gradient duration), 24.3 ms; flip angle, 90°; FOV, 230 × 230 mm; 64 × 64 matrix; slice thickness, 5 mm × 20 slices; voxel size 3.59 × 3.59 × 5.00 mm; parallel imaging acceleration factor, 2; numbers of signal averages, b-value of 0–100 s/mm2 (one average), 200–800 s/mm2 (two averages) and 1000–2000 s/mm2 (three averages); scanning time, 4 min 37 s.

Conventional MR images were also obtained to evaluate the primary tumour. These images included (a) axial T1-weighted image (T1WI) with a spin-echo sequence (TR, 450 ms; TE, 10 ms; FOV, 240 × 240 mm; 512 × 512 matrix; slice thickness, 5 mm; inter-slice gap, 30 %; scanning time, 2 min 12 s), and (b) axial T2-weighted image (T2WI) with a turbo spin-echo (TSE) sequence with fat suppression (TR, 4500 ms; TE, 70 ms; TSE factor, 9; FOV, 240 × 240 mm; 512 × 512 matrix; slice thickness, 5 mm; inter-slice gap, 30 %; scanning time, 2 min 06 s).

Analysis of IVIM and DKI data

Using the signal intensity of all 12 b-values, we calculated the IVIM parameters perfusion fraction, f; the pseudo-diffusion coefficient, D*; and the true diffusion coefficient, D. Using the signal intensity of five b-values (0, 400, 800, 1000 and 2000 s/mm2), we calculated the DKI data. To obtain IVIM parameters, we applied the image signal intensity of each b-value to the following bi-exponential function [10]:

$$ \frac{S_{(b)}}{S_0}=f\cdot {e}^{-b\cdot D*}+\left(1-f\right)\cdot {e}^{-b\cdot D} $$
(1)

To obtain DKI parameters (i.e., the kurtosis value, K and the kurtosis corrected diffusion coefficient, Dk), we applied the signal intensity data of five b-values (0, 400, 800, 1000 and 2000) using the following equation [22]:

$$ \frac{S_{(b)}}{S_0}= \exp \left[-b\ast {D}_k+\frac{1}{6}\ast {b}^2\ast {D_k}^2\ast K\right], $$
(2)

where S (b) is the signal intensity at the b-value denoted by the subscript, S 0 is the signal intensity at the b-value of 0, and b is b-factor, in both Eqs. [1] and [2]. We fitted the signal intensity of b-values in Eqs. [1] and [2] with least square fitting using the Levenberg-Marquardt algorithm.

For the IVIM, we used a two-step fitting method for calculating the increase in the robustness of the fitting with less calculation error as follows: (1) the data of b > 400 s/mm2 were fitted for the single parameter D, because D* is assumed to be significantly > D, so that the influence of pseudo-diffusion on signal decay can be neglected for b-values > 400 s/mm2. (2) The curve was fitted for f and D* over all b-values, while keeping D constant [10, 23].

For the DKI, direct fitting to Eq. [2] was performed. The IVIM and DKI parameter calculations were performed on a pixel-by-pixel basis. Maps of f, D, D*, K and Dk values were each obtained on a pixel-by-pixel basis from each pixel’s signal intensity in the IVIM and DKI data.

The conventional ADC was also calculated using two b-values (0 and 1000). The following equation was used for the ADC calculation:

$$ \left(\mathrm{Signal}\kern0.5em \mathrm{intensity}\kern0.5em \mathrm{of}\kern0.5em \mathrm{b}=1000\right)/\left(\mathrm{Signal}\kern0.5em \mathrm{intensity}\kern0.5em \mathrm{of}\kern0.5em \mathrm{b}=0\right)= \exp \left(-1000*\mathrm{A}\mathrm{D}\mathrm{C}\right). $$

All parameter calculations were performed using MATLAB ver. 2012a (MathWorks, Natick, MA).

ROI delineation and parameter calculations

In the IVIM analysis, a board-certified neuroradiologist (A.T., 18 years of experience) delineated each tumour with a polygonal ROI on b0 images; axial T1WI and T2WI were used as reference images. Any area, which was suggested to be a necrosis or cystic lesion, was excluded from the ROI. If the tumour extended into two or more slices, all slices in which the tumour was included were used for the ROI delineation. Each delineated tumour ROI was copied on each parameter (f, D*, D, K, Dk and ADC) map. Each f, D*, D, K, Dk and ADC value was determined as the mean value in the delineated ROI by integrating all tumour voxels from all delineated slices into the total signal intensity.

For each pixel, the upper and lower limits were set for f and D* values to exclude unrealistic measurements (probably because of several outliers) to avoid including any erroneous pixels in the calculation. The lower and upper limits of f and D* were respectively set at 0–0.4 and 0–50 × 10−3 by referring to the range of each parameter in an earlier report [12]. We performed a histogram analysis by calculating the 25th, 50th and 75th percentile values of f, D*, D, K, Dk and ADC in each tumour ROI. In addition, to assess the goodness of fit in both the IVIM and DKI fitting, we calculated the coefficient of determination (R 2 value; R 2 = 1 − ESS/TSS, where ESS is the sum of the squared errors between the data points and IVIM/DKI fitting curve, and TSS is the sum of the squared differences between the data points and the mean value of all data points) in each pixel [24]. When a pixel’s R 2 value was < 0.8, the pixel was excluded from the parameter calculation.

Statistical analysis

We used Pearson’s correlation coefficient to assess the correlations of IVIM and DKI parameters (between the two sets of mean f, D*, D, K, Dk and ADC values) at pretreatment, at the early treatment period and the percentage change between these two periods.

In a univariate analysis, the Mann-Whitney U-test was used to compare mean f, D*, D, K, Dk and ADC values at the pretreatment, at the early-treatment period and the percentage change of these two periods between the local control and failure groups, respectively. The histogram (25th, 50th and 75th percentiles in the tumour) of these parameters (f, D*, D, K, Dk and ADC) were also compared in the same trend. If a significant difference was obtained for more than two parameters among the mean or histogram parameters, these parameters were analyzed by multivariate logistic regression models to determine whether they had independent predictive value with odds ratios (OR) and corresponding 95 % confidence intervals (CI). The detected predictive values were also assessed using receiver operating characteristic (ROC) curves constructed for calculating the area under the curve (AUC). The diagnostic accuracy was determined by using the closest point to the upper left corner of the ROC curve in the division of local control and failure.

We conducted a univariate Cox regression analysis to predict aspects of the outcomes including the time point at which local failure was determined, adjusting each model separately for the parameters (f, D*, D, K, Dk, ADC at pretreatment, at the early treatment period, and the percentage change between these two periods), patient age and T stage (T2–3 vs. T4). If several significant parameters were detected regarding the relation to patient outcome, we conducted a multivariate analysis. Hazard ratios (HRs) and corresponding 95 % CIs were calculated.

P-values <0.05 were considered significant. SPSS software (IBM, Armonk, NY) was used for all analyses.

Results

We successfully obtained IVIM and kurtosis parameter maps for all 28 primary tumours in both the pretreatment and early treatment periods. Among the 28 patients, nine patients were found to be local failures. Eight patients with local failure were confirmed by histopathological findings. One local failure and 19 local controls were determined by clinical diagnosis at follow-up (mean 23 mos; range 12–38 mos). All of the parameter data at the pretreatment, at the early treatment period and the percentage changes between these two periods are presented in Table 1. For each pixel within the tumour ROI, the coefficient of determination (R2 value) was 0.94 ± 0.03 in the IVIM fitting and 0.97 ± 0.02 in the DKI fitting. Figure 1 presents a case example of all DW images, all parameter maps and the map of the R 2 values.

Table 1 All IVIM and DKI parameters in the pretreatment and early treatment periods and the percentage change between the two periods
Fig. 1
figure 1

DW images in each b-value, tumour ROI delineation, IVIM and DKI parameter maps and residual sum/R2 map in patients with right maxillary cancer. The original images of all b-values (0–2000) are presented (a) with the same window level/width in the range of 0-400 of b-value, whereas the window level/width was each adjusted for well visualization of the tumour in upper 800 of b-value. Each tumour was delineated with a polygonal ROI on b0 images in the raw images of IVIM (b). Axial T2WI (c) and T1WI (d) were used as reference images for the delineation. Each delineated tumour ROI was copied on each parameter’s (f, D*, D, K, Dk and ADC) map (e). R 2 maps of IVIM (f) and DKI (g) in the tumour ROI were presented for the assessment of goodness of fit

The Pearson’s correlation coefficients between all parameters in the pretreatment period and during treatment and the percentage changes are summarized in Table 2.

Table 2 The Pearson’s correlation coefficients between the two sets of mean f, D*, D, K, Dk and ADC in the pretreatment and early treatment periods and the percentage change

The univariate analysis revealed significant differences between the local control and failure groups in pretreatment f values (mean, 25th, 50th, 75th percentiles), and the percentage change of D (mean, 25th, 50th, 75th), K values (mean, 50th, 75th), Dk values (mean, 25th, 50th, 75th), and ADCs (mean, 25th, 50th, 75th), respectively. The multivariate analysis revealed that the percentage change of D (mean, 25th, 50th, 75th percentiles), K value (mean, 50th, 75th), Dk value (mean, 25th, 50th), and ADC (mean, 25th, 50th) were predictors for determining local control (Table 3). From the ROC curve analysis, the AUC, sensitivity, specificity, positive predictive value, negative predictive value, accuracy and cut-off value were calculated (Table 4). The parameter with the highest accuracy was the percentage change of D value with the histogram 25th percentile. Figure 2 presents a case example of D maps.

Table 3 Significant parameters in the multivariate logistic regression models
Table 4 ROC analysis results
Fig. 2
figure 2

Patient with right maxillary SCC in pretreatment (a: T2WI, b: D map), early treatment period (c: T2WI, d: D map) and follow-up scanning after treatment (e: T2WI). Compared to the pretreatment T2WI (a: white arrows), it was difficult to discern a marked difference in T2WI at the early treatment period (c: white arrows). In the D map at the early treatment period (b: white arrows), an increase in the D values for most of the tumour was observed (d: white arrows); however, only a slight change in the D value on the left-dorsal area in the tumour was observed (d: black arrow). This area was finally revealed as a residual tumour by follow-up T2WI after the treatment as development of mass lesion (e: white arrows) and histopathological findings. Such focal change was detected by the histogram analysis of the 25th percentile in the D map

The univariate Cox regression analyses revealed the following significant predictors of treatment outcome: T-stage, the percentage change of D value (mean, 25th, 50th, 75th percentiles), K value (mean, 50th, 75th), Dk value (mean, 25th, 50th), and ADC (mean, 25th, 50th). The multivariate Cox regression analyses indicated that the predictors were the percentage change of D value (mean, 25th, 50th percentiles), K value (mean, 50th, 75th), Dk value (mean, 25th, 50th), and ADC (mean, 25th, 50th). The percentage change of D value with the histogram 25th percentile was also the highest HR. Table 5 summarizes these results.

Table 5 Significant parameters in the cox-proportional hazards regression models

Discussion

Our findings revealed percentage change of several IVIM and DKI parameters between the pretreatment and the early treatment period as predictors of local control in nasal or sinonasal SCC, while all absolute values of IVIM/DKI parameters at pretreatment and at the early treatment period were revealed non-significant. The IVIM parameter of D with the histogram 25th percentile was suggested to be the most powerful predictor.

In addition, a certain degree of correlations between several IVIM and DKI parameters was revealed. These IVIM and DKI parameters included the information of water diffusion, which reflects numerous tissue structures (e.g., the cellular compartment and the extracellular extravascular space), which indirectly affected the DWI signal in each b-value. Several parameters probably overlapped such tissue structural information, thus, providing a significant correlation, although mainly reflected structure in each parameter may be different.

Being able to predict the local outcome of a primary tumour would provide useful information about the need for additional chemotherapy and possible earlier salvage surgery after the current chemoradiotherapy. Moreover, detecting parameter changes related to local control during treatment could help improve treatment planning. For example, radiotherapy is often re-planned due to a reduction in irradiated tumour volume after an interim evaluation. The local distribution of IVIM or DKI parameters percentage changes might be useful for re-planning radiotherapy, just as with FDG-PET [25]. In arterial infusion therapy, the dose distribution of drug infusion in each artery can also be adjusted depending on the regional percentage change of IVIM or DKI parameters in each arterial territory [26]. If a poor outcome is strongly indicated, it may thus be possible to change the treatment plan (e.g., for a surgical resection) by interrupting the current treatment method.

The usefulness of the ADC for predicting patients’ local outcomes has been described [6], but few reports discuss the usefulness of IVIM and DKI parameters for predicting treatment outcomes in head and neck lesions. There has apparently been no study assessing long-term treatment outcomes using both IVIM and DKI parameters in pretreatment and the early treatment period in SCC patients. Our present findings systematically revealed the overall usefulness of these parameters in nasal or sinonasal SCC patients.

Concerning IVIM parameters in head and neck lesions, Xiao et al. reported that the baseline D and early change of D were correlated with nasopharyngeal cancer patients’ treatment response to neoadjuvant chemotherapy, although that series’ histopathology was undifferentiated cancer in the nasopharynx [13], unlike the present study. Similar to the Xiao study, the percentage change of D was revealed herein as a significant predictor of local control, although the baseline D was not observed to be significant. We speculate that the D value reflects more detail about intratumoral cell density and stromal space than the ADC because the calculation of D excludes the perfusion-related signal in the pixels, unlike the ADC calculation.

We suspect that another reason for the efficacy of the histogram 25th percentile of D is that it reflects the intratumoral heterogeneity for the sensitivity of treatment response in the whole tumour, e.g., a hypoxic lesion around the necrotic tissue, or a deep area of the tumour with lower perfusion [27, 28]. Such regions tend to be resistant to treatment and are more likely to be a residual tumour after treatment. These regions can be detected by the histogram analysis of the 25th percentile as a lower D-value area, while other regions of higher treatment sensitivity areas show increased D values. If the whole-tumour mean D value is used, such a focal area with treatment resistance is averaged with another region, resulting in a failure to detect such focal regions within a residual tumour.

This study also revealed significant correlations between the treatment outcome and the percentage change of K and Dk values at the early treatment period in both the logistic regression and Cox regression analyses. The associations between DKI parameters and the diagnosis of local control remain complex and unclear, but as also reported [29, 30], we suspect that K and Dk values reflect tissue microstructural complexity (e.g., tumour cell density, stromal volume of the tumour tissue, and the complexity of the membrane structure) in more detail by a multiple b-value analysis with a non-linear fitting model. Therefore, the percentage change of DKI parameters might well reflect the microstructural-level damage of tumour tissue induced by the treatment, and may also reflect the treatment response in detail.

There is a technical issue related to the parameter calculation. We used all 12 b-values for the IVIM and five b-values for the DKI in the present study, but the series of b-value data varies among the prior and present studies, and the use of different b-value data might result in bias [31]. In the present study, D was calculated using the cutoff b-value of 400 mm/s2 in reference to another IVIM report [23] to exclude perfusion-related signals as much as possible, whereas most of the other studies used b = 200 mm/s2. Additionally, we set the highest b-value of 2000 mm/s2; this was a slightly larger value than that used when targeting head and neck lesions. Such a high b-value signal will provide more details of restricted diffusion if a sufficient SNR is obtained. However, the data obtained in the present study may differ from IVIM or DKI values obtained in other studies. Careful interpretation is needed when comparing the obtained IVIM or DKI values — especially the absolute values.

This study has several limitations. First, the treatment method was the arterial infusion of cisplatin which have become popular in nasal or sinonasal SCC, but was not commonly used in cases of overall head and neck cancer; systemic therapy is frequently performed. Although we expect the reactivity in tumour-microenvironment by the chemoradiation used in the current study will not be much different from systemic chemoradiation or other non-surgical therapy, our findings should be carefully treated as reference data for other head and neck lesions. However, with this method, the early treatment period is easily set compared to systemic chemoradiation, the treatment regimen of which varies widely. Second, the number of patients (n = 28) was not large, and we, thus, could not perform a subgroup analysis with divisions of histopathological differentiation or human papilloma virus status. However, nasal and sinonasal SCC are not very common, and it would be a challenge to investigate larger numbers of patients with nasal or sinonasal SCC. Third, we used a large pixel size for the acquisition of IVIM and DKI data; the reason for this was to obtain a high SNR in each pixel. All of the primary tumours in the present study included a large mass and were sufficient for the histogram analysis with such a large pixel size. However, a large pixel size could be inappropriate when small tumours are evaluated. In addition, the tumour ROI delineation was performed in b0 images in this study. We did not investigate which b-value image was most appropriate to delineate the exact tumour ROI, and this remains to be determined in the future.

In conclusion, IVIM and DKI parameters — especially the IVIM D value with the histogram 25th percentile — were useful indicators for predicting treatment outcomes in nasal/sinonasal SCC patients. These parameters will contribute to decisions regarding treatment planning, additional treatment, and follow-up strategies, and thus to improved patient prognoses.