Introduction

Diffusion-weighted imaging (DWI), which assesses the degree of diffusion of water molecules in living tissues, has become one of the most essential techniques for the detection of various diseases, such as acute cerebral infarction, neoplasm and inflammatory lesions [1,2,3,4]. The apparent diffusion coefficient (ADC) reflects the quantitative diffusivity of protons; this might be useful in the differentiation of diseases and prediction of treatment response and prognosis [5, 6]. However, ADC cannot separate the perfusion that represents the capillary blood flow from the diffusion of water molecules. Therefore, ADC cannot accurately determine the degree of diffusion of water molecules.

Contrarily, intravoxel incoherent motion (IVIM) imaging, which is a DWI technique, can evaluate diffusion and perfusion separately using multiple b-values [7]. IVIM imaging yields the three quantitative parameters (D, D* and f) that represent the true diffusion coefficient, pseudo-diffusion coefficient and perfusion fraction, respectively. These IVIM-derived parameters can also facilitate differential diagnosis and estimations of therapeutic effects as well as ADC [8, 9]. However, the image quality of standard single-shot echo-planar imaging (EPI)-DWI (EPI-DWI) is frequently deteriorated by susceptibility artefacts, because EPI sequence is prone to phase error accumulation. Moreover, the head and neck area is especially sensitive to magnetic inhomogeneity due to the presence of dental alloy or its complex structure with many boundaries, such as air or bones. These susceptibility artefacts cause image distortion or signal loss, and they may hamper accurate disease detection or measurements of ADC and IVIM-derived parameters [10].

The single-shot turbo spin-echo (TSE) is the second most common DWI sequence [11]. Importantly, DWI using the spin-echo based TSE sequence (TSE-DWI) is less sensitive to susceptibility artefacts, because it uses radio frequency (RF) refocusing pulses [12,13,14]; however, the scan time of this sequence is long due to multiple RF refocusing pulses and the RF heating restrictions with an increase in the specific absorption rate. In recent years, TSE-DWI has been improved by the adoption of the modulus averaging method, short RF pulses and sensitivity encoding (SENSE) [15]. Elefante et al. have recently reported that multi-shot TSE-DWI has higher sensitivity for detection of cholesteatoma and lower probability of misdiagnosis compared with single-shot EPI-DWI [16]. We hypothesised that TSE-DWI can improve DWI quality and can be used as an alternative sequence for EPI-DWI in the head and neck. The reproducibility of ADC values between EPI-DWI and TSE-DWI and the comparison of IVIM parameters in normal pituitary gland between EPI-DWI and TSE-DWI have already been reported [17, 18]; however, there have been no reports comparing the IVIM-derived parameters between TSE-DWI and EPI-DWI of the head and neck. Thus, the purpose of our study was to compare the image quality, ADC and IVIM-derived parameters of TSE-DWI and EPI-DWI of the head and neck using a 3-T magnetic resonance (MR) unit.

Materials and methods

Subjects

All experiments were performed after obtaining institutional review board approval and written informed consent from all subjects. The inclusion criteria were as follows: no present illness of the head and neck areas, no contraindication for MR examinations including claustrophobia or presence of metallic biomedical materials or electronic devices in the body. We excluded patients who had metallic artificial dentition or dental bridges, because they would cause severe susceptibility artefacts in MR images. Fourteen healthy volunteers (seven men and seven women; age range, 21–23 years; mean age, 22 years) prospectively underwent DWI of the head and neck.

Imaging protocol

MR imaging was performed using a 3-T magnetic resonance system (Intera Achieva 3.0 T TX, Philips Healthcare, Best, The Netherlands) with a 16-channel SENSE Neurovascular coil. The axial TSE-T2-weighted imaging (T2WI) was performed with the following parameters: repetition time, 3,000 ms; echo time, 80 ms; field of view, 230 × 230 mm2; matrix size, 336 × 336; slice thickness, 5 mm with a 1-mm intersection gap. The slice thickness of 5 mm was chosen for T2WI to allow superimposition using DWI. Diffusion-weighted MRI data were acquired using both single-shot TSE sequence and single-shot EPI sequence. The single-shot TSE sequence had the following parameters: repetition time, 4,200 ms; echo time, 78.8–81.3 ms; field of view, 230 × 230 mm2; matrix size, 128 × 129; reconstruction matrix size, 320 × 320; reconstruction voxel size, 0.72 mm; SENSE factor, 2.0; Partial Fourier factor, 0.6; TSE factor, 40; slice thickness, 5 mm with a 1-mm intersection gap; bandwidth, 630 Hz/pixel; number of signal averages, 4; b value = 0, 300 and 750 s/mm2; fat suppression, spectral pre-saturation with inversion recovery; acquisition time, 8 min 24 s. The parameters of the single-shot EPI sequence were as follows: repetition time, 6,000 ms; echo time, 71.5–72.7 ms; field of view, 230 × 230 mm2; matrix size, 128 × 125; reconstruction matrix size, 320 × 320; reconstruction voxel size, 0.72 mm; SENSE factor, 2.5; Partial Fourier factor, no; EPI factor, 51; slice thickness, 5 mm with a 1-mm intersection gap; bandwidth in frequency direction, 2,820 Hz/pixel; number of signal averages, 4; b value = 0, 300 and 750 s/mm2; fat suppression, short-T1 inversion recovery; acquisition time, 1 min 42 s. These two DWI sequences were consecutively performed in each examination. Fujima et al. reported that the signal intensity in the high b-value might be an outlier because the decrease was small compared to that in the low b-value, and the calculated parameters were probably influenced by this outlier [19]. For shortening the scanning time, we chose b = 0, 300 and 750 s/mm2 [8, 9, 19].

Data analysis

Image quality

The quantitative assessments of image quality of all the volunteers were analysed by a radiological technologist under the guidance of an experienced radiologist (26 years’ experience in head and neck radiology). The image distortion in the phase-encoding direction, signal-to-noise ratio (SNR) and contrast-to-noise ratio (CNR) from TSE-DWI and EPI-DWI (with b = 750 s/mm2) were noted. For the evaluation of distortion, fusion images were created by superimposing T2WI and DWI findings using DxMM (Medasys, Gif-sur-Yvette, France) without use of an automatic position registration (Fig. 1a). Image distortions of each sequence at cervical spinal cords at the nasopharynx and oropharynx levels, submandibular glands, palatine tonsils and cerebellar hemispheres were quantitatively compared using the distortion ratio (DR). The DR is defined as per the following equation:

Fig. 1
figure 1

Comparison between image distortion in TSE-DWI and EPI-DWI. (a) The schema of fusion images. The black schema represents the spinal cord at T2WI and the red schema represents the distorted spinal cord at DWI. The white arrow shows the maximum diameter of the anatomical structure on T2WI and the black arrow shows the maximum displacement of the anatomical structure between T2WI and each-DWI. (b) The fusion image of TSE- DWI and T2WI. (c) The fusion image of EPI-DWI and T2WI. Note that the spinal cord at EPI-DWI (black arrow) (c) is more distorted than that at TSE-DWI (b). TSE turbo spin-echo, DWI diffusion-weighted imaging, EPI echo-planar imaging, T2WI TSE-T2-weighted imaging

$$ \mathrm{D}\mathrm{R} = \mathrm{A}/\mathrm{B} $$
(1)

A is the maximum displacement in the phase-encoding direction of the anatomical structure between T2WI and each-DWI, and B is the diameter in the phase-encoding direction of the anatomical structure on TSE-T2WI (Fig. 1a) [20].

At least three circular regions of interest (ROIs) of 50 pixels were set in the submandibular glands and parotid glands in order to measure the SNR and CNR. However, superior deep cervical lymph nodes in normal volunteers were too small to set three ROIs; therefore, we used a single ROI, as large as possible, within a superior deep cervical lymph node. For the reference tissue of CNR measurements, 50 pixel ROIs were set at muscles close to each organ. The SNR and CNR were calculated from each ROI, defined by the following equations:

$$ \mathrm{S}\mathrm{N}\mathrm{R} = {\mathrm{SI}}_{\mathrm{a}}/{\mathrm{SD}}_{\mathrm{a}} $$
(2)
$$ \mathrm{C}\mathrm{N}\mathrm{R} = \left|{\mathrm{SI}}_{\mathrm{a}} - {\mathrm{SI}}_{\mathrm{muscle}}\right|/{\mathrm{SD}}_{\mathrm{muscle}} $$
(3)

where SIa is the mean signal intensity in the anatomical regions, SDa is the standard deviation of signal intensity in the anatomical regions, SImuscle is the mean signal intensity in the muscle and SDmuscle is the standard deviation of the signal intensity in the muscle. The mean SNR and CNR of multiple ROIs were calculated at the submandibular glands and parotid glands. Noise measurement with parallel imaging was difficult because the noise level in the reconstructed images largely depends on the sensitivity profiles of receiver coils; noise distribution is heterogeneous for each given location [21]. Therefore, we used SDmuscle as the local noise estimate, because we could not set the ROIs in the background air [21, 22].

Apparent diffusion coefficient (ADC) and intravoxel incoherent motion (IVIM)-derived parameters

We analysed the ADC and IVIM-derived parameters from DWI at b-values of 0, 300 and 750 s/mm2. Under the guidance of an experienced radiologist, one of the researchers set the ROI manually in the submandibular glands, sublingual glands and palatine tonsils using SYNAPSE VINCENT (Fujifilm Medical, Tokyo, Japan), while ensuring that the ROIs were as large as possible and avoiding vessels or gland ducts. The ADCs were calculated from the signal decay curve and approximated by the least squares method. The IVIM theory is shown in the following equation:

$$ {\mathrm{S}}_{\mathrm{b}}/{\mathrm{S}}_0 = \left(1-\mathrm{f}\right)\cdot \exp\ \left(-\mathrm{b}\cdot \mathrm{D}\right) + \mathrm{f}\cdot \exp\ \left[-\mathrm{b}\ \left(\mathrm{D} + \mathrm{D}*\right)\right] $$
(4)

Here, Sb and S0 are signal intensities at arbitrary b value and b = 0 s/mm2. The IVIM-derived parameters were calculated using a segmented method (Fig. 2) [8]. As the very first step of this method, D was calculated, because the effect of D* on the signal of DWI at b-value higher than 200 s/mm2 is negligibly small. Therefore, Eq. (4) can be simplified as: Eq. (5).

Fig. 2
figure 2

Calculation of intravoxel incoherent motion (IVIM)-derived parameters. D is calculated by the equation: D = - ln (S750/S300)/(750 - 300) and f is calculated by the equation: f = (S0 – Sinter)/S0. Sinter is the intersection point of the Y-axis and the line through ln S300 and ln S750

$$ {\mathrm{S}}_{\mathrm{b}1}/{\mathrm{S}}_{\mathrm{b}2}= \exp \kern.2em \left[\left({\mathrm{b}}_2{\textstyle\ \hbox{-}}\kern.2em {\mathrm{b}}_1\right)\kern.3em \mathrm{D}\right] $$
(5)

Here, Sb1 and Sb2 are signal intensities at two b-values higher than 200 s/mm2.

The second step of the segmented method calculated f as per the following equation:

$$ \mathrm{f}=\left({\mathrm{S}}_0{\textstyle\ \hbox{-}}\kern.2em {\mathrm{S}}_{\mathrm{inter}}\right)/{\mathrm{S}}_0 $$
(6)

Here, Sinter is the intersection point of the y-axis and the line through ln S300 and ln S750. The measurement reproducibility of f and D* were poor; however, inter-slice consistency of f was better than that of D* [23]. Therefore, we evaluated f as the perfusion-related parameter and excluded D* from the analysis.

Statistical analysis

Statistical analysis was performed to compare the DR, SNR, CNR, ADC and IVIM-derived parameters of TSE-DWI and EPI-DWI of each anatomical structure using the Wilcoxon signed-rank test. The statistical significance was set at a p-value less than 0.05. In addition, the reproducibility of the mean ADC and IVIM-derived parameters of TSE-DWI and EPI-DWI were analysed by using the Bland-Altman plot [24], and 95% limits of agreement (LoA) were calculated as a percentage of the overall mean value of the two sequences. The 95% LoA, estimated by mean difference ± 1.96 standard deviation, provides an interval within which 95% of differences between the two methods of measurement are expected to lie [25]. All statistical analyses were performed with JMP Pro 11.0.0 (SAS Institute, Cary, NC, USA).

Results

Comparison of distortion ratio (DR)

The representative two fusion images, which consisted of TSE-DWI with T2WI and EPI-DWI with T2WI, are shown in Figs. 1b and c, respectively. The mean DRs of TSE-DWI and EPI-DWI are shown in Table 1. The distortion in TSE-DWI was significantly smaller than that in EPI-DWI in the cervical spinal cord of the nasopharynx and oropharynx levels, submandibular glands, palatine tonsils and cerebellar hemispheres (p < 0.05).

Table 1 The DR on TSE-DWI and EPI-DWI

Comparison of signal-to-noise (SNR) and contrast-to-noise (CNR) ratios

The ROI sizes (mean pixels ± SD) at the superior deep cervical lymph node were 85.2 ± 23.5 about TSE-DWI, and 76.4 ± 35.3 about EPI-DWI. The representative images of TSE-DWI and EPI-DWI in a healthy volunteer are shown in Fig. 3. The mean SNRs and CNRs of TSE-DWI and EPI-DWI are shown in Table 2. The SNRs of TSE-DWI were significantly higher than those of EPI-DWI (p < 0.05) in the submandibular glands, parotid glands and superior deep cervical lymph node. The CNRs of TSE-DWI were significantly higher than those of EPI-DWI in the submandibular glands and parotid glands (p < 0.05). There were no significant differences between the CNRs of TSE-DWI and EPI-DWI in the superior deep cervical lymph node.

Fig. 3
figure 3

The axial MR images at the same submandibular gland section of a 22-year-old female volunteer: (a) T2WI, (b) TSE-DWI of b = 750 and (c) EPI-DWI of b = 750. TSE turbo spin-echo, DWI diffusion-weighted imaging, EPI echo-planar imaging, T2WI TSE-T2-weighted imaging

Table 2 The SNR and CNR on TSE-DWI and EPI-DWI

Comparison of ADC and IVIM-derived parameters

The ROI sizes (mean pixels ± SD) at right submandibular gland, left submandibular gland, right sublingual gland, left sublingual gland, right palatine tonsil and left palatine tonsil were 432 ± 129, 414 ± 89, 237 ± 67, 224 ± 94, 254 ± 86 and 228 ± 70, respectively, for TSE-DWI, and 438 ± 147, 434 ± 124, 190 ± 58, 172 ± 69, 223 ± 69 and 223 ± 51, respectively, for EPI-DWI. The sublingual glands of two volunteers were excluded, because these could not be depicted in the EPI-DWI. The mean ADC and IVIM-derived parameters of TSE-DWI and EPI-DWI are shown in Table 3. The ADC and D of TSE-DWI exhibited higher values than those of EPI-DWI in the submandibular glands, sublingual glands and palatine tonsils. However, the differences were not significant. The f-value of TSE-DWI was higher than that of EPI-DWI in all parts except the right palatine tonsil. However, the differences were not significant.

Table 3 The ADC, D and f on TSE-DWI and EPI-DWI

Bland-Altman plots are shown in Figs. 4 (a–f); 95% LoA for ADC and IVIM-derived parameters between TSE-DWI and EPI-DWI are shown in Table 4. Fixed biases were not detected; however, Bland-Altman analysis showed wide limits of agreement between TSE-DWI and EPI-DWI (Table 4).

Fig. 4
figure 4

Bland-Altman plots for ADC, D and f of TSE-DWI and EPI-DWI in the bilateral submandibular glands (a, d, g), sublingual glands (b, e, h) and palatine tonsils (c, f, i). Each colour (blue = right anatomical regions, red = left anatomical regions) plot shows the value difference against the value average between TSE-DWI and EPI-DWI. Continuous lines show average measurement difference and dotted lines show 95% limits of agreement. TSE turbo spin-echo, DWI diffusion-weighted imaging, EPI echo-planar imaging, T2WI TSE-T2-weighted imaging

Table 4 The LoA for ADC and IVIM parameters between TSE-DWI and EPI-DWI

Discussion

Our study results showed that TSE-DWI had significantly smaller image distortion, and higher SNR and CNR in the head and neck than those of EPI-DWI. In general, the TSE sequence, which obtains echo by refocusing RF pulses, is less sensitive to susceptibility artefacts compared to EPI sequence [11]. Reduced image distortion may provide better reproducibility and reliability of diffusion-weighted images and quantitative parameters [10]. Regarding the advantages of TSE-DWI in SNR and CNR, rectangular-shaped and shorter RF pulses might contribute to the improvement of these factors. The image noise depends on the voxel size, receiver bandwidth and number of averages at the image acquisition [26]. In our study, two DWI sequences showed approximately the same voxel size and number of averages; however, the bandwidth of EPI-DWI (2,820 Hz/pixel) was wider than that of TSE-DWI (630 Hz/pixel); therefore, it could significantly affect the SNR and CNR. Sakamoto et al. reported that TSE-DWI has an inherently lower SNR than EPI-DWI [27]. However, our study results conversely revealed that CNR was higher than SNR at submandibular glands and superior deep cervical lymph nodes in TSE-DWI because of setting parameters to maintain image quality suitable for diagnosis. We speculate that the reason why the CNR was much higher than the SNR was that the standard deviation of the signal intensity in the muscle became smaller based on the low signal intensity in DWI at a high b-value.

In the quantitative analyses of ADC and IVIM-derived parameters, no significant difference between TSE-DWI and EPI-DWI was noted; however, TSE-DWI showed higher values than EPI-DWI. There should be many factors accounting for the differences in quantitative parameters between TSE-DWI and EPI-DWI. In the Bland-Altman plot (Fig. 4), the difference in each parameter between TSE-DWI and EPI-DWI might vary depending on volunteers. Additionally, fixed bias or proportional error was not detected in the Bland-Altman plot. Therefore, we speculate that distortion in EPI-DWI might hamper accurate measurement of quantitative parameters. Image noise can considerably influence the calculation of diffusion parameters [28]. When image noise is large, the diffusion parameters tend to show lower values [28]. TSE-DWI showed higher SNR than EPI-DWI; therefore, the influence of noise was smaller in TSE-DWI. Andreou et al. have reported that the limits of agreement for ADC, D and f in normal liver were −4% to 7%, −5% to 8%, and −24% to 25%, respectively [29]. On the other hand, the limits of agreement for ADC, D and f in our study became much wider (Table 4). Although these results were derived from different ROIs, it would be necessary to show similar LoAs for the follow-up studies. Therefore, there were unacceptable LoAs between TSE-DWI and EPI-DWI in our study. Inaccurate measurement of parameters due to the image distortion on EPI-DWI might be the cause of the wide LoA, although no significant difference between the two DWI sequences was noted. There was a severe distortion in EPI-DWI compared with TSE-DWI, therefore we could not sample the signal intensity from the same region within the organ between low and high b values in EPI-DWI. We speculate that it might affect the calculation of ADC and IVIM-derived parameters using multiple b values and lead to a wide LoA. Thus, the ADC and IVIM-derived parameters obtained from these two sequences should not be used as equivalent quantitative parameters in disease differentiation, treatment response prediction and prognosis.

There are two major limitations to this study. First, we used the segmented method, which is a non-fitting approach, to measure IVIM-derived parameters for shortening the imaging time by reducing number of b values. Therefore, f-value derived from this method are susceptible to S0 [30]. Furthermore, the f-value may lack robustness, because it did not take into account the perfusion effect based mainly on higher b values [31]. In addition, the IVIM-derived parameters become more accurate by using a large numbers of b-values. In our study, using only three b-values might have influenced the calculation of IVIM-derived parameters. Second, the acquisition time using three b-values of TSE-DWI (8 min 24 s) was longer than that of EPI-DWI (1 min 42 s). It was impossible to set the same imaging time between TSE-DWI and EPI-DWI in clinical practice, because TSE-DWI has a much longer imaging time than EPI-DWI. We certainly thought that it was ideal to make the same imaging time and parameters as identical as possible between the two sequences, but we set those as clinically applicable sequences subject to limited imaging time. These discrepancies including a small number of b-factors will have affected the measurement of the DR, SNR, CNR, ADC and IVIM-derived parameters. The longer the acquisition time, the greater the sensitivity to motion artefacts; this consequently influences the quantitative parameters. However, these artefacts were not visually observed in our study. In clinical practice, it is necessary to reduce the acquisition time to adopt TSE-DWI for assessment of diseased patients. A comparative study of image quality or IVIM-derived parameters calculated by the fitting approach using large numbers of b-values between TSE-DWI acquired in a shorter imaging time and EPI-DWI including readout-segmented multi-shot sequence that could reduce artefacts is needed in a future.

In conclusion, TSE-DWI can improve image quality in comparison with EPI-DWI in head and neck imaging. Therefore, TSE-DWI could be a good alternative to EPI-DWI for patients sensitive to distortion such as those with metallic implants. However, it could not be recommended using both TSE-DWI and EPI-DWI in follow-up studies in the same patients, because TSE-DWI might show higher ADC, D and f values compared with EPI-DWI.