1 Introduction

Diffusion weighted imaging (DWI) is sensitive to water motion, and is, therefore, an important form of imaging in magnetic resonance imaging (MRI), because it provides quantitative values [1]. Moreover, DWI has been widely used in clinical practice to detect early cerebral ischemia [2], detect tumors in the whole body [3], and visualize brain white matter [4]. Also, the apparent diffusion coefficient (ADC) is used to differentiate or determine the effects of treatments on oncogenic lesions and contributes considerably to qualitative diagnoses [5,6,7]. The ADC is a quantitative index in DWI that can differentiate malignant from benign tumors. For example, the reported threshold ADC that can distinguish a benign mass from malignant breast cancer is 0.95 ± 0.10 × 10–3 mm2/s [8]. Some malignant tumors show an ADC of less than 1.0 × 10–3 mm2/s. Therefore, it is important to quantify the ADC of malignant tumors for oncological purposes. The ADC is obviously clinically useful, but it is influenced by various factors including scan parameters, slew rates, pulse sequence design, magnetic field, and the MRI system, causing variances in the ADC even when the same object is repeatedly measured. A study of the relationship between scan parameters and ADC has found that a long TR and a short TE are effective for precise ADC measurements [9]. Even when scan parameters are fixed, differences among MRI scanners slightly influence the ADC [10]. In fact, the ADC of 12 healthy volunteers measured using 10 MRI systems significantly differed [11]. One study of seven healthy volunteers found significant inter-scanner differences in the ADC of gray matter [12]. These findings indicated that the quantifiability and reproducibility of ADC measurements requires improvement by determining the dependence of these factors before starting measurements. However, individual differences might affect the results of imaging humans. Therefore, a constant standard ADC evaluation phantom is needed.

The motion of water molecules in living tissue includes not only free diffusion but also restricted diffusion due to several factors (e.g., internal physiological perfusion, cell membrane structure, and other tissue compartments). Therefore, a standard phantom to investigate the precision of ADC measurement requires restricted diffusion. There have been several published reports on phantoms for evaluating ADCs. Among them, the ice-water phantom [13, 14] has the ADC values of approximately 1.1 × 10–3 mm2/s and has been reported to be useful as the ADC evaluation phantom. Also, the Quantitative Imaging Biomarkers Alliance (QIBA)/National Institutes of Health (NIH)/NIST isotropic diffusion phantom [15] has been reported to be useful as a reference phantom that can evaluate the repeatability and reproducibility of ADC values [16, 17]. In a multi-center study conducted with ice-water phantom [13], the standardization of the DWI protocol improved the reproducibility of ADC measurements and reported that the source of error in multi-center studies was spatial ADC non-uniformity. Also, the QIBA profile provides small inter-system, inter-site, and temporal variations that allow for quantitative evaluation. For the above reasons, the use of the QIBA/NIH/NIST isotropic diffusion phantom and measurement according to the QIBA profile is recommended for the quantitative evaluation of ADC values in multi-center studies. The ice-water phantom or the QIBA/NIH/NIST isotropic diffusion phantom, a quality assurance tool for precise ADC measurements, refers to the commissioning of the MRI machine and fitting the algorithm at 0 °C. However, the correct ADC measurements at 0 °C do not imply correct room temperature or tissue in vivo ADC measurements. On the other hand, PEG has the advantage of changing the ADC values by adjusting the PEG concentrations, although we need to observe to the temperature more, as reported in a previous study [18]. Sucrose solutions [1, 19, 20] and PAG (Polyacrylamide) phantom [21] have also the advantage of changing the ADC by adjusting the concentration when fabricating the phantom. Controlling ADC with their concentration suggests the possibility of simulating ADC of malignant tumors (ADC < 1.0 × 10–3 mm2/s). The main advantage of PEG is that they are ready to use, commercially available, without the need of in-house fabrication process. Furthermore, the PEG is inexpensive and thus affordable for any institution.

We aimed to examine the possibility that PEG phantoms can simulate ADCs of malignant tumors and the effectiveness of PEG phantoms using three 3.0 T MRI systems. In particular, the correlations between PEG concentrations and ADC values, the validation of ADC measurement precision, and the stability and reproducibility of PEG phantom were verified.

2 Materials and Methods

2.1 Preparation of PEG Phantom

Figure 1 shows a schematic view of the PEG phantom. The PEG (MP Biomedicals Inc., Santa Ana, CA, USA) used herein has a molecular weight of 8000. Plastic vials (45 × 80 mm) containing 0-, 0.625-, 1.25-, 2.5-, 5-, 10- and 20-mM PEG were placed in a 170 × 110 mm plastic container and fixed using agarose (FUJIFILM Wako Pure Chemical Corp., Osaka, Japan) dissolved in Milli-Q water to form a gel. The 0 mM indicates PEG-free water only.

Fig. 1
figure 1

Schematic view of PEG phantom. Vials containing 0, 0.625, 1.250, 2.5, 5, 10, 20 mM PEG are immobilized in agarose gel within a plastic container. Vial containing 0 mM PEG (water) is centered in phantom. Above that is vial containing 0.625 mM PEG, then vials containing 1.25–20 mM PEG are arranged clockwise around center vial

2.2 ADC Measured Using PEG Phantom

The ADC was measured using the PEG phantom with MAGNETOM Verio (Siemens Healthcare, Erlangen, Germany), Signa HDxt (GE Healthcare, Waukesha, WI, USA) and Ingenia (Philips Healthcare, Best, The Netherlands) 3.0-T MRI systems (Table 1). We defined the MAGNETOM Verio (Siemens Healthcare) as Scanner A, Signa HDxt (GE Healthcare) as Scanner B, and Ingenia (Philips Healthcare) as Scanner C. The surface temperature of the PEG phantom was fixed at 22.0 °C by placing it in the center of a head coil for several hours, and maintained at 22.0 °C during image acquisition. Table 2 shows the DWI scan parameters of the three MRI systems that are normally used for acquiring clinical images of the head. Therefore, we set the b-factor to 1000 s/mm2. The scan parameters, such as field of view (FOV) and echo time (TE), were modified to consider the effects of DWI distortion. In each MRI system, the TE was selected to be the shortest when b factor = 1000 s/mm2. The slice position of the DWI was set to the isocenter that was the center of the PEG vial. Regions of interest (ROI) were set to acquire the signal intensity from the DWI of each PEG (0–20 mM) using in-house software of the three MRI systems (Scanner A, B, and C). The determined ROI shape was circular and the ROI size was > 70% (> 816 mm2) of the area of the PEG vial. The mean ADC of five scans at each PEG concentration were calculated using Eq. (1) [11] built in Microsoft® Excel®, where b0 is 0 s/mm2, b1 is 1000 s/mm2, SI(b0) is the signal intensity at b = 0 s/mm2, and SI(b1) is the signal intensity at b = 1000 s/mm2.

$${\text{ADC}}\left[ {{\text{mm}}^{2} /{\text{s}}} \right] = \ln \left[ {{\text{SI}}\left( {b_{1} } \right)/{\text{SI}}\left( {b_{0} } \right)} \right]/\left( {b_{1} - b_{0} } \right){.}$$
(1)
Table 1 Characteristics of 3.0 T MRI systems
Table 2 Scan parameters of DWI using 3.0 T MRI systems

Correlations between PEG concentrations and ADC, and among the three MRI systems at each PEG concentration were statistically evaluated using Pearson correlation analysis by SPSS version 20.0 (IBM Corp., Armonk, NY, USA). Values with P < 0.001 were taken as being significantly different. The mean ADC (ADCmean) was calculated from the ADC measured by each of the MRI systems. An ADCmean of 0 mM was defined as the control, and those of the other PEG concentrations were statistically analyzed using t tests, with P < 0.05 indicating a significant difference. In addition, the linear equation and correlation coefficient between PEG concentration and ADC value were determined for each MRI system.

2.3 Validation of ADC Measurement Precision

Measurement errors in ADC evaluation are important to understand. To verify the setting errors of PEG phantoms by operators, we re-set the PEG phantoms at each of five image acquisitions. Analytical errors in ROI settings were verified over five repeated ROI analyses of each image. The surface temperature of the PEG phantom was fixed at 22.0 °C by placing it in the center of a head coil for several hours, and maintained at 22.0 °C during image acquisition. The scan parameters and ROI settings for the acquisition of these data were the same as in those described above. The mean ± standard deviation (SD) of the ADC at each PEG concentration was determined, then, the coefficient of variation (%CV) was calculated from these values as follows:

$${\text{\% CV}} = \frac{{{\text{Mean}}}}{{{\text{SD}}}} \times 100.$$
(2)

2.4 Change Over TIME and Reproducibility of PEG Phantom

Images were acquired from the primary PEG phantom (P0) prepared as described in the Methods subsection entitled, “Preparation of PEG Phantoms” using scanner C at 3 months later (P3) and 6 months later (P6) to study changes in the PEG phantom over time. Six months after the primary phantom (P0) was created, we also similarly created another PEG phantom (NP6). We verified the reproducibility of this PEG phantom by comparing the ADC with that of P0. The scan parameters, ROI settings, and ADC measurement methods for acquiring these data were the same as in the subsection, “ADC measured using PEG phantom”. The surface temperature of the PEG phantom was fixed at 22.0 °C by placing it in the center of a head coil for several hours, and maintained at 22.0 °C during image acquisition. The ADC at month 0 served as the control. Differences in ADC over time within the primary PEG phantom (DifferenceChange over time) and between the primary and the new phantom (DifferencePri. vs. New.) were calculated using Eqs. (3.1), (3.2), (4.1), and (4.2), respectively:

$${\text{Difference}}_{{{\text{change}}\;{\text{over}}\;{\text{time}}}} = {\text{ADC}}_{{{\text{Pri}}.{\text{month}}0 }} - {\text{ADC}}_{{{\text{Pri}}.{\text{months}} 3 {\text{or}} 6 }}$$
(3.1)
$${\text{Difference}}_{{{\text{change}} {\text{over}} {\text{time}}}} \left[ \% \right] = \left( {\frac{{{\text{ADC}}_{{{\text{Pri}}.{\text{month}}0 }} - {\text{ADC}}_{{{\text{Pri}}.{\text{months}}\;3\;{\text{or}}\;6 }} }}{{{\text{ADC}}_{{{\text{Pri}}{.} {\text{month}}\;0 }} }}} \right) \times 100$$
(3.2)
$${\text{Difference}}_{{{\text{Pri}}.\;{\text{vs}}\;{\text{New}}.}} = {\text{ADC}}_{{{\text{Pri}}.\;{\text{month}}\;0}} - {\text{ADC}}_{{{\text{New}}.\;{\text{month}}\;6}}$$
(4.1)
$${\text{Difference}}_{{{\text{Pri}}.\;{\text{vs}}\;{\text{New}}.}} \left[ \% \right] = \left( {\frac{{{\text{ADC}}_{{{\text{Pri}}.\;{\text{month}}\;0 }} - {\text{ADC}}_{{{\text{New}}\;{\text{month}}\;6}} }}{{{\text{ADC}}_{{{\text{Pri}}.\;{\text{month}}\;0 }} }}} \right) \times 100.$$
(4.2)

3 Results

3.1 Correlations Between PEG Concentrations and ADC

Figure 2 shows correlations between PEG concentrations and ADC determined by the three MRI systems. The ADC decreased as PEG concentrations increased according to all three systems. The correlation coefficients (r) between PEG concentrations and the ADC of all three MRI systems revealed a high negative correlation (Scanner A; r = − 0.949, Scanner B; r = − 0.939, Scanner C; r = − 0.961), with P < 0.001 vs. 0 mM PEG. The correlation coefficients (r) among the three MRI systems revealed a close positive correlation (Scanner B; r = 0.996, Scanner C; r = 0.997), with P < 0.001 vs. Scanner A.

Fig. 2
figure 2

Correlations between PEG concentrations and ADC determined using three MRI systems. Data are presented as means ± SD. Graph shows decreasing ADC with increasing PEG concentrations. Correlation coefficients (r) between PEG concentrations and ADC of three MRI systems show high negative correlation (r ≈ − 1.000, **P < 0.001; Pearson correlation analysis) and those among three MRI systems show high positive correlation (r ≈ 1.000, *P < 0.001, Pearson correlation analysis). ADC apparent diffusion coefficients, MRI magnetic resonance imaging, PEG polyethylene glycol

Figure 3 shows the means ± SD of the mean ADC (ADCmean) values determined from the three MRI systems. The ADCmean at 0, 0.625, 1.25, 2.5, 5, 10 and 20 mM were 2.08 ± 0.12 × 10–3, 2.03 ± 0.14 × 10–3, 1.93 ± 0.14 × 10–3, 1.77 ± 0.06 × 10–3, 1.35 ± 0.05 × 10–3 mm2/s, 0.95 ± 0.05 × 10–3 and 0.63 ± 0.07 × 10–3 mm2/s, respectively. The ADCmean significantly differed at ≥ 2.5 mM PEG in all three systems (P < 0.05, t tests).

Fig. 3
figure 3

ADCmean determined from three MRI systems. Data are presented as means ± SD. *P < 0.05 versus 0 mM PEG (t tests). ADC apparent diffusion coefficients, PEG polyethylene glycol

The results of the linear equation between PEG concentration and ADC value for each MRI system are as follows: Scanner A is y = − 0.0754x + 1.897 (r = 0.949), Scanner B is y = − 0.0831x + 2.062 (r = 0.939), and Scanner C is y = − 0.0652x + 1.873 (r = 0.961), where y is the ADC values, x is the PEG concentration, and r is the correlation coefficient.

3.2 Validation of ADC Measurement Precision

Table 3 shows that the mean (± SD) ADC and %CV of five images acquired using the three MRI systems decreased with increasing PEG concentrations. The ADC ranged from 0.57 ± 0.01 (20 mM, Scanner A) to 2.20 ± 0.03 (0 mM, Scanner B), and the %CV ranged from 1.20 (0 mM, Scanner B) to 4.62 (20 mM, Scanner B).

Table 3 ADC and %CV of three MRI systems

3.3 Stability and Reproducibility of PEG Phantom

We defined the time when the primary PEG phantom was prepared as month 0 (P0). We then compared the ADC of this phantom at 3 (P3) and 6 (P6) months later and also with the new PEG phantom at month 6 (NP6). Figure 4 and Table 4 show that the stability measured as differences in ADC over time (DifferenceChange over time) and the reproducibility measured as differences in ADC between the primary and the new phantom (DifferencePri.vs.New) ranged from 0.03 × 10–3 to 0.20 × 10–3 mm2/s (1.60–9.03%), and from − 0.02 × 10–3 to 0.19 × 10–3 mm2/s (− 0.95 to 8.19%), respectively.

Fig. 4
figure 4

Comparison of changes in ADC over time (stability) and of phantoms prepared 6 months apart (reproducibility). ADC apparent diffusion coefficient, PEG polyethylene glycol, P0, P3, P6 primary phantom at month 0, and at 3 and 6 months thereafter, NP6 a new PEG phantom prepared at 6 months after primary PEG phantom (month 0; P0)

Table 4 Differences in ADC at various PEG concentrations over time and between primary and new phantoms

4 Discussion

The main idea points of our study are to evaluate the correlation between PEG concentration and ADC, the validation of ADC measurement precision, the changes over time of PEG phantom, and the reproducibility of PEG phantom using three MRI systems under general conditions.

This study evaluated PEG phantom in different MRI systems in terms of PEG concentration, %CV, stability, and reproducibility. Moreover, we concluded that PEG concentration was determined for the simulation of ADCs of a malignant tumor. Furthermore, the ADCmean significantly differed at ≥ 2.5 mM PEG in all three systems (P < 0.05, t tests). Previous studies of ADC measurements have used phantoms containing agarose [22], polyacrylamide [23], nickel-doped agarose/sucrose gels [1], PEG [18, 24], PVP [25, 26] and ice-water [20]. One of these studies [25], obtained ADC ranging from 0.5 to 1.1 × 10–3 mm2/s by adjusting the PVP concentration between 0 and 25% w/w. Furthermore, ADC decreased as PVP concentrations increased. Similarly, ADC decreased in phantoms containing increasing concentrations of agar gels and sucrose solutions [14]. Others have also found that ADC decreases in phantoms with increasing PEG concentrations [18, 24]. The authors of these reports noted that these phantoms might serve as ADC standard phantoms. Our results were comparable to their findings. That is, PEG concentrations can be adjusted to control ADC. Therefore, the ADC of clinical lesions can be simulated by adjusting PEG concentrations in a phantom. Notably, the ADC is < 1.0 × 10–3 mm2/s at PEG concentrations ≥ 10 mM, whereas that of malignant tumors is often ≤ 1.0 × 10–3 mm2/s [27]. The novel finding that PEG concentrations can express ADC ≤ 1.0 × 10–3 mm2/s is clinically useful.

In our study, DWI data were obtained by re-setting the phantom on each scan using the clinical scan parameters routinely used in each MR system. A previous multi-center study on DWI examined the imaging position and repeatability of ADCs in each MR system using a standardized protocol [13]. This report shows that the standard deviation of bore center ADCs is less than 2%, day-to-day repeatability is within 4.5%, intra-exam repeatability at the phantom center is within 1%, and inter-site reproducibility of ADC at magnet isocenter should be within 3%, vendor and system-specific spatial non-uniformity ADC bias should be more than 10% off-center. Ogura et al. [10] reported a %CV of ADC for six MRI systems of 3.32%, and a %CV for ten repeated measurements of one individual within 1 day of 1.72%. Our ADC values were obtained from PEG phantoms in three MR systems using a routine scanning protocol in clinical practice. Our results showed an SD of 0.01–0.06 and a %CV of 1.20–4.06%, which are approximately comparable to previous studies [10, 13]. However, our data may include a variety of error factors (e.g., system, operator, analyzer, phantom setting, scan protocol, etc.). Although, these variabilities may not be enough compared to the values of previous studies, we found that our PEG phantom was able to obtain ADC values within %CV of 5%. The stability measured as differences in ADC over time (DifferenceChange over time) was within 10% and the reproducibility measured as differences in ADC between the primary and the new phantom (DifferencePri.vs.New) was within 9%. Based on the results that showed a maximum variation of 6.38% at a PEG concentration of 0 mM (water), we believe that the differences in ADC values may be a factor of measurement error rather than chemical modification of PEG. However, it was shown that the ADC values may be obtained within 10% of the remeasurement within 6 months or when the phantoms were reproduced.

The study limitations are as follows. We maintained the temperature of the PEG phantom simply by placing it in a head coil for several hours. In addition, we carefully verified that the surface temperature of the PEG phantom was fixed at 22.0 °C while acquiring images. Although we did not evaluate temperature dependence, ADC values might be stably and reproducibly measured by the PEG phantom under conditions of a constant surface temperature. Chenevert et al. described an ice-water phantom with a known diffusion coefficient and noted that ADC are temperature dependent across multiple vendors [14]. Moreover, differences among vendors remained unresolved in that study compared with the present findings. Although the correlation coefficients among the three MRI systems revealed a close positive correlation (Fig. 2), the difference in ADC values at the PEG concentration of 1.25 mM between Scanner A and Scanner B was the maximum (Table 3). We did not compare the PEG phantom with phantoms containing other materials, the influences of scan parameters of the slew rates of the three MRI systems, as this was not the aim of the study. However, the assessment of quantitative values is very important and several methods (1 imaging procedure, 2 + procedures, 3 multiple studies) have been proposed for evaluating quantitative values in multi-centers [28]. In our study, we consider the “1 imaging procedure” is the most important subject of these methods. Sullivan et al. [28] distinguish between “statistical methods based on knowing the true value” and “methods where the reference standard provides a measurement that contains errors”. Our approach corresponds to the latter approach (i.e., the methods where the reference standard provides a measurement that contains errors), and we evaluated the bias, repeatability, and reproducibility of ADC measurements using PEG phantoms in this study. On the other hand, we have determined the linearity and correlation of ADC measurements using the PEG phantom with each MRI system, however, we have not yet been able to obtain the true value of ADC with the PEG phantom. Nevertheless, the ADC value of water (0 mM PEG) at 22 °C could be considered as a reference for the true value of the phantom of this study. A phantom for ADC measurement that provides stable and reproducible ADC values is important to verify these influences (e.g., QIBA/NIH/NIST isotropic diffusion phantom). In addition, we believe that the ADC values must be measured according to the QIBA profile. QIBA phantom is an excellent means for precise measurements in ADC values, however these measurements are performed in 0 °C [16]. There is a need to examine also the ADC measurements in higher temperature (room temperate or tissue in vivo), as it is performed with other phantoms too such as PEG, PAG, sucrose, etc. Therefore, the PEG phantom and the other solutions can be used alternatively for ADC measurements.

5 Conclusion

In this study, we found that PEG concentrations ≥ 10 mM are required to simulate the ADCs of malignant tumors (ADC < 1.0 × 10–3 mm2/s). We also showed that the ADC value is easily controlled by adjusting PEG concentrations, and can be stably measured using our PEG phantom for at least 6 months. The PEG phantom can easily and stably simulate the ADC of malignant tumors with high reproducibility.