Total mesorectal excision combined with preoperative radiation therapy and chemotherapy (pCRT) is the current standard for locally advanced rectal cancer (LARC) [1,2,3]. Long-course CRT has been extensively applied, and encouraging results derive from this approach in terms of local control with a high percentage of tumor regression up to a significant rate of complete response [1,2,3]. However, short-course radiotherapy (SCR) is known to be a valuable therapeutic option in patients with LARC. A recent meta-analysis [4] reported that SCR with immediate surgery is as effective as long CRT with deferred surgery in terms of overall and disease-free survival rates, local and distant control and toxicity. Also, short-course radiotherapy with delayed surgery (SCRDS) (after 4–8 weeks), optional therapy described for patients with locally advanced tumors who are not fit for CRT, leads to similar results in terms of negative margin resection percentage and satisfactory results about the downstaging and pathological response rate compared to traditional pCRT [5,6,7,8,9,10,11,12,13].

The use of new imaging modalities to make individual assessments of therapy response could be of great clinical value to adjust subsequent strategies tailored for each patient. Such strategies range from a tailored surgical approach, to administering an adjuvant regimen, or even to a wait-and-see policy without surgery for patients with high surgical risks [14, 15].

A positive tumor response does not necessarily require a significant tumor size reduction in morphological MRI [16]; there is difficulty to differentiate necrosis, fibrotic tissue and viable residual tumor tissue within treated areas [16, 17]. Several studies focused their attention on the potential added benefit of functional quantitative parameters derived by MR image [17,18,19,20]. Dynamic contrast-enhanced MRI (DCE-MRI) has demonstrated promise to detect residual tumor after pre-surgery CRT [17,18,19,20,21]. Previous studies have investigated functional parameters derived from DCE-MRI data in rectal cancer [18,19,20,21] such as the Standardized Index of Shape proposed by Petrillo et al. [18] as a simple semiquantitative parameter capable to differentiate pathological significant and complete response after CRT in LARC and after SCRDS [22]. Moreover, in various oncology fields, researchers have recommended the use of diffusion-weighted imaging (DWI) to assess treatment response [23,24,25,26,27,28,29,30]. DWI provides functional information on the tissues’ microstructure by means of the evaluation of water proton mobility differences [23, 24]. Water diffusion characteristics depend on cell density, vascularity, viscosity of the extracellular fluid and cell membrane integrity. By quantifying these properties by means of the individual apparent diffusion coefficient (ADC) using a monoexponential model to analyze DWI data, it can be used as an imaging biomarker to reflect biologic tumor changes and to monitor and predict treatment response [25, 26]. Moreover, using a biexponential model to analyze DWI data, information on both diffusion and perfusion tissue properties derived from intravoxel incoherent motion method (IVIM) can be obtained: the pure tissue coefficient (Dt) that describes water macroscopic motion in the cellular interstitial space, the pseudo-diffusion coefficient (Dp) that describes blood microscopic motion in the vessels, and the perfusion fraction (fp) that describes the proportion of the two different motions [27,28,29,30].

Also, the conventional DWI model is based on the assumption that water diffusion within a voxel has a single component and follows a Gaussian behavior where water molecules diffuse without any restriction [31]. However, due to the presence of microstructures (i.e., two tissue types or components within one voxel, and organelles and cell membranes), random motion or diffusion of thermally agitated water molecules within biologic tissues exhibits non-Gaussian behavior [32]. Jensen and co-workers in 2005 proposed a non-Gaussian diffusion model called diffusion kurtosis imaging (DKI) [32]. This model includes the kurtosis coefficient (K), that measures the deviation of tissue diffusion from a Gaussian model, and the diffusion coefficient (D) with the correction of non-Gaussian bias.

Aim of this study is to determine the diagnostic performance of MR imaging for the assessment of tumor response after SCRDS in patients with LARC using Standardized Index of Shape (SIS) obtained by DCE-MRI, using ADC-, IVIM- and DKI-derived parameters obtained by DW-MRI. A multivariate analysis of all functional MR-derived parameters extracted by DCE- and DW-MRI using a neural network was also evaluated.

Materials and methods

Patient selection

Thirty-four patients with a median age of 67 years (range 48–83 years) who refused or were considered unfit for chemo-radiation and planned for neoadjuvant short-course radiotherapy were evaluated in this retrospective study, from May 2011 to December 2016.

Patient characteristics are described in Table 1.

Table 1 Patient characteristics and histopathological findings

All patients had a biopsy-proven rectal adenocarcinoma. Endorectal ultrasonography, MRI of pelvis and computed tomography (CT) scan of chest, abdomen and pelvis were used as staging examinations. Patients who had T2 with local lymph node involvement, T3 rectal cancer with or without local lymph node involvement and patient with T4 were included. Exclusion criteria were: inability to give informed consent, previous rectal surgery and contraindications to MRI or to MR contrast media administration (see Fig. 1). Patients were included in the study in accordance with the approved guidelines of ethical committee of National Cancer Institute of Naples and gave their written informed consent.

Fig. 1
figure 1

Flowchart showing how patients were included in the study

Radiotherapy

All patients underwent dose-planning CT in prone position. After an online CT virtual simulation, CT datasets were transferred to a dedicated treatment planning system through a DICOM network, and an individualized clinical target volume (CTV) was done, including the gross tumor volume with margins (2–3 cm depending upon tumor position, defined by MRI imaging), the mesorectum and regional lymph nodes depending upon tumor location. We contoured the small bowel, the femoral heads and the bladder as critical organs on all CT slices of every patient, and we evaluated the relative dose–volume histogram on the treatment planning console. Three-dimensional plans for 3D or intensity-modulated radiation therapy (IMRT) radiotherapy were generated for a dual-energy, 6–20 MV X-ray linear accelerator (Clinac 2100, Varian Medical Systems, Palo Alto, CA) or for a 6–15 MV X-ray linear accelerator (Elekta Agility, Elekta Instrument AB Stockholm, Stockholm, Sweden), both equipped with multileaf collimators (MLC). Patients were scheduled using a 3-field or IMRT treatment arrangement to include the planning target volume within the 95% isodose, and a dose of 25 Gy in 5 fractions over 1 week was prescribed to the ICRU 62 intersection point. Surgery was performed 5(±1) weeks on average after the end of radiotherapy.

MRI data acquisitions

Each patient underwent MR studies before and after SCR: baseline, on average 23.8 days (9 days of standard deviation) before starting radiotherapy, and delayed, on average 61.0 days (11 days of standard deviation) after the end of SCR.

DCE-MRI was performed with a 1.5T scanner (MAGNETOM Symphony, Siemens Healthcare, Erlangen, Germany) and phased-array body coil. Pre-contrast coronal T1-weighted 2D turbo spin-echo (TSE) images, sagittal and axial T2w 2D TSE images were acquired. Axial DW images were obtained with a single-shot spin-echo diffusion-weighted echo-planar pulse sequence (SE-DW-EPI) at b values of 0, 50, 100, 150, 300, 600, 800 s/mm2. Axial dynamic contrast-enhanced T1-weighted FLASH 3D gradient-echo images were obtained: one sequence before and ten sequences after intraveneous injection of 0.1 mmol/kg of a positive, gadolinium-based paramagnetic contrast agent (Gd-DOTA, Dotarem, Guerbet, Roissy-CdG-Cedex, France) at 2 mL/s of flow rate, followed by a 10-mL saline flush at the same rate. Sagittal, axial and coronal post-contrast T1-weighted 2D TSE images with and without fat saturation were also acquired. Table 2 reports MR sequence parameters. In order to reduce respiratory artefacts, each patient received bowel preparation and antispasmodic medication.

Table 2 Pulse sequence parameters on MR studies

MRI image data analysis

Image assessment was performed in a single reading session for each patient by consensus of two gastrointestinal radiologists with 25 years and 10 years of experience in reading pelvic MR images. For the image analysis, the radiologists blinded to the examinations and clinical results.

To take into account tumor heterogeneity, the radiologists, based on pre-contrast T1-weighted images using the T2-weighted images as a guide [33], manually defined regions of interests (ROIs) along the contours of the tumor to obtain a DCE-MRI volume of interest (VOI) for each study, covering the whole lesion with exclusion of peripheral fat, artefacts and blood vessels. Also, for DW-MRI the radiologists, based on diffusion-weighted imaging with the highest b value, manually defined ROIs along the contours of the tumor to obtain a DW-MRI VOI for each study.

For each MR descriptor, the mean and the standard deviation values on the VOI were registered.

Percentage change between pre- and post-treatment was calculated as ΔX = (Xpre − Xpost)/Xpre. (X is the generic shape descriptor.)

No image registration was applied to our data acquired. We took care to exclude from the analysis the slices where motion artefacts were visible. Moreover, a volumetric analysis for each parameter was performed, thus minimizing errors due to voxel misalignments.

DCE-MRI features

In order to perform SIS analysis, an OsiriX plugin has been developed by the authors [34]. Considering the segmented VOI, the maximum signal difference (MSD) and washout slope (WOS) were calculated as reported in [35]. Then for SIS analysis were evaluated the percentage change in MSD [ΔMSD = (MSD1 − MSD2)/MSD1× 100], of WOS [ΔWOS = (WOS1 − WOS2)/WOS1× 100] and their combination as described previously [18]. Standardized Index of Shape was given by the following linear combination: 0.7780*ΔMSD + 0.6157*ΔWOS.

DWI features

For each voxel, 9 features were extracted from DWI data using the monoexponential model, the diffusion kurtosis imaging model, the intravoxel incoherent motion using conventional biexponential fitting method and variable projection algorithm.

DWI signal decay is most commonly analyzed using the monoexponential model [23, 24]:

$${\text{ADC}} = \frac{{\ln \left( {\frac{{S_{0} }}{{S_{b} }}} \right)}}{b}$$
(1)

where Sb is the MRI signal intensity with diffusion weighting b, S0 is the non-diffusion-weighted signal intensity, and ADC is the apparent diffusion coefficient.

For a voxel with a large vascular fraction, the MRI data decay can deviate from a monoexponential form, in particular showing a fast decay in the range of low b values generated by the IVIM effect [23, 24]. Thus, in addition to the monoexponential model, a conventional biexponential model using Levenberg–Marquardt fitting method was used to estimate the IVIM-related parameters of pseudo-diffusivity (Dp indicated also with D*), perfusion fraction (fp indicated also with f) and tissue diffusivity (Dt):

$$\frac{{S_{b} }}{{S_{\text{o}} }} = f_{\text{p}} \cdot \exp \left( { - b \cdot D_{\text{p}} } \right) + \left( {1 - f_{\text{p}} } \right) \cdot \exp \left( { - b \cdot D_{\text{t}} } \right)$$
(2)

Moreover, diffusion kurtosis imaging was included in the analysis in order to obtain the final fitted images [mean of diffusion coefficient (MD) and mean of diffusional kurtosis (MK)].

Multi-b DW images were obtained by voxel-wise fitting using the diffusion kurtosis signal decay Eq. (3) by a two-variable linear least-squares algorithm as used in previous study [32]:

$$S(b) = S_{0} \exp \left( { - b \cdot D + \frac{1}{6}b^{2} \cdot D^{2} \cdot K} \right)$$
(3)

In this equation, D is a corrected diffusion coefficient and K is the excess diffusion kurtosis coefficient. K describes the degree that molecular motion deviates from the perfect Gaussian distribution. When K is equal to 0, Eq. (3) evolves into a conventional monoexponential Eq. (1):

The difference between D and ADC is that D is a corrected form of ADC for use in non-Gaussian circumstances.

The parameters of conventional DWI (ADC), IVIM (fp, Dt, Dp) using CBFM and DKI (MK and MD) were obtained from the multi-b DWI data with all measured b values using the prototype post-processing software Body Diffusion Toolbox (Siemens Healthcare, Erlangen, Germany).

Moreover, for each voxel on DW, data diffusion and perfusion information were also obtained modeling DW-data using the IVIM model and using VARPRO algorithm for parameters estimation [36].

Rearranging Eq. (2) the \(S(b)/S_{0} . - e^{{ - bD_{t} }}\) is the product of f and a nonlinear function of Dt and Dp:

$$f\left( {D_{\text{p}} ;D_{\text{t}} ; \, b} \right) = S(b)/S_{0} . - e^{{ - bD_{\text{t}} }} = f\left( {e^{{ - bD_{\text{p}} }} - e^{{ - bD_{\text{t}} }} } \right)$$
(3)

Letting \(f\left( {D_{\text{p}} ;D_{\text{t}} ; \, b} \right)\) the cost functional becomes:

$$S(b)/S_{0} . - e^{{ - bD_{\text{t}} }} = \left\| {y - f(D_{\text{p}} ;D_{\text{t}} ;b) \, f} \right\|_{2} \,$$
(4)

Therefore, a separable nonlinear least square algorithm known as variable projection (VARPRO) can be used to calculate the diffusion parameters. If we knew the estimate of the nonlinear parameters Dp and Dt, the estimate of the linear parameter f could be obtained by solving a linear least square problem:

$$f = e^{{ - bD_{\text{t}} }} + y + f \, (D_{\text{p}} ;D_{\text{t}} ;b)^{ + }$$
(5)

where \(f(D_{\text{p}} ;D_{\text{t}} ;b)^{ + }\) is the Moore–Penrose generalized inverse of \(f(D_{\text{p}} ;D_{\text{t}} ;b) \,\). Therefore, a new cost functional can be constructed:

$$S(b)/S_{0} . - e^{{ - bD_{\text{t}} }} = \left\| {y - f \, (D_{\text{p}} ;D_{\text{t}} ;b) \, f(D_{\text{p}} ;D_{\text{t}} ;b)^{ + } y} \right\|$$
(5)

An exhaustive description of the algorithm is beyond the scope of the present paper, and the interested reader is referred to [37]. This analysis was performed using MATLAB R2007a (The MathWorks Inc., Natick, MA).

Per each MR descriptor, the mean and the standard deviation values were registered.

Evaluation of pathologic response

The pathologic response assessment was performed according to previously published recommendations [38, 39]. Briefly, surgical specimens containing the tumor were evaluated and scored according to tumor regression grade (TRG), as proposed by Mandard et al. [39], by an expert pathologist who was not aware of MRI findings. Patients with a TRG 1 or 2 score were considered as responders, whereas the remaining patients (TRG 3, 4 or 5) were classified as non-responders. Patients with TRG 1 were considered with pathological complete response, while patients with TRG 2–5 were considered with incomplete pathological response [18].

Statistical analysis

Mann–Whitney nonparametric test was performed to assess statistically significant differences between responder versus non-responder patients and between pathological complete responders versus incomplete responders. Receiver operating characteristic (ROC) curves were also used to evaluate the diagnostic performance for each parameter. Area under ROC curve (AUC) was calculated, and optimal thresholds were obtained maximizing the Youden index. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and accuracy were performed considering optimal cutoff values.

Moreover, three different neural networks were trained and used as classifiers to test the performance of a multivariate analysis of all MR-derived parameters. We used a supervised linear classifier (linear discriminant analysis), a supervised linear support vector machine and a supervised decision tree classifier. For each training set, a leave-one-out cross-validation was performed, and the best findings were registered. An exhaustive description of each classifier is beyond the scope of the present paper, and the interested reader is referred to [40,41,42,43].

Statistical analysis was performed using the mean and standard deviation values pre-treatment to assess the capability to predict pathological response before therapy, while the mean and standard deviation values post-treatment and the percentage changes were investigated to assess the SCRDS response in the preoperative phase to guide the surgery versus more conservative treatments. For each comparisons, the gold standard is the tumor regression grade obtained on surgical specimens containing the tumor.

A p value < 0.05 was considered significant for all tests. All analyses were performed using Statistics Toolbox of MATLAB R2007a (The MathWorks Inc., Natick, MA).

Results

All patients in our series had rectal adenocarcinomas. Three patients were pathologically classified as T0, 6 as T1, 20 as T2 and 5 as T3. There were 7 patients with a TRG 1, 8 with a TRG 2, 11 with a TRG 3, 8 with a TRG 4 and none with a TRG 5. Therefore, 15 patients were classified as responders and 19 as not responders by TRG. Seven patients had pathological complete response.

Considering the high number of parameters investigated, we reported exclusively the findings for parameters that resulted statistically significant.

Mean and standard deviation values of pre-treatment CBFM Dp and mean value of VARPRO Dp pre-treatment showed statistically significant differences to predict pathological complete response. (p value at Mann–Whitney test was 0.05, 0.03 and 0.008, respectively.)

Instead, Mann–Whitney test showed statistically significant differences exclusively for SIS percentage change between responder and non-responder patients (p value << 0.001) and between pathological complete responders and incomplete responders after SCRDS (p value << 0.001).

The best results to predict pathological complete response were obtained by CBFM Dp mean value pre-treatment, VARPRO Fp mean value pre-treatment and for VARPRO Dp mean value pre-treatment (Fig. 2a–c) with area under ROC of 0.71, 0.68 and 0.84 (Table 3), respectively.

Fig. 2
figure 2

ROC analysis for the best MR-derived parameters to predict pathological complete response and responder patients of the SCRDS

Table 3 Diagnostic performance of the best MR-derived parameters to predict pathological complete response and responder patients of the SCRDS

The best results to predict responders were obtained by CBFM Fp mean value pre-treatment, CBM Dp standard deviation value pre-treatment and for VARPRO Dp mean value pre-treatment (Fig. 2d–f) with area under ROC of 0.63, 0.63 and 0.65 (Table 3), respectively.

The best results to detect after treatment complete pathological response were obtained by SIS, the percentage change in MK standard deviation and by percentage change in VARPRO Dp mean value (Fig. 3a–c) with area under ROC of 0.89, 0.70 and 0.72 (Table 4), respectively.

Fig. 3
figure 3

ROC analysis of the best MR-derived parameters to assess pathological complete response and to detect responder patients after SCRDS

Table 4 Diagnostic performance of the best MR-derived parameters to assess pathological complete response and to detect responder patients after SCRDS

The best results to differentiate responders vs. non-responders after treatment were obtained by SIS, the percentage change in MK standard deviation and percentage change in VARPRO Dp mean value (Fig. 3d–f) with area under ROC of 0.94, 0.60 and 0.65 (Table 4), respectively.

Table 5 reports the diagnostic performance of tested classifiers to predict pathological response before treatment and to assess pathological response after treatment. The best results (bold values) were obtained by a supervised decision tree with an accuracy of 88.2% to predict pathological complete response using VARPRO Dp pre-treatment mean value, ADC standard deviation value pre-treatment (Fig. 4a and b). However, these results showed a worse accuracy compared at the use of CBFM Dp mean pre-treatment, VARPRO Fp mean pre-treatment VARPRO Dp mean pre-treatment mean value alone. Instead, to predict responder patients the best results (bold values) were obtained by a supervised decision tree with an accuracy of 64.7% using CBFM IVIM Fp pre-treatment mean and standard deviation value, MK standard deviation value pre-treatment, ADC standard deviation value pre-treatment (Fig. 4c and d). However, these results showed a worse accuracy compared at the use of CBFM Fp mean pre-treatment CBFM Dp std pre-VARPRO Dp mean pre-treatment.

Table 5 Performance of multivariate analysis using DWI-derived parameters and neural networks
Fig. 4
figure 4

ROC curve and the best decision tree to predict pathological complete response using VARPRO Dp pre-treatment mean value, ADC standard deviation value pre-treatment (Fig. 4a and b) and the ROC curve and the best decision tree to predict responder patients using CBFM IVIM Fp pre-treatment mean and standard deviation values, MK standard deviation value pre-treatment, ADC standard deviation value pre-treatment (Fig. 3c and d)

The best results (bold values) to assess pathological complete response after SCRDS were obtained by a supervised decision tree with an accuracy of 85.3% using VARPRO Dp mean value post-treatment, ADC standard deviation value pre-treatment, MD standard deviation value post-treatment (Fig. 5a and b). Instead, low accuracy (55.9%) was reached in the assessment of responder patients after SCRDS considering the best results (bold values) by a supervised decision tree using VARPRO Dp mean value post-treatment, VARPRO D mean value pre- and post-treatment (Fig. 5c and d).

Fig. 5
figure 5

Figure shows the ROC curve and the best decision tree to assess pathological complete response after SCRDS using VARPRO Dp mean value post-treatment, ADC standard deviation value pre-treatment, MD standard deviation value post-treatment (Fig. 4a and b) and the ROC curve and the best decision tree to assess responder patients after SCRDS using VARPRO Dp mean value post-treatment, VARPRO D mean value pre- and post-treatment (Fig. 4c and d)

We reported representative MR images for a responder patient (TRG = 2). Figure 6 shows SIS analysis, ΔSIS = 24.79%. Figure 7 shows ADC- and IVIM-derived parameter maps pre-treatment (a, c, e, g) and post-treatment (b, d, f, h): ΔADC = − 87.25%, ΔD = − 99.36%, ΔDp= 8.42%, Δfp= 38.67%. Figure 8 shows DKI-derived parameters maps pre-treatment (a, c) and post-treatment (b, d): ΔMD = − 45.79%, ΔMK = 55.29%.

Fig. 6
figure 6

Figure shows SIS analysis for a responder patients: TRG 2 and ΔSIS = 24.79%

Fig. 7
figure 7

Figure shows ADC- and IVIM-derived parameters maps pre-treatment (A, C, E, G) and post-treatment (B, D, F, H) for a responder patients (TRG 2): ADC (A) and (B), D (C) and (D), Dp (E) and (F), fp (G) and (H)

Fig. 8
figure 8

Figure shows DKI-derived parameters maps pre-treatment (A), (C) and post-treatment (B), (D) for a responder patients (TRG 2): MD (A) and (B), MK (C) and (D)

Figure 9 shows an image representative of the correlation of tumor on MR T2-weighted images with pathology specimen.

Fig. 9
figure 9

A representative image that shows the correlation of tumor on MR T2-weighted images with the pathology specimens

Discussion

Recently, there has been growing interest in functional imaging modalities to increase diagnostic accuracy for therapy response assessment. These imaging modalities reflect the microstructural and metabolic proprieties of the tumor, allowing the assessment of treatment-induced changes before morphological changes are visible. DCE-MRI and DWI have emerged as powerful tools to predict and to assess neoadjuvant therapy response for rectal cancer. In fact, DCE- and DW-MR imaging after preoperative CRT was shown to be more valuable than morphologic MR imaging to recognize pathological significant and complete response and to discriminate viable residual tumor.

The objective of this study was to determine the diagnostic performance of DCE and DW imaging for the assessment of tumor response after SCRDS in patients with LARC comparing Standardized Index of Shape (SIS) obtained by DCE-MRI and using ADC-, intravoxel incoherent motion- and DKI-derived parameters obtained from DW-MRI and their combinations. To the best of our knowledge, there are no studies in the literature focused on a comparison of DCE-MRI-, IVIM- and DKI-derived parameters and their combination to predict and to assess therapy response in locally advanced rectal cancer after SCRDS.

There are many studies that evaluate the single modality, DCE or DWI, in preoperative long CRT assessment [18,19,20,21, 44,45,46]. In our previous studies [18], we demonstrated the ability of DCE-MRI using the Standardized Index of Shape to discriminate significant responders and pathological complete tumor response after preoperative CRT in LARC with a good accuracy also when compared to FDG-PET examinations [47]. Several studies demonstrated the role of diffusion-weighted imaging in LARC for early and late assessment of therapy response [44,45,46], and several studies evaluated the use of IVIM to elaborate DW-data in different kind of tumors [27,28,29,30, 48]. Choi et al. [44] demonstrated that ADC values have significant difference between pathological complete response and non-pathological complete response groups, demonstrating the utility of the ADC value as a quantitative objective biomarker in rectal cancer. Moreover, there are some studies with the aim to assess tumor response after SCR using metabolic change evaluations by FDG-PET with contrasting results [49,50,51,52]. In another study [19], we assessed SIS- and IVIM-derived parameters in LARC after SCRDS demonstrating that SIS obtained the best parameter to discriminate responders by non-responders (sensitivity 94%, specificity 84%, accuracy 89%, cutoff value = − 7.8%) and the best diagnostic performance also to discriminate pCR (sensitivity 86%, specificity 89%, accuracy 89%, cutoff value = 68.2%) and that no accuracy increase was obtained combining linearly each possible parameters couple or combining all functional MR-derived parameters compared to SIS alone.

Only few researches have evaluated the role of DKI for assessing treatment response of neoadjuvant chemoradiotherapy in LARC [53, 54]. Yu et al. [53] evaluated the feasibility and value of DKI in assessing treatment response to neoadjuvant CRT in LARC. In their study, histogram analysis of imaging indices (MD, MK, ADC) was performed and evaluated on 41 patients. The results by Yu et al. [53] suggested that the percentage change in MD provides high diagnostic performance for assessing treatment response. For pre-CRT parameters, histogram indices medians and 10th percentiles of pre-MD correlated statistically with TRG scores, providing a good performance for evaluating treatment to neoadjuvant CRT. Hu et al. assessed the value of DKI in evaluating pathological complete response to neoadjuvant CRT compared to conventional DWI [54] using a proper DKI scan protocol with b values of 0, 700, 1400 and 2000. They enrolled 56 patients with LARC that underwent MRI before and after CRT. They showed that among 56 rectal lesions (pCR, n = 14; non-pCR, n = 42), the MK pre- and MK post-values were much lower for pCR patients than for non-pCR patients. The ADC post and ADC percentage change was significantly higher for pCR patients than for non-pCR. The MK post showed relatively high sensitivity and high specificity in comparison with other parameters. Our findings showed that mean and standard deviation values pre-treatment of CBFM Dp and mean value of VARPRO Dp pre-treatment showed statistically significant differences between pathological complete responders and incomplete responders (p value at Mann–Whitney test was 0.05, 0.03 and 0.008, respectively), while exclusively SIS percentage change showed significant differences between responder and non-responder patients after SCRDS (p value << 0.001) and to assess pathological complete response after SCRDS (p value << 0.001).

The best results to predict pathological complete response were obtained by VARPRO Fp mean value pre-treatment with area under ROC of 0.84, a sensitivity of 96.4%, a specificity of 71.4%, a positive predictive value of 92.9%, a negative predictive value of 83.3% and an accuracy of 91.2%. The best results to assess post-treatment complete pathological response were obtained by SIS with an area under ROC of 0.89, a sensitivity of 85.7%, a specificity of 92.6%, a positive predictive value of 75.0%, a negative predictive value of 96.1% and an accuracy of 91.2%. Moreover, the best results to differentiate post-treatment responders by non-responders were obtained by SIS with an area under ROC of 0.94, a sensitivity of 93.3%, a specificity of 84.2%, a positive predictive value of 82.4%, a negative predictive value of 94.1% and an accuracy of 88.2%.

These results take the conclusion that DWI-derived parameters including pseudo-diffusion coefficient and perfusion fraction have a great potential to predict therapy response and then patient outcome before treatment in LARC, while SIS is the best parameter to assess pathological response after treatment to guide surgery.

Promising results were obtained using a decision tree tested with all ADC-, IVIM- and DKI-derived parameters. A high accuracy was reached for the assessment of pathological complete response after SCRDS in LARC. The best results were obtained with a supervised decision tree with an accuracy of 85.3% to assess pathological complete response after SCRDS using VARPRO Dp mean value post-treatment, ADC standard deviation value pre-treatment, MD standard deviation value post-treatment. However, SIS alone showed a major accuracy. In any case, these results could be interesting when contrast agent administration was contraindicate and DCE-MRI cannot performed.

Some potential limitations deserve a special consideration: two radiologists evaluated the MR images by consensus and in a single session per patient so that the intraobserver variability was not assessed. The calculation of DKI parameters may be suboptimal for the use of an acquisition protocol with 800 m/s2 as a maximum b value. A more extensive patient panel would probably strengthen the power of this study in SCR therapy assessment. A reproducibility analysis of MR-derived parameters was not performed; however, the use of mean values for each DCE and DW parameters extracted by volume of interest allows to obtain more robust measures.

Conclusions

In conclusion, perfusion fraction derived by IVIM model using a more sophisticated fitting approach compared to convention biexponential fitting using Levenberg–Marquardt had high accuracy to predict positive therapy response after SCR. On the other side, SIS is an encouraging DCE-MRI angiogenic biomarker to assess preoperative treatment response after SCR with delayed surgery, and it permits to discriminate pathological complete response allowing direct surgery for tailored and conservative treatment. Furthermore, an important prognostic role was obtained by perfusion fraction mean value pre-treatment and by a decision tree composed by VARPRO Dp mean value post-treatment, ADC standard deviation value pre-treatment, MD standard deviation value post-treatment in order to assess pathological complete response.