Introduction

Rectal cancer is a frequent malignancy in both men and women, accounting for 40,290 new cases in the USA in 2012 [1]. Despite the efforts made to introduce screening programs, most patients are diagnosed at a locally advanced stage of the disease (T3-T4, Nx, Mx). Total mesorectal excision (TME) combined with preoperative radiation therapy and concurrent chemotherapy (CRT) is the current standard for locally advanced rectal cancer (LARC) [2, 3]. TME is associated with significant morbidity and functional complications; evolving conservative treatment strategies are being developed for patients with early rectal cancer at diagnosis and patients with significant/complete tumour regression after preoperative CRT. A further conservative strategy has been to adopt a ‘wait and see’ policy, omitting surgery when a complete clinical response is obtained after preoperative CRT. This strategy has the advantage of reducing morbidity and provides a ‘true’ organ-sparing approach, considering that sphincter preservation without adequate function has questionable benefit. A major challenge for organ-sparing treatments after good pathological response in LARC is the selection of patients suitable for such an approach. Morphological MRI evaluation (mMRI) is considered the best available tool for LARC staging, allowing an accurate evaluation of the disease extent, up to, beyond and over the mesorectal fascia, and of the lymph node involvement [4]. However, there are some limitations in depicting changes after CRT through morphological MRI alone. A favourable tumour response may not correspond to an appreciable tumour size reduction. Moreover, it is difficult to differentiate between post-radiation fibrosis and fibrosis containing viable tumour remnants. Studies are therefore focusing on the potential added benefit of functional and/or quantitative methods of MR image evaluation. Dynamic contrast-enhanced MRI (DCE-MRI) has proven useful in detecting residual tumour after CRT [59]: leaky capillaries associated with tumour neovascularization determine a rapid gadolinium uptake and an early wash-out, which correlate with the vascular endothelial growth factor (VEGF) expression and micro vessel density (MVD) [1012]. DCE-MRI evaluation can be assessed through time intensity curve (TIC) analysis performed using three different approaches: qualitative DCE-MRI, quantitative DCE-MRI and semi-quantitative DCE-MRI based on pattern analysis. Qualitative DCE-MRI involves the visual inspection and classification of TIC in accordance with the scheme proposed by Daniel et al. [7]. The shape of the curve can summarize the angiogenic characteristics of the tumour, which is typically divided into two phases: wash-in (contrast agent enhancement) and wash-out (contrast agent excretion). A high or low slope in the first phase or wash-in phase (WI) suggests high or low angiogenesis, respectively [13]. With regard to the second phase or wash-out phase (WO), a steep decline (Type 3) corresponds to higher vascular permeability. A continuous rise (Type 1 or persistent enhancement) represents lower permeability. A stable curve (Type 2 or plateau) also depicts higher permeability when its signal intensity does not reach a plateau [14]. The main drawback of the qualitative approach is the operator dependence. Quantitative DCE-MRI can be achieved by applying an adequate pharmacokinetic model to the TIC [15]. This approach can yield parameters with a direct physiological interpretation. However, quantitative DCE-MRI involves many critical issues: accurate measurement of the arterial input function, accurate quantification of gadolinium, choice of an adequate model, and accurate estimation of tracer kinetic parameters [16]. Semi-quantitative DCE-MRI consists of the calculation of opportune TIC shape descriptors providing summary information, roughly related to the pathophysiology of the cancer. This approach could be more robust in clinical settings, since many critical issues of quantitative analysis are attenuated. However, semi-quantitative parameters do not have a direct interpretation in terms of physiological characteristics [17, 18]. Previous DCE-MRI studies have investigated shape descriptors for non-invasive response assessment in various malignancies, including rectal cancer [12, 13, 17, 18]. These initial investigations have focused on several outcome variables such as response, correlation with angiogenesis markers, and overall presence of tumour response, by classifying patients empirically into ‘responder’ and ‘not responder’ based on T stage or clinical down-staging. However, patients with complete tumour regression based on tumour regression grade (TRG) represent a distinct subset with a particularly favourable outcome, thus suggesting that further therapy could be tailored according to tumour staging after CRT. Moreover, individualizing a single parameter, easily translatable in clinical practice, through which one can discriminate responder from non-responder patients after CRT still remains a challenge.

The aim of this study was to investigate dynamic contrast-enhanced MRI (DCE-MRI) with the purpose of discriminating responders from non-responders after CRT for LARC and to investigate several shape parameters for the time-intensity curve (TIC) in order to identify the best parameters combination.

Materials and methods

Patient selection

Seventy-four consecutive patients (22 women and 52 men) with a median age of 62 years (range 29–76 years) were enrolled in this prospective study from March 2007 to July 2013. All patients had a biopsy-proven rectal adenocarcinoma. Endorectal ultrasound, MRI of the liver and pelvis, CT of the abdomen and pelvis, and chest x-ray were used as staging procedures. Inclusion criteria were: patients with clinical T4, nodal involvement or T3 N0 with a tumour location ≤5 cm from the anal verge or a circumferential resection margin of ≤5 mm, defined by MRI. Exclusion criteria were: inability to give informed consent, previous rectal surgery, and contraindications to MRI or to MR contrast media. All patients were enrolled within the phase I-II prospective trial described in Avallone et al. [19], which was approved by the Independent Ethics Committee of our institution. All patients provided written informed consent to participate in the trial.

Neoadjuvant therapy

External radiation therapy was performed using a three-field technique (one posterior-anterior and two lateral fields). Standard fractions of 1.8 Gy/day to the reference point were given five times a week up to a total dose of 45 Gy. Two different regimens were carried out for the concomitant chemotherapy. Fifty-four patients received bi-weekly bevacizumab at 5 mg/kg plus three bi-weekly cycles of oxaliplatin at 100 mg/m2 and raltitrexed at 2.5 mg/m2 on day 1, and levo-folinic acid at 250 mg/m2, and 5-fluorouracil at 800 mg/m2 on day 2 [2, 19]. The 20 remaining subjects received capecitabine at a dose of 825 mg/m2 twice a day, 5 days a week, for 5 weeks.

MRI data acquisitions

All patients underwent DCE-MRI examination before and after CRT (90 days on average, range 86–94 days). Imaging was performed with a 1.5 T scanner (Magnetom Symphony, Siemens Medical System, Erlangen, Germany) equipped with a phased-array body coil. Patients were placed in a supine, head-first position. Mild rectal lumen distension was achieved with 60–90 mL of undiluted ferumoxil (Lumirem, Guerbet, Roissy CdG Cedex, France) suspension introduced per rectum [20]. Precontrast coronal T1w 2D turbo spin-echo images and sagittal and axial T2w 2D turbo spin-echo images of the pelvis were obtained for the mMRI. Subsequently, axial, dynamic, contrast-enhanced T1w, FLASH 3D gradient-echo images were acquired for the sqMRI and for pattern analysis. We obtained one sequence before and ten sequences, without any delay, after the intravenous (IV) injection of 2 ml/kg of a positive, gadolinium-based paramagnetic contrast medium (Gd-DOTA, Dotarem, Guerbet, Roissy CdG Cedex, France). The contrast medium was injected using Spectris Solaris® EP MR (MEDRAD Inc., Indianola, PA, USA), with a flow rate of 2 ml/s, followed by a 10-ml saline flush at the same rate. Temporal resolution was 0.58 min, corresponding to 35 s (as reported in Table 1). Total acquisition time for pre-contrast and ten post-contrast sequences was 6.4 minutes. Then, sagittal, axial and coronal post-contrast T1w 2D turbo spin-echo images, with and without fat saturation, were obtained (Table 1). The axial images were acquired without any angulation. Axial T1-w pre- and post-contrast sequences were acquired at the same position as the T2-w sequence. MRI total acquisition time was around 30 minutes.

Table 1 Pulse sequence parameters on MR studies

Image data analysis

Image assessment was performed in a single reading session for each patient by consensus of two gastrointestinal radiologists (A.P. and V.G.) with 18 years and 5 years of experience, respectively, in reading pelvic MR images. mMRI evaluation on post-CRT T2w images followed the criteria set out by Dresen et al. [21]. Lesions showing a complete disappearance of the tumour or residual tumour surrounded by a dark, intact bowel wall were classified as T0-T2. Lesions exhibiting a residual isointense mass penetrating the rectal wall and/or the mesorectal fascia and/or the surrounding organs were classified as T3-T4. Since MRI cannot distinguish between fibrosis with and without tumour [20], patients with a thickened but hypo-intense rectal wall were also categorized as T3. For morphological MRI evaluation, tumour volumes were calculated by manually tracing the tumour boundaries slice by slice on conventional T2-weighting. Whole-tumour volume was calculated by considering the total number of pixels (slice by slice) and multiplying this by pixel size in mm2. For semi-quantitative analysis of MRI evaluation, in order to take into account tumour heterogeneity, the radiologists, based on pre-contrast T1-weighted images using the T2-weighted images as a guide, manually drew regions of interests (ROIs) along contours of the tumour, covering the whole lesion with exclusion of the peripheral fat, artefact and blood vessels. Histograms of pixel data were obtained from all acquired tumour slices, and median values and standard deviation were recorded for each study. The semi-quantitative analysis was performed on ROIs previously described. For each pixel, TICs were obtained and for each TIC, eight shape descriptors (Fig. 1) were computed using the approach previously reported in another of our papers [17]: the maximum signal difference (MSD), the time to the knee between the WI and WO segments (TTP), the WI slope (WIS), the WO slope (WOS), the WI intercept (WII), the WO intercept (WOI), the WOS/WIS ratio, the WOI/WII ratio. Each descriptor's median value and standard deviation were obtained and the percentage changes between pre- and post-treatment ΔX = (Xpre - Xpost)/Xpre were calculated (X is the generic shape descriptor). Semi-quantitative analyses were carried out using the Matlab R2007a software (The Math-Works Inc., Natick, MA, USA).

Fig. 1
figure 1

Semi-quantitative analysis: illustration of the parameters calculated from the TIC

Surgical approach and evaluation of pathological response

All patients underwent total mesorectal excision after completing CRT (58 days on average, range 48–66 days). An anterior resection or an abdominal-perineal resection was performed based on the results of post-CRT restaging. The surgical specimens containing the tumour were processed and evaluated by a single pathologist who was not aware of either the clinical or the MRI findings. The circumferential resection margin was examined by sampling a 1-mm thick slice [22] and was defined as positive if the tumour residuals were found ≤1 mm from the margin. Lymph nodes were examined by the manual dissection method [11]. The mandard TRG scoring system was used [23]: TRG 1 means a complete response with absence of residual cancer and fibrosis extending through the wall. TRG 2 is the presence of residual cancer cells scattered through the fibrosis. TRG 3 corresponds to an increased number of residual cancer cells, with predominant fibrosis. TRG 4 indicates residual cancer outgrowing fibrosis. TRG 5 is the absence of regressive changes. Patients with a TRG 1 or 2 score were considered as responders, whereas the remaining patients (TRG 3, 4, or 5) were classified as non-responders [12].

Statistical analysis

To assess the influence of the two chemotherapy regimens on the results, we compared the proportion of responders with the non-responders in the two different chemotherapy groups by means of a chi-square test. The univariate analysis was carried out to evaluate the ability of single-shape descriptors to discriminate responders from non-responders. To this aim, non-parametric tests for paired data (Wilcoxon test) were used to verify differences before and after CRT. The tests were applied to each shape descriptor obtained as median value and standard deviation. The Mann-Whitney test for unpaired data was used to verify statistically significant differences in ΔX of each shape descriptor between the two groups (responders and non-responders). Moreover, receiver operating characteristic (ROC) curves were calculated using percentage change of tumour volume and percentage change for each DCE-MRI shape parameter. For each ROC, the area under the curve (AUROC) was calculated [24]. The AUROCs were compared using nonparametric methods [25] and optimal thresholds were obtained maximizing the Youden index [26]. Multivariate analysis was performed to calculate the best combination of shape descriptors to discriminate responders from non-responders. We considered the linear classifiers that can be generated from independent pairs of the eight shape descriptors. For any two paired parameters, ∆Xp1 and ΔX2 considered as percentage change, ROCs curves were computed by applying the method referenced above to the linear combination p = ∆X1 cosθ - ∆X2 sinθ, for θ varying from −180° to 180° in 1-degree increments. For each θ, the AUROC was computed by trapezoidal rule integration. The best possible linear classifier was then determined by maximizing the AUROC as a function of θ and was indicated as ∆X1 and as ∆X2 [27]. A P-value <0.05 was considered significant for all tests. All analyses were performed using Statistics Toolbox of Matlab R2007a (The Math-Works Inc., Natick, MA, USA).

Results

Population

All patients in our series had rectal adenocarcinomas, G2 graded, with an intact and complete mesorectum after total excision. Fifteen patients were pathologically classified as T0 (20.3 %), ten as T1 (13.5 %), 27 as T2 (36.5 %), 17 as T3 (23.0 %), and five as T4 (6.7 %). There were 22 patients with a TRG 1 (29.7 %), 24 with a TRG 2 (32.4 %), 21 with a TRG 3 (28.4 %), five with a TRG 4 (6.8 %), and two with a TRG 5 (2.7 %). According to morphological MRI on T2w images, ten (13.5 %) patients were staged as T0, 22 (29.7 %) as T2, 38 (51.4 %) as T3, and four (5.4 %) as T4. The Chi-square test, used to evaluate the influence of two different chemotherapy regimens, showed that the therapy regimen did not affect the proportions in responder and non-responder patients.

Univariate analysis of DCE-MRI shape descriptors

On univariate analysis, the Wilcoxon test showed statistically significant differences before and after CRT in responders for MSD and WOS median values (Table 2, Fig. 2). However, in non-responders, no statistically significant difference was observed for each one of the eight descriptors (Table 2). Statistically significant differences between responders and non-responders in the percentage change of the MSD and WOS were also obtained with the Mann-Whitney test (Table 3).

Table 2 Semi-quantitative parameters: median values of responder and non-responder patients according to TRG criteria at two time points: pre and post neo-adjuvant therapy
Fig. 2
figure 2

Box plots of ∆MSD (a) and ∆WOS (b) representing differences in distributions of responders (R) and non-responders (NR). The middle line is the median value. The inferior and superior extremes of the box correspond to the first and third quartile, respectively. The whiskers lines correspond to values within 1.5 times the interquartile range from the ends of the box. Outlier data beyond the ends of the whiskers are displayed with a + sign

Table 3 Semi-quantitative parameters: median values of percentage change (Δ) for each parameter in responder and non-responder patients according to TRG criteria

Tumour volume morphological variation showed a low correlation between response and TRG with a sensibility and specificity, respectively, of 74 % and 67 % (33 % cut-off, AUROC 0.67). For DCE-MRI descriptors, the maximum AUROC was obtained for ∆MSD and ∆WOS, 0.73 and 0.81, respectively. The optimal cut-off value in ∆MSD of −15.1 % yields 74.0 % sensitivity and 70.4 % specificity; optimal cut-off value in ∆WOS of 18.9 % gave a sensitivity of 80.4 % and a specificity of 77.8 %.

Multivariate analysis of DCE-MRI shape descriptors

Among all ROC curves computed for each pair of the eight descriptors, the combination of ∆MSD and ∆WOS (blue ROC curve in Fig. 3a) yielded the best results. A combination of ∆MSD and ∆WOS gave an improvement in the AUROC (0.899) when compared to ∆MSD and ∆WOS alone (Fig. 3a). The linear classifier maximizing AUROC for ∆MSD and ∆WOS was given as 0.7780*∆MSD + 0.6157*∆WOS (Fig. 4). We defined this linear combination as the Standardized Index of Shape (SIS). The optimal cut-off value in SIS change was −3.0 % (Youden Index). Sensitivity and specificity values were 93.5 % and 82.1 %, respectively, and positive and negative predictive values were 89.6 % and 88.5 %, respectively. The McNemar test [25] showed that SIS AUROC was significantly greater than tumour volume morphological variation AUROC (P-value <0.01), ∆MSD and ∆WOS AUROC (P-value <0.05). When the distributions of ∆MSD, ∆WOS and SIS values were compared distinguishing complete response (TRG = 1) and non-complete response (TRG ≥ 2), the AUROCs were, respectively, 0.66, 0.61 and 0.72 (Fig. 3b). However, the SIS reached better results with a sensitivity of 71.2 % and specificity of 65.4 %. A large number of patients in the complete response group are needed to evaluate a significant statistical difference and to establish the optimal threshold to differentiate patients with a complete response.

Fig. 3
figure 3

ROC analysis for ∆MSD, ∆WOS and SIS in discriminating responders from non-responders (a). ROC analysis for ∆MSD, ∆WOS and SIS in discriminating complete response from non-complete response (b)

Fig. 4
figure 4

Scatter plot of ∆WOS versus ∆MSD for all patients. Responders were indicated by a blue round symbol and non-responders by a red triangular symbol. The linear classifier 0.7780*∆MSD + 0.6157*∆WOS ≥ −3.0 %, provided a sensitivity of 93.5 %, a specificity of 82.1 %, a positive predictive value of 89.6 % and a negative predictive value of 88.5 %. Accuracy was 89.2 %

Discussion

Since histological down-staging is not always associated with tumour size reduction, mMRI alone is not considered conclusive in tumour-response evaluation after CRT in LARC [21]. The main drawback of morphological MRI is the discrimination of active tumour tissue by post-treatment fibrosis (Figs. 5 and 6). Our findings have confirmed that T2-weighted imaging volume assessment alone is not sufficiently accurate (74 % of sensibility and 67 % of specificity), not being able to discriminate between the persistence of viable tissue and the peritumoral fibrosis present after radiation therapy. Residual tumour within the fibrotic areas is frequently represented by small tumour nests, beyond the depiction level of morphological MRI [21]. According to T stage after CRT, Chen et al. [8] reported an overstaging of 38 % and an accuracy of 52 %. With regard to qualitative DCE-MRI, the main drawback remains ROI positioning, which makes the evaluation operator dependent [7]. At the same time, quantification through visual TIC inspection is not achievable, as it misses reproducible numerical data for comparison. According to our previous experience, when patients with a partial or complete response to CRT were included, we obtained a sensitivity, a specificity and an accuracy, respectively, of 79 %, 76 % and 78 %; while considering the performance of qualitative MRI evaluation in complete responders we reached a sensitivity, a specificity and an accuracy, respectively, of 94 %, 76 % and 84 % [9]. To overcome the limitations related to visual inspection alone, the calculation of opportune TIC shape descriptors characterized the semiquantitative MRI analysis focusing on relationships that link tumour viability to angiogenesis. Zhang et al. [12] demonstrated the ability of 3T DCE-MRI to discriminate rectal carcinoma from normal tissue, showing a close relationships among functional parameters, VEGF and MVD expression. Lavini et al. [28], in order to classify the pixels according to benignity and malignancy, used the following features: maximum signal difference (sometimes referred as relative enhancement), TTP, maximum slope of increase, relative final slope and initial signal. Tuncbilek et al. [29] observed that TTP, wash-in intercept, and maximum enhancement were strongly correlated to MVD. Blomqvist et al. [30] proposed a TIC piecewise linear fitting reporting wash-out intercept as the parameter that significantly differentiated between malignant and benign lesions. Another developed approach is based on quantitative ‘model-based’ investigations that involve compartmental tracer kinetic modelling [15] and pixel-by-pixel estimation of kinetic features, through nonlinear regression. The latter has been introduced to better correlate quantitative ‘model-based’ results with physiological tissue properties. Kim et al. [31] showed that the Ktrans parameter (contrast agent transfer constant between plasma space to extracellular extravascular space) had a large decrease in the mean value after CRT and this was associated with a good therapeutic response to CRT for LARC. However, being influenced by many factors, many different models are still present in the literature and their high output variability reflects a poor clinical reliability [8, 16, 18, 32]. Results of quantitative analysis to assess therapy response on 3T MR scanners are more promising, as was reported in the studies of Intven et al. [33] and Lim et al. [34]. As an alternative, Guo et al. [18] proposed a piecewise linear fitting of the TIC in evaluating pixel-by-pixel the osteosarcoma’s perfusion. Our semi-quantitative analysis employed this fitting adding the TIC normalization with the aim of eliminating the dependence on proton density, relaxivity and other equipment parameters, and to identify more suitable indexes in detecting CRT tumour response [17]. Statistically significant changes in discriminating responders from non-responders were obtained with two parameters, MSD and WOS − although an overlap was observed when each parameter was considered alone (Fig. 2). This determines a higher number of false positives and negatives. Therefore, we considered the hypothesis that a combination of descriptors could reduce this effect by optimizing sensibility, specificity and overall examination accuracy. To this aim, the linear combination of ∆MSD and ∆WOS, SIS, showed an improvement in sensitivity up to 93.5 %, with a specificity of 82.1 % with relevant gains with respect to ∆MSD (+20.1 % in sensitivity and +11.7 % in specificity) and ∆WOS considered alone (+13.1 % in sensitivity and +4.3 % in specificity). SIS improved the negative predictive value to 88.5 % and the positive predictive value to 89.6 %. Our proposed matched index showed interesting results in re-staging LARC after CRT, providing a cut-off of −3.06 %, which was able to accurately separate responders from non-responders. No similar studies are present in the literature that focus on the matching of multiple shape descriptors with the aim of obtaining a one ‘stop shop’ index that is robust, easy to apply and able to produce adequate performance when directly applied in clinical practice. The most attractive ‘ultimate challenge’ could be to boost the accuracy of MRI examinations integrating DCE-MRI with DWI. The latter is a potential biomarker able to predict response to CRT [3541], since pre-treatment apparent diffusion coefficient (ADC) and its changes correlate with down-staging. Recent studies have investigated the potential of DWI to assess therapy response in LARC: Cai et al. [38] showed that AUROC, for identification of a complete response after neoadjuvant therapy, was low for ADC (equal to 0.66 at 3T MRI and 0.64 at 1.5T MR scanner); they concluded that ADC measurements are not accurate for assessment of complete response. Genovesi et al. [39] showed that the diagnostic accuracy of the mean ADC increase, to discriminate the complete response group from the non-complete response group, was significantly higher than the mean tumour volume reduction (0.913 vs. 0.658; p = 0.022), concluding that the mean ADC increase appears to be a reliable tool for differentiating complete response from non-complete response after CRT. Wu et al. [40] have performed a meta-analysis showing that MRI is an accurate tool in predicting pathological response after preoperative therapy in patients with LARC but suggested performing MRI with 3T devices, which might be sensitive enough to identify responders; moreover it is shown that the addition of DWI to T2-weighted imaging had a non-significant improvement in sensitivity, which deserves further investigation. Song at al. [41] reported that, utilizing 3T MR systems, the increase in signal-to-noise ratio (SNR) provided by these machines offers increased DWI quality and improvements to ADC maps, they reported that adding DWI to T2WI is helpful for detecting viable tumours after neoadjuvant CRT compared with T2 weighted imaging alone or PET in patients with LARC. The findings are still controversial; therefore, future investigations are desirable. A crucial point becomes apparent: the need to improve the diagnostic accuracy of perfusion studies with DCE-MRI and DWI. Therefore, before proceeding to integrate these different techniques we focused on developing an effective DCE-MRI approach.

Fig. 5
figure 5

Patient no. 3. A case of overestimation in mMRI evaluation. Axial T2w images before (a) and after (b) CRT and sagittal T2w images before (c) and after (d) CRT. The morphological evaluation before CRT shows a heterogeneous irregular thickening along the entire rectal wall spreading into the perirectal fat (a, arrowheads). After CRT, a hypointense spiculated area with thin digitations is still clearly visible into the perirectal fat (b, arrowheads). On the sagittal scan obtained before CRT multiple irregular rectal wall thickenings are shown (c, arrowheads). After CRT, a hypointense area with thickening of the rectal wall and straining into perirectal fat (d, arrowheads) was identified as a residual tumour probably mixed with fibrous tissue

Fig. 6
figure 6

Patient no. 3, a case of correct sqMRI estimation. Axial T1w, contrast-enhanced scans obtained before (e), and after (g) CRT. Qualitative DCE-MRI based on TIC evaluation analysis obtained from the ROIs drawn before (f) and after (h) CRT. TIC confirms the perirectal infiltration visible on the MR scan (e), showing a rapid contrast intake and a fast discharge (g). After CRT there is no fast contrast uptake identified on the TIC analysis, suggesting that the hypointense tissue is mainly due to the CRT-induced fibrosis

Our data, if obtained during therapy and if supported by stronger numerical evidence, could offer several opportunities: SIS could be used to distinguish pathological complete responders from partial responders or non-responders in order to tailor neoadjuvant therapy on single subject response, to treat full responders with less extensive surgery or even to skip immediate surgery and to follow a wait-and-watch strategy through a strict follow-up program based on a complete response assessed by clinical and imaging analysis: endoscopic, DCE-MRI, DW-MRI and [18F]-fluorodeoxyglucose positron emission tomography (PET) studies. Today, PET evaluation is considered the best technique for assessing CRT response in LARC, and the most widely used in assisting physicians when planning optimal treatment. Cascini et al. [42] performed a PET evaluation based on the maximum variation of the standardized uptake value (∆SUV) optimal threshold value of −42 % and identified 13 of 15 non-responders with a specificity of 87 %. Although PET provides a tool to monitor tumour glucose metabolism, the method is less well suited to the study of tumour-associated angiogenesis or perfusion because of the ‘uncoupling’ of blood flow and glucose metabolism that has been observed to occur in human tumours [43]. Moreover, low spatial resolutions coupled with ‘respiratory artefacts’ have a considerable impact on overall evaluation, drawbacks that have less impact on image assessment of DCE-MRI analysis.

Our study has certain limitations: the intra-observer variability was not assessed, but two radiologists evaluated the images by consensus and in a single session per patient. No precautions were taken to avoid bowel movements, nor was the administration of butylscopolamine, dicyclomine, glucagon or similar drugs carried out, because a volumetric analysis was performed thus minimizing errors due to caused voxel misalignments. The semi-quantitative approach used takes inspiration from studies on breast and osteosarcoma tumours, where leaky capillaries associated with tumour neovascularization determine rapid gadolinium tissue uptake and early washout, which correlate with VEGF and MVD [1012]. However, being rapid growing, colorectal carcinomas similar to breast tumour and most malignant tumours for their biological characteristics, using their contrast kinetics on MRI can be depicted by DCE-MRI [811].

In conclusion, our study proposes a ‘surrogate angiogenic marker’, SIS, as a simple objective method, easily transferable to clinical routine and able to assess tumour response to CRT through a reproducible semi-quantitative estimation of tumour blood perfusion. SIS is the first pattern analysis MRI index suitable for establishing the response to CRT in LARC patients. We identified a well defined threshold value of SIS, able to differentiate responders (TRG 1 or 2) from non-responders (TRG 3, 4, or 5), allowing the prediction of a pathologically good response after neoadjuvant treatment. The future endpoint could be the analysis of the inter- and intra-operator variability; the development of SIS evaluation software, easy to use and able to guarantee a user-friendly experience for the operator; a comparison of SIS with the SUV obtained in PET is desirable with data integration, performed during intermediate therapy time points, leading to a tailored CRT. Lastly, it could allow the identification of a defined threshold value of SIS to differentiate pathological complete responders (TRG 1) from other responders (TRG > 2) that we could use to increase single TRG subgroups.