Introduction

Magnetic resonance imaging (MRI) has an undisputed role in the evaluation of fistula-in-ano owing to a sensitivity of up to 100 % and a specificity of 86 % for the depiction of a fistula track [1, 2]. In addition, MRI has a sensitivity of 96 % and a specificity of 97 % in the diagnosis of abscesses [1, 2]. The association of MRI with examination under anaesthesia (EUA) has been shown to have an accuracy of 100 % (95 % IC: 88 %-100 %) for the diagnosis of perianal complications in patients with Crohn’s disease [3, 4]. Moreover, preoperative MRI helps decrease the recurrence rate of anal fistula by 75 % and its use correlates with a better outcome [5].

Perianal fistula and abscess are common complications of Crohn’s disease and occur in 30 to 50 % of Crohn’s disease patients during their lifetime [6]. Crohn’s disease patients have repeated MRI examinations for the evaluation of the perineum and anal canal and many studies have suggested that MRI can be used for the assessment of disease activity and follow-up after treatment [79].

Current MRI protocols commonly used to investigate perianal fistulas uniformly include fat-suppressed T2-weighted turbo spin-echo (TSE) and, at a lesser degree, gadolinium chelate-enhanced T1-weighted MR sequences [10, 11]. Fat suppressed T2-weighted TSE MRI is very accurate at depicting a fistula track but some reports have emphasized the difficulty in distinguishing between actual abscesses that require surgical drainage and inflammatory masses that are not drainable and require adequate medical anti-inflammatory treatment. The adjunct of gadolinium chelate-enhanced T1-weighted MR images is often needed as a problem solving tool [1, 5, 1214]. DWMRI has recently been used successfully in the field of rectal and perirectal inflammation as well as in that of Crohn’s disease [12, 13, 15]. Surprisingly, only two studies have evaluated the potential role of DWMRI in the evaluation of patients with perirectal inflammation [9, 15]. However, the main limitation of these two studies is the lack of surgical correlation.

The goal of our study was to retrospectively evaluate the capabilities of DWMRI in detecting fistula-in-ano, assessing fistula activity and in distinguishing between an abscess and an uncomplicated inflammatory mass, using fat-suppressed T2-weighted MR imaging, clinical examination, surgery and follow-up as reference.

Material and methods

Patients

This study included MRI examinations of 24 patients who were referred to our department from April 2012 to August 2013 for a preoperative MRI examination before surgical management of a fistula-in-ano with a suspected coexisting pelvic abscess. There were 18 men and 6 women with a median age of 35 years (Q1 = 26.8; Q3 = 42.5) [range: 20 - 73 years]. Seventeen patients had Crohn’s disease; of which 12 were under immunosuppressive therapy when MRI was performed. Two patients had ulcerative colitis, one patient had colorectal anastomosis after colorectal cancer resection, one had type II diabetes and three had no underlying disease. This study was approved by our Institutional Review Board and informed consent was waived.

MRI acquisition

All MRI examinations were performed with the same protocol, using a 1.5-T clinical MR unit (Magnetom Avanto, VB15 software version, Siemens Healthcare, Erlangen, Germany) with 9 receiver channels, using one anterior torso phased-array coil with six channels and a posterior spine cluster with three channels; the patient was in supine position. Gradient strength of the magnet was 45-mT/m with a maximal gradient slope of 200-mT/m/s using super quantum gradients. No specific bowel preparation was used before MRI examination and no antispasmodic agents were administrated to the patients. For all patients, the MRI protocol included frequency-selective fat-suppressed T2-weighted TSE MRI in the axial, coronal and sagittal planes and DWMRI in the axial plane. The MRI protocol is detailed in Table 1. Axial ADC maps were created using three b-values (0, 600 and 1000 s/mm2) with a mono-exponential algorithm.

Table 1 MR Imaging parameters

MRI analysis

All MRI examinations were evaluated retrospectively by two independent, board-certified abdominal radiologists with 5- and 20-years of experience in reading abdomen and pelvis MRI examinations. They were blind to clinical history and to the results of surgery, blood tests, endoscopic examinations and follow-up MRI examinations. MRI examinations where evaluated on a picture archiving and communication system (PACS) viewing station (Directview®, 11.3 sp1 version, Carestream Health Inc., Rochester, NY, USA) using a standardized form created for the study including all the information further described hereafter.

Qualitative analysis

Three different reading sessions were performed with sessions being separated by two week intervals in order to avoid recall bias. Each reading session was performed blind to the results of the others. During the first reading session the radiologists analyzed a set that consisted of the DWMRI sequence alone (“Set 1”). During the second reading session the radiologists analyzed the T2-weighted TSE MR images alone (“Set 2”). During the third reading session the radiologists analyzed the two previous sets together (“Set 1” and “Set 2,” hereafter referred to as “Set 3”).

For each MR imaging set the two radiologists recorded the number of visible fistulas-in-ano and graded their conspicuity semi-quantitatively according to a subjective 3-point scale, from 0 to 2; if there were primary and/or secondary fistula their location and topography was recorded according to the anal clock. Conspicuity was defined as the ease by which the radiologist could clearly distinguish the fistula tract along its entire length (0 = not visible, 1 = difficult to depict and 2 = easily depicted). Internal and external openings were not recorded. The presence of an abscess was suspected when there was a loculated pelvic collection or a pooling of fluid with high signal intensity on T2-weighted TSE MR images or on DWMR images obtained at b = 0.

Quantitative analysis

After qualitative analysis “Set 3” was reviewed by the two radiologists working in consensus. Any loculated collection (i. e., area with non-parallel borders) with a high signal intensity and larger than 20 mm2 was measured for each patient at the slice level where the largest dimensions were found (Fig. 1). The corresponding surface and signal intensity on the fat-suppressed T2-weighted TSE MR images were tabulated. This value was normalized using the signal of the quadriceps muscle as a reference organ. Then a region of interest (ROI) was placed on the diffusion MR images (b = 0) to cover the largest area of the suspected region and copied to the ADC map corresponding to this area and the lowest value of ADC was recorded (Fig. 1). The smallest ROI was 20 mm2; ROI sizes are reported in Table 2.

Fig. 1
figure 1

32-year-old woman with diabetes presenting with transsphincteric fistula and large perianal abscess. a. T2-weighted TSE MR image (TR/TE = 6180/112) in the axial plane shows an area of high signal intensity (arrow). b. ADC map in the axial plane. The ADC value of the pelvic perianal collection is: 0.586 × 10-3 mm2/s, corresponding to an actual perianal abscess

Table 2 Quantitative MR imaging findings in 14 patients with abscess and 10 patients with inflammatory mass

Then one of the readers manually placed ellipsoidal ROIs on the fistula tracks using the DWMR image obtained at b = 0 and the ROI was copied on the ADC map (Fig. 2). An ADC value and ROI surface were then tabulated.

Fig. 2
figure 2

28-year-old man with Crohn’s disease, transsphincteric fistula at 6 hours with a horseshoe perianal collection. a. The fistula track is visible on T2-weighted TSE MR image in the axial plane (arrow). b. On diffusion-weighted images in the axial plane (b = 1000), only the track and the collection have a high signal intensity. c. A ROI is traced on the diffusion MR image (b = 1000) and copied on the ADC map. ADC value did not correlate with MRI activity score (Van Assche classification)

Finally the two radiologists quantified pelvic inflammation according to the MRI classification as described by Van Assche et al. [16].

Standard of reference

Surgery was performed under general anaesthesia in all patients and included visual inspection, manual exploration and a methylene blue test after catheterization of the primary orifice and drainage of pelvic abscess. Extrasphincteric fistulas were sectioned whereas transsphincteric and intersphincteric fistulas were treated using a draining seton. Bacteriological examination was performed in 17 patients because of the presence of pus in the fistula and an histopathological examination was performed in 4 patients. All MR images and reports were available to the surgeon in the operating room. Diagnosis of an abscess was made when a purulent mass was found in the corresponding suspected area during MRI.

For all patients fistulas were classified by the two radiologists and the surgeon in consensus using the classification of Parks et al. [17].

All patients had blood tests including a blood count and a C-reactive protein count. All patient (n = 19) with inflammatory bowel disease were examined by a gastroenterologist specialized in inflammatory bowel diseases who evaluated disease activity and introduced an immunosuppressive therapy or increased the dose of immunosuppressive treatment already given in cases of suspected active disease.

All patients had a clinical follow-up of at least 6 months and 18 had a new MRI examination within 6 months after surgery. If MRI showed an extension that was not found at the time of surgery a second intervention was performed and if the extension remained unfound it was considered a false positive for MRI. In cases of unfound collection the radiologist performed an ultrasound examination during intervention for guiding the surgeon. When a fistula was external and considered to be inaccessible to surgical examination the fistula was considered a false-positive finding of the MRI if it was not found during the follow-up MRI.

Statistical analysis

Statistical analysis was performed using the following software: SAS, version 9.2, SAS Institute, Cary, NC; R, version 2.8; GraphPad Prism, version 5.03, GraphPad softwares, Inc, R Foundation, http://www.r-project.org/ Descriptive statistics were calculated for all variables obtained with the two sequences and reported as medians, first quartiles (Q1), third quartiles (Q3) and ranges.

The Mann-Whitney U test was used to compare the ADC values, the signal intensities and the surfaces areas of high signal intensity on T2-weighted TSE MR images between patients with abscesses and those with inflammatory mass. The Wilcoxon-signed rank test was used to compare the conspicuity of the fistula between T2-weighted TSE MRI and DWMRI. A logarithmic fitting method was used to calculate the correlation coefficient (R2) between ADC values in the main fistula track and the MRI activity score. Linear regression analysis was used to calculate the correlation coefficient R2 between signal intensity in the main fistula track on T2-weighted TSE MR images and the MRI activity score.

A McNemar test was used to compare the number of fistulas depicted on each MRI set. A Fisher exact test was used to search for differences in sensitivity between the three MRI sets. A P value of less than 0.05 was considered significant.

Receiver operating characteristic (ROC) analysis was performed to determine the area under the ROC curve and the optimal ADC cut-off value with a corresponding sensitivity, specificity and positive and negative predictive value.

Agreement in scoring with the 3-point scale between the two readers was assessed with weighted Cohen κ (kappa) statistics. Degrees of agreement were categorized as follows: κ values of 0.00 - 0.20 were considered to indicate poor agreement; κ values of 0.21 - 0.40, fair agreement; κ values of 0.41 - 0.60, moderate agreement; κ values of 0.61 - 0.80, good agreement; κ values of 0.81 - 0.99, excellent agreement; and, κ value of 1, perfect agreement [18].

Results

Classification of fistulas according to the classification system of Parks et al. is reported in Table 3. The median time interval between MRI and surgery was 1.6 days (Q1 = 1; Q2 = 2.25; range: 0-8 days). We did not evaluate the ability of DWMRI to classify fistula according to the classification of Parks et al. because of well-established high degrees of accuracy for T2-weighted TSE MRI especially in evaluating the relationships between the fistulas and the sphincter complex. Moreover, DWMRI is performed in only one plane and we assumed that T2-weighted TSE MR would outperform it.

Table 3 Fistula type in 24 patients with fistula-in-ano according to Park et al.

A loculated collection larger than 20 mm2 and of high signal intensity on T2-weighted TSE MR images was found in all cases.

Quantitative evaluation

Table 4 shows results of the quantitative analysis. Thirty-one fistulas were depicted on the T2-weighted TSE MR images, 34 via DWMRI and 34 were deemed present using the standard of reference, corresponding to a sensitivity of 91.2 % [95 % CI: 76 %-98 %] for T2-weighted TSE MRI and 100 % [95 % CI: 90 %-100 %] for DWMRI but the difference was not significant (P = 0.239). Concordant findings between DWMRI, T2-weighted TSE MRI and surgery regarding location of the fistulas according to the anal clock were found in 32 fistulas (32/34; 94 %) and discordant findings in the remaining two (2/34; 6 %). In these 2 cases the fistula was depicted via DWMRI and not on fat-suppressed T2-weighted TSE MR images. However, all fistulas initially missed via T2-weighted TSE MRI and seen on DWMRI were surgically proven and were visible via the T2-weighted TSE MRI in retrospect. Agreement between the two independent readers for the number of fistulas was good to excellent with kappa values ranging between 0.65 and 1.

Table 4 Number of fistulas depicted by each MR imaging set and corresponding sensitivity
Fig. 3
figure 3

Box and whisker plots of ADC values, which differ significantly between abscess and inflammatory mass. Boxes stretch across the interquartile range (IR), i.e., from lower quartile (Q1) to upper quartile (Q3); whiskers show smallest data point that is greater than (Q1×1.5\IR) and largest data point that is smaller than (Q3×1.5\IR). The horizontal line through each box represents the median value. ADC values of abscesses overlap with those of inflammatory mass in only one case. Lower ADC values are associated with a perianal abscess.

ADC values in suspected pelvic collections were significantly lower in patients with an abscess (n = 14) than in patients with an inflammatory mass (n = 10; P = 0.714 × 10-6; Table 2; Fig. 3). The area under the ROC curve was 0.971 and the optimal cut-off ADC value was 1.186 × 10-3 mm2/s. This optimal ADC cut-off value yielded a sensitivity of 100 % [95 % CI: 77 %-100 %], a specificity of 90 % [95 % CI: 66 %-100 %], a positive predictive value of 93 % [95 % CI: 82.8 %-100 %] and a negative predictive value of 90 % [95 % CI: 78 %-100 %] for the diagnosis of perianal abscess (Fig. 4). No differences in surface area values of high signal intensity were found between abscesses and inflammatory masses on T2-weighted TSE MRI (P = 0.285; Fig. 5). Similarly, no differences in signal intensity were found between abscesses and inflammatory masses on T2-weighted TSE MRI (P = 0.259; Table 2).

Fig. 4
figure 4

Graph shows ROC curves for ADC values in the differentiation between an abscess and an inflammatory mass. Area under the curve is 0.971

Fig. 5
figure 5

41-year-old man with Crohn’s disease, transsphincteric fistula and a large perianal collection. a. The collection is highly suggestive of abscess on T2-weighted TSE MR image in the axial plane. b. On ADC map, ADC value is 1.844×10-3 mm2/s, above the cut-off value for discriminating between inflammatory mass and abscess. Surgical examination confirmed the diagnosis of inflammatory mass

The Van Assche classification median value was 14 (Q1 = 10.5, Q3 = 16.5; range: 4-20). Neither ADC values nor values of signal intensity on T2-weighted MR images in the fistula track correlated with the Van Assche classification, with a correlation coefficient R2 of 0.1084 for ADC and R2 of 0.0031 for signal intensity on T2-weighted MR images (Figs. 6 and 7).

Fig. 6
figure 6

Diagram shows correlation between ADC values in fistula tracks and MRI activity score (Van Assche classification) with a logarithmic fitting method (R2 = 0.1084)

Fig. 7
figure 7

Diagram shows correlation between T2 values in fistula tracks on fat-suppressed T2 TSE weighted MR images and MRI activity score with a linear regression (R2 = 0.0031)

Qualitative evaluation

Conspicuity of the fistulas was significantly greater via DWMRI than T2-weighted TSE MRI for the two observers (P = 0.003 and P = 0.0007, respectively), with fair agreement between the two observers using a T2-weighted TSE MRI sequence (kappa = 0.355) and moderate agreement with DWMRI (kappa = 0.747).

Discussion

We found that DWMRI shows high degrees of sensitivity and specificity for diagnosis of a perianal abscess and helps discriminate between an abscess and an inflammatory mass (100 % sensitivity and 90 % specificity using a ADC threshold value of 1.186 × 10-3 mm2/s) and may avoid gadolinium-chelate administration. Moreover, conspicuity of a fistula-in-ano is greater with DWMRI than with T2-weighted TSE MRI. Another result of our study is that ADC values better correlate with disease activity than does signal intensity on T2-weighted MR images, although correlation remains weak for both sequences.

Accuracy of pelvic MRI for the preoperative staging of fistula-in-ano has already been evaluated in large studies [3, 4]. The basic protocol for MRI evaluation of fistula-in-ano uniformly includes fat-suppressed T2-weighted TSE MR in three planes including a strict axial plane, a sagittal plane and a plane parallel to the anal canal; the latter is of great value for depicting fistula tracks, perianal collections and sphincter involvement [19]. The added value of gadolinium chelate-enhanced T1-weighted MRI has been investigated [19]. In this regard, researchers have shown that intravenous administration of gadolinium-chelate is helpful for the diagnosis of pelvic collections and helps discriminate between abscesses and inflammatory masses [19]. However, these advantages are obtained at the penalty of an increased MR examination time and extra cost. Moreover, it has been shown that gadolinium-chelate-enhanced T1-weighted MRI may overestimate the number of active lesions or the degree of lesion activity, especially when persisting quiescent fistulas are present [10]. Occasionally, quiescent fistula tracks may show gadolinium-chelate uptake and may thus be erroneously considered as active lesions [10].

The sensitivity of DWMRI in the depiction of malignant or inflammatory lesions in the abdomen has been well demonstrated in several studies [2023]. It is assumed that ADC values in abscesses is lower than than in normal tissues because the presence of pus reduces free diffusion of water molecules [24].

The depiction of a perianal abscess in patients with inflammatory bowel diseases is a critical issue because it may have substantive impact on the treatment option. In this regard MRI has a pivotal role because it helps determine which patient should have antibiotics or surgical drainage and those who require medical immunosuppressive therapy [5]. The results of our study suggest that MRI can be used to discriminate between abscesses and an inflammatory mass. Although an improvement in the diagnostic capabilities of MRI has been shown using gadolinium-chelate, this is at the penalty of an overestimation of pelvic lesions and the need for an intravenous injection with its potential complications [10]. Our study suggests that DWMRI is highly accurate in differentiating between abscess and inflammatory mass and does not overestimate the number of lesions present. The preoperative depiction of inflammatory mass in very helpful for the surgeon who will not be able to catheterize the inflammatory tracks and insert a draining seton. Moreover, surgical excision of an inflammatory mass may be aggressive whereas medical treatment is sufficient. In addition, the acquisition time of DWMRI is similar to those reported in the literature for a gadolinium-chelate enhanced T1-weighted spoiled gradient echo MR sequence. DWMRI may thus be used instead of gadolinium-chelate enhanced T1-weighted MRI, particularly in patients with contra-indications to intravenous administration of gadolinium-chelate.

Of interest in our study, we found that a subset of patients with an abscess had a very low ADC value (i. e., <1.0 × 10-3 mm2/s). In these patients the diagnosis of a perianal abscess was easy without any overlap with the ADC values of patients with an inflammatory mass. In some cases, the signal of the collections may be heterogeneous and the ADC values range between a value around 1.100 × 10-3 mm2/s and 2.300 × 10-3 mm2/s. We assume that high ADC values are due to internal heterogeneity of the abscess, which may contain gas-polluting ADC measurement. This is the reason why we decided to consider the region of the collection with the lowest ADC value instead of the ADC value of the whole collection.

Our results show that the use of DWMRI results in better conspicuity of the fistula tracks by comparison with T2-weighted TSE MRI. Conversely, in our study DWMRI did not show more fistula tracks than T2-weighted TSE MRI did, which is consistent with the results Hori et al. obtained with a more limited patients population [15]. However, it can be assumed that our study was underpowered and thus statistically inconclusive. Yoshizako et al. evaluated the ability of DWMRI to depict active inflammatory disease using ADC measurement in the fistula track [9]. Our results show a poor correlation between ADC values in fistula tracks and lesion activity according to Van Assche’s classification [16]. Our results are consistent with those reported by Yoshizako et al. and carry the same limitation that relates to the inaccurate standard of reference used. Indeed, a definite diagnosis with respect to the status of the disease (i. e., quiescent vs. active) is difficult to make in retrospect. We consider that our results need further prospective investigation to be confirmed.

DWMRI provides a quantitative evaluation of the restriction of diffusion via use of an ADC map, even in retrospect. Conversely it is very difficult to obtain a quantitative evaluation of the degree of gadolinium enhancement on T1-weighted dynamic MRI because this would require subtle and refined analysis with dedicated software and time-consuming acquisitions [7]. This is the reason why studies evaluating inflammatory activity via gadolinium-enhanced T1-weighted MRI mainly use a semi quantitative approach that is simple and does not require any specific software for post-processing [25].

Our study has several limitations. First, the small sample size does not allow us to draw definitive conclusions, but to date no studies have evaluated the value of DWMRI using surgery as a standard of reference. In addition, in our study MRI examinations were performed less than 8 days before surgery in all patients. The second limitation is due to the difficulty to obtain a strong and valid standard of reference for pelvic evaluation of fistula-in-ano and abscesses, which today includes examination under general anaesthesia and MRI without DWMRI [4]. The third limitation relates to the fact that we did not try to search for correlation between ADC values and the perianal disease activity index (PDAI) because of the lack of prospective clinical data acquisition. However, we tried to overcome this limitation by making a correlation with Van Assche MRI classification. Nevertheless, there is no ideal reference standard to determine disease activity in perianal Crohn’s disease and several authors have already underlined the poor accuracy of PDAI and other indexes to evaluate the actual degree of inflammation [7]. The fourth limitation of our study is that we used a 3 point scale to evaluate the conspicuity of fistula. It may be thus argued that the use of a 5 point scale might have resulted in a different result regarding conspicuity of fistulas with DWMRI. Finally, we did not evaluate the added value of gadolinium-enhanced T1-weighted MRI nor compare it to DWMRI. We agree upon the fact that further studies are still needed to evaluate the added value of gadolinium-chelate enhanced T1-weighted MRI, but it can be reasonably assumed that DWMRI may be sufficient in the majority of cases.

In conclusion, we found that DWMRI is very accurate at distinguishing between a perianal abscess and an inflammatory mass while fat-saturated T2-weighted TSE MRI is not able to do so. DWMRI could thus obviate the need for gadolinium-chelate administration for this purpose. This is of interest because of recent concern raised by the use of gadolinium-chelate and also because of a resulting decreased examination time and cost [26]. In addition, we found that conspicuity of fistulas-in-ano is greater with DWMRI than with T2-weighted TSE MRI. As a limitation, however, we did not find any correlation between ADC values and disease activity. Moreover, DWMRI does not help depict more fistula tracks than T2-weighted TSE MRI did, which is already very accurate. According to our results, we suggest that DWMRI should be part of the routine MRI protocols for patients with a suspected perianal abscess especially for protocols without gadolinium-chelate administration.