Introduction

Crohn disease is a chronic relapsing inflammatory bowel disease with increasing incidence in children and adolescents. The use of novel therapies requires repetitive imaging to assess disease activity and complications and to monitor therapeutic response [1]. We can assess inflammation of the bowel wall with ultrasound (US) and colour Doppler imaging with accuracy comparable to computed tomography (CT) enterography and magnetic resonance (MR) enterography [2,3,4]. Ultrasound has many advantages. There is no need for bowel preparation, sedation or anaesthesia, and it is a radiation-free method, all of which are of particular importance for the paediatric population. However, despite the use of colour Doppler imaging, its accuracy in detecting slow-moving blood flow in small vessels and vascularity in deep-lying bowel segments is low [2], and therefore the estimation of disease activity is less precise. Contrast-enhanced US uses the intravenous administration of a microbubble contrast agent to enhance the bowel wall and mesentery. The different patterns of contrast enhancement, the creation of time-intensity curves and the calculation of different quantitative parameters yield subjective and objective demonstrations of inflammation [5, 6].

To the best of our knowledge, there are no dedicated contrast-enhanced US studies for the evaluation of Crohn disease activity in children. Previous reports include five studies with mixed populations of children older than 16 years and adults [7] and one case report in a 13-year-old boy [8]. Children have significant physical, hemodynamic and tissue differences in comparison to adults. The influence of blood pressure and tissue attenuation on quantification of perfusion with contrast-enhanced US is well documented but not yet established for children. Lower blood pressure and higher heart rate directly impact the mean size of the microbubble and the resonant frequency of the contrast agent [9]. Furthermore, children’s distinct body habitus might alter the attenuation properties of the tissues preceding the organ of interest. These patient-related factors affect the imaging results and should not be simply extrapolated to the paediatric population.

The purpose of our study was to evaluate the diagnostic accuracy of US with colour Doppler imaging, subjective contrast-enhanced US and quantitative contrast-enhanced US in the estimation of Crohn disease activity in children by comparing it with histopathology as the reference standard.

Materials and methods

Forty-six children with clinical suspicion of exacerbated inflammatory bowel disease from January 2018 to February 2019 were included in this prospective study. All patients were referred to the abdominal US with colour Doppler imaging and endoscopy with histopathological evaluation as part of clinical care. The indications for endoscopy were: a) suspected clinical diagnosis of inflammatory bowel disease needing histological confirmation (n=32), b) assessment of the activity of Crohn disease during clinical relapse after an inactive phase (n=12) and c) assessment of the activity of Crohn disease during deterioration while on medical treatment (n=2). Additionally, as part of this research patients underwent contrast-enhanced US on the same day as endoscopy (n=28) or within 14 days (n=18) with no change in treatment during the interval. Contrast-enhanced US was performed on bowel segments with ≥4-mm thickened bowel wall on the US with colour Doppler imaging. If multiple diseased segments were identified, we used the two with the most thickened bowel wall. Biopsy specimens were taken from endoscopically normal mucosa or the site most inflamed at each segment along the colon and the terminal ileum. Specimens were sent to the histopathology laboratory and accompanied by crucial clinical information, macroscopic description of the mucosa and a schematic note of the location of the biopsy sample. After receiving biopsy results, patients with a histopathological diagnosis other than Crohn disease were excluded. A flow diagram of our study group is presented in Fig. 1.

Fig. 1
figure 1

A flow diagram of the study group

The research was conducted following the Helsinki Declaration. The National Medical Ethics Committee approved the study. Patients’ parents and one 18-year old patient signed written informed consent after detailed explanation of the procedure.

Ultrasound with colour Doppler imaging

All US studies (US with colour Doppler imaging and the contrast-enhanced US) were performed by one of two experienced radiologists (D.K. and S.P.). The senior radiologist (D.K.) is skilled in paediatric US and contrast-enhanced US with 20 years of experience with intravesical and intravenous US contrast media administration (for bowel and other organ evaluation). The other radiologist has 10 years of experience in abdominal US and 4 years of experience in bowel contrast-enhanced US examination and analysis.

The US with colour Doppler imaging examinations were performed on an Aplio 500 machine (Canon Medical System Europe, Zoetermeer, the Netherlands), initially with a 3- to 6-MHz convex array transducer and then, for a detailed examination, with a 7.5- to 12-MHz linear transducer. Children were examined following overnight fasting. No special bowel preparation was needed. US with colour Doppler imaging of the intestine was performed by the standard technique as previously described by Biko et al. [10]. The following US data were obtained: location of the most affected bowel segments, bowel wall thickness, assessment of wall layer stratification and the presence of inflammatory fat. Additional observations (lymphadenopathy, free fluid) were also documented. Bowel wall vascularity was assessed with colour Doppler imaging. Images were only acquired if definitive pathology was seen.

Contrast-enhanced ultrasound

Contrast-enhanced US was performed with a 7.5- to 12-MHz linear transducer using contrast-specific software with a low mechanical index and a preset dedicated to the linear probe. The mechanical index was set at or below 0.8; the dynamic range was ≥65. The image was optimised concerning the position of the focus, gain, depth and observation area and remained unchanged during the contrast-enhanced US examination. The second-generation US contrast agent (SonoVue; Bracco, Milan, Italy) in a dose of 0.05 mL/kg was injected by bolus through one of the arm veins followed by a 10-mL saline flush. The subsequent enhancement was recorded as a continuous cine loop file for the 100-s interval immediately after injection. In cases with more than one diseased bowel segment, a second bolus was administered after at least 10 min. The raw native data were exported to the US workstation.

Image review

US with colour Doppler images and contrast-enhanced US studies were reviewed retrospectively independently by the radiologists mentioned above. Each of them was blinded to all clinical information and the results of other studies. The US studies with colour Doppler images were reviewed first and the contrast-enhanced US studies 4 weeks later in an attempt to reduce recall bias.

The estimation of Crohn disease activity with the US with colour Doppler imaging of the most diseased segments was based on the scoring system developed by Medellin-Kowalewski [11]. The inflammation was graded as no signs of inflammation/remission or mild, moderate or severe inflammation.

Contrast-enhanced US studies were reviewed on a US workstation using a proprietary software package (CHI Q, Canon [Toshiba], Tokyo, Japan). Contrast-enhanced US perfusion data are quantified by fitting a software-dedicated perfusion model (lognormal perfusion model with least-squares curve-fitting in our study) to a time-intensity curve extracted from a region of interest. Time-intensity curves were calculated using the intensity values of the pixels and the mean and median values of the image intensity [12]. Regions of interest were drawn manually (size approximately 1 cm2) at the most enhanced part of the bowel wall, excluding perivisceral tissue (Fig. 2). Since acquiring information from a single region of interest might introduce error from peristalsis or breathing, we used three non-overlapping regions of interest. Several parameters related to blood flow and volume were automatically computed by software by analysing the time-intensity curve (Fig. 2): peak enhancement, time to peak, mean transit time and area under the curve. Peak enhancement is defined as a maximum value of signal intensity and is reported in arbitrary units (AU) for linear values. Time to peak is expressed in seconds and defined as the time from zero to maximum signal intensity. Mean transit time is expressed in seconds and defined as the mean time taken by the bubbles to pass through the region of interest (mathematically it is the first moment of the fitted curve). The area under the curve is defined as the area under the time-intensity curve and reported in arbitrary units × seconds (AU×s) [13]. The mean value for each parameter from all three regions of interest was generated and used for further statistical analysis.

Fig. 2
figure 2

Contrast-enhanced US of the terminal ileum in a 16-year-old girl with Crohn disease. a Transverse contrast-enhanced US, dual-screen mode with simultaneous display of gray-scale image (right) and contrast-only image (left) shows intense transmural region of interest (blue oval). b Time-intensity curve automatically generated from the region of interest. X-axis represents time (s) and y-axis represents signal intensity (arbitrary units). Dots represent raw data and curve presents fitted lognormal perfusion model. The starting point of the time-intensity curve corresponds to the moment when the contrast media first appears in the region of interest. High peak intensity (19 arbitrary units) is consistent with severe inflammation

The subjective visual analysis included the assessment of different bowel wall perfusion patterns, as described by Migaleddu et al. [6]. Two types of graphic displays for viewing a time-intensity curve were attained: log and linear representations. Subjective grading included a visual analysis of the graphic representations of the log time-intensity curve, as defined: high peak enhancement and delayed decline at a higher enhancement level indicated severe disease; low peak enhancement with a rapid decline, more closely reaching the baseline, indicated mild disease [14].

Histopathological evaluation

An experienced histopathologist (R.D.G., with 25 years of experience) examined biopsy specimens. The degree of inflammation was evaluated according to the European Crohn and Colitis Organisation criteria as remission, mild, moderate and severe. Fibrosis was identified when abnormal depositions of collagen were seen in the edges of mucosal ulceration.

Due to the lack of validated histological scoring systems and data on outcomes, as well as concerns of sampling error, histological remission is not recommended as target treatment in Crohn disease. Instead, mucosal healing is defined as the treatment goal [15]. Given that persistent mild histopathological inflammation can be displayed in up to one-third of biopsies from patients with endoscopically healed mucosa [16], this most likely does not significantly affect clinicians’ decisions to treat or not to treat. Therefore, we decided to unite remission and mild grades of inflammation into an inactive disease group (i.e. indicating mucosal remission) and moderate and severe grades into an active disease group for dichotomisation of histopathological results.

Statistical analysis

The data for quantitative contrast-enhanced US kinetic parameters were, because of non-normality (Shapiro-Wilk W test), summarised as the median and interquartile range. The differences in the parameters between groups were tested with the Mann-Whitney U test. In multivariate logistic regression analysis, different models were formed to explore which of the kinetic parameters are most important for discriminating between the active and inactive groups. Odds ratios (OR) with 95% confidence intervals (CI) were reported. Models were evaluated with the Akaike information criterion and the Hosmer-Lemeshow test. With both tests, we assessed the fit of logistic regression models, whereby the Akaike information criterion also enabled us to compare different models. The smaller the Akaike information criterion, the better the model fits the data. However, the model with the smallest Akaike information criterion does not necessarily fit the data well, so we also used a goodness-of-fit test to assess the fit. In the case of logistic regression, the P-value of the Hosmer-Lemeshow test was checked and, if more than 0.05, provided evidence that predicted probabilities did not deviate from observed probabilities. Binned residual plots for each of the five models were also formed as the traditional residual plots of linear regression are not very helpful with logistic regression (Online Supplementary Material 1). Binned plots are not very informative in research with small sample size, so we relied instead on more global statistics (i.e. the Akaike information criterion and the Hosmer-Lemeshow test). A receiver operating characteristics curve analysis was applied to determine the threshold values of the kinetic parameters. Sensitivity, specificity and diagnostic accuracy were evaluated for the quantitative contrast-enhanced US, subjective contrast-enhanced US and US with colour Doppler imaging with 95% CI. The agreement between the two radiologists regarding their subjective contrast-enhanced US analysis was tested with kappa (κ) statistic. The values of κ were interpreted as follows: 0–0.20 indicated poor agreement; 0.21–0.40, fair agreement; 0.41–0.60, moderate agreement; 0.61–0.80, good agreement; and 0.81–1.00, excellent agreement. A P-value of less than 0.05 was considered statistically significant. All analyses were performed using statistical software (SPSS version 25; IBM, Armonk, NY; and MedCalc version 19.0.6; MedCalc Software Ltd., Ostend, Belgium).

Results

Patient characteristics

Twenty-two patients were excluded from the study: 19 patients had a histopathological diagnosis other than Crohn disease (ulcerous colitis, n=7; indeterminate colitis, n=2; and infectious colitis, n=9) and 3 patients were excluded due to inadequate histopathological results. Twenty-four patients were eligible for contrast-enhanced US final analysis (age range: 3.5–18 years; median age: 14 years; interquartile range: 13–17 years; 12 boys, 12 girls). A total of 40 bowel segments were analysed: sigmoid colon, n=10; descending colon, n=6; transverse colon, n=1; ascending colon, n=3; cecum, n=7; and ileum, n=13. One bowel segment was analysed in 8 patients and 2 bowel segments in 16 patients. Patient and segment characteristics are summarised in Table 1.

Table 1 Characteristics of patients and bowel segments

Only one transient adverse event occurred during contrast-enhanced US examination: a strong metallic taste that vanished after 10 min.

Histopathology

Histopathological examination identified remission in 10 segments (25%), mild inflammation in 12 segments (30%), moderate inflammation in 17 (42.5%) segments and severe inflammation in 1 (2.5%) segment. In agreement with our dichotomisation, 22 of 40 segments (55%) were classified as inactive disease and 18 of 40 segments (45%) as active disease group.

Ultrasound with colour Doppler imaging

The sensitivity, specificity and accuracy of US with colour Doppler imaging analysis were 55.6%, 86.4% and 72.5%, respectively. Seven falsely negative and three falsely positive results occurred (Table 2).

Table 2 Cutoff points and performance values for contrast-enhanced ultrasonography (quantitative and subjective evaluation) and ultrasonography with colour Doppler imaging

Subjective contrast-enhanced US

The sensitivity, specificity and accuracy of subjective contrast-enhanced US were 77.8%, 77.3% and 77.5%, respectively. Four segments were falsely evaluated as negative and five segments as falsely positive (Table 2). Interobserver agreement between the two radiologists for subjective contrast-enhanced US analysis was good (κ=0.74, 95% CI 0.58–0.90).

Quantitative contrast-enhanced US

Univariate analysis of quantitative contrast-enhanced US parameters showed a significant difference between the inactive and active groups concerning peak enhancement, mean transit time and area under the curve (P<0.001, P=0.015 and P=0.001) (Table 3). Since the time-to-peak parameter was not significant (P=0.103), it was excluded from further analyses. Multivariate logistic regression analyses revealed that the most predictive was the model with peak enhancement only, regardless of the models with mean transit time and/or area under the curve included. All fitted models showed no misfit (Hosmer-Lemeshow test, P>0.05), but according to the Akaike information criterion, the model with only peak enhancement performed better than all the other models (the lowest Akaike information criterion) (Table 4). The patients with higher peak enhancement were more likely to have active inflammation than the patients with lower peak enhancement (OR=1.44, 95% CI 1.11–1.87, P=0.007). A receiver operating characteristics curve for the peak enhancement parameter showed sensitivity 72.2%, specificity 100% and overall diagnostic accuracy 87.5% (Fig. 3). According to peak enhancement at a threshold value of 6.9, there were five false negatives and no false positives in our sample (Table 2) (Fig. 4). The proportions of disagreement (as extracted from Table 2) are 12.5% for peak enhancement, 22.5% for mean transit time, 20% for the area under curve, 22.5% for subjective contrast-enhanced US analysis and 24.5% for the US with colour Doppler imaging, respectively. Detailed bowel segment results, as evaluated by histopathology, US with colour Doppler imaging, subjective and quantitative contrast-enhanced US (peak enhancement parameter) are presented in Online Supplementary Material 2.

Table 3 Comparison between quantitative contrast-enhanced ultrasound kinetic parameters and histopathology
Table 4 Results of multivariate logistic regression models testing the association of quantitative contrast-enhanced ultrasound kinetic parameters with the histopathology
Fig. 3
figure 3

A receiver operating characteristics curve of peak enhancement in the prediction of active disease. Area under the curve is 84.3% (95% confidence interval 70.6–98.1%)

Fig. 4
figure 4

A scatterplot of peak enhancement versus four grades of inflammation at histopathology (0–remission, 1–mild, 2– moderate, 3–severe). The threshold value (horizontal line) for peak enhancement is set at 6.9, differentiating mucosal remission (histopathology grades 0 and 1) from active inflammation (histopathology grades 2 and 3). Mucosal remission was falsely estimated in five segments

Discussion

The European Federation of Societies for Ultrasound in Medicine and Biology states that contrast-enhanced US can be used as an alternative imaging modality for the follow-up of children with known inflammatory bowel disease [7]. However, the level of evidence is based on adult studies. Furthermore, a lack of consensus regarding contrast-enhanced US parameters for the diagnosis of active Crohn disease and software packages offering a variety of perfusion measures make translating results between different machines and vendors problematic.

The results of our study revealed that quantitative contrast-enhanced US, particularly the peak enhancement parameter, had the highest performance in detecting the inflammatory activity of Crohn disease in children. The peak enhancement parameter at a threshold value of 6.9 (specific to our quantification program and cannot be used as general cutoff value when other quantification programs are used) showed 72.2% sensitivity, 100% specificity and 87.5% overall diagnostic accuracy. Lower sensitivity in our study is not in line with the results of two previous studies in adults, which reported sensitivity of peak enhancement of 89.4% [17] and 93% [18]. This could be associated with patient characteristics in their studies. Romanini et al. [17] included 33 patients with inflammatory bowel disease, of which only 15 patients had Crohn disease. On the other hand, Ripollés et al. [18] included patients with Crohn disease who were candidates for resection. Of the 40 bowel wall segments in our study, 5 segments (4 of 24 children) were falsely considered to have negative results with peak enhancement below the cutoff value. In one case, this can be attributed to a mismatch of the segment selection on contrast-enhanced US and histopathology. In the remaining four segments, fibrosis might have contributed to falsely lowered peak enhancement values. All of these segments were falsely negative also by subjective contrast-enhanced US and US with colour Doppler imaging and demonstrated a mixed pattern of moderate fibrosis and moderate acute inflammation on histopathology. Similar observations were noted in studies in adults [18, 19]. Furthermore, fibrosis reduces blood volume and blood flow [20], which might have contributed as well.

On the contrary, Wilkens et al. [21] recently demonstrated that relative contrast enhancement could not distinguish between inflammatory activity and fibrosis. This was also noted in an animal model of Crohn disease [22]. Thimm et al. [23] used US elastography on three patients to complement the contrast-enhanced US and detect bowel wall fibrosis through evaluation of tissue stiffness, as previously suggested by Quaia et al. [24].

The time-to-peak parameter did not show any significant difference between active and inactive Crohn disease. This is not in line with the results of previous studies, where time to peak was found to be significantly lower in adults with active Crohn disease [17, 25, 26] and attributed to inflammatory oedema. Other studies [28,27,29], however, reported contradictory results of time to peak and raised incongruous discussions on the relevance of this parameter.

Subjective contrast-enhanced US analysis of the bowel wall enhancement pattern in conjunction with the graphic representation of the log time-intensity curve was superior to US with colour Doppler imaging. Sensitivity, specificity and accuracy for the subjective contrast-enhanced US were 77.8%, 77.3% and 77.5%, and 55.6%, 86.4% and 72.5% for US with colour Doppler imaging analysis, respectively. Migaleddu et al. [6] reported a much better performance of subjective contrast-enhanced US (93.5% sensitivity, 93.7% specificity, 93.6% diagnostic accuracy). This difference can be attributed to the interobserver variability, examinators’ experience and rapid microbubble inflow that does not provide a wide temporal window for an unequivocal evaluation of each enhancement pattern. Furthermore, transmural contrast enhancement pattern has been described in patients with inflammatory as well as fibrotic strictures of the small bowel [19].

Although our sample size was comparable to other studies investigating the same modality in adults, a rather small number of children and bowel segments is a limitation of our study, and was also the reason that only the dichotomous cutoff value for disease activity was provided. The histopathological heterogeneity of the disease activity, presenting in a spectrum from acute inflammation to long-standing fibrosis, is a significant limitation with the impact on our results mentioned above. Incorrect bowel segment classification due to inherent US limitation and a lack of a validated histopathology scoring system are other biases.

Overall, quantitative contrast-enhanced US’s peak enhancement is predictive of active Crohn disease in children with comparable diagnostic accuracy as in adults. Intravenous application of US contrast agent in children is currently off-label in Europe and approved only for evaluation of liver lesions in the United States of America [30]. Intravascular US contrast media use in children is associated with a very low rate of severe adverse reactions (less than 1/10,000 patients; 0.009%) [31]. In our study, one mild adverse event was noted (transient metallic taste). However, any exposure to intravascular US contrast media, with or without previous sensitisation, may result in anaphylaxis [32, 33]. Therefore, precautions should be in place with every contrast-enhanced US examination and examinations should be performed only by adequately trained medical personnel [33]. Albeit a child-friendly method, it may cause distress for the child as cannulation is needed. It does not, however, exceed the discomfort of respective alternative techniques (endoscopy and MR enterography). Besides, US contrast media can be safely administered to patients with renal insufficiency with no risk of contrast-related nephropathy or nephrogenic systemic fibrosis [34]. Hazards associated with sedation or anaesthesia needed in endoscopy and MR enterography are also avoided.

Mucosal healing, rather than symptom control or histological remission, is increasingly recommended as the optimal treatment target of Crohn disease. To evaluate disease activity, the Pediatric Crohn Disease Activity Index and faecal calprotectin are often used. However, the former can be subjective and cumbersome to calculate, and the latter can be present in intestinal infections and as such is not a specific indicator of inflammation. Evaluation of bowel wall thickness with the US with colour Doppler imaging cannot reliably distinguish between active inflammation and quiescent disease and fails to demonstrate blood flow at the capillary level. On the contrary, contrast-enhanced US displays hyperemia and neovascularisation also at perfusion level and, as demonstrated in our study, peak enhancement cut-off value provides objective evaluation of bowel wall enhancement. In our everyday clinical experience, quantitative contrast-enhanced US has shown to be a convenient, complementary method to clinical and laboratory evaluation of Crohn disease activity in children. It is useful particularly in patient follow-up where peak enhancement dynamics from baseline to following controls can be monitored and objective response to treatment noticed. As baseline contrast-enhanced US examination is usually accompanied by histopathological bowel wall evaluation, information regarding fibrosis and its possible impact on peak enhancement value is somewhat provided. In a considerable number of our patients, contrast-enhanced US also reduced the need for more invasive investigations, particularly in children with small bowel disease who refused MR enterography. In some patients, it was also possible to postpone endoscopic evaluation of disease activity.

Conclusion

Quantitative contrast-enhanced US has the potential to become a complementary method in the evaluation of Crohn disease activity in children. In particular, peak enhancement is predictive of active inflammation. Fibrosis may affect peak enhancement results and underestimate inflammatory activity. Hereof, the introduction of US elastography for assessment of fibrosis could be useful. Subjective contrast-enhanced US analysis shows a slightly better diagnostic accuracy than the US with colour Doppler imaging.