Introduction

Chronic hepatitis B (CHB) virus infection could cause damage to the hepatic parenchyma, leading to hepatic fibrosis (HF) [1,2,3]. Moreover, the progression of untreated HF may eventually cause cirrhosis, and subsequently hepatocellular carcinoma (HCC). Recent clinical studies have revealed that the use of anti-fibrotic drugs in patients with CHB in the early stages of HF may result in reversal of HF [4]. Therefore, early detection and stratification of HF is critical. Currently, invasive liver biopsy is the gold standard for evaluating HF [5, 6], but this technique has some potential limitations including sampling errors and inter-observer variations [5, 7]. Hence, reliable and noninvasive methods are essential for early detecting and staging of HF.

Diffusion-weighted imaging (DWI) is a noninvasive technique based on the Brownian motion of water molecules in biological tissue and has shown potential in the assessment of HF [8, 9]. The ADCst parameter obtained from monoexponential DWI model has been used for the detection and semi-quantification of HF and has shown promise in HF evaluations [6, 8, 10]. However, ADCst values may not accurately represent water-molecule diffusion because they are influenced by the microcirculation of blood in the capillaries.

Some previous studies have proposed that biexponential or stretched-exponential DWI models may provide more accurate information with respect to water diffusion [11,12,13,14,15,16,17,18,19,20]. The biexponential intravoxel incoherent motion (IVIM) model, which was introduced by Le Bihan et al. [11], could generate three parameters including Dp (representing capillary perfusion), Dt (representing true water molecular diffusion), and f (reflecting the fractional volume of blood flowing in the capillaries) [11], and hence could allow separation of water-molecule diffusion from microcirculation in vivo. The stretched-exponential model proposed by Bennett et al. [12] could generate two parameters including DDC (representing the mean intravoxel diffusion rate), and α (representing the intravoxel water diffusion heterogeneity), and hence could truly reflect the physiological characteristics of tissue in vivo. All the diffusion parameters can be derived from the post-reconstruction of multi-b value DWI. But the efficiency of these parameters in different reconstruction models need further exploration.

Since various diffusion parameters obtained from different DWI models may display different aspects of biological tissue, a thorough investigation and comparison of their roles in evaluating HF and inflammation may be valuable. Although some earlier studies [6, 8,9,10,11, 16, 18,19,20] have explored the value of monoexponential and biexponential DWI models in evaluation of HF from various etiologies, the degree and pattern of HF may be variable with different etiologies of chronic hepatic disease. To our knowledge, however, no study has compared various diffusion parameters obtained from monoexponential, biexponential, and stretched-exponential DWI models in the assessment of HF and inflammatory activity in CHB. Therefore, this study aimed to explore and compare the effectiveness of the diffusion parameters obtained from monoexponential, biexponential, and stretched-exponential models in evaluation of HF and inflammatory activity in patients with CHB.

Materials and methods

Study population

This prospective study was approved by the institutional review board, and informed consent was obtained from all participants. A total of 102 patients with chronic HBV infection were recruited consecutively and underwent liver magnetic resonance (MR) examinations (including routine sequences and DWI with multiple b-values) between June 2014 and December 2016. The inclusion criteria were as follows: (a) MR imaging was performed prior to liver biopsy, and the interval between MR imaging and liver biopsy was less than one month; (b) pathological results were obtained; (c) the patients had no surgical history involving the right lobe of the liver. The exclusion criteria were as follows: (a) MR data were not available due to respiratory artefacts; (b) patients had other focal lesions in the liver. Based on the exclusion and inclusion criteria, 20 patients were excluded from the study for the following reasons: four did not undergo liver biopsy, six had poor images with artefacts, four had other lesions, and six had fatty liver disease. Consequently, a total of 82 patients (55 males and 27 females; mean age: 36.7 years, age range: 22–61 years) were included in this study (Fig. 1). Concomitantly, 30 healthy subjects (8 males and 22 females, mean age: 31.3 years, age range: 22–69 years) with no history of liver disease, alcohol abuse, liver dysfunction, and liver biopsy were enrolled as the control group (Fig. 1). All the healthy subjects had undergone liver MR examinations.

Fig. 1
figure 1

Flowchart of the patient evaluation process

Image data acquisition

All patients underwent liver MR on a 3T MR imaging unit (Discovery MR750; GE Medical System, Milwaukee, WI, USA) with an eight-channel phased-array coil (GE Medical Systems). All patients fasted for at least 8 h before the MR examinations. They underwent a routine liver MRI sequence, which consisted of an axial T1-weighted fast spin-echo sequence (repetition time [TR]/echo time [TE], 180 ms/2.1 ms), and an axial T2-weighted fast spin-echo sequence with fat suppression (TR/TE, 4800 ms/76 ms).

DWI with multiple b-values was performed using a respiratory-triggered single-shot spin-echo planar sequence with the parallel imaging technique and a monopolar gradient in the axial plane. DWI with multiple b-values used the following parameters: TR/TE, 9230 ms/minimum; slice thickness, 5 mm; gap, 1 mm; field of view, 360 mm × 380 mm; and matrix, 128 × 128. Thirteen b-values from 0 to 2000s/mm2 (0, 50, 100, 150, 200, 300, 500, 800, 1000, 1300, 1500, 1700, and 2000 s/mm2) were used for performing DWI in three diffusion directions.

Data analysis

Images were obtained and transferred to a workstation (Advantage Workstation 4.6; GE Medical Systems) for processing. They were independently processed and analysed by two experienced radiologists who were blinded to the histopathologic results.

The ADCst value was calculated from all 13 b-values with a monoexponential model as follows [13]:

$$ {{S\left( b \right)} \mathord{\left/ {\vphantom {{S\left( b \right)} {S\left( 0 \right)}}} \right. \kern-\nulldelimiterspace} {S\left( 0 \right)}} = {\exp}\left( { - \,b \times {\text{ADC}}} \right) $$

where S(b) represents the signal intensity in the presence of diffusion sensitisation, S(0) represents the signal intensity in the absence of diffusion sensitisation, b represents the diffusion sensitising factor, and ADC represents an apparent diffusion coefficient.

The true diffusion coefficient (Dt), pseudo-diffusion coefficient (Dp), and perfusion fraction (f) were calculated with the biexponential model as follows [13]:

$$ {{S\left( b \right)} \mathord{\left/ {\vphantom {{S\left( b \right)} {S\left( 0 \right)}}} \right. \kern-\nulldelimiterspace} {S\left( 0 \right)}} = \left[ {\left( {1 - f} \right) \times \exp \left( { - b \times D_{t} } \right)} \right] + \left[ {f \times \exp \left( { - b \times D_{p} } \right)} \right], $$

The water-molecule diffusion heterogeneity index (a) and the distributed diffusion coefficient (DDC) were obtained using a stretched-exponential model that employed the following equation [12]:

$$ {{S\left( b \right)} \mathord{\left/ {\vphantom {{S\left( b \right)} {S\left( 0 \right)}}} \right. \kern-\nulldelimiterspace} {S\left( 0 \right)}} = \exp \left( { - \,b \times DDC} \right)^{\alpha } , $$

where α varies between 0 and 1, which represents the intravoxel water-molecule diffusion heterogeneity. A numerically high value characterises low intravoxel diffusion heterogeneity, which approaches the monoexponential decay. The index DDC represents the mean intravoxel diffusion rate.

For every patient, the two radiologists independently placed three regions of interest (ROIs) in the right lobe of the liver on the ADCst maps to acquire measurements and calculated the mean values. The areas of the ROIs varied from 150 to 200 mm2, and the ROIs were selected avoiding large vessels and bile ducts to ensure more accurate measurements [21]. The selected ROIs were copied to the maps of the other parameters (Dt, Dp, f, DDC, and α) from the same patient.

Histopathological analysis

Liver biopsy specimens from the right lobe of the liver were analysed independently by two experienced pathologists. The METAVIR scoring system was used to semi-quantitatively evaluate fibrosis and inflammation [22]. The degree of fibrosis was staged as follows: F0 = no fibrosis, F1 = portal fibrosis without septa formation, F2 = portal fibrosis with few septa, F3 = numerous septa without cirrhosis, and F4 = cirrhosis. The inflammation activity was graded as follows: A0 = no activity, A1 = mild activity, A2 = moderate activity, and A3 = severe activity. Any cases in which the final fibrosis stage or activity grade differed between the two pathologists were reevaluated and scored in consensus.

Statistical analysis

All analyses were performed using IBM SPSS 23.0(SPSS, Chicago, IL) and MedCalc 12.0 (Mariakerke, Belgium). The mean results for each parameter (ADCst, Dt, Dp, f, DDC, and α) were utilised for quantitative statistical analyses. The Kruskal–Wallis H test was employed for comparisons of each parameter among the control and the fibrosis stage groups, or the control and inflammatory activity grade groups. The Mann–Whitney U test was adopted to compare each parameter between the fibrosis stage ≤ F1 and ≥ F2, between stage ≤ F2 and ≥ F3, between stage ≤ F3 and F4. Additionally, the Mann–Whitney U test was adopted to compare each parameter between the inflammatory activity grade ≤ A1 and ≥ A2, between grade ≤ A2 and A3. Spearman rank correlation was adopted to evaluate the correlation of each parameter with fibrosis stages and inflammatory activity grades. ROC curves were performed for all parameters to assess the AUC and to establish which parameter was optimal for predicting fibrosis stages and inflammatory activity grades. The inter-observer agreement for the two independent quantitative analyses was evaluated by calculating the intraclass correlation coefficient. Results with P values < 0.05 were considered significantly different.

Results

Histological quantification of fibrosis stage and inflammatory activity grade was performed in 82 patients with CHB by liver biopsy. The fibrosis stage distribution is as follows (Fig. 1): F0, n = 1; F1, n = 16; F2, n = 31; F3, n = 19; and F4, n = 15. The inflammatory activity grade distribution is as follows: A0, n = 1; A1, n = 14; A2, n = 46; and A3, n = 21.

Figure 2 shows the DWI and ADCst, Dt, Dp, f, DDC, and α maps for a patient with fibrosis stage 2 and inflammatory activity grade 2. All parameters except Dp and α were significantly different among the control group and groups F1, F2, F3, and F4 (all P < 0.001) and showed a tendency to decrease gradually as the HF stage progressed (Fig. 3). Additionally, all parameters except Dp and α were significantly different among the control group and groups A1, A2, and A3 (all P < 0.001) and showed a tendency to decrease gradually as the inflammatory activity grade progressed (Fig. 4).

Fig. 2
figure 2

A 26-year-old female patient with CHB with fibrosis stage 2 and inflammatory activity grade 2. Diffusion-weighted image with b = 50 s/mm2 (a). The ADCst map showed that the ADCst value was 0.92 × 10−3 mm2/s (b). The Dt map showed that the Dt value was 0.67 × 10−3 mm2/s (c). The Dp map showed that the Dp value was 24 × 10−3 mm2/s (d). The f map showed that the f value was 25.9% (e). DDC map showed that the DDC value was 1.02 × 10−3 mm2/s (f). The α map showed that the α value was 0.70 (g)

Fig. 3
figure 3

Box plots of ADCst values (a), Dt values (b), Dp values (c), f values (d), DDC values (e), and α (f) values for the control group and groups F1, F2, F3, and F4. ADCst, Dt, f, and DDC values were significantly different among the above groups (all P < 0.001). However, Dp and α values did not show significant differences (all P > 0.05)

Fig. 4
figure 4

Box plots of ADCst values (a), Dt values (b), Dp values (c), f values (d), DDC values (e), and α values (f) in the control group and groups A1, A2, and A3. ADCst, Dt, f, and DDC were significantly different among the above groups (all P < 0.001). However, Dp and α did not show significant differences (all P > 0.05)

ADCst, Dt and DDC values showed moderately negative correlation with the fibrosis stage (r = − 0.418, − 0.717 and − 0.630, all P < 0.001). The ADCst, Dt, f, and DDC values were significantly lower in fibrosis stage ≥ F2 than stage ≤ F1 (all P < 0.05), significantly lower in fibrosis stage ≥ F3 than stage ≤ F2, and significantly lower in fibrosis stage F4 than stage ≤ F3 (all P < 0.05) (Table 1). However, Dp and α values showed no significant differences in these comparisons (P < 0.05).

Table 1 Comparisons of the diffusion parameters between fibrosis stages

For the evaluation of fibrosis stages (≥ F2/ ≥ F3/F4), Dt and DDC showed the higher diagnostic value than ADCst (all P < 0.05), with an exception that both Dt and DDC showed a similar diagnostic performance to ADCst in detecting stage F4 (Fig. 5, Table 2). Moreover, Dt and DDC showed a comparable diagnostic performance in detecting fibrosis stage ≥ F2/ ≥ F3/F4.

Fig. 5
figure 5

ROC curves for ADCst, Dt, f and DDC in distinguishing ≥ F2 from ≤ F1 (a). ROC curves for ADCst, Dt, DDC and f in distinguishing ≥ F3 from ≤ F2 (b). ROC curves for ADCst, Dt, f and DDC in distinguishing F4 from ≤ F3 (c)

Table 2 Performance of diffusion parameters in predicting fibrosis stage

Moreover, ADCst, Dt and DDC values showed moderately negative correlation with the inflammatory activity grade (r = − 0.514, − 0.626 and − 0.550, all P < 0.001). The ADCst, Dt, f, and DDC values were significantly lower in inflammatory activity grade ≥ A2 than in grade ≤ A1 (all P < 0.05), and significantly lower in inflammatory activity grade A3 than in grade ≤ A2 (all P < 0.05). Dp and α values showed no significant differences in the above comparisons (all P > 0.05) (Table 3).

Table 3 Comparisons of the diffusion parameters between inflammatory activity grades

For the evaluation of inflammatory activity grades (≥ A2/A3), although Dt and DDC showed higher AUCs than ADCst, there were no significant differences between the diagnostic performance of Dt and ADCst or between the diagnostic performance of DDC and ADCst (all P > 0.05) (Fig. 6, Table 4). Moreover, Dt and DDC showed a comparable diagnostic performance in detecting inflammatory activity grade ≥ A2/A3.

Fig. 6
figure 6

ROC curves for ADCst, Dt, f, and DDC in distinguishing ≥ A2 from ≤ A1 (a). ROC curves for ADCst, Dt, f, and DDC in distinguishing A3 from ≤ A2 (b)

Table 4 Performance of diffusion parameters in predicting inflammatory activity grade

The overall mean interclass correlation coefficient between the two independent radiologists was 0.871 (P < 0.001).

Discussion

In the study, we observed that the Dt, DDC and ADCst values were significantly lower in ≥ F2 than in ≤ F1, lower in ≥ F3 than in ≤ F2, lower in ≥ A2 than in ≤ A1 and lower in A3 than in ≤ A2. In addition, Dt and DDC had higher diagnostic performances than ADCst in detecting fibrosis stage ≥ F2, stage ≥ F3. Nevertheless, Dt, DDC and ADCst had similar diagnostic performance for discriminating inflammatory activity grade ≥ A2 and grade A3. Hence, Dt and DDC are optimal diffusion parameters for evaluation of HF in CHB in comparison with the other diffusion parameters.

Thus, these results indicate that the ADCst, Dt and DDC values in the fibrosis stage groups (F1/F2/F3/F4) were significantly lower than the corresponding values in the control group. The parameter ADCst obtained from monoexponential DWI model is usually used to reflect water diffusion, however, it was unable to separate the water diffusion from the microcirculation perfusion [6, 8,9,10]. The diffusion-related Dt obtained from the biexponential DWI model reflects the true water diffusion with a slower flow and is measured with b-values higher than 200 s/mm2 [11, 13, 15,16,17,18,19,20]. DDC obtained from the stretched-exponential DWI model for represents the mean intravoxel diffusion rate [12]. The limitation of water molecules diffusion can lead to the reduced ADCst, Dt and DDC values. The limitation of water molecules diffusion in the fibrotic liver could be attributed to the following aspects of HF pathogenesis: HF is associated with excessive synthesis and sedimentation of the extracellular matrix, specifically in collagen fibres, in which the protons are less abundant and tightly bound [23]. The existence of collagen fibres in the distorted lobular tissue would therefore limit water-molecule diffusion in the fibrotic liver, resulting in decreased ADCst, Dt and DDC values. Several prior studies [6, 24, 25] have reported that the ADCst and Dt values obtained using with multiple b-values in HF and cirrhosis were lower than those in the normal liver. Our study results accord with these prior study results. Regarding the diagnosis of HF with DDC values, Anderson et al. [26] has reported by using an ex vivo murine that DDC values in the HF group were significantly lower than those in the control group. Our study result is consistent with the previous study. In this study, the α values showed no significant differences among the HF groups and the control group, which is also in good agreement with the findings of the prior study by Anderson et al. [26]. Thus, the stretched-exponential model showed no clear evidence of an increase in intravoxel heterogeneity of HF in comparison with the normal liver.

It is well known that HF is associated with decreased liver perfusion. The increased arterial flow activated by intrahepatic portal hypertension in HF is inadequate to compensate for the decreased portal flow. Both Dp and f from the biexponential DWI model were perfusion-related parameter [11, 13, 27]. Dp is used for evaluating microcapillary perfusion with a fast flow which is measured with b-values lower than 200 s/mm2, and f is used for reflecting the fraction of flowing blood in the capillaries [11, 13, 15,16,17,18,19,20, 27]. Several prior studies [10, 24, 25, 28] have reported that Dp values were significantly lower in the fibrotic or cirrhotic liver group than in the control group. Interestingly, in our study, the Dp values showed no significant difference among the control and fibrosis groups. We believe that the inconsistencies between the results of our study and the prior studies were caused by the following factors: first, in our study, very low b-values (0 < b < 50 s/mm2) were not included in the b-value distribution, which may have resulted in underestimation of Dp at the lower b-values of 0, 50, 100, and 150 s/mm2. Second, the instability and the large SD of Dp could have influenced the findings [29,30,31]. Third, the HF samples in each stage were different, and the patient populations varied. The f values from the biexponential model reflect the fast diffusion fraction caused by microcirculatory blood perfusion and account for the ratio of the total diffusion components (including fast and slow diffusion). Our study revealed that the f value in the control group were higher than the f values in the HF groups. This finding was consistent with the results from some prior studies [10, 23].

Previous studies [3, 10] have reported that patients with fibrosis stage ≤ F1 have a low risk of liver failure, while stage ≥ F2 is a predictor of future hepatic cirrhosis and is an indication for therapy. In addition, patients with stage ≥ F3 require screening for portal hypertension and HCC. In our study, the Dt outperformed the ADCst in diagnosing fibrosis stage ≥ F2, ≥ F3. This could be attributed to the fact that Dt can basically eliminate the influence of microcirculation perfusion and can more accurately reflect the diffusion limitation of water molecules. However, the ADCst value was affected by the microcirculation perfusion when reflecting the diffusion of water molecules, thereby showing slightly inferior efficacy and accuracy for diagnosis of HF. Besides, our study also showed that DDC outperformed ADCst with good diagnostic performance in detecting fibrosis stages ≥ F2 and ≥ F3. This could be attributed to the fact that DDC is a weighted sum over a continuous allocation of ADCst values and reflect the multi-exponential decay properties [12, 32]. Therefore, based on the study results, we believe that Dt and DDC could be more beneficial than ADCst for diagnosing significant HF(≥ F2) and advanced fibrosis(≥ F3) and the superior performance of Dt and DDC compared with that of ADCst can have clinically important value for managing patients with HF. Thus, we assume that Dt, DDC could be used to determine the indication of anti-fibrotic treatment and as a marker for monitoring progression, and evaluating treatment efficacy.

HF is known to be accompanied by varying degrees of inflammation. Since inflammatory activity is closely related to the progression and prognosis of HF, assessment of the extent of inflammation is also very important [33]. In the process of chronic hepatitis, oedema, degeneration, and necrosis of liver cells and infiltration of inflammatory cells in the portal area and lobules may decrease the extracellular/liquid volume ratio in the cell, cause liver tissue ischaemia, and reduce liver tissue blood flow. Therefore, the presence of inflammation in chronic hepatitis may cause limited water-molecule diffusion and decreased blood perfusion in the liver. Moreover, an increase in inflammatory activity can further limit water-molecule diffusion and reduce hepatic tissue perfusion. The results of our study showed that ADCst, Dt, DDC, and f values in the inflammatory activity grade groups (A1, A2, and A3) were significantly lower than the corresponding parameters in the control group. Additionally, the mean ADCst, Dt, DDC, and f values in the groups decreased gradually. These current findings were consistent with the results from prior studies [6, 26]. Thus, the ADCst, Dt, DDC, and f values of the liver may reflect the extent of inflammatory activity.

It is widely accepted that patients with inflammatory activity ≥ A2 are at a higher risk of developing liver cirrhosis and need to receive antiviral treatment [34]. Thus, we believe that accurate diagnosis of inflammatory activity ≥ A2 may have significant clinical implications. In the study, Dt, DDC and ADCst all showed moderate diagnostic performance for detecting inflammatory activity ≥ A2 and A3 (AUC:0.7–0.9) and had comparable diagnostic performance in detecting grade ≥ A2/A3. Thus, we believe that Dt, DDC and ADCst could be used to determine the indication of antiviral treatment and as a marker for therapy surveillance.

This study had some limitations. First, the number of patients was relatively small, and the distributions of fibrosis stages and inflammatory activity grades were uneven. Second, the influence of iron or fat deposition in HF on the diffusion parameters was not assessed. Third, Dp values might have been underestimated at the lower b-values of 0, 50, 100, and 150 s/mm2 since very low b-values (0 < b < 50 s/mm2) were not selected. Finally, the liver diffusion parameters in patients with CHB were determined by both fibrosis stage and inflammatory activity grade, but it is unclear which aspect has a greater role, and further stratified research is required to address this issue.

In conclusion, the Dt derived from the biexponential model and DDC from the stretched-exponential model are more valuable than other parameters in predicting significant fibrosis, advanced fibrosis in patients with CHB. Therefore, we believe that Dt and DDC could be used clinically to diagnose and stage HF, and as a marker for guiding therapy, monitoring progression, and evaluating treatment efficacy in a noninvasive manner.