Introduction

Hepatic fibrosis (HF) is a wound-healing response to various categories of chronic liver diseases [1]. The early stage of HF is potentially reversible after treatment [2,3,4]. However, the progression of untreated HF may eventually lead to hepatic cirrhosis, portal hypertension, and even hepatocellular carcinoma (HCC) which are accountable for a high rate of morbidity and mortality as well as increased resource costs [1, 4, 5]. To date, invasive liver biopsy is the reference standard for diagnosis and staging HF. However, this technique has some limitations such as invasiveness, risks of complications, observation variations, sampling error, and poor patient tolerance [6], making it poorly suitable for dynamically monitoring patients with chronic liver diseases. Therefore, reliable and non-invasive tools are essential for early detecting and staging of HF.

Recently, there are several non-invasive and potential imaging techniques for the assessment of HF, including magnetic resonance elastography (MRE), intravoxel incoherent motion diffusion-weighted MRI (IVIM DW-MRI), and ultrasound-based transient elastography (TE). MRE has been increasingly adopted for HF evaluation in patients with chronic liver diseases [1, 3, 7,8,9,10,11,12,13,14,15,16]. As shown in some studies, the stiffness and elasticity of the liver tissue may alter along with the degree of HF, impacts of chronic hepatitis before HF, and increased portal venous pressure [12,13,14,15,16]. Moreover, liver stiffness measurement with MRE (LSM-MRE) has been used for evaluating HF in clinical practice and has received significant attention.

Intravoxel incoherent motion (IVIM), a biexponential model of DWI which was introduced by Le Bihan et al. [17], would allow separate assessment of microcirculation in the capillaries and water-molecule diffusion. Currently, IVIM with multiple b-values has become a topic of interest in the assessment HF. Several prior studies have reported that IVIM parameters could show potential ability in diagnosing HF and may provide more accurate information concerning true water molecules diffusion and incoherent motion of water molecules in the capillary network [4, 18,19,20,21,22,23].

Transient elastography (TE) is also a non-invasive method by measuring liver stiffness to evaluate HF [24,25,26]. As we all know, this technique has the following advantages: simplicity, low cost, short performance time, high patient reception, and so forth. So far, the results of prior studies suggest that liver stiffness measurement with TE (LSM-TE) has a promising application prospect for evaluation of HF in clinical practice [24,25,26,27,28,29,30,31].

To date, although some studies have reported MRE, TE, and IVIM in the assessment of HF [1, 16, 32,33,34], there is a lack of comprehensive comparison among them. Given this background, the goal of this study was to quantitatively compare the ability among LSM-MRE, LSM-TE, and IVIM parameters in evaluating HF in the same individuals.

Materials and methods

Subjects

This prospective study was approved by our institutional review board. Informed consent in written was obtained from every subject prior to participation. Between May 2014 and January 2017, a total of 108 patients with chronic liver diseases were recruited consecutively for the study. All patients had undergone liver MRI (including MRE, IVIM-DWI) and TE. The inclusion criteria for patients with chronic liver diseases were as follows: no previous antiviral therapy; without a large amount of ascites; all subjects were confirmed by liver biopsy or liver resection; liver MRI and TE were performed within one month before liver biopsy or liver resection; acceptable image quality. Of these 108 patients, 6 were excluded from the study due to lack of histopathological results (n = 2) and poor images with artifacts (n = 4). In addition, 2 were excluded from the study on account of high iron content in liver. Finally, 100 patients (64 males and 36 females; mean age: 37.6 ± 9.3 years, age range: 22–58 years) were included in our study (Fig. 1, Table 1). Among the 100 patients, 81 were with hepatitis B, 8 with hepatitis C, 3 with non-alcoholic fatty liver disease, 2 with drug-induced hepatitis, and 6 with autoimmune hepatitis (Table 1).

Fig. 1
figure 1

Flowchart of the study enrolment population. The 25 healthy volunteers without ultrasound-guided liver biopsy and with no history or of evidence alcohol abuse, chronic liver disease, focal liver lesions, or liver dysfunction were assumed to have fibrosis stage F0 and inflammatory activity grade A0 in this study. *F0/A0 group includes 1 patient with pathologically proved F0/A0 and 25 volunteers who were regarded to have healthy livers

Table 1 Characteristics of the study population

In the meantime, 25 healthy volunteers (7 males and 18 females, mean age: 31.5 ± 12.9 years, age range: 21–69 years) were recruited consecutively for the study (Fig. 1, Table 1). All the healthy subjects with no ultrasound-guided liver biopsy had undergone MRE, IVIM-DWI, and TE. The inclusion criteria for the healthy volunteers were no history of alcohol abuse; without chronic liver disease or focal liver lesions; without abnormal liver function test results; and acceptable image quality.

MR elastography

All the participants underwent liver MR examination on a 3-T MR imaging unit (Discovery MR 750; GE Medical Systems, Milwaukee, WI) using an 8-channel phased-array coil and fasted for at least 6 h before the examination. An active pneumatic driver was located outside the imaging room and produced 60-Hz mechanical vibrations. The active driver was connected with a passive driver through a plastic tube to deliver the vibrations to the right upper abdomen. As the patient stayed in the supine position, the passive driver was attached to the right chest wall via a rubber belt and the center of the driver was to be located at the xiphisternum level. Image acquisition of MRE was performed with a two-dimensional spin-echo echo-planar imaging (SE-EPI) sequence and the imaging parameters were as follows: TR/TE, 600/58 ms; flip angle, 30°; matrix, 256 × 128; bandwidth, 250 Hz/pixel; imaging frequency, 64 MHz; field of view, 42 × 42 cm; parallel imaging acceleration factor, 2; section thickness/ intersection gap, 10/0 mm; slice number, 3; number of excitations, 2; and acquisition time, 19 s. Three slices were acquired at the widest part of the liver. The liver stiffness maps were generated by processing the acquired propagating shear wave image through an inversion algorithm [1, 12]. For each participant, two radiologists independently measured the liver stiffness (LSM-MRE) values by placing ROIs in the right lobe of the liver on the stiffness maps. As a rule, each ROI was at least 150 mm2 and was positioned away from blood vessels, marginal liver tissues, poor shear wave amplitude, and multi-path wave interference. LSM-MRE value was expressed in kilopascals (kPa).

Transient elastography

LSM-TE was obtained by using ultrasound-based transient elastography (FibroScan; Echosens, Paris, France) in the light of the manufacturer’s recommendations, with the patient in the supine position. The tip of the transducer probe was daubed with coupling gel and located on the skin over the ribs above the right lobe of the liver. Ten successful acquisitions were accomplished on each patient by a trained operator. The median LSM-TE value was then calculated by the device and regarded as representative of the elastic modulus of liver for that patient [25]. The LSM-TE values were expressed in kilopascals (kPa). As per the manufacturer’s recommendations, a success rate of at least 60% and TE procedures with 10 valid measurements were thought to be reliable and could be retained [30].

IVIM-DWI

IVIM-DWI was performed with respiratory-triggered fat-saturated spin-echo echo-planar sequence in the transverse plane. The imaging parameters of IVIM-DWI with multiple b-values were as follows: TR/TE, 9230 ms/minimum; field of view, 36 × 38 cm; matrix, 128 × 128; slice thickness, 5 mm; intersection gap, 1 mm; parallel imaging reduction factor, 2; and acquisition time, approximately 10 min (variable depending on a patient’s respiratory cycle). Thirteen different b-values were used: 0, 50, 100, 150, 200, 300, 500, 800, 1000, 1300, 1500, 1700, and 2000 s/mm2. Motion proving gradient pulses were simultaneously used in three directions.

Three diffusion parameters are obtained from IVIM-DWI including the pseudo-diffusion coefficient which would reflect capillary perfusion (Dp; × 10−3mm2/s), the true diffusion coefficient which would reflect true molecular diffusion (Dt; × 10−3mm2/s), and the perfusion fraction (f; %) which characterizes the segmental volume occupied in the voxel via flowing spins. These parameter values were calculated with the biexponential IVIM model equation introduced by Le Bihan et al [18]:

$$S\left( b \right)/ \, S \, \left( 0 \right) \, = \, [(1-f)\,\times\,\exp(-b\,\times\,D_{\text{t}})] +[f\,\times\,\exp(-b\,\times\,D_{\text{p}})],$$

where S(b) characterizes the signal intensity in the presence of diffusion sensitization, S(0) characterizes the signal intensity in the absence of diffusion sensitization, and b characterizes the diffusion sensitizing factor. The ADC value which represents an apparent diffusion coefficient was calculated from all the 13 b-values with a monoexponential model as follows [18]:

$$S\left( b \right)/ \, S \, \left( 0 \right) \, = \, \exp \, \left( { - b \, \times \, ADC} \right).$$

To obtain the diffusion parameters, IVIM images were transferred to a workstation (Advantage Workstation 4.6, GE Medical Systems) for processing. They were independently processed and analyzed by two radiologists with 9 and 11 years of experience in abdominal MR imaging who were blinded to the histopathologic results. The two radiologists independently placed three regions of interest (ROIs) in the right lobe of the liver on the images with b = 0 s/mm2. The ROI was automatically copied by software to each image (b = 50–2000 s/mm2) and the mean signal intensities within the ROIs were calculated for each image. Dt, Dp, f, and ADC values were calculated using the average signal intensities in each ROI. The areas of the ROIs varied from 150 to 200 mm2, and each ROI was avoiding bile ducts and large vessels to ensure accurate measurements.

Histopathological analysis

In the work, the liver biopsy was obtained by ultrasound-guided 18-gauge core liver biopsy from the right lobe of the liver in 94 patients. In addition, the liver specimens were obtained during liver resection for hepatocellular carcinoma (HCC) in 6 patients and the distance of specimens with tumor was 1–2 cm. Histopathology of the biopsy specimens and hepatic parenchyma in surgically resected livers were analyzed independently by two hepatopathologists with 8 and 17 years of experience. The hepatopathologists were blinded to all the patients’ imaging data and provided scores in consensus by using the METAVIR system [35]. The fibrosis was staged as follows: F0 = no fibrosis, F1 = portal fibrosis without septa, F2 = portal fibrosis and few septa, F3 = numerous septa without cirrhosis, and F4 = cirrhosis. The inflammatory activity was graded as follows: A0 = no activity, A1 = mild activity, A2 = moderate activity, and A3 = severe activity.

Statistical analysis

The mean and standard deviation (SD) of each parameter (LSM-MRE, LSM-TE, Dt, Dp, f, and ADC) were calculated for each fibrosis group. The Kruskal–Wallis test was employed for comparison of each parameter among the fibrosis groups. Spearman rank correlation was used to assess the correlation of each parameter with fibrosis stages. ROC curves were obtained for all parameters to evaluate the AUC and to determine which parameter was optimal for detecting fibrosis stages. All analyses were performed using statistical software (version 17.0; SPSS, Chicago, IL and version 12; MedCalc Software, Mariakerke, Belgium). The results with P values less than 0.05 were considered significantly different. The inter-observer agreement for the two independent measurements was assessed by calculating the intraclass correlation coefficient.

Results

Fibrosis staging and inflammatory activity grading

Histological quantification of HF was performed in 94 patients by liver biopsy and 6 patients by surgical hepatic resection. The 25 volunteers with healthy livers were regarded to have fibrosis stage 0 and inflammatory activity grade 0. Finally, the fibrosis stage distribution is as follows: F0, n = 26; F1, n = 17; F2, n = 33; F3, n = 25; and F4, n = 24. The inflammatory activity grade distribution is as follows: A0, n = 26; A1, n = 20; A2, n = 56; and A3, n = 23 (Fig. 1, Table 1).

Figure 2 shows MRE, DWI (b = 0 s/mm2) images and ADC, Dt, Dp, as well as f maps for a patient with fibrosis stage 3 and inflammatory activity grade 2.

Fig. 2
figure 2

A 45-year-old male patient with chronic hepatitis B with fibrosis stage 3 and inflammatory activity grade 2. a MR elastogram image superimposed on a conventional MR image; b Diffusion-weighted image with b = 0 s/mm2; c ADC map; dDt map; eDp map; f f map

Relationships between fibrosis stage and all parameters

All parameters except Dp were significantly different among the groups F0, F1, F2, F3, and F4 (all P < 0.001). All parameters data in each fibrosis group are summarized in Table 2.

Table 2 Comparison of LSM-MRE, LSM-TE, and IVIM parameters for each fibrosis stage

There was a strongly positive correlation between LSM-MRE and fibrosis stage (r = 0.910, P < 0.0001), and between LSM-TE and fibrosis stage (r = 0.813, P < 0.0001) (Table 2). LSM-MRE and LSM-TE values increased with the fibrosis stage progressing (all P < 0.0001) (Fig. 3a, b). In addition, there was a strongly negative correlation between Dt and fibrosis stage (r = − 0.727, P < 0.001), a moderately negative correlation between f and fibrosis stage and between ADC and fibrosis stage (r = −0.503 and r = − 0.601, all P < 0.001) (Table 2). Nevertheless, no significant correlation was observed between Dp and fibrosis stage (r = − 0.121, P > 0.05) (Table 2). All diffusion parameters except Dp decreased with the fibrosis stage progressing (all P < 0.001) (Fig. 3c–f).

Fig. 3
figure 3

Box plots of a liver stiffness value from MRE (LSM-MRE), b liver stiffness value from TE (LSM-TE), cDt, dDp, e f, and f ADC values for fibrosis stage groups F0, F1, F2, F3, and F4. LSM-MRE, LSM-TE, Dt, f and ADC values were significantly different among the above groups (all P < 0.001). However, Dp values did not show significant differences (P > 0.05). Line in each box characterizes the median, and the horizontal boundaries of the boxes characterize the first and third quartiles

Diagnostic performance of MRE, TE, and IVIM-DWI

Table 3 shows the AUCs, sensitivity, and specificity of LSM-MRE, LSM-TE, Dt, f, and ADC at their optimal cut-off values in detecting fibrosis stage ≥ F1, stage ≥ F2, stage ≥ F3, and stage F4. For the evaluation of fibrosis stages (≥ F1/ ≥ F2/ ≥ F3/F4), LSM-MRE showed the higher diagnostic value than LSM-TE, Dt (all P < 0.05), with an exception that LSM-MRE and LSM-TE showed a comparable diagnostic performance in detecting stage ≥ F1 (Table 4). For the evaluation of fibrosis stages, although LSM-TE showed the higher AUCs, there were no significant differences between LSM-TE and Dt (P > 0.05) except in detecting stage F4 (P < 0.05) (Table 4).

Table 3 Performance of MRE, TE, and IVIM in the prediction of fibrosis stages
Table 4 P values from a statistical comparison of ROCs of LSM-MRE, LSM-TE, and Dt in detecting fibrosis stages

The overall mean intraclass correlation coefficient of LSM-MRE between the two independent radiologists was 0.936 (P < 0.001). The overall mean interclass correlation coefficient of IVIM parameters between the two independent radiologists was 0.874 (P < 0.001).

Discussion

In the study, we observed a significantly positive correlation between HF stage and LSM-MRE and LSM-TE, and a significantly negative correlation between HF stage and IVIM parameters with the exception of Dp. We also observed that MRE showed the highest diagnostic performance compared to TE and IVIM for discriminating fibrosis stage ≥ F2, ≥ F3 , and F4. Nevertheless, TE and IVIM had similar diagnostic performance for discriminating fibrosis stage ≥ F1, ≥ F2 , and ≥ F3.

Several prior studies [13, 14, 16, 25, 26, 29] have reported that MRE and TE showed significantly correlation with the HF stage and that the liver stiffness values increased as HF stage progressed. Our study had the similar results agreed with these prior studies. Furthermore, some studies [16, 23] have reported that Dp values from IVIM model showed negative correlation with the HF stage and the corresponding parameter values reduced as HF stage increased. Interestingly, the Dp values showed no significant correlation with the HF stage in this study; nevertheless, Dt showed a negative correlation. We considered that the discrepancies between the results of this study and the previous studies were caused by the following reasons: First, in this study, very low b-values (0 < b < 50 s/mm2) were not selected, which may result in underestimation of Dp; second, the instability and the large SD of Dp values might influence the findings. As prior studies have reported, the Dp values obtained from the IVIM model are unstable and may have low repeatability [16, 36]; third, the sample size in each stage was different and the study populations varied.

Previous studies [1, 23, 37] have indicated that patients with significant fibrosis (≥ F2) have a higher risk of developing cirrhosis in the future and need treatment; patients with advanced fibrosis (≥ F3) or cirrhosis should be screened for portal hypertension and HCC. Hence, accurate diagnosis of HF stage is very meaningful. In the study, LSM-MRE showed high diagnostic capability in detecting stage ≥ F1, ≥ F2, ≥ F3 , and F4 with cut-off values of 3.16, 3.57, 4.41, and 5.1 kPa, representing a combination of high sensitivity (86.9%, 89.0%, 87.8%, and 100%) and specificity (96.2%, 97.7%, 92.1%, and 90.1%). In comparison, Wang et al. [1] reported sensitivity of 72%, 91%, 92%, and 95% and specificity of 100%, 97%, 95%, and 87% with relatively higher cut-off values of 5.02, 5.37, 5.97, and 5.97 kPa. Venkatesh et al. [8] reported sensitivity of 96.08%, 97.4%, 100%, and 100% and specificity of 91.7%, 100%, 94.1%, and 95.2% with relatively lower cut-off values of 2.74, 3.2, 3.7, and 4.33 kPa. The current study and previous studies [1, 7,8,9,10, 12, 13] all showed that MRE could provide similarly excellent diagnostic performance for staging HF, whereas they showed variety of cut-off values which may be explained by the differences of the sample size and the etiology of study population, imaging protocols, and post processing procedures [1].

In the study, TE showed high diagnostic performance for detecting fibrosis stages, which was similar to previous studies [31, 32, 38]. However, the cut-off values of LSM-TE in detecting stage ≥ F3 and F4 (7.85 and 10.45 kPa) in this study are relatively lower than those results of prior studies (9.5 and 12–13.01 kPa) [31, 32, 38]. The inconsistencies between this study and prior studies might be ascribed to the factors including the etiology of the liver disease, the unbalanced distribution of patients with various degrees of HF, as well as the BMI of study population. Compared with MRE, TE showed inferior ability in distinguishing each stage of HF with the exceptions in detecting ≥ F1. LSM-MRE also demonstrated a combination of higher sensitivity and specificity in detecting stage ≥ F2, ≥ F3, and F4 than LSM-TE.

In the study, IVIM showed moderate diagnostic performance in detecting fibrosis stages. In a comparative study, LSM-MRE showed superior diagnostic performance than Dt from IVIM model (AUC: 0.964 vs. 0.865 for ≥ F1, 0.965 vs. 0.875 for ≥ F2, 0.957 vs. 0.879 for ≥ F3 and 0.983 vs. 0.861 for F4) and demonstrated a combination of higher sensitivity and specificity in detecting stage ≥ F1, ≥ F2, ≥ F3, and F4 than Dt. In the study by Dyvorne et al. [33] for assessment of advanced fibrosis and cirrhosis, LSM-MRE showed superior diagnostic performance than Dt from IVIM model (AUC: 0.94 vs. 0.79), which was in agreement with our study. For the detection of fibrosis stages, although LSM-TE demonstrated higher AUC values than Dt (AUC: 0.921 vs. 0.865 for ≥ F1, 0.906 vs. 0.875 for ≥ F2, 0.913 vs. 0.879 for ≥ F3 and 0.931 vs. 0.861 for F4), there was no significant difference in their capability to distinguish HF stages except for detecting F4. The study by Dyvorne et al. [33] also showed the similar result. Thus, Dt showed comparable diagnostic performance to LSM-TE in detecting significant fibrosis and advanced fibrosis.

This study has several limitations. First, it is well known that HF usually accompanies varying degrees of inflammation and the degrees of inflammation is closely correlated with the progression and prognosis of HF, whereas we did not evaluate the influence of inflammatory activity grade on LSM-MRE, LSM-TE, and IVIM parameters in the staging of HF; further research is required to address this issue. Second, the perfusion-related diffusion coefficients (Dp) obtained from IVIM model might have been underestimated as very low b-values (0 < b < 50 s/mm2) were not selected in the study. Third, our study population comprised various chronic liver diseases from multiple etiologies because of the small sample size of each etiology except chronic hepatitis B. Fourth, for the specimens, we did not take hepatic steatosis into account during pathology analysis in the study. In addition, the influence of hepatic steatosis on the liver stiffness and IVIM parameters was not assessed. Although fat deposition might not affect the stiffness measurement, steatosis could cause inflammation, and thus increase liver stiffness even in the absence of liver fibrosis [16, 39, 40]. In the study by Sandrasegaran et al. [4] for assessment of hepatic steatosis in patients with chronic liver disease, IVIM parameters showed no significant association with the severity of hepatic steatosis. Fifth, the healthy volunteers were selected and assumed to be F0 and A0 without histologic findings as the reference standard. According to both previous studies [16, 41], we selected strict clinical and laboratory data which served as the reference standard. Finally, we selected the ROIs of MRE and IVIM in the largest cross section of the right lobe of the liver and used slice position with vessels and fissures as landmarks. We selected the ROIs of TE in the right lobe of the liver, but TE provides localized spot measurements at limited depth in the liver in areas where there is an acoustic window. Thus, the ROIs of MRE, IVIM, and TE could not match exactly.

In conclusion, TE and IVIM imaging are useful techniques and have a comparable performance for detecting significant fibrosis and advanced fibrosis, while MRE shows the best performance in distinguishing significant fibrosis, advanced fibrosis, and cirrhosis compared with both TE and IVIM and may have the potential for replacement of liver biopsy as a reference standard.