Introduction

Reversible liver conditions, such as hepatic steatosis and liver iron overload, can progress to liver cirrhosis. Liver cirrhosis is an irreversible disorder for which there is no curative treatment except liver transplant. It may be complicated by portal hypertension, ascites, and clotting dysfunction, finally leading to liver failure and death. Further, subjects with liver cirrhosis have an increased risk of developing primary liver malignancies, especially hepatocellular cancer [1]. Early identification of liver fibrosis, a precursor to cirrhosis, is necessary to prevent conversion to the irreversible stage.

Liver core biopsy is the reference standard for grading chronic liver disease, including the staging of liver fibrosis. However, liver biopsy is an invasive technique and is associated with complications such as pain and bleeding [2, 3]. Further, liver core biopsy covers only a small part of the liver, and there is the possibility of sampling error [4, 5].

Noninvasive diagnostic investigations for detection of liver fibrosis and cirrhosis are clinically warranted. Ultrasound and computed tomography (CT) can only identify morphological changes of full-blown cirrhosis (enlarged left lateral and caudate lobes, irregular surface, and irregular intrahepatic septa) [68]; however, both are limited in grading liver fibrosis. MR elastography may have the potential to assess liver stiffness [9]; however, its clinical role in assessing patients with liver fibrosis remains to be investigated further.

Magnetic resonance imaging (MRI) has the potential to quantify and grade liver fibrosis [1014]. One promising technique is diffusion-weighted MRI (DW-MRI). This MR technique measures water motion in the extracellular space and water diffusion can be quantified by calculating the apparent diffusion coefficient (ADC). Liver cirrhosis is characterised by the extracellular accumulation of collagen fibres, glycosaminoglycans, and proteoglycans, which theoretically results in restricted water diffusion [12]. DW-MRI may improve the diagnosis of liver fibrosis and cirrhosis, although initial results are contradictory [11, 12, 1518]. Several investigators have successfully used DW-MRI for assessing hepatic fibrosis and cirrhosis [11, 15, 16]. Two recent studies concluded that DW-MRI is capable of staging liver fibrosis and liver cirrhosis and distinguishing these two conditions from healthy liver tissue [12, 17], whereas another group reported that DW-MRI is limited in differentiating among different fibrosis grades and is only capable of differentiating healthy tissue from cirrhotic liver tissue [18]. Interestingly, different authors have found fundamentally different ADC baseline values in healthy subjects [1922].

The impact of liver fat on ADC is currently unclear. A recent study found a significant relationship between liver fat and ADC [23], whereas another study found no such relationship [24]. Furthermore, liver iron has been found to have no influence on the ADC value, suggesting that it does not impair quantification of liver cirrhosis [11]. However, in theory and in our clinical experience, there is a signal loss in subjects with liver iron overload because iron shortens the T2/T2* relaxation time [25].

Despite these recent insights, the interpretation of diffusion-weighted images is not well understood in individuals with diffuse liver disease. Contradictory results in previous studies may indicate that DW-MRI is a nonspecific surrogate marker for diffuse liver diseases with liver fat and liver iron as possible confounders. The purpose of our study was to investigate the reliability of DW-MRI for staging liver fibrosis in the presence of liver fat and liver iron.

Materials and methods

This retrospective study was conducted in accordance with the local Institutional Review Board of Greifswald University Hospital. Written informed consent was obtained separately for liver biopsy and postinterventional MR imaging.

Subjects and study design

Between February 2009 and March 2012, 95 patients underwent a clinically indicated image-guided liver core biopsy. The study population included 48 men and 47 women with a mean age of 57.0 ± 14.2 years. Clinical indications for biopsy were confirmation of suspected focal malignancy in 44 patients and workup of elevated liver enzymes in 51 patients. Samples were obtained using 16–18-gauge biopsy devices. In accordance with our clinical standard, in patients with focal liver lesions, two samples were obtained in one session under image guidance, the first from surrounding nonlesional liver and the second from the target lesion.

Within 24–72 h after liver core biopsy, postinterventional MRI of the upper abdominal organs was performed in each subject, including a chemical-shift-encoded T1-weighted sequence and DW-MRI.

Inclusion criteria for this study were an age of at least 18 years, a valid liver biopsy including a core sample of nonmalignant tissue, and valid histopathological processing of the sample. Subjects with postinterventional haemorrhage were excluded. A haemorrhage was defined as hyperintense signal alterations in the T1-weighted data sets. No subjects had to be excluded because of postinterventional complications.

Overall, 96 valid liver core samples from 95 patients were included in this study. In one patient with geographical fatty degeneration, two samples from nonmalignant tissue in different regions were obtained in the same session.

Histological assessment

One pathologist (M.E.) with more than 14 years of experience in liver pathology reviewed the samples to stage fibrosis using an H&E-stained slide and additionally a Sirius-red-stained slide. The fibrosis stage was determined according to the modified Ishak scale [26]. This scoring system distinguishes three grades: grade 0, no fibrosis (Ishak score 0); grade 1, fibrous expansion (Ishak score 1–4); grade 2, incomplete (marked bridging)/complete cirrhosis (Ishak score 5–6).

Liver fat content was estimated in 5 % steps from 0–100 % including an intermediate step of 3 % using an H&E-stained slide as described in more detail previously [27].

Following the clinical standard, liver iron content was semiquantitavely assessed based on a Perl’s Prussian blue reaction using a four-point scoring system: grade 0, no iron; grade 1, low iron; grade 2, moderate iron; grade 3, high iron content.

Postinterventional MR imaging

All postinterventional MRI examinations were performed with a 1.5-T system (Magnetom Avanto, Siemens Healthcare, Erlangen, Germany) using a 12-channel phased-array surface coil.

To obtain anatomical information and to rule out complications after biopsy, a T2*-corrected chemical-shift-encoded multi-echo T1-weighted 3D gradient echo sequence with water-fat separation (TR/TE1/TE2/TE3: 11/2.4/4.8/9.6 ms; flip angle = 10°; bandwidth = ±1,065 Hz/pixel; matrix = 224 × 168 × 64; FOV = 410 × 308 mm; slice thickness: 3 mm) was acquired in each subject.

Thereafter, isotropic diffusion-weighted imaging was performed using a spin-echo-based echo-planar imaging sequence. Imaging series with different diffusion weightings (b-values) were acquired starting with a b-value of 50 mm2/s, followed by two b-values of 400 mm2/s and 800 mm2/s. The acquisition was gated using a prospective acquisition correction technique (PACE). Further imaging parameters were: TR = 1,700 ms; TE = 72 ms; flip angle = 90°; averages = 2; bandwidth = ±1,735 Hz/pixel; matrix: 115 × 192 × 30; FOV phase = 285 mm; parallel imaging acceleration factor = 2; slice thickness = 6 mm. A spectral adiabatic inversion recovery fat saturation technique was used for DW-MRI.

Apparent diffusion coefficient maps (ADC maps) were automatically calculated at the end of each acquisition. ADC calculation was performed using a monoexponential fitting model of the data sets: b50, b400, and b800. The ADC was calculated as follows: S(b) = S0 e−ADC b.

Image analysis

One reader (R.B.) with more than 2 years of experience in hepatobiliary imaging reviewed the ADC maps using the Osirix software (version 3.8.1, 64 bit, Pixmeo Sarl, Bernex, Switzerland). The observer was unaware of the histopathological results.

A representative liver slice was selected to avoid regions with artefacts and partial volume effects, liver tumours, and large vessels including the central portal vein and the central hepatic vein. In this slice, the liver was manually segmented, and the mean ADC was documented for each case.

In the one case of geographical fatty degeneration, two independent biopsy samples were obtained—one from the fatty tissue and one from the nonfatty tissue. Two circular ROIs were placed in the liver segments from which biopsies had been obtained.

Statistics

Values were found to be normally distributed. Variables are presented as median values and the corresponding interquartile ranges (IR). Correlation analysis was performed using a linear regression model with robust estimators of the standard errors. First, correlations between ADC and histological fibrosis stage were calculated for all samples independently of the presence of fat and iron. Thereafter, ADC correlations were repeated after exclusion of potential confounders (i.e., liver fat, liver iron, and liver fibrosis) in subgroups, defined as follows:

  • ADC versus stage of fibrosis: samples without liver fat and without liver iron;

  • ADC versus liver fat: samples without fibrosis/cirrhosis and without iron overload;

  • ADC versus liver iron: samples without fibrosis/cirrhosis and without liver fat.

Fat content was treated as a continuous variable. A liver fat content of ≤ 5 % was defined as a sample without liver fat. Liver fibrosis and liver iron load were treated as categorical data.

Calculations were performed using Stata 11.2 (StataCorp 2009. Stata Statistical Software: release 11. College Station, TX: StataCorp LP). All statistical significance tests refer to the linear regression model. Coefficients of the slopes are reported as ß-values. The level for statistical significance was set to P < 0.05.

Results

In the total study group (n = 96), histological fibrosis grades were distributed as follows: no fibrosis (grade 0) in 60 samples, fibrosis expansion (grade 1) in 16 samples, and incomplete/complete cirrhosis (grade 2) in 20 samples. In this group, the median ADC value of 995.0 × 10−3 mm2/s (IR: 849.0 × 10−3 mm2/s – 1,114.3 × 10−3 mm2/s) did not differ significantly between stages of fibrosis or cirrhosis (P = 0.17) (Fig. 1a).

Fig. 1
figure 1

The graphs represent liver ADC values for the three histologically determined grades of liver fibrosis (a) for the whole study population and (b) for the subgroup without liver iron and with liver fat content ≤ 5 %. There are no significant differences in median ADC values among the three fibrosis grades. However, the three fibrosis groups have higher median ADCs and smaller interquartile ranges after exclusion of subjects with liver fat and iron

ADC versus fibrosis (subgroup analysis)

After exclusion of cases with potential confounders such as liver fat or iron, a subgroup of 40 cases remained: grade 0: 22 samples; grade 1: 6 samples; grade 2: 12 samples. Analysis within this subgroup also revealed no significant differences in ADC values among fibrosis grades (P = 0.75) (Fig. 1b). However, after exclusion of subjects with fat and iron, median ADCs were higher, and variability was smaller.

ADC versus fat (subgroup analysis)

Thereafter samples with liver fibrosis and liver iron were excluded to test the effects of liver fat. This subgroup included 42 samples: 22 samples without liver fat (histological fat content ≤ 5 %) and 20 samples with liver fat (fat content > 5 %). Median histologically determined hepatic fat content was 25 % [interquartile range (IR): 15 %–45 %]. Linear regression of subjects in the subgroup without fibrosis and without liver iron revealed that ADC values decreased significantly with increasing liver fat content (R 2 = 0.28; ß = -5.3; P < 0.001) (Figs. 2 and 3). In this subgroup, the median ADC of samples without liver fat was 1,012.0−3 mm2/s (IR: 264.8−3 mm2/s – 1,334.7−3 mm2/s) compared to a median ADC of 868.5−3 mm2/s (IR: 161.7−3 mm2/s – 1268.8−3 mm2/s) in samples with liver fat.

Fig. 2
figure 2

Comparison of image sets obtained with different diffusion coefficients (50/400/800 mm2/s) and their corresponding ADC maps in clean subgroups of no liver disease, manifest cirrhosis, high liver fat content, and high liver iron content. There was no visually apparent difference in ADC maps between healthy and cirrhotic liver tissue. However, lower ADC values were calculated in the presence of liver fat and high liver iron content. Furthermore, changes in ADC from iron overload are likely due to signal decay and noise floor effects in subjects with severe iron overload

Fig. 3
figure 3

Linear regression analysis of ADC values for samples without fibrosis/cirrhosis and without liver iron (a) revealed an inverse relationship between ADC and hepatic fat content (R 2 = 0.28; ß = -5.3; P < 0.001). Furthermore, the median ADC of patients with high iron load (without fibrosis/and without liver fat) (b) was significantly different (**) from that of patients without liver iron (β = -489; P = 0.005). Therefore, liver fat or high liver iron concentrations affect ADC values, possibly complicating quantitative staging of fibrosis

ADC versus iron (subgroup analysis)

The subgroup for analysis of the relationship between ADC and liver iron content in subjects without liver fibrosis/cirrhosis and fat ≤ 5 % included 33 samples: 22 samples without iron (grade 0) and 11 samples with an increased liver iron content (grade 1: 5; grade 2: 3; grade 3: 3 samples). Linear regression analysis in this subgroup without fibrosis and without liver fat demonstrated a moderate correlation between ADC values and iron load (R 2 = 0.29). However, ADC values in patients with a high iron load (grade 3) were significantly different from those of patients without liver iron (grade 0) (β = -489; P = 0.005). The median ADC in group 0 (samples without iron) was 1,094.6 × 10−3 mm2/s (IR: 993.7 × 10−3 mm2/s–1,134.7 × 10−3 mm2/s), whereas the median ADC in samples with high iron content (grade 3) was 506.6 × 10−3 mm2/s (IR: 319.5 × 10−3 mm2/s–905.5 × 10−3 mm2/s). Only low (group 1) and moderate iron (group 2) levels did not differ significantly from the reference group without liver iron (P = 0.93 and P = 0.54) (Figs. 2 and 3).

The regression analysis presented in Fig. 3 revealed one sample with a very low ADC of 264.8-3 mm2/s. Liver fat, liver iron, and fibrosis were ruled out by biopsy. The biopsy did not reveal hepatitis in this case (Fig. 3).

Discussion

This study investigated the reliability of DW-MRI for the staging of liver fibrosis and liver cirrhosis in the presence of common liver diseases, specifically fatty liver and liver iron overload. Our results suggest that the DW-MRI technique used in our study has limitations both in staging liver fibrosis and in differentiating between healthy liver and cirrhosis. Furthermore, liver fat and high liver iron levels result in lower ADC and both are possible technical confounders hampering the reliability of DW-MRI independent of the operating mode.

DW-MRI allows sensitive evaluation of the molecular motion of random microscopic molecules, which can be measured by the apparent diffusion coefficient (ADC) [2830]. This technique is commonly used for the staging of liver fibrosis, evaluation of cirrhosis, quantification of hepatitis, therapy monitoring, and assessment of treatment response [12, 17, 28, 3133]. Despite the broad spectrum of indications for DW-MRI, our results suggest that the technique is not specific for staging liver fibrosis.

The ADC is affected by several factors, such as permeability and cellular structure [29]. Therefore, both diffusion and perfusion influence the ADC value. At present it is still unclear which effect is more important for the staging of liver fibrosis. Annet et al. found a correlation of ADC and stage of liver fibrosis in rats, which they no longer observed after the animals had been killed [18]. The authors suggest that the abolished perfusion may explain the decrease of the hepatic ADC measured in vivo in rats with liver fibrosis. To avoid microperfusion effects and to investigate only the impact of true water diffusion, a monoexponential model starting with a b-value of 50 was chosen in this study. Our study identified no relation between ADC values and histopathologically determined stages of fibrosis, suggesting that the molecular water diffusion technique is inadequate for staging liver fibrosis.

In theory, in subjects with liver cirrhosis, arterial and portal venous pressure increases. Additionally, in cirrhosis, intrahepatic and extrahepatic portal venous shunts reduce vascular liver input, leading to decreased microperfusion. First results in humans suggest that microcirculation or perfusion plays a more important role than molecular water diffusion in differentiating cirrhotic from normal livers [34]. Kim et al. used another MR technique for the staging of liver fibrosis—T1-relaxometry before and after oxygen inhalation—in patients with and without liver cirrhosis and concluded that microperfusion is related to the stage of liver fibrosis [10]. Further studies are necessary to assess how microperfusion can be exploited for staging liver fibrosis.

Our findings show that fat affects the ADC of the liver, confirming the results of Poyraz et al. [23]. In theory, liver fat increases the size of hepatocytes. We assume that the increased size of liver cells is associated with a smaller extracellular space. Therefore, molecular motion of water molecules is restricted, and ADC values may be lower than in nonfatty liver. Another possible explanation for the decrease in observed ADC is incomplete fat suppression in fatty livers. Fat suppression in DW-MRI is typically achieved using spectrally selective techniques centered around the water resonance frequency (4.7 ppm). These techniques effectively remove the signal from the main (methylene) fat peak at 1.3 ppm and nearby fat peaks. However, the MR fat signal has a complex spectrum, including several peaks near the water peak (e.g., glycerol and olefinic protons) [35]. These fat peaks are probably not suppressed by spectrally selective techniques and may contribute to the measured DW signal, thus confounding ADC measurement.

The DW-MRI technique is most commonly performed using an echo-planar imaging technique, and it is well known that this technique is prone to susceptibility artefacts [36, 37]. Naganawa et al. reported a signal decay of the raw data in nonmalignant liver tissue after injection of superparamagnetic iron oxide particles: the combination of superparamagnetic iron oxides and DW-MRI significantly improved the detection rate [38]. Our results show a decreased ADC as an expression of the signal decay in the raw data in liver tissue with high iron concentrations. Changes in ADC from high iron overload are likely due to signal decay and noise floor effects in the fitting with severe iron overload. These noise floor effects may influence quantitative ADC analysis.

Furthermore, the ADC we measured in one patient without fibrosis and in the absence of liver iron and liver fat (see outlier in graph 3) suggests that ADC analysis is impaired by at least one additional confounder other than iron and fat.

Our study has several limitations. First, we investigated an inhomogeneous study population including patients with different types of liver lesions and patients on medical treatment, for example. DW-MRI must prove to be robust before it can be used clinically. Further, in this study, biopsy was the gold standard. Liver core biopsy covers only a small region and there is the possibility of a sampling error (e.g., surround lesions), which may limited our study results. Furthermore, diffusion-weighted single-shot echo-planar imaging is limited in terms of image quality, including a poor signal-to-noise ratio, and spatial resolution. This may explain part of the variability observed in ADC measurements observed by us and other investigators. Another limitation is the choice of b-values for DW-MRI and the monoexponential model used for ADC calculation. ADC measurements using a bi-exponential fitting model and a wider range of low (0–50 mm2/s) and high (50–800 mm2/s) b-values would have been better for visualising the effects of microperfusion and real water diffusion separately.

In conclusion, DW-MRI using a monoexponential fitting model and starting with a b-value greater than 50 mm2/s does not seem to discriminate adequately among healthy tissue, fibrosis, and cirrhosis. More studies are necessary to assess DW-MRI using a bi-exponential fitting model and to elucidate the role of microperfusion in the detection and quantification of liver fibrosis and cirrhosis. Biological factors such as hepatic steatosis and high liver iron level were found to influence quantitative ADC assessment. The effects of fat may be due to an actual change in water diffusion or to residual fat signal in fatty livers. Changes in ADC from iron overload are likely due to signal decay and noise floor effects in the fitting with severe iron overload.