Introduction

Hepatocellular carcinoma (HCC) is the second leading cause of cancer-related death worldwide [1]. HCC is typically diagnosed using dynamic contrast-enhanced CT or MRI. The imaging diagnosis of HCC typically relies on enhancement characteristics such as hyperenhancement during the arterial phase relative to the surrounding liver parenchyma and washout in portal venous or delayed venous phases post-contrast on CT and MRI [2]. While CT has the advantage of being widely available and quicker, it exposes patients to ionizing radiation. On the other hand, by combining multiple pulse sequences, MRI offers better tissue characterization, which aids in HCC diagnosis [3]. The sensitivity for HCC detection using CT has been reported to vary between 50%–80%, compared to 52%–93% for MRI [4,5,6,7,8,9,10,11,12,13], with superiority of MRI for detecting small HCCs [3, 9, 13].

Various gadolinium-based contrast agents (GBCAs) can be utilized when performing liver MRI, including extracellular (EC) agents and liver-specific agents (Gadobenate Dimeglumine and Gadoxetic acid EOB-DTPA]) [6, 14,15,16]. Even if EOB-MRI demonstrates high sensitivity for HCC detection with the addition of T1-weighted imaging obtained during the hepatobiliary phase (HBP) (92%), there is no clear demonstrated superiority of EOB-MRI over EC-MRI in terms of HCC detection [11].

Prior studies have assessed the diagnostic performance of CT and EC-MRI using liver explant pathology as the reference, with diagnostic per-lesion sensitivity for HCC detection ranging between 46%–98% for CT [17, 18] and 77.8%–100% for EC-MRI [6, 19, 20]. The published evidence for EOB-MRI in comparison with liver explant pathology is small, with 3 studies demonstrating per-lesion sensitivity ranging between 44.8%–94.6% [21,22,23]. It is generally accepted that EC-MRI is superior to CT for detection of small HCCs [10, 24]. Although there have been several studies comparing the diagnostic performance of CT or EC-MRI to EOB-MRI [4, 7,8,9, 25,26,27], the incremental value of EOB-MRI for HCC detection with explant pathology needs to be established. Current North American and European guidelines recommend either CT or MRI without specifying the type of gadolinium based contrast to be used for HCC diagnosis and staging before transplantation, due to lack of published evidence and absence of consensus in favor of one technique [28, 29]. Thus, the comparison of CT, EC-MRI and EOB-MRI is warranted to establish the optimal method for HCC diagnosis.

The main objective of our study was to compare the diagnostic performance of CT, EC-MRI and EOB-MRI for HCC diagnosis in pre-transplant cirrhotic patients, using liver explant as the reference. A secondary objective was to evaluate the added value of HBP for HCC detection when using Gadoxetic acid.

Materials and methods

Patients

This single-center HIPAA compliant retrospective study was approved by the local IRB, with a waiver for informed consent. All consecutive patients who underwent liver transplantation and had undergone CT or MRI, using either EC-GBCA or gadoxetic acid between November 2006 and October 2015 were identified through the institutional liver transplant database. Inclusion criteria were: a) Adult patients with liver cirrhosis that underwent liver transplantation over a period of 9 years, b) CT or MRI performed with an adequate technique and quality within 90 days prior to liver transplantation. Exclusion criteria were: a) Interval treatment between imaging and transplantation, b) Suboptimal image quality (exams with major breathing artifacts) or single-phase CT. Among 534 patients, 277 patients were retained for the study (M/F: 199/78, mean age 60 ± 11y, range:23-77y). Patients were divided into three groups based on the type of pre-transplant imaging modality performed: CT (n = 100), EC-MRI (n = 77) and EOB-MRI (n = 100) (Fig. 1). The choice of imaging modality (CT or MRI) was based on the physician’s preference. No patient had multiple imaging modalities within 90 days to transplantation. Consequently, each patient was included only in one study group.

Fig. 1
figure 1

Flow chart of patient population. Abbreviations: CE-CT: contrast-enhanced computed tomography, EC-MRI: extracellular gadolinium based contrast-enhanced MRI, EOB-MRI: gadoxetic acid-enhanced MRI.

All patients had liver cirrhosis, ranging in etiology as follows: chronic hepatitis C (n = 154), alcohol abuse (n = 29), chronic hepatitis B (n = 24), primary biliary cirrhosis (n = 16), nonalcoholic steatohepatitis (n = 21), autoimmune hepatitis (n = 13), primary sclerosing cholangitis (n = 12) and cryptogenic cirrhosis (n = 8). All patients were transplanted based on United Network for Organ Sharing (UNOS) liver allocation selection criteria [2], including 157 patients (57%) that received locoregional therapies for HCC downstaging.

Imaging

All imaging studies were performed with state-of-the-art systems, using standard liver dedicated protocol. Details on the imaging systems and protocols are provided in supplemental material.

Image analysis

Three radiologists (CS, reader 1; SL, reader 2, and MC, reader 3) with 1, 7, and 8 years of experience in abdominal imaging, respectively, independently reviewed the images on a picture archiving and communication system (PACS). Exams were reviewed in a different random order (including the imaging modality) but if an exam contained multiple lesions all lesions were scored in the same reading session. The readers were asked to record all hepatic observations measuring ≥ 1 cm (excluding cysts and hemangiomas), with mention of lesion size and Couinaud segment location.

The readers categorized lesions using the Liver Imaging Reporting and Data System (LI-RADS) v2017 diagnostic algorithm relying on dynamic contrast-enhanced sequences [30]. A lesion was diagnosed as HCC if it was categorized as LI-RADS 5. Washout and capsule enhancement were assessed on portal venous or equilibrium/transitional phases for the EC-MRI group and on portal venous phase only for the EOB-MRI group. The readers were blinded to all radiological and pathological clinical reports and to treatment history. As diffusion-weighted imaging (DWI) was made systematically available on routine MRI protocols only after 2011, it was not assessed in this study.

The images from the EOB-MRI group were reviewed in a second reading session at least 6 weeks after the initial assessment, to minimize recall bias. The readers were asked to re-assess all lesions ≥ 1 cm, using dynamic contrast-enhanced characteristics with the addition HBP sequence, and assign appropriate LI-RADS scores, without the information from the first reading. Hypointensity on HBP was used as an ancillary finding according to the LI-RADSv2017 diagnostic algorithm in the hope of improving lesion detection. Any finding detected on this sequence was reported by the reader and analyzed back on dynamic contrast-enhanced in order to apply a LI-RADS score.

Reference standard

The study coordinator (SS) reviewed the images and correlated the lesions recorded on imaging with the explant pathologic findings, based on lesion size and segment location. Cholangiocarcinoma and mixed HCC-cholangiocarcinoma tumors were excluded from the analysis. Only viable HCCs ≥ 1 cm on pathology were included in the study. Lesions scored LI-RADS 1–4 and LI-RADS M were considered negative for HCC. Completely necrotic tumors were excluded from analysis. Lesion size on pathology was used to define sub-groups of analysis: 1–1.9 cm and ≥ 2 cm. On a lesion-basis, true positive (TP) was defined as a lesion scored LI-RADS 5 with corresponding HCC in the same segment location at pathology. True negative (TN) was defined when no lesion was reported or scored LI-RADS 5 and no HCC was found in the same segment location at pathology. False positive (FP) was defined when a lesion was scored LI-RADS 5 and no HCC was found in the same segment location at pathology. False negative (FN) was defined when an HCC was found at pathology and no lesion was scored LI-RADS 5 in the same segment location. On a patient-basis, TP was defined when at least one TP lesion was found. TN was defined when no lesion was found or all observations were considered TN. FP was defined when no HCC was found at pathology and at least one FP lesion was recorded. FN was defined when at least one HCC was found at pathology and no lesion was considered TP.

Statistical analysis

All statistical tests were conducted at the two-sided 5% significance level using SAS version 9.3 (SAS Institute). Logistic regression for correlated data was used to assess and compare the imaging methods in terms of per-patient and per-lesion diagnostic accuracy for the detection of HCC using explant pathology used as reference standard. Per-patient and per-lesion analyses are important for patient classification and liver transplant eligibility [2]. For the per-lesion analysis, an indicator variable identifying the lesion associated with each observation was included in the model as a blocking factor so that methods could be compared on a per-lesion basis and results then pooled over lesions. Per-lesion and per-patient sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and accuracy were calculated for each modality, and the performance characteristics of the imaging modalities were compared. The per-lesion analysis for HCC detection were first conducted using data from all lesions regardless of size and then stratified according to lesion size at pathology (1–1.9 cm or ≥ 2 cm). Inter-reader agreement for HCC detection was assessed using kappa (κ) coefficients.

Results

Histopathologic results

A total of 376 HCCs were identified in 177 patients. There were 5 patients with 5 non-HCC malignancy (1 cholangiocarcinoma and 4 mixed HCC-cholangiocarcinomas). These patients were not excluded (as they also had HCC), however the 5 non-HCC malignant lesions were excluded from per-lesion analysis. Out of 277 patients, 100 (36.1%) patients had no HCC, 84 (30.3%) patients had 1 HCC, 39 (14.1%) patients had 2 HCCs, 24 (8.7%) patients had 3 HCCs and 30 (10.8%) patients had more than 3 HCCs. Out of these tumors, 265 HCCs were viable (n = 72 in CT group, n = 61 in EC-MRI group, and n = 132 in EOB-MRI group) and 106 HCCs were completely necrotic (n = 40 in CT group, n = 11 in EC-MRI group, and n = 55 in EOB-MRI group). Differentiation of viable tumors was as follows: 85 well differentiated, 163 moderately differentiated, and 17 poorly differentiated. The mean size of the viable lesions was 2.1 cm ±1.4 cm, ranging from 1.0–9.1 cm. When stratified according to size, 158 HCCs were 1–1.9 cm (n = 46 in CT group, n = 30 in EC-MRI group, and n = 82 in EOB-MRI group) and 107 HCCs measured ≥ 2 cm (n = 26 in CT group, n = 31 in EC-MRI group, and n = 50 in EOB-MRI group).

Interobserver agreement

There was substantial agreement among readers for LI-RADS 5 categorization, with concordance rates of 90.7% for the CT group (κ = 0.810), 95.2% for the EC-MRI group (κ = 0.905), 88.7% for the dynamic EOB-MRI group (κ = 0.754), and 89.0% for the full EOB-MRI group (κ = 0.735).

HCC detection per patient

There was no significant difference in the specificity, PPV or accuracy between the groups when pooling all three readers together (Table 1). However, dynamic EOB-MRI was found to be superior to CT in terms of sensitivity (p = 0.032) and NPV (p = 0.043). There was no significant difference between dynamic EOB-MRI and EC-MRI in terms of sensitivity (p = 0.292), however full EOB-MRI had a significantly higher sensitivity than CT (p = 0.002) and EC-MRI (p = 0.047).

Table 1 Per-patient diagnostic performance for each modality for HCC detection with 95% confidence intervals

HCC detection per lesion

Both MRI techniques had significantly higher per-lesion sensitivity for HCC than CT (Table 2). There was no significant difference in the sensitivity for HCC between EC-MRI and dynamic EOB-MRI (p = 0.949). Despite this, the addition of HBP significantly improved the sensitivity of EOB-MRI for HCC detection (p = 0.006). There were no significant differences in specificity or PPV between the imaging modalities. Examples of HCCs with CT, EOB-MRI and EC-MRI are shown in Figs. 24.

Table 2 Per-lesion sensitivity for each modality for HCC detection with 95% confidence intervals
Fig. 2
figure 2

52-year-old male patient with chronic hepatitis C-related liver cirrhosis and HCC diagnosed with CT. (a) Axial arterial phase post-contrast image demonstrates hyperenhancement of the lesion (arrows). (b) Axial portal venous phase post-contrast image demonstrates lesion washout and capsule (arrows). Abbreviations: HCC: hepatocellular carcinoma, CT: contrast-enhanced computed tomography.

Fig. 3
figure 3

56-year-old male patient with chronic hepatitis C-related cirrhosis and HCC, evaluated with gadoxetic acid-enhanced MRI. (a) Axial fat suppressed T1-weighted image obtained during the arterial phase demonstrates 1.3 cm partially hyperenhancing lesion in segment 4 (arrow) with corresponding washout at the portal venous phase (b, arrow). The lesion demonstrates hypointensity on axial fat suppressed T1-weighted image obtained at the hepatobiliary phase (c, arrow). On initial evaluation, none of the readers identified the lesion on dynamic imaging, but the lesion was reclassified as LI-RADS 5 during the second reading session by all 3 readers. Abbreviations: HCC: hepatocellular carcinoma, LI-RADS: Liver Imaging Reporting and Data System.

Fig. 4
figure 4

57-year-old male patient with alcoholic liver cirrhosis and HCC, evaluated with MRI using EC-GBCA (gadobutrol). (a) Axial fat suppressed T1-weighted image obtained at the early arterial phase demonstrates 2.1 cm hyperenhancing lesion in segment 2 (arrow) with corresponding washout and capsule at the portal venous phase (arrow, b). All three readers characterized the lesion as LI-RADS 5. (c) Gross section of the explanted liver demonstrates 2.2 cm HCC (arrows). Abbreviations: HCC: hepatocellular carcinoma, EC-GBCA: extracellular gadolinium based contrast agents, LI-RADS: Liver Imaging Reporting and Data System.

Per-Lesion Sensitivity Stratified by Tumor Size

There was no significant difference in sensitivity for HCCs ≥ 2 cm between modalities for any individual reader or when pooling readers together, with sensitivity ranging between 83.9% and 93.8% for the pooled reader data (Table 3). For HCCs 1–1.9 cm, the sensitivity of MRI using either type of GBCA (64.6% with EC-MRI and 67.3% for full EOB-MRI using pooled reader data) was significantly higher than CT (34.4% sensitivity using pooled reader data, p = 0.012 and 0.003, respectively). There was no significant difference between the EC-MRI and EOB-MRI groups for detection of 1–1.9 cm HCCs (p = 0.9 and 0.249, pooled reader data, for dynamic and full EOB-MRI, respectively).

Table 3 Per-lesion sensitivity for each modality for HCC detection stratified by lesion size with 95% confidence intervals

Added value of HBP sequence

The addition of the HBP sequence obtained in the EOB-MRI dataset significantly improved per-lesion sensitivity for HCC detection (from 69.7% to 76.8%; p = 0.006) (Table 2). Readers 1, 2 and 3 detected 12, 13 and 15 additional HCCs with the addition of HBP images, in comparison with dynamic EOB dataset, respectively (Fig. 3). This improvement was also significant for HCCs 1–1.9 cm (p = 0.002) (Table 3). Despite the improvement in HCC detection with the addition of HBP sequences, there was no significant difference with EC-MRI. All HCCs but two were hypointense on HBP; the two isointense HCCs demonstrated typical findings on dynamic imaging and were both moderately differentiated on pathology. There were no hyperintense HCCs.

False-positive lesions

Two FP lesions were recorded in the CT group (1 for reader 1 and 2 for reader 2, none for reader 3, mean size 15.7 mm, range: 10–18 mm). Two FP were recorded in EC-MRI group (both for reader 2, sizes: 11 and 13 mm). For the dynamic EOB-MRI group, four FP were recorded (3 for reader 1 and 1 for reader 2, none for reader 3, mean size: 15.1 mm, range 11-20 mm). For the full EOB-MRI group, two FP for each reader were recorded (mean size: 14.5 mm, range 11-18 mm). When correlating with pathology, three of the FP lesions identified represented dysplastic nodules (11, 17 and 18 mm) and the rest had no correlate on explant and were thus presumed to represent areas of shunting or posttreatment changes.

Discussion

Our study demonstrates that EC-MRI and EOB-MRI are superior to CT for HCC detection on both per-patient and per-lesion basis, particularly for lesions 1-1.9 cm. We observed that EC-MRI was slightly superior to dynamic EOB-MRI on a per-lesion analysis, although without statistical difference between both techniques. We also observed that the addition of HBP sequence to dynamic EOB-MRI significantly improved HCC detection, particularly for lesions 1–1.9 cm, as shown before [14]. Despite this improvement in performance, there was still no significant difference in HCC detection compared to EC-MRI. On a per-patient perspective, sensitivity of full EOB-MRI was significantly higher than that of CT and EC-MRI.

It is generally accepted that dynamic MRI has high diagnostic accuracy for HCC detection, outperforming CT, for lesions < 2 cm [4, 7, 8, 24]. Regarding comparison of CT and EOB-MRI, two meta-analyses including comparative studies concluded that EOB-MRI was more sensitive for HCC detection [4, 31]. The present study is in line with these results as it shows a higher diagnostic accuracy for both MRI groups compared to CT, albeit in different populations. CT has the advantage over MRI as being more available, rapid and robust. The main disadvantages of CT are the radiation exposure and lower sensitivity for small lesions (as shown in the present study). Compared to CT, MRI provides higher contrast resolution and assessment of a greater number of tissue properties, which are valuable for lesion detection and characterization [3]. However, MRI is limited by accessibility, longer acquisition time, the need for optimization and expertise to be accurately interpreted, and increased sensitivity to motion [3].

EOB-MRI is often presented as a viable alternative to ECCM, with the added value of HBP. However, compared to ECCM, gadoxetic acid have several limitations: 1) Higher risk of arterial phase respiratory artifact (called transient severe motion), described in up to 17% of MRI examinations with gadoxetic acid, compared to 2% with ECCM in a prospective study [32]. 2) Dynamic imaging quality is also limited due to the smaller amount of gadolinium injected (0.025 mmol/kg for Gd-EOB-DTPA compared to 0.1 mmol/kg for ECCM), counterbalanced by a higher R1 relaxivity [33]. 3) Quality of the transitional phase is limited by early hepatocyte uptake of contrast, confounding assessment of lesion washout [34]. 4) Detection of HCC can be challenging in the setting of severe liver dysfunction, cholestasis, or biliary obstruction, in which hepatocyte contrast uptake can be significantly reduced or absent. 5) Cost of gadoxetic acid is higher compared to ECCM, in addition to the longer acquisition time. Limitations related to the dynamic phases can explain the low sensitivity for HCC detection per-lesion of dynamic EOB-MRI (69.7% with dynamic EOB-MRI, compared to 78.5% with EC-MRI, pooled reader data) in the present study, albeit not statistically different to EC-MRI. The addition of HBP has shown to improve lesion conspicuity compared to arterial phase imaging alone [25]. Additionally, the use of gadoxetic acid provides diagnostic information regarding both lesion characteristics and background hepatocellular function [34]. In the present study, we showed that the use of HBP allows the detection of additional lesions that are then characterized as HCC using dynamic sequences, as suggested before [21, 22, 27]. This is mainly due to increased conspicuity of HCC, which appears hypointense during HBP compared to the background liver parenchyma in over 90% of cases [25, 35]. Moreover, HBP may help differentiate benign nodules from HCC based on gadoxetic acid uptake [34]. In addition, gadoxetic acid may allow improved confidence in the evaluation and characterization of pseudolesions and peritumoral areas. The potential benefit of EOB-MRI has to be balanced with the risk of gadolinium tissue deposition in the brain when used repeatedly for surveillance that has been described to be higher with gadoxetic acid than macrocyclic GBCAs [36,37,38]. Future work is needed to assess the long-term effect of gadolinium deposition in tissues and investigate the risk-benefit ratio of gadoxetic acid compared to macrocyclic EC-GBCAs for HCC diagnosis. To the best of our knowledge, there is no data comparing the performance of EC-MRI to EOB-MRI using explant pathology as the reference. Kierans et al presented the lack of explant correlation as an important bias of their meta-analysis (only 13% of the selected studies included explant correlation) [39]. A small prospective crossover study from Besa et al demonstrated similar image quality for both GBCAs, with EOB-MRI being slightly superior to EC-MRI in terms of HCC detection [25]. More recently, in a prospective study in 91 patients with explant and resection specimen correlation, Min et al found that EC-MRI was superior to EOB-MRI for the diagnosis of HCC (sensitivity 77.9% vs. 66.3% for EC-MRI and EOB-MRI, respectively) [27]. The authors performed this analysis on a by-lesion perspective and explained the worse sensitivity of EOB-MRI with the known limitations of gadoxetic acid during the dynamic phase. In the present study, we found the same range of sensitivity for dynamic EOB-MRI (69.7%) but with a significant increase in lesion detection when adding HBP (76.8% for full EOB-MRI, p = 0.006). On a per-patient basis, we showed superiority of EOB-MRI for HCC detection compared to EC-MRI and CT, which may motivate the use of abbreviated EOB-MRI for HCC screening and surveillance [40].

Using liver explant pathology as the reference standard presents the advantage of sampling the whole liver, compared to biopsy. Moreover, biopsies are limited by intra-tumoral heterogeneity that limits biopsy performance known to have a sampling error rate as high as 20.3% [41, 42]. Our results are thus not limited to a patient-level but can be extended to a lesion-level analysis. This revealed that sensitivity was lower in smaller HCC. This could be explained by lack of hypervascularity in early stage HCC [43].

There are several limitations to our study. It was a retrospective single-center study. The CT and MR systems used were different, and the time ranges in which the various modalities were used differed, which may result in varying image quality and signal-to-noise ratios. For this reason, we were not able to assess DWI, as it was not obtained in all the MRIs. The added value of DWI has already been shown, including with explant correlation [5]. The patient populations in the different groups were different, with a higher HCC prevalence and treated patients in EOB-MRI group. The patients were grouped according to the type of imaging that was performed based on the referring physician choice, inducing a possible selection bias between imaging modalities. A large crossover prospective study would be ideal, however more difficult to perform. As we excluded 5 lesions from the analysis (cholangiocarcinoma and mixed HCC-cholangiocarcinoma), we acknowledge a slight overestimation of the diagnostic performance. We believe that this effect should be limited as the total of included lesions is high (265 lesions) and as we only excluded the lesions but not the patients.

In conclusion, our study demonstrates that MRI outperforms CT in detecting HCC, particularly for lesions measuring 1–1.9 cm, without significant difference in the performance of EC-GBCAs and Gadoxetic acid. EOB-MRI outperforms EC-MRI for per-patient HCC detection, and the addition of HBP improves the detection of small HCCs.