Rapid advancements in percutaneous, medical, and surgical therapy, including liver transplantation, rely on accurate noninvasive diagnosis and staging of hepatocellular carcinoma (HCC). While ultrasound may be used in screening or surveillance of HCC, contrast-enhanced CT and MRI are recommended for final noninvasive diagnosis by many societies including the American Association for the Study of Liver Diseases (AASLD), the Organ Procurement and Transplantation Network (OPTN), and others [16].

The Liver Imaging Reporting and Data System (LI-RADS) provides standardized reporting of imaging features in patients at risk for HCC. The goal of LI-RADS is to allow for consistent application of terminology, to reduce variability in reporting, to improve communication with referring physicians, and to provide data for quality assurance and research purposes [7]. For OPTN listing, LI-RADS 5 or OPTN 5 lesions are considered diagnostic of HCC without the need for histologic confirmation [1, 2]. This necessitates accurate diagnosis of HCC to minimize false positives and negatives because many patients with cirrhosis and imaging features consistent with HCC receive MELD exception points and thereby improve their standing on liver transplant waiting lists [811]. Existing literature suggests that using the characteristic enhancement patterns on which the LI-RADS criteria are structured, contrast-enhanced CT and MRI are moderately sensitive for small HCCs between 1 and 2 cm and are highly sensitive for larger HCCs measuring greater than 2 cm [1217].

Retrospective studies have shown that careful application of more stringent diagnostic criteria to smaller lesions may reduce the rate of false positives at liver transplant [18]. Other studies have shown high inter-observer reliability in the overall grading of lesions using LI-RADS and other criteria [19]. Reported inter-observer reliability for individual features is variable, with high agreement in arterial phase hyperenhancement at MRI but poor agreement on washout or capsule appearance [20]. The inter-reader agreement for major imaging features in HCCs has not yet been compared at CT vs. MRI, and the frequency of LI-RADS features and categories assigned to HCCs at CT vs. MRI is not yet known.

The primary aim of this project was to estimate the frequency of observation and compare inter-reader agreement for LI-RADS v2014 major features at CT vs. MRI in pathology-proven hepatocellular carcinoma. The secondary purpose was to estimate and compare the frequency of LI-RADS categorization at CT vs. MRI. Both goals would increase understanding of the impact that choice of imaging modality has on final LI-RADS category.

Methods

This study was reviewed by the local institutional review board, and informed consent was waived for this retrospective study. Pathology reports from liver specimens (explants and partial hepatectomies) of 605 sequential patients with cirrhosis between December 2008 and October 2013 were reviewed to identify specimens with at least one focus of histologically proven hepatocellular carcinoma. Lesions which were completely necrotic and without residual findings to confirm a diagnosis of HCC at pathology were excluded. Patients having undergone localized therapy (transarterial chemoembolization, wedge resection, thermal or ethanol ablation) prior to transplant or resection without available pre-treatment imaging were also excluded, as were lesions evaluated with gadoxetate disodium enhanced MRI (n = 8) or those with suboptimal arterial phase imaging (n = 4).Finally, lesions smaller than 1 cm at pathology were excluded because these lesions were felt to be too small to reliably identify at imaging, particularly given the inevitable delay between imaging and surgery. If multiple remote pre-treatment examinations were available, the two most recent pre-treatment studies were evaluated for determination of threshold growth.

Image acquisition

For CT studies performed at our institution, quad-phase technique was used with noncontrast imaging of the abdomen followed by late arterial phase, portovenous phase, and 3–5 min delayed phase imaging following the intravenous administration of iodinated contrast (iohexol, GE Healthcare, Milwaukee, WI). Patients undergoing MRI were imaged at 1.5 or 3.0 Tesla. Typical sequences included coronal T2 SSFSE, axial T2 with fat saturation, axial in and out of phase, axial diffusion-weighted imaging (DWI) with B = 500, precontrast axial T1 with fat saturation, and multiphase post-contrast T1 with fat saturation imaging of the abdomen following the administration of IV contrast.

Image interpretation

Each lesion described in the pathology report was correlated with the most recent pre-treatment multiphase imaging examination, CT or MRI. No lesions were evaluated at both CT and MRI. Studies were independently reviewed by two board-certified abdominal imagers who had access to full imaging datasets as well as the lesion size and segmental location as noted in the pathology report. In cases where identifying lesions of the correct size and location was difficult, or the two readers disagreed on lesion location, lesions were excluded from further analysis. Each reader measured the size of each lesion on the most conspicuous imaging phase and threshold growth compared to prior imaging (if available) was determined. Each lesion was then assessed by both readers in terms of arterial phase hyperenhancement, portal venous and delayed phase enhancement characteristics as well as the presence or absence of washout and capsule appearance. Mirroring the LI-RADS lexicon, washout was defined as a temporal reduction in enhancement relative to liver from an earlier to later phase and capsule appearance was defined as a peripheral rim of smooth hyperenhancement in the portal venous or delayed phase that is unequivocally thicker or more conspicuous than the rims of surrounding background nodules. In cases of disagreement between the two readers, images were re-reviewed and a consensus categorization and grading was performed. Each lesion was then assigned a LI-RADS category based on the v2014 guidelines [7]. Readers were blinded to the radiology report, although knew that all lesions were histologically confirmed as HCC.

Statistical analysis

Interval data are presented as an average and standard deviation. Inter-observer agreement of categorical data was evaluated using Cohen’s kappa statistic. A Fisher’s exact test was used to compare the proportion of lesions demonstrating arterial phase hyperenhancement, washout, and capsule appearance as well as rates of LI-RADS category between MRI and CT. Excel 12.1.0 (Microsoft, Redmond, WA) and R 3.0.2 (The R Foundation for Statistical Computing, Vienna Austria) were used for statistical analysis. A p value of <0.05 was used to determine significance for all tests.

Results

There were a total of 134 patients with 184 hepatocellular carcinomas with pre-therapy imaging who were included in this study. 108 patients were males and 26 were females, with a mean age of 58.2 ± 8.1 years. The average time between the most recent pre-treatment prior imaging study and surgery, either hepatectomy or transplant, was 12.7 ± 8.7 months. Mean lesion size was 2.6 ± 1.3 cm with ICC of 0.92 and 95% CI [0.88, 0.95]. Overall, 63 (34%) lesions measured ≥1 and <2 cm, while 121 (66%) were ≥2 cm. Because patients may have had multiple lesions and different lesions may have been noted at different times during serial follow-up imaging while awaiting transplant, a total of 108 unique CT and 33 unique MRI studies (141 total studies) were reviewed. By lesion, 141 (77%) were imaged by multiphase CT, and 43 (23%) lesions by MRI. Many of the CT and MRI studies reviewed were performed at outside institutions because our institution is a tertiary referral center for liver transplant. 79/141 (56%) studies reviewed were external, while 62/141 (44%) were performed at our institution. Additionally, CT studies spanned a time period during which a transition occurred where a delayed phase was commonly acquired. As a result, only 71/141 (50%) of CT studies included a delayed phase. CT studies without a delayed phase were not included in the washout, capsule, or LI-RADS analysis unless otherwise explicitly stated.

Major criteria

Table 1 shows the number of lesions that demonstrated each LI-RADS major criterion. Figure 1 shows the examples of each major criterion.

Table 1 Rate of LI-RADS major criteria in total and by modality, percent agreement between readers and Cohen’s kappa statistic for LI-RADS major criteria in total and by imaging modality
Fig. 1
figure 1

Examples of LI-RADS major criteria for HCC. A A large peripheral HCC in a 48-year-old man shows arterial enhancement. B HCC (white arrow) in a 65-year-old man demonstrates washout on delayed phase imaging. C Capsule appearance in addition to washout on delayed phase images is seen in a 58-year-old woman with cirrhosis and HCC. D Graphic representation of threshold growth defined as greater than 50% increase in diameter in 6 months or greater than 100% increase in 1 year

Arterial enhancement was the most commonly observed major criterion, seen in 159 (86%) of lesions, and was seen slightly more frequently at CT vs. MRI (87% vs. 86%, p = 1.00). Between the two raters, there was agreement on arterial phase characteristics in 156 (95%) cases (κ = 0.75).

Washout appearance was the second most frequently seen major criterion, seen in 94 (82%) lesions, and seen similarly at MRI and CT with delay (79% vs. 85%, p = 0.46). Agreement between readers on the presence or absence of washout occurred in 89% of cases (κ = 0.61).

Capsule appearance was seen in 31 (27%) lesions and was more commonly observed at MRI (44%) than at CT (17%), p = 0.002. Capsule appearance was most discrepant between the two readers, with 21 (18%) disagreements (κ = 0.56), with 13 (18%) disagreements at CT and 8 (19%) disagreements at MRI. Figure 2 shows four examples of cases in which there was disagreement between our readers on the presence or absence of capsule appearance. Subgroup analysis performed on CT studies with a delayed phase vs. those without a delayed phase revealed that washout was seen in 60/71 (85%) of studies with a delay compared to 37/70 (53%) of studies without a delay (p = 0.0001). Capsule was seen in 12/71 (17%) of CT studies with a delayed phase and in 7/70 (10%) of studies without a delayed phase (p = 0.32).

Fig. 2
figure 2

Discrepancy of capsule appearance assessment between readers. Axial contrast-enhanced MRI images in the delayed phase from a 57-year-old man (A) and a 66-year-old woman (B). Axial contrast-enhanced CT images also in the delayed phase from a 64-year-old man (C) and a 68-year-old man (D). All patients had pathologically proven hepatocellular carcinomas, and there was disagreement between readers on the presence of capsule appearance. In each case, the ultimate determination affected the LI-RADS category for the lesion. This effect was most pronounced for lesions between 1 and 2 cm, because arterially enhancing lesions ≥2 cm with washout qualify as LI-RADS 5 without additional major criteria

Of the 99 lesions with a relevant prior, 47 (47%) lesions demonstrated threshold interval growth (new lesion ≥1 cm, greater than 100% diameter increase in 12 months, or 50% in 6 months), with a similar distribution at both CT and MRI (49% and 42% respectively, p = 0.64). Figure 3 shows a LI-RADS 5 lesion demonstrating all 4 major criteria, arterial hyperenhancement, delayed phase washout, capsule appearance, and threshold growth at both CT and MRI.

Fig. 3
figure 3

Comparison of MRI and CT imaging of a LI-RADS 5 lesion in a 67-year-old man with cirrhosis and hepatocellular carcinoma. Arterially enhancing (A, B) segment 5/6 lesion which demonstrates washout and capsule appearance (C, D) was initially imaged at MRI (A, C). The lesion was imaged 4 months later at CT (B, D) re-demonstrating the same features in addition to a diameter increase meeting criteria for threshold growth

LI-RADS categorization

Figure 4 and Table 2 detail the distribution of consensus LI-RADS grading. ICC for LI-RADS category between individual readers was 0.59 with 95% CI of [0.30, 0.95]. There were 10 (9%) LI-RADS 3 lesions, 42 (37%) LI-RADS 4 lesions, and 61 (54%) LI-RADS 5 lesions. At CT with delay, the rates of LI-RADS 3, 4, and 5 lesions were 10%, 38%, and 52%, respectively. At MRI, these rates were 7%, 35%, and 56% (1 lesion was categorized LI-RADS 2 by MRI), respectively. At CT, 8% (n = 2) of 1–2 cm lesions were categorized LI-RADS 5, and at MRI, 43% (n = 6) were categorized LI-RADS 5 (p = 0.01). The presence or absence of capsule appearance changed the LI-RADS category (either between LI-RADS 3/4 or LI-RADS 4/5) in 7/8 (88%) small lesions where readers disagreed on capsule. In contrast, only 3/25 (12%) disagreements in capsule appearance resulted in change of category (all LI-RADS 4/5) of larger lesions (p = 0.0002).

Fig. 4
figure 4

Distribution of LI-RADS scores by modality. Although not significant, there was a trend toward increased proportion of LI-RADS 5 lesions at MRI compared to CT

Table 2 Distribution of LI-RADS category by modality and size

Discussion

We evaluated 184 pathologically proven HCCs in 134 patients who underwent liver transplantation or partial hepatectomy at our institution between December 2008 and October 2013. The most commonly visualized major feature was arterial phase hyperenhancement followed by washout appearance. Inter-observer reliability for arterial hyperenhancement and washout were substantial. Capsule appearance was rarely seen. Inter-observer reliability for overall LI-RADS category was moderate.

Mirroring results of prior reports comparing the rates of portal venous and delayed phase washout at both CT and MRI [21, 22], there was a significant increase in the rate of HCCs demonstrating washout on CTs with a delayed phase compared to CTs without a delayed phase The rate of washout seen at CT with a delayed phase and MRI was similar. These results reaffirm added value in delayed phase imaging despite the increased radiation dose resulting from an additional CT acquisition.

Only 54% of lesions were categorized as LI-RADS 5. A lesion categorized LI-RADS 5 was modestly sensitive (50%–59%) for HCC. This is an expected result because the LI-RADS categories were designed to have high specificity (not tested in this study) at the cost of sensitivity. Creating a composite category of LI-RADS 4 (probably HCC) and LI-RADS 5 (definitely HCC) would provide 90%–91% sensitivity. Further study including non HCC lesions such as other liver tumors and benign nodules will be needed to determine the specificity of this combined category.

Capsule appearance was the least commonly seen major feature and had only moderate inter-observer reliability. Nonetheless, capsule is an important characteristic, particularly in arterially enhancing lesions between 1 and 2 cm. In lesions of this size, both capsule and washout appearance are required in order to characterize a lesion as LI-RADS 5. Because of the increased incidence of capsule appearance on MRI compared to CT, more lesions between 1 and 2 cm were categorized as LI-RADS 5 on MRI. This is an important finding as MRI may up-categorize observations affecting transplant listing using the Milan or UCSF criteria [23, 24], though a single lesion measuring less than 2 cm will not influence a patient’s standing on the transplant list as it does not qualify as T2 disease [25].

Analysis of inter-reader variability between CT and MRI showed an unexpectedly low Cohen’s kappa statistic for arterial phase hyperenhancement at MRI (κ = 0.22). Because the calculation of Cohen’s kappa involves removing agreement by chance, the result is dependent on the prevalence of the finding being compared as well as the sample size being examined. In the case of arterial phase enhancement at MRI, the vast majority of cases (86%) demonstrated arterial phase hyperenhancement, and therefore, a nonenhancing lesion was rare, increasing the expected rate of agreement by chance and artificially decreasing kappa the smaller MRI group.

There were a number of limitations of this single-center, retrospective study. As a tertiary referral center for liver transplant, 56% of the studies reviewed were from other institutions, resulting in heterogeneous technique and image quality. Accordingly, we were unable to evaluate the presence of ancillary features such as T2 signal intensity and DWI in patients imaged using MRI.

It is important to note that the absolute time between imaging and pathology appears lengthy (12.9 ± 8.9 months). This reflects the length of the transplant waiting list in our region. Patients with an imaging diagnosis of HCC often received local therapy such as TACE or ablation. Because imaging after treatment was excluded from this study, only the pre-therapy imaging could be reviewed. Because a majority of patients underwent localized therapy prior to transplantation, we were forced to limit analysis to those lesions that were proven to be HCC on pathology. This prevented the detection of false positive lesions and specificity was not calculated. Lastly, patients are typically referred to our institution at the initial diagnosis of HCC, so prior studies were not always available which prevented evaluation of threshold growth in some cases.

In conclusion, we have evaluated the sensitivity and reproducibility of LI-RADS in a large series of pathologically proven HCCs. Arterial phase hyperenhancement and washout appearance are the most common major criteria and demonstrate the most robust inter-reader variability. Rates of observed arterial hyperenhancement and capsule were similar at both CT and MRI. While infrequently seen overall, capsule appearance is more common at MRI and may impact LI-RADS categorization primarily in lesions between 1 and 2 cm.