Introduction

Hepatocellular carcinoma (HCC) usually develops in patients with chronic liver disease, and in this population, HCC has been recognized as a leading cause of death [1]. The prolonged subclinical course of this cancer and the identification of a target population at risk of developing HCC provide the opportunity for early detection through the implementation of surveillance programs [2, 3]. HCC surveillance relies on ultrasonography (US), and the unequivocal diagnosis of a US-detected nodule within a cirrhotic liver represents a major clinical challenge.

Aimed to develop a system for standardizing the terminology, technique, interpretation, reporting, and data collection of liver imaging in patients at risk of developing HCC, the American College of Radiology (ACR) developed the Liver Imaging Reporting and Data System (LI-RADS) [4]. The purpose of this system is to guide the diagnosis and management of liver observations identified by imaging techniques. LI-RADS classifies the full spectrum of liver lesions and pseudolesions in categories, ranging from benign to malignant/HCC lesions.

LI-RADS is a dynamic document originally released in 2011, which has been updated several times incorporating information regarding probabilities of benign, HCC, and other malignancies in each LI-RADS category. In that sense, the diagnostic performance of LI-RADS v2013 by MRI was evaluated in cirrhotic patients with a US-detected solitary nodule 20 mm or smaller. In those US-detected nodules, the LR-4 profile had a near-absolute specificity and, when combining both LR-4 and LR-5 as definitely HCC, the specificity was maintained with a significant increase in the sensitivity [5]. According to these results, an update of LI-RADS in 2014 (v2014) implemented the possibility of categorizing as LR-5us those LR-4 observations (10–19 mm with arterial phase hyperenhancement (APHE) and washout (WO) if visible on surveillance US), and this criterion was maintained until v2017. Another important finding of that study was that a relevant proportion of LR-3 observations (29 out of 42 patients, 69%) was finally diagnosed as HCC. Contrarily, in other studies, the proportion of HCC in LR-3 observations was lower, and most of these lesions remained stable or even decreased in category during follow-up, thus supporting the recommendation of imaging follow-up with no need for recalling an invasive work-up including percutaneous biopsy [6,7,8,9].

In 2018, the latest version of LI-RADS was released [10, 11]. The main changes compared to the previous version of 2017 were the simplification of the definition of threshold growth and the assignment of LR-5 category to observations between 10 and 19 mm displaying non-rim APHE and non-peripheral WO or threshold growth, regardless of their identification by US. With this latest update, LI-RADS is consistent with and fully integrated into the American Association for the Study of Liver Diseases (AASLD) clinical practice guidance [3].

The aim of this study is to evaluate the diagnostic accuracy of each LI-RADS category by MRI according to LI-RADS v2018 in a cohort of patients with cirrhosis in whom a solitary nodule ≤ 20 mm was detected during screening US, with particular interest in identifying which features are associated with the HCC diagnosis in nodules categorized as LR-3.

Materials and methods

This study was approved by the institutional ethics committee for clinical research of our center, and the need to obtain consent for analysis of the data was waived. The inclusion was initiated in November 2003, and this assessment includes the patients recruited until February 2017 in order to ensure enough follow-up of patients with non-malignant diagnosis. Patients were Child-Pugh A-B with no history of HCC in whom a new solitary, well-defined, solid nodule between 5 and 20 mm was detected by screening ultrasound (US). Upon initial detection of hepatic nodule at screening US, patients were examined by dynamic MRI with extracellular contrast agent and finally submitted to ultrasound-guided biopsy. Pathology result was considered the gold standard, and biopsy was repeated if a conclusive diagnosis was not achieved. Since non-invasive criteria by MRI have been externally validated [12,13,14] and adopted as criteria for HCC diagnosis, we considered, after 2007, also the specific vascular profile (APHE and WO) by MRI as gold standard for HCC diagnosis, and if this was not present, pathology confirmation was requested. Nodules with neither pathological confirmation nor specific vascular pattern by MRI were followed with US every 3 months and MRI every 6 months, and a new biopsy was performed in case of growth or appearance of hypervascularization during the follow-up.

MR imaging

MRI examinations were performed using clinical 1.5-T systems (Symphony or Aera; Siemens Healthineers, or Signa; General Electric Healthcare) with a torso phased-array coil for signal detection. A standard clinical liver MRI protocol was used in all patients with axial unenhanced breath-hold T1-weighted gradient-echo in-phase and out-of-phase and T2-weighted half-Fourier single-shot turbo spin-echo or single-shot fast spin-echo sequences. Then, an axial multiphase dynamic study with a 3D breath-hold fat-suppressed T1-weighted interpolated spoiled gradient-echo sequence with extracellular gadolinium, at the recommended dose and rate (gadodiamide 0.5 mmol/mL, dose of 0.1 mmol/kg and rate of 2 mL/s; gadobutrol 1 mmol/mL, dose of 0.1 mmol/kg and rate of 2 mL/s), was done. After pre-contrast imaging, bolus tracking technique was used to obtain arterial-phase images, approximately 20 s after contrast injection. Portal venous and delayed venous phase images were acquired 60 to 65 s and 100 to 110 s, respectively, thereafter. Finally, a post-contrast axial 2D breath-hold fat-suppressed T1-weighted gradient-echo sequence was performed. Subtraction of the pre-contrast images from the arterial-phase images was performed. MRI with liver-specific contrast agent was not included in the analysis.

Image interpretation

The first MRI obtained within 1 month after US nodule detection was read by one of three radiologists with more than 10 years of experience in imaging of the liver (A.D., J.R., and C.A.).

The analysis considers only the target lesion initially detected by screening US. The following items obtained by MRI were separately evaluated for each target lesion and prospectively registered in a case report form by J.R. and C.A.: size (in mm) and signal intensity in each sequence and in each phase of the dynamic study after intravenous contrast injection compared with that of the liver parenchyma and categorically classified as hyperintense, isointense, or hypointense at visual inspection. Occurrence of APHE (defined as non-rim-like enhancement in arterial phase unequivocally greater in whole or in part than the liver), and WO in venous phases, and presence of fat and “capsule” were specifically registered. Subtraction images were used for assessment of APHE in observations that were hyperintense on pre-contrast T1-weighted images.

A LI-RADS category according to LI-RADS v2108 [10] was retrospectively assigned by A.D. and C.C. by consensus. Only major features were used for category assignment of LI-RADS 3, 4, and 5 observations. Both radiologists were unaware of the final diagnosis of the lesion.

Pathology diagnosis

Ultrasound-guided fine-needle aspiration was performed using a 20-gauge needle and/or core biopsy with an 18-gauge needle biopsy by an expert abdominal radiologist. Specimens were routinely processed and stained with hematoxylin–eosin. Diagnosis of HCC was made according to the criteria of the International Consensus Group for Hepatocellular Neoplasia [15].

Statistical analysis

Comparison of patients with HCC and patients with non-HCC nodules was done by using Student’s t test, ANOVA, or the Mann–Whitney test for continuous variables and the chi-square test or Fisher’s exact test for categorical variables. A p value of less than 0.05 was considered significant. The diagnostic accuracy for each LI-RADS v2018 category was described by sensitivity, specificity, and positive/negative predictive values and was expressed with their 95% confident interval and compared with the accuracy of current European Association for the Study of the Liver (EASL) criteria. Calculations were done with the Stata package version 14 (StataCorp, 1985–2015).

Results

Three hundred and fifteen cirrhotic patients with a single nodule equal or smaller than 20 mm detected by surveillance US were included. Fifty-one target nodules were not detected by MRI, and in two patients, the MRI was not technically adequate for a reliable evaluation: forty-two corresponded to benign lesions and the remaining eleven were finally diagnosed as HCC. Accordingly, LI-RADS v2018 categories were determined in the 262 target nodules identified in the first MRI after US detection (83.2% of the whole cohort). The final diagnosis of the 262 patients were as follows: 197 HCC (75.2%), 5 intrahepatic cholangiocarcinoma (iCCA) (1.9%), 2 metastases of a poorly differentiated neuroendocrine tumor and a colorectal adenocarcinoma (0.8%), and 58 benign lesions (22.1%) that were followed for a median of 16 months (interquartile range, 11.6–24.2) to assure the absence of malignancy (Fig. 1). Biopsy was done in 209 out of 262 nodules (79.8%), and the final diagnosis was pathologically confirmed in 196 nodules (74.8%): in 153 out of 197 HCC nodules, in 36 out of 58 benign lesions, and in all non-HCC malignant lesions. The main patients’ characteristics are described in supplementary Table 1.

Fig. 1
figure 1

Flow chart of patients included in the analysis

LI-RADS v2018 categories

The LI-RADS v2018 categories and final diagnosis are summarized in Tables 1 and 2. Fifteen nodules were categorized as LR-1 (5.7%), 26 as LR-2 (9.9%), 74 as LR-3 (28.2%), 12 as LR-4 (4.6%), 127 as LR-5 (48.5%), and 8 as LR-M (3%).

Table 1 LI-RADS v2018 category of the 262 target lesions identified in the baseline MRI
Table 2 Categorization of the nodules considering major features (nodule size and presence of arterial phase hyperenhancement, washout and capsule)

All nodules categorized as LR-1 were confirmed as benign conditions. Six out of 26 LR-2 (23.1%) and 51 out of 74 LR-3 (68.9%) nodules were finally diagnosed as HCC. On the other hand, eleven out of 12 LR-4 (91.7%) and 126 out of 127 LR-5 (99.2%) nodules were finally diagnosed as HCC. The two false-positive diagnoses in LR-4 and LR-5 categories were an 18-mm poorly differentiated metastasis of a neuroendocrine tumor categorized as LR-5 and an arterial-hyperenhancing 20-mm nodule in an alcohol-related cirrhotic patient with a biopsy negative for malignancy, which disappeared during the work-up and did not reappear during more than 4 years of follow-up. It was classified as LR-4, and the final diagnosis was considered a regenerative nodule.

The eight nodules categorized as LR-M were 4 iCCA nodules of 14 mm, 20 mm, 20 mm, and 36 mm; 3 HCC nodules of 23 mm, 20 mm, and 15 mm; and a 15-mm nodule in a non-alcoholic fatty liver disease (NAFLD)-cirrhotic patient with two consecutive biopsies negative for malignancy, which disappeared during the work-up and did not reappear after more than 5 years of follow-up, categorized as regenerative nodule.

The diagnostic accuracies according to LI-RADS v2018 definitions are summarized in Table 3.

Table 3 Summary of the diagnostic accuracies for confident HCC diagnosis in nodules detected during US surveillance in patients with cirrhosis

Since our study includes patients recruited during a long time period, in which the imaging acquisition has evolved, we separately analyzed those patients included from 2010 and results are summarized in supplementary Table 2. The distribution of LI-RADS categories and the probability of HCC were very similar to the whole cohort. Also, we evaluated the LI-RADS categories in only those patients with final diagnosis confirmed by pathology (n = 196) (supplemental Table 3). In addition, tumor size was correlated with LI-RADS categories: Only 3 out of 15 LR-1 (20%) but 69 out of 127 LR-5 (54.4%) nodules were > 15 mm (supplementary Table 4).

LI-RADS 2 category

Twenty-six nodules (9.9%) were categorized as LR-2. Six out of the 26 LR-2 nodules were finally diagnosed as HCC (23.1%) and the remaining 20 as benign nodules after a median follow-up of 12.3 months (IQR, 7–21). Five HCC nodules were diagnosed by histology, and median time from MRI to final HCC diagnosis was 4.5 months (IQR, 1.2–12 months). As seen in Table 4, the 6 LR-2 nodules finally diagnosed as HCC did not show any difference compared to those finally classified as benign.

Table 4 MRI features of lesions categorized as LR-2

LI-RADS 3 category

The main patients’ characteristics are described in supplementary Table 2, and the main MRI findings are summarized in Table 5. Seventy-four nodules (28.2%) were categorized as LR-3. Fifty-one out of the 74 (68.9%) were finally diagnosed as HCC (Fig. 2), two as non-HCC malignant lesions (one 14-mm iCCA and a 15-mm colorectal adenocarcinoma metastasis) and the remaining 21 as benign nodules after a median imaging follow-up of 17 months (IQR, 12–27.3). Forty-nine out of 51 HCC nodules were confirmed histologically and the remaining by non-invasive diagnosis criteria during the diagnostic work-up. The median time from MRI to final HCC diagnosis was 1.4 months (IQR, 0.6–6.2 months). HCC diagnosis was closely related to baseline tumor size: All 5 nodules smaller than 10 mm were diagnosed as benign. Oppositely, 31 out of 48 nodules 11–15 mm (64%) and 20 out of 21 LR-3 nodules > 15 mm (95.2%) were diagnosed as HCC. The nodule size was statistically associated with final HCC diagnosis (p < 0.001). The only LR-3 nodule > 15 mm without diagnosis of HCC was a 56-year-old woman with NAFLD-related cirrhosis in whom a 15-mm iso-hypoechoic nodule was detected on US. The MRI showed a 20-mm nodule hyperintense in T1-weighted images and hypointense in T2-weighted images, not identified in dynamic studies. A biopsy was indicated but was not feasible due to lack of an adequate percutaneous access. The nodule remained stable during more than 2 years and was categorized as benign.

Table 5 MRI features of lesions categorized as LR-3
Fig. 2
figure 2

Eleven-millimeter US-detected nodule (arrow) in segment V (a). The nodule (arrows) is slightly hyperintense on T2-weigthed images (b) and hyperintense on T1-weighted images before contrast administration (c). It shows arterial phase hyperenhancement (arterial phase and subtraction sequences; d and e, respectively) but not washout nor pseudocapsule on delayed venous phase (f). A LI-RADS 3 category was assigned. Histological analysis after FNB showed that the nodule corresponded to a well-differentiated HCC

Other MRI findings, except more frequent hyperintense signal in T1-weighted imaging in HCC nodules, were not significantly different between HCC and non-HCC lesions (Table 5).

Discussion

The main aim of the current study was to assess the diagnostic accuracy of each LI-RADS category by MRI according to LI-RADS v2018 and to evaluate the findings associated with HCC diagnosis in US-detected nodules ≤ 20 mm categorized as LR-3. In our cohort, 71.6% of LR-3 nodules corresponded to malignant lesions (51 HCC, 1 ICC, and 1 CRC metastasis). In addition, the median time from MRI to final HCC diagnosis in our series was 1.4 months (IQR, 0.6–6.2 months), which suggests that most of the LR-3 nodules were already HCC and not regenerative/dysplastic nodules that have evolved to HCC during the follow-up. This high proportion of malignancy in LR-3 category in US-detected nodules has been also reported by other authors [16, 17]. However, the high proportion of HCC in LR-3 category diverges from retrospective studies exploring the outcome of LR-3 observations by CT/MRI [6,7,8,9] and from a recent systematic review and meta-analysis which reported a 38% (95% CI, 31–45%) of HCC risk in LR-3 category [18]. However, to properly interpret the results of studies about LI-RADS categorization, it is a key to evaluate if the cohort of patients has been collected prospectively, if there was prior US detection within a surveillance program of the population at risk of developing HCC, and if the final diagnosis has been set by pathology as an optimal gold standard. For instance, in a study of Tanabe et al [7] that suggests that LR-3 observations should be seen as a very low likelihood of HCC development, it is reported that 151 out of 166 LR-3 observations remained stable or decreased in category after a median imaging follow-up of 538 days. However, none of these LR-3 observations was assessed histologically, and the LI-RADS category assignment was based on the retrospective evaluation of the radiological reports.

When we compared the main radiological findings of LR-3 observations between HCC and non-HCC lesions, ancillary features of malignancy as hyperintensity on T2-weighted imaging or presence of intralesional fat [19,20,21] were not more frequent in LR-3 HCC lesions. Nodule size at MRI became the most relevant parameter associated with HCC diagnosis. In our series, 20 out of 21 LR-3 nodules > 15 mm (95.2%) were finally diagnosed as HCC. Contrarily, none of the LR-3 nodules < 10 mm was finally confirmed as HCC. Based on our results, in our opinion, the cutoff size to stratify LI-RADS categories could be set at 15 mm. This suggestion should be explored by other groups and, if confirmed, will change the current proposal. Keeping the 20 mm cutoff may be appealing because of the policy for liver transplantation indication, but it is worth to recall that patients with very early HCC may benefit from ablation or resection, while never being considered for transplant because of comorbidities or lack of such option in their country.

The high probability that a new nodule detected by US in cirrhotic patients categorized as LR-3 corresponds to an HCC justifies triggering an active diagnostic work-up including biopsy and/or close imaging follow-up if we are intended to diagnose and treat the HCC at a very early stage, when the probability of dissemination is very low [22] and the outcome after resection or ablation is excellent [2, 3].

Furthermore, this study confirms our previous observation that near one-quarter of nodules categorized as LR-2 are finally diagnosed as HCC (6 out of 26, 23.1%). More interestingly, the median time for HCC diagnosis was 4.5 months, and thus the recommendation of not engaging any additional studies except continuing the routine surveillance every 6 months [10] may not be supported by our results. In our opinion, a 23% risk of HCC may justify an active diagnostic work-up [23], but additional studies including more LR-2 observations in US-detected nodules are needed to support our suggestion.

The major modification implemented in the LI-RADS v2018 was the change of categorization in observations 10–19 mm with arterial phase hyperenhancement and non-peripheral washout from LR-4 to LR-5 regardless of appearance on antecedent surveillance US images [11, 24]. In our study, observations categorized as LR-5 as defined in LI-RADS v2018 displayed a sensitivity of 64% (95% CI, 56.8–70.7), figures very similar when EASL criteria [2] were applied in our cohort, and in line with the reported by other authors when these criteria were evaluated [12,13,14, 25,26,27]. Very interestingly, with the new update of v2018, there is a significant drop of observations categorized as LR-4 compared with previous studies [5, 18]. Furthermore, 11 out of 12 observations categorized as LR-4 were finally diagnosed as HCC, and when combined all together LR-4 and LR-5 observations, the diagnostic accuracy did not differ significantly and thus questioning the need of distinguishing between both categories in newly detected nodules by screening US.

Our study has limitations: LI-RADS category assignment was done after consensus interpretation by two radiologists, and we have not evaluated the inter-observer agreement. This is relevant since, in real life, LI-RADS categories are assigned by one radiologist, and there is a substantial variation in liver observation reported by both experts and novices when a standardized reporting schema is used [28]. Furthermore, we have not considered ancillary findings in our assessment of LR-3, LR-4, and LR-5 observations. However, as clearly stated in the LI-RADS v2018 proposal, ancillary features are optional and may be used at radiologist discretion, and their use does not allow to upgrade the LI-RADS category to LR-5 [29]. The application of these ancillary findings has been shown useful for increasing the sensitivity but in detriment of worse specificity [30]. In addition, avoiding the use of ancillary findings, we reduce the radiologist-dependent subjectivity and allow translating our results to CT since most of those ancillary findings are only available when using MR, and some of them only when using hepatobiliary agents.

In conclusion, 69% of our newly detected nodules by screening US categorized as LR-3 are HCC, being particularly true in nodules larger than 15 mm. Our data suggest that if LI-RADS is applied, an active diagnostic work-up including biopsy is justified in US-detected LR-3 observations if aimed to treat HCC at a very early stage.