Introduction

Hepatocellular carcinoma (HCC) is the sixth most common cancer in the world and the fourth leading cause of cancer-related death worldwide [1]. The early detection and management of small (< 20 mm) HCCs can prolong patient survival [2]. Unlike other malignancies that require histological diagnosis, HCC can be diagnosed noninvasively according to imaging-based criteria without confirmatory pathology [3,4,5].

The Liver Imaging Reporting and Data System (LI-RADS) is widely accepted as a good scheme for interpreting and reporting imaging features of liver observations in patients at high risk for HCC [6,7,8]. LI-RADS version 2018 (hereafter LI-RADS v2018) represents the fourth update of this HCC imaging system since its first release in 2011. In LI-RADS v2018, a 10–19 mm observation with nonrim arterial-phase hyperenhancement (APHE) plus nonperipheral “washout” or threshold growth is now categorized as LR-5. The updated LI-RADS v2018 have been integrated into the 2018 American Association for the Study of Liver Disease (AASLD) guidance and is expected to improve the diagnosis of HCC. However, few studies have reported the diagnostic performance of LI-RADS v2018 for 10–19 mm HCC lesions on gadoxetic acid-enhanced magnetic resonance imaging (MRI) [9].

In clinical practice, small hepatic nodules (< 20 mm) often pose a diagnostic challenge due to their atypical imaging feature on either contrast-enhanced computed tomography (CECT) or MRI. Prior studies involving small nodules (< 20 mm) have demonstrated higher sensitivity for small HCC lesions on gadoxetic acid-enhanced MRI than those observed on multiphasic CT or extracellular contrast-enhanced MRI [10,11,12] at the expense of reduced specificity. The hypointensity on TP is not a real washout when using Gd-EOB-DTPA, since it is a combination of a true vascular washout and an increased enhancement of the background liver [13,14,15,16]. Therefore, in the latest versions of AASLD guidance, the European Association for the Study of the Liver (EASL) and LI-RADS v2018, the nonperipheral washout appearance on gadoxetic acid-enhanced MRI can be depicted only in the portal venous phase (PVP). On the other hand, that results in reduced diagnostic sensitivity for HCC (66.3–77.9%) [15, 17, 18]. Furthermore, some ancillary features (AFs), such as hepatobiliary phase (HBP) hypointensity and restricted diffusion, are strong indicators of malignancy and play important roles in the diagnosis of HCC [19]. However, in the current LI-RADS version, the diagnosis of HCC is primarily based on major features (MFs), and the use of AFs is optional. In addition, using AFs, an observation can be upgraded by one category up to LR-4 but not to LR-5. Therefore, the weight of AFs in the diagnosis of HCC may be somehow overlooked.

Therefore, the purpose of this study was to evaluate the diagnostic performance of LI-RADS v2018 for 10–19 mm HCC and to explore whether modifications to LI-RADS using AFs could further improve sensitivity without reducing specificity for HCC on gadoxetic acid-enhanced MRI.

Materials and methods

Patients

This retrospective study was approved by our institutional review board, and the requirement for written informed consent was waived. From June 2015 to April 2017, a total of 830 patients who underwent gadoxetic acid-enhanced 1.5-T or 3.0-T MRI examination for the evaluation of suspicious nodules at our center were included. The inclusion criteria were as follows: (a) age ≥ 18 years old; (b) high risk for HCC according to LI-RADS v2018 (cirrhosis or chronic hepatitis B viral infection); (c) observation size of 10–19 mm and ≤ 4 nodules observed in one patient using contrast-enhanced MRI with gadoxetate disodium; and (d) no previous treatment for hepatic lesions. The exclusion criteria were as follows: (a) no observations other than hepatic cysts on MRI; (b) congestive hepatopathies, such as hereditary hemorrhagic telangiectasia, Budd–Chiari syndrome, chronic portal vein occlusion, and cardiac congestion; and (c) an insufficient final diagnosis (e.g., histopathologic diagnosis more than 2 months after MRI examination, inconclusive histopathologic diagnosis, immediate local treatment or systemic therapy without pathologic diagnosis, and insufficient follow-up to determine size stability) or unsatisfactory image quality for analysis. The process of patient selection for the study cohort is shown in Fig. 1.

Fig. 1
figure 1

Process of patient selection for the study cohort. HCC hepatocellular carcinoma, MRI magnetic resonance imaging, cHCC-CCA combined HCC-cholangiocarcinoma, ICCA intrahepatic cholangiocarcinoma. *Patients with an insufficient final diagnosis (n = 120) included patients with a histopathologic diagnosis more than 2 months after the MRI examination (n = 30), an inconclusive histopathologic diagnosis (n = 17), immediate local treatment or systemic therapy without a pathologic diagnosis (n = 50), and insufficient follow-up to determine size stability (n = 23)

MRI examination

All patients were scanned in a supine position on a 1.5-T (Optima MR360, GE Healthcare) or a 3.0-T whole-body MRI scanner (Discovery MR750, GE Healthcare) with an eight-channel phased-array torso coil centered over the abdomen. Routine MRI sequences included T2-weighted, diffusion-weighted, in-and out-of-phase, unenhanced T1-weighted, dynamic, and HBP sequences using gadoxetate disodium (Primovist; Bayer Healthcare). The contrast agent was automatically administered intravenously using a power injection device (Spectris Solaris EP; MEDRAD) at a rate of 2.0 ml/s for a total dose of 0.025 mmol/kg body weight, followed by a 20-ml saline flash. The dual arterial phase (AP) was initiated 15–30 s after the contrast agent arrived at the pulmonary artery using bolus triggering. The dual PVP and transitional phase (TP) were acquired at 50–80 s and 3 min, respectively. HBP images were obtained 20 min after contrast agent injection. Imaging examinations were in compliance with LI-RADS technical requirements. Scanning parameters are listed in Supplementary Table 1.

MRI analysis and LI-RADS category assignment

All MRI scans were retrieved from the picture archiving and communication system (PACS) and reviewed by two abdominal radiologists [with 14 years (Sidong Xie) and 9 years (Jingbiao Chen) of experience in liver MRI]. Blinded to the reference standard, they independently evaluated the following imaging features defined in LI-RADS v2018 for every observation found on each patient’s MRI examination [20]: MFs and AFs favoring malignancy, and AFs favoring benignity, targetoid features and tumor in vein (TIV). If an observation showed high signal intensity on T1-weighted imaging (T1WI) before the administration of gadoxetic acid, APHE was evaluated by subtracting arterial-phase images. Discrepancies were resolved by consensus. After scoring all the imaging features, the two radiologists assigned a LI-RADS category to each observation according to only the major imaging features of LI-RADS v2018. Subsequently, LI-RADS categorization were assigned using MFs and independent significant AFs in combination.

Reference standard

The final diagnoses of HCC and non-HCC malignancies were confirmed by histopathology (hepatectomy specimen or liver biopsy). An experienced histopathologist (Jianning Chen) blinded to all clinical data and MRI results reviewed the histological specimens and confirmed the histological diagnosis according to the World Health Organization classification system [21]. The benign lesions were confirmed by pathology or typical imaging features and stability on imaging for at least 2 years.

The presence or absence of liver cirrhosis was evaluated by the same pathologist according to the background liver adjacent to the tumor in patients who underwent surgical treatment. In patients who did not undergo surgery, the diagnosis of cirrhosis was determined by the two radiologists according to the morphological MRI features, including a nodular contour of the liver, segmental atrophy, and widened fissures or gallbladder fossa, regardless of varices, ascites, or splenomegaly [22].

Statistical analysis

Continuous data are summarized as the mean and standard deviation or median and range. Categorical data are summarized as counts and percentages. Per-observation estimates of the diagnostic performance for each MF and AF were calculated. Univariate and multivariate logistic regression analyses were performed to determine the independent predictors for HCC among the AFs according to LI-RADS v2018. Variables with a p value < 0.05 in the univariable analysis were entered into the multivariable analysis to identify independently significant AFs for HCC diagnosis. For the multivariable analysis, a stepwise backward elimination method was used. The sensitivity and specificity of different modified LI-RADS for HCC were compared with those of LI-RADS v2018 using McNemar’s test.

The kappa (κ) statistic was used to determine interobserver agreement for the LI-RADS features. The interpretation of κ values was as follows: poor, 0.00–0.20; fair, 0.21–0.40; moderate, 0.41–0.60; good, 0.61–0.80; and excellent, 0.81–1.00.

SPSS software version 22.0 (IBM) was used for all statistical analyses. p < 0.05 was considered statistically significant.

Results

Patients

A total of 133 patients (129 males and 4 females; median age 52.0 years) with 174 observations were finally included. The demographics and clinical and biological characteristics of the patients are summarized in Table 1. Of the 133 patients, 103 (77.4%) patients had a single observation, and 29 (21.8%) patients had more than one observation. Among the 133 patients, 128 (96.2%) patients had hepatitis B virus (HBV) infection, while 82 (61.7%) patients had liver cirrhosis. Regarding the liver function estimation, the majority of patients (85.0%) were Child–Pugh class A.

Table 1 Clinicopathologic characteristics of patients and lesions

Final diagnoses

Among the 174 observations, 104 (59.8%) and 70 (40.2%) were malignant and benign lesions, respectively. A total of 115 (66.1%) observations were pathologically proven, including all HCC (n = 98) and non-HCC (n = 6) malignancies [i.e., combined HCC-cholangiocarcinoma (cHCC-CCA, n = 2), intrahepatic cholangiocarcinoma (ICCA, n = 1), lymphoma (n = 1), and metastasis (n = 2)], and 11 benign lesions [i.e., pseudo-lymphoma (n = 2), dysplastic nodule (n = 6) and high-grade dysplastic nodule (n = 3)]. The types of procedures performed for histopathologic confirmation of hepatic observations included surgical resection (n = 48), ablation after US-guided needle biopsy (n = 64), and liver transplantation (n = 3). The other benign lesions (n = 59) were confirmed by a composite of clinical and MRI follow-up, including 24 (40.7%) hemangiomas and 4 (6.8%) perfusion alterations based on typical imaging features by the two radiologists in consensus. The other 31 (52.5%) lesions were interpreted by the two radiologists as benign hepatocellular nodules based on imaging features and longitudinal stability.

Interobserver agreement of the MRI features

The interobserver agreement of MFs and AFs favoring malignancy of LI-RADS v2018 is shown in Supplementary Table 2. The agreement between the two observers for LI-RADS features was moderate to excellent, with κ values ranging from 0.66 to 1.00.

Independently significant AFs for diagnosing HCC

The univariate analysis demonstrated that AFs favoring malignancy (restricted diffusion, mild–moderate T2 hyperintensity, TP hypointensity, HBP hypointensity, US visibility, fat in mass and mosaic architecture) had significant value for predicting HCC (p < 0.001, < 0.001, < 0.001, < 0.001, < 0.001, 0.001, and 0.049, respectively; Table 2). According to the multivariate analysis, mild–moderate T2 hyperintensity [odds ratio (OR), 5.1; 95% confidence interval (CI), 2.1–12.7; p < 0.001], TP hypointensity (OR, 3.6; 95% CI, 1.1–11.8; p = 0.032) and fat in mass (OR, 3.3; 95% CI, 1.0–10.2; p = 0.043) were independently significant AFs for identifying 10–19 mm HCC (Table 2).

Table 2 Logistic regression analyses of ancillary features favoring malignancy in LI-RADS v2018 for diagnosing 10–19 mm HCC

Based on the above results of the univariate and multivariate analyses, we developed a modified LI-RADS with the upgraded LR-5 criteria by combining LR-3/LR-4 v2018 classified by MFs alone, with these three independently significant AFs identified from the multivariable analysis. In different modified LI‑RADS versions, except for the LR-5 observations as defined by LI-RADS v2018, observations that meet the following criteria were also categorized as LR-5 in the mLI-RADS versions. The modified LI-RADS was as follows:

mLI-RADS I: LR-4 + mild–moderate T2 hyperintensity;

mLI-RADS II: LR-4 + TP hypointensity;

mLI-RADS III: LR-4 + fat in mass;

mLI-RADS IV: LR-3 + mild–moderate T2 hyperintensity;

mLI-RADS V: LR-3 + TP hypointensity;

mLI-RADS VI: LR-3 + fat in mass; and

mLI-RADS VII: LR-4 + mild–moderate T2 hyperintensity/TP hypointensity or LR-3 + fat in mass (Figs. 2, 3).

Fig. 2
figure 2

Axial gadoxetic acid-enhanced MRI images from a 54-year-old man with chronic hepatitis B and surgically confirmed HCC. a Precontrast, b arterial-phase, c portal venous-phase, d T2WI FS, e transitional phase (TP), f hepatobiliary phase (HBP), g T1WI dual echo in-phase (IP) and h T1WI dual echo out-of-phase (OP) images showing an 18-mm observation in segment VI of the liver. The observation shows a precontrast isohypointensity (arrow), b nonrim arterial-phase hyperenhancement (arrow), c no nonperipheral washout and capsular enhancement (arrow) in the portal venous phase, d mild–moderate T2WI hyperintensity (arrow), e TP hypointensity, f HBP hypointensity, and slight signal loss (arrow) of OP imaging f compared to that of IP imaging (e). This hepatic observation was classified as LR-4 and could not be classified as LR-5 using Liver Imaging Reporting and Data System (LI-RADS) version 2018. In contrast, our upgraded LR-5 criteria of mLI-RADS I–III or mLI-RADS VII, utilizing independently significant ancillary features, helped achieve the correct diagnosis of HCC

Fig. 3
figure 3

Axial gadoxetic acid-enhanced MRI images from a 52-year-old man with chronic hepatitis B and surgically confirmed HCC. a Precontrast, b arterial-phase, c portal venous-phase, d T2WI FS, e transitional phase (TP), f hepatobiliary phase (HBP), g T1WI dual echo in-phase (IP), and h T1WI dual echo out-of-phase (OP) images showing a 16-mm observation in segment IV of the liver. The observation shows a precontrast hypointensity (arrow), b arterial-phase hyperenhancement (arrow), c no washout (arrow) in the portal venous phase, d mild–moderate T2WI hyperintensity (arrow), e TP hypointensity (arrow), f HBP hypointensity (arrow), and signal loss (arrow) on OP imaging (h) compared to that of IP imaging (g). This hepatic observation was classified as LR-3 and could not be classified as LR-5 using Liver Imaging Reporting and Data System (LI-RADS) version 2018. In contrast, our upgraded LR-5 criteria of mLI-RADS IV–VII, utilizing independently significant ancillary features, helped achieve the correct diagnosis of HCC

Diagnostic performance of LI-RADS v2018 and different LI‑RADS versions

Distribution and interobserver agreement for conventional LI-RADS v2018 categories are shown in Supplementary Table 3. Using LI-RADS v2018, 18 (10.3%), 26 (14.9%), 58 (33.3%), 8 (4.6%), 55 (31.6%), and 9 (5.2%) observations were categorized as LR-1, LR-2, LR-3, LR-4, LR-5, and LR-M, respectively. No observations were categorized as LR-TIV. Among 174 observations, 0% (0/18) of LR-1, 0% (0/26) of LR-2, 56.9% (33/58) of LR-3, 87.5% (7/8) of LR-4, 98.2% (54/55) of LR-5, and 44.4% (4/9) of LR-M were HCCs in LI-RADS v2018. The number of HCC identified as LR-5 in LI-RADS v2018, mLI-RADS I, mLI-RADS II, mLI-RADS III, mLI-RADS IV, mLI-RADS V, mLI-RADS VI and mLI-RADS VII were 54/174, 61/174, 60/174, 55/174, 81/174, 80/174, 62/174, and 69/174, respectively (Table 3). The numbers of LR-4 in LI-RADS v2018 upgrading to LR-5 in mLI-RADS I, mLI-RADS II, and mLI-RADS III were 8/8, 7/8, and 1/8, respectively. The numbers of LR-3 in LI-RADS v2018 upgrading to LR-5 in mLI-RADS IV, mLI-RADS V, and mLI-RADS VI were 40/58, 47/58, and 10/58, respectively. The numbers of LR-4 and LR-3 in LI-RADS v2018 upgrading to LR-5 in mLI-RADS VII were 8/8 and 10/58, respectively.

Table 3 Diagnostic performance of various versions for the diagnosis of 10–19 mm HCC lesions

The diagnostic performance of various versions for the diagnosis of 10–19 mm HCC lesions is shown in Table 3. LR-5 v2018 showed a sensitivity of 55.1% and a specificity of 98.7%. mLI-RADS I and mLI-RADS II showed higher sensitivity (62.2% vs. 55.1%, p = 0.016; 61.2% vs. 55.1%, p = 0.031, respectively) than LR-5 v2018 without compromising specificity (both 97.4% vs. 98.7%, p > 0.999). mLI-RADS III showed almost the same sensitivity and specificity as LR-5 v2018. mLI-RADS IV and mLI-RADS V showed higher sensitivity (82.7% vs. 55.1%, p < 0.001, and 81.6% vs. 55.1%, p < 0.001, respectively) than LR-5 v2018 but low specificity (81.6% vs. 98.7%, p < 0.001, and 71.1% vs. 98.7%, p < 0.001, respectively). mLI-RADS VI and mLI-RADS VII showed higher sensitivity (63.3% vs. 55.1%, p = 0.008, and 70.4% vs. 55.1%, p < 0.001, respectively) than LR-5 v2018 but similarly high specificity (96.1% vs. 98.7%, p = 0.500, and 94.7% vs. 98.7%, p = 0.250, respectively).

Discussion

Our study demonstrates that three of the AFs favoring malignancy (mild–moderate T2 hyperintensity, TP hypointensity and fat in mass) were independently significant features for the noninvasive diagnosis of 10–19 mm HCC observed on gadoxetic acid-enhanced MRI. When using mild–moderate T2 hyperintensity/TP hypointensity to upgrade an observation from LR-4 to LR-5 or using fat in mass to upgrade an observation from LR-3 to LR-5, the sensitivity was significantly improved compared with that of LI-RADS v2018, while the specificity was not significantly reduced for diagnosing 10–19 mm HCC observed on gadoxetic acid-enhanced MRI.

Recent studies have shown that modified MFs, such as APHE evaluated using an arterial subtraction sequence [23, 24] or extended washout or capsule criteria on gadoxetic acid-enhanced MRI [17, 25, 26], could improve the sensitivity of LR-5 for diagnosing HCC. Previous studies have revealed that some AFs, such as hypointensity in the transitional or HBP, nonenhancing “capsule” and restricted diffusion, could improve sensitivity for the diagnosis of HCC [11, 17, 26,27,28,29,30,31]. However, few studies have assessed the role of AFs in LI-RADS using gadoxetic acid-enhanced MRI for the diagnosis of 10–19 mm HCC. Our study included all AFs of LI-RADS v2018 to determine independent features for predicting 10–19 mm HCC lesions on gadoxetic acid-enhanced MRI, and our results demonstrate that only mild–moderate T2 hyperintensity, TP hypointensity and ‘fat in mass’ were significant and independent features for the diagnosis of 10–19 mm HCC. This implies that mild–moderate T2 hyperintensity, TP hypointensity and ‘fat in mass’ may be more important than other AFs, and therefore, we try to explore the role of the independent significant AFs to upgrading LR-3/4 to LR-5.

In our study, mild–moderate T2 hyperintensity was a significant independent feature for the diagnosis of 10–19 mm HCC on gadoxetic acid-enhanced MRI. Moreover, when using mild–moderate T2 hyperintensity to upgrade an observation from LR-4 to LR-5, the sensitivity was significantly improved (from 55.1 to 62.2%) without impairing specificity (from 98.7% to 97.4%) for diagnosing 10–19 mm HCC on gadoxetic acid-enhanced MRI. Previous studies have reported that mild–moderate T2 hyperintensity is an important feature for the early detection of HCC or predicting the malignant transformation of hypovascular nodules [31,32,33,34]. Our study is consistent with these studies. In terms of pathological changes, T2 hyperintensity may reflect intratumoral sinusoid dilation and edema. T2WI signals correlate with intranodular arterial flow and inversely with intranodular portal venous flow, pathophysiological alterations associated with hepatocarcinogenesis [32]. In addition, 83–86% of all HCCs have T2 hyperintensity [35]. Additionally, using mild–moderate T2 hyperintensity to upgrade an observation from LR-3 to LR-5 provided the highest sensitivity (82.7%) for the diagnosis of 10–19 mm HCC, but with reduced specificity (81.6% vs. 98.7%, p < 0.001). The reason for this result may be the higher proportion of non-HCC LR-3 lesions with mild–moderate T2 hyperintensity in our study (13/25, 52.0%). In our study, while mild–moderate T2 hyperintensity was highly suggestive of malignancy, it was also seen in non-HCC malignancies [35]. Thus, the presence of mild–moderate T2 hyperintensity alone may not be used to establish a diagnosis of HCC in the absence of MFs.

TP hypointensity was also a significant independent feature for the diagnosis of 10–19 mm HCC lesions on gadoxetic acid-enhanced MRI. Our study results are in good agreement with the results of a previous study by Choi et al. [30], which demonstrated that TP hypointensity is an independent predictor of HCC in lesions ≤ 3 cm with gadoxetic acid-enhanced dynamic MRI. Furthermore, our results showed that when using TP hypointensity to upgrade an observation from LR-4 to LR-5, the sensitivity was higher (55.1% vs. 61.2%, p = 0.031) while maintaining sufficiently specificity (98.7% vs. 97.4%, p < 0.05) than compared to that of LI-RADS v2018 for diagnosing 10–19 mm HCC on gadoxetic acid-enhanced MRI. This may be due to the portal supply and organic anion transporter (OATP) expression usually diminishing during hepatocarcinogenesis [36], resulting in TP hypointensity in gadoxetic acid-enhanced MRI. However, when using TP hypointensity to upgrade an observation from LR-3 to LR-5, the sensitivity was improved, and the specificity was significantly impaired. The reason for this result may be the higher proportion of non-HCC LR-3 lesions with TP hypointensity (21/47, 44.6%) and all of these lesions were benign lesions in our study. Further study with a larger sample size is warranted to validate our results.

Fat in mass is another significant independent feature for the diagnosis of small HCC with gadoxetic acid-enhanced MRI. Recent studies have revealed that the MRI of detection of intralesional fat in patients at high risk for HCC on MRI favors the diagnosis of HCC [37,38,39]. Previous research has shown that up to 40% of early HCCs exhibit fat in histology [40]. Furthermore, Kutami et al. found that fatty changes in small HCC lesions are closely related to tumor size [40]. Intralesional fat is most frequent in small HCCs, and the frequency decreases with increasing size. Importantly, fat in mass is extremely rare in non-HCC hepatic malignancies; thus, the detection of fat may help to exclude ICC in problematic cases [39]. The presence of intralesional fat has been considered a marker of carcinogenesis from dysplasia to HCC in CT or MRI [41, 42]. This feature is attributable to a decrease in intratumoral arteries without alterations of intratumoural portal tracts [40]. This may explain why fat in mass could upgrade an observation from LR-3 to LR-5 with higher sensitivity without significantly reducing the specificity compared with that of LI-RADS v2018 for diagnosing 10–19 mm HCC with gadoxetic acid-enhanced MRI. However, when using fat in mass to upgrade an observation from LR-4 to LR-5, the sensitivity and specificity remained nearly the same as those of the original LI-RADS v2018. This may be because that only one LR-4 lesion in this study showed ‘fat in mass’. This may explain why ‘fat in mass’ did not contribute to the upgrading of lesions from LR-4 to LR-5. Further research with a larger number of LR-4 lesions is warranted to validate our findings.

HBP hypointensity has previously been reported as an independently significant feature for the diagnosis of HCC [11, 43, 44]. In particular, in a recent similar study performed by Lee et al. [31], four AFs (mild–moderate T2 hyperintensity, HBP hypointensity, nonenhancing “capsule” and mosaic architecture) were independently significant features, along with all the MFs. The results of these studies are different from those of our study. This may be because the number of cases, etiology and observation size (10–19 mm) in our study were different from those in the other studies. Another reason may be that HBP hypointensity showed relatively low specificity (25%) in our study.

There are several limitations to our study. First, this was a single-center retrospective study, which may have selection bias. Multicentric prospective studies are warranted to validate our results. Second, most of the patients included in this study had compensated liver function, and most patients had hepatitis B infection. The results of this study might not be directly adapted to, for example, Western populations with a higher expected proportion of patients with decompensated liver function and different underlying chronic liver diseases (hepatitis C, alcoholic steatohepatitis, nonalcoholic steatohepatitis, etc.). Third, the proportion of males was higher than that of females in our study. That may be due to HBV infection is mostly in males in China and our study was based on 10–19 mm observations, which may have selection bias. Fourth, the number of LR-4 lesions included in the study was relatively small. Further work using a population with larger LR-4 lesions is needed to confirm the value of AFs for diagnosing 10–19 mm HCC lesions on gadoxetic acid-enhanced MRI. Finally, we used a composite reference standard for the determination of benignity. However, biopsy is not always performed in the clinic for highly suspected benign lesions, and the application of a strict reference (e.g., pathology) for benign observations may lead to additional selection bias. Patients with observations without pathological results were followed up for at least 2 years to make a final diagnosis, which better represents clinical practice.

In conclusion, our results show that upgrading an observation from LR-3/LR-4 to LR-5 using independently significant AFs (mild–moderate T2 hyperintensity, TP hypointensity or fat in mass) can improve sensitivity without impairing specificity in the diagnosis of 10–19 mm HCC lesions on gadoxetic acid-enhanced MRI.