Introduction

Hepatocellular carcinoma (HCC) is the most common primary liver malignancy and third leading cause of cancer-related death worldwide, with a 5-year survival rate of 18% [1]. As the incidence of hepatic cirrhosis and chronic liver disease continues to rise in the USA, there has been a concomitant increase in the frequency of HCC [2, 3, 4•, 5••]. Early detection of HCC is critical for improved overall survival; however, greater than 70% of patients with HCC have advanced disease at diagnosis [2]. Per American Association for the Study of Liver Disease (AASLD) guidelines, the diagnosis of HCC is frequently made based on cross-sectional contrast-enhanced imaging alone, without the need for confirmatory percutaneous biopsy in at-risk populations [2]. Therefore, it is imperative for a radiologist to provide an accurate interpretation of liver imaging in this population, in order to provide appropriate and timely diagnosis that allows for early treatment, as studies show that early detection and improved survival are strongly linked [2, 3].

Initially released in 2011, and with several subsequent revisions, the Liver Imaging Reporting and Data System (LI-RADS) was created to provide standardization of liver imaging, strict diagnostic criteria, a diagnostic algorithm and reporting guidelines for interpretation of observations in at-risk patients, in order to improve radiologist interpretation, reporting [4•] and communication with clinicians. Based on the imaging features, each liver observation is assigned a LI-RADS category which reflects its likelihood of being benign, malignant, or HCC [5••]. Standardization improves communication between radiologists and other clinicians, improves consistency in interpretation and management recommendations [6, 7], and potentially facilitates data collection for research and quality improvement with the long-term goal of positively impacting clinical care, education, and research [8] .

In 2018, LI-RADS was endorsed by and integrated into the AASLD practice guidance for HCC [1]. LI-RADS offers four individual algorithms designed for different clinical contexts: ultrasound (US) LI-RADS for screening and surveillance, CT/MRI LI-RADS algorithm for diagnosis, contrast-enhanced US (CEUS) LI-RADS for diagnosis, and LI-RADS treatment response algorithm (LR-TRA) for response assessment following locoregional therapies (LRT) and resection [4•, 9].

In this article, we review the emerging evidence supporting use of the LI-RADS v2018 CT/MRI diagnostic algorithm. We will also briefly discuss the role of abbreviated MRI for screening at-risk patients. Finally, we will provide a review of emerging evidence supporting the LR-TRA.

Current LI-RADS Imaging Criteria for HCC Diagnosis

The LI-RADS CT/MRI diagnostic algorithm can only be applied to a specific target population in which the pretest probability of HCC is sufficiently high to attain high specificity (> 95%) of LR-5 (definite HCC) criteria [10]. Each liver observation is categorized from LR-1 (definitely benign) to LR-5, with the categories from 1 to 5 reflecting increasing likelihood of HCC [11••]. LR-1 and LR-2 observations are categorized as definitive or probably benign, respectively [5••]. LR-3 to LR-5 categories correspond to intermediate probability of malignancy, probable HCC, and definite HCC, respectively. The LR-5 category is intended to be 100% specific for HCC, which would preclude the need for percutaneous biopsy for definitive diagnosis [12]. In addition to LR-1 through LR-5, an LR-M (probably or definitely malignant, not HCC specific) category can be assigned to observations that are probably or definitely malignant, but the imaging features are not specific for HCC [13]. While approximately 36% of LR-M observations are HCC, a majority of LR-M represent other malignancies, such as intrahepatic cholangiocarcinoma (iCCA), which may arise in the same at-risk population [14]. The final category, LR-TIV (definite tumor in vein) is used for malignancy that has definite macrovascular invasion on imaging [15]. The assignment of LR-3 to LR-5 categories is based on the combination of major imaging features (i.e., “nonrim arterial phase hyperenhancement [APHE], nonperipheral washout appearance [WO], enhancing capsule appearance, size, and threshold growth [16]) as well as ancillary imaging features, from which a category can be determined by using the CT/MRI diagnostic table [9].

Emerging Evidence on LI-RADS Diagnostic Validation

Since the diagnosis of HCC has predominately become an imaging diagnosis without tissue confirmation, it is critical that the LR-5 category retains a high specificity for HCC. A multitude of studies have investigated the diagnostic accuracy of LI-RADS, including radiology–pathology correlation and outcomes studies (Table 1) [17]. Recently, a large systemic review by van der Pol et al. evaluated the percentage of HCC and malignancy seen in each of the LR categories based on pathology, follow-up imaging analyses, and treatment response [11••]. The review was based on 17 retrospective studies that included 2760 patients, 3556 observations, of which 2482 were HCCs [11••]. Percentage of HCC and other malignancy were 94% and 97% for LR-5, 74% and 80% for LR-4, 38% and 40% for LR-3, 13% and 14% for LR-2, 79% and 92% for LR-TIV, and 36% and 93% for LR-M, respectively [11••]. There were no malignancies found in the LR-1 category, and there was a significant difference in the percentage of confirmed HCCs and overall malignancies (p < 0.00001) in the LR 2–5 categories [11••]. Based on this analysis, the risk of HCC and overall malignancy increases with increasing LR category [11••]. Of note, of 240 LR-M observations, 93% were found to be other malignancy, and 36% were found to be HCC on pathological correlation [11••]. Surprisingly, HCC was found in 14% of LR-2 [11••]. This number is likely elevated as a result of selection bias, however, since the majority of LR-2 observations do not require a biopsy in clinical practice and a subgroup of LR-2 observations necessitating biopsy likely harbor a higher proportion of malignancy than the entire population of LR-2 observations.

Table 1 Studies evaluating LI-RADS diagnostic performance, inter-reader reliability, and efficacy of abbreviated MRI

In 2017, LI-RADS introduced specific criteria for LR-M category, and LI-RADS v2018 updated LR-5 diagnostic criteria in order to align AASLD with LI-RADS [9]. There have been multiple recent studies evaluating the impact of these changes on diagnosis of HCC. For example, Ren et al. [18•] found that LI-RADS v2018 was superior to v2017, increasing the sensitivity of LR-5 for HCC to 81% vs. 71%, negative predictive value (NPV) 70% vs. 61%, and accuracy 84% vs. 78%, respectively, while maintaining the specificity (92% vs 90%, respectively). Similarly, Lee et al. [19] reported an improved sensitivity of v2018 LR-5 criteria compared to those of v2017 (75% vs 62%, respectively), at the expense of minimal decrease in specificity (91% vs 94%, respectively), A recently published study by Smereka et al. [20] compared the diagnostic efficacy of LI-RADS v2018 with v2017 in categorizing newly identified nodules with nonrim APHE in cirrhotic livers. In this study, LR-3 and LR-4 criteria based on v2018 were more accurate in predicting the rate of progression to HCC, as compared to v2017: the rate of progression of LR-3 observations was 38% based on v2017 and 44% based on v2018, and the rate of progression of LR-4 observations was 58% based on v2017 and 70% based on v2018. A study by Chernyak et al. [21] assessed the concordance in categorization and radiologic T-staging using LI-RADS v2017 and v2018 and Organ Procurement and Transplantation Network (OPTN) criteria. In this study, 73/182 (40%) of 641 total observations which were categorized as LR-4 by LI-RADS v2017 were upstaged to LR-5 according to v2018, while only 4/196 (2%) observations categorized as LR-5 were down-staged to LR-4 based on changes in threshold growth definition between the two versions. Furthermore, in 12% (49/398) of patients, T stage was higher using LI-RADS v2018 compared to v2017 [21]. In summary, a number of recent studies indicate that v2018 has increased sensitivity and improved accuracy of LR-5 category for the diagnosis of HCC.

Ko et al. [22] compared the diagnostic performance of LI-RADS v2018 with v2017 for HCC utilizing gadoxetic acid-enhanced MRI, in which 76% (64/84) of observations with threshold growth using v2017 were reclassified as subthreshold growth when using v2018, ultimately downstaging 14% (9/64) of the cases. They also found that v2018 was inferior to v2017 for observations measuring between 10 and 19 mm in size when considering only major features (accuracy 86% versus 80%, p < 0.013) but was comparable after incorporating the ancillary features (accuracy 87% versus 86%, p = 1). Thus, the two versions demonstrated similar diagnostic performance with added value of ancillary features in combination with major features in v2018 for increased detection of HCC.

A number of recent studies evaluating the utility of ancillary features suggest that their use in combination with major features increases sensitivity while preserving high specificity for HCC diagnosis. For example, Cerny et al. [23] compared the performance of major features alone and major and ancillary features in combination on MRI using LI-RADS v2014 criteria. In this study, the authors found that the combination of major and ancillary features resulted in recategorization of 15% of the lesions, with increased sensitivity for HCC diagnosis on a per-lesion level from 76 to 88%, while preserving specificity. In a study by Joo et al. [24], the frequency of recategorization with ancillary features on MRI was similar (18%); all changes were LR-3 upgraded to LR-4. Finally, a more recent study by Ren et al. [25] reported similar results, with 9% recategorization of observations on CT, when combining major and ancillary features as opposed to major features alone. Ren et al. [25] also reported increased specificity, positive predictive value, and accuracy when LR-5 was used a predictor for HCC when applying combination of major and ancillary features as opposed to major features on multiphasic CT (92% versus 90%, 93% versus 92%, and 69% versus 68%, respectively).

The LI-RADS diagnostic algorithm has a high diagnostic performance and also benefits from high inter-reader reliability [26]. Ludwig et al. [27] analyzed the diagnostic performance and inter-reader agreement of LI-RADS v2018 for distinguishing HCC from non-HCC primary hepatic malignancy in patients at risk for HCC and also assessed the impact of changes introduced in the revised 2018 version. They found a high specificity for distinguishing HCC from non-HCC malignancy (89% and 90% for two readers), with moderate inter-reader reliability (IRR) for overall LI-RADS category (k = 0.5) and substantial IRR for LR-5 versus LR-M and LR-TIV (k = 0.64) observations. Multiple additional publications have also reported excellent IRR of all LR categories on MRI when using LI-RADS v2018. For example, Razek et al. [28•] demonstrated an interobserver agreement of k = 0.89 for LR-4 observations and k = 0.84 for LR-5 observations (p = 0.001), with 91% agreement. Stocker et al. [29] compared qualitative and quantitative assessment of APHE and WO to assess diagnostic accuracy and inter-reader agreement of LI-RADS v2018, with results revealing similar performance of LI-RADS on quantitative and qualitative analysis (k = 0.38 and 0.4–0.47). However, there was improved agreement for quantitatively assessed APHE (κ = 0.65 vs. 0.81) when compared to WO (κ = 0.53 vs. 0.78).

Despite the successes of the LI-RADS diagnostic algorithm, the management of LR-3 observations remains an open question. LR-3 category suggests intermediate probability of malignancy, and studies show that 72–93% of LR 3 observations are benign on final pathology [30, 31]. Nevertheless, a non-negligible percentage of LR-3 observations are indeed HCC on pathology, and a substantial proportion of LR-3 observations progress to LR-5. Therefore, it is important to determine the clinical significance of these intermediate probability lesions and their potential to evolve into HCC. For example, a retrospective study utilizing hepatobiliary contrast agent to evaluate LR-3 observations over the period of 11.2 months, found that only 6% of LR-3 observations were upgraded to probable or definite HCC, while 80% remained unchanged and 14% decreased in size or disappeared on follow-up [32]. Shropshire et al. [33] showed that major and ancillary features were not significantly associated with a change in the LR-3 observation category, and instead, this category resulted in longer follow-up times (mean follow-up 20.3 ± 13.4 months, range 2.9–45.2 months), as 40% of lesions remained categorized as LR-3, 50% were down categorized to LR-2 or LR-1, and only 10% progressed to LR-4 or LR-5 on follow-up. Regardless of the low risk of malignancy, the LR-3 observations should be continued to follow. Future studies are needed to better risk-stratify LR-3 observations, to determine which require shorter interval follow-up.

The LR-M category was designed to include malignancies of non-hepatocellular origin in order to preserve the high specificity of LR-5 category for diagnosing HCC. Up to 28–36% of HCCs do not demonstrate “classic” imaging features and are thus characterized as LR-M observations [11••, 34]. An LR-M category is assigned when at least one targetoid LR-M feature (i.e., rim APHE, peripheral WO, delayed central enhancement, targetoid restricted diffusion, or targetoid appearance on transitional or hepatobiliary phase) is present [9]. LR-M category is also assigned to observations not meeting LR-5 or LR-TIV criteria [9] and with nontargetoid LR-M features (infiltrative appearance, severe ischemia/necrosis, markedly restricted diffusion). Kim et al. compared the diagnostic performance of LR-5 and LR-M in LI-RADS v2018 for distinguishing HCC from other malignancy (OM) [35•]. In this study, imaging findings were evaluated from 55 patients with OM (histologically proven cases of iCCA (n = 16), combined HCC and cholangiocarcinoma (cHCC-CCA) (n = 31), metastatic adenocarcinoma (n = 7) and angiosarcoma (n = 1)) and 165 histologically proven HCC [35•]. The LR-M category was found to have a sensitivity and specificity of 86% and 48% for the diagnosis of OM, respectively [35•]. On the other hand, 11% (6/55) of the patients with OM satisfied the LR-5 criteria [35•]. The LR-5 category showed a high specificity (89%) for the diagnosis of HCC and was significantly more accurate than LR-M (78% versus 58%, p < 0.01) [35•]. The low specificity of LR-M category for other malignancy was explained by high number of biopsy-proven HCC (86/165) exhibiting imaging features of LR-M in this retrospective study [35•].

Given that a third of LR-M lesions are ultimately HCC on pathology, Park et al. assessed different imaging features to identify if there is a way to differentiate HCC categorized as LR-M from non-HCC malignancies on gadoxetic acid enhancement MRI using LI-RADS v2018 [36]. The authors devised a classification tree analysis model that had a sensitivity of 64%, specificity of 90%, and accuracy of 81% for differentiating HCC categorized as LR-M from non-HCC malignancy [36]. Results of the study revealed a total of 36 (28%) HCC and 70 (78%) non-HCC malignancies categorized as LR-M, with features such as enhancing capsule (p = 0.0293), blood products in the mass (p = 0.0393), and nontargetoid restriction (p = 0.018) as independent predictors of HCC, compared to other malignancies. A recent study by Kim et al. in which 165 high-risk patients for HCC had pathology confirmed primary liver cancer showed a high rate of sensitivity of the LR-M category for diagnosing malignancy, with sensitivities of 88–93% [37].

Role of Abbreviated MR

While US is an accepted modality for HCC screening and surveillance, its performance can be limited in some patients, particularly in the setting of steatosis and severe cirrhosis [38]. In some patients, CT or MRI serve as both surveillance and diagnosis tools. Although MRI is reported to have the highest accuracy for HCC diagnosis of all available modalities [39, 40], its cost and length of exam may be prohibitive from widespread utilization. Consequently, there has been a recent surge in literature evaluating the use of an abbreviated MRI (aMRI) protocol for screening and surveillance of HCC (Table 1). In all published studies to date, a radiologist interprets both the full and abbreviated MRI to compare the sensitivity of each. A recent study by Khatri et al. retrospectively compared the sensitivity of a complete MRI to aMRI (coronal T2-weighted and axial dynamic post-contrast T1-weighted fat-suppressed sequences). In this study, in which 93 patients with 121 liver observations met inclusion criteria, they reported a sensitivity and specificity of aMRI at 93 and 89% compared to complete MRI at 94 and 88%, respectively, suggesting that an abbreviated protocol screening MRI is not only interchangeable, but equivalent to a complete diagnostic MRI in cirrhotic patients for HCC detection [41•]. In another study, cirrhotic patients who underwent gadoxetic acid-enhanced abbreviated MRI utilizing axial DWI with apparent diffusion coefficient (ADC) maps, axial T2-weighted single-shot fast spin echo, and axial T1-weighted three-dimensional gradient echo fat-suppressed hepatobiliary phase imaging revealed that the abbreviated MRI had a sensitivity of 92% and specificity of 91% for detection of HCC [42]. A third recent study with 188 patients investigated the role of abbreviated non-contrast MRI protocol, utilizing axial T2-weighted sequence, axial T1-weighted Dixon sequence (in-phase, opposed-phase, fat and water), DWI with ADC maps, as a screening alternative in patients with poor-quality screening ultrasounds. They too reported a high specificity (93%) and negative predictive value (97%) with a sensitivity of 85% when compared to that of ultrasound and gadoxetic acid contrast-enhanced MRI [38]. Thus, early data suggests that an abbreviated MRI protocol can be used for screening and surveillance of HCC.

Treatment of HCC

Approximately 80% of cirrhotic patients diagnosed with HCC are non-surgical candidates, resulting in increasing use of locoregional therapies (LRT). Over the past few years, the number of LRT options for patients with HCC have increased; these include thermal ablation (e.g., microwave, radiofrequency, cryoablation), percutaneous ethanol injection (PEI), transarterial chemoembolization (TACE), transarterial bland embolization (TAE), transarterial radioembolization (TARE), and stereotactic body radiotherapy (SBRT) [43, 44••, 45, 46]. LRT can be used alone, in combination with each other, or in combination with systemic therapy, and has been shown to improve disease-free and overall survival in non-surgical candidates [47,48,49]. In addition, LRT can help prolong time to progression, palliate symptoms, prevent lesions from progressing outside of Milan criteria in order to maintain liver transplant candidacy (bridge to transplant), and convert nontransplant candidates to transplant candidates based on Milan criteria (downstage to transplant) [50,51,52]. Treatment decisions are usually made by a multidisciplinary tumor board and depend on patient factors, including tumor location, size and multiplicity, disease stage, liver function, performance status, technical feasibility, and potential for future transplant candidacy [53, 54].

Following treatment with LRT, it is important to provide accurate treatment response assessment to help guide clinical management. The imaging appearance of HCC following LRT depends on type of LRT and is distinct for each treatment modality. In general, complete loss of APHE is suggestive of nonviable disease. However, post-treatment imaging evaluation is complicated after radiation-based therapies, in which there can be intra-tumoral APHE or nodular or geographic peri-tumoral enhancement that can persist for up to 1 year after therapy and often longer [55••, 56]. Thus, it is imperative to be aware of the different types of LRTs, as well as their expected post-treatment imaging appearance. HCC response assessment can be performed according to various response assessment guidelines, such as the European Association for the Study of Liver Disease (EASL) [57] and modified Response Evaluation Criteria in Solid Tumors (mRECIST) [58], which are patient-level response systems. Alternatively, LI-RADS TRA [45••] can be used to assess response on a per-lesion level.

LI-RADS Treatment Assessment of HCC

LI-RADS treatment response algorithm was released in 2017, to provide a step by step approach for assessment of lesions following LRT [5••, 59]. LI-RADS TRA, modeled after mRECIST, not only includes APHE as a criterion to detect viable tumor but also incorporates imaging findings of washout appearance or treatment-specific enhancement pattern [5••]. Another distinguishing characteristic of LI-RADS TRA is that it provides assessment at a lesion level, which is important for clinical management, as individual lesions may be treated by different LRT over time, with distinct responses.

The LI-RADS treatment response categories include LR-TR nonviable, LR-TR equivocal, LR-TR viable, and LR-TR nonevaluable. A unique aspect of LI-RADS TRA is the inclusion of the category of LR-TR equivocal, which allows reporting of lesions in which the imaging appearance does not meet criteria for viable or nonviable disease. Treated tumors are characterized as equivocal when enhancement is atypical for treatment-specific enhancement pattern and not meeting criteria for probably or definitely viable tumor. In this scenario, short-term follow-up imaging is recommended, as opposed to immediate retreatment [1, 5••, 12, 13]. The LR-TR equivocal category is particularly important for tumors treated with intra-arterial and radiation-based therapies which target not only the tumor itself but also the hepatic parenchyma adjacent to the treated tumor, resulting in perfusional alterations visible on post-treatment imaging. In such instances, use of the LR-TR equivocal category helps to prevent unnecessary retreatment. While there is a risk that viable tumor is left untreated, HCC is generally a slow-growing tumor with a doubling time of 86–117 days [60,61,62], thus a wait and watch approach with a 3-month interval imaging may be a safe approach to differentiate residual viable disease from benign parenchymal perfusional alterations.

Current Evidence Validating LI-RADS TRA

Since LR-TRA is relatively novel, only a few publications have assessed its performance thus far (Table 2). Chaudhry et al. [64] and Shropshire et al. [65] both reported that following TACE and TAE, 81–86% and 67–71% lesions categorized as LR-TR nonviable demonstrate 100% necrosis on pathology. This study supports the LR-TR nonviable category for prediction of treatment response.

Table 2 Performance of LI-RADS treatment response algorithm, inter-reader reliability, and challenges of the LR-TR equivocal category

Both Chaudhry et al. [64] and Shropshire et al. [65] have reported high sensitivity and specificity of the LR-TR viable category. Chaudry et al. [64] reported that 73% of treated lesions categorized as LR-TR viable, were viable on pathology while Shropshire et al. [65] reported 60–65% of LR-TR viable observations were similarly viable on pathology, defined as < 99% pathologic necrosis.

Interestingly, most LI-RADS TRA validation studies have shown high rates of viable disease in the LR-TR equivocal category. In these studies, 71% of tumors treated with TAE and 83% of tumors treated with thermal ablation, which were categorized as LR-TR equivocal, demonstrated viable disease at pathology [64, 65]. This high rate of pathologic viability is likely the result of microscopically viable tumor which cannot be seen on imaging, but can be detected pathologically. Despite these early reports, the LR-TR equivocal category may still serve a role, especially in patients treated with radioembolization or SBRT, to avoid early and repeat retreatment, particularly for those lesions which are truly nonviable.

Several studies have evaluated inter-reader agreement of LR-TRA response categorization. These show moderate inter-reader agreement in assigning LR-TR categories following ablation and nonradiation arterial based therapies for HCC. Cool et al. [66] and Chaudhary et al. [64] have shown an inter-reader agreement of 90% and 95% with kappa of 0.75 and 0.71, respectively, following thermal ablation. Seo et al. evaluated HCC tumors treated with TACE, which were imaged with either CT or MRI and reported kappa values of 0.69 and 0.56, respectively [56]. Shropshire et al. [65] reported kappas of 0.55, in which all tumors were treated with TAE and imaged with MRI. Lower inter-reader agreement in patients treated with intra-arterial therapy, as compared to thermal ablation, is likely related to the complex imaging features seen after transcatheter arterial based therapies, in which abnormal parenchymal enhancement may contribute to uncertainty in imaging findings early post-treatment. A recent study by Abdel Razak et al. [67] evaluated the reproducibility of LI-RADS TRA for HCC after thermal ablation and TACE. They found excellent inter-reader agreement of all LI-RADS TRA categories, 97% for overall treated nonviable HCC (k = 0.94), 98% agreement for LR-TR viable disease (k = 0.96) and 97% for LR-TR equivocal HCC (k = 0.7) [58].

Unlike the LI-RADS diagnostic algorithm, the LI-RADS TRA currently does not use ancillary criteria for imaging assessment. However, new studies evaluating the role of ancillary features suggest increasing sensitivities to detect viable tumor. For example, one recent study [68•] evaluated LR-TRA on MRI with or without the addition of ancillary features to the existing enhancement-based criteria. Using pathologic necrosis confirmed by liver explant or surgical resection as their reference standard in 138 patients with HCC, the authors found that the use of ancillary features increased the sensitivity to detect viable disease on MRI from 76 to 84%, without sacrificing specificity (80% vs 83%). These results are comparable to a recent study by Kim et al., who found an increase in sensitivity of detecting viability from 40–67 to 83–87%, when adding the ancillary features [69]. In both studies, the ancillary features which were used included hepatobiliary phase (HBP) hypointensity, restricted diffusion, and intermediate signal intensity on T2-weighted imaging (WI). In both studies, HBP hypointensity restricted diffusion and intermediate signal intensity on T2-WI were seen in nearly all cases where the readers upgraded an LR-TR equivocal category to LR-TR viable. All three ancillary features resulted in improved sensitivities, (81%, 79%, and 83%, respectively) for detection of viability when compared to MRI alone without ancillary features (76%). Conversely, HBP isointensity was treated as an ancillary feature favoring nonviable tumor and may prove useful when arterial phase hyperenhancement is seen at the margin of an otherwise necrotic treated HCC. In these instances, isointensity on HBP may help increase a radiologist’s confidence that the observed arterial phase hyperenhancement represents perfusional abnormality and not viable tumor.

Of note, both studies revealed a marked reduction in use of the LR-TR equivocal category when incorporating ancillary features into treatment response interpretation. In Park et al., strict adherence to the LR-TR algorithm resulted in the use of LR-TR Equivocal in 9 and 12 cases out of 138 on MRI, by two readers, but incorporation of ancillary features reduced LR-TR Equivocal categorization to 2 and 0 cases, respectively [68•]. In this study, all up-categorized cases had pathologic confirmation of viable tumor [68•]. Similarly, Kim et al. [69] showed that the use of ancillary features reduced the use of the LR-TR equivocal category, from 29 and 59 assignments out of 183 cases by two radiologists, to only 2 and 14 cases, respectively. Both studies suggest improved sensitivities for detection of viable HCC when using ancillary features by correctly changing a majority of LR-TR equivocal assignments to LR-TR viable. They have also reported an excellent inter-reader agreement for LR-TR categories when using ancillary features, with a weighted kappa statistic of 0.75, which is comparable to the use of the original LR-TR algorithm [69]. These early studies support incorporation of ancillary features into the LR-TRA, as it seems to improve response assessment accuracy without apparent downside.

While tumor response assessment after thermal ablation and nonradiation intra-arterial therapy seems relatively straight-forward, HCC treated with radiation-based therapies have a unique and complex post-treatment imaging appearance and thus categorization based on LI-RADS TRA is much more challenging. There is limited data on radiologic-pathologic correlation following radiation-based therapy using LI-RADS TRA. Mendiratta-Lala et al. [55••] evaluated 10 stereotactic body radiation (SBRT)-treated HCC, in which all tumors were necrotic on explant pathology at 12 months, out of which 40% exhibited persistent APHE and 90% exhibited WO on imaging prior to transplant. Although only a very small cohort, this study suggests that persistent arterial enhancement post-SBRT can be an expected post-treatment imaging finding. Further radiology–pathology studies in HCC treated with SBRT will be needed in order to validate the application of the LI-RADS TRA.

Likewise, there is limited radiology–pathology data for HCC treated with TARE. In a study evaluating 37 lesions post-TARE, Riaz et al. [70] compared the degree of tumor necrosis at pathologic diagnosis to the imaging appearance, albeit utilizing EASL tumor response categories, in which a complete response was seen in 32% of lesions on histopathology at a median time of 34 days. Another study by Vouche et al. [71] compared EASL, mRECIST, WHO, and RECIST tumor response categories by comparing imaging to explant pathology to determine degree of tumor necrosis. Sixteen HCC treated with TARE were evaluated and complete pathological necrosis could not be predicted by WHO, RECIST, EASL, or mRECIST. Furthermore, response to treatment was categorized as equivocal in all tumors when using EASL and mRECIST, as neither could reliably predict complete tumor necrosis. This may be explained by the persistent enhancement seen in TARE-treated HCC limiting the utility of enhancement-based response assessment guidelines (EASL, mRECIST). Further evaluation with outcome data such as time to progression, disease-free survival, and overall survival will be needed in conjunction with radiology and pathology data. To date, no studies have been performed evaluating the reproducibility of the LI-RADS TRA categories or their diagnostic performance compared to explant pathology for radiation-based therapies.

Limitations, Knowledge Gaps, and Controversies

Though there is early promising data on the sensitivity/specificity on LRT, further validation studies are needed given the increasing number of different LRTs, as well as in patients receiving combination LRT and systemic therapies. LI-RADS TRA after radiation-based therapies for HCC may also require modifications of the current algorithm. Furthermore, longitudinal follow-up studies are needed to determine the utility of the LR-TR equivocal category, particularly since many recent studies have suggested high frequency of viable tumor within this category after locoablative and nonradiation embolotherapies. Secondly, LI-RADS TRA is not yet applicable to patients on systemic and/or biologic treatments and further research comparing outcomes using the different treatment response guidelines such as RECIST, mRECIST, and LI-RADS TRA are needed. Lastly, post-treatment imaging follow-up recommendations and standardized reporting templates need to be developed.

Conclusion

Diagnostic LI-RADS CT/MR is a validated tool for diagnosis of HCC, improving characterization of lesions in at-risk patients and improving communication between radiologists and clinicians. Since its introduction in 2011, LI-RADS has clarified communications between radiology researchers with multiple studies leading to refinements in LI-RADS categories over time. LI-RADS TRA is a relatively new but promising approach for response assessment following LRT, which needs further validation and refinement to be applicable to radiation-based therapies.