Introduction

Hepatocellular carcinoma is the sixth most commonly diagnosed cancer and the third leading cause of cancer-related deaths in the world, accounting for 75–85% of primary liver cancer, and its incidence is still expected to increase in the future [1].

Patients with HCC often undergo locoregional therapy (LRT) to downstage tumor as a bridge to liver transplant, resection or treatment for advanced disease. LRTs include ablation therapies (radiofrequency ablation, microwave ablation, percutaneous ethanol injection, and cryoablation), transcatheter therapies (transarterial embolization or transarterial chemoembolization, TACE) and radiation therapy (stereotactic body radiotherapy, SBRT) [2]. Of the LRTs, TACE is the most widely used as the first-line treatment for intermediate stage HCC according to the Barcelona Clinic Liver Cancer (BCLC) staging system [3]. After LRT, accurate assessment of viable tumors is crucial to guide further management. The concept of “LR-Treated” was first introduced in 2014 version of LI-RADS, in 2017, a dedicated treatment response algorithm was developed to provide an image-based assessment of HCC after locoregional therapy, and there was no modification of this algorithm in 2018 version of LI-RADS. The treated lesions are categorized to LR-TR viable, LR-TR equivocal and LR-TR nonviable according to contrast-enhanced CT/MRI features, including arterial phase hyperenhancement (APHE), washout, and enhancement similar to pretreatment enhancement [4]. Some recent studies have assessed the diagnostic performance of LR-TR v2018 after different LRTs [5,6,7], but most of them are based on MRI with hepatobiliary agent, HBA [8,9,10], the evidence of added value of ancillary features to LR-TR for TACE-treated HCC is insufficient, especially on ECA-MRI.

The purpose of this study was to assess the diagnostic performance of LR-TR v2018 algorithm and the value of ancillary features for HCC treated with TACE.

Methods

Institutional Review Board in our hospital approved this retrospective study. The requirement for written informed consent was waived.

Study population

We retrospectively reviewed electronic pathology records in our hospital (a tertiary care center) to search consecutive HCC patients (age ≥ 18 years) who were first treated by TACE and subsequently underwent hepatic surgery (local hepatic surgery or liver transplantation) between January 2019 and June 2023 (n = 176). The exclusion criteria were as follows: (a) underwent surgery or LRTs before baseline MRI (n = 9), or combined treatment with TACE including ablation (n = 2) and systemic treatment (n = 37), (b) HCC was treated at an outside hospital with no pretreatment MRI available (n = 61) or no posttreatment MRI examination prior to surgery (n = 4), (c) > 30 days between post-TACE MRI examination and surgery (n = 2).

Clinical and laboratory data from electronic medical record were collected for each patient including age at time of TACE, sex, etiology of chronic liver disease, type of surgery, alpha-fetoprotein, and protein induced by vitamin K absence or antagonist-II level before TACE, type of TACE, interval days between post-treatment MRI and surgery.

Reference standard

All lesions were confirmed by surgical histopathology. Reference standard for viable and nonviable tumors was based on histopathological results from pathologic reports. Tumor necrosis was categorized as complete pathologic necrosis (100%, CPN) and non-complete pathologic necrosis (< 100%, non-CPN). For complete necrotic tumors, the pathologist could see the outlines of dead HCC cells which can confirm that they were treated-HCCs.

A radiologist with 10 years abdominal MRI experiences annotated each target lesion matched with pathologic results on pre- and post-treatment MRI examinations. For patients who had multiple sets of MRI before TACE, chosen the examination closest to TACE. For patients undergone multiple TACE treatments, chosen the examination after last TACE and closest to surgery.

Imaging protocol on ECA-MRI

All patients underwent MRI examinations with 3.0-T scanners (Siemens Healthcare, Prisma and uMR770, United Imaging Healthcare, Shanghai, China). The protocol included: spoiled gradient-echo T1-weighted in/opposed phase imaging, turbo spin–echo T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI, b values = 0, 50, and 500 s/mm2) and apparent diffusion coefficient (ADC) map, dynamic contrast-enhanced imaging using a T1-weighted, fat-suppressed sequence. The arterial, portal venous, and delayed phase images were obtained respectively at 20–30 s, 70–90 s, and 180–300 s. The gadopentetate dimeglumine (Gd-DTPA) was administered intravenously at a rate of 2 mL/s and a dose of 0.1 mmol/kg, followed by a 20 mL saline flush. Scanning sequence parameters are summarized in Table E1.

TACE procedure

Both cTACE and DEB-TACE were performed by selectively introducing a catheter into vessels feeding the tumor through a right femoral artery access. cTACE was performed by injecting a mixture of an iodized oil (4–20 mL, Lipiodol) and antineoplastic drugs followed by gelatin sponge particles. DEB-TACE was performed by using drug-eluting beads with diameters ranging from 100 to 300 µm loaded with antineoplastic drugs.

Imaging analysis

Two radiologists with 10 and 14 years of abdominal MRI experience (reader 1 Di Wang and reader 2 Yang Zhang) who knew the target observations but blinded to the pathologic results independently assessed the pre- and post-treatment images and categorized all lesions by using the LI-RADS v2018 (pre-TACE), LR-TR and mLR-TR algorithm (post-TACE). On pre-TACE images, readers assessed tumor location and size, then gave the category of LI-RADS based on major features only, including nonrim arterial phase hyperenhancement (APHE), non-peripheral washout and enhancing capsule appearance. On post-TACE images, readers assessed the whole tumor size and viable/equivocal tumor size, then assigned a category by using LR-TR and modified LR-TR algorithm, respectively. The lesion was categorized as LR-TR viable, LR-TR nonviable and LR-TR equivocal on basis of the LR-TR algorithm, LR-TR viable had nodular, masslike, or irregular thick tissue in or along the treated lesion (NMLIT) with any of the following: (a) arterial phase hyperenhancement or (b) washout appearance or (c) enhancement similar to pretreatment. Washout was defined as reduction in enhancement relative to liver tissue in portal venous phase or delayed phase. Enhancement similar to pretreatment was defined as the observation had the same enhancing pattern with baseline images in all contrast enhancement phases. LR-TR nonviable showed no lesional enhancement or treatment-specific expected enhancement pattern (i.e., a thin smooth rim of enhancement surrounding a non-enhancing lesion or typical treatment-related changes in parenchymal perfusion); LR-TR equivocal had atypical for treatment-specific expected enhancement pattern and not meeting criteria for probably viable or definitely viable. The mLR-TR criteria for viable category were as follows: (a) meet the LR-TR viable criteria; (b) NMLIT with indeterminate enhancement or no enhancement plus two ancillary features (restricted diffusion and intermediate T2-weighted hyperintensity), restricted diffusion defined as DWI hyperintensity at high b value or low ADC. Otherwise, the lesion was categorized as mLR-TR nonviable. The consensus reading was provided by a third radiologist (reader 3 with 28 years of abdominal MRI experience Peng-ju Xu).

Statistical analysis

Quantitative variables (age) consistent with normal distribution were expressed as mean ± standard deviation, variables (tumor size, laboratory data, interval days) not consistent with normal distribution were expressed as median and range (minimum and maximum). Qualitative variables (sex, etiology, LI-RADS and LR-TR categories, type of surgery, type of TACE) were expressed as frequencies and percentages.

The diagnostic performance including sensitivity, specificity, accuracy of LR-TR and mLR-TR for predicting viable and nonviable tumors was compared in consensus by using the McNemar test with pathological results as a reference standard. For each post-TACE MRI feature, the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy for the diagnosis of viable tumors were calculated. Interreader agreement for post-TACE assessment categories and MRI features was computed using Cohen's weighted κ [viable (score 1) vs. equivocal (score 0.5) vs. nonviable (score 0)] and unweighted κ with 95% confidence intervals, the κ value was interpreted as slight (0.01–0.20), fair (0.21–0.40), moderate (0.41–0.60), substantial (0.61–0.80), or almost perfect (0.81–1.00).

SPSS software (for Windows, version 26.0, IBM Corp, USA) was used for all statistical analysis. P < 0.05 indicated a significant difference.

Results

Patient characteristics

Our final study population consisted of 61 patients (mean age at TACE 59 years ± 10; 47 men) with 79 lesions. This included 14 women (23%, mean age, 60 years ± 7) and 47 men (mean age, 60 years ± 11, Fig. 1). Hepatitis B viral infection (87%, 53 of 61) was the predominant etiology of underlying liver disease. 47 (77%, 47 of 61) Patients had a single lesion, 11 (18%, 11 of 61) patients had two lesions, and 3 (5%, 3 of 61) patients had three or more lesions. Most patients (97%, 59/61) underwent hepatic resections and only two underwent liver transplantation. The median interval between post-MRI and hepatic surgery was 6 days (1–27 days). Consensus LI-RADS category assignments for pre-TACE MRI findings were as follows: 4 as LI-RADS 4, 75 as LI-RADS 5. The majority (81%, 64 of 79) of lesions had been treated with conventional TACE, and the remaining 15 (19%, 15 of 79) lesions had been treated with DEB-TACE. 65 Lesions underwent one time TACE treatment, 12 lesions underwent two times TACE treatment, and 2 lesions underwent three times TACE treatment. The pre-TACE characteristics of patients are summarized in Table 1.

Fig. 1
figure 1

Study flowchart. TACE transarterial chemoembolization, HCC hepatocellular carcinoma, LRT locoregional therapy

Table 1 Patient characteristics prior to TACE

Category assignment according to LR-TR and modified LR-TR Algorithms

On histopathology, 57 lesions were confirmed to be non-complete pathologic necrosis (non-CPN), and the remainder were confirmed to be complete pathologic necrosis (CPN). By consensus, 47 (59%, 47 of 79) of treated lesions were categorized as LR-TR viable, with 43 (91%, 43 of 47) lesions demonstrating non-CPN, 19 (24%, 19 of 79) of treated lesions were categorized as LR-TR nonviable, with 14 (74%, 14 of 19) lesions demonstrating CPN. 13 (16%, 13 of 79) of treated lesions were categorized as LR-TR equivocal, with 9 lesions showing non-CPN and the other 4 showing CPN. According to the mLR-TR algorithm adding ancillary features, in consensus, 7 of equivocal lesions were assigned to viable, all of these were incompletely necrosis on histopathology, 6 of LR-TR equivocal lesions were assigned to nonviable, and 4 lesions were completely necrosis on histopathology, the percentage of viable tumors were both less than 10% in 2 false negative lesions.

Diagnostic performances of LR-TR and modified LR-TR algorithm

The diagnostic performance in predicting complete and incomplete tumor necrosis based on different algorithms is summarized in Table 2, each reader’s performance is presented in Table E2 (online). For non-CPN prediction, the sensitivity, specificity and accuracy of LR-TR viable were 75% (43 of 57), 82% (18 of 22) and 77% (61 of 79) respectively when LR-TR equivocal category was treated as nonviable (Fig. 2). For CPN prediction, the sensitivity, specificity and accuracy of LR-TR nonviable were 64% (14 of 22), 91% (52 of 57) and 84% (66 of 79), respectively, when LR-TR equivocal category was treated as viable.

Table 2 Diagnostic performances of treatment response algorithms in consensus reading in predicting histopathologic incomplete necrosis and complete necrosis
Fig. 2
figure 2

52-year-old woman with a history of hepatitis B infection. ac MRI examination performed before treatment. Precontrast fat-suppressed T1-weighted image (a) shows an 8.6-cm hypointensity lesion in right posterior lobe of the liver with non-rim arterial phase hyperenhancement (APHE) in arterial phase (b) and non-rim washout as well as enhancing capsule in portal venous phase (c). According to Liver Imaging Reporting and Data System (LI-RADS) algorithm, this observation is categorized as LI-RADS 5. di MRI examination performed after conventional transarterial chemoembolization (cTACE). A 7.6-cm TACE-treated lesion shows hypointensity on precontrast fat-suppressed T1-weighted image (d) with nodular APHE in arterial phase (e) and persistent enhancement in delayed phase (f). The lesion is deemed as viable category according to LI-RADS Treatment Response algorithm (LR-TRA) by the presence of nodular APHE. The lesion shows restricted diffusion on diffusion-weighted image (b = 500 s/mm2) (g) and ADC map (h), and intermediate hyperintensity on T2-weighted image (i) which also approve the viable diagnosis. Pathologic examination after surgery revealed noncomplete pathologic necrosis with a histological subtype of microtrabecular

Compared to LR-TR v2018, the sensitivity of the mLR-TR version in predicting non-CPN increased from 75 to 88% (50 of 57) with P value = 0.016 (Fig. 3), the specificity was slightly decreased, with no statistical differences (77%, 17 of 22, P = 1.000). For CPN prediction, the sensitivity of mLR-TR version was higher than LR-TR and the specificity was slightly decreased but both with no statistical differences (P = 0.250 vs. 0.500). Interreader agreement of LR-TR and mLR-TR category was moderate (k = 0.50, [95% CI 0.33, 0.67], k = 0.42, [95% CI 0.20, 0.63]).

Fig. 3
figure 3

61-year-old woman with cirrhosis due to hepatitis B infection. ac MRI examination performed before treatment. Precontrast fat-suppressed T1-weighted image (a) shows a 2.8-cm hypointensity lesion (arrow) in left lobe of the liver with nodular APHE in arterial phase (b) and washout in delayed phase (c) that is assigned a LI-RADS 5 category. dh MRI examination performed after cTACE. A 2.9-cm TACE-treated lesion shows hypointensity on precontrast fat-suppressed T1-weighted image (d) with indeterminate enhancement in arterial phase (e) and delayed phase (f) that is deemed LR-TR equivocal category. The lesion shows hyperintensity on diffusion-weighted image (b = 500 s/mm2) (g) and intermediate hyperintensity on T2-weighted image (h), the findings of these ancillary features support a modified LR-TR viable category. Pathologic examination after surgery revealed noncomplete pathologic necrosis. There was the other LI-RADS 5 lesion in left lobe of the liver with no enhancement after TACE (arrowhead in f) which revealed complete pathologic necrosis after surgery

Imaging features in predicting histological tumor viable

The diagnostic performance of each MRI feature is shown in Table 3, each reader’s performance is presented in Table E (online). In terms of viable tumor prediction, among the LR-TR MRI features, NMLIT with APHE showed the highest sensitivity (72%, 41 of 57), while “NMLIT with washout” and “enhancing similar to pretreatment” showed the same specificity (86%, 19 of 22) which was slightly higher than “NMLIT with APHE” (82%, 18 of 22). The feature of “NMLIT with washout” (n = 25) was mostly concomitance with that of “NMLIT with APHE” (n = 23) and independently successfully predicted tumor viable in only two lesions. The feature of “NMLIT with enhancement similar to pretreatment” (n = 28) did not change the categorization because it was mostly concomitance with that of “NMLIT with APHE” (n = 27) or “NMLIT with washout” (n = 21). For the ancillary features of mLR-TR, Intermediate hyperintensity on T2-weighted image showed high sensitivity (86%, 49 of 57) but low specificity (59%, 13 of 22). Restricted diffusion showed higher specificity (91%, 20 of 22) and slightly higher sensitivity (74%, 42 of 57) than NMLIT with APHE. Interreader agreement was moderate to substantial with κ values ranging from 0.56 to 0.66 for all assessed features.

Table 3 Diagnostic performance and interreader agreement of imaging features for detection viable tumors in consensus reading

Discussion

Our results showed that the modified LR-TR algorithm had higher sensitivity in predicting viable tumors compared with the LR-TR algorithm when LR-TR equivocal category was treated as nonviable (88%, 50 of 57 vs. 75%, 43 of 57, P = 0.016) and without significantly decreasing in specificity (77%, 17 of 22 vs. 82%, 18 of 22, P = 1.000). The results are in accordance with some previous studies on the EOB-MRI-modified TR algorithm [8,9,10], in which AFs including not only “intermediate hyperintensity on T2-weighted image” and “restrict diffusion” but also “TP washout” and “HBP washout”. Compared with studies on EOB-MRI, our mLR-TR algorithm on ECA-MRI achieved comparable sensitivity with fewer AFs, which could be related to several possible reasons. First, our approach is strictly interpret the LR-TR algorithm by emphasizing the presence of nodular, masslike, or irregular thick tissue morphology, even for ancillary features [11]. Second, the motion artifacts and weak enhancement in arterial phase for HBA-MRI may affect the identification of APHE, therefore, using ECA rather than HBA-MRI as a follow-up imaging modality after LRT may be more useful [12, 13].

Our study had a specificity of 82% to 86% in predicting viable tumors, which was consistent with most previous studies, but lower than the result by Kim et al. with a specificity of 93.3% to 100% [9]. The difference could be related to different reference standard, Kim et al. adopt both histological and clinical follow-up as reference standards, resulting in a significantly higher proportion of CPN (54%, 99 of 183) than in our study cohort (28%, 22 of 79) and most previous studies [5, 8, 11].

The interreader agreement was moderate for LR-TR category and mLR-TR category, which is accordance with previous studies [5, 11]. The variability in interreader agreement may be related to the following reasons. Despite the emphasis on NMLIT morphology, it is sometimes difficult to identify irregular thick tissue along the treated lesion, in addition, the judgment of indeterminate enhancement is different between readers. The interreader agreement of all posttreatment MRI features was similar (k range 0.56–0.66), and readers had the highest agreement regarding the feature of NMLIT with APHE (k = 0.66), which is consistent with previous studies 5, 7, 11.

The sensitivity of NMLIT with washout in previous studies varied widely, ranging from 12.8 to 73% [8, 9, 11, 14], which may be related to different imaging modalities and different type of LRT. Our study had a sensitivity of 39%, which was lower than that of majority of studies. For viable tumors after treatment, the proportion of lesions (49%, 20 of 41) demonstrating both “NMLIT with APHE” and “NMLT with washout” was lower than pre-treatment lesions (88%, 57 of 65), in the lesions demonstrating APHE but not showing washout after TACE, the observations were presented as isointensity (13 of 21) or hyperintensity (8 of 21) in delayed phase, while 17 of 21 ones showed washout on baseline MRI. The persistent enhancement manifestation was more common in the DEB-TACE group, accounting for 75% (6 of 8), which was much higher than in cTACE group (45%, 15 of 33). The result was consistent with the work by Adam et al., who found only 18.8% of cases with baseline washout also showed washout on post-treatment imaging [15]. The persistent enhancement may represent fibrosis as a response to tissue injury and repair [2], further investigations are needed to confirm the precise mechanism.

In our study, the presence of “NMLIT with enhancement similar to pretreatment” did not change the categorization because these features were concurrent with APHE or washout which was in accordance with some previous studies [5, 7, 8, 16]. This finding may have significant implications for future revisions of the LI-RADS TR algorithm [17].

There is no demonstrable difference in tumor response or posttreatment appearances between cTACE and DEB-TACE on MRI, but the ethiodized oil agent deposition in the tumor could influence the evaluation of posttreatment enhancement on CT [18,19,20]. Our result also showed the sensitivity of both LR-TR and mLR-TR algorithms in predicting viable tumors between cTACE and DEB-TACE did not have significant difference (cTACE: 76%, 89% vs. DEB-TACE: 73%, 82%).

There were some limitations in our study. First, our data was selected from a single tertiary care center with a relatively small sample, especially the patients underwent DEB-TACE treatment. In the future, more cases will be collected in multiple centers. Second, we only assessed the performance of ECA-MRI and did not compare with other imaging modalities such as contrast-enhanced CT and hepatobiliary contrast agents MRI. Third, although we adopt histopathologic results as reference standard, it is difficult to match lesions on imaging with pathology for small foci viable tumors. In addition, a maximum interval of 30 days between post-treated MRI examination and resection might introduce selection bias.

In conclusion, applying ancillary features on ECA-MRA to LI-RADS v2018 TRA can improve the sensitivity to predict pathologic tumor viability in patients treated with TACE for hepatocellular carcinoma without a loss of specificity. For TACE-treated HCC, viable tumors demonstrating “NMLIT with APHE” do not always show “NMLIT with washout”. The feature of “NMLIT with enhancement similar to pretreatment” do not change the categorization.