Introduction

Liver resection is one of the radical treatment options for patients with resectable hepatocellular carcinoma (HCC) [1]. Post-hepatectomy liver failure (PHLF) is the most serious complication of liver resection, accounting for more than 50% of postoperative deaths [2,3,4,5]. Previous reports have shown that the incidence of PHLF after liver resection varies and ranges from 0.7 to 39.6% [6, 7]. The occurrence of PHLF is decided by both the quality and the quantity of remaining liver parenchyma [8]. Preoperative assessment of background liver function is essential for preventing the development of PHLF.

Several clinical models are commonly used for predicting PHLF in patients with resectable HCC, including Child–Pugh stage, Model for End-Stage Liver Disease (MELD) scoring system, and albumin–bilirubin (ALBI) score. Zou et al [9] determined that the ALBI score is an independent predictor of PHLF and showed superior predictive value over Child–Pugh stage and MELD score. Unfortunately, the predictive ability of the ALBI score is still unsatisfactory due to limited data.

Multiphasic contrast-enhanced computed tomography (CT) is a widely recommended modality for the preoperative assessment of HCC [10, 11]. The liver consists of three distinct spaces, including the intravascular space, intracellular space, and extravascular extracellular space. The expansion of extracellular volume (ECV), due to the deposition of collagen and matrix proteins, is the histopathologic feature of liver fibrosis [12]. Several studies [12,13,14,15,16,17] have indicated that ECV, which is calculated using nonenhanced and equilibrium-phase CT images, can predict the degree of liver fibrosis. HCC patients usually suffer from liver fibrosis and cirrhosis, which is associated with PHLF [2, 8, 18, 19]. However, whether CT–derived ECV can predict PHLF in patients with resectable HCC is unclear.

Nomogram [20], simple visualization of complex regression equation, has been widely used. It is considered to be evidence-based, personalized, and accurate risk assessment. In this study, we aimed to develop and validate a nomogram based on CT–derived ECV for predicting PHLF in patients with resectable HCC and to compare the preoperative prediction efficacy of this nomogram with that of clinical models (including Child–Pugh stage, MELD score, and ALBI score).

Materials and methods

Patients

This retrospective study was approved by the Institutional Review Board of the two participating hospitals. Informed consent was waived.

Between June 2013 and May 2021, 1150 consecutive patients who underwent liver resection with histopathologically proven HCC at Chongqing University Cancer Hospital were reviewed retrospectively. A total of 121 patients who met the inclusion criteria were enrolled and formed the training cohort for this study. From November 2019 to July 2021, the other 81 consecutive patients in Chongqing General Hospital with the same selection criteria were enrolled as an independent external validation cohort. The inclusion criteria were as follows: (1) patients with histopathologically confirmed HCC after hepatectomy, (2) patients with Child–Pugh stage (grade A or B) liver function before operation, and (3) patients who underwent multiphasic contrast-enhanced CT examination and serum marker testing within 2 weeks before operation. The exclusion criteria were patients with any antitumor therapy before surgery. None of these included patients were analyzed in another study, nor as a sub-cohort of another published study.

Collection of data

Preoperative laboratory data (including biochemical and coagulation markers) were recorded, including plasma levels of platelet (PLT) counts, alanine aminotransferase (ALT), aspartate aminotransferase (AST), albumin (Alb), creatinine (Cr), total bilirubin (Tbil), hematocrit (Hct), international normalized ratio (INR), prothrombin time (PT), and prothrombin activity (PTA). The following preoperative comorbidities were recorded: ascites, high blood pressure, diabetes mellitus, the presence of viral hepatitis, and liver cirrhosis. Surgical data were recorded, including operation time, hepatic inflow occlusion duration (the total time of occlusion of hepatic blood flow during hepatic resection), blood loss in surgery, blood transfusion, and hepatectomy (major or minor). Major hepatectomy was defined as resection of four or more liver segments, and minor hepatectomy was defined as resection of three or fewer liver segments [21]. All resected specimens were subjected to histopathological analysis, and fibrosis stage (S0–4) and the presence of steatosis in the background liver were recorded.

The MELD score and the ALBI score were calculated using the following formulas [9]: MELD score = 11.2 × ln (INR) + 9.57 × ln (Cr, mg/dL) + 3.78 × ln (Tbil, mg/dL) + 6.43 × etiology (0 for cholestatic and alcoholic, 1 for liver cirrhosis due to virus and other causes); ALBI score = 0.66 × log10 (Tbil [μmol/L]) − 0.085 × (Alb [g/L]).

According to the International Study Group of Liver Surgery (ISGLS) [4], PHLF was defined as hyperbilirubinemia (serum Tbil > 22 μmol/L or above preoperative value) and increased INR (INR > 1.5 or PTA < 50%) on or after postoperative day 5, except for observed biochemical and clinical changes due to other reasons.

CT examination protocol

Multiphasic contrast-enhanced CT was conducted for all patients using two CT scanners (SOMATOM Definition AS and SOMATOM DRIVE, Siemens Healthineers). The scanning range was from the lung base to the iliac crest. Nonenhanced images were acquired using the conventional helical scan mode. Contrast material (Ioversol, 320 mg/mL iodine, HENGRUI Medicine) was injected via antecubital venous injection at a rate of 3.0–3.5 mL/s for a total of 80–100 mL (1.5 mL/kg of body weight). Bolus tracking was used. Arterial phase scanning began 7 s after the trigger attenuation threshold (100 HU) reached the level of the supraceliac abdominal aorta. Portal phase scanning began at a delay of 30 s after arterial phase scanning, and the equilibrium phase (EP) began at a delay of 90 s after portal phase scanning. The scanning parameters were as follows: tube voltage = 120 kV, reference tube current = 180 mAs, increment collimation = 128 × 0.6 mm, pitch = 1.0, and rotation time = 0.5 s. The reconstruction parameters were as follows: thickness = 1.5 mm, increment = 1 mm, and soft-tissue convolution kernel (I31). CT images were reconstructed with projection-based material-decomposition software using a standard reconstruction kernel.

Imaging analysis

With the known clinical and surgery data, radiologist 1 (15 years of experience in abdominal CT imaging) drew the regions of interest (ROIs) on the nonenhanced images and EP images in the PACS (DJ HealthUnion Systems Corp version 2.0). For each patient, three anatomical cross sections were chosen, including the intrahepatic inferior vena cava confluence, portal hilum, and gallbladder fossa [15]. Handcrafted ROIs were drawn along the margins of the future remnant liver in the preoperative baseline images, taking care to avoid blood vessels and bile ducts. To minimize the difference in image acquisition time, circular ROIs were placed in the abdominal aorta on the same plane (Fig. S1). The CT values (HU) of the liver (HUliver) and the abdominal aorta (HUaorta) were measured. The ECV values of the background liver at the section of the intrahepatic inferior vena cava confluence (ECV1), portal hilum (ECV2), and gallbladder fossa (ECV3) were calculated using the following formula:

$$ \mathrm{ECV}=\left[100-\mathrm{Hct}\left(\%\right)\right]\times {\Delta \mathrm{HU}}_{\mathrm{liver}}\div \left({\Delta \mathrm{HU}}_{\mathrm{aorta}}\right) $$

where ΔHU means the change in HU between EP and nonenhanced images. The mean value of ECV1, ECV2, and ECV3 was taken as the CT–derived ECV for the patient (ECV). The calculation process of the CT–derived ECV is also shown in Fig. S1.

To assess intraobserver and interobserver reliability of the CT–derived ECV, radiologist 1 and radiologist 2 (10 years of experience in abdominal CT imaging) drew ROIs at 4 weeks after the first assessment in the PACS in the same way.

Statistical analysis

All statistical analyses were performed using SPSS statistics version 25.0 (International Business Machines Corporation (IBM)). The Kolmogorov–Smirnov test was used to test whether the variable obeyed a normal distribution. Continuous variables are presented as the mean ± standard deviation (SD) or median and interquartile range. Categorical variables were summarized as counts and frequencies. Intraclass correlation coefficients (ICCs) and Bland–Altman plots were used to assess the intraobserver and interobserver reliability of the CT–derived ECV. Spearman’s rank correlation was used to assess the association between CT–derived ECV and the fibrosis stage of the background liver. To identify independent predictors for PHLF, we used univariate and multivariate analyses. For the univariate analysis, the Wilcoxon rank sum test or Student’s t test was used for continuous variables, and the χ2 test or Fisher’s exact test was used for categorical variables. Then, multivariate logistic regression analysis was performed to identify the independent variables for PHLF by using variables (p value < 0.05 in the univariate analyses). The β coefficients from the multivariate logistic regression analysis results were used to form the nomogram to assess the risk for PHLF using R software (version 3.6.2). The predictive performance of the nomogram and clinical models was assessed by receiver operator characteristic (ROC) analysis and nomogram, and the areas under the ROC curve (AUCs) were calculated. The Delong test was used to compare the AUCs. The Hosmer–Lemeshow test was used to assess the calibration of the model. A p value < 0.05 (two-tailed) was considered statistically significant.

Results

Characteristics of the study patients

The characteristics of the study patients in the training cohort and validation cohort are summarized in Table 1. The characteristics included the following categories: preoperative laboratory data, preoperative comorbidity, surgical data, histopathological data, and PHLF. In our study, 121 and 81 consecutive patients with resectable HCC were finally enrolled in the training and validation cohorts, respectively. In the training and validation cohorts, the incidences of PHLF were 39.67% (48/121) and 38.27% (31/81), respectively.

Table 1 The characteristics of the study patients in the training cohort and validation cohort

The intraobserver and interobserver reliability of the CT–derived ECV

The intraobserver and interobserver reliability of the CT–derived ECV were good in both the training and validation cohorts, with all ICC values greater than 0.80 (Table S1). The Bland–Altman plots of the CT–derived ECV showed good intraobserver and interobserver agreement in both the training and validation cohorts (Fig. S2). Therefore, the average value of ECV measured by the two radiologists was used for further analysis.

Correlation between CT–derived ECV and pathological fibrosis stage in background liver

Spearman correlation analysis revealed that CT–derived ECV had a strong correlation with the postoperative pathological fibrosis stage of the background liver (p < 0.001, r = 0.591; Fig. 1).

Fig. 1
figure 1

Scatterplot showed strong correlation between CT–derived ECV and the postoperative pathological fibrosis stage of the background liver (p < 0.001, r = 0.591). CT, computed tomography; ECV, extracellular volume

Independent predictors of post-hepatectomy liver failure in the training cohort

The univariate analyses of predictors for PHLF according to the ISGLS criteria in the training cohort are shown in Table 1. CT–derived ECV (28.02 vs. 25.67; p < 0.001), serum Tbil (14.94 μmol/L vs. 13.43 μmol/L; p = 0.013), and hepatic inflow occlusion duration (22.50 min vs. 0 min; p = 0.032) were higher in patients with PHLF than in patients without PHLF. Serum Alb (38.55 g/L ± 5.24 vs. 41.84 g/L ± 4.24; p < 0.001) was lower in patients with PHLF than in patients without PHLF. These four risk factors were accepted in the multivariate logistic regression analysis. With a method of input, CT–derived ECV (OR: 1.308, 95% CI: 1.139–1.502, p < 0.001), serum Alb (odds ratio [OR]: 0.855, 95% confidence interval [CI]: 0.771–0.949, p = 0.003), and serum Tbil (OR: 1.015, 95% CI: 1.000–1.030, p = 0.048) were independent predictors of PHLF (Table 2).

Table 2 Multivariate logistic regression analysis for PHLF in the training cohort

The median ALBI score of patients with PHLF was significantly higher than that of patients without PHLF (− 2.54 vs. − 2.73; p < 0.001). No significant differences in Child–Pugh stage or MELD score between patients with and without PHLF were found (both p > 0.05).

Development of the nomogram

These significant independent predictors were combined to form a nomogram for PHLF estimation in the training cohort (Fig. 2). The nomogram demonstrated good performance for PHLF prediction, with an AUC of 0.828 (95% CI: 0.756–0.901). The AUC of the nomogram for PHLF prediction was higher than that of the single predictor, including CT–derived ECV (AUC = 0.712), serum Alb (AUC = 0.680), serum Tbil (AUC = 0.595), and hepatic inflow occlusion duration (AUC = 0.645; Delong test: p = 0.008, p = 0.002, p < 0.001, and p < 0.001, respectively; Fig. 3). The importance ranking of these factors in the nomogram was CT–derived ECV > serum Alb > serum Tbil. Therefore, CT–derived ECV was the most crucial factor in the nomogram. The nomogram was well calibrated according to the Hosmer–Lemeshow test (p = 0.621) (Table 3 and Fig. 2).

Fig. 2
figure 2

The nomogram and calibration curve for prediction of PHLF in patients with resectable HCC. a The nomogram was developed by combined CT–derived ECV, serum Alb, and serum Tbil in the training cohort. The calibration curves of the nomogram in the training cohort (b) and validation cohort (c) showed a good consistency. PHLF, post-hepatectomy liver failure; HCC, hepatocellular carcinoma; CT, computed tomography; ECV, extracellular volume; Alb, albumin; Tbil, total bilirubin

Fig. 3
figure 3

Receiver operating characteristic (ROC) curve analyses of CT–derived ECV, serum Alb, serum Tbil, hepatic inflow occlusion duration, and the nomogram to predict PHLF. The nomogram showed the highest AUC value (0.828; 95% CI: 0.756–0.901) for PHLF prediction compared with CT–derived ECV, serum Alb, serum Tbil, and hepatic inflow occlusion duration (Delong test: all p < 0.05). PHLF, post-hepatectomy liver failure; Alb, albumin; Tbil, total bilirubin; CT, computed tomography; ECV, extracellular volume; AUC, areas under the curve

Table 3 Comparison of predictive efficacy for PHLF between the nomogram and clinical models by the training cohort

Comparison of predictive efficacy for PHLF between the nomogram and clinical models in the training cohort

The predictive power for PHLF prediction between the nomogram and clinical models was compared. There was no significant difference in Child–Pugh stage or MELD score between patients with and without PHLF (both p > 0.05). The AUC of the nomogram was significantly higher than that of the ALBI score (0.828, 95% CI: 0.756–0.901 vs. 0.708, 95% CI: 0.612–0.803; Delong test: p = 0.004) (Fig. 4). The cutoff value of the nomogram was 0.473, with a sensitivity of 68.8% and a specificity of 83.6% for predicting PHLF (Table 3).

Fig. 4
figure 4

Comparison of predictive power for PHLF between the nomogram and ALBI score. The AUCs of the nomogram were significantly higher than those of the ALBI score both in the training cohort (a) and validation cohort (b) (Delong test: p = 0.004, p < 0.001, respectively). PHLF, post-hepatectomy liver failure; ALBI, albumin-bilirubin; AUC, areas under the curve

Validation of the nomogram

For the purpose of validating the nomogram, we prepared a validation cohort from another hospital (n = 81) which met the inclusion and exclusion criteria (Table 1). In the validation cohort, the AUC of the nomogram was significantly higher than that of the ALBI score (0.821, 95% CI: 0.727–0.914 vs. 0.630, 95% CI: 0.516–0.735; Delong test: p < 0.001) (Fig. 4). According to the Hosmer–Lemeshow test, the nomogram showed good agreement between the prediction and actual observation (p = 0.697, Fig. 2). Two examples were given to illustrate the clinical application of the nomogram (Fig. S3).

Discussion

In our study, we developed and validated a nomogram for the prediction of PHLF in patients with resectable HCC. The nomogram integrated three independent factors, including CT–derived ECV, serum Alb, and serum Tbil. In addition, CT–derived ECV was the most crucial factor in the nomogram. The performance of the nomogram was superior to that of the clinical model with AUCs of 0.828 and 0.821 in the training and validation cohorts, respectively. The nomogram may be helpful for providing pretreatment consultation for patients who are suitable for hepatectomy.

We found that CT–derived ECV had a strong positive correlation with the postoperative pathological fibrosis stage of the background liver. The higher the ECV of the background liver, the more severe the liver fibrosis. Our results are consistent with those of previous studies showing that ECV was useful in estimating the liver fibrosis stage and the risk of liver-related events [12,13,14,15,16,17]. The histopathological mechanism of liver fibrosis is the expansion of extracellular volume due to the deposition of collagen and matrix proteins [12]. Based on these theories, we evaluated the efficacy of preoperative CT–derived ECV in predicting PHLF after hepatectomy for patients with HCC. CT–derived ECV as an independent and crucial predictor was integrated into the nomogram in our study. A higher ECV means a higher risk for PHLF in patients with resectable HCC.

Serum Alb and serum Tbil were both independent predictors for PHLF in our study, which was consistent with previous studies [20, 22, 23]. These two parameters are well-known indicators of liver function. In line with this, some previous studies have shown that impaired preoperative liver function and potential liver disease are important predictors of postoperative liver function impairment [24, 25]. In the present study, hepatic inflow occlusion duration, a factor in surgery, was identified as a significant predictor of PHLF by univariate analysis, while it was not included as a final independent predictor. The longer the hepatic inflow occlusion duration, the higher the risk for PHLF in HCC patients. During surgery, even though hepatic inflow occlusion is helpful in controlling bleeding, repeated hilar blocking and opening can lead to hepatic ischemia–reperfusion injury and thus increase the risk of PHLF. The reason why it was not included among the final independent predictors may be that the strong discrimination ability of the selected CT–derived ECV, serum Alb, and serum Tbil diminished the contribution of hepatic inflow occlusion duration in the final nomogram. Our result is different from that of a previous study by Shen et al [26]. Their study showed that hepatic inflow occlusion was not associated with PHLF in patients with HCC. The reasons may be that, on the one hand, hepatic inflow occlusion in their study was a categorical variable (yes/no), and the quantitative data in our study; on the other hand, the samples of the two studies are different in the present study; the ALBI score was an independent predictor of PHLF. This was in accordance with the study reported by Zou et al [9], which also showed that the ALBI score was a predictor of PHLF. The AUC of the ALBI score for PHLF prediction was 0.708 in our study, which was near the 0.745 published by Zou et al [9]. Zou et al also found that the MELD score and Child–Pugh stage could predict PHLF with AUCs of 0.649 and 0.665, respectively. However, our study demonstrated that the MELD score and Child–Pugh stage failed to predict PHLF. The following factors may explain the different results. First, the MELD score was determined based on serum INR, Cr, and Tbil according to the formula. Although serum Tbil was different between patients with and without PHLF in our study, the absence of differences in serum INR and Cr between patients with and without PHLF may lead to the failure of the MELD score to predict PHLF. Second, our study population consisted of patients with resectable HCC; thus, the preoperative baseline Child–Pugh stage of these patients was mostly grade A/B (grade A dominated). This may be one reason why the Child–Pugh stage did not show a significant difference in our study.

The performance of our nomogram was superior to the clinical models both in the training and validation cohorts (0.828 vs. 0.708, p = 0.004; 0.821 vs. 0.630, p < 0.001, respectively). The calibration curve showed that the predicted value of the nomogram was in good agreement with the actual observed. Shen et al [26] also constructed a nomogram in predicting PHLF with a C-index of 0.818 and 0.906 in the training and externally validated cohort, respectively. Their study included volumetric data (future liver remnant (FLR) ratio, spleen volume/FLR), the generation of these parameters needed the help of additional post-processing software, and the process was time consuming. Several studies [20, 27, 28] have shown that preoperative liver stiffness (LS) derived from ultrasound elastography imaging is a reliable predictor of PHLF. However, ultrasound-derived LS measurement requires not only well-trained operators but also other factors that must be controlled, such as abdominal obesity/body mass index [29], liver inflammatory activity, and adverse effects of abnormal bilirubin or transaminase [30, 31]. CT–derived ECV in our nomogram is easy to operate, has low requirements for operators, and has good reliability [17]. Meanwhile, our nomogram is a simple method, which does not add any additional examination to those already performed in routine.

This study still had several limitations. First, it was a retrospective study. Second, it was based on two centers in Chongqing, China, and the population sample was relatively limited. To further verify the clinical reliability and effectiveness of the proposed nomogram in this study, more centers and larger samples from different regions and countries are needed. Third, the proposed nomogram in this study was only compared with clinical models (including blood biochemical indices), and other indices (LS based on ultrasound, residual liver volume, and MR–based indices, among others) need to be added for further comparison and demonstration. Fourth, patients with PHLF were not further classified in detail (grade A/B/C) in this study. Due to the small number of cases, the data after classification and grouping were seriously unbalanced, and most patients were grade A or B.

Conclusions

The proposed novel nomogram, combined with CT–derived ECV, serum Alb, and serum Tbil, showed superior prediction of PHLF in patients with resectable HCC than the Child–Pugh stage, MELD score, and ALBI score. As a feasible and promising noninvasive tool, this nomogram may provide a new strategy for the preoperative prediction of PHLF.