Introduction

Hepatocellular carcinoma (HCC) ranks as the fifth most common malignant cancer and the third most frequent cause of cancer leading death worldwide [1, 2]. Chiefly, due to the high prevalence of hepatitis virus infection and alcoholic liver disease, the disease burden of HCC is still quite tremendous [3, 4]. Along with the improvement in the population surveillance program and imaging modalities in the past decades, the proportion of patients at early or intermediate stages are expected to increase [5, 6].

Liver resection (LR) is considered as the mainstream treatment option for patients with well-preserved liver function, anatomic resectable lesions, and no evidence of distant metastasis [711]. Although liver transplantation removes the tumor lesions and the impaired liver simultaneously, its application is limited by the lack of liver grafts and by huge economic costs [12]. The prognosis of HCC patients largely depends on the interplay of the tumor burden and liver function reserve. Usually, the tumor burden is appraised by the number and size of the tumor nodules. The Milan criteria (solitary HCC diameter ≤ 5 cm or up to three nodules smaller than 3 cm), which was initially proposed by Mazzaferro et al. [13] in 1996, have been accepted as one of the standards selecting the candidates for liver transplantation. And, its prognostic efficacy was later validated in the realm of surgical resection [14]. However, a substantial proportion of HCC patients exceeding the Milan criteria have been found to survive longer than expected after curative liver resection [15]. It therefore seems reasonable to develop a tumor burden assessing parameter that gains optimal prognostic efficacy as well as further reduces unnecessary exclusions due to the strict application of the narrow selection criteria.

Recently, total tumor volume (TTV), which incorporates the number and size of the tumor nodules, has been proven to be a useful parameter to describe tumor burden in HCC patients awaiting liver transplantation [1618]. However, scarce clinical evidences regarding the prognostic value of TTV in the LR-treated HCC patients have been reported. Lee et al. [19] compared the prognostic ability of Milan criteria and TTV in patients with TTV not more than 65.5 cm3. But, there have been no studies comprehensively evaluating the prognostic efficacy of TTV in HCC patients who underwent hepatectomy especially for patients with TTV > 65.5 cm3. We thus performed this retrospective analysis to evaluate the prognostic value of TTV in LR-treated patients.

Materials and methods

Patient selection

From January 2001 to December 2008, 539 patients with HCC as the only primary cancer who underwent curative liver resection (R0 resection) without previous history of treatments at the Department of Abdominal Surgical Oncology, Cancer Hospital, Chinese Academy of Medical Sciences, Beijing, People’s Republic of China, were identified. The diagnosis of HCC was determined by the evidence-based clinical guidelines and was confirmed by the pathological examination of the resected specimen. The negative resection margin was validated by the pathological examination. Of them, 18 patients were lost in the follow-up. Thus, finally, a total of 521 patients underwent R0 resection as the initial therapy for HCCs were taken into the study. Clinical data including patient demographics (e.g., age and sex), etiology of underlying liver disease, serum biochemistry, tumor number, tumor size, pathological reports, and liver function reserves were obtained from patients’ medical records. Information about tumor staging systems such as the seventh tumor-node-metastasis (TNM) staging of the American Joint Committee on Cancer (AJCC) [20], Cancer of the Liver Italian Program (CLIP) scoring system [21], Barcelona Clinic Liver Cancer (BCLC) staging system [22], and the Okuda staging system [23] for HCC were also collected. Patients’ informed consent was not required owing to the retrospective nature of the study.

Treatment

In our institution, the decision of curative intention LR was made by joint discussions. LR was usually considered in patients with (i) anatomically resectable tumors which were assessed by imaging examinations of the tumor size, tumor number, tumor location and vascular involvement; (ii) adequate liver function reserve (Child-Pugh grade A or B); and (iii) less than 25 % retention of indocyanine green 15 min after injection (ICGR-15).

Follow-up

Overall survival time of patients was calculated from the date of surgery to the date of last follow-up or death. The last date of follow-up was August 31, 2015. The follow-up was performed through face-to-face or telephone interview every 3 months in the first 2 years after operation and every 6 months thereafter or when tumor recurrence was highly suspected. At each follow-up visit, the information about physical examination, liver function tests, alpha-fetoprotein (AFP) level test, chest radiography, abdominal ultrasonography (US), contrast-enhanced computer tomography (CT) scans, and (or) liver magnetic resonance imaging (MRI) were obtained.

Definitions

The number and size of tumor nodules were measured by the CT scan in the preoperative evaluation. TTV was calculated as the sum of each tumor nodule volume; the volume of each tumor nodule was calculated as (4/3) × 3.14 × (maximum radius of the tumor nodule in cm)3 as previously described [16]. The patients were divided into three subgroups according to the tertiles of TTV (the first tertile <17.1 cm3, the second tertile 17.1–73.5 cm3, and the third tertile >73.5 cm3).

Statistical methods

Continuous variables were expressed as means with standard deviation (SD) or medians with range. Categorical variables were expressed as frequencies with percentages. For group comparisons, chi-squared test or Fisher’s exact test (categorical variables) and ANOVA test or Kruskal-Wallis test (continuous variables) were used to compare the differences between subgroups. Cumulative overall survival rates were appraised by the Kaplan-Meier method, and the differences were analyzed by the log rank tests. Significant factors identified in univariate analysis were subsequently enrolled in the multivariate Cox’s proportional hazard model. The likelihood ratio related to a Cox’s proportional hazard model was used to evaluate the homogeneity within categories of each system [24]. Discrimination for survival data was evaluated using the Harrell’s concordance index (C-index) [25]. The results of Cox’s regression were also presented using the Akaike information criterion (AIC) [26]. Cancer staging system with higher χ 2 value by the likelihood ratio test, higher C-index, and lower AIC value was considered to gain superior prognostic ability [26]. A two-tailed P < 0.05 was considered statistically significant. All statistical analyses were performed using SPSS 11.5 for Windows (SPSS Inc., Chicago, IL) and STATA version 10.0 (STATA Corp., College Station, TX, USA).

Results

The baseline characteristics of the patients

The baseline clinicopathological factors of the patients were summarized in Table 1. Among them, 453 (453/521, 86.9 %) were male patients and 68 (68/521, 13.1 %) were female patients. The mean age of the patients was 54.7 years old. Hepatitis B virus (HBV) was the dominant etiology of the underlying chronic liver disease in the study population (427/521, 81.9 %). Solitary tumor was detected in 442 (442/521, 84.8 %) patients. The maximum diameter of the tumor lesion was 4.7 ± 2.5 cm. The numbers of the patients at Child-Pugh class A and Child-Pugh class B were 509 (509/521, 97.7 %) and 12 (12/521, 2.3 %), respectively.

Table 1 Baseline clinicopathological features of the patients involved in this study

The distribution of the TTV

The TTV of the patients was 112.0 ± 222.4 cm3. The distribution of the patients with reference to the TTV, tumor number, and tumor size was shown in Fig. 1a, b. Adopting the cutoff values as 17.1 and 73.5 cm3, 174 (33.40 %) patients fell into the first tertile (<17.1 cm3), 172 (33.01 %) patients fell into the second tertile (17.1–73.5 cm3), and 175 (33.59 %) patients fell into the third tertile (>73.5 cm3). We compared the clinicopathological factors between the three groups. Patients with larger TTV tended to be featured with higher AFP level (P = 0.004; Table 2), presence of macrovascular invasion (P = 0.026; Table 2), multiple tumor nodules (P = 0.003; Table 2), larger tumor size (P < 0.001; Table 2), advanced TNM stage (P < 0.001; Table 2), advanced CLIP score (P < 0.001; Table 2), advanced BCLC stage (P < 0.001; Table 2), and advanced Okuda stage (P = 0.013; Table 2).

Fig. 1
figure 1

The distribution of total tumor volume (TTV) of all studied patients (a). The distribution of the number and size of tumor lesions of all studied patients when stratified by TTV (b)

Table 2 Comparison of the clinicopathological characteristics between patients in the TTV tertiles

Prognostic factors for the patients’ survival outcome

With a median follow-up of 116 months (range, 80–175 months), the median overall survival (OS) of the patients was 58 months. The 1-, 3-, and 5-year overall survival rates were 93.1, 69.9, and 46.3 %, respectively. The median OS of patients in the first tertile was 69 months, which was significantly longer than that of patients in the second tertile as 60 months (P = 0.005; Table 3 and Fig. 2). And, the median OS of patients in the second tertile was also significantly longer than that of patients in the third tertile (median OS 60 vs. 40 months, P < 0.001; Table 3 and Fig. 2). In the univariate analysis, gender as male (P = 0.048; Table 3), presence of macrovascular invasion (P < 0.001; Table 3), albumin < 35 g/l (P < 0.001; Table 3), AFP ≥ 400 ng/ml (P = 0.018; Table 3), and larger TTV (P < 0.001; Table 3 and Fig. 2) were significantly associated with inferior survival outcome. These factors were taken into the subsequent Cox multivariate analysis. Compared with patients in the first tertile, the adjusted relative risk for mortality was 1.328 (95 % confidence interval (CI) 1.053–1.675, P = 0.016) for patients in the second tertile and 1.898 (95 % CI 1.502–2.397, P < 0.001) for patients in the third tertile. Besides, presence of macrovascular invasion (hazard ratio (HR) = 3.693, 95 % CI 2.140–6.374, P < 0.001; Table 3), albumin < 35 g/l (HR = 1.957, 95 % CI 1.364–2.809, P < 0.001; Table 3), and AFP ≥ 400 ng/ml (HR = 1.253, 95 % CI 1.016–1.546, P = 0.035; Table 3) were identified as the independent prognostic factors regarding the OS.

Table 3 Univariate analysis and multivariate analysis of the overall survival
Fig. 2
figure 2

Kaplan-Meier survival analysis of overall survival (OS) stratified by the tertile of TTV (<17.1 vs. 17.1–73.5 vs. >73.5 cm3) (P < 0.001)

Comparison of the survival outcome within and beyond the Milan criteria in the first and second TTV tertile groups

Within the first and second tertiles of TTV (TTV ≤ 73.5 cm3), 312 patients were within Milan criteria and 34 patients were beyond Milan criteria. The median survival time of patients within and beyond the Milan criteria were 66 and 66 months, respectively. No significant difference in overall survival between the above two subgroups was detected (P = 0.183; Fig. 3).

Fig. 3
figure 3

The comparison of survival distribution between HCC patients within versus beyond the Milan criteria but within the first and second tertiles of TTV (TTV ≤ 73.5 cm3). No significant difference in long-term outcome between these two groups was detected (P = 0.183)

Evaluation of TTV-based CLIP score

We further appraised the prognostic value of TTV by replacing the “tumor morphology” section of the original CLIP score with the TTV category. In the new system, TTV < 17.1 cm3 was assigned as score = 0, TTV within the range from 17.1 to 73.5 cm3 was assigned as score = 1, and TTV > 73.5 cm3 was assigned as score = 2 (Table 4). Statistically significant differences of OS were found in the pair-wise comparison across the TTV-CLIP scores. The median overall survival time across the TTV-CLIP scores 0–6 were 70.5, 65.5, 50, 41, 32, 20.5, and 15 months, respectively (Fig. 4a). We then compared the prognostic performance of TTV-CLIP score (Fig. 4a), CLIP score (Fig. 4b), TNM staging system (Fig. 4c), BCLC staging system (Fig. 4d), Okuda staging system (Fig. 4e), and Milan criteria (Fig. 4f). TTV-CLIP gained the lowest AIC (AIC = 4932.804), the highest likelihood ratio test (χ 2) as 34.99, and the highest C-index as 0.6126 among the seven prognostic scoring systems (Table 5).

Table 4 Construction of the total tumor volume-based CLIP score (TTV-CLIP)
Fig. 4
figure 4

Comparison of the survival distribution of the TTV-CLIP scoring system (a), CLIP scoring system (b), TNM staging system (c), BCLC staging system (d), Okuda staging system (e), and Milan criteria (f)

Table 5 Comparison of prognostic performance of the tumor prognostic systems in patients with hepatocellular carcinoma

Discussion

Few previous studies have extensively evaluated the prognostic value of TTV, which combined the information of tumor number as well as tumor size, in patients receiving liver resection. Our results suggested that patients with larger TTV gained a shorter overall survival than those with smaller TTV. Larger TTV was associated with elevated AFP level, multiple tumors, larger tumor size, presence of macroscopic vascular invasion, and advanced tumor stages. We further incorporated the TTV into the CLIP score and built the TTV-CLIP. The outperformance of TTV-CLIP score in predicting the clinical outcome further upheld the prognostic value of TTV in HCC patients.

Our results showed that TTV might provide a simplified way to describe the tumor burden and was also a potential parameter in the staging system for HCC. Compared with reporting the tumor burden by the tumor number and size, TTV has several advantages. TTV incorporates the number and size of tumor nodules into one continuous variable. Analyzing a single continuous index may be much simpler than analyzing the size and number of tumor nodule simultaneously. It was observed by someone that patients with multiple nodules but moderate tumor size may gain better clinical prognosis than those of patients with single large tumor [27]. It may be partially explained by the fact that the TTV of the former ones might be smaller than that of the latter ones. Regarding these, reporting the value of TTV instead of the size and number of tumor nodule may describe the tumor burden more accurately. We found that 34 patients in the first and second TTV tertiles were beyond the Milan criteria. The percentage of patients exceeding the Milan criteria but with TTV less than 73.5 cm3 was as high as 10.9 %, which was higher than the reported 6.5 % by Lee et al. [19] using the cutoff value as 65.5 cm3. The survival outcome of patients within Milan criteria was comparable with those of the patients beyond Milan criteria but with TTV ≤ 73.5 cm3. And, distinct difference in the OS was observed between patients with TTV > 73.5 cm3 and patients with TTV ≤ 73.5 cm3. The results suggested that the second tertile of TTV (73.5 cm3) in our study could be used as an expanded criteria selecting candidates receiving hepatectomy. Further studies are warranted to validate the prognostic efficacy of the criteria.

Patients with larger TTV predisposed to have AFP ≥ 400 ng/ml and macroscopic vascular invasion. Elevated AFP level and macroscopic vascular invasion were proved to be adverse prognostic indicators in our analysis, which were also consistent with previous studies [28, 29]. AFP level has long been regarded as being closely linked with the aggressive behavior of the tumor cells as well as the disease progression process [29]. And, several studies revealed that AFP concentrations often increase concomitantly with the augment of tumor burden [30]. Since TTV can be used to rating the tumor burden, it is quite possible that a larger TTV is expected to be more often associated with elevated AFP level. Besides, it is also assumed that larger tumors are often accompanied with the higher incidence of vascular invasion and presence of satellite nodules [27]. The close relationship between larger TTV and adverse clinicopathological features further defended the prognostic value of TTV.

CLIP score was initially derived from a retrospective cohort study of 435 patients and was then externally validated in a lot of studies [21, 31]. In our study, two of the components of CLIP score, including AFP level and presence of macroscopic vascular invasion, both obtained statistical significance in the multivariate analysis for OS. The original CLIP score has an inherent defect as the scale for “tumor morphology” is relatively subjective, without specific size criteria. Therefore, its objectivity and reliability in predicting outcomes may be somehow compromised [32]. We thus replaced the tumor morphology section with TTV in the modified CLIP score system. The TTV-based CLIP score gained a better prognostic ability than that of the original CLIP score, TNM staging system, Okuda staging system, BCLC staging system, TTV tertile category, and Milan criteria. Further external validation studies are warranted to establish the value of TTV-CLIP score.

This study has a few potential limitations. First of all, the retrospective nature of the study may be susceptible to the selection bias and recalling bias. Secondly, as HBV is the predominant etiology of HCC in China, the results of the study may have more implication for HBV-dominant areas. Our results should still be interpreted with caution in areas where other etiologies such as HCV, alcohol liver disease, and non-alcoholic fatty liver disease predominate. Thirdly, the calculation of TTV is based on the assumption that all tumor nodules are spherical. In fact, many tumor lesions were irregularly shaped. In order to enhance the applicability of our results in the surgeon’s preoperative evaluation, the value of radius was measured by the preoperative CT scan, which might be somehow different from the actual values. In addition, our study chiefly focused on patients who underwent liver resection. It should be reminded that before entering into the clinical practice, the prognostic values of TTV and TTV-CLIP scores need more external validations.

Conclusions

TTV is a good parameter measuring the tumor burden of HCC. Patients with TTV ≤ 73.5 cm3 may gain more survival benefits from liver resection. And, the TTV-CLIP score may provide a good prognostic performance for LR-treated HCC patients.