Introduction

Hepatocellular carcinoma (HCC) is a common malignancy that ranks as the fourth leading cause of cancer-related deaths worldwide [1]. Although most patients present with advanced disease when first seen by a physician, some patients are actually diagnosed at the very early and early stage (BCLC 0-A) [2]. Partial hepatectomy is the best modality for treating these patients [3]. However, it was found that the prognosis of very-early- and early-stage HCC after hepatic segmentectomy varied [4]. In some patients, the disease progresses slowly and has along disease-free survival (DFS), while in some patients, the tumors recur and progress quickly [4]. The varied prognosis may have resulted from the high heterogeneity of the tumors [5, 6]. Some tumors manifest as having highly aggressive characteristics, and the patients have a poor prognosis. Alternatively, some tumors appear as low-grade tumors with slow growth.

Microvascular invasion (MVI) is defined as the invasion of HCC cells to the microvascular of peritumoral tissues, which can only be visible on the pathological section under the microscopy after operation. It is one of the pathological features reflecting the invasiveness of the tumor. Highly aggressive HCC often exhibits obvious MVI, while low-grade HCC often appears with no MVI. HCC with MVI has a 4.4-fold higher risk of tumor recurrence, and the patient has a much shorter DFS [7]. Therefore, MVI is recognized as an important prognostic factor in HCC [8,9,10]. Preoperative prediction of MVI in HCC using noninvasive imaging modalities, such as β-2-[18F] fluoro-2-deoxy-d-glucose positron emission tomography/computed tomography ([18F]FDG PET/CT), magnetic resonance imaging(MRI), computed tomography(CT), or ultrasound (US) [11,12,13], has attracted clinical interest.

[18F]FDG PET/CT is an integrated imaging modality of PET and CT, which is always used for evaluating and staging the malignant tumors before surgery, including HCC [14, 15]. The semi-quantitative indices such as maximum standardized uptake value (SUVmax) and TLR (tumor-to-liver ratio) on PET images [16] are often used to quantify tumor glucose metabolism, which may have a positive relationship with MVI and the prognosis of HCC patient [17, 18]. Pseudocapsule sign of the lesion on CT was also reported to prompt the low grade of the tumor. On the other hand, fuzzy and irregular boundary of the lesion often indicates the invasiveness of the tumor, which often indicates MVI. However, both the SUVmax and TLR can only provide the gross glucose metabolism of the tumor, but not reflect the subtle heterogeneity of metabolism in different parts of the tumor. In addition, edge information of the lesion analyzed by visual assessment has great inter-observer variability.

Radiomics is an emerging method for imaging analysis using algorithms or statistical analysis tools to capture distinct phenotypic differences of tumors from diagnostic images. The radiomics features can provide a great deal of information beyond analysis with the naked eye and can sensitively determine the subtle heterogeneity of the morphology and function among different parts within the tumor at the cellular level [19]. Radiomics has shown its powerful potential in diagnosis and prognosis predicting in some tumors based on analysis of image features extracted from CT, MRI, or PET/CT. To the best of our knowledge, no article has introduced radiomics analysis to predict MVI and prognosis in very-early- and early-stage HCC. Therefore, in this paper, we investigate the capacity of the radiomics signature extracted from preoperative [18F]FDG PET/CT to predict MVI and DFS in patients with very-early- and early-stage HCC.

Materials and methods

Patients

Ethical approval was waived by the local Ethics Committee of Guangzhou Medical University and Southern Medical University in view of the retrospective nature of the study.

We retrospectively reviewed data from patients with histologically confirmed very-early- and early-stage (BCLC0, BCLCA) HCC who underwent radical surgical resection between March 2013 and April 2019 at the First Affiliated Hospital of Guangzhou Medical University and Southern Medical University Nanfang Hospital. The tumors were staged according to the Barcelona Clinic Liver Cancer (BCLC) system. Inclusion criteria included (1) pathologically confirmed primary HCC, (2) [18F]FDG PET/CT performed less than 2 weeks prior to surgery, (3) availability of follow-up data and clinical-pathologic characteristics, and (4) no history of preoperative anti-cancer treatment. Patients were excluded if he/she had multiple primary cancers or had a previous history of cancer. The endpoint of this study was DFS calculated from the day of surgery to the date of disease progression (locoregional recurrences or metastases), death from any cause (censored), or the date of the last follow-up visit (censored). Local recurrence and distant metastasis were diagnosed based on clinical symptoms, physical examination, AFP essay, and imaging findings including chest X-ray, abdominal ultrasound, enhanced contrast CT or MRI, whole-body bone scan, and PET/CT. A total of 80 consecutive patients who met the criteria were enrolled and divided into two cohorts (training and validation cohort) with a ratio of 5:3 using computer-generated random numbers. As a result, 50 patients were allocated to the training cohort, and 30 patients were allocated to the independent validation cohort. Clinical information is presented in Table 1. Of these 80 patients, 63 patients underwent isolated enhanced CT, 7 had isolated MRI examinations, and 4 underwent both CT and MRI prior to radical surgical resection.

Table 1 Clinical and histopathological characteristics and imaging indicators of patients with very early and early HCC in the training and validation cohort

Histopathology

All patients underwent radical surgical resection within 2 weeks of [18F]FDG PET/CT examination. All surgical specimens were reviewed by a pathologist with more than 10 years of experience in liver pathology, in particular to estimate the presence of MVI. MVI was defined as the presence of tumor in a portal vein, hepatic vein, or a large capsular vessel of the surrounding hepatic tissue lined by the endothelium that was visible only on microscopy [20, 21].

PET/CT imaging

[18F]FDG was synthesized using the Tracer lab FXFN synthesis system (TRACERlab FXFDG; GE Healthcare, USA). PET/CT scan was performed using a BiographmCTx scanner (Discovery ST 8, GE Healthcare, WI, USA, and Siemens, Germany). Patients were instructed to fast for at least 4 h. The blood glucose levels of patients ranged from 3.6 to 6.1 mmol/L before [18F]FDG injection. Approximately 60 min after the intravenous injection of 3.70–5.55 MBq/kg of [18F]FDG, PET/CT data were acquired according to the guidelines for tumor imaging with [18F]FDG PET/CT 1.0. Then, PET images were converted into SUV units by normalizing the activity concentration to the dosage of [18F]FDG injected and patient body weight after decay correction.

VOI drawing and image analyzing

[18F]FDG PET/CT images were exported to the Lifex software (version 5.10) [22]. PET/CT images were analyzed by an experienced nuclear medicine physician with more than 20 years of experience in PET/CT diagnosis (XL Wang). Areas with abnormal uptake of [18F]FDG on PET and/or abnormal density on CT were defined as lesions. On the PET image, an ellipse iso-contour was drawn covering the lesion, and the volume of interest (VOI) in 3D was obtained semi-automatedly with an iso-contour threshold of 30% maximum. If the PET and CT images matched well, the VOI of PET was then copied to CT images to obtain the VOI on CT. If the lesion had a low uptake of [18F]FDG, the VOI was calculated on CT images and was then copied to PET to obtain the VOI on PET. On the CT images, the regions of interest (ROIs) were drawn section by section to measure the VOI. Enhanced contrast CT and MRI were sometimes used to help determine the VOI. VOIs for PET and CT were drawn respectively in case of the mismatching of lesions border on PET and CT. The texture features were then automatically extracted from the PET and CT images. Tumor FDG avidity was measured by tumor-to-normal liver standardized uptake value ratio (TLR) calculated with the following equation: TLR = maximum SUV of the tumor/mean SUV of the normal liver.

Then, the images were resampled to 1 × 1 × 1 mm3 voxels by three-dimensional Lagrangian polygon interpolation. On the CT images, Hounsfield units were then resampled into 400 discrete values (called bins) with absolute discretization from − 1000 to 1000 Hounsfield units, leading to a fixed bin size of 5.0 Hounsfield units. Four gray-level matrices were calculated in three dimensions, giving 49 radiomics features (including first-order and second-order features and volumes) for each of the CT-tumor VOIs. On the PET images, SUV units were resampled into 64 discrete values with absolute discretization from 0 to 30 SUV units, leading to a fixed bin size of 0.476 SUV units. Four gray-level matrices were calculated in three dimensions, giving 52 radiomics features for each of the PET-tumor VOIs. Values of extracted radiomics features were normalized using the Z-transform method [23]. All the texture features were summarized and defined in detail in the Supplemental Materials.

Feature selection and radiomics signature construction

The least absolute shrinkage and selection operator (LASSO) method was used to select the most useful features from the training cohort for prediction. A radiomics score (Rad-score) was calculated for each patient via a linear combination of selected features that were weighted by their respective coefficients.

Development of individualized prediction models

Univariable and multivariable logistic regression analyses or Cox regression analyses were used to analyze the value of clinical candidate predictors. Radiomics signature was applied to develop a diagnostic model for the prediction of MVI status and DFS. Backward stepwise selection was applied using the likelihood ratio test with Akaike’s information criterion as the stopping rule. Patients were classified into the high-risk group and the low-risk group using the X-tile analyses for the radiomics signature [24]. To provide the clinician with a quantitative tool to predict MVI and DFS, a radiomics nomogram was built based on multivariable analysis.

Assessment of nomogram performance

Calibration curves were plotted to assess the calibration of the radiomics nomograms and were accompanied by Hosmer-Lemeshow tests (a significant test statistic implies that the model does not calibrate perfectly). To quantify the discrimination performance of the radiomics nomogram, area under the curve (AUC) or Harrell’s C-index was calculated.

Validation of the radiomics nomogram

The radiomics nomogram was subjected to bootstrapping validation (10 bootstrap resamples) to draw the time-dependent C-index curves in the training cohort. Then, the model created in the training cohort was applied to all patients in the validation cohort and the performance of the internally validated nomogram was tested in the validation cohort. The AUC/C-index and calibration curves were derived based on the regression analyses.

Clinical utility of the radiomics nomogram

To estimate the clinical utility of the nomograms, decision curve analyses (DCA) were performed by calculating the net benefits for a range of threshold probabilities in the training cohort and validation cohort.

Statistical analysis

All statistical tests were performed using SPSS 24.0 and R statistical software (version 4.0.2). The differences in age, sex, BCLC stage, mean follow-up time, and other related clinical information between the training and validation cohort were assessed using independent sample t test, χ2 test, or Mann-Whitney U test, where appropriate.

The selected features after Z-transformation and the clinical features were revealed by a heatmap using an online tool called Morpheus (https://software.broadinstitute.org/morpheus). Univariate and multivariate regression analyses were used to evaluate the prediction value by SPSS 24. To predict MVI status, binary logistic regression analyses were used. To predict DFS, we used Cox regression analyses. A two-sided P < 0.05 was considered significant.

We used the “glmnet” package to perform the LASSO regression analysis. Nomogram construction and calibration plots were performed using the “rms” and “pec” packages. Bootstrapping validation of the radiomics nomogram was accomplished using the “hdnom” package. Comparison of two time-dependent AUCs was performed using the “time ROC” package. The survival curves were obtained by the “survminer” package. DCA was performed using the “rmda” and “ggDCA” package.

Results

Clinical characteristics

The study flowchart for predicting MVI and DFS via analysis of PET and CT texture features and clinical risk factors is presented in Fig. 1. Patient characteristics in the training cohort and validation cohort are given in Table 1. Of the 80 patients with BCLC0-A HCC, 45.0% (36/80) had the tumors with MVI (MVI+) and 55.0% (44/80) patients had the tumors without MVI (MVI−). MVI positivity was detected in 44.0% (22/50) of tumors in the training cohort, similar to 46.7% (14/30) seen in the validation cohort (P = 0.816). Follow-up data showed that 50.0% of patients in the training cohort experienced tumor relapse, which was not significantly different from the 46.7% seen in the validation cohort (P = 0.773). Univariate analyses revealed that there was also no significant difference in patients’ clinical and histological characteristics between training and validation cohort (P > 0.05, Table 1). The above information indicated that the patients in the training and validation cohort had a balanced distribution of survival and baseline clinical-pathologic characteristics.

Fig. 1
figure 1

Study flowchart for predicting MVI and DFS by analyzing PET and CT texture features and clinical risk factors in training and validation set

Development of the predictive models for MVI status

Univariable analyses were used to determine the relationship between clinical and histopathological features and imaging indicators with MVI status (Table 2). According to the results of the univariable analysis, none of the clinical factors was found to be closely related with MVI status (Table 2, P > 0.05 for all clinical features), neither the histopathological features, including BCLC stage, tumor differentiation and Child-Pugh classification (Table 2, P > 0.05). For the three conventional imaging indicators, the diameter was also not related to MVI status (Table 2, P > 0.05); however, close associations were observed between SUVmax and TLR with MVI in training cohort (P < 0.05).

Table 2 Univariate cox regression analyses for MVI in the training cohort

Potential radiomics features were selected to establish PET/CT model (M-PET/CT model) for predicting MVI status using the LASSO regression on the basis of 50 patients in the training cohort. LASSO analysis revealed that five PET texture features (HISTO_Entropy_log2, HISTO_Energy, GLCM_Contrast, NGLDM_Busyness, GLZLM_ZP), and six CT texture features (CONVENTIONAL_HUmax, CONVENTIONAL_HUQ3, HISTO_Kurtosis, HISTO_ExcessKurtosis, NGLDM_Coarseness, GLZLM_ZLNU), were most useful for predicting the MVI status. From the above-selected radiomics features, Rad-score for PET/CT (M-R-score) was calculated. Calculation formula for the Rad-score based on radiomics features was presented in the Supplemental Data. As a result, M-R-score was proved to be statistically associated with MVI (P = 0.001, Table 2). The heatmap, which was used to visualize the rough association of clinical features and M-R-score with MVI status, is provided in Figs. S1 and S2 in the Supplemental Data. From the results of univariable analyses, SUVmax, TLR and M-R-score were then brought into the multivariable logistic analysis, which finally demonstrated that only M-R-score was the independent predictor for predicting the MVI status (P = 0.001, Table 4). Only the M-R-score was used to build predictive model (named M-PET/CT model) for MVI status.

Assessment and validation of prediction models for MVI status

In order to identify the prediction performance, the ROC curves for the M-PET/CT model in the training and validation cohort were drawn (Fig. 2a, b). The AUC of M-PET/CT model in the training cohort was found to be 0.891(95% CI: 0.799–0.984) (Fig. 2a). The optimum cut-off of M-R-score generated by the AUC was 0.07. In this condition, the predicting sensitivity was 0.682 and the specificity was 0.964. The AUC of the M-PET/CT model in the validation cohort was 0.692 (95% CI: 0.497–0.887). The Hosmer and Lemeshow tests showed good fits not only in the training cohort (P = 0.796) but also in validation cohort (P = 0.874). The calibration curve of the M-PET/CT model for predicting MVI demonstrated good agreement between prediction and observation in training cohort (Fig. 2c). However, the unfavorable calibration of the M-PET/CT model was observed in validation cohort (Fig. 2d).

Fig. 2
figure 2

a, b ROC curve for the M-PET/CT model in the training (a) and validation (b) cohorts. c, d Calibration curves of the M-PET/CT model for MVI prediction in the training (c) and validation (d) cohorts. e, f Decision curve analysis for M-PET/CT model in training cohort (e) and validation cohort (f)

Clinical use

The decision curve analysis for the M-PET/CT model in the training and validation cohort is presented in Fig. 2e, f. The decision curve showed that if the threshold probability of a patient or doctor is > 10%, using the M-PET/CT model to predict MVI adds more benefit than either the treat-all-patients scheme or the treat-none scheme.

Feature selection and radiomics nomograms building for predicting DFS

According to the results of follow-up, a total of 39 patients, including 25 patients (50%) in the training cohort and 14 patients (46.7%) in the validation cohort, had the tumor relapse.

The LASSO regression analyses revealed that three PET texture features (HISTO_Energy_Uniformity, GLCM_Contrast_Variance, NGLDM_Coarseness) and four CT texture features (CONVENTIONAL_HUQ2, CT_HISTO_Kurtosis, CT_HISTO_ExcessKurtosis, CT_NGLDM_Busyness) were most useful for predicting DFS. PET/CT Rad-score (D-R-score) was then calculated based on the above texture features. The formula of Rad-score for predicting DFS was listed in the Supplemental Materials. The D-R-score optimum cut-off point generated by the X-tile analysis was identified at 0.5. Accordingly, patients were classified into the low-risk group (Rad-score < 0.5) and the high-risk group (Rad-score ≥ 0.5). The relationship between DFS and the clinical factors and histopathologic and imaging indicators were investigated. Among them, 6 variables including 4 clinical variables (INR, CB, TB and MVI) and 2 conventional PET imaging metrics (SUVmax and TLR) were identified to be closely related with DFS by univariate analysis (P < 0.05 for all) (Table 3 and Table 4, Fig. 3). The D-R-score was observed to be significantly associated with DFS (P < 0.001) (Table 3, Fig. 4). In the training cohort, Kaplan-Meier analysis showed the D-Rad-score was significantly correlated with tumor relapse and the patients with higher Rad-scores had a significantly higher incidence of tumor recurrence (P < 0.05, Fig. 4a). 13 (34.2%) patients were confirmed to have tumor relapses in the low-risk group; 12 (100%) patients in the high-risk group were confirmed to have tumor relapse (mean DFS, 40.82vs9.08 months, P < 0.0001). A similar result was also observed in the D-R-score in the validation cohort (Fig. 4b). Further multivariate analysis revealed that the INR, TB and D-R-score were statistically significant for predicting DFS. These three features were applied to develop the predictive model for DFS, which was named the D-PET/CT model. Meanwhile, a clinical model was also built after multivariate analysis of the clinical variables in the training cohort. The clinical model was integrated by the INR, TB, and MVI status. To provide the clinician with a quantitative method to predict patients’ probability of 1-, 2- and 3-year DFS, the nomograms of the D-PET/CT model and clinical model were developed (Fig. 5a–d).

Table 3 Univariate cox regression analyses for disease-free survival in the training cohort
Table 4 Univariate and multivariable Cox regression analysis of predictors in three predicting Models for MVI and DFS in the training cohort
Fig. 3
figure 3

 Kaplan-Meier analyses of the 6 selected features for patients in the training cohort

Fig. 4
figure 4

a, b Survival of patients stratified by the risk classification according to D-PET/CT radiomics signature for patients in the training (a) and validation (b) cohorts

Fig. 5
figure 5

a, b Radiomics nomogram of the D-PET/CT model and clinical model for predicting DFS. c, d Time-dependent area under ROC curve of the D-PET/CT model and clinical model for DFS prediction. eh Calibration curves of D-PET/CT model and clinical model for DFS prediction in the training cohort at the 1-year (e), 2-year (f), and 3-year (h) time points

Assessment and validation of prediction models for DFS

The C-indices of the D-PET/CT model and clinical model in the training cohort were 0.831(95% CI, 0.761 to 0.900) and 0.776(95% CI, 0.689 to 0.862), respectively. The C-index of the D-PET/CT model was superior to that of clinical model in the training cohort (P = 0.04). Using time-dependent receiver operating characteristic curve analysis, we found that the D-PET/CT model had higher predictive power for HCC recurrence than the clinical model at various time points in the training cohort (Fig. 5c, d). The C-indices of the two models for predicting DFS in the validation cohort were 0.640 (95% CI, 0.487 to 0.793) and 0.692 (95% CI, 0.552 to 0.832), respectively.

The calibration curves for predicting the probability of tumor recurrence at 1, 2, or 3 years after surgery of each model after 1000 times of bootstraps are shown in Fig. 5e–h. The calibration curves showed similarly good agreement between the estimation and actual observation in the training cohort. Correlation matrix for all the eligible features in the training cohort is shown in the Fig. S3.

Clinical use

The decision curve analyses for the D-PET/CT model and clinical model are presented in Fig. 6. Decision curve analyses showed that the D-PET/CT radiomics nomogram had a higher overall net benefit than the clinical-pathologic nomogram across most of the range of risk threshold (Fig. 6a). Decision curve analyses in the validation cohort were a little unsatisfactory but kept a similar trend with that in the training cohort (Fig. 6b).

Fig. 6
figure 6

Decision curve analysis for each model in training cohort (a) and validation cohort (b). The y-axis measures the net benefit. The net benefit was calculated by summing the benefits (true positive results) and subtracting the harms (false-positive results), weighting the latter by a factor related to the relative harm of an undetected cancer compared with the harm of unnecessary treatment. The radiomics model had the highest net benefit compared with both clinical model and simple strategies such as follow-up of all patients (green line) or no patients (horizontal black line) across the full range of threshold probabilities at which a patient would choose to undergo imaging follow-up in the training cohort

Discussion

The prediction of MVI and the tumor recurrence in patients with very early and early HCC (BCLC stage 0-A) are important for establishing a precise therapeutic strategy. Despite improvements in diagnostic and therapeutic modalities, the recurrence rate after treatment for HCC remains high [25]. Approximately half of the patients in our study developed tumor relapse. MVI is an important factor related to the recurrence of HCC. Preoperatively predicting MVI and DFS in patients with very early or early HCC (BCLC stage 0 or A) is important for establishing a personalized therapeutic strategy and helps to select a more aggressive treatment for those with high-risk factors in order to reduce the post-operation recurrence. In this study, we found that the radiomics signature of preoperative [18F]FDG PET/CT from patients with very-early- and early-stage HCC was an independent predictor for MVI status, while the clinical features were not. Meanwhile, we also demonstrated the role of the radiomics of preoperative [18F]FDG PET/CT in predicting DFS. Based on the above findings, we developed and validated a radiomics feature-based [18F]FDG PET/CT nomogram and a clinical nomogram for predicting DFS. Our data revealed that the D-PET/CT nomogram successfully stratified patients into high-risk and low-risk groups, with significant differences in DFS within 3 years. The D-PET/CT model performed better than the clinical-pathologic nomogram, indicating the incremental value of the D-PET/CT radiomics signature for individualized DFS in patients with very-early- and early-stage HCC.

Some studies reported that preoperative metabolic biomarkers of the HCCs, such as SUVmax and TLR, are independent predictors of MVI [26,27,28]. In agreement with their results, univariate analysis in our study also found that SUVmax and TLR were closely associated with MVI (P < 0.05), as higher SUVmax and TLR showed higher frequencies of microvascular infiltration than the lower glucose metabolism. However, all of the clinical features were not significantly associated with MVI (P > 0.05), which is different from reports by Xu and Seung et al. [18, 21]. In their studies, some clinical features such as clinical stage, AST, and AFP are important predictors of MVI [21, 29]. This difference is likely due to the difference in study object, as their studies included advanced liver cancer, whereas we only enrolled patients with very-early- and early-stage HCC. In our study, most patients had the low grade of HCC and preserved liver function, which might be one of the reasons that the clinical features were not significant predictors for MVI status. However, multivariate analysis showed that only the radiomics signature (M-R-score) was an independent predictor for MVI status in very-early- and early-stage HCC (P < 0.05). Our study revealed that the radiomics signature of [18F]FDG PET/CT was a stable predictor of MVI status with relatively high AUCs and calibrations. Ultrasound, contrast-enhanced CT and the gadoxetic acid–enhanced MRI based models had been used for predicting MVI status. The gadoxetic acid–enhanced MR model had the highest AUC (0.936; 95% CI 0.895–0.976) for predicting MVI status [30], followed by the contrast-enhanced CT model, both of which worked better than the ultrasound-based model [12]. The blood supply could be assessed via contrast-enhanced CT or MR, which may contribute to the high AUCs for prediction. When compared with other reported models, our study demonstrated that the M-PET/CT model (0.891, 95% CI: 0.799–0.984) exhibited similar performance with the contrast-enhanced CT model (0.909, 95% CI, 0.869–0.928). The CT component of the PET/CT had a relatively poor performance in assessing MVI due to its specific image acquisition (low dose without contrast enhancement). The performance improvement of the M-PET/CT model was likely due to the addition of the PET component.

Resection for early-stage HCCs is still plagued by a high recurrence rate, and approximately half of the patients in our study developed tumor relapse, which was in agreement with the results of previous studies [31, 32]. In this study, we revealed that the radiomics signature of [18F]FDG PET/CT was a stable predictor for MVI. However, the relationship between MVI, PET/CT based radiomics, and early tumor recurrence in very-early- and early-stage HCC remains unclear. Although the Barcelona Clinic Liver Cancer system is crucial for assessing the prognosis and establishing a treatment strategy, this staging system performs poorly in predicting tumor recurrence after hepatic resection or orthotopic liver transplantation for HCC [29]. In this study, Kaplan-Meier analysis revealed that there was no significant difference in the DFS distribution between BCLC 0 stage and the group of BCLC A stage (P = 0.269, Fig. S4a). Within BCLC A, the radiomics signature successfully identified high-risk patients with poor survival outcomes, for whom more intensified treatment was needed (Fig. S4b). Unfortunately, the radiomics signature failed to discriminate subgroups within stage BCLC 0 (n = 14), which may be largely attributed to the limited size of the study population. Unlike prior prognostic investigations that mainly analyzed patients at all stages of the disease, our current study focused exclusively on patients with very-early- and early-stage disease. Therefore, we sought to improve the predictive ability for prognosis of very early and early HCC by developing a novel method for categorizing patients into low- and high-risk groups for early tumor recurrence. To our knowledge, the association between radiomics features of PET/CT images and DFS of HCC patients has not been evaluated previously.

Radiomics refers to the comprehensive quantification of tumor phenotypes by applying a large number of quantitative imaging features, which may reflect changes in human tissues at the cellular and genetic levels, and provides more detailed information on tumor biology and microenvironment that are complementary to visual features [33, 34]. In this study, the extraction of advanced PET imaging features allowed us to assess intratumor heterogeneity quantitatively on the glucose metabolism scale. Cox regression analysis showed that the radiomics signature was an independent prognostic factor for DFS in very-early- and early-stage hepatocellular carcinoma, even after adjustment for the same stage (Fig. S4b). In order to use it for clinical practice, we established a radiomic-based prediction model and a clinical model for predicting HCC recurrence. We found that the prognostic performance of the radiomic-based model (D-PET/CT model) was superior to that of the clinical model in the training cohort (P < 0.05). The D-PET/CT model had a higher capability for risk assessment of early recurrence after radical resection of very early and early HCC and yielded the best discriminatory capacity (Fig. 5c, d). The metabolic signature of [18F]FDG PET/CT, a multi-parametric approach using fully integrated [18F]FDG PET/CT, successfully stratified those patients into high-risk and low-risk groups with significant differences in DFS within 3 years. Patients at high risk for recurrence are potential candidates for more intensive surveillance and adjuvant therapy after curative surgery. Study findings have supported the prognostic value of the signature and allowing clinicians to potentially identify candidates for systemic approaches with greater effectiveness to improve treatment outcomes. Moreover, incorporation of D-PET/CT radiomics signatures into a clinical model can add prognostic information to better identify patients that may have different outcomes, and the radiomics nomogram is a good witness. The decision curve analysis demonstrated that the D-PET/CT radiomics nomogram was superior to the clinical-pathologic nomogram across most of the range of reasonable threshold probabilities, indicating that the D-PET/CT radiomics signature added incremental value to other clinical-pathologic risk factors for individualized DFS estimation.

Microvascular invasion is a well-known independent prognostic factor associated with more advanced tumor stage, tumor progression, and poorer clinical outcomes. In this study, the microvascular invasion was a high-risk factor associated with DFS (Fig. 3a). However, further analysis revealed that the radiomics signature is the independent prognostic factor, but not MVI (Table 4). As demonstrated in the current study, radiomics feature-based [18F]FDG PET/CT imaging signature by applying a large number of quantitative image features could be suggestive of survival outcomes, which may reflect changes of human tissues at the cellular and genetic levels and provide different information from MVI and thus is associated with patient prognosis. Except for radiomics signature, TB and INR were also significantly associated with DFS even in very early and early HCC and were risk factors for DFS prediction models (Fig. 3b, c), similar to the results of other studies that liver function is another independent prognosticator of tumor recurrence [35,36,37]. Previous studies have explored the relationship between SUVmax and HCC outcomes. HCCs with increased [18F]FDG uptake showed molecular features of more aggressive biologic properties than those with a low [18F]FDG uptake [14]. The findings of this study support the above hypothesis that patients with a high SUVmax and TLR demonstrated a worse overall survival (Fig. 3e, f).

Our study has some limitations. First is the small sample size and the retrospective evaluation of data. The relationship between the clinicopathological parameters, MVI, and prognosis could have been influenced by the small sample size. To overcome these limitations, we are aiming at validating the model in prospective multi-center clinical trials. Finally, whether adjuvant therapy can provide a better survival benefit to patients classified as high Rad-score based on [18F]FDG PET/CT nomogram should be assessed in the future.

Conclusion

This study provided [18F]FDG predictive model that incorporated both the radiomics signature of [18F]FDG PET/CT and clinical risk factors. The radiomics signature can be conveniently used to facilitate the preoperative individualized prediction for MVI status and DFS in patients with very-early- and early-stage HCC. The radiomics signature of [18F]FDG PET/CT effectively predicted MVI status. Incorporating D-PET/CT radiomics into a nomogram along with other clinical-pathologic risk factors to estimate the disease-free survival for very early and early HCC was more accurate than the clinical-pathologic nomogram. Our predictive models are potential, noninvasive, and effective complements to clinical practice. The radiomics analysis could renew the application of [18F]FDG for management of HCC.