Introduction

Hepatocellular carcinoma (HCC) is one of the most widespread neoplasms and is the third global cause of cancer-related deaths [1]. Microvascular invasion (MVI) is a significant risk factor responsible for tumor recurrence, specifically early recurrence, in HCC patients [2, 3]. The incidence of MVI in small HCC ≤ 3 cm is 18.1 to 37.0% [4, 5]. For patients with MVI-positive small HCC, surgical resection proves more beneficial than radiofrequency ablation (RFA) because the former technique is capable of clearing a larger percentage of micrometastasis. Furthermore, anatomic resection (complete removal of tumor-bearing portal territory) can avert early recurrence [2, 3, 6]. However, the major drawback lies in the fact that MVI is detected by histological examination after surgery. Therefore, it is important to explore methods that identify MVI at the preoperative stage.

Gadoxetic acid–enhanced magnetic resonance imaging (MRI) is highly sensitive in detecting HCC [7, 8]. Literature reports indicate the capability of specific MRI features in predicting the presence of MVI in HCC [9,10,11]. However, these features have limited capacities. Radiomics and deep learning are emerging methods utilized for predicting MVI in HCC. However, they are burdened by several key problems, such as the stability of high-throughput data, modeling methodology, and evaluation criteria [12,13,14].

The macroscopic classification of HCC is associated with MVI. The recurrence of the single nodular type with extra-nodular growth, confluent multinodular type, and infiltrative type is higher than that of the single nodular type [15, 16]. High-resolution images obtained from gadoxetic acid–enhanced MR can classify HCC at the macroscopic level. However, the interobserver agreement among the reviewers is not strong [17, 18]. Three-dimensional (3D) quantitative analysis is a technique that furnishes a more detailed and repeatable quantitative assessment of the tumor characteristics. Reports indicate the application of 3D quantitative parameters extracted from tumor lesions in preoperative diagnosis and risk stratification. Additionally, they display strong interobserver agreement [19,20,21]. These results enabled the hypothesis that 3D quantitative analysis of gadoxetic acid–enhanced MRI may help predict MVI in HCC.

Thus, the current study aimed to explore the significance of preoperative 3D quantitative analysis and imaging features of gadoxetic acid–enhanced MR in MVI and the relationship between the significant predictors and early recurrence after surgery of single HCC ≤ 3 cm.

Material and methods

The institutional review board approved this retrospective study, and informed consent from the participants was waived.

Study design

The patients were divided into the training cohort and the validation cohort according to time. The data from the training cohort enabled the screening of the significant image features and established the MVI prediction model. The data from the validation cohort verified the diagnostic performance of the prediction model.

Patients

Consecutive patients who underwent gadoxetic acid–enhanced MRI before HCC surgical resection between March 2012 and October 2019 were retrospectively included. The patients with the following criteria were as included: pathologically proven single HCC without suspicious lymph nodes, longest tumor diameter ≤ 3 cm in MR imaging, and examination of gadoxetic acid–enhanced MRI within 30 days before surgery. The inclusion and exclusion criteria are summarized in the patient flow diagram shown in Fig. 1.

Fig. 1
figure 1

Flow diagram showing the inclusion and exclusion criteria for the study

MR imaging data

All patients were scanned in a 1.5 T scanner (MAGNETOM Aera, Siemens Healthcare) and administered an intravenous bolus injection of 0.025 mmol/kg gadoxetic acid (Primovist, Bayer HealthCare Pharmaceuticals Inc.). The following MRI sequences were recorded: diffusion-weighted imaging (DWI), T2-weighted phase with fat suppression, pre-contrast T1-weighted phase and post-contrast dynamic T1-weighted volumetric-interpolated breath-hold examination (VIBE) at arterial phase (20–30 s), portal venous phase (60–70 s), transitional phase (180 s), and hepatobiliary phase (HBP, 20 min). The details of these MRI sequences are summarized in Table S1.

Imaging analysis

The image features analyzed in this study included 3D quantification parameters, traditional quantitative image features, and qualitative image features. Three radiologists (with 6, 8, and 18 years of experience in abdominal image diagnosis, respectively; hereafter referred to as reader 1, reader 2, and reader 3, respectively) participated in the lesion image evaluation. Reader 1 and reader 3 evaluated the quantitative image features, and reader 2 and reader 3 together investigated the qualitative image features.

3D quantitative analysis

The entire tumor segmentation exercise was accomplished using the LIFEx software (http://www.lifexsoft.org). Earlier reports have indicated that the boundary of the lesion is best visualized on the HBP images [22, 23]. Hence, in the current study, the regions of interest (ROI) were manually drawn along the tumor boundary on the HBP images. The tumors were first segmented by reader 1. Next, tumor segmentation was independently performed by reader 3 on randomly chosen 50 lesions (40 in the training cohort and 10 in the validation cohort) to test the reproducibility of the features extracted from the segmentation [24]. Both reviewers were aware of the HCC diagnosis but blinded to clinical, laboratory, histopathological, and follow-up results. The HCC contouring on MR images is shown in Fig. 2.

Fig. 2
figure 2

Examples of small HCC. ac Hepatocellular carcinoma with microvascular invasion in a 61-year-old woman. Tumor recurrence occurred in the liver 8 months after surgical resection. a Gadoxetic acid–enhanced arterial phase MR image shows a 2.8-cm hypervascular mass with peritumoral enhancement in the hepatic segment VI. b Hepatobiliary phase image shows as non-smooth tumor margin and peritumoral hypointensity. c Segmentation display at this level of the lesion on the HBP images. The tumor sphericity is 0.730. df Hepatocellular carcinoma without microvascular invasion in a 49-year-old man. Tumor recurrence did not occur during 24 months of the follow-up period after surgical resection. d Gadoxetic acid–enhanced arterial phase MR image shows a 2.9-cm hypervascular mass without peritumoral enhancement in the hepatic segment V. e Hepatobiliary phase image shows smooth tumor margin and no peritumoral hypointensity. f Segmentation display at this level of the lesion on the HBP images. The tumor sphericity is 0.818

After profiling the entire tumor, three 3D quantitative parameters were calculated using the LIFEx software: (1) volume (mL), the volume of interest in mL; (2) sphericity, ranging from 0 to 1. The sphericity of a perfect sphere is equal to 1; (3) compacity, reflecting how compact the lesion is. The 3D shape parameter formulae are given in the supplementary material.

Traditional imaging features

The following traditional quantitative image features were recorded: tumor size, tumor apparent diffusion coefficient (ADC) value, tumor-to-liver signal intensity (SI) ratio on DWI, and tumor-to-liver SI ratio on HBP. The maximum diameter on the transverse images defined the tumor size. The other parameters were measured at the level of the maximum diameter of the tumor. Manually, the ROI was placed on the ADC map, DWI (b value of 500 s/mm2), and HBP images to approximately include the entire tumor area. The peripheral portions were excluded to prevent interference by partial volume effects from the adjacent tissue. All lesions were first measured by reader 1. Reader 3 then independently measured 50 randomly chosen lesions (same as those mentioned in “3D quantitative analysis”) to evaluate interobserver reproducibility. Further details are given in the supplementary material.

The qualitative image features were independently reviewed by reader 2 and reader 3 by a picture archiving and communication system (PACS). The two readers arrived at a consensus, and a final decision was made regarding the inconsistent cases. The following qualitative imaging features were evaluated: (a) arterial rim enhancement, (b) arterial peritumoral enhancement, (c) tumor margin, (d) tumor capsule, (e) tumor hypointensity on HBP, (f) peritumoral hypointensity on HBP, and (g) enhancement pattern. The qualitative imaging features are described in the supplementary material.

Histopathological diagnosis and follow-up

The appearance of tumor emboli in the tiny blood vessels in the vicinity of the primary tumor is defined as MVI; the emboli can be visualized only microscopically. The histopathological data were obtained from surgical and pathologic reports completed initially by two experienced pathologists (with 15 and 21 years of experience in liver histopathological diagnosis, respectively). The etiology of liver disease for the patients recruited in the current study included chronic viral hepatitis B, viral hepatitis C, and others (alcohol intake and aflatoxin exposure). The number of cases with the respective etiologies is given in Table 1. After surgical resection, the patients were monitored with multiphase liver computed tomography (CT) or MRI and alpha-fetoprotein (AFP) every 3–6 months to evaluate tumor recurrence. Early recurrence was defined as an occurrence within 2 years after surgery of HCC [25].

Table 1 Patient characteristics

Nomogram construction and evaluation

The nomogram was constructed from the predictive model and illustrated graphically. Harrell’s C-index evaluated the discrimination performance of the nomogram. Calibration curves analyzed the diagnostic performance of the nomogram in the training and validation cohorts. The decision curve determined the clinical efficacy of the nomogram by quantifying the net benefits of the entire cohort under different threshold probabilities.

Statistical analysis

The statistical differences among the variables were evaluated using the two-sample t-test or the Mann–Whitney U test for continuous variables and the chi-square test or Fisher’s exact test for categorical variables. Univariate and multivariate logistic regression analyses assessed the independent risk factors for MVI. Nomograms were built using this prediction model. Statistically significant variables were further investigated using the receiver operating characteristic (ROC) curve analysis. The model’s capacity to predict MVI was calculated from the area under the curve (AUC). The best cutoff values for ROC curves were determined using Youden’s index. Interobserver agreement on qualitative and quantitative image findings was determined using kappa statistics and intraclass correlation coefficients (ICC), respectively. These values are explained as follows: 0.00–0.20, no agreement; 0.21–0.40, weak agreement; 0.41–0.60, moderate agreement; 0.61–0.80, strong agreement; and 0.81–1.0, almost perfect agreement. The sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) of each significant imaging finding and the combinations of the findings that predicted MVI were also calculated. Early recurrence curves were evaluated by using the Kaplan–Meier method, and differences among the subgroups were compared by the log-rank test. All statistical analyses were performed on the SPSS software (version 26.0) and R software (version 3.6.1). p < 0.05 indicated statistical significance.

Results

Demographic and pathologic characteristics

Table 1 shows the baseline clinical and pathologic characteristics of HCC. The final cohort of 214 patients (184 men and 30 women) was classified into two cohorts. The training cohort included 169 patients (37 for MVI-positive, 132 for MVI-negative; 147 men and 22 women) and was conducted from March 2012 to December 2017. The time-independent validation cohort involved 45 patients (12 for MVI-positive, 33 for MVI-negative; 37 men and 8 women) and was conducted from January 2018 to October 2019.

In the training cohort, statistical difference between the MVI-positive and MVI-negative groups was observed only in the Edmondson-Steiner grade (p = 0.003). However, this was a postoperative feature and not included in the multivariate analysis. Other baseline variables, including age, gender, background liver, and etiology of liver disease, were similar in both groups (p > 0.05). No significant difference was observed between the training cohort and the validation cohort considering other variables; total bilirubin (p = 0.001) was an exception. No significant difference was observed in the MVI status between the two cohorts (p = 0.498).

MR imaging characteristics

The quantitative MR imaging features of HCC are described in Table 2. In the training cohort, statistical differences between the MVI-positive and MVI-negative patients were observed in the following image features: tumor volume (p < 0.001), sphericity (p < 0.001), compacity (p = 0.002), tumor size (p < 0.001), and tumor-to-liver SI ratio on DWI (p = 0.034). The ICCs for the imaging findings were 0.913–0.980 (p < 0.001). The tumor volume of HCCs with MVI was larger than that of HCCs lacking MVI. The tumor sphericity of HCCs with MVI was smaller than that of HCCs without MVI, suggesting that the MVI-positive HCC lesions were less spherical. The tumor compacity of the HCCs with MVI was larger than that of HCCs without MVI, suggesting the more compact nature of the MVI-positive HCC lesions. The tumor size and tumor-to-liver SI ratio on DWI of the HCCs with MVI were larger than the respective parameters of the HCCs without MVI. No statistical difference was identified in the quantitative imaging features between the training and the validation cohorts (p > 0.05).

Table 2 Quantitative MR imaging findings

The qualitative MR imaging findings of HCC are described in Table 3. In the training cohort, statistical difference between the MVI-positive and MVI-negative patients was observed in the following image features: arterial peritumoral enhancement (p = 0.001), non-smooth tumor margin (p < 0.001), and peritumoral hypointensity on HBP (p < 0.001). Cohen’s kappa coefficients for the imaging results were 0.774–0.841 (p < 0.001). No statistical difference was identified in the qualitative imaging features between the training and the validation cohorts (p > 0.05).

Table 3 Qualitative MR imaging findings

Prediction model

The results of the univariate and multivariate analyses of imaging findings related to MVI expression in the training cohort are described in Table 4. The variables showing p < 0.05 in the univariate logistic regression analysis were applied to multivariate logistic regression analysis. Statistical difference was observed in the tumor sphericity (odds ratio [OR], 0.000; p < 0.001), non-smooth tumor margin (OR, 3.353; p = 0.015), and peritumoral hypointensity on HBP (OR, 14.067; p = 0.003) finally for MVI in HCC. The optimal tumor sphericity cutoff value of 0.800 was obtained based on Youden’s index of the ROC.

Table 4 Univariate and multivariate analyses for predicting MVI-positive HCC in the training cohort

The sensitivity, specificity, accuracy, PPV, and NPV for predicting MVI using the three significant factors and their combinations are described in Table 5. In the training cohort, the combination of all three factors produced the values of 45.9 (17/37) for sensitivity, 97.0 (128/132) for specificity, and 85.8 (145/169) for accuracy. In the validation cohort, when all three factors were combined, the values for sensitivity, specificity, and accuracy were 66.7 (8/12), 87.9 (29/33), and 82.2 (37/45), respectively.

Table 5 Diagnostic performance of significant imaging findings and combinations for predicting MVI-positive HCC

For the two cohorts, the diagnostic model was combined with the three significant factors. The resulting AUC values were 0.885 (95% confidence interval [CI]: 0.825, 0.946; training cohort) and 0.869 (95% CI: 0.753, 0.985; validation cohort) (Fig. 3). The diagnostic model (with the significant factors combined) exhibited the highest predictive value compared to that obtained using each significant MRI finding alone.

Fig. 3
figure 3

Receiver operating characteristic (ROC) curve of the three significant MR imaging findings and diagnostic model in the prediction of microvascular invasion for small hepatocellular carcinoma. a ROC curve in the training cohort; b ROC curve in the validation cohort. AUC: area under the curve

Development and validation of the nomogram

The nomogram for predicting MVI using the three significant MR imaging findings is illustrated in Fig. 4. The training cohort (C-index 0.885) and the validation cohort (C-index 0.869) were predicted satisfactorily. The calibration curves for prediction and observation agreed well in both cohorts. The net benefit of the decision curve for the predictive nomogram in the whole cohort was higher than that when it was assumed that all or no patients expressed MVI. Moreover, the threshold probability was between 1 and 85%, indicating that the therapy strategy based on our nomogram would be capable of improving clinical outcomes.

Fig. 4
figure 4

Nomogram for preoperative estimation of microvascular invasion (MVI) risk and its predictive performance. a Nomogram for predicting MVI probabilities. When using the nomogram, find the position of each variable on the axis and the corresponding point was identified vertically. Then, the points of all variables were added and converted into the MVI-positive probability on the bottom scale. b, c Calibration curves of the nomogram in the training and validation cohorts. The X-axis is the nomogram-predicted probability of MVI; Y-axis is the observed MVI, and the diagonal dashed line represents the ideal prediction by a perfect model. d Decision curves of the nomogram in the whole cohort. The green line is the net benefit obtained by assuming that all patients have manifested MVI; the black line is the net benefit obtained by assuming that no patients have manifested MVI; the red line is the expected net benefit of per patient based on the predictive nomogram

Early recurrence rates of patients

The early recurrence rates of patients with no, one, and two or three significant MR imaging findings are described in Fig. 5. In the training cohort, during the entire follow-up period (range, 28–731 days; median, 648 days), the recurrence rates of the three groups were 10.2% (6/59, no MR finding), 14.5% (8/55, one MR finding), and 29.1% (16/55, two or three MR findings), respectively. In the validation cohort, during the entire follow-up period (range, 24–731 days; median, 701 days), the recurrence rates of the three groups were 6.7% (1/15), 12.5% (1/8), and 36.4% (8/22), respectively. Early recurrence rates of patients expressing two or three significant factors were significantly higher than the patients that manifested no significant factors in the training (29.1% vs. 10.2%, p = 0.007) and the validation cohorts (36.4% vs. 6.7%, p = 0.037).

Fig. 5
figure 5

Early recurrence rates of patients. a In the training cohort. b In the validation cohort. Early recurrence rates of patients with single hepatocellular carcinoma with no (blue line), one (red line), and two or three (green line) significant MR imaging findings for predicting MVI

Discussion

In the present study, tumor sphericity, non-smooth tumor margin, and peritumoral hypointensity on gadoxetic acid–enhanced HBP were identified as independent and significant variables that predicted MVI in single HCC ≤ 3 cm. Moreover, these factors exhibited good interobserver agreement. The diagnostic specificity and accuracy of MVI were higher when the three MRI findings were combined. The nomogram constructed from the prediction model was predicted satisfactorily and calibrated well in both cohorts. Additionally, the early recurrence rate in patients with two or three MR imaging findings was significantly higher than that in patients without any MRI findings after surgery in both cohorts.

Tumor sphericity manifested as a significant MRI finding for predicting MVI in 3D quantitative analysis in the current study. This observation is consistent with previous results that described the significance of tumor sphericity in tumor preoperative diagnosis and risk stratification [19,20,21]. The macroscopic appearance of single nodular type with extra-nodular growth, confluent multinodular type, and infiltrative type indicates low tumor sphericity. The single nodular type corresponds to high tumor sphericity because tumors with longer shapes or more protrusions presented lower sphericity. Moreover, literature reports have indicated a higher postoperative recurrence rate of the single nodular type with extra-nodular growth, confluent multinodular type, and infiltrative type of HCCs than that of the single nodular type HCCs [15, 16].

Non-smooth tumor margin was an important MRI finding for predicting MVI in the present study, and this observation was consistent with that from previous studies [9, 26]. The aggressive biological tendencies of HCC for invading the tumor capsule and protrusion into the non-tumoral parenchyma are demonstrated by non-smooth tumor margins. Previous studies have related non-smooth tumor margin with early recurrence of HCC [26, 27].

Peritumoral hypointensity on HBP images was another important factor that predicted MVI in the current study. This observation was also consistent with those from previous studies [26, 28]. Peritumoral perfusion changes caused by MVI affect the organic anion-transporting polypeptide (OATP) expression of the hepatocyte membrane around the tumor. This changes the hepatic function and reduces the uptake of gadoxetic acid by hepatocytes around the tumors. This could be the probable mechanism of peritumoral hypointensity on HBP images of HCC with MVI. Additionally, previous studies have shown that the peritumoral hypointensity on HBP was related to the early recurrence of HCC [29, 30].

Total bilirubin was significantly different between the training cohort and the validation cohort, possibly due to the selection bias of the retrospective nature of the study. However, there was no significant difference in the other clinical-pathological variables, MVI states, and imaging features between the two cohorts. Thus, the data were similar between the training and the validation cohorts in the present study.

The current study exhibited good diagnostic performance in predicting MVI in HCC. When the three significant imaging findings of tumor sphericity (p < 0.800), non-smooth tumor margin, and peritumoral hypointensity on HBP were combined, the diagnostic specificity and accuracy of MVI increased in the two cohorts. The nomogram based on the prediction model performed satisfactorily in the two cohorts and with good calibration. The predictive nomograms based on clinical-radiological factors of previous studies [31, 32] were not as accurate as those obtained in the current study. Therefore, the current nomogram was capable of improving the diagnostic performance of MVI prediction. This nomogram might assist surgeons in treatment decision-making and promote a personalized therapeutic regime for HCC patients.

The current study confirmed that patients manifesting two or three significant MRI findings had a significantly higher early recurrence rate than patients with none of the findings after HCC radical resection in both cohorts. The combination of significant MR imaging findings for predicting MVI might have a synergistic effect in predicting early recurrence of HCC preoperatively, compared to each significant MRI finding considered individually.

The study has several limitations. First, the retrospective study design could result in selection bias. Second, the tumor size was limited to less than 3 cm in the study. Hence, the results cannot be generalized to HCCs that contain tumors with the largest diameter greater than 3 cm. Third, only the quantitative 3D parameters on HBP were analyzed; the differences among the sequences were not compared. Finally, the pathological macroscopic classification of HCC was not compared with the quantitative characteristics of HCC.

In conclusion, lower sphericity in 3D quantitative analysis, non-smooth tumor margin, and peritumoral hypointensity on HBP can function as preoperative imaging biomarkers for speculating MVI with good predictive performance. These factors are associated with early recurrence after surgery of single HCC ≤ 3 cm.