Introduction

Hepatocellular carcinoma (HCC) is the most common primary malignancy of the liver and is a major cause of mortality [1, 2]. Despite considerable advances in early diagnosis and surgical techniques, and improved post-operative management, the tumor recurrence rate remains high (50–60%) after curative resection [3, 4]. Studies have shown that microvascular invasion, poor histologic differentiation, and microsatellite nodules are more likely to be associated with recurrence after hepatic resection [5, 6]. Most of these risk factors, however, can only be assessed at post-operative pathologic examination and are therefore of limited value in guiding treatment decisions prior to hepatic resection.

Recently, preoperative imaging features such as rim enhancement and arterial peritumoral enhancement on dynamic contrast-enhanced magnetic resonance imaging (MRI) [7, 8], as well as non-smooth tumor margins [9], and tumor size [10] have proven useful for predicting HCC recurrence. Other imaging features, such as HCC capsule and hypointense nodules in the hepatobiliary phase (HBP) on hepatobiliary contrast agent-enhanced MRI, have also been proposed for predicting tumor behavior [11, 12]. Most of these imaging indices, however, are qualitative measurements and therefore subject to interpreter bias, possibly resulting in low diagnostic performance in predicting tumor recurrence after hepatectomy.

Diffusion-weighted imaging (DWI) can quantitatively evaluate the apparent mobility of water molecules in tissues, as indicated by the apparent diffusion coefficient (ADC) [13]. Previous studies have shown that low ADC values derived from DWI can be used to evaluate tumor behavior and recurrence after hepatectomy in patients with HCC [14,15,16]. However, because ADC values reflect a combination of diffusion and blood perfusion effects, they may not entirely account for all tissue characteristics [17, 18]. Conversely, intravoxel incoherent motion (IVIM) values derived from DWI with multiple b value models can reflect tissue diffusion and microcapillary perfusion separately [19]. A recent study has suggested that IVIM parameters correlate well with the degree of HCC differentiation [20]. The aim of our study was to determine whether IVIM values can be used to predict tumor recurrence in patients with HCC after curative resection.

Materials and methods

Patients

This retrospective study followed the principles of the Declaration of Helsinki and was approved by the Institutional Review Board. Written informed consent was waived. Between 1 March 2014 and 31 July 2017, all patients referred for abdominal MRI for suspected HCC due to chronic hepatitis B virus (HBV) infection were evaluated. Patients were ineligible for inclusion if they (a) had a tumor other than HCC, (b) had undergone previous anti-tumor treatment, (c) underwent hepatectomy at > 1 month after the MRI examination, (d) had simultaneous hepatitis C virus infection or any other chronic hepatic diseases carrying the risk of cirrhosis, (e) a lesion size < 1 cm, (f) massive necrosis and hemorrhage that could potentially impact accurate measurement, (g) obvious slice misregistration or motion artifacts, (h) no or irregular anti-HBV-DNA treatment after hepatectomy, (i) were unavailable or lost to follow-up, and (j) had a tumor with macrovascular invasion, which was defined as gross invasion into the main portal veins or their branches, hepatic veins or the inferior vena cava at gross examination or at imaging including MRI and computed tomography (CT), because such tumors are known to have a greater risk of recurrence [21,22,23,24] (Fig. 1).

Fig. 1
figure 1

Flowchart of inclusion and exclusion criteria. HCC, hepatocellular carcinoma; MRI, magnetic resonance imaging; HBV, hepatitis B virus

Clinical information, laboratory data, and pathology reports of eligible patients were retrospectively retrieved from our database. All patients were treated by curative resection, which was defined as complete macroscopic removal of the tumor with ample free section margins. The median time between the MRI examination and surgery was 7 days (range, 1–14 days).

Post-operative follow-up in all of the study patients after curative resection for HCC was performed with abdominal ultrasound and either contrast-enhanced CT or MRI, initially at 3-month intervals during the first year, and thereafter at 6-month intervals. In addition, serum liver profiles (alpha-fetoprotein [AFP] and HBV DNA levels) were obtained. Tumor recurrence was suspected when the serum AFP levels were progressively elevated and ultrasonography revealed a new hepatic lesion. Recurrence was diagnosed by dynamic CT (n = 15) or MRI (n = 32) according to typical imaging characteristics (hyper-enhancement on arterial phase images and washout on portal venous and/or delayed-phase images) [25, 26]. Recurrence was confirmed by surgical pathology when patients underwent re-resection (n = 8) or with biopsy (n = 5). All patients were followed until December 2017. The mean and median follow-up times for surviving patients after hepatic resection were 15 and 14 months (range, 3–43 months), respectively.

MRI protocol

MRI investigations were performed on a 3.0 T scanner (Discovery MR750; GE Healthcare) equipped with an eight-channel phased-array abdominal coil. All patients had fasted for at least 4 h before examination. DWI was performed by using a respiratory-triggered single-shot spin echo echo-planar imaging sequence in the transverse plane before contrast-enhanced imaging. Eleven b values from 0 to 1500 s/mm2 (0, 30, 50, 100, 150, 200, 300, 500, 800, 1000, and 1500) were applied and the number of excitations (NEX) for each b value was 1, 1, 1, 1, 1, 1, 2, 4, 4, 6, and 6, respectively. The acquisition parameters of the multiple b value DWI sequences were as follows: repetition time = 6000 to 10,000 ms, echo time = 56 ms, bandwidth = 250 kHz/pixel, acquisition matrix = 128 × 128, field of view = 38 × 30 cm, slice thickness = 5 mm, and slice gap = 1 mm. The acquisition of the DWI sequence with multiple b values typically took between 4 and 7 mins. The three perpendicular x, y, z gradient directions were applied simultaneously (i.e., 3-in-1 option on GEHC system). Gadobenate dimeglumine (MultiHance; Bracco) was administered intravenously using a dual-head power-injected bolus injector (Spectris Solaris EP; Medrad) at a dose of 0.1 mmol/kg bodyweight and at a flow rate 2.0 mL/s, followed by a 20-mL saline flush at the same rate. The liver protocols and the detailed acquisition parameters of the other MRI sequences used are summarized in Table 1.

Table 1 MRI acquisition parameters

IVIM post-processing and analysis

The ADC values were estimated by fitting diffusion-weighted signals at all b values (0–1500 s/mm2) to the following mono-exponential equation:

$$ {S}_{\mathrm{b}}/{S}_0=\exp \left(-b\cdotp \mathrm{ADC}\right) $$

where Sb is the signal intensity at a given b value and S0 is the signal intensity without diffusion weighting.

The IVIM model uses a bi-exponential fitting of the diffusion decay curve to separate true diffusion from “pseudodiffusion” which reflects tissue microperfusion. In the IVIM model, the signal intensity curves from a DWI acquisition with multiple b values are described by Eq. [1] as follows [19]:

$$ {S}_{\mathrm{b}}/{S}_0=\left(1-f\right)\cdotp \exp \left(-b\cdotp D\right)+f\cdotp \left(-b\cdotp {D}^{\ast}\right) $$
(1)

where Sb and S0 are signal intensities with and without diffusion weighting, respectively, D (true diffusion coefficient) represents pure molecular diffusion, D* (pseudodiffusion coefficient) represents incoherent microcirculation within the voxel, and f (pseudodiffusion fraction) is the proportion of the pseudodiffusion. To robustly separate the diffusion and perfusion effects in Eq. [1], a typical multi-step approach is applied [27,28,29]. Initially, D is estimated by mono-exponential fitting of the diffusion-weighted signals at high b values (b > 200 s/mm2) assuming that perfusion contributions are negligible in Eq. [2].

$$ {S}_{\mathrm{b}}/{S}_0=\exp \left(-b\cdotp D\right) $$
(2)

The pseudodiffusion fraction f and pseudodiffusion coefficient D* are subsequently estimated by fitting the measured signal intensity at all b values to Eq. [1] with fixed D.

Image analysis was performed independently by two abdominal imaging radiologists (S.K. with 14 years and Y.Z. with 3 years of experience in liver MRI), who were blinded to all clinical, laboratory, pathologic, and follow-up information. The workstation with FuncTool software (version AW 4.6, GE Healthcare) was used for post-processing. The region of interest (ROI) was manually drawn to encompass as much of the lesion as possible in the maximum tumor cross-section on a single axial (b = 1000 s/mm2) image using T2-weighted imaging (T2WI) as a reference, after reviewing all T2WI, DWI, and dynamic enhancement images. ROI boundaries were placed at least 5 mm away from the margin of the lesion to avoid partial volume effects [30]. Regions of necrosis and hemorrhage were avoided using the T2WI and dynamic enhancement on the T1-weighted images as reference. Subsequently, the IVIM parameters were calculated automatically. The final value of each parameter was the average of the values measured by the two radiologists. Additionally, the respective measures obtained by the two radiologists were used to determine inter-observer agreement expressed in terms of the intra-class correlation coefficient (ICC). A third abdominal imaging radiologist (H.Y. with 3 years of experience in liver MRI) performed an analysis of tumor size by measuring the longest cross-sectional diameter on axial images according to the liver imaging reporting and data system, version 2017 [31]. In patients with multiple tumors, a target tumor with the longest cross-section was selected for analysis.

Statistical analysis

Data for continuous measures are reported as the mean ± standard deviation or as the median and range when the two-sample t tests or nonparametric Mann–Whitney tests were used. Categorical variables are presented as numbers of patients and percentages and were analyzed by means of the chi-square or Fisher’s exact test. The best cutoff value for each IVIM-derived parameter was determined based on receiver operating characteristic (ROC) curves. Inter-observer agreement in measuring the IVIM values was determined by means of the ICC with 95% confidence interval (CI) and was interpreted as follows: 0.00–0.20, poor correlation; 0.21–0.40, fair correlation; 0.41–0.60, moderate correlation; 0.61–0.80, good correlation; and 0.81–1.00, excellent correlation [32, 33]. The follow-up time was calculated from the date of resection to the date when tumor recurrence was diagnosed, or from the date of the resection to the last visit if recurrence was not diagnosed and the patients were censored on the date of death or on the last date of follow-up. Recurrence-free survival rates were computed using the Kaplan–Meier method and compared using the log-rank test. Univariate analysis was performed to assess significant clinical characteristics and IVIM values associated with tumor recurrence after curative resection. Multivariate Cox regression (stepwise forward model) analysis was performed to identify independent predictors of recurrence. The results are expressed as hazard ratios (HRs) with 95% CIs. Finally, ROC curve analysis was used to evaluate the obtained multivariate Cox regression model. All statistical analyses were performed using SPSS 22.0 (IBM). p < 0.05 was considered statistically significant.

Results

Study population

The characteristics of the patients evaluated in this study are summarized in Table 2. Overall, tumor recurrence after hepatectomy occurred in 47 (29.9%) of the 157 patients included in the analysis. In 13/47 patients, recurrence was confirmed by biopsy (n = 5) or by pathologic assessment after re-resection (n = 8). In the remaining, the recurrence was diagnosed on follow-up CT (n = 8), MR (n = 21), and both CT and MRI (n = 5). The median time from hepatectomy to HCC recurrence was 14 months (range 4–32 months). The principal sites of recurrence were the distal segment (n = 21 [45%]), the adjacent segment (n = 15 [32%]), and the marginal segment (n = 3 [6%]). In eight patients [17%], recurrence was multisegmental. The median longest cross-section of largest recurrent tumor was 17 mm (range 5–47 mm).

Table 2 Demographic and clinical characteristics of the 157 patients with HCCs

Comparison of clinical and pathologic characteristics according to IVIM values

The inter-observer agreement was very good for ADC (ICC = 0.896, 95% CI = 0.841–0.932), D (ICC = 0.889, 95% CI = 0.831–0.927), and f (ICC = 0.870, 95% CI = 0.803–0.915). Moderate inter-observer agreement was observed for D* (ICC = 0.600, 95% CI = 0.437–0.725). ROC analysis showed that the optimal cutoff values for ADC, D, D*, and f for predicting recurrence were 0.858 × 10−3 mm2/s, 0.985 × 10−3 mm2/s, 12.5 × 10−3 mm2/s, and 23.4%, respectively.

The patients were divided into two groups with optimal cutoff value for each of the IVIM metrics for predicting recurrence. The clinical and pathological characteristics of the two groups were then compared based on optimal cutoff value for each of the IVIM metrics (Table 3). Patients with ADC values ≤ 0.858 × 10−3 mm2/s tended to have higher serum AFP levels than those patients with ADC values ˃ 0.858 × 10−3 mm2/s (30 [48%] vs 25 [27%], p = 0.007). Furthermore, there were more well-differentiated HCC lesions in patients with ADC levels ˃ 0.858 × 10−3 mm2/s (38 [40%]), D values ˃ 0.985 × 10−3 mm2/s (15 [65%]), and f values ˃ 23.4% (22 [41%]) than in patients with ADC values ≤ 0.858 × 10−3 mm2/s (2 [2%]), D values ≤ 0.985 × 10−3 mm2/s (25 [19%]), and f values ≤ 23.4% (18 [17%]) (p < 0.001–0.001). Finally, HCC lesions with f values ≤ 23.4% more frequently demonstrated microvascular invasion (51 [50%] vs 16 [30%], respectively, p = 0.017). No significant differences in baseline clinical or pathological characteristics were observed between groups with D* ≤ 12.5 × 10−3 mm2/s and D* ˃ 12.5 × 10−3 mm2/s (p > 0.05 for all variables).

Table 3 Patient demographic data and comparison of the patient characteristics according to the IVIM metrics

Comparison of preoperative characteristics of patients with and without HCC recurrence

A comparison of patients with and without HCC recurrence is presented in supplementary material 1.

Prediction of HCC recurrence

The Kaplan–Meier method and log-rank test revealed that patients with ADC values ≤ 0.858 × 10−3 mm2/s, D values ≤ 0.985 × 10−3 mm2/s, and f values ≤ 23.4% had shorter recurrence-free survival times (p = 0.015, p = 0.042, and p = 0.025, respectively; Fig. 2). D* was not significantly related to recurrence-free survival (p = 0.475; Fig. 2). The univariate Cox proportional hazards model showed that age > 60 years (p = 0.024), aspartate aminotransferase > 40 U/L (p = 0.017), AFP > 400 mg/dL (p = 0.014), ADC values ˃ 0.858 × 10−3 mm2/s (p = 0.048), D values ˃ 0.985 × 10−3 mm2/s (p = 0.030), and f values ˃ 23.4% (p = 0.031) were significant factors for recurrence. The multivariate Cox proportional hazards model revealed that among these factors, age (HR, 0.328; CI, 0.117–0.920, p = 0.034), AFP level (HR, 2.079; CI, 1.168–3.701, p = 0.013), and D (HR, 0.190; CI, 0.046–0.794, p = 0.023) were significant risk factors associated with tumor recurrence after hepatic resection (Table 4). The accuracy based on age and AFP alone was 67.5%, whereas accuracy based on age, AFP, and D value combined was 73.8%. Additional ROC analysis showed that the area under the curve (AUC) of the obtained Cox regression model was improved from 0.68 for the combination of AFP and age to 0.724 for the combination of AFP, age, and D (Fig. 3).

Fig. 2
figure 2

Comparison of recurrence rates according to the cutoff values of the IVIM values. The Kaplan–Meier analysis shows that the survival time was higher in patients with ADC values ˃ 0.858 × 10−3 mm2/s than in patients with ADC values ≤ 0.858 × 10−3 mm2/s (a), that the survival time was higher in patients with D values ˃ 0.985 × 10−3 mm2/s than in patients with D values ≤ 0.985 × 10−3 mm2/s (b), that the survival times were equivalent in patients with D* values ˃ 12.5 × 10−3 mm2/s and in patients with ADC values ≤ 12.5 × 10−3 mm2/s (c), and that the survival time was lower in patients with f values ≤ 23.4% than in patients with f values ˃ 23.4% (d). IVIM, intravoxel incoherent motion; ADC, apparent diffusion coefficient; HCC, hepatocellular carcinoma

Table 4 Univariate and multivariate analysis of the factors related to recurrence
Fig. 3
figure 3

Comparison of the ROC curve analyses of the obtained Cox regression model for the combination of AFP and age alone (A) and for the combination of D, AFP, and age (B)

Discussion

Our results demonstrated that HCC lesions with lower ADC, D, and f values tended to have poorer histologic differentiation and that lower f values in particular are associated with a higher incidence of microvascular invasion. More pertinently, we showed that patients with lower ADC, D, and f values tend to have a higher incidence of HCC recurrence. Our results also showed that D, along with age and serum AFP levels, is an independent risk factor for HCC recurrence after hepatic resection for HBV-related HCC. When combined with age and AFP levels, D, as derived from IVIM model, can further improve predictions regarding tumor recurrence (Figs. 4 and 5).

Fig. 4
figure 4

A patient with HCC that was treated with curative resection. The lesion had a low D value. Tumor recurrence occurred 4.67 months after surgery. a Fat-suppressed axial T2WI showing a focal liver lesion with an obvious hyperintense signal. b Axial diffusion-weighted image with a b = 800 s/mm2 shows a hyperintense focal liver lesion. cf ADC, D, D*, and f pseudocolorized maps showing that the mean ADC, D, D*, and f values of the tumor were 0.817 × 10−3 mm2 /s, 0.709 × 10−3 mm2/s, 5.78 × 10−3 mm2 /s, and 25%, respectively. ADC, apparent diffusion coefficient

Fig. 5
figure 5

A patient with HCC treated with curative resection without recurrence during a follow-up period of 19.75 months. The lesion had a high D value. a Fat-suppressed axial T2WI showing a focal liver lesion with obvious hyperintense signal. b Axial diffusion-weighted image with a b = 800 s/mm2 shows a hyperintense focal liver lesion. cf ADC, D, D*, and f pseudocolorized maps show that the mean ADC, D, D*, and f values of the tumor were 1.09 × 10−3 mm2/s, 0.994 × 10−3 mm2/s, 20.8 × 10−3 mm2/s, and 18.3%, respectively. ADC, apparent diffusion coefficient

Numerous studies have looked to evaluate radiologic features as markers to predict HCC recurrence. Previous studies have shown that tumor hypointensity on Gd-EOB-DTPA-enhanced HBP images may be a useful imaging biomarker to predict poor prognosis in patients with HCC after surgical resection [11, 34]. However, whereas the sensitivity of lesion hypointensity on HBP images was high, the specificity was relatively low potentially limiting its value as a preoperative marker for predicting tumor recurrence. In another study, Lee et al [7] reported that the combination of two or three imaging features (i.e., tumor size, non-smooth tumor margins, and peritumoral enhancement) was indicative of higher rates of early recurrence in patients with a single HCC. In Asia, HBV infection is the predominant etiology of HCC [35], and the mechanisms underlying the related hepatocarcinogenesis, as well as tumor factors and prognosis of HBV-related HCC patients, are different from those caused by other non-viral etiologies [36, 37]. Quantitative biomarkers that can predict tumor recurrence after hepatectomy in patients with HBV-related HCC remain unknown.

In our study, we found that tumor recurrence was more frequent in HCC patients with lower ADC values (≤ 0.858 × 10−3 mm2/s), D values (≤ 0.985 × 10−3 mm2/s), and f values (≤ 23.4%) derived from IVIM DWI. In support of our findings, Muhi et al [14] found that the recurrence rate was increased and the 3- and 5-year survival rates significantly decreased in HCC patients with ADC values ≤ 0.898 × 10−3 mm2/s, i.e., at an ADC cutoff very similar to that in our study.

It is established that histologic differentiation is one of the most important predictive factors of recurrence and survival after liver transplantation and hepatic resection [24, 38, 39]. A possible reason for the higher recurrence rate in patients with lower tumor ADC, D, and f values in our study may therefore be the greater proportion of poorly differentiated HCC lesions in our population. It is likely that the increased cellular density and decreased blood supply associated with poorly differentiated HCC result in decreased ADC, D and f values [20, 40, 41].

Our results revealed that HCCs with microvascular invasion, which is related to a 4.4-fold increase in the risk of tumor recurrence [42, 43], tend to have low f values. This may be another explanation for the higher recurrence rate of HCCs in patients with lower f values. The reason for which lower f values tend to be observed in HCCs with microvascular invasion is not completely understood but is most likely related to a decreased hemodynamic perfusion in HCC. Unfortunately, the bi-exponential IVIM model for DWI cannot explain the differential T2s in the model compartments, which results in an overestimation of pseudodiffusion fraction f in the liver [44, 45]. Therefore, the observed changes in pseudodiffusion fraction f in a clinical setting should be interpreted with caution. In our study, there was no correlation between the D* values and tumor recurrence which may have been due to the poor reproducibility of D*. Similar to our results, a previous study also demonstrated poor measurement reproducibility for D* [46, 47]. Heterogeneity of HCC could be one of the contributing factors to the poor measurement reproducibility of D* [46]. In addition, respiratory-triggering techniques, distribution of b values, and bi-exponential fitting problems may be other contributing factors [47,48,49,50]. Hence, further technical improvements are needed to strengthen the robustness of D*.

Whereas low ADC, D, and f values were associated with tumor recurrence, D, along with age and serum AFP levels, was found to be an independent risk factor for tumor recurrence after hepatic resection for HBV-related HCCs. The reason why D is an independent risk factor for predicting HCC recurrence might be that increased cellular density and increased nuclear-to-cytoplasmic ratio would be more prominent pathophysiologic phenomena than decreased blood perfusion in the recurrence of HCC [20]. Importantly, D values can be calculated from DWI at two b values. Therefore, a shorter DWI acquisition tailored to derive D alone may be a more practical and time-saving approach in clinical practice. Further research is needed to investigate this possibility.

In addition to MRI features, clinical markers can be potentially useful to preoperatively predict prognosis after curative resection of HCCs. In our study, elevated preoperative serum AFP level was associated with HCC recurrence. This finding was consistent with those of previous studies [51, 52]. However, An et al [8] reported that AFP was not a significant predictor of recurrence. This discrepancy in observations may be attributed to the heterogeneity of the patient populations evaluated. In addition, younger age was also associated with tumor recurrence. The reasons for the poorer prognosis of younger HCC patients may be due to their having a more advanced disease stage at diagnosis, a higher number of tumor nodules, and a more biologically aggressive tumor [53, 54].

In our study, the addition of D to AFP and age improved the preoperative predictive performance of HCC recurrence. An integrated model derived from these variables revealed an AUC of 0.724 for prediction of the risk of tumor recurrence. Previous studies reported that elevated serum AFP levels and younger age were associated with HCC recurrence [51,52,53]. An et al [8] reported that four MRI features, namely, rim enhancement, peritumoral parenchymal enhancement in the arterial phase, satellite nodules, and tumor size, were independently associated with early recurrence only in patients with a single tumor. Unlike these morphological features, quantitative IVIM values derived from DWI do not depend on contrast agents and could be used in patients with liver or renal insufficiency.

There were several limitations to our study. First, we only focused on HBV-related HCCs. Diffusion and blood perfusion effects may be different in HCCs caused by other etiologies. The cutoff values derived from IVIM warrant validation in other HCC populations with different etiologies for chronic liver diseases. Second, we measured IVIM parameters in the target tumor with the longest cross-section. However, single-slice ROI methods may not reflect the overall characteristic of the tumor. It is known that intra-tumor heterogeneity can be better captured by analyzing the entire tumor volume. Future work should therefore focus on the whole tumor volume rather than using one-section ROI methods. Third, histopathological proof was absent in 34 of 47 recurrent tumors. Last, the follow-up time was relatively short. Future work should prolong the follow-up period to better evaluate the long-term survival outcome and the role of IVIM parameters in predicting HCC recurrence.

In conclusion, our results suggest that lower ADC, D, and f values are associated with increased HCC recurrence in patients with HCC after hepatectomy. The D value may be a useful biomarker for preoperative prediction of recurrence after hepatectomy in patients with HBV-related HCC. When combined with age and AFP levels, D values can further improve the performance of predicting tumor recurrence. Patients with low D values, younger age, and high AFP levels may need more regular follow-up after curative hepatectomy to detect possible recurrence. Therefore, a shorter DWI acquisition tailored to derive D alone may be more practical and time-saving in clinical practice.