Introduction

Transarterial chemoembolisation (TACE) is the main form of treatment for hepatocellular carcinoma (HCC) patients with multifocal or large lesions and compensated liver function who are untreatable by resection [1]. Partial response was observed in 15–55% of patients who had received TACE, which showed significant delay in tumour progression including macrovascular invasion [2]. In patients with TACE failure, additional therapy may be considered. In order to decide the optimal therapeutic pretreatment option for each patient with an acceptable balance of risks and benefits, a key aspect lies in the assessment of patients via Barcelona Clinic Liver Cancer (BCLC) staging. This criterion is widely used to determine suitable treatment strategies validated by various scientific associations and research consortia [1,2,3,4]. Previous studies have focused on response evaluation with parameter changes reflecting HCC diffusion and perfusion characteristics derived from uni- or bi-exponential model of diffusion-weighted magnetic resonance imaging (DW-MRI) before and after TACE [5,6,7]. However, the results do not show consistent correlation between pre-TACE apparent diffusion coefficient (ADC) values of hepatic lesions and survival benefits.

A histogram is a particularly sensitive feature for pathological assessment; it provides additional parameters beyond the mean and quantitatively reflects the distribution of signal intensity throughout all tumour voxels [8], which has been widely used in the characterisation of benign versus malignant lesions in different tumours [8,9,10,11,12]. A biexponential model of intravoxel incoherent motion (IVIM) can noninvasively provide microcirculatory perfusion measurement in addition to diffusivity [13]. In this present study, we aimed to determine prospectively the feasibility of IVIM imaging based on histogram analysis to predict the efficacy of TACE.

Materials and methods

Patients

In this prospective study, we included 55 patients (mean age, 58 years, range, 35–83 years; 45 males and 10 females) with unresectable liver-confined HCC who were eligible for TACE between May 2014 and September 2016 under the approval of our hospital’s Institutional Review Board. All patients gave their written informed consent. Subsequently, we enrolled patients according to: (1) Existence of single or multinodular HCC diagnosed independently by two experienced clinicians on the basis of typical dynamic contrast-enhanced imaging combined with alpha fetoprotein (AFP) serology, according to the American Association for the Study of Liver Diseases (AASLD) guidelines [4]. (2) Presence of pretreatment MRI, including IVIM-DWI; brain and chest computed tomography (CT) were also implemented to exclude extrahepatic metastasis. (3) BCLC stage A or B and Child-Pugh class A or B [2, 4]. Exclusion criteria consisted of the following: (1) previous oncological treatment or recurrent HCC patients within 1 year after the last treatment; (2) tumour invasion of branches and trunk of the portal vein or hepatic veins on MRI; and (3) extrahepatic metastasis.

Transarterial chemoembolisation (TACE)

To identify all the feeding arteries of the tumour, angiography of the coeliac, hepatic, superior mesenteric, left gastric and bilateral inferior phrenic artery was performed with a 5-F RH catheter (Cook). After inserting a 2.7-F microcatheter (Renegade, Boston Scientific) into the target artery, 10–50 mg/m2 of epirubicin hydrochloride (Pharmorubicin, Pfizer) was mixed with 5–20 ml of iodised oil (Lipiodol Ultrafluide, Laboratoire Guerbet) via a pumping method. The dose of iodised oil and epirubicin was determined according to the tumour size and patient’s liver function. Under fluoroscopic monitoring, the mixture was infused through the microcatheter at a rate of 0.5–1 ml/min until flow in the tumour vascularity became stasis. Finally, a gelatin sponge 1–2 mm in diameter (Jingling) was used to embolise the feeding artery.

Imaging technique

All MR images were acquired on a 1.5T scanner (Magnetom Area; Siemens Healthineers) with phased-array coil 1 week before TACE (range, 1–7 days). The following sequences were used: A single-shot spin-echo planar free breathing IVIM DW imaging sequence with a bipolar DW scheme was performed in the axial view with 12 b values (0, 10, 20, 30, 40, 50, 70, 100, 200, 300, 500 and 800 s/mm2) (repetition time [TR]/echo time [TE] = 4,100/70 ms; section thickness, 6 mm; intersection gap, 1 mm; field of view [FOV] optimised to patients’ body habitus, 285 × 214–308 × 380 mm; matrix, 128 × 128) with a total acquisition time of about 6–7 min. Transverse respiratory-triggered T2-weighted with fat suppression turbo spin echo sequence (TR/TE = 3,400/106 ms; slice thickness, 5 mm; intersection gap, 1 mm; FOV optimised to patients’ body habitus, 285 × 214–308 × 380 mm; matrix, 168 × 320) with a total acquisition time about 3–4 min and transverse T1-weighted breath-hold in-phase and opposed-phase gradient echo (TR/TE = 6.87/4.75 [in-phase], 2.38 [opposed-phase] ms; section thickness, 4 mm; intersection gap, 1 mm; FOV optimised to patients’ body habitus, 285 × 214–308 × 380 mm; matrix, 180 × 320). A T1-weighted with fat-suppressed volumetric interpolated breath-hold examination (VIBE) sequence (TR/TE = 4.36/2.00 ms; section thickness, 3 mm; no intersection gap; FOV optimised to patients’ body habitus, 285 × 214–308 × 380 mm; matrix, 270 × 360) was performed before and after injection of contrast media. Arterial, portal venous and delayed phases were obtained at 20–30 s, 70–80 s and 180 s after a 0.2 ml/kg bolus injection of gadopentetate dimeglumine at a rate of 2 ml/s.

Imaging assessment

Diffusion-weighted imaging (DWI)

The acquired DWI data were post-processed using an IVIM work-in progress package (provided by Siemens Healthineers) so as to obtain the following parameters: true diffusion coefficient (Dslow), pseudodiffusion coefficient (Dfast) and perfusion fraction (PF), which were calculated using the formula Sb/S0 = (1 – PF) • e(-b • Dslow) + PF • e(-b • Dfast), where S(b) is the mean signal intensity (SI) under the given b value. In addition, ADCtotal and ADC(0,500) were calculated using 12 b values (0, 10, 20, 30, 40, 50, 70, 100, 200, 300, 500 and 800 s/mm2) and two b values (0, 500 s/mm2) with monoexponential fit using the model equation Sb = S0 • e-bADC.

Histogram analysis

A prototype software (MR OncoTreat, Siemens Healthineers) was used for histogram analysis on a Windows personal computer. The qualitative analysis was performed by one radiologist (L.Y., with 7 years of experience) who was blinded to any other results and was initially trained before the qualitative evaluation. For histogram analysis, DICOM data including IVIM images of b value = 0 s/mm2, Dslow, Dfast, PF, ADC(0,500) and ADCtotal maps were loaded into the OncoTreat software. The largest lesion was chosen as the target lesion. The radiologist manually painted the seed points for the whole lesion as large as possible covering both the viable and necrotic portion on IVIM (b value = 0 s/mm2) sequences from representative axial, sagittal and coronal images while avoiding large vessels, then automated co-registration of morphological images and corresponding parametric maps, and finally ran the semiautomatic 3D tumour segmentation (Figs. 1 and 2). The results of segmentation can be corrected by adjusting seed points to assure the voxel-based histogram data was generated for the entire target lesion without adjacent liver parenchyma. The following parameters were calculated: mean, median, standard deviation, skewness, kurtosis and the 25th, 75th and 90th percentiles.

Fig. 1
figure 1

A partial responder with hepatocellular carcinoma (HCC) in the right lobe (segment 6) before transarterial chemoembolisation (TACE). Regions of interest were manually circumscribed on intravoxel incoherent motion (IVIM) (b value = 0 s/mm2) sequences from representative axial (a), sagittal (b) and coronal images (c). Semiautomatic 3D tumour segmentation (d). Corresponding ADCtotal map (e). mRECIST response indicated a more than 30% decrease in the longest diameter of the enhancing tumour component between g, imaging before treatment and h, imaging 1 month after treatment (partial response). Histogram analyses of entire lesion shows low kurtosis of 2.90 and high median ADCtotal of 1.33×10-3 mm2/s. The patient had a time to progression (TTP) of 263 days

Fig. 2
figure 2

A non-responder of stable disease with hepatocellular carcinoma (HCC) in the right lobe (segments 5, 6, 7 and 8) before transarterial chemoembolisation (TACE). Regions of interest were manually circumscribed on intravoxel incoherent motion (IVIM) (b value = 0 s/mm2) sequences from representative axial (a), sagittal (b) and coronal images (c). Semiautomatic 3D tumour segmentation (d). Corresponding ADCtotal map (e). mRECIST response shows no diameter change between g, imaging before treatment and h, imaging 1 month after treatment (stable disease). Histogram analysis of the entire lesion shows high kurtosis of 5.18 and low median ADCtotal of 1.01×10-3 mm2/s. The patient had a time to progression (TTP) of 86 days

Follow-up strategy

All patients were followed up with monthly abdominal DCE MRI or CT and combined chest examinations, tumour markers and biochemical liver function tests for at least 6 months. Considering cost effectiveness and inter-patient variability, a 2-month interval at most has been adopted according to the idiomatic follow-up protocol in our hospital, which also coincides with Japanese guidelines [14, 15], in the next 6 months. Thereafter follow-up is conducted every 3–6 months as necessary. Patients with documented disease progression during the follow-up underwent additional TACE and/or systemic therapy.

Reference standard

TACE response was evaluated according to the modified Response Evaluation Criteria in Solid Tumors (mRECIST) [16], for which a maximum of two enhancing lesions were chosen as target lesion if multinodular. mRECIST assessment is based on the sum of unidimensional measurements of arterially-enhancing lesions of pre-TACE MR as well as 1 month after treatment MR or CT, defining complete response (CR) as disappearance of any intra-tumoral enhancement, partial response (PR) as at least a 30% decrease from baseline; progressive disease (PD) as at least a 20% increase from baseline; and stable disease (SD) as any response that did not qualify for inclusion in other categories [2, 17]. The appearance of new lesions or extra-hepatic metastases is also considered as PD. Patients with SD and PD were classified as non-responders while CR and PR were classified as responders. Time to progression (TTP) was defined as the time between the start of treatment and PD.

Statistical analysis

Statistical analyses were performed using SPSS (v. 20.0) and MedCalc software (MedCalc). Baseline characteristics of the patients were expressed as mean and standard deviation or count. Continuous variables were compared with Student’s t-test or the Mann–Whitney U-test if not normally distributed. Categorical variables were compared using Pearson’s chi-squared test or Fisher’s exact test. The histogram analyses for DWI parameters were compared between responders and non-responders using Student’s t-test or Mann–Whitney U-test. Additionally, ROC analyses were conducted on significant parameters to assess the ability of the imaging measures to discriminate between the responders and non-responders. Univariate analysis was performed using Kaplan–Meier and log rank tests to correlate the response variables with TTP. Multivariate analysis was performed by using Cox proportional hazard regression to determine independent factors associated with disease progression. A difference with a p-value of less than 0.05 was considered statistically significant.

Results

Patient characteristics

Patient pretreatment characteristics are listed in Table 1. There were 25 responders (2 complete responders, 23 partial responders) and 30 non-responders (18 stable disease, 12 progressions) according to mRECIST. The TTP of patients ranged from 27 days to 819 days with 14 censored data until September 2017.

Table 1 Patient baseline characteristics

Histogram measurements

The mean values of histogram metrics (mean, median, 25th, 75th and 90th percentiles, standard deviation, skewness and kurtosis) for Dslow, Dfast, PF, ADC(0,500) and ADCtotal maps are provided in Table 2. Median Dslow (p = 0.045), mean (p = 0.022), median (p = 0.043), 25th percentile (p < 0.001) and 75th percentile (p = 0.031) of PF, mean (p < 0.001), median (p < 0.001), 25th percentile (p < 0.001), 75th percentile (p = 0.001) and standard deviation (p = 0.049) of ADC(0,500), mean (p = 0.005), median (p = 0.008) and 25th percentile (p = 0.039) of ADCtotal were higher, while skewness and kurtosis of PF (p = 0.001, p = 0.005, respectively), kurtosis of ADC(0,500) (p = 0.005) and kurtosis of ADCtotal (p = 0.001) were lower in responders compared with non-responders.

Table 2 Differences in histogram metrics for intravoxel incoherent motion (IVIM) parameter maps between responders and non-responders

Diagnostic performance for identifying responders and non-responders

With regard to the above-mentioned parameters that showed a significant difference in the histogram analyses between responders and non-responders (Table 2), ROC curve analyses were further performed to evaluate diagnostic performance. The optimal cut-off value, as well as corresponding AUCs, p values, sensitivities, specificities, positive predictive values (PPVs) and negative predictive values (NPVs) are shown in Table 3 and Fig. 3. The p value of median Dslow (p = 0.063), 75th percentile of PF (p = 0.059) and standard deviation of ADC(0,500) (p = 0.061) were > 0.05, which meant that these parameters failed to distinguish responders from non-responders. Among the rest of the significant parameters, the best one, kurtosis of ADCtotal, had an AUC of 0.82, with a sensitivity of 76% and specificity of 86.7% at the optimum threshold of 4.69.

Table 3 Diagnostic performance of the valuable parameters for identifying responders and non-responders
Fig. 3
figure 3

Receiver operating characteristic (ROC) curve and corresponding areas under the ROC curve (AUCs) of significant parameters for prediction of responders to transarterial chemoembolisation (TACE). Numbers in parentheses represent 95 % confidence intervals

Association with time to progression (TTP)

A significant association was found between TTP and mRECIST criteria (p < 0.001). With regard to pretreatment clinical characteristics (Table 1) and histogram metrics that showed a significant difference between different groups (Table 3), mean PF (p = 0.076), median ADC(0,500) (p = 0.096) and kurtosis of ADC(0,500) (p = 0.691) failed to establish an association with TTP via univariate analysis. The result of association between the rest of the variables and TTP with univariate and multivariate analysis is shown in Table 4.

Table 4 Factors associated with time to progression (TTP) (n = 55)

TTP was significantly higher in responders with mRECIST assessment than in non-responders (p < 0.001). Patients with a kurtosis value of ADCtotal below 4.69 had significantly higher TTP than those above 4.69 (p = 0.036) (Fig. 4).

Fig. 4
figure 4

Kaplan-Meier curves show comparison of time to progression (TTP) between responders and non-responders as assessed with mRECIST (a), kurtosis of ADCtotal (b). P values are provided in the figure

Table 5 showed relationships between independent factors and TTP. mRECIST criteria (hazard ratio [HR] = 642.088) and 75th percentile of ADC(0,500) (HR = 5.742) demonstrated positive correlations with TTP, while kurtosis of ADCtotal (HR = 0.266) showed negative correlations.

Table 5 Independent factors for time to progression (TTP) with multiple-factor analysis

Discussion

Our study showed that histogram-derived parameters for PF, ADC(0,500) and ADCtotal maps were useful for discrimination between responders and non-responders to TACE in HCC patients. With multivariate analysis, we demonstrated that mRECIST assessment was associated with TTP independently, and that kurtosis of ADCtotal had the best predictive performance for disease progression.

Previously retrospective studies concentrating on ADC measurement showed conflicting results for predicting response to TACE in patients with HCC. Mannelli et al. [18] found that HCCs with poor and incomplete response to TACE (< 50% necrosis on post-TACE MRI) had significantly lower pre-treatment ADC calculated by using 3 b values (0, 50 and 500 s/mm2). Likewise, Park et al. [7] found a higher pretreatment ADC value calculated by using b-values of 0, 25, 50, 75, 100, 200, 500 and 800 s/mm2 in the lipiodol good uptake group than the lipiodol poor uptake group; however, there was no statistical difference between these two groups (p = 0.073). Furthermore, Dong et al. [19] found that the overall survival was significantly longer in the group with lower prechemoembolisation ADC calculated by using 2 b-value (0, 500 s/mm2) than in the group with higher ADC. In our prospective study, we strictly enrolled patients according to the inclusion criteria and ensured that regular follow-ups were conducted for an extended period of time without failure. Thus, we are confident in the homogeneity of our patients as far as staging and surveillance protocols are concerned with far more accurate TTP compared to other retrospective studies. The mRECIST adopted in our study is the main form of assessing response to TACE for HCC based on the European Association for the Study of the Liver (EASL) and the AASLD [20, 21]. We also demonstrated that mRECIST was independently associated with TTP. Similarly, Gillmore et al. [16] have shown that overall responses measured by mRECIST between 2 and 3 months after the first TA(C)E are independently associated with 1-year survival. Moreover, in our study, parameters were derived from histogram analysis of the entire lesion, while ADCs were derived from single slice-based regions of interest (ROIs) within tumours in their studies. The findings of our study showed that mean, median and 25th percentile of ADC(0,500) and ADCtotal, and 75th percentile of ADC(0,500) were higher in responders compared with non-responders. We hypothesised that tumours with lower pre-TACE ADC may exhibit higher tumour cellularity and/or higher tumour grade, as shown in previously published papers [12, 22], which may lead to poor prognosis. In addition, lower ADC may be secondary to reduced tumour perfusion, as ADC is a parameter contaminated by pseudo-diffusion [18].

By using IVIM with multiple b values and a bi-exponential model, a quantitative estimation for perfusion-sensitive parameters is generated. PF is defined as the fractional volume of capillary blood flowing in each imaging voxel [13, 23]. In our study, we found that the mean, median and 25th percentile of PF were higher in responders compared with non-responders. Woo et al. [22] found higher PF value in arterially hyperenhanced HCC. Chandarana et al [24] reported that the PF values of enhanced renal lesions were significantly higher compared with non-enhanced lesions (p = 0.037). As such, a low PF value may suggest slow movement or stagnant blood flow into capillaries of tumour, resulting in less chemoembolisation materials delivered to these areas. Likewise, Lewin et al. [25] put forward that increased PF values at 2 weeks in advanced HCC following sorafenib treatment is consistent with the effects of improved remaining vessels as well as basement membrane thickness and pericyte coverage. In this study, kurtosis of ADCtotal was independently associated with TTP, and had the best discriminative performance. Kurtosis of ADCtotal in non-responders was higher than in responders. Kurtosis reflects the peak and heaviness of the tails of ADC distribution values. A higher kurtosis value indicates more heterogeneous internal components and poorer cell differentiation [26,27,28,29]. HCC are known to exhibit substantial tumour heterogeneity, due to prolonged development of the disease. In a previous study [28], kurtosis of ADC values was also found to be higher in triple negative breast cancer (BC) that had significantly more recurrence events on multivariate analysis than in ER-positive subtype BC. Guan et al. [26] found that the normal cervix displays significantly lower ADC kurtosis than cervical cancer due to the homogeneity of normal tissues. Tumours with high heterogeneity generally have poorer prognoses, which may be secondary to a more aggressive biological or treatment resistance [30]. Identification of imaging features that correlate with TTP may potentially serve as a form of non-invasive prediction of TACE outcome and the decision of personalised treatment.

This study had several limitations that need to be acknowledged. Firstly, it was not possible to calculate correlations between different IVIM parameters and the pathology of HCC in our study after treatment with TACE because none of the cases was sufficiently down-staged by local TACE for hepatectomy or liver transplantation. Secondly, the threshold values obtained in our study may be preliminary due to absent unification in using b values. However, adoption of a large number of b values including enough low b values that are less than 200 s/mm2 enable precisely calculated perfusion-sensitive parameters and ADCtotal to be obtained. Thirdly, with regard to patient baseline characteristics, there were statistical differences in hepatitis B virus, the serum level of ALP, albumin and AFP, and tumour size between responders and non-responders, which may potentially affect the patients’ survival. Therefore, it is necessary to predict prognosis based on a combination of multiparametric imaging approach and individual clinical conditions.

In conclusion, we demonstrated that pre-TACE kurtosis of ADCtotal can be used as the best independent predictor of subsequent responses to TACE.