Introduction

Hepatocellular carcinoma (HCC) is the sixth most common malignancy and the fourth leading cause of cancer-related mortality worldwide [1,2,3]. HCC with extrahepatic metastasis (M1 or N1) or locally advanced disease not amenable to locoregional therapy is considered an advanced stage [4,5,6]. Generally, advanced HCC patients demonstrate extremely poor prognosis with a median overall survival (OS) of 6–9 months [2, 7, 8]. Sorafenib, an oral multi-tyrosine kinase inhibitor, is the first regulatory-approved systemic therapeutic agent for advanced HCC patients [2, 8] and has been used as first-line therapy for these patients since 2006 [4,5,6].

Sorafenib targets vascular endothelial growth factor receptors, platelet-derived growth factor receptors, and the Raf family; it has demonstrated significant anticancer effects against HCC [9]. Although sorafenib is a widely used treatment option for advanced HCC, its clinical benefit in terms of OS is modest at only an average of approximately 3 months [8]. Because the clinical response to sorafenib is often variable and unpredictable, reliable biomarkers are urgently needed for predicting its efficacy and safety in candidate advanced HCC patients.

Sorafenib-induced liver injury mainly develops as idiosyncratic hepatotoxicity [10]. Most patients with preserved liver function recover even after the hepatotoxicity. However, in advanced HCC patients whose underlying liver function is impaired, even minor hepatocellular damage could cause irreversible liver damage [11]. Therefore, it is necessary to establish a treatment plan by identifying high-risk patients with impaired liver function who may experience serious adverse events caused by sorafenib-induced liver injury.

Magnetic resonance elastography (MRE) is a non-invasive, magnetic resonance imaging (MRI)-based tool for the quantitative assessment of liver stiffness (LS) [12, 13]. MRE is considered the most accurate non-invasive technique for staging liver fibrosis [13]. Recently, it has been highlighted as a potential radio-omics biomarker for predicting the prognosis of patients with chronic liver disease and HCC [14,15,16]. However, the clinical implications of pretreatment LS by MRE in sorafenib-treated advanced HCC patients have not been explored.

Thus, we aimed to identify the clinical significance of LS by MRE in sorafenib-treated advanced HCC patients. We analyzed whether LS by MRE before sorafenib administration could be an independent biomarker for predicting OS or progression-free survival (PFS). We also analyzed whether pretreatment MRE-assessed LS could predict significant liver injury (grade ≥ 3) after sorafenib administration.

Material and methods

Patient selection

The institutional review board approved this retrospective study and waived the informed consent requirement (AJIRB-MED-MDB-18-484). Between June 2015 and June 2019, 360 advanced HCC patients treated with sorafenib were identified from a prospectively maintained cohort of chronic liver disease patients who underwent MRE. Among them, 101 patients underwent MRE before the first sorafenib treatment. Fourteen patients with unsuccessful MRE acquisition owing to ascites, large abdominal girth, or mechanical failure (n = 8) and loss of MRE data (n = 6) were excluded. Furthermore, we excluded patients with (a) MRE acquired more than 3 months before treatment (n = 31), (b) hepatic parenchyma totally replaced by tumor (n = 5), and (c) prior liver transplantation (n = 1). Finally, 50 patients were included in the analysis (Fig. 1).

Fig. 1
figure 1

Flow chart demonstrating the inclusion and exclusion criteria for patient selection. HCC, hepatocellular carcinoma; MR, magnetic resonance

MRE measurement of LS

MRE was performed as part of the baseline liver MRI using 1.5-T (Signa HDxt; GE Healthcare) and 3-T (Discovery 750w; GE Healthcare) scanners. Hepatic shear wave was induced by 60-Hz acoustic waves transmitted via passive driver and was imaged with motion-sensitized 2D gradient echo and 2D spin echo-echo planar pulse sequences in 1.5-T and 3-T scanners, respectively, in four contiguous image slices. Stiffness maps were generated for each image slice using inversion algorithm provided by a commercially available MRE package (MR Touch, GE Healthcare) [17]. The detailed MRE parameters are provided in Supplementary Table 1.

One abdominal radiologist (4 years of experience in liver MRE) measured the LS using a dedicated post-processing software program (READY View, version 12.3; GE Healthcare). The region of interest (ROI) was manually drawn on the stiffness map bounded by 95% confidence map while avoiding incoherent shear waves, large vessels, and tumors. The ROI was drawn 2–3 cm away from the tumor by referring to anatomic images to exclude tumor-bearing parenchyma [14]. The measured stiffness of four ROIs was averaged to represent the LS of the patient.

Treatment of HCC, assessment of outcomes, and term definition

HCC was diagnosed based on the American Association for the Study of Liver Diseases (AASLD) practice guideline and the European Association for the Study of the Liver guideline [4, 18, 19]. The patients received sorafenib 400 mg twice daily until the occurrence of disease progression or sorafenib-induced adverse events. Following adverse event occurrence, the sorafenib dose was modified or stopped according to the type of adverse event and grade of severity. Multiphasic liver computed tomography was performed every 2–3 months after the initiation of sorafenib administration to evaluate treatment response. All patients with chronic hepatitis B were treated with antiviral agents during the follow-up period based on the guideline [20].

The following medical data were collected: sex, age, pre-existing underlying diseases including diabetes, Child-Pugh class, previous history of HCC treatment, tumor number, tumor size, vascular invasion, extrahepatic metastasis, HCC stage according to the modified Union for International Cancer Control (mUICC) criteria [21], duration of sorafenib treatment, and adverse events during sorafenib treatment (including hand-foot skin reaction, diarrhea, and liver injury). The tumor size was measured by its largest diameter in centimeters. In case of multiple tumors, the size of the largest tumor was measured. The baseline laboratory data collected were platelet count, total bilirubin, albumin, alanine transaminase (ALT), international normalized ratio (INR) of prothrombin time, and alpha-fetoprotein (AFP) levels. Tumor response was evaluated by one of the four attending radiologists according to the modified Response Evaluation Criteria in Solid Tumors [22].

Adverse events associated with sorafenib treatment were monitored using the National Cancer Institute Common Terminology Criteria for Adverse Events, version 4 [23]. The severity of liver injury after sorafenib administration was assessed using the grading system of the Drug-Induced Liver Injury Network [24]. PFS and OS were defined as the time from the initiation of sorafenib administration to disease progression and death from any cause, respectively. The time to termination of sorafenib owing to significant liver injury was defined as the time of sorafenib administration until drug cessation owing to the development of grade ≥ 3 liver injury.

Statistical analysis

Statistical analysis was performed using the Statistical Package for the Social Sciences (SPSS) (version 20.0; SPSS Inc.) and R software package (R, version 3.2.5; R Foundation for Statistical Computing). The Kolmogorov-Smirnov test was performed to test the normality of continuous variables. Continuous variables with normal distribution were presented as mean ± standard deviation, while non-normal variables were reported as median (interquartile range). To compare baseline characteristics between the groups, an independent t test and chi-square test were used for continuous and categorical variables, respectively. For survival analysis, the Kaplan-Meier analysis with log-rank test was performed. Patients were divided into two groups according to the value of MRE-assessed LS (high or low) for the Kaplan-Meier analysis.

Because there was no established clinical information on cutoff values of MRE-assessed LS in advanced HCC, binary cutoff values were determined based on the best Youden’s index on the receiver operating characteristic curve of OS. To identify risk factors associated with poor PFS and OS and shorter time to termination of sorafenib owing to grade ≥ 3 liver injury, univariate and multivariate Cox regression analyses were performed. In multivariate Cox regression analysis, multiple statistical models were derived using various combinations of the variables; the final model was determined, considering the discrimination power by concordance index. P values < 0.05 (two-sided) were considered significant.

Results

Baseline characteristics and treatment outcomes

Table 1 shows the patient baseline characteristics. Study patients (age, 61.9 ± 11.3 years) consisted of 40 men (90%) and 10 women (10%); 39 (78.0%) and 11 (22.0%) had Child-Pugh class A and B status, respectively. Thirty-five (70%) and 15 (30%) patients were categorized into mUICC stages IVa and IVb. Twenty-three patients (46%) experienced diarrhea or hand-foot syndrome or both after sorafenib administration; 8 (16%) experienced grade 3 adverse events.

Table 1 Baseline characteristics and treatment outcomes

ALT levels were elevated in 30 (60%) patients, including 18 (34%) with grade 1 liver injury who showed only a mild, transient increase without constitutional symptoms, which was resolved without interrupting sorafenib. However, 12 (24%) patients with grade ≥ 3 liver injury demonstrated impaired liver function with total bilirubin ≥ 2.5 mg/dL, requiring hospitalization and sorafenib termination. Only conservative care was available for all 12 patients after they developed liver injury because their hepatic function and performance status did not recover subsequently. The mean MRE-assessed LS was 5.48 ± 2.07 kPa. Cutoff value of MRE-assessed LS for poor OS was determined as 7.5 kPa. Among the 50 patients, 43 patients (86%) showed LS ≤ 7.5 kPa, while 7 patients (14%) showed LS > 7.5 kPa.

Risk factors associated with poor OS

OS analysis was performed for all included patients using mortality data obtained from the Central Cancer Registry database of the National Cancer Center. The median OS was 155 days with a range of 8–1435 days. The Kaplan-Meier analysis was performed to identify risk factors associated with poor OS; the curves were constructed (Fig. 2). In the Kaplan-Meier analysis, patients with higher MRE-assessed LS (> 7.5 kPa) demonstrated a significantly poorer OS than those with lower LS (≤ 7.5 kPa) (p = 0.021). In addition to higher MRE-assessed LS, advanced tumor stage (mUICC IVb) was also identified as a risk factor for poor OS (p = 0.002).

Fig. 2
figure 2

Kaplan-Meier analysis comparing overall survival (OS) according to (a) liver stiffness (LS) assessed using magnetic resonance elastography (MRE), (b) Child-Pugh score, (c) modified Union for International Cancer Control (mUICC) stage, and (d) lymph node (LN) metastasis

In the univariate Cox regression analysis, advanced tumor stage (mUICC IVb) and higher MRE-assessed LS (> 7.5 kPa) were identified as significant risk factors for poor OS (Table 2). Multivariate Cox regression analysis was performed with backward stepwise model selection (Table 2). Model 1 included MRE-assessed LS as a continuous variable, whereas model 2 included MRE-assessed LS as a categorical variable using the cutoff value of 7.5 kPa.

Table 2 Univariate and multivariate analyses of variables associated with overall survival (OS) in patients with hepatocellular carcinoma (HCC) treated with sorafenib

In model 1, male sex, advanced age, higher AFP (≥ 400 ng/mL), and advanced tumor stage (mUICC IVb) were identified as independent risk factors in addition to MRE-assessed LS (kPa, hazard ratio (HR) 1.54, 95% confidence interval (CI) 1.23–1.92, p < 0.001). In model 2, higher MRE-assessed LS (> 7.5 kPa, HR 4.06, 95% CI 1.40–11.79, p = 0.010) was also revealed as an independent risk factor for poor OS along with higher AFP (≥ 400 ng/mL), higher ALT (> 40 IU/L), and advanced tumor stage (mUICC IVb). Harrell’s C index values were calculated as 0.746 (standard error (SE) estimate = 0.043) for model 1 and 0.75 (0.045) for model 2, indicating sufficient discrimination power of the model.

Risk factors associated with development of grade ≥ 3 liver injury

Four of the 50 patients were transferred to other hospitals before the sorafenib toxicity assessment. Therefore, the adverse effects of sorafenib were evaluated in 46 patients. Figure 3 shows comparisons of Kaplan-Meier curves for developing significant liver injury between groups divided according to MRE-assessed LS, Child-Pugh score, tumor stage, and serum albumin level. Patients with higher baseline MRE-assessed LS (> 7.5 kPa) or higher Child-Pugh score (≥ 6) showed significantly earlier development of grade ≥ 3 liver injury than those with lower LS or Child-Pugh score (p = 0.001 and p = 0.013, respectively).

Fig. 3
figure 3

Kaplan-Meier analyses comparing the development of grade ≥ 3 liver injury according to (a) liver stiffness (LS) assessed using magnetic resonance elastography (MRE), (b) Child-Pugh score, (c) modified Union for International Cancer Control (mUICC) stage, and (d) serum albumin level

Univariate Cox regression analysis revealed that higher Child-Pugh score (≥ 6) and MRE-assessed LS (> 7.5 kPa) were significant risk factors associated with earlier development of grade ≥ 3 liver injury (Table 3). Both the continuous and categorical variables (≤ 7.5 kPa vs. > 7.5 kPa) of MRE-assessed LS were significantly associated with the development of liver injury in the univariate analysis. For the adjustment of confounding variables, multivariate analysis was performed. In model 1, MRE-assessed LS was included as a continuous variable, whereas it was included as a categorical variable in model 2. In the multivariate model 1, high MRE-assessed LS (kPa, HR 1.62, 95% CI 1.21–2.17, p = 0.001) was identified as the only significant risk factor for predicting the development of grade ≥ 3 liver injury (Table 3).

Table 3 Univariate and multivariate analyses of variables associated with significant liver injury after sorafenib administration in patients with hepatocellular carcinoma (HCC)

Harrell’s C index of model 1 was 0.814, indicating the good discriminatory power of the model. In the Cox model 2, sex (male), Child-Pugh score (≥ 6), and MRE-assessed LS (> 7.5 kPa) were finally selected for the multivariate Cox model. Higher MRE-assessed LS (> 7.5 kPa, HR 10.11, 95% CI 2.41–42.46, p = 0.002) and higher Child-Pugh score (≥ 6, HR 5.09, 95% CI 1.28–20.19, p = 0.021) were identified as independent risk factors for predicting significant liver injury after sorafenib administration. Harrell’s C index of model 2 was calculated as 0.747. The Kaplan-Meier analysis was performed to compare the OS between patients with grade ≤ 2 or without liver injury and patients with grade ≥ 3 liver injury (Supporting Fig. 1). The results indicated that patients with grade ≥ 3 liver injury demonstrated markedly poorer OS than those with grade ≤ 2 or without liver injury (p < 0.001).

Risk factors associated with poor PFS

PFS analysis was conducted in 32 patients following the exclusion of 4 who were transferred to other hospitals; sorafenib was discontinued in 14 patients before the first response evaluation because of grade ≥ 3 adverse events. Figure 4 shows comparisons of Kaplan-Meier survival curves of PFS between groups classified according to pretreatment tumor stage, MRE-assessed LS, serum AFP, and status of lymph node (LN) metastasis. Patients with mUICC stage IVb showed significantly earlier HCC progression than those with mUICC stage IVa in the Kaplan-Meier analysis (p = 0.038).

Fig. 4
figure 4

Kaplan-Meier analyses comparing progression-free survival (PFS) according to (a) modified Union for International Cancer Control (mUICC) stage, (b) Child-Pugh score, (c) serum alpha-fetoprotein (AFP) level, and (d) lymph node (LN) metastasis status

In the univariate and multivariate Cox regression analysis, advanced tumor stage (mUICC IVb, HR 5.70, 95% CI 1.82–17.81, p = 0.001) and presence of LN metastasis (present, HR 2.57, 95% CI 1.07–6.14, p = 0.035) (Table 4) were revealed as independent risk factors for predicting poor PFS. MRE-assessed LS was not a significant predictor in the analysis of PFS.

Table 4 Univariate and multivariate analyses of the variables associated with progression-free survival in patients with hepatocellular carcinoma treated with sorafenib

Correlation analysis between clinical variables and MRE-assessed LS

To determine the correlation between clinical variables and MRE-assessed LS, Pearson correlation coefficients were calculated (Table 5). The MRE-assessed LS showed significant negative and positive correlations with serum albumin and bilirubin (Pearson R = − 0.478 and 0.487, respectively; both p < 0.001) (Fig. 5).

Table 5 Pearson correlation analyses between clinical factors and liver stiffness measured using magnetic resonance elastography (MRE)
Fig. 5
figure 5

Correlation between liver stiffness (LS) assessed using magnetic resonance elastography (MRE) and clinical variables (a albumin and b total bilirubin)

Discussion

In this study, the prognostic implications of MRE-assessed LS were explored in advanced HCC patients treated with sorafenib. A higher pretreatment MRE-assessed LS was strongly associated with poor OS in these patients along with traditional predictors such as advanced tumor stage and high AFP; it was also significantly associated with the development of grade ≥ 3 liver injury after sorafenib administration. To our knowledge, this is the first study to report these associations.

In the present study, 30 (60%) patients experienced liver injury after sorafenib administration. Among them, 18 patients had only transient and mild elevation of ALT, which was normalized without discontinuation of sorafenib. However, 12 (24%) patients demonstrated marked hepatic impairment with elevated total bilirubin > 2.5 mg/dL that required hospitalization and drug termination, leading to a shorter OS than that of the patients with milder liver injury. In clinical trials, including the SHARP and Asia-Pacific trials, severe liver injury after sorafenib administration was reported as an uncommon adverse event [2, 5]. However, in this study, grade ≥ 3 liver injury occurred in 24% of sorafenib-treated patients. Similarly, several real-world studies reported the prevalence of sorafenib-induced severe liver injury as 18–24% [25,26,27]. Ozenne et al [25] reported that 26% of sorafenib-treated patients experienced grade 3 or 4 liver injury. In a retrospective study in Japan, liver failure occurred in 19% of sorafenib-treated HCC patients [26], which suggests that sorafenib-induced severe liver injury is not a rare adverse event and is strongly associated with poor OS of HCC patients. Thus, sorafenib-induced liver injury risk assessment in determining optimum treatment strategies is essential to improve prognosis of advanced HCC patient.

High MRE-assessed LS was identified as a potent independent biomarker for predicting severe liver injury after sorafenib administration. Patients with pretreatment LS > 7.5 kPa demonstrated a more markedly increased risk of severe liver injury than that of patients with LS ≤ 7.5 kPa (HR 10.11). Thus, patients with LS > 7.5 kPa should be monitored very carefully during sorafenib administration with a frequent performance of liver function tests and clinicians should consider sorafenib dose modification to minimize sorafenib-induced liver injury.

Most HCC patients have underlying liver cirrhosis; therefore, hepatic reserve is a high priority in selecting HCC treatment strategies and is strongly associated with post-treatment complications [4, 28]. Thus, extensive efforts have been made to assess hepatic functional reserve. For instance, indocyanine green retention rate at 15 min (ICG 15) has been widely used to assess preoperative hepatic reserve and predict post-hepatectomy liver failure especially in Asian countries [29, 30]. Recently, Lee et al [31] identified MRE-assessed LS as a more potent biomarker for predicting post-hepatectomy liver failure than ICG 15 and other traditional serum markers. Additionally, MRE-assessed LS was negatively correlated with postoperative hepatic regeneration [32]. These findings suggest that LS measured by MRE is a potent biomarker to assess underlying hepatic reserve by reflecting the degree of hepatic fibrosis that restricts the hepatic reserve. Similarly, in our study, MRE-assessed LS was identified as a strong predictor of severe liver injury after sorafenib administration. Majority of sorafenib-induced liver injury is known to be idiosyncratic hepatotoxicity [10], which seems to occur as a transient and mild event that most patients with adequate hepatic reserve and endurance recover from. However, hepatic failure occurs more frequently in patients with insufficient hepatic reserve who cannot endure even minimal hepatocellular damage.

In this study, MRE-assessed LS was significantly correlated with serum bilirubin and albumin, which are assessment components of the Child-Pugh score, the most widely used scoring system for assessing hepatic reserve in clinical practice. Interestingly, MRE-assessed LS demonstrated a much more potent power for predicting OS and severe liver injury than that of the Child-Pugh score or traditional serum markers such as albumin or bilirubin. These results suggest that MRE-assessed LS can predict hepatic reserve more accurately than traditional serum marker–based models. MRE can be easily integrated into routine MRI protocols, and the clinical use of this attractive image-based biomarker would be relatively convenient. In the future, active use of MRE-assessed LS could contribute to developing precision medicine strategies for HCC, which would improve the prognosis of HCC patients.

A major challenge in cancer treatment is predicting therapeutic response to anticancer drugs on a precision basis. MRE has shown a future role in precision medicine for HCC by demonstrating that MRE-assessed tumor stiffness is associated with histopathology feature and treatment response to pembrolizumab [33, 34]. The result seems promising especially in advanced HCC patients with limited options for treatment and whose hepatic parenchyma has been completely replaced by the tumor. In the present study, MRE-assessed LS, but not tumor stiffness, was evaluated as a prognostic marker in sorafenib-treated HCC. Because LS reflected hepatic functional reserve, but not tumor characteristics, it was a significant predictor of OS, but not associated with PFS. With this, further studies investigating the potential association between MRE-assessed tumor stiffness and therapeutic effect of anticancer drug along with PFS or even OS would be an interesting research focus.

Our study has a few limitations. First, there was a possible selection bias because of the retrospective study design. Specifically, a considerable number of patients who did not underwent liver MRI and missed MRE were excluded from the analysis. Second, although we obtained significant results, a relatively small number of patients were included in the study; therefore, further studies with a larger number of patients are warranted. Third, MRE was acquired across field strengths, which may potentially have affected the LS values. However, because LS measurement was shown to be highly reproducible across field strengths, we believe it would have little effect on the results [17, 35]. Finally, the measurement was performed by one reader, which might limit the transferability. However, high inter-reader agreement for LS by MRE was previously reported [17, 35].

Thus, we confirmed the clinical feasibility of applying MRE-assessed LS in predicting the prognosis of advanced HCC patients treated with sorafenib. A high MRE-assessed LS is a predictor of poor OS and is associated with the development of sorafenib-induced significant liver injury. Proactive use of MRE-assessed LS as a prognostic predictor in pretreatment assessment would facilitate the planning of personalized therapeutic strategies for advanced HCC patients.