Introduction

Primary liver cancer, including hepatocellular carcinoma (HCC), intrahepatic cholangiocarcinoma (ICC), and other types, has become the sixth most prevalent cancer and the third leading cause of cancer-related death worldwide [1, 2]. HCC is the most common primary liver cancer and may be cured by a variety of treatment modalities (liver resection, liver transplantation, local ablation, etc.) [3, 4]. Unfortunately, the prognosis of ICC is significantly worse than that of HCC, with radical surgery being the only curative treatment, and extended hepatectomy is required [5, 6]. The treatment strategies and prognosis between HCC and ICC are significantly different, and accurate noninvasive preoperative diagnosis is essential [7].

Contrast-enhanced ultrasound (CEUS) has been widely used for noninvasive diagnosis before treatment. Typical features of CEUS in ICC include rim enhancement, early washout time (<  60 seconds), and marked washout [8]. However, the presence of early washout time in some poorly differentiated hepatocellular carcinomas (P-HCCs) can lead to overlapping CEUS features of P-HCC and ICC, thereby making accurate identification challenging [9, 10].

In recent years, Sonazoid® has been launched in some countries as a contrast agent with some differences from other contrast agents. One characteristic of Sonazoid® is its unique Kupffer phase (KP), which enables continuous scanning without microbubbles being destroyed [2, 11]. Sugimoto K, et al. [12] demonstrated that the perfusion performance in KP was closely related to the pathological grade of HCC, suggesting that KP perfusion may serve as an independent and effective predictor for differentiating between well-differentiated HCC, moderately differentiated HCC, or P-HCC [13, 14].

There are few reports on the preoperative diagnosis of ICC and P-HCC by Sonazoid contrast-enhanced ultrasound (SCEUS). Therefore, we aimed to develop a nomogram using SCEUS and clinical features to improve diagnostic accuracy in distinguishing between ICC and P-HCC.

Material and methods

Study population

Using data from a prospective, multicenter study designed for sample collection (Clinical Trials.gov identifier: NCT04563897), we retrospectively analyzed SCEUS scans and clinical data from a total of 90 adults who were diagnosed with ICC and P-HCC. This study included 16 medical institutions (Supplement Table 1) and was approved by the Institutional Review Boards at each center (S2020-300-01). All participants provided written informed consent. The research protocol, which included the eligibility criteria and standardized data access procedures, was implemented consistently across all participating institutions. The flowchart of the study population selection process is shown in Fig. 1.

Fig. 1
figure 1

Flowchart of the study population selection process

Clinicopathologic information

Clinical data were collected, including age, sex, and serum markers. The target lesions were diagnosed histopathologically (surgery, n = 26; biopsy, n = 64). Histological diagnosis was made by at least two experienced pathologists according to the World Health Organization criteria [15].

Sonazoid contrast-enhanced ultrasound

Gray-scale ultrasound and contrast-enhanced ultrasound were performed in medical institutions using ultrasound equipment with contrast-enhancing software (GE Healthcare, Philips, Siemens, Mindray). The abdominal probe (frequency 2-6 MHz) was used, and the mechanical index ranged from 0.16 to 0.21. The second-generation contrast agent Sonazoid (GE Healthcare AS, Oslo, Norway), a lipid-encapsulated perfluorobutane microbubble, was injected at a dose of 0.015 ml/kg through the antecubital vein, followed immediately by a flush of 5 mL of 0.9% normal saline solution. The largest lesion was observed continuously, and videos of arterial phases (0-30 s) and portal phases (31–120 s) were stored. After vascular phases (120 s), the first 10 s of intermittent imaging was stored every minute until 10 min or the disappearance of the liver parenchyma contrast agent (for evaluating KP) [16]. All the images were stored on hard drives for offline analysis.

Analysis of ultrasound and Sonazoid contrast-enhanced ultrasound features

Ultrasound and SCEUS images were retrospectively reviewed by two sonographers who had more than six years of experience in abdominal ultrasound imaging, independently. All lesions were assessed and classified according to CEUS LI-RADS version 2017 [17]. If there was any discordance, the images were rereviewed, and a consensus was reached by discussion.

The SCEUS features of the lesion were characterized as follows: (1) the echo and size (mm) of the lesion;(2) enhancement levels in the AP ( hypo-enhancement /iso-enhancement /hyper-enhancement); (3) enhancement pattern in the AP(Rim enhancement and mosaic architecture [18, 19]); (4)washout time(≤  45 s) [20, 21]; (5) critical features according to CEUS LI-RADS version 2017, including AP hyper-enhancement and late and mild washout (in contrast to the liver in the portal venous phase, the lesion showed hypo-enhancement within 60 seconds of contrast injection and no apparent hypo-enhancement or contrast defect within 2 minutes of contrast injection) [16, 22]; and (6) KP hypo-enhancement (lesions with low enhancement compared to the liver in KP) or KP defect (lesions with similar non-enhancement compared to the liver in KP) [20]. If two or more lesions were present in a single patient, the histologically confirmed lesion or the largest lesion was selected as the target lesion.

Gray-scale analysis of Kupffer phase images by ImageJ

KP images were exported to a personal computer in JPEG format and analyzed using ImageJ version 1.47 software (National Institutes of Health, Bethesda, MD). Two sonographers freehand outlined the lesion, which was marked as a region of interest (ROI)1 and the surrounding liver parenchyma as ROI2. The shape, size, and depth of ROI2 are consistent with those of ROI1. Then, the ROI Manager mode was used for a gray analysis of the outlines. The mean gray value, modal gray value, minimum gray value, maximum gray value, and standard deviation of the gray value of the lesions were measured [23]. The measurement was repeated five times, and the average gray value was calculated. The average mean gray value was then used to calculate the gray value ratio (gray value ratio=average mean gray value of liver parenchyma/average mean gray value of lesion).

Development and validation of a nomogram to distinguish between ICC and P-HCC

Multivariate logistic regression and LASSO regression were used to filter variables, and a nomogram model was constructed to predict ICC. The Hosmer-Lemeshow test was used to assess the goodness of fit of the model. A receiver operating characteristic (ROC) curve, the area under the ROC curve (AUC), the concordance index (C-statistic), and the calibration curve were used to evaluate the predictive accuracy and consistency of the model. Discrimination and calibration were assessed by 1000 bootstrapping validations and internally validated by 400 times repeated 3-fold cross-validation. Decision curve analysis (DCA) reflects the model’s net benefit to the patient.

Statistical analysis

All analyses were performed using IBM SPSS Statistics 26 for Windows (IBM Corp, Armonk, NY, USA) , R software (version 4.2.1), and MedCalc version 9.0 software (MedCalc Software, Mariakerke, Belgium). The differences in clinical and SCEUS features were compared between ICC and P-HCC using independent samples t -tests, chi-square tests, Fisher’s exact tests, or Mann–Whitney U-tests. Interobserver agreement of features between two sonographers was assessed using Cohens kappa coefficients and 95% confidence intervals. Univariate and multivariate regression analyses were performed via SPSS 26.0. ICC was predicted using binary logistic regression analysis. ROC curve analysis was drawn via MedCalc. The McNemar test was used to compare the clinical usefulness of each diagnostic method. The difference was statistically significant when the two-tailed P-value was <  0.05.

Via R software, the “glmnet” package was used for LASSO regression, the “rms” package was used to plot the nomogram, the "pec" package was used to construct the calibration curves, the "caret" package was used for bootstrapping validation and k-fold cross-validation, and clinical decision curves were constructed using the “ggDCA” package.

Results

Patient characteristics

The baseline characteristics of the clinical and ultrasound images are shown in Table 1, including 41 ICC and 49 P-HCC. A total of 13 patients showed hyperechoic lesions, including 11 patients (11/13) in the P-HCC group and two patients (2/13) in the ICC group. Of the 13 hyperechoic lesions, 12 had hepatitis.

Table 1 Patients clinical and ultrasound features

As shown in Table 2, among the SCEUS features, there were significant differences between the two groups of patients in the washout time (whether ≤ 45 s) and the enhancement level in the KP. In addition, the consistency of subjective assessment of the enhancement level in the KP between the two sonographers was high, with a kappa-value of 0.773.

Table 2 SCEUS features in ICC and P-HCC

Gray-scale analysis of Kupffer phase images by ImageJ

The gray-scale analysis is shown in Supplemental Table 2. The cutoff value of the mean gray value between ICC and P-HCC was 24.92, and the cutoff value of the modal gray value, minimum gray value, and gray value ratio were 10.00, 0.00, and 2.24, respectively. These values were significantly different between ICC and P-HCC (P <  0.05).

Multivariate regression and LASSO regression analysis

Patient features with P < 0.05 in the univariate analysis were incorporated into the multivariate logistic regression model, and the independent influencing factors were determined via ENTER, as shown in Table 3. Meanwhile, LASSO regression was applied to solve the multicollinearity relationships of all features, and the coefficient of each variable was generated (Fig. 2). We screened out four variables with the optimal lambda (λ = 0.1095613). These variables were enhancement level in the KP (0.9029231), hepatitis (− 0.3887257), AFP level (> 20 µg/L) (− 0.6085682), washout time (≤ 45 s) (0.6220081), and multivariate logistic regression analysis with P < 0.05.

Table 3 Univariate and Multivariate Analysis of clinical and SCEUS features
Fig. 2
figure 2

Clinical and ultrasound feature selection with least absolute shrinkage and selection operator (LASSO) regression. A coefficient profile plot was produced against the log(lambda) sequence (a). Four variables with nonzero coefficients were selected by the optimal lambda. By verifying the optimal parameter (lambda) in the LASSO model, the partial likelihood deviance (binomial deviance) curve was plotted versus log(lambda), and dotted vertical lines were drawn based on 1 standard error criterion (b)

Development and validation of the nomogram

The following model was constructed based on the independent variables selected by multivariate regression analysis and LASSO regression: Predicted value = − 5.399 + 2.404Age − 1.023Hepatitis-1.540AFP+1.452washout time +2.801 Enhancement level in the KP (age> 51 y, value 1; hepatitis, value 1; AFP> 20 µg/L, value 1; washout time≤ 45s, value 1; enhancement level in the KP (defect), value 1).

Therefore, we constructed a nomogram (Fig. 3). The cutoff value of the nomogram was 0.430. When it was more significant than 0.430, the diagnosis was ICC (Fig. 4), and vice versa, the diagnosis was P-HCC (Fig. 5). Using this nomogram, the C-statistic was 0.930 (95% CI: 0.856-0.973) (Table 4).

Fig. 3
figure 3

A nomogram was developed with predictors including age, hepatitis, AFP level, washout time, and KP enhancement level. Draw a vertical straight line from the variable value to the axis labeled “Points”. Then calculate five variables’ points. The total points on the bottom scales that correspond to the predicted value are shown

Fig. 4
figure 4

ICC in a 60-year-old patient. a Gray-scale ultrasound showed a hypoechoic nodule (arrow) in liver S8, with a maximum diameter of approximately 2.0 cm. The edge was blurred, and the morphology was irregular. b Hyper-enhancement (arrow) in the AP (30 s) after injection of the Sonazoid contrast agent. c At 108 s after injection, the lesion began to wash out (arrow). d After 10 minutes (Kupffer phase), the lesion (arrow) showed marked washout. According to the LI-RADS CUES, the lesion was classified as LI-5. According to the nomogram, the risk value was 0.830(> 0.430); this lesion was predicated to be ICC. The pathological diagnosis was ICC

Fig. 5
figure 5

P-HCC in a 33-year-old patient. a Gray-scale ultrasound showed a hypoechoic lesion(arrow) at the junction of liver S8 and S5, with a maximum diameter of approximately 3.3cm.The edge was clear and the morphology was regular. b Hyper-enhancement (arrow) in the early AP (16 s) after injection of the Sonazoid contrast agent. c At 48 s after injection, the lesion began to wash out (arrow). d After 10 minutes (Kupffer phase), the lesion (arrow) shows mild washout. According to the LI-RADS CUES, the lesion was classified as LI-M. According to the nomogram, the risk value was 0.024(< 0.430); this lesion was predicated to be P-HCC. The pathological diagnosis was P-HCC

Table 4 Diagnostic efficacy of different methods

The Hosmer-Lemeshow goodness-of-fit test showed good fit in the cohort (P = 0.437). The calibration curve (Fig. 6a) showed that the predicted incidence was more consistent with the actual incidence of ICC, and 400 times repeated 3-fold cross-validation revealed good discrimination with a mean AUC of 0.851. Decision curve analysis showed that the nomogram could increase the net benefit to the patient (Fig. 6b).

Fig. 6
figure 6

Calibration curves (a) of the nomogram prediction. The y-axis indicates the actual diagnosed ICC. The x-axis indicates the predicted risk of ICC. The diagonal dotted line indicates a perfect prediction by an ideal model. The solid line represents the performance of the cohort, which indicates that a closer fit to the diagonal dotted line represents a better prediction. Decision curve analysis (b) showed that it would be more accurate to use this nomogram to predict the risk of ICC. The quantified net benefits can be measured at different threshold probabilities. The y-axis denotes the standardized net benefit, and the x-axis indicated the risk threshold probabilities. The orange line represents the nomogram, the blue dotted line represents the condition that all patients have ICC, and the pink dotted line represents the condition that none have ICC

Sonographers’ Diagnosis and CEUS LI-RADS Categories

According to the sonographers’ diagnosis given by each center, only 17 ICCs were correctly diagnosed, 14 ICCs were diagnosed as HCC, 8 ICCs were diagnosed as metastasis, and 2 ICCs were diagnosed as benign lesions. However, among 49 P-HCCs, 48 P-HCCs were correctly diagnosed as HCC.

According to CEUS LI-RADS, 10 (24.39%) ICCs were finally classified as non-LI-M, and 17 (34.69%) P-HCCs were classified as LI-M (Supplement Table 3). The classification of LI-M according to the CEUS LI-RADS criteria by two sonographers had a kappa-value of 0.608.

The nomogram was compared with the above methods, and the validity of the nomogram was better than that of the sonographers’ diagnosis and CEUS LI-RADS categories (P <  0.001), as shown in Table 4. The diagnostic performance of ICC is shown in Fig. 7.

Fig. 7
figure 7

Discrimination of the nomogram was evaluated by the ROC curve. The y-axis indicates the true-positive rate of the risk prediction. The x-axis indicates the false-positive rate of the risk prediction. The orange line represents the performance of the nomogram. AUC=0.930 which is equal to the c-statistic

Discussion

In this study, we report the ability to differentiate ICC from P-HCC by developing a nomogram model that uses clinical and SCEUS features. Compared to sonographers’ diagnosis and CEUS LI-RADS categories, our model showed a higher sensitivity and improved accuracy in identifying ICC from P-HCC with atypical vascular phase; it demonstrated higher predictive performance (AUC = 0.930); and the accuracy rate reached 88.9%. Notably, the innovation was that the model was based on clinical and SCEUS features for noninvasive differentiation of difficult-to-identify lesions prior to treatment.

Sonazoid® can be used to detect the presence of Kupffer cells, since these microbubbles are easily taken up by Kupffer cells [24].Therefore, it has an additional Kupffer phase (KP), which starts approximately 10 min postinjection, when the microbubbles have been eliminated from the blood pool [25, 26]. Previous studies [27, 28] have demonstrated that the enhancement level in KP varies in benign and malignant lesions and could differentiate HCC grades. Kupffer cells rarely or almost never exist in malignant lesions, making the contrast intensity different between lesions during the Kupffer phase. Malignant lesions showed hypo-enhancement in KP, while non-HCC malignant lesions were more likely to show a cavity-like appearance in the KP with approximately no enhancement [26, 29]. In 2020, Sugimoto, K. [30] proposed a modified CEUS LI-RADS for Sonazoid, which included the enhancement level in the KP and improved the accuracy of diagnosing focal liver lesions. However, the KP enhancement level was not further classified into the LI-5 and LI-M classification criteria. Our study demonstrated that the enhancement level (hypo-enhancement /defect) in the KP, as subjectively assessed by sonographers, was an independent risk factor that could affect the differential diagnosis of P-HCC and ICC, OR: 10.726 (95% CI 1.471-78.204), P=0.019. In our cohort, 33 (80.49%) ICCs exhibit KP defects, which could initially distinguish them from P-HCC, with an AUC of 0.780 (95% CI 0.681-0.879). This finding was confirmed in the gray-scale analysis using ImageJ software (Supplement Table 2), which showed a significant difference in the mean gray value of ICC and P-HCC in the KP (= 0.004).

Several studies have reported that certain ultrasound features, such as bile duct dilatation or cholangiolithiasis, irregular rim enhancement, and early washout time, were found to have diagnostic value for ICC [31, 32]. However, in our study, there were no significant differences in gray-scale ultrasound features, arterial phase enhancement patterns, and rim enhancement between P-HCC and ICC, which was partially inconsistent with Yuan M's study [32]. This may be attributed to the different patients enrolled. Furthermore, it is noteworthy that some ICCs do not exhibit typical signs such as bile duct dilatation, and these patients may have a history of hepatitis, leading to subjective misclassification of them as HCCs by sonographers. However, sonographers could accurately diagnose the ICC with typical CEUS features, resulting in a high specificity of 97.96%.

According to the vascular phase (<  120 s), the CEUS LI-RADS categories was evaluated, and the sensitivity and specificity of classifying ICC as LI-M were 75.6% and 65.3%, respectively. These were slightly lower than those reported by Zheng et al. [33], who found the sensitivity and specificity to classify the ICC as LI-M to be 89% and 88%, respectively. Our analysis revealed that some P-HCCs were difficult to distinguish from ICCs in the vascular phase, with high enhancement in the arterial phase and the early washout time (<  60 s), which may lead to misclassification by the CEUS LI-RADS categories. We also found that P-HCC and ICC could be better discriminated (P <  0.001) when the early washout time was defined as 45 seconds.

Given the similarity of the vascular phase enhancement patterns between some P-HCCs and ICCs, the accuracy of sonographers' diagnosis may be compromised. To address this issue, we developed an ICC-predicted nomogram that is comprised of five risk factors. The AUC of the nomogram was 0.930, which was higher than the sonographers’ diagnosis and CEUS LI-RADS categories. The 400 times repeated 3-fold cross-validation showed that the mean AUC was 0.851, which indicates a relatively reliable result. This approach may provide a new diagnostic method for tumors that are challenging to differentiate by gray-scale ultrasound and SonoVue CEUS. With the ability to diagnose more ICC, more appropriate treatment options can be further selected to improve the overall prognosis.

Nevertheless, our study has some limitations. While it is a multicenter study, the number of cases of P-HCC and ICC was small. Further validation is necessary to indicate the model’s general applicability and improve it if necessary. Additionally, some tumor specimens were obtained by biopsy. Due to the heterogeneity of tumors, combined hepatocellular carcinoma may not be ruled out. But owing to its low incidence, it was included in this article.

Conclusion

This study found that clinical indicators such as age, hepatitis, and AFP were helpful in the diagnosis of ICC. The enhancement level in the KP and early washout time (≤  45 s) were identified as independent risk factors for differentiating ICC from P-HCC. The nomogram constructed based on SCEUS and clinical features has the potential to noninvasively diagnose the ICC before surgery and may provide some support for clinical treatment decision-making.