Introduction

Hepatic steatosis is characterized by the accumulation of triacylglycerol-rich macrovesicular and/or microvesicular lipid droplets within the hepatocytes [1, 2]. Hepatic steatosis or fatty liver is defined as intrahepatic triacylglycerol of at least 5% of liver weight or 5% of hepatocytes containing lipid vacuoles [3, 4]. Non-alcoholic fatty liver disease (NAFLD) is becoming increasingly prevalent along with obesity and type 2 diabetes mellitus, and it is now the most common cause of chronic liver disease worldwide. The symptoms of NAFLD vary widely, from simple steatosis to non-alcoholic steatohepatitis [5, 6]. Untreated NAFLD may progress to more damaging conditions, including cirrhosis, hepatocellular carcinoma, and liver failure [7,8,9,10]. Controlled attenuation parameter (CAP), measured via transient elastography, has recently been endorsed as a noninvasive method for the detection and quantification of steatosis [11]. CAP has excellent diagnostic accuracy for diagnosing hepatic steatosis in patients with chronic liver disease [12,13,14]. Meanwhile, conventional abdominal B-mode ultrasound has been widely used to diagnose fatty liver (bright liver, hepato-renal contrast, deep attenuation, or vessel blurring). B-mode ultrasound has been shown to have an acceptable diagnostic accuracy of > 15–20% for detecting hepatic steatosis [15, 16]. However, its usefulness in identifying steatosis of ≥ 5% remains unclear.

A quantification method using attenuation coefficients has recently been utilized in B-mode ultrasound [17, 18]. Given that these B-mode imaging-guided attenuation coefficient methods are relatively new, only a few validation studies have been conducted. Thus, the diagnostic capability of B-mode ultrasound for detecting steatosis of ≥ 5% is controversial. There has been no evidence of accurately diagnosing steatosis of ≥ 5% using B-mode ultrasound. This meta-analysis aimed to comprehensively assess the diagnostic performance of B-mode ultrasound imaging for evaluating steatosis of ≥ 5%.

Materials and methods

Ethics

This study was performed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-analyses of Diagnostic Test Accuracy Studies (PRISMA-DTA) [19]. The review was registered in the International Prospective Register of Systematic Reviews (PROSPERO: CRD42021255110).

Literature search

Studies on the diagnostic characteristics of B-mode ultrasound for quantifying liver steatosis with histology as the reference standard were extracted from the PubMed, Embase, and Web of Science databases until May 15, 2021. Only English-language manuscripts published or accepted by a peer-reviewed journal between January 2000 and May 2021 were included in the meta-analysis. Study results published only in abstract form were not considered. The final query for PubMed was: “(fatty liver) AND ((ultrasound) AND (attenuation)) NOT (Children)) NOT (review)) AND ((biopsy)” [All Fields] AND 2000/1/1:2021/5/15[Date–Entry]. An analogous query was used for Web of Science.

Selection criteria

Eligible studies were identified according to the following criteria: (1) the diagnostic accuracy for steatosis (≥ 5%) in adults with chronic liver disease; (2) liver biopsy was performed; (3) sufficient data were provided to calculate the true positive (TP), false-positive (FP), false-negative (FN), and true-negative (TN) rates of ultrasound findings for hepatic steatosis of ≥ 5%; and (4) at least 50 patients were evaluated to obtain good reliability. Only articles available in full text and written in English were included. Duplicate publication, animal studies, and ex vivo studies were excluded.

Data extraction and quality assessment

Two reviewers independently extracted the data and evaluated the quality of the included studies, with disagreements resolved by consensus. The following data were retrieved: first author, publication year, location, study design, technique of ultrasound, the number of patients, age, sex, body mass index (BMI), and the duration between ultrasound and liver biopsy. TP, FP, FN, and TN were directly extracted or calculated. The quality of the studies was assessed by two reviewers according to the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool [20].

Statistical analysis

The diagnostic accuracy of B-mode ultrasound attenuation was evaluated according to the summary sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), positive predictive value (PPV), negative predictive value (NPV), and diagnostic odds ratio (DOR) with corresponding 95% confidence intervals (CI) calculated using the bivariate random-effects model. Then, the hierarchical summary receiver-operating characteristic (HSROC) curve and the area under the curve (AUC) were calculated. Heterogeneity was evaluated using the Cochrane Q-test and the Higgins inconsistency index (I2), with p < 0.05 or I2 > 50% considered to indicate substantial heterogeneity [21, 22].

Sensitivity analysis was restricted to patients with chronic viral liver disease. Univariate meta-regression analysis and subgroup analysis were also performed to explore possible sources of heterogeneity. The covariates included the following: (1) measurement technique (attenuation coefficient vs. conventional B-mode findings), (2) number of patients (≥ 150 vs. < 150), (3) BMI (≥ 26 vs. < 26 kg/m2), (4) publication date (2000–2009 vs. 2010–2021), and (5) the duration between liver biopsy and ultrasound (≥ 30 vs. < 30 days). The findings of steatosis observed using conventional B-mode ultrasound were defined as follows: bright liver [23], hepato-renal contrast [23], deep attenuation [23], or vascular blurring [23]. Fagan plots were used to assess the clinical utility of ultrasound findings for diagnosing hepatic steatosis. Publication bias was assessed using Deeks’ funnel plot, with a p value of < 0.1 for the slope coefficient suggesting significant asymmetry. All statistical analyses were performed using the “midas” and “metandi” modules of Stata version 15.0 (Stata Corp, College Station, TX, USA).

Results

Search results and study characteristics

A total of 304 articles were initially analyzed. After removing 40 duplicates and 264 irrelevant studies, 43 studies were further evaluated. Of these, 24 studies were excluded after a full-text review, because they were of undesirable article type, were not diagnostic accuracy studies, were not relevant to chronic liver disease, included a small sample size (< 50 participants), had insufficient data (TP, FP, TN, and FN not reported or could not be calculated), and not written in the English language. Ultimately, 19 articles involving 3240 patients were analyzed [24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42]. The study selection flowchart is presented in Fig. 1. The detailed characteristics of the 19 studies are summarized in Table 1.

Fig. 1
figure 1

Study selection flowchart

Table 1 Characteristics of studies that evaluated the diagnostic performance of B-mode ultrasound and attenuation coefficients for mild steatosis

The 19 original articles included eight prospective studies, ten retrospective studies, and one cross-sectional study. The quality assessment results are presented in Fig. 2. Risk-of-bias analysis showed that four studies were high risk for flow and timing because of the long duration between ultrasound and liver biopsy. Most studies were identified as low risk for bias and applicability concerns, with all of the studies satisfying four or more of the seven total domains. Deeks’ plot showed that there was no potential publication bias (p = 0.68 for all patients, p = 0.24 for the patients with attenuation coefficients, and p = 0.97 for the patients with conventional B-mode ultrasound) (Fig. 3).

Fig. 2
figure 2

Quality assessment of the included studies according to Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) criteria

Fig. 3
figure 3

Deeks’ funnel plot asymmetry test for publication bias. a All patients. b Attenuation coefficients. c Conventional B-mode ultrasound

Diagnostic accuracy of B-mode ultrasound for the detection of mild steatosis

In all patients, the pooled sensitivity and specificity of B-mode ultrasound for detecting S1 were 0.70 (95% CI 0.63–0.77) and 0.86 (95% CI 0.82–0.89), respectively (Table 2). The pooled PLR, NLR, and DOR were 4.90 (95% CI 3.69–6.51), 0.35 (95% CI 0.27– 0.44), and 14.1 (95% CI 8.7–23.0), respectively. Figure 4a shows the HSROC curve, with an AUC of 0.87 (95% CI 0.82–0.91).

Table 2 Summary of the diagnostic accuracy and the post-test probabilities of B-mode ultrasound and attenuation coefficients for mild steatosis (5%)
Fig. 4
figure 4

Hierarchical summary receiver-operating characteristic (HSROC) curve for detecting mild steatosis. a All patients. b Attenuation coefficients. c Conventional B-mode ultrasound. d Conventional B-mode ultrasound with non-quantification methods

Meanwhile, the diagnostic accuracies according to the attenuation coefficient and conventional B-mode findings were as follows. The attenuation coefficient group showed better diagnostic accuracy (sensitivity, specificity, PPV, NPV, PLR, NLR, DOR, and AUC) than the conventional B-mode findings group (Table 2; Fig. 4b, c). In the conventional B-mode ultrasound group, semi-quantification methods were performed to diagnose the steatosis grade in three studies [36, 37, 40]. By quantifying the level of echogenicity, the hepato-renal contrast index was calculated. Thus, we analyzed the impact of the exclusion of semi-quantification methods on the diagnostic value of conventional B-mode ultrasound. Conventional B-mode findings that included only non-quantification methods had low diagnostic value (Table 2, Fig. 4d). The forest plots are presented in Figs. 5 and 6.

Fig. 5
figure 5

Sensitivity and specificity forest plots of attenuation coefficients for diagnosing mild steatosis

Fig. 6
figure 6

Sensitivity and specificity forest plots of conventional B-mode ultrasound findings for diagnosing mild steatosis

No heterogeneity was found in the attenuation coefficient group (Fig. 5). The pooled sensitivity and specificity of attenuation coefficients for detecting mild steatosis were 0.75 (95% CI 0.70–0.79) and 0.86 (95% CI 0.82–0.90), respectively (Fig. 5). The pooled sensitivity and specificity of conventional B-mode ultrasound for detecting mild steatosis were 0.69 (95% CI 0.57–0.79) and 0.85 (95% CI 0.79–0.89). There were heterogeneities in both sensitivity and specificity in the conventional B-mode ultrasound group (Fig. 6).

Clinical utility of B-mode ultrasound for hepatic steatosis in suspected fatty liver patients

Among patients with suspected hepatic steatosis, the PLR and NLR in those with S1 steatosis were 4.90 and 0.35, respectively (Fagan plot analysis). In this subset of patients with a 25% pre-test probability (based on clinical suspicion), a positive B-mode finding achieved a 62% probability of correct diagnosis, and a negative B-mode value achieved a 10% probability of incorrect diagnosis. When the pre-test probability (based on clinical suspicion) was set to 50%, a positive B-mode finding yielded an 83% probability of correct diagnosis, and a negative B-mode result yielded a 26% probability of incorrect diagnosis. When the pre-test probability (based on clinical suspicion) was set to 75%, a positive B-mode value showed a 94% probability of correct diagnosis, and a negative B-mode value showed a 51% probability of incorrect diagnosis.

In patients with suspected hepatic steatosis based on attenuation coefficients, the PLR and NLR in those with S1 steatosis were 5.66 and 0.28, respectively (Fagan plot). In this subset of patients with a 25% pre-test probability (based on clinical suspicion), a positive B-mode finding had a 64% probability of correct diagnosis, and a negative B-mode value revealed a 9% probability of incorrect diagnosis. When the pre-test probability (based on clinical suspicion) was set to 50%, a positive B-mode finding yielded an 84% probability of correct diagnosis, and a negative B-mode result yielded a 23% probability of incorrect diagnosis. When the pre-test probability (based on clinical suspicion) was set to 75%, a positive B-mode value showed a 94% probability of correct diagnosis, and a negative B-mode value showed a 47% probability of incorrect diagnosis.

In patients suspected of having hepatic steatosis based on B-mode ultrasound findings, the PLR and NLR among those with S1 steatosis were 4.48 and 0.37, respectively (Fagan plot). In this subset of patients with 25% pre-test probability (based on clinical suspicion), a positive B-mode finding revealed a 62% probability of correct diagnosis, and a negative B-mode value revealed a 11% probability of incorrect diagnosis. When the pre-test probability (based on clinical suspicion) was set to 50%, a positive B-mode finding yielded an 83% probability of correct diagnosis, and a negative B-mode finding yielded a 26% probability of incorrect diagnosis. When the pre-test probability (based on clinical suspicion) was set to 75%, a positive B-mode value showed a 94% probability of correct diagnosis, and a negative B-mode value showed a 52% probability of incorrect diagnosis.

Results of meta-regression and subgroup analysis

Univariate meta-regression showed that the sample size, method of ultrasound, BMI, publish year, and duration between liver biopsy and ultrasound were associated with the heterogeneity. B-mode ultrasound showed better performance in the diagnosis of a large sample size, attenuation coefficient, BMI > 25 kg/m2, recently reported paper, and short time between biopsy and ultrasound. The details of the subgroup analysis are presented in Table 3.

Table 3 Results of subgroup analysis of attenuation coefficients for grading steatosis

Discussion

The accuracy of attenuation coefficients and B-mode ultrasound for diagnosing hepatic steatosis is yet to be fully elucidated. The results of this meta-analysis indicate that B-mode ultrasound performed using current techniques has a good accuracy for detecting 5% steatosis (S1), with the AUC reaching 87%. In particular, when attenuation coefficients are utilized, the AUC for the diagnosis of S1 reached 89%, while the AUC was 0.80 for conventional B-mode ultrasound.

Hepatic steatosis is defined as intrahepatic fat of at least 5% of hepatocytes. The degree of steatosis in liver biopsies is usually assessed through a morphological semiquantitative approach in which the histopathologist uses a four-grade scale. However, Franzen et al. reported that semiquantitative evaluation overestimates the degree of steatosis in liver biopsies, with 5% of hepatocytes corresponding to only a small percentage of the total area. This meta-analysis primarily investigated whether a small percentage of fat area in tissue specimens could be diagnosed using current ultrasound imaging methods. CAP, which is based on the properties of ultrasonic signals acquired using transient elastography, is currently widely used to diagnose hepatic steatosis [11]. Some meta-analysis studies have reported that CAP is a promising noninvasive test for diagnosing mild hepatic steatosis [14, 43]. Petroff et al. recently reported the diagnostic accuracy of CAP in 2346 patients with mild steatosis [14]. CAP was found to be a reliable method to distinguish S0 from S1–3 in patients with NAFLD, with an AUC of 0.807, sensitivity of 0.790, and specificity of 0.740.

Apart from CAP, B-mode ultrasound findings and attenuation parameters have also been used to diagnose hepatic steatosis. A meta-analysis by Hernaez et al. [44] showed that B-mode findings allowed for reliable and accurate detection of moderate-to-severe steatosis (S2 and S3). B-mode ultrasound images can be improved. For example, new ultrasound machines have lower signal attenuation and generate clearer intrahepatic echo images. However, the diagnostic accuracy of B-mode ultrasound for mild steatosis (S1) has not been established. Although there have been studies on its diagnostic capability for mild steatosis [34,35,36, 39], to our knowledge, no meta-analysis has been performed to analyze attenuation coefficients guided by B-mode ultrasound. Previous studies also had a small sample size [24,25,26,27,28,29,30, 38].

In this study, the AUC in the total population was higher than that in previous CAP reports [14]. Despite using the same principle, attenuation coefficients guided by B-mode ultrasound had an even better AUC than CAP. The attenuation coefficients guided by B-mode can be measured while checking the B-mode image. Given that measurement by TE is based on M- and A-mode imaging, it is influenced by the respiratory status, size of the liver, and experience of the operator. It has already been reported that the accuracy of stiffness measurement is higher when B-mode is utilized [45]. The same applies to the attenuation coefficients guided by B-mode ultrasound.

Previous studies have shown an NAFLD prevalence rate of approximately 25% [23, 46, 47]. In this study, post-test probabilities were calculated using the Fagan plot. When the pre-test probability was set to 25%, B-mode findings showed high accuracy for ruling out fatty liver. This indicates the importance of paying attention to patients with a low level of attenuation coefficients or without conventional B-mode findings. Meanwhile, if obese patients are evaluated, positive findings are crucial as most of them have fatty liver. Patients with positive B-mode findings can be considered to have fatty liver. Interestingly, regarding the diagnosis of mild steatosis, B-mode findings had diagnostic accuracy inferior to that of attenuation coefficients. However, the post-test probability of B-mode was not inferior to that of attenuation coefficients. This may be the rationale for the recent improvements in ultrasound machines, and these improvements may indicate that the conventional B-mode ultrasound may also be useful in obese patients.

Meanwhile, HSROC analysis showed a wide variability in B-mode ultrasound. Forest plots indicated heterogeneity for both sensitivity and specificity in the conventional B-mode ultrasound. In subgroup analysis, a large sample size, attenuation coefficients, BMI < 25 kg/m2, recent publication year (from 2010), and duration between biopsy and ultrasound were better for diagnostic accuracy when mild steatosis was diagnosed. Although funnel plots showed no publication bias, clinical studies have to include a large sample size. Enrolling patients with BMI < 25 kg/m2 may lower the proportion of those with fatty liver. Moreover, when older studies were analyzed, B-mode ultrasound had insufficient diagnostic performance. Attenuation coefficients had better diagnostic value than conventional B-mode ultrasound with non-quantification methods. Conventional B-mode ultrasound had lower pooled sensitivity, indicating that it has insufficient diagnostic capability for ruling out S0. Thus, attenuation coefficients should be used to distinguish S0 from S1–3, with this method being reliable.

This study had some limitations. First, there have been only a few reported studies on attenuation coefficients guided by B-mode ultrasound to date. More high-quality studies are needed to establish the diagnostic value of attenuation coefficients. Second, there were few recent studies that analyzed conventional B-mode. We will perform another meta-analysis when more studies become available. Third, a few clinical trials have compared attenuation coefficients with other tests (CAP or magnetic resonance imaging). Comparative studies are needed to establish the accuracy and usefulness of attenuation coefficients. Fourth, the diagnostic ability of attenuation coefficients was not evaluated between different ultrasound devices as data from only a limited number of participants were included and, therefore, they were insufficient to perform this sub-analysis. When more papers are published in the future, we would like to re-analyze this aspect.

Conclusion

Conventional B-mode ultrasound has sufficient diagnostic capability for steatosis ≥ 5%. In particular, attenuation coefficients guided by B-mode ultrasound are reliable and reproducible.