Introduction

Metabolic dysfunction-associated steatotic liver disease (MASLD), formerly known as nonalcoholic fatty liver disease (NAFLD), is one of the most prevalent forms of liver disease evaluated in gastroenterology and hepatology clinics in the United States [1,2,3]. The severity of MASLD is graded on a spectrum and ranges from steatotic liver with no steatohepatitis or fibrosis to metabolic dysfunction-associated steatohepatitis (MASH) with fibrosis that can ultimately lead to cirrhosis complicated by portal hypertension [4,5,6]. Since MASH is a histologic diagnosis requiring an invasive and time-consuming liver biopsy, the field has embraced non-invasive tests (NITs) to recognize the presence of fibrosis (clinically significant, advanced or cirrhosis) [7,8,9,10,11]. Several cross-sectional studies have demonstrated a correlation between liver stiffness measurement (LSM) and the severity of fibrosis in MASLD [8, 12,13,14,15,16,17,18]. More recently, a few investigators have reported a correlation between baseline LSM and change in LSM over time with liver-related clinical outcomes [18, 19]. The recent BAVENO VII established the “rule of five,” which proposes that there is no concern for compensated advanced chronic liver disease (cACLD) for those with LSM < 10 kilopascals (kPa), while an LSM ≥ 25 kPa suggests the presence of clinically significant portal hypertension (CSPH) [20]. In patients with MASLD, due to increased probe-to-capsule distance resulting in a false increase in LSM, the ANTICIPATE-NASH model is recommended, incorporating LSM with platelet count and body mass index (BMI) to predict the presence of CSPH[21].

There is a need for NITs that can estimate the presence of cACLD and CSPH with high diagnostic accuracy in patients with MASLD. Although the evaluation of a compensated cirrhotic by hepatic vein pressure gradient is ideal for risk stratification, the procedure is cumbersome, invasive, non-standardized, and often restricted to tertiary care centers due to limited expertise [22]. Spleen stiffness measurement (SSM) is an emerging NIT that has been studied to assess the severity of portal hypertension. In 2011, the utility of the standard liver 50-Hz vibration-controlled elastography to measure spleen stiffness as an alternative means to successfully predict the presence of esophageal varices (EV) was proposed [23]. Additional studies analyzing this standard 50-Hz elastography have demonstrated inconsistencies in detecting cutoff values for the presence of EV and the accuracy with which it can differentiate between variceal grades [23,24,25]. The 50-Hz elastography is limited by its narrow range of measurement [1.5–75 kPa]. As a result, the FibroScan 630 Expert was recently made available to include a broader range of SSM (5–100 kPa) and a B‐mode ultrasound to locate the spleen [26]. Prior studies have concluded that the FibroScan 630 Expert can more accurately detect EV than the original 50-Hz elastography probe [27].

Currently, the main diagnostic NITs utilized to assess the severity of liver disease rely on measuring LSM using elastography combined with blood tests [13, 28, 29]. However, using LSM to predict the severity of CSPH in individuals with liver disease has limited accuracy [30,31,32]. To address this issue, some studies have suggested that adding SSM to LSM may improve the ability to predict the presence of high-risk EV in those with liver disease and elevated LSM [33,34,35]. However, the usefulness of SSM by itself in routine clinical practice still needs to be clarified, as the data on its diagnostic accuracy are limited, particularly in MASLD [36,37,38].

Recently, the BAVENO VII guidelines recommended an SSM cutoff value of < 21 kPa to rule out CSPH and a cutoff of > 50 kPa to rule it in [20]. Additionally, an SSM cutoff of ≤ 40 kPa was proposed to identify individuals who are unlikely to have high-risk EV [20]. However, these recommendations were based on studies of patients with predominantly viral hepatitis, and it is unclear if they are applicable to individuals with MASLD [20]. Moreover, it is unknown whether SSM alone can detect cirrhosis and EV in those with MASLD. To address these gaps in knowledge, this study aims to assess SSM's clinical usefulness for detecting cirrhosis, EV, and high-risk EV in individuals with MASLD. Additionally, the study also evaluated the diagnostic performance of SSM cutoffs proposed by the BAVENO VII guidelines (≥ 21 kPa, > 40 kPa, and > 50 kPa) for identification of cirrhosis, EV (any size), and high-risk EV.

Methods

Study Design

In this prospective study (ClinicalTrials.gov Identifier: NCT03778411), subjects ≥ 18 years of age presenting to the hepatology clinics at Indiana University Hospital for the care of their liver disease were recruited to participate. All subjects provided Institutional Review Board or Ethics Committee approved informed consent before undergoing study-specific procedures. Patients were excluded if they had clinically apparent ascites or declined to provide informed consent. Simultaneous LSM, SSM, and controlled attenuation parameter (CAP) were obtained during the study using the FibroScan 630 Expert (Echosens, Paris) without SmartExam. All measurements were obtained by a single technician (AM). All scans were performed in the fasting state (at least 3 h) using a standard protocol per manufacturer recommendations.

Study Participants and Clinical Data

Between November 2020 and May 2022, a total of 419 patients presenting to the liver clinic were assessed for simultaneous LSM and SSM using the FibroScan 630 Expert. Pertinent demographics, clinical data, and variables related to blood tests, histology, imaging modalities, and endoscopy were extracted from the electronic health records. Of these, 154 patients with MASLD met the study inclusion criteria of a clinical diagnosis of MASLD by imaging or liver biopsy without a clinical history of excess alcohol use. Of these, 93 (60.1%) had MASH cirrhosis diagnosed through a combination of either imaging with thrombocytopenia (n = 33) and/or a liver biopsy (n = 60) [39].

Patients were excluded if they had primary biliary cholangitis (n = 23), primary sclerosing cholangitis (n = 16), autoimmune hepatitis (n = 25), hepatitis B (n = 11), hepatitis C (n = 48), alcohol-associated liver disease (n = 79), hereditary hemochromatosis (n = 3), Budd-Chiari syndrome (n = 3), alpha-1 antitrypsin deficiency (n = 8), hepatocellular carcinoma (n = 18), current alcohol use (n = 17), or an unknown liver etiology (n = 14). Fibrosis-4 (FIB4) was calculated for all enrolled study participants from data extracted through the electronic health record using the following formula ([age]x[AST]/[platelet count]x[√ALT]) [40].

Measurement of LSM and SSM

The FibroScan 630 Expert is a vibration-controlled transient elastography (VCTE) was used to measure the stiffness of the spleen and liver in kilopascals. Briefly, each patient was placed supine with right arm raised behind his or her head and remained still during the procedure. The probe was placed over the liver region, and measurements were obtained. After the completion of the liver elastography, the device was moved and positioned to obtain SSM as per the manufacturer's protocol. For LSM, the measurement was considered successful if at least 10 measurements were available and reliable when the interquartile range/median (IQR/M) was ≤ 30%, as per the previous recommendations [8, 41]. In contrast, SSM was considered successful if at least 10 measurements were available. Currently, there are no criteria to establish a reliable SSM.

Endoscopic Evaluation

In the current study cohort, 96 (62%) participants had undergone an upper endoscopy, and 73 (47%) had an upper endoscopy within a year of undergoing VCTE (Supplementary Fig. 1). The endoscopy records were reviewed, and the EV were classified according to Baveno VI consensus: no varix, small varices, moderately enlarged, beady varices, and large varices [42, 43]. EV were categorized as high-risk if they were large, required endoscopic band ligation, or had a red wale sign irrespective of the size of the varices as per BAVENO VI criteria [42, 43].

Statistical Analysis

Statistical analyses were performed using SPSS version 28 (IBM, New York, 10,504) and Stata MP version 17.0 (StataCorp, College Station, TX). Data were analyzed by the Mann–Whitney U test and chi-squared for continuous and categorical variables, respectively. P values less than 0.05 were deemed statistically significant. Area under the receiver operating characteristic curves (AUROC), specificity, sensitivity, positive predictive values (PPV), and negative predictive values (NPV) was established to determine the aptitude for the three non-invasive markers (SSM, LSM, and FIB4) to detect cirrhosis, EV (any size), and high-risk EV. The best Youden index was identified to determine the sensitivity and specificity of these NITs. The diagnostic performance of SSM cutoffs as proposed in the recent BAVENO VII consensus document, i.e., SSM cutoffs of ≥ 21 kPa, > 40 kPa, and > 50 kPa, was examined in the current cohort and reported in terms of sensitivity, specificity, PPV, and NPV [20].

Results

Study Participant Demographics

Table 1 illustrates the demographics and laboratory findings of the total cohort, valid SSM, and invalid SSM subgroups. The median age was 59 years (range 24–79 years), 46% were males, 88% were Caucasian, and the mean BMI was 34.6 ± 6.3 kg/m2 for the total cohort (Table 1). The mean platelet count, INR, and albumin were significantly different between the valid and invalid SSM subgroups (Table 1). The median duration between those who received an upper endoscopy and VCTE evaluation within one year was 67 days (IQR, 25th-75th: 4–168), with 18 (25%) undergoing both tests on the same day.

Table 1 Patient demographics and laboratory values of the patients in the study cohort

FibroScan Metrics

Of the 154 participants, 29 had an invalid SSM [inability to locate the spleen (n = 28) or < 10 valid spleen stiffness measurements (n = 1)] with a failure rate of 19% (Fig. 1). 8 participants had an invalid LSM [< 10 valid liver stiffness measurements (n = 8) or IQR/M > 30% (n = 0)] with a failure rate of 5% (Fig. 1).

Fig. 1
figure 1

Proportion of study participants with valid and invalid vibration-controlled transient elastography in the current study cohort of study participants with metabolic dysfunction-associated steatotic liver disease (N = 154). a Evaluation for spleen stiffness measurement (SSM) with failure rate of 18.8%. b Liver stiffness measurements (LSM) with failure rate of 5.2%

Patients with a valid SSM had a lower BMI [33.8 ± 5.9 vs. 37.9 ± 6.9 kg/m2, P = 0.002] and waist circumference [109.2 ± 14.4 vs. 123.2 ± 17.4) centimeters, P = 0.004] compared to those with an invalid SSM (Table 1). The valid SSM cohort had a greater degree of fibrosis than the invalid SSM group (P = 0.015) (Table 1). There were no significant differences between the two groups regarding the presence of EV, gastric varices, portal hypertensive gastropathy, splenomegaly, ascites, or hepatomegaly (Table 1).

Discriminating Ability of SSM

The mean SSM for MASH cirrhosis vs. non-cirrhotic MASLD was 56.2 ± 25.8 kPa vs. 33.0 ± 15.9 (P < 0.001), respectively (Table 2). The AUROC for SSM was 0.78 (95% CI 0.70–0.85) for detecting MASH cirrhosis (Fig. 2a). SSM cutoff of ≥ 21 kPa had a sensitivity of 96% and PPV of 88%, while SSM cutoff of > 50 kPa had a specificity of 78% and PPV of 94% for identifying NASH cirrhosis (Table 3). Of the patients with a valid SSM, 79 patients had an SSM < 50 kPa, 37 (47%) of whom had MASH cirrhosis and 42 (53%) who did not. Forty-six individuals had an SSM > 50 kPa, of which 43 (93%) had MASH cirrhosis and 3 (7%) did not. With this cutoff point, 68% of the cohort was correctly classified.

Table 2 Differentiating cutoff values of various NITs for cirrhosis and associated manifestations of clinically significant portal hypertension in study participants with metabolic dysfunction-associated steatotic liver disease (MASLD) and metabolic dysfunction-associated steatohepatitis (MASH)
Fig. 2
figure 2

Diagnostic performance characteristics of spleen stiffness measurement in the current study cohort of subjects with metabolic dysfunction-associated steatotic liver disease: area under the curve for identifying cirrhosis (a) and associated manifestations of clinically significant portal hypertension (CSPH) such as esophageal varices (b) and high-risk esophageal varices (c)

Table 3 Diagnostic performance of prior suggested SSM cutoffs18 in the current study participants with and without cirrhosis from metabolic dysfunction-associated steatohepatitis

73 participants had an upper endoscopy within one year of their VCTE and were included in this analysis. Of these 73 individuals, 63 had a valid SSM. 33 subjects in this valid SSM cohort had EV (19 small, 5 moderate, 9 large), of which 13 were banded, and 35 had portal hypertensive gastropathy. The valid LSM subgroup had 30 patients with EV (17 small, 4 moderate, and 9 large). The values of SSM in the groups with EV (any size) and without EV were 69.2 ± 23.4 kPa vs. 47.8 ± 25.1 kPa, respectively (P < 0.001) (Table 2). The mean SSM for small, medium, and large EV was 57.6 ± 19.7, 79.8 ± 27.8, and 85.0 ± 17.2 kPa, respectively (P = 0.007) (Table 2). SSM values were statistically different among small, medium, and large varices, with the largest difference between small vs. moderate and large groups. The LSM and FIB4 were not different among small, medium, and large varices (P = 0.48 and 0.65, respectively) (Table 2).

The AUROC for SSM was 0.74 (95% CI 0.61–0.84) for detecting the presence of EV (any size) (Fig. 2b). The AUROC for SSM for detecting high-risk EV was higher at 0.82 (95% CI: 0.75–0.98) (Fig. 2c). The optimal cutoff based on the Youden index for SSM associated with the presence of high-risk EV was > 67 kPa with sensitivity of 79%, specificity of 76%, PPV of 48%, and NPV of 93% (Table 4). 40 patients had SSM < 67 kPa, 3 had high-risk EV, and 37 did not. Among the 23 individuals that had SSM > 67 kPa, 11 had high-risk EV, and 12 did not. With this cutoff, 76% of the subjects were correctly classified.

Table 4 Diagnostic performance of non-invasive tests in the detection of metabolic dysfunction-associated steatotic liver disease-related cirrhosis and manifestations of clinically significant portal hypertension

Comparative Analysis

We next compared the discrimination ability of LSM and FIB4 in comparison with SSM in the overall cohort and in the cirrhosis subgroup for cirrhosis, EV (any size), and high-risk EV. For the overall cohort, the AUROC for LSM was 0.89 (95% CI 0.82–0.93, P < 0.001) and was higher than SSM in detecting MASH cirrhosis (Table 4). The LSM cutoff of > 18.0 kPa demonstrated a sensitivity of 74%, specificity of 88%, PPV of 90%, and NPV of 69% for NASH cirrhosis (Table 4). The valid LSM had 30 patients with EV (17 small, 4 moderate, 9 large), of which 11 were banded. The AUROC of LSM was low for detecting EV at 0.61 (95% CI 0.49–0.73, P = 0.1) (Table 4). For the detection of high-risk EV, LSM had an AUROC of 0.59 (95% CI 0.35–0.76, P = 0.4) (Table 4). The AUROC for FIB4 was excellent at 0.90 (95% CI 0.83–0.94, P < 0.0001) with a sensitivity of 84%, specificity of 79%, PPV of 89%, and NPV of 72% for detecting MASH cirrhosis with a FIB4 cutoff of > 0.375 (Table 4). The values of FIB4 in the group with EV were 1.236 ± 0.819 and were statistically significantly higher compared to those without varices (P =  < 0.001) (Table 2). The FIB4 values were not statistically significantly different between the different sizes of EV (small, moderate, and large) (P = 0.65) (Table 2). The AUROC for FIB4 was 0.76 (95% CI 0.65–0.85) for detecting EV (Table 4). A FIB4 cutoff of > 0.481 had a sensitivity of 92%, specificity of 55%, PPV of 64%, and NPV of 89% for EV (Table 4). For detecting high-risk EV, FIB4 > 0.976, we had a sensitivity of 72%, specificity of 69%, PPV of 39%, and NPV of 90% (Table 4).

In the cirrhosis subgroup (n = 93), 64 subjects had undergone an upper endoscopy within a year of undergoing evaluation with FibroScan 630 Expert. The median age was 63 years, with 44% male and a body mass index of 34 ± 6 kg/m2. The mean platelet count was 120 ± 64 k/cumm with a FIB4 score of 0.963 ± 0.707. There were 55 cirrhosis patients with valid SSM and 57 cirrhosis patients with valid LSM who underwent an upper endoscopy within a year. The AUROC of SSM, LSM, and FIB4 for detecting the presence of EV was 0.70 (95% CI 0.56–0.84), 0.54 (95% CI 0.39–0.70), and 0.72 (95% CI 0.60–0.84), respectively. In those subjects who had all three NITs and upper endoscopy within a year (n = 48), the AUROC of SSM, LSM, and FIB4 were 0.74 (95% CI 0.60–0.89), 0.54 (95% CI 0.37–0.71), and 0.69 (95% CI 0.54–0.84), respectively, for the presence of EV. For the diagnosis of high-risk EV, the AUROC of SSM, LSM, and FIB4 was 0.85 (95% CI 0.73–0.98), 0.50 (95% CI 0.28–0.72), and 0.64 (95% CI 0.47–0.82), respectively.

Discussion

SSM is an additional NIT in the toolbox for clinicians caring for patients with chronic liver disease. In the current study, we obtained SSM measurements from all participants without any selection bias to examine its independent role as an NIT in evaluating MASLD. The failure rate for obtaining a valid SSM was 19% in the current study and higher than the 3–4% failure rates reported in prior studies that used the FibroScan 630 Expert [44, 45]. We believe this higher failure rate is related to patient factors and device limitations. Notably, higher BMI and larger waist circumference decreased the probability of acquiring a valid SSM. We suspect this is related to the limitation of measuring the SSM with the medium (M+) probe and the lack of the SmartExam option for SSM. In previous studies, failure rates were lower, likely because participants had lower BMIs [44, 45]. Another reason for the higher failure rate in the current study was the inability to locate the spleen with the probe despite the availability of an ultrasound probe for visualization in the FibroScan 630 Expert. We suspect that the lack of splenomegaly in non-cirrhotic MASLD adds an additional limitation along with the M + probe and lack of the SmartExam. Although one might argue that failure to measure SSM could be a good qualitative prognostic sign indicating the absence of CSPH, caution should be exercised with that assumption in a patient with MASLD with a higher waist circumference or higher BMI. We hope that newer versions of the device will incorporate both the SmartExam and extra-large (XL+) probe to measure SSM. The SmartExam option has overcome the limitation associated with increased probe-to-capsule distance regarding LSM, as there were no patients with unreliable criteria of > 30% IQR/M.

In this study, SSM is a useful tool for detecting MASH cirrhosis using the SSM ≥ 21 kPa cutoff proposed by Baveno VII. This cutoff had a sensitivity of 96% with a PPV of 88%. Alternatively, SSM > 50 kPa has both a high specificity (78%) and high PPV (94%). Both these cutoffs performed well in the current MASLD cohort and, therefore, would be very useful in the evaluation of MASLD patients, particularly in those with invalid LSM or where data to calculate FIB4 are not readily available or in the indeterminate range. In our opinion, it is not unreasonable to routinely evaluate SSM in those with LSM > 15 kPa to evaluate for the presence of cirrhosis and CSPH. Furthermore, SSM ≥ 21 kPa cutoff had high NPV for EV (any size) and high-risk EV, making it clinically useful to prognosticate patients with suspected cirrhosis. Lastly, SSM > 50 kPa had a high NPV. Clinically, this would be useful to prioritize these patients for early evaluation with an upper endoscopy. As the mortality rate for bleeding EV can be high, it is important to have an NIT such as SSM with a high NPV for detecting high-risk EV [46]. Finally, these findings suggest that SSM by itself may be used as an alternative diagnostic tool for risk prognostication in patients with MASLD.

Certain aspects of our study merit further discussion. The study enrolled patients from a hepatology clinic in a prospective manner, but there is still a possibility of referral bias, as the number of patients with advanced fibrosis or cirrhosis in the study is higher than what would be expected in a primary care clinic. To detect high-risk EV, the cutoff value for SSM, > 67 kPa, needs to be validated in a separate independent cohort, even though it is in line with the > 50 kPa proposed by the BAVENO VII. The current dataset could have been more reliable if all participants had same-day FibroScan and upper endoscopies. However, we think there may not be a significant increase in portal hypertension within a year. Additionally, several participants without cirrhosis did not have an upper endoscopy, as an endoscopy was not clinically warranted. The combination of LSM < 20 kPa and platelet count of > 150 k/cumm criteria did not show high diagnostic accuracy (data not shown) in the current cohort, presumably due to the high proportion of patients with cirrhosis. Interestingly, SSM was the only NIT demonstrating a positive correlation with variceal size. Lastly, FIB4 appears to perform with high diagnostic accuracy to identify CSPH but has limited potential to differentiate between cirrhosis, the presence of EV (any size), and high-risk EV.

In conclusion, our study supports using SSM as a NIT for detecting cirrhosis and portal hypertensive manifestations associated with CSPH in patients with MASLD. SSM by itself is most effective in identifying the presence of EV, particularly high-risk EV. The availability of SSM, in addition to other NITs, may provide additional insights for risk stratification of patients with MASLD.