Keywords

The Baveno VI Criteria

The first five Baveno consensus conferences recommended performing endoscopy on every patient with a diagnosis of cirrhosis. For the first time, Baveno VI conference introduced a two-step strategy for the screening of esophageal varices in patients with compensated advanced chronic liver disease (cACLD), establishing what became known as the Baveno VI criteria [1]. These resulted from the combination of a liver stiffness measurement by transient elastography (LSM by TE) of less than 20 kPa, together with a platelet count over 150 × 109/L. The risk of high-risk varices (HRV) when fulfilling both criteria was considered low enough to circumvent the performance of an endoscopy. In those patients outside Baveno VI criteria, an endoscopy should be performed. These criteria have been widely adopted in practice and recommended by subsequent guidelines for the management of a chronic liver disease [2]. Furthermore, since the Baveno VI conference, the criteria have undergone an unprecedented level of validation, with 28 fully published manuscripts up to March 2021 testing its performance.

The basic tenet of the proposal at Baveno VI was that “for high risk varices (HRV: medium-large varices or small with red signs) the acceptable risk of missing varices should be near 0 or 5% at the most”) [3]. This put the emphasis on not missing an opportunity of doing primary prophylaxis of bleeding (either with beta-blockers or variceal ligation), and considered much less harmful the performance of an unneeded endoscopy since the invasiveness of endoscopy is low. Indeed, setting that threshold at the 5% level means that we would accept to do an endoscopy with up to 95% chance of a false-positive result, and up to 5% chance of a false-negative result, or what would be equivalent to the value of not missing a case of HRV (a false negative) was 19 (95/5) times greater than the value of doing an unneeded endoscopy (false positive).

The subsequent ANTICIPATE study showed that the pointwise estimate of a combination of LSM by TE of 20 kPa and a platelet count of 150 × 109/L corresponds to the predicted risk of HRV of ~5% [4]. The Baveno VI criteria, therefore, set the maximum allowed risk of missing varices at the 5% mark (Fig. 10.1). Provided that the ANTICIPATE model is well calibrated, the negative predicted value (NPV) of the criteria would tend to be >95%, since all patients with negative criteria would have a theoretical risk of HRV below 5%. This seems very reasonable in a two-step strategy where, in the first step, a high NPV is favored.

Fig. 10.1
A nomogram of H R V risk. 3 horizontal sections have scales labeled as follows. Top, points. Middle, liver stiffness and platelet count. Bottom, total points and risk of H R V. The maximum risk of H R V is 5 percent. 2 vertical lines connect the following values on the scale, points 0 liver stiffness 8 and platelet count 150, points 40 liver stiffness 19.

ANTICIPATE nomogram for predicting HRV based on LSM by TE and platelet count. A LSM of just below 20 kPa, together with a platelet count of 150 × 109/L yields a probability of HRV of 5% [4]

The interpretation of the 5% threshold raised some questions [5], including whether this should correspond to a sensitivity of >95% for the criteria or a NPV of >95%. It is important to emphasize that sensitivity is a backward probability. For example, the sensitivity of the Baveno VI criteria would be the probability of having positive criteria provided the patient has HRV. Therefore, sensitivity never reflects a clinical question, but might be relevant in the early development of a diagnostic test, especially when the study is based on a case-control study. The NPV better reflects the clinical question addressed here: what would be the probability of not having HRV provided that the patient has values within the Baveno VI criteria.

Evolving recommendations for the management of compensated cirrhosis, based on the result of the PREDESCI study [6] might result in the use of beta-blockers in patients with clinically significant portal hypertension (CSPH), regardless of the presence of varices. This shift in paradigm, if widely adopted, will decrease the relevance of Baveno VI criteria for the assessment of varices, since patients already on beta-blockers do not require endoscopy. However, a relevant proportion of patients would have either contraindications or intolerance to beta-blockers, and these patients would require endoscopic assessment unless they are within the Baveno VI criteria.

Validation of Baveno VI Criteria

We performed a systematic search of fully published studies up to March 2021 assessing the performance of Baveno VI criteria. The search strategy is reported in detail in supplementary data 11.1. We identified 28 studies, of which the main characteristics are reported in Table 10.1.

Table 10.1 Main characteristics of the included validation studies

We performed a univariate quantitative meta-analysis of proportions to pool NPVs since, as discussed above, this is the metric we consider the benchmark for validation of the performance of the criteria. The forest plot is shown in Fig. 10.2, with further methodological details in the figure legend. The pooled NPV was 0.99 (95% CI 0.99–1.00), with no significant heterogeneity. The proportion of saved endoscopies ranged from 8% to 60%. The interpretation of the proportion of saved endoscopies must be taken with caution, since they are highly dependent on the spectrum of diseases assessed in individual studies. Results of the meta-analysis of sensitivity are provided in supplementary data 11.2 (pooled sensitivity 0.99; 95% CI 0.98–0.99). We did not use bivariate models (which take into account the covariance of sensitivity and specificity) in our meta-analysis for two reasons. First, bivariate models require continuity correction, which adds a 0.5 to cells with a zero value. Since the number of zero cells was high (a number of studies had a sensitivity of 100% or an NPV of 100%), adding that 0.5 would artificially bring down sensitivity and NPV. Second, as detailed above, Baveno VI is meant to be used in the first step of the approach to diagnosing varices, so the main goal is a high NPV, and positive predictive value and specificity have much less relevance.

Fig. 10.2
An illustration of the studies is listed on the left followed by N P V, C I, previous H R V, and saved percentage. The pooled N P V is 0.99, 95 percent C I, left bracket 0.99, semicolon, 1.00, right bracket. Heterogeneity is I squared, T squared, and p equal 0 percent, 0, and 0.69, respectively. The forest plot is at 0.99 N P V.

Forest plot showing NPVs of the Baveno VI validation studies. Studies were ordered by prevalence. Prevalence and proportion of saved endoscopies are shown in the right columns. Meta-analysis was performed after double arcsine transformation of the proportions, pooled with random effects. Note: Calés et al. 2018 study data was extracted from the following systematic review, since more complete data was obtained from the authors [34]

Impact of Etiology on the Performance of Baveno VI Criteria

Even if there was no heterogeneity in the NPVs of Baveno VI across studies, to confirm that Baveno VI criteria perform well across etiologies, we conducted a subgroup meta-analysis. Only 12 of the 28 studies performed etiology-specific analysis. Figure 10.3 demonstrates the forest plot of the studies. There were no significant differences in NPVs according to etiology subgroups.

Fig. 10.3
An illustration of the studies is listed on the left followed by N P V, C I, previous H R V, and saved percentage. The pooled N P V is 1.00, 95 percent C I, left bracket 0.99, semicolon, 1.00, right bracket. Heterogeneity is l square, T square, chi-square subscript 22, and p equal 0 percent, 0, 19.16, 0.64, respectively. The forest plot is at 1 N P V.

Etiology-specific pooled negative predictive values of Baveno VI criteria

Can Baveno VI Criteria Be Expanded?

The Baveno VI criteria is undoubtedly a validated tool to select for low-risk cACLD patients who can safely avoid a surveillance gastroscopy. However, it has been suggested that the number of saved endoscopies is low. As stated above, this is an unsound metric to compare different studies since it largely depends on how early in the natural history of cACLD these criteria are applied.

In an attempt to increase the proportion of saved endoscopies, the Expanded Baveno VI criteria were proposed after the Baveno VI conference, in which the LSM by TE threshold was increased to 25 kPa, and the platelet threshold decreased to 110 × 109/L [7]. A systematic search identified 16 studies assessing the expanded Baveno VI criteria, and results are shown in Fig. 10.4. Pooled NPV was 0.97 (95% CI 0.95–0.98). However, distinct from Baveno VI criteria, performance of Expanded Baveno VI showed significant heterogeneity (p < 0.0001). Results of the meta-analysis of sensitivities are provided in Supplementary data 11.3 (pooled sensitivity 0.90; 95% CI 0.87–0.93).

Fig. 10.4
An illustration of the studies is listed on the left followed by N P V, C I, previous H R V, and saved percentage. The pooled N P V is 0.97, 95 percent C I, left bracket 0.95, semicolon, 0.98, right bracket. Heterogeneity is l square, T square, and p equal 71 percent, 0.0038, and less than 0.01, respectively. The forest plot is at 0.97 N P V.

Forest plot showing NPVs of Expanded Baveno VI validation studies. Studies were ordered by prevalence. Methodology to pool the NPVs was similar to that shown in Fig. 10.2

To address the sources of heterogeneity, we first evaluated whether etiology was associated with different performance of the expanded criteria. Eight studies showed etiology-specific data. Subgroup meta-analysis did not show any differences in performance across etiologies (Fig. 10.5).

Fig. 10.5
An illustration of the studies is listed on the left followed by N P V, C I, previous H R V, and saved percentage. The pooled N P V is 0.96, 95 percent C I, left bracket 0.94, semicolon, 0.97, right bracket. Heterogeneity is l square, T square, and p equal 30 percent, 0.0013, 21.53, and 0.12, respectively. The forest plot is at 0.96 N P V.

Stratified meta-analysis of NPVs of Expanded Baveno VI criteria according to etiology. There were no significant subgroup differences across different etiologies

We then evaluated the impact of the prevalence of HRV on the performance of Expanded Baveno VI criteria. The group of patients within the Expanded Baveno VI criteria comprises those who are within Baveno VI, and those beyond Baveno VI. The latter is the group that either shows a LSM of 20–25 kPa or a platelet count between 110 and 150. The pointwise risk of HRV of a LSM of 25 and a platelet count of 110 according to the ANTICIPATE model is ~12% [4]. Therefore, patients beyond Baveno VI but within Expanded Baveno VI would have a predicted risk of HRV between 5% and 12%. The prevalence of varices in patients within the Expanded Baveno VI criteria would depend largely on the distribution of the patients in those two groups (within Baveno VI and beyond Baveno VI). We therefore predicted that in series with higher prevalence of HRV, that would predictably have a higher number of patients beyond Baveno VI, the NPV of Expanded Baveno VI would decrease.

To assess this hypothesis, we performed a meta-regression analysis of NPV on the prevalence of HRV. There was a strong association between NPV and prevalence of HRV (Fig. 10.6a), with prevalence explaining 77% of the heterogeneity in NPVs observed across studies. There was no significant association between prevalence of HRV and the NPV of original Baveno VI (Fig. 10.6b), which is likely explained by the fact that all patients within Baveno VI have a theoretical <5% risk of HRV [4].

Fig. 10.6
Two dot plots of H R V prevalence on the x axis versus negative predictive value on the y axis. Plot a titled expanded Baveno 6 criteria has the best fit line with the coordinates (0, 1.00), (15, 0.97), (20, 0.95), (30, 0.90), (35, 0.88). The dots are on both sides of the line. Plot b titled Baveno 6 criteria has the best fit line with the coordinates (0, 1.00), (10, 0.99), (35, 0.98). The dots are mostly on the line. All values are estimated.

Meta-regression assessing the association between prevalence of HRV and NPV of the Expanded Baveno VI criteria (a) and Baveno VI criteria (b)

Other Elastography Methods and Baveno VI Criteria

Point shear wave elastography (pSWE) and 2D-SWE have witnessed increased use in the last few years [35]. The main unsolved issue with these methods is the multiplicity of devices with proprietary algorithms that lead to differences in the quantification of the speed of shear wave, and consequently provide values of liver stiffness that are not identical [34,35,36,37,39]. Therefore, the same liver stiffness thresholds defined for TE cannot be directly applied to pSWE or 2D-SWE [40].

Methods Beyond Baveno VI and Expanded Baveno VI Criteria

Several other models have been proposed for the noninvasive prediction of HRV, including the use of spleen stiffness, spleen diameter or blood-based tests, with only limited or no external validation. Several of these models are reviewed in specific chapters of the book.

Summary and Conclusions

The Baveno VI criteria have been extensively validated as a decision rule for not doing an endoscopy in patients with compensated cirrhosis. The pooled NPV in series with a wide range of prevalences of HRVs (from 5% to 33%) is 99% (95% CI 99%–100%). The expanded Baveno VI cannot be recommended at the present time in any etiology.