Keywords

Introduction

The principle of hepatic elastography is to generate an elastic shear wave and to measure its speed through the liver, from which is calculated the liver stiffness (LS) itself correlated with the severity of chronic liver diseases. For more details, see also book section II “Techniques to measure liver stiffness.” As for all medical exams, some precautions must be taken before and during liver stiffness measurement (LSM) to ensure that the most relevant and clinically meaningful results are obtained. Therefore, conditions related to the patient (fasting, alcohol withdrawal status), to the operator (experience with the device), and to the procedure of examination (measurement site, choice of the FibroScan probe, intrinsic characteristic of the examination) must be carefully controlled to reach the highest quality of liver stiffness measurement.

Fasting

Several studies have evaluated the effect of meal intake on elastography results (Table 41.1). Most of these works evaluated TE whose results significantly increase after eating in half of the patients [1]. The peak increase in LS occurs between 15 and 60 min after meal intake [2,3,4,5], and the increase rises up to 20–40% of the baseline value [2, 3, 6, 7]. LS recovers to initial level within 2–3 h [1, 2, 5, 8]. LS measured with two-dimensional shear wave elastography (2D-SWE) shows also a peak increase before the first hour after meal intake [3, 9], but at lesser extend with a mean 10–20% increase [3, 9, 10]. As for TE, LS recovers to the initial level within 2 h [9]. When measured with magnetic resonance elastography (MRE), LS increases by 5–20% after meal intake [11,12,13].

Table 41.1 Impact of meal intake on liver stiffness measurement

The portal blood flow increases after eating [3, 6,7,8, 11], and some studies have highlighted a significant correlation between both portal blood flow and LS variations [6, 11]. However, others failed to replicate these findings [3, 7, 8]. The decrease in arterial hepatic blood flow is a physiologic response to increased portal blood flow after a meal (hepatic arterial buffer response; HABR). In a study conducted in 19 cirrhotic patients, the increase in LS was more pronounced in patients lacking this postprandial HABR, suggesting it is an important factor modulating postprandial change in LS [7]. Increasing LS after a meal has a significant impact on the diagnosis of liver fibrosis at the individual level. 11% of healthy volunteers with normal LS shift to >6.0 kPa [1] or to >6.7 kPa [10] after meal intake. In patients with chronic liver disease, performing LSM with TE early after eating leads to overestimation of liver fibrosis in around one-third of the patients [1, 3]. As a consequence, international guidelines recommend to perform LSM after fasting for at least 2 h [14, 15].

The Controlled Attenuation Parameter (CAP) included in the FibroScan device evaluates liver steatosis through quantification of the ultrasound attenuation during TE examination on the FibroScan platform [16, 17]. For more details, see also book section VI “Assessment of hepatic steatosis using CAP.” The data available about CAP evolution after meal intake remains conflicting (Table 41.1): some works have shown a significant decrease [4], whereas others demonstrated a significant increase [3] or no modification [5, 18].

Alcohol Withdrawal

Liver stiffness has been shown to significantly and rapidly decrease after curing the cause of chronic liver diseases, especially in chronic viral hepatitis [19, 20]. In fact, this early decrease is mainly due to inflammation regression rather than immediate fibrosis improvement. This is also the case in alcoholic liver disease, with studies showing around 3 kPa decrease in LS within the month after alcohol withdrawal in half of the patients [21,22,23,24]. LS could continue to decrease 6 months after alcohol withdrawal, up to 6 kPa [24]. Therefore, LS results should be interpreted with caution in this situation to avoid underestimation of liver fibrosis [22, 24]. Inflammation-adapted cutoff values may be used for optimal LS interpretation [25]. A recent study has shown that CAP also significantly decreases in 78% of the patients who stop alcohol consumption [26]. For more details, see also book section IV “Important (patho)physiological confounders of LS.”

Operator Experience

An important point for clinical practice is to assess when an operator is sufficiently trained to perform liver stiffness measurement (Table 41.2). In a large series of 13,369 liver stiffness examinations with FibroScan, operator experience fewer than 500 examinations was independently associated with a higher rate of measurement failure and a higher rate of unreliable examinations [27]. However, two other works performed in 2335 patients with chronic liver disease [28] and 992 NAFLD patients [29] found no independent association between the operator experience and the reliability of FibroScan examination.

Table 41.2 Impact of operator experience on liver stiffness measurement

Beyond reliability of the examination, training can also be evaluated through interobserver reproducibility between novice and experienced operators. A recent study showed an excellent novice-expert agreement for the FibroScan result as early as the first 100 examinations [30]. Due to its high ease of use, LSM with FibroScan could be delegated to nonmedical staff such as nurse or specialized technicians. To explore this possibility, the training with FibroScan has been evaluated in five novices having different medical status: a physician specializing in hepatology, a medical intern, a third-year medical student, a nurse, and a non-physician clinical research assistant [31]. The novices showed a progressive increase in the success rate of their LSM, especially the two non-physicians who finally required 50 examinations training. Interestingly, novice-expert agreement for LS results was excellent from the ten first patients with no learning curve for any of the five novices. Taken together, these results suggest that increasing experience allow to perform LSM more easily, but that results are relevant from the first examinations. In line with these findings, a study performed in patients with biopsy-proven chronic liver disease showed that operators with moderate experience (50–500 LSM) were as accurate as experienced operators for the diagnosis of cirrhosis using FibroScan (respective AUROC: 0.93 vs. 0.94), and only slightly less accurate for the diagnosis of significant fibrosis (AUROC: 0.89 vs. 0.91) [32].

There are few data available about the training with 2D-SWE or point shear wave elastography (pSWE). The rate of unreliable examinations with 2D-SWE in obese patients is higher for a novice compared to an experimented operator, but the difference is no longer significant in patients with normal weight and in overweight patients [33]. Interobserver agreement for liver stiffness results between novice and expert operators are very good for 2D-SWE [34] as for pSWE [35, 36]. However, one study that used liver biopsy as reference has suggested that around 130 examinations with pSWE are required to optimize diagnostic accuracy [36]. Since both technologies are run on conventional ultrasound machines, normally a dedicated ultrasound knowledge is required.

In summary, it can be considered that around 100 liver stiffness examinations are required before considering an operator as totally trained in the use of an elastography device.

FibroScan Probe

The classic FibroScan M probe is impaired by measurement failure rates reaching 8% in overweight patients and 17% in obese patients [27]. To circumvent this limitation, the manufacturer has developed the XL probe specifically dedicated for obese patients with skin-liver capsule distance >25 mm. Compared with the classic M probe, the XL probe uses a lower central frequency (2.5 vs. 3.5 MHz for the M probe), has a larger tip diameter (12 vs. 9 mm), and measures more deeply below the skin surface (3.5–7.5 cm vs. 2.5–6.5 cm with the M probe). The XL probe provides a lower rate of measurement failure and a similar diagnostic accuracy than the M probe [37,38,39,40]. However, the XL probe result is lower than that of the M probe with consequently a potential risk of underestimation of liver fibrosis. In contrast with the phantoms, the XL probe consistently produced approximately 20% lower liver stiffness values in humans compared with the M probe [40]. In addition to a long skin-liver capsule distance, a high degree of steatosis was also responsible for this discordance. Adjustment of cutoff values for the XL probe (<5.5, 5.5–7, 7–10, and <10 kPa for F0, F1–2, F3, and F4 fibrosis, respectively) significantly improved agreement between the two probes from r = 0.655 to 0.679 [40]. However, a recent work has shown that liver stiffness results obtained with the M probe in patients with BMI <30 kg/m2 are not significantly different from those obtained with the XL probe in obese patients (BMI ≥30 kg/m2) [41]. Therefore, by following the EASL-ALEH Clinical Practice Guidelines (M probe in patients with BMI <30 kg/m2 and XL probe in obese patients) [14], the same diagnostic cutoffs for both probes displayed similar diagnostic accuracy [41]. These results have been confirmed by another study which also evaluated the Automatic Probe Selection tool included in the recent versions of the FibroScan software [42]. The Automatic Probe Selection tool automatically measures the skin-liver capsule distance and indicates the probe to be used as a function of the patient’s morphology. According to their study results, the authors proposed to use the M probe first in patients with BMI <32 kg/m2 and eventually switch to the XL probe according to the recommendation made by the Automatic Probe Selection tool, and to use the XL probe in all patients with BMI ≥32 kg/m2.

Measurement Site

Liver stiffness measurement is performed in patients lying in dorsal decubitus with the right arm behind the head in maximal abduction. The operator has first to choose the correct measurement site, between two ribs at the level of the right lobe of the liver. By evaluating four different measurement sites, it has been suggested that the interobserver reproducibility for FibroScan results is the highest when the measurement is performed at the crossing of the median axillary line and the first intercostal space under the upper limit of the liver dullness [43] (Table 41.3). Another work showed no significant difference in liver stiffness result, AUROC for significant fibrosis or AUROC for cirrhosis among the different measurement sites evaluated [44]. In fact, it is important to correctly place the probe in front of the liver and to ensure a measurement in a liver portion at least 6 cm thick free of large vascular structures. To do this with FibroScan, the operator should control the real-time ultrasound signal on the screen of the device to obtain a typical acoustic signature of the liver characterized by a layered TM mode without heterogeneity and a linear decrease of the A mode. After the shot, the operator must also control that the elastogram displayed on the FibroScan screen is visible throughout the entire window with parallel margins.

Table 41.3 Impact of the measurement site on liver stiffness measurement

Imaging devices including elastography modulus have the advantage to allow the visual selection of the best region of interest within the liver parenchyma. However, liver stiffness results obtained with pSWE are significantly lower when the measurement is performed in the deep portion compared to the superficial portion of the right lobe of the liver, and by intercostal compared to subcostal approach [45]. 2D-SWE seems to perform less for the diagnosis of significant fibrosis when the measurement is performed in the left lobe of the liver [46, 47].

Reliability Criteria

The correct interpretation of elastography results is crucial to ensure appropriate patient management but remains a challenge for physicians because several conditions other than liver fibrosis can increase liver stiffness: liver inflammation [48, 49], cholestasis [50], and central venous pressure [51]. Conflicting data have been observed on steatosis [52,53,54,55] and they are discussed in more detail in chapter “Histological confounders of liver stiffness” in book section IV and in chapter “Liver steatosis (CAP) as modifier of liver stiffness” in book section VI. In addition, intrinsic characteristics of elastography examination should also be carefully considered for the best interpretation of elastography result.

FibroScan

A reliable FibroScan examination has initially been defined as an exam with ≥10 valid shots and ≥60% success rate and an interquartile range/median ratio (IQR/M) ≤30%. However, some works have found that this “classical” definition does not lead to a significant improvement of the noninvasive diagnosis of liver fibrosis [56, 57]. The first study which specifically evaluated the intrinsic characteristics of FibroScan examination demonstrated that IQR/M is a key parameter to consider [58]. Liver stiffness was converted into fibrosis stage according to published cutoffs, discordance was defined as ≥2 stages difference with liver biopsy result, and the multivariate analysis identified IQR/M as independently associated with discordances. There was a 15% discordance rate in FibroScan examinations with IQR/M ≥0.21 vs. 7% in those with IQR/M <0.21. These results were confirmed by another work in which the discordance rate was 22% for FibroScan examinations with IQR/M ≥0.17 vs. 7% for those with IQR/M <0.17 [57]. In this last work, neither the criteria <10 valid shots nor the success rate was associated with discordance between FibroScan and liver biopsy.

The effect of IQR/M on the discordance rate observed in the Lucidarme and Myers studies did not translate in a significant effect on diagnostic accuracy as evaluated with the AUROC [57, 58]. Therefore, another work has used diagnostic accuracy as endpoint rather than discordance between FibroScan and liver biopsy [56]. Multivariate analysis demonstrated that IQR/M independently interacted with the level of liver stiffness to predict liver fibrosis, leading the authors to define three new categories of reliability (Table 41.4): “very reliable” (IQR/M ≤0.10), “reliable” (0.10< IQR/M ≤0.30, or IQR/M >0.30 with liver stiffness <7.1 kPa), and “poorly reliable” (IQR/M >0.30 with liver stiffness ≥7.1 kPa). AUROCs and rate of well classified patients were significantly lower in poorly reliable examinations compared to the two other very reliable and reliable categories. 9.1% of FibroScan examinations were poorly reliable versus 24.3% unreliable examination with the classical definition (≥10 valid shots, ≥60% success rate, and IQR/M ≤30%). The IQR/M ratio reflects the dispersion of the valid acquisitions obtained during the examination and, when increased, it indicates a limitation in correctly assessing the true level of liver stiffness. However, by definition, a high IQR/M implies a smaller interval in low liver stiffness levels. For example, an IQR/M at 0.30 represents a 1.5 kPa interval when liver stiffness is 5.0 kPa, but a 4.5 kPa interval when liver stiffness is 15.0 kPa. Consequently, IQR/M has little impact in low liver stiffness levels, thus explaining why FibroScan examination with IQR/M >0.30 can be considered “reliable” when liver stiffness is <7.1 kPa. Therefore, reliability criteria based only on IQR/M without consideration for the level of LS erroneously exclude reliable examinations and artificially increase the rate of unreliable examinations. An independent validation study has confirmed that the new reliability criteria increase the number of patients with valid FibroScan examinations without compromising the diagnostic accuracy [59]. In this work including 55% cirrhotic patients, the rate of reliable examinations according to the classical definition was 71.6% versus 83.2% of very reliable/reliable examinations according to the new criteria. Compared to classically defined reliable examinations, reliable/very reliable examinations according to the new criteria yielded a similar correlation with fibrosis stages and hepatic venous pressure gradient and showed the same diagnostic accuracy for significant fibrosis or cirrhosis. The new reliability criteria for FibroScan have recently been validated in a cohort of 938 NAFLD patients [60].

Table 41.4 New reliability criteria for liver stiffness measurement with FibroScan

Point Shear Wave Elastography (pSWE)

pSWE examinations with IQR/M ≥0.30 have a higher rate of discordance with liver biopsy and a lower diagnostic accuracy for significant fibrosis and for severe fibrosis/cirrhosis [61]. The same methodology used for FibroScan has been applied in 1094 patients with biopsy-proven chronic liver disease to define three categories of reliability for pSWE: “very reliable” (IQR/M <0.15), “reliable” (0.15≤ IQR/M <0.35, or IQR/M ≥0.35 with pSWE result <1.37 m/s), and “poorly reliable” (IQR/M ≥0.35 with pSWE result ≥1.37 m/s) [60]. Unreliable examinations produced a very low diagnostic accuracy for advanced fibrosis (AUROC: 0.657, rate of well classified patients: 57.8%) as well as for cirrhosis (AUROC: 0.659, rate of well classified patients: 50.0%), which made these exams as not suitable for the evaluation of liver fibrosis in clinical practice. Unreliable examinations accounted for 21.4% of all exams and, interestingly, the rate of unreliable examinations significantly increased with the skin-liver capsule distance to reach 52.7% in patients with a distance higher than 30 mm. These reliability criteria for pSWE examination need now to be independently validated.

2D Shear Wave Elastography (2D-SWE)

A recent work performed in a small series of 88 patients with chronic liver disease took clinically significant portal hypertension defined by hepatic venous pressure gradient ≥10 mmHg as endpoint and proposed three categories of reliability for 2D-SWE examination using the standard deviation/mean ratio (SD) and the depth of measurement: “highly reliable” (SD ≤0.10 and depth <5.6 cm), “reliable” (SD >0.10 or depth ≥5.6 cm), “unreliable” (SD >0.10 and depth ≥5.6 cm) [62]. Accuracy of 2D-SWE for the noninvasive diagnosis of clinically significant portal hypertension was significantly different between highly reliable, reliable, and unreliable examinations, with respectively 96%, 76%, and 44% correctly classified patients. Another study performed in 142 patients with alcoholic liver disease or chronic viral hepatitis C did not find any association between reliability and SD below 10% [63]. In this work, 2D-SWE measurements with both SD ≤1.75 kPa and a ROI diameter ≥18 mm had excellent accuracy for the diagnosis of cirrhosis (AUROC = 0.99) whereas AUROC was only 0.75 for the exams with SD >1.75 kPa and diameter <18 mm. Finally, it seems that the SD is an important factor to consider for the interpretation of 2D-SWE results, but further studies performed in large cohorts are required to clearly define and validate the reliability criteria of 2D-SWE.

Controlled Attenuation Parameter (CAP)

It has been recently suggested that CAP examinations with an IQR/M >30 dB/m [64] or >40 dB/m [65] are less accurate for the diagnosis of fatty liver. The interest of using IQR/M >40 dB/m as criteria to identify unreliable CAP examinations has been replicated in a study including patients with alcoholic liver disease [26], but not in another work performed in NAFLD [66]. Further studies are required to determine and validate the reliability criteria for CAP examination.