Chronic liver disease has many potential etiologies, ranging from viral infection to non-alcoholic fatty liver disease, some of which are on the rise, with an estimated 30 million Americans affected [1, 2]. The end stage of liver disease is fibrosis and eventual cirrhosis. Assessment of fibrosis is not only important for diagnosis, but also increasingly for management decisions and follow-up, particularly in evaluating candidacy for and response to novel and emerging therapies. Although liver biopsy has been the reference standard in the diagnosis and staging of fibrosis, it has disadvantages in that it is an invasive technique with relatively high-cost and potential sampling error [3, 4]. Non-invasive imaging techniques for staging hepatic fibrosis have shown great promise and increased adoption, with the most attention given to elastography techniques. There is a growing body of data showing that MR and US elastographies are safe, non-invasive, and reliable techniques for evaluating fibrosis [1, 512]. However, these techniques must be performed prospectively with dedicated equipment, may be influenced by operator/technical and patient factors, and rely on measurement of tissue stiffness [1, 7, 9, 13]. Diseases other than fibrosis, such as inflammation, biliary obstruction, and congestion, can also increase liver stiffness [1].

There is an ongoing clinical need for robust non-invasive imaging biomarkers to detect and stage hepatic fibrosis. A variety of morphologic and volumetric changes in the liver and spleen have been evaluated on CT, recently including the liver segmental volume ratio (LSVR), splenic volume, and liver surface nodularity [1417]. These measurements can be performed retrospectively without special equipment and have preliminarily shown performance similar to elastography techniques. CT texture analysis is a technique that quantifies heterogeneity of a region of interest by analyzing the distribution and/or relationship of pixel or voxel gray levels in the image [18]. This technique has been most intensely studied in oncologic applications and has shown association with pathologic features and clinical outcomes in a variety of tumor types [1932]. Some non-oncologic applications are being evaluated, with several groups using texture analysis to assess emphysema and fibrosis in the lung [33, 34]. One small series using CT texture analysis to evaluate hepatic fibrosis found that texture parameters showed some ability to discriminate between stages of fibrosis, but the results were not very convincing [35], and some groups have applied texture analysis to US or MRI [36]. Like volumetric assessment and surface nodularity, this technique can be easily retrospectively applied to CT images. Since it is assessing heterogeneity in the liver rather than tissue stiffness or macroscopic liver morphology, it has the potential to be complementary to other techniques. The purpose of this study was to further evaluate CTTA for staging of hepatic fibrosis (stages F0–F4).

Methods

This retrospective study was HIPAA compliant and IRB approved. The requirement for signed informed consent was waived.

Patient population

The final cohort consisted of 289 adult patients (158 M, 131 F, mean age 51 years). Patients were categorized based on stage of hepatic fibrosis (METAVIR stages F0–F4) [37], ranging from normal controls (F0, n = 77) through intermediate stages of fibrosis (F1, n = 42; F2, n = 37; F3, n = 53) to end-stage cirrhosis (F4, n = 80). Inclusion criteria included adult patients, with both a liver biopsy and CT performed within 1 year for intermediate stages of fibrosis (F1–F3). Normal patients were asymptomatic healthy patients being evaluated for kidney donation and did not have a biopsy. The F4 group was defined by either a liver biopsy (n = 36) or clear-cut chronic end-stage liver disease, as established by imaging and electronic medical review by an experienced abdominal radiologist. The latter definition was based on consultations with our hepatologists and our institutional practice. It required clear cross-sectional imaging evidence of cirrhosis and portal hypertension, identifiable clinical cause/risk factors for liver disease, and complication of liver disease/portal hypertension(hepatic encephalopathy and variceal bleed). A portion of this cohort participated in separate investigations of hepatosplenic volume changes and surface nodularity, and similar inclusion criteria were used [1517]. In the majority of cases, the cause of underlying liver disease was hepatitis C, alcohol-related liver disease, or non-alcoholic fatty liver disease, although small numbers of cases had other causes (primary sclerosing cholangitis, primary biliary cirrhosis, alpha 1 antitrypsin disease, and cryptogenic).

MDCT technique

All CT scans were obtained on 16- or 64-MDCT scanners. The specific CT protocol varied slightly based on the indication (i.e., triphasic liver for transplant evaluation, biphasic liver for cirrhotic liver evaluation, and multiphasic exam for renal donor evaluation). For the CTTA measurement, portal venous phase images were used from all of the exams, using patient size-based scan parameters (auto-mA, kV 100-140). Portal venous phase images were generally reconstructed with 5-mm slice thickness at 3-mm intervals at a matrix of 512 × 512 × 16. Several prior studies have looked at the liver in the portal venous phase [38, 39] and shown that analysis of hepatic texture in this phase of contrast is useful. While delayed phases may emphasize fibrosis [40], this portal venous phase is fairly standard for most institutions and we wanted to evaluate an imaging biomarker that could be widely retrospectively applicable.

CT texture analysis (CTTA)

CTTA was performed by a single-trained reader under the supervision of two experienced abdominal radiologists (12, 22 years). Representative single-slice images at the level of the porta hepatis (largest cross section of the liver) were then sent to a commercially available texture analysis research software platform (TexRAD Ltd, part of Feedback Plc, Cambridge, UK). This slice selection at the level of the porta hepatis was used based on several other whole liver studies [38, 39, 41]. Using the software, a region of interest (ROI) was manually drawn on a single slice at the level of the porta hepatis to include the entire liver (but exclude the major vessels) for a total liver measurement (Fig. 1). A second ROI was drawn around the left lateral lobe and caudate (Couinaud segments I–III) with a third drawn around the medial left lobe and right lobe (Couinaud segments IV–VIII). This additional evaluation of segments I–III versus IV-VIII was based on the morphologic changes seen in cirrhosis and prior work looking at the liver segmental volume ratio (LSVR) [16, 17]. Previous studies have demonstrated that CTTA from single-slice analysis have the ability to extract sufficient information to answer the clinical question. Multi-slice or volume analysis may not be necessary in some cases [23, 42]. Although it makes intuitive sense that multi-slice analysis may be useful, it seems that single-slice analysis is sufficient in many cases and the trade-off in the computational time is more practical in a routine radiological workflow and complexity associated with multi-slice/volumetric analysis related to operator variability associated with ROI drawing, particularly if the ROI drawing process involves manual/semi-automated segmentation approaches, for slightly less data may be worthwhile. CTTA technique employed in this study uses a filtration-histogram technique where initial filtration step using Laplacian of Gaussian (LoG) spatial band-pass (non-orthogonal Wavelet) filter to selectively extract and enhance features of different sizes and intensity variation [20, 31]. This produces a series of derived images that show features ranging from fine (spatial scaling factor, SSF 2 mm which corresponds to object size of 2 mm in radius), medium (SSF 3, 4, 5 mm which corresponds to object size of 3, 4, 5 mm in radius) to coarse (SSF 6, which corresponds to object size of approximately 6 mm in radius) texture maps (Fig. 1) [41, 43]. This is followed by quantification using histogram-based statistical parameters (first, second, and higher order) which includes mean gray-level intensity (Mean), standard deviation (SD, dispersion from the mean), entropy (irregularity), mean of the positive pixels (MPP), skewness (asymmetry), and kurtosis (peakedness or sharpness) at each SSF value. Additional texture features described by Haralick were not available with this software [44]. Also, these histogram parameters were quantified from the conventional image without filtration (SSF = 0) as a control for all texture features. These values were recorded for each patient case and subsequently underwent statistical analysis. Texture features were correlated with stage of hepatic fibrosis.

Fig. 1
figure 1

Texture measurements of the liver. Single-slice CT image demonstrates an ROI drawn around the total liver at the level of the porta hepatis (A) in a pt with F3 disease. Fine (B), medium (C), and coarse filter (D) texture output is obtained. Similarly, ROIs were place on segments IV–VIII (E) and segments I–III (F) for additional analysis

Statistical analysis

Examination of boxplots and Spearman rank correlates was used as a data reduction step to select features most highly correlated with fibrosis. This was seen in conjunction with a trend in significant correlation between a texture parameter vs fibrosis across the different SSF values (e.g., fine, medium, and coarse) indicated a robust association, which would not be attributed to a chance correlation. A Mann–Whitney non-parametric U test was used to assess differences in the texture parameters selected from the data reduction step among the discrete F0–F4 cohorts with emphasis placed on the clinically relevant distinctions/groupings of the presence of fibrosis (F0 vs. F1–F4), significant fibrosis (≥F2), advanced hepatic fibrosis (≥F3), and cirrhosis (F4). ROC curves were obtained for each significant candidate metric from the Mann–Whitney test, and AUC was calculated with DeLong 95% CI. Exploratory cutoffs for fibrosis categories were derived from the ROC analysis. A value of p < 0.05 (two-sided) was the criterion for statistical significance. All statistical analyses were performed with the R program (version 3.3.1, R Core Team, 2016).

Results

Mean gray-level intensity (Mean) showed a statistically significant association with stage of fibrosis at nearly every SSF value (fine, medium, and coarse texture scale) and for nearly all categorizations of fibrosis. Mean was seen to increase from F0 to F4 in the total liver measurement, and even more strongly in the segment IV–VIII measurement, particularly at medium (SSF 5 p < 0.001) and coarse feature size (SSF 6, p < 0.001). This association was seen less strongly (or possibly moving in the opposite direction) in segments I–III (Table 1; Fig. 2). ROC analysis of mean demonstrated an AUC of 0.78 at fine (SSF 2), 0.76 at medium (SSF 5, p<0.001), and 0.76 at coarse (SSF 6 p<0.001) feature size for differentiating F0 vs. F1-4 (presence of any fibrosis, p<0.001, Table 2). A threshold of Mean >0.175 demonstrated a sensitivity and specificity of 74% and 74%, respectively (Table 2). Mean demonstrated an AUC of 0.73 (medium texture scale, SSF 5) for identifying significant fibrosis (≥F2) and AUC of 0.73 for advanced fibrosis (≥F3; coarse texture scale, SSF 6). Mean demonstrated an AUC of 0.76 (coarse texture scale, SSF 6) for identification of cirrhosis (F4) with threshold of mean >0.94 with both sensitivity and specificity at 70% (Table 2; Fig. 3). Entropy and SD also showed some promise in identifying the presence of fibrosis with lower entropy and SD values in the F0 group compared to F1–F4, particularly at fine (SSF 2) filter levels, but at medium and coarse texture size filters for entropy as well. Again, this was seen in the total liver, but more prominently in segments IV–VIII.

Table 1 Summary statistics for mean, skewness, Kurtosis at coarse filter (ssf 6)
Fig. 2
figure 2

Boxplots for mean gray-level intensity (y-axis) across fibrosis stages (x-axis, 0–4) at coarse feature size (SSF 6) for segments IV–VIII (A), total liver (B), and segments I–III (C). Note how the values increase across levels of fibrosis seen most prominently in segments IV–VIII, with a much less prominent or possibly opposite effect in segments I–III and a slightly diluted effect seen in the combined total liver

Table 2 Texture parameter associations with hepatic fibrosis
Fig. 3
figure 3

ROC analysis for mean gray-level intensity (mean) compared to stage of fibrosis in segments IV–VIII. AUC for mean at fine feature size (SSF 2) for detecting the presence of fibrosis (stage 0 vs 1–4, A) was 0.78; AUC for mean at medium feature size (SSF 5) for detecting significant fibrosis (≥F2, B) was 0.73; AUC for mean at coarse feature size (SSF 6) for detecting advanced fibrosis (≥F3, C) was 0.73; and AUC for mean at coarse feature size (SSF 6) for detecting cirrhosis (F4, D) was 0.76

Skewness and kurtosis of the pixel histogram also demonstrated association with hepatic fibrosis, which was particularly strong at identifying cirrhosis (F4). Skewness values were higher in the F4 group compared to the earlier stages of fibrosis, with a similar trend seen with kurtosis. This was seen at all filter levels (fine, medium, and coarse texture scale) for kurtosis and mainly at medium (SSF 5) and coarse (SSF 6) texture scales for skewness. This was again seen in the total liver, but more prominently in segments IV–VIII for both measures, and to a lesser extent (or perhaps opposite direction for kurtosis) in segments I–III (Table 1; Fig. 4). For skewness at coarse filter (SSF 6), an AUC of 0.87 (95% CI 0.827, 0.919) was seen (F0-3 vs. F4), with a threshold of <1.3 demonstrating sensitivity of 84% and specificity of 75%. Similarly, for kurtosis at coarse filter (SSF 6), an AUC of 0.86 (95 % CI 0.806, 0.909) was seen, with threshold <5.2 showing sensitivity and specificity of 81% and 76% respectively (Table 2; Fig. 5).

Fig. 4
figure 4

Boxplots for skewness (y-axis) at coarse feature size (SSF 6) comparing cirrhotics (F4) to other stages of fibrosis (F0–F3, x-axis) measured in segments IV–VIII (A), total liver (B), and segments I–III (C). There is a decrease in skewness values for F4 (cirrhosis) compared to F0–F3, seen most prominently in segments IV–VIIII. A similar trend is seen for kurtosis at coarse feature size (SSF 6) comparing F4 to other stages of fibrosis measured in segments IV–VIII (D), total liver (E), and segments I–III (F). In fact, kurtosis in F4 appears to have the opposite pattern relative to F0–F3 for segments IV–VIII vs. segments I–III

Fig. 5
figure 5

ROC analysis for skewness at coarse filter in segments IV–VIII for detecting cirrhosis (F4, A) with AUC of 0.87 and for kurtosis at coarse feature size (SSF 6) for cirrhosis (F4, B) with AUC of 0.86

Discussion

We found that changes in liver parenchymal CT texture features are associated with degree of underlying hepatic fibrosis. As many of these patients undergo CT for other reasons, this is a measurement that can easily be performed retrospectively on routinely acquired portal venous phase scans without special equipment or complex technique. This may be helpful not only in patients with known liver disease undergoing serial monitoring, but also for patients with unsuspected liver disease that is incidentally detected or first suggested on CT imaging. As improved therapies become available, the ability to detect and characterize fibrosis in at risk populations or monitor response to therapy in a non-invasive, global way becomes increasingly desirable. Techniques like CTTA would not necessarily eliminate the need for biopsy but minimize the frequency/streamline its usage, as a method to triage need for biopsy or act as an adjunct to biopsy in discordant or complex cases.

Daginawala et al [35] looked at the use of texture analysis in assessing hepatic fibrosis in a cohort of 83 patients stratified by Ishak fibrosis scale and found that mean of the pixel histogram was a useful parameter, but with AUC of 0.68 for Ishak 0–2 vs. 3–6, AUC of 0.68 for Ishak 0–3 vs. 4–6, and 0.69 for Ishak 0–4 vs. 5–6. They also saw some association with entropy [35]. We saw similar texture features (mean, entropy) associated with stage of fibrosis, but with more robust AUC values of approximately 0.75 in most cases (in the fair range of ROC curves), and with skewness and kurtosis showing ROC AUC values of 0.86–0.87 (good accuracy, near excellent). It is not clear that a filtration step was used in the texture analysis process used by Daginawala et al, and it is possible that employment of a filtration-histogram technique used here extracts and enhances subtle features/objects potentially improving the diagnostic capability.

There has been increasing interest in non-invasive imaging biomarkers of hepatic fibrosis, with particular attention to elastography (US, MR) techniques. For identifying significant fibrosis (≥F2), US elastography has reported ROC AUC values ranging from 0.840 to 0.870 [45] with sensitivity and specificity from 70–79% to 81–85%, respectively. A meta-analysis looking at MR elastography for identifying significant fibrosis showed AUC 0.880, sensitivity 79%, and specificity 81%, similar to the US-based results [7]. However, US is operator dependent and may be limited in obese or very ill patients, and with MRI failure rates may exceed 5% [12, 13]. Both techniques must be performed prospectively with special equipment, and both are based on changes in stiffness, which is not specific to fibrosis. Increases in stiffness can also be seen in the setting of inflammation, which can overlap or coexist with hepatic fibrosis. CT-based imaging biomarkers in general may be more easily and consistently applied and may have the ability to differentiate inflammation vs fibrosis as they do not rely solely on changes in stiffness in the assessment, although this needs to be an area of future study. For example, in a study looking at lung cancer, the investigators were able to create regional histopathologic maps using CT features (HU histogram) to identify things like inactive fibrosis, active fibrosis, necrosis, red blood cells, and neoplastic cells. In a liver that has a combination of processes such as inflammation and fibrosis, such maps may be useful in determining the predominant process, targeting biopsy, and prioritizing treatment [46].

A number of other promising CT-based imaging parameters have been assessed. One is the liver segmental volume ratio (LSVR), which looks at the ratio of volume changes between segment I–III and segment IV–VIII[16]. As the degree of fibrosis increases, there is increasing hypertrophy of the left lateral segment and caudate lobe compared to the right lobe and medial left lobe, so the LSVR increases with increasing fibrosis. LSVR showed an ROC AUC value of 0.854 with sensitivity of 68% and specificity of 88% for identifying significant fibrosis (≥F2) using a threshold of 0.336 and of 0.880, 72%, and 88%, respectively, for identifying advanced fibrosis (≥F3) using a threshold of 0.347. Similarly, changes in splenic volume were also found to correlate as well or better with stage of hepatic fibrosis, and when LSVR and splenic volume were taken together, improved ROC AUC values were seen. However, changes in total liver volume were not useful in predicting stage of fibrosis [16, 17]. This idea of differential changes in segments I–III vs. IV–VIII is what compelled us in our study to measure these sections of liver separately in addition to the total hepatic texture. This may be why CT texture assessment of segment IV–VIII was the best predictor of stage of hepatic fibrosis, as segments I–III may be changing more slowly, or possibly in the opposite direction and measurements of the total liver become diluted as a result. This also supports the idea that measurements such as CTTA may be used for triaging for or even targeting of hepatic biopsy. Our practice frequently targets the left lobe for random biopsy, but these data suggest that that may underestimate the global amount of fibrosis in the liver. One potential drawback of the LSVR is that it may take a period of time and a certain amount of fibrosis for volume changes to begin, so it was more difficult to separate earlier stages of fibrosis (F0, F1) using this technique. However, CTTA seems to capture some of the early changes happening within the parenchyma and multiple features, but particularly the mean gray-level intensity at different filter scales (fine, medium, and coarse), showed promise in differentiating no fibrosis (F0) from fibrosis (F1–F4), suggesting texture and volumetric measurements may be complementary. These changes in mean gray-level intensity and entropy may occur as the homogeneous normal background liver develops areas of higher attenuation fibrosis (increasing mean gray-level intensity with increased degree of fibrosis). In addition, CTTA is less time and labor intensive than volumes, although both can be obtained retrospectively.

Another CT imaging biomarker that has shown promising in assessing stage of hepatic fibrosis is liver surface nodularity (LSN), which is a simple objective measurement that quantifies the amount of surface nodularity along a section of liver [14]. For significant fibrosis, LSN showed an ROC AUC value of 0.902, for advanced fibrosis, 0.932, and for cirrhosis, 0.959 [15]. This also showed promise in early stages of disease with an ROC AUC of 0.903 for F0 vs. F1–F4 [15]. Like CTTA, the liver surface nodularity score is likely capturing changes that occur before substantial volumetric changes are seen.

Given that all three of these measures can be easily retrospectively obtained and may be complementary, multi-parametric assessment using all three may demonstrate improved performance, and this is a goal of ongoing future work.

Limitations of this study include that texture analysis was only performed on a single slice of the liver rather than the entire volume of the liver. However, several studies have shown that use of a single slice is sufficient for sampling and extracting subtle features relevant for the clinical application being evaluated [23, 42]. In addition, the cohort was a pooled group of liver disease with a variety of etiologies included. Future work with larger, disease-specific cohorts is planned. Although all the patients with intermediate stages of fibrosis underwent percutaneous biopsy, none of the patients in the normal cohort and some of the patients in the cirrhotic cohort did not have biopsy/tissue sampling. It is possible that some of our healthy controls may have had early unsuspected fibrosis, although the differentiation of F0 from F1–F4 seen consistently in this cohort might argue against that. For our F4 cohort, the decision not to require biopsy (although about half had a biopsy) was based on clinical practice of our hepatologists, who often do not order liver biopsy if the patient has an established etiology for liver disease, imaging findings of cirrhosis, and clinical complications of cirrhosis/portal hypertension. There were slight technical differences in some of the CT scan parameters, although the same phase of contrast was always utilized. A filtration-histogram method was used to reduce some of the heterogeneity of the images due to technical factors, but additional gray-scale normalization was not performed.

In conclusion, CT texture analysis of the liver parenchyma, particularly segments IV–VIII, may be a useful, non-invasive imaging biomarker for staging hepatic fibrosis that can easily be performed retrospectively or from routinely acquired CT and on serial CT examinations.