Introduction

Nonalcoholic fatty liver disease (NAFLD) is a common condition worldwide, with a prevalence in the US of 19% [1]. Within the NAFLD spectrum, non-alcoholic steatohepatitis (NASH) causes severe liver diseases including advanced fibrosis, hepatic failure, and hepatocellular carcinoma [2]. Ultrasound-guided liver biopsy is the gold standard diagnostic test for NASH, but screening biopsies are discouraged as they are costly and invasive [3]. Thus, it is important that a method is developed for the noninvasive identification of patients who are at high risk of NASH, and in whom a biopsy as part of further diagnostic work-up is warranted.

Previous research has shown that several serum markers are elevated in patients with NASH compared with their levels in controls. Elevated markers include alanine aminotransferase (ALT) [4, 5], gamma glutamyl-transferase [5], and cytokeratin-18 [6]. However, other studies have shown a lack of an apparent threshold for serum ALT [7] and low diagnostic sensitivity for serum cytokeratin-18 in patients with NASH [8].

Regarding imaging techniques, some studies have demonstrated a correlation between steatosis grade obtained using abdominal ultrasonography and the risk of NASH [4, 5]. In addition, several groups have used computed tomography (CT) texture analysis to aid in the staging of hepatic fibrosis [9,10,11] as related to the progression of NASH. However, despite the routine availability of non-contrast-enhanced CT (NECT), no study has yet sought to diagnose NASH using this modality.

Texture analysis quantifies regional heterogeneity [12, 13], which is a recognized feature of malignant tumour biology [14]. In texture analysis, regional heterogeneity is represented by texture parameters calculated based on a variety of mathematical methods to evaluate the grey-level intensity of a pixel and its position within an image [12, 14]. The most common method is to quantify statistical values from the grey-level frequency distribution within the region of interest (ROI), which can be obtained from the histogram of pixel intensities. Recently, texture analysis has been applied to the assessment of several pathologies, including tumours [15,16,17,18].

We hypothesized that CT texture analysis could be used to identify patients at high risk of NASH. As pre-existing liver fibrosis can affect CT texture features [19], we chose to group patients based on the biochemical suspicion of pre-existing fibrosis. CT texture and its relationship to NASH were assessed separately in these groups. Our purpose was to explore whether CT texture analysis could be used to predict the presence of NASH accurately.

Materials and methods

This study was approved by the Research Ethics Committee as a retrospective data analysis for medical imaging-based diagnoses (2561-12). The committee waived the requirement for informed consent.

Patients

Initially, a total of 143 patients who underwent a liver biopsy at our institution between August 2006 and April 2016 for the diagnosis of suspected NASH were recruited for this study. Patients who used alcohol (>30 g/day in men and >20 g/day in women) were excluded. A total of 88 patients (average age 64.5 ± 12.0 years, 50 men and 38 women) with suspected NASH based on abnormal liver function test results underwent both abdominal NECT and liver biopsy during this period. The mean interval between NECT and liver biopsy was 2.9 months. All relevant laboratory data were obtained from the patient electronic medical records, including the levels of serum albumin (ALB), aspartate aminotransferase (AST), ALT, serum hyaluronic acid, total bilirubin (T-Bil), prothrombin time (PT), and the international normalized ratio (INR). Histories of alcohol intake were obtained from a database of liver biopsies maintained by the Department of Gastroenterology and Hepatology.

CT image acquisition

CT was performed using a 320-detector row scanner (Aquilion One; Toshiba Medical Systems Corp., 11 patients) or various 64-detector row scanners (Aquilion Prime in 11 patients, Aquilion 64 scanner in 34 patients, both Toshiba Medical Systems; Discovery CT750 HD in 27 patients, LightSpeed VCT in 5 patients, both GE Medical Systems). All scanners used the identical clinical protocol: 120 kV, 100–450 mA (depending on the anatomical location; automatic exposure control systems were employed), 0.5 s per rotation, pitch 0.8–1.4:1, matrix 512, thickness 5 mm, and field-of-view 400–500 mm. For image reconstruction, filtered back-projection was used with a soft-tissue kernel (FC03 in 25 patients, FC04 in 31 patients) with the Aquilion One, Aquilion Prime and Aquilion 64 scanners; or a standard kernel in 32 patients with the Discovery CT750 HD and LightSpeed VCT scanners). Five NECT examinations were performed as noncontrast studies alone and 83 NECT examinations were combined with contrast-enhanced CT.

Texture analysis

A single image slice from the level of the umbilical portion of the portal vein was loaded into the workstation for texture analysis. The TexRAD (TexRAD Ltd.) proprietary research algorithm was used to quantify the textural properties of each image. A circular ROI 2 cm in diameter was placed in the right anterior segment of the liver by the operator, a board-certified radiologist with 5 years’ experience with abdominal CT in conjunction with a board-certified radiologist with 23 years’ experience of abdominal CT. The diameter was chosen so as to maximize the size of the ROI while excluding intrahepatic vessels. The right anterior segment was chosen as it is typically the location selected for biopsy, and avoids potential artefacts caused by the vertebral bodies and the heart.

Texture assessment involved an initial filtration step. A Laplacian of Gaussian spatial band-pass filter was used for the selective extraction of features of differing size and intensity with three spatial scaling factors (SSF): fine (feature width about 1 mm), medium (feature width about 2 mm), and coarse (feature width about 4 mm; Fig. 1). Filtration was applied to reduce CT photon noise, which may affect texture assessment [14]. The SSF was noted following the analyzed parameters. For example, the calculated mean in unfiltered images was recorded as mean0 and the calculated entropy in filtered images with an SSF of 4 mm was recorded as entropy4. Heterogeneity was obtained by analysis of the ROI histogram and each feature of heterogeneity was set as a parameter according to previous studies [12, 14]. The average grey-level intensity of pixels inside the ROI and the variation from the mean intensity were set as parameter mean and standard deviation, respectively (Fig. 2). The average intensity of the positive grey-level signals from pixels within the ROI was set as parameter mean of positive pixels (Fig 3). Asymmetry and pointedness of the distribution were set as parameters skewness and kurtosis, respectively (Figs. 4 and 5). The irregularity or complexity of the grey-level intensity was set as parameter entropy. Generally, a lower entropy represents a more regular (i.e. homogeneous) distribution and a higher entropy represents a more irregular/complex (i.e. heterogeneous) distribution.

Fig. 1
figure 1

Texture analysis of CT images of the liver. a A non-contrast-enhanced CT image in a 57-year-old man shows a region of interest in the right anterior lobe. a–c The corresponding postprocessed CT images reveal (b) fine, (c) medium and (d) coarse textures obtained with fine (1 mm), medium (2 mm) and coarse (4 mm) filtration, respectively. The images are displayed using a red–blue scale indicating negative–positive pixels respectively.

Fig. 2
figure 2

Illustration of the parameter ‘mean’ (the simple average of all the pixel values)

Fig. 3
figure 3

Illustration of the mean positive pixel value (MPP)

Fig. 4
figure 4

Illustration of positive and negative skewness. The parameter ‘skewness’ represents the degree and direction of asymmetry

Fig. 5
figure 5

Illustration of positive and negative kurtosis. The parameter ‘kurtosis’ represents the extent of the distribution’s tail (single line positive kurtosis, double line negative kurtosis, dashed line normal distribution)

Liver biopsy and pathology

A percutaneous 16-gauge needle core biopsy of the right hepatic lobe was performed using ultrasound by a hepatologist blinded to the clinical data and study design. The Department of Pathology processed all specimens. Biopsy reports were prepared using the NASH Clinical Research Network histological scoring scheme [20]. This score incorporates both the NAFLD Activity Score (NAS; grades 0–8) and fibrosis stage (grades 0, 1a, 1b, 1c, 2, 3 and 4). The NAS is the sum of the steatosis score (0–3), the lobular inflammation score (0–3), and the hepatocyte ballooning score (0/2).

Statistical analysis

Patients were divided into two groups in ascending order of the date of liver biopsy to form a learning dataset comprising 53 patients (60%) and a validation dataset comprising 35 patients (40%). As fibrosis influences CT texture [19], the patients were further subdivided based on the suspicion of pre-existing fibrosis (Fig. 6). A serum hyaluronic acid cut-off level of 50 μg/L was used to predict fibrosis grades 3 and 4 [21].

Fig. 6
figure 6

Patient flow diagram (s/o suspicion of)

The learning dataset group with and without suspicion of fibrosis was used to select appropriate variables and to create a multiple logistic regression model that predicted the presence or absence of NASH. NASH was considered present when the NAS was ≥3, as scores of 0 and 2 are not diagnostic of steatohepatitis [20]. First, we constructed receiver operating characteristic (ROC) curves to assess which parameters derived from texture analysis best predicted NAS ≥3, and the areas under the ROC curves (AUCs) of each parameter were measured, including the mean, standard deviation, entropy, mean of positive pixels, skewness, and kurtosis. Parameters with AUCs >0.65 were then subjected to stepwise regression analysis including bidirectional elimination (inclusion and exclusion thresholds, 0.25) to create a multiple logistic model predictive of NASH. The cut-off value was selected according to the Youden index (the highest sum of sensitivity plus specificity minus one). Finally, the multiple logistic model and the defined cut-off values were applied to the validation dataset, and diagnostic performance, including accuracy, sensitivity, specificity, and positive and negative predictive values, was calculated.

The patients’ background characteristics including age, ALT, AST, serum hyaluronic acid, ALB, T-Bil, PT-INR and NAS were compared using a t test, and Child–Pugh class, ascites grade, steatosis scores, lobular inflammation status, hepatocyte ballooning and fibrosis stage were compared using the chi-squared test between patients with and without suspicion of fibrosis. All statistical analyses were performed with JMP 11.2.0 (SAS Inc., Cary, NC). All data are presented as means ± standard deviation.

Results

Patient data grouped by serum hyaluronic acid status are shown in Table 1. Patients with elevated serum hyaluronic acid levels had significantly higher age, Child–Pugh scores, T-Bil levels, PT-INR, pathological steatosis scores, and fibrosis stage. In addition, patients with elevated serum hyaluronic acid levels had significantly lower ALT and ALB. Of the 36 patients without elevated levels of serum hyaluronic acid (<50 μg/L), 28 (77.8%) had fibrosis stages 0–2. Of the 52 patients with elevated serum hyaluronic acid levels (≥50 μg/L), 45 (86.5%) had fibrosis stage 3 or 4. A cut-off value of 50 μg/L was effective in predicting the presence or absence of liver fibrosis in 83% of patients (73/88). The NAS scores did not differ significantly between the two groups.

Table 1 Patient data

Table 2 shows the AUCs of the texture parameters predicting NAS ≥3, derived from ROC analysis of the learning dataset. There were 12 parameters with AUCs >0.65 in patients without elevated serum hyaluronic acid levels. Mean0 and skewness2 remained after stepwise regression analysis, and a multiple logistic model with these parameters was used for prediction of NASH in this group. The cut-off value obtained was 0.45:

$$ \ln \left(p/\left(1-p\right)\right)=-12.4+0.27\times \mathrm{mean}0+4.06\times \mathrm{skewness}2 $$
Table 2 AUCs of texture parameters for NASH prediction

When the predictive model was applied to the learning and validation datasets, the AUC values were 0.93 and 0.94, respectively. The diagnostic performance values of the predictive model using the cut-off value obtained were: accuracy 94%, specificity 92%, sensitivity 100%, positive predictive value 100%, and negative predictive value 80%.

In the learning dataset patients with elevated levels of serum hyaluronic acid, only mean0 and kurtosis4 had AUCs >0.65. Both parameters remained following stepwise regression analysis. A multiple logistic model using these two parameters was used for prediction of NASH in this group. The cut-off value obtained was 0.81:

$$ \ln \left(p/\left(1-p\right)\right)=-10.18+0.18\times \mathrm{mean}0+0.69\times \mathrm{kurtosis}4 $$

When the predictive model was applied to the learning and validation datasets, the AUC values were 0.81 and 0.60, respectively. The diagnostic performance values of the predictive model with the cut-off value obtained were: accuracy 42%, specificity 31%, sensitivity 100%, positive predictive value 100%, and negative predictive value 21%.

Discussion

This is the first study to show that NECT texture features can predict the risk of NASH. As discussed above, NAS ≥3 indicates the strong possibility of steatohepatitis and NASH. In patients without elevated levels of serum hyaluronic acid, mean0 and skewness2 were modest predictors of NAS ≥3. A predictive model using these two parameters predicted NASH with an accuracy of 92% in the validation dataset. In patients with suspected pre-existing liver fibrosis as indicated by elevated levels of serum hyaluronic acid, mean0 and kurtosis4 were modest predictors of NAS ≥3. In this group, a predictive model using these two parameters predicted NASH in the validation dataset with only 42% accuracy. Although CT is widely used to screen for hepatocellular carcinoma in patients with a fatty liver, CT has not been used to diagnose NASH [2]. We found that NECT texture features accurately predicted a higher NAS in patients without elevated serum hyaluronic acid levels.

A recent study found that multiparametric assessment using MRI can distinguish NASH from simple steatosis [22]. However, MRI signal values and acquisition protocols vary, and between-hospital results may be more variable than those found using CT, with which absolute tissue densities are calculated in Hounsfield units. Furthermore, MRI is more expensive and less accessible than CT. We used NECT data in the present study because NECT is essential for the diagnosis of fatty liver, and texture features of contrast-enhanced CT images are influenced by both the acquisition protocol and haemodynamic factors. Dynamic contrast-enhanced CT is recommended for all patients with a diagnosis of NASH but not for all patients with suspected NASH. Whatever the indication, NECT is commonly available and could also be used to provide information for texture analysis without the need for additional imaging.

The unfiltered means are the CT attenuations. Low-level attenuation reflects both macroscopic fat and fat-containing lesions with various proportions of fat and water [23]. NASH is pathologically characterized by steatosis, inflammation and hepatocellular ballooning. Both steatosis and inflammation reduce liver attenuation. The accumulations of fat and water induced by inflammation are less dense than normal hepatic parenchyma. Therefore, it is expected that low-level CT attenuation accompanies NASH. We found that skewness2 also contributed to NASH prediction. Skewness reflects asymmetry of the pixel histogram of the ROI [14]. This may be related to an unusual distribution of hepatic tissue in patients with NASH.

In patients with suspected fibrosis, defined as a serum hyaluronic acid level >50 μg/L, texture features did not accurately predict NAS ≥3. Liver CT texture features are modified by fibrosis [9,10,11], and fibrosis may mask the texture features of NASH. It is therefore difficult to predict NASH in patients with advanced liver fibrosis. In such patients, an alternative noninvasive imaging method is required.

We performed first-order texture analysis by constructing histograms. Some reports have indicated that higher-order texture analysis is helpful in the diagnosis of liver metastasis [24], prostate cancer [25] and glioblastoma [26]. Histogram analysis is simpler and more practical, and should soon be routinely available on DICOM viewers.

We used elevated serum hyaluronic acid levels to predict liver fibrosis of stages 3 and 4. Previous studies have shown that a cut-off value of 42.0 μg/L has high sensitivity and specificity (100% and 89%, respectively) [27], as does a cut-off value of 46.1 μg/L (85% and 80%, respectively) [21]. In our study, a serum hyaluronic acid cut-off value of 50 μg/L was effective in predicting the presence or absence of liver fibrosis in 83% (73/88) of patients. A previous study has shown that fibrosis influences texture features in CT images of patients with varying extents of hepatic fibrosis [19]. In this study, we found that hepatic fibrosis masks the characteristic texture features of NASH, and that these features are more pronounced in patients without suspicion of pre-existing fibrosis.

Our study had several limitations. First, several different CT scanners were used. The scan parameters were controlled in a single institution and interscanner differences in radiomic features were known [28]. In addition, filtration was used, which reduces the effect of photon noise when assessing texture features [28] and may have decreased interscanner differences. Second, the study population was small. This was because the indications for liver biopsy in patients with suspected NASH are limited. NASH is typically diagnosed by excluding other possible causes of hepatic impairment such as alcohol consumption, hepatitis B or C virus infection, and drug-induced hepatitis. However, pathological data were available for all patients, and our new noninvasive diagnostic method was useful when selecting patients for liver biopsy.

Third, despite avoiding visible hepatic vessels by manually creating ROIs in the anterior right lobe of the liver, it is possible that some ROIs may have contained some portion of a vessel. It can be difficult to identify vessels in a fatty liver because the vessels and the liver parenchyma are of similar density. However, the vessel area in the peripheral zone of the liver is small. Fourth, we used a single slice for texture analysis in this study. In some reported studies, three-dimensional ROIs have been used for texture analysis, particularly when performing assessments for neoplasm [29,30,31]. The heterogeneity inside the tumour is thought to be associated with variations in genomic subtype, gene expression, angiogenic factors, and the tumour microenvironment [32]. These variations depend on the location in the tumour and a three-dimensional analysis reveals texture features in the whole lesion. The liver in NASH may show less location-dependent variance than tumours. Therefore, we believe that the texture features were sufficiently demonstrated using a single-slice, two-dimensional assessment.

Fifth, radiation exposure is an inevitable issue in CT. Ultrasonography is a simple and noninvasive examination; however, the procedure depends on the skill of the operator, patient body habitus, and overlying bowel gas. MRI can provide reproducible images without radiation exposure, but CT is less expensive and more accessible than MRI, as discussed above. Therefore, we believe that it is meaningful to establish the assessment of NASH using CT. Finally, as our population consisted of only patients with the suspicion of NASH from their clinical information, it initially had a high pretest probability of NASH, which was considered to have contributed to the high sensitivity of the results.

Our study results might be useful in assessment before liver biopsy, allowing an unnecessary, invasive procedure to be avoided. To confirm the applicability of our results for distinguishing patients with NASH from the more general population, further cohort studies including more patients without suspicion of NASH are required.

Conclusions

In conclusion, texture analysis of NECT images allowed prediction of the presence of NASH with an accuracy of 92% in patients without fibrosis. In patients with elevated hyaluronic acid levels, associated with the presence of fibrosis, no model provided accurate prediction.