Introduction

Spontaneous bacterial peritonitis (SBP) developed ranging from 20 to 30% in patients with cirrhosis and ascites at 1-year follow-up, with a mortality rate of 20–40% [5, 12, 19]. After an episode of SBP, recurrence and mortality rates increased sharply [10]. The available evidence indicated that antibiotic prophylaxis was effective in the prevention of SBP [8, 16]. However, several problems would arise in long-term primary prophylaxis for first episode of SBP. Cost issues aside, appearance of resistant bacteria strains and the risk of subsequent development of SBP caused by resistant organisms should be considered [4, 7]. It is urgent to identify subgroups of cirrhotic patients with ascites who is at high risk of developing first episode of SBP, and can benefit from antibiotic prophylactic therapy.

Several risk indicators of SBP had been proposed in recent years, such as low ascites fluid protein level, platelet count and high serum bilirubin [2, 11, 15]. Moreover, model of end-stage liver disease (MELD) score, which was originally established to evaluate short-term mortality risk in patients with cirrhosis undergoing transjugular intrahepatic portosystemic shunt [17], had also been used to assess risk for SBP in patients with cirrhosis [9, 18]. However, clinically practical method of risk stratification of SBP in patients with different combinations of parameters was not well established. Combinations of the variables with optimal cut-off values would be an easier approach in the evaluation of risk for SBP.

Classification and regression tree (CART) analysis belongs to a family of nonparametric regression methods. The CART analysis is a tree-based classification and prediction method that uses recursive partitioning to split the training records into segments with similar output field values. At each split, the best predictor is automatically selected based on the modeling method used and 2 child nodes were generated. In turn, two child nodes may become parent nodes yielding additional child nodes. This process continues until no additional significant variable was detected or the sample size of the subgroup was below 20. As a result, the decision tree models are produced, which are easy to use and understand. It can be used simply to identify patients at different levels of risk by combining different variables. CART analysis had provided accurate prognostic models in different medical areas [3, 13, 21]. However, this method had not been used in the risk assessment of SBP development in cirrhotic patients with ascites.

In this study, CART analysis was used to develop a simple and accurate prediction model for stratifying cirrhotic patients with ascites, according to their risk of developing a first episode of SBP. Furthermore, the novel model would be tested in an independent cohort. Furthermore, we tested how well the new discrimination model could stratify SBP among patients with MELD scores under 15, in which SBP occurrence rate was low.

Patients and methods

Study population for development of CART model (derivation cohort)

Medical records of 1,381 consecutive cirrhotic patients with ascites who underwent diagnostic paracentesis within 48 h of admission to the First Affiliated Hospital of Wenzhou Medical College were reviewed from January 2007 to December 2009. Data concerning demographic information, medical history, clinical characteristics, laboratory values, co-morbidities, and physical exam findings were collected retrospectively. Exclusion criteria were: (1) patients admitted with gastrointestinal hemorrhage, or a previous history of gastrointestinal hemorrhage or SBP, because most of such patients had received antibiotic prophylaxis for SBP [10]; (2) liver cancer; (3) patients who had antibiotic administration within 2 weeks before admission; (4) patients with a potential confounding etiology for ascites unrelated to cirrhosis, such as peritoneal carcinomatosis, pancreatitis, tuberculosis, hemorrhage into ascites, or congestive heart failure; (5) for early identification of patients at risk for SBP, patients with fever, abdominal pain, or hepatic encephalopathy (HE) were excluded from the study; (6) nosocomial-acquired SBP. Seven hundred and five patients were excluded from the analysis: 116 subjects suffered from gastrointestinal hemorrhage, 137 patients had a previous history of SBP, 68 patients had a previous history of gastrointestinal hemorrhage, 132 patients had liver cancer, 58 patients received antibiotic treatment within 2 weeks before admission, 35 patients had confounding etiologies for ascites, 86 patients with fever, abdominal pain, or HE, and 73 patients with nosocomial-acquired SBP. Thus, a total of 676 eligible patients were included in the derivation cohort. The flow of inclusion and exclusion was shown in Fig. 1. This study was approved by the Ethical Committees of the First Affiliated Hospital of Wenzhou Medical College. All patients provided written informed consent.

Fig. 1
figure 1

Flow diagram of patients included in the study. *An independent cohort was prospectively enrolled for the validation of the derived CART model between January 2010 and December 2010. #Patients with MELD scores under 15 in derivation cohort and validation cohort. Dotted line showed that the patients in subgroup analysis were selected from derivation cohort and validation cohort. AUROC area under the receiver operating characteristic curve, CART classification and regression tree, HE hepatic encephalopathy, MELD model of end-stage liver disease, SBP spontaneous bacterial peritonitis

Definitions

The diagnosis of cirrhosis was based on the results of the combination of physical, laboratory, and radiologic examination results or endoscopic signs of portal hypertension. Ascites was confirmed by ultrasonography. SBP was defined as polymorphonuclear cells count greater than or equal to 250 per cubic millimeter with or without a positive culture of the ascetic fluid, in the absence of finding suggestive of secondary peritonitis [5]. A community-acquired SBP episode was considered in any case diagnosed during the first 48 h of hospitalization. The diagnosis of liver cancer was made based on computer tomography and/or magnetic resonance imaging and/or hepatic angiogram.

CART analysis

In the study, CART analysis was applied to determine which cirrhotic patients with ascites would develop a first episode of SBP, solely based on the pre-paracentesis objective variables. The terminal subgroups were most homogeneous with respect to the probability of SBP development. Then, cirrhotic patients at low, intermediate, and high risk for SBP were identified according to the rates of SBP of subgroups. SBP occurrence rates for these risk groups and the odds ratios (ORs) and 95% confident interval (CI) between risk groups were determined.

Sample size for the validation cohort

To have an appropriate set for validation, it was planned that the validation set should have at least 5 events (SBP) for each variable included in the model derived from the derivation set [1]. Therefore, an independent cohort which consisted of 198 consecutive cirrhotic patients with ascites underwent diagnostic paracentesis was prospectively enrolled at the First Affiliated Hospital of Wenzhou Medical College between January 2010 and December 2010. The study protocols for these subjects were identical to those used in the development phase. The values of pre-paracentesis variables were taken within 24 h of admission.

CART model validation and comparison

The ability of the derived tree to stratify cirrhotic patients into different risk of SBP was tested in the independent validation cohort. The patients from the validation cohort were allocated to subgroups using the flowchart of the derived CART tree. The patients at low, intermediate, and high risk for SBP were identified and the SBP OR and 95% CI between risk groups were determined. In order to assess model calibration, the correlation between the model derivation and the validation dataset was also calculated. To validate the model further, we compared its performance with that of MELD score using the area under the receiver operating characteristic curve (AUROC) in both cohorts. MELD score had been proved to be associated with risk for SBP and then was chosen as the reference standard. It was calculated according to the modified Malinchoc formula: R = 9.57 × loge(creatinine [mg/dl]) + 3.78 × loge(bilirubin [mg/dl]) + 11.2 × loge(INR) + 6.43 × (aetiology: 0 if cholestatic or alcoholic, 1 otherwise).

Subgroup analysis

An important application of the new discrimination model in cirrhosis with ascites was to identify patients at risk for SBP with low MELD scores. We were interested in determining how well the new discrimination model could stratify SBP among patients with MELD scores under 15, in which SBP occurrence rate was low [18]. Therefore, we performed a subgroup analysis in which we applied the new prediction rule exclusively to patients with MELD scores under 15 in derivation cohort and validation cohort.

Statistical analysis

Continuous variables were expressed by mean ± SD and categorical values were described by count and proportions. For comparison of different groups, the Mann–Whitney U test and the χ2 test were used to compare continuous and categorical variables, respectively. CART analysis was performed using data mining software Clementine version 12.0 (SPSS Inc, Chicago, IL). Additional statistical analysis was performed in SPSS 13.0 software (SPSS Inc, Chicago, IL). ROC curve analysis was computed using MedCalc 10.0 software (Mariakerke, Belgium). For all analyses, a P value of <0.05 was considered statistically significant.

Result

Characteristics of patients in the derivation and validation cohorts

In the derivation cohort, 676 cases were enrolled for CART model development, of which 153 patients (22.6%) had SBP. The mean age was 54.3 ± 10.0 years, 493 were male (72.9%), and hepatitis B virus (HBV) was the most frequent etiology of cirrhosis (64.8%). A total of 198 subjects were prospectively enrolled in the validation cohort. Sixty-nine patients (34.8%) were found to have SBP. The mean age was 53.6 ± 10.6 years, 145 were male (73.2%), and HBV was also the predominant etiology of cirrhosis (57.1%). There was a significant increase in overall SBP occurrence between the derivation and validation cohorts (P = 0.001). Distributions for demographic and clinical features between the two cohorts were depicted in Table 1.

Table 1 Baseline characteristics of the derivation cohort and validation cohort

CART analysis

Twenty variables were evaluated in this CART analysis, of which the CART method identified serum creatinine (Scr) as the variable of initial split with an optimal cut-off of 79.5 μmol/l. Among patients with Scr lower than 79.5 μmol/l, prothrombin time (PT) was selected as the variable of second split at a discrimination level of 22.95 s (s). The next best predictor of SBP in the lower PT node was total bilirubin (TB) with an optimal cut-off of 72.5 μmol/l. For the node with patients having a Scr level of lower than 79.5 μmol/l, PT of less than 22.95 s, and TB level of lower than 72.5 μmol/l, white blood cell (WBC) counts was selected as additional significant variable, dichotomized at a level of 6.85 × 103/mm3. Among patients with Scr levels higher than 79.5 μmol/l, TB was selected as the next best predictor of SBP and dichotomized at a level of 63.5 μmol/l. Any additional risk nodes involving additional variables could not be generated to offered incremental risk discrimination. Therefore, a total of 6 subgroups of patients were produced by 4 predictive variables selected in this CART analysis: subgroup 1 (Scr ≤ 79.5 μmol/l and PT > 22.95 s), subgroup 2 (Scr > 79.5 μmol/l and TB ≤ 63.5 μmol/l), subgroup 3 (Scr > 79.5 μmol/l and TB > 63.5 μmol/l), subgroup 4 (Scr ≤ 79.5 μmol/l, PT ≤ 22.95 s and TB > 72.5 μmol/l), subgroup 5 (Scr ≤ 79.5 μmol/l, PT ≤ 22.95 s, TB ≤ 72.5 μmol/l and WBC ≤ 6.85 × 103/mm3), and subgroup 6 (Scr ≤ 79.5 μmol/l, PT ≤ 22.95 s, TB ≤ 72.5 μmol/l and WBC > 6.85 × 103/mm3) (Fig. 2).

Fig. 2
figure 2

Predictors of SBP and risk stratification for the derivation cohort. Terminal subgroups of patients discriminated by the analysis were numbered from one to six. PT prothrombin time, SBP spontaneous bacterial peritonitis, Scr serum creatinine, TB total bilirubine, WBC white blood cell

The probabilities of SBP occurrence for the 6 subgroups were highly distinct. Patients were stratified into 3 risk levels: a low risk group (subgroup 5) with SBP rate of 2.0%, an intermediate risk group (subgroup 2, subgroup 4 and subgroup 6) with the SBP rate ranging from 27.5 to 33.3%, and a high risk group (subgroup 1 and subgroup 3) with SBP rate ranging from 60.6 to 86.4%. Subjects in the intermediate risk and high risk groups had 22.21-fold (95% CI, 9.98–49.45, P < 0.001) and 173.50-fold (95% CI, 77.68–634.33, P < 0.001) increased rates of SBP, respectively, compared with those in the low risk group (Table 2).

Table 2 SBP between risk groups

CART tree validation and comparison

The decision tree generated by CART analysis was validated for its ability to risk-stratify patients in the validation cohort of 198 subjects, which was independent of the model building dataset. Each patient was allocated to subgroups according to flowchart of the derived CART tree. The rates of SBP in each subgroup were shown in Supplementary Fig. 1, which were closely correlated with those in the model building dataset (r2 = 0.963) (Fig. 3). This CART tree was also able to stratify patients in the validation cohort into high, intermediate, and low risk. Subjects in the intermediate risk and high risk groups had 19.33-fold (95% CI, 5.56–57.09, P < 0.001) and 369.75-fold (95% CI, 107.89–2175.98, P < 0.001) increased rates of SBP, respectively, compared with those in the low risk group, which were similar to those of the derivation cohort (Table 2).

Fig. 3
figure 3

Consensus analysis for derivation cohort and validation cohort: subgroup-stratified comparison of the SBP rate. The rate of SBP in each subgroup was plotted. The X axis represents the model building, and the Y axis represents the validation datasets. SBP spontaneous bacterial peritonitis

The performance of the derived CART tree was compared with MELD score. The CART tree had an excellent predictive accuracy, with AUROC significantly better than those of MELD score (0.881 vs. 0.791, P < 0.001) (Fig. 4a), and also was reproducible when applied to the validation set (0.924 vs. 0.858, P = 0.025) (Fig. 4b).

Fig. 4
figure 4

Comparison of the AUROC for CART Model and MELD score. a Derivation set; b validation set; c subgroup analysis. AUROC area under the receiver operating characteristic curve, CART classification and regression tree, CI confident interval, MELD model of end-stage liver disease

Subgroup analysis (stratification patients with MELD scores under 15)

There were 512 patients with MELD scores under 15 in the derivation and validation cohorts for subgroup analysis. In these cases, there were 51 patients with SBP (10.0%). The observed patients in this subgroup analysis were stratified into high, intermediate, and low risk according to the CART tree (Supplementary Table 1). This risk tree displayed its reliable ability to identify patients with increased risk of SBP in this subgroup analysis. The AUROCs of the CART tree and MELD score for prediction of SBP in the subgroup analysis were 0.869 and 0.663, respectively (P < 0.001) (Fig. 4c).

Discussion

The data from this large single-center cohort indicated that the prevalence of the first episode SBP was not improved in the past few years compared with historical cohorts [14]. In our study, there was a significant increase in overall SBP occurrence in validation cohort. Based on the data presented in Table 1, the difference was mainly owing to the different severity of the disease between the two cohorts; more severe disease was observed in the validation cohort. This would probably be due to higher distribution of the patients with HBV and Alcohol related cirrhosis. Furthermore, the incidence of SBP varied greatly in the different risk groups (low risk group, 2.0–2.2%; high risk group, 57.1–94.0%). Therefore, it was important to identify the patients with high risk of SBP who could benefit from antibiotic prophylactic treatments.

In the present study, we had derived a novel prediction model to assess risk of SBP in cirrhotic patients with ascites. Four laboratory variables were highlighted in the analysis: Scr, TB, PT and WBC count. The patients were classified to 6 subgroups with different probabilities of SBP based on these variables. Patients were stratified into 3 risk groups according to the rate of SBP: 2.0% for low risk group, 27.5–33.3% for intermediate risk group and 60.6–86.4% for high risk group, according to the result of CART analysis. Patients could be allocated to specific subgroups by following the flow-chart form. In addition, the reproducibility of the prediction model was validated by an independent prospective cohort, in which some of the characteristics of the patients, such as the rate of SBP and laboratory values, differed from the derivation cohort. Thus, the novel model might have the potential to support decisions making in selecting high risk patients for antibiotic prophylaxis.

In our study, the Scr was selected as the first split variable with 79.5 μmol/l in the CART model. Patients with cirrhosis and ascites had a circulatory dysfunction which played important roles in the pathophysiological processes of renal failure and would be aggravated by infection, such as SBP [6, 20]. Level of Scr would elevate in the process of development of SBP. The finding raised the possibility that improvement of circulatory function accompanying with Scr decreasing in cirrhotic patients with ascites might contribute to decrease the occurrence of the SBP. A large prospective study is needed to be conducted to confirm the hypothesis.

The association between the first SBP episode and liver insufficiency had been proofed in previous studies [2, 9]. In a large series of cirrhotic patients with ascites, increased PT and TB, which were considered as markers of liver dysfunction, were significant and independent predictors of a first SBP episode [2]. In risk analysis of a first community-acquired SBP in cirrhosis with low ascitic fluid protein levels, SBP risk increased with bilirubin concentration above 57.4 μmol/l [11]. Our data showed that PT and TB were selected as the second factors for splitting by CART in patients with Scr ≤ 79.5 μmol/l and Scr > 79.5 μmol/l, respectively. In addition, TB was also the third most important predictor for SBP in patients with PT ≤ 22.95 s. These findings indicated that antibiotic prophylaxis was needed for patients with severe liver insufficiency.

WBC defending against bacterial infection is usually the first responders to microbial infection. In cirrhotic patients, the spleen frequently enlarges and holds the WBC, reducing the number of these cells in blood. Elevating WBC count may indicate microbial infection. In our study, WBC count (6.85 × 103/mm3) could be a predicator for SBP in selected patients with Scr ≤ 79.5 μmol/l, PT ≤ 22.95 s and TB ≤ 72.5 μmol/l who were at low risk of SBP (4.6 and 4.1% in derivation and validation cohorts, respectively).

In addition to these parameters, other parameters were correlated with SBP in patients with cirrhosis include ascitic fluid opsonic activity, serum albumin, poor nutritional status, serum AST levels, ascitic fluid protein levels [2, 11]. Because multiple risk factors could exist in the same patient, risk factor analysis must consider factors in combination rather than isolation. Although no SBP risk stratification scheme was available for patients with cirrhosis and ascites, it had been proofed that increasing MELD score was independently associated with a higher risk of SBP. However, the complexity nature of MELD score had greatly limited its use. The number of variables and mathematical functions involved frequently require access to a computer or an electronic calculator to generate a score and then to determine risk made it impractical for bedside assessment. Similar to multivariate regression analysis, the CART method can detect interactions between variables. Moreover, it yields a decision tree that is relatively easy to apply at the bedside, leading to its potential use in a wide variety of clinical conditions. In current study, an intuitive decision tree (Fig. 2), based on the combined use of four variables (Scr, TB, PT and WBC count), usually available at patient’s bedside, allowed an early discrimination of low, intermediate, or high risk groups with clearly distinct rate of SBP. Each of variables could be easily obtained early in the course of general hospital admission. Furthermore, the accuracy of the new prediction model was better than that of the MELD score. The CART model could also accurately identify patients with MELD score below 15.

The validated CART model could be helpful in difficult medical decision making in cirrhotic patients with ascites. Patients stratified to be at the higher risk group should receive a paracentesis and might undergo higher-level monitoring and earlier antibiotic prophylaxis for SBP, while patients judged to be at lower risk group might be reassured and receive less intensive and delayed antibiotic prophylaxis. In addition, the model might be valuable in designing clinical trials involving antibiotic prophylaxis of SBP. It could identify homogeneous patients into uniform groups and balance the risk factors across treatment groups, especially when the outcome of interest is determined by many different variables. For example, patients at high risk for SBP which were selected by CART analysis could be chosen as inclusion subjects in the design of controlled trials.

There were some potential limitations of the current study must be acknowledged. First, the study population came exclusively from a single center and the derivation set were collected retrospectively. The novel model should be validated in the external cohorts prospectively. Second, for early identification of patients with cirrhosis and ascites at risk for SBP, only pre-paracentesis variables were analyzed in this study, and ascitic fluid characteristics, such as ascitic fluid protein level, were not assessed in the current model. Third, data in this study was obtained from inpatients which were at a relatively more advanced stage of the disease and were at higher risk of SBP than outpatients. It would be interesting and necessary to validate the novel model in these patients.

In summary, cirrhotic patients with ascites at low, intermediate, and high risk for SBP can be easily identified using CART model, which provides clinicians with a validated, practical bedside tool for SBP risk stratification.