Introduction

Hepatocellular carcinoma (HCC) is the second leading cause of cancer death in China and the third in the world (Tang 2001; Parkin et al. 2005). Surgical resection plays a major role in treatment for HCC and offers a chance of cure for patients (Bruix et al. 2006; Bruix and Sherman 2005). However, the prognosis for HCC patients with extrahepatic metastasis is poor (Uchino et al. 2011). Bone metastasis (BM) is one of the major sites of extrahepatic metastasis. We have previously reported that the frequency of BM in HCC patients who had undergone curative resection was 11.7% (Xiang et al. 2011a). BM from HCC itself rarely causes death, but it is a cause of pain and other significant symptoms that are detrimental to the quality of the patient’s life (He et al. 2009).

We have used a cDNA-mediated annealing, selection, extension, and ligation assay method to screen for predictive BM biomarkers, and we identified intratumoral connective tissue growth factor (CTGF) and interleukin (IL)-11(Xiang et al. 2011a). We also have reported previously that CXCR4 overexpression in primary tumor tissues was associated with BM from HCC (Xiang et al. 2009a). We therefore have become interested in the possibility of combining these biomarkers with certain clinical factors to establish a model for predicting BM from HCC. Until now, there has been no such prediction model or clinical scoring system. If HCC patients at high risk of BM may be screened out early, there are therapeutic measures (including administration of bisphosphonates) that may be taken to decrease the probability of BM and increase the quality of life.

The aim of the present study was to use the biomarkers and clinical factors, which we identified with relevance to BM, to set up a model to predict BM in HCC patients.

Methods

Patients and tissue specimens

The present retrospective study was based on two cohorts. Cohort 1 was used as the training set to derive a model for predicting BM in HCC patients, and cohort 2 served to validate the model.

From May 1999 to October 2005, there were 201 patients with pathologically proven HCC who underwent hepatectomy (all performed by the same surgical team) at the Liver Cancer Institute, Fudan University, and these were included in cohort 1. This group consisted of 171 men and 30 women with a mean age of 50.9 ± 10.3 years (range, 25–81 years). Of these patients, 156 (77.6%) were hepatitis B surface antigen (HBsAg)-positive.

The validation cohort, enrolled from January 2000 to March 2005, comprised 179 pathologically proven HCC patients who had all undergone hepatectomy by a second team at the same institution as cohort 1. This study population consisted of 156 men and 23 women with a mean age of 51.2 ± 10.0 years (range, 25–75 years). Of the 179 patients, 142 (79.3%) were infected with hepatitis B virus.

Tumor size of patients was based on the largest dimension of the tumor specimen. Vascular invasion was determined by microscopic examination of the resected specimen. Tumor stage was determined according to the UICC tumor-node-metastasis (TNM) classification system (6th edition). Tumor differentiation was graded by the Edmondson grading system. All patients received a bone scan prior to surgery to exclude BM. BM was not detected in any of the patients of cohort 1 or 2 at the time of surgery.

The inclusion criteria of tissue specimens were limited to the followings: a curative resection; an HCC diagnosis based on pathology; suitable formalin-fixed, paraffin-embedded tissue; no prior anticancer treatment; and complete clinicopathological and follow-up data for the patient.

The study protocol was approved by the Zhongshan Hospital research Ethics Committee. Informed consent was obtained from each patient in accordance with this committee’s regulations.

Follow-up and postoperative treatment

Follow-up was performed at 3-month intervals after hepatectomy. At every visit, the patient history was taken and a physical examination was performed. Chest radiography was performed every 6 months. Ultrasound images of the liver and abdominal lymph nodes and laboratory tests (liver function, α-fetoprotein, and hematologic parameters) were independently evaluated every 3 months by doctors who had no knowledge of the study. A bone scan was performed annually, and bone scanning or magnetic resonance imaging was immediately performed upon any report of localized bone pain. A diagnosis of BM was based on a history of HCC, presence of symptoms, and radiological imaging studies. The time interval between the date of surgery and the date of presentation of BM was recorded. Treatment modalities after relapse were administered as follows: when a diagnosis of BM was made, external beam radiotherapy was focused on the involved bone. Other site relapses received radiotherapy, interventional therapy, or surgery.

Tissue microarray and immunohistochemistry

Tissue microarrays (TMAs) were constructed as previously described (Xiang et al. 2011b, c). Hematoxylin- and eosin-stained TMA slides were screened to identify the optimal intratumoral tissue to use for analysis. TMA slides of training cohort and validation cohort, containing samples from a total of 380 HCC patients, were then constructed in collaboration with Shanghai Biochip Company, Ltd., Shanghai, China. Two tissue cores were collected from non-necrotic areas of tumor foci. Punch cores with a longest dimension of 1.0 mm were used. Sections (4-μm thickness) of the resulting TMA blocks were prepared by using standard techniques.

Immunohistochemistry was carried out as previously described (Xiang et al. 2009b). Primary antibodies used were mouse anti-human CXCR4 monoclonal antibody (R & D Systems, Minneapolis, MN); mouse anti-human CTGF monoclonal antibody, and rabbit anti-human IL-11 polyclonal antibody (Santa Cruz Biotechnology, Santa Cruz, CA). TMA slides were incubated with primary antibodies overnight at 4°C and then washed to remove excess primary antibody. The Envision-plus system (En-Vision+/HRP/Mo, Dako, Carpinteria, CA) was used for the detection step. Reaction products were visualized by incubation with 3,3′-diaminobenzidine. Sections were dehydrated, counterstained with hematoxylin, and mounted. Negative controls were identically treated, but the primary antibody incubation step was omitted.

Positive staining was quantified with a computerized imaging system consisting of a Leica CCD camera DFC420 connected to a Leica DM IRE2 microscope (Leica Microsystems Imaging Solutions, Ltd., Cambridge, UK). Immunohistochemical staining was assessed by three of the authors, who were blinded with respect to outcome. For CXCR4, the intensity of staining (brown color) was scored semi-quantitatively, as follows: +, weak; ++, medium; +++, strong; and ++++, very strong. Samples receiving a score of ++ or greater were considered CXCR4-positive (Shim et al. 2006). For CTGF and IL-11, we randomly selected 10 high-power fields (×400magnification; 100 cells/field) and counted 1,000 cells in each core. The percentages of positive cells expressing CTGF and IL-11 were categorized as follows: CTGF was considered highly expressed if there were ≥50% positive cells (Lin et al. 2005); IL-11 expression was considered positive if ≥10% of the cells were reactive (Yamazumi et al. 2006).

Statistical analysis

Statistical analyses were performed with SPSS 16.0 software (SPSS, Chicago, IL). A simple risk score was devised by using significant variables (P < 0.05) obtained from stepwise multivariate analysis. The score was the weighted sum of the variables; the individual variable weights were defined as the quotient (rounded to nearest integer) of the corresponding estimated coefficient from a Cox regression analysis that was then divided by the smallest chi-square (χ2) coefficient (Wong et al. 2010). The performance of the cutoff was determined by linear trend χ2 test in terms of the discriminatory ability and monotonicity (Feinstein 1972; Ueno et al. 2001). The score was then categorized as either low- or high-risk group. Time-to-BM was defined as the interval from the date of surgery to the date of presentation of BM, and it was analyzed by Kaplan–Meier and log-rank tests. Area under the curve (AUC) and the 95% confidence interval were used to assess the power of the model for predicting BM. A P value of <0.05 by a two-tailed test was judged to be significant.

Results

Patients’ background data

The characteristics of all 380 HCC patients (201 in cohort 1 and 179 in cohort 2) are summarized in Table 1. There was no significant difference in incidence of BM between the two cohorts. All patients of the training cohort were observed until January 2010, and the median follow-up time was 54.3 months (range, 3.9–118.9 months). For the validation cohort, observation continued until May 2010 and the median follow-up time was 52.5 months (range, 3.0–116.1 months). During the follow-up time, 23 patients (11.4%) in the training cohort and 19 patients (10.6%) in the validation cohort developed BM.

Table 1 Baseline demographics and clinicopathological characteristics of the study population

Expression of immunohistochemical biomarkers in TMA

CXCR4 staining was detected both in the cytoplasm and in nucleus. CTGF and IL-11 expression was mainly localized in the cytoplasm of tumor cells or hepatocytes. In the training cohort, CXCR4-positive expression was detected in 92 of 201 patients (45.8%), high CTGF expression in 35 (17.4%), and positive IL-11 expression in 51 (25.4%). In the validation cohort, CXCR4 positive expression was detected in 76 of 179 patients (42.5%), high CTGF expression in 34 (19.0%) and positive IL-11 expression in 43 (24.0%).

Predictors of BM

For the training cohort, 17 clinicopathological features were considered in the Cox proportional hazards regression univariate analysis. These consisted of age, gender, HBsAg, hepatitis C virus antibody, α-fetoprotein, alanine aminotransferase, liver cirrhosis, Child-Pugh score, tumor differentiation, tumor size, tumor number, tumor encapsulation, vascular invasion, TNM stage, CXCR4, CTGF, and IL-11. Table 2 summarizes the association of clinicopathological factors with BM in HCC patients of the training cohort. By univariate comparison, tumor differentiation, tumor number, vascular invasion, TNM stage, CXCR4, CTGF, and IL-11 were significantly associated (all P < 0.03) with subsequent BM in HCC patients. Age (P = 0.859), gender (P = 0.486), HBsAg (P = 0.118), HCV-Ab (P = 0.327), AFP (P = 0.332), ALT (P = 0.750), liver cirrhosis (P = 0.240), Child-Push score (P = 0.819), tumor size (P = 0.661), and tumor encapsulation (P = 0.864) were not significantly associated with BM. Variables that displayed prognostic significance by univariate analysis were adopted for multivariate analysis. By multivariate analysis, the following five independent variables were found to be significant in predicting the risk of BM: vascular invasion (P = 0.006), TNM stage (P = 0.025), CXCR4 (P = 0.005), CTGF (P = 0.003), and IL-11 (P = 0.001).

Table 2 Clinicopathological factors associated with bone metastasis of hepatocellular carcinoma in the training cohort

Derivation of prediction model of BM

In Table 2, the smallest χ2 of multivariate analysis was 3.157, and then the χ2 of the five factors (vascular invasion, TNM stage, CXCR4, CTGF, and IL-11) were divided by 3.157. As shown in Table 3, a simple risk score was devised using significant variables in the multivariate model. A score was attributed to each variable according to its relative contribution in the Cox proportional hazards model, as determined by the χ2 score. Then, every patient was scored according to the five clinicopathological factors status. Every patient got a total score from the sum of the five factors. The scores ranged from 0 to 19 both in the training cohort and in the validation cohort. In the training cohort, the cutoff point of 9.4 was the best in terms of discriminating between low and high risk of BM using linear trend χ2 test in terms of the discriminatory ability and monotonicity. In the training cohort, using 9.4 as cutoff points, 155/201 patients (77.1%) were in the low-risk category, and 46/201 (22.9%) were high risk. In the low-risk group, 8/155 patients (5.2%) developed BM, and 15/46 (32.6%) developed BM in the high-risk group. In the training cohort, analysis by receiver operating characteristic curve demonstrated that this model can predict BM in HCC patients, with an AUC of 0.809 (95% CI, 0.694–0.923; P < 0.001). The prediction sensitivity and specificity were 69.6% and 79.0%, respectively, over 5 years. Cox regression analysis identified that the hazard ratio for BM of the high- versus low-risk groups was 12.132 (95% CI: 5.248–28.049; P < 0.001).

Table 3 Components of the bone metastasis prediction score

Validation of results

In the validation cohort, 19/179 patients (10.6%) developed BM. Forty-eight patients had a score of ≥9.4. Using 9.4 as a cutoff point, 131/179 patients (73.2%) were categorized as low risk, and 48/179 (26.8%) as high risk. In the low-risk group, 5/131 patients (3.8%) developed BM, and 14/48 patients (29.2%) developed BM in the high-risk group. Receiver operating characteristic analysis demonstrated that the present model is able to predict BM in HCC patients (AUC of 0.762; 95% CI, 0.642–0.883; P < 0.001). The prediction sensitivity and specificity were 73.7 and 78.7%, respectively, over 5 years. The 5-year positive predictive value was 29.2%, and the negative predictive value was 96.2%.

Time-to-BM was analyzed by Kaplan–Meier and log-rank tests. Patients with a high score (≥9.4) were more likely to develop BM (P < 0.001). The 1- and 2-year cumulative BM rates were (respectively) 10.8 and 27.4% in the high-risk group, and 2.4 and 4.3% in the low-risk group. By Cox regression analysis, the hazard ratio for BM of the high- versus low-risk group was 9.240 (95% CI: 3.319–25.722; P < 0.001). Log-rank test showed patients in high-risk group had poor disease-free survival (P = 0.011) and overall survival (P = 0.007) than those in low-risk group.

Discussion

The increasing incidence of BM that develops from HCC may be attributed to the prolonged survival of HCC patients due to recent progress in both the diagnosis and treatment for the disease (Fukutomi et al. 2001). HCC patients with BM not only have a poor prognosis but also suffer from pain and other significant symptoms that are detrimental to quality of life. Once tumors metastasize to bone, they are usually incurable, the consequences of BM are often devastating (Roodman 2004). Severe pain, pathological fracture, and spinal cord compression may result; additionally, malignant hypercalcemia may develop, which can be life-threatening in patients with BM. We have reported that external beam radiotherapy can effect relief from bone pain (He et al. 2009). But the clinical outcome of HCC patients with BM is still very poor. Only by screening HCC patients at high risk for developing BM, which depends on an effective prediction model, we can carry out more effective individualized therapy that will prevent its development.

In the present study, we have developed a simple model composed of clinicopathological factors to predict the future risk of BM in HCC patients who have undergone curative resection. The predictive model we propose incorporates the tumor properties of vascular invasion and TNM stage, and the expression levels of CXCR4, CTGF, and IL-11 proteins. Of these properties, vascular invasion and TNM stage are known prognostic factors for HCC (Zhu et al. 2008); CXCR4 is known to significantly decrease BM-free survival in vivo, IL-11 to stimulate osteoclasts, and CTGF to enhance BM in vivo (Horak and Steeg 2005; Kang et al. 2003; Xiang et al. 2011a). We have previously reported these factors to be associated with BM in HCC. In this study, we show that they are independent prognostic factors for BM in HCC. The proposed model provides a more refined and systematic stratification of BM risk in HCC, and it has potential clinical implications. The sensitivity and specificity of this model in predicting BM were approximately 70% and 80%, respectively, both in the training and in validation cohorts used in the study. Both cohorts had approximately 25% patients at high risk. Using a score of 9.4 as a cutoff point, the method based on our model may be used to accurately screen HCC patients at high risk of BM. In the validation cohort, the hazard ratio for developing BM of the high-versus low-risk group was 9.240. To the best of our knowledge, this study is the first report of such a prediction model in this disease context. Using this method, HCC patients at high risk of BM may be identified at the time of surgery for additional therapy, for example, by treatment with bisphosphonates, which are known to inhibit BM (Fournier et al. 2010).

The proposed model has some unique features. First, the factors upon which it is based are easily measured in clinical pathology laboratories. Second, it was formulated and then validated in independent cohorts, and the method based on it shows high accuracy in predicting BM. Finally, this model has the potential to change the way BM is treated during HCC therapy, and it may provide a useful prediction tool for clinicians. The traditional treatment for BM is palliation radiotherapy when symptoms appear; our model adds an earlier screening procedure to identify and treat high-risk patients to prevent the development of BM. Once HCC patients categorized in the high-risk group by this model, more frequent screening of the BM may be required, it help to find BM early. Administration of bisphosphonates may help to decrease the frequency of BM in high-risk group. On the other hand, bone scan may not be necessary for those in the low-risk group after 3 years.

There are some limitations in the present study. As it was a retrospective cohort study, and because of the limited number of patients involved, the results need to be further validated in a prospective study. It was reported that the detection of BM of HCC can be enhanced by PET/CT (Ho et al. 2011), so preoperative PET/CT may be a useful tool to detect BM of HCC.