Introduction

Lung cancer is the most common cause of cancer-related deaths worldwide [1]. Non-small cell lung cancer (NSCLC) is a frequent type of lung cancer, accounting for over 80 % of all lung cancer patients [2]. In spite of improved surgical techniques, advanced chemotherapeutic regimens, and novel targeted therapy, the clinical outcome of NSCLC patients remains unsatisfactory. The 5-year survival rate of patients with NSCLC is only 15 % [3]. Moreover, most patients with NSCLC are diagnosed at advanced stages because of the difficulty in making an early diagnosis. Thus, it is necessary to identify biomarkers which provide early diagnosis, accurate prognosis prediction.

The high-mobility group A (HMGA) proteins contain three AT hook DNA-binding motifs that mediate binding to the AT-rich regions in the minor groove of DNA. Two types of HMGA proteins, HMGA1 and HMGA2, with similar functions are encoded by two different genes at chromosomal loci 6p21.3 and 12q15, respectively [4]. HMGA plays an important role in several biological processes such as regulation of transcription, differentiation, and neoplastic transformation [5].

HMGA1 oncogene is widely expressed during embryonic development and is low or absent in most of the differentiated tissues in adults [6]. Recent study suggested that HMGA1 was induced by the Wnt/β-catenin pathway and maintains proliferation of gastric cancer cells [7]. Meanwhile, decreased expression of HMGA1 suppressed anchorage-independent cell proliferation, cellular motility, migration, and invasion in several cancer cell lines [811]. The high level of HMGA protein has been found in several malignant neoplasias, including colon cancer [12], breast cancer [13], prostatic cancer [14], pancreatic cancer [15], and ovarian cancer [16]. Furthermore, HMGA1 overexpression was associated with a highly malignant phenotype, also representing a poor prognostic factor since HMGA overexpression often correlated with the presence of metastasis, and with a short survival [17, 18]. Although the role of HMGA1 has been studied in many types of cancer, but little is known about the significance of HMGA1 in patients with NSCLC.

The aim of this study was to identify the pathological roles of HMGA1 in lung cancer. This study suggested that the messenger RNA (mRNA) and protein level of HMGA1 were increased in lung cancer tissue and positively associated with differentiated degree, clinical stage, and T/N/M classification. Furthermore, we found that overexpression of HMGA1 was a significant predictor of poor prognosis for NSCLC patients.

Materials and methods

Analysis of microarray data

Microarray data set (Gene Expression Omnibus (GEO) accession number: GSE19804) from 60 pairs of tumor and adjacent normal lung tissue specimens submitted by Lu was retrieved from the GEO database. Those differentially expressed genes were screened and identified by real-time PCR for the following study.

Samples collection

One hundred forty-five paraffin-embedded lung cancer tissues and 29 adjacent normal lung tissue samples were retrieved from the Affiliated Hospital of Jilin University and Occupational Disease Control and Prevention Hospital. Twenty pairs of fresh tumor and adjacent normal lung tissue samples were collected from the Affiliated Hospital of Jilin University and Occupational Disease Control and Prevention Hospital. All fresh samples were immediately preserved in liquid nitrogen. No patients had received any form of tumor-specific therapy before diagnosis. The histopathological diagnosis of all samples was respectively diagnosed by two pathologists. In these 145 NSCLC cases, there were 89 males and 56 females with age ranging from 23 to 76 years. The clinical follow-up time of patients ranged from 6 to 96 months. The clinical processes were approved by the Ethics Committees of the Affiliated Hospital of Jilin University and Occupational Disease Control and Prevention Hospital. Informed consent was obtained from all patients. The clinical stage and system treatment of all specimens were confirmed according to the seventh edition of the AJCC TNM system and NCCN guideline, respectively.

Real-time PCR

To investigate the mRNA level of HMGA1, total RNA of fresh tissues was reversely transcribed using PrimeScript® RT reagent Kit (TaKaRa). Quantitative real-time reverse transcription-PCR (qRT-PCR) was performed with SYBR® Premix Ex Taq™ II (TaKaRa) on a LightCycler (Roche), following the manufacturer’s instructions. The sequence-specific forward and reverse primers sequences for HMGA1 mRNA were 5′-TCCAGGAAGGAAACCAAGG-3′ and 5′-AGGACTCCTGCGAGATGC-3′, respectively. Forward and reverse primers sequences for GAPDH mRNA were 5′-GAGTCCACTGGCGTCTTC-3′ and 5′-GATGATCTTGAGGCTGTTGTC-3′, respectively. Relative quantification of mRNA expression was calculated by using the 2-△△Ct method. The raw data were presented as the relative quantity of HMGA1 mRNA, normalized with GAPDH, and relative to a calibrator sample. All qRT-PCR reactions were performed in triplicate.

Immunohistochemistry

To detect HMGA1 protein, immunohistochemistry experiments were performed. Paraffin-embedded sections were deparaffinized in xylene and rehydrated in a descending ethanol series (100, 90, 80, and 70 % ethanol) and double-distilled water according to standard protocols. High-pressure antigenic retrieval was performed in citrate buffer (pH 6.0) and boiled for 2 min. After antigen retrieval, sections were treated with 3 % hydrogen peroxide and 1 % bovine serum albumin to block the endogenous peroxidase activity and non-specific binding. The sections were incubated with HMGA1 antibody (Cell Signaling, #12094, dilution 1:250) overnight at 4 °C. The sections were washed three times and incubated with the biotinylated secondary antibody and streptavidin horseradish peroxidase complex for 25 min at room temperature, respectively. Sections were reacted with diaminobenzidine for 2 min, rinsed with tap water, and then counterstained with hematoxylin. In the end, sections were viewed under a light microscopy.

Evaluation of staining

Immunostaining results were reviewed and scored independently by two pathologists without knowledge of the clinical parameters. Any intensity of nuclear immunoreactivity was considered to present immunopositivity for HMGA1. A total of 1000 cells were counted at several high-power fields (×400) selected from different reactivity density regions including areas of high, moderate, low, and negative reactivity. The percentage of HMGA1-stained tumor cells was scored on a scale of 0–4 (0, no staining; 1, ≤5 %; 2, ≤30 %; 3, ≤50 %; 4, >50 %) [19]. For statistical analysis, a final staining scores of 0–2 and 3–4 were respectively considered to be low and high expression.

Statistical analysis

All data were analyzed by using SPSS 13.0 software. The unpaired T test was applied to test the differential mRNA expression of HMGA1 in lung cancer tissue compared to adjacent normal lung tissue. The chi-square test was used to analyze the correlation between HMGA1 expression and the clinicopathologic parameters of NSCLC patients. The Kaplan–Meier method and the log-rank test were used to the correlations between HMGA1 expression and the overall survival time of patients. The significance of survival variables was analyzed using the Cox multivariate proportional hazards model. Data was presented as mean ± SD. P values less than 0.05 were considered statistically significant.

Results

HMGA1 is highly expressed in NSCLC

From our microarray data, HMGA1 was highly expressed in NSCLC tissues compared with adjacent normal lung tissues with an average of 1.87 folds (P < 0.001, Fig. 1). Using real-time PCR to measure the expression of HMGA1 transcripts, we found that the HMGA1 expression level was significantly increased with an average increase of 2.49-fold in NSCLC tissue in comparison to adjacent normal lung tissue (P < 0.001, Fig. 2). Furthermore, we measured subcellular localization and the expression levels of HMGA1 protein in 145 paraffin-embedded NSCLC samples and 29 paraffin-embedded normal lung tissues using immunohistochemical staining (Fig. 3). Specific HMGA1 protein staining was found in the nucleus. The expression of HMGA1 was significantly elevated in NSCLC compared with normal lung tissues (P = 0.010, Table. 1).

Fig. 1
figure 1

Increased HMGA1 expression was shown in NSCLC by microarray data analysis of GSE19804 data set retrieved from the GEO database

Fig. 2
figure 2

Expression of HMGA1 mRNA is increased in NSCLC tissues compared with adjacent normal lung tissues by real-time PCR

Fig. 3
figure 3

Immunohistochemical staining of HMGA1 in NSCLC tissues. a Negative expression of HMGA1 in normal lung tissues (original magnification ×400). b Negative expression of HMGA1 in lung cancer (original magnification ×400). c, d Weak expression of HMGA1 in lung cancer (original magnification ×400). e, f Strong expression of HMGA1 in lung cancer (original magnification ×400)

Table 1 Expression of HMGA1 protein between lung cancer and normal tissues

Relationship between clinicopathological characteristics and expression of HMGA1 in NSCLC patients

The relationship between clinicopathological characteristics and HMGA1 expression levels in patients with NSCLC were summarized in Table 2. We did not find any significant association of HMGA1 expression levels with patient’s gender (P = 0.227), age (P = 0.799), smoking (P = 0.077), and pathology classification (P = 0.363). However, HMGA1 was positively associated with clinical stage (I–II vs. III–IV, P < 0.001), T classification (T1–T vs. T3–T4, P = 0.003), N classification (N0–N1 vs. N2–N3, P < 0.001), M classification (M0 vs. M1, P = 0.002), and differentiated degree (high or middle vs. low or undifferentiated, P = 0.003) in NSCLC.

Table 2 Correlation between the clinicopathologic characteristics and expression of HMGA1 protein in NSCLC

Survival analysis

To explore the prognostic role of HMGA1 expression for NSCLC, we analyzed the association between HMGA1 expression and patient survival using Kaplan–Meier analysis with the log-rank test. In 145 NSCLC cases with prognosis information, we found that the nuclear expression level of HMGA1 protein was significantly correlated with the overall survival of NSCLC patients. Patients with higher levels of HMGA1 expression had shorter survival rates than those with lower HMGA1 expression levels (P < 0.001, Fig. 4). Univariate analysis indicated that HMGA1 expression level, clinical stages, and T/N/M classifications were significantly correlated with patients’ survival (all P < 0.001, Table 3). To determine whether HMGA1 is an independent prognostic factor for NSCLC, we performed multivariate analyses using the Cox proportional hazards model. The results indicated that the level of high expression of HMGA1 was not an independent prognostic factor for NSCLC (P = 0.016, Table 3).

Fig. 4
figure 4

Increased HMGA1 protein expression predicts an unfavorable prognosis. The association between patient survival and HMGA1 expression was estimated using the Kaplan–Meier method and the log-rank test (P < 0.001)

Table 3 Summary of univariate and multivariate Cox regression analyses of overall survival duration

Discussion

HMGA1, which is a member of HMGA family, has been implicated in many biological processes [20]. HMGA1 proteins modulate gene expression by altering chromatin structure and orchestrating the assembly of transcription factor complexes to enhanceosomes within enhancer or promoter regions throughout the genome [20]. Recent studies indicated that the HMGA1 as a key factor enriched in undifferentiated tumors and embryonic stem cells [21], and is low or absent in most of the differentiated tissues [22], which suggested that HMGA1 plays an important role in cell proliferation and differentiation. In addition, present studies demonstrated that HMGA1 is also involved in processes of tumor invasion and metastasis [18, 23, 24]. In pancreatic cancer, HMGA1 promotes cell invasive and metastatic potential through PI3K/Akt/MMP-9 [25]. Furthermore, HMGA1 activity is not restricted to the regulation of cell proliferation and mobility and is necessary for several processes involved in induction of epithelial–mesenchymal transition and pluripotent stem cells [2628].

HMGA1 is highly expressed in several types of human cancer, such as colon cancer [12], breast cancer [13], prostatic cancer [14], pancreatic cancer [15], and ovarian cancer [16], but little is known about HMGA1 in NSCLC patients. In a microarray analysis performed by Lu et al. (GSE19804), we found that HMGA1 level was higher in 60 NSCLC samples than in adjacent paired normal lung samples. Then, we present the evidence that mRNA and protein expressions of HMGA1 were increased in NSCLC through real-time PCR and immunohistochemistry, which were similar to the microarray data.

In order to further identify the role of HMGA1 in the development and progression of lung cancer. We analyzed the expression of HMGA1 in 145 NSCLC patients and found HMGA1 overexpression was significantly associated with clinical stage, T classification (tumor size), N classification (lymph node metastasis), M classification (distant metastasis), and differentiated degree. Overexpressed HMGA1 in NSCLC may accelerate tumor growth and enhance local cell invasion and metastasis. Our results may indicate that HMGA1 plays significant roles in NSCLC progression, including tumor proliferation, invasion, and metastasis. Similar to Wang et al.’s report, HMGA1 overexpression positively associated with clinical stage, lymph node metastasis, and histological grade in patients with laryngeal squamous cell carcinoma [29]. In human pituitary adenomas, HMGA1 expression also correlated with tumor size, tumor invasion, and histological grade [19]. Moreover, Takaha et al.’s study in vitro showed that HMGA1 knockdown markedly inhibited colony formation, significantly induced apoptosis, inhibited invasion potential, and induced anoikis in renal cell carcinoma [30]. These studies consistently suggest that overexpressed HMGA1 may play an unfavorable role in NSCLC pathogenesis. However, the correlation between HMGA1 expression and the survival of NSCLC patients has been seldom reported.

In the past few years, HMGA1 overexpression in tumor cells has been shown to be an independent poor prognostic factor in several types of tumors, such as lung cancer [31], colorectal cancer [17], liver cancer [18], pancreatic adenocarcinoma [3234], and uveal melanomas [35]. In this study, we presented that HMGA1 expression in NSCLC was inversely correlated with patient’s overall survival in protein level. The patients with higher expression of HMGA1 protein had shorter survival time. According to multivariate analyses, overexpression of HMGA1 protein was a significant predictor of poor prognosis for NSCLC patients. These results were consistent with Sarhadi et al.’s report, which suggested that increased nuclear expression of HMGA1 correlated with poor survival of NSCLC patients [31]. Similarly, Takahashi et al. demonstrated that the expression of HMGA1 was significantly correlated with clinical stage and lymph node metastasis and as an independent biomarker for poor prognosis of colorectal cancer patients [17]. In pancreatic cancer, Liau et al. reported HMGA1 expression was predictive of poor patient survival and was an independent prognostic indicator through multivariate analysis [34].

In conclusion, this study indicated that the expression level of HMGA1 was significantly increased in NSCLC and correlated with the malignant status of NSCLC. Because of the limited sample size of patients in our study, further studies would be needed to verify the role of HMGA1 as a convictive clinical predictor for the survival of NSCLC patients.