Introduction

SOX4 is a major member of the SOX family [1]. SOX4 gene is located on 6p22.3 and encodes a protein of 474 amino acids with three distinguishable domains: an HMG box, a glycine-rich region, and a serine-rich region [2]. The HMG box acts as a DNA-binding region, whereas the serine-rich domain acts as a transactivation domain. The central domain containing the glycine-rich region located between the HMG box and serine-rich domains acts as a novel functional region for promoting apoptotic cell death [3]. SOX4 involves in many developmental processes such as the T cell differentiation and the development of thymocyte, nervous system, and embryonic cardiovascular system [46].

In recent years, the clinical significance of SOX4 has gained more and more attention, with many reports suggesting that SOX4 may involve in tumor development and progression. Present studies have suggested that SOX4 as one of the most frequently overexpressed protein in several types of human cancers mediated regulation of tumorigenesis and tumor progression such as hepatocellular carcinoma [7, 8], prostate cancer [9, 10], breast cancer [11, 12], colorectal cancer [13, 14], and bladder cancer [15, 16]. Moreover, SOX4 overexpression in tumor cells has been shown to be an independent prognostic factor in several types of tumors, which has a favorable or unfavorable prognostic significance according to tumor types [8, 1426]. Some studies indicated that positive SOX4 expression was significantly associated with a poor prognosis in colorectal cancer [14, 17], gastric cancer [18, 21], acute lymphoblastic leukemia [20], and prostate cancer [24], while others studies suggested patients with higher levels of SOX4 expression had better survival than those with lower levels of SOX4 expression in bladder carcinoma [15, 23], melanoma [19], and hepatocellular carcinoma [8].

In order to identify the pathological roles of SOX4 in human cancers, we evaluated the status of SOX4 expression in twenty kinds of human cancers through a protein database and performed a meta-analysis aiming to evaluate the relationship between SOX4 expression and prognosis in patients with cancers.

Materials and methods

Analysis of protein database

The data of SOX4 expression in twenty kinds of human cancers were retrieved from the protein database (The Human Protein Atlas, www.proteinatlas.org). The status of SOX4 expression was evaluated in twenty kinds of human cancer samples including breast cancer, carcinoid, cervical cancer, colorectal cancer, endometrial cancer, glioma, head and neck cancer, liver cancer, lung cancer, lymphoma, melanoma, ovarian cancer, pancreatic cancer, prostate cancer, renal cancer, skin cancer, stomach cancer, testis cancer, thyroid cancer, and urothelial cancer.

Inclusion and exclusion criteria

Studies eligible for inclusion in this meta-analysis met the following criteria: (1) Expression of SOX4 was evaluated in all cancers by cDNA microarray, tissue microarray, or immunohistochemistry; (2) hazard ratios (HRs) for overall survival related to SOX4 expression were provided or were extractable from the published data; (3) all patients diagnosed with malignant tumors or cancer must be confirmed by histopathologic examinations; and (4) there was sufficient information about study population, origin of country, and cancer type. The following criteria were used to exclude published studies: (1) Any studies that did not meet all inclusion criteria and (2) studies of case reports, letters, and reviews without original data. When a study reporting the same patient cohort was included in several publications, the most complete study was selected.

Search strategy

The MEDLINE, EMBASE, SCOPUS, WEB OF SCIENCE and Cochrane Library databases were systematically searched till October 2014. Publications with the following search words in the title, abstract, or keywords were included: SOX4, SRY (sex-determining region Y)-box 4, Carcinoma, Cancer, Tumor and Prognosis. Keywords were combined and Boolean operators (and, or, not) were used in the search strategies. The studies identified through the search were independently screened by two authors (JC and HJ) for inclusion. Any disagreements were arbitrated by a third author (BL). We did not limit our search by country, race, or date.

Data extraction

Data were independently extracted by two authors (JC and XY) using the same standardized table. Data were extracted from the included studies, and the major information included the following: first name, publication year, country, cancer type, total cases, survival analysis method (multivariate or univariate), end point results, and hazard ratio with 95 % confidence interval (CI). For articles with the same population resources or overlapping datasets, data were extracted and reported as a single trial. Overall survival in relation to survivin expression was estimated by the HR. If authors reported HR and 95 % CI, these data were extracted from the included articles. Otherwise, HR and 95 % CI were calculated as described previously [27, 28]. The number of patients at risk in each group, number of events, and the log-rank statistic or its p value were used to calculate the HR estimate and its variance. If the study did not provide a HR but reported the survival curve, we need to obtain data from survival curves. Survival curve could be read by Engauge Digitizer.

Statistical analysis

The meta-analysis was performed through using Cochrane Collaboration Review Manager 5.1 software. The details of statistical analysis were shown as described previously [29]. Heterogeneity of hazard ratio was assessed by use of the Chi-square and I 2 tests. When heterogeneity was significant (I 2 > 50 %and p < 0.05 for χ2), we used a random effects model with the DerSimonian and Laird method for the meta-analysis. Otherwise, we used a fixed effects model with the inverse variance method.

Overall survival (OS) of gastric cancer and recurrence-free survival (RFS) of colorectal cancer were determined in subgroup analyses. The sensitivity was analyzed by changing the effect model to estimate confidence. Funnel plot was executed for evaluating the potential publication bias. An asymmetric plot indicates there was potential publication bias; otherwise, the plot should be shaped like a funnel.

Results

SOX4 expression is positive in most types of human cancer tissues

From protein database, SOX4 expression was significantly positive in most types of human cancer tissues such as breast cancer, carcinoid, cervical cancer, colorectal cancer, endometrial cancer, glioma, head and neck cancer, liver cancer, lung cancer, lymphoma, melanoma, ovarian cancer, pancreatic cancer, prostate cancer, renal cancer, skin cancer, stomach cancer, testis cancer, thyroid cancer, and urothelial cancer (Fig. 1A; SOX4 expression in normal tissues was showed in http://www.proteinatlas.org/ENSG00000124766-SOX4/tissue). The high and medium staining rate was about 78 % in overall cancer tissues (Fig. 1B). Specific SOX4 protein staining was found in nucleus at twenty kinds of human cancers (Fig. 1C).

Fig. 1
figure 1

Status of SOX4 in human cancer tissues through protein database (The Human Protein Atlas). A The expression of SOX4 in twenty kinds of human cancers. B Overall cancer tissues staining statistics of SOX4 expression. C Immunohistochemical staining of SOX4 expression in twenty kinds of human cancers: a breast cancer, b carcinoid, c cervical cancer, d colorectal cancer, e endometrial cancer, f glioma, g head and neck cancer, h liver cancer, i lung cancer, j lymphoma, k melanoma, l ovarian cancer, m pancreatic cancer, n prostate cancer, o renal cancer, p skin cancer, q stomach cancer, r testis cancer, s thyroid cancer, and t urothelial cancer

Eligible studies

Electronic database search identified a total of 684 articles by using our search criteria (Fig. 2). After carefully reading the title and abstract, 668 articles were excluded, because they did not present any data about the association of SOX4 expression with patients’ outcome. After reviewing the full text of the remaining 15 studies, we ultimately included 10 studies in the final analysis. Five studies were excluded from the final review because of insufficient survival data (n = 5) [8, 15, 16, 25, 26].

Fig. 2
figure 2

Flow diagram of the identification and selection of studies

Study characteristics

A total of 1348 cases from the 10 studies that had pathological results and clinical data were included in this meta-analysis. Publication year of selected studies ranged from 2006 to 2014. These cases of 11 studies were from different populations (six studies in China and other four studies in Canada, USA, Denmark, and France, respectively) and eight kinds of cancers (lung cancer, gastric cancer, colorectal cancer, pancreatic cancer, cutaneous melanoma, acute lymphoblastic leukemia, gallbladder carcinoma, and prostate cancer). SOX4 expression was evaluated by immunohistochemistry, cDNA microarray, or tissue microarray. The positive expression rate of SOX4 varied from 32.8 to 80.0 %. Overall, five studies calculated HR and 95 % CI by multivariate analysis; the other five studies used univariate analysis. Table 1 shows the main characteristics of all of the studies.

Table 1 Characteristics of the eligible studies

Meta-analysis

There was obvious heterogeneity among those eight studies (I 2 = 82.0 %). Thus, the random effects model was used to calculate the pooled HR with corresponding 95 % CI. Meta-analysis of those eight studies showed that SOX4 expression was obviously associated with poor OS outcome in various cancers, with the pooled HR of 1.67 (95 % CI 1.01–2.78) (Fig. 3).

Fig. 3
figure 3

Forest plot for the correlation between SOX4 expression and overall survival of patients with human cancers

Subgroup analysis was performed to explore the between-study heterogeneity. No significant heterogeneity was observed among the studies on colorectal cancer for RFS (I 2 = 0.0 %, P = 0.93) and on gastric cancer for OS (I2 = 75.0 %, P = 0.05). The random effects model was used to calculate the pooled HR (95 % CI) in subgroups of colorectal cancer and gastric cancer. The result indicated that pooled HRs (95 % CI) were 1.73 (95 % CI 1.04–2.88) for colorectal cancer patients’ RFS and 3.74 (95 % CI 1.04–13.45) for gastric cancer patients’ OS (Fig. 4).

Fig. 4
figure 4

Forest plot of subgroup analysis showed the correlation of SOX4 expression with recurrence-free survival in colorectal cancer and overall survival in gastric cancer

Sensitivity analysis suggested that changing the effect model had little effect on pooled HR and did not change the strength of the association between SOX4 expression and OS for patients with cancers. The HRs (95 % CI) were 1.75 (95 % CI 1.42–2.15) for changing the effect model. Publication bias of this meta-analysis was evaluated by funnel plot. No evidence of asymmetry was observed in the funnel plots (Fig. 5).

Fig. 5
figure 5

Funnel plot for identifying publication bias in the correlation between SOX4 expression and overall survival of patients with human cancers

Discussion

SOX4, which is a member of the SOX family, has been implicated in multiple biological processes for the T cell differentiation and the development of thymocyte, nervous system, and embryonic cardiovascular system [46]. In recent years, more and more evidences have been presented suggesting that SOX plays an important role in tumor development and progression through involving in cell proliferation, migration, invasion, apoptosis, and epithelial-to-mesenchymal transition [3].

In our study, we found that SOX4 expression was significantly positive in twenty kinds of human cancer tissues including breast cancer, carcinoid, cervical cancer, colorectal cancer, endometrial cancer, glioma, head and neck cancer, liver cancer, lung cancer, lymphoma, melanoma, ovarian cancer, pancreatic cancer, prostate cancer, renal cancer, skin cancer, stomach cancer, testis cancer, thyroid cancer, and urothelial cancer through protein database, and the total positive rate was about 78 % in overall cancer tissues. Our result is consistent with reviews about the relationship between SOX4 and cancer performed by Jafarnejad et al. [2] and Vervoort et al. [3]. These studies consistently imply that overexpressed SOX4 may serve as unfavorable role in human cancer pathogenesis. Furthermore, the experiment in vitro showed that downregulated SOX4 expression decreased cell viability and induced apoptosis, and promoted malignant phenotype of adenoid cystic carcinoma cells [30]. Vervoort et al. [31] reported that TGFB1 induced the mesenchymal phenotype of mammary epithelial cells during epithelial-to-mesenchymal transition depending on upregulation of SOX4 expression. Similar to Zhang et al. [11] study in breast cancer, they found that SOX4 expression induced epithelial-to-mesenchymal transition and accelerated metastasis in vitro and in vivo.

In the past 10 years, SOX4 overexpression has been suggested to be an independent predictor for OS in various human cancers, which has a favorable or unfavorable prognostic significance according to the type of cancer. In primary gallbladder carcinoma, SOX4 overexpression was significantly correlated with better OS and disease-free survival, and was an independent favorable prognostic factor for OS and disease-free survival through multivariate analyses [23]. Moreover, Jafarnejad et al. reported that the SOX4 expression in metastatic melanoma was remarkably decreased in comparison with primary melanoma, and increased SOX4 expression associated with a better disease-specific survival for melanoma patients and was an independent favorable prognostic factor [19]. Conversely, there was more evidence suggesting that overexpression of SOX4 was unfavorable prognosis factor in gastric cancer [18, 21], acute lymphoblastic leukemia [20], colorectal cancer [14, 17], etc. In gastric cancer patients, higher level of SOX4 expression was markedly associated with distant metastasis, low differentiation, and unfavorable prognosis [21]. Similarly, Ramezani-Rad et al. [20] study showed that acute lymphoblastic leukemia (ALL) patients at the time of diagnosis with SOX4 overexpression in ALL cells had a poor OS outcome, and SOX4 was identified as a key regulator of MAPK and PI3K/AKT signaling in ALL cells. Meanwhile, SOX4 overexpression also has been shown to be an unfavorable prognostic factor in prostate cancer [24].

In order to evaluate the prognostic value of SOX4 in human cancer, we are to carry out a meta-analysis to analyze the association between SOX4 and human cancer. In the meta-analysis of 10 studies containing 1348 cases, we found that SOX4 expression was obviously associated with unfavorable OS in varied cancers, with the pooled HR of 1.67 (95 % CI 1.01–2.78). Furthermore, the subgroup analysis was performed to identify the heterogeneity of tumor types. The subgroup analysis showed that there was no significant subgroup difference between gastric cancer and colorectal cancer (both I 2 = 17.2 %). The SOX4 overexpression was significantly correlated with unfavorable OS in gastric cancer (HR 3.74, 95 %CI 1.04–13.45) and unfavorable RFS in colorectal cancer (HR 1.73, 95 %CI 1.04–2.88). Among those 10 studies, SOX4 overexpression served as a favorable prognostic factor only in gallbladder carcinoma (HR 1.73, 95 %CI 0.10–0.91) [23]. A similar report which could not be included in this meta-analysis because of insufficient data shows that SOX4 overexpression had a favorable OS outcome for bladder carcinoma patients through tissue microarray analysis [15]. The discrepancy of SOX4 expression in different types of human cancers may be attributed to tumor heterogeneity. We thought that SOX4 overexpression was a credible favorable prognostic factor for bladder carcinoma patients through systematically reviewing two relevant studies.

The meta-analysis may have some potential limitations. On the one hand, although 1348 cases of 10 studies were included in this meta-analysis, well-designed and large clinical studies for each cancer were insufficient. On the other hand, there was statistical heterogeneity in this meta-analysis which may be attributed to the differences in evaluation criterions of SOX4 expression, population, disease stages, and tumor types.

In summary, SOX4 overexpression was significantly correlated with unfavorable OS in most types of cancers except bladder carcinoma. SOX4 may be considered to serve as a credible prognostic factor in human cancers. In the future, more studies are necessary to verify and strengthen the role of SOX4 in human cancers.