Introduction

Lung cancer (LC), as one of the most common cancers, is the leading cause of cancer-related mortality in the world, and approximately 85% of lung cancers are non–small cell lung cancers (NSCLC), which is much more aggressive than other types (Bray et al. 2018; Ettinger et al. 2010; McAleese et al. 2019). In spite of a declining global incidence (Chen et al. 2016), lung cancer remains poor prognosis with its 5-year overall survival rate no higher than 15%. Therefore, to fully evaluate and identify its prognostic factors become crucially important to gain a better understanding of this cancer.

Transcriptional factor plays a critical role in cellular process, biological behaviors and functions, which has an effect on the prognosis of patients. TFAP2 (AP-2), a family of basic helix-span-helix, have been identified to participate in vertebrate evolution (Eckert et al. 2005).The family consists of five paralogs in mammals: TFAP2A to TFAP2E, encoding the proteins AP-2α to AP-2ε, respectively. AP-2α knockout animals damage neural-crest-derived facial structures while AP-2β is essential for kidney development (Eckert et al. 2005). In addition to their effects on development, AP-2 family have also been studied in cancer. Previous studies showed that KRT16 was found to be an oncogene induced by TFAP2A in lung adenocarcinoma (Yuanhua et al. 2019) and TFAP2C plays a role in tumorigenesis and aggressiveness in lung cancer (Kang et al. 2017). Yet, little is known regarding their dysregulated expression level and prognostic values in clinic.

An increasingly large number of bio-information involved in lung cancer has been accessible from the public database so far, including genomics and transcriptomes as well as the corresponding clinical information. Therefore, the present study is conducted to evaluate the prognostic values of AP-2 in patients with lung cancer through bioinformatics methods which may serve as a reference to further confirm their important roles in lung cancer.

Materials and methods

Oncomine database

The gene expression levels of TFAP2 in different types of cancer were evaluated through Oncomine (https://www.oncomine.org/) which is a cancer microarray platform from genome-wide expression (Rhodes et al. 2004, 2007). To compare transcriptional level between clinical neoplastic and normal tissues, the cutoff of p value and fold change were defined as 1E−4 and 2 respectively by using the Student’s t test.

GEPIA analysis

Gene Expression Profiling Interactive Analysis (GEPIA) (https://gepia.cancer-pku.cn), as an online tool for in-depth analysis of gene functions based on the TCGA and the GTEx datasets featuring 198,619 isoforms and 84 cancer subtypes, can perform multiple interactive functions such as differential expression analysis, correlation analysis, patient survival analysis as well as similar gene detection analysis (Tang et al. 2017, 2019). We compared the mRNA level as well as the clinical tumor stage of TFAP2 factors between lung carcinoma tissue and normal tissue.

Kaplan‐Meier plotter

Kaplan–Meier Plotter (www.kmplot.com) could implement survival analysis of different cancer subtypes and is supervised and extended regularly (Nagy et al. 2018). We assessed TFAP2 mRNA for their prognostic power, and the risk ratios with 95% confidence intervals and logarithmic rank p values were calculated. Median expression was used as a cut-off point for dividing patients into high and low expression groups as mentioned previously (Mihaly et al. 2013). The JetSet best probe set of each TFAP2 was selected in order to carry out the overall survival analysis, and the numbers at risk are shown below the main plot.

LinkedOmics portal

The LinkedOmics database (https://www.linkedomics.orglogin.php) can access, analyze and compare cancer multi-omics data within and across all 32 TCGA cancer types (Tomczak et al. 2015). LinkedOmics three analytical modules could be applied to explore attributes that are associated with a query attribute, perform enrichment analysis and compare integrated association results (Vasaikar et al. 2018). The association of TFAP2 mRNA expression with clinical features and overall survival in this study was accessed by the LinkedOmics portal.

c-BioPortal databases

The cBioPortal for Cancer Genomics is a web-based interactive tool for analysis of RNA sequencing expression data of cancer and normal samples from TCGA (Gao et al. 2013). In the present study, c-BioPortal was applied to evaluate lung adenocarcinoma (TCGA, Provisional) dataset including data from 586 samples with clinicopathologic reports for interactive analysis of TFAP2 factors. The targeted genes include mutation information from GISTIC and mRNA expression Z-scores (RNASeq V2 RSEM). TFAP2 genes were accessed by Z-score ± 2.0. Co-expression and network were also performed.

Results

The comparison of TFAP2 expression levels between lung cancer and normal tissue

From the mRNA level, the expression of TFAP2 family (including TFAP2A, B, C) in 18 different types of cancer were compared with adjacent normal tissues as shown in Fig. 1a. An obvious up-regulation of TFAP2A mRNA level in patients with LC was found in eleven datasets while TFAP2C mRNA expression was moderately upregulated in five datasets. Particularly, Bhattacharjee’s dataset demonstrated that TFAP2A in squamous cell lung carcinoma was 98 times higher than the paired normal tissue as shown in Table 1. In other databases, TFAP2A mRNA expressions were 3–15 times more expressed in squamous cell lung carcinoma tissues than in normal lung tissues. In the dataset of Garber, Hou, Su and Selamat, 3 times higher expressions of TFAP2A mRNA were found in lung adenocarcinoma tissues compared with that in normal tissues. Meanwhile, expressions of TFAP2C mRNA were, on average, approximately twice as common in lung adenocarcinoma tissues as in normal lung tissues as shown in three datasets. And Bhattacharjee’s database reported a ten-fold increasing in squamous cell lung carcinoma compared with normal lung tissue.

Fig. 1
figure 1

The comparison of TFAP2 expression levels between lung cancer and normal tissue. a The transcriptonal levels of TFAP2 in different types of cancers (ONCOMINE). b The elevated expressions of TFAP2A and TFAP2C in lung carcinoma compared with that in normal tissue (GEPIA). c Immunohistochemistry staining for TFAP2 in normal and tumor lung tissue from a Human Protein Atlas Portal

Table 1 The changes of TFAP2 transcription factors expression in transcription level between Lung cancer and normal lung tissues (Oncomine Database)

Furthermore, we verified the mRNA expressions of TFAP2 between LC and normal lung tissues via GEPIA database. We found that TFAP2A and TFAP2C in LC were both higher than that in normal lung tissues for adenocarcinoma and squamous cell carcinoma, but a larger difference was found in lung squamous cell carcinoma. However, no difference was found in the TFAP2B mRNA expression (Fig. 1b).

Additionally, we identified the expression of TFAP2 at the protein level using Human Protein Atlas database. Immunohistochemistry staining showed a higher expression in cytoplasmic and membranous parts of lung cancer cells for TFAP2A, while no marked difference was found for TFAP2C between lung cancer and normal lung samples. Interestingly, TFAP2B was found to be higher expressed in LC compared with normal lung tissue (Fig. 1c).

Correlation between TFAP2 and clinicopathological features in LC patients

Next, we evaluated the relationship of TFAP2 family with other clinicopathological features. All the TFAP2 family members including TFAP2A (p = 0.451), TFAP2B (p = 0.140) and TFAP2C (p = 0.050) in Fig. 2a were found to be not correlated to the age in lung cancer patients. No significant difference was found between histology and three TFAP2A (p = 0.322) and TFAP2C (p = 0.477) in Fig. 2b, but we found that mucinous subtype of lung carcinoma was more likely to be enriched with higher expression of TFAP2B (p = 0.028). Meanwhile, no significant difference was found between TAFAP2 family members and stage TFAP2A (p = 0.223), TFAP2B (p = 0.479) and TFAP2C (p = 0.197) in Fig. 2c.

Fig. 2
figure 2

Relationship between the mRNA levels of TFAP2 and the clinicopathological parameters of patients with LC. a Spearman correlation of age (LinkedOmics). b Kruskal–Wallis Test of histological type (LinkedOmics). c One-way ANOVA analysis of pathological stage (GEPIA)

The prognostic significances of TFAP2 in patients with LC

We further evaluated the prognostic significances of TFAP2 in LC patients using Kaplan–Meier Plotter. As illustrated in Fig. 3a, the results showed that only TFAP2A was significantly associated with patient prognosis. In particular, a higher level of TFAP2A mRNA expression predicted a poor overall survival (OS) in lung cancer. However, there were no statistical differences for TFAP2B and TFAP2C in Kaplan–Meier analysis (Fig. 3b, c).

Fig. 3
figure 3

Kaplan–Meier survival curves for overall survival in patients with LC (Kaplan–Meier plotter). a The high expression of TFAP2A correlates with poor prognosis. b, c No prognosis significance was found for the expression of TFAP2B and TFAP2C. The high expression of TFAP2A correlates with poor prognosis in smoking patients (d) compared with that in non-smoking patients (g). The high expression of TFAP2A correlates with poor prognosis in non-chemotherapy patients h compared with that in chemotherapy patients (e). The high expression of TFAP2A correlates with poor prognosis in non-radiotherapy patients i compared with that in radiotherapy patients f

In the subgroup analysis, the upregulated TFAP2A showed significant relation to the overall survival in patients with smoking, non-chemotherapy and non-radiotherapy, whereas no significant survival difference was found in patients with non-smoking, chemotherapy and radiotherapy (Fig. 3d–i).

Sequence alterations and correlations of TFAP2 in lung cancer

To further explore TFAP2 sequence alterations and their correlations, we used the cBioPortal for lung adenocarcinoma (TCGA, Provisional). As shown in Fig. 4a, TFAP2 genes were altered in 20.7% of 517 cases with lung adenocarcinoma and mRNA high expression accounted for 12.96%. Compaerd with 3% mutation rate of TFAP2B, higher rate of sequence alterations was found for the genes, TFAP2A and TFAP2C, with 10% and 12% mutation rate, respectively (Fig. 4b). Genes co-expressed with TFAP2 families were detected in lung adenocarcinoma, and the result showed that TFAP2A was positively correlated with MUC16, KRT16, PCDH7 and RHOV. Besides, TFAP2C was positively correlated with GRHL1, DUSP9, SERTAD4 and NECTIN4 (Fig. 4c).

Fig. 4
figure 4

TFAP2 genes alteration and correlation analysis in lung adenocarcinoma (cBioPortal). a, b The genetic alteration of TFAP2 in lung adenocarcinoma. c MUC16,EPHA7 and GRHL1 are the most correlated genes with TFAP2A, TFAP2B and TFAP2C in mRNA expression level, respectively. d Correlation between TFAP2 transcription factors

We also evaluated the association among TFAP2A, TFAP2B and TFAP2C by Spearman analysis via the GEPIA dataset and found that they were all positively correlated with each other (Fig. 4d). In particular, TFAP2A was moderately correlated with TFAP2C (R = 0.53, p < 0.01) but was low correlated with TFAP2B (R = 0.3, p < 0.01). And TFAP2B was also low correlated with TFAP2C (R = 0.26, p < 0.01).

Network analysis of functions and pathways affected by TFAP2 in patients with LC

To investigate the biological functions and molecular interaction networks of TFAP2 factors and their associated genes, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were utilized by DAVID database.

As illustrated in Fig. 5, the results of GO analysis showed that GO:0006915 (apoptotic process), GO:0045944 (positive regulation of transcription from RNA polymerase II promoter), and GO:0000122 (negative regulation of transcription from RNA polymerase II promoter) were markedly regulated by TFAP2 alterations in LC in the aspect of biological process (Fig. 5a). As for the cellular components, GO:0005634 (nucleus), and GO:0005654 (nucleoplasm) were mostly associated with TFAP2 alterations (Figs. 5b). Furthermore, the alterations also prominently regulated molecular functions including GO:0001077 (transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding) and GO:0042802 (identical protein binding) (Figs. 5c).

Fig. 5
figure 5

The functions of TFAP2 and genes significant associated with TFAP2 alterations were predicted by analysis of Gene Ontology (GO) by DAVID database. GO enrichment analysis based on biological processes (a), cellular components (b), and molecular functions (c)

Meanwhile, eight most associated pathways in view of TFAP2 alterations were found through KEGG analysis. In particular, two pathways were of significant correlation with TFAP2 alterations including hsa04210: (Apoptosis) and hsa04064: (NF-kappa B signaling pathway). NF-kappa B signaling pathway and apoptosis pathway were shown in Fig. 6.

Fig. 6
figure 6

NF-kappa B signaling pathway and apoptosis pathway are regulated by TFAP2 alterations in lung adenocarcinoma (cBioPortal)

Discussion

As an important transcriptional factors family, TFAP2 in this study was demonstrated to be a prognostic factor impacting the overall survival for lung cancer patients. We investigated three family members, including TFAP2A, TFAP2B and TFAP2C, and found that they played different roles regarding the biological function and clinical course.

Specifically, TFAP2A is the most frequently studied AP-2 family, which is expressed at the early stage of embryogenesis and is involved in determining the cell fate during neural crest and epidermis formation (Zhang et al. 1996). Reduced, or overexpressed TFAP2A expression has been reported in several cancers including of ovarian, nasopharyngeal, breast, melanoma, glioma, gastric and colon (Anttila et al. 2000; Bar-Eli 1999; Douglas et al. 2004; Heimberger et al. 2005; Schwartz et al. 2007; Shi et al. 2014; Sumigama et al. 2004; Wang et al. 2011), indicating that regulation of TFAP2A gene expression may play a key role in tumorigenesis and development. This study showed that an up-regulation level of TFAP2A in LC compared with normal lung tissue, which is consistent with a previous study in which TFAP2A was illustrated to upregulate the expression of KRT16, an independent prognosis predictor for LC (Yuanhua et al. 2019).

Besides, no significant correlation between TFAP2A expression and the other clinicopathological features, e.g. age, pathological stage and histology type existed in the patients with LC. However, a high TFAP2A expression was markedly related to a poor OS in patients with LC, demonstrating the oncogenic potential of TFAP2A in LC, due to the consideration that it might play a key role in promoting tumorigenesis via epithelial- mesenchymal transition (Yuanhua et al. 2019). We found that TFAP2A was positively correlated with MUC16, KRT16, PCDH7 and RHOV, which are all related to bad survival for cancers.

Of note, the upregulation of TFAP2A was found to be significantly related to the overall survival for the patients with smoking in the subgroup analysis, not for the nonsmokers, indicating that TFAP2A played much more important role for smokers than nonsmokers. Moreover, for patients with chemotherapy or radiotherapy, no survival discriminatory difference was found, but for patients without chemotherapy or radiotherapy, the different expression of TFAP2A showed distinct significant survival, which verified that chemotherapy or radiotherapy could mitigate the effect of TFAP2A in LC.

How the cancer-related gene expressions are impacted by the TFAP2B remains largely under debate. TFAP2B was initially known for its association with the Char syndrome, a human autosomal dominant disorder (Satoda et al. 2000), and later reported to be related to type 2 diabetes mellitus (Comasco et al. 2013; Maeda et al. 2005). TFAP2B was reported previously to be correlated with endometrial cancer, lobular breast cancer, neuroblastoma and thyroid cancer (Cui et al. 2015; Ikram et al. 2016; Raap et al. 2018; Wu and Zhang 2018). Fu et al. found that TFAP2B was upregulated in lung adenocarcinomas and correlated with poor prognosis of the patients (Fu et al. 2014). However, the present study indicated that there was no difference in the expression of TFAP2B between lung cancer and normal tissues, and no significant survival discrimination was found between the patients with high and low expression level. In our view, the reasons for the difference may be due to the sample size, clinical stage and demographic characteristics. Fu et al., studied 147 patients while our prognosis results contain 1145 patients. As the relative much smaller number of patients were found in Fu’s study, we cannot deny that selection bias might have been existed between these two studies. Hence, if we want to gain a better understanding about the potential role of TFAP2B in lung cancer, further studies with much larger sample involved the biological function and the prognostic evaluation were needed.

It has been reported that TFAP2C participated in the regulation of tumorigenesis and was relevant to poor prognosis in some cancers (Ailan et al. 2009; Aqeilan et al. 2004; Woodfield et al. 2010). Previous studies have shown that TFAP2C overexpression led to oncogenic activity and promoted lung tumorigenesis (Kang et al. 2017; Kim et al. 2016). In the present study, a significantly increased expression of TFAP2C was detected in lung cancer compared with normal lung tissue. However, there was no correlation between TFAP2C expression and age, tumor histology types and tumor stage as well as overall survival for patients with LC.

Furthermore, we also evaluated the mutual interaction among the TFAP2 family members and found that they were all positively correlated with each other. TFAP2A was moderately correlated with TFAP2C but was low correlated with TFAP2B (R = 0.3, p < 0.01). And TFAP2B was also low correlated with TFAP2C (R = 0.26, p < 0.01). To investigate the biological functions and molecular interaction networks of TFAP2, we found it was related to the three cellular biological process and function, apoptosis, NF-kappa B signaling pathway and amyotrophic lateral sclerosis, which means TFAP2 could exert an impact for the LC patients via these three pathways.

Several limitations in the present study should be acknowledged. Firstly, the present study was based on in silico predictions, follow-up validation and extension in vivo as well as in vitro experiments are needed to clarify the biological functions of TFAP family in lung cancer. Secondly, the sample selection bias may be existed in the study, as the relatively small sample size may generate insufficient power to detect the majority of associations (Zhao et al. 2018), thus the prediction role of TFAP2 family should be combined with large-scale clinical samples to verify its value for clinical prognosis and treatment.

In all, the expression, mutation, and prognostic values of TFAP2 in LC patients were systematically analyzed in the present study. Our results demonstrated that up-regulated expression of TFAP2A was related to an unfavorable overall survival in lung cancer. Therefore, TFAP2A could be a potential marker for the improvement of survival and prognostic accuracy in patients with lung cancer.