Introduction

Pancreatic adenocarcinoma (PAAD) is one of the most malignant gastrointestinal tumors with difficulties in early diagnosis and treatment [1]. Despite of the development of diagnosis and treatment of PAAD, the 5-year survival rate is still as low as 8% [2]. Besides, since there is no obvious sign at early stage, the early diagnosis rate of PAAD is rather low. Treatment of PAAD includes surgery, chemotherapy, radiation therapy and palliative care, while surgery remains to be the only potentially curative option [3]. PAAD can occur in any part of the pancreas, but most of PAADs are found in the head of the pancreas, while pancreatic body/tail cancers account for only 20–25% [4]. Branching intraductal papillary mucinous tumors typically occur in the head of the pancreas, while mucinous cystic tumors are more common in the body or tail of pancreas [5]. Several studies have suggested that the anatomic location of PAADs plays an important role in survival. These studies indicate that there might be differences in clinical presentation, therapy and malignant potentials between pancreatic head cancer and pancreatic body/tail cancer [6,7,8]. Usually, pancreatic head cancer has a higher incidence and is easier to detect and has a better prognosis compared with pancreatic body/tail cancer [9]. Moreover, compared with patients with pancreatic head cancer, patients with early stage pancreatic body/tail cancer have a lower tumor recurrence rate after curative resection [10].

The effect of anatomic site on prognosis of PAAD has recently been studied extensively, yet the exact genetic differences between pancreatic head cancer and pancreatic body/tail cancer have not been fully elucidated [11]. Therefore, the main purpose of this study is to analyze the DEGs and mutation signatures of pancreatic head cancer and pancreatic body/tail cancer, and to find out the significant DEGs, GO functions, KEGG pathways and mutations which are related to PAAD sites, so as to contribute to the better diagnosis, treatment and prognosis of PAAD.

Methods

Data source

RNA-seq expression profiles, clinical data and mutation data were downloaded and collected from TCGA (https://xenabrowser.net/datapages/), CBioPortal (https://www.cbioportal.org/datasets) and FireHose, respectively. The RNA-seq expression profiles of PAAD were downloaded from TCGA, including 139 pancreatic head cancer cases and 28 pancreatic body/tail cancer cases (Fig. 1). Clinical data of PAAD was downloaded from CBioPortal, including age, gender, race, alcohol history, OS, DFS, and AJCC stage. Mutation data were downloaded from FireHose.

Fig. 1
figure 1

Flow Chart for cases involvement

Clinical information analysis

To ensure the patients of pancreatic head cancer and pancreatic body/tail cancer were comparable, the χ2 test was performed on the age, sex, race, AJCC stage and alcohol history between the two groups. Then overall survival was calculated by R package survival, using the Kaplan–Meier method and log-rank test. The criterion for statistical significance was p < 0.05 on above analysis.

Analysis of RNA-seq data

Firstly, RNA-seq with count per million (CPM) > 1 in more than 10% of the PAAD samples were retained for differential expression genes analysis. Then R package DESeq2 (https://github.com/mikelove/DESeq2) was applied to screen DEGs of pancreatic head cancer and pancreatic body/tail cancer. We performed multiple comparisons using the Benjamini and Hochberg approach to acquire the false discovery rate (FDR). DEGs were defined with the thresholds of FDR < 0.05 and |log2 FoldChange| > 1.5. To analyze the function and the potential pathway of DEGs, the online tool DAVID was used to conduct the functional annotation GO and KEGG enrichment analysis. FDR < 0.05 was defined as the criteria of statistical significance. Moreover, the top 300 genes were used to build PPI network by online tool String and Cytoscape software. We used nodes to represent the proteins and edges to represent interactions between two proteins. The nodes and edges indicate proteins and interactions between two proteins, respectively. The minimum required interaction score was set up as 0.400.

Tumor immune estimation resource (TIMER)

To analyze the immune infiltration of the tumor microenvironment of PAAD, we used the TIMER, a web-based tool which was validated in several cancers (https://cistrome.shinyapps.io/timer/). Six types of tumor-infiltrating cell populations (CD4 + T cells, CD8 + T cells, B cells, Macrophages, Neutrophils and Dendritic cells) were included in the analysis based on the PAAD data from TCGA database.

Analysis of somatic mutation data

Mutation annotation format (MAF) files for somatic mutation were downloaded from FireHose for the analysis. A total of 140 patients were available for somatic mutation analysis, including 20 pancreatic head cancer and 106 pancreatic body/tail cancer. Then, the R packages maftools (https://github.com/PoisonAlien/maftools) and GenVisR (https://bioconductor.org/packages/GenVisR/) were used to analyze the top 10 frequently mutated genes of PDAC. To compare the top 10 frequently mutated genes of pancreatic head cancer and pancreatic body/tail cancer, Chi-square test was performed on them. Furthermore, to evaluate the degree of intra-tumor heterogeneity, MATH was calculated for each patient by R package maftools. Student’s t test was used to compare the MATH score between two groups.

Result

Clinicopathological characteristics

There was no significant difference in the Chi-square test of age (p = 0.89982), sex (p = 0.99982), race (p = 0.14), AJCC stage (p = 0.61) and alcohol history (p = 0.16) of pancreatic head cancer and pancreatic body/tail cancer (Table 1). The Kaplan–Meier survival analysis showed that the anatomic location of PAAD was closely related to the prognosis of patients with PAAD. The survival of patients with pancreatic body/tail cancer was superior to those with pancreatic head cancer (p = 0.048) (Fig. 2).

Table 1 Clinical characteristics of pancreatic head cancer and pancreatic body/tail cancer
Fig. 2
figure 2

Kaplan–Meier curves with orange indicating pancreatic body/tail cancer and blue indicating pancreatic head cancer, showed the overall survival was higher in the patients with pancreatic body/tail cancer than in those with pancreatic head cancer(p = 0.048)

Analysis of DEGs

There were 496 significant DEGs (|log2 FoldChange| > 1.5, p < 0.05), of which 243 genes were upregulated and 253 genes were downregulated (Fig. 3a, b; Supplementary Table S1). The GO enrichment analysis showed that there were 13 significant functions/structures (FDR < 0.05), 4 biological processes, 6 cellular components and 3 molecular functions, including digestion, chemical synaptic transmission, regulation of insulin secretion, glucose homeostasis, extracellular space, extracellular region, plasma membrane, secretory granule, integral component of plasma membrane, voltage-gated potassium channel complex neuropeptide hormone activity, serine-type endopeptidase activity and heparin binding terms (Fig. 3c). The KEGG demonstrated that there were 3 significant pathways (FDR < 0.05), including pancreatic secretion, fat digestion and absorption and protein digestion and absorption (Fig. 3d).The PPI network of top 300 genes consisted of 278 nodes and 684 edges (Fig. 3e, Supplementary Table S2). Genes CCR7, PENK, PPY, SSTR2, CXCL13, GNG4, ADCY1, CCR4, CPR18 and CNR2 were at the core of interaction network (Fig. 3f).

Fig. 3
figure 3

a Volcano plots of DEGs between pancreatic head cancer and pancreatic body/tail cancer. X-axis indicates the fold change (log scaled), whereas the Y-axis shows the p values (log scaled). Each symbol represents a different gene, and the red/blue color of the symbols categorize the upregulated/downregulated genes falling under different criteria (p value and fold change threshold). p value < 0.05 is considered as statistically significant, whereas fold change = 1.5 is set as the threshold. b Heatmaps of the top 20 and last 20 DEGs between pancreatic head cancer and pancreatic body/tail cancer in TCGA. c GO analyses of the DEGs according to their biological process, cellular component and molecular function. X-axis indicates the gene number, whereas the Y-axis shows the most enriched GO terms. d Bubble plot of significant KEGG pathways. X-axis indicates the gene ratio, whereas the Y-axis shows the most enriched KEGG pathways. The size of bubble presents genes number, and the color of bubble presents the p value. e PPI network of the top 300 DEGs. And the red/blue color of the symbols categorize the upregulated/downregulated genes. f Ten hub genes were identified by Cytoscape Cytohubba plug-in

Mutation signatures

Among the 186 patients with PAAD, 140 had both mutation and tumor site information, including 118 pancreatic head cancer and 22 pancreatic body/tail cancer. By analyzing the MAF files of PAAD. The top 10 frequently mutated genes of PAAD were respectively KRAS (90.7%), TP53 (69.0%), TNN (2.9%), MUC16 (7.8%), CDKN2A (14.3%), SMAD4 (25.0%), FLG (7.1%), GNAS (7.1%), RYR (7.1%), and OBSCN (7.1%). Among them, the KRAS gene mutation rate was the highest and 127 out of 140 patients with PAAD had KRAS gene mutations. Moreover, the results of Chi-square test showed no significant difference in KRAS (p = 0.2213), TP53 (p = 0.2558), TNN (p = 1.000), MUC16 (p = 0.379), CDKN2A (p = 1.000), SMAD4 (p = 0.283), FLG (p = 0.949), GNAS (p = 0.362), RYR (p = 0.193), and OBSCN (p = 0.363) (Table 2). Then, to evaluate the degree of intra-tumor heterogeneity, MATH was calculated. The result showed the median MATH score of pancreatic body/tail cancer (91.5) was significantly higher than pancreatic head cancer (78.1) (p = 0.009) (Fig. 4b).

Table 2 Top 10 frequently-mutated genes of pancreatic head cancer and pancreatic body/tail cancer
Fig. 4
figure 4

a Waterfall of top 10 frequently mutated genes of pancreatic head cancer and pancreatic body/tail cancer. b Boxplot of MATH scores between pancreatic head cancer and pancreatic body/tail cancer

Infiltration level of immune cells in pancreatic head cancer and pancreatic body/tail cancer

With the estimation of TIMER, the infiltration levels of CD4+ T cells, CD8+ T cells, B cells, Macrophages, Neutrophils and DCs in two groups were retrieved. The infiltration levels of CD4+ T cells and B cells were higher in pancreatic head cancers than those in the pancreatic body/tail cancers (Fig. 5, both p < 0.01). There is no significant difference of CD8+ T cells (p = 0.450), Macrophages (p = 0.450), Neutrophils (p = 0.051) and DCs (p = 0.067) between two groups.

Fig. 5
figure 5

Infiltration level of immune cells with immune scores of pancreatic head cancer and pancreatic body/tail cancer

Discussion

Pancreatic head cancer and pancreatic body/tail cancer may not be the same type of tumor. Several studies have demonstrated that there are differences in clinical manifestations, treatment, prognosis and recurrence [10, 12]. As for clinical presentation, most patients with PAAD are asymptomatic until the disease progresses to an advanced stage [13]. Patients with PAAD usually presents jaundice, indigestion, pain and weight loss. Usually, pancreatic head cancer presents jaundice early due to the obstruction of the common bile duct, while pancreatic body/tail cancer presents with weight loss and pain, symptoms more in keeping with advanced disease [14]. As for treatment, tumor location is an important factor in treatment of PAAD. Operation plan is different between pancreatic head cancer and pancreatic body/tail cancer. Pancreatic head cancer requires a pancreaticoduodenectomy (Whipple operation), whereas pancreatic tail cancer requires a distal pancreatectomy with an en bloc splenectomy. Pancreatic neck/body cancer may require a pancreaticoduodenectomy, distal pancreatectomy or, rarely, a total pancreatectomy [15, 16]. The lower frequency of pancreatic body cancer resection may be explained by the fact that pancreatic body lesions may be the most challenging to manage due to the co-involvement of major vessels and the late diagnosis [17].

The anatomic location of PAAD also plays an important role in prognosis. Our study based on 174 PAAD cases from TCGA cohort indicates that the prognosis of patients with pancreatic body/tail cancer is better than those with pancreatic head cancer. However, the prognosis of pancreatic head cancer and pancreatic body/tail cancer is still controversial. Some studies support that patients with pancreatic body/tail cancer have an increased risk of death compared to those with pancreatic head cancer [6, 18]. This may result from late diagnosis of pancreatic body/tail cancer, increased frequency of metastasis and lower resecting rate [6]. However, some studies have shown that patients with pancreatic head cancer have a poorer survival than those with pancreatic body/tail cancer. Although patients with pancreatic head cancer have higher resection rates and utilization of adjuvant therapy, those with pancreatic body/tail cancer experience better overall survival [19]. Besides, study shows that the earlier stage at diagnosis has better survival than later stages, especially for pancreatic body/tail cancer [8]. Study has demonstrated that the patients with early stage pancreatic body/tail cancer have a lower tumor recurrence rate after curative resection [10].

In our study, digestion, chemical synaptic transmission, regulation of insulin secretion, glucose homeostasis, extracellular space, extracellular region, plasma membrane, secretory granule, integral component of plasma membrane, voltage-gated potassium channel complex neuropeptide hormone activity, serine-type endopeptidase activity and heparin binding terms are the most enriched GO terms. KEGG is mainly enriched in three pathways, including Pancreatic secretion, Fat digestion and absorption and Protein digestion and absorption. We can find that the most enriched GO terms and KEGG pathways are intensely correlated to digestion and pancreatic excretion. The main functions of the pancreas include digestion and excretion, while the head of pancreas and body/tail of pancreas have different roles in digestion and secretion. Study has confirmed that the concentration of insulin-positive endocrine cells in the tail of the pancreas is higher than that in the head [20]. Therefore, different clinical features of the cancers from head and tail could be caused by the different histological components and different biological functions.

PPIs plays an important role in gene expression, cell growth, proliferation and apoptosis [21]. Numerous studies have shown that PPIs is the basis of a variety of aggregation-related diseases, especially related to the occurrence and progression of cancer [22, 23]. In our study, genes CCR7, PENK, PPY, SSTR2, CXCL13, GNG4, ADCY1, CCR4, CPR18 and CNR2 are at the core of PPI network. However, CCR7 is the most significant gene of PPI network. As one of chemokine receptors, CCR7 expression is highly associated with lymph node metastasis in PAAD [24]. Although CCR7 expression may have no power to predict the recurrence of PAAD, CCR7 positivity is closely associated with a high incidence of metastasis [24]. Study has identified that tumor metastasis is highly related to cancer mortality, especially PAAD [25]. Therefore, we can conclude that the expression of CCR7 may affect the survival and prognosis of patients with PAAD. KRAS, TP53, TTN, SMAD4, CDKN2A, MUC16, FLG, GNAS, RYR1 and OBSCN are top 10 frequently mutated genes of PAAD. However, the results of our study show little difference of the top 10 frequently mutated genes between pancreatic head cancer and pancreatic body/tail cancer. Study shows heterogeneity may have a significant impact on tumor invasiveness, disease prognosis, and response to treatment, and thus may be a major obstacle to effective treatment of cancer and individualized medicine [26]. MATH has been proven to be a simple, quantitative and universally applicable method to evaluate intra-tumor heterogeneity [27]. In our study, the MATH score of pancreatic body/tail cancer is higher than pancreatic head cancer, which means that pancreatic body/tail cancer might be more heterogeneous than pancreatic head cancer. Tumor microenvironment analysis regarding the immune cells infiltration showed that the levels of B cells and CD4 + cells are higher in the pancreatic head cancer. RNA-seq data above suggests that CXCL13 is highly expressed in the pancreatic head which is an important recruitment factor for B cells [28]. B cells have a cancer-promoting effect which might play a role in the worse outcome of pancreatic head cancer. Tregs (T regulatory cells) are important subpopulation of CD4 + cells and can promote the pancreatic cancer progression [29].

The main advantage of this study is the reliable data source, which is downloaded from TCGA. The TCGA is a public funded project which aims to catalog and discover major cancer-causing genome alterations in large cohorts of over 30 human tumors through large-scale genome sequencing and integrated multi-dimensional analyses. In addition, TCGA can provide comprehensive analysis of cancer genome profiles, which is constantly updated [30]. So, the RNA-seq expression profiles, clinical data and genes mutation data of our study are believable and comprehensive. However, there are some limitations to our study. Firstly, the sample size in the confirmation by qRT-PCR is small and large numbers of samples of pancreatic head cancer and pancreatic body/tail cancer are needed for further research. Secondly, the DEGs obtained in our study are from TCGA cohort, which lacks the validation of other cohorts. Thirdly, the analyses of the current study are mainly based on the gene expression and mutation of the PAADs in TCGA cohort. To fully explore genetic differences of pancreatic head cancer and pancreatic body/tail cancer, vascular stability and immune response genes, in vitro and in vivo experimental approaches are needed.

In conclusion, by comparing DEGs and mutation signatures of pancreatic head cancer and pancreatic body/tail cancer from TCGA cohort, our study identifies that there are prognostic and genetic differences between pancreatic head cancer and pancreatic body/tail cancer, which can provide the basis for the individualized treatment and prognosis assessment of pancreatic head cancer and pancreatic body/tail cancer.